Sczyrba, Alex
2018-02-13
DOE JGI's Alex Sczyrba on "Evaluation of the Cow Rumen Metagenome" and "Assembly by Single Copy Gene Analysis and Single Cell Genome Assemblies" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sczyrba, Alex
2011-10-13
DOE JGI's Alex Sczyrba on "Evaluation of the Cow Rumen Metagenome" and "Assembly by Single Copy Gene Analysis and Single Cell Genome Assemblies" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.
Functional analysis of regulatory single-nucleotide polymorphisms.
Pampín, Sandra; Rodríguez-Rey, José C
2007-04-01
The identification of regulatory polymorphisms has become a key problem in human genetics. In the past few years there has been a conceptual change in the way in which regulatory single-nucleotide polymorphisms are studied. We revise the new approaches and discuss how gene expression studies can contribute to a better knowledge of the genetics of common diseases. New techniques for the association of single-nucleotide polymorphisms with changes in gene expression have been recently developed. This, together with a more comprehensive use of the old in-vitro methods, has produced a great amount of genetic information. When added to current databases, it will help to design better tools for the detection of regulatory single-nucleotide polymorphisms. The identification of functional regulatory single-nucleotide polymorphisms cannot be done by the simple inspection of DNA sequence. In-vivo techniques, based on primer-extension, and the more recently developed 'haploChIP' allow the association of gene variants to changes in gene expression. Gene expression analysis by conventional in-vitro techniques is the only way to identify the functional consequences of regulatory single-nucleotide polymorphisms. The amount of information produced in the last few years will help to refine the tools for the future analysis of regulatory gene variants.
Winterhoff, Boris J; Maile, Makayla; Mitra, Amit Kumar; Sebe, Attila; Bazzaro, Martina; Geller, Melissa A; Abrahante, Juan E; Klein, Molly; Hellweg, Raffaele; Mullany, Sally A; Beckman, Kenneth; Daniel, Jerry; Starr, Timothy K
2017-03-01
The purpose of this study was to determine the level of heterogeneity in high grade serous ovarian cancer (HGSOC) by analyzing RNA expression in single epithelial and cancer associated stromal cells. In addition, we explored the possibility of identifying subgroups based on pathway activation and pre-defined signatures from cancer stem cells and chemo-resistant cells. A fresh, HGSOC tumor specimen derived from ovary was enzymatically digested and depleted of immune infiltrating cells. RNA sequencing was performed on 92 single cells and 66 of these single cell datasets passed quality control checks. Sequences were analyzed using multiple bioinformatics tools, including clustering, principle components analysis, and geneset enrichment analysis to identify subgroups and activated pathways. Immunohistochemistry for ovarian cancer, stem cell and stromal markers was performed on adjacent tumor sections. Analysis of the gene expression patterns identified two major subsets of cells characterized by epithelial and stromal gene expression patterns. The epithelial group was characterized by proliferative genes including genes associated with oxidative phosphorylation and MYC activity, while the stromal group was characterized by increased expression of extracellular matrix (ECM) genes and genes associated with epithelial-to-mesenchymal transition (EMT). Neither group expressed a signature correlating with published chemo-resistant gene signatures, but many cells, predominantly in the stromal subgroup, expressed markers associated with cancer stem cells. Single cell sequencing provides a means of identifying subpopulations of cancer cells within a single patient. Single cell sequence analysis may prove to be critical for understanding the etiology, progression and drug resistance in ovarian cancer. Copyright © 2017 Elsevier Inc. All rights reserved.
Gene and domain duplication in the chordate Otx gene family: insights from amphioxus Otx.
Williams, N A; Holland, P W
1998-05-01
We report the genomic organization and deduced protein sequence of a cephalochordate member of the Otx homeobox gene family (AmphiOtx) and show its probable single-copy state in the genome. We also present molecular phylogenetic analysis indicating that there was single ancestral Otx gene in the first chordates which was duplicated in the vertebrate lineage after it had split from the lineage leading to the cephalochordates. Duplication of a C-terminal protein domain has occurred specifically in the vertebrate lineage, strengthening the case for a single Otx gene in an ancestral chordate whose gene structure has been retained in an extant cephalochordate. Comparative analysis of protein sequences and published gene expression patterns suggest that the ancestral chordate Otx gene had roles in patterning the anterior mesendoderm and central nervous system. These roles were elaborated following Otx gene duplication in vertebrates, accompanied by regulatory and structural divergence, particularly of Otx1 descendant genes.
A kernel regression approach to gene-gene interaction detection for case-control studies.
Larson, Nicholas B; Schaid, Daniel J
2013-11-01
Gene-gene interactions are increasingly being addressed as a potentially important contributor to the variability of complex traits. Consequently, attentions have moved beyond single locus analysis of association to more complex genetic models. Although several single-marker approaches toward interaction analysis have been developed, such methods suffer from very high testing dimensionality and do not take advantage of existing information, notably the definition of genes as functional units. Here, we propose a comprehensive family of gene-level score tests for identifying genetic elements of disease risk, in particular pairwise gene-gene interactions. Using kernel machine methods, we devise score-based variance component tests under a generalized linear mixed model framework. We conducted simulations based upon coalescent genetic models to evaluate the performance of our approach under a variety of disease models. These simulations indicate that our methods are generally higher powered than alternative gene-level approaches and at worst competitive with exhaustive SNP-level (where SNP is single-nucleotide polymorphism) analyses. Furthermore, we observe that simulated epistatic effects resulted in significant marginal testing results for the involved genes regardless of whether or not true main effects were present. We detail the benefits of our methods and discuss potential genome-wide analysis strategies for gene-gene interaction analysis in a case-control study design. © 2013 WILEY PERIODICALS, INC.
Genetic mapping of the rice resistance-breaking gene of the brown planthopper Nilaparvata lugens
Kobayashi, Tetsuya; Yamamoto, Kimiko; Suetsugu, Yoshitaka; Kuwazaki, Seigo; Hattori, Makoto; Jairin, Jirapong; Sanada-Morimura, Sachiyo; Matsumura, Masaya
2014-01-01
Host plant resistance has been widely used for controlling the major rice pest brown planthopper (BPH, Nilaparvata lugens). However, adaptation of the wild BPH population to resistance limits the effective use of resistant rice varieties. Quantitative trait locus (QTL) analysis was conducted to identify resistance-breaking genes against the anti-feeding mechanism mediated by the rice resistance gene Bph1. QTL analysis in iso-female BPH lines with single-nucleotide polymorphism (SNP) markers detected a single region on the 10th linkage group responsible for the virulence. The QTL explained from 57 to 84% of the total phenotypic variation. Bulked segregant analysis with next-generation sequencing in F2 progenies identified five SNPs genetically linked to the virulence. These analyses showed that virulence to Bph1 was controlled by a single recessive gene. In contrast to previous studies, the gene-for-gene relationship between the major resistance gene Bph1 and virulence gene of BPH was confirmed. Identified markers are available for map-based cloning of the major gene controlling BPH virulence to rice resistance. PMID:24870048
Single-Cell and Single-Molecule Analysis of Gene Expression Regulation.
Vera, Maria; Biswas, Jeetayu; Senecal, Adrien; Singer, Robert H; Park, Hye Yoon
2016-11-23
Recent advancements in single-cell and single-molecule imaging technologies have resolved biological processes in time and space that are fundamental to understanding the regulation of gene expression. Observations of single-molecule events in their cellular context have revealed highly dynamic aspects of transcriptional and post-transcriptional control in eukaryotic cells. This approach can relate transcription with mRNA abundance and lifetimes. Another key aspect of single-cell analysis is the cell-to-cell variability among populations of cells. Definition of heterogeneity has revealed stochastic processes, determined characteristics of under-represented cell types or transitional states, and integrated cellular behaviors in the context of multicellular organisms. In this review, we discuss novel aspects of gene expression of eukaryotic cells and multicellular organisms revealed by the latest advances in single-cell and single-molecule imaging technology.
Non-biased and efficient global amplification of a single-cell cDNA library
Huang, Huan; Goto, Mari; Tsunoda, Hiroyuki; Sun, Lizhou; Taniguchi, Kiyomi; Matsunaga, Hiroko; Kambara, Hideki
2014-01-01
Analysis of single-cell gene expression promises a more precise understanding of molecular mechanisms of a living system. Most techniques only allow studies of the expressions for limited numbers of gene species. When amplification of cDNA was carried out for analysing more genes, amplification biases were frequently reported. A non-biased and efficient global-amplification method, which uses a single-cell cDNA library immobilized on beads, was developed for analysing entire gene expressions for single cells. Every step in this analysis from reverse transcription to cDNA amplification was optimized. By removing degrading excess primers, the bias due to the digestion of cDNA was prevented. Since the residual reagents, which affect the efficiency of each subsequent reaction, could be removed by washing beads, the conditions for uniform and maximized amplification of cDNAs were achieved. The differences in the amplification rates for randomly selected eight genes were within 1.5-folds, which could be negligible for most of the applications of single-cell analysis. The global amplification gives a large amount of amplified cDNA (>100 μg) from a single cell (2-pg mRNA), and that amount is enough for downstream analysis. The proposed global-amplification method was used to analyse transcript ratios of multiple cDNA targets (from several copies to several thousand copies) quantitatively. PMID:24141095
Fox, Bridget C; Devonshire, Alison S; Baradez, Marc-Olivier; Marshall, Damian; Foy, Carole A
2012-08-15
Single cell gene expression analysis can provide insights into development and disease progression by profiling individual cellular responses as opposed to reporting the global average of a population. Reverse transcription-quantitative polymerase chain reaction (RT-qPCR) is the "gold standard" for the quantification of gene expression levels; however, the technical performance of kits and platforms aimed at single cell analysis has not been fully defined in terms of sensitivity and assay comparability. We compared three kits using purification columns (PicoPure) or direct lysis (CellsDirect and Cells-to-CT) combined with a one- or two-step RT-qPCR approach using dilutions of cells and RNA standards to the single cell level. Single cell-level messenger RNA (mRNA) analysis was possible using all three methods, although the precision, linearity, and effect of lysis buffer and cell background differed depending on the approach used. The impact of using a microfluidic qPCR platform versus a standard instrument was investigated for potential variability introduced by preamplification of template or scaling down of the qPCR to nanoliter volumes using laser-dissected single cell samples. The two approaches were found to be comparable. These studies show that accurate gene expression analysis is achievable at the single cell level and highlight the importance of well-validated experimental procedures for low-level mRNA analysis. Copyright © 2012 Elsevier Inc. All rights reserved.
Genetic mapping of the rice resistance-breaking gene of the brown planthopper Nilaparvata lugens.
Kobayashi, Tetsuya; Yamamoto, Kimiko; Suetsugu, Yoshitaka; Kuwazaki, Seigo; Hattori, Makoto; Jairin, Jirapong; Sanada-Morimura, Sachiyo; Matsumura, Masaya
2014-07-22
Host plant resistance has been widely used for controlling the major rice pest brown planthopper (BPH, Nilaparvata lugens). However, adaptation of the wild BPH population to resistance limits the effective use of resistant rice varieties. Quantitative trait locus (QTL) analysis was conducted to identify resistance-breaking genes against the anti-feeding mechanism mediated by the rice resistance gene Bph1. QTL analysis in iso-female BPH lines with single-nucleotide polymorphism (SNP) markers detected a single region on the 10th linkage group responsible for the virulence. The QTL explained from 57 to 84% of the total phenotypic variation. Bulked segregant analysis with next-generation sequencing in F2 progenies identified five SNPs genetically linked to the virulence. These analyses showed that virulence to Bph1 was controlled by a single recessive gene. In contrast to previous studies, the gene-for-gene relationship between the major resistance gene Bph1 and virulence gene of BPH was confirmed. Identified markers are available for map-based cloning of the major gene controlling BPH virulence to rice resistance. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Ehlers, Claudia; Veit, Katharina; Gottschalk, Gerhard; Schmitz, Ruth A.
2002-01-01
The mesophilic methanogenic archaeon Methanosarcina mazei strain Gö1 is able to utilize molecular nitrogen (N2) as its sole nitrogen source. We have identified and characterized a single nitrogen fixation (nif) gene cluster in M. mazei Gö1 with an approximate length of 9 kbp. Sequence analysis revealed seven genes with sequence similarities to nifH, nifI1, nifI2, nifD, nifK, nifE and nifN, similar to other diazotrophic methanogens and certain bacteria such as Clostridium acetobutylicum, with the two glnB-like genes (nifI1 and nifI2) located between nifH and nifD. Phylogenetic analysis of deduced amino acid sequences for the nitrogenase structural genes of M. mazei Gö1 showed that they are most closely related to Methanosarcina barkeri nif2 genes, and also closely resemble those for the corresponding nif products of the gram-positive bacterium C. acetobutylicum. Northern blot analysis and reverse transcription PCR analysis demonstrated that the M. mazei nif genes constitute an operon transcribed only under nitrogen starvation as a single 8 kb transcript. Sequence analysis revealed a palindromic sequence at the transcriptional start site in front of the M. mazei nifH gene, which may have a function in transcriptional regulation of the nif operon. PMID:15803652
Single Cell Gene Expression Profiling of Skeletal Muscle-Derived Cells.
Gatto, Sole; Puri, Pier Lorenzo; Malecova, Barbora
2017-01-01
Single cell gene expression profiling is a fundamental tool for studying the heterogeneity of a cell population by addressing the phenotypic and functional characteristics of each cell. Technological advances that have coupled microfluidic technologies with high-throughput quantitative RT-PCR analyses have enabled detailed analyses of single cells in various biological contexts. In this chapter, we describe the procedure for isolating the skeletal muscle interstitial cells termed Fibro-Adipogenic Progenitors (FAPs ) and their gene expression profiling at the single cell level. Moreover, we accompany our bench protocol with bioinformatics analysis designed to process raw data as well as to visualize single cell gene expression data. Single cell gene expression profiling is therefore a useful tool in the investigation of FAPs heterogeneity and their contribution to muscle homeostasis.
Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.
Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong
2015-06-09
Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.
Granatum: a graphical single-cell RNA-Seq analysis pipeline for genomics scientists.
Zhu, Xun; Wolfgruber, Thomas K; Tasato, Austin; Arisdakessian, Cédric; Garmire, David G; Garmire, Lana X
2017-12-05
Single-cell RNA sequencing (scRNA-Seq) is an increasingly popular platform to study heterogeneity at the single-cell level. Computational methods to process scRNA-Seq data are not very accessible to bench scientists as they require a significant amount of bioinformatic skills. We have developed Granatum, a web-based scRNA-Seq analysis pipeline to make analysis more broadly accessible to researchers. Without a single line of programming code, users can click through the pipeline, setting parameters and visualizing results via the interactive graphical interface. Granatum conveniently walks users through various steps of scRNA-Seq analysis. It has a comprehensive list of modules, including plate merging and batch-effect removal, outlier-sample removal, gene-expression normalization, imputation, gene filtering, cell clustering, differential gene expression analysis, pathway/ontology enrichment analysis, protein network interaction visualization, and pseudo-time cell series construction. Granatum enables broad adoption of scRNA-Seq technology by empowering bench scientists with an easy-to-use graphical interface for scRNA-Seq data analysis. The package is freely available for research use at http://garmiregroup.org/granatum/app.
Single cell transcriptomics of neighboring hyphae of Aspergillus niger
2011-01-01
Single cell profiling was performed to assess differences in RNA accumulation in neighboring hyphae of the fungus Aspergillus niger. A protocol was developed to isolate and amplify RNA from single hyphae or parts thereof. Microarray analysis resulted in a present call for 4 to 7% of the A. niger genes, of which 12% showed heterogeneous RNA levels. These genes belonged to a wide range of gene categories. PMID:21816052
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-01-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield. PMID:25333064
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-09-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield.
Ben Said, Mourad; Ben Asker, Alaa; Belkahia, Hanène; Ghribi, Raoua; Selmi, Rachid; Messadi, Lilia
2018-05-12
Anaplasma marginale, which is responsible for bovine anaplasmosis in tropical and subtropical regions, is a tick-borne obligatory intraerythrocytic bacterium of cattle and wild ruminants. In Tunisia, information about the genetic diversity and the phylogeny of A. marginale strains are limited to the msp4 gene analysis. The purpose of this study is to investigate A. marginale isolates infecting 16 cattle located in different bioclimatic areas of northern Tunisia with single gene analysis and multilocus sequence typing methods on the basis of seven partial genes (dnaA, ftsZ, groEL, lipA, secY, recA and sucB). The single gene analysis confirmed the presence of different and novel heterogenic A. marginale strains infecting cattle from the north of Tunisia. The concatenated sequence analysis showed a phylogeographical resolution at the global level and that most of the Tunisian sequence types (STs) formed a separate cluster from a South African isolate and from all New World isolates and strains. By combining the characteristics of each single locus with those of the multi-loci scheme, these results provide a more detailed understanding on the diversity and the evolution of Tunisian A. marginale strains. Copyright © 2018 Elsevier GmbH. All rights reserved.
Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data.
Yip, Shun H; Sham, Pak Chung; Wang, Junwen
2018-02-21
Traditional RNA sequencing (RNA-seq) allows the detection of gene expression variations between two or more cell populations through differentially expressed gene (DEG) analysis. However, genes that contribute to cell-to-cell differences are not discoverable with RNA-seq because RNA-seq samples are obtained from a mixture of cells. Single-cell RNA-seq (scRNA-seq) allows the detection of gene expression in each cell. With scRNA-seq, highly variable gene (HVG) discovery allows the detection of genes that contribute strongly to cell-to-cell variation within a homogeneous cell population, such as a population of embryonic stem cells. This analysis is implemented in many software packages. In this study, we compare seven HVG methods from six software packages, including BASiCS, Brennecke, scLVM, scran, scVEGs and Seurat. Our results demonstrate that reproducibility in HVG analysis requires a larger sample size than DEG analysis. Discrepancies between methods and potential issues in these tools are discussed and recommendations are made.
Meta-analysis of gene-level associations for rare variants based on single-variant statistics.
Hu, Yi-Juan; Berndt, Sonja I; Gustafsson, Stefan; Ganna, Andrea; Hirschhorn, Joel; North, Kari E; Ingelsson, Erik; Lin, Dan-Yu
2013-08-08
Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying "causal" rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Dimitrova, Irina K.; Richer, Jennifer K.; Rudolph, Michael C.; Spoelstra, Nicole S.; Reno, Elaine M.; Medina, Theresa M.; Bradford, Andrew P.
2009-01-01
Objective To identify differentially expressed genes between fibroid and adjacent normal myometrium in an identical hormonal and genetic background. Design Array analysis of 3 leiomyomata and matched adjacent normal myometrium in a single patient. Setting University of Colorado Hospital. Patient(s) A single female undergoing medically indicated hysterectomy for symptomatic fibroids. Interventions(s) mRNA isolation and microarray analysis, reverse-transcriptase polymerase chain reaction, western blotting and immunohistochemistry. Main Outcome Measure(s) Changes in mRNA and protein levels in leiomyomata and matched normal myometrium. Result(s) Expression of 197 genes was increased and 619 decreased, significantly by at least 2 fold, in leiomyomata relative to normal myometrium. Expression profiles between tumors were similar and normal myometrial samples showed minimal variation. Changes in, and variation of, expression of selected genes were confirmed in additional normal and leiomyoma samples from multiple patients. Conclusion(s) Analysis of multiple tumors from a single patient confirmed changes in expression of genes described in previous, apparently disparate, studies and identified novel targets. Gene expression profiles in leiomyomata are consistent with increased activation of mitogenic pathways and inhibition of apoptosis. Down-regulation of genes implicated in invasion and metastasis, of cancers, was observed in fibroids. This expression pattern may underlie the benign nature of uterine leiomyomata and may aid in the differential diagnosis of leiomyosarcoma. PMID:18672237
Zhai, Rong-Lin; Xu, Fei; Zhang, Pei; Zhang, Wan-Li; Wang, Hui; Wang, Ji-Liang; Cai, Kai-Lin; Long, Yue-Ping; Lu, Xiao-Ming; Tao, Kai-Xiong; Wang, Guo-Bin
2016-02-01
This meta-analysis was designed to evaluate the diagnostic performance of stool DNA testing for colorectal cancer (CRC) and compare the performance between single-gene and multiple-gene tests.MEDLINE, Cochrane, EMBASE databases were searched using keywords colorectal cancers, stool/fecal, sensitivity, specificity, DNA, and screening. Sensitivity analysis, quality assessments, and performance bias were performed for the included studies.Fifty-three studies were included in the analysis with a total sample size of 7524 patients. The studies were heterogeneous with regard to the genes being analyzed for fecal genetic biomarkers of CRC, as well as the laboratory methods being used for each assay. The sensitivity of the different assays ranged from 2% to 100% and the specificity ranged from 81% to 100%. The meta-analysis found that the pooled sensitivities for single- and multigene assays were 48.0% and 77.8%, respectively, while the pooled specificities were 97.0% and 92.7%. Receiver operator curves and diagnostic odds ratios showed no significant difference between both tests with regard to sensitivity or specificity.This meta-analysis revealed that using assays that evaluated multiple genes compared with single-gene assays did not increase the sensitivity or specificity of stool DNA testing in detecting CRC.
Ishikawa, Akira
2017-11-27
Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Zuo, Erwei; Cai, Yi-Jun; Li, Kui; Wei, Yu; Wang, Bang-An; Sun, Yidi; Liu, Zhen; Liu, Jiwei; Hu, Xinde; Wei, Wei; Huo, Xiaona; Shi, Linyu; Tang, Cheng; Liang, Dan; Wang, Yan; Nie, Yan-Hong; Zhang, Chen-Chen; Yao, Xuan; Wang, Xing; Zhou, Changyang; Ying, Wenqin; Wang, Qifang; Chen, Ren-Chao; Shen, Qi; Xu, Guo-Liang; Li, Jinsong; Sun, Qiang; Xiong, Zhi-Qi; Yang, Hui
2017-07-01
The CRISPR/Cas9 system is an efficient gene-editing method, but the majority of gene-edited animals showed mosaicism, with editing occurring only in a portion of cells. Here we show that single gene or multiple genes can be completely knocked out in mouse and monkey embryos by zygotic injection of Cas9 mRNA and multiple adjacent single-guide RNAs (spaced 10-200 bp apart) that target only a single key exon of each gene. Phenotypic analysis of F0 mice following targeted deletion of eight genes on the Y chromosome individually demonstrated the robustness of this approach in generating knockout mice. Importantly, this approach delivers complete gene knockout at high efficiencies (100% on Arntl and 91% on Prrt2) in monkey embryos. Finally, we could generate a complete Prrt2 knockout monkey in a single step, demonstrating the usefulness of this approach in rapidly establishing gene-edited monkey models.
Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.
Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao
2016-11-30
Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.
Single-feature polymorphism discovery in the barley transcriptome
Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie
2005-01-01
A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
Marian, Ali J.; van Rooij, Eva; Roberts, Robert
2016-01-01
This is the first of 2 review papers on genetics and genomics appearing as part of the series on “omics.” Genomics pertains to all components of an organism’s genes, whereas genetics involves analysis of a specific gene(s) in the context of heredity. The paper provides introductory comments, describes the basis of human genetic diversity, and addresses the phenotypic consequences of genetic variants. Rare variants with large effect sizes are responsible for single-gene disorders, whereas complex polygenic diseases are typically due to multiple genetic variants, each exerting a modest effect size. To illustrate the clinical implications of genetic variants with large effect sizes, 3 common forms of hereditary cardiomyopathies are discussed as prototypic examples of single-gene disorders, including their genetics, clinical manifestations, pathogenesis, and treatment. The genetic basis of complex traits is discussed in a separate paper. PMID:28007145
Quantitative high-resolution genomic analysis of single cancer cells.
Hannemann, Juliane; Meyer-Staeckling, Sönke; Kemming, Dirk; Alpers, Iris; Joosse, Simon A; Pospisil, Heike; Kurtz, Stefan; Görndt, Jennifer; Püschel, Klaus; Riethdorf, Sabine; Pantel, Klaus; Brandt, Burkhard
2011-01-01
During cancer progression, specific genomic aberrations arise that can determine the scope of the disease and can be used as predictive or prognostic markers. The detection of specific gene amplifications or deletions in single blood-borne or disseminated tumour cells that may give rise to the development of metastases is of great clinical interest but technically challenging. In this study, we present a method for quantitative high-resolution genomic analysis of single cells. Cells were isolated under permanent microscopic control followed by high-fidelity whole genome amplification and subsequent analyses by fine tiling array-CGH and qPCR. The assay was applied to single breast cancer cells to analyze the chromosomal region centred by the therapeutical relevant EGFR gene. This method allows precise quantitative analysis of copy number variations in single cell diagnostics.
Isoform-level gene expression patterns in single-cell RNA-sequencing data.
Vu, Trung Nghia; Wills, Quin F; Kalari, Krishna R; Niu, Nifang; Wang, Liewei; Pawitan, Yudi; Rantalainen, Mattias
2018-02-27
RNA sequencing of single cells enables characterization of transcriptional heterogeneity in seemingly homogeneous cell populations. Single-cell sequencing has been applied in a wide range of researches fields. However, few studies have focus on characterization of isoform-level expression patterns at the single-cell level. In this study we propose and apply a novel method, ISOform-Patterns (ISOP), based on mixture modeling, to characterize the expression patterns of isoform pairs from the same gene in single-cell isoform-level expression data. We define six principal patterns of isoform expression relationships and describe a method for differential-pattern analysis. We demonstrate ISOP through analysis of single-cell RNA-sequencing data from a breast cancer cell line, with replication in three independent datasets. We assigned the pattern types to each of 16,562 isoform-pairs from 4,929 genes. Among those, 26% of the discovered patterns were significant (p<0.05), while remaining patterns are possibly effects of transcriptional bursting, drop-out and stochastic biological heterogeneity. Furthermore, 32% of genes discovered through differential-pattern analysis were not detected by differential-expression analysis. The effect of drop-out events, mean expression level, and properties of the expression distribution on the performances of ISOP were also investigated through simulated datasets. To conclude, ISOP provides a novel approach for characterization of isoformlevel preference, commitment and heterogeneity in single-cell RNA-sequencing data. The ISOP method has been implemented as a R package and is available at https://github.com/nghiavtr/ISOP under a GPL-3 license. mattias.rantalainen@ki.se. Supplementary data are available at Bioinformatics online.
Aoun, Meriem; Kolmer, James A; Rouse, Matthew N; Chao, Shiaoman; Bulbula, Worku Denbel; Elias, Elias M; Acevedo, Maricelis
2017-12-01
Leaf rust, caused by Puccinia triticina, and stem rust, caused by P. graminis f. sp. tritici, are important diseases of durum wheat. This study determined the inheritance and genomic locations of leaf rust resistance (Lr) genes to P. triticina race BBBQJ and stem rust resistance (Sr) genes to P. graminis f. sp. tritici race TTKSK in durum accessions. Eight leaf-rust-resistant genotypes were used to develop biparental populations. Accessions PI 192051 and PI 534304 were also resistant to P. graminis f. sp. tritici race TTKSK. The resulting progenies were phenotyped for leaf rust and stem rust response at seedling stage. The Lr and Sr genes were mapped in five populations using single-nucleotide polymorphisms and bulked segregant analysis. Five leaf-rust-resistant genotypes carried single dominant Lr genes whereas, in the remaining accessions, there was deviation from the expected segregation ratio of a single dominant Lr gene. Seven genotypes carried Lr genes different from those previously characterized in durum. The single dominant Lr genes in PI 209274, PI 244061, PI387263, and PI 313096 were mapped to chromosome arms 6BS, 2BS, 6BL, and 6BS, respectively. The Sr gene in PI 534304 mapped to 6AL and is most likely Sr13, while the Sr gene in PI 192051 could be uncharacterized in durum.
Zhao, Dejian; Lin, Mingyan; Pedrosa, Erika; Lachman, Herbert M; Zheng, Deyou
2017-11-10
Monoallelic expression of autosomal genes has been implicated in human psychiatric disorders. However, there is a paucity of allelic expression studies in human brain cells at the single cell and genome wide levels. In this report, we reanalyzed a previously published single-cell RNA-seq dataset from several postmortem human brains and observed pervasive monoallelic expression in individual cells, largely in a random manner. Examining single nucleotide variants with a predicted functional disruption, we found that the "damaged" alleles were overall expressed in fewer brain cells than their counterparts, and at a lower level in cells where their expression was detected. We also identified many brain cell type-specific monoallelically expressed genes. Interestingly, many of these cell type-specific monoallelically expressed genes were enriched for functions important for those brain cell types. In addition, function analysis showed that genes displaying monoallelic expression and correlated expression across neuronal cells from different individual brains were implicated in the regulation of synaptic function. Our findings suggest that monoallelic gene expression is prevalent in human brain cells, which may play a role in generating cellular identity and neuronal diversity and thus increasing the complexity and diversity of brain cell functions.
Direct observation of frequency modulated transcription in single cells using light activation
Larson, Daniel R; Fritzsch, Christoph; Sun, Liang; Meng, Xiuhau; Lawrence, David S; Singer, Robert H
2013-01-01
Single-cell analysis has revealed that transcription is dynamic and stochastic, but tools are lacking that can determine the mechanism operating at a single gene. Here we utilize single-molecule observations of RNA in fixed and living cells to develop a single-cell model of steroid-receptor mediated gene activation. We determine that steroids drive mRNA synthesis by frequency modulation of transcription. This digital behavior in single cells gives rise to the well-known analog dose response across the population. To test this model, we developed a light-activation technology to turn on a single steroid-responsive gene and follow dynamic synthesis of RNA from the activated locus. DOI: http://dx.doi.org/10.7554/eLife.00750.001 PMID:24069527
Zooplankton community analysis in the Changjiang River estuary by single-gene-targeted metagenomics
NASA Astrophysics Data System (ADS)
Cheng, Fangping; Wang, Minxiao; Li, Chaolun; Sun, Song
2014-07-01
DNA barcoding provides accurate identification of zooplankton species through all life stages. Single-gene-targeted metagenomic analysis based on DNA barcode databases can facilitate longterm monitoring of zooplankton communities. With the help of the available zooplankton databases, the zooplankton community of the Changjiang (Yangtze) River estuary was studied using a single-gene-targeted metagenomic method to estimate the species richness of this community. A total of 856 mitochondrial cytochrome oxidase subunit 1 (cox1) gene sequences were determined. The environmental barcodes were clustered into 70 molecular operational taxonomic units (MOTUs). Forty-two MOTUs matched barcoded marine organisms with more than 90% similarity and were assigned to either the species (similarity>96%) or genus level (similarity<96%). Sibling species could also be distinguished. Many species that were overlooked by morphological methods were identified by molecular methods, especially gelatinous zooplankton and merozooplankton that were likely sampled at different life history phases. Zooplankton community structures differed significantly among all of the samples. The MOTU spatial distributions were influenced by the ecological habits of the corresponding species. In conclusion, single-gene-targeted metagenomic analysis is a useful tool for zooplankton studies, with which specimens from all life history stages can be identified quickly and effectively with a comprehensive database.
Guedes, Ana M V; Henrique, Domingos; Abranches, Elsa
2016-01-01
Mouse Embryonic Stem cells (mESCs) show heterogeneous and dynamic expression of important pluripotency regulatory factors. Single-cell analysis has revealed the existence of cell-to-cell variability in the expression of individual genes in mESCs. Understanding how these heterogeneities are regulated and what their functional consequences are is crucial to obtain a more comprehensive view of the pluripotent state.In this chapter we describe how to analyze transcriptional heterogeneity by monitoring gene expression of Nanog, Oct4, and Sox2, using single-molecule RNA FISH in single mESCs grown in different cell culture medium. We describe in detail all the steps involved in the protocol, from RNA detection to image acquisition and processing, as well as exploratory data analysis.
Noninvasive prenatal diagnosis for single gene disorders.
Allen, Stephanie; Young, Elizabeth; Bowns, Benjamin
2017-04-01
Noninvasive prenatal diagnosis for single gene disorders is coming to fruition in its clinical utility. The presence of cell-free DNA in maternal plasma has been recognized for many years, and a number of applications have developed from this. Noninvasive prenatal diagnosis for single gene disorders has lagged behind due to complexities of technology development, lack of investment and the need for validation samples for rare disorders. Publications are emerging demonstrating a variety of technical approaches and feasibility of clinical application. Techniques for analysis of cell-free DNA including digital PCR, next-generation sequencing and relative haplotype dosage have been used most often for assay development. Analysis of circulating fetal cells in the maternal blood is still being investigated as a viable alternative and more recently transcervical trophoblast cells. Studies exploring ethical and social issues are generally positive but raise concerns around the routinization of prenatal testing. Further work is necessary to make testing available to all patients with a pregnancy at risk of a single gene disorder, and it remains to be seen if the development of more powerful technologies such as isolation and analysis of single cells will shift the emphasis of noninvasive prenatal diagnosis. As testing becomes possible for a wider range of conditions, more ethical questions will become relevant.
Single cell digital polymerase chain reaction on self-priming compartmentalization chip
Zhu, Qiangyuan; Qiu, Lin; Xu, Yanan; Li, Guang; Mu, Ying
2017-01-01
Single cell analysis provides a new framework for understanding biology and disease, however, an absolute quantification of single cell gene expression still faces many challenges. Microfluidic digital polymerase chain reaction (PCR) provides a unique method to absolutely quantify the single cell gene expression, but only limited devices are developed to analyze a single cell with detection variation. This paper describes a self-priming compartmentalization (SPC) microfluidic digital polymerase chain reaction chip being capable of performing single molecule amplification from single cell. The chip can be used to detect four single cells simultaneously with 85% of sample digitization. With the optimized protocol for the SPC chip, we first tested the ability, precision, and sensitivity of our SPC digital PCR chip by assessing β-actin DNA gene expression in 1, 10, 100, and 1000 cells. And the reproducibility of the SPC chip is evaluated by testing 18S rRNA of single cells with 1.6%–4.6% of coefficient of variation. At last, by detecting the lung cancer related genes, PLAU gene expression of A549 cells at the single cell level, the single cell heterogeneity was demonstrated. So, with the power-free, valve-free SPC chip, the gene copy number of single cells can be quantified absolutely with higher sensitivity, reduced labor time, and reagent. We expect that this chip will enable new studies for biology and disease. PMID:28191267
Single cell digital polymerase chain reaction on self-priming compartmentalization chip.
Zhu, Qiangyuan; Qiu, Lin; Xu, Yanan; Li, Guang; Mu, Ying
2017-01-01
Single cell analysis provides a new framework for understanding biology and disease, however, an absolute quantification of single cell gene expression still faces many challenges. Microfluidic digital polymerase chain reaction (PCR) provides a unique method to absolutely quantify the single cell gene expression, but only limited devices are developed to analyze a single cell with detection variation. This paper describes a self-priming compartmentalization (SPC) microfluidic digital polymerase chain reaction chip being capable of performing single molecule amplification from single cell. The chip can be used to detect four single cells simultaneously with 85% of sample digitization. With the optimized protocol for the SPC chip, we first tested the ability, precision, and sensitivity of our SPC digital PCR chip by assessing β-actin DNA gene expression in 1, 10, 100, and 1000 cells. And the reproducibility of the SPC chip is evaluated by testing 18S rRNA of single cells with 1.6%-4.6% of coefficient of variation. At last, by detecting the lung cancer related genes, PLAU gene expression of A549 cells at the single cell level, the single cell heterogeneity was demonstrated. So, with the power-free, valve-free SPC chip, the gene copy number of single cells can be quantified absolutely with higher sensitivity, reduced labor time, and reagent. We expect that this chip will enable new studies for biology and disease.
Quantitative High-Resolution Genomic Analysis of Single Cancer Cells
Hannemann, Juliane; Meyer-Staeckling, Sönke; Kemming, Dirk; Alpers, Iris; Joosse, Simon A.; Pospisil, Heike; Kurtz, Stefan; Görndt, Jennifer; Püschel, Klaus; Riethdorf, Sabine; Pantel, Klaus; Brandt, Burkhard
2011-01-01
During cancer progression, specific genomic aberrations arise that can determine the scope of the disease and can be used as predictive or prognostic markers. The detection of specific gene amplifications or deletions in single blood-borne or disseminated tumour cells that may give rise to the development of metastases is of great clinical interest but technically challenging. In this study, we present a method for quantitative high-resolution genomic analysis of single cells. Cells were isolated under permanent microscopic control followed by high-fidelity whole genome amplification and subsequent analyses by fine tiling array-CGH and qPCR. The assay was applied to single breast cancer cells to analyze the chromosomal region centred by the therapeutical relevant EGFR gene. This method allows precise quantitative analysis of copy number variations in single cell diagnostics. PMID:22140428
NASA Astrophysics Data System (ADS)
Tsyganov, M. M.; Ibragimova, M. K.; Karabut, I. V.; Freydin, M. B.; Choinzonov, E. L.; Litvyakov, N. V.
2015-11-01
Our previous research establishes that changes of expression of the ATP-binding cassette genes family is connected with the neoadjuvant chemotherapy effect. However, the mechanism of regulation of resistance gene expression remains unclear. As many researchers believe, single nucleotide polymorphisms can be involved in this process. Thereupon, microarray analysis is used to study polymorphisms in ATP-binding cassette genes. It is thus found that MDR gene expression is connected with 5 polymorphisms, i.e. rs241432, rs241429, rs241430, rs3784867, rs59409230, which participate in the regulation of expression of own genes.
Zhang, Jing; Zhang, Lu; Zhang, Yan; Yang, Jing; Guo, Mengbiao; Sun, Liangdan; Pan, Hai-Feng; Hirankarn, Nattiya; Ying, Dingge; Zeng, Shuai; Lee, Tsz Leung; Lau, Chak Sing; Chan, Tak Mao; Leung, Alexander Moon Ho; Mok, Chi Chiu; Wong, Sik Nin; Lee, Ka Wing; Ho, Marco Hok Kung; Lee, Pamela Pui Wah; Chung, Brian Hon-Yin; Chong, Chun Yin; Wong, Raymond Woon Sing; Mok, Mo Yin; Wong, Wilfred Hing Sang; Tong, Kwok Lung; Tse, Niko Kei Chiu; Li, Xiang-Pei; Avihingsanon, Yingyos; Rianthavorn, Pornpimol; Deekajorndej, Thavatchai; Suphapeetiporn, Kanya; Shotelersuk, Vorasuk; Ying, Shirley King Yee; Fung, Samuel Ka Shun; Lai, Wai Ming; Garcia-Barceló, Maria-Mercè; Cherny, Stacey S; Sham, Pak Chung; Cui, Yong; Yang, Sen; Ye, Dong Qing; Zhang, Xue-Jun; Lau, Yu Lung; Yang, Wanling
2015-11-01
Previous genome-wide association studies (GWAS), which were mainly based on single-variant analysis, have identified many systemic lupus erythematosus (SLE) susceptibility loci. However, the genetic architecture of this complex disease is far from being understood. The aim of this study was to investigate whether using a gene-based analysis may help to identify novel loci, by considering global evidence of association from a gene or a genomic region rather than focusing on evidence for individual variants. Based on the results of a meta-analysis of 2 GWAS of SLE conducted in 2 Asian cohorts, we performed an in-depth gene-based analysis followed by replication in a total of 4,626 patients and 7,466 control subjects of Asian ancestry. Differential allelic expression was measured by pyrosequencing. More than one-half of the reported SLE susceptibility loci showed evidence of independent effects, and this finding is important for understanding the mechanisms of association and explaining disease heritability. ANXA6 was detected as a novel SLE susceptibility gene, with several single-nucleotide polymorphisms (SNPs) contributing independently to the association with disease. The risk allele of rs11960458 correlated significantly with increased expression of ANXA6 in peripheral blood mononuclear cells from heterozygous healthy control subjects. Several other associated SNPs may also regulate ANXA6 expression, according to data obtained from public databases. Higher expression of ANXA6 in patients with SLE was also reported previously. Our study demonstrated the merit of using gene-based analysis to identify novel susceptibility loci, especially those with independent effects, and also demonstrated the widespread presence of loci with independent effects in SLE susceptibility genes. © 2015, American College of Rheumatology.
Exome Array Analysis of Nuclear Lens Opacity.
Loomis, Stephanie J; Klein, Alison P; Lee, Kristine E; Chen, Fei; Bomotti, Samantha; Truitt, Barbara; Iyengar, Sudha K; Klein, Ronald; Klein, Barbara E K; Duggal, Priya
2018-06-01
Nuclear cataract is the most common subtype of age-related cataract, the leading cause of blindness worldwide. It results from advanced nuclear sclerosis, or opacity in the center of the optic lens, and is affected by both genetic and environmental risk factors, including smoking. We sought to understand the genetic factors associated with nuclear sclerosis through interrogation of rare and low frequency coding variants using exome array data. We analyzed Illumina Human Exome Array data for 1,488 participants of European ancestry in the Beaver Dam Eye Study who were without cataract surgery for association with nuclear sclerosis grade, controlling for age and sex. We performed single-variant regression analysis for 32,138 variants with minor allele frequency (MAF) ≥0.003. In addition, gene-based analysis of 11,844 genes containing at least two variants with MAF < 0.05 was performed using a gene-based unified burden and non-burden sequence kernel association test (SKAT-O). Additionally, both single-variant and gene-based analyses were analyzed stratified by smoking status. No single-variant test was statistically significant after Bonferroni correction (p < 1.6 × 10 -6 ; top single nucleotide polymorphism (SNP): rs144458991, p = 2.83 × 10 -5 ). Gene-based tests were suggestively associated with the gene RNF149 overall (p = 8.29 × 10 -6 ) and among never smokers (N = 790, p = 2.67 × 10 -6 ). This study did not find a significant genetic association with nuclear sclerosis, the possible association with the RNF149 gene highlights a potential candidate gene for future studies that aim to understand the genetic architecture of nuclear sclerosis.
Single-Cell RNA-Sequencing: Assessment of Differential Expression Analysis Methods.
Dal Molin, Alessandra; Baruzzo, Giacomo; Di Camillo, Barbara
2017-01-01
The sequencing of the transcriptomes of single-cells, or single-cell RNA-sequencing, has now become the dominant technology for the identification of novel cell types and for the study of stochastic gene expression. In recent years, various tools for analyzing single-cell RNA-sequencing data have been proposed, many of them with the purpose of performing differentially expression analysis. In this work, we compare four different tools for single-cell RNA-sequencing differential expression, together with two popular methods originally developed for the analysis of bulk RNA-sequencing data, but largely applied to single-cell data. We discuss results obtained on two real and one synthetic dataset, along with considerations about the perspectives of single-cell differential expression analysis. In particular, we explore the methods performance in four different scenarios, mimicking different unimodal or bimodal distributions of the data, as characteristic of single-cell transcriptomics. We observed marked differences between the selected methods in terms of precision and recall, the number of detected differentially expressed genes and the overall performance. Globally, the results obtained in our study suggest that is difficult to identify a best performing tool and that efforts are needed to improve the methodologies for single-cell RNA-sequencing data analysis and gain better accuracy of results.
A single-molecule view of gene regulation in cancer
NASA Astrophysics Data System (ADS)
Larson, Daniel
2013-03-01
Single-cell analysis has revealed that transcription is dynamic and stochastic, but tools are lacking that can determine the mechanism operating at a single gene. Here we utilize single-molecule observations of RNA in fixed and living cells to develop a single-cell model of steroid-receptor mediated gene activation. Steroid receptors coordinate a diverse range of responses in higher eukaryotes and are involved in a wide range of human diseases, including cancer. Steroid receptor response elements are present throughout the human genome and modulate chromatin remodeling and transcription in both a local and long-range fashion. As such, steroid receptor-mediated transcription is a paradigm of genetic control in the metazoan nucleus. Moreover, the ligand-dependent nature of these transcription factors makes them appealing targets for therapeutic intervention, necessitating a quantitative understanding of how receptors control output from target genes. We determine that steroids drive mRNA synthesis by frequency modulation of transcription. This digital behavior in single cells gives rise to the well-known analog dose response across the population. To test this model, we developed a light-activation technology to turn on a single gene and follow dynamic synthesis of RNA from the activated locus. The response delay is a measure of time required for chromatin remodeling at a single gene.
Highly Multiplexed, Single Cell Transcriptomic Analysis of T-Cells by Microfluidic PCR.
Dominguez, Maria; Roederer, Mario; Chattopadhyay, Pratip K
2017-01-01
Recently, technologies have been developed to measure expression of 96 (or more) mRNA transcripts at once from a single cell. Here we describe methods and important considerations for use of Fluidigm's BioMark platform for multiplexed single cell gene expression. We describe how to qualify primer/probes, select genes to examine in 96-parameter panels, perform the reverse transcription/cDNA synthesis step, and operate the instrument. In addition, we describe data analysis considerations. This technology has enormous value for characterizing the heterogeneity of T-cells, thereby providing a useful tool for immune monitoring.
BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
Vallejos, Catalina A.; Marioni, John C.; Richardson, Sylvia
2015-01-01
Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach. PMID:26107944
BASiCS: Bayesian Analysis of Single-Cell Sequencing Data.
Vallejos, Catalina A; Marioni, John C; Richardson, Sylvia
2015-06-01
Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell's lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach.
Ocular findings associated with a Cys39Arg mutation in the Norrie disease gene.
Joos, K M; Kimura, A E; Vandenburgh, K; Bartley, J A; Stone, E M
1994-12-01
To diagnose the carriers and noncarriers in a family affected with Norrie disease based on molecular analysis. Family members from three generations, including one affected patient, two obligate carriers, one carrier identified with linkage analysis, one noncarrier identified with linkage analysis, and one female family member with indeterminate carrier status, were examined clinically and electrophysiologically. Linkage analysis had previously failed to determine the carrier status of one female family member in the third generation. Blood samples were screened for mutations in the Norrie disease gene with single-strand conformation polymorphism analysis. The mutation was characterized by dideoxy-termination sequencing. Ophthalmoscopy and electroretinographic examination failed to detect the carrier state. The affected individuals and carriers in this family were found to have a transition from thymidine to cytosine in the first nucleotide of codon 39 of the Norrie disease gene, causing a cysteine-to-arginine mutation. Single-strand conformation polymorphism analysis identified a patient of indeterminate status (by linkage) to be a noncarrier of Norrie disease. Ophthalmoscopy and electroretinography could not identify carriers of this Norrie disease mutation. Single-strand conformation polymorphism analysis was more sensitive and specific than linkage analysis in identifying carriers in this family.
The nif Gene Operon of the Methanogenic Archaeon Methanococcus maripaludis
Kessler, Peter S.; Blank, Carrine; Leigh, John A.
1998-01-01
Nitrogen fixation occurs in two domains, Archaea and Bacteria. We have characterized a nif (nitrogen fixation) gene cluster in the methanogenic archaeon Methanococcus maripaludis. Sequence analysis revealed eight genes, six with sequence similarity to known nif genes and two with sequence similarity to glnB. The gene order, nifH, ORF105 (similar to glnB), ORF121 (similar to glnB), nifD, nifK, nifE, nifN, and nifX, was the same as that found in part in other diazotrophic methanogens and except for the presence of the glnB-like genes, also resembled the order found in many members of the Bacteria. Using transposon insertion mutagenesis, we determined that an 8-kb region required for nitrogen fixation corresponded to the nif gene cluster. Northern analysis revealed the presence of either a single 7.6-kb nif mRNA transcript or 10 smaller mRNA species containing portions of the large transcript. Polar effects of transposon insertions demonstrated that all of these mRNAs arose from a single promoter region, where transcription initiated 80 bp 5′ to nifH. Distinctive features of the nif gene cluster include the presence of the six primary nif genes in a single operon, the placement of the two glnB-like genes within the cluster, the apparent physical separation of the cluster from any other nif genes that might be in the genome, the fragmentation pattern of the mRNA, and the regulation of expression by a repression mechanism described previously. Our study and others with methanogenic archaea reporting multiple mRNAs arising from gene clusters with only a single putative promoter sequence suggest that mRNA processing following transcription may be a common occurrence in methanogens. PMID:9515920
NASA Astrophysics Data System (ADS)
Streets, Aaron M.; Cao, Chen; Zhang, Xiannian; Huang, Yanyi
2016-03-01
Phenotype classification of single cells reveals biological variation that is masked in ensemble measurement. This heterogeneity is found in gene and protein expression as well as in cell morphology. Many techniques are available to probe phenotypic heterogeneity at the single cell level, for example quantitative imaging and single-cell RNA sequencing, but it is difficult to perform multiple assays on the same single cell. In order to directly track correlation between morphology and gene expression at the single cell level, we developed a microfluidic platform for quantitative coherent Raman imaging and immediate RNA sequencing (RNA-Seq) of single cells. With this device we actively sort and trap cells for analysis with stimulated Raman scattering microscopy (SRS). The cells are then processed in parallel pipelines for lysis, and preparation of cDNA for high-throughput transcriptome sequencing. SRS microscopy offers three-dimensional imaging with chemical specificity for quantitative analysis of protein and lipid distribution in single cells. Meanwhile, the microfluidic platform facilitates single-cell manipulation, minimizes contamination, and furthermore, provides improved RNA-Seq detection sensitivity and measurement precision, which is necessary for differentiating biological variability from technical noise. By combining coherent Raman microscopy with RNA sequencing, we can better understand the relationship between cellular morphology and gene expression at the single-cell level.
Quantitative gene expression analysis in Caenorhabditis elegans using single molecule RNA FISH.
Bolková, Jitka; Lanctôt, Christian
2016-04-01
Advances in fluorescent probe design and synthesis have allowed the uniform in situ labeling of individual RNA molecules. In a technique referred to as single molecule RNA FISH (smRNA FISH), the labeled RNA molecules can be imaged as diffraction-limited spots and counted using image analysis algorithms. Single RNA counting has provided valuable insights into the process of gene regulation. This microscopy-based method has often revealed a high cell-to-cell variability in expression levels, which has in turn led to a growing interest in investigating the biological significance of gene expression noise. Here we describe the application of the smRNA FISH technique to samples of Caenorhabditis elegans, a well-characterized model organism. Copyright © 2015 Elsevier Inc. All rights reserved.
Modeling Bi-modality Improves Characterization of Cell Cycle on Gene Expression in Single Cells
Danaher, Patrick; Finak, Greg; Krouse, Michael; Wang, Alice; Webster, Philippa; Beechem, Joseph; Gottardo, Raphael
2014-01-01
Advances in high-throughput, single cell gene expression are allowing interrogation of cell heterogeneity. However, there is concern that the cell cycle phase of a cell might bias characterizations of gene expression at the single-cell level. We assess the effect of cell cycle phase on gene expression in single cells by measuring 333 genes in 930 cells across three phases and three cell lines. We determine each cell's phase non-invasively without chemical arrest and use it as a covariate in tests of differential expression. We observe bi-modal gene expression, a previously-described phenomenon, wherein the expression of otherwise abundant genes is either strongly positive, or undetectable within individual cells. This bi-modality is likely both biologically and technically driven. Irrespective of its source, we show that it should be modeled to draw accurate inferences from single cell expression experiments. To this end, we propose a semi-continuous modeling framework based on the generalized linear model, and use it to characterize genes with consistent cell cycle effects across three cell lines. Our new computational framework improves the detection of previously characterized cell-cycle genes compared to approaches that do not account for the bi-modality of single-cell data. We use our semi-continuous modelling framework to estimate single cell gene co-expression networks. These networks suggest that in addition to having phase-dependent shifts in expression (when averaged over many cells), some, but not all, canonical cell cycle genes tend to be co-expressed in groups in single cells. We estimate the amount of single cell expression variability attributable to the cell cycle. We find that the cell cycle explains only 5%–17% of expression variability, suggesting that the cell cycle will not tend to be a large nuisance factor in analysis of the single cell transcriptome. PMID:25032992
Single gene and gene interaction effects on fertilization and embryonic survival rates in cattle.
Khatib, H; Huang, W; Wang, X; Tran, A H; Bindrim, A B; Schutzkus, V; Monson, R L; Yandell, B S
2009-05-01
Decrease in fertility and conception rates is a major cause of economic loss and cow culling in dairy herds. Conception rate is the product of fertilization rate and embryonic survival rate. Identification of genetic factors that cause the death of embryos is the first step in eliminating this problem from the population and thereby increasing reproductive efficiency. A candidate pathway approach was used to identify candidate genes affecting fertilization and embryo survival rates using an in vitro fertilization experimental system. A total of 7,413 in vitro fertilizations were performed using oocytes from 504 ovaries and semen samples from 10 different bulls. Fertilization rate was calculated as the number of cleaved embryos 48 h postfertilization out of the total number of oocytes exposed to sperm. Survival rate of embryos was calculated as the number of blastocysts on d 7 of development out of the number of total embryos cultured. All ovaries were genotyped for 8 genes in the POU1F1 signaling pathway. Single-gene analysis revealed significant associations of GHR, PRLR, STAT5A, and UTMP with survival rate and of POU1F1, GHR, STAT5A, and OPN with fertilization rate. To further characterize the contribution of the entire integrated POU1F1 pathway to fertilization and early embryonic survival, a model selection procedure was applied. Comparisons among the different models showed that interactions between adjacent genes in the pathway revealed a significant contribution to the variation in fertility traits compared with other models that analyzed only bull information or only genes without interactions. Moreover, some genes that were not significant in the single-gene analysis showed significant effects in the interaction analysis. Thus, we propose that single genes as well as an entire pathway can be used in selection programs to improve reproduction performance in dairy cattle.
Determining Physical Mechanisms of Gene Expression Regulation from Single Cell Gene Expression Data.
Ezer, Daphne; Moignard, Victoria; Göttgens, Berthold; Adryan, Boris
2016-08-01
Many genes are expressed in bursts, which can contribute to cell-to-cell heterogeneity. It is now possible to measure this heterogeneity with high throughput single cell gene expression assays (single cell qPCR and RNA-seq). These experimental approaches generate gene expression distributions which can be used to estimate the kinetic parameters of gene expression bursting, namely the rate that genes turn on, the rate that genes turn off, and the rate of transcription. We construct a complete pipeline for the analysis of single cell qPCR data that uses the mathematics behind bursty expression to develop more accurate and robust algorithms for analyzing the origin of heterogeneity in experimental samples, specifically an algorithm for clustering cells by their bursting behavior (Simulated Annealing for Bursty Expression Clustering, SABEC) and a statistical tool for comparing the kinetic parameters of bursty expression across populations of cells (Estimation of Parameter changes in Kinetics, EPiK). We applied these methods to hematopoiesis, including a new single cell dataset in which transcription factors (TFs) involved in the earliest branchpoint of blood differentiation were individually up- and down-regulated. We could identify two unique sub-populations within a seemingly homogenous group of hematopoietic stem cells. In addition, we could predict regulatory mechanisms controlling the expression levels of eighteen key hematopoietic transcription factors throughout differentiation. Detailed information about gene regulatory mechanisms can therefore be obtained simply from high throughput single cell gene expression data, which should be widely applicable given the rapid expansion of single cell genomics.
Wiengkum, Thanatcha; Srithep, Sarinee; Chainoi, Isarapong; Singboottra, Panthong; Wongwiwatthananukit, Sanchai
2011-01-01
Background Prevention and control of thalassemia requires simple, rapid, and accurate screening tests for carrier couples who are at risk of conceiving fetuses with severe thalassemia. Methods Single-tube multiplex real-time PCR with SYBR Green1 and high-resolution melting (HRM) analysis were used for the identification of α-thalassemia-1 Southeast Asian (SEA) and Thai type deletions and β-thalassemia 3.5-kb gene deletion. The results were compared with those obtained using conventional gap-PCR. DNA samples were derived from 28 normal individuals, 11 individuals with α-thalassemia-1 SEA type deletion, 2 with α-thalassemia-1 Thai type deletion, and 2 with heterozygous β-thalassemia 3.5-kb gene deletion. Results HRM analysis indicated that the amplified fragments from α-thalassemia-1 SEA type deletion, α-thalassemia-1 Thai type deletion, β-thalassemia 3.5-kb gene deletion, and the wild-type β-globin gene had specific peak heights at mean melting temperature (Tm) values of 86.89℃, 85.66℃, 77.24℃, and 74.92℃, respectively. The results obtained using single-tube multiplex real-time PCR with SYBR Green1 and HRM analysis showed 100% consistency with those obtained using conventional gap-PCR. Conclusions Single-tube multiplex real-time PCR with SYBR Green1 and HRM analysis is a potential alternative for routine clinical screening of the common types of α- and β-thalassemia large gene deletions, since it is simple, cost-effective, and highly accurate. PMID:21779184
Yue, M; Tian, Y G; Wang, Y J; Gu, Y; Bayaer, N; Hu, Q; Gu, W W
2014-02-27
The IGF-1 gene is an important regulating factor that has a growth-promoting effect on growth hormone. The IGF-1 gene promotes muscle cell differentiation in the muscle cell formation process. The IGF-1 gene also regulates the growth of skeletal muscle during skeletal muscle growth. In addition, the IGF-1 gene plays an important role in the formation of mammals and poultry embryos, and the process of postnatal growth. The IGF-1 gene has been implicated as a candidate gene for the regulation of pig growth traits. We analyzed exon 3 of the IGF-1 gene polymorphism in Tibetan miniature pigs (N = 128) by polymerase chain reaction-single-strand conformation polymorphism and DNA sequencing. One single nucleotide polymorphism (T40C) was found on exon 3 of the IGF-1 gene. Statistical analysis of genotype frequencies revealed that the T allele was dominant in Tibetan miniature pigs at the T40C locus. The association analysis showed that the IGF-1 mutation had an effect on the body weight, body length, and chest circumference of pigs aged 6-8 months. In addition, the IGF-1 mutation had an effect on body weight in pigs aged 9-11 months (P < 0.05). We speculated that the pigs with the TT genotype grow more rapidly compared to those with the TC genotype. The TC genotype of the Tibetan miniature pig has a smaller body type. This information provides a theoretical basis for the genetic background of Tibetan miniature pigs.
Analysis of Single-cell Gene Transcription by RNA Fluorescent In Situ Hybridization (FISH)
Ronander, Elena; Bengtsson, Dominique C.; Joergensen, Louise; Jensen, Anja T. R.; Arnot, David E.
2012-01-01
Adhesion of Plasmodium falciparum infected erythrocytes (IE) to human endothelial receptors during malaria infections is mediated by expression of PfEMP1 protein variants encoded by the var genes. The haploid P. falciparum genome harbors approximately 60 different var genes of which only one has been believed to be transcribed per cell at a time during the blood stage of the infection. How such mutually exclusive regulation of var gene transcription is achieved is unclear, as is the identification of individual var genes or sub-groups of var genes associated with different receptors and the consequence of differential binding on the clinical outcome of P. falciparum infections. Recently, the mutually exclusive transcription paradigm has been called into doubt by transcription assays based on individual P. falciparum transcript identification in single infected erythrocytic cells using RNA fluorescent in situ hybridization (FISH) analysis of var gene transcription by the parasite in individual nuclei of P. falciparum IE1. Here, we present a detailed protocol for carrying out the RNA-FISH methodology for analysis of var gene transcription in single-nuclei of P. falciparum infected human erythrocytes. The method is based on the use of digoxigenin- and biotin- labeled antisense RNA probes using the TSA Plus Fluorescence Palette System2 (Perkin Elmer), microscopic analyses and freshly selected P. falciparum IE. The in situ hybridization method can be used to monitor transcription and regulation of a variety of genes expressed during the different stages of the P. falciparum life cycle and is adaptable to other malaria parasite species and other organisms and cell types. PMID:23070076
A multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors for functional gene analysis.
Weber, Kristoffer; Bartsch, Udo; Stocking, Carol; Fehse, Boris
2008-04-01
Functional gene analysis requires the possibility of overexpression, as well as downregulation of one, or ideally several, potentially interacting genes. Lentiviral vectors are well suited for this purpose as they ensure stable expression of complementary DNAs (cDNAs), as well as short-hairpin RNAs (shRNAs), and can efficiently transduce a wide spectrum of cell targets when packaged within the coat proteins of other viruses. Here we introduce a multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors designed according to the "building blocks" principle. Using a wide spectrum of different fluorescent markers, including drug-selectable enhanced green fluorescent protein (eGFP)- and dTomato-blasticidin-S resistance fusion proteins, LeGO vectors allow simultaneous analysis of multiple genes and shRNAs of interest within single, easily identifiable cells. Furthermore, each functional module is flanked by unique cloning sites, ensuring flexibility and individual optimization. The efficacy of these vectors for analyzing multiple genes in a single cell was demonstrated in several different cell types, including hematopoietic, endothelial, and neural stem and progenitor cells, as well as hepatocytes. LeGO vectors thus represent a valuable tool for investigating gene networks using conditional ectopic expression and knock-down approaches simultaneously.
DEIVA: a web application for interactive visual analysis of differential gene expression profiles.
Harshbarger, Jayson; Kratz, Anton; Carninci, Piero
2017-01-07
Differential gene expression (DGE) analysis is a technique to identify statistically significant differences in RNA abundance for genes or arbitrary features between different biological states. The result of a DGE test is typically further analyzed using statistical software, spreadsheets or custom ad hoc algorithms. We identified a need for a web-based system to share DGE statistical test results, and locate and identify genes in DGE statistical test results with a very low barrier of entry. We have developed DEIVA, a free and open source, browser-based single page application (SPA) with a strong emphasis on being user friendly that enables locating and identifying single or multiple genes in an immediate, interactive, and intuitive manner. By design, DEIVA scales with very large numbers of users and datasets. Compared to existing software, DEIVA offers a unique combination of design decisions that enable inspection and analysis of DGE statistical test results with an emphasis on ease of use.
Schürch, A C; Arredondo-Alonso, S; Willems, R J L; Goering, R V
2018-04-01
Whole genome sequence (WGS)-based strain typing finds increasing use in the epidemiologic analysis of bacterial pathogens in both public health as well as more localized infection control settings. This minireview describes methodologic approaches that have been explored for WGS-based epidemiologic analysis and considers the challenges and pitfalls of data interpretation. Personal collection of relevant publications. When applying WGS to study the molecular epidemiology of bacterial pathogens, genomic variability between strains is translated into measures of distance by determining single nucleotide polymorphisms in core genome alignments or by indexing allelic variation in hundreds to thousands of core genes, assigning types to unique allelic profiles. Interpreting isolate relatedness from these distances is highly organism specific, and attempts to establish species-specific cutoffs are unlikely to be generally applicable. In cases where single nucleotide polymorphism or core gene typing do not provide the resolution necessary for accurate assessment of the epidemiology of bacterial pathogens, inclusion of accessory gene or plasmid sequences may provide the additional required discrimination. As with all epidemiologic analysis, realizing the full potential of the revolutionary advances in WGS-based approaches requires understanding and dealing with issues related to the fundamental steps of data generation and interpretation. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Nakad, Rania; Snoek, L Basten; Yang, Wentao; Ellendt, Sunna; Schneider, Franziska; Mohr, Timm G; Rösingh, Lone; Masche, Anna C; Rosenstiel, Philip C; Dierking, Katja; Kammenga, Jan E; Schulenburg, Hinrich
2016-04-11
The invertebrate immune system comprises physiological mechanisms, physical barriers and also behavioral responses. It is generally related to the vertebrate innate immune system and widely believed to provide nonspecific defense against pathogens, whereby the response to different pathogen types is usually mediated by distinct signalling cascades. Recent work suggests that invertebrate immune defense can be more specific at least at the phenotypic level. The underlying genetic mechanisms are as yet poorly understood. We demonstrate in the model invertebrate Caenorhabditis elegans that a single gene, a homolog of the mammalian neuropeptide Y receptor gene, npr-1, mediates contrasting defense phenotypes towards two distinct pathogens, the Gram-positive Bacillus thuringiensis and the Gram-negative Pseudomonas aeruginosa. Our findings are based on combining quantitative trait loci (QTLs) analysis with functional genetic analysis and RNAseq-based transcriptomics. The QTL analysis focused on behavioral immune defense against B. thuringiensis, using recombinant inbred lines (RILs) and introgression lines (ILs). It revealed several defense QTLs, including one on chromosome X comprising the npr-1 gene. The wildtype N2 allele for the latter QTL was associated with reduced defense against B. thuringiensis and thus produced an opposite phenotype to that previously reported for the N2 npr-1 allele against P. aeruginosa. Analysis of npr-1 mutants confirmed these contrasting immune phenotypes for both avoidance behavior and nematode survival. Subsequent transcriptional profiling of C. elegans wildtype and npr-1 mutant suggested that npr-1 mediates defense against both pathogens through p38 MAPK signaling, insulin-like signaling, and C-type lectins. Importantly, increased defense towards P. aeruginosa seems to be additionally influenced through the induction of oxidative stress genes and activation of GATA transcription factors, while the repression of oxidative stress genes combined with activation of Ebox transcription factors appears to enhance susceptibility to B. thuringiensis. Our findings highlight the role of a single gene, npr-1, in fine-tuning nematode immune defense, showing the ability of the invertebrate immune system to produce highly specialized and potentially opposing immune responses via single regulatory genes.
Single-Copy Genes as Molecular Markers for Phylogenomic Studies in Seed Plants
De La Torre, Amanda R.; Sterck, Lieven; Cánovas, Francisco M.; Avila, Concepción; Merino, Irene; Cabezas, José Antonio; Cervera, María Teresa; Ingvarsson, Pär K.
2017-01-01
Phylogenetic relationships among seed plant taxa, especially within the gymnosperms, remain contested. In contrast to angiosperms, for which several genomic, transcriptomic and phylogenetic resources are available, there are few, if any, molecular markers that allow broad comparisons among gymnosperm species. With few gymnosperm genomes available, recently obtained transcriptomes in gymnosperms are a great addition to identifying single-copy gene families as molecular markers for phylogenomic analysis in seed plants. Taking advantage of an increasing number of available genomes and transcriptomes, we identified single-copy genes in a broad collection of seed plants and used these to infer phylogenetic relationships between major seed plant taxa. This study aims at extending the current phylogenetic toolkit for seed plants, assessing its ability for resolving seed plant phylogeny, and discussing potential factors affecting phylogenetic reconstruction. In total, we identified 3,072 single-copy genes in 31 gymnosperms and 2,156 single-copy genes in 34 angiosperms. All studied seed plants shared 1,469 single-copy genes, which are generally involved in functions like DNA metabolism, cell cycle, and photosynthesis. A selected set of 106 single-copy genes provided good resolution for the seed plant phylogeny except for gnetophytes. Although some of our analyses support a sister relationship between gnetophytes and other gymnosperms, phylogenetic trees from concatenated alignments without 3rd codon positions and amino acid alignments under the CAT + GTR model, support gnetophytes as a sister group to Pinaceae. Our phylogenomic analyses demonstrate that, in general, single-copy genes can uncover both recent and deep divergences of seed plant phylogeny. PMID:28460034
MAGMA: Generalized Gene-Set Analysis of GWAS Data
de Leeuw, Christiaan A.; Mooij, Joris M.; Heskes, Tom; Posthuma, Danielle
2015-01-01
By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn’s Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn’s Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn’s Disease data was found to be considerably faster as well. PMID:25885710
MAGMA: generalized gene-set analysis of GWAS data.
de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle
2015-04-01
By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
Dolezal, Tomas; Gazi, Michal; Zurovec, Michal; Bryant, Peter J
2003-10-01
Many Drosophila genes exist as members of multigene families and within each family the members can be functionally redundant, making it difficult to identify them by classical mutagenesis techniques based on phenotypic screening. We have addressed this problem in a genetic analysis of a novel family of six adenosine deaminase-related growth factors (ADGFs). We used ends-in targeting to introduce mutations into five of the six ADGF genes, taking advantage of the fact that five of the family members are encoded by a three-gene cluster and a two-gene cluster. We used two targeting constructs to introduce loss-of-function mutations into all five genes, as well as to isolate different combinations of multiple mutations, independent of phenotypic consequences. The results show that (1) it is possible to use ends-in targeting to disrupt gene clusters; (2) gene conversion, which is usually considered a complication in gene targeting, can be used to help recover different mutant combinations in a single screening procedure; (3) the reduction of duplication to a single copy by induction of a double-strand break is better explained by the single-strand annealing mechanism than by simple crossing over between repeats; and (4) loss of function of the most abundantly expressed family member (ADGF-A) leads to disintegration of the fat body and the development of melanotic tumors in mutant larvae.
Analysis of a genome-wide set of gene deletions in the fission yeast Schizosaccharomyces pombe
Duhig, Trevor; Nam, Miyoung; Palmer, Georgia; Han, Sangjo; Jeffery, Linda; Baek, Seung-Tae; Lee, Hyemi; Shim, Young Sam; Lee, Minho; Kim, Lila; Heo, Kyung-Sun; Noh, Eun Joo; Lee, Ah-Reum; Jang, Young-Joo; Chung, Kyung-Sook; Choi, Shin-Jung; Park, Jo-Young; Park, Youngwoo; Kim, Hwan Mook; Park, Song-Kyu; Park, Hae-Joon; Kang, Eun-Jung; Kim, Hyong Bai; Kang, Hyun-Sam; Park, Hee-Moon; Kim, Kyunghoon; Song, Kiwon; Song, Kyung Bin; Nurse, Paul; Hoe, Kwang-Lae
2014-01-01
SUMMARY We report the construction and analysis of 4,836 heterozygous diploid deletion mutants covering 98.4% of the fission yeast genome. This resource provides a powerful tool for biotechnological and eukaryotic cell biology research. Comprehensive gene dispensability comparisons with budding yeast, the first time such studies have been possible between two eukaryotes, revealed that 83% of single copy orthologues in the two yeasts had conserved dispensability. Gene dispensability differed for certain pathways between the two yeasts, including mitochondrial translation and cell cycle checkpoint control. We show that fission yeast has more essential genes than budding yeast and that essential genes are more likely than non-essential genes to be single copy, broadly conserved and to contain introns. Growth fitness analyses determined sets of haploinsufficient and haploproficient genes for fission yeast, and comparisons with budding yeast identified specific ribosomal proteins and RNA polymerase subunits, which may act more generally to regulate eukaryotic cell growth. PMID:20473289
Nguyen, Thong T; Suryamohan, Kushal; Kuriakose, Boney; Janakiraman, Vasantharajan; Reichelt, Mike; Chaudhuri, Subhra; Guillory, Joseph; Divakaran, Neethu; Rabins, P E; Goel, Ridhi; Deka, Bhabesh; Sarkar, Suman; Ekka, Preety; Tsai, Yu-Chih; Vargas, Derek; Santhosh, Sam; Mohan, Sangeetha; Chin, Chen-Shan; Korlach, Jonas; Thomas, George; Babu, Azariah; Seshagiri, Somasekar
2018-06-12
We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089 bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.
Unique Physiological and Transcriptional Shifts under Combinations of Salinity, Drought, and Heat.
Shaar-Moshe, Lidor; Blumwald, Eduardo; Peleg, Zvi
2017-05-01
Climate-change-driven stresses such as extreme temperatures, water deficit, and ion imbalance are projected to exacerbate and jeopardize global food security. Under field conditions, these stresses usually occur simultaneously and cause damages that exceed single stresses. Here, we investigated the transcriptional patterns and morpho-physiological acclimations of Brachypodium dystachion to single salinity, drought, and heat stresses, as well as their double and triple stress combinations. Hierarchical clustering analysis of morpho-physiological acclimations showed that several traits exhibited a gradually aggravating effect as plants were exposed to combined stresses. On the other hand, other morphological traits were dominated by salinity, while some physiological traits were shaped by heat stress. Response patterns of differentially expressed genes, under single and combined stresses (i.e. common stress genes), were maintained only among 37% of the genes, indicating a limited expression consistency among partially overlapping stresses. A comparison between common stress genes and genes that were uniquely expressed only under combined stresses (i.e. combination unique genes) revealed a significant shift from increased intensity to antagonistic responses, respectively. The different transcriptional signatures imply an alteration in the mode of action under combined stresses and limited ability to predict plant responses as different stresses are combined. Coexpression analysis coupled with enrichment analysis revealed that each gene subset was enriched with different biological processes. Common stress genes were enriched with known stress response pathways, while combination unique-genes were enriched with unique processes and genes with unknown functions that hold the potential to improve stress tolerance and enhance cereal productivity under suboptimal field conditions. © 2017 American Society of Plant Biologists. All Rights Reserved.
Localization of migraine susceptibility genes in human brain by single-cell RNA sequencing.
Renthal, William
2018-01-01
Background Migraine is a debilitating disorder characterized by severe headaches and associated neurological symptoms. A key challenge to understanding migraine has been the cellular complexity of the human brain and the multiple cell types implicated in its pathophysiology. The present study leverages recent advances in single-cell transcriptomics to localize the specific human brain cell types in which putative migraine susceptibility genes are expressed. Methods The cell-type specific expression of both familial and common migraine-associated genes was determined bioinformatically using data from 2,039 individual human brain cells across two published single-cell RNA sequencing datasets. Enrichment of migraine-associated genes was determined for each brain cell type. Results Analysis of single-brain cell RNA sequencing data from five major subtypes of cells in the human cortex (neurons, oligodendrocytes, astrocytes, microglia, and endothelial cells) indicates that over 40% of known migraine-associated genes are enriched in the expression profiles of a specific brain cell type. Further analysis of neuronal migraine-associated genes demonstrated that approximately 70% were significantly enriched in inhibitory neurons and 30% in excitatory neurons. Conclusions This study takes the next step in understanding the human brain cell types in which putative migraine susceptibility genes are expressed. Both familial and common migraine may arise from dysfunction of discrete cell types within the neurovascular unit, and localization of the affected cell type(s) in an individual patient may provide insight into to their susceptibility to migraine.
Molecular characterization of the vitamin D receptor (VDR) gene in Holstein cows.
Ali, Mayar O; El-Adl, Mohamed A; Ibrahim, Hussam M M; Elseedy, Youssef Y; Rizk, Mohamed A; El-Khodery, Sabry A
2018-06-01
Vitamin D plays a vital role in calcium homeostasis, growth, and immunoregulation. Because little is known about the vitamin D receptor (VDR) gene in cattle, the aim of the present investigation was to present the molecular characterization of exons 5 and 6 of the VDR gene in Holstein cows. DNA extraction, genomic sequencing, phylogenetic analysis, synteny mapping and single nucleotide gene polymorphism analysis of the VDR gene were performed to assess blood samples collected from 50 clinically healthy Holstein cows. The results revealed the presence of a 450-base pair (bp) nucleotide sequence that resembled exons 5 and 6 with intron 5 enclosed between these exons. Sequence alignment and phylogenetic analysis revealed a close relationship between the sequenced VDR region and that found in Hereford cattle. A close association between this region and the corresponding region in small ruminants was also documented. Moreover, a single nucleotide polymorphism (SNP) that caused the replacement of a glutamate with an arginine in the deduced amino acid sequence was detected at position 7 of exon 5. In conclusion, Holstein and Hereford cattle differ with respect to exon 5 of the VDR gene. Phylogenetic analysis of the VDR gene based on nucleotide sequence produced different results from prior analyses based on amino acid sequence. Copyright © 2018 Elsevier Ltd. All rights reserved.
A powerful score-based test statistic for detecting gene-gene co-association.
Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun
2016-01-29
The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.
Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.
Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao
2016-04-01
To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.
Wu, Meiye; Singh, Anup K
2012-01-01
Heterogeneity of cellular systems has been widely recognized but only recently have tools become available that allow probing of genes and proteins in single cells to understand it. While the advancement in single cell genomic analysis has been greatly aided by the power of amplification techniques (e.g., PCR), analysis of proteins in single cells has proven to be more challenging. However, recent advances in multi-parameter flow cytometry, microfluidics and other techniques have made it possible to measure wide variety of proteins in single cells. In this review, we highlight key recent developments in analysis of proteins in a single cell, and discuss their significance in biological research. PMID:22189001
Sun, Zhengda; Wang, Chih-Yang; Lawson, Devon A; Kwek, Serena; Velozo, Hugo Gonzalez; Owyong, Mark; Lai, Ming-Derg; Fong, Lawrence; Wilson, Mark; Su, Hua; Werb, Zena; Cooke, Daniel L
2018-02-16
Tumor endothelial cells (TEC) play an indispensible role in tumor growth and metastasis although much of the detailed mechanism still remains elusive. In this study we characterized and compared the global gene expression profiles of TECs and control ECs isolated from human breast cancerous tissues and reduction mammoplasty tissues respectively by single cell RNA sequencing (scRNA-seq). Based on the qualified scRNA-seq libraries that we made, we found that 1302 genes were differentially expressed between these two EC phenotypes. Both principal component analysis (PCA) and heat map-based hierarchical clustering separated the cancerous versus control ECs as two distinctive clusters, and MetaCore disease biomarker analysis indicated that these differentially expressed genes are highly correlated with breast neoplasm diseases. Gene Set Enrichment Analysis software (GSEA) enriched these genes to extracellular matrix (ECM) signal pathways and highlighted 127 ECM-associated genes. External validation verified some of these ECM-associated genes are not only generally overexpressed in various cancer tissues but also specifically overexpressed in colorectal cancer ECs and lymphoma ECs. In conclusion, our data demonstrated that ECM-associated genes play pivotal roles in breast cancer EC biology and some of them could serve as potential TEC biomarkers for various cancers.
Zhu, Ying; Zhang, Yun-Xia; Liu, Wen-Wen; Ma, Yan; Fang, Qun; Yao, Bo
2015-04-01
This paper describes a nanoliter droplet array-based single-cell reverse transcription quantitative PCR (RT-qPCR) assay method for quantifying gene expression in individual cells. By sequentially printing nanoliter-scale droplets on microchip using a microfluidic robot, all liquid-handling operations including cell encapsulation, lysis, reverse transcription, and quantitative PCR with real-time fluorescence detection, can be automatically achieved. The inhibition effect of cell suspension buffer on RT-PCR assay was comprehensively studied to achieve high-sensitivity gene quantification. The present system was applied in the quantitative measurement of expression level of mir-122 in single Huh-7 cells. A wide distribution of mir-122 expression in single cells from 3061 copies/cell to 79998 copies/cell was observed, showing a high level of cell heterogeneity. With the advantages of full-automation in liquid-handling, simple system structure, and flexibility in achieving multi-step operations, the present method provides a novel liquid-handling mode for single cell gene expression analysis, and has significant potentials in transcriptional identification and rare cell analysis.
Zhu, Ying; Zhang, Yun-Xia; Liu, Wen-Wen; Ma, Yan; Fang, Qun; Yao, Bo
2015-01-01
This paper describes a nanoliter droplet array-based single-cell reverse transcription quantitative PCR (RT-qPCR) assay method for quantifying gene expression in individual cells. By sequentially printing nanoliter-scale droplets on microchip using a microfluidic robot, all liquid-handling operations including cell encapsulation, lysis, reverse transcription, and quantitative PCR with real-time fluorescence detection, can be automatically achieved. The inhibition effect of cell suspension buffer on RT-PCR assay was comprehensively studied to achieve high-sensitivity gene quantification. The present system was applied in the quantitative measurement of expression level of mir-122 in single Huh-7 cells. A wide distribution of mir-122 expression in single cells from 3061 copies/cell to 79998 copies/cell was observed, showing a high level of cell heterogeneity. With the advantages of full-automation in liquid-handling, simple system structure, and flexibility in achieving multi-step operations, the present method provides a novel liquid-handling mode for single cell gene expression analysis, and has significant potentials in transcriptional identification and rare cell analysis. PMID:25828383
Lymphocyte signaling: beyond knockouts.
Saveliev, Alexander; Tybulewicz, Victor L J
2009-04-01
The analysis of lymphocyte signaling was greatly enhanced by the advent of gene targeting, which allows the selective inactivation of a single gene. Although this gene 'knockout' approach is often informative, in many cases, the phenotype resulting from gene ablation might not provide a complete picture of the function of the corresponding protein. If a protein has multiple functions within a single or several signaling pathways, or stabilizes other proteins in a complex, the phenotypic consequences of a gene knockout may manifest as a combination of several different perturbations. In these cases, gene targeting to 'knock in' subtle point mutations might provide more accurate insight into protein function. However, to be informative, such mutations must be carefully based on structural and biophysical data.
Malmstrom, Rex R; Rodrigue, Sébastien; Huang, Katherine H; Kelly, Libusha; Kern, Suzanne E; Thompson, Anne; Roggensack, Sara; Berube, Paul M; Henn, Matthew R; Chisholm, Sallie W
2013-01-01
Prochlorococcus is the numerically dominant photosynthetic organism throughout much of the world's oceans, yet little is known about the ecology and genetic diversity of populations inhabiting tropical waters. To help close this gap, we examined natural Prochlorococcus communities in the tropical Pacific Ocean using a single-cell whole-genome amplification and sequencing. Analysis of the gene content of just 10 single cells from these waters added 394 new genes to the Prochlorococcus pan-genome—that is, genes never before seen in a Prochlorococcus cell. Analysis of marker genes, including the ribosomal internal transcribed sequence, from dozens of individual cells revealed several representatives from two uncultivated clades of Prochlorococcus previously identified as HNLC1 and HNLC2. While the HNLC clades can dominate Prochlorococcus communities under certain conditions, their overall geographic distribution was highly restricted compared with other clades of Prochlorococcus. In the Atlantic and Pacific oceans, these clades were only found in warm waters with low Fe and high inorganic P levels. Genomic analysis suggests that at least one of these clades thrives in low Fe environments by scavenging organic-bound Fe, a process previously unknown in Prochlorococcus. Furthermore, the capacity to utilize organic-bound Fe appears to have been acquired horizontally and may be exchanged among other clades of Prochlorococcus. Finally, one of the single Prochlorococcus cells sequenced contained a partial genome of what appears to be a prophage integrated into the genome. PMID:22895163
A Minimally Invasive Method for Retrieving Single Adherent Cells of Different Types from Cultures
Zeng, Jia; Mohammadreza, Aida; Gao, Weimin; Merza, Saeed; Smith, Dean; Kelbauskas, Laimonas; Meldrum, Deirdre R.
2014-01-01
The field of single-cell analysis has gained a significant momentum over the last decade. Separation and isolation of individual cells is an indispensable step in almost all currently available single-cell analysis technologies. However, stress levels introduced by such manipulations remain largely unstudied. We present a method for minimally invasive retrieval of selected individual adherent cells of different types from cell cultures. The method is based on a combination of mechanical (shear flow) force and biochemical (trypsin digestion) treatment. We quantified alterations in the transcription levels of stress response genes in individual cells exposed to varying levels of shear flow and trypsinization. We report optimal temperature, RNA preservation reagents, shear force and trypsinization conditions necessary to minimize changes in the stress-related gene expression levels. The method and experimental findings are broadly applicable and can be used by a broad research community working in the field of single cell analysis. PMID:24957932
GSCALite: A Web Server for Gene Set Cancer Analysis.
Liu, Chun-Jie; Hu, Fei-Fei; Xia, Mengxuan; Han, Leng; Zhang, Qiong; Guo, An-Yuan
2018-05-22
The availability of cancer genomic data makes it possible to analyze genes related to cancer. Cancer is usually the result of a set of genes and the signal of a single gene could be covered by background noise. Here, we present a web server named Gene Set Cancer Analysis (GSCALite) to analyze a set of genes in cancers with the following functional modules. (i) Differential expression in tumor vs normal, and the survival analysis; (ii) Genomic variations and their survival analysis; (iii) Gene expression associated cancer pathway activity; (iv) miRNA regulatory network for genes; (v) Drug sensitivity for genes; (vi) Normal tissue expression and eQTL for genes. GSCALite is a user-friendly web server for dynamic analysis and visualization of gene set in cancer and drug sensitivity correlation, which will be of broad utilities to cancer researchers. GSCALite is available on http://bioinfo.life.hust.edu.cn/web/GSCALite/. guoay@hust.edu.cn or zhangqiong@hust.edu.cn. Supplementary data are available at Bioinformatics online.
Dominant selectable markers for Penicillium spp. transformation and gene function studies
USDA-ARS?s Scientific Manuscript database
Penicillium spp. has been genetically manipulated and gene function studies have utilized single gene deletion strains for phenotypic analysis. Fungal transformation experiments have relied on hygromycin and hygromycin phosphotransferase (hph) as the main dominant selectable marker (DSM) system in P...
A single determinant dominates the rate of yeast protein evolution.
Drummond, D Allan; Raval, Alpan; Wilke, Claus O
2006-02-01
A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.
Li, Qike; Schissler, A Grant; Gardeux, Vincent; Achour, Ikbel; Kenost, Colleen; Berghout, Joanne; Li, Haiquan; Zhang, Hao Helen; Lussier, Yves A
2017-05-24
Transcriptome analytic tools are commonly used across patient cohorts to develop drugs and predict clinical outcomes. However, as precision medicine pursues more accurate and individualized treatment decisions, these methods are not designed to address single-patient transcriptome analyses. We previously developed and validated the N-of-1-pathways framework using two methods, Wilcoxon and Mahalanobis Distance (MD), for personal transcriptome analysis derived from a pair of samples of a single patient. Although, both methods uncover concordantly dysregulated pathways, they are not designed to detect dysregulated pathways with up- and down-regulated genes (bidirectional dysregulation) that are ubiquitous in biological systems. We developed N-of-1-pathways MixEnrich, a mixture model followed by a gene set enrichment test, to uncover bidirectional and concordantly dysregulated pathways one patient at a time. We assess its accuracy in a comprehensive simulation study and in a RNA-Seq data analysis of head and neck squamous cell carcinomas (HNSCCs). In presence of bidirectionally dysregulated genes in the pathway or in presence of high background noise, MixEnrich substantially outperforms previous single-subject transcriptome analysis methods, both in the simulation study and the HNSCCs data analysis (ROC Curves; higher true positive rates; lower false positive rates). Bidirectional and concordant dysregulated pathways uncovered by MixEnrich in each patient largely overlapped with the quasi-gold standard compared to other single-subject and cohort-based transcriptome analyses. The greater performance of MixEnrich presents an advantage over previous methods to meet the promise of providing accurate personal transcriptome analysis to support precision medicine at point of care.
Huang, D; Wu, W; Lu, L
2004-05-01
Amplification of resistance gene analogs (RGAs) is both a useful method for acquiring DNA markers closely linked to disease resistance (R) genes and a potential approach for the rapid cloning of R genes in plants. However, the screening of target sequences from among the numerous amplified RGAs can be very laborious. The amplification of RGAs from specific chromosomes could greatly reduce the number of RGAs to be screened and, consequently, speed up the identification of target RGAs. We have developed two methods for amplifying RGAs from single chromosomes. Method 1 uses products of Sau3A linker adaptor-mediated PCR (LAM-PCR) from a single chromosome as the templates for RGA amplification, while Method 2 directly uses a single chromosomal DNA molecule as the template. Using a pair of degenerate primers designed on the basis of the conserved nucleotide-binding-site motifs in many R genes, RGAs were successfully amplified from single chromosomes of pomelo using both these methods. Sequencing and cluster analysis of RGA clones obtained from single chromosomes revealed the number, type and organization of R-gene clusters on the chromosomes. We suggest that Method 1 is suitable for analyzing chromosomes that are unidentifiable under a microscope, while Method 2 is more appropriate when chromosomes can be clearly identified.
Sarkar, F H; Kupsky, W J; Li, Y W; Sreepathi, P
1994-03-01
Mutations in the p53 gene have been recognized in brain tumors, and clonal expansion of p53 mutant cells has been shown to be associated with glioma progression. However, studies on the p53 gene have been limited by the need for frozen tissues. We have developed a method utilizing polymerase chain reaction (PCR) for the direct analysis of p53 mutation by single-strand conformation polymorphism (SSCP) and by direct DNA sequencing of the p53 gene using a single 10-microns paraffin-embedded tissue section. We applied this method to screen for p53 gene mutations in exons 5-8 in human gliomas utilizing paraffin-embedded tissues. Twenty paraffin blocks containing tumor were selected from surgical specimens from 17 different adult patients. Tumors included six anaplastic astrocytomas (AAs), nine glioblastomas (GBs), and two mixed malignant gliomas (MMGs). The tissue section on the stained glass slide was used to guide microdissection of an unstained adjacent tissue section to ensure > 90% of the tumor cell population for p53 mutational analysis. Simultaneously, microdissection of the tissue was also carried out to obtain normal tissue from adjacent areas as a control. Mutations in the p53 gene were identified in 3 of 17 (18%) patients by PCR-SSCP analysis and subsequently confirmed by PCR-based DNA sequencing. Mutations in exon 5 resulting in amino acid substitution were found in one thalamic AA (codon 158, CGC > CTT: Arg > Leu) and one cerebral hemispheric GB (codon 151, CCG > CTG: Pro > Leu).(ABSTRACT TRUNCATED AT 250 WORDS)
Röschmann, K I L; van Kuijen, A-M; Luiten, S; Jonker, M J; Breit, T M; Fokkens, W J; Petersen, A; van Drunen, C M
2012-10-01
Seasonal allergic rhinitis (AR) is a global health problem and its prevalence has increased considerably in the last decades. As the allergic response with its clinical manifestations is triggered by only a few proteins within natural extracts, there is an increasing tendency for single-component-resolved diagnosis and immunotherapy. As natural exposure is not to single proteins, but to complex mixtures of molecules, we were interested in comparing the activation of respiratory epithelial cells induced by the purified major allergen Phl p 1 with the induction caused by a complete extract of Timothy grass pollen (GPE). NCI-H292 cells were exposed to GPE or Ph1 p 1 for 24 h, isolated RNA and cell culture supernatants were used for microarray analysis, multiplex enzyme-linked immunosorbant assay (ELISA) and subsequent analysis. We found 262 genes that showed a GPE-induced change of at least 3-fold, whereas Ph1 p 1-stimulation resulted in 71 genes with a fold induction of more than 3-fold. Besides genes that were regulated by both stimuli, we also detected genes displaying an opposite response after stimulation, suggesting that GPE might be more than purified major allergens with regard to induced immune responses. Additional components within GPE and the resulting modulation of general processes affecting gene transcription and signalling pathways might be crucial to maintain/overcome the diseased phenotype and to induce the influx of cells contributing to late-phase allergic responses. When the initial process of sensitization is the matter of interest or late-phase allergic responses, one might miss important immune modulatory molecules and their interaction with allergens by applying single components only. © 2012 Blackwell Publishing Ltd.
Matsunaga, Hiroko; Goto, Mari; Arikawa, Koji; Shirai, Masataka; Tsunoda, Hiroyuki; Huang, Huan; Kambara, Hideki
2015-02-15
Analyses of gene expressions in single cells are important for understanding detailed biological phenomena. Here, a highly sensitive and accurate method by sequencing (called "bead-seq") to obtain a whole gene expression profile for a single cell is proposed. A key feature of the method is to use a complementary DNA (cDNA) library on magnetic beads, which enables adding washing steps to remove residual reagents in a sample preparation process. By adding the washing steps, the next steps can be carried out under the optimal conditions without losing cDNAs. Error sources were carefully evaluated to conclude that the first several steps were the key steps. It is demonstrated that bead-seq is superior to the conventional methods for single-cell gene expression analyses in terms of reproducibility, quantitative accuracy, and biases caused during sample preparation and sequencing processes. Copyright © 2014 Elsevier Inc. All rights reserved.
Technique for quantitative RT-PCR analysis directly from single muscle fibers.
Wacker, Michael J; Tehel, Michelle M; Gallagher, Philip M
2008-07-01
The use of single-cell quantitative RT-PCR has greatly aided the study of gene expression in fields such as muscle physiology. For this study, we hypothesized that single muscle fibers from a biopsy can be placed directly into the reverse transcription buffer and that gene expression data can be obtained without having to first extract the RNA. To test this hypothesis, biopsies were taken from the vastus lateralis of five male subjects. Single muscle fibers were isolated and underwent RNA isolation (technique 1) or placed directly into reverse transcription buffer (technique 2). After cDNA conversion, individual fiber cDNA was pooled and quantitative PCR was performed using primer-probes for beta(2)-microglobulin, glyceraldehyde-3-phosphate dehydrogenase, insulin-like growth factor I receptor, and glucose transporter subtype 4. The no RNA extraction method provided similar quantitative PCR data as that of the RNA extraction method. A third technique was also tested in which we used one-quarter of an individual fiber's cDNA for PCR (not pooled) and the average coefficient of variation between fibers was <8% (cycle threshold value) for all genes studied. The no RNA extraction technique was tested on isolated muscle fibers using a gene known to increase after exercise (pyruvate dehydrogenase kinase 4). We observed a 13.9-fold change in expression after resistance exercise, which is consistent with what has been previously observed. These results demonstrate a successful method for gene expression analysis directly from single muscle fibers.
Gene expression profiling of single cells on large-scale oligonucleotide arrays
Hartmann, Claudia H.; Klein, Christoph A.
2006-01-01
Over the last decade, important insights into the regulation of cellular responses to various stimuli were gained by global gene expression analyses of cell populations. More recently, specific cell functions and underlying regulatory networks of rare cells isolated from their natural environment moved to the center of attention. However, low cell numbers still hinder gene expression profiling of rare ex vivo material in biomedical research. Therefore, we developed a robust method for gene expression profiling of single cells on high-density oligonucleotide arrays with excellent coverage of low abundance transcripts. The protocol was extensively tested with freshly isolated single cells of very low mRNA content including single epithelial, mature and immature dendritic cells and hematopoietic stem cells. Quantitative PCR confirmed that the PCR-based global amplification method did not change the relative ratios of transcript abundance and unsupervised hierarchical cluster analysis revealed that the histogenetic origin of an individual cell is correctly reflected by the gene expression profile. Moreover, the gene expression data from dendritic cells demonstrate that cellular differentiation and pathway activation can be monitored in individual cells. PMID:17071717
The GTPase Activating Rap/RanGAP Domain-Like 1 Gene Is Associated with Chicken Reproductive Traits
Shen, Xu; Zeng, Hua; Xie, Liang; He, Jun; Li, Jian; Xie, Xiujuan; Luo, Chenglong; Xu, Haiping; Zhou, Min; Nie, Qinghua; Zhang, Xiquan
2012-01-01
Background Abundant evidence indicates that chicken reproduction is strictly regulated by the hypothalamic-pituitary-gonad (HPG) axis, and the genes included in the HPG axis have been studied extensively. However, the question remains as to whether any other genes outside of the HPG system are involved in regulating chicken reproduction. The present study was aimed to identify, on a genome-wide level, novel genes associated with chicken reproductive traits. Methodology/Principal Finding Suppressive subtractive hybridization (SSH), genome-wide association study (GWAS), and gene-centric GWAS were used to identify novel genes underlying chicken reproduction. Single marker-trait association analysis with a large population and allelic frequency spectrum analysis were used to confirm the effects of candidate genes. Using two full-sib Ningdu Sanhuang (NDH) chickens, GARNL1 was identified as a candidate gene involved in chicken broodiness by SSH analysis. Its expression levels in the hypothalamus and pituitary were significantly higher in brooding chickens than in non-brooding chickens. GWAS analysis with a NDH two tail sample showed that 2802 SNPs were significantly associated with egg number at 300 d of age (EN300). Among the 2802 SNPs, 2 SNPs composed a block overlapping the GARNL1 gene. The gene-centric GWAS analysis with another two tail sample of NDH showed that GARNL1 was strongly associated with EN300 and age at first egg (AFE). Single marker-trait association analysis in 1301 female NDH chickens confirmed that variation in this gene was related to EN300 and AFE. The allelic frequency spectrum of the SNP rs15700989 among 5 different populations supported the above associations. Western blotting, RT-PCR, and qPCR were used to analyze alternative splicing of the GARNL1 gene. RT-PCR detected 5 transcripts and revealed that the transcript, which has a 141 bp insertion, was expressed in a tissue-specific manner. Conclusions/Significance Our findings demonstrate that the GARNL1 gene contributes to chicken reproductive traits. PMID:22496769
Mutational analysis of the transcriptional activator VirG of Agrobacterium tumefaciens.
Scheeren-Groot, E P; Rodenburg, K W; den Dulk-Ras, A; Turk, S C; Hooykaas, P J
1994-01-01
To find VirG proteins with altered properties, the virG gene was mutagenized. Random chemical mutagenesis of single-stranded DNA containing the Agrobacterium tumefaciens virG gene led with high frequency to the inactivation of the gene. Sequence analysis showed that 29% of the mutants contained a virG gene with one single-base-pair substitution somewhere in the open reading frame. Thirty-nine different mutations that rendered the VirG protein inactive were mapped. Besides these inactive mutants, two mutants in which the vir genes were active even in the absence of acetosyringone were found on indicator plates. A VirG protein with an N54D substitution turned out to be able to induce a virB-lacZ reporter gene to a high level even in the absence of the inducer acetosyringone. A VirG protein with an I77V substitution exhibited almost no induction in the absence of acetosyringone but showed a maximum induction level already at low concentrations of acetosyringone. Images PMID:7961391
Decoding the Regulatory Network for Blood Development from Single-Cell Gene Expression Measurements
Haghverdi, Laleh; Lilly, Andrew J.; Tanaka, Yosuke; Wilkinson, Adam C.; Buettner, Florian; Macaulay, Iain C.; Jawaid, Wajid; Diamanti, Evangelia; Nishikawa, Shin-Ichi; Piterman, Nir; Kouskoff, Valerie; Theis, Fabian J.; Fisher, Jasmin; Göttgens, Berthold
2015-01-01
Here we report the use of diffusion maps and network synthesis from state transition graphs to better understand developmental pathways from single cell gene expression profiling. We map the progression of mesoderm towards blood in the mouse by single-cell expression analysis of 3,934 cells, capturing cells with blood-forming potential at four sequential developmental stages. By adapting the diffusion plot methodology for dimensionality reduction to single-cell data, we reconstruct the developmental journey to blood at single-cell resolution. Using transitions between individual cellular states as input, we develop a single-cell network synthesis toolkit to generate a computationally executable transcriptional regulatory network model that recapitulates blood development. Model predictions were validated by showing that Sox7 inhibits primitive erythropoiesis, and that Sox and Hox factors control early expression of Erg. We therefore demonstrate that single-cell analysis of a developing organ coupled with computational approaches can reveal the transcriptional programs that control organogenesis. PMID:25664528
A Nonlinear Model for Gene-Based Gene-Environment Interaction.
Sa, Jian; Liu, Xu; He, Tao; Liu, Guifen; Cui, Yuehua
2016-06-04
A vast amount of literature has confirmed the role of gene-environment (G×E) interaction in the etiology of complex human diseases. Traditional methods are predominantly focused on the analysis of interaction between a single nucleotide polymorphism (SNP) and an environmental variable. Given that genes are the functional units, it is crucial to understand how gene effects (rather than single SNP effects) are influenced by an environmental variable to affect disease risk. Motivated by the increasing awareness of the power of gene-based association analysis over single variant based approach, in this work, we proposed a sparse principle component regression (sPCR) model to understand the gene-based G×E interaction effect on complex disease. We first extracted the sparse principal components for SNPs in a gene, then the effect of each principal component was modeled by a varying-coefficient (VC) model. The model can jointly model variants in a gene in which their effects are nonlinearly influenced by an environmental variable. In addition, the varying-coefficient sPCR (VC-sPCR) model has nice interpretation property since the sparsity on the principal component loadings can tell the relative importance of the corresponding SNPs in each component. We applied our method to a human birth weight dataset in Thai population. We analyzed 12,005 genes across 22 chromosomes and found one significant interaction effect using the Bonferroni correction method and one suggestive interaction. The model performance was further evaluated through simulation studies. Our model provides a system approach to evaluate gene-based G×E interaction.
Combining lipophilic dye, in situ hybridization, immunohistochemistry, and histology.
Duncan, Jeremy; Kersigo, Jennifer; Gray, Brian; Fritzsch, Bernd
2011-03-17
Going beyond single gene function to cut deeper into gene regulatory networks requires multiple mutations combined in a single animal. Such analysis of two or more genes needs to be complemented with in situ hybridization of other genes, or immunohistochemistry of their proteins, both in whole mounted developing organs or sections for detailed resolution of the cellular and tissue expression alterations. Combining multiple gene alterations requires the use of cre or flipase to conditionally delete genes and avoid embryonic lethality. Required breeding schemes dramatically enhance effort and cost proportional to the number of genes mutated, with an outcome of very few animals with the full repertoire of genetic modifications desired. Amortizing the vast amount of effort and time to obtain these few precious specimens that are carrying multiple mutations necessitates tissue optimization. Moreover, investigating a single animal with multiple techniques makes it easier to correlate gene deletion defects with expression profiles. We have developed a technique to obtain a more thorough analysis of a given animal; with the ability to analyze several different histologically recognizable structures as well as gene and protein expression all from the same specimen in both whole mounted organs and sections. Although mice have been utilized to demonstrate the effectiveness of this technique it can be applied to a wide array of animals. To do this we combine lipophilic dye tracing, whole mount in situ hybridization, immunohistochemistry, and histology to extract the maximal possible amount of data.
Combining Lipophilic dye, in situ Hybridization, Immunohistochemistry, and Histology
Duncan, Jeremy; Kersigo, Jennifer; Gray, Brian; Fritzsch, Bernd
2011-01-01
Going beyond single gene function to cut deeper into gene regulatory networks requires multiple mutations combined in a single animal. Such analysis of two or more genes needs to be complemented with in situ hybridization of other genes, or immunohistochemistry of their proteins, both in whole mounted developing organs or sections for detailed resolution of the cellular and tissue expression alterations. Combining multiple gene alterations requires the use of cre or flipase to conditionally delete genes and avoid embryonic lethality. Required breeding schemes dramatically enhance effort and cost proportional to the number of genes mutated, with an outcome of very few animals with the full repertoire of genetic modifications desired. Amortizing the vast amount of effort and time to obtain these few precious specimens that are carrying multiple mutations necessitates tissue optimization. Moreover, investigating a single animal with multiple techniques makes it easier to correlate gene deletion defects with expression profiles. We have developed a technique to obtain a more thorough analysis of a given animal; with the ability to analyze several different histologically recognizable structures as well as gene and protein expression all from the same specimen in both whole mounted organs and sections. Although mice have been utilized to demonstrate the effectiveness of this technique it can be applied to a wide array of animals. To do this we combine lipophilic dye tracing, whole mount in situ hybridization, immunohistochemistry, and histology to extract the maximal possible amount of data. PMID:21445047
Lin, Ping-I; Martin, Eden R; Browning-Large, Carrie A; Schmechel, Donald E; Welsh-Bohmer, Kathleen A; Doraiswamy, P Murali; Gilbert, John R; Haines, Jonathan L; Pericak-Vance, Margaret A
2006-07-01
Previous linkage studies have suggested that chromosome 12 may harbor susceptibility genes for late-onset Alzheimer disease (LOAD). No risk genes on chromosome 12 have been conclusively identified yet. We have reported that the linkage evidence for LOAD in a 12q region was significantly increased in autopsy-confirmed families particularly for those showing no linkage to alpha-T catenin gene, a LOAD candidate gene on chromosome 10 [LOD score increased from 0.1 in the autopsy-confirmed subset to 4.19 in the unlinked subset (optimal subset); p<0.0001 for the increase in LOD score], indicating a one-LOD support interval spanning 6 Mb. To further investigate this finding and to identify potential candidate LOAD risk genes for follow-up analysis, we analyzed 99 single nucleotide polymorphisms in this region, for the overall sample, the autopsy-confirmed subset, and the optimal subset, respectively, for comparison. We saw no significant association (p<0.01) in the overall sample. In the autopsy-confirmed subset, the best finding was obtained in the activation transcription factor 7 (ATF7) gene (single-locus association, p=0.002; haplotype association global, p=0.007). In the optimal subset, the best finding was obtained in the hypothetical protein FLJ20436 (FLJ20436) gene (single-locus association, p=0.0026). These results suggest that subset and covariate analyses may be one approach to help identify novel susceptibility genes on chromosome 12q for LOAD.
Chen, Tzu-Han; Shiau, Hsin-Chieh
2018-01-01
Single cell transcriptome (SCT) analysis provides superior resolution to illustrate tumor cell heterogeneity for clinical implications. We characterized four SCTs of MCF-7 using 143 housekeeping genes (HKGs) as control, of which lactate dehydrogenase B (LDHB) expression is silenced. These SCT libraries mapped to 11,423, 11,486, 10,380, and 11,306 RefSeq genes (UCSC), respectively. High consistency in HKG expression levels across all four SCTs, along with transcriptional silencing of LDHB, was observed, suggesting a high sensitivity and reproducibility of the SCT analysis. Cross-library comparison on expression levels by scatter plotting revealed a linear correlation and an 83–94% overlap in transcript isoforms and expressed genes were also observed. To gain insight of transcriptional diversity among the SCTs, expressed genes were split into consistently expressed (CE) (expressed in all SCTs) and inconsistently expressed (IE) (expressed in some but not all SCTs) genes for further characterization, along with the 142 expressed HKGs as a reference. Distinct transcriptional strengths were found among these groups, with averages of 1,612.0, 88.0 and 1.2 FPKM for HKGs, CE and IE, respectively. Comparison between CE and IE groups further indicated that expressions of CE genes vary more significantly than that of IE genes. Gene Ontology analysis indicated that proteins encoded by CE genes are mainly involved in fundamental intracellular activities, while proteins encoded by IE genes are mainly for extracellular activities, especially acting as receptors or ion channels. The diversified gene expressions, especially for those encoded by IE genes, may contribute to cancer drug resistance. PMID:29920548
Single cell transcriptomic analysis of prostate cancer cells.
Welty, Christopher J; Coleman, Ilsa; Coleman, Roger; Lakely, Bryce; Xia, Jing; Chen, Shu; Gulati, Roman; Larson, Sandy R; Lange, Paul H; Montgomery, Bruce; Nelson, Peter S; Vessella, Robert L; Morrissey, Colm
2013-02-16
The ability to interrogate circulating tumor cells (CTC) and disseminated tumor cells (DTC) is restricted by the small number detected and isolated (typically <10). To determine if a commercially available technology could provide a transcriptomic profile of a single prostate cancer (PCa) cell, we clonally selected and cultured a single passage of cell cycle synchronized C4-2B PCa cells. Ten sets of single, 5-, or 10-cells were isolated using a micromanipulator under direct visualization with an inverted microscope. Additionally, two groups of 10 individual DTC, each isolated from bone marrow of 2 patients with metastatic PCa were obtained. RNA was amplified using the WT-Ovation™ One-Direct Amplification System. The amplified material was hybridized on a 44K Whole Human Gene Expression Microarray. A high stringency threshold, a mean Alexa Fluor® 3 signal intensity above 300, was used for gene detection. Relative expression levels were validated for select genes using real-time PCR (RT-qPCR). Using this approach, 22,410, 20,423, and 17,009 probes were positive on the arrays from 10-cell pools, 5-cell pools, and single-cells, respectively. The sensitivity and specificity of gene detection on the single-cell analyses were 0.739 and 0.972 respectively when compared to 10-cell pools, and 0.814 and 0.979 respectively when compared to 5-cell pools, demonstrating a low false positive rate. Among 10,000 randomly selected pairs of genes, the Pearson correlation coefficient was 0.875 between the single-cell and 5-cell pools and 0.783 between the single-cell and 10-cell pools. As expected, abundant transcripts in the 5- and 10-cell samples were detected by RT-qPCR in the single-cell isolates, while lower abundance messages were not. Using the same stringency, 16,039 probes were positive on the patient single-cell arrays. Cluster analysis showed that all 10 DTC grouped together within each patient. A transcriptomic profile can be reliably obtained from a single cell using commercially available technology. As expected, fewer amplified genes are detected from a single-cell sample than from pooled-cell samples, however this method can be used to reliably obtain a transcriptomic profile from DTC isolated from the bone marrow of patients with PCa.
Lymphocyte signaling : beyond knockouts
Saveliev, Alexander; Tybulewicz, Victor L. J.
2016-01-01
The analysis of lymphocyte signaling was greatly enhanced by the advent of gene targeting, which allows the selective inactivation of a single gene. Whereas this gene ‘knockout’ approach is often informative, in many cases the phenotype resulting from gene ablation might not provide a complete picture of the function of the corresponding protein. If a protein has multiple functions within a single or several signaling pathways, or stabilizes other proteins in a complex, the phenotypic consequences of a gene knockout may manifest as a combination of several different perturbations. In these cases, gene targeting to ‘knockin’ subtle point mutations might provide more accurate insight into protein function. However, to be informative, such mutations must be carefully designed based on structural and biophysical data. PMID:19295633
Regularized rare variant enrichment analysis for case-control exome sequencing data.
Larson, Nicholas B; Schaid, Daniel J
2014-02-01
Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.
Zhang, Zhen; Shang, Haihong; Shi, Yuzhen; Huang, Long; Li, Junwen; Ge, Qun; Gong, Juwu; Liu, Aiying; Chen, Tingting; Wang, Dan; Wang, Yanling; Palanga, Koffi Kibalou; Muhammad, Jamshed; Li, Weijie; Lu, Quanwei; Deng, Xiaoying; Tan, Yunna; Song, Weiwu; Cai, Juan; Li, Pengtao; Rashid, Harun or; Gong, Wankui; Yuan, Youlu
2016-04-11
Upland Cotton (Gossypium hirsutum) is one of the most important worldwide crops it provides natural high-quality fiber for the industrial production and everyday use. Next-generation sequencing is a powerful method to identify single nucleotide polymorphism markers on a large scale for the construction of a high-density genetic map for quantitative trait loci mapping. In this research, a recombinant inbred lines population developed from two upland cotton cultivars 0-153 and sGK9708 was used to construct a high-density genetic map through the specific locus amplified fragment sequencing method. The high-density genetic map harbored 5521 single nucleotide polymorphism markers which covered a total distance of 3259.37 cM with an average marker interval of 0.78 cM without gaps larger than 10 cM. In total 18 quantitative trait loci of boll weight were identified as stable quantitative trait loci and were detected in at least three out of 11 environments and explained 4.15-16.70 % of the observed phenotypic variation. In total, 344 candidate genes were identified within the confidence intervals of these stable quantitative trait loci based on the cotton genome sequence. These genes were categorized based on their function through gene ontology analysis, Kyoto Encyclopedia of Genes and Genomes analysis and eukaryotic orthologous groups analysis. This research reported the first high-density genetic map for Upland Cotton (Gossypium hirsutum) with a recombinant inbred line population using single nucleotide polymorphism markers developed by specific locus amplified fragment sequencing. We also identified quantitative trait loci of boll weight across 11 environments and identified candidate genes within the quantitative trait loci confidence intervals. The results of this research would provide useful information for the next-step work including fine mapping, gene functional analysis, pyramiding breeding of functional genes as well as marker-assisted selection.
FastProject: a tool for low-dimensional analysis of single-cell RNA-Seq data.
DeTomaso, David; Yosef, Nir
2016-08-23
A key challenge in the emerging field of single-cell RNA-Seq is to characterize phenotypic diversity between cells and visualize this information in an informative manner. A common technique when dealing with high-dimensional data is to project the data to 2 or 3 dimensions for visualization. However, there are a variety of methods to achieve this result and once projected, it can be difficult to ascribe biological significance to the observed features. Additionally, when analyzing single-cell data, the relationship between cells can be obscured by technical confounders such as variable gene capture rates. To aid in the analysis and interpretation of single-cell RNA-Seq data, we have developed FastProject, a software tool which analyzes a gene expression matrix and produces a dynamic output report in which two-dimensional projections of the data can be explored. Annotated gene sets (referred to as gene 'signatures') are incorporated so that features in the projections can be understood in relation to the biological processes they might represent. FastProject provides a novel method of scoring each cell against a gene signature so as to minimize the effect of missed transcripts as well as a method to rank signature-projection pairings so that meaningful associations can be quickly identified. Additionally, FastProject is written with a modular architecture and designed to serve as a platform for incorporating and comparing new projection methods and gene selection algorithms. Here we present FastProject, a software package for two-dimensional visualization of single cell data, which utilizes a plethora of projection methods and provides a way to systematically investigate the biological relevance of these low dimensional representations by incorporating domain knowledge.
Coats' disease and congenital retinoschisis in a single eye: a case report and DNA analysis.
Berinstein, D M; Hiraoka, M; Trese, M T; Shastry, B S
2001-01-01
The clinical features of Coats' disease and congenital retinoschisis (RS) are distinctly different. Therefore, finding changes consistent with Coats' disease and congenital RS in a single eye is an unusual occurrence. The following report describes two cases with a Coats' telangiectatic lesion in one region of the retina separated by normal retina and the presence of central and peripheral congenital RS. Molecular genetic analysis of the Norrie disease and RS genes failed to identify disease-causing or polymorphic mutations in either of the genes, suggesting that the above condition is clinically and genetically a different disorder. Further studies are needed to identify the genes responsible for the above disorder and associated ocular manifestations. Copyright 2001 S. Karger AG, Basel.
Dynamic Network-Based Epistasis Analysis: Boolean Examples
Azpeitia, Eugenio; Benítez, Mariana; Padilla-Longoria, Pablo; Espinosa-Soto, Carlos; Alvarez-Buylla, Elena R.
2011-01-01
In this article we focus on how the hierarchical and single-path assumptions of epistasis analysis can bias the inference of gene regulatory networks. Here we emphasize the critical importance of dynamic analyses, and specifically illustrate the use of Boolean network models. Epistasis in a broad sense refers to gene interactions, however, as originally proposed by Bateson, epistasis is defined as the blocking of a particular allelic effect due to the effect of another allele at a different locus (herein, classical epistasis). Classical epistasis analysis has proven powerful and useful, allowing researchers to infer and assign directionality to gene interactions. As larger data sets are becoming available, the analysis of classical epistasis is being complemented with computer science tools and system biology approaches. We show that when the hierarchical and single-path assumptions are not met in classical epistasis analysis, the access to relevant information and the correct inference of gene interaction topologies is hindered, and it becomes necessary to consider the temporal dynamics of gene interactions. The use of dynamical networks can overcome these limitations. We particularly focus on the use of Boolean networks that, like classical epistasis analysis, relies on logical formalisms, and hence can complement classical epistasis analysis and relax its assumptions. We develop a couple of theoretical examples and analyze them from a dynamic Boolean network model perspective. Boolean networks could help to guide additional experiments and discern among alternative regulatory schemes that would be impossible or difficult to infer without the elimination of these assumption from the classical epistasis analysis. We also use examples from the literature to show how a Boolean network-based approach has resolved ambiguities and guided epistasis analysis. Our article complements previous accounts, not only by focusing on the implications of the hierarchical and single-path assumption, but also by demonstrating the importance of considering temporal dynamics, and specifically introducing the usefulness of Boolean network models and also reviewing some key properties of network approaches. PMID:22645556
Heendeniya, Ravindra G; Yu, Peiqiang
2017-03-20
Alfalfa ( Medicago sativa L.) genotypes transformed with Lc-bHLH and Lc transcription genes were developed with the intention of stimulating proanthocyanidin synthesis in the aerial parts of the plant. To our knowledge, there are no studies on the effect of single-gene and two-gene transformation on chemical functional groups and molecular structure changes in these plants. The objective of this study was to use advanced molecular spectroscopy with multivariate chemometrics to determine chemical functional group intensity and molecular structure changes in alfalfa plants when co-expressing Lc-bHLH and C1-MYB transcriptive flavanoid regulatory genes in comparison with non-transgenic (NT) and AC Grazeland (ACGL) genotypes. The results showed that compared to NT genotype, the presence of double genes ( Lc and C1 ) increased ratios of both the area and peak height of protein structural Amide I/II and the height ratio of α-helix to β-sheet. In carbohydrate-related spectral analysis, the double gene-transformed alfalfa genotypes exhibited lower peak heights at 1370, 1240, 1153, and 1020 cm -1 compared to the NT genotype. Furthermore, the effect of double gene transformation on carbohydrate molecular structure was clearly revealed in the principal component analysis of the spectra. In conclusion, single or double transformation of Lc and C1 genes resulted in changing functional groups and molecular structure related to proteins and carbohydrates compared to the NT alfalfa genotype. The current study provided molecular structural information on the transgenic alfalfa plants and provided an insight into the impact of transgenes on protein and carbohydrate properties and their molecular structure's changes.
Wang, Tianyu; Nabavi, Sheida
2018-04-24
Differential gene expression analysis is one of the significant efforts in single cell RNA sequencing (scRNAseq) analysis to discover the specific changes in expression levels of individual cell types. Since scRNAseq exhibits multimodality, large amounts of zero counts, and sparsity, it is different from the traditional bulk RNA sequencing (RNAseq) data. The new challenges of scRNAseq data promote the development of new methods for identifying differentially expressed (DE) genes. In this study, we proposed a new method, SigEMD, that combines a data imputation approach, a logistic regression model and a nonparametric method based on the Earth Mover's Distance, to precisely and efficiently identify DE genes in scRNAseq data. The regression model and data imputation are used to reduce the impact of large amounts of zero counts, and the nonparametric method is used to improve the sensitivity of detecting DE genes from multimodal scRNAseq data. By additionally employing gene interaction network information to adjust the final states of DE genes, we further reduce the false positives of calling DE genes. We used simulated datasets and real datasets to evaluate the detection accuracy of the proposed method and to compare its performance with those of other differential expression analysis methods. Results indicate that the proposed method has an overall powerful performance in terms of precision in detection, sensitivity, and specificity. Copyright © 2018 Elsevier Inc. All rights reserved.
MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways
Koumakis, Lefteris; Kartsaki, Evgenia; Chatzimina, Maria; Zervakis, Michalis; Vassou, Despoina; Marias, Kostas; Moustakis, Vassilis; Potamias, George
2016-01-01
Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the unique characteristic to color regulatory relations between genes and reveal their phenotype inclination. This unique characteristic makes MinePath a valuable tool for in silico molecular biology experimentation as it serves the biomedical researchers’ exploratory needs to reveal and interpret the regulatory mechanisms that underlie and putatively govern the expression of target phenotypes. PMID:27832067
MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways.
Koumakis, Lefteris; Kanterakis, Alexandros; Kartsaki, Evgenia; Chatzimina, Maria; Zervakis, Michalis; Tsiknakis, Manolis; Vassou, Despoina; Kafetzopoulos, Dimitris; Marias, Kostas; Moustakis, Vassilis; Potamias, George
2016-11-01
Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the unique characteristic to color regulatory relations between genes and reveal their phenotype inclination. This unique characteristic makes MinePath a valuable tool for in silico molecular biology experimentation as it serves the biomedical researchers' exploratory needs to reveal and interpret the regulatory mechanisms that underlie and putatively govern the expression of target phenotypes.
Single-cell regulome data analysis by SCRAT.
Ji, Zhicheng; Zhou, Weiqiang; Ji, Hongkai
2017-09-15
Emerging single-cell technologies (e.g. single-cell ATAC-seq, DNase-seq or ChIP-seq) have made it possible to assay regulome of individual cells. Single-cell regulome data are highly sparse and discrete. Analyzing such data is challenging. User-friendly software tools are still lacking. We present SCRAT, a Single-Cell Regulome Analysis Toolbox with a graphical user interface, for studying cell heterogeneity using single-cell regulome data. SCRAT can be used to conveniently summarize regulatory activities according to different features (e.g. gene sets, transcription factor binding motif sites, etc.). Using these features, users can identify cell subpopulations in a heterogeneous biological sample, infer cell identities of each subpopulation, and discover distinguishing features such as gene sets and transcription factors that show different activities among subpopulations. SCRAT is freely available at https://zhiji.shinyapps.io/scrat as an online web service and at https://github.com/zji90/SCRAT as an R package. hji@jhu.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
NASA Astrophysics Data System (ADS)
Werthmann, Britta; Marwan, Wolfgang
2017-11-01
The developmental switch to sporulation in Physarum polycephalum is a phytochrome-mediated far-red light-induced cell fate decision that synchronously encompasses the entire multinucleate plasmodial cell and is associated with extensive reprogramming of the transcriptome. By repeatedly taking samples of single cells after delivery of a light stimulus pulse, we analysed differential gene expression in two mutant strains and in a heterokaryon of the two strains all of which display a different propensity for making the cell fate decision. Multidimensional scaling of the gene expression data revealed individually different single cell trajectories eventually leading to sporulation. Characterization of the trajectories as walks through states of gene expression discretized by hierarchical clustering allowed the reconstruction of Petri nets that model and predict the observed behavior. Structural analyses of the Petri nets indicated stimulus- and genotype-dependence of both, single cell trajectories and of the quasipotential landscape through which these trajectories are taken. The Petri net-based approach to the analysis and decomposition of complex cellular responses and of complex mutant phenotypes may provide a scaffold for the data-driven reconstruction of causal molecular mechanisms that shape the topology of the quasipotential landscape.
Wu, Lei; He, Yao; Zhang, Di
2015-11-01
To systematically evaluate the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout in East Asian population. The literature retrieval was conducted by using English databases (Medline, EMbase), Chinese databases (CNKI, Vip, Wanfang, SinaMed) and others to collect the published papers on the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout by the end of December 2014. Meta-analysis was performed with software Stata 12.0. Nine studies were included. There were significant associations between increased risk of gout and single nucleotide polymorphism of rs2231142, the combined OR was 2.04 (95%CI: 1.82-2.28) for A allele and C allele, 1.97 (95%CI: 1.57-2.48) for CA and CC, 3.71 (95%CI: 3.07-4.47) for AA and CC. Sex and region specific subgroup analysis showed less heterogeneity. There is significant association between gout and single nucleotide polymorphism of rs2231142 in East Asian population, and A allele is a high risk gene for gout.
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...
2016-06-24
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.
2016-01-01
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290
Prioritizing biological pathways by recognizing context in time-series gene expression data.
Lee, Jusang; Jo, Kyuri; Lee, Sunwon; Kang, Jaewoo; Kim, Sun
2016-12-23
The primary goal of pathway analysis using transcriptome data is to find significantly perturbed pathways. However, pathway analysis is not always successful in identifying pathways that are truly relevant to the context under study. A major reason for this difficulty is that a single gene is involved in multiple pathways. In the KEGG pathway database, there are 146 genes, each of which is involved in more than 20 pathways. Thus activation of even a single gene will result in activation of many pathways. This complex relationship often makes the pathway analysis very difficult. While we need much more powerful pathway analysis methods, a readily available alternative way is to incorporate the literature information. In this study, we propose a novel approach for prioritizing pathways by combining results from both pathway analysis tools and literature information. The basic idea is as follows. Whenever there are enough articles that provide evidence on which pathways are relevant to the context, we can be assured that the pathways are indeed related to the context, which is termed as relevance in this paper. However, if there are few or no articles reported, then we should rely on the results from the pathway analysis tools, which is termed as significance in this paper. We realized this concept as an algorithm by introducing Context Score and Impact Score and then combining the two into a single score. Our method ranked truly relevant pathways significantly higher than existing pathway analysis tools in experiments with two data sets. Our novel framework was implemented as ContextTRAP by utilizing two existing tools, TRAP and BEST. ContextTRAP will be a useful tool for the pathway based analysis of gene expression data since the user can specify the context of the biological experiment in a set of keywords. The web version of ContextTRAP is available at http://biohealth.snu.ac.kr/software/contextTRAP .
Large-Scale Femtoliter Droplet Array for Single Cell Efflux Assay of Bacteria.
Iino, Ryota; Sakakihara, Shouichi; Matsumoto, Yoshimi; Nishino, Kunihiko
2018-01-01
Large-scale femtoliter droplet array as a platform for single cell efflux assay of bacteria is described. Device microfabrication, femtoliter droplet array formation and concomitant enclosure of single bacterial cells, fluorescence-based detection of efflux activity at the single cell level, and collection of single cells from droplet and subsequent gene analysis are described in detail.
The complete chloroplast genome of Sinopodophyllum hexandrum (Berberidaceae).
Li, Huie; Guo, Qiqiang
2016-07-01
The complete chloroplast (cp) genome of the Sinopodophyllum hexandrum (Berberidaceae) was determined in this study. The circular genome is 157,940 bp in size, and comprises a pair of inverted repeat (IR) regions of 26,077 bp each, a large single-copy (LSC) region of 86,460 bp and a small single-copy (SSC) region of 19,326 bp. The GC content of the whole cp genome was 38.5%. A total of 133 genes were identified, including 88 protein-coding genes, 37 tRNA genes and eight rRNA genes. The whole cp genome consists of 114 unique genes, and 19 genes are duplicated in the IR regions. The phylogenetic analysis revealed that S. hexandrum is closely related to Nandina domestica within the family Berberidaceae.
The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).
Choi, Kyoung Su; Park, SeonJoo
2016-09-01
The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.
Integrating alternative splicing detection into gene prediction.
Foissac, Sylvain; Schiex, Thomas
2005-02-10
Alternative splicing (AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders. We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences (ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGENE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions (among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence (but not necessarily with a full-length coverage). This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.
Amador, A; Papaceit, M; Juan, E
2001-06-01
The Adh locus of Drosophilidae is organized as a single gene transcribed from two spatially and temporally regulated promoters except in species of the repleta group, which have two single promoter genes. Here we show that in Drosophila funebris the Adh gene is transcribed from a single promoter, in both larva and adult, with qualitative and quantitative species specific-differences in tissue distribution. The gene is expressed in larval fat body but in other tissues such as gastric caeca, midgut and Malpighian tubules its expression is reduced compared to most Drosophilidae species, and in adults it is almost limited to the fat body. The comparative analysis of gene expression of two strains, which differ by a duplication, indicates that the cis elements necessary for this pattern of expression in larvae are included in the region of 1.55 kb upstream of the transcription initiation site. This new organization reveals the evolution of a different regulatory strategy to express the Adh gene in the subgenus Drosophila.
Woldesemayat, Adugna Abdi; Van Heusden, Peter; Ndimba, Bongani K; Christoffels, Alan
2017-12-22
Drought is the most disastrous abiotic stress that severely affects agricultural productivity worldwide. Understanding the biological basis of drought-regulated traits, requires identification and an in-depth characterization of genetic determinants using model organisms and high-throughput technologies. However, studies on drought tolerance have generally been limited to traditional candidate gene approach that targets only a single gene in a pathway that is related to a trait. In this study, we used sorghum, one of the model crops that is well adapted to arid regions, to mine genes and define determinants for drought tolerance using drought expression libraries and RNA-seq data. We provide an integrated and comparative in silico candidate gene identification, characterization and annotation approach, with an emphasis on genes playing a prominent role in conferring drought tolerance in sorghum. A total of 470 non-redundant functionally annotated drought responsive genes (DRGs) were identified using experimental data from drought responses by employing pairwise sequence similarity searches, pathway and interpro-domain analysis, expression profiling and orthology relation. Comparison of the genomic locations between these genes and sorghum quantitative trait loci (QTLs) showed that 40% of these genes were co-localized with QTLs known for drought tolerance. The genome reannotation conducted using the Program to Assemble Spliced Alignment (PASA), resulted in 9.6% of existing single gene models being updated. In addition, 210 putative novel genes were identified using AUGUSTUS and PASA based analysis on expression dataset. Among these, 50% were single exonic, 69.5% represented drought responsive and 5.7% were complete gene structure models. Analysis of biochemical metabolism revealed 14 metabolic pathways that are related to drought tolerance and also had a strong biological network, among categories of genes involved. Identification of these pathways, signifies the interplay of biochemical reactions that make up the metabolic network, constituting fundamental interface for sorghum defence mechanism against drought stress. This study suggests untapped natural variability in sorghum that could be used for developing drought tolerance. The data presented here, may be regarded as an initial reference point in functional and comparative genomics in the Gramineae family.
Model-based gene set analysis for Bioconductor.
Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien
2011-07-01
Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.
Prioritizing Genes Related to Nicotine Addiction Via a Multi-source-Based Approach.
Liu, Xinhua; Liu, Meng; Li, Xia; Zhang, Lihua; Fan, Rui; Wang, Ju
2015-08-01
Nicotine has a broad impact on both the central and peripheral nervous systems. Over the past decades, an increasing number of genes potentially involved in nicotine addiction have been identified by different technical approaches. However, the molecular mechanisms underlying nicotine addiction remain largely unknown. Under such situation, prioritizing the candidate genes for further investigation is becoming increasingly important. In this study, we presented a multi-source-based gene prioritization approach for nicotine addiction by utilizing the vast amounts of information generated from for nicotine addiction study during the past years. In this approach, we first collected and curated genes from studies in four categories, i.e., genetic association analysis, genetic linkage analysis, high-throughput gene/protein expression analysis, and literature search of single gene/protein-based studies. Based on these resources, the genes were scored and a weight value was determined for each category. Finally, the genes were ranked by their combined scores, and 220 genes were selected as the prioritized nicotine addiction-related genes. Evaluation suggested the prioritized genes were promising targets for further analysis and replication study.
Integrative sparse principal component analysis of gene expression data.
Liu, Mengque; Fan, Xinyan; Fang, Kuangnan; Zhang, Qingzhao; Ma, Shuangge
2017-12-01
In the analysis of gene expression data, dimension reduction techniques have been extensively adopted. The most popular one is perhaps the PCA (principal component analysis). To generate more reliable and more interpretable results, the SPCA (sparse PCA) technique has been developed. With the "small sample size, high dimensionality" characteristic of gene expression data, the analysis results generated from a single dataset are often unsatisfactory. Under contexts other than dimension reduction, integrative analysis techniques, which jointly analyze the raw data of multiple independent datasets, have been developed and shown to outperform "classic" meta-analysis and other multidatasets techniques and single-dataset analysis. In this study, we conduct integrative analysis by developing the iSPCA (integrative SPCA) method. iSPCA achieves the selection and estimation of sparse loadings using a group penalty. To take advantage of the similarity across datasets and generate more accurate results, we further impose contrasted penalties. Different penalties are proposed to accommodate different data conditions. Extensive simulations show that iSPCA outperforms the alternatives under a wide spectrum of settings. The analysis of breast cancer and pancreatic cancer data further shows iSPCA's satisfactory performance. © 2017 WILEY PERIODICALS, INC.
Gangavarapu, Kalyan J; Miller, Austin; Huss, Wendy J
2016-09-01
Defining biological signals at the single cell level can identify cancer initiating driver mutations. Techniques to isolate single cells such as microfluidics sorting and magnetic capturing systems have limitations such as: high cost, labor intense, and the requirement of a large number of cells. Therefore, the goal of our current study is to identify a cost and labor effective, reliable, and reproducible technique that allows single cell isolation for analysis to promote regular laboratory use, including standard reverse transcription PCR (RT-PCR). In the current study, we utilized single prostate cells isolated from the CWR-R1 prostate cancer cell line and human prostate clinical specimens, based on the ATP binding cassette (ABC) transporter efflux of dye cycle violet (DCV), side population assay. Expression of four genes: ABCG2; Aldehyde dehydrogenase1A1 (ALDH1A1); androgen receptor (AR); and embryonic stem cell marker, Oct-4, were determined. Results from the current study in the CWR-R1 cell line showed ABCG2 and ALDH1A1 gene expression in 67% of single side population cells and in 17% or 100% of non-side population cells respectively. Studies using single cells isolated from clinical specimens showed that the Oct-4 gene is detected in only 22% of single side population cells and in 78% of single non-side population cells. Whereas, AR gene expression is in 100% single side population and non-side population cells isolated from the same human prostate clinical specimen. These studies show that performing RT-PCR on single cells isolated by FACS can be successfully conducted to determine gene expression in single cells from cell lines and enzymatically digested tissue. While these studies provide a simple yes/no expression readout, the more sensitive quantitative RT-PCR would be able to provide even more information if necessary.
Gangavarapu, Kalyan J; Miller, Austin; Huss, Wendy J
2016-01-01
Defining biological signals at the single cell level can identify cancer initiating driver mutations. Techniques to isolate single cells such as microfluidics sorting and magnetic capturing systems have limitations such as: high cost, labor intense, and the requirement of a large number of cells. Therefore, the goal of our current study is to identify a cost and labor effective, reliable, and reproducible technique that allows single cell isolation for analysis to promote regular laboratory use, including standard reverse transcription PCR (RT-PCR). In the current study, we utilized single prostate cells isolated from the CWR-R1 prostate cancer cell line and human prostate clinical specimens, based on the ATP binding cassette (ABC) transporter efflux of dye cycle violet (DCV), side population assay. Expression of four genes: ABCG2; Aldehyde dehydrogenase1A1 (ALDH1A1); androgen receptor (AR); and embryonic stem cell marker, Oct-4, were determined. Results from the current study in the CWR-R1 cell line showed ABCG2 and ALDH1A1 gene expression in 67% of single side population cells and in 17% or 100% of non-side population cells respectively. Studies using single cells isolated from clinical specimens showed that the Oct-4 gene is detected in only 22% of single side population cells and in 78% of single non-side population cells. Whereas, AR gene expression is in 100% single side population and non-side population cells isolated from the same human prostate clinical specimen. These studies show that performing RT-PCR on single cells isolated by FACS can be successfully conducted to determine gene expression in single cells from cell lines and enzymatically digested tissue. While these studies provide a simple yes/no expression readout, the more sensitive quantitative RT-PCR would be able to provide even more information if necessary. PMID:27785389
Jaramillo, Luz Marina; Gutiérrez, Lina A; Luckhart, Shirley; Conn, Jan E; Correa, Margarita M
2012-01-01
To elucidate the Anopheles nuneztovari s.l. taxonomic status at a microgeographic level in four malaria endemic localities from Antioquia and Córdoba, Colombia, fragments of the Cytochrome oxidase subunit I (COI) and the white gene were used. The COI analysis showed low genetic differentiation with FST levels between −0.02 and 0.137 and Nm values between 3 and infinity, indicating the presence of high gene flow among An. nuneztovari s.l. populations from the four localities. The COI network showed a single most common haplotype, 1 (n=55), present in all localities, as the likely ancestral haplotype. Analysis of the white gene showed that An. nuneztovari s.l. populations from both departments grouped with haplotypes 19 and 20, which are part of lineage 3 previously reported. The results of the present study suggest that An. nuneztovari s.l. is a single taxon in the area of the present study. PMID:22241127
Wang, Zhuo; Jin, Shuilin; Liu, Guiyou; Zhang, Xiurui; Wang, Nan; Wu, Deliang; Hu, Yang; Zhang, Chiping; Jiang, Qinghua; Xu, Li; Wang, Yadong
2017-05-23
The development of single-cell RNA sequencing has enabled profound discoveries in biology, ranging from the dissection of the composition of complex tissues to the identification of novel cell types and dynamics in some specialized cellular environments. However, the large-scale generation of single-cell RNA-seq (scRNA-seq) data collected at multiple time points remains a challenge to effective measurement gene expression patterns in transcriptome analysis. We present an algorithm based on the Dynamic Time Warping score (DTWscore) combined with time-series data, that enables the detection of gene expression changes across scRNA-seq samples and recovery of potential cell types from complex mixtures of multiple cell types. The DTWscore successfully classify cells of different types with the most highly variable genes from time-series scRNA-seq data. The study was confined to methods that are implemented and available within the R framework. Sample datasets and R packages are available at https://github.com/xiaoxiaoxier/DTWscore .
Bayesian segregation analysis of production traits in two strains of laying chickens.
Szydłowski, M; Szwaczkowski, T
2001-02-01
A bayesian marker-free segregation analysis was applied to search for evidence of segregating genes affecting production traits in two strains of laying hens under long-term selection. The study used data from 6 generations of Leghorn (H77) and New Hampshire (N88) breeding nuclei. Estimation of marginal posterior means of variance components and parameters of a single autosomal locus was performed by use of the Gibbs sampler. The results showed evidence for a mixed major gene: -polygenic inheritance of BW and age at sexual maturity (ASM) in both strains. Single genes affecting BW and ASM explained one-third of the genetic variance. For ASM large overdominance effect at single locus was estimated. Initial egg production (IEP) and average egg weight (EW) showed a polygenic model of inheritance. The polygenic heritability estimates for BW, ASM, IEP, and EW were 0.32, 0.25, 0.23, and 0.08 in Strain H77 and 0.25, 0.24, 0.11, and 0.38 in Strain N88, respectively.
Ming, De-Song; Chen, Qing-Qing; Chen, Xiao-Tin
2018-05-14
To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of β-lactamase [NPS β-lactamase (EC 3.5.2.6), β-lactamase class C, and a metal-dependent hydrolase of β-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.
Dai, Hongying; Wu, Guodong; Wu, Michael; Zhi, Degui
2016-01-01
Next-generation sequencing data pose a severe curse of dimensionality, complicating traditional "single marker-single trait" analysis. We propose a two-stage combined p-value method for pathway analysis. The first stage is at the gene level, where we integrate effects within a gene using the Sequence Kernel Association Test (SKAT). The second stage is at the pathway level, where we perform a correlated Lancaster procedure to detect joint effects from multiple genes within a pathway. We show that the Lancaster procedure is optimal in Bahadur efficiency among all combined p-value methods. The Bahadur efficiency,[Formula: see text], compares sample sizes among different statistical tests when signals become sparse in sequencing data, i.e. ε →0. The optimal Bahadur efficiency ensures that the Lancaster procedure asymptotically requires a minimal sample size to detect sparse signals ([Formula: see text]). The Lancaster procedure can also be applied to meta-analysis. Extensive empirical assessments of exome sequencing data show that the proposed method outperforms Gene Set Enrichment Analysis (GSEA). We applied the competitive Lancaster procedure to meta-analysis data generated by the Global Lipids Genetics Consortium to identify pathways significantly associated with high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, triglycerides, and total cholesterol.
Carmona, Santiago J; Teichmann, Sarah A; Ferreira, Lauren; Macaulay, Iain C; Stubbington, Michael J T; Cvejic, Ana; Gfeller, David
2017-03-01
The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans -membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell-specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans -membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. © 2017 Carmona et al.; Published by Cold Spring Harbor Laboratory Press.
Ferreira, Lauren; Macaulay, Iain C.; Stubbington, Michael J.T.
2017-01-01
The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans-membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell–specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans-membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. PMID:28087841
Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen
2016-01-01
Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335
Phylogenomic Reconstruction of the Oomycete Phylogeny Derived from 37 Genomes
McCarthy, Charley G. P.
2017-01-01
ABSTRACT The oomycetes are a class of microscopic, filamentous eukaryotes within the Stramenopiles-Alveolata-Rhizaria (SAR) supergroup which includes ecologically significant animal and plant pathogens, most infamously the causative agent of potato blight Phytophthora infestans. Single-gene and concatenated phylogenetic studies both of individual oomycete genera and of members of the larger class have resulted in conflicting conclusions concerning species phylogenies within the oomycetes, particularly for the large Phytophthora genus. Genome-scale phylogenetic studies have successfully resolved many eukaryotic relationships by using supertree methods, which combine large numbers of potentially disparate trees to determine evolutionary relationships that cannot be inferred from individual phylogenies alone. With a sufficient amount of genomic data now available, we have undertaken the first whole-genome phylogenetic analysis of the oomycetes using data from 37 oomycete species and 6 SAR species. In our analysis, we used established supertree methods to generate phylogenies from 8,355 homologous oomycete and SAR gene families and have complemented those analyses with both phylogenomic network and concatenated supermatrix analyses. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and individual clades within the problematic Phytophthora genus. Support for the resolution of the inferred relationships between individual Phytophthora clades varies depending on the methodology used. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. IMPORTANCE The oomycetes are a class of eukaryotes and include ecologically significant animal and plant pathogens. Single-gene and multigene phylogenetic studies of individual oomycete genera and of members of the larger classes have resulted in conflicting conclusions concerning interspecies relationships among these species, particularly for the Phytophthora genus. The onset of next-generation sequencing techniques now means that a wealth of oomycete genomic data is available. For the first time, we have used genome-scale phylogenetic methods to resolve oomycete phylogenetic relationships. We used supertree methods to generate single-gene and multigene species phylogenies. Overall, our supertree analyses utilized phylogenetic data from 8,355 oomycete gene families. We have also complemented our analyses with superalignment phylogenies derived from 131 single-copy ubiquitous gene families. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and clades. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. PMID:28435885
Opazo, Juan C.; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F.
2015-01-01
Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. PMID:25743544
Similarity of markers identified from cancer gene expression studies: observations from GEO.
Shi, Xingjie; Shen, Shihao; Liu, Jin; Huang, Jian; Zhou, Yong; Ma, Shuangge
2014-09-01
Gene expression profiling has been extensively conducted in cancer research. The analysis of multiple independent cancer gene expression datasets may provide additional information and complement single-dataset analysis. In this study, we conduct multi-dataset analysis and are interested in evaluating the similarity of cancer-associated genes identified from different datasets. The first objective of this study is to briefly review some statistical methods that can be used for such evaluation. Both marginal analysis and joint analysis methods are reviewed. The second objective is to apply those methods to 26 Gene Expression Omnibus (GEO) datasets on five types of cancers. Our analysis suggests that for the same cancer, the marker identification results may vary significantly across datasets, and different datasets share few common genes. In addition, datasets on different cancers share few common genes. The shared genetic basis of datasets on the same or different cancers, which has been suggested in the literature, is not observed in the analysis of GEO data. © The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Diverse Genome-wide Association Studies Associate the IL12/IL23 Pathway with Crohn Disease
Wang, Kai; Zhang, Haitao; Kugathasan, Subra; Annese, Vito; Bradfield, Jonathan P.; Russell, Richard K.; Sleiman, Patrick M.A.; Imielinski, Marcin; Glessner, Joseph; Hou, Cuiping; Wilson, David C.; Walters, Thomas; Kim, Cecilia; Frackelton, Edward C.; Lionetti, Paolo; Barabino, Arrigo; Van Limbergen, Johan; Guthery, Stephen; Denson, Lee; Piccoli, David; Li, Mingyao; Dubinsky, Marla; Silverberg, Mark; Griffiths, Anne; Grant, Struan F.A.; Satsangi, Jack; Baldassano, Robert; Hakonarson, Hakon
2009-01-01
Previous genome-wide association (GWA) studies typically focus on single-locus analysis, which may not have the power to detect the majority of genuinely associated loci. Here, we applied pathway analysis using Affymetrix SNP genotype data from the Wellcome Trust Case Control Consortium (WTCCC) and uncovered significant association between Crohn Disease (CD) and the IL12/IL23 pathway, harboring 20 genes (p = 8 × 10−5). Interestingly, the pathway contains multiple genes (IL12B and JAK2) or homologs of genes (STAT3 and CCR6) that were recently identified as genuine susceptibility genes only through meta-analysis of several GWA studies. In addition, the pathway contains other susceptibility genes for CD, including IL18R1, JUN, IL12RB1, and TYK2, which do not reach genome-wide significance by single-marker association tests. The observed pathway-specific association signal was subsequently replicated in three additional GWA studies of European and African American ancestry generated on the Illumina HumanHap550 platform. Our study suggests that examination beyond individual SNP hits, by focusing on genetic networks and pathways, is important to unleashing the true power of GWA studies. PMID:19249008
Tsuchida, Shuichi; Kagi, Akiko; Koyama, Hidekazu; Tagawa, Masahiro
2007-12-01
Xanthine urolithiasis was found in a 4-year-old spayed female Himalayan cat with a 10-month history of intermittent haematuria and dysuria. Ultrasonographs indicated the existence of several calculi in the bladder that were undetectable by survey radiographic examination. Four bladder stones were removed by cystotomy. The stones were spherical brownish-yellow and their surface was smooth and glossy. Quantitative mineral analysis showed a representative urolith to be composed of more than 95% xanthine. Ultrasonographic examination of the bladder 4.5 months postoperatively indicated the recurrence of urolithiasis. Analysis of purine concentration in urine and blood showed that the cat excreted excessive amounts of xanthine. In order to test the hypothesis that xanthinuria was caused by a homozygote of the inherited mutant allele of a gene responsible for deficiency of enzyme activity in purine degradation pathway, the allele composition of xanthine dehydrogenase (XDH) gene (one of the candidate genes for hereditary xanthinuria) was evaluated. The cat with xanthinuria was a heterozygote of the polymorphism. A single nucleotide polymorphism analysis of the cat XDH gene strongly indicated that the XDH gene of the patient cat was composed of two kinds of alleles and ruled out the hypothesis that the cat inherited the same recessive XDH allele suggesting no activity from a single ancestor.
Asaf, Sajjad; Khan, Abdul Latif; Khan, Muhammad Aaqil; Waqas, Muhammad; Kang, Sang-Mo; Yun, Byung-Wook; Lee, In-Jung
2017-08-08
We investigated the complete chloroplast (cp) genomes of non-model Arabidopsis halleri ssp. gemmifera and Arabidopsis lyrata ssp. petraea using Illumina paired-end sequencing to understand their genetic organization and structure. Detailed bioinformatics analysis revealed genome sizes of both subspecies ranging between 154.4~154.5 kbp, with a large single-copy region (84,197~84,158 bp), a small single-copy region (17,738~17,813 bp) and pair of inverted repeats (IRa/IRb; 26,264~26,259 bp). Both cp genomes encode 130 genes, including 85 protein-coding genes, eight ribosomal RNA genes and 37 transfer RNA genes. Whole cp genome comparison of A. halleri ssp. gemmifera and A. lyrata ssp. petraea, along with ten other Arabidopsis species, showed an overall high degree of sequence similarity, with divergence among some intergenic spacers. The location and distribution of repeat sequences were determined, and sequence divergences of shared genes were calculated among related species. Comparative phylogenetic analysis of the entire genomic data set and 70 shared genes between both cp genomes confirmed the previous phylogeny and generated phylogenetic trees with the same topologies. The sister species of A. halleri ssp. gemmifera is A. umezawana, whereas the closest relative of A. lyrata spp. petraea is A. arenicola.
Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE
Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.
2009-01-01
Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438
Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.
Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A
2006-06-01
To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.
Infrared laser-mediated local gene induction in medaka, zebrafish and Arabidopsis thaliana.
Deguchi, Tomonori; Itoh, Mariko; Urawa, Hiroko; Matsumoto, Tomohiro; Nakayama, Sohei; Kawasaki, Takashi; Kitano, Takeshi; Oda, Shoji; Mitani, Hiroshi; Takahashi, Taku; Todo, Takeshi; Sato, Junichi; Okada, Kiyotaka; Hatta, Kohei; Yuba, Shunsuke; Kamei, Yasuhiro
2009-12-01
Heat shock promoters are powerful tools for the precise control of exogenous gene induction in living organisms. In addition to the temporal control of gene expression, the analysis of gene function can also require spatial restriction. Recently, we reported a new method for in vivo, single-cell gene induction using an infrared laser-evoked gene operator (IR-LEGO) system in living nematodes (Caenorhabditis elegans). It was demonstrated that infrared (IR) irradiation could induce gene expression in single cells without incurring cellular damage. Here, we report the application of IR-LEGO to the small fish, medaka (Japanese killifish; Oryzias latipes) and zebrafish (Danio rerio), and a higher plant (Arabidopsis thaliana). Using easily observable reporter genes, we successfully induced gene expression in various tissues in these living organisms. IR-LEGO has the potential to be a useful tool in extensive research fields for cell/tissue marking or targeted gene expression in local tissues of small fish and plants.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-11-29
Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-01-01
Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649
Hardy, W Reef; Moldovan, Nicanor I; Moldovan, Leni; Livak, Kenneth J; Datta, Krishna; Goswami, Chirayu; Corselli, Mirko; Traktuev, Dmitry O; Murray, Iain R; Péault, Bruno; March, Keith
2017-05-01
Adipose tissue is a rich source of multipotent mesenchymal stem-like cells, located in the perivascular niche. Based on their surface markers, these have been assigned to two main categories: CD31 - /CD45 - /CD34 + /CD146 - cells (adventitial stromal/stem cells [ASCs]) and CD31 - /CD45 - /CD34 - /CD146 + cells (pericytes [PCs]). These populations display heterogeneity of unknown significance. We hypothesized that aldehyde dehydrogenase (ALDH) activity, a functional marker of primitivity, could help to better define ASC and PC subclasses. To this end, the stromal vascular fraction from a human lipoaspirate was simultaneously stained with fluorescent antibodies to CD31, CD45, CD34, and CD146 antigens and the ALDH substrate Aldefluor, then sorted by fluorescence-activated cell sorting. Individual ASCs (n = 67) and PCs (n = 73) selected from the extremities of the ALDH-staining spectrum were transcriptionally profiled by Fluidigm single-cell quantitative polymerase chain reaction for a predefined set (n = 429) of marker genes. To these single-cell data, we applied differential expression and principal component and clustering analysis, as well as an original gene coexpression network reconstruction algorithm. Despite the stochasticity at the single-cell level, covariation of gene expression analysis yielded multiple network connectivity parameters suggesting that these perivascular progenitor cell subclasses possess the following order of maturity: (a) ALDH br ASC (most primitive); (b) ALDH dim ASC; (c) ALDH br PC; (d) ALDH dim PC (least primitive). This order was independently supported by specific combinations of class-specific expressed genes and further confirmed by the analysis of associated signaling pathways. In conclusion, single-cell transcriptional analysis of four populations isolated from fat by surface markers and enzyme activity suggests a developmental hierarchy among perivascular mesenchymal stem cells supported by markers and coexpression networks. Stem Cells 2017;35:1273-1289. © 2017 AlphaMed Press.
Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A
2017-04-01
Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
NASA Astrophysics Data System (ADS)
He, Feng; Wen, Haishen; Yu, Dahui; Li, Jifang; Shi, Bao; Chen, Caifang; Zhang, Jiaren; Jin, Guoxiong; Chen, Xiaoyan; Shi, Dan; Yang, Yanping
2010-12-01
Follicle stimulating hormone β (FSHβ) of Japanese flounder ( Paralichthys olivaceus) plays a key role in the regulation of gonadal development. This study aimed to investigate molecular genetic characteristics of the FSHβ gene and elucidate the effects of single nucleotide polymorphisms (SNPs) of FSHβ on reproductive traits in Japanese flounder. We used polymerase chain reaction single-strand conformation polymorphism (PCR-SSCP) and sequencing of the FSHβ gene in 60 individuals. We identified only an SNP (T/C) in the coding region of exon3 of FSHβ. The SNP (T/C) did not lead to amino acid changes at the position 340 bp of FSHβ gene. Statistical analysis showed that the SNP was significantly associated with testosterone (T) level and gonadosomatic index (GSI) ( P < 0.05). Individuals with genotype TC of the SNP had significantly higher serum T levels and GSI ( P < 0.05) than that of genotype CC. Therefore, FSHβ gene could be a useful molecular marker in selection for prominent reproductive trait in Japanese Flounder.
Yang, Yong; Wu, Zhihong; Zhao, Taimao; Wang, Hai; Zhao, Dong; Zhang, Jianguo; Wang, Yipeng; Ding, Yaozhong; Qiu, Guixing
2009-06-01
The etiology of adolescent idiopathic scoliosis is undetermined despite years of research. A number of hypotheses have been postulated to explain its development, including growth abnormalities. The irregular expression of growth hormone and insulin-like growth factor-1 (IGF-1) may disturb hormone metabolism, result in a gross asymmetry, and promote the progress of adolescent idiopathic scoliosis. Initial association studies in complex diseases have demonstrated the power of candidate gene association. Prior to our study, 1 study in this field had a negative result. A replicable study is vital for reliability. To determine the relationship of growth hormone receptor and IGF-1 genes with adolescent idiopathic scoliosis, a population-based association study was performed. Single nucleotide polymorphisms with potential function were selected from candidate genes and a distribution analysis was performed. A conclusion was made confirming the insufficiency of an association between adolescent idiopathic scoliosis and the single-nucleotide polymorphism of the growth hormone receptor and IGF-1 genes in Han Chinese.
The Metarhizium anisopliae trp1 gene: cloning and regulatory analysis.
Staats, Charley Christian; Silva, Marcia Suzana Nunes; Pinto, Paulo Marcos; Vainstein, Marilene Henning; Schrank, Augusto
2004-07-01
The trp1 gene from the entomopathogenic fungus Metarhizium anisopliae, cloned by heterologous hybridization with the plasmid carrying the trpC gene from Aspergillus nidulans, was sequence characterized. The predicted translation product has the conserved catalytic domains of glutamine amidotransferase (G domain), indoleglycerolphosphate synthase (C domain), and phosphoribosyl anthranilate isomerase (F domain) organized as NH2-G-C-F-COOH. The ORF is interrupted by a single intron of 60 nt that is position conserved in relation to trp genes from Ascomycetes and length conserved in relation to Basidiomycetes species. RT-PCR analysis suggests constitutive expression of trp1 gene in M. anisopliae.
Anderson, Olin D; Coleman-Derr, Devin; Gu, Yong Q; Heath, Sekou
2010-06-16
Among the dietary essential amino acids, the most severely limiting in the cereals is lysine. Since cereals make up half of the human diet, lysine limitation has quality/nutritional consequences. The breakdown of lysine is controlled mainly by the catabolic bifunctional enzyme lysine ketoglutarate reductase - saccharopine dehydrogenase (LKR/SDH). The LKR/SDH gene has been reported to produce transcripts for the bifunctional enzyme and separate monofunctional transcripts. In addition to lysine metabolism, this gene has been implicated in a number of metabolic and developmental pathways, which along with its production of multiple transcript types and complex exon/intron structure suggest an important node in plant metabolism. Understanding more about the LKR/SDH gene is thus interesting both from applied standpoint and for basic plant metabolism. The current report describes a wheat genomic fragment containing an LKR/SDH gene and adjacent genes. The wheat LKR/SDH genomic segment was found to originate from the A-genome of wheat, and EST analysis indicates all three LKR/SDH genes in hexaploid wheat are transcriptionally active. A comparison of a set of plant LKR/SDH genes suggests regions of greater sequence conservation likely related to critical enzymatic functions and metabolic controls. Although most plants contain only a single LKR/SDH gene per genome, poplar contains at least two functional bifunctional genes in addition to a monofunctional LKR gene. Analysis of ESTs finds evidence for monofunctional LKR transcripts in switchgrass, and monofunctional SDH transcripts in wheat, Brachypodium, and poplar. The analysis of a wheat LKR/SDH gene and comparative structural and functional analyses among available plant genes provides new information on this important gene. Both the structure of the LKR/SDH gene and the immediately adjacent genes show lineage-specific differences between monocots and dicots, and findings suggest variation in activity of LKR/SDH genes among plants. Although most plant genomes seem to contain a single conserved LKR/SDH gene per genome, poplar possesses multiple contiguous genes. A preponderance of SDH transcripts suggests the LKR region may be more rate-limiting. Only switchgrass has EST evidence for LKR monofunctional transcripts. Evidence for monofunctional SDH transcripts shows a novel intron in wheat, Brachypodium, and poplar.
Validation of high-throughput single cell analysis methodology.
Devonshire, Alison S; Baradez, Marc-Olivier; Morley, Gary; Marshall, Damian; Foy, Carole A
2014-05-01
High-throughput quantitative polymerase chain reaction (qPCR) approaches enable profiling of multiple genes in single cells, bringing new insights to complex biological processes and offering opportunities for single cell-based monitoring of cancer cells and stem cell-based therapies. However, workflows with well-defined sources of variation are required for clinical diagnostics and testing of tissue-engineered products. In a study of neural stem cell lines, we investigated the performance of lysis, reverse transcription (RT), preamplification (PA), and nanofluidic qPCR steps at the single cell level in terms of efficiency, precision, and limit of detection. We compared protocols using a separate lysis buffer with cell capture directly in RT-PA reagent. The two methods were found to have similar lysis efficiencies, whereas the direct RT-PA approach showed improved precision. Digital PCR was used to relate preamplified template copy numbers to Cq values and reveal where low-quality signals may affect the analysis. We investigated the impact of calibration and data normalization strategies as a means of minimizing the impact of inter-experimental variation on gene expression values and found that both approaches can improve data comparability. This study provides validation and guidance for the application of high-throughput qPCR workflows for gene expression profiling of single cells. Copyright © 2014 Elsevier Inc. All rights reserved.
Molecular mapping of stripe rust resistance gene Yr76 in winter club wheat cultivar Tyee
USDA-ARS?s Scientific Manuscript database
Tyee, one of the wheat cultivars used to differentiate races of Puccinia striiformis f. sp. tritici (Pst) in the United States, was identified to have a single gene for all-stage resistance, tentatively named YrTye. To map the gene, Tyee was crossed with ‘Avocet Susceptible’ (AvS). Genetic analysi...
Jena, Kshirod K; Hechanova, Sherry Lou; Verdeprado, Holden; Prahalada, G D; Kim, Sung-Ryul
2017-11-01
A first set of 25 NILs carrying ten BPH resistance genes and their pyramids was developed in the background of indica variety IR24 for insect resistance breeding in rice. Brown planthopper (Nilaparvata lugens Stal.) is one of the most destructive insect pests in rice. Development of near-isogenic lines (NILs) is an important strategy for genetic analysis of brown planthopper (BPH) resistance (R) genes and their deployment against diverse BPH populations. A set of 25 NILs with 9 single R genes and 16 multiple R gene combinations consisting of 11 two-gene pyramids and 5 three-gene pyramids in the genetic background of the susceptible indica rice cultivar IR24 was developed through marker-assisted selection. The linked DNA markers for each of the R genes were used for foreground selection and confirming the introgressed regions of the BPH R genes. Modified seed box screening and feeding rate of BPH were used to evaluate the spectrum of resistance. BPH reaction of each of the NILs carrying different single genes was variable at the antibiosis level with the four BPH populations of the Philippines. The NILs with two- to three-pyramided genes showed a stronger level of antibiosis (49.3-99.0%) against BPH populations compared with NILs with a single R gene NILs (42.0-83.5%) and IR24 (10.0%). Background genotyping by high-density SNPs markers revealed that most of the chromosome regions of the NILs (BC 3 F 5 ) had IR24 genome recovery of 82.0-94.2%. Six major agronomic data of the NILs showed a phenotypically comparable agronomic performance with IR24. These newly developed NILs will be useful as new genetic resources for BPH resistance breeding and are valuable sources of genes in monitoring against the emerging BPH biotypes in different rice-growing countries.
ADGO: analysis of differentially expressed gene sets using composite GO annotation.
Nam, Dougu; Kim, Sang-Bae; Kim, Seon-Kyu; Yang, Sungjin; Kim, Seon-Young; Chu, In-Sun
2006-09-15
Genes are typically expressed in modular manners in biological processes. Recent studies reflect such features in analyzing gene expression patterns by directly scoring gene sets. Gene annotations have been used to define the gene sets, which have served to reveal specific biological themes from expression data. However, current annotations have limited analytical power, because they are classified by single categories providing only unary information for the gene sets. Here we propose a method for discovering composite biological themes from expression data. We intersected two annotated gene sets from different categories of Gene Ontology (GO). We then scored the expression changes of all the single and intersected sets. In this way, we were able to uncover, for example, a gene set with the molecular function F and the cellular component C that showed significant expression change, while the changes in individual gene sets were not significant. We provided an exemplary analysis for HIV-1 immune response. In addition, we tested the method on 20 public datasets where we found many 'filtered' composite terms the number of which reached approximately 34% (a strong criterion, 5% significance) of the number of significant unary terms on average. By using composite annotation, we can derive new and improved information about disease and biological processes from expression data. We provide a web application (ADGO: http://array.kobic.re.kr/ADGO) for the analysis of differentially expressed gene sets with composite GO annotations. The user can analyze Affymetrix and dual channel array (spotted cDNA and spotted oligo microarray) data for four species: human, mouse, rat and yeast. chu@kribb.re.kr http://array.kobic.re.kr/ADGO.
Droplet barcoding for single cell transcriptomics applied to embryonic stem cells
Klein, Allon M; Mazutis, Linas; Akartuna, Ilke; Tallapragada, Naren; Veres, Adrian; Li, Victor; Peshkin, Leonid; Weitz, David A; Kirschner, Marc W
2015-01-01
Summary It has long been the dream of biologists to map gene expression at the single cell level. With such data one might track heterogeneous cell sub-populations, and infer regulatory relationships between genes and pathways. Recently, RNA sequencing has achieved single cell resolution. What is limiting is an effective way to routinely isolate and process large numbers of individual cells for quantitative in-depth sequencing. We have developed a high-throughput droplet-microfluidic approach for barcoding the RNA from thousands of individual cells for subsequent analysis by next-generation sequencing. The method shows a surprisingly low noise profile and is readily adaptable to other sequencing-based assays. We analyzed mouse embryonic stem cells, revealing in detail the population structure and the heterogeneous onset of differentiation after LIF withdrawal. The reproducibility of these high-throughput single cell data allowed us to deconstruct cell populations and infer gene expression relationships. PMID:26000487
Semrau, Stefan; Goldmann, Johanna E; Soumillon, Magali; Mikkelsen, Tarjei S; Jaenisch, Rudolf; van Oudenaarden, Alexander
2017-10-23
Gene expression heterogeneity in the pluripotent state of mouse embryonic stem cells (mESCs) has been increasingly well-characterized. In contrast, exit from pluripotency and lineage commitment have not been studied systematically at the single-cell level. Here we measure the gene expression dynamics of retinoic acid driven mESC differentiation from pluripotency to lineage commitment, using an unbiased single-cell transcriptomics approach. We find that the exit from pluripotency marks the start of a lineage transition as well as a transient phase of increased susceptibility to lineage specifying signals. Our study reveals several transcriptional signatures of this phase, including a sharp increase of gene expression variability and sequential expression of two classes of transcriptional regulators. In summary, we provide a comprehensive analysis of the exit from pluripotency and lineage commitment at the single cell level, a potential stepping stone to improved lineage manipulation through timing of differentiation cues.
Fundamental limits on dynamic inference from single-cell snapshots
Weinreb, Caleb; Tusi, Betsabeh K.; Socolovsky, Merav
2018-01-01
Single-cell expression profiling reveals the molecular states of individual cells with unprecedented detail. Because these methods destroy cells in the process of analysis, they cannot measure how gene expression changes over time. However, some information on dynamics is present in the data: the continuum of molecular states in the population can reflect the trajectory of a typical cell. Many methods for extracting single-cell dynamics from population data have been proposed. However, all such attempts face a common limitation: for any measured distribution of cell states, there are multiple dynamics that could give rise to it, and by extension, multiple possibilities for underlying mechanisms of gene regulation. Here, we describe the aspects of gene expression dynamics that cannot be inferred from a static snapshot alone and identify assumptions necessary to constrain a unique solution for cell dynamics from static snapshots. We translate these constraints into a practical algorithmic approach, population balance analysis (PBA), which makes use of a method from spectral graph theory to solve a class of high-dimensional differential equations. We use simulations to show the strengths and limitations of PBA, and then apply it to single-cell profiles of hematopoietic progenitor cells (HPCs). Cell state predictions from this analysis agree with HPC fate assays reported in several papers over the past two decades. By highlighting the fundamental limits on dynamic inference faced by any method, our framework provides a rigorous basis for dynamic interpretation of a gene expression continuum and clarifies best experimental designs for trajectory reconstruction from static snapshot measurements. PMID:29463712
Vibrio chromosomes share common history.
Kirkup, Benjamin C; Chang, LeeAnn; Chang, Sarah; Gevers, Dirk; Polz, Martin F
2010-05-10
While most gamma proteobacteria have a single circular chromosome, Vibrionales have two circular chromosomes. Horizontal gene transfer is common among Vibrios, and in light of this genetic mobility, it is an open question to what extent the two chromosomes themselves share a common history since their formation. Single copy genes from each chromosome (142 genes from chromosome I and 42 genes from chromosome II) were identified from 19 sequenced Vibrionales genomes and their phylogenetic comparison suggests consistent phylogenies for each chromosome. Additionally, study of the gene organization and phylogeny of the respective origins of replication confirmed the shared history. Thus, while elements within the chromosomes may have experienced significant genetic mobility, the backbones share a common history. This allows conclusions based on multilocus sequence analysis (MLSA) for one chromosome to be applied equally to both chromosomes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Morimoto, Yuji; Murayama, Nobuhiro; Kuwano, Akira
1995-12-18
The polymorphic allele of the monoamine oxidase B (MAO-B) gene detected by polymerase chain reaction (PCR) and single-stranded conformation polymorphism (SSCP) was associated with Parkinson`s disease (PD) in Caucasians. We characterized this polymorphic allele, allele 1, of the MAO-B gene using direct sequencing of PCR products. A single DNA substitution (G-A), resulting gain of Mae III restriction site was detected in intron 13 of the MAO-B gene. The allele associated with PD in Caucasians was twice as frequent as in healthy Japanese, but the association of the allele of the MAO-B gene was not observed in Japanese patients with PD.more » 7 refs., 2 figs., 1 tab.« less
Distinctive archaebacterial species associated with anaerobic rumen protozoan Entodinium caudatum.
Tóthová, T; Piknová, M; Kisidayová, S; Javorský, P; Pristas, P
2008-01-01
The diversity of archaebacteria associated with anaerobic rumen protozoan Entodinium caudatum in long term in vitro culture was investigated by denaturing gradient gel electrophoresis (DGGE) analysis of hypervariable V3 region of archaebacterial 16S rRNA gene. PCR was accomplished directly from DNA extracted from a single protozoal cell and from total community genomic DNA and the obtained fingerprints were compared. The analysis indicated the presence of a solitary intensive band present in Entodinium caudatum single cell DNA, which had no counterparts in the profile from total DNA. The identity of archaebacterium represented by this band was determined by sequence analysis which showed that the sequence fell to the cluster of ciliate symbiotic methanogens identified recently by 16S gene library approach.
Single molecule fluorescence microscopy for ultra-sensitive RNA expression profiling
NASA Astrophysics Data System (ADS)
Hesse, Jan; Jacak, Jaroslaw; Regl, Gerhard; Eichberger, Thomas; Aberger, Fritz; Schlapak, Robert; Howorka, Stefan; Muresan, Leila; Frischauf, Anna-Maria; Schütz, Gerhard J.
2007-02-01
We developed a microarray analysis platform for ultra-sensitive RNA expression profiling of minute samples. It utilizes a novel scanning system for single molecule fluorescence detection on cm2 size samples in combination with specialized biochips, optimized for low autofluorescence and weak unspecific adsorption. 20 μg total RNA was extracted from 10 6 cells of a human keratinocyte cell line (HaCaT) and reversely transcribed in the presence of Alexa647-aha-dUTP. 1% of the resulting labeled cDNA was used for complex hybridization to a custom-made oligonucleotide microarray representing a set of 125 different genes. For low abundant genes, individual cDNA molecules hybridized to the microarray spots could be resolved. Single cDNA molecules hybridized to the chip surface appeared as diffraction limited features in the fluorescence images. The à trous wavelet method was utilized for localization and counting of the separated cDNA signals. Subsequently, the degree of labeling of the localized cDNA molecules was determined by brightness analysis for the different genes. Variations by factors up to 6 were found, which in conventional microarray analysis would result in a misrepresentation of the relative abundance of mRNAs.
High-coverage methylation data of a gene model before and after DNA damage and homologous repair.
Pezone, Antonio; Russo, Giusi; Tramontano, Alfonso; Florio, Ermanno; Scala, Giovanni; Landi, Rosaria; Zuchegna, Candida; Romano, Antonella; Chiariotti, Lorenzo; Muller, Mark T; Gottesman, Max E; Porcellini, Antonio; Avvedimento, Enrico V
2017-04-11
Genome-wide methylation analysis is limited by its low coverage and the inability to detect single variants below 10%. Quantitative analysis provides accurate information on the extent of methylation of single CpG dinucleotide, but it does not measure the actual polymorphism of the methylation profiles of single molecules. To understand the polymorphism of DNA methylation and to decode the methylation signatures before and after DNA damage and repair, we have deep sequenced in bisulfite-treated DNA a reporter gene undergoing site-specific DNA damage and homologous repair. In this paper, we provide information on the data generation, the rationale for the experiments and the type of assays used, such as cytofluorimetry and immunoblot data derived during a previous work published in Scientific Reports, describing the methylation and expression changes of a model gene (GFP) before and after formation of a double-strand break and repair by homologous-recombination or non-homologous-end-joining. These data provide: 1) a reference for the analysis of methylation polymorphism at selected loci in complex cell populations; 2) a platform and the tools to compare transcription and methylation profiles.
High-coverage methylation data of a gene model before and after DNA damage and homologous repair
Pezone, Antonio; Russo, Giusi; Tramontano, Alfonso; Florio, Ermanno; Scala, Giovanni; Landi, Rosaria; Zuchegna, Candida; Romano, Antonella; Chiariotti, Lorenzo; Muller, Mark T.; Gottesman, Max E.; Porcellini, Antonio; Avvedimento, Enrico V.
2017-01-01
Genome-wide methylation analysis is limited by its low coverage and the inability to detect single variants below 10%. Quantitative analysis provides accurate information on the extent of methylation of single CpG dinucleotide, but it does not measure the actual polymorphism of the methylation profiles of single molecules. To understand the polymorphism of DNA methylation and to decode the methylation signatures before and after DNA damage and repair, we have deep sequenced in bisulfite-treated DNA a reporter gene undergoing site-specific DNA damage and homologous repair. In this paper, we provide information on the data generation, the rationale for the experiments and the type of assays used, such as cytofluorimetry and immunoblot data derived during a previous work published in Scientific Reports, describing the methylation and expression changes of a model gene (GFP) before and after formation of a double-strand break and repair by homologous-recombination or non-homologous-end-joining. These data provide: 1) a reference for the analysis of methylation polymorphism at selected loci in complex cell populations; 2) a platform and the tools to compare transcription and methylation profiles. PMID:28398335
Tang, Yidan; Lu, Baiyang; Zhu, Zhentong; Li, Bingling
2018-01-21
The polymerase chain reaction and many isothermal amplifications are able to achieve super gene amplification. Unfortunately, most commonly-used transduction methods, such as dye staining and Taqman-like probing, still suffer from shortcomings including false signals or difficult probe design, or are incompatible with multi-analysis. Here a universal and rational gene detection strategy has been established by translating isothermal amplicons to enzyme-free strand displacement circuits via three-way junction-based remote transduction. An assistant transduction probe was imported to form a partial hybrid with the target single-stranded nucleic acid. After systematic optimization the hybrid could serve as an associative trigger to activate a downstream circuit detector via a strand displacement reaction across the three-way junction. By doing so, the detection selectivity can be double-guaranteed through both amplicon-transducer recognition and the amplicon-circuit reaction. A well-optimized circuit can be immediately applied to a new target detection through simply displacing only 10-12 nt on only one component, according to the target. More importantly, this property for the first time enables multi-analysis and logic-analysis in a single reaction, sharing a single fluorescence reporter. In an applicable model, trace amounts of Cronobacter and Enterobacteria genes have been clearly distinguished from samples with no bacteria or one bacterium, with ultra-high sensitivity and selectivity.
Cohen, M M
1989-12-01
The role of chance using a stochastic single gene model has been shown to generate a continuous liability curve resembling that obtained from a multifactorial threshold model. Segregation of some malformations may be explained by a single defective gene that predisposes to, but does not necessarily result in, the malformation. Low penetrance and remarkably variable expressivity that characterize a number of presumed autosomal dominant malformation syndromes are possibly reflections of specific stochastic influences that are intrinsic to the embryonic process itself. Gene analysis is discussed and illustrated. Using polymorphic DNA probes to study cleft palate and ankyloglossia in males and ankyloglossia only in females in a large Icelandic family, the responsible gene was found to be located on the long arm of the X chromosome in the Xq21.1 region. In addition to gene analysis, some of the implications of transgenic analysis using mice are discussed. Among disorders of collagen metabolism, both the osteogenesis imperfectas and the Ehlers-Danlos syndromes are shown to represent genetically heterogeneous groups of connective tissue disorders. The days of thinking about osteogenesis imperfecta as one disorder and the Ehlers-Danlos syndrome as another are a thing of the past; persistence of such thinking is erroneous and misleading. Of the many disorders affecting bone mineral, the complexities of hypophosphatasia and pseudohypoparathyroidism are singled out for discussion. For lysosomal storage disorders, an overview of the mucopolysaccharidoses is provided. Finally, the recently delineated peroxisomal disorders--hyperpipecolic acidemia, rhizomelic chondrodysplasia, neonatal adrenoleukodystrophy, Zellweger syndrome, and infantile Refsum disease--are known to share a distinctive biochemical phenotype, although fibroblast complementation analysis suggests that some of these disorders are etiologically distinct.
Genetic Association of MPPED2 and ACTN2 with Dental Caries
Stanley, B.O.C.; Feingold, E.; Cooper, M.; Vanyukov, M.M.; Maher, B.S.; Slayton, R.L.; Willing, M.C.; Reis, S.E.; McNeil, D.W.; Crout, R.J.; Weyant, R.J.; Levy, S.M.; Vieira, A.R.; Marazita, M.L.; Shaffer, J.R.
2014-01-01
The first genome-wide association study of dental caries focused on primary teeth in children aged 3 to 12 yr and nominated several novel genes: ACTN2, EDARADD, EPHA7, LPO, MPPED2, MTR, and ZMPSTE24. Here we interrogated 156 single-nucleotide polymorphisms (SNPs) within these candidate genes for evidence of association with dental caries experience in 13 race- and age-stratified samples from 6 independent studies (n = 3600). Analysis was performed separately for each sample, and results were combined across samples via meta-analysis. MPPED2 was significantly associated with caries via meta-analysis across the 5 childhood samples, with 4 SNPs showing significant associations after gene-wise adjustment for multiple comparisons (p < .0026). These results corroborate the previous genome-wide association study, although the functional role of MPPED2 in caries etiology remains unknown. ACTN2 also showed significant association via meta-analysis across childhood samples (p = .0014). Moreover, in adults, genetic association was observed for ACTN2 SNPs in individual samples (p < .0025), but no single SNP was significant via meta-analysis across all 8 adult samples. Given its compelling biological role in organizing ameloblasts during amelogenesis, this study strengthens the hypothesis that ACTN2 influences caries risk. Results for the other candidate genes neither proved nor precluded their associations with dental caries. PMID:24810274
Gardeux, Vincent; David, Fabrice P. A.; Shajkofci, Adrian; Schwalie, Petra C.; Deplancke, Bart
2017-01-01
Abstract Motivation Single-cell RNA-sequencing (scRNA-seq) allows whole transcriptome profiling of thousands of individual cells, enabling the molecular exploration of tissues at the cellular level. Such analytical capacity is of great interest to many research groups in the world, yet these groups often lack the expertise to handle complex scRNA-seq datasets. Results We developed a fully integrated, web-based platform aimed at the complete analysis of scRNA-seq data post genome alignment: from the parsing, filtering and normalization of the input count data files, to the visual representation of the data, identification of cell clusters, differentially expressed genes (including cluster-specific marker genes), and functional gene set enrichment. This Automated Single-cell Analysis Pipeline (ASAP) combines a wide range of commonly used algorithms with sophisticated visualization tools. Compared with existing scRNA-seq analysis platforms, researchers (including those lacking computational expertise) are able to interact with the data in a straightforward fashion and in real time. Furthermore, given the overlap between scRNA-seq and bulk RNA-seq analysis workflows, ASAP should conceptually be broadly applicable to any RNA-seq dataset. As a validation, we demonstrate how we can use ASAP to simply reproduce the results from a single-cell study of 91 mouse cells involving five distinct cell types. Availability and implementation The tool is freely available at asap.epfl.ch and R/Python scripts are available at github.com/DeplanckeLab/ASAP. Contact bart.deplancke@epfl.ch Supplementary information Supplementary data are available at Bioinformatics online. PMID:28541377
Gardeux, Vincent; David, Fabrice P A; Shajkofci, Adrian; Schwalie, Petra C; Deplancke, Bart
2017-10-01
Single-cell RNA-sequencing (scRNA-seq) allows whole transcriptome profiling of thousands of individual cells, enabling the molecular exploration of tissues at the cellular level. Such analytical capacity is of great interest to many research groups in the world, yet these groups often lack the expertise to handle complex scRNA-seq datasets. We developed a fully integrated, web-based platform aimed at the complete analysis of scRNA-seq data post genome alignment: from the parsing, filtering and normalization of the input count data files, to the visual representation of the data, identification of cell clusters, differentially expressed genes (including cluster-specific marker genes), and functional gene set enrichment. This Automated Single-cell Analysis Pipeline (ASAP) combines a wide range of commonly used algorithms with sophisticated visualization tools. Compared with existing scRNA-seq analysis platforms, researchers (including those lacking computational expertise) are able to interact with the data in a straightforward fashion and in real time. Furthermore, given the overlap between scRNA-seq and bulk RNA-seq analysis workflows, ASAP should conceptually be broadly applicable to any RNA-seq dataset. As a validation, we demonstrate how we can use ASAP to simply reproduce the results from a single-cell study of 91 mouse cells involving five distinct cell types. The tool is freely available at asap.epfl.ch and R/Python scripts are available at github.com/DeplanckeLab/ASAP. bart.deplancke@epfl.ch. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Recombining overlapping BACs into a single larger BAC.
Kotzamanis, George; Huxley, Clare
2004-01-06
BAC clones containing entire mammalian genes including all the transcribed region and long range controlling elements are very useful for functional analysis. Sequenced BACs are available for most of the human and mouse genomes and in many cases these contain intact genes. However, large genes often span more than one BAC, and single BACs covering the entire region of interest are not available. Here we describe a system for linking two or more overlapping BACs into a single clone by homologous recombination. The method was used to link a 61-kb insert carrying the final 5 exons of the human CFTR gene onto a 160-kb BAC carrying the first 22 exons. Two rounds of homologous recombination were carried out in the EL350 strain of bacteria which can be induced for the Red genes. In the first round, the inserts of the two overlapping BACs were subcloned into modified BAC vectors using homologous recombination. In the second round, the BAC to be added was linearised with the very rare-cutting enzyme I-PpoI and electroporated into recombination efficient EL350 bacteria carrying the other BAC. Recombined BACs were identified by antibiotic selection and PCR screening and 10% of clones contained the correctly recombined 220-kb BAC. The system can be used to link the inserts from any overlapping BAC or PAC clones. The original orientation of the inserts is not important and desired regions of the inserts can be selected. The size limit for the fragments recombined may be larger than the 61 kb used here and multiple BACs in a contig could be combined by alternating use of the two pBACLink vectors. This system should be of use to many investigators wishing to carry out functional analysis on large mammalian genes which are not available in single BAC clones.
Vojinovic, Dina; Brison, Nathalie; Ahmad, Shahzad; Noens, Ilse; Pappa, Irene; Karssen, Lennart C; Tiemeier, Henning; van Duijn, Cornelia M; Peeters, Hilde; Amin, Najaf
2017-08-01
Autism spectrum disorder (ASD) is a highly heritable neurodevelopmental disorder with a complex genetic architecture. To identify genetic variants underlying ASD, we performed single-variant and gene-based genome-wide association studies using a dense genotyping array containing over 2.3 million single-nucleotide variants in a discovery sample of 160 families with at least one child affected with non-syndromic ASD using a binary (ASD yes/no) phenotype and a quantitative autistic trait. Replication of the top findings was performed in Psychiatric Genomics Consortium and Erasmus Rucphen Family (ERF) cohort study. Significant association of quantitative autistic trait was observed with the TTC25 gene at 17q21.2 (effect size=10.2, P-value=3.4 × 10 -7 ) in the gene-based analysis. The gene also showed nominally significant association in the cohort-based ERF study (effect=1.75, P-value=0.05). Meta-analysis of discovery and replication improved the association signal (P-value meta =1.5 × 10 -8 ). No genome-wide significant signal was observed in the single-variant analysis of either the binary ASD phenotype or the quantitative autistic trait. Our study has identified a novel gene TTC25 to be associated with quantitative autistic trait in patients with ASD. The replication of association in a cohort-based study and the effect estimate suggest that variants in TTC25 may also be relevant for broader ASD phenotype in the general population. TTC25 is overexpressed in frontal cortex and testis and is known to be involved in cilium movement and thus an interesting candidate gene for autistic trait.
Estimating intrinsic and extrinsic noise from single-cell gene expression measurements
Fu, Audrey Qiuyan; Pachter, Lior
2017-01-01
Gene expression is stochastic and displays variation (“noise”) both within and between cells. Intracellular (intrinsic) variance can be distinguished from extracellular (extrinsic) variance by applying the law of total variance to data from two-reporter assays that probe expression of identically regulated gene pairs in single cells. We examine established formulas [Elowitz, M. B., A. J. Levine, E. D. Siggia and P. S. Swain (2002): “Stochastic gene expression in a single cell,” Science, 297, 1183–1186.] for the estimation of intrinsic and extrinsic noise and provide interpretations of them in terms of a hierarchical model. This allows us to derive alternative estimators that minimize bias or mean squared error. We provide a geometric interpretation of these results that clarifies the interpretation in [Elowitz, M. B., A. J. Levine, E. D. Siggia and P. S. Swain (2002): “Stochastic gene expression in a single cell,” Science, 297, 1183–1186.]. We also demonstrate through simulation and re-analysis of published data that the distribution assumptions underlying the hierarchical model have to be satisfied for the estimators to produce sensible results, which highlights the importance of normalization. PMID:27875323
Fu, Wei; Xie, Wen; Zhang, Zhuo; Wang, Shaoli; Wu, Qingjun; Liu, Yong; Zhou, Xiaomao; Zhou, Xuguo; Zhang, Youjun
2013-01-01
Abstract: Quantitative real-time PCR (qRT-PCR), a primary tool in gene expression analysis, requires an appropriate normalization strategy to control for variation among samples. The best option is to compare the mRNA level of a target gene with that of reference gene(s) whose expression level is stable across various experimental conditions. In this study, expression profiles of eight candidate reference genes from the diamondback moth, Plutella xylostella, were evaluated under diverse experimental conditions. RefFinder, a web-based analysis tool, integrates four major computational programs including geNorm, Normfinder, BestKeeper, and the comparative ΔCt method to comprehensively rank the tested candidate genes. Elongation factor 1 (EF1) was the most suited reference gene for the biotic factors (development stage, tissue, and strain). In contrast, although appropriate reference gene(s) do exist for several abiotic factors (temperature, photoperiod, insecticide, and mechanical injury), we were not able to identify a single universal reference gene. Nevertheless, a suite of candidate reference genes were specifically recommended for selected experimental conditions. Our finding is the first step toward establishing a standardized qRT-PCR analysis of this agriculturally important insect pest. PMID:23983612
Birla, Bhagyashree S; Chou, Hui-Hsien
2015-01-01
Gene synthesis is frequently used in modern molecular biology research either to create novel genes or to obtain natural genes when the synthesis approach is more flexible and reliable than cloning. DNA chemical synthesis has limits on both its length and yield, thus full-length genes have to be hierarchically constructed from synthesized DNA fragments. Gibson Assembly and its derivatives are the simplest methods to assemble multiple double-stranded DNA fragments. Currently, up to 12 dsDNA fragments can be assembled at once with Gibson Assembly according to its vendor. In practice, the number of dsDNA fragments that can be assembled in a single reaction are much lower. We have developed a rational design method for gene construction that allows high-number dsDNA fragments to be assembled into full-length genes in a single reaction. Using this new design method and a modified version of the Gibson Assembly protocol, we have assembled 3 different genes from up to 45 dsDNA fragments at once. Our design method uses the thermodynamic analysis software Picky that identifies all unique junctions in a gene where consecutive DNA fragments are specifically made to connect to each other. Our novel method is generally applicable to most gene sequences, and can improve both the efficiency and cost of gene assembly.
Gene- and pathway-based association tests for multiple traits with GWAS summary statistics.
Kwak, Il-Youp; Pan, Wei
2017-01-01
To identify novel genetic variants associated with complex traits and to shed new insights on underlying biology, in addition to the most popular single SNP-single trait association analysis, it would be useful to explore multiple correlated (intermediate) traits at the gene- or pathway-level by mining existing single GWAS or meta-analyzed GWAS data. For this purpose, we present an adaptive gene-based test and a pathway-based test for association analysis of multiple traits with GWAS summary statistics. The proposed tests are adaptive at both the SNP- and trait-levels; that is, they account for possibly varying association patterns (e.g. signal sparsity levels) across SNPs and traits, thus maintaining high power across a wide range of situations. Furthermore, the proposed methods are general: they can be applied to mixed types of traits, and to Z-statistics or P-values as summary statistics obtained from either a single GWAS or a meta-analysis of multiple GWAS. Our numerical studies with simulated and real data demonstrated the promising performance of the proposed methods. The methods are implemented in R package aSPU, freely and publicly available at: https://cran.r-project.org/web/packages/aSPU/ CONTACT: weip@biostat.umn.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Vivar, Juan C.; Sarzynski, Mark A.; Sung, Yun Ju; Timmons, James A.; Bouchard, Claude; Rankinen, Tuomo
2013-01-01
We previously reported the findings from a genome-wide association study of the response of maximal oxygen uptake (V̇o2max) to an exercise program. Here we follow up on these results to generate hypotheses on genes, pathways, and systems involved in the ability to respond to exercise training. A systems biology approach can help us better establish a comprehensive physiological description of what underlies V̇o2maxtrainability. The primary material for this exploration was the individual single-nucleotide polymorphism (SNP), SNP-gene mapping, and statistical significance levels. We aimed to generate novel hypotheses through analyses that go beyond statistical association of single-locus markers. This was accomplished through three complementary approaches: 1) building de novo evidence of gene candidacy through informatics-driven literature mining; 2) aggregating evidence from statistical associations to link variant enrichment in biological pathways to V̇o2max trainability; and 3) predicting possible consequences of variants residing in the pathways of interest. We started with candidate gene prioritization followed by pathway analysis focused on overrepresentation analysis and gene set enrichment analysis. Subsequently, leads were followed using in silico analysis of predicted SNP functions. Pathways related to cellular energetics (pantothenate and CoA biosynthesis; PPAR signaling) and immune functions (complement and coagulation cascades) had the highest levels of SNP burden. In particular, long-chain fatty acid transport and fatty acid oxidation genes and sequence variants were found to influence differences in V̇o2max trainability. Together, these methods allow for the hypothesis-driven ranking and prioritization of genes and pathways for future experimental testing and validation. PMID:23990238
Genetic association of SNPs in the FTO gene and predisposition to obesity in Malaysian Malays.
Apalasamy, Y D; Ming, M F; Rampal, S; Bulgiba, A; Mohamed, Z
2012-12-01
The common variants in the fat mass- and obesity-associated (FTO) gene have been previously found to be associated with obesity in various adult populations. The objective of the present study was to investigate whether the single nucleotide polymorphisms (SNPs) and linkage disequilibrium (LD) blocks in various regions of the FTO gene are associated with predisposition to obesity in Malaysian Malays. Thirty-one FTO SNPs were genotyped in 587 (158 obese and 429 non-obese) Malaysian Malay subjects. Obesity traits and lipid profiles were measured and single-marker association testing, LD testing, and haplotype association analysis were performed. LD analysis of the FTO SNPs revealed the presence of 57 regions with complete LD (D' = 1.0). In addition, we detected the association of rs17817288 with low-density lipoprotein cholesterol. The FTO gene may therefore be involved in lipid metabolism in Malaysian Malays. Two haplotype blocks were present in this region of the FTO gene, but no particular haplotype was found to be significantly associated with an increased risk of obesity in Malaysian Malays.
Li, Chun-Xiao; Jiang, Mei-Shan; Chen, Shi-Yi; Lai, Song-Jia
2008-07-01
Single nucleotide polymorphism (SNP) in exon 1 and 3 of fibroblast growth factor (FGF5) gene was studied by DNA sequencing in Yingjing angora rabbit, Tianfu black rabbit and California rabbit. A frameshift mutation (TCT insert) at base position 217 (site A) of exon 1 and a T/C missense mutation at base position 59 (site B) of exon 3 were found in Yingjing angora rabbit with a high frequency; a T/C same-sense mutation at base position 3 (site C) of exon 3 was found with similar frequency in three rabbit breeds. Least square analysis showed that different genotypes had no significant association with wool yield in site A, and had high significant association with wool yield in site B (P<0.01) and significant association with wool yield in site C (P<0.05). It was concluded from the results that FGF5 gene could be the potential major gene affecting wool yield or link with the major gene, and polymorphic loci B and C may be used as molecular markers for im-proving wool yield in angora rabbits.
Genetic association of SNPs in the FTO gene and predisposition to obesity in Malaysian Malays
Apalasamy, Y.D.; Ming, M.F.; Rampal, S.; Bulgiba, A.; Mohamed, Z.
2012-01-01
The common variants in the fat mass- and obesity-associated (FTO) gene have been previously found to be associated with obesity in various adult populations. The objective of the present study was to investigate whether the single nucleotide polymorphisms (SNPs) and linkage disequilibrium (LD) blocks in various regions of the FTO gene are associated with predisposition to obesity in Malaysian Malays. Thirty-one FTO SNPs were genotyped in 587 (158 obese and 429 non-obese) Malaysian Malay subjects. Obesity traits and lipid profiles were measured and single-marker association testing, LD testing, and haplotype association analysis were performed. LD analysis of the FTO SNPs revealed the presence of 57 regions with complete LD (D' = 1.0). In addition, we detected the association of rs17817288 with low-density lipoprotein cholesterol. The FTO gene may therefore be involved in lipid metabolism in Malaysian Malays. Two haplotype blocks were present in this region of the FTO gene, but no particular haplotype was found to be significantly associated with an increased risk of obesity in Malaysian Malays. PMID:22911346
Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong
2013-01-01
For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods. PMID:23620809
Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong
2013-01-01
For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods.
A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.
Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C
2017-10-01
Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.
Genomic organization of the rat alpha 2u-globulin gene cluster.
McFadyen, D A; Addison, W; Locke, J
1999-05-01
The alpha 2u-globulin are a group of similar proteins, belonging to the lipocalin superfamily of proteins, that are synthesized in a subset of secretory tissues in rats. The many alpha 2u-globulin isoforms are encoded by a multigene family that exhibits extensive homology. Despite a high degree of sequence identity, individual family members show diverse expression patterns involving complex hormonal, tissue-specific, and developmental regulation. Analysis suggests that there are approximately 20 alpha 2u-globulin genes in the rat genome. We have used fluorescence in situ hybridization (FISH) to show that the alpha 2u-globulin genes are clustered at a single site on rat Chromosome (Chr) 5 (5q22-24). Southern blots of rat genomic DNA separated by pulsed field gel electrophoresis indicated that the alpha 2u-globulin genes are contained on two NruI fragments with a total size of 880 kbp. Analysis of three P1 clones containing alpha 2u-globulin genes indicated that the alpha 2u-globulin genes are tandemly arranged in a head-to-tail fashion. The organization of the alpha 2u-globulin genes in the rat as a tandem array of single genes differs from the homologous major urinary protein genes in the mouse, which are organized as tandem arrays of divergently oriented gene pairs. The structure of these gene clusters may have consequences for the proposed function, as a pheromone transporter, for the protein products encoded by these genes.
McCormick, Mark A.; Delaney, Joe R.; Tsuchiya, Mitsuhiro; Tsuchiyama, Scott; Shemorry, Anna; Sim, Sylvia; Chou, Annie Chia-Zong; Ahmed, Umema; Carr, Daniel; Murakami, Christopher J.; Schleit, Jennifer; Sutphin, George L.; Wasko, Brian M.; Bennett, Christopher F.; Wang, Adrienne M.; Olsen, Brady; Beyer, Richard P.; Bammler, Theodor K.; Prunkard, Donna; Johnson, Simon C.; Pennypacker, Juniper K.; An, Elroy; Anies, Arieanna; Castanza, Anthony S.; Choi, Eunice; Dang, Nick; Enerio, Shiena; Fletcher, Marissa; Fox, Lindsay; Goswami, Sarani; Higgins, Sean A.; Holmberg, Molly A.; Hu, Di; Hui, Jessica; Jelic, Monika; Jeong, Ki-Soo; Johnston, Elijah; Kerr, Emily O.; Kim, Jin; Kim, Diana; Kirkland, Katie; Klum, Shannon; Kotireddy, Soumya; Liao, Eric; Lim, Michael; Lin, Michael S.; Lo, Winston C.; Lockshon, Dan; Miller, Hillary A.; Moller, Richard M.; Muller, Brian; Oakes, Jonathan; Pak, Diana N.; Peng, Zhao Jun; Pham, Kim M.; Pollard, Tom G.; Pradeep, Prarthana; Pruett, Dillon; Rai, Dilreet; Robison, Brett; Rodriguez, Ariana A.; Ros, Bopharoth; Sage, Michael; Singh, Manpreet K.; Smith, Erica D.; Snead, Katie; Solanky, Amrita; Spector, Benjamin L.; Steffen, Kristan K.; Tchao, Bie Nga; Ting, Marc K.; Wende, Helen Vander; Wang, Dennis; Welton, K. Linnea; Westman, Eric A.; Brem, Rachel B.; Liu, Xin-guang; Suh, Yousin; Zhou, Zhongjun; Kaeberlein, Matt; Kennedy, Brian K.
2015-01-01
SUMMARY Many genes that affect replicative lifespan (RLS) in the budding yeast Saccharomyces cerevisiae also affect aging in other organisms such as C. elegans and M. musculus. We performed a systematic analysis of yeast RLS in a set of 4,698 viable single-gene deletion strains. Multiple functional gene clusters were identified, and full genome-to-genome comparison demonstrated a significant conservation in longevity pathways between yeast and C. elegans. Among the mechanisms of aging identified, deletion of tRNA exporter LOS1 robustly extended lifespan. Dietary restriction (DR) and inhibition of mechanistic Target of Rapamycin (mTOR) exclude Los1 from the nucleus in a Rad53-dependent manner. Moreover, lifespan extension from deletion of LOS1 is non-additive with DR or mTOR inhibition, and results in Gcn4 transcription factor activation. Thus, the DNA damage response and mTOR converge on Los1-mediated nuclear tRNA export to regulate Gcn4 activity and aging. PMID:26456335
[Association between single-nucleotide polymorphisms in the IRAK-4 gene and allergic rhinitis].
Zhang, Yuan; Xi, Lin; Zhao, Yan-ming; Zhao, Li-ping; Zhang, Luo
2012-06-01
To investigate the genetic association pattern between single-nucleotide polymorphisms (SNP) in the interleukin-1 receptor-associated kinase 4 (IRAK-4) gene and allergic rhinitis (AR). A population of 379 patients with the diagnosis of AR and 333 healthy controls who lived in Beijing region was recruited. A total of 8 reprehensive marker SNP which were in IRAK-4 gene region were selected according to the Beijing people database from Hapmap website. The individual genotyping was performed by MassARRAY platform. SPSS 13.0 software was used for statistic analysis. Subgroup analysis for the presence of different allergen sensitivities displayed associations only in the house dust mite-allergic cohorts (rs3794262: P = 0.0034, OR = 1.7388; rs4251481: P = 0.0023, OR = 2.6593), but not in subjects who were allergic to pollens as well as mix allergens. The potential genetic contribution of the IRAK-4 gene to AR demonstrated an allergen-dependant association pattern in Chinese population.
Measuring single-cell gene expression dynamics in bacteria using fluorescence time-lapse microscopy
Young, Jonathan W; Locke, James C W; Altinok, Alphan; Rosenfeld, Nitzan; Bacarian, Tigran; Swain, Peter S; Mjolsness, Eric; Elowitz, Michael B
2014-01-01
Quantitative single-cell time-lapse microscopy is a powerful method for analyzing gene circuit dynamics and heterogeneous cell behavior. We describe the application of this method to imaging bacteria by using an automated microscopy system. This protocol has been used to analyze sporulation and competence differentiation in Bacillus subtilis, and to quantify gene regulation and its fluctuations in individual Escherichia coli cells. The protocol involves seeding and growing bacteria on small agarose pads and imaging the resulting microcolonies. Images are then reviewed and analyzed using our laboratory's custom MATLAB analysis code, which segments and tracks cells in a frame-to-frame method. This process yields quantitative expression data on cell lineages, which can illustrate dynamic expression profiles and facilitate mathematical models of gene circuits. With fast-growing bacteria, such as E. coli or B. subtilis, image acquisition can be completed in 1 d, with an additional 1–2 d for progressing through the analysis procedure. PMID:22179594
The complete chloroplast genome of a medicinal plant Epimedium koreanum Nakai (Berberidaceae).
Lee, Jung-Hoon; Kim, Kyunghee; Kim, Na-Rae; Lee, Sang-Choon; Yang, Tae-Jin; Kim, Young-Dong
2016-11-01
Epimedium koreanum is a perennial medicinal plant distributed in Eastern Asia. The complete chloroplast genome sequences of E. koreanum was obtained by de novo assembly using whole genome next-generation sequences. The chloroplast genome of E. koreanum was 157 218 bp in length and separated into four distinct regions such as large single copy region (89 600 bp), small single copy region (17 222 bp) and a pair of inverted repeat regions (25 198 bp). The genome contained a total of 112 genes including 78 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Phylogenetic analysis with the reported chloroplast genomes revealed that E. koreanum is most closely related to Berberis bealei, a traditional medicinal plant in the Berberidaceae family.
The complete chloroplast genome of salt cress (Eutrema salsugineum).
Guo, Xinyi; Hao, Guoqian; Ma, Tao
2016-07-01
The complete chloroplast (cp) sequence of the salt cress (Eutrema salsugineum), a plant well-adapted to salt stress, was presented in this study. The circular molecule is 153,407 bp in length and exhibit a typical quadripartite structure containing an 83,894 bp large single copy (LSC) region, a 17,607 bp small single copy (SSC) region, and the two 25,953 bp inverted repeats (IRs). The salt cress cp genome contains 135 known genes, including 87 protein-coding genes, 8 ribosomal RNA genes, and 40 tRNA genes; 21 of these are located in the inverted repeat region. As expected, phylogenetic analysis support the idea that E. salsugineum is sister to Brassiceae species within the Brassicaceae family.
Hahntow, Ines N; Mairuhu, Gideon; van Valkengoed, Irene Gm; Koopmans, Richard P; Michel, Martin C
2010-06-02
Genotype-phenotype association studies are typically based upon polymorphisms or haplotypes comprised of multiple polymorphisms within a single gene. It has been proposed that combinations of polymorphisms in distinct genes, which functionally impact the same phenotype, may have stronger phenotype associations than those within a single gene. We have tested this hypothesis using genes encoding components of the renin-angiotensin-aldosterone system and the high blood pressure phenotype. Our analysis is based on 1379 participants of the cross-sectional SUNSET study randomly selected from the population register of Amsterdam. Each subject was genotyped for the angiotensinogen M235T, the angiotensin-converting enzyme insertion/deletion and the angiotensin II type 1 receptor A1166C polymorphism. The phenotype high blood pressure was defined either as a categorical variable comparing hypertension versus normotension as in most previous studies or as a continuous variable using systolic, diastolic and mean blood pressure in a multiple regression analysis with gender, ethnicity, age, body-mass-index and antihypertensive medication as covariates. Genotype-phenotype relationships were explored for each polymorphism in isolation and for double and triple polymorphism combinations. At the single polymorphism level, only the A allele of the angiotensin II type 1 receptor was associated with a high blood pressure phenotype. Using combinations of polymorphisms of two or all three genes did not yield stronger/more consistent associations. We conclude that combinations of physiologically related polymorphisms of multiple genes, at least with regard to the renin-angiotensin-aldosterone system and the hypertensive phenotype, do not necessarily offer additional benefit in analyzing genotype/phenotype associations.
[Progress in porky genes and transcriptome and discussion of relative issues].
Zhu, Meng-Jin; Liu, Bang; Li, Kui
2005-01-01
To date, research on molecular base of porky molecular development was mainly involved in muscle growth and meat quality. Some functional genes including Hal gene and RN gene and some QTLs controlling or associated with porky growth and quality were detected through candidate gene approach and genome-wide scanning. Genic transcriptome pertinent to porcine muscle and adipose also came into study. At the same time, these researches have befallen some shortcomings to some extent. Research from molecular quantitative genetics showed shortcomings that single gene was devilishly emphasized and co-expression pattern of multi-genes was ignored. Research applying transcriptome analysis tool also met two of limitations, one was the singleness of type of molecular experimental techniques, and another was that genes of muscle and adipose were artificially divided into unattached two parts. Thus, porky genes were explored by parallel genetics based on systemic views and techniques to specially reveal the interactional mechanism of porky genes respectively controlling muscle and adipose, which would be important issues of genes and genome researches on porky development in the near future.
Tembrock, Luke R.; Zheng, Shaoyu; Wu, Zhiqiang
2018-01-01
Qat (Catha edulis, Celastraceae) is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp) genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae. PMID:29425128
Comprehensive Analysis of Transcription Dynamics from Brain Samples Following Behavioral Experience
Turm, Hagit; Mukherjee, Diptendu; Haritan, Doron; Tahor, Maayan; Citri, Ami
2014-01-01
The encoding of experiences in the brain and the consolidation of long-term memories depend on gene transcription. Identifying the function of specific genes in encoding experience is one of the main objectives of molecular neuroscience. Furthermore, the functional association of defined genes with specific behaviors has implications for understanding the basis of neuropsychiatric disorders. Induction of robust transcription programs has been observed in the brains of mice following various behavioral manipulations. While some genetic elements are utilized recurrently following different behavioral manipulations and in different brain nuclei, transcriptional programs are overall unique to the inducing stimuli and the structure in which they are studied1,2. In this publication, a protocol is described for robust and comprehensive transcriptional profiling from brain nuclei of mice in response to behavioral manipulation. The protocol is demonstrated in the context of analysis of gene expression dynamics in the nucleus accumbens following acute cocaine experience. Subsequent to a defined in vivo experience, the target neural tissue is dissected; followed by RNA purification, reverse transcription and utilization of microfluidic arrays for comprehensive qPCR analysis of multiple target genes. This protocol is geared towards comprehensive analysis (addressing 50-500 genes) of limiting quantities of starting material, such as small brain samples or even single cells. The protocol is most advantageous for parallel analysis of multiple samples (e.g. single cells, dynamic analysis following pharmaceutical, viral or behavioral perturbations). However, the protocol could also serve for the characterization and quality assurance of samples prior to whole-genome studies by microarrays or RNAseq, as well as validation of data obtained from whole-genome studies. PMID:25225819
Gong, Bin-Sheng; Zhang, Qing-Pu; Zhang, Guang-Mei; Zhang, Shao-Jun; Zhang, Wei; Lv, Hong-Chao; Zhang, Fan; Lv, Sa-Li; Li, Chuan-Xing; Rao, Shao-Qi; Li, Xia
2007-01-01
Gene expression profiles and single-nucleotide polymorphism (SNP) profiles are modern data for genetic analysis. It is possible to use the two types of information to analyze the relationships among genes by some genetical genomics approaches. In this study, gene expression profiles were used as expression traits. And relationships among the genes, which were co-linked to a common SNP(s), were identified by integrating the two types of information. Further research on the co-expressions among the co-linked genes was carried out after the gene-SNP relationships were established using the Haseman-Elston sib-pair regression. The results showed that the co-expressions among the co-linked genes were significantly higher if the number of connections between the genes and a SNP(s) was more than six. Then, the genes were interconnected via one or more SNP co-linkers to construct a gene-SNP intermixed network. The genes sharing more SNPs tended to have a stronger correlation. Finally, a gene-gene network was constructed with their intensities of relationships (the number of SNP co-linkers shared) as the weights for the edges. PMID:18466544
D'Addabbo, Annarita; Palmieri, Orazio; Maglietta, Rosalia; Latiano, Anna; Mukherjee, Sayan; Annese, Vito; Ancona, Nicola
2011-08-01
A meta-analysis has re-analysed previous genome-wide association scanning definitively confirming eleven genes and further identifying 21 new loci. However, the identified genes/loci still explain only the minority of genetic predisposition of Crohn's disease. To identify genes weakly involved in disease predisposition by analysing chromosomal regions enriched of single nucleotide polymorphisms with modest statistical association. We utilized the WTCCC data set evaluating 1748 CD and 2938 controls. The identification of candidate genes/loci was performed by a two-step procedure: first of all chromosomal regions enriched of weak association signals were localized; subsequently, weak signals clustered in gene regions were identified. The statistical significance was assessed by non parametric permutation tests. The cytoband enrichment analysis highlighted 44 regions (P≤0.05) enriched with single nucleotide polymorphisms significantly associated with the trait including 23 out of 31 previously confirmed and replicated genes. Importantly, we highlight further 20 novel chromosomal regions carrying approximately one hundred genes/loci with modest association. Amongst these we find compelling functional candidate genes such as MAPT, GRB2 and CREM, LCT, and IL12RB2. Our study suggests a different statistical perspective to discover genes weakly associated with a given trait, although further confirmatory functional studies are needed. Copyright © 2011 Editrice Gastroenterologica Italiana S.r.l. All rights reserved.
[Identification and polymorphism of pectinase genes PGU in the Saccharomyces bayanus complex].
Shalamitskiy, M Yu; Naumov, G I
2016-05-01
Pectinase (endo-polygalacturonase) is the key enzyme splitting plant pectin. The corresponding single gene PGU1 is documented for the yeast S. cerevisiae. On the basis of phylogenetic analysis of the PGU nucleotide sequence available in the GenBank, a family of divergent PGU genes is found in the species complex S. bayanus: S. bayanus var. uvarum, S. eubayanus, and hybrid taxon S. pastorianus. The PGU genes have different chromosome localization.
Flajnik, Martin F; Tlapakova, Tereza; Criscitiello, Michael F; Krylov, Vladimir; Ohta, Yuko
2012-08-01
The B7 family of genes is essential in the regulation of the adaptive immune system. Most B7 family members contain both variable (V)- and constant (C)-type domains of the immunoglobulin superfamily (IgSF). Through in silico screening of the Xenopus genome and subsequent phylogenetic analysis, we found novel genes belonging to the B7 family, one of which is the recently discovered B7H6. Humans and rats have a single B7H6 gene; however, many B7H6 genes were detected in a single large cluster in the Xenopus genome. The B7H6 expression patterns also varied in a species-specific manner. Human B7H6 binds to the activating natural killer receptor, NKp30. While the NKp30 gene is single-copy and maps to the MHC in most vertebrates, many Xenopus NKp30 genes were found in a cluster on a separate chromosome that does not harbor the MHC. Indeed, in all species so far analyzed from sharks to mammals, the number of NKp30 and B7H6 genes correlates well, suggestive of receptor-ligand co-evolution. Furthermore, we identified a Xenopus-specific B7 homolog (B7HXen) and revealed its close linkage to B2M, which we have demonstrated previously to have been originally encoded in the MHC. Thus, our study provides further proof that the B7 precursor was included in the proto MHC. Additionally, the comparative analysis revealed a new B7 family member, B7H7, which was previously designated in the literature as an unknown gene, HHLA2.
Gladka, Monika M; Molenaar, Bas; de Ruiter, Hesther; van der Elst, Stefan; Tsui, Hoyee; Versteeg, Danielle; Lacraz, Grègory P A; Huibers, Manon M H; van Oudenaarden, Alexander; van Rooij, Eva
2018-01-31
Background -Genome-wide transcriptome analysis has greatly advanced our understanding of the regulatory networks underlying basic cardiac biology and mechanisms driving disease. However, so far, the resolution of studying gene expression patterns in the adult heart has been limited to the level of extracts from whole tissues. The use of tissue homogenates inherently causes the loss of any information on cellular origin or cell type-specific changes in gene expression. Recent developments in RNA amplification strategies provide a unique opportunity to use small amounts of input RNA for genome-wide sequencing of single cells. Methods -Here, we present a method to obtain high quality RNA from digested cardiac tissue from adult mice for automated single-cell sequencing of both the healthy and diseased heart. Results -After optimization, we were able to perform single-cell sequencing on adult cardiac tissue under both homeostatic conditions and after ischemic injury. Clustering analysis based on differential gene expression unveiled known and novel markers of all main cardiac cell types. Based on differential gene expression we were also able to identify multiple subpopulations within a certain cell type. Furthermore, applying single-cell sequencing on both the healthy and the injured heart indicated the presence of disease-specific cell subpopulations. As such, we identified cytoskeleton associated protein 4 ( Ckap4 ) as a novel marker for activated fibroblasts that positively correlates with known myofibroblast markers in both mouse and human cardiac tissue. Ckap4 inhibition in activated fibroblasts treated with TGFβ triggered a greater increase in the expression of genes related to activated fibroblasts compared to control, suggesting a role of Ckap4 in modulating fibroblast activation in the injured heart. Conclusions -Single-cell sequencing on both the healthy and diseased adult heart allows us to study transcriptomic differences between cardiac cells, as well as cell type-specific changes in gene expression during cardiac disease. This new approach provides a wealth of novel insights into molecular changes that underlie the cellular processes relevant for cardiac biology and pathophysiology. Applying this technology could lead to the discovery of new therapeutic targets relevant for heart disease.
Quantification of multiple gene expression in individual cells.
Peixoto, António; Monteiro, Marta; Rocha, Benedita; Veiga-Fernandes, Henrique
2004-10-01
Quantitative gene expression analysis aims to define the gene expression patterns determining cell behavior. So far, these assessments can only be performed at the population level. Therefore, they determine the average gene expression within a population, overlooking possible cell-to-cell heterogeneity that could lead to different cell behaviors/cell fates. Understanding individual cell behavior requires multiple gene expression analyses of single cells, and may be fundamental for the understanding of all types of biological events and/or differentiation processes. We here describe a new reverse transcription-polymerase chain reaction (RT-PCR) approach allowing the simultaneous quantification of the expression of 20 genes in the same single cell. This method has broad application, in different species and any type of gene combination. RT efficiency is evaluated. Uniform and maximized amplification conditions for all genes are provided. Abundance relationships are maintained, allowing the precise quantification of the absolute number of mRNA molecules per cell, ranging from 2 to 1.28 x 10(9) for each individual gene. We evaluated the impact of this approach on functional genetic read-outs by studying an apparently homogeneous population (monoclonal T cells recovered 4 d after antigen stimulation), using either this method or conventional real-time RT-PCR. Single-cell studies revealed considerable cell-to-cell variation: All T cells did not express all individual genes. Gene coexpression patterns were very heterogeneous. mRNA copy numbers varied between different transcripts and in different cells. As a consequence, this single-cell assay introduces new and fundamental information regarding functional genomic read-outs. By comparison, we also show that conventional quantitative assays determining population averages supply insufficient information, and may even be highly misleading.
Xie, Jianbo; Shi, Haowen; Du, Zhenglin; Wang, Tianshu; Liu, Xiaomeng; Chen, Sanfeng
2016-01-01
Paenibacillus polymyxa has widely been studied as a model of plant-growth promoting rhizobacteria (PGPR). Here, the genome sequences of 9 P. polymyxa strains, together with 26 other sequenced Paenibacillus spp., were comparatively studied. Phylogenetic analysis of the concatenated 244 single-copy core genes suggests that the 9 P. polymyxa strains and 5 other Paenibacillus spp., isolated from diverse geographic regions and ecological niches, formed a closely related clade (here it is called Poly-clade). Analysis of single nucleotide polymorphisms (SNPs) reveals local diversification of the 14 Poly-clade genomes. SNPs were not evenly distributed throughout the 14 genomes and the regions with high SNP density contain the genes related to secondary metabolism, including genes coding for polyketide. Recombination played an important role in the genetic diversity of this clade, although the rate of recombination was clearly lower than mutation. Some genes relevant to plant-growth promoting traits, i.e. phosphate solubilization and IAA production, are well conserved, while some genes relevant to nitrogen fixation and antibiotics synthesis are evolved with diversity in this Poly-clade. This study reveals that both P. polymyxa and its closely related species have plant growth promoting traits and they have great potential uses in agriculture and horticulture as PGPR. PMID:26856413
Du, Qingzhang; Tian, Jiaxing; Yang, Xiaohui; Pan, Wei; Xu, Baohua; Li, Bailian; Ingvarsson, Pär K.; Zhang, Deqiang
2015-01-01
Economically important traits in many species generally show polygenic, quantitative inheritance. The components of genetic variation (additive, dominant and epistatic effects) of these traits conferred by multiple genes in shared biological pathways remain to be defined. Here, we investigated 11 full-length genes in cellulose biosynthesis, on 10 growth and wood-property traits, within a population of 460 unrelated Populus tomentosa individuals, via multi-gene association. To validate positive associations, we conducted single-marker analysis in a linkage population of 1,200 individuals. We identified 118, 121, and 43 associations (P< 0.01) corresponding to additive, dominant, and epistatic effects, respectively, with low to moderate proportions of phenotypic variance (R2). Epistatic interaction models uncovered a combination of three non-synonymous sites from three unique genes, representing a significant epistasis for diameter at breast height and stem volume. Single-marker analysis validated 61 associations (false discovery rate, Q ≤ 0.10), representing 38 SNPs from nine genes, and its average effect (R2 = 3.8%) nearly 2-fold higher than that identified with multi-gene association, suggesting that multi-gene association can capture smaller individual variants. Moreover, a structural gene–gene network based on tissue-specific transcript abundances provides a better understanding of the multi-gene pathway affecting tree growth and lignocellulose biosynthesis. Our study highlights the importance of pathway-based multiple gene associations to uncover the nature of genetic variance for quantitative traits and may drive novel progress in molecular breeding. PMID:25428896
Plessy, Charles; Desbois, Linda; Fujii, Teruo; Carninci, Piero
2013-02-01
Tissues contain complex populations of cells. Like countries, which are comprised of mixed populations of people, tissues are not homogeneous. Gene expression studies that analyze entire populations of cells from tissues as a mixture are blind to this diversity. Thus, critical information is lost when studying samples rich in specialized but diverse cells such as tumors, iPS colonies, or brain tissue. High throughput methods are needed to address, model and understand the constitutive and stochastic differences between individual cells. Here, we describe microfluidics technologies that utilize a combination of molecular biology and miniaturized labs on chips to study gene expression at the single cell level. We discuss how the characterization of the transcriptome of each cell in a sample will open a new field in gene expression analysis, population transcriptomics, that will change the academic and biomedical analysis of complex samples by defining them as quantified populations of single cells. Copyright © 2013 WILEY Periodicals, Inc.
One-Step and Stepwise Magnification of a BOBBED LETHAL Chromosome in DROSOPHILA MELANOGASTER
Endow, Sharyn A.; Komma, Donald J.
1986-01-01
Bobbed lethal (bbl) chromosomes carry too few ribosomal genes for homozygous flies to be viable. Reversion of bbl chromosomes to bb or nearly bb + occurs under magnifying conditions at a low frequency in a single generation. These reversions occur too rapidly to be accounted for by single unequal sister chromatid exchanges and seem unlikely to be due to multiple sister strand exchanges within a given cell lineage. Analysis of several one-step revertants indicates that they are X-Y recombinant chromosomes which probably arise from X-Y recombination at bb. The addition of ribosomal genes from the Y chromosome to the bbl chromosome explains the more rapid reversion of the bbl chromosome than is permitted by single events of unequal sister chromatid exchange. Analysis of stepwise bbl magnified chromosomes, which were selected over a period of 4–9 magnifying generations, shows ribosomal gene patterns that are closely similar to each other. Similarity in rDNA pattern among stepwise magnified products of the same parental chromosome is consistent with reversion by a mechanism of unequal sister strand exchange. PMID:3095184
Identification of genes associated with low furanocoumarin content in grapefruit
USDA-ARS?s Scientific Manuscript database
Some furanocoumarins in grapefruit (Citrus paradisi) are associated with the so-called grapefruit juice effect. Previous phytochemical quantification and genetic analysis suggested that the synthesis of these furanocoumarins may be controlled by a single gene in the pathway. In this study, cDNA-ampl...
Ohashi, Takao; Nakakita, Shin-ichi; Sumiyoshi, Wataru; Yamada, Naotaka; Ikeda, Yuka; Tanaka, Naotaka; Takegawa, Kaoru
2011-03-01
In the fission yeast Schizosaccharomyces pombe, galactose (Gal) residues are transferred to N- and O-linked oligosaccharides of glycoproteins by galactosyltransferases in the lumen of the Golgi apparatus. In S. pombe, the major in vitro α1,2-galactosyltransferase activity has been purified, the gma12(+) gene has been cloned, and three α-galactosyltransferase genes (gmh1(+)-gmh3(+)) have also been partially characterized. In this study, we found three additional uncharacterized genes with homology to gmh1(+) (gmh4(+)-gmh6(+)) in the fission yeast genome sequence. All possible single disruption mutants and the septuple disruption strain were constructed and characterized. The electrophoretic mobility of acid phosphatase prepared from gma12Δ, gmh2Δ, gmh3Δ and gmh6Δ mutants was higher than that from wild type, indicating that Gma12p, Gmh2p, Gmh3p and Gmh6p are required for the galactosylation of N-linked oligosaccharides. High-performance liquid chromatography (HPLC) analysis of pyridylaminated O-linked oligosaccharides from each single mutant showed that Gma12p, Gmh2p and Gmh6p are involved in galactosylation of O-linked oligosaccharides. The septuple mutant exhibited similar drug and temperature sensitivity as a gms1Δ mutant that is incapable of galactosylation. Oligosaccharide structural analysis based on HPLC and methylation analysis revealed that the septuple mutant still contained oligosaccharides consisting of α1,3-linked Gal residues, indicating that an unknown α1,3-galactosyltransferase activity was still present in the septuple mutant.
Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng
2017-08-01
Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.
Transcriptome Analysis of a Premature Leaf Senescence Mutant of Common Wheat (Triticum aestivum L.)
Xia, Chuan; Zhang, Lichao; Dong, Chunhao; Liu, Xu; Kong, Xiuying
2018-01-01
Leaf senescence is an important agronomic trait that affects both crop yield and quality. In this study, we characterized a premature leaf senescence mutant of wheat (Triticum aestivum L.) obtained by ethylmethane sulfonate (EMS) mutagenesis, named m68. Genetic analysis showed that the leaf senescence phenotype of m68 is controlled by a single recessive nuclear gene. We compared the transcriptome of wheat leaves between the wild type (WT) and the m68 mutant at four time points. Differentially expressed gene (DEG) analysis revealed many genes that were closely related to senescence genes. Gene Ontology (GO) enrichment analysis suggested that transcription factors and protein transport genes might function in the beginning of leaf senescence, while genes that were associated with chlorophyll and carbon metabolism might function in the later stage. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis showed that the genes that are involved in plant hormone signal transduction were significantly enriched. Through expression pattern clustering of DEGs, we identified 1012 genes that were induced during senescence, and we found that the WRKY family and zinc finger transcription factors might be more important than other transcription factors in the early stage of leaf senescence. These results will not only support further gene cloning and functional analysis of m68, but also facilitate the study of leaf senescence in wheat. PMID:29534430
Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.)
Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu
2015-01-01
The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut. PMID:26125188
The Essential Genome of Escherichia coli K-12
2018-01-01
ABSTRACT Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. PMID:29463657
Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.).
Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu
2015-01-01
The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut.
USDA-ARS?s Scientific Manuscript database
Background: DNA methylation is influenced by diet and single nucleotide polymorphisms (SNPs), and methylation modulates gene expression. Objective: We aimed to explore whether the gene-by-diet interactions on blood lipids act through DNA methylation. Design: We selected 7 SNPs on the basis of predic...
Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T
2017-02-01
To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.
DEsingle for detecting three types of differential expression in single-cell RNA-seq data.
Miao, Zhun; Deng, Ke; Wang, Xiaowo; Zhang, Xuegong
2018-04-24
The excessive amount of zeros in single-cell RNA-seq data include "real" zeros due to the on-off nature of gene transcription in single cells and "dropout" zeros due to technical reasons. Existing differential expression (DE) analysis methods cannot distinguish these two types of zeros. We developed an R package DEsingle which employed Zero-Inflated Negative Binomial model to estimate the proportion of real and dropout zeros and to define and detect 3 types of DE genes in single-cell RNA-seq data with higher accuracy. The R package DEsingle is freely available at https://github.com/miaozhun/DEsingle and is under Bioconductor's consideration now. zhangxg@tsinghua.edu.cn. Supplementary data are available at Bioinformatics online.
Gomes, S L; Gober, J W; Shapiro, L
1990-01-01
Caulobacter crescentus has a single dnaK gene that is highly homologous to the hsp70 family of heat shock genes. Analysis of the cloned and sequenced dnaK gene has shown that the deduced amino acid sequence could encode a protein of 67.6 kilodaltons that is 68% identical to the DnaK protein of Escherichia coli and 49% identical to the Drosophila and human hsp70 protein family. A partial open reading frame 165 base pairs 3' to the end of dnaK encodes a peptide of 190 amino acids that is 59% identical to DnaJ of E. coli. Northern blot analysis revealed a single 4.0-kilobase mRNA homologous to the cloned fragment. Since the dnaK coding region is 1.89 kilobases, dnaK and dnaJ may be transcribed as a polycistronic message. S1 mapping and primer extension experiments showed that transcription initiated at two sites 5' to the dnaK coding sequence. A single start site of transcription was identified during heat shock at 42 degrees C, and the predicted promoter sequence conformed to the consensus heat shock promoters of E. coli. At normal growth temperature (30 degrees C), a different start site was identified 3' to the heat shock start site that conformed to the E. coli sigma 70 promoter consensus sequence. S1 protection assays and analysis of expression of the dnaK gene fused to the lux transcription reporter gene showed that expression of dnaK is temporally controlled under normal physiological conditions and that transcription occurs just before the initiation of DNA replication. Thus, in both human cells (I. K. L. Milarski and R. I. Morimoto, Proc. Natl. Acad. Sci. USA 83:9517-9521, 1986) and in a simple bacterium, the transcription of a hsp70 gene is temporally controlled as a function of the cell cycle under normal growth conditions. Images PMID:2345134
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ts'o, P.O.P.
1990-09-01
The main objectives of this Program Project is to develop strategy and technology for the study of gene structure, organization and function in a multi-disciplinary, highly coordinated manner. In Project I, Molecular Cytology, the establishment of all instrumentation for the computerized microscopic imaging system (CMIS) has been completed with the software in place, including measurement of the third dimension (along the Z-axis). The technique is now at hand to measure single copy DNA in the nucleus, single copy mRNA in the cell, and finally, we are in the process of developing mathematical approaches for the analysis of the relative spatialmore » 3-D relationship among the chromosomes and the individual genes in the interphasal nucleus. Also, we have a sensitive and reliable method for measuring single-stranded DNA breaks which will be useful for the determination of damage to DNA caused by ionizing radiation. In Project II, the mapping of restriction fragments by 2-D enzymatic and electrophoretic analysis has been perfected for application. In Project III, a major finding is that the binding constant and effectiveness of antisense oligonucleotide analogues, Matagen, can be significantly improved by substituting 2{prime}-O-methylribos methylphosphonate backbones for the current 2{prime}-deoxyribomethylphosphonate backbones. 15 refs., 10 figs., 2 tabs.« less
Single cell genome analysis of an uncultured heterotrophic stramenopile
NASA Astrophysics Data System (ADS)
Roy, Rajat S.; Price, Dana C.; Schliep, Alexander; Cai, Guohong; Korobeynikov, Anton; Yoon, Hwan Su; Yang, Eun Chan; Bhattacharya, Debashish
2014-04-01
A broad swath of eukaryotic microbial biodiversity cannot be cultivated in the lab and is therefore inaccessible to conventional genome-wide comparative methods. One promising approach to study these lineages is single cell genomics (SCG), whereby an individual cell is captured from nature and genome data are produced from the amplified total DNA. Here we tested the efficacy of SCG to generate a draft genome assembly from a single sample, in this case a cell belonging to the broadly distributed MAST-4 uncultured marine stramenopiles. Using de novo gene prediction, we identified 6,996 protein-encoding genes in the MAST-4 genome. This genetic inventory was sufficient to place the cell within the ToL using multigene phylogenetics and provided preliminary insights into the complex evolutionary history of horizontal gene transfer (HGT) in the MAST-4 lineage.
Wu, Liang; Zhang, Xiaolong; Zhao, Zhikun; Wang, Ling; Li, Bo; Li, Guibo; Dean, Michael; Yu, Qichao; Wang, Yanhui; Lin, Xinxin; Rao, Weijian; Mei, Zhanlong; Li, Yang; Jiang, Runze; Yang, Huan; Li, Fuqiang; Xie, Guoyun; Xu, Liqin; Wu, Kui; Zhang, Jie; Chen, Jianghao; Wang, Ting; Kristiansen, Karsten; Zhang, Xiuqing; Li, Yingrui; Yang, Huanming; Wang, Jian; Hou, Yong; Xu, Xun
2015-01-01
Viral infection causes multiple forms of human cancer, and HPV infection is the primary factor in cervical carcinomas. Recent single-cell RNA-seq studies highlight the tumor heterogeneity present in most cancers, but virally induced tumors have not been studied. HeLa is a well characterized HPV+ cervical cancer cell line. We developed a new high throughput platform to prepare single-cell RNA on a nanoliter scale based on a customized microwell chip. Using this method, we successfully amplified full-length transcripts of 669 single HeLa S3 cells and 40 of them were randomly selected to perform single-cell RNA sequencing. Based on these data, we obtained a comprehensive understanding of the heterogeneity of HeLa S3 cells in gene expression, alternative splicing and fusions. Furthermore, we identified a high diversity of HPV-18 expression and splicing at the single-cell level. By co-expression analysis we identified 283 E6, E7 co-regulated genes, including CDC25, PCNA, PLK4, BUB1B and IRF1 known to interact with HPV viral proteins. Our results reveal the heterogeneity of a virus-infected cell line. It not only provides a transcriptome characterization of HeLa S3 cells at the single cell level, but is a demonstration of the power of single cell RNA-seq analysis of virally infected cells and cancers.
Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis.
Perera, Dinum; Magbanua, Zenaida V; Thummasuwan, Supaphan; Mukherjee, Dipaloke; Arick, Mark; Chouvarine, Philippe; Nairn, Campbell J; Schmutz, Jeremy; Grimwood, Jane; Dean, Jeffrey F D; Peterson, Daniel G
2018-07-15
Loblolly pine (LP; Pinus taeda L.) is an economically and ecologically important tree in the southeastern U.S. To advance understanding of the loblolly pine (LP; Pinus taeda L.) genome, we sequenced and analyzed 100 BAC clones and performed a Cot analysis. The Cot analysis indicates that the genome is composed of 57, 24, and 10% highly-repetitive, moderately-repetitive, and single/low-copy sequences, respectively (the remaining 9% of the genome is a combination of fold back and damaged DNA). Although single/low-copy DNA only accounts for 10% of the LP genome, the amount of single/low-copy DNA in LP is still 14 times the size of the Arabidopsis genome. Since gene numbers in LP are similar to those in Arabidopsis, much of the single/low-copy DNA of LP would appear to be composed of DNA that is both gene- and repeat-poor. Macroarrays prepared from a LP bacterial artificial chromosome (BAC) library were hybridized with probes designed from cell wall synthesis/wood development cDNAs, and 50 of the "targeted" clones were selected for further analysis. An additional 25 clones were selected because they contained few repeats, while 25 more clones were selected at random. The 100 BAC clones were Sanger sequenced and assembled. Of the targeted BACs, 80% contained all or part of the cDNA used to target them. One targeted BAC was found to contain fungal DNA and was eliminated from further analysis. Combinations of similarity-based and ab initio gene prediction approaches were utilized to identify and characterize potential coding regions in the 99 BACs containing LP DNA. From this analysis, we identified 154 gene models (GMs) representing both putative protein-coding genes and likely pseudogenes. Ten of the GMs (all of which were specifically targeted) had enough support to be classified as intact genes. Interestingly, the 154 GMs had statistically indistinguishable (α = 0.05) distributions in the targeted and random BAC clones (15.18 and 12.61 GM/Mb, respectively), whereas the low-repeat BACs contained significantly fewer GMs (7.08 GM/Mb). However, when GM length was considered, the targeted BACs had a significantly greater percentage of their length in GMs (3.26%) when compared to random (1.63%) and low-repeat (0.62%) BACs. The results of our study provide insight into LP evolution and inform ongoing efforts to produce a reference genome sequence for LP, while characterization of genes involved in cell wall production highlights carbon metabolism pathways that can be leveraged for increasing wood production. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Lai, Y C; Fujikawa, T; Ando, T; Kitahara, G; Koiwa, M; Kubota, C; Miura, N
2017-06-01
Our aim was to identify a suitable microRNA housekeeping gene for real-time PCR analysis of bovine mastitis-related microRNA in milk. We identified , , and as housekeeping gene candidates on the basis of previous Solexa sequencing results. Threshold cycle (CT) values for , , and did not differ between milk from control cows and milk from mastitis-affected cows. NormFinder software identified as the most stable single housekeeping gene. We evaluated the suitability of the housekeeping gene candidates by using them to assess expression levels of the inflammation-related gene . Regardless of the housekeeping gene candidates used for normalization, relative expression levels of were significantly higher in mastitis-affected samples than in control samples. However, of all the housekeeping genes and gene combinations investigated, normalization with alone generated the difference in relative expression between mastitis-affected and control samples with the highest significance. These results suggest that is suitable for use as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.
Taka, Hitomi; Asano, Shin-ichiro; Matsuura, Yoshiharu; Bando, Hisanori
2015-01-01
To infect their hosts, DNA viruses must successfully initiate the expression of viral genes that control subsequent viral gene expression and manipulate the host environment. Viral genes that are immediately expressed upon infection play critical roles in the early infection process. In this study, we investigated the expression and regulation of five canonical regulatory immediate-early (IE) genes of Autographa californica multicapsid nucleopolyhedrovirus: ie0, ie1, ie2, me53, and pe38. A systematic transient gene-expression analysis revealed that these IE genes are generally transactivators, suggesting the existence of a highly interactive regulatory network. A genetic analysis using gene knockout viruses demonstrated that the expression of these IE genes was tolerant to the single deletions of activator IE genes in the early stage of infection. A network graph analysis on the regulatory relationships observed in the transient expression analysis suggested that the robustness of IE gene expression is due to the organization of the IE gene regulatory network and how each IE gene is activated. However, some regulatory relationships detected by the genetic analysis were contradictory to those observed in the transient expression analysis, especially for IE0-mediated regulation. Statistical modeling, combined with genetic analysis using knockout alleles for ie0 and ie1, showed that the repressor function of ie0 was due to the interaction between ie0 and ie1, not ie0 itself. Taken together, these systematic approaches provided insight into the topology and nature of the IE gene regulatory network. PMID:25816136
Kouvelis, Vassili N; Sialakouma, Aphrodite; Typas, Milton A
2008-07-01
The recent revision of Verticillium sect. Prostrata led to the introduction of the genus Lecanicillium, which comprises the majority of the entomopathogenic strains. Sixty-five strains previously classified as Verticillium lecanii or Verticillium sp. from different geographical regions and hosts were examined and their phylogenetic relationships were determined using sequences from three mitochondrial (mt) genes [the small rRNA subunit (rns), the NADH dehydrogenase subunits 1 (nad1) and 3 (nad3)] and the ITS region. In general, single gene phylogenetic trees differentiated and placed the strains examined in well-supported (by BS analysis) groups of L. lecanii, L. longisporum, L. muscarium, and L. nodulosum, although in some cases a few uncertainties still remained. nad1 was the most informative single gene in phylogenetic analyses and was also found to contain group I introns with putative open reading frames (ORFs) encoding for GIY-YIG endonucleases. The combined use of mt gene sequences resolved taxonomic uncertainties arisen from ITS analysis and, alone or in combination with ITS sequences, helped in placing uncharacterised Verticillium lecanii and Verticillium sp. firmly into Lecanicillium species. Combined gene data from all the mt genes and all the mt genes and the ITS region together, were very similar. Furthermore, a relaxed correlation with host specificity -- at least for Homoptera -- was indicated for the rns and the combined mt gene sequences. Thus, the usefulness of mt gene sequences as a convenient molecular tool in phylogenetic studies of entomopathogenic fungi was demonstrated.
Kang, Yun; McMillan, Ian; Norris, Michael H; Hoang, Tung T
2015-07-01
Until recently, transcriptome analyses of single cells have been confined to eukaryotes. The information obtained from single-cell transcripts can provide detailed insight into spatiotemporal gene expression, and it could be even more valuable if expanded to prokaryotic cells. Transcriptome analysis of single prokaryotic cells is a recently developed and powerful tool. Here we describe a procedure that allows amplification of the total transcript of a single prokaryotic cell for in-depth analysis. This is performed by using a laser-capture microdissection instrument for single-cell isolation, followed by reverse transcription via Moloney murine leukemia virus, degradation of chromosomal DNA with McrBC and DpnI restriction enzymes, single-stranded cDNA (ss-cDNA) ligation using T4 polynucleotide kinase and CircLigase, and polymerization of ss-cDNA to double-stranded cDNA (ds-cDNA) by Φ29 polymerase. This procedure takes ∼5 d, and sufficient amounts of ds-cDNA can be obtained from single-cell RNA template for further microarray analysis.
Opazo, Juan C; Lee, Alison P; Hoffmann, Federico G; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F
2015-07-01
Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Genome-wide and gene-based association implicates FRMD6 in Alzheimer disease.
Hong, Mun-Gwan; Reynolds, Chandra A; Feldman, Adina L; Kallin, Mikael; Lambert, Jean-Charles; Amouyel, Philippe; Ingelsson, Erik; Pedersen, Nancy L; Prince, Jonathan A
2012-03-01
Genome-wide association studies (GWAS) that allow for allelic heterogeneity may facilitate the discovery of novel genes not detectable by models that require replication of a single variant site. One strategy to accomplish this is to focus on genes rather than markers as units of association, and so potentially capture a spectrum of causal alleles that differ across populations. Here, we conducted a GWAS of Alzheimer disease (AD) in 2,586 Swedes and performed gene-based meta-analysis with three additional studies from France, Canada, and the United States, in total encompassing 4,259 cases and 8,284 controls. Implementing a newly designed gene-based algorithm, we identified two loci apart from the region around APOE that achieved study-wide significance in combined samples, the strongest finding being for FRMD6 on chromosome 14q (P = 2.6 × 10(-14)) and a weaker signal for NARS2 that is immediately adjacent to GAB2 on chromosome 11q (P = 7.8 × 10(-9)). Ontology-based pathway analyses revealed significant enrichment of genes involved in glycosylation. Results suggest that gene-based approaches that accommodate allelic heterogeneity in GWAS can provide a complementary avenue for gene discovery and may help to explain a portion of the missing heritability not detectable with single nucleotide polymorphisms (SNPs) derived from marker-specific meta-analysis. © 2011 Wiley Periodicals, Inc.
Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Sezutsu, Hideki; Uchino, Keiro; Kobayashi, Isao; Tamura, Toshiki; Yamamoto, Kimiko; Mita, Kazuei; Shimada, Toru; Kadono-Okuda, Keiko
2018-05-09
Bombyx mori densovirus type 1 (BmDV) is a pathogen that causes flacherie disease in the silkworm. The absolute nonsusceptibility to BmDV among certain silkworm strains is determined independently by two genes, nsd-1 and Nid-1. However, neither of these genes has been molecularly identified to date. Here, we isolated the nsd-1 gene by positional cloning and characterized the properties of its product, NSD-1. Sequence and biochemical analyses revealed that this gene encodes a Bombyx-specific mucin-like glycoprotein with a single transmembrane domain. The NSD-1 protein was specifically expressed in the larval midgut epithelium, the known infection site of BmDV. Sequence analysis of the nsd-1 gene from 13 resistant and 12 susceptible strains suggested that a specific arginine residue in the extracellular tail of the NSD-1 protein was common among susceptible strains. Germline transformation of the susceptible-type nsd-1 (with a single nucleotide substitution) conferred partial susceptibility to resistant larvae, indicating that the + nsd-1 gene is required for the susceptibility of B. mori larvae to BmDV and the susceptibility is solely a result of the substitution of a single amino acid with arginine. Taken together, our results provide striking evidence that a novel membrane-bound mucin-like protein functions as a cell-surface receptor for a densovirus.
Zhang, Huifa; Jenkins, Gareth; Zou, Yuan; Zhu, Zhi; Yang, Chaoyong James
2012-04-17
A microfluidic device for performing single copy, emulsion Reverse Transcription Polymerase Chain Reaction (RT-PCR) within agarose droplets is presented. A two-aqueous-inlet emulsion droplet generator was designed and fabricated to produce highly uniform monodisperse picoliter agarose emulsion droplets with RT-PCR reagents in carrier oil. Template RNA or cells were delivered from one inlet with RT-PCR reagents/cell lysis buffer delivered separately from the other. Efficient RNA/cell encapsulation and RT-PCR at the single copy level was achieved in agarose-in-oil droplets, which, after amplification, can be solidified into agarose beads for further analysis. A simple and efficient method to graft primer to the polymer matrix using 5'-acrydite primer was developed to ensure highly efficient trapping of RT-PCR products in agarose. High-throughput single RNA molecule/cell RT-PCR was demonstrated in stochastically diluted solutions. Our results indicate that single-molecule RT-PCR can be efficiently carried out in agarose matrix. Single-cell RT-PCR was successfully performed which showed a clear difference in gene expression level of EpCAM, a cancer biomarker gene, at the single-cell level between different types of cancer cells. This work clearly demonstrates for the first time, single-copy RT-PCR in agarose droplets. We believe this will open up new possibilities for viral RNA detection and single-cell transcription analysis.
Rong, E G; Yang, H; Zhang, Z W; Wang, Z P; Yan, X H; Li, H; Wang, N
2015-10-01
Methionine synthase (MTR) plays a crucial role in maintaining homeostasis of intracellular methionine, folate, and homocysteine, and its activity correlates with DNA methylation in many mammalian tissues. Our previous genomewide association study identified that 1 SNP located in the gene was associated with several wool production and quality traits in Chinese Merino. To confirm the potential involvement of the gene in sheep wool production and quality traits, we performed sheep tissue expression profiling, SNP detection, and association analysis with sheep wool production and quality traits. The semiquantitative reverse transcription PCR analysis showed that the gene was differentially expressed in skin from Merino and Kazak sheep. The sequencing analysis identified a total of 13 SNP in the gene from Chinese Merino sheep. Comparison of the allele frequencies revealed that these 13 identified SNP were significantly different among the 6 tested Chinese Merino strains ( < 0.001). Linkage disequilibrium analysis showed that SNP 3 to 11 were strongly linked in a single haplotype block in the tested population. Association analysis showed that SNP 2 to 11 were significantly associated with the average wool fiber diameter and the fineness SD and that SNP 4 to 11 were significantly associated with the CV of fiber diameter trait ( < 0.05). Single nucleotide polymorphism 2 and SNP 5 to 12 were weakly associated with wool crimp. Similarly, the haplotypes derived from these 13 identified SNP were also significantly associated with the average wool fiber diameter, fineness SD, and the CV of fiber diameter ( < 0.05). Our results suggest that is a candidate gene for sheep wool production and quality traits, and the identified SNP might be used in sheep breeding.
Association of HS6ST3 gene polymorphisms with obesity and triglycerides: gene x gender interaction.
Wang, Ke-Sheng; Wang, Liang; Liu, Xuefeng; Zeng, Min
2013-12-01
The heparan sulfate 6-O-sulfotransferase 3 (HS6ST3) gene is involved in heparan sulphate and heparin metabolism, and has been reported to be associated with diabetic retinopathy in type 2 diabetes.We hypothesized that HS6ST3 gene polymorphisms might play an important role in obesity and related phenotypes (such as triglycerides). We examined genetic associations of 117 single-nucleotide polymorphisms (SNPs) within the HS6ST3 gene with obesity and triglycerides using two Caucasian samples: the Marshfield sample (1442 obesity cases and 2122 controls), and the Health aging and body composition (Health ABC) sample (305 cases and 1336 controls). Logistic regression analysis of obesity as a binary trait and linear regression analysis of triglycerides as a continuous trait, adjusted for age and sex, were performed using PLINK. Single marker analysis showed that six SNPs in the Marshfield sample and one SNP in the Health ABC sample were associated with obesity (P < 0.05). SNP rs535812 revealed a stronger association with obesity in meta-analysis of these two samples (P = 0.0105). The T-A haplotype from rs878950 and rs9525149 revealed significant association with obesity in the Marshfield sample (P = 0.012). Moreover, nine SNPs showed associations with triglycerides in the Marshfield sample (P < 0.05) and the best signal was rs1927796 (P = 0.00858). In addition, rs7331762 showed a strong gene x gender interaction (P = 0.00956) for obesity while rs1927796 showed a strong gene x gender interaction (P = 0.000625) for triglycerides in the Marshfield sample. These findings contribute new insights into the pathogenesis of obesity and triglycerides and demonstrate the importance of gender differences in the aetiology.
A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction.
Marceau, Rachel; Lu, Wenbin; Holloway, Shannon; Sale, Michèle M; Worrall, Bradford B; Williams, Stephen R; Hsu, Fang-Chi; Tzeng, Jung-Ying
2015-09-01
Kernel machine (KM) models are a powerful tool for exploring associations between sets of genetic variants and complex traits. Although most KM methods use a single kernel function to assess the marginal effect of a variable set, KM analyses involving multiple kernels have become increasingly popular. Multikernel analysis allows researchers to study more complex problems, such as assessing gene-gene or gene-environment interactions, incorporating variance-component based methods for population substructure into rare-variant association testing, and assessing the conditional effects of a variable set adjusting for other variable sets. The KM framework is robust, powerful, and provides efficient dimension reduction for multifactor analyses, but requires the estimation of high dimensional nuisance parameters. Traditional estimation techniques, including regularization and the "expectation-maximization (EM)" algorithm, have a large computational cost and are not scalable to large sample sizes needed for rare variant analysis. Therefore, under the context of gene-environment interaction, we propose a computationally efficient and statistically rigorous "fastKM" algorithm for multikernel analysis that is based on a low-rank approximation to the nuisance effect kernel matrices. Our algorithm is applicable to various trait types (e.g., continuous, binary, and survival traits) and can be implemented using any existing single-kernel analysis software. Through extensive simulation studies, we show that our algorithm has similar performance to an EM-based KM approach for quantitative traits while running much faster. We also apply our method to the Vitamin Intervention for Stroke Prevention (VISP) clinical trial, examining gene-by-vitamin effects on recurrent stroke risk and gene-by-age effects on change in homocysteine level. © 2015 WILEY PERIODICALS, INC.
Jheng, Cheng-Fong; Chen, Tien-Chih; Lin, Jhong-Yi; Chen, Ting-Chieh; Wu, Wen-Luan; Chang, Ching-Chun
2012-07-01
The chloroplast genome of Phalaenopsis equestris was determined and compared to those of Phalaenopsis aphrodite and Oncidium Gower Ramsey in Orchidaceae. The chloroplast genome of P. equestris is 148,959 bp, and a pair of inverted repeats (25,846 bp) separates the genome into large single-copy (85,967 bp) and small single-copy (11,300 bp) regions. The genome encodes 109 genes, including 4 rRNA, 30 tRNA and 75 protein-coding genes, but loses four ndh genes (ndhA, E, F and H) and seven other ndh genes are pseudogenes. The rate of inter-species variation between the two moth orchids was 0.74% (1107 sites) for single nucleotide substitution and 0.24% for insertions (161 sites; 1388 bp) and deletions (189 sites; 1393 bp). The IR regions have a lower rate of nucleotide substitution (3.5-5.8-fold) and indels (4.3-7.1-fold) than single-copy regions. The intergenic spacers are the most divergent, and based on the length variation of the three intergenic spacers, 11 native Phalaenopsis orchids could be successfully distinguished. The coding genes, IR junction and RNA editing sites are relatively more conserved between the two moth orchids than between those of Phalaenopsis and Oncidium spp. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Larson, Nicholas B; McDonnell, Shannon; Cannon Albright, Lisa; Teerlink, Craig; Stanford, Janet; Ostrander, Elaine A; Isaacs, William B; Xu, Jianfeng; Cooney, Kathleen A; Lange, Ethan; Schleutker, Johanna; Carpten, John D; Powell, Isaac; Bailey-Wilson, Joan E; Cussenot, Olivier; Cancel-Tassin, Geraldine; Giles, Graham G; MacInnis, Robert J; Maier, Christiane; Whittemore, Alice S; Hsieh, Chih-Lin; Wiklund, Fredrik; Catalona, William J; Foulkes, William; Mandal, Diptasri; Eeles, Rosalind; Kote-Jarai, Zsofia; Ackerman, Michael J; Olson, Timothy M; Klein, Christopher J; Thibodeau, Stephen N; Schaid, Daniel J
2017-05-01
Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results. © 2017 WILEY PERIODICALS, INC.
Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki
2014-01-01
ABSTRACT Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. IMPORTANCE Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3′ end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. PMID:25142600
Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki; Mühlberger, Elke
2014-11-01
Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3' end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Pena, S D; Barreto, G; Vago, A R; De Marco, L; Reinach, F C; Dias Neto, E; Simpson, A J
1994-01-01
Low-stringency single specific primer PCR (LSSP-PCR) is an extremely simple PCR-based technique that detects single or multiple mutations in gene-sized DNA fragments. A purified DNA fragment is subjected to PCR using high concentrations of a single specific oligonucleotide primer, large amounts of Taq polymerase, and a very low annealing temperature. Under these conditions the primer hybridizes specifically to its complementary region and nonspecifically to multiple sites within the fragment, in a sequence-dependent manner, producing a heterogeneous set of reaction products resolvable by electrophoresis. The complex banding pattern obtained is significantly altered by even a single-base change and thus constitutes a unique "gene signature." Therefore LSSP-PCR will have almost unlimited application in all fields of genetics and molecular medicine where rapid and sensitive detection of mutations and sequence variations is important. The usefulness of LSSP-PCR is illustrated by applications in the study of mutants of smooth muscle myosin light chain, analysis of a family with X-linked nephrogenic diabetes insipidus, and identity testing using human mitochondrial DNA. Images PMID:8127912
Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells.
Klein, Allon M; Mazutis, Linas; Akartuna, Ilke; Tallapragada, Naren; Veres, Adrian; Li, Victor; Peshkin, Leonid; Weitz, David A; Kirschner, Marc W
2015-05-21
It has long been the dream of biologists to map gene expression at the single-cell level. With such data one might track heterogeneous cell sub-populations, and infer regulatory relationships between genes and pathways. Recently, RNA sequencing has achieved single-cell resolution. What is limiting is an effective way to routinely isolate and process large numbers of individual cells for quantitative in-depth sequencing. We have developed a high-throughput droplet-microfluidic approach for barcoding the RNA from thousands of individual cells for subsequent analysis by next-generation sequencing. The method shows a surprisingly low noise profile and is readily adaptable to other sequencing-based assays. We analyzed mouse embryonic stem cells, revealing in detail the population structure and the heterogeneous onset of differentiation after leukemia inhibitory factor (LIF) withdrawal. The reproducibility of these high-throughput single-cell data allowed us to deconstruct cell populations and infer gene expression relationships. VIDEO ABSTRACT. Copyright © 2015 Elsevier Inc. All rights reserved.
Mustafa, Saima; Fatima, Hira; Fatima, Sadia; Khosa, Tafheem; Akbar, Atif; Shaikh, Rehan Sadiq; Iqbal, Furhan
2018-01-01
To find out a correlation between the single nucleotide polymorphisms in cluster of differentiation 28 and cluster of differentiation 40 genes with Graves' disease, if any. This case-control study was conducted at the Multan Institute of Nuclear Medicine and Radiotherapy, Multan, Pakistan, and comprised blood samples of Graves' disease patients and controls. Various risk factors were also correlated either with the genotype at each single-nucleotide polymorphism or with various combinations of genotypes studied during present investigation. Of the 160 samples, there were 80(50%) each from patients and controls. Risk factor analysis revealed that gender (p=0.008), marital status (p<0.001), education (p<0.001), smoking (p<0.001), tri-iodothyronine (P <0.001), thyroxin (p<0.001) and thyroid-stimulating hormone (p<0.000) levels in blood were associated with Graves' disease. Both single-nucleotide polymorphisms in both genes were not associated with Graves' disease, either individually or in any combined form.
Bao, Weier; Greenwold, Matthew J; Sawyer, Roger H
2017-11-01
Gene co-expression network analysis has been a research method widely used in systematically exploring gene function and interaction. Using the Weighted Gene Co-expression Network Analysis (WGCNA) approach to construct a gene co-expression network using data from a customized 44K microarray transcriptome of chicken epidermal embryogenesis, we have identified two distinct modules that are highly correlated with scale or feather development traits. Signaling pathways related to feather development were enriched in the traditional KEGG pathway analysis and functional terms relating specifically to embryonic epidermal development were also enriched in the Gene Ontology analysis. Significant enrichment annotations were discovered from customized enrichment tools such as Modular Single-Set Enrichment Test (MSET) and Medical Subject Headings (MeSH). Hub genes in both trait-correlated modules showed strong specific functional enrichment toward epidermal development. Also, regulatory elements, such as transcription factors and miRNAs, were targeted in the significant enrichment result. This work highlights the advantage of this methodology for functional prediction of genes not previously associated with scale- and feather trait-related modules.
Chen, Mengqiang; Xu, Mengyun; Xiao, Yao; Cui, Dandan; Qin, Yongqiang; Wu, Jiaqi; Wang, Wenyi; Wang, Guoping
2018-01-01
Anthocyanins are the main pigments in flowers and fruits. These pigments are responsible for the red, red-purple, violet, and purple color in plants, and act as insect and animal attractants. In this study, phenotypic analysis of the purple flower color in eggplant indicated that the flower color is controlled by a single dominant gene, FAS. Using an F2 mapping population derived from a cross between purple-flowered ‘Blacknite’ and white-flowered ‘Small Round’, Flower Anthocyanidin Synthase (FAS) was fine mapped to an approximately 165.6-kb region between InDel marker Indel8-11 and Cleaved Amplified Polymorphic Sequences (CAPS) marker Efc8-32 on Chromosome 8. On the basis of bioinformatic analysis, 29 genes were subsequently located in the FAS target region, among which were two potential Anthocyanidin Synthase (ANS) gene candidates. Allelic sequence comparison results showed that one ANS gene (Sme2.5_01638.1_g00003.1) was conserved in promoter and coding sequences without any nucleotide change between parents, whereas four single-nucleotide polymorphisms were detected in another ANS gene (Sme2.5_01638.1_g00005.1). Crucially, a single base pair deletion at site 438 resulted in premature termination of FAS, leading to the loss of anthocyanin accumulation. In addition, FAS displayed strong expression in purple flowers compared with white flowers and other tissues. Collectively, our results indicate that Sme2.5_01638.1_g00005.1 is a good candidate gene for FAS, which controls anthocyanidin synthase in eggplant flowers. The present study provides information for further potential facilitate genetic engineering for improvement of anthocyanin levels in plants. PMID:29522465
Identification of innate lymphoid cells in single-cell RNA-Seq data.
Suffiotti, Madeleine; Carmona, Santiago J; Jandus, Camilla; Gfeller, David
2017-07-01
Innate lymphoid cells (ILCs) consist of natural killer (NK) cells and non-cytotoxic ILCs that are broadly classified into ILC1, ILC2, and ILC3 subtypes. These cells recently emerged as important early effectors of innate immunity for their roles in tissue homeostasis and inflammation. Over the last few years, ILCs have been extensively studied in mouse and human at the functional and molecular level, including gene expression profiling. However, sorting ILCs with flow cytometry for gene expression analysis is a delicate and time-consuming process. Here we propose and validate a novel framework for studying ILCs at the transcriptomic level using single-cell RNA-Seq data. Our approach combines unsupervised clustering and a new cell type classifier trained on mouse ILC gene expression data. We show that this approach can accurately identify different ILCs, especially ILC2 cells, in human lymphocyte single-cell RNA-Seq data. Our new model relies only on genes conserved across vertebrates, thereby making it in principle applicable in any vertebrate species. Considering the rapid increase in throughput of single-cell RNA-Seq technology, our work provides a computational framework for studying ILC2 cells in single-cell transcriptomic data and may help exploring their conservation in distant vertebrate species.
A microfluidic approach to parallelized transcriptional profiling of single cells.
Sun, Hao; Olsen, Timothy; Zhu, Jing; Tao, Jianguo; Ponnaiya, Brian; Amundson, Sally A; Brenner, David J; Lin, Qiao
2015-12-01
The ability to correlate single-cell genetic information with cellular phenotypes is of great importance to biology and medicine, as it holds the potential to gain insight into disease pathways that is unavailable from ensemble measurements. We present a microfluidic approach to parallelized, rapid, quantitative analysis of messenger RNA from single cells via RT-qPCR. The approach leverages an array of single-cell RT-qPCR analysis units formed by a set of parallel microchannels concurrently controlled by elastomeric pneumatic valves, thereby enabling parallelized handling and processing of single cells in a drastically simplified operation procedure using a relatively small number of microvalves. All steps for single-cell RT-qPCR, including cell isolation and immobilization, cell lysis, mRNA purification, reverse transcription and qPCR, are integrated on a single chip, eliminating the need for off-chip manual cell and reagent transfer and qPCR amplification as commonly used in existing approaches. Additionally, the approach incorporates optically transparent microfluidic components to allow monitoring of single-cell trapping without the need for molecular labeling that can potentially alter the targeted gene expression and utilizes a polycarbonate film as a barrier against evaporation to minimize the loss of reagents at elevated temperatures during the analysis. We demonstrate the utility of the approach by the transcriptional profiling for the induction of the cyclin-dependent kinase inhibitor 1a and the glyceraldehyde 3-phosphate dehydrogenase in single cells from the MCF-7 breast cancer cell line. Furthermore, the methyl methanesulfonate is employed to allow measurement of the expression of the genes in individual cells responding to a genotoxic stress.
Dong, Chongmei; Vincent, Kate; Sharp, Peter
2009-12-04
TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful tool for reverse genetics, combining traditional chemical mutagenesis with high-throughput PCR-based mutation detection to discover induced mutations that alter protein function. The most popular mutation detection method for TILLING is a mismatch cleavage assay using the endonuclease CelI. For this method, locus-specific PCR is essential. Most wheat genes are present as three similar sequences with high homology in exons and low homology in introns. Locus-specific primers can usually be designed in introns. However, it is sometimes difficult to design locus-specific PCR primers in a conserved region with high homology among the three homoeologous genes, or in a gene lacking introns, or if information on introns is not available. Here we describe a mutation detection method which combines High Resolution Melting (HRM) analysis of mixed PCR amplicons containing three homoeologous gene fragments and sequence analysis using Mutation Surveyor software, aimed at simultaneous detection of mutations in three homoeologous genes. We demonstrate that High Resolution Melting (HRM) analysis can be used in mutation scans in mixed PCR amplicons containing three homoeologous gene fragments. Combining HRM scanning with sequence analysis using Mutation Surveyor is sensitive enough to detect a single nucleotide mutation in the heterozygous state in a mixed PCR amplicon containing three homoeoloci. The method was tested and validated in an EMS (ethylmethane sulfonate)-treated wheat TILLING population, screening mutations in the carboxyl terminal domain of the Starch Synthase II (SSII) gene. Selected identified mutations of interest can be further analysed by cloning to confirm the mutation and determine the genomic origin of the mutation. Polyploidy is common in plants. Conserved regions of a gene often represent functional domains and have high sequence similarity between homoeologous loci. The method described here is a useful alternative to locus-specific based methods for screening mutations in conserved functional domains of homoeologous genes. This method can also be used for SNP (single nucleotide polymorphism) marker development and eco-TILLING in polyploid species.
Raynal, Caroline; Ciccolini, Joseph; Mercier, Cédric; Boyer, Jean-Christophe; Polge, Anne; Lallemant, Benjamin; Mouzat, Kévin; Lumbroso, Serge; Brouillet, Jean-Paul; Evrard, Alexandre
2010-02-01
Gemcitabine (2',2'-difluorodeoxycytidine) is a major antimetabolite cytotoxic drug with a wide spectrum of activity against solid tumors. Hepatic elimination of gemcitabine depends on a catabolic pathway through a deamination step driven by the enzyme cytidine deaminase (CDA). Severe hematologic toxicity to gemcitabine was reported in patients harboring genetic polymorphisms in CDA gene. High-resolution melting (HRM) analysis of polymerase chain reaction amplicon emerges today as a powerful technique for both genotyping and gene scanning strategies. In this study, 46 DNA samples from gemcitabine-treated patients were subjected to HRM analysis on a LightCycler 480 platform. Residual serum CDA activity was assayed as a surrogate marker for the overall functionality of this enzyme. Genotyping of three well-described single nucleotide polymorphisms in coding region (c.79A>C, c.208G>A and c.435C>T) was successfully achieved by HRM analysis of small polymerase chain reaction fragments, whereas unknown single nucleotide polymorphisms were searched by a gene scanning strategy with longer amplicons (up to 622 bp). The gene scanning strategy allowed us to find a new intronic mutation c.246+37G>A in a female patient displaying marked CDA deficiency and who had an extreme toxic reaction with a fatal outcome to gemcitabine treatment. Our work demonstrates that HRM-based methods, owing to their simplicity, reliability, and speed, are useful tools for diagnosis of CDA deficiency and could be of interest for personalized medicine.
Stice, Shaun P; Stumpf, Spencer D; Gitaitis, Ron D; Kvitko, Brian H; Dutta, Bhabesh
2018-01-01
Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study.
Stice, Shaun P.; Stumpf, Spencer D.; Gitaitis, Ron D.; Kvitko, Brian H.; Dutta, Bhabesh
2018-01-01
Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study. PMID:29491851
Hammer, Alexandra; Steiner, Sabine
2013-09-01
Beyond pharmacological, endovascular and surgical treatment strategies for peripheral arterial disease (PAD), therapeutic angiogenesis has been advocated to relieve symptoms and support limb salvage, in particular in patients with critical limb ischemia. We aimed to systematically review randomized controlled trials (RCTs) of gene therapy in PAD. A systematic search of electronic databases was performed to identify RCTs studying local administration of pro-angiogenic growth factors (VEGF, FGF, HGF, Del-1, HIF-1alpha) using plasmid or viral gene transfer by intra-arterial or intra-muscular injections. Outcomes of interest comprised all-cause mortality, amputations, ulcer healing, walking distance and ankle-brachial index. If feasible, standard meta-analysis should be performed with subgroup analysis for claudicants and patients with critical limb ischemia (CLI). The systematic search yielded 12 RCTs for analysis from 1163 citations. In total, 1494 patients (29 % females) were included with the majority suffering from CLI (64 %). Various endpoints were improved by single studies, but none by a majority of studies. Meta-analysis showed neither a significant benefit nor harm for gene therapy when synthesizing data for all-cause mortality (OR 0.88, 95 % CI 0.62 - 1.26) amputations (OR 0.64, 95 % CI 0.31 - 1.31) or ulcer healing (OR 1.79, 95 % CI 0.8 - 4.01). No differences were seen between patients with intermittent claudication or CLI. Despite promising results in single studies, no clear benefit could be identified for gene therapy in PAD patients, irrespective of disease severity.
Liu, Yunjun; Cao, Gaoyi; Chen, Rongrong; Zhang, Shengxue; Ren, Yuan; Lu, Wei; Wang, Jianhua; Wang, Guoying
2015-08-01
5-Enolpyruvylshikimate-3-phosphate synthase (EPSPS) and glyphosate N-acetyltransferase (GAT) can detoxify glyphosate by alleviating the suppression of shikimate pathway. In this study, we obtained transgenic tobacco plants overexpressing AM79 aroA, GAT, and both of them, respectively, to evaluate whether overexpression of both genes could confer transgenic plants with higher glyphosate resistance. The transgenic plants harboring GAT or AM79 aroA, respectively, showed good glyphosate resistance. As expected, the hybrid plants containing both GAT and AM79 aroA exhibited improved glyphosate resistance than the transgenic plants overexpressing only a single gene. When grown on media with high concentration of glyphosate, seedlings containing a single gene were severely inhibited, whereas plants expressing both genes were affected less. When transgenic plants grown in the greenhouse were sprayed with glyphosate, less damage was observed for the plants containing both genes. Metabolomics analysis showed that transgenic plants containing two genes could maintain the metabolism balance better than those containing one gene after glyphosate treatment. Glyphosate treatment did not lead to a huge increase of shikimate contents of tobacco leaves in transgenic plants overexpressing two genes, whereas significant increase of shikimate contents in transgenic plants containing only a single gene was observed. These results demonstrated that pyramiding both aroA and GAT in transgenic plants can enhance glyphosate resistance, and this strategy can be used for the development of transgenic glyphosate-resistant crops.
Improved score statistics for meta-analysis in single-variant and gene-level association studies.
Yang, Jingjing; Chen, Sai; Abecasis, Gonçalo
2018-06-01
Meta-analysis is now an essential tool for genetic association studies, allowing them to combine large studies and greatly accelerating the pace of genetic discovery. Although the standard meta-analysis methods perform equivalently as the more cumbersome joint analysis under ideal settings, they result in substantial power loss under unbalanced settings with various case-control ratios. Here, we investigate the power loss problem by the standard meta-analysis methods for unbalanced studies, and further propose novel meta-analysis methods performing equivalently to the joint analysis under both balanced and unbalanced settings. We derive improved meta-score-statistics that can accurately approximate the joint-score-statistics with combined individual-level data, for both linear and logistic regression models, with and without covariates. In addition, we propose a novel approach to adjust for population stratification by correcting for known population structures through minor allele frequencies. In the simulated gene-level association studies under unbalanced settings, our method recovered up to 85% power loss caused by the standard methods. We further showed the power gain of our methods in gene-level tests with 26 unbalanced studies of age-related macular degeneration . In addition, we took the meta-analysis of three unbalanced studies of type 2 diabetes as an example to discuss the challenges of meta-analyzing multi-ethnic samples. In summary, our improved meta-score-statistics with corrections for population stratification can be used to construct both single-variant and gene-level association studies, providing a useful framework for ensuring well-powered, convenient, cross-study analyses. © 2018 WILEY PERIODICALS, INC.
Naaijen, J; Bralten, J; Poelmans, G; Glennon, J C; Franke, B; Buitelaar, J K
2017-01-10
Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms, autism symptom severity and inhibition were performed using principal component regression analyses. Subsequently, gene-wide association analyses were performed. The glutamate gene set showed an association with severity of hyperactivity/impulsivity (P=0.009), which was robust to correcting for genome-wide association levels. The GABA gene set showed nominally significant association with inhibition (P=0.04), but this did not survive correction for multiple comparisons. None of single gene or single variant associations was significant on their own. By analyzing multiple genetic variants within candidate gene sets together, we were able to find genetic associations supporting the involvement of excitatory and inhibitory neurotransmitter systems in ADHD and ASD symptom severity in ADHD.
Vancamelbeke, Maaike; Vanuytsel, Tim; Farré, Ricard; Verstockt, Sare; Ferrante, Marc; Van Assche, Gert; Rutgeerts, Paul; Schuit, Frans; Vermeire, Séverine; Arijs, Ingrid; Cleynen, Isabelle
2017-10-01
Intestinal barrier defects are common in patients with inflammatory bowel disease (IBD). To identify which components could underlie these changes, we performed an in-depth analysis of epithelial barrier genes in IBD. A set of 128 intestinal barrier genes was selected. Polygenic risk scores were generated based on selected barrier gene variants that were associated with Crohn's disease (CD) or ulcerative colitis (UC) in our study. Gene expression was analyzed using microarray and quantitative reverse transcription polymerase chain reaction. Influence of barrier gene variants on expression was studied by cis-expression quantitative trait loci mapping and comparing patients with low- and high-risk scores. Barrier risk scores were significantly higher in patients with IBD than controls. At single-gene level, the associated barrier single-nucleotide polymorphisms were most significantly enriched in PTGER4 for CD and HNF4A for UC. As a group, the regulating proteins were most enriched for CD and UC. Expression analysis showed that many epithelial barrier genes were significantly dysregulated in active CD and UC, with overrepresentation of mucus layer genes. In uninflamed CD ileum and IBD colon, most barrier gene levels restored to normal, except for MUC1 and MUC4 that remained persistently increased compared with controls. Expression levels did not depend on cis-regulatory variants nor combined genetic risk. We found genetic and transcriptomic dysregulations of key epithelial barrier genes and components in IBD. Of these, we believe that mucus genes, in particular MUC1 and MUC4, play an essential role in the pathogenesis of IBD and could represent interesting targets for treatment.
Cui, Lipeng; Qiu, Zhengkun; Wang, Zhirong; Gao, Jianchang; Guo, Yanmei; Huang, Zejun; Du, Yongchen; Wang, Xiaoxuan
2017-01-01
The hydrophobic cuticle that covers the surface of tomato (Solanum lycopersicum) fruit plays key roles in development and protection against biotic and abiotic stresses, including water loss, mechanical damage, UV radiation, pathogens, and pests. However, many details of the genes and regulatory mechanisms involved in cuticle biosynthesis in fleshy fruits are not well understood. In this study, we describe a novel tomato fruit phenotype, characterized by epidermal reticulation (ER) of green fruit and a higher water loss rate than wild type (WT) fruit. The ER phenotype is controlled by a single gene, ER4.1, derived from an introgressed chromosomal segment from the wild tomato species S. pennellii (LA0716). We performed fine mapping of the single dominant gene to an ~300 kb region and identified Solyc04g082540, Solyc04g082950, Solyc04g082630, and Solyc04g082910as potential candidate genes for the ER4.1 locus, based on comparative RNA-seq analysis of ER and WT fruit peels. In addition, the transcriptome analysis revealed that the expression levels of genes involved in cutin, wax and flavonoid biosynthesis were altered in the ER fruit compared with WT. This study provides new insights into the regulatory mechanisms and metabolism of the fruit cuticle. PMID:28798753
Sarkar, F H; Valdivieso, M; Borders, J; Yao, K L; Raval, M M; Madan, S K; Sreepathi, P; Shimoyama, R; Steiger, Z; Visscher, D W
1995-12-01
The p53 tumor suppressor gene has been found to be altered in almost all human solid tumors, whereas K-ras gene mutations have been observed in a limited number of human cancers (adenocarcinoma of colon, pancreas, and lung). Studies of mutational inactivation for both genes in the same patient's sample on non-small-cell lung cancer have been limited. In an effort to perform such an analysis, we developed and compared methods (for the mutational detection of p53 and K-ras gene) that represent a modified and universal protocol, in terms of DNA extraction, polymerase chain reaction (PCR) amplification, and nonradioisotopic PCR-single-strand conformation polymorphism (PCR-SSCP) analysis, which is readily applicable to either formalin-fixed, paraffin-embedded tissues or frozen tumor specimens. We applied this method to the evaluation of p53 (exons 5-8) and K-ras (codon 12 and 13) gene mutations in 55 cases of non-small-cell lung cancer. The mutational status in the p53 gene was evaluated by radioisotopic PCR-SSCP and compared with PCR-SSCP utilizing our standardized nonradioisotopic detection system using a single 6-microns tissue section. The mutational patterns observed by PCR-SSCP were subsequently confirmed by PCR-DNA sequencing. The mutational status in the K-ras gene was similarly evaluated by PCR-SSCP, and the specific mutation was confirmed by Southern slot-blot hybridization using 32P-labeled sequence-specific oligonucleotide probes for codons 12 and 13. Mutational changes in K-ras (codon 12) were found in 10 of 55 (18%) of non-small-cell lung cancers. Whereas adenocarcinoma showed K-ras mutation in 33% of the cases at codon 12, only one mutation was found at codon 13. As expected, squamous cell carcinoma samples (25 cases) did not show K-ras mutations. Mutations at exons 5-8 of the p53 gene were documented in 19 of 55 (34.5%) cases. Ten of the 19 mutations were single nucleotide point mutations, leading to amino acid substitution. Six showed insertional mutation, and three showed deletion mutations. Only three samples showed mutations of both K-ras and p53 genes. We conclude that although K-ras and p53 gene mutations are frequent in non-small-cell lung cancer, mutations of both genes in the same patient's samples are not common. We also conclude that this universal nonradioisotopic method is superior to other similar methods and is readily applicable to the rapid screening of large numbers of formalin-fixed, paraffin-embedded or frozen samples for the mutational analysis of multiple genes.
Walker, Louise A.; Martin-Yken, Hélène; Dague, Etienne; Legrand, Mélanie; Lee, Keunsook; Chauvel, Murielle; Firon, Arnaud; Rossignol, Tristan; Richard, Mathias L.; Munro, Carol A.; Bachellier-Bassi, Sophie; d'Enfert, Christophe
2014-01-01
Biofilm formation is an important virulence trait of the pathogenic yeast Candida albicans. We have combined gene overexpression, strain barcoding and microarray profiling to screen a library of 531 C. albicans conditional overexpression strains (∼10% of the genome) for genes affecting biofilm development in mixed-population experiments. The overexpression of 16 genes increased strain occupancy within a multi-strain biofilm, whereas overexpression of 4 genes decreased it. The set of 16 genes was significantly enriched for those encoding predicted glycosylphosphatidylinositol (GPI)-modified proteins, namely Ihd1/Pga36, Phr2, Pga15, Pga19, Pga22, Pga32, Pga37, Pga42 and Pga59; eight of which have been classified as pathogen-specific. Validation experiments using either individually- or competitively-grown overexpression strains revealed that the contribution of these genes to biofilm formation was variable and stage-specific. Deeper functional analysis of PGA59 and PGA22 at a single-cell resolution using atomic force microscopy showed that overexpression of either gene increased C. albicans ability to adhere to an abiotic substrate. However, unlike PGA59, PGA22 overexpression led to cell cluster formation that resulted in increased sensitivity to shear forces and decreased ability to form a single-strain biofilm. Within the multi-strain environment provided by the PGA22-non overexpressing cells, PGA22-overexpressing cells were protected from shear forces and fitter for biofilm development. Ultrastructural analysis, genome-wide transcript profiling and phenotypic analyses in a heterologous context suggested that PGA22 affects cell adherence through alteration of cell wall structure and/or function. Taken together, our findings reveal that several novel predicted GPI-modified proteins contribute to the cooperative behaviour between biofilm cells and are important participants during C. albicans biofilm formation. Moreover, they illustrate the power of using signature tagging in conjunction with gene overexpression for the identification of novel genes involved in processes pertaining to C. albicans virulence. PMID:25502890
Pang, Xiuhua; Aigle, Bertrand; Girardet, Jean-Michel; Mangenot, Sophie; Pernodet, Jean-Luc; Decaris, Bernard; Leblond, Pierre
2004-01-01
Streptomyces ambofaciens has an 8-Mb linear chromosome ending in 200-kb terminal inverted repeats. Analysis of the F6 cosmid overlapping the terminal inverted repeats revealed a locus similar to type II polyketide synthase (PKS) gene clusters. Sequence analysis identified 26 open reading frames, including genes encoding the β-ketoacyl synthase (KS), chain length factor (CLF), and acyl carrier protein (ACP) that make up the minimal PKS. These KS, CLF, and ACP subunits are highly homologous to minimal PKS subunits involved in the biosynthesis of angucycline antibiotics. The genes encoding the KS and ACP subunits are transcribed constitutively but show a remarkable increase in expression after entering transition phase. Five genes, including those encoding the minimal PKS, were replaced by resistance markers to generate single and double mutants (replacement in one and both terminal inverted repeats). Double mutants were unable to produce either diffusible orange pigment or antibacterial activity against Bacillus subtilis. Single mutants showed an intermediate phenotype, suggesting that each copy of the cluster was functional. Transformation of double mutants with a conjugative and integrative form of F6 partially restored both phenotypes. The pigmented and antibacterial compounds were shown to be two distinct molecules produced from the same biosynthetic pathway. High-pressure liquid chromatography analysis of culture extracts from wild-type and double mutants revealed a peak with an associated bioactivity that was absent from the mutants. Two additional genes encoding KS and CLF were present in the cluster. However, disruption of the second KS gene had no effect on either pigment or antibiotic production. PMID:14742212
2013-12-18
include interactive gene and methylation profiles, interactive heatmaps, cytoscape network views, integrative genomics viewer ( IGV ), and protein-protein...single chart. The website also provides an option to include multiple genes. Integrative Genomics Viewer ( IGV )1, is a high-performance desktop tool for
Gene expression in the tanoak-Phytophthora ramorum interaction
Katherine J. Hayden; Matteo Garbelotto; Hardeep Fai; Brian Knaus; Richard Cronn; Jessica W. Wright
2012-01-01
Disease processes are dynamic, involving a suite of gene expression changes in both the host and the pathogen, all within a single tissue. As such, they lend themselves well to transcriptomic analysis. Here we focus on a generalist invasive pathogen (Phytophthora ramorum) and its most susceptible California Floristic Province native host, tanoak (...
USDA-ARS?s Scientific Manuscript database
The family Rutaceae encompasses several genera including the economically important genus Citrus. In this study, we selected 22 citrus relatives belonging to the various sub groups of Rutaceae and compared the sequences of three gene fragments. The accessions selected belong to the subfamily Rutoide...
Genetic Analysis of Haploids from Industrial Strains of Baker's Yeast
Oda, Yuji; Ouchi, Kozo
1989-01-01
Strains of baker's yeast conventionally used by the baking industry in Japan were tested for the ability to sporulate and produce viable haploid spores. Three isolates which possessed the properties of baker's yeasts were obtained from single spores. Each strain was a haploid, and one of these strains, YOY34, was characterized. YOY34 fermented maltose and sucrose, but did not utilize galactose, unlike its parental strain. Genetic analysis showed that YOY34 carried two MAL genes, one functional and one cryptic; two SUC genes; and one defective gal gene. The genotype of YOY34 was identified as MATα MAL1 MAL3g SUC2 SUC4 gall. The MAL1 gene from this haploid was constitutively expressed, was dominant over other wild-type MAL tester genes, and gave a weak sucrose fermentation. YOY34 was suitable for both bakery products, like conventional baker's yeasts, and for genetic analysis, like laboratory strains. PMID:16347967
McCormick, Mark A; Delaney, Joe R; Tsuchiya, Mitsuhiro; Tsuchiyama, Scott; Shemorry, Anna; Sim, Sylvia; Chou, Annie Chia-Zong; Ahmed, Umema; Carr, Daniel; Murakami, Christopher J; Schleit, Jennifer; Sutphin, George L; Wasko, Brian M; Bennett, Christopher F; Wang, Adrienne M; Olsen, Brady; Beyer, Richard P; Bammler, Theodor K; Prunkard, Donna; Johnson, Simon C; Pennypacker, Juniper K; An, Elroy; Anies, Arieanna; Castanza, Anthony S; Choi, Eunice; Dang, Nick; Enerio, Shiena; Fletcher, Marissa; Fox, Lindsay; Goswami, Sarani; Higgins, Sean A; Holmberg, Molly A; Hu, Di; Hui, Jessica; Jelic, Monika; Jeong, Ki-Soo; Johnston, Elijah; Kerr, Emily O; Kim, Jin; Kim, Diana; Kirkland, Katie; Klum, Shannon; Kotireddy, Soumya; Liao, Eric; Lim, Michael; Lin, Michael S; Lo, Winston C; Lockshon, Dan; Miller, Hillary A; Moller, Richard M; Muller, Brian; Oakes, Jonathan; Pak, Diana N; Peng, Zhao Jun; Pham, Kim M; Pollard, Tom G; Pradeep, Prarthana; Pruett, Dillon; Rai, Dilreet; Robison, Brett; Rodriguez, Ariana A; Ros, Bopharoth; Sage, Michael; Singh, Manpreet K; Smith, Erica D; Snead, Katie; Solanky, Amrita; Spector, Benjamin L; Steffen, Kristan K; Tchao, Bie Nga; Ting, Marc K; Vander Wende, Helen; Wang, Dennis; Welton, K Linnea; Westman, Eric A; Brem, Rachel B; Liu, Xin-Guang; Suh, Yousin; Zhou, Zhongjun; Kaeberlein, Matt; Kennedy, Brian K
2015-11-03
Many genes that affect replicative lifespan (RLS) in the budding yeast Saccharomyces cerevisiae also affect aging in other organisms such as C. elegans and M. musculus. We performed a systematic analysis of yeast RLS in a set of 4,698 viable single-gene deletion strains. Multiple functional gene clusters were identified, and full genome-to-genome comparison demonstrated a significant conservation in longevity pathways between yeast and C. elegans. Among the mechanisms of aging identified, deletion of tRNA exporter LOS1 robustly extended lifespan. Dietary restriction (DR) and inhibition of mechanistic Target of Rapamycin (mTOR) exclude Los1 from the nucleus in a Rad53-dependent manner. Moreover, lifespan extension from deletion of LOS1 is nonadditive with DR or mTOR inhibition, and results in Gcn4 transcription factor activation. Thus, the DNA damage response and mTOR converge on Los1-mediated nuclear tRNA export to regulate Gcn4 activity and aging. Copyright © 2015 Elsevier Inc. All rights reserved.
Molecular and immunohistochemical analysis of P53 in phaeochromocytoma.
Dahia, P. L.; Aguiar, R. C.; Tsanaclis, A. M.; Bendit, I.; Bydlowski, S. P.; Abelin, N. M.; Toledo, S. P.
1995-01-01
We searched for mutations of the p53 gene in 25 phaeochromocytomas using polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) analysis of the entire conserved region of the gene, encompassing exons 4-8; expression of the p53 protein was assessed by immunohistochemistry. No mutations were found, while a polymorphism in codon 72 was observed. Immunohistochemistry revealed nuclear p53 overexpression in one tumour sample. We conclude that mutations of the 'hotspot' region of the p53 gene do not seem to play a role in the pathogenesis of phaeochromocytoma. Images Figure 1 Figure 2 Figure 3 PMID:7577469
Extensive complementarity between gene function prediction methods.
Vidulin, Vedrana; Šmuc, Tomislav; Supek, Fran
2016-12-01
The number of sequenced genomes rises steadily but we still lack the knowledge about the biological roles of many genes. Automated function prediction (AFP) is thus a necessity. We hypothesized that AFP approaches that draw on distinct genome features may be useful for predicting different types of gene functions, motivating a systematic analysis of the benefits gained by obtaining and integrating such predictions. Our pipeline amalgamates 5 133 543 genes from 2071 genomes in a single massive analysis that evaluates five established genomic AFP methodologies. While 1227 Gene Ontology (GO) terms yielded reliable predictions, the majority of these functions were accessible to only one or two of the methods. Moreover, different methods tend to assign a GO term to non-overlapping sets of genes. Thus, inferences made by diverse genomic AFP methods display a striking complementary, both gene-wise and function-wise. Because of this, a viable integration strategy is to rely on a single most-confident prediction per gene/function, rather than enforcing agreement across multiple AFP methods. Using an information-theoretic approach, we estimate that current databases contain 29.2 bits/gene of known Escherichia coli gene functions. This can be increased by up to 5.5 bits/gene using individual AFP methods or by 11 additional bits/gene upon integration, thereby providing a highly-ranking predictor on the Critical Assessment of Function Annotation 2 community benchmark. Availability of more sequenced genomes boosts the predictive accuracy of AFP approaches and also the benefit from integrating them. The individual and integrated GO predictions for the complete set of genes are available from http://gorbi.irb.hr/ CONTACT: fran.supek@irb.hrSupplementary information: Supplementary materials are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Parallel gene analysis with allele-specific padlock probes and tag microarrays
Banér, Johan; Isaksson, Anders; Waldenström, Erik; Jarvius, Jonas; Landegren, Ulf; Nilsson, Mats
2003-01-01
Parallel, highly specific analysis methods are required to take advantage of the extensive information about DNA sequence variation and of expressed sequences. We present a scalable laboratory technique suitable to analyze numerous target sequences in multiplexed assays. Sets of padlock probes were applied to analyze single nucleotide variation directly in total genomic DNA or cDNA for parallel genotyping or gene expression analysis. All reacted probes were then co-amplified and identified by hybridization to a standard tag oligonucleotide array. The technique was illustrated by analyzing normal and pathogenic variation within the Wilson disease-related ATP7B gene, both at the level of DNA and RNA, using allele-specific padlock probes. PMID:12930977
Non-invasive prenatal testing for single gene disorders: exploring the ethics.
Deans, Zuzana; Hill, Melissa; Chitty, Lyn S; Lewis, Celine
2013-07-01
Non-invasive prenatal testing for single gene disorders is now clearly on the horizon. This new technology offers obvious clinical benefits such as safe testing early in pregnancy. Before widespread implementation, it is important to consider the possible ethical implications. Four hypothetical scenarios are presented that highlight how ethical ideals of respect for autonomy, privacy and fairness may come into play when offering non-invasive prenatal testing for single gene disorders. The first scenario illustrates the moral case for using these tests for 'information only', identifying a potential conflict between larger numbers of women seeking the benefits of the test and the wider social impact of funding tests that do not offer immediate clinical benefit. The second scenario shows how the simplicity and safety of non-invasive prenatal testing could lead to more autonomous decision-making and, conversely, how this could also lead to increased pressure on women to take up testing. In the third scenario we show how, unless strong safeguards are put in place, offering non-invasive prenatal testing could be subject to routinisation with informed consent undermined and that woman who are newly diagnosed as carriers may be particularly vulnerable. The final scenario introduces the possibility of a conflict of the moral rights of a woman and her partner through testing for single gene disorders. This analysis informs our understanding of the potential impacts of non-invasive prenatal testing for single gene disorders on clinical practice and has implications for future policy and guidelines for prenatal care.
High-throughput microfluidics to control and measure signaling dynamics in single yeast cells
Hansen, Anders S.; Hao, Nan; O'Shea, Erin K.
2015-01-01
Microfluidics coupled to quantitative time-lapse fluorescence microscopy is transforming our ability to control, measure, and understand signaling dynamics in single living cells. Here we describe a pipeline that incorporates multiplexed microfluidic cell culture, automated programmable fluid handling for cell perturbation, quantitative time-lapse microscopy, and computational analysis of time-lapse movies. We illustrate how this setup can be used to control the nuclear localization of the budding yeast transcription factor Msn2. Using this protocol, we generate oscillations of Msn2 localization and measure the dynamic gene expression response of individual genes in single cells. The protocol allows a single researcher to perform up to 20 different experiments in a single day, whilst collecting data for thousands of single cells. Compared to other protocols, the present protocol is relatively easy to adopt and higher-throughput. The protocol can be widely used to control and monitor single-cell signaling dynamics in other signal transduction systems in microorganisms. PMID:26158443
Dawes, Piers; Platt, Hazel; Horan, Michael; Ollier, William; Munro, Kevin; Pendleton, Neil; Payton, Antony
2015-01-01
Age-related hearing loss has a genetic component, but there have been limited genetic studies in this field. Both N-acetyltransferase 2 and apolipoprotein E genes have previously been associated. However, these studies have either used small sample sizes, examined a limited number of polymorphisms, or have produced conflicting results. Here we use a haplotype tagging approach to determine association with age-related hearing loss and investigate epistasis between these two genes. Candidate gene association study of a continuous phenotype. We investigated haplotype tagging single nucleotide polymorphisms in the N-acetyltransferase 2 gene and the presence/absence of the apolipoprotein E ε4 allele for association with age-related hearing loss in a cohort of 265 Caucasian elderly volunteers from Greater Manchester, United Kingdom. Hearing phenotypes were generated using principal component analysis of the hearing threshold levels for the better ear (severity, slope, and concavity). Genotype data for the N-acetyltransferase 2 gene was obtained from existing genome-wide association study data from the Illumina 610-Quadv1 chip. Apolipoprotein E genotyping was performed using Sequenom technology. Linear regression analysis was performed using Plink and Stata software. No significant associations (P value, > 0.05) were observed between the N-acetyltransferase 2 or apolipoprotein E gene polymorphisms and any hearing factor. No significant association was observed for epistasis analysis of apolipoprotein E ε4 and the N-acetyltransferase 2 single nucleotide polymorphism rs1799930 (NAT2*6A). We found no evidence to support that either N-acetyltransferase 2 or apolipoprotein E gene polymorphisms are associated with age-related hearing loss in a cohort of 265 elderly volunteers. © 2014 The American Laryngological, Rhinological and Otological Society, Inc.
Identifying stochastic oscillations in single-cell live imaging time series using Gaussian processes
Manning, Cerys; Rattray, Magnus
2017-01-01
Multiple biological processes are driven by oscillatory gene expression at different time scales. Pulsatile dynamics are thought to be widespread, and single-cell live imaging of gene expression has lead to a surge of dynamic, possibly oscillatory, data for different gene networks. However, the regulation of gene expression at the level of an individual cell involves reactions between finite numbers of molecules, and this can result in inherent randomness in expression dynamics, which blurs the boundaries between aperiodic fluctuations and noisy oscillators. This underlies a new challenge to the experimentalist because neither intuition nor pre-existing methods work well for identifying oscillatory activity in noisy biological time series. Thus, there is an acute need for an objective statistical method for classifying whether an experimentally derived noisy time series is periodic. Here, we present a new data analysis method that combines mechanistic stochastic modelling with the powerful methods of non-parametric regression with Gaussian processes. Our method can distinguish oscillatory gene expression from random fluctuations of non-oscillatory expression in single-cell time series, despite peak-to-peak variability in period and amplitude of single-cell oscillations. We show that our method outperforms the Lomb-Scargle periodogram in successfully classifying cells as oscillatory or non-oscillatory in data simulated from a simple genetic oscillator model and in experimental data. Analysis of bioluminescent live-cell imaging shows a significantly greater number of oscillatory cells when luciferase is driven by a Hes1 promoter (10/19), which has previously been reported to oscillate, than the constitutive MoMuLV 5’ LTR (MMLV) promoter (0/25). The method can be applied to data from any gene network to both quantify the proportion of oscillating cells within a population and to measure the period and quality of oscillations. It is publicly available as a MATLAB package. PMID:28493880
Simon, Marissa; Bruex, Angela; Kainkaryam, Raghunandan M.; Zheng, Xiaohua; Huang, Ling; Woolf, Peter J.; Schiefelbein, John
2013-01-01
Traditional genetic analysis relies on mutants with observable phenotypes. Mutants lacking visible abnormalities may nevertheless exhibit molecular differences useful for defining gene function. To examine this, we analyzed tissue-specific transcript profiles from Arabidopsis thaliana transcription factor gene mutants with known roles in root epidermis development, but lacking a single-gene mutant phenotype due to genetic redundancy. We discovered substantial transcriptional changes in each mutant, preferentially affecting root epidermal genes in a manner consistent with the known double mutant effects. Furthermore, comparing transcript profiles of single and double mutants, we observed remarkable variation in the sensitivity of target genes to the loss of one or both paralogous genes, including preferential effects on specific branches of the epidermal gene network, likely reflecting the pathways of paralog subfunctionalization during evolution. In addition, we analyzed the root epidermal transcriptome of the transparent testa glabra2 mutant to clarify its role in the network. These findings provide insight into the molecular basis of genetic redundancy and duplicate gene diversification at the level of a specific gene regulatory network, and they demonstrate the usefulness of tissue-specific transcript profiling to define gene function in mutants lacking informative visible changes in phenotype. PMID:24014549
Pan-Cancer Analysis of Mutation Hotspots in Protein Domains.
Miller, Martin L; Reznik, Ed; Gauthier, Nicholas P; Aksoy, Bülent Arman; Korkut, Anil; Gao, Jianjiong; Ciriello, Giovanni; Schultz, Nikolaus; Sander, Chris
2015-09-23
In cancer genomics, recurrence of mutations in independent tumor samples is a strong indicator of functional impact. However, rare functional mutations can escape detection by recurrence analysis owing to lack of statistical power. We enhance statistical power by extending the notion of recurrence of mutations from single genes to gene families that share homologous protein domains. Domain mutation analysis also sharpens the functional interpretation of the impact of mutations, as domains more succinctly embody function than entire genes. By mapping mutations in 22 different tumor types to equivalent positions in multiple sequence alignments of domains, we confirm well-known functional mutation hotspots, identify uncharacterized rare variants in one gene that are equivalent to well-characterized mutations in another gene, detect previously unknown mutation hotspots, and provide hypotheses about molecular mechanisms and downstream effects of domain mutations. With the rapid expansion of cancer genomics projects, protein domain hotspot analysis will likely provide many more leads linking mutations in proteins to the cancer phenotype. Copyright © 2015 Elsevier Inc. All rights reserved.
Salvianti, Francesca; Rotunno, Giada; Galardi, Francesca; De Luca, Francesca; Pestrin, Marta; Vannucchi, Alessandro Maria; Di Leo, Angelo; Pazzagli, Mario; Pinzani, Pamela
2015-09-01
The purpose of the study was to explore the feasibility of a protocol for the isolation and molecular characterization of single circulating tumor cells (CTCs) from cancer patients using a single-cell next generation sequencing (NGS) approach. To reach this goal we used as a model an artificial sample obtained by spiking a breast cancer cell line (MDA-MB-231) into the blood of a healthy donor. Tumor cells were enriched and enumerated by CellSearch(®) and subsequently isolated by DEPArray™ to obtain single or pooled pure samples to be submitted to the analysis of the mutational status of multiple genes involved in cancer. Upon whole genome amplification, samples were analysed by NGS on the Ion Torrent PGM™ system (Life Technologies) using the Ion AmpliSeq™ Cancer Hotspot Panel v2 (Life Technologies), designed to investigate genomic "hot spot" regions of 50 oncogenes and tumor suppressor genes. We successfully sequenced five single cells, a pool of 5 cells and DNA from a cellular pellet of the same cell line with a mean depth of the sequencing reaction ranging from 1581 to 3479 reads. We found 27 sequence variants in 18 genes, 15 of which already reported in the COSMIC or dbSNP databases. We confirmed the presence of two somatic mutations, in the BRAF and TP53 gene, which had been already reported for this cells line, but also found new mutations and single nucleotide polymorphisms. Three variants were common to all the analysed samples, while 18 were present only in a single cell suggesting a high heterogeneity within the same cell line. This paper presents an optimized workflow for the molecular characterization of multiple genes in single cells by NGS. The described pipeline can be easily transferred to the study of single CTCs from oncologic patients.
Martin, Eden R.; Lai, Eric H.; Gilbert, John R.; Rogala, Allison R.; Afshari, A. J.; Riley, John; Finch, K. L.; Stevens, J. F.; Livak, K. J.; Slotterbeck, Brandon D.; Slifer, Susan H.; Warren, Liling L.; Conneally, P. Michael; Schmechel, Donald E.; Purvis, Ian; Pericak-Vance, Margaret A.; Roses, Allen D.; Vance, Jeffery M.
2000-01-01
There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P⩽.05) was identified for 7 of 13 SNPs, including the APOE-4 polymorphism, spanning 40 kb on either side of APOE. As expected, very strong evidence for association with AD was seen for the APOE-4 polymorphism, as well as for two other SNPs that lie <16 kb from APOE. Haplotype analysis using family data increased significance over that seen in single-locus tests for some of the markers, and, for these data, improved localization of the gene. Our results demonstrate that associations can be detected at SNPs near a complex disease gene. We found that a high density of markers will be necessary in order to have a good chance of including SNPs with detectable levels of allelic association with the disease mutation, and statistical analysis based on haplotypes can provide additional information with respect to tests of significance and fine localization of complex disease genes. PMID:10869235
Martin, E R; Lai, E H; Gilbert, J R; Rogala, A R; Afshari, A J; Riley, J; Finch, K L; Stevens, J F; Livak, K J; Slotterbeck, B D; Slifer, S H; Warren, L L; Conneally, P M; Schmechel, D E; Purvis, I; Pericak-Vance, M A; Roses, A D; Vance, J M
2000-08-01
There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P=.05) was identified for 7 of 13 SNPs, including the APOE-4 polymorphism, spanning 40 kb on either side of APOE. As expected, very strong evidence for association with AD was seen for the APOE-4 polymorphism, as well as for two other SNPs that lie <16 kb from APOE. Haplotype analysis using family data increased significance over that seen in single-locus tests for some of the markers, and, for these data, improved localization of the gene. Our results demonstrate that associations can be detected at SNPs near a complex disease gene. We found that a high density of markers will be necessary in order to have a good chance of including SNPs with detectable levels of allelic association with the disease mutation, and statistical analysis based on haplotypes can provide additional information with respect to tests of significance and fine localization of complex disease genes.
Gaffoor, Iffa; Brown, Daren W.; Plattner, Ron; Proctor, Robert H.; Qi, Weihong; Trail, Frances
2005-01-01
Polyketides are a class of secondary metabolites that exhibit a vast diversity of form and function. In fungi, these compounds are produced by large, multidomain enzymes classified as type I polyketide synthases (PKSs). In this study we identified and functionally disrupted 15 PKS genes from the genome of the filamentous fungus Gibberella zeae. Five of these genes are responsible for producing the mycotoxins zearalenone, aurofusarin, and fusarin C and the black perithecial pigment. A comprehensive expression analysis of the 15 genes revealed diverse expression patterns during grain colonization, plant colonization, sexual development, and mycelial growth. Expression of one of the PKS genes was not detected under any of 18 conditions tested. This is the first study to genetically characterize a complete set of PKS genes from a single organism. PMID:16278459
The complete chloroplast genome sequence of Chikusichloa aquatica (Poaceae: Oryzeae).
Zhang, Jie; Zhang, Dan; Shi, Chao; Gao, Ju; Gao, Li-Zhi
2016-07-01
The complete chloroplast sequence of the Chikusichloa aquatica was determined in this study. The genome consists of 136 563 bp containing a pair of inverted repeats (IRs) of 20 837 bp, which was separated by a large single-copy region and a small single-copy region of 82 315 bp and 33 411 bp, respectively. The C. aquatica cp genome encodes 111 functional genes (71 protein-coding genes, four rRNA genes, and 36 tRNA genes): 92 are unique, while 19 are duplicated in the IR regions. The genic regions account for 58.9% of whole cp genome, and the GC content of the plastome is 39.0%. A phylogenomic analysis showed that C. aquatica is closely related to Rhynchoryza subulata that belongs to the tribe Oryzeae.
Pirulli, D; Giordano, M; Lessi, M; Spanò, A; Puzzer, D; Zezlina, S; Boniotto, M; Crovella, S; Florian, F; Marangella, M; Momigliano-Richiardi, P; Savoldi, S; Amoroso, A
2001-06-01
Primary hyperoxaluria type 1 is an autosomal recessive disorder of glyoxylate metabolism, caused by a deficiency of alanine:glyoxylate aminotransferase, which is encoded by a single copy gene (AGXT. The aim of this research was to standardize denaturing high-performance liquid chromatography, a new, sensitive, relatively inexpensive, and automated technique, for the detection of AGXT mutation. Denaturing high-performance liquid chromatography was used to analyze in blind the AGXT gene in 20 unrelated Italian patients with primary hyperoxaluria type I previously studied by other standard methods (single-strand conformation polymorphism analysis and direct sequencing) and 50 controls. Denaturing high-performance liquid chromatography allowed us to identify 13 mutations and the polymorphism at position 154 in exon I of the AGXT gene. Hence the method is more sensitive and less time consuming than single-strand conformation polymorphism analysis for the detection of AGXT mutations, thus representing a useful and reliable tool for detecting the mutations responsible for primary hyperoxaluria type 1. The new technology could also be helpful in the search for healthy carriers of AGXT mutations amongst family members and their partners, and for screening of AGXT polymorphisms in patients with nephrolithiasis and healthy populations.
Yamamoto, K; Oda, Y; Haseda, A; Fujito, S; Mikami, T; Onodera, Y
2014-01-01
Spinach (Spinacia oleracea L.) is widely known to be dioecious. However, monoecious plants can also occur in this species. Sex expression in dioecious spinach plants is controlled by a single gene pair termed X and Y. Our previous study showed that a single, incompletely dominant gene, which controls the monoecious condition in spinach line 03–336, should be allelic or linked to X/Y. Here, we developed 19 AFLP markers closely linked to the monoecious gene. The AFLP markers were mapped to a 38.2-cM chromosomal region that included the monoecious gene, which is bracketed between flanking markers with a distance of 7.1 cM. The four AFLP markers developed in our studies were converted into sequence-characterized amplified region (SCAR) markers, which are linked to both the monoecious gene and Y and are common to both populations segregating for the genes. Linkage analysis using the SCAR markers suggested that the monoecious gene (M) and Y are located in different intervals, between different marker pairs. Analysis of populations segregating for both M and Y also directly demonstrates linkage of the genes at a distance of ∼12 cM. The data presented in this study may be useful for breeding dioecious and highly male monoecious lines utilized as the pollen parents for hybrid seed production, as well as for studies of the evolutionary history of sexual systems in this species, and can provide a molecular basis for positional cloning of the sex-determining genes. PMID:24169648
Using a periclinal chimera to unravel layer-specific gene expression in plants
Filippis, Ioannis; Lopez-Cobollo, Rosa; Abbott, James; Butcher, Sarah; Bishop, Gerard J
2013-01-01
Plant organs are made from multiple cell types, and defining the expression level of a gene in any one cell or group of cells from a complex mixture is difficult. Dicotyledonous plants normally have three distinct layers of cells, L1, L2 and L3. Layer L1 is the single layer of cells making up the epidermis, layer L2 the single cell sub-epidermal layer and layer L3 constitutes the rest of the internal cells. Here we show how it is possible to harvest an organ and characterise the level of layer-specific expression by using a periclinal chimera that has its L1 layer from Solanum pennellii and its L2 and L3 layers from Solanum lycopersicum. This is possible by measuring the level of the frequency of species-specific transcripts. RNA-seq analysis enabled the genome-wide assessment of whether a gene is expressed in the L1 or L2/L3 layers. From 13 277 genes that are expressed in both the chimera and the parental lines and with at least one polymorphism between the parental alleles, we identified 382 genes that are preferentially expressed in L1 in contrast to 1159 genes in L2/L3. Gene ontology analysis shows that many genes preferentially expressed in L1 are involved in cutin and wax biosynthesis, whereas numerous genes that are preferentially expressed in L2/L3 tissue are associated with chloroplastic processes. These data indicate the use of such chimeras and provide detailed information on the level of layer-specific expression of genes. PMID:23725542
Weidner, Christopher; Steinfath, Matthias; Wistorf, Elisa; Oelgeschläger, Michael; Schneider, Marlon R; Schönfelder, Gilbert
2017-08-16
Recent studies that compared transcriptomic datasets of human diseases with datasets from mouse models using traditional gene-to-gene comparison techniques resulted in contradictory conclusions regarding the relevance of animal models for translational research. A major reason for the discrepancies between different gene expression analyses is the arbitrary filtering of differentially expressed genes. Furthermore, the comparison of single genes between different species and platforms often is limited by technical variance, leading to misinterpretation of the con/discordance between data from human and animal models. Thus, standardized approaches for systematic data analysis are needed. To overcome subjective gene filtering and ineffective gene-to-gene comparisons, we recently demonstrated that gene set enrichment analysis (GSEA) has the potential to avoid these problems. Therefore, we developed a standardized protocol for the use of GSEA to distinguish between appropriate and inappropriate animal models for translational research. This protocol is not suitable to predict how to design new model systems a-priori, as it requires existing experimental omics data. However, the protocol describes how to interpret existing data in a standardized manner in order to select the most suitable animal model, thus avoiding unnecessary animal experiments and misleading translational studies.
Saeed, Mohammad
2017-05-01
Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.
Liu, Dongming; Tang, Jun; Liu, Zezhou; Dong, Xin; Zhuang, Mu; Zhang, Yangyong; Lv, Honghao; Sun, Peitian; Liu, Yumei; Li, Zhansheng; Ye, Zhibiao; Fang, Zhiyuan; Yang, Limei
2017-11-28
The aerial parts of most land plants are covered with cuticular wax which is important for plants to avoid harmful factors. There is still no cloning study about wax synthesis gene of the alcohol-forming pathway in Brassica species. Scanning electron microscopy (SEM) showed that, compared with wild type (WT), wax crystal are severely reduced in both the adaxial and abaxial sides of cabbage (Brassica oleracea L. var. capitata L.) leaves from the LD10GL mutant. Genetic analysis results revealed that the glossy trait of LD10GL is controlled by a single recessive gene, and fine mapping results revealed that the target gene Cgl2 (Cabbage glossy 2) is located within a physical region of 170 kb on chromosome 1. Based on sequence analysis of the genes in the mapped region, the gene designated Bol013612 was speculated to be the candidate gene. Gene Bol013612 is homologous to Arabidopsis CER4, which encodes fatty acyl-coenzyme A reductase. Sequencing identified a single nucleotide substitution at an intron/exon boundary that results in an insertion of six nucleotides in the cDNA of Bol013612 in LD10GL. The phenotypic defect of LD10GL was confirmed by a functional complementation test with Arabidopsis mutant cer4. Our results indicated that wax crystals of cabbage mutant LD10GL are severely reduced and mutation of gene Bol013612 causes a glossy phenotype in the LD10GL mutant.
Nicholas, B; Rudrasingham, V; Nash, S; Kirov, G; Owen, M J; Wimpory, D C
2007-06-01
Clock gene anomalies have been suggested as causative factors in autism. We screened eleven clock/clock-related genes in a predominantly high-functioning Autism Genetic Resource Exchange sample of strictly diagnosed autistic disorder progeny and their parents (110 trios) for association of clock gene variants with autistic disorder. We found significant association (P<0.05) for two single-nucleotide polymorphisms in per1 and two in npas2. Analysis of all possible combinations of two-marker haplotypes for each gene showed that in npas2 40 out of the 136 possible two-marker combinations were significant at the P<0.05 level, with the best result between markers rs1811399 and rs2117714, P=0.001. Haplotype analysis within per1 gave a single significant result: a global P=0.027 for the markers rs2253820-rs885747. No two-marker haplotype was significant in any of the other genes, despite the large number of tests performed. Our findings support the hypothesis that these epistatic clock genes may be involved in the etiology of autistic disorder. Problems in sleep, memory and timing are all characteristics of autistic disorder and aspects of sleep, memory and timing are each clock-gene-regulated in other species. We identify how our findings may be relevant to theories of autism that focus on the amygdala, cerebellum, memory and temporal deficits. We outline possible implications of these findings for developmental models of autism involving temporal synchrony/social timing.
Kerr, Jonathan R; Kaushik, Narendra; Fear, David; Baldwin, Don A; Nuwaysir, Emile F; Adcock, Ian M
2005-07-15
This study was undertaken to further examine the role of the host response to parvovirus B19 in the development of symptoms and consequences of viral persistence. Genomic DNA from 42 patients with symptomatic B19 infection was analyzed using the HuSNP assay (Affymetrix), and the results were compared with those from analysis of 53 healthy control individuals. Fifty-seven single-nucleotide polymorphisms were identified that were significantly associated with symptomatic infection. Total RNA from peripheral blood mononuclear cells from 57 B19-seropositive and 13 B19-seronegative donors was analyzed by hybridization to a single-color microarray representing 9522 human genes. Ninety-two genes were shown to be differentially expressed. Differential expression was confirmed in 6 of 38 genes (SKIP, MACF1, SPAG7, FLOT1, c6orf48, and RASSF5) tested using real-time quantitative polymerase chain reaction in a different group of healthy subjects. Genes identified in both studies play a functional role in the cytoskeleton, integrin signaling, and oncosuppression, themes that have been shown to be important in parvovirus infections.
A strategy to apply quantitative epistasis analysis on developmental traits.
Labocha, Marta K; Yuan, Wang; Aleman-Meza, Boanerges; Zhong, Weiwei
2017-05-15
Genetic interactions are keys to understand complex traits and evolution. Epistasis analysis is an effective method to map genetic interactions. Large-scale quantitative epistasis analysis has been well established for single cells. However, there is a substantial lack of such studies in multicellular organisms and their complex phenotypes such as development. Here we present a method to extend quantitative epistasis analysis to developmental traits. In the nematode Caenorhabditis elegans, we applied RNA interference on mutants to inactivate two genes, used an imaging system to quantitatively measure phenotypes, and developed a set of statistical methods to extract genetic interactions from phenotypic measurement. Using two different C. elegans developmental phenotypes, body length and sex ratio, as examples, we showed that this method could accommodate various metazoan phenotypes with performances comparable to those methods in single cell growth studies. Comparing with qualitative observations, this method of quantitative epistasis enabled detection of new interactions involving subtle phenotypes. For example, several sex-ratio genes were found to interact with brc-1 and brd-1, the orthologs of the human breast cancer genes BRCA1 and BARD1, respectively. We confirmed the brc-1 interactions with the following genes in DNA damage response: C34F6.1, him-3 (ortholog of HORMAD1, HORMAD2), sdc-1, and set-2 (ortholog of SETD1A, SETD1B, KMT2C, KMT2D), validating the effectiveness of our method in detecting genetic interactions. We developed a reliable, high-throughput method for quantitative epistasis analysis of developmental phenotypes.
Meta-analytic framework for liquid association.
Wang, Lin; Liu, Silvia; Ding, Ying; Yuan, Shin-Sheng; Ho, Yen-Yi; Tseng, George C
2017-07-15
Although coexpression analysis via pair-wise expression correlation is popularly used to elucidate gene-gene interactions at the whole-genome scale, many complicated multi-gene regulations require more advanced detection methods. Liquid association (LA) is a powerful tool to detect the dynamic correlation of two gene variables depending on the expression level of a third variable (LA scouting gene). LA detection from single transcriptomic study, however, is often unstable and not generalizable due to cohort bias, biological variation and limited sample size. With the rapid development of microarray and NGS technology, LA analysis combining multiple gene expression studies can provide more accurate and stable results. In this article, we proposed two meta-analytic approaches for LA analysis (MetaLA and MetaMLA) to combine multiple transcriptomic studies. To compensate demanding computing, we also proposed a two-step fast screening algorithm for more efficient genome-wide screening: bootstrap filtering and sign filtering. We applied the methods to five Saccharomyces cerevisiae datasets related to environmental changes. The fast screening algorithm reduced 98% of running time. When compared with single study analysis, MetaLA and MetaMLA provided stronger detection signal and more consistent and stable results. The top triplets are highly enriched in fundamental biological processes related to environmental changes. Our method can help biologists understand underlying regulatory mechanisms under different environmental exposure or disease states. A MetaLA R package, data and code for this article are available at http://tsenglab.biostat.pitt.edu/software.htm. ctseng@pitt.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Jiang, Rong; French, John E.; Stober, Vandy P.; Kang-Sickel, Juei-Chuan C.; Zou, Fei
2012-01-01
Background: Individual genetic variation that results in differences in systemic response to xenobiotic exposure is not accounted for as a predictor of outcome in current exposure assessment models. Objective: We developed a strategy to investigate individual differences in single-nucleotide polymorphisms (SNPs) as genetic markers associated with naphthyl–keratin adduct (NKA) levels measured in the skin of workers exposed to naphthalene. Methods: The SNP-association analysis was conducted in PLINK using candidate-gene analysis and genome-wide analysis. We identified significant SNP–NKA associations and investigated the potential impact of these SNPs along with personal and workplace factors on NKA levels using a multiple linear regression model and the Pratt index. Results: In candidate-gene analysis, a SNP (rs4852279) located near the CYP26B1 gene contributed to the 2-naphthyl–keratin adduct (2NKA) level. In the multiple linear regression model, the SNP rs4852279, dermal exposure, exposure time, task replacing foam, age, and ethnicity all were significant predictors of 2NKA level. In genome-wide analysis, no single SNP reached genome-wide significance for NKA levels (all p ≥ 1.05 × 10–5). Pathway and network analyses of SNPs associated with NKA levels were predicted to be involved in the regulation of cellular processes and homeostasis. Conclusions: These results provide evidence that a quantitative biomarker can be used as an intermediate phenotype when investigating the association between genetic markers and exposure–dose relationship in a small, well-characterized exposed worker population. PMID:22391508
Yuan, Congying; Wang, Meinan; Skinner, Danniel Z; See, Deven R; Xia, Chongjing; Guo, Xinhong; Chen, Xianming
2018-01-01
Puccinia striiformis f. sp. tritici, the wheat stripe rust pathogen, is a dikaryotic, biotrophic, and macrocyclic fungus. Genetic study of P. striiformis f. sp. tritici virulence was not possible until the recent discovery of Berberis spp. and Mahonia spp. as alternate hosts. To determine inheritance of virulence and map virulence genes, a segregating population of 119 isolates was developed by self-fertilizing P. striiformis f. sp. tritici isolate 08-220 (race PSTv-11) on barberry leaves under controlled greenhouse conditions. The progeny isolates were phenotyped on a set of 29 wheat lines with single genes for race-specific resistance and genotyped with simple sequence repeat (SSR) markers, single nucleotide polymorphism (SNP) markers derived from secreted protein genes, and SNP markers from genotyping-by-sequencing (GBS). Using the GBS technique, 10,163 polymorphic GBS-SNP markers were identified. Clustering and principal component analysis grouped these markers into six genetic groups, and a genetic map, consisting of six linkage groups, was constructed with 805 markers. The six clusters or linkage groups resulting from these analyses indicated a haploid chromosome number of six in P. striiformis f. sp. tritici. Through virulence testing of the progeny isolates, the parental isolate was found to be homozygous for the avirulence loci corresponding to resistance genes Yr5, Yr10, Yr15, Yr24, Yr32, YrSP, YrTr1, Yr45, and Yr53 and homozygous for the virulence locus corresponding to resistance gene Yr41. Segregation was observed for virulence phenotypes in response to the remaining 19 single-gene lines. A single dominant gene or two dominant genes with different nonallelic gene interactions were identified for each of the segregating virulence phenotypes. Of 27 dominant virulence genes identified, 17 were mapped to two chromosomes. Markers tightly linked to some of the virulence loci may facilitate further studies to clone these genes. The virulence genes and their inheritance information are useful for understanding the host-pathogen interactions and for selecting effective resistance genes or gene combinations for developing stripe rust resistant wheat cultivars.
A robust method for RNA extraction and purification from a single adult mouse tendon.
Grinstein, Mor; Dingwall, Heather L; Shah, Rishita R; Capellini, Terence D; Galloway, Jenna L
2018-01-01
Mechanistic understanding of tendon molecular and cellular biology is crucial toward furthering our abilities to design new therapies for tendon and ligament injuries and disease. Recent transcriptomic and epigenomic studies in the field have harnessed the power of mouse genetics to reveal new insights into tendon biology. However, many mouse studies pool tendon tissues or use amplification methods to perform RNA analysis, which can significantly increase the experimental costs and limit the ability to detect changes in expression of low copy transcripts. Single Achilles tendons were harvested from uninjured, contralateral injured, and wild type mice between three and five months of age, and RNA was extracted. RNA Integrity Number (RIN) and concentration were determined, and RT-qPCR gene expression analysis was performed. After testing several RNA extraction approaches on single adult mouse Achilles tendons, we developed a protocol that was successful at obtaining high RIN and sufficient concentrations suitable for RNA analysis. We found that the RNA quality was sensitive to the time between tendon harvest and homogenization, and the RNA quality and concentration was dependent on the duration of homogenization. Using this method, we demonstrate that analysis of Scx gene expression in single mouse tendons reduces the biological variation caused by pooling tendons from multiple mice. We also show successful use of this approach to analyze Sox9 and Col1a2 gene expression changes in injured compared with uninjured control tendons. Our work presents a robust, cost-effective, and straightforward method to extract high quality RNA from a single adult mouse Achilles tendon at sufficient amounts for RT-qPCR as well as RNA-seq. We show this can reduce variation and decrease the overall costs associated with experiments. This approach can also be applied to other skeletal tissues, as well as precious human samples.
Molinaro, Alyssa M; Pearson, Bret J
2016-04-27
The planarian Schmidtea mediterranea is a master regenerator with a large adult stem cell compartment. The lack of transgenic labeling techniques in this animal has hindered the study of lineage progression and has made understanding the mechanisms of tissue regeneration a challenge. However, recent advances in single-cell transcriptomics and analysis methods allow for the discovery of novel cell lineages as differentiation progresses from stem cell to terminally differentiated cell. Here we apply pseudotime analysis and single-cell transcriptomics to identify adult stem cells belonging to specific cellular lineages and identify novel candidate genes for future in vivo lineage studies. We purify 168 single stem and progeny cells from the planarian head, which were subjected to single-cell RNA sequencing (scRNAseq). Pseudotime analysis with Waterfall and gene set enrichment analysis predicts a molecularly distinct neoblast sub-population with neural character (νNeoblasts) as well as a novel alternative lineage. Using the predicted νNeoblast markers, we demonstrate that a novel proliferative stem cell population exists adjacent to the brain. scRNAseq coupled with in silico lineage analysis offers a new approach for studying lineage progression in planarians. The lineages identified here are extracted from a highly heterogeneous dataset with minimal prior knowledge of planarian lineages, demonstrating that lineage purification by transgenic labeling is not a prerequisite for this approach. The identification of the νNeoblast lineage demonstrates the usefulness of the planarian system for computationally predicting cellular lineages in an adult context coupled with in vivo verification.
Calcium Signaling Pathway Genes RUNX2 and CACNA1C Are Associated With Calcific Aortic Valve Disease
Guauque-Olarte, Sandra; Messika-Zeitoun, David; Droit, Arnaud; Lamontagne, Maxime; Tremblay-Marchand, Joël; Lavoie-Charland, Emilie; Gaudreault, Nathalie; Arsenault, Benoit J.; Dubé, Marie-Pierre; Tardif, Jean-Claude; Body, Simon C.; Seidman, Jonathan G.; Boileau, Catherine; Mathieu, Patrick; Pibarot, Philippe; Bossé, Yohan
2016-01-01
Background Calcific aortic valve stenosis (AS) is a life-threatening disease with no medical therapy. The genetic architecture of AS remains elusive. This study combines genome-wide association studies, gene expression, and expression quantitative trait loci mapping in human valve tissues to identify susceptibility genes of AS. Methods and Results A meta-analysis was performed combining the results of 2 genome-wide association studies in 474 and 486 cases from Quebec City (Canada) and Paris (France), respectively. Corresponding controls consisted of 2988 and 1864 individuals with European ancestry from the database of genotypes and phenotypes. mRNA expression levels were evaluated in 9 calcified and 8 normal aortic valves by RNA sequencing. The results were integrated with valve expression quantitative trait loci data obtained from 22 AS patients. Twenty-five single-nucleotide polymorphisms had P<5×10−6 in the genome-wide association studies meta-analysis. The calcium signaling pathway was the top gene set enriched for genes mapped to moderately AS-associated single-nucleotide polymorphisms. Genes in this pathway were found differentially expressed in valves with and without AS. Two single-nucleotide polymorphisms located in RUNX2 (runt-related transcription factor 2), encoding an osteogenic transcription factor, demonstrated some association with AS (genome-wide association studies P=5.33×10−5). The mRNA expression levels of RUNX2 were upregulated in calcified valves and associated with eQTL-SNPs. CACNA1C encoding a subunit of a voltage-dependent calcium channel was upregulated in calcified valves. The eQTL-SNP with the most significant association with AS located in CACNA1C was associated with higher expression of the gene. Conclusions This integrative genomic study confirmed the role of RUNX2 as a potential driver of AS and identified a new AS susceptibility gene, CACNA1C, belonging to the calcium signaling pathway. PMID:26553695
Rinke, Jenny; Schäfer, Vivien; Schmidt, Mathias; Ziermann, Janine; Kohlmann, Alexander; Hochhaus, Andreas; Ernst, Thomas
2013-08-01
We sought to establish a convenient, sensitive next-generation sequencing (NGS) method for genotyping the 26 most commonly mutated leukemia-associated genes in a single work flow and to optimize this method for low amounts of input template DNA. We designed 184 PCR amplicons that cover all of the candidate genes. NGS was performed with genomic DNA (gDNA) from a cohort of 10 individuals with chronic myelomonocytic leukemia. The results were compared with NGS data obtained from sequencing of DNA generated by whole-genome amplification (WGA) of 20 ng template gDNA. Differences between gDNA and WGA samples in variant frequencies were determined for 2 different WGA kits. For gDNA samples, 25 of 26 genes were successfully sequenced with a sensitivity of 5%, which was achieved by a median coverage of 492 reads (range, 308-636 reads) per amplicon. We identified 24 distinct mutations in 11 genes. With WGA samples, we reliably detected all mutations above 5% sensitivity with a median coverage of 506 reads (range, 256-653 reads) per amplicon. With all variants included in the analysis, WGA amplification by the 2 kits tested yielded differences in variant frequencies that ranged from -28.19% to +9.94% [mean (SD) difference, -0.2% (4.08%)] and from -35.03% to +18.67% [mean difference, -0.75% (5.12%)]. Our method permits simultaneous analysis of a wide range of leukemia-associated target genes in a single sequencing run. NGS can be performed after WGA of template DNA for reliable detection of variants without introducing appreciable bias.
Zheng, Zhi; Luo, Yuling; McMaster, Gary K
2006-07-01
Accurate and precise quantification of mRNA in whole blood is made difficult by gene expression changes during blood processing, and by variations and biases introduced by sample preparations. We sought to develop a quantitative whole-blood mRNA assay that eliminates blood purification, RNA isolation, reverse transcription, and target amplification while providing high-quality data in an easy assay format. We performed single- and multiplex gene expression analysis with multiple hybridization probes to capture mRNA directly from blood lysate and used branched DNA to amplify the signal. The 96-well plate singleplex assay uses chemiluminescence detection, and the multiplex assay combines Luminex-encoded beads with fluorescent detection. The single- and multiplex assays could quantitatively measure as few as 6000 and 24,000 mRNA target molecules (0.01 and 0.04 amoles), respectively, in up to 25 microL of whole blood. Both formats had CVs < 10% and dynamic ranges of 3-4 logs. Assay sensitivities allowed quantitative measurement of gene expression in the minority of cells in whole blood. The signals from whole-blood lysate correlated well with signals from purified RNA of the same sample, and absolute mRNA quantification results from the assay were similar to those obtained by quantitative reverse transcription-PCR. Both single- and multiplex assay formats were compatible with common anticoagulants and PAXgene-treated samples; however, PAXgene preparations induced expression of known antiapoptotic genes in whole blood. Both the singleplex and the multiplex branched DNA assays can quantitatively measure mRNA expression directly from small volumes of whole blood. The assay offers an alternative to current technologies that depend on RNA isolation and is amenable to high-throughput gene expression analysis of whole blood.
Four Linked Genes Participate in Controlling Sporulation Efficiency in Budding Yeast
Ben-Ari, Giora; Zenvirth, Drora; Sherman, Amir; David, Lior; Klutstein, Michael; Lavi, Uri; Hillel, Jossi; Simchen, Giora
2006-01-01
Quantitative traits are conditioned by several genetic determinants. Since such genes influence many important complex traits in various organisms, the identification of quantitative trait loci (QTLs) is of major interest, but still encounters serious difficulties. We detected four linked genes within one QTL, which participate in controlling sporulation efficiency in Saccharomyces cerevisiae. Following the identification of single nucleotide polymorphisms by comparing the sequences of 145 genes between the parental strains SK1 and S288c, we analyzed the segregating progeny of the cross between them. Through reciprocal hemizygosity analysis, four genes, RAS2, PMS1, SWS2, and FKH2, located in a region of 60 kilobases on Chromosome 14, were found to be associated with sporulation efficiency. Three of the four “high” sporulation alleles are derived from the “low” sporulating strain. Two of these sporulation-related genes were verified through allele replacements. For RAS2, the causative variation was suggested to be a single nucleotide difference in the upstream region of the gene. This quantitative trait nucleotide accounts for sporulation variability among a set of ten closely related winery yeast strains. Our results provide a detailed view of genetic complexity in one “QTL region” that controls a quantitative trait and reports a single nucleotide polymorphism-trait association in wild strains. Moreover, these findings have implications on QTL identification in higher eukaryotes. PMID:17112318
Fekete, Tibor; Rásó, Erzsébet; Pete, Imre; Tegze, Bálint; Liko, István; Munkácsy, Gyöngyi; Sipos, Norbert; Rigó, János; Györffy, Balázs
2012-07-01
Transcriptomic analysis of global gene expression in ovarian carcinoma can identify dysregulated genes capable to serve as molecular markers for histology subtypes and survival. The aim of our study was to validate previous candidate signatures in an independent setting and to identify single genes capable to serve as biomarkers for ovarian cancer progression. As several datasets are available in the GEO today, we were able to perform a true meta-analysis. First, 829 samples (11 datasets) were downloaded, and the predictive power of 16 previously published gene sets was assessed. Of these, eight were capable to discriminate histology subtypes, and none was capable to predict survival. To overcome the differences in previous studies, we used the 829 samples to identify new predictors. Then, we collected 64 ovarian cancer samples (median relapse-free survival 24.5 months) and performed TaqMan Real Time Polimerase Chain Reaction (RT-PCR) analysis for the best 40 genes associated with histology subtypes and survival. Over 90% of subtype-associated genes were confirmed. Overall survival was effectively predicted by hormone receptors (PGR and ESR2) and by TSPAN8. Relapse-free survival was predicted by MAPT and SNCG. In summary, we successfully validated several gene sets in a meta-analysis in large datasets of ovarian samples. Additionally, several individual genes identified were validated in a clinical cohort. Copyright © 2011 UICC.
Cha, Kihoon; Hwang, Taeho; Oh, Kimin; Yi, Gwan-Su
2015-01-01
It has been reported that several brain diseases can be treated as transnosological manner implicating possible common molecular basis under those diseases. However, molecular level commonality among those brain diseases has been largely unexplored. Gene expression analyses of human brain have been used to find genes associated with brain diseases but most of those studies were restricted either to an individual disease or to a couple of diseases. In addition, identifying significant genes in such brain diseases mostly failed when it used typical methods depending on differentially expressed genes. In this study, we used a correlation-based biclustering approach to find coexpressed gene sets in five neurodegenerative diseases and three psychiatric disorders. By using biclustering analysis, we could efficiently and fairly identified various gene sets expressed specifically in both single and multiple brain diseases. We could find 4,307 gene sets correlatively expressed in multiple brain diseases and 3,409 gene sets exclusively specified in individual brain diseases. The function enrichment analysis of those gene sets showed many new possible functional bases as well as neurological processes that are common or specific for those eight diseases. This study introduces possible common molecular bases for several brain diseases, which open the opportunity to clarify the transnosological perspective assumed in brain diseases. It also showed the advantages of correlation-based biclustering analysis and accompanying function enrichment analysis for gene expression data in this type of investigation.
2015-01-01
Background It has been reported that several brain diseases can be treated as transnosological manner implicating possible common molecular basis under those diseases. However, molecular level commonality among those brain diseases has been largely unexplored. Gene expression analyses of human brain have been used to find genes associated with brain diseases but most of those studies were restricted either to an individual disease or to a couple of diseases. In addition, identifying significant genes in such brain diseases mostly failed when it used typical methods depending on differentially expressed genes. Results In this study, we used a correlation-based biclustering approach to find coexpressed gene sets in five neurodegenerative diseases and three psychiatric disorders. By using biclustering analysis, we could efficiently and fairly identified various gene sets expressed specifically in both single and multiple brain diseases. We could find 4,307 gene sets correlatively expressed in multiple brain diseases and 3,409 gene sets exclusively specified in individual brain diseases. The function enrichment analysis of those gene sets showed many new possible functional bases as well as neurological processes that are common or specific for those eight diseases. Conclusions This study introduces possible common molecular bases for several brain diseases, which open the opportunity to clarify the transnosological perspective assumed in brain diseases. It also showed the advantages of correlation-based biclustering analysis and accompanying function enrichment analysis for gene expression data in this type of investigation. PMID:26043779
USDA-ARS?s Scientific Manuscript database
Scope: Omega-3 PUFAs (n-3 PUFAs) reduce IL-6 gene expression, but their effects on transcription regulatory mechanisms are unknown. We aimed to conduct an integrated analysis with both population and in vitro studies to systematically explore the relationships among n-3 PUFA, DNA methylation, single...
Imaging Transcriptional Regulation of Eukaryotic mRNA Genes: Advances and Outlook.
Yao, Jie
2017-01-06
Regulation of eukaryotic transcription in vivo occurs at distinct stages. Previous research has identified many active or repressive transcription factors (TFs) and core transcription components and studied their functions in vitro and in vivo. Nonetheless, how individual TFs act in concert to regulate mRNA gene expression in a single cell remains poorly understood. Direct observation of TF assembly and disassembly and various biochemical reactions during transcription of a single-copy gene in vivo is the ideal approach to study this problem. Research in this area requires developing novel techniques for single-cell transcription imaging and integrating imaging studies into understanding the molecular biology of transcription. In the past decade, advanced cell imaging has enabled unprecedented capabilities to visualize individual TF molecules, to track single transcription sites, and to detect individual mRNA in fixed and living cells. These studies have raised several novel insights on transcriptional regulation such as the "hit-and-run" model and transcription bursting that could not be obtained by in vitro biochemistry analysis. At this point, the key question is how to achieve deeper understandings or discover novel mechanisms of eukaryotic transcriptional regulation by imaging transcription in single cells. Meanwhile, further technical advancements are likely required for visualizing distinct kinetic steps of transcription on a single-copy gene in vivo. This review article summarizes recent progress in the field and describes the challenges and opportunities ahead. Copyright © 2016 Elsevier Ltd. All rights reserved.
Estimation of gene induction enables a relevance-based ranking of gene sets.
Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens
2009-07-01
In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
Trujillo-Esquivel, Elías; Franco, Bernardo; Flores-Martínez, Alberto; Ponce-Noyola, Patricia; Mora-Montes, Héctor M
2016-08-02
Analysis of gene expression is a common research tool to study networks controlling gene expression, the role of genes with unknown function, and environmentally induced responses of organisms. Most of the analytical tools used to analyze gene expression rely on accurate cDNA synthesis and quantification to obtain reproducible and quantifiable results. Thus far, most commercial kits for isolation and purification of cDNA target double-stranded molecules, which do not accurately represent the abundance of transcripts. In the present report, we provide a simple and fast method to purify single-stranded cDNA, exhibiting high purity and yield. This method is based on the treatment with RNase H and RNase A after cDNA synthesis, followed by separation in silica spin-columns and ethanol precipitation. In addition, our method avoids the use of DNase I to eliminate genomic DNA from RNA preparations, which improves cDNA yield. As a case report, our method proved to be useful in the purification of single-stranded cDNA from the pathogenic fungus Sporothrix schenckii.
Use of the Fluidigm C1 platform for RNA sequencing of single mouse pancreatic islet cells.
Xin, Yurong; Kim, Jinrang; Ni, Min; Wei, Yi; Okamoto, Haruka; Lee, Joseph; Adler, Christina; Cavino, Katie; Murphy, Andrew J; Yancopoulos, George D; Lin, Hsin Chieh; Gromada, Jesper
2016-03-22
This study provides an assessment of the Fluidigm C1 platform for RNA sequencing of single mouse pancreatic islet cells. The system combines microfluidic technology and nanoliter-scale reactions. We sequenced 622 cells, allowing identification of 341 islet cells with high-quality gene expression profiles. The cells clustered into populations of α-cells (5%), β-cells (92%), δ-cells (1%), and pancreatic polypeptide cells (2%). We identified cell-type-specific transcription factors and pathways primarily involved in nutrient sensing and oxidation and cell signaling. Unexpectedly, 281 cells had to be removed from the analysis due to low viability, low sequencing quality, or contamination resulting in the detection of more than one islet hormone. Collectively, we provide a resource for identification of high-quality gene expression datasets to help expand insights into genes and pathways characterizing islet cell types. We reveal limitations in the C1 Fluidigm cell capture process resulting in contaminated cells with altered gene expression patterns. This calls for caution when interpreting single-cell transcriptomics data using the C1 Fluidigm system.
Fusagene vectors: a novel strategy for the expression of multiple genes from a single cistron.
Gäken, J; Jiang, J; Daniel, K; van Berkel, E; Hughes, C; Kuiper, M; Darling, D; Tavassoli, M; Galea-Lauri, J; Ford, K; Kemeny, M; Russell, S; Farzaneh, F
2000-12-01
Transduction of cells with multiple genes, allowing their stable and co-ordinated expression, is difficult with the available methodologies. A method has been developed for expression of multiple gene products, as fusion proteins, from a single cistron. The encoded proteins are post-synthetically cleaved and processed into each of their constituent proteins as individual, biologically active factors. Specifically, linkers encoding cleavage sites for the Golgi expressed endoprotease, furin, have been incorporated between in-frame cDNA sequences encoding different secreted or membrane bound proteins. With this strategy we have developed expression vectors encoding multiple proteins (IL-2 and B7.1, IL-4 and B7.1, IL-4 and IL-2, IL-12 p40 and p35, and IL-12 p40, p35 and IL-2 ). Transduction and analysis of over 100 individual clones, derived from murine and human tumour cell lines, demonstrate the efficient expression and biological activity of each of the encoded proteins. Fusagene vectors enable the co-ordinated expression of multiple gene products from a single, monocistronic, expression cassette.
Aukema, Sietse M; Kreuz, Markus; Kohler, Christian W; Rosolowski, Maciej; Hasenclever, Dirk; Hummel, Michael; Küppers, Ralf; Lenze, Dido; Ott, German; Pott, Christiane; Richter, Julia; Rosenwald, Andreas; Szczepanowski, Monika; Schwaenen, Carsten; Stein, Harald; Trautmann, Heiko; Wessendorf, Swen; Trümper, Lorenz; Loeffler, Markus; Spang, Rainer; Kluin, Philip M; Klapper, Wolfram; Siebert, Reiner
2014-04-01
Chromosomal translocations affecting the MYC oncogene are the biological hallmark of Burkitt lymphomas but also occur in a subset of other mature B-cell lymphomas. If accompanied by a chromosomal break targeting the BCL2 and/or BCL6 oncogene these MYC translocation-positive (MYC(+)) lymphomas are called double-hit lymphomas, otherwise the term single-hit lymphomas is applied. In order to characterize the biological features of these MYC(+) lymphomas other than Burkitt lymphoma we explored, after exclusion of molecular Burkitt lymphoma as defined by gene expression profiling, the molecular, pathological and clinical aspects of 80 MYC-translocation-positive lymphomas (31 single-hit, 46 double-hit and 3 MYC(+)-lymphomas with unknown BCL6 status). Comparison of single-hit and double-hit lymphomas revealed no difference in MYC partner (IG/non-IG), genomic complexity, MYC expression or gene expression profile. Double-hit lymphomas more frequently showed a germinal center B-cell-like gene expression profile and had higher IGH and MYC mutation frequencies. Gene expression profiling revealed 130 differentially expressed genes between BCL6(+)/MYC(+) and BCL2(+)/MYC(+) double-hit lymphomas. BCL2(+)/MYC(+) double-hit lymphomas more frequently showed a germinal center B-like gene expression profile. Analysis of all lymphomas according to MYC partner (IG/non-IG) revealed no substantial differences. In this series of lymphomas, in which immunochemotherapy was administered in only a minority of cases, single-hit and double-hit lymphomas had a similar poor outcome in contrast to the outcome of molecular Burkitt lymphoma and lymphomas without the MYC break. Our data suggest that, after excluding molecular Burkitt lymphoma and pediatric cases, MYC(+) lymphomas are biologically quite homogeneous with single-hit and double-hit lymphomas as well as IG-MYC and non-IG-MYC(+) lymphomas sharing various molecular characteristics.
Prabhakaran, Vasudevan; Drevets, Douglas A; Ramajayam, Govindan; Manoj, Josephine J; Anderson, Michael P; Hanas, Jay S; Rajshekhar, Vedantam; Oommen, Anna; Carabin, Hélène
2017-06-01
Neurocysticercosis (NCC), a neglected tropical disease, inflicts substantial health and economic costs on people living in endemic areas such as India. Nevertheless, accurate diagnosis using brain imaging remains poorly accessible and too costly in endemic countries. The goal of this study was to test if blood monocyte gene expression could distinguish patients with NCC-associated epilepsy, from NCC-negative imaging lesion-free patients presenting with idiopathic epilepsy or idiopathic headaches. Patients aged 18 to 51 were recruited from the Department of Neurological Sciences, Christian Medical College and Hospital, Vellore, India, between January 2013 and October 2014. mRNA from CD14+ blood monocytes was isolated from 76 patients with NCC, 10 Recovered NCC (RNCC), 29 idiopathic epilepsy and 17 idiopathic headaches patients. A preliminary microarray analysis was performed on six NCC, six idiopathic epilepsy and four idiopathic headaches patients to identify genes differentially expressed in NCC-associated epilepsy compared with other groups. This analysis identified 1411 upregulated and 733 downregulated genes in patients with NCC compared to Idiopathic Epilepsy. Fifteen genes up-regulated in NCC patients compared with other groups were selected based on possible relevance to NCC, and analyzed by qPCR in all patients' samples. Differential gene expression among patients was assessed using linear regression models. qPCR analysis of 15 selected genes showed generally higher gene expression among NCC patients, followed by RNCC, idiopathic headaches and Idiopathic Epilepsy. Gene expression was also generally higher among NCC patients with single cyst granulomas, followed by mixed lesions and single calcifications. Expression of certain genes in blood monocytes can distinguish patients with NCC-related epilepsy from patients with active Idiopathic Epilepsy and idiopathic headaches. These findings are significant because they may lead to the development of new tools to screen for and monitor NCC patients without brain imaging.
Kanakachari, Mogilicherla; Solanke, Amolkumar U; Prabhakaran, Narayanasamy; Ahmad, Israr; Dhandapani, Gurusamy; Jayabalan, Narayanasamy; Kumar, Polumetla Ananda
2016-02-01
Brinjal/eggplant/aubergine is one of the major solanaceous vegetable crops. Recent availability of genome information greatly facilitates the fundamental research on brinjal. Gene expression patterns during different stages of fruit development can provide clues towards the understanding of its biological functions. Quantitative real-time PCR (qPCR) has become one of the most widely used methods for rapid and accurate quantification of gene expression. However, its success depends on the use of a suitable reference gene for data normalization. For qPCR analysis, a single reference gene is not universally suitable for all experiments. Therefore, reference gene validation is a crucial step. Suitable reference genes for qPCR analysis of brinjal fruit development have not been investigated so far. In this study, we have selected 21 candidate reference genes from the Brinjal (Solanum melongena) Plant Gene Indices database (compbio.dfci.harvard.edu/tgi/plant.html) and studied their expression profiles by qPCR during six different fruit developmental stages (0, 5, 10, 20, 30, and 50 days post anthesis) along with leaf samples of the Pusa Purple Long (PPL) variety. To evaluate the stability of gene expression, geNorm and NormFinder analytical softwares were used. geNorm identified SAND (SAND family protein) and TBP (TATA binding protein) as the best pairs of reference genes in brinjal fruit development. The results showed that for brinjal fruit development, individual or a combination of reference genes should be selected for data normalization. NormFinder identified Expressed gene (expressed sequence) as the best single reference gene in brinjal fruit development. In this study, we have identified and validated for the first time reference genes to provide accurate transcript normalization and quantification at various fruit developmental stages of brinjal which can also be useful for gene expression studies in other Solanaceae plant species.
de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome
2016-08-01
Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected p<0.05), highly ranked gene-sets reaching suggestive significance including the dopamine receptor antagonists metoclopramide and trifluoperazine and the tyrosine kinase inhibitor neratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.
Fu, Glenn K; Wilhelmy, Julie; Stern, David; Fan, H Christina; Fodor, Stephen P A
2014-03-18
We present a new approach for the sensitive detection and accurate quantitation of messenger ribonucleic acid (mRNA) gene transcripts in single cells. First, the entire population of mRNAs is encoded with molecular barcodes during reverse transcription. After amplification of the gene targets of interest, molecular barcodes are counted by sequencing or scored on a simple hybridization detector to reveal the number of molecules in the starting sample. Since absolute quantities are measured, calibration to standards is unnecessary, and many of the relative quantitation challenges such as polymerase chain reaction (PCR) bias are avoided. We apply the method to gene expression analysis of minute sample quantities and demonstrate precise measurements with sensitivity down to sub single-cell levels. The method is an easy, single-tube, end point assay utilizing standard thermal cyclers and PCR reagents. Accurate and precise measurements are obtained without any need for cycle-to-cycle intensity-based real-time monitoring or physical partitioning into multiple reactions (e.g., digital PCR). Further, since all mRNA molecules are encoded with molecular barcodes, amplification can be used to generate more material for multiple measurements and technical replicates can be carried out on limited samples. The method is particularly useful for small sample quantities, such as single-cell experiments. Digital encoding of cellular content preserves true abundance levels and overcomes distortions introduced by amplification.
DU, Zhi-Heng; Liu, Zong-Yue; Bai, Xiu-Juan
2010-06-01
Using single-strand conformation polymorphism (PCR-SSCP) and DNA sequencing, single nucleotide polymorphisms (SNPs) of growth hormone receptor (GHR) gene were detected in an arctic fox population. Correlation analysis between GHR polymorphisms and growth traits were carried out using the appropriate model. Four SNPs, G3A in the 5'UTR, C99T in the first exon, T59C and G65A in the fifth exon were identified on the arctic fox GHR gene. The G3A and C99T polymorphisms of GHR were associated with female fox body weight (Pamp;0.05) and the T59C and G65A polymorphisms of GHR were associated with male fox body weight (Pamp;0.05) and the skin length of the female fox (Pamp;0.01). Therefore, marker assistant selection on body weight and skin length of arctic foxes using these SNPs can be applied to get big and high quality arctic foxes.
Consolandi, Clarissa
2009-01-01
One major goal of genetic research is to understand the role of genetic variation in living systems. In humans, by far the most common type of such variation involves differences in single DNA nucleotides, and is thus termed single nucleotide polymorphism (SNP). The need for improvement in throughput and reliability of traditional techniques makes it necessary to develop new technologies. Thus the past few years have witnessed an extraordinary surge of interest in DNA microarray technology. This new technology offers the first great hope for providing a systematic way to explore the genome. It permits a very rapid analysis of thousands genes for the purpose of gene discovery, sequencing, mapping, expression, and polymorphism detection. We generated a series of analytical tools to address the manufacturing, detection and data analysis components of a microarray experiment. In particular, we set up a universal array approach in combination with a PCR-LDR (polymerase chain reaction-ligation detection reaction) strategy for allele identification in the HLA gene.
Kiefer, Christiane; Koch, Marcus A.
2012-01-01
74 of the currently accepted 111 taxa of the North American genus Boechera (Brassicaceae) were subject to pyhlogenetic reconstruction and network analysis. The dataset comprised 911 accessions for which ITS sequences were analyzed. Phylogenetic analyses yielded largely unresolved trees. Together with the network analysis confirming this result this can be interpreted as an indication for multiple, independent, and rapid diversification events. Network analyses were superimposed with datasets describing i) geographical distribution, ii) taxonomy, iii) reproductive mode, and iv) distribution history based on phylogeographic evidence. Our results provide first direct evidence for enormous reticulate evolution in the entire genus and give further insights into the evolutionary history of this complex genus on a continental scale. In addition two novel single-copy gene markers, orthologues of the Arabidopsis thaliana genes At2g25920 and At3g18900, were analyzed for subsets of taxa and confirmed the findings obtained through the ITS data. PMID:22606266
Spatial transcriptomic survey of human embryonic cerebral cortex by single-cell RNA-seq analysis.
Fan, Xiaoying; Dong, Ji; Zhong, Suijuan; Wei, Yuan; Wu, Qian; Yan, Liying; Yong, Jun; Sun, Le; Wang, Xiaoye; Zhao, Yangyu; Wang, Wei; Yan, Jie; Wang, Xiaoqun; Qiao, Jie; Tang, Fuchou
2018-06-04
The cellular complexity of human brain development has been intensively investigated, although a regional characterization of the entire human cerebral cortex based on single-cell transcriptome analysis has not been reported. Here, we performed RNA-seq on over 4,000 individual cells from 22 brain regions of human mid-gestation embryos. We identified 29 cell sub-clusters, which showed different proportions in each region and the pons showed especially high percentage of astrocytes. Embryonic neurons were not as diverse as adult neurons, although they possessed important features of their destinies in adults. Neuron development was unsynchronized in the cerebral cortex, as dorsal regions appeared to be more mature than ventral regions at this stage. Region-specific genes were comprehensively identified in each neuronal sub-cluster, and a large proportion of these genes were neural disease related. Our results present a systematic landscape of the regionalized gene expression and neuron maturation of the human cerebral cortex.
Elasmobranch qPCR reference genes: a case study of hypoxia preconditioned epaulette sharks
2010-01-01
Background Elasmobranch fishes are an ancient group of vertebrates which have high potential as model species for research into evolutionary physiology and genomics. However, no comparative studies have established suitable reference genes for quantitative PCR (qPCR) in elasmobranchs for any physiological conditions. Oxygen availability has been a major force shaping the physiological evolution of vertebrates, especially fishes. Here we examined the suitability of 9 reference candidates from various functional categories after a single hypoxic insult or after hypoxia preconditioning in epaulette shark (Hemiscyllium ocellatum). Results Epaulette sharks were caught and exposed to hypoxia. Tissues were collected from 10 controls, 10 individuals with single hypoxic insult and 10 individuals with hypoxia preconditioning (8 hypoxic insults, 12 hours apart). We produced sequence information for reference gene candidates and monitored mRNA expression levels in four tissues: cerebellum, heart, gill and eye. The stability of the genes was examined with analysis of variance, geNorm and NormFinder. The best ranking genes in our study were eukaryotic translation elongation factor 1 beta (eef1b), ubiquitin (ubq) and polymerase (RNA) II (DNA directed) polypeptide F (polr2f). The performance of the ribosomal protein L6 (rpl6) was tissue-dependent. Notably, in one tissue the analysis of variance indicated statistically significant differences between treatments for genes that were ranked as the most stable candidates by reference gene software. Conclusions Our results indicate that eef1b and ubq are generally the most suitable reference genes for the conditions and tissues in the present epaulette shark studies. These genes could also be potential reference gene candidates for other physiological studies examining stress in elasmobranchs. The results emphasise the importance of inter-group variation in reference gene evaluation. PMID:20416043
Kujoth, Gregory C.; Sullivan, Thomas D.; Merkhofer, Richard; Lee, Taek-Jin; Wang, Huafeng; Brandhorst, Tristan; Wüthrich, Marcel
2018-01-01
ABSTRACT Blastomyces dermatitidis is a human fungal pathogen of the lung that can lead to disseminated disease in healthy and immunocompromised individuals. Genetic analysis of this fungus is hampered by the relative inefficiency of traditional recombination-based gene-targeting approaches. Here, we demonstrate the feasibility of applying CRISPR/Cas9-mediated gene editing to Blastomyces, including to simultaneously target multiple genes. We created targeting plasmid vectors expressing Cas9 and either one or two single guide RNAs and introduced these plasmids into Blastomyces via Agrobacterium gene transfer. We succeeded in disrupting several fungal genes, including PRA1 and ZRT1, which are involved in scavenging and uptake of zinc from the extracellular environment. Single-gene-targeting efficiencies varied by locus (median, 60% across four loci) but were approximately 100-fold greater than traditional methods of Blastomyces gene disruption. Simultaneous dual-gene targeting proceeded with efficiencies similar to those of single-gene-targeting frequencies for the respective targets. CRISPR/Cas9 disruption of PRA1 or ZRT1 had a variable impact on growth under zinc-limiting conditions, showing reduced growth at early time points in low-passage-number cultures and growth similar to wild-type levels by later passage. Individual impairment of PRA1 or ZRT1 resulted in a reduction of the fungal burden in a mouse model of Blastomyces infection by a factor of ~1 log (range, up to 3 logs), and combined disruption of both genes had no additional impact on the fungal burden. These results underscore the utility of CRISPR/Cas9 for efficient gene disruption in dimorphic fungi and reveal a role for zinc metabolism in Blastomyces fitness in vivo. PMID:29615501
Márquez, Lidia; Camarena, Beatriz; Hernández, Sandra; Lóyzaga, Cristina; Vargas, Luis; Nicolini, Humberto
2013-11-01
Obsessive-compulsive disorder (OCD) is a psychiatric disorder whose etiology is not yet known. We investigate the role of three variants of the BDNF gene (rs6265, rs1519480 and rs7124442) by single SNP and haplotype analysis in OCD Mexican patients using a case-control and family-based association design. BDNF gene variants were genotyped in 283 control subjects, 232 OCD patients and first degree relatives of 111 OCD subjects. Single SNP analysis in case-control study showed an association between rs6265 and OCD with a high frequency of Val/Val genotype and Val allele (p=0.0001 and p=0.0001, respectively). Also, genotype and allele analysis of rs1519480 showed significant differences (p=0.0001, p=0.0001; respectively) between OCD and control groups. Haplotype analysis showed a high frequency of A-T (rs6265-rs1519480) in OCD patients compared with the control group (OR=2.06 [1.18-3.59], p=0.0093) and a low frequency of haplotype A-C in the OCD patients (OR=0.04 [0.01-0.16], p=0.000002). The family-based association study showed no significant differences in the transmission of any variant. Our study replicated the association between BDNF Val66Met gene polymorphism and OCD. Also, we found a significant association of rs1519480 in OCD patients compared with a control group, region that has never been analyzed in OCD. In conclusion, our findings suggest that BDNF gene could be related to the development of OCD. © 2013 Elsevier B.V. and ECNP. All rights reserved.
Naaijen, J; Bralten, J; Poelmans, G; Faraone, Stephen; Asherson, Philip; Banaschewski, Tobias; Buitelaar, Jan; Franke, Barbara; P Ebstein, Richard; Gill, Michael; Miranda, Ana; D Oades, Robert; Roeyers, Herbert; Rothenberger, Aribert; Sergeant, Joseph; Sonuga-Barke, Edmund; Anney, Richard; Mulas, Fernando; Steinhausen, Hans-Christoph; Glennon, J C; Franke, B; Buitelaar, J K
2017-01-01
Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms, autism symptom severity and inhibition were performed using principal component regression analyses. Subsequently, gene-wide association analyses were performed. The glutamate gene set showed an association with severity of hyperactivity/impulsivity (P=0.009), which was robust to correcting for genome-wide association levels. The GABA gene set showed nominally significant association with inhibition (P=0.04), but this did not survive correction for multiple comparisons. None of single gene or single variant associations was significant on their own. By analyzing multiple genetic variants within candidate gene sets together, we were able to find genetic associations supporting the involvement of excitatory and inhibitory neurotransmitter systems in ADHD and ASD symptom severity in ADHD. PMID:28072412
Implication of common and disease specific variants in CLU, CR1, and PICALM.
Ferrari, Raffaele; Moreno, Jorge H; Minhajuddin, Abu T; O'Bryant, Sid E; Reisch, Joan S; Barber, Robert C; Momeni, Parastoo
2012-08-01
Two recent genome-wide association studies (GWAS) for late onset Alzheimer's disease (LOAD) revealed 3 new genes: clusterin (CLU), phosphatidylinositol binding clathrin assembly protein (PICALM), and complement receptor 1 (CR1). In order to evaluate association with these genome-wide association study-identified genes and to isolate the variants contributing to the pathogenesis of LOAD, we genotyped the top single nucleotide polymorphisms (SNPs), rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), and sequenced the entire coding regions of these genes in our cohort of 342 LOAD patients and 277 control subjects. We confirmed the association of rs3851179 (PICALM) (p = 7.4 × 10(-3)) with the disease status. Through sequencing we identified 18 variants in CLU, 3 of which were found exclusively in patients; 8 variants (out of 65) in CR1 gene were only found in patients and the 16 variants identified in PICALM gene were present in both patients and controls. In silico analysis of the variants in PICALM did not predict any damaging effect on the protein. The haplotype analysis of the variants in each gene predicted a common haplotype when the 3 single nucleotide polymorphisms rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), respectively, were included. For each gene the haplotype structure and size differed between patients and controls. In conclusion, we confirmed association of CLU, CR1, and PICALM genes with the disease status in our cohort through identification of a number of disease-specific variants among patients through the sequencing of the coding region of these genes. Published by Elsevier Inc.
Application of advanced cytometric and molecular technologies to minimal residual disease monitoring
NASA Astrophysics Data System (ADS)
Leary, James F.; He, Feng; Reece, Lisa M.
2000-04-01
Minimal residual disease monitoring presents a number of theoretical and practical challenges. Recently it has been possible to meet some of these challenges by combining a number of new advanced biotechnologies. To monitor the number of residual tumor cells requires complex cocktails of molecular probes that collectively provide sensitivities of detection on the order of one residual tumor cell per million total cells. Ultra-high-speed, multi parameter flow cytometry is capable of analyzing cells at rates in excess of 100,000 cells/sec. Residual tumor selection marker cocktails can be optimized by use of receiver operating characteristic analysis. New data minimizing techniques when combined with multi variate statistical or neural network classifications of tumor cells can more accurately predict residual tumor cell frequencies. The combination of these techniques can, under at least some circumstances, detect frequencies of tumor cells as low as one cell in a million with an accuracy of over 98 percent correct classification. Detection of mutations in tumor suppressor genes requires insolation of these rare tumor cells and single-cell DNA sequencing. Rare residual tumor cells can be isolated at single cell level by high-resolution single-cell cell sorting. Molecular characterization of tumor suppressor gene mutations can be accomplished using a combination of single- cell polymerase chain reaction amplification of specific gene sequences followed by TA cloning techniques and DNA sequencing. Mutations as small as a single base pair in a tumor suppressor gene of a single sorted tumor cell have been detected using these methods. Using new amplification procedures and DNA micro arrays it should be possible to extend the capabilities shown in this paper to screening of multiple DNA mutations in tumor suppressor and other genes on small numbers of sorted metastatic tumor cells.
Yu, T W; Bibb, M J; Revill, W P; Hopwood, D A
1994-01-01
A fragment of DNA was cloned from the Streptomyces griseus K-63 genome by using genes (act) for the actinorhodin polyketide synthase (PKS) of Streptomyces coelicolor as a probe. Sequencing of a 5.4-kb segment of the cloned DNA revealed a set of five gris open reading frames (ORFs), corresponding to the act PKS genes, in the following order: ORF1 for a ketosynthase, ORF2 for a chain length-determining factor, ORF3 for an acyl carrier protein, ORF5 for a ketoreductase, and ORF4 for a cyclase-dehydrase. Replacement of the gris genes with a marker gene in the S. griseus genome by using a single-stranded suicide vector propagated in Escherichia coli resulted in loss of the ability to produce griseusins A and B, showing that the five gris genes do indeed encode the type II griseusin PKS. These genes, encoding a PKS that is programmed differently from those for other aromatic PKSs so far available, will provide further valuable material for analysis of the programming mechanism by the construction and analysis of strains carrying hybrid PKS. Images PMID:8169211
Lessons from single-cell transcriptome analysis of oxygen-sensing cells.
Zhou, Ting; Matsunami, Hiroaki
2018-05-01
The advent of single-cell RNA-sequencing (RNA-Seq) technology has enabled transcriptome profiling of individual cells. Comprehensive gene expression analysis at the single-cell level has proven to be effective in characterizing the most fundamental aspects of cellular function and identity. This unbiased approach is revolutionary for small and/or heterogeneous tissues like oxygen-sensing cells in identifying key molecules. Here, we review the major methods of current single-cell RNA-Seq technology. We discuss how this technology has advanced the understanding of oxygen-sensing glomus cells in the carotid body and helped uncover novel oxygen-sensing cells and mechanisms in the mice olfactory system. We conclude by providing our perspective on future single-cell RNA-Seq research directed at oxygen-sensing cells.
Different approaches in the molecular analysis of the SHOX gene dysfunctions.
Stuppia, L; Gatta, V; Antonucci, I; Giuliani, R; Palka, G
2010-06-01
Deficit of the short stature homeobox containing gene (SHOX) accounts for 2.15% of cases of idiopathic short stature (ISS) and 50-100% of cases of Leri-Weill dyschondrosteosis (LWD). It has been demonstrated that patients with SHOX deficit show a good response to treatment with GH. Thus, the early identification of SHOX alterations is a crucial point in order to choose the best treatment for ISS and LWD patients. In this study, we analyze the most commonly used molecular techniques for the detection of SHOX gene alterations. multiple ligation-dependent probe amplification analysis appears to represent the gold standard for the detection of deletion involving the SHOX gene or the enhancer region, being able to show both alterations in a single assay.
Jia, Cheng; Hu, Yu; Kelly, Derek; Kim, Junhyong; Li, Mingyao; Zhang, Nancy R
2017-11-02
Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Jia, Cheng; Hu, Yu; Kelly, Derek; Kim, Junhyong
2017-01-01
Abstract Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq. PMID:29036714
Adhikari, Kiran; Otaki, Joji M
2016-02-01
It is often desirable but difficult to retrieve information on the mature phenotype of an immature tissue sample that has been subjected to gene expression analysis. This problem cannot be ignored when individual variation within a species is large. To circumvent this problem in the butterfly wing system, we developed a new surgical method for removing a single forewing from a pupa using Junonia orithya; the operated pupa was left to develop to an adult without eclosion. The removed right forewing was subjected to gene expression analysis, whereas the non-removed left forewing was examined for color patterns. As a test case, we focused on Distal-less (Dll), which likely plays an active role in inducing elemental patterns, including eyespots. The Dll expression level in forewings was paired with eyespot size data from the same individual. One third of the operated pupae survived and developed wing color patterns. Dll expression levels were significantly higher in males than in females, although male eyespots were smaller in size than female eyespots. Eyespot size data showed weak but significant correlations with the Dll expression level in females. These results demonstrate that a single-wing removal method was successfully applied to the butterfly wing system and suggest the weak and non-exclusive contribution of Dll to eyespot size determination in this butterfly. Our novel methodology for establishing correspondence between gene expression and phenotype can be applied to other candidate genes for color pattern development in butterflies. Conceptually similar methods may also be applicable in other developmental systems.
Polymorphism in and localization of the gene LCP2 (SLP-76) to chromosome 5q33.1-qter
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sunden, S.L.F.; Carr, L.L.; Clements, J.L.
This report describes the localization of the human LCP2 gene to human chromosome 5q33.1-qter using single-stranded conformation polymorphisms analysis. This gene encodes an SH2 domain containing leukocyte protein of 76 kDa (SLP-76), which plays a functional role in T-cell activation. It remains to be determined whether mutations in this gene or translocations at this chromosome location are the genetic basis for various diseases, including lymphoblastic leukemia. 12 refs., 1 fig.
Schmale, H; Ivell, R; Breindl, M; Darmer, D; Richter, D
1984-01-01
The vasopressin gene from normal and diabetes insipidus (Brattleboro) rats has been isolated and sequenced. Except for a single deletion of a G residue in region coding for the neurophysin carrier protein the approximately 2300 nucleotides of both genes are identical. Blot analysis of hypothalamic RNA as well as transfection and microinjection experiments indicate that the mutant gene is correctly transcribed and spliced, however the resulting mRNA is not efficiently translated. Images Fig. 2. Fig. 3. PMID:6526016
Woodhouse, Steven; Piterman, Nir; Wintersteiger, Christoph M; Göttgens, Berthold; Fisher, Jasmin
2018-05-25
Reconstruction of executable mechanistic models from single-cell gene expression data represents a powerful approach to understanding developmental and disease processes. New ambitious efforts like the Human Cell Atlas will soon lead to an explosion of data with potential for uncovering and understanding the regulatory networks which underlie the behaviour of all human cells. In order to take advantage of this data, however, there is a need for general-purpose, user-friendly and efficient computational tools that can be readily used by biologists who do not have specialist computer science knowledge. The Single Cell Network Synthesis toolkit (SCNS) is a general-purpose computational tool for the reconstruction and analysis of executable models from single-cell gene expression data. Through a graphical user interface, SCNS takes single-cell qPCR or RNA-sequencing data taken across a time course, and searches for logical rules that drive transitions from early cell states towards late cell states. Because the resulting reconstructed models are executable, they can be used to make predictions about the effect of specific gene perturbations on the generation of specific lineages. SCNS should be of broad interest to the growing number of researchers working in single-cell genomics and will help further facilitate the generation of valuable mechanistic insights into developmental, homeostatic and disease processes.
Lamontagne, Jason; Mell, Joshua C; Bouchard, Michael J
2016-02-01
Globally, a chronic hepatitis B virus (HBV) infection remains the leading cause of primary liver cancer. The mechanisms leading to the development of HBV-associated liver cancer remain incompletely understood. In part, this is because studies have been limited by the lack of effective model systems that are both readily available and mimic the cellular environment of a normal hepatocyte. Additionally, many studies have focused on single, specific factors or pathways that may be affected by HBV, without addressing cell physiology as a whole. Here, we apply RNA-seq technology to investigate transcriptome-wide, HBV-mediated changes in gene expression to identify single factors and pathways as well as networks of genes and pathways that are affected in the context of HBV replication. Importantly, these studies were conducted in an ex vivo model of cultured primary hepatocytes, allowing for the transcriptomic characterization of this model system and an investigation of early HBV-mediated effects in a biologically relevant context. We analyzed differential gene expression within the context of time-mediated gene-expression changes and show that in the context of HBV replication a number of genes and cellular pathways are altered, including those associated with metabolism, cell cycle regulation, and lipid biosynthesis. Multiple analysis pipelines, as well as qRT-PCR and an independent, replicate RNA-seq analysis, were used to identify and confirm differentially expressed genes. HBV-mediated alterations to the transcriptome that we identified likely represent early changes to hepatocytes following an HBV infection, suggesting potential targets for early therapeutic intervention. Overall, these studies have produced a valuable resource that can be used to expand our understanding of the complex network of host-virus interactions and the impact of HBV-mediated changes to normal hepatocyte physiology on viral replication.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hiort, O.; Huang, Q.; Sinnecker, G.H.G.
Recent studies indicate that mutations in the androgen receptor gene are associated with androgen insensitivity syndromes, a heterogeneous group of related disorders involving defective sexual differentiation in karyotypic males. In this report, the authors address the possibility of rapid mutational analysis of the androgen receptor gene for initial diagnosis, genetic counseling, and molecular subclassification of affected patients and their families. DNA from peripheral blood leukocytes of six patients from five families with various degrees of androgen insensitivity was studied. Exons 2 to 8 of the androgen receptor gene were analyzed using a combination of single strand conformation polymorphism analysis andmore » direct DNA sequencing. Female family members were also studied to identify heterozygote carriers. Point mutations in the AR gene were identified in all six patients, and all mutations caused amino acid substitutions. One patient with incomplete androgen insensitivity was a mosaic for the mutation. Four of the five mothers, as well as a young sister of one patient, were carriers of the mutation present in the affected child. The data show that new mutations may occur in the androgen receptor gene leading to sporadic androgen insensitivity syndrome. Molecular genetic characterization of the variant allele can serve as a primary tool for diagnosis and subsequent therapy, and can provide a basis for distinguishing heterozygous carriers in familial androgen resistance. The identification of carriers is of substantial clinical importance for genetic counseling. 29 refs., 2 figs., 1 tab.« less
Matsumoto, Hirotaka; Kiryu, Hisanori
2016-06-08
Single-cell technologies make it possible to quantify the comprehensive states of individual cells, and have the power to shed light on cellular differentiation in particular. Although several methods have been developed to fully analyze the single-cell expression data, there is still room for improvement in the analysis of differentiation. In this paper, we propose a novel method SCOUP to elucidate differentiation process. Unlike previous dimension reduction-based approaches, SCOUP describes the dynamics of gene expression throughout differentiation directly, including the degree of differentiation of a cell (in pseudo-time) and cell fate. SCOUP is superior to previous methods with respect to pseudo-time estimation, especially for single-cell RNA-seq. SCOUP also successfully estimates cell lineage more accurately than previous method, especially for cells at an early stage of bifurcation. In addition, SCOUP can be applied to various downstream analyses. As an example, we propose a novel correlation calculation method for elucidating regulatory relationships among genes. We apply this method to a single-cell RNA-seq data and detect a candidate of key regulator for differentiation and clusters in a correlation network which are not detected with conventional correlation analysis. We develop a stochastic process-based method SCOUP to analyze single-cell expression data throughout differentiation. SCOUP can estimate pseudo-time and cell lineage more accurately than previous methods. We also propose a novel correlation calculation method based on SCOUP. SCOUP is a promising approach for further single-cell analysis and available at https://github.com/hmatsu1226/SCOUP.
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd
Wang, Zichen; Monteiro, Caroline D.; Jagodnik, Kathleen M.; Fernandez, Nicolas F.; Gundersen, Gregory W.; Rouillard, Andrew D.; Jenkins, Sherry L.; Feldmann, Axel S.; Hu, Kevin S.; McDermott, Michael G.; Duan, Qiaonan; Clark, Neil R.; Jones, Matthew R.; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R.; Szeto, Gregory L.; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M.; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M.; Kruth, Candice D.; Bongio, Nicholas J.; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E.; Malatras, Apostolos; Fulp, Carl T.; Galindo, John A.; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C.; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H.; Allison, Lindsey R.; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi
2016-01-01
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization. PMID:27667448
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.
Wang, Zichen; Monteiro, Caroline D; Jagodnik, Kathleen M; Fernandez, Nicolas F; Gundersen, Gregory W; Rouillard, Andrew D; Jenkins, Sherry L; Feldmann, Axel S; Hu, Kevin S; McDermott, Michael G; Duan, Qiaonan; Clark, Neil R; Jones, Matthew R; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R; Szeto, Gregory L; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M; Kruth, Candice D; Bongio, Nicholas J; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E; Malatras, Apostolos; Fulp, Carl T; Galindo, John A; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H; Allison, Lindsey R; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi
2016-09-26
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.
Dwivedi, Bhakti; Kowalski, Jeanne
2018-01-01
While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/.
Dwivedi, Bhakti
2018-01-01
While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/. PMID:29415010
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.
Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L
2015-01-01
Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Wang, Shuo; Gao, Li-Zhi
2016-11-01
The complete chloroplast genome sequence of foxtail millet (Setaria italica), an important food and fodder crop in the family Poaceae, is first reported in this study. The genome consists of 1 35 516 bp containing a pair of inverted repeats (IRs) of 21 804 bp separated by a large single-copy (LSC) region and a small single-copy (SSC) region of 79 896 bp and 12 012 bp, respectively. Coding sequences constitute 58.8% of the genome harboring 111 unique genes, 71 of which are protein-coding genes, 4 are rRNA genes, and 36 are tRNA genes. Phylogenetic analysis indicated foxtail millet clustered with Panicum virgatum and Echinochloa crus-galli belonging to the tribe Paniceae of the subfamily Panicoideae. This newly determined chloroplast genome will provide valuable information for the future breeding programs of valuable cereal crops in the family Poaceae.
Silberstein, Lev; Goncalves, Kevin A; Kharchenko, Peter V; Turcotte, Raphael; Kfoury, Youmna; Mercier, Francois; Baryawno, Ninib; Severe, Nicolas; Bachand, Jacqueline; Spencer, Joel A; Papazian, Ani; Lee, Dongjun; Chitteti, Brahmananda Reddy; Srour, Edward F; Hoggatt, Jonathan; Tate, Tiffany; Lo Celso, Cristina; Ono, Noriaki; Nutt, Stephen; Heino, Jyrki; Sipilä, Kalle; Shioda, Toshihiro; Osawa, Masatake; Lin, Charles P; Hu, Guo-Fu; Scadden, David T
2016-10-06
Physiological stem cell function is regulated by secreted factors produced by niche cells. In this study, we describe an unbiased approach based on the differential single-cell gene expression analysis of mesenchymal osteolineage cells close to, and further removed from, hematopoietic stem/progenitor cells (HSPCs) to identify candidate niche factors. Mesenchymal cells displayed distinct molecular profiles based on their relative location. We functionally examined, among the genes that were preferentially expressed in proximal cells, three secreted or cell-surface molecules not previously connected to HSPC biology-the secreted RNase angiogenin, the cytokine IL18, and the adhesion molecule Embigin-and discovered that all of these factors are HSPC quiescence regulators. Therefore, our proximity-based differential single-cell approach reveals molecular heterogeneity within niche cells and can be used to identify novel extrinsic stem/progenitor cell regulators. Similar approaches could also be applied to other stem cell/niche pairs to advance the understanding of microenvironmental regulation of stem cell function. Copyright © 2016 Elsevier Inc. All rights reserved.
Bush, W S; McCauley, J L; DeJager, P L; Dudek, S M; Hafler, D A; Gibson, R A; Matthews, P M; Kappos, L; Naegelin, Y; Polman, C H; Hauser, S L; Oksenberg, J; Haines, J L; Ritchie, M D
2011-07-01
Gene-gene interactions are proposed as an important component of the genetic architecture of complex diseases, and are just beginning to be evaluated in the context of genome-wide association studies (GWAS). In addition to detecting epistasis, a benefit to interaction analysis is that it also increases power to detect weak main effects. We conducted a knowledge-driven interaction analysis of a GWAS of 931 multiple sclerosis (MS) trios to discover gene-gene interactions within established biological contexts. We identify heterogeneous signals, including a gene-gene interaction between CHRM3 (muscarinic cholinergic receptor 3) and MYLK (myosin light-chain kinase) (joint P=0.0002), an interaction between two phospholipase C-β isoforms, PLCβ1 and PLCβ4 (joint P=0.0098), and a modest interaction between ACTN1 (actinin alpha 1) and MYH9 (myosin heavy chain 9) (joint P=0.0326), all localized to calcium-signaled cytoskeletal regulation. Furthermore, we discover a main effect (joint P=5.2E-5) previously unidentified by single-locus analysis within another related gene, SCIN (scinderin), a calcium-binding cytoskeleton regulatory protein. This work illustrates that knowledge-driven interaction analysis of GWAS data is a feasible approach to identify new genetic effects. The results of this study are among the first gene-gene interactions and non-immune susceptibility loci for MS. Further, the implicated genes cluster within inter-related biological mechanisms that suggest a neurodegenerative component to MS.
Apigenin Impacts the Growth of the Gut Microbiota and Alters the Gene Expression of Enterococcus.
Wang, Minqian; Firrman, Jenni; Zhang, Liqing; Arango-Argoty, Gustavo; Tomasula, Peggy; Liu, LinShu; Xiao, Weidong; Yam, Kit
2017-08-03
Apigenin is a major dietary flavonoid with many bioactivities, widely distributed in plants. Apigenin reaches the colon region intact and interacts there with the human gut microbiota, however there is little research on how apigenin affects the gut bacteria. This study investigated the effect of pure apigenin on human gut bacteria, at both the single strain and community levels. The effect of apigenin on the single gut bacteria strains Bacteroides galacturonicus , Bifidobacterium catenulatum , Lactobacillus rhamnosus GG, and Enterococcus caccae , was examined by measuring their anaerobic growth profiles. The effect of apigenin on a gut microbiota community was studied by culturing a fecal inoculum under in vitro conditions simulating the human ascending colon. 16S rRNA gene sequencing and GC-MS analysis quantified changes in the community structure. Single molecule RNA sequencing was used to reveal the response of Enterococcus caccae to apigenin. Enterococcus caccae was effectively inhibited by apigenin when cultured alone, however, the genus Enterococcus was enhanced when tested in a community setting. Single molecule RNA sequencing found that Enterococcus caccae responded to apigenin by up-regulating genes involved in DNA repair, stress response, cell wall synthesis, and protein folding. Taken together, these results demonstrate that apigenin affects both the growth and gene expression of Enterococcus caccae .
Mutation analysis of the Smad3 gene in human osteoarthritis.
Yao, Jun-Yan; Wang, Yan; An, Jing; Mao, Chun-Ming; Hou, Ning; Lv, Ya-Xin; Wang, You-Liang; Cui, Fang; Huang, Min; Yang, Xiao
2003-09-01
Osteoarthritis (OA) is the most common joint disease worldwide. Recent studies have shown that targeted disruption of Smad3 in mouse results in OA. To reveal the possible association between the Smad3 gene mutation and human OA, we employed polymerase chain reaction-single strand conformation polymorphism and sequencing to screen mutations in all nine exons of the Smad3 gene in 32 patients with knee OA and 50 patients with only bone fracture. A missense mutation of the Smad3 gene was found in one patient. The single base mutation located in the linker region of the SMAD3 protein was A --> T change in the position 2 of codon 197 and resulted in an asparagine to isoleucine amino-acid substitution. The expressions of matrix metalloproteinase 2 (MMP-2) and MMP-9 in sera of the patient carrying the mutation were higher than other OA patients and controls. This is the first report showing that the Smad3 gene mutations could be associated with the pathogenesis of human OA.
Identification of the gene for disaggregatase from Methanosarcina mazei.
Osumi, Naoki; Kakehashi, Yoshihiro; Matsumoto, Shiho; Nagaoka, Kazunari; Sakai, Junichi; Miyashita, Kiyotaka; Kimura, Makoto; Asakawa, Susumu
2008-12-01
The gene sequences encoding disaggregatase (Dag), the enzyme responsible for dispersion of cell aggregates of Methanosarcina mazei to single cells, were determined for three strains of M. mazei (S-6(T), LYC and TMA). The dag genes of the three strains were 3234 bp in length and had almost the same sequences with 97% amino acid sequence identities. Dag was predicted to comprise 1077 amino acid residues and to have a molecular mass of 120 kDa containing three repeats of the DNRLRE domain in the C terminus, which is specific to the genus Methanosarcina and may be responsible for structural organization and cell wall function. Recombinant Dag was overexpressed in Escherichia coli and preparations of the expressed protein exhibited enzymatic activity. The RT-PCR analysis showed that dag was transcribed to mRNA in M. mazei LYC and indicated that the gene was expressed in vivo. This is the first time the gene involved in the morphological change of Methanosarcina spp. from aggregate to single cells has been identified.
Qiu, Ying-Hua; Deng, Fei-Yan; Li, Min-Jing; Lei, Shu-Feng
2014-11-01
Type 1 diabetes mellitus is a serious disorder characterized by destruction of pancreatic β-cells, culminating in absolute insulin deficiency. Genetic factors contribute to the susceptibility of type 1 diabetes mellitus. The aim of the present study was to identify more susceptibility genes of type 1 diabetes mellitus. We carried out an initial gene-based genome-wide association study in a total of 4,075 type 1 diabetes mellitus cases and 2,604 controls by using the Gene-based Association Test using Extended Simes procedure. Furthermore, we carried out replication studies, differential expression analysis and functional annotation clustering analysis to support the significance of the identified susceptibility genes. We identified 452 genes associated with type 1 diabetes mellitus, even after adapting the genome-wide threshold for significance (P < 9.05E-04). Among these genes, 171 were newly identified for type 1 diabetes mellitus, which were ignored in single-nucleotide polymorphism-based association analysis and were not previously reported. We found that 53 genes have supportive evidence from replication studies and/or differential expression studies. In particular, seven genes including four non-human leukocyte antigen (HLA) genes (RASIP1, STRN4, BCAR1 and MYL2) are replicated in at least one independent population and also differentially expressed in peripheral blood mononuclear cells or monocytes. Furthermore, the associated genes tend to enrich in immune-related pathways or Gene Ontology project terms. The present results suggest the high power of gene-based association analysis in detecting disease-susceptibility genes. Our findings provide more insights into the genetic basis of type 1 diabetes mellitus.
Complete Genome Sequence and Comparative Analysis of the Fish Pathogen Lactococcus garvieae
Oshima, Kenshiro; Yoshizaki, Mariko; Kawanishi, Michiko; Nakaya, Kohei; Suzuki, Takehito; Miyauchi, Eiji; Ishii, Yasuo; Tanabe, Soichi; Murakami, Masaru; Hattori, Masahira
2011-01-01
Lactococcus garvieae causes fatal haemorrhagic septicaemia in fish such as yellowtail. The comparative analysis of genomes of a virulent strain Lg2 and a non-virulent strain ATCC 49156 of L. garvieae revealed that the two strains shared a high degree of sequence identity, but Lg2 had a 16.5-kb capsule gene cluster that is absent in ATCC 49156. The capsule gene cluster was composed of 15 genes, of which eight genes are highly conserved with those in exopolysaccharide biosynthesis gene cluster often found in Lactococcus lactis strains. Sequence analysis of the capsule gene cluster in the less virulent strain L. garvieae Lg2-S, Lg2-derived strain, showed that two conserved genes were disrupted by a single base pair deletion, respectively. These results strongly suggest that the capsule is crucial for virulence of Lg2. The capsule gene cluster of Lg2 may be a genomic island from several features such as the presence of insertion sequences flanked on both ends, different GC content from the chromosomal average, integration into the locus syntenic to other lactococcal genome sequences, and distribution in human gut microbiomes. The analysis also predicted other potential virulence factors such as haemolysin. The present study provides new insights into understanding of the virulence mechanisms of L. garvieae in fish. PMID:21829716
Single-molecule dilution and multiple displacement amplification for molecular haplotyping.
Paul, Philip; Apgar, Josh
2005-04-01
Separate haploid analysis is frequently required for heterozygous genotyping to resolve phase ambiguity or confirm allelic sequence. We demonstrate a technique of single-molecule dilution followed by multiple strand displacement amplification to haplotype polymorphic alleles. Dilution of DNA to haploid equivalency, or a single molecule, is a simple method for separating di-allelic DNA. Strand displacement amplification is a robust method for non-specific DNA expansion that employs random hexamers and phage polymerase Phi29 for double-stranded DNA displacement and primer extension, resulting in high processivity and exceptional product length. Single-molecule dilution was followed by strand displacement amplification to expand separated alleles to microgram quantities of DNA for more efficient haplotype analysis of heterozygous genes.
Li, Man; Li, Yong; Weeks, Olivia; Mijatovic, Vladan; Teumer, Alexander; Huffman, Jennifer E; Tromp, Gerard; Fuchsberger, Christian; Gorski, Mathias; Lyytikäinen, Leo-Pekka; Nutile, Teresa; Sedaghat, Sanaz; Sorice, Rossella; Tin, Adrienne; Yang, Qiong; Ahluwalia, Tarunveer S; Arking, Dan E; Bihlmeyer, Nathan A; Böger, Carsten A; Carroll, Robert J; Chasman, Daniel I; Cornelis, Marilyn C; Dehghan, Abbas; Faul, Jessica D; Feitosa, Mary F; Gambaro, Giovanni; Gasparini, Paolo; Giulianini, Franco; Heid, Iris; Huang, Jinyan; Imboden, Medea; Jackson, Anne U; Jeff, Janina; Jhun, Min A; Katz, Ronit; Kifley, Annette; Kilpeläinen, Tuomas O; Kumar, Ashish; Laakso, Markku; Li-Gao, Ruifang; Lohman, Kurt; Lu, Yingchang; Mägi, Reedik; Malerba, Giovanni; Mihailov, Evelin; Mohlke, Karen L; Mook-Kanamori, Dennis O; Robino, Antonietta; Ruderfer, Douglas; Salvi, Erika; Schick, Ursula M; Schulz, Christina-Alexandra; Smith, Albert V; Smith, Jennifer A; Traglia, Michela; Yerges-Armstrong, Laura M; Zhao, Wei; Goodarzi, Mark O; Kraja, Aldi T; Liu, Chunyu; Wessel, Jennifer; Boerwinkle, Eric; Borecki, Ingrid B; Bork-Jensen, Jette; Bottinger, Erwin P; Braga, Daniele; Brandslund, Ivan; Brody, Jennifer A; Campbell, Archie; Carey, David J; Christensen, Cramer; Coresh, Josef; Crook, Errol; Curhan, Gary C; Cusi, Daniele; de Boer, Ian H; de Vries, Aiko P J; Denny, Joshua C; Devuyst, Olivier; Dreisbach, Albert W; Endlich, Karlhans; Esko, Tõnu; Franco, Oscar H; Fulop, Tibor; Gerhard, Glenn S; Glümer, Charlotte; Gottesman, Omri; Grarup, Niels; Gudnason, Vilmundur; Hansen, Torben; Harris, Tamara B; Hayward, Caroline; Hocking, Lynne; Hofman, Albert; Hu, Frank B; Husemoen, Lise Lotte N; Jackson, Rebecca D; Jørgensen, Torben; Jørgensen, Marit E; Kähönen, Mika; Kardia, Sharon L R; König, Wolfgang; Kooperberg, Charles; Kriebel, Jennifer; Launer, Lenore J; Lauritzen, Torsten; Lehtimäki, Terho; Levy, Daniel; Linksted, Pamela; Linneberg, Allan; Liu, Yongmei; Loos, Ruth J F; Lupo, Antonio; Meisinger, Christine; Melander, Olle; Metspalu, Andres; Mitchell, Paul; Nauck, Matthias; Nürnberg, Peter; Orho-Melander, Marju; Parsa, Afshin; Pedersen, Oluf; Peters, Annette; Peters, Ulrike; Polasek, Ozren; Porteous, David; Probst-Hensch, Nicole M; Psaty, Bruce M; Qi, Lu; Raitakari, Olli T; Reiner, Alex P; Rettig, Rainer; Ridker, Paul M; Rivadeneira, Fernando; Rossouw, Jacques E; Schmidt, Frank; Siscovick, David; Soranzo, Nicole; Strauch, Konstantin; Toniolo, Daniela; Turner, Stephen T; Uitterlinden, André G; Ulivi, Sheila; Velayutham, Dinesh; Völker, Uwe; Völzke, Henry; Waldenberger, Melanie; Wang, Jie Jin; Weir, David R; Witte, Daniel; Kuivaniemi, Helena; Fox, Caroline S; Franceschini, Nora; Goessling, Wolfram; Köttgen, Anna; Chu, Audrey Y
2017-03-01
Genome-wide association studies have identified >50 common variants associated with kidney function, but these variants do not fully explain the variation in eGFR. We performed a two-stage meta-analysis of associations between genotypes from the Illumina exome array and eGFR on the basis of serum creatinine (eGFRcrea) among participants of European ancestry from the CKDGen Consortium ( n Stage1 : 111,666; n Stage2 : 48,343). In single-variant analyses, we identified single nucleotide polymorphisms at seven new loci associated with eGFRcrea ( PPM1J , EDEM3, ACP1, SPEG, EYA4, CYP1A1 , and ATXN2L ; P Stage1 <3.7×10 -7 ), of which most were common and annotated as nonsynonymous variants. Gene-based analysis identified associations of functional rare variants in three genes with eGFRcrea, including a novel association with the SOS Ras/Rho guanine nucleotide exchange factor 2 gene, SOS2 ( P =5.4×10 -8 by sequence kernel association test). Experimental follow-up in zebrafish embryos revealed changes in glomerular gene expression and renal tubule morphology in the embryonic kidney of acp1- and sos2 -knockdowns. These developmental abnormalities associated with altered blood clearance rate and heightened prevalence of edema. This study expands the number of loci associated with kidney function and identifies novel genes with potential roles in kidney formation. Copyright © 2017 by the American Society of Nephrology.
Chen, Ying; Zhang, Zhijun; Xu, Zhi; Pu, Mengjia; Geng, Leiyu
2015-12-01
To explore the influence of interleukin-1 beta (IL1B) gene polymorphism and childhood maltreatment on antidepressant treatment. Two hundred and four patients with major depressive disorder (MDD) have received treatment with single antidepressant drugs and were followed up for 8 weeks. Hamilton depression scale-17 (HAMD-17) was used to evaluate the severity of depressive symptoms and therapeutic effect. Childhood maltreatment was assessed using Childhood Trauma Questionnaire, a 28-item Short Form (CTQ-SF). Single nucleotide polymorphism (SNP) of the IL1B gene was determined using a SNaPshot method. Correlation of rs16944 gene polymorphism with response to treatment was analyzed using Unphased 3.0.13 software. The main and interactive effects of SNP and childhood maltreatment on the antidepressant treatment were analyzed using Logistic regression analysis. No significant difference of gender, age, year of education, family history, episode time, and antidepressant agents was detected between the remitters and non-remitters. Association analysis has found that the SNP rs16944 in the IL1B AA genotype carriers antidepressant response was poorer (χ2=3.931, P=0.047). No significant difference was detected in the CTQ scores between the two groups. Genetic and environmental interaction analysis has demonstrated a significant correlation between rs16944 AA genotype and childhood maltreatment and poorer response to antidepressant treatment. The SNP rs16944 in the IL1B gene and its interaction with childhood maltreatment may influence the effect of antidepressant treatment for patients with MDD.
Zhang, Qiang; Wang, Tingting; Zhou, Qian; Zhang, Peng; Gong, Yanhai; Gou, Honglei; Xu, Jian; Ma, Bo
2017-01-23
Wider application of single-cell analysis has been limited by the lack of an easy-to-use and low-cost strategy for single-cell isolation that can be directly coupled to single-cell sequencing and single-cell cultivation, especially for small-size microbes. Herein, a facile droplet microfluidic platform was developed to dispense individual microbial cells into conventional standard containers for downstream analysis. Functional parts for cell encapsulation, droplet inspection and sorting, as well as a chip-to-tube capillary interface were integrated on one single chip with simple architecture, and control of the droplet sorting was achieved by a low-cost solenoid microvalve. Using microalgal and yeast cells as models, single-cell isolation success rate of over 90% and single-cell cultivation success rate of 80% were demonstrated. We further showed that the individual cells isolated can be used in high-quality DNA and RNA analyses at both gene-specific and whole-genome levels (i.e. real-time quantitative PCR and genome sequencing). The simplicity and reliability of the method should improve accessibility of single-cell analysis and facilitate its wider application in microbiology researches.
Zhang, Qiang; Wang, Tingting; Zhou, Qian; Zhang, Peng; Gong, Yanhai; Gou, Honglei; Xu, Jian; Ma, Bo
2017-01-01
Wider application of single-cell analysis has been limited by the lack of an easy-to-use and low-cost strategy for single-cell isolation that can be directly coupled to single-cell sequencing and single-cell cultivation, especially for small-size microbes. Herein, a facile droplet microfluidic platform was developed to dispense individual microbial cells into conventional standard containers for downstream analysis. Functional parts for cell encapsulation, droplet inspection and sorting, as well as a chip-to-tube capillary interface were integrated on one single chip with simple architecture, and control of the droplet sorting was achieved by a low-cost solenoid microvalve. Using microalgal and yeast cells as models, single-cell isolation success rate of over 90% and single-cell cultivation success rate of 80% were demonstrated. We further showed that the individual cells isolated can be used in high-quality DNA and RNA analyses at both gene-specific and whole-genome levels (i.e. real-time quantitative PCR and genome sequencing). The simplicity and reliability of the method should improve accessibility of single-cell analysis and facilitate its wider application in microbiology researches. PMID:28112223
Using a periclinal chimera to unravel layer-specific gene expression in plants.
Filippis, Ioannis; Lopez-Cobollo, Rosa; Abbott, James; Butcher, Sarah; Bishop, Gerard J
2013-09-01
Plant organs are made from multiple cell types, and defining the expression level of a gene in any one cell or group of cells from a complex mixture is difficult. Dicotyledonous plants normally have three distinct layers of cells, L1, L2 and L3. Layer L1 is the single layer of cells making up the epidermis, layer L2 the single cell sub-epidermal layer and layer L3 constitutes the rest of the internal cells. Here we show how it is possible to harvest an organ and characterise the level of layer-specific expression by using a periclinal chimera that has its L1 layer from Solanum pennellii and its L2 and L3 layers from Solanum lycopersicum. This is possible by measuring the level of the frequency of species-specific transcripts. RNA-seq analysis enabled the genome-wide assessment of whether a gene is expressed in the L1 or L2/L3 layers. From 13 277 genes that are expressed in both the chimera and the parental lines and with at least one polymorphism between the parental alleles, we identified 382 genes that are preferentially expressed in L1 in contrast to 1159 genes in L2/L3. Gene ontology analysis shows that many genes preferentially expressed in L1 are involved in cutin and wax biosynthesis, whereas numerous genes that are preferentially expressed in L2/L3 tissue are associated with chloroplastic processes. These data indicate the use of such chimeras and provide detailed information on the level of layer-specific expression of genes. © 2013 East Malling Research The Plant Journal © 2013 John Wiley & Sons Ltd.
Rios, Jonathan J; Perelygin, Andrey A; Long, Maureen T; Lear, Teri L; Zharkikh, Andrey A; Brinton, Margo A; Adelson, David L
2007-01-01
Background The mammalian OAS/RNASEL pathway plays an important role in antiviral host defense. A premature stop-codon within the murine Oas1b gene results in the increased susceptibility of mice to a number of flaviviruses, including West Nile virus (WNV). Mutations in either the OAS1 or RNASEL genes may also modulate the outcome of WNV-induced disease or other viral infections in horses. Polymorphisms in the human OAS gene cluster have been previously utilized for case-control analysis of virus-induced disease in humans. No polymorphisms have yet been identified in either the equine OAS1 or RNASEL genes for use in similar case-control studies. Results Genomic sequence for equine OAS1 was obtained from a contig assembly generated from a shotgun subclone library of CHORI-241 BAC 100I10. Specific amplification of regions of the OAS1 gene from 13 horses of various breeds identified 33 single nucleotide polymorphisms (SNP) and two microsatellites. RNASEL cDNA sequences were determined for 8 mammals and utilized in a phylogenetic analysis. The chromosomal location of the RNASEL gene was assigned by FISH to ECA5p17-p16 using two selected CHORI-241 BAC clones. The horse genomic RNASEL sequence was assembled. Specific amplification of regions of the RNASEL gene from 13 horses identified 31 SNPs. Conclusion In this report, two dinucleotide microsatellites and 64 single nucleotide polymorphisms within the equine OAS1 and RNASEL genes were identified. These polymorphisms are the first to be reported for these genes and will facilitate future case-control studies of horse susceptibility to infectious diseases. PMID:17822564
Deng, Hong-Zhu; You, Cong; Xing, Yu; Chen, Kai-Yun; Zou, Xiao-Bing
2016-05-01
Autism spectrum disorder is a group of neurodevelopmental disorders with the higher prevalence in males. Our previous studies have indicated lower progesterone levels in the children with autism spectrum disorder, suggesting involvement of the cytochrome P-450scc gene (CYP11A1) and cytochrome P-45011beta gene (CYP11B1) as candidate genes in autism spectrum disorder. The aim of this study was to investigate the family-based genetic association between single-nucleotide polymorphisms, rs2279357 in the CYP11A1 gene and rs4534 and rs4541 in the CYP11B1 gene and autism spectrum disorder in Chinese children, which were selected according to the location in the coding region and 5' and 3' regions and minor allele frequencies of greater than 0.05 in the Chinese populations. The transmission disequilibrium test and case-control association analyses were performed in 100 Chinese Han autism spectrum disorder family trios. The genotype and allele frequency of the 3 single-nucleotide polymorphisms had no statistical difference between the children with autism spectrum disorder and their parents (P> .05). Transmission disequilibrium test analysis showed transmission disequilibrium of CYP11A1 gene rs2279357 single-nucleotide polymorphisms (χ(2)= 5.038,P< .001). Our findings provide further support for the hypothesis that a susceptibility gene for autism spectrum disorder exists within or near the CYP11A1 gene in the Han Chinese population. © The Author(s) 2015.
Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia
2012-01-01
Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of “injury" gene predictors that can classify lung injury samples and identify patients at risk for clinically relevant lung injury complications. PMID:23071521
Polonikov, Alexey V.; Ivanov, Vladimir P.; Bogomazov, Alexey D.; Freidin, Maxim B.; Illig, Thomas; Solodilova, Maria A.
2014-01-01
Oxidative stress resulting from an increased amount of reactive oxygen species and an imbalance between oxidants and antioxidants plays an important role in the pathogenesis of asthma. The present study tested the hypothesis that genetic susceptibility to allergic and nonallergic variants of asthma is determined by complex interactions between genes encoding antioxidant defense enzymes (ADE). We carried out a comprehensive analysis of the associations between adult asthma and 46 single nucleotide polymorphisms of 34 ADE genes and 12 other candidate genes of asthma in Russian population using set association analysis and multifactor dimensionality reduction approaches. We found for the first time epistatic interactions between ADE genes underlying asthma susceptibility and the genetic heterogeneity between allergic and nonallergic variants of the disease. We identified GSR (glutathione reductase) and PON2 (paraoxonase 2) as novel candidate genes for asthma susceptibility. We observed gender-specific effects of ADE genes on the risk of asthma. The results of the study demonstrate complexity and diversity of interactions between genes involved in oxidative stress underlying susceptibility to allergic and nonallergic asthma. PMID:24895604
Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C
2014-10-01
Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.
Law, Yee-Song; Gudimella, Ranganath; Song, Beng-Kah; Ratnam, Wickneswari; Harikrishna, Jennifer Ann
2012-01-01
Many of the plant leucine rich repeat receptor-like kinases (LRR-RLKs) have been found to regulate signaling during plant defense processes. In this study, we selected and sequenced an LRR-RLK gene, designated as Oryza rufipogon receptor-like protein kinase 1 (OrufRPK1), located within yield QTL yld1.1 from the wild rice Oryza rufipogon (accession IRGC105491). A 2055 bp coding region and two exons were identified. Southern blotting determined OrufRPK1 to be a single copy gene. Sequence comparison with cultivated rice orthologs (OsI219RPK1, OsI9311RPK1 and OsJNipponRPK1, respectively derived from O. sativa ssp. indica cv. MR219, O. sativa ssp. indica cv. 9311 and O. sativa ssp. japonica cv. Nipponbare) revealed the presence of 12 single nucleotide polymorphisms (SNPs) with five non-synonymous substitutions, and 23 insertion/deletion sites. The biological role of the OrufRPK1 as a defense related LRR-RLK is proposed on the basis of cDNA sequence characterization, domain subfamily classification, structural prediction of extra cellular domains, cluster analysis and comparative gene expression. PMID:22942769
Inducible repression of multiple expansin genes leads to growth suppression during leaf development.
Goh, Hoe-Han; Sloan, Jennifer; Dorca-Fornell, Carmen; Fleming, Andrew
2012-08-01
Expansins are cell wall proteins implicated in the control of plant growth via loosening of the extracellular matrix. They are encoded by a large gene family, and data linked to loss of single gene function to support a role of expansins in leaf growth remain limited. Here, we provide a quantitative growth analysis of transgenics containing an inducible artificial microRNA construct designed to down-regulate the expression of a number of expansin genes that an expression analysis indicated are expressed during the development of Arabidopsis (Arabidopsis thaliana) leaf 6. The results support the hypothesis that expansins are required for leaf growth and show that decreased expansin gene expression leads to a more marked repression of growth during the later stage of leaf development. In addition, a histological analysis of leaves in which expansin gene expression was suppressed indicates that, despite smaller leaves, mean cell size was increased. These data provide functional evidence for a role of expansins in leaf growth, indicate the importance of tissue/organ developmental context for the outcome of altered expansin gene expression, and highlight the separation of the outcome of expansin gene expression at the cellular and organ levels.
Tagle, Analiza Grubanzo; Chuma, Izumi; Tosa, Yukio
2015-04-01
A single gene for resistance, designated Rmg7 (Resistance to Magnaporthe grisea 7), was identified in a tetraploid wheat accession, St24 (Triticum dicoccum, KU120), against Br48, a Triticum isolate of Pyricularia oryzae. Two other wheat accessions, St17 (T. dicoccum, KU112) and St25 (T. dicoccum, KU122), were also resistant against Br48 and showed a similar disease reaction pattern to St24. Crosses between these resistant accessions yielded no susceptible F2 seedlings, suggesting that St24, St17, and St25 carry the same resistance gene. Furthermore, a single avirulence gene corresponding to Rmg7 was detected in a segregation analysis of random F1 progenies between Br48 and MZ5-1-6, an Eleusine isolate virulent to St24 at a higher temperature. This avirulence gene was recognized not only by St24, but also by St17 and St25, thus supporting the preceding results indicating that all three accessions carry Rmg7. This resistance gene may have potential in future wheat breeding programs.
Foroushani, Amir B.K.; Brinkman, Fiona S.L.
2013-01-01
Motivation. Predominant pathway analysis approaches treat pathways as collections of individual genes and consider all pathway members as equally informative. As a result, at times spurious and misleading pathways are inappropriately identified as statistically significant, solely due to components that they share with the more relevant pathways. Results. We introduce the concept of Pathway Gene-Pair Signatures (Pathway-GPS) as pairs of genes that, as a combination, are specific to a single pathway. We devised and implemented a novel approach to pathway analysis, Signature Over-representation Analysis (SIGORA), which focuses on the statistically significant enrichment of Pathway-GPS in a user-specified gene list of interest. In a comparative evaluation of several published datasets, SIGORA outperformed traditional methods by delivering biologically more plausible and relevant results. Availability. An efficient implementation of SIGORA, as an R package with precompiled GPS data for several human and mouse pathway repositories is available for download from http://sigora.googlecode.com/svn/. PMID:24432194
High-Content Analysis of CRISPR-Cas9 Gene-Edited Human Embryonic Stem Cells.
Carlson-Stevermer, Jared; Goedland, Madelyn; Steyer, Benjamin; Movaghar, Arezoo; Lou, Meng; Kohlenberg, Lucille; Prestil, Ryan; Saha, Krishanu
2016-01-12
CRISPR-Cas9 gene editing of human cells and tissues holds much promise to advance medicine and biology, but standard editing methods require weeks to months of reagent preparation and selection where much or all of the initial edited samples are destroyed during analysis. ArrayEdit, a simple approach utilizing surface-modified multiwell plates containing one-pot transcribed single-guide RNAs, separates thousands of edited cell populations for automated, live, high-content imaging and analysis. The approach lowers the time and cost of gene editing and produces edited human embryonic stem cells at high efficiencies. Edited genes can be expressed in both pluripotent stem cells and differentiated cells. This preclinical platform adds important capabilities to observe editing and selection in situ within complex structures generated by human cells, ultimately enabling optical and other molecular perturbations in the editing workflow that could refine the specificity and versatility of gene editing. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Namkung, Junghyun; Nam, Jin-Wu; Park, Taesung
2007-01-01
Many genes with major effects on quantitative traits have been reported to interact with other genes. However, finding a group of interacting genes from thousands of SNPs is challenging. Hence, an efficient and robust algorithm is needed. The genetic algorithm (GA) is useful in searching for the optimal solution from a very large searchable space. In this study, we show that genome-wide interaction analysis using GA and a statistical interaction model can provide a practical method to detect biologically interacting loci. We focus our search on transcriptional regulators by analyzing gene x gene interactions for cancer-related genes. The expression values of three cancer-related genes were selected from the expression data of the Genetic Analysis Workshop 15 Problem 1 data set. We implemented a GA to identify the expression quantitative trait loci that are significantly associated with expression levels of the cancer-related genes. The time complexity of the GA was compared with that of an exhaustive search algorithm. As a result, our GA, which included heuristic methods, such as archive, elitism, and local search, has greatly reduced computational time in a genome-wide search for gene x gene interactions. In general, the GA took one-fifth the computation time of an exhaustive search for the most significant pair of single-nucleotide polymorphisms.
Namkung, Junghyun; Nam, Jin-Wu; Park, Taesung
2007-01-01
Many genes with major effects on quantitative traits have been reported to interact with other genes. However, finding a group of interacting genes from thousands of SNPs is challenging. Hence, an efficient and robust algorithm is needed. The genetic algorithm (GA) is useful in searching for the optimal solution from a very large searchable space. In this study, we show that genome-wide interaction analysis using GA and a statistical interaction model can provide a practical method to detect biologically interacting loci. We focus our search on transcriptional regulators by analyzing gene × gene interactions for cancer-related genes. The expression values of three cancer-related genes were selected from the expression data of the Genetic Analysis Workshop 15 Problem 1 data set. We implemented a GA to identify the expression quantitative trait loci that are significantly associated with expression levels of the cancer-related genes. The time complexity of the GA was compared with that of an exhaustive search algorithm. As a result, our GA, which included heuristic methods, such as archive, elitism, and local search, has greatly reduced computational time in a genome-wide search for gene × gene interactions. In general, the GA took one-fifth the computation time of an exhaustive search for the most significant pair of single-nucleotide polymorphisms. PMID:18466570
Ye, Jun-jie; Ma, Li; Yang, Li-juan; Wang, Jin-huan; Wang, Yue-li; Guo, Hai; Gong, Ning; Nie, Wen-hui; Zhao, Shu-hua
2013-09-01
There are many reports on associations between spermatogenesis and partial azoospermia factor c (AZFc) deletions as well as duplications; however, results are conflicting, possibly due to differences in methodology and ethnic background. The purpose of this study is to investigate the association of AZFc polymorphisms and male infertility in the Yi ethnic population, residents within Yunnan Province, China. A total of 224 infertile patients and 153 fertile subjects were selected in the Yi ethnic population. The study was performed by sequence-tagged site plus/minus (STS+/-) analysis followed by gene dosage and gene copy definition analysis. Y haplotypes of 215 cases and 115 controls were defined by 12 binary markers using single nucleotide polymorphism on Y chromosome (Y-SNP) multiplex assays based on single base primer extension technology. The distribution of Y haplotypes was not significantly different between the case and control groups. The frequencies of both gr/gr (7.6% vs. 8.5%) and b2/b3 (6.3% vs. 8.5%) deletions do not show significant differences. Similarly, single nucleotide variant (SNV) analysis shows no significant difference of gene copy definition between the cases and controls. However, the frequency of partial duplications in the infertile group (4.0%) is significantly higher than that in the control group (0.7%). Further, we found a case with sY1206 deletion which had two CDY1 copies but removed half of DAZ genes. Our results show that male infertility is associated with partial AZFc duplications, but neither gr/gr nor b2/b3 deletions, suggesting that partial AZFc duplications rather than deletions are risk factors for male infertility in Chinese-Yi population.
Nanda, Arun M.; Heyer, Antonia; Krämer, Christina; Grünberger, Alexander; Kohlheyer, Dietrich
2014-01-01
The genome of the Gram-positive soil bacterium Corynebacterium glutamicum ATCC 13032 contains three integrated prophage elements (CGP1 to -3). Recently, it was shown that the large lysogenic prophage CGP3 (∼187 kbp) is excised spontaneously in a small number of cells. In this study, we provide evidence that a spontaneously induced SOS response is partly responsible for the observed spontaneous CGP3 induction. Whereas previous studies focused mainly on the induction of prophages at the population level, we analyzed the spontaneous CGP3 induction at the single-cell level using promoters of phage genes (Pint2 and Plysin) fused to reporter genes encoding fluorescent proteins. Flow-cytometric analysis revealed a spontaneous CGP3 activity in about 0.01 to 0.08% of the cells grown in standard minimal medium, which displayed a significantly reduced viability. A PrecA-eyfp promoter fusion revealed that a small fraction of C. glutamicum cells (∼0.2%) exhibited a spontaneous induction of the SOS response. Correlation of PrecA to the activity of downstream SOS genes (PdivS and PrecN) confirmed a bona fide induction of this stress response rather than stochastic gene expression. Interestingly, the reporter output of PrecA and CGP3 promoter fusions displayed a positive correlation at the single-cell level (ρ = 0.44 to 0.77). Furthermore, analysis of the PrecA-eyfp/Pint2-e2-crimson strain during growth revealed the highest percentage of spontaneous PrecA and Pint2 activity in the early exponential phase, when fast replication occurs. Based on these studies, we postulate that spontaneously occurring DNA damage induces the SOS response, which in turn triggers the induction of lysogenic prophages. PMID:24163339
Wang, Shi-Yuan; Zhang, Qi; Zhang, Xiang; Zhao, Pei-Quan
2016-01-01
To make a comprehensive analysis of the potential pathogenic genes related with Leber congenital amaurosis (LCA) in Chinese. LCA subjects and their families were retrospectively collected from 2013 to 2015. Firstly, whole-exome sequencing was performed in patients who had underwent gene mutation screening with nothing found, and then homozygous sites was selected, candidate sites were annotated, and pathogenic analysis was conducted using softwares including Sorting Tolerant from Intolerant (SIFT), Polyphen-2, Mutation assessor, Condel, and Functional Analysis through Hidden Markov Models (FATHMM). Furthermore, Gene Ontology function and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of pathogenic genes were performed followed by co-segregation analysis using Fisher exact Test. Sanger sequencing was used to validate single-nucleotide variations (SNVs). Expanded verification was performed in the rest patients. Totally 51 LCA families with 53 patients and 24 family members were recruited. A total of 104 SNVs (66 LCA-related genes and 15 co-segregated genes) were submitted for expand verification. The frequencies of homozygous mutation of KRT12 and CYP1A1 were simultaneously observed in 3 families. Enrichment analysis showed that the potential pathogenic genes were mainly enriched in functions related to cell adhesion, biological adhesion, retinoid metabolic process, and eye development biological adhesion. Additionally, WFS1 and STAU2 had the highest homozygous frequencies. LCA is a highly heterogeneous disease. Mutations in KRT12, CYP1A1, WFS1, and STAU2 may be involved in the development of LCA.
Miyakawa, Hiroe; Miyamoto, Toshinobu; Koh, Eitetsu; Tsujimura, Akira; Miyagawa, Yasushi; Saijo, Yasuaki; Namiki, Mikio; Sengoku, Kazuo
2012-01-01
Genetic mechanisms have been implicated as a cause of some cases of male infertility. Recently, 10 novel genes involved in human spermatogenesis, including human SEPTIN12, were identified by expression microarray analysis of human testicular tissue. Septin12 is a member of the septin family of conserved cytoskeletal GTPases that form heteropolymeric filamentous structures in interphase cells. It is expressed specifically in the testis. Therefore, we hypothesized that mutation or polymorphisms of SEPTIN12 participate in male infertility, especially Sertoli cell-only syndrome (SCOS). To investigate whether SEPTIN12 gene defects are associated with azoospermia caused by SCOS, mutational analysis was performed in 100 Japanese patients by direct sequencing of coding regions. Statistical analysis was performed in patients with SCOS and in 140 healthy control men. No mutations were found in SEPTIN12 ; however, 8 coding single-nucleotide polymorphisms (SNP1-SNP8) could be detected in the patients with SCOS. The genotype and allele frequencies in SNP3, SNP4, and SNP6 were notably higher in the SCOS group than in the control group (P < .001). These results suggest that SEPTIN12 might play a critical role in human spermatogenesis.
Tang, Clara S; Zhang, He; Cheung, Chloe Y Y; Xu, Ming; Ho, Jenny C Y; Zhou, Wei; Cherny, Stacey S; Zhang, Yan; Holmen, Oddgeir; Au, Ka-Wing; Yu, Haiyi; Xu, Lin; Jia, Jia; Porsch, Robert M; Sun, Lijie; Xu, Weixian; Zheng, Huiping; Wong, Lai-Yung; Mu, Yiming; Dou, Jingtao; Fong, Carol H Y; Wang, Shuyu; Hong, Xueyu; Dong, Liguang; Liao, Yanhua; Wang, Jiansong; Lam, Levina S M; Su, Xi; Yan, Hua; Yang, Min-Lee; Chen, Jin; Siu, Chung-Wah; Xie, Gaoqiang; Woo, Yu-Cho; Wu, Yangfeng; Tan, Kathryn C B; Hveem, Kristian; Cheung, Bernard M Y; Zöllner, Sebastian; Xu, Aimin; Eugene Chen, Y; Jiang, Chao Qiang; Zhang, Youyi; Lam, Tai-Hing; Ganesh, Santhi K; Huo, Yong; Sham, Pak C; Lam, Karen S L; Willer, Cristen J; Tse, Hung-Fat; Gao, Wei
2015-12-22
Blood lipids are important risk factors for coronary artery disease (CAD). Here we perform an exome-wide association study by genotyping 12,685 Chinese, using a custom Illumina HumanExome BeadChip, to identify additional loci influencing lipid levels. Single-variant association analysis on 65,671 single nucleotide polymorphisms reveals 19 loci associated with lipids at exome-wide significance (P<2.69 × 10(-7)), including three Asian-specific coding variants in known genes (CETP p.Asp459Gly, PCSK9 p.Arg93Cys and LDLR p.Arg257Trp). Furthermore, missense variants at two novel loci-PNPLA3 p.Ile148Met and PKD1L3 p.Thr429Ser-also influence levels of triglycerides and low-density lipoprotein cholesterol, respectively. Another novel gene, TEAD2, is found to be associated with high-density lipoprotein cholesterol through gene-based association analysis. Most of these newly identified coding variants show suggestive association (P<0.05) with CAD. These findings demonstrate that exome-wide genotyping on samples of non-European ancestry can identify additional population-specific possible causal variants, shedding light on novel lipid biology and CAD.
Mohammadi, Faezeh; Hashemi, Seyed Jamal; Zoll, Jan; Melchers, Willem J. G.; Rafati, Haleh; Dehghan, Parvin; Rezaie, Sasan; Tolooe, Ali; Tamadon, Yalda; van der Lee, Henrich A.; Verweij, Paul E.
2015-01-01
We employed an endpoint genotyping method to update the prevalence rate of positivity for the TR34/L98H mutation (a 34-bp tandem repeat mutation in the promoter region of the cyp51A gene in combination with a substitution at codon L98) and the TR46/Y121F/T289A mutation (a 46-bp tandem repeat mutation in the promoter region of the cyp51A gene in combination with substitutions at codons Y121 and T289) among clinical Aspergillus fumigatus isolates obtained from different regions of Iran over a recent 5-year period (2010 to 2014). The antifungal activities of itraconazole, voriconazole, and posaconazole against 172 clinical A. fumigatus isolates were investigated using the European Committee on Antimicrobial Susceptibility Testing (EUCAST) broth microdilution method. For the isolates with an azole resistance phenotype, the cyp51A gene and its promoter were amplified and sequenced. In addition, using a LightCycler 480 real-time PCR system, a novel endpoint genotyping analysis method targeting single-nucleotide polymorphisms was evaluated to detect the L98H and Y121F mutations in the cyp51A gene of all isolates. Of the 172 A. fumigatus isolates tested, the MIC values of itraconazole (≥16 mg/liter) and voriconazole (>4 mg/liter) were high for 6 (3.5%). Quantitative analysis of single-nucleotide polymorphisms showed the TR34/L98H mutation in the cyp51A genes of six isolates. No isolates harboring the TR46/Y121F/T289A mutation were detected. DNA sequencing of the cyp51A gene confirmed the results of the novel endpoint genotyping method. By microsatellite typing, all of the azole-resistant isolates had genotypes different from those previously recovered from Iran and from the Dutch TR34/L98H controls. In conclusion, there was not a significant increase in the prevalence of azole-resistant A. fumigatus isolates harboring the TR34/L98H resistance mechanism among isolates recovered over a recent 5-year period (2010 to 2014) in Iran. A quantitative assay detecting a single-nucleotide polymorphism in the cyp51A gene of A. fumigatus is a reliable tool for the rapid screening and monitoring of TR34/L98H- and TR46/Y121F/T289A-positive isolates and can easily be incorporated into clinical mycology algorithms. PMID:26525787
Single cell gene expression profiling in Alzheimer's disease.
Ginsberg, Stephen D; Che, Shaoli; Counts, Scott E; Mufson, Elliott J
2006-07-01
Development and implementation of microarray techniques to quantify expression levels of dozens to hundreds to thousands of transcripts simultaneously within select tissue samples from normal control subjects and neurodegenerative diseased brains has enabled scientists to create molecular fingerprints of vulnerable neuronal populations in Alzheimer's disease (AD) and related disorders. A goal is to sample gene expression from homogeneous cell types within a defined region without potential contamination by expression profiles of adjacent neuronal subpopulations and nonneuronal cells. The precise resolution afforded by single cell and population cell RNA analysis in combination with microarrays and real-time quantitative polymerase chain reaction (qPCR)-based analyses allows for relative gene expression level comparisons across cell types under different experimental conditions and disease progression. The ability to analyze single cells is an important distinction from global and regional assessments of mRNA expression and can be applied to optimally prepared tissues from animal models of neurodegeneration as well as postmortem human brain tissues. Gene expression analysis in postmortem AD brain regions including the hippocampal formation and neocortex reveals selectively vulnerable cell types share putative pathogenetic alterations in common classes of transcripts, for example, markers of glutamatergic neurotransmission, synaptic-related markers, protein phosphatases and kinases, and neurotrophins/neurotrophin receptors. Expression profiles of vulnerable regions and neurons may reveal important clues toward the understanding of the molecular pathogenesis of various neurological diseases and aid in identifying rational targets toward pharmacotherapeutic interventions for progressive, late-onset neurodegenerative disorders such as mild cognitive impairment (MCI) and AD.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hadano, S.; Ishida, Y.; Tomiyasu, H.
1994-09-01
To complete a transcription map of the 1 Mb region in human chromosome 4p16.3 containing the Huntington disease (HD) gene, the isolation of cDNA clones are being performed throughout. Our method relies on a direct screening of the cDNA libraries probed with single copy microclones from 3 YAC clones spanning 1 Mbp of the HD gene region. AC-DNAs were isolated by a preparative pulsed-field gel electrophoresis, amplified by both a single unique primer (SUP)-PCR and a linker ligation PCR, and 6 microclone-DNA libraries were generated. Then, 8,640 microclones from these libraries were independently amplified by PCR, and arrayed onto themore » membranes. 800-900 microclones that were not cross-hybridized with total human and yeast genomic DNA, TAC vector DNA, and ribosomal cDNA on a dot hybridization (putatively carrying single copy sequences) were pooled to make 9 probe pools. A total of {approximately}1.8x10{sup 7} plaques from the human brain cDNA libraries was screened with 9 pool-probes, and then 672 positive cDNA clones were obtained. So far, 597 cDNA clones were defined and arrayed onto a map of the 1 Mbp of the HD gene region by hybridization with HD region-specific cosmid contigs and YAC clones. Further characterization including a DNA sequencing and Northern blot analysis is currently underway.« less
General statistics of stochastic process of gene expression in eukaryotic cells.
Kuznetsov, V A; Knott, G D; Bonner, R F
2002-01-01
Thousands of genes are expressed at such very low levels (< or =1 copy per cell) that global gene expression analysis of rarer transcripts remains problematic. Ambiguity in identification of rarer transcripts creates considerable uncertainty in fundamental questions such as the total number of genes expressed in an organism and the biological significance of rarer transcripts. Knowing the distribution of the true number of genes expressed at each level and the corresponding gene expression level probability function (GELPF) could help resolve these uncertainties. We found that all observed large-scale gene expression data sets in yeast, mouse, and human cells follow a Pareto-like distribution model skewed by many low-abundance transcripts. A novel stochastic model of the gene expression process predicts the universality of the GELPF both across different cell types within a multicellular organism and across different organisms. This model allows us to predict the frequency distribution of all gene expression levels within a single cell and to estimate the number of expressed genes in a single cell and in a population of cells. A random "basal" transcription mechanism for protein-coding genes in all or almost all eukaryotic cell types is predicted. This fundamental mechanism might enhance the expression of rarely expressed genes and, thus, provide a basic level of phenotypic diversity, adaptability, and random monoallelic expression in cell populations. PMID:12136033
Behavioral Actions of Alcohol: Phenotypic Relations from Multivariate Analysis of Mutant Mouse Data
Blednov, Yuri A.; Mayfield, R. Dayne; Belknap, John; Harris, R. Adron
2012-01-01
Behavioral studies of genetically diverse mice have proven powerful for determining relationships between phenotypes and have been widely used in alcohol research. Most of these studies rely on naturally occurring genetic polymorphisms among inbred strains and selected lines. Another approach is to introduce variation by engineering single gene mutations in mice. We have tested 37 different mutant mice and their wild type controls for a variety (31) of behaviors and have mined this dataset by K-means clustering and analysis of correlations. We found a correlation between a stress-related response (activity in a novel environment) and alcohol consumption and preference for saccharin. We confirmed several relationships detected in earlier genetic studies including positive correlation of alcohol consumption with saccharin consumption, and negative correlations with conditioned taste aversion and alcohol withdrawal severity. Introduction of single gene mutations either eliminated or greatly diminished these correlations. The three tests of alcohol consumption used (continuous two bottle choice, and two limited access tests: Drinking In the Dark and Sustained High Alcohol Consumption) share a relationship with saccharin consumption, but differ from each other in their correlation networks. We suggest that alcohol consumption is controlled by multiple physiological systems where single gene mutations can disrupt the networks of such systems. PMID:22405477
Single cell Hi-C reveals cell-to-cell variability in chromosome structure
Schoenfelder, Stefan; Yaffe, Eitan; Dean, Wendy; Laue, Ernest D.; Tanay, Amos; Fraser, Peter
2013-01-01
Large-scale chromosome structure and spatial nuclear arrangement have been linked to control of gene expression and DNA replication and repair. Genomic techniques based on chromosome conformation capture assess contacts for millions of loci simultaneously, but do so by averaging chromosome conformations from millions of nuclei. Here we introduce single cell Hi-C, combined with genome-wide statistical analysis and structural modeling of single copy X chromosomes, to show that individual chromosomes maintain domain organisation at the megabase scale, but show variable cell-to-cell chromosome territory structures at larger scales. Despite this structural stochasticity, localisation of active gene domains to boundaries of territories is a hallmark of chromosomal conformation. Single cell Hi-C data bridge current gaps between genomics and microscopy studies of chromosomes, demonstrating how modular organisation underlies dynamic chromosome structure, and how this structure is probabilistically linked with genome activity patterns. PMID:24067610
Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia
2009-01-01
Background microRNAs (miRNAs) are single-stranded RNA molecules of about 20–23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. Results GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. Conclusion GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA. PMID:19534746
Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia
2009-06-16
microRNAs (miRNAs) are single-stranded RNA molecules of about 20-23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA.
Identification of genes associated with low furanocoumarin content in grapefruit.
Chen, Chunxian; Yu, Qibin; Wei, Xu; Cancalon, Paul F; Gmitter, Fred G
2014-10-01
Some furanocoumarins in grapefruit (Citrus paradisi) are associated with the so-called grapefruit juice effect. Previous phytochemical quantification and genetic analysis suggested that the synthesis of these furanocoumarins may be controlled by a single gene in the pathway. In this study, cDNA-amplified fragment length polymorphism (cDNA-AFLP) analysis of fruit tissues was performed to identify the candidate gene(s) likely associated with low furanocoumarin content in grapefruit. Fifteen tentative differentially expressed fragments were cloned through the cDNA-AFLP analysis of the grapefruit variety Foster and its spontaneous low-furanocoumarin mutant Low Acid Foster. Sequence analysis revealed a cDNA-AFLP fragment, Contig 6, was homologous to a substrate-proved psoralen synthase gene, CYP71A22, and was part of citrus unigenes Cit.3003 and Csi.1332, and predicted genes Ciclev10004717m in mandarin and orange1.1g041507m in sweet orange. The two predicted genes contained the highly conserved motifs at one of the substrate recognition sites of CYP71A22. Digital gene expression profile showed the unigenes were expressed only in fruit and seed. Quantitative real-time PCR also proved Contig 6 was down-regulated in Low Acid Foster. These results showed the differentially expressed Contig 6 was related to the reduced furanocoumarin levels in the mutant. The identified fragment, homologs, unigenes, and genes may facilitate further furanocoumarin genetic study and grapefruit variety improvement.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Riess, O.; Weber, B.; Hayden, M.R.
1992-10-01
The finding of a mutation in the beta subunit of the cyclic GMP (cGMP) phosphodiesterase gene causing retinal degeneration in mice (the Pdeb gene) prompted a search for disease-causing mutations in the human phosphodiesterase gene (PDEB gene) in patients with retinitis pigmentosa. All 22 exons including 196 bp of the 5[prime] region of the PDEB gene have been assessed for mutations by using single-strand conformational polymorphism analysis in 14 patients from 13 unrelated families with autosomal recessive retinitis pigmentosa (ARRP). No disease-causing mutations were found in this group of affected individuals of seven different ancestries. However, a frequent intronic andmore » two exonic polymorphisms (Leu[sup 489][yields]Gln and Gly[sup 842][yields]Gly) were identified. Segregation analysis using these polymorphic sites excludes linkage of ARRP to the PDEB gene in a family with two affected children. 43 refs., 3 figs., 2 tabs.« less
Transcriptome analysis of PCOS arrested 2-cell embryos.
Lu, Cuiling; Chi, Hongbin; Wang, Yapeng; Feng, Xue; Wang, Lina; Huang, Shuo; Yan, Liying; Lin, Shengli; Liu, Ping; Qiao, Jie
2018-06-18
In an attempt to explore the early developmental arrest in embryos from polycystic ovarian syndrome (PCOS) patients, we sequenced the transcriptome profiles of PCOS arrested 2-cell embryos, non-PCOS arrested 2-cell embryos and non-arrested 2-cell embryos using single-cell RNA-Seq technique. Differential expression analysis was performed using the DEGSeq R package. Gene Ontology (GO) enrichment was analyzed using the GOseq R package. Data revealed 62 differentially expressed genes between non-PCOS arrested and PCOS arrested embryos and 2217 differentially expressed genes between PCOS arrested and non-arrested 2-cell embryos. A total of 49 differently expressed genes (DEGs) were annotated with GO terms in the up-regulated genes between PCOS arrested and non-PCOS arrested embryos after GO enrichment. A total of 29 DEGs were annotated with GO terms in the down-regulated genes between PCOS arrested and non-arrested 2-cell embryos after GO enrichment. These data can provide a reference for screening specific genes involved in the arrest of PCOS embryos.
Mácha, Jaroslav; Teichmanová, Radka; Sater, Amy K; Wells, Dan E; Tlapáková, Tereza; Zimmerman, Lyle B; Krylov, Vladimír
2012-07-16
The X and Y sex chromosomes are conspicuous features of placental mammal genomes. Mammalian sex chromosomes arose from an ordinary pair of autosomes after the proto-Y acquired a male-determining gene and degenerated due to suppression of X-Y recombination. Analysis of earlier steps in X chromosome evolution has been hampered by the long interval between the origins of teleost and amniote lineages as well as scarcity of X chromosome orthologs in incomplete avian genome assemblies. This study clarifies the genesis and remodelling of the Eutherian X chromosome by using a combination of sequence analysis, meiotic map information, and cytogenetic localization to compare amniote genome organization with that of the amphibian Xenopus tropicalis. Nearly all orthologs of human X genes localize to X. tropicalis chromosomes 2 and 8, consistent with an ancestral X-conserved region and a single X-added region precursor. This finding contradicts a previous hypothesis of three evolutionary strata in this region. Homologies between human, opossum, chicken and frog chromosomes suggest a single X-added region predecessor in therian mammals, corresponding to opossum chromosomes 4 and 7. A more ancient X-added ancestral region, currently extant as a major part of chicken chromosome 1, is likely to have been present in the progenitor of synapsids and sauropsids. Analysis of X chromosome gene content emphasizes conservation of single protein coding genes and the role of tandem arrays in formation of novel genes. Chromosomal regions orthologous to Therian X chromosomes have been located in the genome of the frog X. tropicalis. These X chromosome ancestral components experienced a series of fusion and breakage events to give rise to avian autosomes and mammalian sex chromosomes. The early branching tetrapod X. tropicalis' simple diploid genome and robust synteny to amniotes greatly enhances studies of vertebrate chromosome evolution.
2012-01-01
Background The X and Y sex chromosomes are conspicuous features of placental mammal genomes. Mammalian sex chromosomes arose from an ordinary pair of autosomes after the proto-Y acquired a male-determining gene and degenerated due to suppression of X-Y recombination. Analysis of earlier steps in X chromosome evolution has been hampered by the long interval between the origins of teleost and amniote lineages as well as scarcity of X chromosome orthologs in incomplete avian genome assemblies. Results This study clarifies the genesis and remodelling of the Eutherian X chromosome by using a combination of sequence analysis, meiotic map information, and cytogenetic localization to compare amniote genome organization with that of the amphibian Xenopus tropicalis. Nearly all orthologs of human X genes localize to X. tropicalis chromosomes 2 and 8, consistent with an ancestral X-conserved region and a single X-added region precursor. This finding contradicts a previous hypothesis of three evolutionary strata in this region. Homologies between human, opossum, chicken and frog chromosomes suggest a single X-added region predecessor in therian mammals, corresponding to opossum chromosomes 4 and 7. A more ancient X-added ancestral region, currently extant as a major part of chicken chromosome 1, is likely to have been present in the progenitor of synapsids and sauropsids. Analysis of X chromosome gene content emphasizes conservation of single protein coding genes and the role of tandem arrays in formation of novel genes. Conclusions Chromosomal regions orthologous to Therian X chromosomes have been located in the genome of the frog X. tropicalis. These X chromosome ancestral components experienced a series of fusion and breakage events to give rise to avian autosomes and mammalian sex chromosomes. The early branching tetrapod X. tropicalis’ simple diploid genome and robust synteny to amniotes greatly enhances studies of vertebrate chromosome evolution. PMID:22800176
Pham, Nikki T.; Wei, Tong; Schackwitz, Wendy S.; Lipzen, Anna M.; Duong, Phat Q.; Jones, Kyle C.; Ruan, Deling; Bauer, Diane; Peng, Yi; Schmutz, Jeremy
2017-01-01
The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportion of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. This work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations. PMID:28576844
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Guotian; Jain, Rashmi; Chern, Mawsheng
The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportionmore » of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. In conclusion, this work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations.« less
Babaiants, L T; Dubinina, L A; Iushchenko, G M
2000-01-01
It was established by hybridological analysis that winter bread wheat lines 1/74-91, 3/36-91, 5/55-91 possess single dominant gene of resistance to bunt (Tilletia caries (DC) Tul.), but lines 8/2-91, 5/43-91, 4/11-91 and 8/16-91 have two independent dominant genes for this character. These genes originated from Aegilops cylindrica are not identical to Bt1-Bt17 genes and are unknown to date. The lines were obtained from crosses between winter bread wheat variety Odeskaya polukarlikovaya and Aegilops cylindrica.
Miyamoto, T; Koh, E; Tsujimura, A; Miyagawa, Y; Saijo, Y; Namiki, M; Sengoku, K
2014-04-01
Genetic mechanisms have been implicated as a cause of some cases of male infertility. Recently, ten novel genes involved in human spermatogenesis, including human LRWD1, have been identified by expression microarray analysis of human testictissue. The human LRWD1 protein mediates the origin recognition complex in chromatin, which is critical for the initiation of pre-replication complex assembly in G1 and chromatin organization in post-G1 cells. The Lrwd1 gene expression is specific to the testis in mice. Therefore, we hypothesized that mutation or polymorphisms of LRWD1 participate in male infertility, especially azoospermia. To investigate whether LRWD1 gene defects are associated with azoospermia caused by SCOS and meiotic arrest (MA), mutational analysis was performed in 100 and 30 Japanese patients by direct sequencing of the coding regions, respectively. Statistical analysis was performed for patients with SCOS and MA and in 100 healthy control men. No mutations were found in LRWD1; however, three coding single-nucleotide polymorphisms (SNP1-SNP3) could be detected in the patients. The genotype and allele frequencies in SNP1 and SNP2 were notably higher in the SCOS group than in the control group (P < 0.05). These results suggest the critical role of LRWD1 in human spermatogenesis. © 2013 Blackwell Verlag GmbH.
Carreno, R A; Barta, J R
1998-11-01
The small subunit ribosomal RNA (SSU rRNA) genes of hippoboscid (Ornithoica vicina Walker) and tabanid (Chrysops niger Macquart) Diptera were sequenced to determine their phylogenetic position within the order and to determine whether or not extensive hypervariable regions in this gene are widespread in the Diptera. A parsimony analysis of an alignment containing 8 dipteran sequences produced a single most parsimonious tree that placed O. vicina as sister group to Drosophila melanogaster Meigen. The tabanid Chrysops niger was sister group to the asilomorphan taxa, and the sister group to the Brachycera was a Tipula sp. although this relationship was not supported by bootstrap analysis. The hippoboscid and tabanid sequences contain extensive hypervariable regions in the V2, V4, V6, and V7 regions as do other Diptera. When these regions of the alignment were excluded from the phylogenetic analysis, a single most parsimonious tree was found. This tree had an identical overall topology to the tree obtained from the total data set. The hypervariable regions in parts of the dipteran SSU rRNA genes were more extensive in the nematocerous dipteran sequences used in this study than in the other dipteran representatives; these hypervariable regions may be of more utility in inferring relationship among species and subspecies than at the suprageneric level.
Maruyama, Kohei; Takeyama, Haruko; Nemoto, Etsuo; Tanaka, Tsuyoshi; Yoda, Kiyoshi; Matsunaga, Tadashi
2004-09-20
Single nucleotide polymorphism (SNP) detection for aldehyde dehydrogenase 2 (ALDH2) gene based on DNA thermal dissociation curve analysis was successfully demonstrated using an automated system with bacterial magnetic particles (BMPs) by developing a new method for avoiding light scattering caused by nanometer-size particles when using commercially available fluorescent dyes such as FITC, Cy3, and Cy5 as labeling chromophores. Biotin-labeled PCR products in ALDH2, two allele-specific probes (Cy3-labeled detection probe for ALDH2*1 and Cy5-labeled detection probe for ALDH2*2), streptavidin-immobilized BMPs (SA-BMPs) were simultaneously mixed. The mixture was denatured at 70 degrees C for 3 min, cooled slowly to 25 degrees C, and incubated for 10 min, allowing the DNA duplex to form between Cy3- or Cy5-labeled detection probes and biotin-labeled PCR products on SA-BMPs. Then duplex DNA-BMP complex was heated to 58 degrees C, a temperature determined by dissociation curve analysis and a dissociated single-base mismatched detection probe was removed at the same temperature under precise control. Furthermore, fluorescence signal from the detection probe was liberated into the supernatant from completely matched duplex DNA-BMP complex by heating to 80 degrees C and measured. In the homozygote target DNA (ALDH2*1/*1 and ALDH2*2/*2), the fluorescence signals from single-base mismatched were decreased to background level, indicating that mismatched hybridization was efficiently removed by the washing process. In the heterozygote target DNA (ALDH2*1/*2), each fluorescence signals was at a similar level. Therefore, three genotypes of SNP in ALDH2 gene were detected using the automated detection system with BMPs. Copyright 2004 Wiley Periodicals, Inc.
Hook, Sharon E; Lampi, Mark A; Febbo, Eric J; Ward, Jeff A; Parkerton, Thomas F
2010-09-01
Traditional biomarkers for hydrocarbon exposure are not induced by all petroleum substances. The objective of this study was to determine if exposure to a crude oil and different refined oils would generate a common hydrocarbon-specific response in gene expression profiles that could be used as generic biomarkers of hydrocarbon exposure. Juvenile rainbow trout (Oncorhynchus mykiss) were exposed to the water accommodated fraction (WAF) of either kerosene, gas oil, heavy fuel oil, or crude oil for 96 h. Tissue was collected for RNA extraction and microarray analysis. Exposure to each WAF resulted in a different list of differentially regulated genes, with few genes in common across treatments. Exposure to crude oil WAF changed the expression of genes including cytochrome P4501A (CYP1A) and glutathione-S-transferase (GST) with known roles in detoxification pathways. These gene expression profiles were compared to others from previous experiments that used a diverse suite of toxicants. Clustering algorithms successfully identified gene expression profiles resulting from hydrocarbon exposure. These preliminary analyses highlight the difficulties of using single genes as diagnostic of petroleum hydrocarbon exposures. Further work is needed to determine if multivariate transcriptomic-based biomarkers may be a more effective tool than single gene studies for exposure monitoring of different oils. Copyright 2010 SETAC.
Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.
Wyszyńska-Koko, J; Kurył, J
2004-01-01
MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.
Hanif, Mubashir; Pardo, Alejandro Guillermo; Gorfer, Markus; Raudaskoski, Marjatta
2002-06-01
The T-DNA of Agrobacterium tumefaciens can be transferred to plants, yeasts, fungi and human cells. Using this system, dikaryotic mycelium of the ectomycorrhizal fungus Suillus bovinus was transformed with recombinant hygromycin B phosphotransferase (hph)and enhanced green fluorescent protein (EGFP) genes fused with a heterologous fungal promoter and CaMV35S terminator. Transformation resulted in hygromycin B-resistant clones, which were mitotically stable. Putative transformants were analysed for the presence of hph and EGFP genes by PCR and Southern analysis. The latter analysis proved both multiple- and single-copy integrations of the genes in the S. bovinus genome. A. tumeficiens transformation should make possible the development of tagged mutagenesis and targeted gene disruption technology for S. bovinus.
Whole-genome association studies of alcoholism with loci linked to schizophrenia susceptibility.
Namkung, Junghyun; Kim, Youngchul; Park, Taesung
2005-12-30
Alcoholism is a complex disease. There have been many reports on significant comorbidity between alcoholism and schizophrenia. For the genetic study of complex diseases, association analysis has been recommended because of its higher power than that of the linkage analysis for detecting genes with modest effects on disease. To identify alcoholism susceptibility loci, we performed genome-wide single-nucleotide polymorphisms (SNP) association tests, which yielded 489 significant SNPs at the 1% significance level. The association tests showed that tsc0593964 (P-value 0.000013) on chromosome 7 was most significantly associated with alcoholism. From 489 SNPs, 74 genes were identified. Among these genes, GABRA1 is a member of the same gene family with GABRA2 that was recently reported as alcoholism susceptibility gene. By comparing 74 genes to the published results of various linkage studies of schizophrenia, we identified 13 alcoholism associated genes that were located in the regions reported to be linked to schizophrenia. These 13 identified genes can be important candidate genes to study the genetic mechanism of co-occurrence of both diseases.
Joint mapping of genes and conditions via multidimensional unfolding analysis
Van Deun, Katrijn; Marchal, Kathleen; Heiser, Willem J; Engelen, Kristof; Van Mechelen, Iven
2007-01-01
Background Microarray compendia profile the expression of genes in a number of experimental conditions. Such data compendia are useful not only to group genes and conditions based on their similarity in overall expression over profiles but also to gain information on more subtle relations between genes and conditions. Getting a clear visual overview of all these patterns in a single easy-to-grasp representation is a useful preliminary analysis step: We propose to use for this purpose an advanced exploratory method, called multidimensional unfolding. Results We present a novel algorithm for multidimensional unfolding that overcomes both general problems and problems that are specific for the analysis of gene expression data sets. Applying the algorithm to two publicly available microarray compendia illustrates its power as a tool for exploratory data analysis: The unfolding analysis of a first data set resulted in a two-dimensional representation which clearly reveals temporal regulation patterns for the genes and a meaningful structure for the time points, while the analysis of a second data set showed the algorithm's ability to go beyond a mere identification of those genes that discriminate between different patient or tissue types. Conclusion Multidimensional unfolding offers a useful tool for preliminary explorations of microarray data: By relying on an easy-to-grasp low-dimensional geometric framework, relations among genes, among conditions and between genes and conditions are simultaneously represented in an accessible way which may reveal interesting patterns in the data. An additional advantage of the method is that it can be applied to the raw data without necessitating the choice of suitable genewise transformations of the data. PMID:17550582
Blanchard, Raymond K.; Moore, J. Bernadette; Green, Calvert L.; Cousins, Robert J.
2001-01-01
Mammalian nutritional status affects the homeostatic balance of multiple physiological processes and their associated gene expression. Although DNA array analysis can monitor large numbers of genes, there are no reports of expression profiling of a micronutrient deficiency in an intact animal system. In this report, we have tested the feasibility of using cDNA arrays to compare the global changes in expression of genes of known function that occur in the early stages of rodent zinc deficiency. The gene-modulating effects of this deficiency were demonstrated by real-time quantitative PCR measurements of altered mRNA levels for metallothionein 1, zinc transporter 2, and uroguanylin, all of which have been previously documented as zinc-regulated genes. As a result of the low level of inherent noise within this model system and application of a recently reported statistical tool for statistical analysis of microarrays [Tusher, V.G., Tibshirani, R. & Chu, G. (2001) Proc. Natl. Acad. Sci. USA 98, 5116–5121], we demonstrate the ability to reproducibly identify the modest changes in mRNA abundance produced by this single micronutrient deficiency. Among the genes identified by this array profile are intestinal genes that influence signaling pathways, growth, transcription, redox, and energy utilization. Additionally, the influence of dietary zinc supply on the expression of some of these genes was confirmed by real-time quantitative PCR. Overall, these data support the effectiveness of cDNA array expression profiling to investigate the pleiotropic effects of specific nutrients and may provide an approach to establishing markers for assessment of nutritional status. PMID:11717422
Single nucleotide polymorphism analysis using different colored dye dimer probes
NASA Astrophysics Data System (ADS)
Marmé, Nicole; Friedrich, Achim; Denapaite, Dalia; Hakenbeck, Regine; Knemeyer, Jens-Peter
2006-09-01
Fluorescence quenching by dye dimer formation has been utilized to develop hairpin-structured DNA probes for the detection of a single nucleotide polymorphism (SNP) in the penicillin target gene pbp2x, which is implicated in the penicillin resistance of Streptococcus pneumoniae. We designed two specific DNA probes for the identification of the pbp2x genes from a penicillin susceptible strain R6 and a resistant strain Streptococcus mitis 661 using green-fluorescent tetramethylrhodamine (TMR) and red-fluorescent DY-636, respectively. Hybridization of each of the probes to its respective target DNA sequence opened the DNA hairpin probes, consequently breaking the nonfluorescent dye dimers into fluorescent species. This hybridization of the target with the hairpin probe achieved single nucleotide specific detection at nanomolar concentrations via increased fluorescence.
duVerle, David A; Yotsukura, Sohiya; Nomura, Seitaro; Aburatani, Hiroyuki; Tsuda, Koji
2016-09-13
Single-cell RNA sequencing is fast becoming one the standard method for gene expression measurement, providing unique insights into cellular processes. A number of methods, based on general dimensionality reduction techniques, have been suggested to help infer and visualise the underlying structure of cell populations from single-cell expression levels, yet their models generally lack proper biological grounding and struggle at identifying complex differentiation paths. Here we introduce cellTree: an R/Bioconductor package that uses a novel statistical approach, based on document analysis techniques, to produce tree structures outlining the hierarchical relationship between single-cell samples, while identifying latent groups of genes that can provide biological insights. With cellTree, we provide experimentalists with an easy-to-use tool, based on statistically and biologically-sound algorithms, to efficiently explore and visualise single-cell RNA data. The cellTree package is publicly available in the online Bionconductor repository at: http://bioconductor.org/packages/cellTree/ .
The complete chloroplast genome sequence of Dodonaea viscosa: comparative and phylogenetic analyses.
Saina, Josphat K; Gichira, Andrew W; Li, Zhi-Zhong; Hu, Guang-Wan; Wang, Qing-Feng; Liao, Kuo
2018-02-01
The plant chloroplast (cp) genome is a highly conserved structure which is beneficial for evolution and systematic research. Currently, numerous complete cp genome sequences have been reported due to high throughput sequencing technology. However, there is no complete chloroplast genome of genus Dodonaea that has been reported before. To better understand the molecular basis of Dodonaea viscosa chloroplast, we used Illumina sequencing technology to sequence its complete genome. The whole length of the cp genome is 159,375 base pairs (bp), with a pair of inverted repeats (IRs) of 27,099 bp separated by a large single copy (LSC) 87,204 bp, and small single copy (SSC) 17,972 bp. The annotation analysis revealed a total of 115 unique genes of which 81 were protein coding, 30 tRNA, and four ribosomal RNA genes. Comparative genome analysis with other closely related Sapindaceae members showed conserved gene order in the inverted and single copy regions. Phylogenetic analysis clustered D. viscosa with other species of Sapindaceae with strong bootstrap support. Finally, a total of 249 SSRs were detected. Moreover, a comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates in D. viscosa showed very low values. The availability of cp genome reported here provides a valuable genetic resource for comprehensive further studies in genetic variation, taxonomy and phylogenetic evolution of Sapindaceae family. In addition, SSR markers detected will be used in further phylogeographic and population structure studies of the species in this genus.
Whalen, M C; Innes, R W; Bent, A F; Staskawicz, B J
1991-01-01
To develop a model system for molecular genetic analysis of plant-pathogen interactions, we studied the interaction between Arabidopsis thaliana and the bacterial pathogen Pseudomonas syringae pv tomato (Pst). Pst strains were found to be virulent or avirulent on specific Arabidopsis ecotypes, and single ecotypes were resistant to some Pst strains and susceptible to others. In many plant-pathogen interactions, disease resistance is controlled by the simultaneous presence of single plant resistance genes and single pathogen avirulence genes. Therefore, we tested whether avirulence genes in Pst controlled induction of resistance in Arabidopsis. Cosmids that determine avirulence were isolated from Pst genomic libraries, and the Pst avirulence locus avrRpt2 was defined. This allowed us to construct pathogens that differed only by the presence or absence of a single putative avirulence gene. We found that Arabidopsis ecotype Col-0 was susceptible to Pst strain DC3000 but resistant to the same strain carrying avrRpt2, suggesting that a single locus in Col-0 determines resistance. As a first step toward genetically mapping the postulated resistance locus, an ecotype susceptible to infection by DC3000 carrying avrRpt2 was identified. The avrRpt2 locus from Pst was also moved into virulent strains of the soybean pathogen P. syringae pv glycinea to test whether this locus could determine avirulence on soybean. The resulting strains induced a resistant response in a cultivar-specific manner, suggesting that similar resistance mechanisms may function in Arabidopsis and soybean.
Proglucagons in vertebrates: Expression and processing of multiple genes in a bony fish.
Busby, Ellen R; Mommsen, Thomas P
2016-09-01
In contrast to mammals, where a single proglucagon (PG) gene encodes three peptides: glucagon, glucagon-like peptide 1 and glucagon-like peptide 2 (GLP-1; GLP-2), many non-mammalian vertebrates carry multiple PG genes. Here, we investigate proglucagon mRNA sequences, their tissue expression and processing in a diploid bony fish. Copper rockfish (Sebastes caurinus) express two independent genes coding for distinct proglucagon sequences (PG I, PG II), with PG II lacking the GLP-2 sequence. These genes are differentially transcribed in the endocrine pancreas, the brain, and the gastrointestinal tract. Alternative splicing identified in rockfish is only one part of this complex regulation of the PG transcripts: the system has the potential to produce two glucagons, four GLP-1s and a single GLP-2, or any combination of these peptides. Mass spectrometric analysis of partially purified PG-derived peptides in endocrine pancreas confirms translation of both PG transcripts and differential processing of the resulting peptides. The complex differential regulation of the two PG genes and their continued presence in this extant teleostean fish strongly suggests unique and, as yet largely unidentified, roles for the peptide products encoded in each gene. Copyright © 2016 Elsevier Inc. All rights reserved.
2012-01-01
High-dimensional gene expression data provide a rich source of information because they capture the expression level of genes in dynamic states that reflect the biological functioning of a cell. For this reason, such data are suitable to reveal systems related properties inside a cell, e.g., in order to elucidate molecular mechanisms of complex diseases like breast or prostate cancer. However, this is not only strongly dependent on the sample size and the correlation structure of a data set, but also on the statistical hypotheses tested. Many different approaches have been developed over the years to analyze gene expression data to (I) identify changes in single genes, (II) identify changes in gene sets or pathways, and (III) identify changes in the correlation structure in pathways. In this paper, we review statistical methods for all three types of approaches, including subtypes, in the context of cancer data and provide links to software implementations and tools and address also the general problem of multiple hypotheses testing. Further, we provide recommendations for the selection of such analysis methods. Reviewers This article was reviewed by Arcady Mushegian, Byung-Soo Kim and Joel Bader. PMID:23227854
de Santana Lopes, Amanda; Pacheco, Túlio Gomes; do Nascimento Vieira, Leila; Guerra, Miguel Pedro; Nodari, Rubens Onofre; de Souza, Emanuel Maltempi; de Oliveira Pedrosa, Fábio; Rogalski, Marcelo
2018-05-23
Crambe abyssinica is an important oilseed crop that accumulates high levels of erucic acid, which is being recognized as a potential oil platform for several industrial purposes. It belongs to the family Brassicaceae, assigned within the tribe Brassiceae. Both family and tribe have been the subject of several phylogenetic studies, but the relationship between some lineages and genera remains unclear. Here, we report the complete sequencing and characterization of the C. abyssinica plastome. Plastome structure, gene order, and gene content of C. abyssinica are similar to other species of the family Brassicaceae. The only exception is the rps16 gene, which is absent in many genera within the family Brassicaceae, but seems to be functional in the tribe Brassiceae, including C. abyssinica. However, the analysis of gene divergence shows that the rps16 is the most divergent gene in C. abyssinica and within the tribe Brassiceae. In addition, species of the tribe Brassiceae also show similar SSR loci distribution, with some regions containing a high number of SSRs, which are located mainly at the single copy regions. Six hotspots of nucleotide divergence among Brassiceae species were located in the single copy regions by sliding window analysis. Brassicaceae phylogenomic analysis, based on the complete plastomes of 72 taxa, resulted in a well-supported and well-resolved tree. The genus Crambe is positioned within the Brassiceae clade together with the genera Brassica, Raphanus, Sinapis, Cakile, Orychophragmus and Sinalliaria. Moreover, we report several losses and gains of RNA editing sites that occurred in plastomes of Brassiceae species during evolution. Copyright © 2017. Published by Elsevier B.V.
Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun
2015-01-01
Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH—developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP—based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS—derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species. PMID:25790283
Effect of misspecification of gene frequency on the two-point LOD score.
Pal, D K; Durner, M; Greenberg, D A
2001-11-01
In this study, we used computer simulation of simple and complex models to ask: (1) What is the penalty in evidence for linkage when the assumed gene frequency is far from the true gene frequency? (2) If the assumed model for gene frequency and inheritance are misspecified in the analysis, can this lead to a higher maximum LOD score than that obtained under the true parameters? Linkage data simulated under simple dominant, recessive, dominant and recessive with reduced penetrance, and additive models, were analysed assuming a single locus with both the correct and incorrect dominance model and assuming a range of different gene frequencies. We found that misspecifying the analysis gene frequency led to little penalty in maximum LOD score in all models examined, especially if the assumed gene frequency was lower than the generating one. Analysing linkage data assuming a gene frequency of the order of 0.01 for a dominant gene, and 0.1 for a recessive gene, appears to be a reasonable tactic in the majority of realistic situations because underestimating the gene frequency, even when the true gene frequency is high, leads to little penalty in the LOD score.
Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.
Deshmukh, Reena; Singh, V K; Singh, B D
2016-04-01
The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants.
Van Assche, Evelien; Moons, Tim; Cinar, Ozan; Viechtbauer, Wolfgang; Oldehinkel, Albertine J; Van Leeuwen, Karla; Verschueren, Karine; Colpin, Hilde; Lambrechts, Diether; Van den Noortgate, Wim; Goossens, Luc; Claes, Stephan; van Winkel, Ruud
2017-12-01
Most gene-environment interaction studies (G × E) have focused on single candidate genes. This approach is criticized for its expectations of large effect sizes and occurrence of spurious results. We describe an approach that accounts for the polygenic nature of most psychiatric phenotypes and reduces the risk of false-positive findings. We apply this method focusing on the role of perceived parental support, psychological control, and harsh punishment in depressive symptoms in adolescence. Analyses were conducted on 982 adolescents of Caucasian origin (M age (SD) = 13.78 (.94) years) genotyped for 4,947 SNPs in 263 genes, selected based on a literature survey. The Leuven Adolescent Perceived Parenting Scale (LAPPS) and the Parental Behavior Scale (PBS) were used to assess perceived parental psychological control, harsh punishment, and support. The Center for Epidemiologic Studies Depression Scale (CES-D) was the outcome. We used gene-based testing taking into account linkage disequilibrium to identify genes containing SNPs exhibiting an interaction with environmental factors yielding a p-value per single gene. Significant results at the corrected p-value of p < 1.90 × 10 -4 were examined in an independent replication sample of Dutch adolescents (N = 1354). Two genes showed evidence for interaction with perceived support: GABRR1 (p = 4.62 × 10 -5 ) and GABRR2 (p = 9.05 × 10 -6 ). No genes interacted significantly with psychological control or harsh punishment. Gene-based analysis was unable to confirm the interaction of GABRR1 or GABRR2 with support in the replication sample. However, for GABRR2, but not GABRR1, the correlation of the estimates between the two datasets was significant (r (46) = .32; p = .027) and a gene-based analysis of the combined datasets supported GABRR2 × support interaction (p = 1.63 × 10 -4 ). We present a gene-based method for gene-environment interactions in a polygenic context and show that genes interact differently with particular aspects of parenting. This accentuates the importance of polygenic approaches and the need to accurately assess environmental exposure in G × E. © 2017 Association for Child and Adolescent Mental Health.
A nanobiosensor for dynamic single cell analysis during microvascular self-organization.
Wang, S; Sun, J; Zhang, D D; Wong, P K
2016-10-14
The formation of microvascular networks plays essential roles in regenerative medicine and tissue engineering. Nevertheless, the self-organization mechanisms underlying the dynamic morphogenic process are poorly understood due to a paucity of effective tools for mapping the spatiotemporal dynamics of single cell behaviors. By establishing a single cell nanobiosensor along with live cell imaging, we perform dynamic single cell analysis of the morphology, displacement, and gene expression during microvascular self-organization. Dynamic single cell analysis reveals that endothelial cells self-organize into subpopulations with specialized phenotypes to form microvascular networks and identifies the involvement of Notch1-Dll4 signaling in regulating the cell subpopulations. The cell phenotype correlates with the initial Dll4 mRNA expression level and each subpopulation displays a unique dynamic Dll4 mRNA expression profile. Pharmacological perturbations and RNA interference of Notch1-Dll4 signaling modulate the cell subpopulations and modify the morphology of the microvascular network. Taken together, a nanobiosensor enables a dynamic single cell analysis approach underscoring the importance of Notch1-Dll4 signaling in microvascular self-organization.
Expanding the horizons for single-cell applications on lab-on-a-chip devices.
Kim, Soo Hyeon; Fourmy, Dominique; Fujii, Teruo
2012-01-01
Stochastic events in gene expression, protein synthesis, and metabolite synthesis or degradation lead to cellular heterogeneity essential to life. In a tissue as we see in organs, there is strong heterogeneity among the constituting cells critical to its function. Thus, there exists a strong demand to develop new micro/nanosystems that would enable us to conduct single-cell analysis. This field is rapidly growing, as exemplified below with recent emerging technologies that now reveal sensitive single-cell "omics" analysis. We describe in the review some of the most promising technologies that will certainly transform our view of biology in the near future.
van der Vossen, E A; van der Voort, J N; Kanyuka, K; Bendahmane, A; Sandbrink, H; Baulcombe, D C; Bakker, J; Stiekema, W J; Klein-Lankhorst, R M
2000-09-01
The isolation of the nematode-resistance gene Gpa2 in potato is described, and it is demonstrated that highly homologous resistance genes of a single resistance-gene cluster can confer resistance to distinct pathogen species. Molecular analysis of the Gpa2 locus resulted in the identification of an R-gene cluster of four highly homologous genes in a region of approximately 115 kb. At least two of these genes are active: one corresponds to the previously isolated Rx1 gene that confers resistance to potato virus X, while the other corresponds to the Gpa2 gene that confers resistance to the potato cyst nematode Globodera pallida. The proteins encoded by the Gpa2 and the Rx1 genes share an overall homology of over 88% (amino-acid identity) and belong to the leucine-zipper, nucleotide-binding site, leucine-rich repeat (LZ-NBS-LRR)-containing class of plant resistance genes. From the sequence conservation between Gpa2 and Rx1 it is clear that there is a direct evolutionary relationship between the two proteins. Sequence diversity is concentrated in the LRR region and in the C-terminus. The putative effector domains are more conserved suggesting that, at least in this case, nematode and virus resistance cascades could share common components. These findings underline the potential of protein breeding for engineering new resistance specificities against plant pathogens in vitro.
Xu, Yiliang; Ren, Jun; Ye, Haihong
2018-04-20
Schizophrenia is a severe psychiatric disorder. Genetic and functional studies have strongly implicated the disrupted in schizophrenia 1 gene (DISC1) as a candidate susceptibility gene for schizophrenia. Moreover, recent association studies have indicated that several DISC1 single nucleotide polymorphisms (SNPs) are associated with schizophrenia. However, the association is hardly replicate in different ethnic group. Here, we performed a meta-analysis of the association between DISC1 SNPs and schizophrenia in which the samples were divided into subgroups according to ethnicity. Both rs3738401 and rs821616 showed not significantly association with schizophrenia in the Caucasian, Asian, Japanese or Han Chinese populations. Copyright © 2018 Elsevier B.V. All rights reserved.
Han, Xiaoping; Chen, Haide; Huang, Daosheng; Chen, Huidong; Fei, Lijiang; Cheng, Chen; Huang, He; Yuan, Guo-Cheng; Guo, Guoji
2018-04-05
Human pluripotent stem cells (hPSCs) provide powerful models for studying cellular differentiations and unlimited sources of cells for regenerative medicine. However, a comprehensive single-cell level differentiation roadmap for hPSCs has not been achieved. We use high throughput single-cell RNA-sequencing (scRNA-seq), based on optimized microfluidic circuits, to profile early differentiation lineages in the human embryoid body system. We present a cellular-state landscape for hPSC early differentiation that covers multiple cellular lineages, including neural, muscle, endothelial, stromal, liver, and epithelial cells. Through pseudotime analysis, we construct the developmental trajectories of these progenitor cells and reveal the gene expression dynamics in the process of cell differentiation. We further reprogram primed H9 cells into naïve-like H9 cells to study the cellular-state transition process. We find that genes related to hemogenic endothelium development are enriched in naïve-like H9. Functionally, naïve-like H9 show higher potency for differentiation into hematopoietic lineages than primed cells. Our single-cell analysis reveals the cellular-state landscape of hPSC early differentiation, offering new insights that can be harnessed for optimization of differentiation protocols.
Schiffer, Mario
2017-11-01
Single-cell RNA-sequence (RNA-seq) is a widely used tool to study biological questions in single cells. The discussed study identified 92 genes being predominantly expressed in podocytes based on a 5-fold higher expression compared with endothelial and mesangial cells. In addition to technical pitfalls, the question that is discussed in this commentary is whether results of a single-cell RNAseq study are able to deliver expression data that truly characterize a podocyte. Copyright © 2017 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.
Kotze, M J; De Villiers, J N; Groenewald, J Z; Rooney, R N; Loubser, O; Thiart, R; Oosthuizen, C J; van Niekerk, M M; Groenewald, I M; Retief, A E; Warnich, L
1998-10-01
A subset of probands from 11 South African families with clinical and/or biochemical features of variegate porphyria (VP), but without the known protoporphyrinogen oxidase (PPOX) gene defects identified previously in the South African population, were subjected to mutation analysis. Disease-related mutation(s) could not be identified after screening virtually the entire PPOX gene by heteroduplex single-strand conformation polymorphism analysis (HEX-SSCP), although three new sequence variants were detected in exon 1 of the gene in three normal controls. The presence of these single base changes at nucleotide positions 22 (C/G), 27 (C/A) and 127 (C/A), in addition to the known exon 1 polymorphisms I-26 and I-150, indicates that this untranslated region of the PPOX gene is particularly mutation-prone. Furthermore, microsatellite markers flanking the PPOX and alpha-1 antitrypsin (PI) gene, on chromosomes 1 and 14, respectively, were used to assess the probability of involvement of these loci in disease presentation. Common alleles transmitted from affected parent to affected child were determined where possible in the mutation-negative index cases. Allelic frequencies of these
Dallman, Timothy J; Chattaway, Marie A; Cowley, Lauren A; Doumith, Michel; Tewolde, Rediat; Wooldridge, David J; Underwood, Anthony; Ready, Derren; Wain, John; Foster, Kirsty; Grant, Kathie A; Jenkins, Claire
2014-01-01
Following a large outbreak of foodborne gastrointestinal (GI) disease, a multiplex PCR approach was used retrospectively to investigate faecal specimens from 88 of the 413 reported cases. Gene targets from a range of bacterial GI pathogens were detected, including Salmonella species, Shigella species and Shiga toxin-producing Escherichia coli, with the majority (75%) of faecal specimens being PCR positive for aggR associated with the Enteroaggregative E. coli (EAEC) group. The 20 isolates of EAEC recovered from the outbreak specimens exhibited a range of serotypes, the most frequent being O104:H4 and O131:H27. None of the EAEC isolates had the Shiga toxin (stx) genes. Multilocus sequence typing and single nucleotide polymorphism analysis of the core genome confirmed the diverse phylogeny of the strains. The analysis also revealed a close phylogenetic relationship between the EAEC O104:H4 strains in this outbreak and the strain of E. coli O104:H4 associated with a large outbreak of haemolytic ureamic syndrome in Germany in 2011. Further analysis of the EAEC plasmids, encoding the key enteroaggregative virulence genes, showed diversity with respect to FIB/FII type, gene content and genomic architecture. Known EAEC virulence genes, such as aggR, aat and aap, were present in all but one of the strains. A variety of fimbrial genes were observed, including genes encoding all five known fimbrial types, AAF/1 to AAF/V. The AAI operon was present in its entirety in 15 of the EAEC strains, absent in three and present, but incomplete, in two isolates. EAEC is known to be a diverse pathotype and this study demonstrates that a high level of diversity in strains recovered from cases associated with a single outbreak. Although the EAEC in this study did not carry the stx genes, this outbreak provides further evidence of the pathogenic potential of the EAEC O104:H4 serotype.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Noethen, M.M.; Eggermann, K.; Propping, P.
1995-10-01
It is well accepted that association studies are a major tool in investigating the contribution of single genes to the development of diseases that do not follow simple Mendelian inheritance pattern (so-called complex traits). Such major psychiatric diseases as bipolar affective disorder and schizophrenia clearly fall into this category of diseases. 7 refs., 1 tab.
USDA-ARS?s Scientific Manuscript database
Semigamy in cotton is a type of facultative apomixis controlled by a single incompletely dominant gene (Se) in which the sperm and egg nuclei fail to fuse after the sperm nucleus has entered the embryo sac, giving rise to diploid, haploid or even chimeral embryos comprised of paternal and maternal o...
Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N
2015-10-20
Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B. To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.
Mapping heterogeneity in patient-derived melanoma cultures by single-cell RNA-seq
Loeffler-Wirth, Henry; Hopp, Lydia; Schadendorf, Dirk; Schartl, Manfred; Anderegg, Ulf; Camp, Gray; Treutlein, Barbara; Binder, Hans; Kunz, Manfred
2017-01-01
Recent technological advances in single-cell genomics make it possible to analyze cellular heterogeneity of tumor samples. Here, we applied single-cell RNA-seq to measure the transcriptomes of 307 single cells cultured from three biopsies of three different patients with a BRAF/NRAS wild type, BRAF mutant/NRAS wild type and BRAF wild type/NRAS mutant melanoma metastasis, respectively. Analysis based on self-organizing maps identified sub-populations defined by multiple gene expression modules involved in proliferation, oxidative phosphorylation, pigmentation and cellular stroma. Gene expression modules had prognostic relevance when compared with gene expression data from published melanoma samples and patient survival data. We surveyed kinome expression patterns across sub-populations of the BRAF/NRAS wild type sample and found that CDK4 and CDK2 were consistently highly expressed in the majority of cells, suggesting that these kinases might be involved in melanoma progression. Treatment of cells with the CDK4 inhibitor palbociclib restricted cell proliferation to a similar, and in some cases greater, extent than MAPK inhibitors. Finally, we identified a low abundant sub-population in this sample that highly expressed a module containing ABC transporter ABCB5, surface markers CD271 and CD133, and multiple aldehyde dehydrogenases (ALDHs). Patient-derived cultures of the BRAF mutant/NRAS wild type and BRAF wild type/NRAS mutant metastases showed more homogeneous single-cell gene expression patterns with gene expression modules for proliferation and ABC transporters. Taken together, our results describe an intertumor and intratumor heterogeneity in melanoma short-term cultures which might be relevant for patient survival, and suggest promising targets for new treatment approaches in melanoma therapy. PMID:27903987
Evangelou, Marina; Smyth, Deborah J; Fortune, Mary D; Burren, Oliver S; Walker, Neil M; Guo, Hui; Onengut-Gumuscu, Suna; Chen, Wei-Min; Concannon, Patrick; Rich, Stephen S; Todd, John A; Wallace, Chris
2014-01-01
Pathway analysis can complement point-wise single nucleotide polymorphism (SNP) analysis in exploring genomewide association study (GWAS) data to identify specific disease-associated genes that can be candidate causal genes. We propose a straightforward methodology that can be used for conducting a gene-based pathway analysis using summary GWAS statistics in combination with widely available reference genotype data. We used this method to perform a gene-based pathway analysis of a type 1 diabetes (T1D) meta-analysis GWAS (of 7,514 cases and 9,045 controls). An important feature of the conducted analysis is the removal of the major histocompatibility complex gene region, the major genetic risk factor for T1D. Thirty-one of the 1,583 (2%) tested pathways were identified to be enriched for association with T1D at a 5% false discovery rate. We analyzed these 31 pathways and their genes to identify SNPs in or near these pathway genes that showed potentially novel association with T1D and attempted to replicate the association of 22 SNPs in additional samples. Replication P-values were skewed () with 12 of the 22 SNPs showing . Support, including replication evidence, was obtained for nine T1D associated variants in genes ITGB7 (rs11170466, ), NRP1 (rs722988, ), BAD (rs694739, ), CTSB (rs1296023, ), FYN (rs11964650, ), UBE2G1 (rs9906760, ), MAP3K14 (rs17759555, ), ITGB1 (rs1557150, ), and IL7R (rs1445898, ). The proposed methodology can be applied to other GWAS datasets for which only summary level data are available. PMID:25371288
A single-base deletion in soybean flavonol synthase gene is associated with magenta flower color.
Takahashi, Ryoji; Githiri, Stephen M; Hatayama, Kouta; Dubouzet, Emilyn G; Shimada, Norimoto; Aoki, Toshio; Ayabe, Shin-ichi; Iwashina, Tsukasa; Toda, Kyoko; Matsumura, Hisakazu
2007-01-01
The Wm locus of soybean [Glycine max (L.) Merr.] controls flower color. Dominant Wm and recessive wm allele of the locus produce purple and magenta flower, respectively. A putative full-length cDNA of flavonol synthase (FLS), gmfls1 was isolated by 5' RACE and end-to-end PCR from a cultivar Harosoy with purple flower (WmWm). Sequence analysis revealed that gmfls1 consisted of 1,208 nucleotides encoding 334 amino acids. It had 59-72% homology with FLS proteins of other plant species. Conserved dioxygenase domains A and B were found in the deduced polypeptide. Sequence comparison between Harosoy and Harosoy-wm (magenta flower mutant of Harosoy; wmwm) revealed that they differed by a single G deletion in the coding region of Harosoy-wm. The deletion changed the subsequent reading frame resulting in a truncated polypeptide consisting of 37 amino acids that lacked the dioxygenase domains A and B. Extracts of E. coli cells expressing gmfls1 of Harosoy catalyzed the formation of quercetin from dihydroquercetin, whereas cell extracts expressing gmfls1 of Harosoy-wm had no FLS activity. Genomic Southern analysis suggested the existence of three to four copies of the FLS gene in the soybean genome. CAPS analysis was performed to detect the single-base deletion. Harosoy and Clark (WmWm) exhibited longer fragments, while Harosoy-wm had shorter fragments due to the single-base deletion. The CAPS marker co-segregated with genotypes at Wm locus in a F(2) population segregating for the locus. Linkage mapping using SSR markers revealed that the Wm and gmfls1 were mapped at similar position in the molecular linkage group F. The above results strongly suggest that gmfls1 represents the Wm gene and that the single-base deletion may be responsible for magenta flower color.
The Essential Genome of Escherichia coli K-12.
Goodall, Emily C A; Robinson, Ashley; Johnston, Iain G; Jabbari, Sara; Turner, Keith A; Cunningham, Adam F; Lund, Peter A; Cole, Jeffrey A; Henderson, Ian R
2018-02-20
Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. IMPORTANCE Incentives to define lists of genes that are essential for bacterial survival include the identification of potential targets for antibacterial drug development, genes required for rapid growth for exploitation in biotechnology, and discovery of new biochemical pathways. To identify essential genes in Escherichia coli , we constructed a transposon mutant library of unprecedented density. Initial automated analysis of the resulting data revealed many discrepancies compared to the literature. We now report more extensive statistical analysis supported by both literature searches and detailed inspection of high-density TraDIS sequencing data for each putative essential gene for the E. coli model laboratory organism. This paper is important because it provides a better understanding of the essential genes of E. coli , reveals the limitations of relying on automated analysis alone, and provides a new standard for the analysis of TraDIS data. Copyright © 2018 Goodall et al.
Han, Yike; Wang, Xianyun; Zhao, Fengyue; Gao, Shang; Wei, Aimin; Chen, Zhengwu; Liu, Nan; Zhang, Zhenxian; Du, Shengli
2018-05-01
Cucumber ( Cucumis sativus L. ) pollen development involves a diverse range of gene interactions between sporophytic and gametophytic tissues. Previous studies in our laboratory showed that male sterility was controlled by a single recessive nuclear gene, and occurred in pollen mother cell meiophase. To fully explore the global gene expression and identify genes related to male sterility, a RNA-seq analysis was adopted in this study. Young male flower-buds (1-2 mm in length) from genetic male sterility (GMS) mutant and homozygous fertile cucumber (WT) were collected for two sequencing libraries. Total 545 differentially expressed genes (DEGs), including 142 up-regulated DEGs and 403 down-regulated DEGs, were detected in two libraries (Fold Change ≥ 2, FDR < 0.01). These genes were involved in a variety of metabolic pathways, like ethylene-activated signaling pathway, sporopollenin biosynthetic pathway, cell cycle and DNA damage repair pathway. qRT-PCR analysis was performed and showed that the correlation between RNA-Seq and qRT-PCR was 0.876. These findings contribute to a better understanding of the mechanism that leads to GMS in cucumber.
van der Ley, P
1988-11-01
Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Lan, Yi; Sun, Jin; Tian, Renmao; Bartlett, Douglas H; Li, Runsheng; Wong, Yue Him; Zhang, Weipeng; Qiu, Jian-Wen; Xu, Ting; He, Li-Sheng; Tabata, Harry G; Qian, Pei-Yuan
2017-07-01
The Challenger Deep in the Mariana Trench is the deepest point in the oceans of our planet. Understanding how animals adapt to this harsh environment characterized by high hydrostatic pressure, food limitation, dark and cold is of great scientific interest. Of the animals dwelling in the Challenger Deep, amphipods have been captured using baited traps. In this study, we sequenced the transcriptome of the amphipod Hirondellea gigas collected at a depth of 10,929 m from the East Pond of the Challenger Deep. Assembly of these sequences resulted in 133,041 contigs and 22,046 translated proteins. Functional annotation of these contigs was made using the go and kegg databases. Comparison of these translated proteins with those of four shallow-water amphipods revealed 10,731 gene families, of which 5659 were single-copy orthologs. Base substitution analysis on these single-copy orthologs showed that 62 genes are positively selected in H. gigas, including genes related to β-alanine biosynthesis, energy metabolism and genetic information processing. For multiple-copy orthologous genes, gene family expansion analysis revealed that cold-inducible proteins (i.e., transcription factors II A and transcription elongation factor 1) as well as zinc finger domains are expanded in H. gigas. Overall, our results indicate that genetic adaptation to the hadal environment by H. gigas may be mediated by both gene family expansion and amino acid substitutions of specific proteins. © 2017 John Wiley & Sons Ltd.
Zhang, Xiaofei; Liu, Dongcheng; Zhang, Jianghua; Jiang, Wei; Luo, Guangbin; Yang, Wenlong; Sun, Jiazhu; Tong, Yiping; Cui, Dangqun; Zhang, Aimin
2013-01-01
Low-molecular-weight glutenin subunits (LMW-GS), encoded by a complex multigene family, play an important role in the processing quality of wheat flour. Although members of this gene family have been identified in several wheat varieties, the allelic variation and composition of LMW-GS genes in common wheat are not well understood. In the present study, using the LMW-GS gene molecular marker system and the full-length gene cloning method, a comprehensive molecular analysis of LMW-GS genes was conducted in a representative population, the micro-core collections (MCC) of Chinese wheat germplasm. Generally, >15 LMW-GS genes were identified from individual MCC accessions, of which 4–6 were located at the Glu-A3 locus, 3–5 at the Glu-B3 locus, and eight at the Glu-D3 locus. LMW-GS genes at the Glu-A3 locus showed the highest allelic diversity, followed by the Glu-B3 genes, while the Glu-D3 genes were extremely conserved among MCC accessions. Expression and sequence analysis showed that 9–13 active LMW-GS genes were present in each accession. Sequence identity analysis showed that all i-type genes present at the Glu-A3 locus formed a single group, the s-type genes located at Glu-B3 and Glu-D3 loci comprised a unique group, while high-diversity m-type genes were classified into four groups and detected in all Glu-3 loci. These results contribute to the functional analysis of LMW-GS genes and facilitate improvement of bread-making quality by wheat molecular breeding programmes. PMID:23536608
Serebrova, V N; Trifonova, E A; Gabidulina, T V; Bukharina, I Yu; Agarkova, T A; Evtushenko, I D; Maksimova, N R; Stepanov, V A
2016-01-01
Regulatory single nucleotide polymorphisms (rSNPs) are the least-studied group of SNP; however, they play an essential role in the development of human pathology by altering the level of candidate genes expression. In this work, we analyzed 29 rSNPs in 17 new candidate genes associated with preeclampsia (PE) according to the analysis of the transcriptome in placental tissue. Three ethnic groups have been studied (yakut, russian, and buryat). We have detected significant associations of PE with eight rSNPs in six differentially expressed genes, i.e., rs10423795 in the LHB gene; rs3771787 in the HK2 gene; rs72959687 in the INHA gene; rs12678229, rs2227262, and rs3802252 in the NDRG1 gene; rs34845949 in the SASH1 gene; and rs66707428 in the PPP1R12C gene. We used a new approach to detecting genetic markers of multifactorial diseases in the case of PE based on a combination of genomic, transcriptomic, and bioinformatic approaches. This approach proved its efficiency and may be applied to detecting new potential genetic markers in genes involved in disease pathogenesis, which reduces missing heritability in multifactorial diseases.
Bao, Le; Gu, Hong; Dunn, Katherine A; Bielawski, Joseph P
2007-02-08
Models of codon evolution have proven useful for investigating the strength and direction of natural selection. In some cases, a priori biological knowledge has been used successfully to model heterogeneous evolutionary dynamics among codon sites. These are called fixed-effect models, and they require that all codon sites are assigned to one of several partitions which are permitted to have independent parameters for selection pressure, evolutionary rate, transition to transversion ratio or codon frequencies. For single gene analysis, partitions might be defined according to protein tertiary structure, and for multiple gene analysis partitions might be defined according to a gene's functional category. Given a set of related fixed-effect models, the task of selecting the model that best fits the data is not trivial. In this study, we implement a set of fixed-effect codon models which allow for different levels of heterogeneity among partitions in the substitution process. We describe strategies for selecting among these models by a backward elimination procedure, Akaike information criterion (AIC) or a corrected Akaike information criterion (AICc). We evaluate the performance of these model selection methods via a simulation study, and make several recommendations for real data analysis. Our simulation study indicates that the backward elimination procedure can provide a reliable method for model selection in this setting. We also demonstrate the utility of these models by application to a single-gene dataset partitioned according to tertiary structure (abalone sperm lysin), and a multi-gene dataset partitioned according to the functional category of the gene (flagellar-related proteins of Listeria). Fixed-effect models have advantages and disadvantages. Fixed-effect models are desirable when data partitions are known to exhibit significant heterogeneity or when a statistical test of such heterogeneity is desired. They have the disadvantage of requiring a priori knowledge for partitioning sites. We recommend: (i) selection of models by using backward elimination rather than AIC or AICc, (ii) use a stringent cut-off, e.g., p = 0.0001, and (iii) conduct sensitivity analysis of results. With thoughtful application, fixed-effect codon models should provide a useful tool for large scale multi-gene analyses.
Homology-dependent Gene Silencing in Paramecium
Ruiz, Françoise; Vayssié, Laurence; Klotz, Catherine; Sperling, Linda; Madeddu, Luisa
1998-01-01
Microinjection at high copy number of plasmids containing only the coding region of a gene into the Paramecium somatic macronucleus led to a marked reduction in the expression of the corresponding endogenous gene(s). The silencing effect, which is stably maintained throughout vegetative growth, has been observed for all Paramecium genes examined so far: a single-copy gene (ND7), as well as members of multigene families (centrin genes and trichocyst matrix protein genes) in which all closely related paralogous genes appeared to be affected. This phenomenon may be related to posttranscriptional gene silencing in transgenic plants and quelling in Neurospora and allows the efficient creation of specific mutant phenotypes thus providing a potentially powerful tool to study gene function in Paramecium. For the two multigene families that encode proteins that coassemble to build up complex subcellular structures the analysis presented herein provides the first experimental evidence that the members of these gene families are not functionally redundant. PMID:9529389
Yang, Congcong; Ding, Puyang; Liu, Yaxi; Qiao, Linyi; Chang, Zhijian; Geng, Hongwei; Wang, Penghao; Jiang, Qiantao; Wang, Jirui; Chen, Guoyue; Wei, Yuming; Zheng, Youliang; Lan, Xiujin
2017-01-01
The MADS-box genes encode transcription factors with key roles in plant growth and development. A comprehensive analysis of the MADS-box gene family in bread wheat (Triticum aestivum) has not yet been conducted, and our understanding of their roles in stress is rather limited. Here, we report the identification and characterization of the MADS-box gene family in wheat. A total of 180 MADS-box genes classified as 32 Mα, 5 Mγ, 5 Mδ, and 138 MIKC types were identified. Evolutionary analysis of the orthologs among T. urartu, Aegilops tauschii and wheat as well as homeologous sequences analysis among the three sub-genomes in wheat revealed that gene loss and chromosomal rearrangements occurred during and/or after the origin of bread wheat. Forty wheat MADS-box genes that were expressed throughout the investigated tissues and development stages were identified. The genes that were regulated in response to both abiotic stresses (i.e., phosphorus deficiency, drought, heat, and combined drought and heat) and biotic stresses (i.e., Fusarium graminearum, Septoria tritici, stripe rust and powdery mildew) were detected as well. A few notable MADS-box genes were specifically expressed in a single tissue and those showed relatively higher expression differences between the stress and control treatment. The expression patterns of considerable MADS-box genes differed from those of their orthologs in Brachypodium, rice, and Arabidopsis. Collectively, the present study provides new insights into the possible roles of MADS-box genes in response to stresses and will be valuable for further functional studies of important candidate MADS-box genes. PMID:28742823
Ma, Jian; Yang, Yujie; Luo, Wei; Yang, Congcong; Ding, Puyang; Liu, Yaxi; Qiao, Linyi; Chang, Zhijian; Geng, Hongwei; Wang, Penghao; Jiang, Qiantao; Wang, Jirui; Chen, Guoyue; Wei, Yuming; Zheng, Youliang; Lan, Xiujin
2017-01-01
The MADS-box genes encode transcription factors with key roles in plant growth and development. A comprehensive analysis of the MADS-box gene family in bread wheat (Triticum aestivum) has not yet been conducted, and our understanding of their roles in stress is rather limited. Here, we report the identification and characterization of the MADS-box gene family in wheat. A total of 180 MADS-box genes classified as 32 Mα, 5 Mγ, 5 Mδ, and 138 MIKC types were identified. Evolutionary analysis of the orthologs among T. urartu, Aegilops tauschii and wheat as well as homeologous sequences analysis among the three sub-genomes in wheat revealed that gene loss and chromosomal rearrangements occurred during and/or after the origin of bread wheat. Forty wheat MADS-box genes that were expressed throughout the investigated tissues and development stages were identified. The genes that were regulated in response to both abiotic stresses (i.e., phosphorus deficiency, drought, heat, and combined drought and heat) and biotic stresses (i.e., Fusarium graminearum, Septoria tritici, stripe rust and powdery mildew) were detected as well. A few notable MADS-box genes were specifically expressed in a single tissue and those showed relatively higher expression differences between the stress and control treatment. The expression patterns of considerable MADS-box genes differed from those of their orthologs in Brachypodium, rice, and Arabidopsis. Collectively, the present study provides new insights into the possible roles of MADS-box genes in response to stresses and will be valuable for further functional studies of important candidate MADS-box genes.
Trapnell, Cole; Roberts, Adam; Goff, Loyal; Pertea, Geo; Kim, Daehwan; Kelley, David R; Pimentel, Harold; Salzberg, Steven L; Rinn, John L; Pachter, Lior
2012-01-01
Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ~1 h of hands-on time. PMID:22383036
Zhao, M; Chen, M; Tan, A S C; Cheah, F S H; Mathew, J; Wong, P C; Chong, S S
2017-07-01
Essentials Preimplantation genetic diagnosis (PGD) of severe hemophilia A relies on linkage analysis. Simultaneous multi-marker screening can simplify selection of informative markers in a couple. We developed a single-tube tetradecaplex panel of polymorphic markers for hemophilia A PGD use. Informative markers can be used for linkage analysis alone or combined with mutation detection. Background It is currently not possible to perform single-cell preimplantation genetic diagnosis (PGD) to directly detect the common inversion mutations of the factor VIII (F8) gene responsible for severe hemophilia A (HEMA). As such, PGD for such inversion carriers relies on indirect analysis of linked polymorphic markers. Objectives To simplify linkage-based PGD of HEMA, we aimed to develop a panel of highly polymorphic microsatellite markers located near the F8 gene that could be simultaneously genotyped in a multiplex-PCR reaction. Methods We assessed the polymorphism of various microsatellite markers located ≤ 1 Mb from F8 in 177 female subjects. Highly polymorphic markers were selected for co-amplification with the AMELX/Y indel dimorphism in a single-tube reaction. Results Thirteen microsatellite markers located within 0.6 Mb of F8 were successfully co-amplified with AMELX/Y in a single-tube reaction. Observed heterozygosities of component markers ranged from 0.43 to 0.84, and ∼70-80% of individuals were heterozygous for ≥ 5 markers. The tetradecaplex panel successfully identified fully informative markers in a couple interested in PGD for HEMA because of an intragenic F8 point mutation, with haplotype phasing established through a carrier daughter. In-vitro fertilization (IVF)-PGD involved single-tube co-amplification of fully informative markers with AMELX/Y and the mutation-containing F8 amplicon, followed by microsatellite analysis and amplicon mutation-site minisequencing analysis. Conclusions The single-tube multiplex-PCR format of this highly polymorphic microsatellite marker panel simplifies identification and selection of informative markers for linkage-based PGD of HEMA. Informative markers can also be easily co-amplified with mutation-containing F8 amplicons for combined mutation detection and linkage analysis. © 2017 International Society on Thrombosis and Haemostasis.
A small indel mutation in an anthocyanin transporter causes variegated colouration of peach flowers.
Cheng, Jun; Liao, Liao; Zhou, Hui; Gu, Chao; Wang, Lu; Han, Yuepeng
2015-12-01
The ornamental peach cultivar 'Hongbaihuatao (HBH)' can simultaneously bear pink, red, and variegated flowers on a single tree. Anthocyanin content in pink flowers is extremely low, being only 10% that of a red flower. Surprisingly, the expression of anthocyanin structural and potential regulatory genes in white flowers was not significantly lower than that in both pink and red flowers. However, proteomic analysis revealed a GST encoded by a gene-regulator involved in anthocyanin transport (Riant)-which is expressed in the red flower, but almost undetectable in the variegated flower. The Riant gene contains an insertion-deletion (indel) polymorphism in exon 3. In white flowers, the Riant gene is interrupted by a 2-bp insertion in the last exon, which causes a frameshift and a premature stop codon. In contrast, both pink and red flowers that arise from bud sports are heterozygous for the Riant locus, with one functional allele due to the 2-bp deletion or a novel 1-bp insertion. Southern blot analysis indicated that the Riant gene occurs in a single copy in the peach genome and it is not interrupted by a transposon. The function of the Riant gene was confirmed by its ectopic expression in the Arabidopsis tt19 mutant, where it complements the anthocyanin phenotype, but not the proanthocyanidin pigmentation in seed coat. Collectively,these results indicate that a small indel mutation in the Riant gene, which is not the result of a transposon insertion or excision, causes variegated colouration of peach flowers. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.
2012-01-01
Background The aim of this study was to evaluate the potential association between single nucleotide polymorphisms related response to radiotherapy injury, such as genes related to DNA repair or enzymes involved in anti-oxidative activities. The paper aims to identify marker genes able to predict an increased risk of late toxicity studying our group of patients who underwent a Single Shot 3D-CRT PBI (SSPBI) after BCS (breast conserving surgery). Methods A total of 57 breast cancer patients who underwent SSPBI were genotyped for SNPs (single nucleotide polymorphisms) in XRCC1, XRCC3, GST and RAD51 by Pyrosequencing technology. Univariate analysis (ORs and 95% CI) was performed to correlate SNPs with the risk of developing ≥ G2 fibrosis or fat necrosis. Results A higher significant risk of developing ≥ G2 fibrosis or fat necrosis in patients with: polymorphic variant GSTP1 (Ile105Val) (OR = 2.9; 95%CI, 0.88-10.14, p = 0.047). Conclusions The presence of some SNPs involved in DNA repair or response to oxidative stress seem to be able to predict late toxicity. Trial Registration ClinicalTrials.gov: NCT01316328 PMID:22272830
The Complete Chloroplast Genome of Wild Rice (Oryza minuta) and Its Comparison to Related Species.
Asaf, Sajjad; Waqas, Muhammad; Khan, Abdul L; Khan, Muhammad A; Kang, Sang-Mo; Imran, Qari M; Shahzad, Raheem; Bilal, Saqib; Yun, Byung-Wook; Lee, In-Jung
2017-01-01
Oryza minuta , a tetraploid wild relative of cultivated rice (family Poaceae), possesses a BBCC genome and contains genes that confer resistance to bacterial blight (BB) and white-backed (WBPH) and brown (BPH) plant hoppers. Based on the importance of this wild species, this study aimed to understand the phylogenetic relationships of O. minuta with other Oryza species through an in-depth analysis of the composition and diversity of the chloroplast (cp) genome. The analysis revealed a cp genome size of 135,094 bp with a typical quadripartite structure and consisting of a pair of inverted repeats separated by small and large single copies, 139 representative genes, and 419 randomly distributed microsatellites. The genomic organization, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. Approximately 30 forward, 28 tandem and 20 palindromic repeats were detected in the O . minuta cp genome. Comparison of the complete O. minuta cp genome with another eleven Oryza species showed a high degree of sequence similarity and relatively high divergence of intergenic spacers. Phylogenetic analyses were conducted based on the complete genome sequence, 65 shared genes and matK gene showed same topologies and O. minuta forms a single clade with parental O. punctata . Thus, the complete O . minuta cp genome provides interesting insights and valuable information that can be used to identify related species and reconstruct its phylogeny.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willis, Leslie G.; Siepp, Robyn; Stewart, Taryn M.
2005-08-01
The genome of the Trichoplusia ni single nucleopolyhedrovirus (TnSNPV), a group II NPV which infects the cabbage looper (T. ni), has been completely sequenced and analyzed. The TnSNPV DNA genome consists of 134,394 bp and has an overall G + C content of 39%. Gene analysis predicted 144 open reading frames (ORFs) of 150 nucleotides or greater that showed minimal overlap. Comparisons with previously sequenced baculoviruses indicate that 119 TnSNPV ORFs were homologues of previously reported viral gene sequences. Ninety-four TnSNPV ORFs returned an Autographa californica multiple NPV (AcMNPV) homologue while 25 ORFs returned poor or no sequence matches withmore » the current databases. A putative photolyase gene was also identified that had highest amino acid identity to the photolyase genes of Chrysodeixis chalcites NPV (ChchNPV) (47%) and Danio rerio (zebrafish) (40%). In addition unlike all other baculoviruses no obvious homologous repeat (hr) sequences were identified. Comparison of the TnSNPV and AcMNPV genomes provides a unique opportunity to examine two baculoviruses that are highly virulent for a common insect host (T. ni) yet belong to diverse baculovirus taxonomic groups and possess distinct biological features. In vitro fusion assays demonstrated that the TnSNPV F protein induces membrane fusion and syncytia formation and were compared to syncytia formed by AcMNPV GP64.« less
Collins, Christine R; Das, Sujaan; Wong, Eleanor H; Andenmatten, Nicole; Stallmach, Robert; Hackett, Fiona; Herman, Jean-Paul; Müller, Sylke; Meissner, Markus; Blackman, Michael J
2013-05-01
Asexual blood stages of the malaria parasite, which cause all the pathology associated with malaria, can readily be genetically modified by homologous recombination, enabling the functional study of parasite genes that are not essential in this part of the life cycle. However, no widely applicable method for conditional mutagenesis of essential asexual blood-stage malarial genes is available, hindering their functional analysis. We report the application of the DiCre conditional recombinase system to Plasmodium falciparum, the causative agent of the most dangerous form of malaria. We show that DiCre can be used to obtain rapid, highly regulated site-specific recombination in P. falciparum, capable of excising loxP-flanked sequences from a genomic locus with close to 100% efficiency within the time-span of a single erythrocytic growth cycle. DiCre-mediated deletion of the SERA5 3' UTR failed to reduce expression of the gene due to the existence of alternative cryptic polyadenylation sites within the modified locus. However, we successfully used the system to recycle the most widely used drug resistance marker for P. falciparum, human dihydrofolate reductase, in the process producing constitutively DiCre-expressing P. falciparum clones that have broad utility for the functional analysis of essential asexual blood-stage parasite genes. © 2013 John Wiley & Sons Ltd.
Nucleotide sequence of the coat protein gene of Lettuce big-vein virus.
Sasaya, T; Ishikawa, K; Koganezawa, H
2001-06-01
A sequence of 1425 nt was established that included the complete coat protein (CP) gene of Lettuce big-vein virus (LBVV). The LBVV CP gene encodes a 397 amino acid protein with a predicted M(r) of 44486. Antisera raised against synthetic peptides corresponding to N-terminal or C-terminal parts of the LBVV CP reacted in Western blot analysis with a protein with an M(r) of about 48000. RNA extracted from purified particles of LBVV by using proteinase K, SDS and phenol migrated in gels as two single-stranded RNA species of approximately 7.3 kb (ss-1) and 6.6 kb (ss-2). After denaturation by heat and annealing at room temperature, the RNA migrated as four species, ss-1, ss-2 and two additional double-stranded RNAs (ds-1 and ds-2). The Northern blot hybridization analysis using riboprobes from a full-length clone of the LBVV CP gene indicated that ss-2 has a negative-sense nature and contains the LBVV CP gene. Moreover, ds-2 is a double-stranded form of ss-2. Database searches showed that the LBVV CP most resembled the nucleocapsid proteins of rhabdoviruses. These results indicate that it would be appropriate to classify LBVV as a negative-sense single-stranded RNA virus rather than as a double-stranded RNA virus.
Xie, Qing; Shen, Kang-Ning; Hao, Xiuying; Nam, Phan Nhut; Ngoc Hieu, Bui Thi; Chen, Ching-Hung; Zhu, Changqing; Lin, Yen-Chang; Hsiao, Chung-Der
2017-03-01
abtract We decoded the complete chloroplast DNA (cpDNA) sequence of the Tianshan Snow Lotus (Saussurea involucrata), a famous traditional Chinese medicinal plant of the family Asteraceae, by using next-generation sequencing technology. The genome consists of 152 490 bp containing a pair of inverted repeats (IRs) of 25 202 bp, which was separated by a large single-copy region and a small single-copy region of 83 446 bp and 18 639 bp, respectively. The genic regions account for 57.7% of whole cpDNA, and the GC content of the cpDNA was 37.7%. The S. involucrata cpDNA encodes 114 unigenes (82 protein-coding genes, 4 rRNA genes, and 28 tRNA genes). There are eight protein-coding genes (atpF, ndhA, ndhB, rpl2, rpoC1, rps16, clpP, and ycf3) and five tRNA genes (trnA-UGC, trnI-GAU, trnK-UUU, trnL-UAA, and trnV-UAC) containing introns. A phylogenetic analysis of the 11 complete cpDNA from Asteracease showed that S. involucrata is closely related to Centaurea diffusa (Diffuse Knapweed). The complete cpDNA of S. involucrata provides essential and important DNA molecular data for further phylogenetic and evolutionary analysis for Asteraceae.
Kulkarni, Krishnanand P; Patil, Gunvant; Valliyodan, Babu; Vuong, Tri D; Shannon, J Grover; Nguyen, Henry T; Lee, Jeong-Dong
2018-03-01
The objective of this study was to determine the genetic relationship between the oleic acid and protein content. The genotypes having high oleic acid and elevated protein (HOEP) content were crossed with five elite lines having normal oleic acid and average protein (NOAP) content. The selected accessions were grown at six environments in three different locations and phenotyped for protein, oil, and fatty acid components. The mean protein content of parents, HOEP, and NOAP lines was 34.6%, 38%, and 34.9%, respectively. The oleic acid concentration of parents, HOEP, and NOAP lines was 21.7%, 80.5%, and 20.8%, respectively. The HOEP plants carried both FAD2-1A (S117N) and FAD2-1B (P137R) mutant alleles contributing to the high oleic acid phenotype. Comparative genome analysis using whole-genome resequencing data identified six genes having single nucleotide polymorphism (SNP) significantly associated with the traits analyzed. A single SNP in the putative gene Glyma.10G275800 was associated with the elevated protein content, and palmitic, oleic, and linoleic acids. The genes from the marker intervals of previously identified QTL did not carry SNPs associated with protein content and fatty acid composition in the lines used in this study, indicating that all the genes except Glyma.10G278000 may be the new genes associated with the respective traits.
Blakely, Collin M.; Watkins, Thomas B.K.; Wu, Wei; Gini, Beatrice; Chabon, Jacob J.; McCoach, Caroline E.; McGranahan, Nicholas; Wilson, Gareth A.; Birkbak, Nicolai J.; Olivas, Victor R.; Rotow, Julia; Maynard, Ashley; Wang, Victoria; Gubens, Matthew A.; Banks, Kimberly C.; Lanman, Richard B.; Caulin, Aleah F.; John, John St.; Cordero, Anibal R.; Giannikopoulos, Petros; Simmons, Andrew D.; Mack, Philip C.; Gandara, David R.; Husain, Hatim; Doebele, Robert C.; Riess, Jonathan W.; Diehn, Maximilian; Swanton, Charles; Bivona, Trever G.
2017-01-01
A widespread approach to modern cancer therapy is to identify a single oncogenic driver gene and target its mutant protein product (e.g. EGFR inhibitor treatment in EGFR-mutant lung cancers). However, genetically-driven resistance to targeted therapy limits patient survival. Through genomic analysis of 1122 EGFR-mutant lung cancer cell-free DNA samples and whole exome analysis of seven longitudinally collected tumor samples from an EGFR-mutant lung cancer patient, we identify critical co-occurring oncogenic events present in most advanced-stage EGFR-mutant lung cancers. We define new pathways limiting EGFR inhibitor response, including WNT/β-catenin and cell cycle gene (e.g. CDK4, CDK6) alterations. Tumor genomic complexity increases with EGFR inhibitor treatment and co-occurring alterations in CTNNB1, and PIK3CA exhibit non-redundant functions that cooperatively promote tumor metastasis or limit EGFR inhibitor response. This study challenges the prevailing single-gene driver oncogene view and links clinical outcomes to co-occurring genetic alterations in advanced-stage EGFR-mutant lung cancer patients. PMID:29106415
Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Donati, Claudio; Medini, Duccio; Ward, Naomi L.; Angiuoli, Samuel V.; Crabtree, Jonathan; Jones, Amanda L.; Durkin, A. Scott; DeBoy, Robert T.; Davidsen, Tanja M.; Mora, Marirosa; Scarselli, Maria; Margarit y Ros, Immaculada; Peterson, Jeremy D.; Hauser, Christopher R.; Sundaram, Jaideep P.; Nelson, William C.; Madupu, Ramana; Brinkac, Lauren M.; Dodson, Robert J.; Rosovitz, Mary J.; Sullivan, Steven A.; Daugherty, Sean C.; Haft, Daniel H.; Selengut, Jeremy; Gwinn, Michelle L.; Zhou, Liwei; Zafar, Nikhat; Khouri, Hoda; Radune, Diana; Dimitrov, George; Watkins, Kisha; O'Connor, Kevin J. B.; Smith, Shannon; Utterback, Teresa R.; White, Owen; Rubens, Craig E.; Grandi, Guido; Madoff, Lawrence C.; Kasper, Dennis L.; Telford, John L.; Wessels, Michael R.; Rappuoli, Rino; Fraser, Claire M.
2005-01-01
The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes. PMID:16172379
Characterization of a novel variant of Mycobacterium chimaera.
van Ingen, J; Hoefsloot, W; Buijtels, P C A M; Tortoli, E; Supply, P; Dekhuijzen, P N R; Boeree, M J; van Soolingen, D
2012-09-01
In this study, nonchromogenic mycobacteria were isolated from pulmonary samples of three patients in the Netherlands. All isolates had identical, unique 16S rRNA gene and 16S-23S ITS sequences, which were closely related to those of Mycobacterium chimaera and Mycobacterium marseillense. The biochemical features of the isolates differed slightly from those of M. chimaera, suggesting that the isolates may represent a possible separate species within the Mycobacterium avium complex (MAC). However, the cell-wall mycolic acid pattern, analysed by HPLC, and the partial sequences of the hsp65 and rpoB genes were identical to those of M. chimaera. We concluded that the isolates represent a novel variant of M. chimaera. The results of this analysis have led us to question the currently used methods of species definition for members of the genus Mycobacterium, which are based largely on 16S rRNA or rpoB gene sequencing. Definitions based on a single genetic target are likely to be insufficient. Genetic divergence, especially in the MAC, yields strains that cannot be confidently assigned to a specific species based on the analysis of a single genetic target.
Gao, Minghong; Liu, Jiwen; Qiao, Yanlu; Zhao, Meixun; Zhang, Xiao-Hua
2017-04-01
Investigating the environmental influence on the community composition and abundance of denitrifiers in marine sediment ecosystem is essential for understanding of the ecosystem-level controls on the biogeochemical process of denitrification. In the present study, nirK-harboring denitrifying communities in different mud deposit zones of eastern China marginal seas (ECMS) were investigated via clone library analysis. The abundance of three functional genes affiliated with denitrification (narG, nirK, nosZ) was assessed by fluorescent quantitative PCR. The nirK-harboring microbiota were dominated by a few operational taxonomic units (OTUs), which were widely distributed in different sites with each site harboring their unique phylotypes. The mean abundance of nirK was significantly higher than that of narG and nosZ genes, and the abundance of narG was higher than that of nosZ. The inconsistent abundance profile of different functional genes along the process of denitrification might indicate that nitrite reduction occurred independently of denitrification in the mud deposit zones of ECMS, and sedimentary denitrification was accomplished by cooperation of different denitrifying species rather than a single species. Such important information would be missed when targeting only a single denitrifying functional gene. Analysis of correlation between abundance ratios and environmental factors revealed that the response of denitrifiers to environmental factors was not invariable in different mud deposit zones. Our results suggested that a comprehensive analysis of different denitrifying functional genes may gain more information about the dynamics of denitrifying microbiota in marine sediments.
de Vega-Bartol, José J; Santos, Raquen Raissa; Simões, Marta; Miguel, Célia M
2013-05-01
Suitable internal control genes to normalize qPCR data from different stages of embryo development and germination were identified in two representative conifer species. Clonal propagation by somatic embryogenesis has a great application potentiality in conifers. Quantitative PCR (qPCR) is widely used for gene expression analysis during somatic embryogenesis and embryo germination. No single reference gene is universal, so a systematic characterization of endogenous genes for concrete conditions is fundamental for accuracy. We identified suitable internal control genes to normalize qPCR data obtained at different steps of somatic embryogenesis (embryonal mass proliferation, embryo maturation and germination) in two representative conifer species, Pinus pinaster and Picea abies. Candidate genes included endogenous genes commonly used in conifers, genes previously tested in model plants, and genes with a lower variation of the expression along embryo development according to genome-wide transcript profiling studies. Three different algorithms were used to evaluate expression stability. The geometric average of the expression values of elongation factor-1α, α-tubulin and histone 3 in P. pinaster, and elongation factor-1α, α-tubulin, adenosine kinase and CAC in P. abies were adequate for expression studies throughout somatic embryogenesis. However, improved accuracy was achieved when using other gene combinations in experiments with samples at a single developmental stage. The importance of studies selecting reference genes to use in different tissues or developmental stages within one or close species, and the instability of commonly used reference genes, is highlighted.
Selisana, S M; Yanoria, M J; Quime, B; Chaipanya, C; Lu, G; Opulencia, R; Wang, G-L; Mitchell, T; Correll, J; Talbot, N J; Leung, H; Zhou, B
2017-06-01
Avirulence (AVR) genes in Magnaporthe oryzae, the fungal pathogen that causes the devastating rice blast disease, have been documented to be major targets subject to mutations to avoid recognition by resistance (R) genes. In this study, an AVR-gene-based diagnosis tool for determining the virulence spectrum of a rice blast pathogen population was developed and validated. A set of 77 single-spore field isolates was subjected to pathotype analysis using differential lines, each containing a single R gene, and classified into 20 virulent pathotypes, except for 4 isolates that lost pathogenicity. In all, 10 differential lines showed low frequency (<24%) of resistance whereas 8 lines showed a high frequency (>95%), inferring the effectiveness of R genes present in the respective differential lines. In addition, the haplotypes of seven AVR genes were determined by polymerase chain reaction amplification and sequencing, if applicable. The calculated frequency of different AVR genes displayed significant variations in the population. AVRPiz-t and AVR-Pii were detected in 100 and 84.9% of the isolates, respectively. Five AVR genes such as AVR-Pik-D (20.5%) and AVR-Pik-E (1.4%), AVRPiz-t (2.7%), AVR-Pita (0%), AVR-Pia (0%), and AVR1-CO39 (0%) displayed low or even zero frequency. The frequency of AVR genes correlated almost perfectly with the resistance frequency of the cognate R genes in differential lines, except for International Rice Research Institute-bred blast-resistant lines IRBLzt-T, IRBLta-K1, and IRBLkp-K60. Both genetic analysis and molecular marker validation revealed an additional R gene, most likely Pi19 or its allele, in these three differential lines. This can explain the spuriously higher resistance frequency of each target R gene based on conventional pathotyping. This study demonstrates that AVR-gene-based diagnosis provides a precise, R-gene-specific, and differential line-free assessment method that can be used for determining the virulence spectrum of a rice blast pathogen population and for predicting the effectiveness of target R genes in rice varieties.
Isolation, sequence, and characterization of the Cercospora nicotianae phytoene dehydrogenase gene.
Ehrenshaft, M; Daub, M E
1994-01-01
We have cloned and sequenced the Cercospora nicotianae gene for the carotenoid biosynthetic enzyme phytoene dehydrogenase. Analysis of the derived amino acid sequence revealed it has greater than 50% identity with its counterpart in Neurospora crassa and approximately 30% identity with prokaryotic phytoene dehydrogenases and is related, but more distantly, to phytoene dehydrogenases from plants and cyanobacteria. Our analysis confirms that phytoene dehydrogenase proteins fall into two groups: those from plants and cyanobacteria and those from eukaryotic and noncyanobacter prokaryotic microbes. Southern analysis indicated that the C. nicotianae phytoene dehydrogenase gene is present in a single copy. Extraction of beta-carotene, the sole carotenoid accumulated by C. nicotianae, showed that both light- and dark-grown cultures synthesize carotenoids, but higher levels accumulate in the light. Northern (RNA) analysis of poly(A)+ RNA, however, showed no differential accumulation of phytoene dehydrogenase mRNA between light- and dark-grown fungal cultures. Images PMID:8085820
TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis
Ji, Zhicheng; Ji, Hongkai
2016-01-01
When analyzing single-cell RNA-seq data, constructing a pseudo-temporal path to order cells based on the gradual transition of their transcriptomes is a useful way to study gene expression dynamics in a heterogeneous cell population. Currently, a limited number of computational tools are available for this task, and quantitative methods for comparing different tools are lacking. Tools for Single Cell Analysis (TSCAN) is a software tool developed to better support in silico pseudo-Time reconstruction in Single-Cell RNA-seq ANalysis. TSCAN uses a cluster-based minimum spanning tree (MST) approach to order cells. Cells are first grouped into clusters and an MST is then constructed to connect cluster centers. Pseudo-time is obtained by projecting each cell onto the tree, and the ordered sequence of cells can be used to study dynamic changes of gene expression along the pseudo-time. Clustering cells before MST construction reduces the complexity of the tree space. This often leads to improved cell ordering. It also allows users to conveniently adjust the ordering based on prior knowledge. TSCAN has a graphical user interface (GUI) to support data visualization and user interaction. Furthermore, quantitative measures are developed to objectively evaluate and compare different pseudo-time reconstruction methods. TSCAN is available at https://github.com/zji90/TSCAN and as a Bioconductor package. PMID:27179027
TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis.
Ji, Zhicheng; Ji, Hongkai
2016-07-27
When analyzing single-cell RNA-seq data, constructing a pseudo-temporal path to order cells based on the gradual transition of their transcriptomes is a useful way to study gene expression dynamics in a heterogeneous cell population. Currently, a limited number of computational tools are available for this task, and quantitative methods for comparing different tools are lacking. Tools for Single Cell Analysis (TSCAN) is a software tool developed to better support in silico pseudo-Time reconstruction in Single-Cell RNA-seq ANalysis. TSCAN uses a cluster-based minimum spanning tree (MST) approach to order cells. Cells are first grouped into clusters and an MST is then constructed to connect cluster centers. Pseudo-time is obtained by projecting each cell onto the tree, and the ordered sequence of cells can be used to study dynamic changes of gene expression along the pseudo-time. Clustering cells before MST construction reduces the complexity of the tree space. This often leads to improved cell ordering. It also allows users to conveniently adjust the ordering based on prior knowledge. TSCAN has a graphical user interface (GUI) to support data visualization and user interaction. Furthermore, quantitative measures are developed to objectively evaluate and compare different pseudo-time reconstruction methods. TSCAN is available at https://github.com/zji90/TSCAN and as a Bioconductor package. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Osborne, Peter W; Benoit, Gérard; Laudet, Vincent; Schubert, Michael; Ferrier, David E K
2009-03-01
The ParaHox cluster is the evolutionary sister to the Hox cluster. Like the Hox cluster, the ParaHox cluster displays spatial and temporal regulation of the component genes along the anterior/posterior axis in a manner that correlates with the gene positions within the cluster (a feature called collinearity). The ParaHox cluster is however a simpler system to study because it is composed of only three genes. We provide a detailed analysis of the amphioxus ParaHox cluster and, for the first time in a single species, examine the regulation of the cluster in response to a single developmental signalling molecule, retinoic acid (RA). Embryos treated with either RA or RA antagonist display altered ParaHox gene expression: AmphiGsx expression shifts in the neural tube, and the endodermal boundary between AmphiXlox and AmphiCdx shifts its anterior/posterior position. We identified several putative retinoic acid response elements and in vitro assays suggest some may participate in RA regulation of the ParaHox genes. By comparison to vertebrate ParaHox gene regulation we explore the evolutionary implications. This work highlights how insights into the regulation and evolution of more complex vertebrate arrangements can be obtained through studies of a simpler, unduplicated amphioxus gene cluster.
Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina
2006-06-01
Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
Watanabe, Mutsumi; Mochida, Keiichi; Kato, Tomohiko; Tabata, Satoshi; Yoshimoto, Naoko; Noji, Masaaki; Saito, Kazuki
2008-01-01
Ser acetyltransferase (SERAT), which catalyzes O-acetyl-Ser (OAS) formation, plays a key role in sulfur assimilation and Cys synthesis. Despite several studies on SERATs from various plant species, the in vivo function of multiple SERAT genes in plant cells remains unaddressed. Comparative genomics studies with the five genes of the SERAT gene family in Arabidopsis thaliana indicated that all three Arabidopsis SERAT subfamilies are conserved across five plant species with available genome sequences. Single and multiple knockout mutants of all Arabidopsis SERAT gene family members were analyzed. All five quadruple mutants with a single gene survived, with three mutants showing dwarfism. However, the quintuple mutant lacking all SERAT genes was embryo-lethal. Thus, all five isoforms show functional redundancy in vivo. The developmental and compartment-specific roles of each SERAT isoform were also demonstrated. Mitochondrial SERAT2;2 plays a predominant role in cellular OAS formation, while plastidic SERAT2;1 contributes less to OAS formation and subsequent Cys synthesis. Three cytosolic isoforms, SERAT1;1, SERAT3;1, and SERAT3;2, may play a major role during seed development. Thus, the evolutionally conserved SERAT gene family is essential in cellular processes, and the substrates and products of SERAT must be exchangeable between the cytosol and organelles. PMID:18776059
The complete chloroplast genome of North American ginseng, Panax quinquefolius.
Han, Zeng-Jie; Li, Wei; Liu, Yuan; Gao, Li-Zhi
2016-09-01
We report complete nucleotide sequence of the Panax quinquefolius chloroplast genome using next-generation sequencing technology. The genome size is 156 359 bp, including two inverted repeats (IRs) of 52 153 bp, separated by the large single-copy (LSC 86 184 bp) and small single-copy (SSC 18 081 bp) regions. This cp genome encodes 114 unigenes (80 protein-coding genes, four rRNA genes, and 30 tRNA genes), in which 18 are duplicated in the IR regions. Overall GC content of the genome is 38.08%. A phylogenomic analysis of the 10 complete chloroplast genomes from Araliaceae using Daucus carota from Apiaceae as outgroup showed that P. quinquefolius is closely related to the other two members of the genus Panax, P. ginseng and P. notoginseng.
Morphological Identification and Single-Cell Genomics of Marine Diplonemids.
Gawryluk, Ryan M R; Del Campo, Javier; Okamoto, Noriko; Strassert, Jürgen F H; Lukeš, Julius; Richards, Thomas A; Worden, Alexandra Z; Santoro, Alyson E; Keeling, Patrick J
2016-11-21
Recent global surveys of marine biodiversity have revealed that a group of organisms known as "marine diplonemids" constitutes one of the most abundant and diverse planktonic lineages [1]. Though discovered over a decade ago [2, 3], their potential importance was unrecognized, and our knowledge remains restricted to a single gene amplified from environmental DNA, the 18S rRNA gene (small subunit [SSU]). Here, we use single-cell genomics (SCG) and microscopy to characterize ten marine diplonemids, isolated from a range of depths in the eastern North Pacific Ocean. Phylogenetic analysis confirms that the isolates reflect the entire range of marine diplonemid diversity, and comparisons to environmental SSU surveys show that sequences from the isolates range from rare to superabundant, including the single most common marine diplonemid known. SCG generated a total of ∼915 Mbp of assembled sequence across all ten cells and ∼4,000 protein-coding genes with homologs in the Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology database, distributed across categories expected for heterotrophic protists. Models of highly conserved genes indicate a high density of non-canonical introns, lacking conventional GT-AG splice sites. Mapping metagenomic datasets [4] to SCG assemblies reveals virtually no overlap, suggesting that nuclear genomic diversity is too great for representative SCG data to provide meaningful phylogenetic context to metagenomic datasets. This work provides an entry point to the future identification, isolation, and cultivation of these elusive yet ecologically important cells. The high density of nonconventional introns, however, also portends difficulty in generating accurate gene models and highlights the need for the establishment of stable cultures and transcriptomic analyses. Copyright © 2016 Elsevier Ltd. All rights reserved.
High-throughput full-length single-cell mRNA-seq of rare cells.
Ooi, Chin Chun; Mantalas, Gary L; Koh, Winston; Neff, Norma F; Fuchigami, Teruaki; Wong, Dawson J; Wilson, Robert J; Park, Seung-Min; Gambhir, Sanjiv S; Quake, Stephen R; Wang, Shan X
2017-01-01
Single-cell characterization techniques, such as mRNA-seq, have been applied to a diverse range of applications in cancer biology, yielding great insight into mechanisms leading to therapy resistance and tumor clonality. While single-cell techniques can yield a wealth of information, a common bottleneck is the lack of throughput, with many current processing methods being limited to the analysis of small volumes of single cell suspensions with cell densities on the order of 107 per mL. In this work, we present a high-throughput full-length mRNA-seq protocol incorporating a magnetic sifter and magnetic nanoparticle-antibody conjugates for rare cell enrichment, and Smart-seq2 chemistry for sequencing. We evaluate the efficiency and quality of this protocol with a simulated circulating tumor cell system, whereby non-small-cell lung cancer cell lines (NCI-H1650 and NCI-H1975) are spiked into whole blood, before being enriched for single-cell mRNA-seq by EpCAM-functionalized magnetic nanoparticles and the magnetic sifter. We obtain high efficiency (> 90%) capture and release of these simulated rare cells via the magnetic sifter, with reproducible transcriptome data. In addition, while mRNA-seq data is typically only used for gene expression analysis of transcriptomic data, we demonstrate the use of full-length mRNA-seq chemistries like Smart-seq2 to facilitate variant analysis of expressed genes. This enables the use of mRNA-seq data for differentiating cells in a heterogeneous population by both their phenotypic and variant profile. In a simulated heterogeneous mixture of circulating tumor cells in whole blood, we utilize this high-throughput protocol to differentiate these heterogeneous cells by both their phenotype (lung cancer versus white blood cells), and mutational profile (H1650 versus H1975 cells), in a single sequencing run. This high-throughput method can help facilitate single-cell analysis of rare cell populations, such as circulating tumor or endothelial cells, with demonstrably high-quality transcriptomic data.
Association of Ugrp2 gene polymorphisms with adenoid hypertrophy in the pediatric population.
Atilla, Mahmut Huntürk; Özdaş, Sibel; Özdaş, Talih; Baştimur, Sibel; Muz, Sami Engin; Öz, Işılay; Kurt, Kenan; İzbirak, Afife; Babademez, Mehmet Ali; Vatandaş, Nilgün
2017-08-01
Adenoid hypertrophy is a condition that presents itself as the chronic enlargement of adenoid tissues; it is frequently observed in the pediatric population. The Ugrp2 gene, a member of the secretoglobin superfamily, encodes a low-molecular weight protein that functions in the differentiation of upper airway epithelial cells. However, little is known about the association of Ugrp2 genetic variations with adenoid hypertrophy. The aim of this study is to investigate the association of single nucleotide polymorphisms in the Ugrp2 gene with adenoid hypertrophy and its related phenotypes. A total of 219 children, comprising 114 patients suffering from adenoid hypertrophy and 105 healthy patients without adenoid hypertrophy, were enrolled in this study. Genotypes of the Ugrp2 gene were determined by DNA sequencing. We identified four single nucleotide polymorphisms (IVS1-189G>A, IVS1-89T>G, c.201delC, and IVS2-15G>A) in the Ugrp2 gene. Our genotype analysis showed that the Ugrp2 (IVS1-89T>G) TG and (c.201delC) CdelC genotypes and their minor alleles were associated with a considerable increase in the risk of adenoid hypertrophy compared with the controls (p=0.012, p=0.009, p=0.013, and p=0.037, respectively). Furthermore, Ugrp2 (GTdelCG, GTdelCA) haplotypes were significantly associated with adenoid hypertrophy (four single nucleotide polymorphisms ordered from 5' to 3'; p=0.0001). Polymorfism-Polymorfism interaction analysis indicated a strong interaction between combined genotypes of the Ugrp2 gene contributing to adenoid hypertrophy, as well as an increased chance of its diagnosis (p<0.0001). In addition, diplotypes carrying the mutant Ugrp2 (c.201delC) allele were strongly associated with an increased risk of adenoid hypertrophy with asthma and adenoid hypertrophy with allergies (p=0.003 and p=0.0007, respectively). Some single nucleotide polymorphisms and their combinations in the Ugrp2 gene are associated with an increased risk of developing adenoid hypertrophy. Therefore, we tried to underline the importance of genetic factors associated with adenoid hypertrophy and adenoid hypertrophy-related clinical phenotypes. Copyright © 2017 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
Yang, Hong; Lin, Shan; Cui, Jingru
2014-02-10
Arsenic trioxide (ATO) is presently the most active single agent in the treatment of acute promyelocytic leukemia (APL). In order to explore the molecular mechanism of ATO in leukemia cells with time series, we adopted bioinformatics strategy to analyze expression changing patterns and changes in transcription regulation modules of time series genes filtered from Gene Expression Omnibus database (GSE24946). We totally screened out 1847 time series genes for subsequent analysis. The KEGG (Kyoto encyclopedia of genes and genomes) pathways enrichment analysis of these genes showed that oxidative phosphorylation and ribosome were the top 2 significantly enriched pathways. STEM software was employed to compare changing patterns of gene expression with assigned 50 expression patterns. We screened out 7 significantly enriched patterns and 4 tendency charts of time series genes. The result of Gene Ontology showed that functions of times series genes mainly distributed in profiles 41, 40, 39 and 38. Seven genes with positive regulation of cell adhesion function were enriched in profile 40, and presented the same first increased model then decreased model as profile 40. The transcription module analysis showed that they mainly involved in oxidative phosphorylation pathway and ribosome pathway. Overall, our data summarized the gene expression changes in ATO treated K562-r cell lines with time and suggested that time series genes mainly regulated cell adhesive. Furthermore, our result may provide theoretical basis of molecular biology in treating acute promyelocytic leukemia. Copyright © 2013 Elsevier B.V. All rights reserved.
2014-01-01
Background Hepatocellular carcinoma (HCC) is one of the major causes of cancer-related death especially among Asian and African populations. It is urgent that we identify carcinogenesis-related genes to establish an innovative treatment strategy for this disease. Methods Triple-combination array analysis was performed using one pair each of HCC and noncancerous liver samples from a 68-year-old woman. This analysis consists of expression array, single nucleotide polymorphism array and methylation array. The gene encoding collagen type 1 alpha 1 (COL1A1) was identified and verified using HCC cell lines and 48 tissues from patients with primary HCC. Results Expression array revealed that COL1A1 gene expression was markedly decreased in tumor tissues (log2 ratio –1.1). The single nucleotide polymorphism array showed no chromosomal deletion in the locus of COL1A1. Importantly, the methylation value in the tumor tissue was higher (0.557) than that of the adjacent liver tissue (0.008). We verified that expression of this gene was suppressed by promoter methylation. Reactivation of COL1A1 expression by 5-aza-2′-deoxycytidine treatment was seen in HCC cell lines, and sequence analysis identified methylated CpG sites in the COL1A1 promoter region. Among 48 pairs of surgical specimens, 13 (27.1%) showed decreased COL1A1 mRNA expression in tumor sites. Among these 13 cases, 10 had promoter methylation at the tumor site. The log-rank test indicated that mRNA down-regulated tumors were significantly correlated with a poor overall survival rate (P = 0.013). Conclusions Triple-combination array analysis successfully identified COL1A1 as a candidate survival-related gene in HCCs. Epigenetic down-regulation of COL1A1 mRNA expression might have a role as a prognostic biomarker of HCC. PMID:24552139
Pasion, S G; Hines, J C; Aebersold, R; Ray, D S
1992-01-01
A type II DNA topoisomerase, topoIImt, was shown previously to be associated with the kinetoplast DNA of the trypanosomatid Crithidia fasciculata. The gene encoding this kinetoplast-associated topoisomerase has been cloned by immunological screening of a Crithidia genomic expression library with monoclonal antibodies raised against the purified enzyme. The gene CfaTOP2 is a single copy gene and is expressed as a 4.8-kb polyadenylated transcript. The nucleotide sequence of CfaTOP2 has been determined and encodes a predicted polypeptide of 1239 amino acids with a molecular mass of 138,445. The identification of the cloned gene is supported by immunoblot analysis of the beta-galactosidase-CfaTOP2 fusion protein expressed in Escherichia coli and by analysis of tryptic peptide sequences derived from purified topoIImt. CfaTOP2 shares significant homology with nuclear type II DNA topoisomerases of other eukaryotes suggesting that in Crithidia both nuclear and mitochondrial forms of topoisomerase II are encoded by the same gene.
Ventura, Marco; Kenny, John G; Zhang, Ziding; Fitzgerald, Gerald F; van Sinderen, Douwe
2005-09-01
The so-called clp genes, which encode components of the Clp proteolytic complex, are widespread among bacteria. The Bifidobacterium breve UCC 2003 genome contains a clpB gene with significant homology to predicted clpB genes from other members of the Actinobacteridae group. The heat- and osmotic-inducibility of the B. breve UCC 2003 clpB homologue was verified by slot-blot analysis, while Northern blot and primer extension analyses showed that the clpB gene is transcribed as a monocistronic unit with a single promoter. The role of a hspR homologue, known to control the regulation of clpB and dnaK gene expression in other high G+C content bacteria was investigated by gel mobility shift assays. Moreover the predicted 3D structure of HspR provides further insight into the binding mode of this protein to the clpB promoter region, and highlights the key amino acid residues believed to be involved in the protein-DNA interaction.
Wang, Shi-Yuan; Zhang, Qi; Zhang, Xiang; Zhao, Pei-Quan
2016-01-01
AIM To make a comprehensive analysis of the potential pathogenic genes related with Leber congenital amaurosis (LCA) in Chinese. METHODS LCA subjects and their families were retrospectively collected from 2013 to 2015. Firstly, whole-exome sequencing was performed in patients who had underwent gene mutation screening with nothing found, and then homozygous sites was selected, candidate sites were annotated, and pathogenic analysis was conducted using softwares including Sorting Tolerant from Intolerant (SIFT), Polyphen-2, Mutation assessor, Condel, and Functional Analysis through Hidden Markov Models (FATHMM). Furthermore, Gene Ontology function and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of pathogenic genes were performed followed by co-segregation analysis using Fisher exact Test. Sanger sequencing was used to validate single-nucleotide variations (SNVs). Expanded verification was performed in the rest patients. RESULTS Totally 51 LCA families with 53 patients and 24 family members were recruited. A total of 104 SNVs (66 LCA-related genes and 15 co-segregated genes) were submitted for expand verification. The frequencies of homozygous mutation of KRT12 and CYP1A1 were simultaneously observed in 3 families. Enrichment analysis showed that the potential pathogenic genes were mainly enriched in functions related to cell adhesion, biological adhesion, retinoid metabolic process, and eye development biological adhesion. Additionally, WFS1 and STAU2 had the highest homozygous frequencies. CONCLUSION LCA is a highly heterogeneous disease. Mutations in KRT12, CYP1A1, WFS1, and STAU2 may be involved in the development of LCA. PMID:27672588
Ali, Muhammad Y; Pavasovic, Ana; Dammannagoda, Lalith K; Mather, Peter B; Prentis, Peter J
2017-01-01
Systemic acid-base balance and osmotic/ionic regulation in decapod crustaceans are in part maintained by a set of transport-related enzymes such as carbonic anhydrase (CA), Na + /K + -ATPase (NKA), H + -ATPase (HAT), Na + /K + /2Cl - cotransporter (NKCC), Na + /Cl - /HCO[Formula: see text] cotransporter (NBC), Na + /H + exchanger (NHE), Arginine kinase (AK), Sarcoplasmic Ca +2 -ATPase (SERCA) and Calreticulin (CRT). We carried out a comparative molecular analysis of these genes in three commercially important yet eco-physiologically distinct freshwater crayfish , Cherax quadricarinatus, C. destructor and C. cainii , with the aim to identify mutations in these genes and determine if observed patterns of mutations were consistent with the action of natural selection. We also conducted a tissue-specific expression analysis of these genes across seven different organs, including gills, hepatopancreas, heart, kidney, liver, nerve and testes using NGS transcriptome data. The molecular analysis of the candidate genes revealed a high level of sequence conservation across the three Cherax sp. Hyphy analysis revealed that all candidate genes showed patterns of molecular variation consistent with neutral evolution. The tissue-specific expression analysis showed that 46% of candidate genes were expressed in all tissue types examined, while approximately 10% of candidate genes were only expressed in a single tissue type. The largest number of genes was observed in nerve (84%) and gills (78%) and the lowest in testes (66%). The tissue-specific expression analysis also revealed that most of the master genes regulating pH and osmoregulation (CA, NKA, HAT, NKCC, NBC, NHE) were expressed in all tissue types indicating an important physiological role for these genes outside of osmoregulation in other tissue types. The high level of sequence conservation observed in the candidate genes may be explained by the important role of these genes as well as potentially having a number of other basic physiological functions in different tissue types.
Genetic polymorphisms associated with increased risk of developing chronic myelogenous leukemia
Bruzzoni-Giovanelli, Heriberto; González, Juan R.; Sigaux, François; Villoutreix, Bruno O.; Cayuela, Jean Michel; Guilhot, Joëlle; Preudhomme, Claude; Guilhot, François; Poyet, Jean-Luc; Rousselot, Philippe
2015-01-01
Little is known about inherited factors associated with the risk of developing chronic myelogenous leukemia (CML). We used a dedicated DNA chip containing 16 561 single nucleotide polymorphisms (SNPs) covering 1 916 candidate genes to analyze 437 CML patients and 1 144 healthy control individuals. Single SNP association analysis identified 139 SNPs that passed multiple comparisons (1% false discovery rate). The HDAC9, AVEN, SEMA3C, IKBKB, GSTA3, RIPK1 and FGF2 genes were each represented by three SNPs, the PSM family by four SNPs and the SLC15A1 gene by six. Haplotype analysis showed that certain combinations of rare alleles of these genes increased the risk of developing CML by more than two or three-fold. A classification tree model identified five SNPs belonging to the genes PSMB10, TNFRSF10D, PSMB2, PPARD and CYP26B1, which were associated with CML predisposition. A CML-risk-allele score was created using these five SNPs. This score was accurate for discriminating CML status (AUC: 0.61, 95%CI: 0.58–0.64). Interestingly, the score was associated with age at diagnosis and the average number of risk alleles was significantly higher in younger patients. The risk-allele score showed the same distribution in the general population (HapMap CEU samples) as in our control individuals and was associated with differential gene expression patterns of two genes (VAPA and TDRKH). In conclusion, we describe haplotypes and a genetic score that are significantly associated with a predisposition to develop CML. The SNPs identified will also serve to drive fundamental research on the putative role of these genes in CML development. PMID:26474455
Evolution of SUMO Function and Chain Formation in Insects.
Ureña, Enric; Pirone, Lucia; Chafino, Silvia; Pérez, Coralia; Sutherland, James D; Lang, Valérie; Rodriguez, Manuel S; Lopitz-Otsoa, Fernando; Blanco, Francisco J; Barrio, Rosa; Martín, David
2016-02-01
SUMOylation, the covalent binding of Small Ubiquitin-like Modifier (SUMO) to target proteins, is a posttranslational modification that regulates critical cellular processes in eukaryotes. In insects, SUMOylation has been studied in holometabolous species, particularly in the dipteran Drosophila melanogaster, which contains a single SUMO gene (smt3). This has led to the assumption that insects contain a single SUMO gene. However, the analysis of insect genomes shows that basal insects contain two SUMO genes, orthologous to vertebrate SUMO1 and SUMO2/3. Our phylogenetical analysis reveals that the SUMO gene has been duplicated giving rise to SUMO1 and SUMO2/3 families early in Metazoan evolution, and that later in insect evolution the SUMO1 gene has been lost after the Hymenoptera divergence. To explore the consequences of this loss, we have examined the characteristics and different biological functions of the two SUMO genes (SUMO1 and SUMO3) in the hemimetabolous cockroach Blattella germanica and compared them with those of Drosophila Smt3. Here, we show that the metamorphic role of the SUMO genes is evolutionary conserved in insects, although there has been a regulatory switch from SUMO1 in basal insects to SUMO3 in more derived ones. We also show that, unlike vertebrates, insect SUMO3 proteins cannot form polySUMO chains due to the loss of critical lysine residues within the N-terminal part of the protein. Furthermore, the formation of polySUMO chains by expression of ectopic human SUMO3 has a deleterious effect in Drosophila. These findings contribute to the understanding of the functional consequences of the evolution of SUMO genes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease.
Martin, E R; Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Ribble, R C; Booze, M W; Rogala, A; Hauser, M A; Zhang, F; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Pericak-Vance, M A; Vance, J M
2001-11-14
The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. To investigate whether the tau gene is involved in idiopathic PD. Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Family-based tests of association, calculated using asymptotic distributions. Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P =.03; SNP 9i, P =.04; and SNP 11, P =.04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P =.11, and SNP 9iii, P =.87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P =.009) and a negative association with another haplotype (P =.007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3, 9i, 9ii, and 11). This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD.
Natural killer cell receptor genes in the family Equidae: not only Ly49.
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes.
Natural Killer Cell Receptor Genes in the Family Equidae: Not only Ly49
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes. PMID:23724088
[Fluoroquinolone resistance mutations in topoisomerase genes of Salmonella typhimurium isolates].
Guo, Yunchang; Pei, Xiaoyan; Liu, Xiumei
2004-09-01
Mutations in topoisomerase genes were main cause of the resistence of Salmonella typhimurium to fluoroquinolone. The MICs of three Salmonella typhimurium isolates X2, X7, X11 to ciprofloxacin were above 32 microg/ml, 0.38 microg/ml and 0.023 microg/ml, respectively. The genetic alterations in four topoisomerase genes, gyrA, gyrB, parC, and parE were detected by multiplex PCR amplimer conformation analysis in these three strains. X2 isolate showed both gyrA mutations (Ser83-->Phe, Asp87-->Asn) and parC mutation (Ser80-->Arg). X7 isolate showed a single gyrA mutation (Ser83-->Phe) and X11 isolate had no changes in all of the four quinolone resistance genes, gyrA, gyrB, parC, and parE. X7 isolate with a single gyrA mutation was less resistant to ciprofloxacin than X2 with double gyrA mutations and an additional parC mutation. GyrA and parC genes play important role of the resistance of Salmonella typhimurium to ciprofloxacin.
Lundqvist, Mats L; Kohlberg, Kathleen E; Gefroh, Holly A; Arnaud, Philippe; Middleton, Darlene L; Romano, Tracy A; Warr, Gregory W
2002-07-01
Clones encoding the dolphin IgM heavy (micro) chain gene were isolated from a cDNA library of peripheral blood leukocytes. Genomic Southern blot analyses showed that the dolphin IGHM gene is most likely present in a single copy, and its sequence shows greatest similarity to those of the IGHM gene of the sheep, pig and cow, evolutionarily related artiodactyls. The transmembrane (TM) form of the IGHM chain was isolated by 3' RACE. While showing similarities to the TM regions of other mammalian IGHM chains, the highly conserved Ser residue of the CART motif is substituted with a Gly in the dolphin. In contrast to the pig and cow, which utilize only a single VH family, the dolphin expresses at least two distinct VH families, belonging to the mammalian VH clans I and III. At least two JH genes were identified in the dolphin. Some CDR3 regions of the dolphin VH are long (up to 21 amino acids), and contain multiple Cys residues, hypothesized to stabilize the CDR3 structure through disulfide bond formation.
Leal, Mariana Ferreira; Astur, Diego Costa; Debieux, Pedro; Arliani, Gustavo Gonçalves; Silveira Franciozi, Carlos Eduardo; Loyola, Leonor Casilla; Andreoli, Carlos Vicente; Smith, Marília Cardoso; Pochini, Alberto de Castro; Ejnisman, Benno; Cohen, Moises
2015-01-01
The anterior cruciate ligament (ACL) is one of the most frequently injured structures during high-impact sporting activities. Gene expression analysis may be a useful tool for understanding ACL tears and healing failure. Reverse transcription-quantitative polymerase chain reaction (RT-qPCR) has emerged as an effective method for such studies. However, this technique requires the use of suitable reference genes for data normalization. Here, we evaluated the suitability of six reference genes (18S, ACTB, B2M, GAPDH, HPRT1, and TBP) by using ACL samples of 39 individuals with ACL tears (20 with isolated ACL tears and 19 with ACL tear and combined meniscal injury) and of 13 controls. The stability of the candidate reference genes was determined by using the NormFinder, geNorm, BestKeeper DataAssist, and RefFinder software packages and the comparative ΔCt method. ACTB was the best single reference gene and ACTB+TBP was the best gene pair. The GenEx software showed that the accumulated standard deviation is reduced when a larger number of reference genes is used for gene expression normalization. However, the use of a single reference gene may not be suitable. To identify the optimal combination of reference genes, we evaluated the expression of FN1 and PLOD1. We observed that at least 3 reference genes should be used. ACTB+HPRT1+18S is the best trio for the analyses involving isolated ACL tears and controls. Conversely, ACTB+TBP+18S is the best trio for the analyses involving (1) injured ACL tears and controls, and (2) ACL tears of patients with meniscal tears and controls. Therefore, if the gene expression study aims to compare non-injured ACL, isolated ACL tears and ACL tears from patients with meniscal tear as three independent groups ACTB+TBP+18S+HPRT1 should be used. In conclusion, 3 or more genes should be used as reference genes for analysis of ACL samples of individuals with and without ACL tears.
Genes Responsive to Low-Intensity Pulsed Ultrasound in MC3T3-E1 Preosteoblast Cells
Tabuchi, Yoshiaki; Sugahara, Yuuki; Ikegame, Mika; Suzuki, Nobuo; Kitamura, Kei-ichiro; Kondo, Takashi
2013-01-01
Although low-intensity pulsed ultrasound (LIPUS) has been shown to enhance bone fracture healing, the underlying mechanism of LIPUS remains to be fully elucidated. Here, to better understand the molecular mechanism underlying cellular responses to LIPUS, we investigated gene expression profiles in mouse MC3T3-E1 preosteoblast cells exposed to LIPUS using high-density oligonucleotide microarrays and computational gene expression analysis tools. Although treatment of the cells with a single 20-min LIPUS (1.5 MHz, 30 mW/cm2) did not affect the cell growth or alkaline phosphatase activity, the treatment significantly increased the mRNA level of Bglap. Microarray analysis demonstrated that 38 genes were upregulated and 37 genes were downregulated by 1.5-fold or more in the cells at 24-h post-treatment. Ingenuity pathway analysis demonstrated that the gene network U (up) contained many upregulated genes that were mainly associated with bone morphology in the category of biological functions of skeletal and muscular system development and function. Moreover, the biological function of the gene network D (down), which contained downregulated genes, was associated with gene expression, the cell cycle and connective tissue development and function. These results should help to further clarify the molecular basis of the mechanisms of the LIPUS response in osteoblast cells. PMID:24252911
Integration of QTL and bioinformatic tools to identify candidate genes for triglycerides in mice[S
Leduc, Magalie S.; Hageman, Rachael S.; Verdugo, Ricardo A.; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A.; Paigen, Beverly
2011-01-01
To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a “toolbox” of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits. PMID:21622629
MGAS: a powerful tool for multivariate gene-based genome-wide association analysis.
Van der Sluis, Sophie; Dolan, Conor V; Li, Jiang; Song, Youqiang; Sham, Pak; Posthuma, Danielle; Li, Miao-Xin
2015-04-01
Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem. Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis. MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models. MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Pseudomonas specific 16S rDNA PCR amplification and multiple enzyme restriction fragment length polymorphism (MERFLP) analysis using a single digestion mixture of Alu I, Hinf I, Rsa I, and Tru 9I distinguished 150 published sequences and reference strains of authentic Pseudomonas...
Ernst, J F; Stewart, J W; Sherman, F
1981-01-01
DNA sequence analysis of a cloned fragment directly established that the cyc1-11 mutation of iso-1-cytochrome c in the yeast Saccharomyces cerevisiae is a two-base-pair substitution that changes the CCA proline codon at amino acid position 76 to a UAA nonsense codon. Analysis of 11 revertant proteins and one cloned revertant gene showed that reversion of the cyc1-11 mutation can occur in three ways: a single base-pair substitution, which produces a serine replacement at position 76; recombination with the nonallelic CYC7 gene of iso-2-cytochrome c, which causes replacement of a segment in the cyc1-11 gene by the corresponding segment of the CYC7 gene; and either a two-base-pair substitution or recombination with the CYC7 gene, which causes the formation of the normal iso-1-cytochrome c sequence. These results demonstrate the occurrence of low frequencies of recombination between nonallelic genes having extensive but not complete homology. The formation of composite genes that share sequences from nonallelic genes may be an evolutionary mechanism for producing protein diversities and for maintaining identical sequences at different loci. Images PMID:6273865
PIGD: a database for intronless genes in the Poaceae.
Yan, Hanwei; Jiang, Cuiping; Li, Xiaoyu; Sheng, Lei; Dong, Qing; Peng, Xiaojian; Li, Qian; Zhao, Yang; Jiang, Haiyang; Cheng, Beijiu
2014-10-01
Intronless genes are a feature of prokaryotes; however, they are widespread and unequally distributed among eukaryotes and represent an important resource to study the evolution of gene architecture. Although many databases on exons and introns exist, there is currently no cohesive database that collects intronless genes in plants into a single database. In this study, we present the Poaceae Intronless Genes Database (PIGD), a user-friendly web interface to explore information on intronless genes from different plants. Five Poaceae species, Sorghum bicolor, Zea mays, Setaria italica, Panicum virgatum and Brachypodium distachyon, are included in the current release of PIGD. Gene annotations and sequence data were collected and integrated from different databases. The primary focus of this study was to provide gene descriptions and gene product records. In addition, functional annotations, subcellular localization prediction and taxonomic distribution are reported. PIGD allows users to readily browse, search and download data. BLAST and comparative analyses are also provided through this online database, which is available at http://pigd.ahau.edu.cn/. PIGD provides a solid platform for the collection, integration and analysis of intronless genes in the Poaceae. As such, this database will be useful for subsequent bio-computational analysis in comparative genomics and evolutionary studies.
Jia, Peilin; Zhao, Zhongming
2014-01-01
A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data. PMID:24516372
Jia, Peilin; Zhao, Zhongming
2014-02-01
A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.
Correa-Rodríguez, María; Schmidt-RioValle, Jacqueline; González-Jiménez, Emilio; Rueda-Medina, Blanca
2017-06-01
Obesity is considered an increasingly serious health problem determined by multiple genetic and environmental factors. Estrogens have been found to play a major role in body weight and adiposity regulation through estrogen receptor 1 ( ESR1). The aim of this study was to determine whether genotype and haplotype frequencies of ESR1 polymorphisms are associated with body composition measures in a population of 572 young adults. A lack of significant association between genotypes of ESR1 gene polymorphisms and obesity phenotypes was seen after adjustment for confounding factors. Linkage disequilibrium (LD) analysis identified a single LD block for the ESR1 gene including PvuII and XbaI single-nucleotide polymorphisms (SNPs) (pairwise r 2 = .66). None of the haplotypes identified revealed statistically significant associations with any of the obesity phenotypes. Our results suggest that polymorphisms of the ESR1 gene do not contribute significantly to the genetic risk for obesity phenotypes in a population of young Caucasian adults.
Multiple-input multiple-output causal strategies for gene selection.
Bontempi, Gianluca; Haibe-Kains, Benjamin; Desmedt, Christine; Sotiriou, Christos; Quackenbush, John
2011-11-25
Traditional strategies for selecting variables in high dimensional classification problems aim to find sets of maximally relevant variables able to explain the target variations. If these techniques may be effective in generalization accuracy they often do not reveal direct causes. The latter is essentially related to the fact that high correlation (or relevance) does not imply causation. In this study, we show how to efficiently incorporate causal information into gene selection by moving from a single-input single-output to a multiple-input multiple-output setting. We show in synthetic case study that a better prioritization of causal variables can be obtained by considering a relevance score which incorporates a causal term. In addition we show, in a meta-analysis study of six publicly available breast cancer microarray datasets, that the improvement occurs also in terms of accuracy. The biological interpretation of the results confirms the potential of a causal approach to gene selection. Integrating causal information into gene selection algorithms is effective both in terms of prediction accuracy and biological interpretation.
Li, Qian-Nan; Guo, Lei; Hou, Yi; Ou, Xiang-Hong; Liu, Zhonghua; Sun, Qing-Yuan
2018-06-22
Polycystic ovary syndrome (PCOS), a familial aggregation disease that causes anovulation in women, has well-recognised characteristics, two of which are hyperinsulinaemia and hyperandrogenaemia. To determine whether the DNA methylation status is altered in oocytes by high insulin and androgen levels, we generated a mouse model with hyperinsulinaemia and hyperandrogenaemia by injection of insulin and human chorionic gonadotrophin and investigated DNA methylation changes through single-cell level whole genome bisulphite sequencing. Our results showed that hyperinsulinaemia and hyperandrogenaemia had no significant effects on the global DNA methylation profile and different functional regions of genes, but did alter methylation status of some genes, which were significantly enriched in 17 gene ontology (GO) terms (P<0.05) by GO analysis. Among differently methylated genes, some were related to the occurrence of PCOS. Based on our results, we suggest that hyperinsulinaemia and hyperandrogenaemia may cause changes in some DNA methylation loci in oocytes.
Wang, Shuo; Gao, Li-Zhi
2016-09-01
The complete chloroplast genome of green foxtail (Setaria viridis), a promising model system for C4 photosynthesis, is first reported in this study. The genome harbors a large single copy (LSC) region of 81 016 bp and a small single copy (SSC) region of 12 456 bp separated by a pair of inverted repeat (IRa and IRb) regions of 22 315 bp. GC content is 38.92%. The proportion of coding sequence is 57.97%, comprising of 111 (19 duplicated in IR regions) unique genes, 71 of which are protein-coding genes, four are rRNA genes, and 36 are tRNA genes. Phylogenetic analysis indicated that S. viridis was clustered with its cultivated species S. italica in the tribe Paniceae of the family Poaceae. This newly determined chloroplast genome will provide valuable genetic resources to assist future studies on C4 photosynthesis in grasses.
Morrone, A; Tylee, K.L.; Al-Sayed, M; Brusius-Facchin, A.C.; Caciotti, A.; Church, H.J.; Coll, M.J.; Davidson, K.; Fietz, M.J.; Gort, L.; Hegde, M.; Kubaski, F.; Lacerda, L.; Laranjeira, F.; Leistner-Segal, S.; Mooney, S.; Pajares, S.; Pollard, L.; Riberio, I.; Wang, R.Y.; Miller, N.
2014-01-01
Morquio A (Mucopolysaccharidosis IVA; MPS IVA) is an autosomal recessive lysosomal storage disorder caused by partial or total deficiency of the enzyme galactosamine-6-sulfate sulfatase (GALNS; also known as N-acetylgalactosamine-6-sulfate sulfatase) encoded by the GALNS gene. Patients who inherit two mutated GALNS gene alleles produce protein with decreased ability to degrade the glycosaminoglycans (GAGs) keratan sulfate and chondroitin 6-sulfate, thereby causing GAG accumulation within lysosomes and consequently pleiotropic disease. GALNS mutations occur throughout the gene and many mutations are identified only in single patients or families, causing difficulties both in mutation detection and interpretation. In this study, molecular analysis of 163 patients with Morquio A identified 99 unique mutations in the GALNS gene believed to negatively impact GALNS protein function, of which 39 are previously unpublished, together with 26 single-nucleotide polymorphisms. Recommendations for the molecular testing of patients, clear reporting of sequence findings, and interpretation of sequencing data are provided. PMID:24726177
Jia, Peilin; Wang, Lily; Fanous, Ayman H.; Pato, Carlos N.; Edwards, Todd L.; Zhao, Zhongming
2012-01-01
With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data, has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes with consistent association signals. We validated the module genes in an independent dataset, and also examined them through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had P meta<1×10−4, including the gene HLA-DQA1 located in the MHC region on chromosome 6, which was reported in previous studies using the largest cohort of schizophrenia patients to date. These results demonstrated our bi-directional network-based strategy is efficient for identifying disease-associated genes with modest signals in GWAS datasets. This approach can be applied to any other complex diseases/traits where multiple GWAS datasets are available. PMID:22792057
Han, Xuelei; Jiang, Tengfei; Yang, Huawei; Zhang, Qingde; Wang, Weimin; Fan, Bin; Liu, Bang
2012-06-01
Meat quality traits are economically important traits of swine, and are controlled by multiple genes as complex quantitative traits. In the present study four genes, H-FABP (heart fatty acid-binding protein), MASTR (MEF2 activating motif and SAP domain containing transcriptional regulator), UCP3 (uncoupling protein 3) and MYOD1 (myogenic differentiation 1) were researched in Large White pigs. The polymorphisms H-FABP T/C of 5'UTR, MYOD1 g.257 A>C, UCP3 g.1406 G>A in exon 3 and MASTR c.187 C>T have been reported to be associated with meat quality traits in pigs. The aim of this study was to analyze the effect of single and multiple markers for single traits in Large White pigs. The single marker association analysis showed that the H-FABP and MASTR genes were associated with IMF (intramuscular fat content) (P < 0.05), and that the g.257 A>C of MYOD1 gene was most significantly related to muscle pH value (P < 0.01). The multiple markers for IMF were analyzed by combining the markers and quantitative trait modes into the linear regression. The results revealed that H-FABP and MASTR integrate gene networks for IMF. Thus, our study results suggested that H-FABP and MASTR polymorphisms could be used as genetic markers in the marker-assisted selection towards the improvement of IMF in Large White pigs.
Mathupala, S P; Lowe, S E; Podkovyrov, S M; Zeikus, J G
1993-08-05
The complete nucleotide sequence of the gene encoding the dual active amylopullulanase of Thermoanaerobacter ethanolicus 39E (formerly Clostridium thermohydrosulfuricum) was determined. The structural gene (apu) contained a single open reading frame 4443 base pairs in length, corresponding to 1481 amino acids, with an estimated molecular weight of 162,780. Analysis of the deduced sequence of apu with sequences of alpha-amylases and alpha-1,6 debranching enzymes enabled the identification of four conserved regions putatively involved in substrate binding and in catalysis. The conserved regions were localized within a 2.9-kilobase pair gene fragment, which encoded a M(r) 100,000 protein that maintained the dual activities and thermostability of the native enzyme. The catalytic residues of amylopullulanase were tentatively identified by using hydrophobic cluster analysis for comparison of amino acid sequences of amylopullulanase and other amylolytic enzymes. Asp597, Glu626, and Asp703 were individually modified to their respective amide form, or the alternate acid form, and in all cases both alpha-amylase and pullulanase activities were lost, suggesting the possible involvement of 3 residues in a catalytic triad, and the presence of a putative single catalytic site within the enzyme. These findings substantiate amylopullulanase as a new type of amylosaccharidase.
Recently, the landscape of single base mutations in diffuse large B-cell lymphoma (DLBCL) was described. Here we report the discovery of a gene fusion between TBL1XR1 and TP63, the only recurrent somatic novel gene fusion identified in our analysis of transcriptome data from 96 DLBCL cases. Based on this cohort and a further 157 DLBCL cases analyzed by FISH, the incidence in de novo germinal center B cell-like (GCB) DLBCL is 5% (6 of 115).
Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.
Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M
2011-11-01
Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.
A microarray analysis of potential genes underlying the neurosensitivity of mice to propofol.
Lowes, Damon A; Galley, Helen F; Lowe, Peter R; Rikke, Brad A; Johnson, Thomas E; Webster, Nigel R
2005-09-01
Establishing the mechanism of action of general anesthetics at the molecular level is difficult because of the multiple targets with which these drugs are associated. Inbred short sleep (ISS) and long sleep (ILS) mice are differentially sensitive in response to ethanol and other sedative hypnotics and contain a single quantitative trait locus (Lorp1) that accounts for the genetic variance of loss-of-righting reflex in response to propofol (LORP). In this study, we used high-density oligonucleotide microarrays to identify global gene expression and candidate genes differentially expressed within the Lorp1 region that may give insight into the molecular mechanism underlying LORP. Microarray analysis was performed using Affymetrix MG-U74Av2 Genechips and a selection of differentially expressed genes was confirmed by semiquantitative reverse transcription-polymerase chain reaction. Global expression in the brains of ILS and ISS mice revealed 3423 genes that were significantly expressed, of which 139 (4%) were differentially expressed. Analysis of genes located within the Lorp1 region showed that 26 genes were significantly expressed and that just 2 genes (7%) were differentially expressed. These genes encoded for the proteins AWP1 (associated with protein kinase 1) and "BTB (POZ) domain containing 1," whose functions are largely uncharacterized. Genes differentially expressed outside Lorp1 included seven genes with previously characterized neuronal functions and thus stand out as additional candidate genes that may be involved in mediating the neurosensitivity differences between ISS and ILS.
Lochlainn, Seosamh Ó; Amoah, Stephen; Graham, Neil S; Alamer, Khalid; Rios, Juan J; Kurup, Smita; Stoute, Andrew; Hammond, John P; Østergaard, Lars; King, Graham J; White, Phillip J; Broadley, Martin R
2011-12-08
Targeted Induced Loci Lesions IN Genomes (TILLING) is increasingly being used to generate and identify mutations in target genes of crop genomes. TILLING populations of several thousand lines have been generated in a number of crop species including Brassica rapa. Genetic analysis of mutants identified by TILLING requires an efficient, high-throughput and cost effective genotyping method to track the mutations through numerous generations. High resolution melt (HRM) analysis has been used in a number of systems to identify single nucleotide polymorphisms (SNPs) and insertion/deletions (IN/DELs) enabling the genotyping of different types of samples. HRM is ideally suited to high-throughput genotyping of multiple TILLING mutants in complex crop genomes. To date it has been used to identify mutants and genotype single mutations. The aim of this study was to determine if HRM can facilitate downstream analysis of multiple mutant lines identified by TILLING in order to characterise allelic series of EMS induced mutations in target genes across a number of generations in complex crop genomes. We demonstrate that HRM can be used to genotype allelic series of mutations in two genes, BraA.CAX1a and BraA.MET1.a in Brassica rapa. We analysed 12 mutations in BraA.CAX1.a and five in BraA.MET1.a over two generations including a back-cross to the wild-type. Using a commercially available HRM kit and the Lightscanner™ system we were able to detect mutations in heterozygous and homozygous states for both genes. Using HRM genotyping on TILLING derived mutants, it is possible to generate an allelic series of mutations within multiple target genes rapidly. Lines suitable for phenotypic analysis can be isolated approximately 8-9 months (3 generations) from receiving M3 seed of Brassica rapa from the RevGenUK TILLING service.
2011-01-01
Background Targeted Induced Loci Lesions IN Genomes (TILLING) is increasingly being used to generate and identify mutations in target genes of crop genomes. TILLING populations of several thousand lines have been generated in a number of crop species including Brassica rapa. Genetic analysis of mutants identified by TILLING requires an efficient, high-throughput and cost effective genotyping method to track the mutations through numerous generations. High resolution melt (HRM) analysis has been used in a number of systems to identify single nucleotide polymorphisms (SNPs) and insertion/deletions (IN/DELs) enabling the genotyping of different types of samples. HRM is ideally suited to high-throughput genotyping of multiple TILLING mutants in complex crop genomes. To date it has been used to identify mutants and genotype single mutations. The aim of this study was to determine if HRM can facilitate downstream analysis of multiple mutant lines identified by TILLING in order to characterise allelic series of EMS induced mutations in target genes across a number of generations in complex crop genomes. Results We demonstrate that HRM can be used to genotype allelic series of mutations in two genes, BraA.CAX1a and BraA.MET1.a in Brassica rapa. We analysed 12 mutations in BraA.CAX1.a and five in BraA.MET1.a over two generations including a back-cross to the wild-type. Using a commercially available HRM kit and the Lightscanner™ system we were able to detect mutations in heterozygous and homozygous states for both genes. Conclusions Using HRM genotyping on TILLING derived mutants, it is possible to generate an allelic series of mutations within multiple target genes rapidly. Lines suitable for phenotypic analysis can be isolated approximately 8-9 months (3 generations) from receiving M3 seed of Brassica rapa from the RevGenUK TILLING service. PMID:22152063
Hamblin, Angela; Wordsworth, Sarah; Fermont, Jilles M; Page, Suzanne; Kaur, Kulvinder; Camps, Carme; Kaisaki, Pamela; Gupta, Avinash; Talbot, Denis; Middleton, Mark; Henderson, Shirley; Cutts, Anthony; Vavoulis, Dimitrios V; Housby, Nick; Tomlinson, Ian; Taylor, Jenny C; Schuh, Anna
2017-02-01
Single gene tests to predict whether cancers respond to specific targeted therapies are performed increasingly often. Advances in sequencing technology, collectively referred to as next generation sequencing (NGS), mean the entire cancer genome or parts of it can now be sequenced at speed with increased depth and sensitivity. However, translation of NGS into routine cancer care has been slow. Healthcare stakeholders are unclear about the clinical utility of NGS and are concerned it could be an expensive addition to cancer diagnostics, rather than an affordable alternative to single gene testing. We validated a 46-gene hotspot cancer panel assay allowing multiple gene testing from small diagnostic biopsies. From 1 January 2013 to 31 December 2013, solid tumour samples (including non-small-cell lung carcinoma [NSCLC], colorectal carcinoma, and melanoma) were sequenced in the context of the UK National Health Service from 351 consecutively submitted prospective cases for which treating clinicians thought the patient had potential to benefit from more extensive genetic analysis. Following histological assessment, tumour-rich regions of formalin-fixed paraffin-embedded (FFPE) sections underwent macrodissection, DNA extraction, NGS, and analysis using a pipeline centred on Torrent Suite software. With a median turnaround time of seven working days, an integrated clinical report was produced indicating the variants detected, including those with potential diagnostic, prognostic, therapeutic, or clinical trial entry implications. Accompanying phenotypic data were collected, and a detailed cost analysis of the panel compared with single gene testing was undertaken to assess affordability for routine patient care. Panel sequencing was successful for 97% (342/351) of tumour samples in the prospective cohort and showed 100% concordance with known mutations (detected using cobas assays). At least one mutation was identified in 87% (296/342) of tumours. A locally actionable mutation (i.e., available targeted treatment or clinical trial) was identified in 122/351 patients (35%). Forty patients received targeted treatment, in 22/40 (55%) cases solely due to use of the panel. Examination of published data on the potential efficacy of targeted therapies showed theoretically actionable mutations (i.e., mutations for which targeted treatment was potentially appropriate) in 66% (71/107) and 39% (41/105) of melanoma and NSCLC patients, respectively. At a cost of £339 (US$449) per patient, the panel was less expensive locally than performing more than two or three single gene tests. Study limitations include the use of FFPE samples, which do not always provide high-quality DNA, and the use of "real world" data: submission of cases for sequencing did not always follow clinical guidelines, meaning that when mutations were detected, patients were not always eligible for targeted treatments on clinical grounds. This study demonstrates that more extensive tumour sequencing can identify mutations that could improve clinical decision-making in routine cancer care, potentially improving patient outcomes, at an affordable level for healthcare providers.
Using Public Data for Comparative Proteome Analysis in Precision Medicine Programs.
Hughes, Christopher S; Morin, Gregg B
2018-03-01
Maximizing the clinical utility of information obtained in longitudinal precision medicine programs would benefit from robust comparative analyses to known information to assess biological features of patient material toward identifying the underlying features driving their disease phenotype. Herein, the potential for utilizing publically deposited mass-spectrometry-based proteomics data to perform inter-study comparisons of cell-line or tumor-tissue materials is investigated. To investigate the robustness of comparison between MS-based proteomics studies carried out with different methodologies, deposited data representative of label-free (MS1) and isobaric tagging (MS2 and MS3 quantification) are utilized. In-depth quantitative proteomics data acquired from analysis of ovarian cancer cell lines revealed the robust recapitulation of observable gene expression dynamics between individual studies carried out using significantly different methodologies. The observed signatures enable robust inter-study clustering of cell line samples. In addition, the ability to classify and cluster tumor samples based on observed gene expression trends when using a single patient sample is established. With this analysis, relevant gene expression dynamics are obtained from a single patient tumor, in the context of a precision medicine analysis, by leveraging a large cohort of repository data as a comparator. Together, these data establish the potential for state-of-the-art MS-based proteomics data to serve as resources for robust comparative analyses in precision medicine applications. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Lira-Albarrán, Saúl; Durand, Marta; Barrera, David; Vega, Claudia; Becerra, Rocio García; Díaz, Lorenza; García-Quiroz, Janice; Rangel, Claudia; Larrea, Fernando
2018-04-27
In order to get further information on the effects of ulipristal acetate (UPA) upon the process of decidualization of endometrium, a functional analysis of the differentially expressed genes in endometrium (DEG) from UPA treated-versus control-cycles of normal ovulatory women was performed. A list of 1183 endometrial DEG, from a previously published study by our group, was submitted to gene ontology, gene enrichment and ingenuity pathway analyses (IPA). This functional analysis showed that decidualization was a biological process overrepresented. Gene set enrichment analysis identified LIF, PRL, IL15 and STAT3 among the most down-regulated genes within the JAK STAT canonical pathway. IPA showed that decidualization of uterus was a bio-function predicted as inhibited by UPA. The results demonstrated that this selective progesterone receptor modulator, when administered during the periovulatory phase of the menstrual cycle, may affect the molecular mechanisms leading to endometrial decidualization in response to progesterone during the period of maximum embryo receptivity. Copyright © 2018 Elsevier B.V. All rights reserved.
Gene analysis of steroid 5 alpha-reductase 1 in hyperandrogenic women.
Eminović, Izet; Komel, Radovan; Prezelj, Janez; Karamehić, Jasenko; Gavrankapetanović, Faris; Heljić, Becir
2005-08-01
To examine the gene encoding for 5alpha-reductase type 1 in hyperandrogenic women, and assess the association of its eventual mutations or polymorphisms with the development of the hyperandrogenic female pattern. Sixteen hyperandrogenic women were included in the study. Single-stranded conformation polymorphism analysis (SSCP) and DNA sequencing were performed after polymerase chain reaction amplification of each of the 5 exons of the SRD5A1 gene in both hyperandrogenic and control group (16 participants). Sequence analysis identified the existence of many polymorphisms; in codon 24 of exon 1, GGC (Gly) into GAC (Asp); in codon 30 of exon 1, CGG (Arg) into CGC (Arg); in exon 3 codon 169, ACA to ACG (both encoding for threonine); in exon 5, AGA to AGG (both encoding for arginine, codon 260); and T/C polymorphism in intron 2. Polymorphisms were found in both groups. Polymorphisms of SRD5A1 gene were the same in both hyperandrogenic and healthy women, indicating no significant associations of genetic polymorphisms/variations of SRD5A1 gene with clinical manifestations of hyperandrogenic disorders in women.
Li, Guotian; Jain, Rashmi; Chern, Mawsheng; Pham, Nikki T; Martin, Joel A; Wei, Tong; Schackwitz, Wendy S; Lipzen, Anna M; Duong, Phat Q; Jones, Kyle C; Jiang, Liangrong; Ruan, Deling; Bauer, Diane; Peng, Yi; Barry, Kerrie W; Schmutz, Jeremy; Ronald, Pamela C
2017-06-01
The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake ( Oryza sativa ssp japonica ), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportion of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. This work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations. © 2017 American Society of Plant Biologists. All rights reserved.
Li, Guotian; Jain, Rashmi; Chern, Mawsheng; ...
2017-06-02
The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportionmore » of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. In conclusion, this work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations.« less
Zou, Shanmei; Fei, Cong; Wang, Chun; Gao, Zhan; Bao, Yachao; He, Meilin; Wang, Changhai
2016-01-01
Microalgae identification is extremely difficult. The efficiency of DNA barcoding in microalgae identification involves ideal gene markers and approaches employed, which however, is still under the way. Although Scenedesmus has obtained much research in producing lipids its identification is difficult. Here we present a comprehensive coalescent, distance and character-based DNA barcoding for 118 Scenedesmus strains based on rbcL, tufA, ITS and 16S. The four genes, and their combined data rbcL + tufA + ITS + 16S, rbcL + tufA and ITS + 16S were analyzed by all of GMYC, P ID, PTP, ABGD, and character-based barcoding respectively. It was apparent that the three combined gene data showed a higher proportion of resolution success than the single gene. In comparison, the GMYC and PTP analysis produced more taxonomic lineages. The ABGD generated various resolution in discrimination among the single and combined data. The character-based barcoding was proved to be the most effective approach for species discrimination in both single and combined data which produced consistent species identification. All the integrated results recovered 11 species, five out of which were revealed as potential cryptic species. We suggest that the character-based DNA barcoding together with other approaches based on multiple genes and their combined data could be more effective in microalgae diversity revelation. PMID:27827440
Zou, Shanmei; Fei, Cong; Wang, Chun; Gao, Zhan; Bao, Yachao; He, Meilin; Wang, Changhai
2016-11-09
Microalgae identification is extremely difficult. The efficiency of DNA barcoding in microalgae identification involves ideal gene markers and approaches employed, which however, is still under the way. Although Scenedesmus has obtained much research in producing lipids its identification is difficult. Here we present a comprehensive coalescent, distance and character-based DNA barcoding for 118 Scenedesmus strains based on rbcL, tufA, ITS and 16S. The four genes, and their combined data rbcL + tufA + ITS + 16S, rbcL + tufA and ITS + 16S were analyzed by all of GMYC, P ID, PTP, ABGD, and character-based barcoding respectively. It was apparent that the three combined gene data showed a higher proportion of resolution success than the single gene. In comparison, the GMYC and PTP analysis produced more taxonomic lineages. The ABGD generated various resolution in discrimination among the single and combined data. The character-based barcoding was proved to be the most effective approach for species discrimination in both single and combined data which produced consistent species identification. All the integrated results recovered 11 species, five out of which were revealed as potential cryptic species. We suggest that the character-based DNA barcoding together with other approaches based on multiple genes and their combined data could be more effective in microalgae diversity revelation.
Klingler, John; Creasy, Robert; Gao, Lingling; Nair, Ramakrishnan M.; Calix, Alonso Suazo; Jacob, Helen Spafford; Edwards, Owain R.; Singh, Karam B.
2005-01-01
Aphids and related insects feed from a single cell type in plants: the phloem sieve element. Genetic resistance to Acyrthosiphon kondoi Shinji (bluegreen aphid or blue alfalfa aphid) has been identified in Medicago truncatula Gaert. (barrel medic) and backcrossed into susceptible cultivars. The status of M. truncatula as a model legume allows an in-depth study of defense against this aphid at physiological, biochemical, and molecular levels. In this study, two closely related resistant and susceptible genotypes were used to characterize the aphid-resistance phenotype. Resistance conditions antixenosis since migratory aphids were deterred from settling on resistant plants within 6 h of release, preferring to settle on susceptible plants. Analysis of feeding behavior revealed the trait affects A. kondoi at the level of the phloem sieve element. Aphid reproduction on excised shoots demonstrated that resistance requires an intact plant. Antibiosis against A. kondoi is enhanced by prior infestation, indicating induction of this phloem-specific defense. Resistance segregates as a single dominant gene, AKR (Acyrthosiphon kondoi resistance), in two mapping populations, which have been used to map the locus to a region flanked by resistance gene analogs predicted to encode the CC-NBS-LRR subfamily of resistance proteins. This work provides the basis for future molecular analysis of defense against phloem parasitism in a plant model system. PMID:15778464
Roberts, Mark A; Schwartz, Tonia S; Karl, Stephen A
2004-01-01
We assessed the degree of population subdivision among global populations of green sea turtles, Chelonia mydas, using four microsatellite loci. Previously, a single-copy nuclear DNA study indicated significant male-mediated gene flow among populations alternately fixed for different mitochondrial DNA haplotypes and that genetic divergence between populations in the Atlantic and Pacific Oceans was more common than subdivisions among populations within ocean basins. Even so, overall levels of variation at single-copy loci were low and inferences were limited. Here, the markedly more variable microsatellite loci confirm the presence of male-mediated gene flow among populations within ocean basins. This analysis generally confirms the genetic divergence between the Atlantic and Pacific. As with the previous study, phylogenetic analyses of genetic distances based on the microsatellite loci indicate a close genetic relationship among eastern Atlantic and Indian Ocean populations. Unlike the single-copy study, however, the results here cannot be attributed to an artifact of general low variability and likely represent recent or ongoing migration between ocean basins. Sequence analyses of regions flanking the microsatellite repeat reveal considerable amounts of cryptic variation and homoplasy and significantly aid in our understanding of population connectivity. Assessment of the allele frequency distributions indicates that at least some of the loci may not be evolving by the stepwise mutation model. PMID:15126404
Comparative Analysis of the Complete Chloroplast Genome of Four Endangered Herbals of Notopterygium
Yang, Jiao; Yue, Ming; Niu, Chuan; Ma, Xiong-Feng; Li, Zhong-Hu
2017-01-01
Notopterygium H. de Boissieu (Apiaceae) is an endangered perennial herb endemic to China. A good knowledge of phylogenetic evolution and population genomics is conducive to the establishment of effective management and conservation strategies of the genus Notopterygium. In this study, the complete chloroplast (cp) genomes of four Notopterygium species (N. incisum C. C. Ting ex H. T. Chang, N. oviforme R. H. Shan, N. franchetii H. de Boissieu and N. forrestii H. Wolff) were assembled and characterized using next-generation sequencing. We investigated the gene organization, order, size and repeat sequences of the cp genome and constructed the phylogenetic relationships of Notopterygium species based on the chloroplast DNA and nuclear internal transcribed spacer (ITS) sequences. Comparative analysis of plastid genome showed that the cp DNA are the standard double-stranded molecule, ranging from 157,462 bp (N. oviforme) to 159,607 bp (N. forrestii) in length. The circular DNA each contained a large single-copy (LSC) region, a small single-copy (SSC) region, and a pair of inverted repeats (IRs). The cp DNA of four species contained 85 protein-coding genes, 37 transfer RNA (tRNA) genes and 8 ribosomal RNA (rRNA) genes, respectively. We determined the marked conservation of gene content and sequence evolutionary rate in the cp genome of four Notopterygium species. Three genes (psaI, psbI and rpoA) were possibly under positive selection among the four sampled species. Phylogenetic analysis showed that four Notopterygium species formed a monophyletic clade with high bootstrap support. However, the inconsistent interspecific relationships with the genus Notopterygium were identified between the cp DNA and ITS markers. The incomplete lineage sorting, convergence evolution or hybridization, gene infiltration and different sampling strategies among species may have caused the incongruence between the nuclear and cp DNA relationships. The present results suggested that Notopterygium species may have experienced a complex evolutionary history and speciation process. PMID:28422071
Image segmentation and dynamic lineage analysis in single-cell fluorescence microscopy.
Wang, Quanli; Niemi, Jarad; Tan, Chee-Meng; You, Lingchong; West, Mike
2010-01-01
An increasingly common component of studies in synthetic and systems biology is analysis of dynamics of gene expression at the single-cell level, a context that is heavily dependent on the use of time-lapse movies. Extracting quantitative data on the single-cell temporal dynamics from such movies remains a major challenge. Here, we describe novel methods for automating key steps in the analysis of single-cell, fluorescent images-segmentation and lineage reconstruction-to recognize and track individual cells over time. The automated analysis iteratively combines a set of extended morphological methods for segmentation, and uses a neighborhood-based scoring method for frame-to-frame lineage linking. Our studies with bacteria, budding yeast and human cells, demonstrate the portability and usability of these methods, whether using phase, bright field or fluorescent images. These examples also demonstrate the utility of our integrated approach in facilitating analyses of engineered and natural cellular networks in diverse settings. The automated methods are implemented in freely available, open-source software.
Pathway analyses and understanding disease associations
Liu, Yu; Chance, Mark R
2013-01-01
High throughput technologies have been applied to investigate the underlying mechanisms of complex diseases, identify disease-associations and help to improve treatment. However it is challenging to derive biological insight from conventional single gene based analysis of “omics” data from high throughput experiments due to sample and patient heterogeneity. To address these challenges, many novel pathway and network based approaches were developed to integrate various “omics” data, such as gene expression, copy number alteration, Genome Wide Association Studies, and interaction data. This review will cover recent methodological developments in pathway analysis for the detection of dysregulated interactions and disease-associated subnetworks, prioritization of candidate disease genes, and disease classifications. For each application, we will also discuss the associated challenges and potential future directions. PMID:24319650
Wang, Nan; Liu, Zhiyong; Zhang, Yun; Li, Chengyu; Feng, Hui
2018-03-01
Using bulked segregant analysis combined with next-generation sequencing, we delimited the Brnye1 gene responsible for the stay-green trait of nye in pakchoi. Sequence analysis identified Bra019346 as the candidate gene. "Stay-green" refers to a plant trait whereby leaves remain green during senescence. This trait is useful in the cultivation of pakchoi (Brassica campestris L. ssp. chinensis), which is marketed as a green leaf product. This study aimed to identify the gene responsible for the stay-green trait in pakchoi. We identified a stay-green mutant in pakchoi, which we termed "nye". Genetic analysis revealed that the stay-green trait is controlled by a single recessive gene, Brnye1. Using the BSA-seq method, a 3.0-Mb candidate region was mapped on chromosome A03, which helped us localize Brnye1 to an 81.01-kb interval between SSR markers SSRWN27 and SSRWN30 via linkage analysis in an F 2 population. We identified 12 genes in this region, 11 of which were annotated based on the Brassica rapa annotation database, and one was a functionally unknown gene. An orthologous gene of the Arabidopsis gene AtNYE1, Bra019346, was identified as the potential candidate for Brnye1. Sequence analysis revealed a 40-bp insertion in the second exon of Bra019346 in nye, which generated the TAA stop codon. A candidate gene-specific Indel marker in 1561 F 2 individuals showed perfect cosegregation with Brnye1 in the nye mutant. These results provide a foundation for uncovering the molecular mechanism of the stay-green trait in pakchoi.
A genome-wide association study of corneal astigmatism: The CREAM Consortium.
Shah, Rupal L; Li, Qing; Zhao, Wanting; Tedja, Milly S; Tideman, J Willem L; Khawaja, Anthony P; Fan, Qiao; Yazar, Seyhan; Williams, Katie M; Verhoeven, Virginie J M; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J; Pärssinen, Olavi; Wedenoja, Juho; Biino, Ginevra; Concas, Maria Pina; Uitterlinden, André; Rivadeneira, Fernando; Jaddoe, Vincent W V; Hysi, Pirro G; Sim, Xueling; Tan, Nicholas; Tham, Yih-Chung; Sensaki, Sonoko; Hofman, Albert; Vingerling, Johannes R; Jonas, Jost B; Mitchell, Paul; Hammond, Christopher J; Höhn, René; Baird, Paul N; Wong, Tien-Yin; Cheng, Chinfsg-Yu; Teo, Yik Ying; Mackey, David A; Williams, Cathy; Saw, Seang-Mei; Klaver, Caroline C W; Guggenheim, Jeremy A; Bailey-Wilson, Joan E
2018-01-01
To identify genes and genetic markers associated with corneal astigmatism. A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha ( PDGFRA ) gene: top SNP: rs7673984, odds ratio=1.12 (95% CI:1.08-1.16), p=5.55×10 -9 . No other genome-wide significant loci were identified in the combined analysis or European/Asian ancestry-specific analyses. Gene-based analysis identified three novel candidate genes for corneal astigmatism in Europeans-claudin-7 ( CLDN7 ), acid phosphatase 2, lysosomal ( ACP2 ), and TNF alpha-induced protein 8 like 3 ( TNFAIP8L3 ). In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the PDGFRA gene, gene-based analysis identified three novel candidate genes, CLDN7 , ACP2 , and TNFAIP8L3 , that warrant further investigation to understand their role in the pathogenesis of corneal astigmatism. The much lower number of genetic variants and genes demonstrating an association with corneal astigmatism compared to published spherical equivalent GWAS analyses suggest a greater influence of rare genetic variants, non-additive genetic effects, or environmental factors in the development of astigmatism.
Tang, Kai; Dong, Chun-Juan; Liu, Jin-Yuan
2016-01-01
In this study, 40 phospholipase D (PLD) genes were identified from allotetraploid cotton Gossypium hirsutum, and 20 PLD genes were examined in diploid cotton Gossypium raimondii. Combining with 19 previously identified Gossypium arboreum PLD genes, a comparative analysis was performed among the PLD gene families among allotetraploid and two diploid cottons. Based on the orthologous relationships, we found that almost each G. hirsutum PLD had a corresponding homolog in the G. arboreum and G. raimondii genomes, except for GhPLDβ3A, whose homolog GaPLDβ3 may have been lost during the evolution of G. arboreum after the interspecific hybridization. Phylogenetic analysis showed that all of the cotton PLDs were unevenly classified into six numbered subgroups: α, β/γ, δ, ε, ζ and φ. An N-terminal C2 domain was found in the α, β/γ, δ and ε subgroups, while phox homology (PX) and pleckstrin homology (PH) domains were identified in the ζ subgroup. The subgroup φ possessed a single peptide instead of a functional domain. In each phylogenetic subgroup, the PLDs showed high conservation in gene structure and amino acid sequences in functional domains. The expansion of GhPLD and GrPLD gene families were mainly attributed to segmental duplication and partly attributed to tandem duplication. Furthermore, purifying selection played a critical role in the evolution of PLD genes in cotton. Quantitative RT-PCR documented that allotetraploid cotton PLD genes were broadly expressed and each had a unique spatial and developmental expression pattern, indicating their functional diversification in cotton growth and development. Further analysis of cis-regulatory elements elucidated transcriptional regulations and potential functions. Our comparative analysis provided valuable information for understanding the putative functions of the PLD genes in cotton fiber. PMID:27213891
Safa, Ahmad Hosseini; Harandi, Majid Fasihi; Tajaddini, Mohammadhasan; Rostami-Nejad, Mohammad; Mohtashami-Pour, Mehdi; Pestehchian, Nader
2016-07-22
High-resolution melting (HRM) is a reliable and sensitive scanning method to detect variation in DNA sequences. We used this method to better understand the epidemiology and transmission of Echinococcus granulosus. We tested the use of HRM to discriminate the genotypes of E. granulosus and E. canadensis. One hundred forty-one hydatid cysts were collected from slaughtered animals in different parts of Isfahan-Iran in 2013. After DNA extraction, the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene was amplified using PCR coupled with the HRM curve. The result of HRM analysis using partial the sequences of cox1 gene revealed that 93, 35, and 2 isolates were identified as G1, G3, and G6 genotypes, respectively. A single nucleotide polymorphism (SNP) was found in locus 9867 of the cox1 gene. This is a critical locus for the differentiation between the G6 and G7 genotypes. In the phylogenic tree, the sample with a SNP was located between the G6 and G7 genotypes, which suggest that this isolate has a G6/G7 genotype. The HRM analysis developed in the present study provides a powerful technique for molecular and epidemiological studies on echinococcosis in humans and animals.
The oculocerebrorenal syndrome gene product is a 105-kD protein localized to the Golgi complex.
Olivos-Glander, I M; Jänne, P A; Nussbaum, R L
1995-01-01
The oculocerebrorenal syndrome of Lowe (OCRL) is a multisystem disorder affecting the lens, kidney, and CNS. The predicted amino acid sequence of the OCRL gene, OCRL-1, was used to develop antibodies against the OCRL-1 protein. Western blot analysis using affinity-purified serum against the amino terminus of the OCRL-1 gene product (ocrl-1) demonstrates a single protein of 105 kD in fibroblasts of a normal individual that is absent in fibroblasts of an OCRL patient who lacks OCRL-1 transcript. A single protein with the same electrophoretic mobility is found by western analysis in various human cultured cell lines, and approximately the same size protein is also found in all mouse tissues tested. Northern analysis of various human and mouse tissues demonstrate that OCRL-1 transcript is expressed in nearly all tissues examined. By immunofluorescence, the ocrl-1 antibody stains a juxtanuclear region in normal fibroblast cells, while no specific staining is evident in the OCRL patient who produces no transcript. Colocalization of the ocrl-1 protein to the Golgi complex was demonstrated using a known monoclonal antibody against a Golgi-specific coat protein, beta-COP (beta coatomer protein). Images Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 PMID:7573041
Naoumkina, Marina; Bechere, Efrem; Fang, David D; Thyssen, Gregory N; Florane, Christopher B
2017-07-01
In this work we describe a chemically-induced short fiber mutant cotton line, Ligon-lintless-y (li y ), which is controlled by a single recessive locus and affects multiple traits, including height of the plant, and length and maturity of fiber. An RNAseq analysis was used to evaluate global transcriptional changes during cotton fiber development at 3, 8 and 16days post anthesis. We found that 613, 2629 and 3397 genes were significantly down-regulated, while 2700, 477 and 3260 were significantly up-regulated in li y at 3, 8 and 16 DPA. Gene set enrichment analysis revealed that many metabolic pathways, including carbohydrate, cell wall, hormone metabolism and transport were substantially altered in li y developing fibers. We discuss perturbed expression of genes involved in signal transduction and biosynthesis of phytohormones, such as auxin, abscisic acid, gibberellin and ethylene. The results of this study provide new insights into transcriptional regulation of cotton fiber development. Published by Elsevier Inc.