Sample records for single gene analyses

  1. Single Cell Gene Expression Profiling of Skeletal Muscle-Derived Cells.

    PubMed

    Gatto, Sole; Puri, Pier Lorenzo; Malecova, Barbora

    2017-01-01

    Single cell gene expression profiling is a fundamental tool for studying the heterogeneity of a cell population by addressing the phenotypic and functional characteristics of each cell. Technological advances that have coupled microfluidic technologies with high-throughput quantitative RT-PCR analyses have enabled detailed analyses of single cells in various biological contexts. In this chapter, we describe the procedure for isolating the skeletal muscle interstitial cells termed Fibro-Adipogenic Progenitors (FAPs ) and their gene expression profiling at the single cell level. Moreover, we accompany our bench protocol with bioinformatics analysis designed to process raw data as well as to visualize single cell gene expression data. Single cell gene expression profiling is therefore a useful tool in the investigation of FAPs heterogeneity and their contribution to muscle homeostasis.

  2. Non-biased and efficient global amplification of a single-cell cDNA library

    PubMed Central

    Huang, Huan; Goto, Mari; Tsunoda, Hiroyuki; Sun, Lizhou; Taniguchi, Kiyomi; Matsunaga, Hiroko; Kambara, Hideki

    2014-01-01

    Analysis of single-cell gene expression promises a more precise understanding of molecular mechanisms of a living system. Most techniques only allow studies of the expressions for limited numbers of gene species. When amplification of cDNA was carried out for analysing more genes, amplification biases were frequently reported. A non-biased and efficient global-amplification method, which uses a single-cell cDNA library immobilized on beads, was developed for analysing entire gene expressions for single cells. Every step in this analysis from reverse transcription to cDNA amplification was optimized. By removing degrading excess primers, the bias due to the digestion of cDNA was prevented. Since the residual reagents, which affect the efficiency of each subsequent reaction, could be removed by washing beads, the conditions for uniform and maximized amplification of cDNAs were achieved. The differences in the amplification rates for randomly selected eight genes were within 1.5-folds, which could be negligible for most of the applications of single-cell analysis. The global amplification gives a large amount of amplified cDNA (>100 μg) from a single cell (2-pg mRNA), and that amount is enough for downstream analysis. The proposed global-amplification method was used to analyse transcript ratios of multiple cDNA targets (from several copies to several thousand copies) quantitatively. PMID:24141095

  3. Obesity modulates inflammation and lipid metabolism oocyte gene expression: A single cell transcriptome perspective

    USDA-ARS?s Scientific Manuscript database

    This study aimed to compare oocyte gene expression profiles and follicular fluid (FF) content from overweight/obese (OW) women and normal weight (NW) women who were undergoing fertility treatments. Using single cell transcriptomic analyses, we investigated oocyte gene expression using RNA-seq. Serum...

  4. A highly sensitive and accurate gene expression analysis by sequencing ("bead-seq") for a single cell.

    PubMed

    Matsunaga, Hiroko; Goto, Mari; Arikawa, Koji; Shirai, Masataka; Tsunoda, Hiroyuki; Huang, Huan; Kambara, Hideki

    2015-02-15

    Analyses of gene expressions in single cells are important for understanding detailed biological phenomena. Here, a highly sensitive and accurate method by sequencing (called "bead-seq") to obtain a whole gene expression profile for a single cell is proposed. A key feature of the method is to use a complementary DNA (cDNA) library on magnetic beads, which enables adding washing steps to remove residual reagents in a sample preparation process. By adding the washing steps, the next steps can be carried out under the optimal conditions without losing cDNAs. Error sources were carefully evaluated to conclude that the first several steps were the key steps. It is demonstrated that bead-seq is superior to the conventional methods for single-cell gene expression analyses in terms of reproducibility, quantitative accuracy, and biases caused during sample preparation and sequencing processes. Copyright © 2014 Elsevier Inc. All rights reserved.

  5. Single-Copy Genes as Molecular Markers for Phylogenomic Studies in Seed Plants

    PubMed Central

    De La Torre, Amanda R.; Sterck, Lieven; Cánovas, Francisco M.; Avila, Concepción; Merino, Irene; Cabezas, José Antonio; Cervera, María Teresa; Ingvarsson, Pär K.

    2017-01-01

    Phylogenetic relationships among seed plant taxa, especially within the gymnosperms, remain contested. In contrast to angiosperms, for which several genomic, transcriptomic and phylogenetic resources are available, there are few, if any, molecular markers that allow broad comparisons among gymnosperm species. With few gymnosperm genomes available, recently obtained transcriptomes in gymnosperms are a great addition to identifying single-copy gene families as molecular markers for phylogenomic analysis in seed plants. Taking advantage of an increasing number of available genomes and transcriptomes, we identified single-copy genes in a broad collection of seed plants and used these to infer phylogenetic relationships between major seed plant taxa. This study aims at extending the current phylogenetic toolkit for seed plants, assessing its ability for resolving seed plant phylogeny, and discussing potential factors affecting phylogenetic reconstruction. In total, we identified 3,072 single-copy genes in 31 gymnosperms and 2,156 single-copy genes in 34 angiosperms. All studied seed plants shared 1,469 single-copy genes, which are generally involved in functions like DNA metabolism, cell cycle, and photosynthesis. A selected set of 106 single-copy genes provided good resolution for the seed plant phylogeny except for gnetophytes. Although some of our analyses support a sister relationship between gnetophytes and other gymnosperms, phylogenetic trees from concatenated alignments without 3rd codon positions and amino acid alignments under the CAT + GTR model, support gnetophytes as a sister group to Pinaceae. Our phylogenomic analyses demonstrate that, in general, single-copy genes can uncover both recent and deep divergences of seed plant phylogeny. PMID:28460034

  6. Contentious relationships in phylogenomic studies can be driven by a handful of genes

    PubMed Central

    Shen, Xing-Xing; Hittinger, Chris Todd; Rokas, Antonis

    2017-01-01

    Phylogenomic studies have resolved countless branches of the tree of life (ToL), but remain strongly contradictory on certain, contentious relationships. Here, we employ a maximum likelihood framework to quantify the distribution of phylogenetic signal among genes and sites for 17 contentious branches and 6 well-established control branches in plant, animal, and fungal phylogenomic data matrices. We find that resolution in some of these 17 branches rests on a single gene or a few sites, and that removal of a single gene in concatenation analyses or a single site from every gene in coalescence-based analyses diminishes support and can alter the inferred topology. These results suggest that tiny subsets of very large data matrices drive the resolution of specific internodes, providing a dissection of the distribution of support and observed incongruence in phylogenomic analyses. We submit that quantifying the distribution of phylogenetic signal in phylogenomic data is essential for evaluating whether branches, especially contentious ones, are truly resolved. Finally, we offer one detailed example of such an evaluation for the controversy regarding the earliest-branching metazoan phylum, where examination of the distributions of gene-wise and site-wise phylogenetic signal across 8 data matrices consistently supports ctenophores as sister group to all other metazoans. PMID:28812701

  7. Beta-Poisson model for single-cell RNA-seq data analyses.

    PubMed

    Vu, Trung Nghia; Wills, Quin F; Kalari, Krishna R; Niu, Nifang; Wang, Liewei; Rantalainen, Mattias; Pawitan, Yudi

    2016-07-15

    Single-cell RNA-sequencing technology allows detection of gene expression at the single-cell level. One typical feature of the data is a bimodality in the cellular distribution even for highly expressed genes, primarily caused by a proportion of non-expressing cells. The standard and the over-dispersed gamma-Poisson models that are commonly used in bulk-cell RNA-sequencing are not able to capture this property. We introduce a beta-Poisson mixture model that can capture the bimodality of the single-cell gene expression distribution. We further integrate the model into the generalized linear model framework in order to perform differential expression analyses. The whole analytical procedure is called BPSC. The results from several real single-cell RNA-seq datasets indicate that ∼90% of the transcripts are well characterized by the beta-Poisson model; the model-fit from BPSC is better than the fit of the standard gamma-Poisson model in > 80% of the transcripts. Moreover, in differential expression analyses of simulated and real datasets, BPSC performs well against edgeR, a conventional method widely used in bulk-cell RNA-sequencing data, and against scde and MAST, two recent methods specifically designed for single-cell RNA-seq data. An R package BPSC for model fitting and differential expression analyses of single-cell RNA-seq data is available under GPL-3 license at https://github.com/nghiavtr/BPSC CONTACT: yudi.pawitan@ki.se or mattias.rantalainen@ki.se Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Single-cell sequencing deciphers a convergent evolution of copy number alterations from primary to circulating tumor cells.

    PubMed

    Gao, Yan; Ni, Xiaohui; Guo, Hua; Su, Zhe; Ba, Yi; Tong, Zhongsheng; Guo, Zhi; Yao, Xin; Chen, Xixi; Yin, Jian; Yan, Zhao; Guo, Lin; Liu, Ying; Bai, Fan; Xie, X Sunney; Zhang, Ning

    2017-08-01

    Copy number alteration (CNA) is a major contributor to genome instability, a hallmark of cancer. Here, we studied genomic alterations in single primary tumor cells and circulating tumor cells (CTCs) from the same patient. Single-nucleotide variants (SNVs) in single cells from both samples occurred sporadically, whereas CNAs among primary tumor cells emerged accumulatively rather than abruptly, converging toward the CNA in CTCs. Focal CNAs affecting the MYC gene and the PTEN gene were observed only in a minor portion of primary tumor cells but were present in all CTCs, suggesting a strong selection toward metastasis. Single-cell structural variant (SV) analyses revealed a two-step mechanism, a complex rearrangement followed by gene amplification, for the simultaneous formation of anomalous CNAs in multiple chromosome regions. Integrative CNA analyses of 97 CTCs from 23 patients confirmed the convergence of CNAs and revealed single, concurrent, and mutually exclusive CNAs that could be the driving events in cancer metastasis. © 2017 Gao et al.; Published by Cold Spring Harbor Laboratory Press.

  9. Single-cell analyses of transcriptional heterogeneity during drug tolerance transition in cancer cells by RNA sequencing.

    PubMed

    Lee, Mei-Chong Wendy; Lopez-Diaz, Fernando J; Khan, Shahid Yar; Tariq, Muhammad Akram; Dayn, Yelena; Vaske, Charles Joseph; Radenbaugh, Amie J; Kim, Hyunsung John; Emerson, Beverly M; Pourmand, Nader

    2014-11-04

    The acute cellular response to stress generates a subpopulation of reversibly stress-tolerant cells under conditions that are lethal to the majority of the population. Stress tolerance is attributed to heterogeneity of gene expression within the population to ensure survival of a minority. We performed whole transcriptome sequencing analyses of metastatic human breast cancer cells subjected to the chemotherapeutic agent paclitaxel at the single-cell and population levels. Here we show that specific transcriptional programs are enacted within untreated, stressed, and drug-tolerant cell groups while generating high heterogeneity between single cells within and between groups. We further demonstrate that drug-tolerant cells contain specific RNA variants residing in genes involved in microtubule organization and stabilization, as well as cell adhesion and cell surface signaling. In addition, the gene expression profile of drug-tolerant cells is similar to that of untreated cells within a few doublings. Thus, single-cell analyses reveal the dynamics of the stress response in terms of cell-specific RNA variants driving heterogeneity, the survival of a minority population through generation of specific RNA variants, and the efficient reconversion of stress-tolerant cells back to normalcy.

  10. Single-cell analyses of transcriptional heterogeneity during drug tolerance transition in cancer cells by RNA sequencing

    PubMed Central

    Lee, Mei-Chong Wendy; Lopez-Diaz, Fernando J.; Khan, Shahid Yar; Tariq, Muhammad Akram; Dayn, Yelena; Vaske, Charles Joseph; Radenbaugh, Amie J.; Kim, Hyunsung John; Emerson, Beverly M.; Pourmand, Nader

    2014-01-01

    The acute cellular response to stress generates a subpopulation of reversibly stress-tolerant cells under conditions that are lethal to the majority of the population. Stress tolerance is attributed to heterogeneity of gene expression within the population to ensure survival of a minority. We performed whole transcriptome sequencing analyses of metastatic human breast cancer cells subjected to the chemotherapeutic agent paclitaxel at the single-cell and population levels. Here we show that specific transcriptional programs are enacted within untreated, stressed, and drug-tolerant cell groups while generating high heterogeneity between single cells within and between groups. We further demonstrate that drug-tolerant cells contain specific RNA variants residing in genes involved in microtubule organization and stabilization, as well as cell adhesion and cell surface signaling. In addition, the gene expression profile of drug-tolerant cells is similar to that of untreated cells within a few doublings. Thus, single-cell analyses reveal the dynamics of the stress response in terms of cell-specific RNA variants driving heterogeneity, the survival of a minority population through generation of specific RNA variants, and the efficient reconversion of stress-tolerant cells back to normalcy. PMID:25339441

  11. A multilocus perspective on the speciation history of a North American aridland toad (Anaxyrus punctatus).

    PubMed

    Bryson, Robert W; Jaeger, Jef R; Lemos-Espinal, Julio A; Lazcano, David

    2012-09-01

    Interpretations of phylogeographic patterns can change when analyses shift from single gene-tree to multilocus coalescent analyses. Using multilocus coalescent approaches, a species tree and divergence times can be estimated from a set of gene trees while accounting for gene-tree stochasticity. We utilized the conceptual strengths of a multilocus coalescent approach coupled with complete range-wide sampling to examine the speciation history of a broadly distributed, North American warm-desert toad, Anaxyrus punctatus. Phylogenetic analyses provided strong support for three major lineages within A. punctatus. Each lineage broadly corresponded to one of three desert regions. Early speciation in A. punctatus appeared linked to late Miocene-Pliocene development of the Baja California peninsula. This event was likely followed by a Pleistocene divergence associated with the separation of the Chihuahuan and Sonoran Deserts. Our multilocus coalescent-based reconstruction provides an informative contrast to previous single gene-tree estimates of the evolutionary history of A. punctatus. Copyright © 2012 Elsevier Inc. All rights reserved.

  12. Strategies for comparing gene expression profiles from different microarray platforms: application to a case-control experiment.

    PubMed

    Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina

    2006-06-01

    Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.

  13. A single-gene explanation for the probability of having idiopathic talipes equinovarus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rebbeck, T.R.; Buetow, K.H.; Dietz, F.R.

    1993-11-01

    It has been hypothesized that the pathogenesis of idiopathic talipes equinovarus (ITEV, or clubfoot) is explained by genetic regulation of development and growth. The objective of the present study was to determine whether a single Mendelian gene explains the probability of having ITEV in a sample of 143 Caucasian pedigrees from Iowa. These pedigrees were ascertained through probands with ITEV. Complex segregation analyses were undertaken using a regressive logistic model. The results of these analyses strongly rejected the hypotheses that the probability of having ITEV in these pedigrees was explained by a non-Mendelian pattern of transmission with residual sibling correlation,more » a nontransmitted (environmental) factor with residual sibling correlation, or residual sibling correlation alone. These results were consistent with the hypothesis that the probability of having ITEV was explained by the Mendelian segregation of a single gene with two alleles plus the effects of some unmeasured factor(s) shared among siblings. The segregation of alleles at this single Mendelian gene indicated that the disease allele A was incompletely dominant to the nondisease allele B. The disease allele A, associated with ITEV affection, was estimated to occur in the population of inference with a frequency of .007. After adjusting for sex-specific population incidences of ITEV, the conditional probability (penetrance) of ITEV affection given the AA, AB, and BB genotypes was computed to be 1.0, 0.039, and .0006, respectively. Individual pedigrees in this sample that most strongly supported the single Mendelian gene hypothesis were identified. These pedigrees are candidates for genetic linkage analyses or DNA association studies. 35 refs., 2 figs., 7 tabs.« less

  14. Sampling strategies for improving tree accuracy and phylogenetic analyses: a case study in ciliate protists, with notes on the genus Paramecium.

    PubMed

    Yi, Zhenzhen; Strüder-Kypke, Michaela; Hu, Xiaozhong; Lin, Xiaofeng; Song, Weibo

    2014-02-01

    In order to assess how dataset-selection for multi-gene analyses affects the accuracy of inferred phylogenetic trees in ciliates, we chose five genes and the genus Paramecium, one of the most widely used model protist genera, and compared tree topologies of the single- and multi-gene analyses. Our empirical study shows that: (1) Using multiple genes improves phylogenetic accuracy, even when their one-gene topologies are in conflict with each other. (2) The impact of missing data on phylogenetic accuracy is ambiguous: resolution power and topological similarity, but not number of represented taxa, are the most important criteria of a dataset for inclusion in concatenated analyses. (3) As an example, we tested the three classification models of the genus Paramecium with a multi-gene based approach, and only the monophyly of the subgenus Paramecium is supported. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. Gene and pathway level analyses of germline DNA-repair gene variants and prostate cancer susceptibility using the iCOGS-genotyping array.

    PubMed

    Saunders, Edward J; Dadaev, Tokhir; Leongamornlert, Daniel A; Al Olama, Ali Amin; Benlloch, Sara; Giles, Graham G; Wiklund, Fredrik; Gronberg, Henrik; Haiman, Christopher A; Schleutker, Johanna; Nordestgaard, Borge G; Travis, Ruth C; Neal, David; Pasayan, Nora; Khaw, Kay-Tee; Stanford, Janet L; Blot, William J; Thibodeau, Stephen N; Maier, Christiane; Kibel, Adam S; Cybulski, Cezary; Cannon-Albright, Lisa; Brenner, Hermann; Park, Jong Y; Kaneva, Radka; Batra, Jyotsna; Teixeira, Manuel R; Pandha, Hardev; Govindasami, Koveela; Muir, Ken; Easton, Douglas F; Eeles, Rosalind A; Kote-Jarai, Zsofia

    2016-04-12

    Germline mutations within DNA-repair genes are implicated in susceptibility to multiple forms of cancer. For prostate cancer (PrCa), rare mutations in BRCA2 and BRCA1 give rise to moderately elevated risk, whereas two of B100 common, low-penetrance PrCa susceptibility variants identified so far by genome-wide association studies implicate RAD51B and RAD23B. Genotype data from the iCOGS array were imputed to the 1000 genomes phase 3 reference panel for 21 780 PrCa cases and 21 727 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. We subsequently performed single variant, gene and pathway-level analyses using 81 303 SNPs within 20 Kb of a panel of 179 DNA-repair genes. Single SNP analyses identified only the previously reported association with RAD51B. Gene-level analyses using the SKAT-C test from the SNP-set (Sequence) Kernel Association Test (SKAT) identified a significant association with PrCa for MSH5. Pathway-level analyses suggested a possible role for the translesion synthesis pathway in PrCa risk and Homologous recombination/Fanconi Anaemia pathway for PrCa aggressiveness, even though after adjustment for multiple testing these did not remain significant. MSH5 is a novel candidate gene warranting additional follow-up as a prospective PrCa-risk locus. MSH5 has previously been reported as a pleiotropic susceptibility locus for lung, colorectal and serous ovarian cancers.

  16. Insertional Mutagenesis by CRISPR/Cas9 Ribonucleoprotein Gene Editing in Cells Targeted for Point Mutation Repair Directed by Short Single-Stranded DNA Oligonucleotides.

    PubMed

    Rivera-Torres, Natalia; Banas, Kelly; Bialk, Pawel; Bloh, Kevin M; Kmiec, Eric B

    2017-01-01

    CRISPR/Cas9 and single-stranded DNA oligonucleotides (ssODNs) have been used to direct the repair of a single base mutation in human genes. Here, we examine a method designed to increase the precision of RNA guided genome editing in human cells by utilizing a CRISPR/Cas9 ribonucleoprotein (RNP) complex to initiate DNA cleavage. The RNP is assembled in vitro and induces a double stranded break at a specific site surrounding the mutant base designated for correction by the ssODN. We use an integrated mutant eGFP gene, bearing a single base change rendering the expressed protein nonfunctional, as a single copy target in HCT 116 cells. We observe significant gene correction activity of the mutant base, promoted by the RNP and single-stranded DNA oligonucleotide with validation through genotypic and phenotypic readout. We demonstrate that all individual components must be present to obtain successful gene editing. Importantly, we examine the genotype of individually sorted corrected and uncorrected clonally expanded cell populations for the mutagenic footprint left by the action of these gene editing tools. While the DNA sequence of the corrected population is exact with no adjacent sequence modification, the uncorrected population exhibits heterogeneous mutagenicity with a wide variety of deletions and insertions surrounding the target site. We designate this type of DNA aberration as on-site mutagenicity. Analyses of two clonal populations bearing specific DNA insertions surrounding the target site, indicate that point mutation repair has occurred at the level of the gene. The phenotype, however, is not rescued because a section of the single-stranded oligonucleotide has been inserted altering the reading frame and generating truncated proteins. These data illustrate the importance of analysing mutagenicity in uncorrected cells. Our results also form the basis of a simple model for point mutation repair directed by a short single-stranded DNA oligonucleotides and CRISPR/Cas9 ribonucleoprotein complex.

  17. Insertional Mutagenesis by CRISPR/Cas9 Ribonucleoprotein Gene Editing in Cells Targeted for Point Mutation Repair Directed by Short Single-Stranded DNA Oligonucleotides

    PubMed Central

    Rivera-Torres, Natalia; Bialk, Pawel; Bloh, Kevin M.; Kmiec, Eric B.

    2017-01-01

    CRISPR/Cas9 and single-stranded DNA oligonucleotides (ssODNs) have been used to direct the repair of a single base mutation in human genes. Here, we examine a method designed to increase the precision of RNA guided genome editing in human cells by utilizing a CRISPR/Cas9 ribonucleoprotein (RNP) complex to initiate DNA cleavage. The RNP is assembled in vitro and induces a double stranded break at a specific site surrounding the mutant base designated for correction by the ssODN. We use an integrated mutant eGFP gene, bearing a single base change rendering the expressed protein nonfunctional, as a single copy target in HCT 116 cells. We observe significant gene correction activity of the mutant base, promoted by the RNP and single-stranded DNA oligonucleotide with validation through genotypic and phenotypic readout. We demonstrate that all individual components must be present to obtain successful gene editing. Importantly, we examine the genotype of individually sorted corrected and uncorrected clonally expanded cell populations for the mutagenic footprint left by the action of these gene editing tools. While the DNA sequence of the corrected population is exact with no adjacent sequence modification, the uncorrected population exhibits heterogeneous mutagenicity with a wide variety of deletions and insertions surrounding the target site. We designate this type of DNA aberration as on-site mutagenicity. Analyses of two clonal populations bearing specific DNA insertions surrounding the target site, indicate that point mutation repair has occurred at the level of the gene. The phenotype, however, is not rescued because a section of the single-stranded oligonucleotide has been inserted altering the reading frame and generating truncated proteins. These data illustrate the importance of analysing mutagenicity in uncorrected cells. Our results also form the basis of a simple model for point mutation repair directed by a short single-stranded DNA oligonucleotides and CRISPR/Cas9 ribonucleoprotein complex. PMID:28052104

  18. Population Distribution Analyses Reveal a Hierarchy of Molecular Players Underlying Parallel Endocytic Pathways

    PubMed Central

    Gupta, Gagan D.; Howes, Mark T.; Chandran, Ruma; Das, Anupam; Menon, Sindhu; Parton, Robert G.; Sowdhamini, R.; Thattai, Mukund; Mayor, Satyajit

    2014-01-01

    Single-cell-resolved measurements reveal heterogeneous distributions of clathrin-dependent (CD) and -independent (CLIC/GEEC: CG) endocytic activity in Drosophila cell populations. dsRNA-mediated knockdown of core versus peripheral endocytic machinery induces strong changes in the mean, or subtle changes in the shapes of these distributions, respectively. By quantifying these subtle shape changes for 27 single-cell features which report on endocytic activity and cell morphology, we organize 1072 Drosophila genes into a tree-like hierarchy. We find that tree nodes contain gene sets enriched in functional classes and protein complexes, providing a portrait of core and peripheral control of CD and CG endocytosis. For 470 genes we obtain additional features from separate assays and classify them into early- or late-acting genes of the endocytic pathways. Detailed analyses of specific genes at intermediate levels of the tree suggest that Vacuolar ATPase and lysosomal genes involved in vacuolar biogenesis play an evolutionarily conserved role in CG endocytosis. PMID:24971745

  19. Single-cell transcriptomics for microbial eukaryotes.

    PubMed

    Kolisko, Martin; Boscaro, Vittorio; Burki, Fabien; Lynn, Denis H; Keeling, Patrick J

    2014-11-17

    One of the greatest hindrances to a comprehensive understanding of microbial genomics, cell biology, ecology, and evolution is that most microbial life is not in culture. Solutions to this problem have mainly focused on whole-community surveys like metagenomics, but these analyses inevitably loose information and present particular challenges for eukaryotes, which are relatively rare and possess large, gene-sparse genomes. Single-cell analyses present an alternative solution that allows for specific species to be targeted, while retaining information on cellular identity, morphology, and partitioning of activities within microbial communities. Single-cell transcriptomics, pioneered in medical research, offers particular potential advantages for uncultivated eukaryotes, but the efficiency and biases have not been tested. Here we describe a simple and reproducible method for single-cell transcriptomics using manually isolated cells from five model ciliate species; we examine impacts of amplification bias and contamination, and compare the efficacy of gene discovery to traditional culture-based transcriptomics. Gene discovery using single-cell transcriptomes was found to be comparable to mass-culture methods, suggesting single-cell transcriptomics is an efficient entry point into genomic data from the vast majority of eukaryotic biodiversity. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    PubMed

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  1. A Hybrid Approach of Gene Sets and Single Genes for the Prediction of Survival Risks with Gene Expression Data

    PubMed Central

    Seok, Junhee; Davis, Ronald W.; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn’t been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge. PMID:25933378

  2. The role of renin-angiotensin-aldosterone system genes in the progression of chronic kidney disease: findings from the Chronic Renal Insufficiency Cohort (CRIC) study.

    PubMed

    Kelly, Tanika N; Raj, Dominic; Rahman, Mahboob; Kretzler, Matthias; Kallem, Radhakrishna R; Ricardo, Ana C; Rosas, Sylvia E; Tao, Kaixiang; Xie, Dawei; Hamm, Lotuce Lee; He, Jiang

    2015-10-01

    We conducted single-marker, gene- and pathway-based analyses to examine the association between renin-angiotensin-aldosterone system (RAAS) variants and chronic kidney disease (CKD) progression among Chronic Renal Insufficiency Cohort study participants. A total of 1523 white and 1490 black subjects were genotyped for 490 single nucleotide polymorphisms (SNPs) in 12 RAAS genes as part of the ITMAT-Broad-CARe array. CKD progression phenotypes included decline in estimated glomerular filtration rate (eGFR) over time and the occurrence of a renal disease event, defined as incident end-stage renal disease or halving of eGFR from baseline. Mixed-effects models were used to examine SNP associations with eGFR decline, while Cox proportional hazards models tested SNP associations with renal events. Gene- and pathway-based analyses were conducted using the truncated product method. All analyses were stratified by race, and a Bonferroni correction was applied to adjust for multiple testing. Among white and black participants, eGFR declined an average of 1.2 and 2.3 mL/min/1.73 m(2)/year, respectively, while renal events occurred in a respective 11.5 and 24.9% of participants. We identified strong gene- and pathway-based associations with CKD progression. The AGT and RENBP genes were consistently associated with risk of renal events in separate analyses of white and black participants (both P < 1.00 × 10(-6)). Driven by the significant gene-based findings, the entire RAAS pathway was also associated with renal events in both groups (both P < 1.00 × 10(-6)). No single-marker associations with CKD progression were observed. The current study provides strong evidence for a role of the RAAS in CKD progression. © The Author 2015. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.

  3. The role of renin–angiotensin–aldosterone system genes in the progression of chronic kidney disease: findings from the Chronic Renal Insufficiency Cohort (CRIC) study

    PubMed Central

    Kelly, Tanika N.; Raj, Dominic; Rahman, Mahboob; Kretzler, Matthias; Kallem, Radhakrishna R.; Ricardo, Ana C.; Rosas, Sylvia E.; Tao, Kaixiang; Xie, Dawei; Hamm, Lotuce Lee; He, Jiang; Appel, J.; Feldman, Harold I.; Go, Alan S.; Kusek, John W.; Lash, James P.; Ojo, Akinlolu; Townsend, Raymond R.

    2015-01-01

    Background We conducted single-marker, gene- and pathway-based analyses to examine the association between renin–angiotensin–aldosterone system (RAAS) variants and chronic kidney disease (CKD) progression among Chronic Renal Insufficiency Cohort study participants. Methods A total of 1523 white and 1490 black subjects were genotyped for 490 single nucleotide polymorphisms (SNPs) in 12 RAAS genes as part of the ITMAT-Broad-CARe array. CKD progression phenotypes included decline in estimated glomerular filtration rate (eGFR) over time and the occurrence of a renal disease event, defined as incident end-stage renal disease or halving of eGFR from baseline. Mixed-effects models were used to examine SNP associations with eGFR decline, while Cox proportional hazards models tested SNP associations with renal events. Gene- and pathway-based analyses were conducted using the truncated product method. All analyses were stratified by race, and a Bonferroni correction was applied to adjust for multiple testing. Results Among white and black participants, eGFR declined an average of 1.2 and 2.3 mL/min/1.73 m2/year, respectively, while renal events occurred in a respective 11.5 and 24.9% of participants. We identified strong gene- and pathway-based associations with CKD progression. The AGT and RENBP genes were consistently associated with risk of renal events in separate analyses of white and black participants (both P < 1.00 × 10−6). Driven by the significant gene-based findings, the entire RAAS pathway was also associated with renal events in both groups (both P < 1.00 × 10−6). No single-marker associations with CKD progression were observed. Conclusions The current study provides strong evidence for a role of the RAAS in CKD progression. PMID:25906781

  4. A survey of the sorghum transcriptome using single-molecule long reads

    DOE PAGES

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...

    2016-06-24

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less

  5. A survey of the sorghum transcriptome using single-molecule long reads

    PubMed Central

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.

    2016-01-01

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290

  6. Genome-Wide Linkage and Positional Association Analyses Identify Associations of Novel AFF3 and NTM Genes with Triglycerides: The GenSalt Study

    PubMed Central

    Li, Changwei; Bazzano, Lydia A.L.; Rao, Dabeeru C.; Hixson, James E.; He, Jiang; Gu, Dongfeng; Gu, Charles C.; Shimmin, Lawrence C.; Jaquish, Cashell E.; Schwander, Karen; Liu, De-Pei; Huang, Jianfeng; Lu, Fanghong; Cao, Jie; Chong, Shen; Lu, Xiangfeng; Kelly, Tanika N.

    2016-01-01

    We conducted a genome-wide linkage scan and positional association study to identify genes and variants influencing blood lipid levels among participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. The GenSalt study was conducted among 1906 participants from 633 Han Chinese families. Lipids were measured from overnight fasting blood samples using standard methods. Multipoint quantitative trait genome-wide linkage scans were performed on the high-density lipoprotein, low-density lipoprotein, and log-transformed triglyceride phenotypes. Using dense panels of single nucleotide polymorphisms (SNPs), single-marker and gene-based association analyses were conducted to follow-up on promising linkage signals. Additive associations between each SNP and lipid phenotypes were tested using mixed linear regression models. Gene-based analyses were performed by combining P-values from single-marker analyses within each gene using the truncated product method (TPM). Significant associations were assessed for replication among 777 Asian participants of the Multi-ethnic Study of Atherosclerosis (MESA). Bonferroni correction was used to adjust for multiple testing. In the GenSalt study, suggestive linkage signals were identified at 2p11.2–2q12.1 [maximum multipoint LOD score (MML) = 2.18 at 2q11.2] and 11q24.3–11q25 (MML = 2.29 at 11q25) for the log-transformed triglyceride phenotype. Follow-up analyses of these two regions revealed gene-based associations of charged multivesicular body protein 3 (CHMP3), ring finger protein 103 (RNF103), AF4/FMR2 family, member 3 (AFF3), and neurotrimin (NTM ) with triglycerides (P = 4 × 10−4, 1.00 × 10−5, 2.00 × 10−5, and 1.00 × 10−7, respectively). Both the AFF3 and NTM triglyceride associations were replicated among MESA study participants (P = 1.00 × 10−7 and 8.00 × 10−5, respectively). Furthermore, NTM explained the linkage signal on chromosome 11. In conclusion, we identified novel genes associated with lipid phenotypes in linkage regions on chromosomes 2 and 11. PMID:25819087

  7. Association analyses of vitamin D-binding protein gene with compression strength index variation in Caucasian nuclear families.

    PubMed

    Xu, X-H; Xiong, D-H; Liu, X-G; Guo, Y; Chen, Y; Zhao, J; Recker, R R; Deng, H-W

    2010-01-01

    This study was conducted to test whether there exists an association between vitamin D-binding protein (DBP) gene and compression strength index (CSI) phenotype. Candidate gene association analyses were conducted in total sample, male subgroup, and female subgroup, respectively. Two single-nucleotide polymorphisms (SNPs) with significant association results were found in males, suggesting the importance of DBP gene polymorphisms on the variation in CSI especially in Caucasian males. CSI of the femoral neck (FN) is a newly developed phenotype integrating information about bone size, body size, and bone mineral density. It is considered to have the potential to improve the performance of risk assessment for hip fractures because it is based on a combination of phenotypic traits influencing hip fractures rather than a single trait. CSI is under moderate genetic determination (with a heritability of approximately 44% found in this study), but the relevant genetic study is still rather scarce. Based on the known physiological role of DBP in bone biology and the relatively high heritability of CSI, we tested 12 SNPs of the DBP gene for association with CSI variation in 405 Caucasian nuclear families comprising 1,873 subjects from the Midwestern US. Association analyses were performed in the total sample, male and female subgroups, respectively. Significant associations with CSI were found with two SNPs (rs222029, P = 0.0019; rs222020, P = 0.0042) for the male subgroup. Haplotype-based association tests corroborated the single-SNP results. Our findings suggest that the DBP gene might be one of the genetic factors influencing CSI phenotype in Caucasians, especially in males.

  8. Developmental switching in Physarum polycephalum: Petri net analysis of single cell trajectories of gene expression indicates responsiveness and genetic plasticity of the Waddington quasipotential landscape

    NASA Astrophysics Data System (ADS)

    Werthmann, Britta; Marwan, Wolfgang

    2017-11-01

    The developmental switch to sporulation in Physarum polycephalum is a phytochrome-mediated far-red light-induced cell fate decision that synchronously encompasses the entire multinucleate plasmodial cell and is associated with extensive reprogramming of the transcriptome. By repeatedly taking samples of single cells after delivery of a light stimulus pulse, we analysed differential gene expression in two mutant strains and in a heterokaryon of the two strains all of which display a different propensity for making the cell fate decision. Multidimensional scaling of the gene expression data revealed individually different single cell trajectories eventually leading to sporulation. Characterization of the trajectories as walks through states of gene expression discretized by hierarchical clustering allowed the reconstruction of Petri nets that model and predict the observed behavior. Structural analyses of the Petri nets indicated stimulus- and genotype-dependence of both, single cell trajectories and of the quasipotential landscape through which these trajectories are taken. The Petri net-based approach to the analysis and decomposition of complex cellular responses and of complex mutant phenotypes may provide a scaffold for the data-driven reconstruction of causal molecular mechanisms that shape the topology of the quasipotential landscape.

  9. Multi-Scale Modeling to Improve Single-Molecule, Single-Cell Experiments

    NASA Astrophysics Data System (ADS)

    Munsky, Brian; Shepherd, Douglas

    2014-03-01

    Single-cell, single-molecule experiments are producing an unprecedented amount of data to capture the dynamics of biological systems. When integrated with computational models, observations of spatial, temporal and stochastic fluctuations can yield powerful quantitative insight. We concentrate on experiments that localize and count individual molecules of mRNA. These high precision experiments have large imaging and computational processing costs, and we explore how improved computational analyses can dramatically reduce overall data requirements. In particular, we show how analyses of spatial, temporal and stochastic fluctuations can significantly enhance parameter estimation results for small, noisy data sets. We also show how full probability distribution analyses can constrain parameters with far less data than bulk analyses or statistical moment closures. Finally, we discuss how a systematic modeling progression from simple to more complex analyses can reduce total computational costs by orders of magnitude. We illustrate our approach using single-molecule, spatial mRNA measurements of Interleukin 1-alpha mRNA induction in human THP1 cells following stimulation. Our approach could improve the effectiveness of single-molecule gene regulation analyses for many other process.

  10. Comparative Chloroplast Genomes of Photosynthetic Orchids: Insights into Evolution of the Orchidaceae and Development of Molecular Markers for Phylogenetic Applications

    PubMed Central

    Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu

    2014-01-01

    The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family. PMID:24911363

  11. Comparative chloroplast genomes of photosynthetic orchids: insights into evolution of the Orchidaceae and development of molecular markers for phylogenetic applications.

    PubMed

    Luo, Jing; Hou, Bei-Wei; Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu

    2014-01-01

    The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family.

  12. Feasibility of a workflow for the molecular characterization of single cells by next generation sequencing.

    PubMed

    Salvianti, Francesca; Rotunno, Giada; Galardi, Francesca; De Luca, Francesca; Pestrin, Marta; Vannucchi, Alessandro Maria; Di Leo, Angelo; Pazzagli, Mario; Pinzani, Pamela

    2015-09-01

    The purpose of the study was to explore the feasibility of a protocol for the isolation and molecular characterization of single circulating tumor cells (CTCs) from cancer patients using a single-cell next generation sequencing (NGS) approach. To reach this goal we used as a model an artificial sample obtained by spiking a breast cancer cell line (MDA-MB-231) into the blood of a healthy donor. Tumor cells were enriched and enumerated by CellSearch(®) and subsequently isolated by DEPArray™ to obtain single or pooled pure samples to be submitted to the analysis of the mutational status of multiple genes involved in cancer. Upon whole genome amplification, samples were analysed by NGS on the Ion Torrent PGM™ system (Life Technologies) using the Ion AmpliSeq™ Cancer Hotspot Panel v2 (Life Technologies), designed to investigate genomic "hot spot" regions of 50 oncogenes and tumor suppressor genes. We successfully sequenced five single cells, a pool of 5 cells and DNA from a cellular pellet of the same cell line with a mean depth of the sequencing reaction ranging from 1581 to 3479 reads. We found 27 sequence variants in 18 genes, 15 of which already reported in the COSMIC or dbSNP databases. We confirmed the presence of two somatic mutations, in the BRAF and TP53 gene, which had been already reported for this cells line, but also found new mutations and single nucleotide polymorphisms. Three variants were common to all the analysed samples, while 18 were present only in a single cell suggesting a high heterogeneity within the same cell line. This paper presents an optimized workflow for the molecular characterization of multiple genes in single cells by NGS. The described pipeline can be easily transferred to the study of single CTCs from oncologic patients.

  13. Understanding development and stem cells using single cell-based analyses of gene expression

    PubMed Central

    Kumar, Pavithra; Tan, Yuqi

    2017-01-01

    In recent years, genome-wide profiling approaches have begun to uncover the molecular programs that drive developmental processes. In particular, technical advances that enable genome-wide profiling of thousands of individual cells have provided the tantalizing prospect of cataloging cell type diversity and developmental dynamics in a quantitative and comprehensive manner. Here, we review how single-cell RNA sequencing has provided key insights into mammalian developmental and stem cell biology, emphasizing the analytical approaches that are specific to studying gene expression in single cells. PMID:28049689

  14. TRACING CO-REGULATORY NETWORK DYNAMICS IN NOISY, SINGLE-CELL TRANSCRIPTOME TRAJECTORIES.

    PubMed

    Cordero, Pablo; Stuart, Joshua M

    2017-01-01

    The availability of gene expression data at the single cell level makes it possible to probe the molecular underpinnings of complex biological processes such as differentiation and oncogenesis. Promising new methods have emerged for reconstructing a progression 'trajectory' from static single-cell transcriptome measurements. However, it remains unclear how to adequately model the appreciable level of noise in these data to elucidate gene regulatory network rewiring. Here, we present a framework called Single Cell Inference of MorphIng Trajectories and their Associated Regulation (SCIMITAR) that infers progressions from static single-cell transcriptomes by employing a continuous parametrization of Gaussian mixtures in high-dimensional curves. SCIMITAR yields rich models from the data that highlight genes with expression and co-expression patterns that are associated with the inferred progression. Further, SCIMITAR extracts regulatory states from the implicated trajectory-evolvingco-expression networks. We benchmark the method on simulated data to show that it yields accurate cell ordering and gene network inferences. Applied to the interpretation of a single-cell human fetal neuron dataset, SCIMITAR finds progression-associated genes in cornerstone neural differentiation pathways missed by standard differential expression tests. Finally, by leveraging the rewiring of gene-gene co-expression relations across the progression, the method reveals the rise and fall of co-regulatory states and trajectory-dependent gene modules. These analyses implicate new transcription factors in neural differentiation including putative co-factors for the multi-functional NFAT pathway.

  15. Association analyses of vitamin D-binding protein gene with compression strength index variation in Caucasian nuclear families

    PubMed Central

    Xu, X.-H.; Xiong, D.-H.; Liu, X.-G.; Guo, Y.; Chen, Y.; Zhao, J.; Recker, R. R.; Deng, H.-W.

    2010-01-01

    Summary This study was conducted to test whether there exists an association between vitamin D-binding protein (DBP) gene and compression strength index (CSI) phenotype. Candidate gene association analyses were conducted in total sample, male subgroup, and female subgroup, respectively. Two single-nucleotide polymorphisms (SNPs) with significant association results were found in males, suggesting the importance of DBP gene polymorphisms on the variation in CSI especially in Caucasian males. Introduction CSI of the femoral neck (FN) is a newly developed phenotype integrating information about bone size, body size, and bone mineral density. It is considered to have the potential to improve the performance of risk assessment for hip fractures because it is based on a combination of phenotypic traits influencing hip fractures rather than a single trait. CSI is under moderate genetic determination (with a heritability of ~44% found in this study), but the relevant genetic study is still rather scarce. Methods Based on the known physiological role of DBP in bone biology and the relatively high heritability of CSI, we tested 12 SNPs of the DBP gene for association with CSI variation in 405 Caucasian nuclear families comprising 1,873 subjects from the Midwestern US. Association analyses were performed in the total sample, male and female subgroups, respectively. Results Significant associations with CSI were found with two SNPs (rs222029, P=0.0019; rs222020, P=0.0042) for the male subgroup. Haplotype-based association tests corroborated the single-SNP results. Conclusions Our findings suggest that the DBP gene might be one of the genetic factors influencing CSI phenotype in Caucasians, especially in males. PMID:19543766

  16. The Phylogeny of Rickettsia Using Different Evolutionary Signatures: How Tree-Like is Bacterial Evolution?

    PubMed Central

    Murray, Gemma G. R.; Weinert, Lucy A.; Rhule, Emma L.; Welch, John J.

    2016-01-01

    Rickettsia is a genus of intracellular bacteria whose hosts and transmission strategies are both impressively diverse, and this is reflected in a highly dynamic genome. Some previous studies have described the evolutionary history of Rickettsia as non-tree-like, due to incongruity between phylogenetic reconstructions using different portions of the genome. Here, we reconstruct the Rickettsia phylogeny using whole-genome data, including two new genomes from previously unsampled host groups. We find that a single topology, which is supported by multiple sources of phylogenetic signal, well describes the evolutionary history of the core genome. We do observe extensive incongruence between individual gene trees, but analyses of simulations over a single topology and interspersed partitions of sites show that this is more plausibly attributed to systematic error than to horizontal gene transfer. Some conflicting placements also result from phylogenetic analyses of accessory genome content (i.e., gene presence/absence), but we argue that these are also due to systematic error, stemming from convergent genome reduction, which cannot be accommodated by existing phylogenetic methods. Our results show that, even within a single genus, tests for gene exchange based on phylogenetic incongruence may be susceptible to false positives. PMID:26559010

  17. Robust and Comprehensive Analysis of 20 Osteoporosis Candidate Genes by Very High-Density Single-Nucleotide Polymorphism Screen Among 405 White Nuclear Families Identified Significant Association and Gene–Gene Interaction

    PubMed Central

    Xiong, Dong-Hai; Shen, Hui; Zhao, Lan-Juan; Xiao, Peng; Yang, Tie-Lin; Guo, Yan; Wang, Wei; Guo, Yan-Fang; Liu, Yong-Jun; Recker, Robert R; Deng, Hong-Wen

    2007-01-01

    Many “novel” osteoporosis candidate genes have been proposed in recent years. To advance our knowledge of their roles in osteoporosis, we screened 20 such genes using a set of high-density SNPs in a large family-based study. Our efforts led to the prioritization of those osteoporosis genes and the detection of gene–gene interactions. Introduction We performed large-scale family-based association analyses of 20 novel osteoporosis candidate genes using 277 single nucleotide polymorphisms (SNPs) for the quantitative trait BMD variation and the qualitative trait osteoporosis (OP) at three clinically important skeletal sites: spine, hip, and ultradistal radius (UD). Materials and Methods One thousand eight hundred seventy-three subjects from 405 white nuclear families were genotyped and analyzed with an average density of one SNP per 4 kb across the 20 genes. We conducted association analyses by SNP- and haplotype-based family-based association test (FBAT) and performed gene–gene interaction analyses using multianalytic approaches such as multifactor-dimensionality reduction (MDR) and conditional logistic regression. Results and Conclusions We detected four genes (DBP, LRP5, CYP17, and RANK) that showed highly suggestive associations (10,000-permutation derived empirical global p ≤ 0.01) with spine BMD/OP; four genes (CYP19, RANK, RANKL, and CYP17) highly suggestive for hip BMD/OP; and four genes (CYP19, BMP2, RANK, and TNFR2) highly suggestive for UD BMD/OP. The associations between BMP2 with UD BMD and those between RANK with OP at the spine, hip, and UD also met the experiment-wide stringent criterion (empirical global p ≤ 0.0007). Sex-stratified analyses further showed that some of the significant associations in the total sample were driven by either male or female subjects. In addition, we identified and validated a two-locus gene–gene interaction model involving GCR and ESR2, for which prior biological evidence exists. Our results suggested the prioritization of osteoporosis candidate genes from among the many proposed in recent years and revealed the significant gene–gene interaction effects influencing osteoporosis risk. PMID:17002564

  18. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    PubMed

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  19. Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.

    PubMed

    Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J

    2018-06-27

    The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.

  20. Gene expression profiling of single cells on large-scale oligonucleotide arrays

    PubMed Central

    Hartmann, Claudia H.; Klein, Christoph A.

    2006-01-01

    Over the last decade, important insights into the regulation of cellular responses to various stimuli were gained by global gene expression analyses of cell populations. More recently, specific cell functions and underlying regulatory networks of rare cells isolated from their natural environment moved to the center of attention. However, low cell numbers still hinder gene expression profiling of rare ex vivo material in biomedical research. Therefore, we developed a robust method for gene expression profiling of single cells on high-density oligonucleotide arrays with excellent coverage of low abundance transcripts. The protocol was extensively tested with freshly isolated single cells of very low mRNA content including single epithelial, mature and immature dendritic cells and hematopoietic stem cells. Quantitative PCR confirmed that the PCR-based global amplification method did not change the relative ratios of transcript abundance and unsupervised hierarchical cluster analysis revealed that the histogenetic origin of an individual cell is correctly reflected by the gene expression profile. Moreover, the gene expression data from dendritic cells demonstrate that cellular differentiation and pathway activation can be monitored in individual cells. PMID:17071717

  1. Genetic mapping of the rice resistance-breaking gene of the brown planthopper Nilaparvata lugens

    PubMed Central

    Kobayashi, Tetsuya; Yamamoto, Kimiko; Suetsugu, Yoshitaka; Kuwazaki, Seigo; Hattori, Makoto; Jairin, Jirapong; Sanada-Morimura, Sachiyo; Matsumura, Masaya

    2014-01-01

    Host plant resistance has been widely used for controlling the major rice pest brown planthopper (BPH, Nilaparvata lugens). However, adaptation of the wild BPH population to resistance limits the effective use of resistant rice varieties. Quantitative trait locus (QTL) analysis was conducted to identify resistance-breaking genes against the anti-feeding mechanism mediated by the rice resistance gene Bph1. QTL analysis in iso-female BPH lines with single-nucleotide polymorphism (SNP) markers detected a single region on the 10th linkage group responsible for the virulence. The QTL explained from 57 to 84% of the total phenotypic variation. Bulked segregant analysis with next-generation sequencing in F2 progenies identified five SNPs genetically linked to the virulence. These analyses showed that virulence to Bph1 was controlled by a single recessive gene. In contrast to previous studies, the gene-for-gene relationship between the major resistance gene Bph1 and virulence gene of BPH was confirmed. Identified markers are available for map-based cloning of the major gene controlling BPH virulence to rice resistance. PMID:24870048

  2. Monitoring the Single-Cell Stress Response of the Diatom Thalassiosira pseudonana by Quantitative Real-Time Reverse Transcription-PCR

    PubMed Central

    Shi, Xu; Gao, Weimin; Chao, Shih-hui

    2013-01-01

    Directly monitoring the stress response of microbes to their environments could be one way to inspect the health of microorganisms themselves, as well as the environments in which the microorganisms live. The ultimate resolution for such an endeavor could be down to a single-cell level. In this study, using the diatom Thalassiosira pseudonana as a model species, we aimed to measure gene expression responses of this organism to various stresses at a single-cell level. We developed a single-cell quantitative real-time reverse transcription-PCR (RT-qPCR) protocol and applied it to determine the expression levels of multiple selected genes under nitrogen, phosphate, and iron depletion stress conditions. The results, for the first time, provided a quantitative measurement of gene expression at single-cell levels in T. pseudonana and demonstrated that significant gene expression heterogeneity was present within the cell population. In addition, different expression patterns between single-cell- and bulk-cell-based analyses were also observed for all genes assayed in this study, suggesting that cell response heterogeneity needs to be taken into consideration in order to obtain accurate information that indicates the environmental stress condition. PMID:23315741

  3. Monitoring the single-cell stress response of the diatom Thalassiosira pseudonana by quantitative real-time reverse transcription-PCR.

    PubMed

    Shi, Xu; Gao, Weimin; Chao, Shih-hui; Zhang, Weiwen; Meldrum, Deirdre R

    2013-03-01

    Directly monitoring the stress response of microbes to their environments could be one way to inspect the health of microorganisms themselves, as well as the environments in which the microorganisms live. The ultimate resolution for such an endeavor could be down to a single-cell level. In this study, using the diatom Thalassiosira pseudonana as a model species, we aimed to measure gene expression responses of this organism to various stresses at a single-cell level. We developed a single-cell quantitative real-time reverse transcription-PCR (RT-qPCR) protocol and applied it to determine the expression levels of multiple selected genes under nitrogen, phosphate, and iron depletion stress conditions. The results, for the first time, provided a quantitative measurement of gene expression at single-cell levels in T. pseudonana and demonstrated that significant gene expression heterogeneity was present within the cell population. In addition, different expression patterns between single-cell- and bulk-cell-based analyses were also observed for all genes assayed in this study, suggesting that cell response heterogeneity needs to be taken into consideration in order to obtain accurate information that indicates the environmental stress condition.

  4. Single-trait and multi-trait genome-wide association analyses identify novel loci for blood pressure in African-ancestry populations

    PubMed Central

    Liang, Jingjing; Le, Thu H.; Edwards, Digna R. Velez; Tayo, Bamidele O.; Gaulton, Kyle J.; Lu, Yingchang; Jensen, Richard A.; Chen, Guanjie; Schwander, Karen; McKenzie, Colin A.; Fox, Ervin; Nalls, Michael A.; Young, J. Hunter; Lane, Jacqueline M.; Zhou, Jie; Tang, Hua; Fornage, Myriam; Musani, Solomon K.; Wang, Heming; Forrester, Terrence; Chu, Pei-Lun; Evans, Michele K.; Morrison, Alanna C.; Martin, Lisa W.; Wiggins, Kerri L.; Hui, Qin; Zhao, Wei; Jackson, Rebecca D.; Faul, Jessica D.; Reiner, Alex P.; Bray, Michael; Denny, Joshua C.; Mosley, Thomas H.; Palmas, Walter; Guo, Xiuqing; Polak, Joseph F.; Taylor, Ken D.; Boerwinkle, Eric; Bottinger, Erwin P.; Liu, Kiang; Risch, Neil; Hunt, Steven C.; Kooperberg, Charles; Zonderman, Alan B.; Becker, Diane M.; Cai, Jianwen; Loos, Ruth J. F.; Psaty, Bruce M.; Weir, David R.; Kardia, Sharon L. R.; Arnett, Donna K.; Won, Sungho; Edwards, Todd L.; Redline, Susan; Cooper, Richard S.; Rao, D. C.; Rotimi, Charles; Levy, Daniel; Chakravarti, Aravinda

    2017-01-01

    Hypertension is a leading cause of global disease, mortality, and disability. While individuals of African descent suffer a disproportionate burden of hypertension and its complications, they have been underrepresented in genetic studies. To identify novel susceptibility loci for blood pressure and hypertension in people of African ancestry, we performed both single and multiple-trait genome-wide association analyses. We analyzed 21 genome-wide association studies comprised of 31,968 individuals of African ancestry, and validated our results with additional 54,395 individuals from multi-ethnic studies. These analyses identified nine loci with eleven independent variants which reached genome-wide significance (P < 1.25×10−8) for either systolic and diastolic blood pressure, hypertension, or for combined traits. Single-trait analyses identified two loci (TARID/TCF21 and LLPH/TMBIM4) and multiple-trait analyses identified one novel locus (FRMD3) for blood pressure. At these three loci, as well as at GRP20/CDH17, associated variants had alleles common only in African-ancestry populations. Functional annotation showed enrichment for genes expressed in immune and kidney cells, as well as in heart and vascular cells/tissues. Experiments driven by these findings and using angiotensin-II induced hypertension in mice showed altered kidney mRNA expression of six genes, suggesting their potential role in hypertension. Our study provides new evidence for genes related to hypertension susceptibility, and the need to study African-ancestry populations in order to identify biologic factors contributing to hypertension. PMID:28498854

  5. Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH.

    PubMed

    Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Gospocic, Janko; Gupte, Rohit; Bonasio, Roberto; Kim, Junhyong; Murray, John; Raj, Arjun

    2018-02-28

    Although single-cell RNA sequencing can reliably detect large-scale transcriptional programs, it is unclear whether it accurately captures the behavior of individual genes, especially those that express only in rare cells. Here, we use single-molecule RNA fluorescence in situ hybridization as a gold standard to assess trade-offs in single-cell RNA-sequencing data for detecting rare cell expression variability. We quantified the gene expression distribution for 26 genes that range from ubiquitous to rarely expressed and found that the correspondence between estimates across platforms improved with both transcriptome coverage and increased number of cells analyzed. Further, by characterizing the trade-off between transcriptome coverage and number of cells analyzed, we show that when the number of genes required to answer a given biological question is small, then greater transcriptome coverage is more important than analyzing large numbers of cells. More generally, our report provides guidelines for selecting quality thresholds for single-cell RNA-sequencing experiments aimed at rare cell analyses. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. Ancient Duplications and Expression Divergence in the Globin Gene Superfamily of Vertebrates: Insights from the Elephant Shark Genome and Transcriptome

    PubMed Central

    Opazo, Juan C.; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F.

    2015-01-01

    Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. PMID:25743544

  7. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior.

    PubMed

    Windhorst, Dafna A; Mileva-Seitz, Viara R; Rippe, Ralph C A; Tiemeier, Henning; Jaddoe, Vincent W V; Verhulst, Frank C; van IJzendoorn, Marinus H; Bakermans-Kranenburg, Marian J

    2016-08-01

    In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and gene-set approaches in tests of Gene by Environment (G × E) effects on complex behavior. This approach can offer an important alternative or complement to candidate gene and genome-wide environmental interaction (GWEI) studies in the search for genetic variation underlying individual differences in behavior. Genetic variants in 12 autosomal dopaminergic genes were available in an ethnically homogenous part of a population-based cohort. Harsh parenting was assessed with maternal (n = 1881) and paternal (n = 1710) reports at age 3. Externalizing behavior was assessed with the Child Behavior Checklist (CBCL) at age 5 (71 ± 3.7 months). We conducted gene-set analyses of the association between variation in dopaminergic genes and externalizing behavior, stratified for harsh parenting. The association was statistically significant or approached significance for children without harsh parenting experiences, but was absent in the group with harsh parenting. Similarly, significant associations between single genes and externalizing behavior were only found in the group without harsh parenting. Effect sizes in the groups with and without harsh parenting did not differ significantly. Gene-environment interaction tests were conducted for individual genetic variants, resulting in two significant interaction effects (rs1497023 and rs4922132) after correction for multiple testing. Our findings are suggestive of G × E interplay, with associations between dopamine genes and externalizing behavior present in children without harsh parenting, but not in children with harsh parenting experiences. Harsh parenting may overrule the role of genetic factors in externalizing behavior. Gene-based and gene-set analyses offer promising new alternatives to analyses focusing on single candidate polymorphisms when examining the interplay between genetic and environmental factors.

  8. An analysis of the sequence of the BAD gene among patients with maturity-onset diabetes of the young (MODY).

    PubMed

    Antosik, Karolina; Gnyś, Piotr; Jarosz-Chobot, Przemysława; Myśliwiec, Małgorzata; Szadkowska, Agnieszka; Małecki, Maciej; Młynarski, Wojciech; Borowiec, Maciej

    2017-01-01

    Monogenic diabetes is a rare disease caused by single gene mutations. Maturity onset diabetes of the young (MODY) is one of the major forms of monogenic diabetes recognised in the paediatric population. To date, 13 genes have been related to MODY development. The aim of the study was to analyse the sequence of the BCL2-associated agonist of cell death (BAD) gene in patients with clinical suspicion of GCK-MODY, but who were negative for glucokinase (GCK) gene mutations. A group of 122 diabetic patients were recruited from the "Polish Registry for Paediatric and Adolescent Diabetes - nationwide genetic screening for monogenic diabetes" project. The molecular testing was performed by Sanger sequencing. A total of 10 sequence variants of the BAD gene were identified in 122 analysed diabetic patients. Among the analysed patients suspected of MODY, one possible pathogenic variant was identified in one patient; however, further confirmation is required for a certain identification.

  9. Genetic polymorphisms in ESR1 and ESR2 genes, and risk of hypospadias in a multiethnic study population.

    PubMed

    Choudhry, Shweta; Baskin, Laurence S; Lammer, Edward J; Witte, John S; Dasgupta, Sudeshna; Ma, Chen; Surampalli, Abhilasha; Shen, Joel; Shaw, Gary M; Carmichael, Suzan L

    2015-05-01

    Estrogenic endocrine disruptors acting via estrogen receptors α (ESR1) and β (ESR2) have been implicated in the etiology of hypospadias, a common congenital malformation of the male external genitalia. We determined the association of single nucleotide polymorphisms in ESR1 and ESR2 genes with hypospadias in a racially/ethnically diverse study population of California births. We investigated the relationship between hypospadias and 108 ESR1 and 36 ESR2 single nucleotide polymorphisms in 647 cases and 877 population based nonmalformed controls among infants born in selected California counties from 1990 to 2003. Subgroup analyses were performed by race/ethnicity (nonHispanic white and Hispanic subjects) and by hypospadias severity (mild to moderate and severe). Odds ratios for 33 of the 108 ESR1 single nucleotide polymorphisms had p values less than 0.05 (p = 0.05 to 0.007) for risk of hypospadias. However, none of the 36 ESR2 single nucleotide polymorphisms was significantly associated. In stratified analyses the association results were consistent by disease severity but different sets of single nucleotide polymorphisms were significantly associated with hypospadias in nonHispanic white and Hispanic subjects. Due to high linkage disequilibrium across the single nucleotide polymorphisms, haplotype analyses were conducted and identified 6 haplotype blocks in ESR1 gene that had haplotypes significantly associated with an increased risk of hypospadias (OR 1.3 to 1.8, p = 0.04 to 0.00001). Similar to single nucleotide polymorphism analysis, different ESR1 haplotypes were associated with risk of hypospadias in nonHispanic white and Hispanic subjects. No significant haplotype association was observed for ESR2. The data provide evidence that ESR1 single nucleotide polymorphisms and haplotypes influence the risk of hypospadias in white and Hispanic subjects, and warrant further examination in other study populations. Copyright © 2015 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.

  10. Insulin‐degrading enzyme is genetically associated with Alzheimer's disease in the Finnish population

    PubMed Central

    Vepsäläinen, Saila; Parkinson, Michele; Helisalmi, Seppo; Mannermaa, Arto; Soininen, Hilkka; Tanzi, Rudolph E; Bertram, Lars; Hiltunen, Mikko

    2007-01-01

    The gene for insulin‐degrading enzyme (IDE), which is located at chromosome 10q24, has been previously proposed as a candidate gene for late‐onset Alzheimer's disease (AD) based on its ability to degrade amyloid β‐protein. Genotyping of single nucleotide polymorphisms (SNPs) in the IDE gene in Finnish patients with AD and controls revealed SNPs rs4646953 and rs4646955 to be associated with AD, conferring an approximately two‐fold increased risk. Single locus findings were corroborated by the results obtained from haplotype analyses. This suggests that genetic alterations in or near the IDE gene may increase the risk for developing AD. PMID:17496198

  11. Understanding development and stem cells using single cell-based analyses of gene expression.

    PubMed

    Kumar, Pavithra; Tan, Yuqi; Cahan, Patrick

    2017-01-01

    In recent years, genome-wide profiling approaches have begun to uncover the molecular programs that drive developmental processes. In particular, technical advances that enable genome-wide profiling of thousands of individual cells have provided the tantalizing prospect of cataloging cell type diversity and developmental dynamics in a quantitative and comprehensive manner. Here, we review how single-cell RNA sequencing has provided key insights into mammalian developmental and stem cell biology, emphasizing the analytical approaches that are specific to studying gene expression in single cells. © 2017. Published by The Company of Biologists Ltd.

  12. Phylogenomic Reconstruction of the Oomycete Phylogeny Derived from 37 Genomes

    PubMed Central

    McCarthy, Charley G. P.

    2017-01-01

    ABSTRACT The oomycetes are a class of microscopic, filamentous eukaryotes within the Stramenopiles-Alveolata-Rhizaria (SAR) supergroup which includes ecologically significant animal and plant pathogens, most infamously the causative agent of potato blight Phytophthora infestans. Single-gene and concatenated phylogenetic studies both of individual oomycete genera and of members of the larger class have resulted in conflicting conclusions concerning species phylogenies within the oomycetes, particularly for the large Phytophthora genus. Genome-scale phylogenetic studies have successfully resolved many eukaryotic relationships by using supertree methods, which combine large numbers of potentially disparate trees to determine evolutionary relationships that cannot be inferred from individual phylogenies alone. With a sufficient amount of genomic data now available, we have undertaken the first whole-genome phylogenetic analysis of the oomycetes using data from 37 oomycete species and 6 SAR species. In our analysis, we used established supertree methods to generate phylogenies from 8,355 homologous oomycete and SAR gene families and have complemented those analyses with both phylogenomic network and concatenated supermatrix analyses. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and individual clades within the problematic Phytophthora genus. Support for the resolution of the inferred relationships between individual Phytophthora clades varies depending on the methodology used. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. IMPORTANCE The oomycetes are a class of eukaryotes and include ecologically significant animal and plant pathogens. Single-gene and multigene phylogenetic studies of individual oomycete genera and of members of the larger classes have resulted in conflicting conclusions concerning interspecies relationships among these species, particularly for the Phytophthora genus. The onset of next-generation sequencing techniques now means that a wealth of oomycete genomic data is available. For the first time, we have used genome-scale phylogenetic methods to resolve oomycete phylogenetic relationships. We used supertree methods to generate single-gene and multigene species phylogenies. Overall, our supertree analyses utilized phylogenetic data from 8,355 oomycete gene families. We have also complemented our analyses with superalignment phylogenies derived from 131 single-copy ubiquitous gene families. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and clades. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. PMID:28435885

  13. Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion.

    PubMed

    Ni, ZhouXian; Ye, YouJu; Bai, Tiandao; Xu, Meng; Xu, Li-An

    2017-09-11

    The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between "IRa" and "IRb". The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.

  14. Discovering genetic variants in Crohn's disease by exploring genomic regions enriched of weak association signals.

    PubMed

    D'Addabbo, Annarita; Palmieri, Orazio; Maglietta, Rosalia; Latiano, Anna; Mukherjee, Sayan; Annese, Vito; Ancona, Nicola

    2011-08-01

    A meta-analysis has re-analysed previous genome-wide association scanning definitively confirming eleven genes and further identifying 21 new loci. However, the identified genes/loci still explain only the minority of genetic predisposition of Crohn's disease. To identify genes weakly involved in disease predisposition by analysing chromosomal regions enriched of single nucleotide polymorphisms with modest statistical association. We utilized the WTCCC data set evaluating 1748 CD and 2938 controls. The identification of candidate genes/loci was performed by a two-step procedure: first of all chromosomal regions enriched of weak association signals were localized; subsequently, weak signals clustered in gene regions were identified. The statistical significance was assessed by non parametric permutation tests. The cytoband enrichment analysis highlighted 44 regions (P≤0.05) enriched with single nucleotide polymorphisms significantly associated with the trait including 23 out of 31 previously confirmed and replicated genes. Importantly, we highlight further 20 novel chromosomal regions carrying approximately one hundred genes/loci with modest association. Amongst these we find compelling functional candidate genes such as MAPT, GRB2 and CREM, LCT, and IL12RB2. Our study suggests a different statistical perspective to discover genes weakly associated with a given trait, although further confirmatory functional studies are needed. Copyright © 2011 Editrice Gastroenterologica Italiana S.r.l. All rights reserved.

  15. Evidence for the involvement of genetic variation in the oxytocin receptor gene (OXTR) in the etiology of autistic disorders on high-functioning level.

    PubMed

    Wermter, Anne-Kathrin; Kamp-Becker, Inge; Hesse, Philipp; Schulte-Körne, Gerd; Strauch, Konstantin; Remschmidt, Helmut

    2010-03-05

    An increasing number of animal studies advert to a substantial role of the neuropeptide oxytocin in the regulation of social attachment and affiliation. Furthermore, animal studies showed anxiety and stress-reduced effects of oxytocin. First human studies confirm these findings in animal studies and implicate a crucial role of oxytocin in human social attachment behavior and in social interactions. Thus, the oxytocin system might be involved in the impairment of social interaction and attachment in autism spectrum disorders (ASD). The human oxytocin receptor gene (OXTR) represents a plausible candidate gene for the etiology of ASD. To analyze whether genetic variants in the OXTR gene are associated with ASD we performed family-based single-marker and haplotype association analyses with 22 single nucleotide polymorphisms (SNPs) in the OXTR and its 5' region in 100 families with autistic disorders on high-functioning level (Asperger syndrome (AS), high-functioning autism (HFA), and atypical autism (AA)). Single-marker and haplotype association analyses revealed nominally significant associations of one single SNP and one haplotype with autism, respectively. Furthermore, employing a "reverse phenotyping" approach, patients carrying the haplotype associated with autism showed nominally significant impairments in comparison to noncarriers of the haplotype in items of the Autism Diagnostic Interview-Revised algorithm describing aspects of social interaction and communication. In conclusion, our results implicate that genetic variation in the OXTR gene might be relevant in the etiology of autism on high-functioning level. (c) 2009 Wiley-Liss, Inc.

  16. SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data.

    PubMed

    Nelson, Chase W; Moncla, Louise H; Hughes, Austin L

    2015-11-15

    New applications of next-generation sequencing technologies use pools of DNA from multiple individuals to estimate population genetic parameters. However, no publicly available tools exist to analyse single-nucleotide polymorphism (SNP) calling results directly for evolutionary parameters important in detecting natural selection, including nucleotide diversity and gene diversity. We have developed SNPGenie to fill this gap. The user submits a FASTA reference sequence(s), a Gene Transfer Format (.GTF) file with CDS information and a SNP report(s) in an increasing selection of formats. The program estimates nucleotide diversity, distance from the reference and gene diversity. Sites are flagged for multiple overlapping reading frames, and are categorized by polymorphism type: nonsynonymous, synonymous, or ambiguous. The results allow single nucleotide, single codon, sliding window, whole gene and whole genome/population analyses that aid in the detection of positive and purifying natural selection in the source population. SNPGenie version 1.2 is a Perl program with no additional dependencies. It is free, open-source, and available for download at https://github.com/hugheslab/snpgenie. nelsoncw@email.sc.edu or austin@biol.sc.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. Lessons learned from additional research analyses of unsolved clinical exome cases.

    PubMed

    Eldomery, Mohammad K; Coban-Akdemir, Zeynep; Harel, Tamar; Rosenfeld, Jill A; Gambin, Tomasz; Stray-Pedersen, Asbjørg; Küry, Sébastien; Mercier, Sandra; Lessel, Davor; Denecke, Jonas; Wiszniewski, Wojciech; Penney, Samantha; Liu, Pengfei; Bi, Weimin; Lalani, Seema R; Schaaf, Christian P; Wangler, Michael F; Bacino, Carlos A; Lewis, Richard Alan; Potocki, Lorraine; Graham, Brett H; Belmont, John W; Scaglia, Fernando; Orange, Jordan S; Jhangiani, Shalini N; Chiang, Theodore; Doddapaneni, Harsha; Hu, Jianhong; Muzny, Donna M; Xia, Fan; Beaudet, Arthur L; Boerwinkle, Eric; Eng, Christine M; Plon, Sharon E; Sutton, V Reid; Gibbs, Richard A; Posey, Jennifer E; Yang, Yaping; Lupski, James R

    2017-03-21

    Given the rarity of most single-gene Mendelian disorders, concerted efforts of data exchange between clinical and scientific communities are critical to optimize molecular diagnosis and novel disease gene discovery. We designed and implemented protocols for the study of cases for which a plausible molecular diagnosis was not achieved in a clinical genomics diagnostic laboratory (i.e. unsolved clinical exomes). Such cases were recruited to a research laboratory for further analyses, in order to potentially: (1) accelerate novel disease gene discovery; (2) increase the molecular diagnostic yield of whole exome sequencing (WES); and (3) gain insight into the genetic mechanisms of disease. Pilot project data included 74 families, consisting mostly of parent-offspring trios. Analyses performed on a research basis employed both WES from additional family members and complementary bioinformatics approaches and protocols. Analysis of all possible modes of Mendelian inheritance, focusing on both single nucleotide variants (SNV) and copy number variant (CNV) alleles, yielded a likely contributory variant in 36% (27/74) of cases. If one includes candidate genes with variants identified within a single family, a potential contributory variant was identified in a total of ~51% (38/74) of cases enrolled in this pilot study. The molecular diagnosis was achieved in 30/63 trios (47.6%). Besides this, the analysis workflow yielded evidence for pathogenic variants in disease-associated genes in 4/6 singleton cases (66.6%), 1/1 multiplex family involving three affected siblings, and 3/4 (75%) quartet families. Both the analytical pipeline and the collaborative efforts between the diagnostic and research laboratories provided insights that allowed recent disease gene discoveries (PURA, TANGO2, EMC1, GNB5, ATAD3A, and MIPEP) and increased the number of novel genes, defined in this study as genes identified in more than one family (DHX30 and EBF3). An efficient genomics pipeline in which clinical sequencing in a diagnostic laboratory is followed by the detailed reanalysis of unsolved cases in a research environment, supplemented with WES data from additional family members, and subject to adjuvant bioinformatics analyses including relaxed variant filtering parameters in informatics pipelines, can enhance the molecular diagnostic yield and provide mechanistic insights into Mendelian disorders. Implementing these approaches requires collaborative clinical molecular diagnostic and research efforts.

  18. Whole exome sequence-based association analyses of plasma amyloid-β in African and European Americans; the Atherosclerosis Risk in Communities-Neurocognitive Study.

    PubMed

    Simino, Jeannette; Wang, Zhiying; Bressler, Jan; Chouraki, Vincent; Yang, Qiong; Younkin, Steven G; Seshadri, Sudha; Fornage, Myriam; Boerwinkle, Eric; Mosley, Thomas H

    2017-01-01

    We performed single-variant and gene-based association analyses of plasma amyloid-β (aβ) concentrations using whole exome sequence from 1,414 African and European Americans. Our goal was to identify genes that influence plasma aβ42 concentrations and aβ42:aβ40 ratios in late middle age (mean = 59 years), old age (mean = 77 years), or change over time (mean = 18 years). Plasma aβ measures were linearly regressed onto age, gender, APOE ε4 carrier status, and time elapsed between visits (fold-changes only) separately by race. Following inverse normal transformation of the residuals, seqMeta was used to conduct race-specific single-variant and gene-based association tests while adjusting for population structure. Linear regression models were fit on autosomal variants with minor allele frequencies (MAF)≥1%. T5 burden and Sequence Kernel Association (SKAT) gene-based tests assessed functional variants with MAF≤5%. Cross-race fixed effects meta-analyses were Bonferroni-corrected for the number of variants or genes tested. Seven genes were associated with aβ in late middle age or change over time; no associations were identified in old age. Single variants in KLKB1 (rs3733402; p = 4.33x10-10) and F12 (rs1801020; p = 3.89x10-8) were significantly associated with midlife aβ42 levels through cross-race meta-analysis; the KLKB1 variant replicated internally using 1,014 additional participants with exome chip. ITPRIP, PLIN2, and TSPAN18 were associated with the midlife aβ42:aβ40 ratio via the T5 test; TSPAN18 was significant via the cross-race meta-analysis, whereas ITPRIP and PLIN2 were European American-specific. NCOA1 and NT5C3B were associated with the midlife aβ42:aβ40 ratio and the fold-change in aβ42, respectively, via SKAT in African Americans. No associations replicated externally (N = 725). We discovered age-dependent genetic effects, established associations between vascular-related genes (KLKB1, F12, PLIN2) and midlife plasma aβ levels, and identified a plausible Alzheimer's Disease candidate gene (ITPRIP) influencing cell death. Plasma aβ concentrations may have dynamic biological determinants across the lifespan; plasma aβ study designs or analyses must consider age.

  19. Phylogenetic Relationships within the Opisthokonta Based on Phylogenomic Analyses of Conserved Single-Copy Protein Domains

    PubMed Central

    Torruella, Guifré; Derelle, Romain; Paps, Jordi; Lang, B. Franz; Roger, Andrew J.; Shalchian-Tabrizi, Kamran; Ruiz-Trillo, Iñaki

    2012-01-01

    Many of the eukaryotic phylogenomic analyses published to date were based on alignments of hundreds to thousands of genes. Frequently, in such analyses, the most realistic evolutionary models currently available are often used to minimize the impact of systematic error. However, controversy remains over whether or not idiosyncratic gene family dynamics (i.e., gene duplications and losses) and incorrect orthology assignments are always appropriately taken into account. In this paper, we present an innovative strategy for overcoming orthology assignment problems. Rather than identifying and eliminating genes with paralogy problems, we have constructed a data set comprised exclusively of conserved single-copy protein domains that, unlike most of the commonly used phylogenomic data sets, should be less confounded by orthology miss-assignments. To evaluate the power of this approach, we performed maximum likelihood and Bayesian analyses to infer the evolutionary relationships within the opisthokonts (which includes Metazoa, Fungi, and related unicellular lineages). We used this approach to test 1) whether Filasterea and Ichthyosporea form a clade, 2) the interrelationships of early-branching metazoans, and 3) the relationships among early-branching fungi. We also assessed the impact of some methods that are known to minimize systematic error, including reducing the distance between the outgroup and ingroup taxa or using the CAT evolutionary model. Overall, our analyses support the Filozoa hypothesis in which Ichthyosporea are the first holozoan lineage to emerge followed by Filasterea, Choanoflagellata, and Metazoa. Blastocladiomycota appears as a lineage separate from Chytridiomycota, although this result is not strongly supported. These results represent independent tests of previous phylogenetic hypotheses, highlighting the importance of sophisticated approaches for orthology assignment in phylogenomic analyses. PMID:21771718

  20. Genetic mapping of the rice resistance-breaking gene of the brown planthopper Nilaparvata lugens.

    PubMed

    Kobayashi, Tetsuya; Yamamoto, Kimiko; Suetsugu, Yoshitaka; Kuwazaki, Seigo; Hattori, Makoto; Jairin, Jirapong; Sanada-Morimura, Sachiyo; Matsumura, Masaya

    2014-07-22

    Host plant resistance has been widely used for controlling the major rice pest brown planthopper (BPH, Nilaparvata lugens). However, adaptation of the wild BPH population to resistance limits the effective use of resistant rice varieties. Quantitative trait locus (QTL) analysis was conducted to identify resistance-breaking genes against the anti-feeding mechanism mediated by the rice resistance gene Bph1. QTL analysis in iso-female BPH lines with single-nucleotide polymorphism (SNP) markers detected a single region on the 10th linkage group responsible for the virulence. The QTL explained from 57 to 84% of the total phenotypic variation. Bulked segregant analysis with next-generation sequencing in F2 progenies identified five SNPs genetically linked to the virulence. These analyses showed that virulence to Bph1 was controlled by a single recessive gene. In contrast to previous studies, the gene-for-gene relationship between the major resistance gene Bph1 and virulence gene of BPH was confirmed. Identified markers are available for map-based cloning of the major gene controlling BPH virulence to rice resistance. © 2014 The Author(s) Published by the Royal Society. All rights reserved.

  1. A kernel regression approach to gene-gene interaction detection for case-control studies.

    PubMed

    Larson, Nicholas B; Schaid, Daniel J

    2013-11-01

    Gene-gene interactions are increasingly being addressed as a potentially important contributor to the variability of complex traits. Consequently, attentions have moved beyond single locus analysis of association to more complex genetic models. Although several single-marker approaches toward interaction analysis have been developed, such methods suffer from very high testing dimensionality and do not take advantage of existing information, notably the definition of genes as functional units. Here, we propose a comprehensive family of gene-level score tests for identifying genetic elements of disease risk, in particular pairwise gene-gene interactions. Using kernel machine methods, we devise score-based variance component tests under a generalized linear mixed model framework. We conducted simulations based upon coalescent genetic models to evaluate the performance of our approach under a variety of disease models. These simulations indicate that our methods are generally higher powered than alternative gene-level approaches and at worst competitive with exhaustive SNP-level (where SNP is single-nucleotide polymorphism) analyses. Furthermore, we observe that simulated epistatic effects resulted in significant marginal testing results for the involved genes regardless of whether or not true main effects were present. We detail the benefits of our methods and discuss potential genome-wide analysis strategies for gene-gene interaction analysis in a case-control study design. © 2013 WILEY PERIODICALS, INC.

  2. Regulation of Gene Editing Activity Directed by Single-Stranded Oligonucleotides and CRISPR/Cas9 Systems

    PubMed Central

    Bialk, Pawel; Rivera-Torres, Natalia; Strouse, Bryan; Kmiec, Eric B.

    2015-01-01

    Single-stranded DNA oligonucleotides (ssODNs) can direct the repair of a single base mutation in human genes. While the regulation of this gene editing reaction has been partially elucidated, the low frequency with which repair occurs has hampered development toward clinical application. In this work a CRISPR/Cas9 complex is employed to induce double strand DNA breakage at specific sites surrounding the nucleotide designated for exchange. The result is a significant elevation in ssODN-directed gene repair, validated by a phenotypic readout. By analysing reaction parameters, we have uncovered restrictions on gene editing activity involving CRISPR/Cas9 complexes. First, ssODNs that hybridize to the non-transcribed strand direct a higher level of gene repair than those that hybridize to the transcribed strand. Second, cleavage must be proximal to the targeted mutant base to enable higher levels of gene editing. Third, DNA cleavage enables a higher level of gene editing activity as compared to single-stranded DNA nicks, created by modified Cas9 (Nickases). Fourth, we calculated the hybridization potential and free energy levels of ssODNs that are complementary to the guide RNA sequences of CRISPRs used in this study. We find a correlation between free energy potential and the capacity of single-stranded oligonucleotides to inhibit specific DNA cleavage activity, thereby indirectly reducing gene editing activity. Our data provide novel information that might be taken into consideration in the design and usage of CRISPR/Cas9 systems with ssODNs for gene editing. PMID:26053390

  3. Regulation of Gene Editing Activity Directed by Single-Stranded Oligonucleotides and CRISPR/Cas9 Systems.

    PubMed

    Bialk, Pawel; Rivera-Torres, Natalia; Strouse, Bryan; Kmiec, Eric B

    2015-01-01

    Single-stranded DNA oligonucleotides (ssODNs) can direct the repair of a single base mutation in human genes. While the regulation of this gene editing reaction has been partially elucidated, the low frequency with which repair occurs has hampered development toward clinical application. In this work a CRISPR/Cas9 complex is employed to induce double strand DNA breakage at specific sites surrounding the nucleotide designated for exchange. The result is a significant elevation in ssODN-directed gene repair, validated by a phenotypic readout. By analysing reaction parameters, we have uncovered restrictions on gene editing activity involving CRISPR/Cas9 complexes. First, ssODNs that hybridize to the non-transcribed strand direct a higher level of gene repair than those that hybridize to the transcribed strand. Second, cleavage must be proximal to the targeted mutant base to enable higher levels of gene editing. Third, DNA cleavage enables a higher level of gene editing activity as compared to single-stranded DNA nicks, created by modified Cas9 (Nickases). Fourth, we calculated the hybridization potential and free energy levels of ssODNs that are complementary to the guide RNA sequences of CRISPRs used in this study. We find a correlation between free energy potential and the capacity of single-stranded oligonucleotides to inhibit specific DNA cleavage activity, thereby indirectly reducing gene editing activity. Our data provide novel information that might be taken into consideration in the design and usage of CRISPR/Cas9 systems with ssODNs for gene editing.

  4. Functional analysis of aromatic biosynthetic pathways in Pseudomonas putida KT2440

    PubMed Central

    Molina‐Henares, M. Antonia; García‐Salamanca, Adela; Molina‐Henares, A. Jesús; De La Torre, Jesús; Herrera, M. Carmen; Ramos, Juan L.; Duque, Estrella

    2009-01-01

    Summary Pseudomonas putida KT2440 is a non‐pathogenic prototrophic bacterium with high potential for biotechnological applications. Despite all that is known about this strain, the biosynthesis of essential chemicals has not been fully analysed and auxotroph mutants are scarce. We carried out massive mini‐Tn5 random mutagenesis and screened for auxotrophs that require aromatic amino acids. The biosynthesis of aromatic amino acids was analysed in detail including physical and transcriptional organization of genes, complementation assays and feeding experiments to establish pathway intermediates. There is a single pathway from chorismate leading to the biosynthesis of tryptophan, whereas the biosynthesis of phenylalanine and tyrosine is achieved through multiple convergent pathways. Genes for tryptophan biosynthesis are grouped in unlinked regions with the trpBA and trpGDE genes organized as operons and the trpI, trpE and trpF genes organized as single transcriptional units. The pheA and tyrA gene‐encoding multifunctional enzymes for phenylalanine and tyrosine biosynthesis are linked in the chromosome and form an operon with the serC gene involved in serine biosynthesis. The last step in the biosynthesis of these two amino acids requires an amino transferase activity for which multiple tyrB‐like genes are present in the host chromosome. PMID:21261884

  5. Quantitative high-resolution genomic analysis of single cancer cells.

    PubMed

    Hannemann, Juliane; Meyer-Staeckling, Sönke; Kemming, Dirk; Alpers, Iris; Joosse, Simon A; Pospisil, Heike; Kurtz, Stefan; Görndt, Jennifer; Püschel, Klaus; Riethdorf, Sabine; Pantel, Klaus; Brandt, Burkhard

    2011-01-01

    During cancer progression, specific genomic aberrations arise that can determine the scope of the disease and can be used as predictive or prognostic markers. The detection of specific gene amplifications or deletions in single blood-borne or disseminated tumour cells that may give rise to the development of metastases is of great clinical interest but technically challenging. In this study, we present a method for quantitative high-resolution genomic analysis of single cells. Cells were isolated under permanent microscopic control followed by high-fidelity whole genome amplification and subsequent analyses by fine tiling array-CGH and qPCR. The assay was applied to single breast cancer cells to analyze the chromosomal region centred by the therapeutical relevant EGFR gene. This method allows precise quantitative analysis of copy number variations in single cell diagnostics.

  6. A Continental-Wide Perspective: The Genepool of Nuclear Encoded Ribosomal DNA and Single-Copy Gene Sequences in North American Boechera (Brassicaceae)

    PubMed Central

    Kiefer, Christiane; Koch, Marcus A.

    2012-01-01

    74 of the currently accepted 111 taxa of the North American genus Boechera (Brassicaceae) were subject to pyhlogenetic reconstruction and network analysis. The dataset comprised 911 accessions for which ITS sequences were analyzed. Phylogenetic analyses yielded largely unresolved trees. Together with the network analysis confirming this result this can be interpreted as an indication for multiple, independent, and rapid diversification events. Network analyses were superimposed with datasets describing i) geographical distribution, ii) taxonomy, iii) reproductive mode, and iv) distribution history based on phylogeographic evidence. Our results provide first direct evidence for enormous reticulate evolution in the entire genus and give further insights into the evolutionary history of this complex genus on a continental scale. In addition two novel single-copy gene markers, orthologues of the Arabidopsis thaliana genes At2g25920 and At3g18900, were analyzed for subsets of taxa and confirmed the findings obtained through the ITS data. PMID:22606266

  7. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder: association to overlapping traits in ADHD and autism.

    PubMed

    Naaijen, J; Bralten, J; Poelmans, G; Glennon, J C; Franke, B; Buitelaar, J K

    2017-01-10

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms, autism symptom severity and inhibition were performed using principal component regression analyses. Subsequently, gene-wide association analyses were performed. The glutamate gene set showed an association with severity of hyperactivity/impulsivity (P=0.009), which was robust to correcting for genome-wide association levels. The GABA gene set showed nominally significant association with inhibition (P=0.04), but this did not survive correction for multiple comparisons. None of single gene or single variant associations was significant on their own. By analyzing multiple genetic variants within candidate gene sets together, we were able to find genetic associations supporting the involvement of excitatory and inhibitory neurotransmitter systems in ADHD and ASD symptom severity in ADHD.

  8. Single nucleotide variants in metastasis-related genes are associated with breast cancer risk, by lymph node involvement and estrogen receptor status, in women with European and African ancestry

    PubMed Central

    Roberts, Michelle R.; Sucheston-Campbell, Lara E.; Zirpoli, Gary R.; Higgins, Michael; Freudenheim, Jo L.; Bandera, Elisa V.; Ambrosone, Christine B.; Yao, Song

    2017-01-01

    Background Single nucleotide polymorphisms (SNPs) in pathways influencing lymph node (LN) metastasis and estrogen receptor (ER) status in breast cancer may partially explain inter-patient variability in prognosis. We examined 154 SNPs in 12 metastasis-related genes for associations with breast cancer risk, stratified by LN and ER status, in European-American (EA) and African-American (AA) women. Methods 2,671 women enrolled in the Women’s Circle of Health Study were genotyped. Pathway analyses were conducted using the adaptive rank truncated product (ARTP) method, with pARTP≤0.10 as significant. Multi-allelic risk scores were created for the ARTP-significant gene(s). Single-SNP and risk score associations were modeled using logistic regression, with false discovery rate (FDR) p-value adjustment. Results Although single-SNP associations were not significant at pFDR<0.05, several genes were significant in the ARTP analyses. In AA women, significant ARTP gene-level associations included CDH1 with LN+ (pARTP=0.10; multi-allelic OR=1.13, 95% CI 1.07–1.19, pFDR=0.0003) and SIPA1 with ER− breast cancer (pARTP=0.10; multi-allelic OR=1.16, 95% CI 1.02–1.31, pFDR=0.03). In EA women, MTA2 was associated with overall breast cancer risk (pARTP=0.004), regardless of ER status, and with LN− disease (pARTP=0.01). Also significant were SATB1 in ER− (pARTP=0.03; multi-allelic OR=1.12, 95% CI 1.05–1.20, pFDR=0.003) and KISS1 in LN− (pARTP=0.10; multi-allelic OR=1.18, 95% CI 1.08–1.29, pFDR=0.002) analyses. Among LN+ cases, significant ARTP associations were observed for SNAI1, CD82, NME1, and CTNNB1 (multi-allelic OR=1.09, 95% CI 1.04–1.14, pFDR=0.001). Conclusion Our findings suggest that variants in several metastasis genes may affect breast cancer risk by LN or ER status, although verification in larger studies is required. PMID:27597141

  9. Co-expression networks reveal the tissue-specific regulation of transcription and splicing

    PubMed Central

    Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D.H.; Jo, Brian; Gao, Chuan; McDowell, Ian C.; Engelhardt, Barbara E.

    2017-01-01

    Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. PMID:29021288

  10. Estimation of gene induction enables a relevance-based ranking of gene sets.

    PubMed

    Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens

    2009-07-01

    In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.

  11. Systematic Integration of Brain eQTL and GWAS Identifies ZNF323 as a Novel Schizophrenia Risk Gene and Suggests Recent Positive Selection Based on Compensatory Advantage on Pulmonary Function

    PubMed Central

    Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D.; Als, Thomas D.; van den Oord, Edwin J.; Aberg, Karolina A.; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G.; Nöthen, Markus M.; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang

    2015-01-01

    Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10–6). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10–6; single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10−10). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10–5 and P = 9.00×10–5, respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. PMID:25759474

  12. Ancient Duplications and Expression Divergence in the Globin Gene Superfamily of Vertebrates: Insights from the Elephant Shark Genome and Transcriptome.

    PubMed

    Opazo, Juan C; Lee, Alison P; Hoffmann, Federico G; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F

    2015-07-01

    Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. High-resolution single-nucleotide polymorphism array-profiling in myeloproliferative neoplasms identifies novel genomic aberrations

    PubMed Central

    Stegelmann, Frank; Bullinger, Lars; Griesshammer, Martin; Holzmann, Karlheinz; Habdank, Marianne; Kuhn, Susanne; Maile, Carmen; Schauer, Stefanie; Döhner, Hartmut; Döhner, Konstanze

    2010-01-01

    Single-nucleotide polymorphism arrays allow for genome-wide profiling of copy-number alterations and copy-neutral runs of homozygosity at high resolution. To identify novel genetic lesions in myeloproliferative neoplasms, a large series of 151 clinically well characterized patients was analyzed in our study. Copy-number alterations were rare in essential thrombocythemia and polycythemia vera. In contrast, approximately one third of myelofibrosis patients exhibited small genomic losses (less than 5 Mb). In 2 secondary myelofibrosis cases the tumor suppressor gene NF1 in 17q11.2 was affected. Sequencing analyses revealed a mutation in the remaining NF1 allele of one patient. In terms of copy-neutral aberrations, no chromosomes other than 9p were recurrently affected. In conclusion, novel genomic aberrations were identified in our study, in particular in patients with myelofibrosis. Further analyses on single-gene level are necessary to uncover the mechanisms that are involved in the pathogenesis of myeloproliferative neoplasms. PMID:20015882

  14. Differential Network Analyses of Alzheimer’s Disease Identify Early Events in Alzheimer’s Disease Pathology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xia, Jing; Rocke, David M.; Perry, George

    In late-onset Alzheimer’s disease (AD), multiple brain regions are not affected simultaneously. Comparing the gene expression of the affected regions to identify the differences in the biological processes perturbed can lead to greater insight into AD pathogenesis and early characteristics. We identified differentially expressed (DE) genes from single cell microarray data of four AD affected brain regions: entorhinal cortex (EC), hippocampus (HIP), posterior cingulate cortex (PCC), and middle temporal gyrus (MTG). We organized the DE genes in the four brain regions into region-specific gene coexpression networks. Differential neighborhood analyses in the coexpression networks were performed to identify genes with lowmore » topological overlap (TO) of their direct neighbors. The low TO genes were used to characterize the biological differences between two regions. Our analyses show that increased oxidative stress, along with alterations in lipid metabolism in neurons, may be some of the very early events occurring in AD pathology. Cellular defense mechanisms try to intervene but fail, finally resulting in AD pathology as the disease progresses. Furthermore, disease annotation of the low TO genes in two independent protein interaction networks has resulted in association between cancer, diabetes, renal diseases, and cardiovascular diseases.« less

  15. Differential Network Analyses of Alzheimer’s Disease Identify Early Events in Alzheimer’s Disease Pathology

    DOE PAGES

    Xia, Jing; Rocke, David M.; Perry, George; ...

    2014-01-01

    In late-onset Alzheimer’s disease (AD), multiple brain regions are not affected simultaneously. Comparing the gene expression of the affected regions to identify the differences in the biological processes perturbed can lead to greater insight into AD pathogenesis and early characteristics. We identified differentially expressed (DE) genes from single cell microarray data of four AD affected brain regions: entorhinal cortex (EC), hippocampus (HIP), posterior cingulate cortex (PCC), and middle temporal gyrus (MTG). We organized the DE genes in the four brain regions into region-specific gene coexpression networks. Differential neighborhood analyses in the coexpression networks were performed to identify genes with lowmore » topological overlap (TO) of their direct neighbors. The low TO genes were used to characterize the biological differences between two regions. Our analyses show that increased oxidative stress, along with alterations in lipid metabolism in neurons, may be some of the very early events occurring in AD pathology. Cellular defense mechanisms try to intervene but fail, finally resulting in AD pathology as the disease progresses. Furthermore, disease annotation of the low TO genes in two independent protein interaction networks has resulted in association between cancer, diabetes, renal diseases, and cardiovascular diseases.« less

  16. BASiCS: Bayesian Analysis of Single-Cell Sequencing Data

    PubMed Central

    Vallejos, Catalina A.; Marioni, John C.; Richardson, Sylvia

    2015-01-01

    Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach. PMID:26107944

  17. BASiCS: Bayesian Analysis of Single-Cell Sequencing Data.

    PubMed

    Vallejos, Catalina A; Marioni, John C; Richardson, Sylvia

    2015-06-01

    Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell's lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach.

  18. The complete chloroplast genome of the Dendrobium strongylanthum (Orchidaceae: Epidendroideae).

    PubMed

    Li, Jing; Chen, Chen; Wang, Zhe-Zhi

    2016-07-01

    Complete chloroplast genome sequence is very useful for studying the phylogenetic and evolution of species. In this study, the complete chloroplast genome of Dendrobium strongylanthum was constructed from whole-genome Illumina sequencing data. The chloroplast genome is 153 058 bp in length with 37.6% GC content and consists of two inverted repeats (IRs) of 26 316 bp. The IR regions are separated by large single-copy region (LSC, 85 836 bp) and small single-copy (SSC, 14 590 bp) region. A total of 130 chloroplast genes were successfully annotated, including 84 protein coding genes, 38 tRNA genes, and eight rRNA genes. Phylogenetic analyses showed that the chloroplast genome of Dendrobium strongylanthum is related to that of the Dendrobium officinal.

  19. Novel linkage disequilibrium clustering algorithm identifies new lupus genes on meta-analysis of GWAS datasets.

    PubMed

    Saeed, Mohammad

    2017-05-01

    Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.

  20. Identification of gene regulation models from single-cell data

    NASA Astrophysics Data System (ADS)

    Weber, Lisa; Raymond, William; Munsky, Brian

    2018-09-01

    In quantitative analyses of biological processes, one may use many different scales of models (e.g. spatial or non-spatial, deterministic or stochastic, time-varying or at steady-state) or many different approaches to match models to experimental data (e.g. model fitting or parameter uncertainty/sloppiness quantification with different experiment designs). These different analyses can lead to surprisingly different results, even when applied to the same data and the same model. We use a simplified gene regulation model to illustrate many of these concerns, especially for ODE analyses of deterministic processes, chemical master equation and finite state projection analyses of heterogeneous processes, and stochastic simulations. For each analysis, we employ MATLAB and PYTHON software to consider a time-dependent input signal (e.g. a kinase nuclear translocation) and several model hypotheses, along with simulated single-cell data. We illustrate different approaches (e.g. deterministic and stochastic) to identify the mechanisms and parameters of the same model from the same simulated data. For each approach, we explore how uncertainty in parameter space varies with respect to the chosen analysis approach or specific experiment design. We conclude with a discussion of how our simulated results relate to the integration of experimental and computational investigations to explore signal-activated gene expression models in yeast (Neuert et al 2013 Science 339 584–7) and human cells (Senecal et al 2014 Cell Rep. 8 75–83)5.

  1. Quantitative High-Resolution Genomic Analysis of Single Cancer Cells

    PubMed Central

    Hannemann, Juliane; Meyer-Staeckling, Sönke; Kemming, Dirk; Alpers, Iris; Joosse, Simon A.; Pospisil, Heike; Kurtz, Stefan; Görndt, Jennifer; Püschel, Klaus; Riethdorf, Sabine; Pantel, Klaus; Brandt, Burkhard

    2011-01-01

    During cancer progression, specific genomic aberrations arise that can determine the scope of the disease and can be used as predictive or prognostic markers. The detection of specific gene amplifications or deletions in single blood-borne or disseminated tumour cells that may give rise to the development of metastases is of great clinical interest but technically challenging. In this study, we present a method for quantitative high-resolution genomic analysis of single cells. Cells were isolated under permanent microscopic control followed by high-fidelity whole genome amplification and subsequent analyses by fine tiling array-CGH and qPCR. The assay was applied to single breast cancer cells to analyze the chromosomal region centred by the therapeutical relevant EGFR gene. This method allows precise quantitative analysis of copy number variations in single cell diagnostics. PMID:22140428

  2. Analysis of a genome-wide set of gene deletions in the fission yeast Schizosaccharomyces pombe

    PubMed Central

    Duhig, Trevor; Nam, Miyoung; Palmer, Georgia; Han, Sangjo; Jeffery, Linda; Baek, Seung-Tae; Lee, Hyemi; Shim, Young Sam; Lee, Minho; Kim, Lila; Heo, Kyung-Sun; Noh, Eun Joo; Lee, Ah-Reum; Jang, Young-Joo; Chung, Kyung-Sook; Choi, Shin-Jung; Park, Jo-Young; Park, Youngwoo; Kim, Hwan Mook; Park, Song-Kyu; Park, Hae-Joon; Kang, Eun-Jung; Kim, Hyong Bai; Kang, Hyun-Sam; Park, Hee-Moon; Kim, Kyunghoon; Song, Kiwon; Song, Kyung Bin; Nurse, Paul; Hoe, Kwang-Lae

    2014-01-01

    SUMMARY We report the construction and analysis of 4,836 heterozygous diploid deletion mutants covering 98.4% of the fission yeast genome. This resource provides a powerful tool for biotechnological and eukaryotic cell biology research. Comprehensive gene dispensability comparisons with budding yeast, the first time such studies have been possible between two eukaryotes, revealed that 83% of single copy orthologues in the two yeasts had conserved dispensability. Gene dispensability differed for certain pathways between the two yeasts, including mitochondrial translation and cell cycle checkpoint control. We show that fission yeast has more essential genes than budding yeast and that essential genes are more likely than non-essential genes to be single copy, broadly conserved and to contain introns. Growth fitness analyses determined sets of haploinsufficient and haploproficient genes for fission yeast, and comparisons with budding yeast identified specific ribosomal proteins and RNA polymerase subunits, which may act more generally to regulate eukaryotic cell growth. PMID:20473289

  3. Sunflower domestication alleles support single domestication center in eastern North America

    PubMed Central

    Blackman, Benjamin K.; Scascitelli, Moira; Kane, Nolan C.; Luton, Harry H.; Rasmussen, David A.; Bye, Robert A.; Lentz, David L.; Rieseberg, Loren H.

    2011-01-01

    Phylogenetic analyses of genes with demonstrated involvement in evolutionary transitions can be an important means of resolving conflicting hypotheses about evolutionary history or process. In sunflower, two genes have previously been shown to have experienced selective sweeps during its early domestication. In the present study, we identified a third candidate early domestication gene and conducted haplotype analyses of all three genes to address a recent, controversial hypothesis about the origin of cultivated sunflower. Although the scientific consensus had long been that sunflower was domesticated once in eastern North America, the discovery of pre-Columbian sunflower remains at archaeological sites in Mexico led to the proposal of a second domestication center in southern Mexico. Previous molecular studies with neutral markers were consistent with the former hypothesis. However, only two indigenous Mexican cultivars were included in these studies, and their provenance and genetic purity have been questioned. Therefore, we sequenced regions of the three candidate domestication genes containing SNPs diagnostic for domestication from large, newly collected samples of Mexican sunflower landraces and Mexican wild populations from a broad geographic range. The new germplasm also was genotyped for 12 microsatellite loci. Our evidence from multiple evolutionarily important loci and from neutral markers supports a single domestication event for extant cultivated sunflower in eastern North America. PMID:21844335

  4. Beyond the usual suspects: a multidimensional genetic exploration of infant attachment disorganization and security.

    PubMed

    Pappa, Irene; Szekely, Eszter; Mileva-Seitz, Viara R; Luijk, Maartje P C M; Bakermans-Kranenburg, Marian J; van IJzendoorn, Marinus H; Tiemeier, Henning

    2015-01-01

    Although the environmental influences on infant attachment disorganization and security are well-studied, little is known about their heritability. Candidate gene studies have shown small, often non-replicable effects. In this study, we gathered the largest sample (N = 657) of ethnically homogenous, 14-month-old children with both observed attachment and genome-wide data. First, we used a Genome-Wide Association Study (GWAS) approach to identify single nucleotide polymorphisms (SNPs) associated with attachment disorganization and security. Second, we annotated them into genes (Versatile Gene-based Association Study) and functional pathways. Our analyses provide evidence of novel genes (HDAC1, ZNF675, BSCD1) and pathways (synaptic transmission, cation transport) associated with attachment disorganization. Similar analyses identified a novel gene (BECN1) but no distinct pathways associated with attachment security. The results of this first extensive, exploratory study on the molecular-genetic basis of infant attachment await replication in large, independent samples.

  5. Gene disruption in Trichoderma atroviride via Agrobacterium-mediated transformation.

    PubMed

    Zeilinger, Susanne

    2004-02-01

    A modified Agrobacterium-mediated transformation method for the efficient disruption of two genes encoding signaling compounds of the mycoparasite Trichoderma atroviride is described, using the hph gene of Escherichia coli as selection marker. The transformation vectors contained about 1 kb of 5' and 3' non-coding regions from the tmk1 (encoding a MAP kinase) or tga3 (encoding an alpha-subunit of a heterotrimeric G protein) target loci flanking a selection marker. Transformation of fungal conidia and selection on hygromycin-containing media applying an overlay-based procedure, which overcomes the lack of formation of distinct single colonies by the fungus, led to stable clones for both disruption constructs. Southern and PCR analyses proved gene disruption by single-copy homologous integration with a frequency of approximately 60% for both genes; and the loss of tmk1 and tga3 transcript formation in the disruptants was demonstrated by RT-PCR.

  6. The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes.

    PubMed

    Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai

    2017-01-01

    The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.

  7. Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.

    PubMed

    Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao

    2016-11-30

    Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.

  8. Probabilistic modeling of bifurcations in single-cell gene expression data using a Bayesian mixture of factor analyzers.

    PubMed

    Campbell, Kieran R; Yau, Christopher

    2017-03-15

    Modeling bifurcations in single-cell transcriptomics data has become an increasingly popular field of research. Several methods have been proposed to infer bifurcation structure from such data, but all rely on heuristic non-probabilistic inference. Here we propose the first generative, fully probabilistic model for such inference based on a Bayesian hierarchical mixture of factor analyzers. Our model exhibits competitive performance on large datasets despite implementing full Markov-Chain Monte Carlo sampling, and its unique hierarchical prior structure enables automatic determination of genes driving the bifurcation process. We additionally propose an Empirical-Bayes like extension that deals with the high levels of zero-inflation in single-cell RNA-seq data and quantify when such models are useful. We apply or model to both real and simulated single-cell gene expression data and compare the results to existing pseudotime methods. Finally, we discuss both the merits and weaknesses of such a unified, probabilistic approach in the context practical bioinformatics analyses.

  9. Genome Sequence and Analysis of Escherichia coli MRE600, a Colicinogenic, Nonmotile Strain that Lacks RNase I and the Type I Methyltransferase, EcoKI

    PubMed Central

    Kurylo, Chad M.; Alexander, Noah; Dass, Randall A.; Parks, Matthew M.; Altman, Roger A.; Vincent, C. Theresa; Mason, Christopher E.; Blanchard, Scott C.

    2016-01-01

    Escherichia coli strain MRE600 was originally identified for its low RNase I activity and has therefore been widely adopted by the biomedical research community as a preferred source for the expression and purification of transfer RNAs and ribosomes. Despite its widespread use, surprisingly little information about its genome or genetic content exists. Here, we present the first de novo assembly and description of the MRE600 genome and epigenome. To provide context to these studies of MRE600, we include comparative analyses with E. coli K-12 MG1655 (K12). Pacific Biosciences Single Molecule, Real-Time sequencing reads were assembled into one large chromosome (4.83 Mb) and three smaller plasmids (89.1, 56.9, and 7.1 kb). Interestingly, the 7.1-kb plasmid possesses genes encoding a colicin E1 protein and its associated immunity protein. The MRE600 genome has a G + C content of 50.8% and contains a total of 5,181 genes, including 4,913 protein-encoding genes and 268 RNA genes. We identified 41,469 modified DNA bases (0.83% of total) and found that MRE600 lacks the gene for type I methyltransferase, EcoKI. Phylogenetic, taxonomic, and genetic analyses demonstrate that MRE600 is a divergent E. coli strain that displays features of the closely related genus, Shigella. Nevertheless, comparative analyses between MRE600 and E. coli K12 show that these two strains exhibit nearly identical ribosomal proteins, ribosomal RNAs, and highly homologous tRNA species. Substantiating prior suggestions that MRE600 lacks RNase I activity, the RNase I-encoding gene, rna, contains a single premature stop codon early in its open-reading frame. PMID:26802429

  10. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder: association to overlapping traits in ADHD and autism

    PubMed Central

    Naaijen, J; Bralten, J; Poelmans, G; Faraone, Stephen; Asherson, Philip; Banaschewski, Tobias; Buitelaar, Jan; Franke, Barbara; P Ebstein, Richard; Gill, Michael; Miranda, Ana; D Oades, Robert; Roeyers, Herbert; Rothenberger, Aribert; Sergeant, Joseph; Sonuga-Barke, Edmund; Anney, Richard; Mulas, Fernando; Steinhausen, Hans-Christoph; Glennon, J C; Franke, B; Buitelaar, J K

    2017-01-01

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms, autism symptom severity and inhibition were performed using principal component regression analyses. Subsequently, gene-wide association analyses were performed. The glutamate gene set showed an association with severity of hyperactivity/impulsivity (P=0.009), which was robust to correcting for genome-wide association levels. The GABA gene set showed nominally significant association with inhibition (P=0.04), but this did not survive correction for multiple comparisons. None of single gene or single variant associations was significant on their own. By analyzing multiple genetic variants within candidate gene sets together, we were able to find genetic associations supporting the involvement of excitatory and inhibitory neurotransmitter systems in ADHD and ASD symptom severity in ADHD. PMID:28072412

  11. TARGET Researchers Identify Mutations in SIX1/2 and microRNA Processing Genes in Favorable Histology Wilms Tumor | Office of Cancer Genomics

    Cancer.gov

    TARGET researchers molecularly characterized favorable histology Wilms tumor (FHWT), a pediatric renal cancer. Comprehensive genome and transcript analyses revealed single-nucleotide substitution/deletion mutations in microRNA processing genes (15% of FHWT patients) and Sine Oculis Homeobox Homolog 1/2 (SIX1/2) genes (7% of FHWT patients). SIX1/2 genes play a critical role in renal development and were not previously associated with FHWT, thus presenting a novel role for SIX1/2 pathway aberrations in this disease.

  12. Transcription factors and stress response gene alterations in human keratinocytes following Solar Simulated Ultra Violet Radiation.

    PubMed

    Marais, Thomas L Des; Kluz, Thomas; Xu, Dazhong; Zhang, Xiaoru; Gesumaria, Lisa; Matsui, Mary S; Costa, Max; Sun, Hong

    2017-10-19

    Ultraviolet radiation (UVR) from sunlight is the major effector for skin aging and carcinogenesis. However, genes and pathways altered by solar-simulated UVR (ssUVR), a mixture of UVA and UVB, are not well characterized. Here we report global changes in gene expression as well as associated pathways and upstream transcription factors in human keratinocytes exposed to ssUVR. Human HaCaT keratinocytes were exposed to either a single dose or 5 repetitive doses of ssUVR. Comprehensive analyses of gene expression profiles as well as functional annotation were performed at 24 hours post irradiation. Our results revealed that ssUVR modulated genes with diverse cellular functions changed in a dose-dependent manner. Gene expression in cells exposed to a single dose of ssUVR differed significantly from those that underwent repetitive exposures. While single ssUVR caused a significant inhibition in genes involved in cell cycle progression, especially G2/M checkpoint and mitotic regulation, repetitive ssUVR led to extensive changes in genes related to cell signaling and metabolism. We have also identified a panel of ssUVR target genes that exhibited persistent changes in gene expression even at 1 week after irradiation. These results revealed a complex network of transcriptional regulators and pathways that orchestrate the cellular response to ssUVR.

  13. Co-expression networks reveal the tissue-specific regulation of transcription and splicing.

    PubMed

    Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D H; Jo, Brian; Gao, Chuan; McDowell, Ian C; Engelhardt, Barbara E; Battle, Alexis

    2017-11-01

    Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. © 2017 Saha et al.; Published by Cold Spring Harbor Laboratory Press.

  14. TTT and PIKK Complex Genes Reverted to Single Copy Following Polyploidization and Retain Function Despite Massive Retrotransposition in Maize.

    PubMed

    Garcia, Nelson; Messing, Joachim

    2017-01-01

    The TEL2, TTI1, and TTI2 proteins are co-chaperones for heat shock protein 90 (HSP90) to regulate the protein folding and maturation of phosphatidylinositol 3-kinase-related kinases (PIKKs). Referred to as the TTT complex, the genes that encode them are highly conserved from man to maize. TTT complex and PIKK genes exist mostly as single copy genes in organisms where they have been characterized. Members of this interacting protein network in maize were identified and synteny analyses were performed to study their evolution. Similar to other species, there is only one copy of each of these genes in maize which was due to a loss of the duplicated copy created by ancient allotetraploidy. Moreover, the retained copies of the TTT complex and the PIKK genes tolerated extensive retrotransposon insertion in their introns that resulted in increased gene lengths and gene body methylation, without apparent effect in normal gene expression and function. The results raise an interesting question on whether the reversion to single copy was due to selection against deleterious unbalanced gene duplications between members of the complex as predicted by the gene balance hypothesis, or due to neutral loss of extra copies. Uneven alteration of dosage either by adding extra copies or modulating gene expression of complex members is being proposed as a means to investigate whether the data supports the gene balance hypothesis or not.

  15. Improvement of experimental testing and network training conditions with genome-wide microarrays for more accurate predictions of drug gene targets

    PubMed Central

    2014-01-01

    Background Genome-wide microarrays have been useful for predicting chemical-genetic interactions at the gene level. However, interpreting genome-wide microarray results can be overwhelming due to the vast output of gene expression data combined with off-target transcriptional responses many times induced by a drug treatment. This study demonstrates how experimental and computational methods can interact with each other, to arrive at more accurate predictions of drug-induced perturbations. We present a two-stage strategy that links microarray experimental testing and network training conditions to predict gene perturbations for a drug with a known mechanism of action in a well-studied organism. Results S. cerevisiae cells were treated with the antifungal, fluconazole, and expression profiling was conducted under different biological conditions using Affymetrix genome-wide microarrays. Transcripts were filtered with a formal network-based method, sparse simultaneous equation models and Lasso regression (SSEM-Lasso), under different network training conditions. Gene expression results were evaluated using both gene set and single gene target analyses, and the drug’s transcriptional effects were narrowed first by pathway and then by individual genes. Variables included: (i) Testing conditions – exposure time and concentration and (ii) Network training conditions – training compendium modifications. Two analyses of SSEM-Lasso output – gene set and single gene – were conducted to gain a better understanding of how SSEM-Lasso predicts perturbation targets. Conclusions This study demonstrates that genome-wide microarrays can be optimized using a two-stage strategy for a more in-depth understanding of how a cell manifests biological reactions to a drug treatment at the transcription level. Additionally, a more detailed understanding of how the statistical model, SSEM-Lasso, propagates perturbations through a network of gene regulatory interactions is achieved. PMID:24444313

  16. Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data.

    PubMed

    Racle, Julien; de Jonge, Kaat; Baumgaertner, Petra; Speiser, Daniel E; Gfeller, David

    2017-11-13

    Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org).

  17. Effect of misspecification of gene frequency on the two-point LOD score.

    PubMed

    Pal, D K; Durner, M; Greenberg, D A

    2001-11-01

    In this study, we used computer simulation of simple and complex models to ask: (1) What is the penalty in evidence for linkage when the assumed gene frequency is far from the true gene frequency? (2) If the assumed model for gene frequency and inheritance are misspecified in the analysis, can this lead to a higher maximum LOD score than that obtained under the true parameters? Linkage data simulated under simple dominant, recessive, dominant and recessive with reduced penetrance, and additive models, were analysed assuming a single locus with both the correct and incorrect dominance model and assuming a range of different gene frequencies. We found that misspecifying the analysis gene frequency led to little penalty in maximum LOD score in all models examined, especially if the assumed gene frequency was lower than the generating one. Analysing linkage data assuming a gene frequency of the order of 0.01 for a dominant gene, and 0.1 for a recessive gene, appears to be a reasonable tactic in the majority of realistic situations because underestimating the gene frequency, even when the true gene frequency is high, leads to little penalty in the LOD score.

  18. Comprehensive replication of the relationship between myopia-related genes and refractive errors in a large Japanese cohort.

    PubMed

    Yoshikawa, Munemitsu; Yamashiro, Kenji; Miyake, Masahiro; Oishi, Maho; Akagi-Kurashige, Yumiko; Kumagai, Kyoko; Nakata, Isao; Nakanishi, Hideo; Oishi, Akio; Gotoh, Norimoto; Yamada, Ryo; Matsuda, Fumihiko; Yoshimura, Nagahisa

    2014-10-21

    We investigated the association between refractive error in a Japanese population and myopia-related genes identified in two recent large-scale genome-wide association studies. Single-nucleotide polymorphisms (SNPs) in 51 genes that were reported by the Consortium for Refractive Error and Myopia and/or the 23andMe database were genotyped in 3712 healthy Japanese volunteers from the Nagahama Study using HumanHap610K Quad, HumanOmni2.5M, and/or HumanExome Arrays. To evaluate the association between refractive error and recently identified myopia-related genes, we used three approaches to perform quantitative trait locus analyses of mean refractive error in both eyes of the participants: per-SNP, gene-based top-SNP, and gene-based all-SNP analyses. Association plots of successfully replicated genes also were investigated. In our per-SNP analysis, eight myopia gene associations were replicated successfully: GJD2, RASGRF1, BICC1, KCNQ5, CD55, CYP26A1, LRRC4C, and B4GALNT2.Seven additional gene associations were replicated in our gene-based analyses: GRIA4, BMP2, QKI, BMP4, SFRP1, SH3GL2, and EHBP1L1. The signal strength of the reported SNPs and their tagging SNPs increased after considering different linkage disequilibrium patterns across ethnicities. Although two previous studies suggested strong associations between PRSS56, LAMA2, TOX, and RDH5 and myopia, we could not replicate these results. Our results confirmed the significance of the myopia-related genes reported previously and suggested that gene-based replication analyses are more effective than per-SNP analyses. Our comparison with two previous studies suggested that BMP3 SNPs cause myopia primarily in Caucasian populations, while they may exhibit protective effects in Asian populations. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.

  19. You've gotta be lucky: Coverage and the elusive gene-gene interaction.

    PubMed

    Reimherr, Matthew; Nicolae, Dan L

    2011-01-01

    Genome-wide association studies (GWAS) have led to a large number of single-SNP association findings, but there has been, so far, no investigation resulting in the discovery of a replicable gene-gene interaction. In this paper, we examine some of the possible explanations for the lack of findings, and argue that coverage of causal variation not only has a large effect on the loss in power, but that the effect is larger than in the single-SNP analyses. We show that the product of linkage disequilibrium measures, r², between causal and tested SNPs offers a good approximation to the loss in efficiency as defined by the ratio of sample sizes that lead to similar power. We also demonstrate that, in addition to the huge search space, the loss in power due to coverage when using commercially available platforms makes the search for gene-gene interactions daunting. © 2010 The Authors Annals of Human Genetics © 2010 Blackwell Publishing Ltd/University College London.

  20. Spatially coordinated dynamic gene transcription in living pituitary tissue

    PubMed Central

    Featherstone, Karen; Hey, Kirsty; Momiji, Hiroshi; McNamara, Anne V; Patist, Amanda L; Woodburn, Joanna; Spiller, David G; Christian, Helen C; McNeilly, Alan S; Mullins, John J; Finkenstädt, Bärbel F; Rand, David A; White, Michael RH; Davis, Julian RE

    2016-01-01

    Transcription at individual genes in single cells is often pulsatile and stochastic. A key question emerges regarding how this behaviour contributes to tissue phenotype, but it has been a challenge to quantitatively analyse this in living cells over time, as opposed to studying snap-shots of gene expression state. We have used imaging of reporter gene expression to track transcription in living pituitary tissue. We integrated live-cell imaging data with statistical modelling for quantitative real-time estimation of the timing of switching between transcriptional states across a whole tissue. Multiple levels of transcription rate were identified, indicating that gene expression is not a simple binary ‘on-off’ process. Immature tissue displayed shorter durations of high-expressing states than the adult. In adult pituitary tissue, direct cell contacts involving gap junctions allowed local spatial coordination of prolactin gene expression. Our findings identify how heterogeneous transcriptional dynamics of single cells may contribute to overall tissue behaviour. DOI: http://dx.doi.org/10.7554/eLife.08494.001 PMID:26828110

  1. A Comparative Study on Multifactor Dimensionality Reduction Methods for Detecting Gene-Gene Interactions with the Survival Phenotype

    PubMed Central

    Lee, Seungyeoun; Kim, Yongkang; Kwon, Min-Seok; Park, Taesung

    2015-01-01

    Genome-wide association studies (GWAS) have extensively analyzed single SNP effects on a wide variety of common and complex diseases and found many genetic variants associated with diseases. However, there is still a large portion of the genetic variants left unexplained. This missing heritability problem might be due to the analytical strategy that limits analyses to only single SNPs. One of possible approaches to the missing heritability problem is to consider identifying multi-SNP effects or gene-gene interactions. The multifactor dimensionality reduction method has been widely used to detect gene-gene interactions based on the constructive induction by classifying high-dimensional genotype combinations into one-dimensional variable with two attributes of high risk and low risk for the case-control study. Many modifications of MDR have been proposed and also extended to the survival phenotype. In this study, we propose several extensions of MDR for the survival phenotype and compare the proposed extensions with earlier MDR through comprehensive simulation studies. PMID:26339630

  2. Global preamplification simplifies targeted mRNA quantification

    PubMed Central

    Kroneis, Thomas; Jonasson, Emma; Andersson, Daniel; Dolatabadi, Soheila; Ståhlberg, Anders

    2017-01-01

    The need to perform gene expression profiling using next generation sequencing and quantitative real-time PCR (qPCR) on small sample sizes and single cells is rapidly expanding. However, to analyse few molecules, preamplification is required. Here, we studied global and target-specific preamplification using 96 optimised qPCR assays. To evaluate the preamplification strategies, we monitored the reactions in real-time using SYBR Green I detection chemistry followed by melting curve analysis. Next, we compared yield and reproducibility of global preamplification to that of target-specific preamplification by qPCR using the same amount of total RNA. Global preamplification generated 9.3-fold lower yield and 1.6-fold lower reproducibility than target-specific preamplification. However, the performance of global preamplification is sufficient for most downstream applications and offers several advantages over target-specific preamplification. To demonstrate the potential of global preamplification we analysed the expression of 15 genes in 60 single cells. In conclusion, we show that global preamplification simplifies targeted gene expression profiling of small sample sizes by a flexible workflow. We outline the pros and cons for global preamplification compared to target-specific preamplification. PMID:28332609

  3. Validation and Interrogation of Differentially Expressed and Alternatively Spliced Genes in African American Prostate Cancer

    DTIC Science & Technology

    2016-10-01

    These analyses have led to two submitted manuscripts. The first manuscript, “Variants of stemness -related genes predicted to regulate RNA splicing...and Table 1-3 at the end of this progress report. The second manuscript, “Single nucleotide polymorphisms of stemness pathway genes predicted to...cancer and support a contribution of the stemness pathway to prostate cancer patient outcome. Please see Figure 5-7 and Table 4-6 at the end of this

  4. N-of-1-pathways MixEnrich: advancing precision medicine via single-subject analysis in discovering dynamic changes of transcriptomes.

    PubMed

    Li, Qike; Schissler, A Grant; Gardeux, Vincent; Achour, Ikbel; Kenost, Colleen; Berghout, Joanne; Li, Haiquan; Zhang, Hao Helen; Lussier, Yves A

    2017-05-24

    Transcriptome analytic tools are commonly used across patient cohorts to develop drugs and predict clinical outcomes. However, as precision medicine pursues more accurate and individualized treatment decisions, these methods are not designed to address single-patient transcriptome analyses. We previously developed and validated the N-of-1-pathways framework using two methods, Wilcoxon and Mahalanobis Distance (MD), for personal transcriptome analysis derived from a pair of samples of a single patient. Although, both methods uncover concordantly dysregulated pathways, they are not designed to detect dysregulated pathways with up- and down-regulated genes (bidirectional dysregulation) that are ubiquitous in biological systems. We developed N-of-1-pathways MixEnrich, a mixture model followed by a gene set enrichment test, to uncover bidirectional and concordantly dysregulated pathways one patient at a time. We assess its accuracy in a comprehensive simulation study and in a RNA-Seq data analysis of head and neck squamous cell carcinomas (HNSCCs). In presence of bidirectionally dysregulated genes in the pathway or in presence of high background noise, MixEnrich substantially outperforms previous single-subject transcriptome analysis methods, both in the simulation study and the HNSCCs data analysis (ROC Curves; higher true positive rates; lower false positive rates). Bidirectional and concordant dysregulated pathways uncovered by MixEnrich in each patient largely overlapped with the quasi-gold standard compared to other single-subject and cohort-based transcriptome analyses. The greater performance of MixEnrich presents an advantage over previous methods to meet the promise of providing accurate personal transcriptome analysis to support precision medicine at point of care.

  5. Quantification of multiple gene expression in individual cells.

    PubMed

    Peixoto, António; Monteiro, Marta; Rocha, Benedita; Veiga-Fernandes, Henrique

    2004-10-01

    Quantitative gene expression analysis aims to define the gene expression patterns determining cell behavior. So far, these assessments can only be performed at the population level. Therefore, they determine the average gene expression within a population, overlooking possible cell-to-cell heterogeneity that could lead to different cell behaviors/cell fates. Understanding individual cell behavior requires multiple gene expression analyses of single cells, and may be fundamental for the understanding of all types of biological events and/or differentiation processes. We here describe a new reverse transcription-polymerase chain reaction (RT-PCR) approach allowing the simultaneous quantification of the expression of 20 genes in the same single cell. This method has broad application, in different species and any type of gene combination. RT efficiency is evaluated. Uniform and maximized amplification conditions for all genes are provided. Abundance relationships are maintained, allowing the precise quantification of the absolute number of mRNA molecules per cell, ranging from 2 to 1.28 x 10(9) for each individual gene. We evaluated the impact of this approach on functional genetic read-outs by studying an apparently homogeneous population (monoclonal T cells recovered 4 d after antigen stimulation), using either this method or conventional real-time RT-PCR. Single-cell studies revealed considerable cell-to-cell variation: All T cells did not express all individual genes. Gene coexpression patterns were very heterogeneous. mRNA copy numbers varied between different transcripts and in different cells. As a consequence, this single-cell assay introduces new and fundamental information regarding functional genomic read-outs. By comparison, we also show that conventional quantitative assays determining population averages supply insufficient information, and may even be highly misleading.

  6. A single amino acid substitution in the Bombyx-specific mucin-like membrane protein causes resistance to Bombyx mori densovirus.

    PubMed

    Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Sezutsu, Hideki; Uchino, Keiro; Kobayashi, Isao; Tamura, Toshiki; Yamamoto, Kimiko; Mita, Kazuei; Shimada, Toru; Kadono-Okuda, Keiko

    2018-05-09

    Bombyx mori densovirus type 1 (BmDV) is a pathogen that causes flacherie disease in the silkworm. The absolute nonsusceptibility to BmDV among certain silkworm strains is determined independently by two genes, nsd-1 and Nid-1. However, neither of these genes has been molecularly identified to date. Here, we isolated the nsd-1 gene by positional cloning and characterized the properties of its product, NSD-1. Sequence and biochemical analyses revealed that this gene encodes a Bombyx-specific mucin-like glycoprotein with a single transmembrane domain. The NSD-1 protein was specifically expressed in the larval midgut epithelium, the known infection site of BmDV. Sequence analysis of the nsd-1 gene from 13 resistant and 12 susceptible strains suggested that a specific arginine residue in the extracellular tail of the NSD-1 protein was common among susceptible strains. Germline transformation of the susceptible-type nsd-1 (with a single nucleotide substitution) conferred partial susceptibility to resistant larvae, indicating that the +  nsd-1 gene is required for the susceptibility of B. mori larvae to BmDV and the susceptibility is solely a result of the substitution of a single amino acid with arginine. Taken together, our results provide striking evidence that a novel membrane-bound mucin-like protein functions as a cell-surface receptor for a densovirus.

  7. A Rhizobium radiobacter Histidine Kinase Can Employ Both Boolean AND and OR Logic Gates to Initiate Pathogenesis.

    PubMed

    Fang, Fang; Lin, Yi-Han; Pierce, B Daniel; Lynn, David G

    2015-10-12

    The molecular logic gates that regulate gene circuits are necessarily intricate and highly regulated, particularly in the critical commitments necessary for pathogenesis. We now report simple AND and OR logic gates to be accessible within a single protein receptor. Pathogenesis by the bacterium Rhizobium radiobacter is mediated by a single histidine kinase, VirA, which processes multiple small molecule host signals (phenol and sugar). Mutagenesis analyses converged on a single signal integration node, and finer functional analyses revealed that a single residue could switch VirA from a functional AND logic gate to an OR gate where each of two signals activate independently. Host range preferences among natural strains of R. radiobacter correlate with these gate logic strategies. Although the precise mechanism for the signal integration node requires further analyses, long-range signal transmission through this histidine kinase can now be exploited for synthetic signaling circuits. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Gene-gene interactions among genetic variants from obesity candidate genes for nonobese and obese populations in type 2 diabetes.

    PubMed

    Lin, Eugene; Pei, Dee; Huang, Yi-Jen; Hsieh, Chang-Hsun; Wu, Lawrence Shih-Hsin

    2009-08-01

    Recent studies indicate that obesity may play a key role in modulating genetic predispositions to type 2 diabetes (T2D). This study examines the main effects of both single-locus and multilocus interactions among genetic variants in Taiwanese obese and nonobese individuals to test the hypothesis that obesity-related genes may contribute to the etiology of T2D independently and/or through such complex interactions. We genotyped 11 single nucleotide polymorphisms for 10 obesity candidate genes including adrenergic beta-2-receptor surface, adrenergic beta-3-receptor surface, angiotensinogen, fat mass and obesity associated gene, guanine nucleotide binding protein beta polypeptide 3 (GNB3), interleukin 6 receptor, proprotein convertase subtilisin/kexin type 1 (PCSK1), uncoupling protein 1, uncoupling protein 2, and uncoupling protein 3. There were 389 patients diagnosed with T2D and 186 age- and sex-matched controls. Single-locus analyses showed significant main effects of the GNB3 and PCSK1 genes on the risk of T2D among the nonobese group (p = 0.002 and 0.047, respectively). Further, interactions involving GNB3 and PCSK1 were suggested among the nonobese population using the generalized multifactor dimensionality reduction method (p = 0.001). In addition, interactions among angiotensinogen, fat mass and obesity associated gene, GNB3, and uncoupling protein 3 genes were found in a significant four-locus generalized multifactor dimensionality reduction model among the obese population (p = 0.001). The results suggest that the single nucleotide polymorphisms from the obesity candidate genes may contribute to the risk of T2D independently and/or in an interactive manner according to the presence or absence of obesity.

  9. Tracing the temporal-spatial transcriptome landscapes of the human fetal digestive tract using single-cell RNA-sequencing.

    PubMed

    Gao, Shuai; Yan, Liying; Wang, Rui; Li, Jingyun; Yong, Jun; Zhou, Xin; Wei, Yuan; Wu, Xinglong; Wang, Xiaoye; Fan, Xiaoying; Yan, Jie; Zhi, Xu; Gao, Yun; Guo, Hongshan; Jin, Xiao; Wang, Wendong; Mao, Yunuo; Wang, Fengchao; Wen, Lu; Fu, Wei; Ge, Hao; Qiao, Jie; Tang, Fuchou

    2018-06-01

    The development of the digestive tract is critical for proper food digestion and nutrient absorption. Here, we analyse the main organs of the digestive tract, including the oesophagus, stomach, small intestine and large intestine, from human embryos between 6 and 25 weeks of gestation as well as the large intestine from adults using single-cell RNA-seq analyses. In total, 5,227 individual cells are analysed and 40 cell types clearly identified. Their crucial biological features, including developmental processes, signalling pathways, cell cycle, nutrient digestion and absorption metabolism, and transcription factor networks, are systematically revealed. Moreover, the differentiation and maturation processes of the large intestine are thoroughly investigated by comparing the corresponding transcriptome profiles between embryonic and adult stages. Our work offers a rich resource for investigating the gene regulation networks of the human fetal digestive tract and adult large intestine at single-cell resolution.

  10. Systematic Integration of Brain eQTL and GWAS Identifies ZNF323 as a Novel Schizophrenia Risk Gene and Suggests Recent Positive Selection Based on Compensatory Advantage on Pulmonary Function.

    PubMed

    Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D; Als, Thomas D; van den Oord, Edwin J; Aberg, Karolina A; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G; Nöthen, Markus M; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang

    2015-11-01

    Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10(-6)). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10(-6); single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10(-10)). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10(-5) and P = 9.00×10(-5), respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  11. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.

    PubMed

    Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E

    2016-03-11

    Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.

  12. Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data

    PubMed Central

    Racle, Julien; de Jonge, Kaat; Baumgaertner, Petra; Speiser, Daniel E

    2017-01-01

    Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org). PMID:29130882

  13. Evidence from single nucleotide polymorphism analyses of ADVANCE study demonstrates EFNB3 as a hypertension risk gene.

    PubMed

    Tremblay, Johanne; Wang, Yujia; Raelson, John; Marois-Blanchet, Francois-Christophe; Wu, Zenghui; Luo, Hongyu; Bradley, Edward; Chalmers, John; Woodward, Mark; Harrap, Stephen; Hamet, Pavel; Wu, Jiangping

    2017-03-08

    EPH kinases and their ligands, ephrins (EFNs), have vital and diverse biological functions. We recently reported that Efnb3 gene deletion results in hypertension in female but not male mice. These data suggest that EFNB3 regulates blood pressure in a sex- and sex hormone-dependent way. In the present study, we conducted a human genetic study to assess the association of EFNB3 single nucleotide polymorphisms with human hypertension risks, using 3,448 patients with type 2 diabetes from the ADVANCE study (Action in Diabetes and Vascular Disease: Peterax and Diamicron MR Controlled Evaluation). We have observed significant association between 2 SNPs in the 3' untranslated region or within the adjacent region just 3' of the EFNB3 gene with hypertension, corroborating our findings from the mouse model. Thus, our investigation has shown that EFNB3 is a hypertension risk gene in certain individuals.

  14. Three WRKY transcription factors additively repress abscisic acid and gibberellin signaling in aleurone cells.

    PubMed

    Zhang, Liyuan; Gu, Lingkun; Ringler, Patricia; Smith, Stanley; Rushton, Paul J; Shen, Qingxi J

    2015-07-01

    Members of the WRKY transcription factor superfamily are essential for the regulation of many plant pathways. Functional redundancy due to duplications of WRKY transcription factors, however, complicates genetic analysis by allowing single-mutant plants to maintain wild-type phenotypes. Our analyses indicate that three group I WRKY genes, OsWRKY24, -53, and -70, act in a partially redundant manner. All three showed characteristics of typical WRKY transcription factors: each localized to nuclei and yeast one-hybrid assays indicated that they all bind to W-boxes, including those present in their own promoters. Quantitative real time-PCR (qRT-PCR) analyses indicated that the expression levels of the three WRKY genes varied in the different tissues tested. Particle bombardment-mediated transient expression analyses indicated that all three genes repress the GA and ABA signaling in a dosage-dependent manner. Combination of all three WRKY genes showed additive antagonism of ABA and GA signaling. These results suggest that these WRKY proteins function as negative transcriptional regulators of GA and ABA signaling. However, different combinations of these WRKY genes can lead to varied strengths in suppression of their targets. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  15. The mitochondrial genome of booklouse, Liposcelis sculptilis (Psocoptera: Liposcelididae) and the evolutionary timescale of Liposcelis

    PubMed Central

    Shi, Yan; Chu, Qing; Wei, Dan-Dan; Qiu, Yuan-Jian; Shang, Feng; Dou, Wei; Wang, Jin-Jun

    2016-01-01

    Bilateral animals are featured by an extremely compact mitochondrial (mt) genome with 37 genes on a single circular chromosome. To date, the complete mt genome has only been determined for four species of Liposcelis, a genus with economic importance, including L. entomophila, L. decolor, L. bostrychophila, and L. paeta. They belong to A, B, or D group of Liposcelis, respectively. Unlike most bilateral animals, L. bostrychophila, L. entomophila and L. paeta have a bitipartite mt genome with genes on two chromosomes. However, the mt genome of L. decolor has the typical mt chromosome of bilateral animals. Here, we sequenced the mt genome of L. sculptilis, and identified 35 genes, which were on a single chromosome. The mt genome fragmentation is not shared by the D group of Liposcelis and the single chromosome of L. sculptilis differed from those of booklice known in gene content and gene arrangement. We inferred that different evolutionary patterns and rate existed in Liposcelis. Further, we reconstructed the evolutionary history of 21 psocodean taxa with phylogenetic analyses, which suggested that Liposcelididae and Phthiraptera have evolved 134 Ma and the sucking lice diversified in the Late Cretaceous. PMID:27470659

  16. Genomic analysis reveals extensive gene duplication within the bovine TRB locus

    PubMed Central

    Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan

    2009-01-01

    Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes, which is substantially larger than that described for humans and mice. Conclusion The analyses completed in this study reveal that, although the gene content and organization of the bovine TRB locus are broadly similar to that of humans and mice, multiple duplication events have led to a marked expansion in the number of TRB genes. Similar expansions in other ruminant TR loci suggest strong evolutionary pressures in this lineage have selected for the development of enlarged sets of TR genes that can contribute to diverse TR repertoires. PMID:19393068

  17. Breast Cancer Clinical Trial of Chemotherapy and Trastuzumab: Potential Tool to Identify Cardiac Modifying Variants of Dilated Cardiomyopathy

    PubMed Central

    Serie, Daniel J.; Crook, Julia E.; Necela, Brian M.; Axenfeld, Bianca C.; Dockter, Travis J.; Colon-Otero, Gerardo; Perez, Edith A.; Thompson, E. Aubrey; Norton, Nadine

    2017-01-01

    Doxorubicin and the ERBB2 targeted therapy, trastuzumab, are routinely used in the treatment of HER2+ breast cancer. In mouse models, doxorubicin is known to cause cardiomyopathy and conditional cardiac knock out of Erbb2 results in dilated cardiomyopathy and increased sensitivity to doxorubicin-induced cell death. In humans, these drugs also result in cardiac phenotypes, but severity and reversibility is highly variable. We examined the association of decline in left ventricular ejection fraction (LVEF) at 15,204 single nucleotide polymorphisms (SNPs) spanning 72 cardiomyopathy genes, in 800 breast cancer patients who received doxorubicin and trastuzumab. For 7033 common SNPs (minor allele frequency (MAF) > 0.01) we performed single marker linear regression. For all SNPs, we performed gene-based testing with SNP-set (Sequence) Kernel Association Tests: SKAT, SKAT-O and SKAT-common/rare under rare variant non-burden; rare variant optimized burden and non-burden tests; and a combination of rare and common variants respectively. Single marker analyses identified seven missense variants in OBSCN (p = 0.0045–0.0009, MAF = 0.18–0.50) and two in TTN (both p = 0.04, MAF = 0.22). Gene-based rare variant analyses, SKAT and SKAT-O, performed very similarly (ILK, TCAP, DSC2, VCL, FXN, DSP and KCNQ1, p = 0.042–0.006). Gene-based tests of rare/common variants were significant at the nominal 5% level for OBSCN as well as TCAP, DSC2, VCL, NEXN, KCNJ2 and DMD (p = 0.044–0.008). Our results suggest that rare and common variants in OBSCN, as well as in other genes, could have modifying effects in cardiomyopathy. PMID:29367538

  18. Genetic heterogeneity in autism: From single gene to a pathway perspective.

    PubMed

    An, Joon Yong; Claudianos, Charles

    2016-09-01

    The extreme genetic heterogeneity of autism spectrum disorder (ASD) represents a major challenge. Recent advances in genetic screening and systems biology approaches have extended our knowledge of the genetic etiology of ASD. In this review, we discuss the paradigm shift from a single gene causation model to pathway perturbation model as a guide to better understand the pathophysiology of ASD. We discuss recent genetic findings obtained through next-generation sequencing (NGS) and examine various integrative analyses using systems biology and complex networks approaches that identify convergent patterns of genetic elements associated with ASD. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. The genetic landscape of paediatric de novo acute myeloid leukaemia as defined by single nucleotide polymorphism array and exon sequencing of 100 candidate genes.

    PubMed

    Olsson, Linda; Zettermark, Sofia; Biloglav, Andrea; Castor, Anders; Behrendtz, Mikael; Forestier, Erik; Paulsson, Kajsa; Johansson, Bertil

    2016-07-01

    Cytogenetic analyses of a consecutive series of 67 paediatric (median age 8 years; range 0-17) de novo acute myeloid leukaemia (AML) patients revealed aberrations in 55 (82%) cases. The most common subgroups were KMT2A rearrangement (29%), normal karyotype (15%), RUNX1-RUNX1T1 (10%), deletions of 5q, 7q and/or 17p (9%), myeloid leukaemia associated with Down syndrome (7%), PML-RARA (7%) and CBFB-MYH11 (5%). Single nucleotide polymorphism array (SNP-A) analysis and exon sequencing of 100 genes, performed in 52 and 40 cases, respectively (39 overlapping), revealed ≥1 aberration in 89%; when adding cytogenetic data, this frequency increased to 98%. Uniparental isodisomies (UPIDs) were detected in 13% and copy number aberrations (CNAs) in 63% (median 2/case); three UPIDs and 22 CNAs were recurrent. Twenty-two genes were targeted by focal CNAs, including AEBP2 and PHF6 deletions and genes involved in AML-associated gene fusions. Deep sequencing identified mutations in 65% of cases (median 1/case). In total, 60 mutations were found in 30 genes, primarily those encoding signalling proteins (47%), transcription factors (25%), or epigenetic modifiers (13%). Twelve genes (BCOR, CEBPA, FLT3, GATA1, KIT, KRAS, NOTCH1, NPM1, NRAS, PTPN11, SMC3 and TP53) were recurrently mutated. We conclude that SNP-A and deep sequencing analyses complement the cytogenetic diagnosis of paediatric AML. © 2016 John Wiley & Sons Ltd.

  20. COMT and MAO-A Polymorphisms and Obsessive-Compulsive Disorder: A Family-Based Association Study

    PubMed Central

    Sampaio, Aline Santos; Hounie, Ana Gabriela; Petribú, Kátia; Cappi, Carolina; Morais, Ivanil; Vallada, Homero; do Rosário, Maria Conceição; Stewart, S. Evelyn; Fargeness, Jesen; Mathews, Carol; Arnold, Paul; Hanna, Gregory L.; Richter, Margaret; Kennedy, James; Fontenelle, Leonardo; de Bragança Pereira, Carlos Alberto; Pauls, David L.; Miguel, Eurípedes Constantino

    2015-01-01

    Objective Obsessive-compulsive disorder (OCD) is a common and debilitating psychiatric illness. Although a genetic component contributes to its etiology, no single gene or mechanism has been identified to the OCD susceptibility. The catechol-O-methyltransferase (COMT) and monoamine oxidase A (MAO-A) genes have been investigated in previous OCD studies, but the results are still unclear. More recently, Taylor (2013) in a comprehensive meta-analysis of genetic association studies has identified COMT and MAO-A polymorphisms involved with OCD. In an effort to clarify the role of these two genes in OCD vulnerability, a family-based association investigation was performed as an alternative strategy to the classical case-control design. Methods Transmission disequilibrium analyses were performed after genotyping 13 single-nucleotide polymorphisms (eight in COMT and five in MAO-A) in 783 OCD trios (probands and their parents). Four different OCD phenotypes (from narrow to broad OCD definitions) and a SNP x SNP epistasis were also analyzed. Results OCD, broad and narrow phenotypes,were not associated with any of the investigated COMT and MAO-A polymorphisms. In addition, the analyses of gene-gene interaction did not show significant epistatic influences on phenotype between COMT and MAO-A. Conclusions The findings do not support an association between DSM-IV OCD and the variants of COMT or MAO-A. However, results from this study cannot exclude the contribution of these genes in the manifestation of OCD. The evaluation of broader spectrum phenotypes could help to understand the role of these and other genes in the pathophysiology of OCD and its spectrum disorders. PMID:25793616

  1. The ergot alkaloid gene cluster: functional analyses and evolutionary aspects.

    PubMed

    Lorenz, Nicole; Haarmann, Thomas; Pazoutová, Sylvie; Jung, Manfred; Tudzynski, Paul

    2009-01-01

    Ergot alkaloids and their derivatives have been traditionally used as therapeutic agents in migraine, blood pressure regulation and help in childbirth and abortion. Their production in submerse culture is a long established biotechnological process. Ergot alkaloids are produced mainly by members of the genus Claviceps, with Claviceps purpurea as best investigated species concerning the biochemistry of ergot alkaloid synthesis (EAS). Genes encoding enzymes involved in EAS have been shown to be clustered; functional analyses of EAS cluster genes have allowed to assign specific functions to several gene products. Various Claviceps species differ with respect to their host specificity and their alkaloid content; comparison of the ergot alkaloid clusters in these species (and of clavine alkaloid clusters in other genera) yields interesting insights into the evolution of cluster structure. This review focuses on recently published and also yet unpublished data on the structure and evolution of the EAS gene cluster and on the function and regulation of cluster genes. These analyses have also significant biotechnological implications: the characterization of non-ribosomal peptide synthetases (NRPS) involved in the synthesis of the peptide moiety of ergopeptines opened interesting perspectives for the synthesis of ergot alkaloids; on the other hand, defined mutants could be generated producing interesting intermediates or only single peptide alkaloids (instead of the alkaloid mixtures usually produced by industrial strains).

  2. Global population genetic structure and male-mediated gene flow in the green sea turtle (Chelonia mydas): analysis of microsatellite loci.

    PubMed Central

    Roberts, Mark A; Schwartz, Tonia S; Karl, Stephen A

    2004-01-01

    We assessed the degree of population subdivision among global populations of green sea turtles, Chelonia mydas, using four microsatellite loci. Previously, a single-copy nuclear DNA study indicated significant male-mediated gene flow among populations alternately fixed for different mitochondrial DNA haplotypes and that genetic divergence between populations in the Atlantic and Pacific Oceans was more common than subdivisions among populations within ocean basins. Even so, overall levels of variation at single-copy loci were low and inferences were limited. Here, the markedly more variable microsatellite loci confirm the presence of male-mediated gene flow among populations within ocean basins. This analysis generally confirms the genetic divergence between the Atlantic and Pacific. As with the previous study, phylogenetic analyses of genetic distances based on the microsatellite loci indicate a close genetic relationship among eastern Atlantic and Indian Ocean populations. Unlike the single-copy study, however, the results here cannot be attributed to an artifact of general low variability and likely represent recent or ongoing migration between ocean basins. Sequence analyses of regions flanking the microsatellite repeat reveal considerable amounts of cryptic variation and homoplasy and significantly aid in our understanding of population connectivity. Assessment of the allele frequency distributions indicates that at least some of the loci may not be evolving by the stepwise mutation model. PMID:15126404

  3. Parsing the genetic heterogeneity of chromosome 12q susceptibility genes for Alzheimer disease by family-based association analysis.

    PubMed

    Lin, Ping-I; Martin, Eden R; Browning-Large, Carrie A; Schmechel, Donald E; Welsh-Bohmer, Kathleen A; Doraiswamy, P Murali; Gilbert, John R; Haines, Jonathan L; Pericak-Vance, Margaret A

    2006-07-01

    Previous linkage studies have suggested that chromosome 12 may harbor susceptibility genes for late-onset Alzheimer disease (LOAD). No risk genes on chromosome 12 have been conclusively identified yet. We have reported that the linkage evidence for LOAD in a 12q region was significantly increased in autopsy-confirmed families particularly for those showing no linkage to alpha-T catenin gene, a LOAD candidate gene on chromosome 10 [LOD score increased from 0.1 in the autopsy-confirmed subset to 4.19 in the unlinked subset (optimal subset); p<0.0001 for the increase in LOD score], indicating a one-LOD support interval spanning 6 Mb. To further investigate this finding and to identify potential candidate LOAD risk genes for follow-up analysis, we analyzed 99 single nucleotide polymorphisms in this region, for the overall sample, the autopsy-confirmed subset, and the optimal subset, respectively, for comparison. We saw no significant association (p<0.01) in the overall sample. In the autopsy-confirmed subset, the best finding was obtained in the activation transcription factor 7 (ATF7) gene (single-locus association, p=0.002; haplotype association global, p=0.007). In the optimal subset, the best finding was obtained in the hypothetical protein FLJ20436 (FLJ20436) gene (single-locus association, p=0.0026). These results suggest that subset and covariate analyses may be one approach to help identify novel susceptibility genes on chromosome 12q for LOAD.

  4. Polyphenism in social insects: insights from a transcriptome-wide analysis of gene expression in the life stages of the key pollinator, Bombus terrestris

    PubMed Central

    2011-01-01

    Background Understanding polyphenism, the ability of a single genome to express multiple morphologically and behaviourally distinct phenotypes, is an important goal for evolutionary and developmental biology. Polyphenism has been key to the evolution of the Hymenoptera, and particularly the social Hymenoptera where the genome of a single species regulates distinct larval stages, sexual dimorphism and physical castes within the female sex. Transcriptomic analyses of social Hymenoptera will therefore provide unique insights into how changes in gene expression underlie such complexity. Here we describe gene expression in individual specimens of the pre-adult stages, sexes and castes of the key pollinator, the buff-tailed bumblebee Bombus terrestris. Results cDNA was prepared from mRNA from five life cycle stages (one larva, one pupa, one male, one gyne and two workers) and a total of 1,610,742 expressed sequence tags (ESTs) were generated using Roche 454 technology, substantially increasing the sequence data available for this important species. Overlapping ESTs were assembled into 36,354 B. terrestris putative transcripts, and functionally annotated. A preliminary assessment of differences in gene expression across non-replicated specimens from the pre-adult stages, castes and sexes was performed using R-STAT analysis. Individual samples from the life cycle stages of the bumblebee differed in the expression of a wide array of genes, including genes involved in amino acid storage, metabolism, immunity and olfaction. Conclusions Detailed analyses of immune and olfaction gene expression across phenotypes demonstrated how transcriptomic analyses can inform our understanding of processes central to the biology of B. terrestris and the social Hymenoptera in general. For example, examination of immunity-related genes identified high conservation of important immunity pathway components across individual specimens from the life cycle stages while olfactory-related genes exhibited differential expression with a wider repertoire of gene expression within adults, especially sexuals, in comparison to immature stages. As there is an absence of replication across the samples, the results of this study are preliminary but provide a number of candidate genes which may be related to distinct phenotypic stage expression. This comprehensive transcriptome catalogue will provide an important gene discovery resource for directed programmes in ecology, evolution and conservation of a key pollinator. PMID:22185240

  5. Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease.

    PubMed

    Martin, E R; Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Ribble, R C; Booze, M W; Rogala, A; Hauser, M A; Zhang, F; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Pericak-Vance, M A; Vance, J M

    2001-11-14

    The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. To investigate whether the tau gene is involved in idiopathic PD. Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Family-based tests of association, calculated using asymptotic distributions. Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P =.03; SNP 9i, P =.04; and SNP 11, P =.04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P =.11, and SNP 9iii, P =.87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P =.009) and a negative association with another haplotype (P =.007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3, 9i, 9ii, and 11). This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD.

  6. Genome-wide Association Studies for Female Fertility Traits in Chinese and Nordic Holsteins.

    PubMed

    Liu, Aoxing; Wang, Yachun; Sahana, Goutam; Zhang, Qin; Liu, Lin; Lund, Mogens Sandø; Su, Guosheng

    2017-08-16

    Reduced female fertility could cause considerable economic loss and has become a worldwide problem in the modern dairy industry. The objective of this study was to detect quantitative trait loci (QTL) for female fertility traits in Chinese and Nordic Holsteins using various strategies. First, single-trait association analyses were performed for female fertility traits in Chinese and Nordic Holsteins. Second, the SNPs with P-value < 0.005 discovered in Chinese Holsteins were validated in Nordic Holsteins. Third, the summary statistics from single-trait association analyses were combined into meta-analyses to: (1) identify common QTL for multiple fertility traits within each Holstein population; (2) detect SNPs which were associated with a female fertility trait across two Holstein populations. A large numbers of QTL were discovered or confirmed for female fertility traits. The QTL segregating at 31.4~34.1 Mb on BTA13, 48.3~51.9 Mb on BTA23 and 34.0~37.6 Mb on BTA28 shared between Chinese and Nordic Holsteins were further ascertained using a validation approach and meta-analyses. Furthermore, multiple novel variants identified in Chinese Holsteins were validated with Nordic data as well as meta-analyses. The genes IL6R, SLC39A12, CACNB2, ZEB1, ZMIZ1 and FAM213A were concluded to be strong candidate genes for female fertility in Holsteins.

  7. Candidate gene association analyses for ketosis resistance in Holsteins.

    PubMed

    Kroezen, V; Schenkel, F S; Miglior, F; Baes, C F; Squires, E J

    2018-06-01

    High-yielding dairy cattle are susceptible to ketosis, a metabolic disease that negatively affects the health, fertility, and milk production of the cow. Interest in breeding for more robust dairy cattle with improved resistance to disease is global; however, genetic evaluations for ketosis would benefit from the additional information provided by genetic markers. Candidate genes that are proposed to have a biological role in the pathogenesis of ketosis were investigated in silico and a custom panel of 998 putative single nucleotide polymorphism (SNP) markers was developed. The objective of this study was to test the associations of these new markers with deregressed estimated breeding values (EBV) for ketosis. A sample of 653 Canadian Holstein cows that had been previously genotyped with a medium-density SNP chip were regenotyped with the custom panel. The EBV for ketosis in first and later lactations were obtained for each animal and deregressed for use as pseudo-phenotypes for association analyses. Results of the mixed inheritance model for single SNP association analyses suggested 15 markers in 6 unique candidate genes were associated with the studied trait. Genes encoding proteins involved in metabolic processes, including the synthesis and degradation of fatty acids and ketone bodies, gluconeogenesis, lipid mobilization, and the citric acid cycle, were identified to contain SNP associated with ketosis resistance. This work confirmed the presence of previously described quantitative trait loci for dairy cattle, suggested novel markers for ketosis-resistance, and provided insight into the underlying biology of this disease. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  8. Gene expression distribution deconvolution in single-cell RNA sequencing.

    PubMed

    Wang, Jingshu; Huang, Mo; Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Murray, John; Raj, Arjun; Li, Mingyao; Zhang, Nancy R

    2018-06-26

    Single-cell RNA sequencing (scRNA-seq) enables the quantification of each gene's expression distribution across cells, thus allowing the assessment of the dispersion, nonzero fraction, and other aspects of its distribution beyond the mean. These statistical characterizations of the gene expression distribution are critical for understanding expression variation and for selecting marker genes for population heterogeneity. However, scRNA-seq data are noisy, with each cell typically sequenced at low coverage, thus making it difficult to infer properties of the gene expression distribution from raw counts. Based on a reexamination of nine public datasets, we propose a simple technical noise model for scRNA-seq data with unique molecular identifiers (UMI). We develop deconvolution of single-cell expression distribution (DESCEND), a method that deconvolves the true cross-cell gene expression distribution from observed scRNA-seq counts, leading to improved estimates of properties of the distribution such as dispersion and nonzero fraction. DESCEND can adjust for cell-level covariates such as cell size, cell cycle, and batch effects. DESCEND's noise model and estimation accuracy are further evaluated through comparisons to RNA FISH data, through data splitting and simulations and through its effectiveness in removing known batch effects. We demonstrate how DESCEND can clarify and improve downstream analyses such as finding differentially expressed genes, identifying cell types, and selecting differentiation markers. Copyright © 2018 the Author(s). Published by PNAS.

  9. Early Evolution of Vertebrate Mybs: An Integrative Perspective Combining Synteny, Phylogenetic, and Gene Expression Analyses

    PubMed Central

    Campanini, Emeline B.; Vandewege, Michael W.; Pillai, Nisha E.; Tay, Boon-Hui; Jones, Justin L.; Venkatesh, Byrappa; Hoffmann, Federico G.

    2015-01-01

    Abstract The genes in the Myb superfamily encode for three related transcription factors in most vertebrates, A-, B-, and c-Myb, with functionally distinct roles, whereas most invertebrates have a single Myb. B-Myb plays an essential role in cell division and cell cycle progression, c-Myb is involved in hematopoiesis, and A-Myb is involved in spermatogenesis and regulating expression of pachytene PIWI interacting RNAs, a class of small RNAs involved in posttranscriptional gene regulation and the maintenance of reproductive tissues. Comparisons between teleost fish and tetrapods suggest that the emergence and functional divergence of the Myb genes were linked to the two rounds of whole-genome duplication early in vertebrate evolution. We combined phylogenetic, synteny, structural, and gene expression analyses of the Myb paralogs from elephant shark and lampreys with data from 12 bony vertebrates to reconstruct the early evolution of vertebrate Mybs. Phylogenetic and synteny analyses suggest that the elephant shark and Japanese lamprey have copies of the A-, B-, and c-Myb genes, implying their origin could be traced back to the common ancestor of lampreys and gnathostomes. However, structural and gene expression analyses suggest that their functional roles diverged between gnathostomes and cyclostomes. In particular, we did not detect A-Myb expression in testis suggesting that the involvement of A-Myb in the pachytene PIWI interacting RNA pathway is probably a gnathostome-specific innovation. We speculate that the secondary loss of a central domain in lamprey A-Myb underlies the functional differences between the cyclostome and gnathostome A-Myb proteins. PMID:26475318

  10. A selection of reference genes and early-warning mRNA biomarkers for environmental monitoring using Mytilus spp. as sentinel species.

    PubMed

    Lacroix, C; Coquillé, V; Guyomarch, J; Auffret, M; Moraga, D

    2014-09-15

    mRNA biomarkers are promising tools for environmental health assessment and reference genes are needed to perform relevant qPCR analyses in tissue samples of sentinel species. In the present study, potential reference genes and mRNA biomarkers were tested in the gills and digestive glands of native and caged mussels (Mytilus spp.) exposed to harbor pollution. Results highlighted the difficulty to find stable reference genes in wild, non-model species and suggested the use of normalization indices instead of single genes as they exhibit a higher stability. Several target genes were found differentially expressed between mussel groups, especially in gills where cyp32, π-gst and CuZn-sod mRNA levels could be biomarker candidates. Multivariate analyses confirmed the ability of mRNA levels to highlight site-effects and suggested the use of several combined markers instead of individual ones. These findings support the use of qPCR technology and mRNA levels as early-warning biomarkers in marine monitoring programs. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Novel Autism Subtype-Dependent Genetic Variants Are Revealed by Quantitative Trait and Subphenotype Association Analyses of Published GWAS Data

    PubMed Central

    Hu, Valerie W.; Addington, Anjene; Hyman, Alexander

    2011-01-01

    The heterogeneity of symptoms associated with autism spectrum disorders (ASDs) has presented a significant challenge to genetic analyses. Even when associations with genetic variants have been identified, it has been difficult to associate them with a specific trait or characteristic of autism. Here, we report that quantitative trait analyses of ASD symptoms combined with case-control association analyses using distinct ASD subphenotypes identified on the basis of symptomatic profiles result in the identification of highly significant associations with 18 novel single nucleotide polymorphisms (SNPs). The symptom categories included deficits in language usage, non-verbal communication, social development, and play skills, as well as insistence on sameness or ritualistic behaviors. Ten of the trait-associated SNPs, or quantitative trait loci (QTL), were associated with more than one subtype, providing partial replication of the identified QTL. Notably, none of the novel SNPs is located within an exonic region, suggesting that these hereditary components of ASDs are more likely related to gene regulatory processes (or gene expression) than to structural or functional changes in gene products. Seven of the QTL reside within intergenic chromosomal regions associated with rare copy number variants that have been previously reported in autistic samples. Pathway analyses of the genes associated with the QTL identified in this study implicate neurological functions and disorders associated with autism pathophysiology. This study underscores the advantage of incorporating both quantitative traits as well as subphenotypes into large-scale genome-wide analyses of complex disorders. PMID:21556359

  12. Unraveling the evolutionary radiation of the families of the Zingiberales using morphological and molecular evidence.

    PubMed

    Kress, W J; Prince, L M; Hahn, W J; Zimmer, E A

    2001-01-01

    The Zingiberales are a tropical group of monocotyledons that includes bananas, gingers, and their relatives. The phylogenetic relationships among the eight families currently recognized are investigated here by using parsimony and maximum likelihood analyses of four character sets: morphological features (1), and sequence data of the (2) chloroplast rbcL gene, (3) chloroplast atpB gene, and (4) nuclear 18S rDNA gene. Outgroups for the analyses include the closely related Commelinaceae + Philydraceae + Haemodoraceae + Pontederiaceae + Hanguanaceae as well as seven more distantly related monocots and paleoherbs. Only slightly different estimates of evolutionary relationships result from the analysis of each character set. The morphological data yield a single fully resolved most-parsimonious tree. None of the molecular datasets alone completely resolves interfamilial relationships. The analyses of the combined molecular dataset provide more resolution than do those of individual genes, and the addition of the morphological data provides a well-supported estimate of phylogenetic relationships: (Musaceae ((Strelitziaceae, Lowiaceae) (Heliconiaceae ((Zingiberaceae, Costaceae) (Cannaceae, Marantaceae))))). Evidence from branch lengths in the parsimony analyses and from the fossil record suggests that the Zingiberales originated in the Early Cretaceous and underwent a rapid radiation in the mid-Cretaceous, by which time most extant family lineages had diverged.

  13. A nuclear phylogenetic analysis: SNPs, indels and SSRs deliver new insights into the relationships in the ‘true citrus fruit trees’ group (Citrinae, Rutaceae) and the origin of cultivated species

    PubMed Central

    Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    Background and Aims Despite differences in morphology, the genera representing ‘true citrus fruit trees’ are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial ‘species’ of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between ‘true citrus fruit trees’ were clarified. Methods Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. Key Results A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Conclusions Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis. PMID:23104641

  14. A nuclear phylogenetic analysis: SNPs, indels and SSRs deliver new insights into the relationships in the 'true citrus fruit trees' group (Citrinae, Rutaceae) and the origin of cultivated species.

    PubMed

    Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    Despite differences in morphology, the genera representing 'true citrus fruit trees' are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial 'species' of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between 'true citrus fruit trees' were clarified. Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis.

  15. Phylogenetic relationships and morphological evolution in Lentinus, Polyporellus and Neofavolus, emphasizing southeastern Asian taxa.

    PubMed

    Seelan, Jaya Seelan Sathiya; Justo, Alfredo; Nagy, Laszlo G; Grand, Edward A; Redhead, Scott A; Hibbett, David

    2015-01-01

    The genus Lentinus (Polyporaceae, Basidiomycota) is widely documented from tropical and temperate forests and is taxonomically controversial. Here we studied the relationships between Lentinus subg. Lentinus sensu Pegler (i.e. sections Lentinus, Tigrini, Dicholamellatae, Rigidi, Lentodiellum and Pleuroti and polypores that share similar morphological characters). We generated sequences of internal transcribed spacers (ITS) and partial 28S regions of nuc rDNA and genes encoding the largest subunit of RNA polymerase II (RPB1), focusing on Lentinus subg. Lentinus sensu Pegler and the Neofavolus group, combined these data with sequences from GenBank (including RPB2 gene sequences) and performed phylogenetic analyses with maximum likelihood and Bayesian methods. We also evaluated the transition in hymenophore morphology between Lentinus, Neofavolus and related polypores with ancestral state reconstruction. Single-gene phylogenies and phylogenies combining ITS and 28S with RPB1 and RPB2 genes all support existence of a Lentinus/Polyporellus clade and a separate Neofavolus clade. Polyporellus (represented by P. arcularius, P. ciliatus, P. brumalis) forms a clade with species representing Lentinus subg. Lentinus sensu Pegler (1983), excluding L. suavissimus. Lentinus tigrinus appears as the sister group of Polyporellus in the four-gene phylogeny, but this placement was weakly supported. All three multigene analyses and the single-gene analysis using ITS strongly supported Polyporus tricholoma as the sister group of the Lentinus/Polyporellus clade; only the 28S rRNA phylogeny failed to support this placement. Under parsimony the ancestral hymenophoral configuration for the Lentinus/Polyporellus clade is estimated to be circular pores, with independent transitions to angular pores and lamellae. The ancestral state for the Neofavolus clade is estimated to be angular pores, with a single transition to lamellae in L. suavissimus. We propose that Lentinus suavissimus (section Pleuroti) should be reclassified as Neofavolus suavissimus comb. nov. © 2015 by The Mycological Society of America.

  16. Exome Array Analysis of Nuclear Lens Opacity.

    PubMed

    Loomis, Stephanie J; Klein, Alison P; Lee, Kristine E; Chen, Fei; Bomotti, Samantha; Truitt, Barbara; Iyengar, Sudha K; Klein, Ronald; Klein, Barbara E K; Duggal, Priya

    2018-06-01

    Nuclear cataract is the most common subtype of age-related cataract, the leading cause of blindness worldwide. It results from advanced nuclear sclerosis, or opacity in the center of the optic lens, and is affected by both genetic and environmental risk factors, including smoking. We sought to understand the genetic factors associated with nuclear sclerosis through interrogation of rare and low frequency coding variants using exome array data. We analyzed Illumina Human Exome Array data for 1,488 participants of European ancestry in the Beaver Dam Eye Study who were without cataract surgery for association with nuclear sclerosis grade, controlling for age and sex. We performed single-variant regression analysis for 32,138 variants with minor allele frequency (MAF) ≥0.003. In addition, gene-based analysis of 11,844 genes containing at least two variants with MAF < 0.05 was performed using a gene-based unified burden and non-burden sequence kernel association test (SKAT-O). Additionally, both single-variant and gene-based analyses were analyzed stratified by smoking status. No single-variant test was statistically significant after Bonferroni correction (p < 1.6 × 10 -6 ; top single nucleotide polymorphism (SNP): rs144458991, p = 2.83 × 10 -5 ). Gene-based tests were suggestively associated with the gene RNF149 overall (p = 8.29 × 10 -6 ) and among never smokers (N = 790, p = 2.67 × 10 -6 ). This study did not find a significant genetic association with nuclear sclerosis, the possible association with the RNF149 gene highlights a potential candidate gene for future studies that aim to understand the genetic architecture of nuclear sclerosis.

  17. Genomics of the Effect of Spinal Cord Stimulation on an Animal Model of Neuropathic Pain.

    PubMed

    Vallejo, Ricardo; Tilley, Dana M; Cedeño, David L; Kelley, Courtney A; DeMaegd, Margaret; Benyamin, Ramsin

    2016-08-01

    Few studies have evaluated single-gene changes modulated by spinal cord stimulation (SCS), providing a narrow understanding of molecular changes. Genomics allows for a robust analysis of holistic gene changes in response to stimulation. Rats were randomized into six groups to determine the effect of continuous SCS in uninjured and spared-nerve injury (SNI) animals. After behavioral assessment, tissues from the dorsal quadrant of the spinal cord (SC) and dorsal root ganglion (DRG) underwent full-genome microarray analyses. Weighted Gene Correlation Network Analysis (WGCNA), and Gene Ontology (GO) analysis identified similar expression patterns, molecular functions and biological processes for significant genes. Microarray analyses reported 20,985 gene probes in SC and 19,104 in DRG. WGCNA sorted 7449 SC and 4275 DRG gene probes into 29 and 9 modules, respectively. WGCNA provided significant modules from paired comparisons of experimental groups. GO analyses reported significant biological processes influenced by injury, as well as the presence of an electric field. The genes Tlr2, Cxcl16, and Cd68 were used to further validate the microarray based on significant response to SCS in SNI animals. They were up-regulated in the SC while both Tlr2 and Cd68 were up-regulated in the DRG. The process described provides highly significant interconnected genes and pathways responsive to injury and/or electric field in the SC and DRG. Genes in the SC respond significantly to the SCS in both injured and uninjured animals, while those in the DRG significantly responded to injury, and SCS in injured animals. © 2016 International Neuromodulation Society.

  18. Common variants of the EPDR1 gene and the risk of Dupuytren’s disease.

    PubMed

    Dębniak, T; Żyluk, A; Puchalski, P; Serrano-Fernandez, P

    2013-10-01

    The object of this study was the investigation of 3 common variants of single nucleotide polymorphisms of the ependymin-related gene 1 and its association with the occurrence of Dupuytren's disease. DNA samples were obtained from the peripheral blood of 508 consecutive patients. The control group comprised 515 healthy adults who were age-matched with the Dupuytren's patients. 3 common variants were analysed using TaqMan® genotyping assays and sequencing. The differences in the frequencies of variants of single nucleotide polymorphisms in patients and the control group were statistically tested. Additionally, haplotype frequency and linkage disequilibrium were analysed for these variants. A statistically significant association was noted between rs16879765_CT, rs16879765_TT and rs13240429_AA variants and Dupuytren's disease. 2 haplotypes: rs2722280_C+rs13240429_A+rs16879765_C and rs2722280_C+rs13240429_G+rs16879765_T were found to be statistically significantly associated with Dupuytren's disease. Moreover, we found that rs13240429 and rs16879765 variants were in strong linkage disequilibrium, while rs2722280 was only in moderate linkage disequilibrium. No significant differences were found in the frequencies of the variants of the gene between the groups with a positive and negative familial history of Dupuytren's disease. In conclusion, results of this study suggest that EPDR1 gene can be added to a growing list of genes associated with Dupuytren's disease development. © Georg Thieme Verlag KG Stuttgart · New York.

  19. Replication Study Confirms the Association of the Common rs1800629 Variant of the TNFα Gene with Postmenopausal Osteoporosis Susceptibility in the Han Chinese Population.

    PubMed

    Jin, Xiaona; Zhou, Baozhen; Zhang, Dangfeng

    2018-04-01

    Previous studies have suggested that tumor necrosis factor α (TNF-α), encoded by the TNFα gene, can increase osteoclast formation, and that specific alleles of the TNFα gene are associated with postmenopausal osteoporosis susceptibility in some populations; however, the exact molecular mechanism remains unknown. To investigate the potential association of nineteen polymorphisms of the TNFα gene with postmenopausal osteoporosis and bone mineral density (BMD) traits in a sample of 1288 postmenopausal women from the Han Chinese population. A total of 437 postmenopausal osteoporosis patients and 851 unrelated age-matched healthy women were recruited to the study. Single marker and haplotype based analyses were conducted to evaluate the association of nineteen single nucleotide polymorphisms (SNPs) in both patient and control groups. The SNP rs1800629 was identified as being highly significantly associated with postmenopausal osteoporosis after accounting for age and body mass index (p = 0.000087). In addition, the GG genotype of this SNP was associated with significantly lower measures of femoral neck BMD and lumbar spine BMD. Moreover, haplotype based analyses suggested significant association signals between the haplotype block, including rs1800629 with postmenopausal osteoporosis (p < 0.001). We have shown that a TNFα gene polymorphism, rs1800629, is highly significantly associated with postmenopausal osteoporosis and BMD in the female Han Chinese population. Additional sequencing-based studies are needed to investigate the genetic architecture of this genomic region and its relationship with osteoporosis-related phenotypes.

  20. Identification of Suitable Reference Genes for Investigating Gene Expression in Anterior Cruciate Ligament Injury by Using Reverse Transcription-Quantitative PCR.

    PubMed

    Leal, Mariana Ferreira; Astur, Diego Costa; Debieux, Pedro; Arliani, Gustavo Gonçalves; Silveira Franciozi, Carlos Eduardo; Loyola, Leonor Casilla; Andreoli, Carlos Vicente; Smith, Marília Cardoso; Pochini, Alberto de Castro; Ejnisman, Benno; Cohen, Moises

    2015-01-01

    The anterior cruciate ligament (ACL) is one of the most frequently injured structures during high-impact sporting activities. Gene expression analysis may be a useful tool for understanding ACL tears and healing failure. Reverse transcription-quantitative polymerase chain reaction (RT-qPCR) has emerged as an effective method for such studies. However, this technique requires the use of suitable reference genes for data normalization. Here, we evaluated the suitability of six reference genes (18S, ACTB, B2M, GAPDH, HPRT1, and TBP) by using ACL samples of 39 individuals with ACL tears (20 with isolated ACL tears and 19 with ACL tear and combined meniscal injury) and of 13 controls. The stability of the candidate reference genes was determined by using the NormFinder, geNorm, BestKeeper DataAssist, and RefFinder software packages and the comparative ΔCt method. ACTB was the best single reference gene and ACTB+TBP was the best gene pair. The GenEx software showed that the accumulated standard deviation is reduced when a larger number of reference genes is used for gene expression normalization. However, the use of a single reference gene may not be suitable. To identify the optimal combination of reference genes, we evaluated the expression of FN1 and PLOD1. We observed that at least 3 reference genes should be used. ACTB+HPRT1+18S is the best trio for the analyses involving isolated ACL tears and controls. Conversely, ACTB+TBP+18S is the best trio for the analyses involving (1) injured ACL tears and controls, and (2) ACL tears of patients with meniscal tears and controls. Therefore, if the gene expression study aims to compare non-injured ACL, isolated ACL tears and ACL tears from patients with meniscal tear as three independent groups ACTB+TBP+18S+HPRT1 should be used. In conclusion, 3 or more genes should be used as reference genes for analysis of ACL samples of individuals with and without ACL tears.

  1. Cloning and characterization of cDNAs encoding human gastrin-releasing peptide.

    PubMed Central

    Spindel, E R; Chin, W W; Price, J; Rees, L H; Besser, G M; Habener, J F

    1984-01-01

    We have prepared and cloned cDNAs derived from poly(A)+ RNA from a human pulmonary carcinoid tumor rich in immunoreactivity to gastrin-releasing peptide, a peptide closely related in structure to amphibian bombesin. Mixtures of synthetic oligodeoxyribonucleotides corresponding to amphibian bombesin were used as hybridization probes to screen a cDNA library prepared from the tumor RNA. Sequencing of the recombinant plasmids shows that human gastrin-releasing peptide (hGRP) mRNA encodes a precursor of 148 amino acids containing a typical signal sequence, hGRP consisting of 27 or 28 amino acids, and a carboxyl-terminal extension peptide. hGRP is flanked at its carboxyl terminus by two basic amino acids, following a glycine used for amidation of the carboxyl-terminal methionine. RNA blot analyses of tumor RNA show a major mRNA of 900 bases and a minor mRNA of 850 bases. Blot hybridization analyses using human genomic DNA are consistent with a single hGRP-encoding gene. The presence of two mRNAs encoding the hGRP precursor protein in the face of a single hGRP gene raises the possibility of alternative processing of the single RNA transcript. Images PMID:6207529

  2. Evolution of Prdm Genes in Animals: Insights from Comparative Genomics

    PubMed Central

    Vervoort, Michel; Meulemeester, David; Béhague, Julien; Kerner, Pierre

    2016-01-01

    Prdm genes encode transcription factors with a subtype of SET domain known as the PRDF1-RIZ (PR) homology domain and a variable number of zinc finger motifs. These genes are involved in a wide variety of functions during animal development. As most Prdm genes have been studied in vertebrates, especially in mice, little is known about the evolution of this gene family. We searched for Prdm genes in the fully sequenced genomes of 93 different species representative of all the main metazoan lineages. A total of 976 Prdm genes were identified in these species. The number of Prdm genes per species ranges from 2 to 19. To better understand how the Prdm gene family has evolved in metazoans, we performed phylogenetic analyses using this large set of identified Prdm genes. These analyses allowed us to define 14 different subfamilies of Prdm genes and to establish, through ancestral state reconstruction, that 11 of them are ancestral to bilaterian animals. Three additional subfamilies were acquired during early vertebrate evolution (Prdm5, Prdm11, and Prdm17). Several gene duplication and gene loss events were identified and mapped onto the metazoan phylogenetic tree. By studying a large number of nonmetazoan genomes, we confirmed that Prdm genes likely constitute a metazoan-specific gene family. Our data also suggest that Prdm genes originated before the diversification of animals through the association of a single ancestral SET domain encoding gene with one or several zinc finger encoding genes. PMID:26560352

  3. Whole-genome relationships among Francisella bacteria of diverse origins define new species and provide specific regions for detection

    DOE PAGES

    Challacombe, Jean Faust; Petersen, Jeannine M.; Gallegos-Graves, La Verne A.; ...

    2016-11-23

    Francisella tularensis is a highly virulent zoonotic pathogen that causes tularemia and, because of weaponization efforts in past world wars, is considered a tier 1 biothreat agent. Detection and surveillance of F. tularensis may be confounded by the presence of uncharacterized, closely related organisms. Through DNA-based diagnostics and environmental surveys, novel clinical and environmental Francisella isolates have been obtained in recent years. Here we present 7 new Francisella genomes and a comparison of their characteristics to each other and to 24 publicly available genomes as well as a comparative analysis of 16S rRNA and sdhA genes from over 90 Francisellamore » strains. Delineation of new species in bacteria is challenging, especially when isolates having very close genomic characteristics exhibit different physiological features—for example, when some are virulent pathogens in humans and animals while others are nonpathogenic or are opportunistic pathogens. Species resolution within Francisella varies with analyses of single genes, multiple gene or protein sets, or whole-genome comparisons of nucleic acid and amino acid sequences. Analyses focusing on single genes (16S rRNA, sdhA), multiple gene sets (virulence genes, lipopolysaccharide [LPS] biosynthesis genes, pathogenicity island), and whole-genome comparisons (nucleotide and protein) gave congruent results, but with different levels of discrimination confidence. We designate four new species within the genus; Francisella opportunistica sp. nov. (MA06-7296), Francisella salina sp. nov. (TX07-7308), Francisella uliginis sp. nov. (TX07-7310), and Francisella frigiditurris sp. nov. (CA97-1460). Lastly, this study provides a robust comparative framework to discern species and virulence features of newly detected Francisella bacteria.« less

  4. Whole-genome relationships among Francisella bacteria of diverse origins define new species and provide specific regions for detection

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Challacombe, Jean Faust; Petersen, Jeannine M.; Gallegos-Graves, La Verne A.

    Francisella tularensis is a highly virulent zoonotic pathogen that causes tularemia and, because of weaponization efforts in past world wars, is considered a tier 1 biothreat agent. Detection and surveillance of F. tularensis may be confounded by the presence of uncharacterized, closely related organisms. Through DNA-based diagnostics and environmental surveys, novel clinical and environmental Francisella isolates have been obtained in recent years. Here we present 7 new Francisella genomes and a comparison of their characteristics to each other and to 24 publicly available genomes as well as a comparative analysis of 16S rRNA and sdhA genes from over 90 Francisellamore » strains. Delineation of new species in bacteria is challenging, especially when isolates having very close genomic characteristics exhibit different physiological features—for example, when some are virulent pathogens in humans and animals while others are nonpathogenic or are opportunistic pathogens. Species resolution within Francisella varies with analyses of single genes, multiple gene or protein sets, or whole-genome comparisons of nucleic acid and amino acid sequences. Analyses focusing on single genes (16S rRNA, sdhA), multiple gene sets (virulence genes, lipopolysaccharide [LPS] biosynthesis genes, pathogenicity island), and whole-genome comparisons (nucleotide and protein) gave congruent results, but with different levels of discrimination confidence. We designate four new species within the genus; Francisella opportunistica sp. nov. (MA06-7296), Francisella salina sp. nov. (TX07-7308), Francisella uliginis sp. nov. (TX07-7310), and Francisella frigiditurris sp. nov. (CA97-1460). Lastly, this study provides a robust comparative framework to discern species and virulence features of newly detected Francisella bacteria.« less

  5. A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction.

    PubMed

    Marceau, Rachel; Lu, Wenbin; Holloway, Shannon; Sale, Michèle M; Worrall, Bradford B; Williams, Stephen R; Hsu, Fang-Chi; Tzeng, Jung-Ying

    2015-09-01

    Kernel machine (KM) models are a powerful tool for exploring associations between sets of genetic variants and complex traits. Although most KM methods use a single kernel function to assess the marginal effect of a variable set, KM analyses involving multiple kernels have become increasingly popular. Multikernel analysis allows researchers to study more complex problems, such as assessing gene-gene or gene-environment interactions, incorporating variance-component based methods for population substructure into rare-variant association testing, and assessing the conditional effects of a variable set adjusting for other variable sets. The KM framework is robust, powerful, and provides efficient dimension reduction for multifactor analyses, but requires the estimation of high dimensional nuisance parameters. Traditional estimation techniques, including regularization and the "expectation-maximization (EM)" algorithm, have a large computational cost and are not scalable to large sample sizes needed for rare variant analysis. Therefore, under the context of gene-environment interaction, we propose a computationally efficient and statistically rigorous "fastKM" algorithm for multikernel analysis that is based on a low-rank approximation to the nuisance effect kernel matrices. Our algorithm is applicable to various trait types (e.g., continuous, binary, and survival traits) and can be implemented using any existing single-kernel analysis software. Through extensive simulation studies, we show that our algorithm has similar performance to an EM-based KM approach for quantitative traits while running much faster. We also apply our method to the Vitamin Intervention for Stroke Prevention (VISP) clinical trial, examining gene-by-vitamin effects on recurrent stroke risk and gene-by-age effects on change in homocysteine level. © 2015 WILEY PERIODICALS, INC.

  6. Characterization of gonadal transcriptomes from the turbot (Scophthalmus maximus).

    PubMed

    Hu, Yulong; Huang, Meng; Wang, Weiji; Guan, Jiantao; Kong, Jie

    2016-01-01

    The mechanisms underlying sexual reproduction and sex ratio determination remains unclear in turbot, a flatfish of great commercial value. And there is limited information in the turbot database regarding genes related to the reproductive system. Here, we conducted high-throughput transcriptome profiling of turbot gonad tissues to better understand their reproductive functions and to supply essential gene sequence information for marker-assisted selection programs in the turbot industry. In this study, two gonad libraries representing sex differences in Scophthalmus maximus yielded 453 818 high-quality reads that were assembled into 24 611 contigs and 33 713 singletons by using 454 pyrosequencing, 13 936 contigs and singletons (CS) of which were annotated using BLASTx. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses revealed that various biological functions and processes were associated with many of the annotated CS. Expression analyses showed that 510 genes were differentially expressed in males versus females; 80% of these genes were annotated. In addition, 6484 and 6036 single nucleotide polymorphisms (SNPs) were identified in male and female libraries, respectively. This transcriptome resource will serve as the foundation for cDNA or SNP microarray construction, gene expression characterization, and sex-specific linkage mapping in turbot.

  7. Single-cell transcriptome analysis of fish immune cells provides insight into the evolution of vertebrate immune cell types.

    PubMed

    Carmona, Santiago J; Teichmann, Sarah A; Ferreira, Lauren; Macaulay, Iain C; Stubbington, Michael J T; Cvejic, Ana; Gfeller, David

    2017-03-01

    The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans -membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell-specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans -membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. © 2017 Carmona et al.; Published by Cold Spring Harbor Laboratory Press.

  8. Single-cell transcriptome analysis of fish immune cells provides insight into the evolution of vertebrate immune cell types

    PubMed Central

    Ferreira, Lauren; Macaulay, Iain C.; Stubbington, Michael J.T.

    2017-01-01

    The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans-membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell–specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans-membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. PMID:28087841

  9. The single mitochondrial chromosome typical of animals has evolved into 18 minichromosomes in the human body louse, Pediculus humanus

    PubMed Central

    Shao, Renfu; Kirkness, Ewen F.; Barker, Stephen C.

    2009-01-01

    The mitochondrial (mt) genomes of animals typically consist of a single circular chromosome that is ∼16-kb long and has 37 genes. Our analyses of the sequence reads from the Human Body Louse Genome Project and the patterns of gel electrophoresis and Southern hybridization revealed a novel type of mt genome in the sucking louse, Pediculus humanus. Instead of having all mt genes on a single chromosome, the 37 mt genes of this louse are on 18 minicircular chromosomes. Each minicircular chromosome is 3–4 kb long and has one to three genes. Minicircular mt chromosomes are also present in the four other species of sucking lice that we investigated, but not in chewing lice nor in the Psocoptera, to which sucking lice are most closely related. We also report unequivocal evidence for recombination between minicircular mt chromosomes in P. humanus and for sequence variation in mt genes generated by recombination. The advantages of a fragmented mt genome, if any, are currently unknown. Fragmentation of mt genome, however, has coevolved with blood feeding in the sucking lice. It will be of interest to explore whether or not life history features are associated with the evolution of fragmented chromosomes. PMID:19336451

  10. The single mitochondrial chromosome typical of animals has evolved into 18 minichromosomes in the human body louse, Pediculus humanus.

    PubMed

    Shao, Renfu; Kirkness, Ewen F; Barker, Stephen C

    2009-05-01

    The mitochondrial (mt) genomes of animals typically consist of a single circular chromosome that is approximately 16-kb long and has 37 genes. Our analyses of the sequence reads from the Human Body Louse Genome Project and the patterns of gel electrophoresis and Southern hybridization revealed a novel type of mt genome in the sucking louse, Pediculus humanus. Instead of having all mt genes on a single chromosome, the 37 mt genes of this louse are on 18 minicircular chromosomes. Each minicircular chromosome is 3-4 kb long and has one to three genes. Minicircular mt chromosomes are also present in the four other species of sucking lice that we investigated, but not in chewing lice nor in the Psocoptera, to which sucking lice are most closely related. We also report unequivocal evidence for recombination between minicircular mt chromosomes in P. humanus and for sequence variation in mt genes generated by recombination. The advantages of a fragmented mt genome, if any, are currently unknown. Fragmentation of mt genome, however, has coevolved with blood feeding in the sucking lice. It will be of interest to explore whether or not life history features are associated with the evolution of fragmented chromosomes.

  11. Impact of sequencing depth and read length on single cell RNA sequencing data of T cells.

    PubMed

    Rizzetto, Simone; Eltahla, Auda A; Lin, Peijie; Bull, Rowena; Lloyd, Andrew R; Ho, Joshua W K; Venturi, Vanessa; Luciani, Fabio

    2017-10-06

    Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the full length T cell receptor (TCRαβ), which defines the specificity against cognate antigens. Several factors, e.g. RNA library capture, cell quality, and sequencing output affect the quality of scRNA-seq data. We studied the effects of read length and sequencing depth on the quality of gene expression profiles, cell type identification, and TCRαβ reconstruction, utilising 1,305 single cells from 8 publically available scRNA-seq datasets, and simulation-based analyses. Gene expression was characterised by an increased number of unique genes identified with short read lengths (<50 bp), but these featured higher technical variability compared to profiles from longer reads. Successful TCRαβ reconstruction was achieved for 6 datasets (81% - 100%) with at least 0.25 millions (PE) reads of length >50 bp, while it failed for datasets with <30 bp reads. Sufficient read length and sequencing depth can control technical noise to enable accurate identification of TCRαβ and gene expression profiles from scRNA-seq data of T cells.

  12. Single cell transcriptomic analysis of prostate cancer cells.

    PubMed

    Welty, Christopher J; Coleman, Ilsa; Coleman, Roger; Lakely, Bryce; Xia, Jing; Chen, Shu; Gulati, Roman; Larson, Sandy R; Lange, Paul H; Montgomery, Bruce; Nelson, Peter S; Vessella, Robert L; Morrissey, Colm

    2013-02-16

    The ability to interrogate circulating tumor cells (CTC) and disseminated tumor cells (DTC) is restricted by the small number detected and isolated (typically <10). To determine if a commercially available technology could provide a transcriptomic profile of a single prostate cancer (PCa) cell, we clonally selected and cultured a single passage of cell cycle synchronized C4-2B PCa cells. Ten sets of single, 5-, or 10-cells were isolated using a micromanipulator under direct visualization with an inverted microscope. Additionally, two groups of 10 individual DTC, each isolated from bone marrow of 2 patients with metastatic PCa were obtained. RNA was amplified using the WT-Ovation™ One-Direct Amplification System. The amplified material was hybridized on a 44K Whole Human Gene Expression Microarray. A high stringency threshold, a mean Alexa Fluor® 3 signal intensity above 300, was used for gene detection. Relative expression levels were validated for select genes using real-time PCR (RT-qPCR). Using this approach, 22,410, 20,423, and 17,009 probes were positive on the arrays from 10-cell pools, 5-cell pools, and single-cells, respectively. The sensitivity and specificity of gene detection on the single-cell analyses were 0.739 and 0.972 respectively when compared to 10-cell pools, and 0.814 and 0.979 respectively when compared to 5-cell pools, demonstrating a low false positive rate. Among 10,000 randomly selected pairs of genes, the Pearson correlation coefficient was 0.875 between the single-cell and 5-cell pools and 0.783 between the single-cell and 10-cell pools. As expected, abundant transcripts in the 5- and 10-cell samples were detected by RT-qPCR in the single-cell isolates, while lower abundance messages were not. Using the same stringency, 16,039 probes were positive on the patient single-cell arrays. Cluster analysis showed that all 10 DTC grouped together within each patient. A transcriptomic profile can be reliably obtained from a single cell using commercially available technology. As expected, fewer amplified genes are detected from a single-cell sample than from pooled-cell samples, however this method can be used to reliably obtain a transcriptomic profile from DTC isolated from the bone marrow of patients with PCa.

  13. Phylogenetic analysis of the cytochrome P450 3 (CYP3) gene family.

    PubMed

    McArthur, Andrew G; Hegelund, Tove; Cox, Rachel L; Stegeman, John J; Liljenberg, Mette; Olsson, Urban; Sundberg, Per; Celander, Malin C

    2003-08-01

    Cytochrome P450 genes (CYP) constitute a superfamily with members known from the Bacteria, Archaea, and Eukarya. The CYP3 gene family includes the CYP3A and CYP3B subfamilies. Members of the CYP3A subfamily represent the dominant CYP forms expressed in the digestive and respiratory tracts of vertebrates. The CYP3A enzymes metabolize a wide variety of chemically diverse lipophilic organic compounds. To understand vertebrate CYP3 diversity better, we determined the killifish (Fundulus heteroclitus) CYP3A30 and CYP3A56 and the ball python (Python regius) CYP3A42 sequences. We performed phylogenetic analyses of 45 vertebrate CYP3 amino acid sequences using a Bayesian approach. Our analyses indicate that teleost, diapsid, and mammalian CYP3A genes have undergone independent diversification and that the ancestral vertebrate genome contained a single CYP3A gene. Most CYP3A diversity is the product of recent gene duplication events. There is strong support for placement of the guinea pig CYP3A genes within the rodent CYP3A diversification. The rat, mouse, and hamster CYP3A genes are mixed among several rodent CYP3A subclades, indicative of a complex history involving speciation and gene duplication.

  14. Lupin nad9 and nad6 genes and their expression: 5' termini of the nad9 gene transcripts differentiate lupin species.

    PubMed

    Rurek, Michał; Nuc, Katarzyna; Raczyńska, Katarzyna Dorota; Augustyniak, Halina

    2003-10-02

    The mitochondrial nad9 and nad6 genes were analyzed in four lupin species: Lupinus luteus, Lupinus angustifolius, Lupinus albus and Lupinus mutabilis. The nucleotide sequence of these genes confirmed their high conservation, however, higher number of nucleotide substitution was observed in the L. albus genes. Southern hybridizations confirmed the presence of single copy number of these genes in L. luteus, L. albus and L. angustifolius. The expression of nad9 and nad6 genes was analyzed by Northern in different tissue types of analyzed lupin species. Transcription analyses of the two nad genes displayed single predominant mRNA species of about 0.6 kb in L. luteus and L. angustifolius. The L. albus transcripts were larger in size. The nad9 and nad6 transcripts were modified by RNA editing at 8 and 11 positions, in L. luteus and L. angustifolius, respectively. The gene order, rps3-rpl16-nad9, found in Arabidopsis thaliana is also conserved in L. luteus and L. angustifolius mitochondria. L. luteus and L. angustifolius showed some variability in the sequence of the nad9 promoter region. The last feature along with the differences observed in nad9 mRNA 5' termini of two lupins differentiate L. luteus and L. angustifolius species.

  15. Hepatic gene expression in rainbow trout (Oncorhynchus mykiss) exposed to different hydrocarbon mixtures.

    PubMed

    Hook, Sharon E; Lampi, Mark A; Febbo, Eric J; Ward, Jeff A; Parkerton, Thomas F

    2010-09-01

    Traditional biomarkers for hydrocarbon exposure are not induced by all petroleum substances. The objective of this study was to determine if exposure to a crude oil and different refined oils would generate a common hydrocarbon-specific response in gene expression profiles that could be used as generic biomarkers of hydrocarbon exposure. Juvenile rainbow trout (Oncorhynchus mykiss) were exposed to the water accommodated fraction (WAF) of either kerosene, gas oil, heavy fuel oil, or crude oil for 96 h. Tissue was collected for RNA extraction and microarray analysis. Exposure to each WAF resulted in a different list of differentially regulated genes, with few genes in common across treatments. Exposure to crude oil WAF changed the expression of genes including cytochrome P4501A (CYP1A) and glutathione-S-transferase (GST) with known roles in detoxification pathways. These gene expression profiles were compared to others from previous experiments that used a diverse suite of toxicants. Clustering algorithms successfully identified gene expression profiles resulting from hydrocarbon exposure. These preliminary analyses highlight the difficulties of using single genes as diagnostic of petroleum hydrocarbon exposures. Further work is needed to determine if multivariate transcriptomic-based biomarkers may be a more effective tool than single gene studies for exposure monitoring of different oils. Copyright 2010 SETAC.

  16. Inheritance of Virulence, Construction of a Linkage Map, and Mapping Dominant Virulence Genes in Puccinia striiformis f. sp. tritici Through Characterization of a Sexual Population with Genotyping-by-Sequencing.

    PubMed

    Yuan, Congying; Wang, Meinan; Skinner, Danniel Z; See, Deven R; Xia, Chongjing; Guo, Xinhong; Chen, Xianming

    2018-01-01

    Puccinia striiformis f. sp. tritici, the wheat stripe rust pathogen, is a dikaryotic, biotrophic, and macrocyclic fungus. Genetic study of P. striiformis f. sp. tritici virulence was not possible until the recent discovery of Berberis spp. and Mahonia spp. as alternate hosts. To determine inheritance of virulence and map virulence genes, a segregating population of 119 isolates was developed by self-fertilizing P. striiformis f. sp. tritici isolate 08-220 (race PSTv-11) on barberry leaves under controlled greenhouse conditions. The progeny isolates were phenotyped on a set of 29 wheat lines with single genes for race-specific resistance and genotyped with simple sequence repeat (SSR) markers, single nucleotide polymorphism (SNP) markers derived from secreted protein genes, and SNP markers from genotyping-by-sequencing (GBS). Using the GBS technique, 10,163 polymorphic GBS-SNP markers were identified. Clustering and principal component analysis grouped these markers into six genetic groups, and a genetic map, consisting of six linkage groups, was constructed with 805 markers. The six clusters or linkage groups resulting from these analyses indicated a haploid chromosome number of six in P. striiformis f. sp. tritici. Through virulence testing of the progeny isolates, the parental isolate was found to be homozygous for the avirulence loci corresponding to resistance genes Yr5, Yr10, Yr15, Yr24, Yr32, YrSP, YrTr1, Yr45, and Yr53 and homozygous for the virulence locus corresponding to resistance gene Yr41. Segregation was observed for virulence phenotypes in response to the remaining 19 single-gene lines. A single dominant gene or two dominant genes with different nonallelic gene interactions were identified for each of the segregating virulence phenotypes. Of 27 dominant virulence genes identified, 17 were mapped to two chromosomes. Markers tightly linked to some of the virulence loci may facilitate further studies to clone these genes. The virulence genes and their inheritance information are useful for understanding the host-pathogen interactions and for selecting effective resistance genes or gene combinations for developing stripe rust resistant wheat cultivars.

  17. MGAS: a powerful tool for multivariate gene-based genome-wide association analysis.

    PubMed

    Van der Sluis, Sophie; Dolan, Conor V; Li, Jiang; Song, Youqiang; Sham, Pak; Posthuma, Danielle; Li, Miao-Xin

    2015-04-01

    Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem. Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis. MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models. MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

  18. Analyzing contentious relationships and outlier genes in phylogenomics.

    PubMed

    Walker, Joseph F; Brown, Joseph W; Smith, Stephen A

    2018-06-08

    Recent studies have demonstrated that conflict is common among gene trees in phylogenomic studies, and that less than one percent of genes may ultimately drive species tree inference in supermatrix analyses. Here, we examined two datasets where supermatrix and coalescent-based species trees conflict. We identified two highly influential "outlier" genes in each dataset. When removed from each dataset, the inferred supermatrix trees matched the topologies obtained from coalescent analyses. We also demonstrate that, while the outlier genes in the vertebrate dataset have been shown in a previous study to be the result of errors in orthology detection, the outlier genes from a plant dataset did not exhibit any obvious systematic error and therefore may be the result of some biological process yet to be determined. While topological comparisons among a small set of alternate topologies can be helpful in discovering outlier genes, they can be limited in several ways, such as assuming all genes share the same topology. Coalescent species tree methods relax this assumption but do not explicitly facilitate the examination of specific edges. Coalescent methods often also assume that conflict is the result of incomplete lineage sorting (ILS). Here we explored a framework that allows for quickly examining alternative edges and support for large phylogenomic datasets that does not assume a single topology for all genes. For both datasets, these analyses provided detailed results confirming the support for coalescent-based topologies. This framework suggests that we can improve our understanding of the underlying signal in phylogenomic datasets by asking more targeted edge-based questions.

  19. The genetics of feed conversion efficiency traits in a commercial broiler line

    PubMed Central

    Reyer, Henry; Hawken, Rachel; Murani, Eduard; Ponsuksili, Siriluck; Wimmers, Klaus

    2015-01-01

    Individual feed conversion efficiency (FCE) is a major trait that influences the usage of energy resources and the ecological footprint of livestock production. The underlying biological processes of FCE are complex and are influenced by factors as diverse as climate, feed properties, gut microbiota, and individual genetic predisposition. To gain an insight to the genetic relationships with FCE traits and to contribute to the improvement of FCE in commercial chicken lines, a genome-wide association study was conducted using a commercial broiler population (n = 859) tested for FCE and weight traits during the finisher period from 39 to 46 days of age. Both single-marker (generalized linear model) and multi-marker (Bayesian approach) analyses were applied to the dataset to detect genes associated with the variability in FCE. The separate analyses revealed 22 quantitative trait loci (QTL) regions on 13 different chromosomes; the integration of both approaches resulted in 7 overlapping QTL regions. The analyses pointed to acylglycerol kinase (AGK) and general transcription factor 2-I (GTF2I) as positional and functional candidate genes. Non-synonymous polymorphisms of both candidate genes revealed evidence for a functional importance of these genes by influencing different biological aspects of FCE. PMID:26552583

  20. A Family-Based Association Study of CYP11A1 and CYP11B1 Gene Polymorphisms With Autism in Chinese Trios.

    PubMed

    Deng, Hong-Zhu; You, Cong; Xing, Yu; Chen, Kai-Yun; Zou, Xiao-Bing

    2016-05-01

    Autism spectrum disorder is a group of neurodevelopmental disorders with the higher prevalence in males. Our previous studies have indicated lower progesterone levels in the children with autism spectrum disorder, suggesting involvement of the cytochrome P-450scc gene (CYP11A1) and cytochrome P-45011beta gene (CYP11B1) as candidate genes in autism spectrum disorder. The aim of this study was to investigate the family-based genetic association between single-nucleotide polymorphisms, rs2279357 in the CYP11A1 gene and rs4534 and rs4541 in the CYP11B1 gene and autism spectrum disorder in Chinese children, which were selected according to the location in the coding region and 5' and 3' regions and minor allele frequencies of greater than 0.05 in the Chinese populations. The transmission disequilibrium test and case-control association analyses were performed in 100 Chinese Han autism spectrum disorder family trios. The genotype and allele frequency of the 3 single-nucleotide polymorphisms had no statistical difference between the children with autism spectrum disorder and their parents (P> .05). Transmission disequilibrium test analysis showed transmission disequilibrium of CYP11A1 gene rs2279357 single-nucleotide polymorphisms (χ(2)= 5.038,P< .001). Our findings provide further support for the hypothesis that a susceptibility gene for autism spectrum disorder exists within or near the CYP11A1 gene in the Han Chinese population. © The Author(s) 2015.

  1. Analysis of Single-cell Gene Transcription by RNA Fluorescent In Situ Hybridization (FISH)

    PubMed Central

    Ronander, Elena; Bengtsson, Dominique C.; Joergensen, Louise; Jensen, Anja T. R.; Arnot, David E.

    2012-01-01

    Adhesion of Plasmodium falciparum infected erythrocytes (IE) to human endothelial receptors during malaria infections is mediated by expression of PfEMP1 protein variants encoded by the var genes. The haploid P. falciparum genome harbors approximately 60 different var genes of which only one has been believed to be transcribed per cell at a time during the blood stage of the infection. How such mutually exclusive regulation of var gene transcription is achieved is unclear, as is the identification of individual var genes or sub-groups of var genes associated with different receptors and the consequence of differential binding on the clinical outcome of P. falciparum infections. Recently, the mutually exclusive transcription paradigm has been called into doubt by transcription assays based on individual P. falciparum transcript identification in single infected erythrocytic cells using RNA fluorescent in situ hybridization (FISH) analysis of var gene transcription by the parasite in individual nuclei of P. falciparum IE1. Here, we present a detailed protocol for carrying out the RNA-FISH methodology for analysis of var gene transcription in single-nuclei of P. falciparum infected human erythrocytes. The method is based on the use of digoxigenin- and biotin- labeled antisense RNA probes using the TSA Plus Fluorescence Palette System2 (Perkin Elmer), microscopic analyses and freshly selected P. falciparum IE. The in situ hybridization method can be used to monitor transcription and regulation of a variety of genes expressed during the different stages of the P. falciparum life cycle and is adaptable to other malaria parasite species and other organisms and cell types. PMID:23070076

  2. Mammalian monogamy is not controlled by a single gene

    PubMed Central

    Fink, Sabine; Excoffier, Laurent; Heckel, Gerald

    2006-01-01

    Complex social behavior in Microtus voles and other mammals has been postulated to be under the direct genetic control of a single locus: the arginine vasopressin 1a receptor (avpr1a) gene. Using a phylogenetic approach, we show that a repetitive element in the promoter region of avpr1a, which reportedly causes social monogamy, is actually widespread in nonmonogamous Microtus and other rodents. There was no evidence for intraspecific polymorphism in regard to the presence or absence of the repetitive element. Among 25 rodent species studied, the element was absent in only two closely related nonmonogamous species, indicating that this absence is certainly the result of an evolutionarily recent loss. Our analyses further demonstrate that the repetitive structures upstream of the avpr1a gene in humans and primates, which have been associated with social bonding, are evolutionarily distinct from those in rodents. Our evolutionary approach reveals that monogamy in rodents is not controlled by a single polymorphism in the promoter region of the avpr1a gene. We thus resolve the contradiction between the claims for an evolutionarily conserved genetic programming of social behavior in mammals and the vast evidence for highly complex and flexible mating systems. PMID:16832060

  3. Association between Single Nucleotide Polymorphisms of the Major Histocompatibility Complex Class II Gene and Newcastle Disease Virus Titre and Body Weight in Leung Hang Khao Chickens

    PubMed Central

    Molee, A.; Kongroi, K.; Kuadsantia, P.; Poompramun, C.; Likitdecharote, B.

    2016-01-01

    The aim of the present study was to investigate the effect of single nucleotide polymorphisms in the major histocompatibility complex (MHC) class II gene on resistance to Newcastle disease virus and body weight of the Thai indigenous chicken, Leung Hang Khao (Gallus gallus domesticus). Blood samples were collected for single nucleotide polymorphism analysis from 485 chickens. Polymerase chain reaction sequencing was used to classify single nucleotide polymorphisms of class II MHC. Body weights were measured at the ages of 3, 4, 5, and 7 months. Titres of Newcastle disease virus at 2 weeks to 7 months were determined and the correlation between body weight and titre was analysed. The association between single nucleotide polymorphisms and body weight and titre were analysed by a generalized linear model. Seven single nucleotide polymorphisms were identified: C125T, A126T, C209G, C242T, A243T, C244T, and A254T. Significant correlations between log titre and body weight were found at 2 and 4 weeks. Associations between single nucleotide polymorphisms and titre were found for C209G and A254T, and between all single nucleotide polymorphisms (except A243T) and body weight. The results showed that class II MHC is associated with both titre of Newcastle disease virus and body weight in Leung Hang Khao chickens. This is of concern because improved growth traits are the main goal of breeding selection. Moreover, the results suggested that MHC has a pleiotropic effect on the titre and growth performance. This mechanism should be investigated in a future study. PMID:26732325

  4. Isotachophoresis for fractionation and recovery of cytoplasmic RNA and nucleus from single cells.

    PubMed

    Kuriyama, Kentaro; Shintaku, Hirofumi; Santiago, Juan G

    2015-07-01

    There is a substantial need for simultaneous analyses of RNA and DNA from individual single cells. Such analysis provides unique evidence of cell-to-cell differences and the correlation between gene expression and genomic mutation in highly heterogeneous cell populations. We present a novel microfluidic system that leverages isotachophoresis to fractionate and isolate cytoplasmic RNA and genomic DNA (gDNA) from single cells. The system uniquely enables independent, sequence-specific analyses of these critical markers. Our system uses a microfluidic chip with a simple geometry and four end-channel electrodes, and completes the entire process in <5 min, including lysis, purification, fractionation, and delivery to DNA and RNA output reservoirs, each containing high quality and purity aliquots with no measurable cross-contamination of cytoplasmic RNA versus gDNA. We demonstrate our system with simultaneous, sequence-specific quantitation using off-chip RT-qPCR and qPCR for simultaneous cytoplasmic RNA and gDNA analyses, respectively. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  5. A Single-Cell Roadmap of Lineage Bifurcation in Human ESC Models of Embryonic Brain Development.

    PubMed

    Yao, Zizhen; Mich, John K; Ku, Sherman; Menon, Vilas; Krostag, Anne-Rachel; Martinez, Refugio A; Furchtgott, Leon; Mulholland, Heather; Bort, Susan; Fuqua, Margaret A; Gregor, Ben W; Hodge, Rebecca D; Jayabalu, Anu; May, Ryan C; Melton, Samuel; Nelson, Angelique M; Ngo, N Kiet; Shapovalova, Nadiya V; Shehata, Soraya I; Smith, Michael W; Tait, Leah J; Thompson, Carol L; Thomsen, Elliot R; Ye, Chaoyang; Glass, Ian A; Kaykas, Ajamete; Yao, Shuyuan; Phillips, John W; Grimley, Joshua S; Levi, Boaz P; Wang, Yanling; Ramanathan, Sharad

    2017-01-05

    During human brain development, multiple signaling pathways generate diverse cell types with varied regional identities. Here, we integrate single-cell RNA sequencing and clonal analyses to reveal lineage trees and molecular signals underlying early forebrain and mid/hindbrain cell differentiation from human embryonic stem cells (hESCs). Clustering single-cell transcriptomic data identified 41 distinct populations of progenitor, neuronal, and non-neural cells across our differentiation time course. Comparisons with primary mouse and human gene expression data demonstrated rostral and caudal progenitor and neuronal identities from early brain development. Bayesian analyses inferred a unified cell-type lineage tree that bifurcates between cortical and mid/hindbrain cell types. Two methods of clonal analyses confirmed these findings and further revealed the importance of Wnt/β-catenin signaling in controlling this lineage decision. Together, these findings provide a rich transcriptome-based lineage map for studying human brain development and modeling developmental disorders. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Genome-wide association and network analysis of lung function in the Framingham Heart Study.

    PubMed

    Liao, Shu-Yi; Lin, Xihong; Christiani, David C

    2014-09-01

    Single nucleotide polymorphisms have been found to be associated with pulmonary function using genome-wide association studies. However, lung function is a complex trait that is likely to be influenced by multiple gene-gene interactions besides individual genes. Our goal is to build a cellular network to explore the relationship between pulmonary function and genotypes by combining SNP level and network analyses using longitudinal lung function data from the Framingham Heart Study. We analyzed 2,698 genotyped participants from the Offspring cohort that had an average of 3.35 spirometry measurements per person for a mean length of 13 years. Repeated forced expiratory volume in one second (FEV1 ) and the ratio of FEV1 to forced vital capacity (FVC) were used as outcomes. Data were analyzed using linear-mixed models for the association between lung function and alleles by accounting for the correlation among repeated measures over time within the same subject and within-family correlation. Network analyses were performed using dmGWAS and validated with data from the Third Generation cohort. Analyses identified SMAD3, TGFBR2, CD44, CTGF, VCAN, CTNNB1, SCGB1A1, PDE4D, NRG1, EPHB1, and LYN as contributors to pulmonary function. Most of these genes were novel that were not found previously using solely SNP-level analysis. These novel genes are involving the transforming growth factor beta (TGFB)-SMAD pathway, Wnt/beta-catenin pathway, etc. Therefore, combining SNP-level and network analyses using longitudinal lung function data is a useful alternative strategy to identify risk genes. © 2014 WILEY PERIODICALS, INC.

  7. Divergence and codon usage bias of Betanodavirus, a neurotropic pathogen in fish.

    PubMed

    He, Mei; Teng, Chun-Bo

    2015-02-01

    Betanodavirus is a small bipartite RNA virus of global economical significance that can cause severe neurological disorders to an increasing number of marine fish species. Herein, to further the understanding of the evolution of betanodavirus, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of their RNA polymerase and coat protein genes. Similar moderate nucleotide substitution rates were then estimated for the two genes. According to age calculations, the divergence of the two genes into the four genotypes initiated nearly simultaneously at ∼700 years ago, despite the different scenarios, whereas the seven analyzed chimeric isolates might be the outcomes of a single genetic reassortment event taking place in the early 1980s in Southern Europe. Furthermore, codon usage bias analyses indicated that each gene had influences in addition to mutational bias and codon choice of betanodavirus was not completely complied with that of fish host. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. Identification of a common cyanobacterial symbiont associated with Azolla spp. through molecular and morphological characterization of free-living and symbiotic cyanobacteria.

    PubMed Central

    Gebhardt, J S; Nierzwicki-Bauer, S A

    1991-01-01

    Symbiotically associated cyanobacteria from Azolla mexicana and Azolla pinnata were isolated and cultured in a free-living state. Morphological analyses revealed differences between the free-living isolates and their symbiotic counterparts, as did restriction fragment length polymorphism (RFLP) analyses with both single-copy glnA and rbcS gene probes and a multicopy psbA gene probe. RFLP analyses with Anabaena sp. strain PCC 7120 nifD excision element probes, including an xisA gene probe, detected homologous sequences in DNA extracted from the free-living isolates. Sequences homologous to these probes were not detected in DNA from the symbiotically associated cyanobacteria. These analyses indicated that the isolates were not identical to the major cyanobacterial symbiont species residing in leaf cavities of Azolla spp. Nevertheless, striking similarities between several free-living isolates were observed. In every instance, the isolate from A. pinnata displayed banding patterns virtually identical to those of free-living cultures previously isolated from Azolla caroliniana and Azolla filiculoides. These results suggest the ubiquitous presence of a culturable minor cyanobacterial symbiont in at least three species of Azolla. Images PMID:1685078

  9. Integrative modeling of gene and genome evolution roots the archaeal tree of life

    PubMed Central

    Szöllősi, Gergely J.; Spang, Anja; Foster, Peter G.; Heaps, Sarah E.; Boussau, Bastien; Ettema, Thijs J. G.; Embley, T. Martin

    2017-01-01

    A root for the archaeal tree is essential for reconstructing the metabolism and ecology of early cells and for testing hypotheses that propose that the eukaryotic nuclear lineage originated from within the Archaea; however, published studies based on outgroup rooting disagree regarding the position of the archaeal root. Here we constructed a consensus unrooted archaeal topology using protein concatenation and a multigene supertree method based on 3,242 single gene trees, and then rooted this tree using a recently developed model of genome evolution. This model uses evidence from gene duplications, horizontal transfers, and gene losses contained in 31,236 archaeal gene families to identify the most likely root for the tree. Our analyses support the monophyly of DPANN (Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota, Nanohaloarchaea), a recently discovered cosmopolitan and genetically diverse lineage, and, in contrast to previous work, place the tree root between DPANN and all other Archaea. The sister group to DPANN comprises the Euryarchaeota and the TACK Archaea, including Lokiarchaeum, which our analyses suggest are monophyletic sister lineages. Metabolic reconstructions on the rooted tree suggest that early Archaea were anaerobes that may have had the ability to reduce CO2 to acetate via the Wood–Ljungdahl pathway. In contrast to proposals suggesting that genome reduction has been the predominant mode of archaeal evolution, our analyses infer a relatively small-genomed archaeal ancestor that subsequently increased in complexity via gene duplication and horizontal gene transfer. PMID:28533395

  10. Integrative modeling of gene and genome evolution roots the archaeal tree of life.

    PubMed

    Williams, Tom A; Szöllősi, Gergely J; Spang, Anja; Foster, Peter G; Heaps, Sarah E; Boussau, Bastien; Ettema, Thijs J G; Embley, T Martin

    2017-06-06

    A root for the archaeal tree is essential for reconstructing the metabolism and ecology of early cells and for testing hypotheses that propose that the eukaryotic nuclear lineage originated from within the Archaea; however, published studies based on outgroup rooting disagree regarding the position of the archaeal root. Here we constructed a consensus unrooted archaeal topology using protein concatenation and a multigene supertree method based on 3,242 single gene trees, and then rooted this tree using a recently developed model of genome evolution. This model uses evidence from gene duplications, horizontal transfers, and gene losses contained in 31,236 archaeal gene families to identify the most likely root for the tree. Our analyses support the monophyly of DPANN (Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota, Nanohaloarchaea), a recently discovered cosmopolitan and genetically diverse lineage, and, in contrast to previous work, place the tree root between DPANN and all other Archaea. The sister group to DPANN comprises the Euryarchaeota and the TACK Archaea, including Lokiarchaeum , which our analyses suggest are monophyletic sister lineages. Metabolic reconstructions on the rooted tree suggest that early Archaea were anaerobes that may have had the ability to reduce CO 2 to acetate via the Wood-Ljungdahl pathway. In contrast to proposals suggesting that genome reduction has been the predominant mode of archaeal evolution, our analyses infer a relatively small-genomed archaeal ancestor that subsequently increased in complexity via gene duplication and horizontal gene transfer.

  11. Evaluating Fast Maximum Likelihood-Based Phylogenetic Programs Using Empirical Phylogenomic Data Sets

    PubMed Central

    Zhou, Xiaofan; Shen, Xing-Xing; Hittinger, Chris Todd

    2018-01-01

    Abstract The sizes of the data matrices assembled to resolve branches of the tree of life have increased dramatically, motivating the development of programs for fast, yet accurate, inference. For example, several different fast programs have been developed in the very popular maximum likelihood framework, including RAxML/ExaML, PhyML, IQ-TREE, and FastTree. Although these programs are widely used, a systematic evaluation and comparison of their performance using empirical genome-scale data matrices has so far been lacking. To address this question, we evaluated these four programs on 19 empirical phylogenomic data sets with hundreds to thousands of genes and up to 200 taxa with respect to likelihood maximization, tree topology, and computational speed. For single-gene tree inference, we found that the more exhaustive and slower strategies (ten searches per alignment) outperformed faster strategies (one tree search per alignment) using RAxML, PhyML, or IQ-TREE. Interestingly, single-gene trees inferred by the three programs yielded comparable coalescent-based species tree estimations. For concatenation-based species tree inference, IQ-TREE consistently achieved the best-observed likelihoods for all data sets, and RAxML/ExaML was a close second. In contrast, PhyML often failed to complete concatenation-based analyses, whereas FastTree was the fastest but generated lower likelihood values and more dissimilar tree topologies in both types of analyses. Finally, data matrix properties, such as the number of taxa and the strength of phylogenetic signal, sometimes substantially influenced the programs’ relative performance. Our results provide real-world gene and species tree phylogenetic inference benchmarks to inform the design and execution of large-scale phylogenomic data analyses. PMID:29177474

  12. SpidermiR: An R/Bioconductor Package for Integrative Analysis with miRNA Data.

    PubMed

    Cava, Claudia; Colaprico, Antonio; Bertoli, Gloria; Graudenzi, Alex; Silva, Tiago C; Olsen, Catharina; Noushmehr, Houtan; Bontempi, Gianluca; Mauri, Giancarlo; Castiglioni, Isabella

    2017-01-27

    Gene Regulatory Networks (GRNs) control many biological systems, but how such network coordination is shaped is still unknown. GRNs can be subdivided into basic connections that describe how the network members interact e.g., co-expression, physical interaction, co-localization, genetic influence, pathways, and shared protein domains. The important regulatory mechanisms of these networks involve miRNAs. We developed an R/Bioconductor package, namely SpidermiR, which offers an easy access to both GRNs and miRNAs to the end user, and integrates this information with differentially expressed genes obtained from The Cancer Genome Atlas. Specifically, SpidermiR allows the users to: (i) query and download GRNs and miRNAs from validated and predicted repositories; (ii) integrate miRNAs with GRNs in order to obtain miRNA-gene-gene and miRNA-protein-protein interactions, and to analyze miRNA GRNs in order to identify miRNA-gene communities; and (iii) graphically visualize the results of the analyses. These analyses can be performed through a single interface and without the need for any downloads. The full data sets are then rapidly integrated and processed locally.

  13. Genetic variations in the serotonergic system contribute to amygdala volume in humans.

    PubMed

    Li, Jin; Chen, Chunhui; Wu, Karen; Zhang, Mingxia; Zhu, Bi; Chen, Chuansheng; Moyzis, Robert K; Dong, Qi

    2015-01-01

    The amygdala plays a critical role in emotion processing and psychiatric disorders associated with emotion dysfunction. Accumulating evidence suggests that amygdala structure is modulated by serotonin-related genes. However, there is a gap between the small contributions of single loci (less than 1%) and the reported 63-65% heritability of amygdala structure. To understand the "missing heritability," we systematically explored the contribution of serotonin genes on amygdala structure at the gene set level. The present study of 417 healthy Chinese volunteers examined 129 representative polymorphisms in genes from multiple biological mechanisms in the regulation of serotonin neurotransmission. A system-level approach using multiple regression analyses identified that nine SNPs collectively accounted for approximately 8% of the variance in amygdala volume. Permutation analyses showed that the probability of obtaining these findings by chance was low (p = 0.043, permuted for 1000 times). Findings showed that serotonin genes contribute moderately to individual differences in amygdala volume in a healthy Chinese sample. These results indicate that the system-level approach can help us to understand the genetic basis of a complex trait such as amygdala structure.

  14. Genetic findings in anorexia and bulimia nervosa.

    PubMed

    Hinney, Anke; Scherag, Susann; Hebebrand, Johannes

    2010-01-01

    Anorexia nervosa (AN) and bulimia nervosa (BN) are complex disorders associated with disordered eating behavior. Heritability estimates derived from twin and family studies are high, so that substantial genetic influences on the etiology can be assumed for both. As the monoaminergic neurotransmitter systems are involved in eating disorders (EDs), candidate gene studies have centered on related genes; additionally, genes relevant for body weight regulation have been considered as candidates. Unfortunately, this approach has yielded very few positive results; confirmed associations or findings substantiated in meta-analyses are scant. None of these associations can be considered unequivocally validated. Systematic genome-wide approaches have been performed to identify genes with no a priori evidence for their relevance in EDs. Family-based scans revealed linkage peaks in single chromosomal regions for AN and BN. Analyses of candidate genes in one of these regions led to the identification of genetic variants associated with AN. Currently, an international consortium is conducting a genome-wide association study for AN, which will hopefully lead to the identification of the first genome-wide significant markers. Copyright © 2010 Elsevier Inc. All rights reserved.

  15. Cloning of the IgM heavy chain of the bottlenose dolphin (Tursiops truncatus), and initial analysis of VH gene usage.

    PubMed

    Lundqvist, Mats L; Kohlberg, Kathleen E; Gefroh, Holly A; Arnaud, Philippe; Middleton, Darlene L; Romano, Tracy A; Warr, Gregory W

    2002-07-01

    Clones encoding the dolphin IgM heavy (micro) chain gene were isolated from a cDNA library of peripheral blood leukocytes. Genomic Southern blot analyses showed that the dolphin IGHM gene is most likely present in a single copy, and its sequence shows greatest similarity to those of the IGHM gene of the sheep, pig and cow, evolutionarily related artiodactyls. The transmembrane (TM) form of the IGHM chain was isolated by 3' RACE. While showing similarities to the TM regions of other mammalian IGHM chains, the highly conserved Ser residue of the CART motif is substituted with a Gly in the dolphin. In contrast to the pig and cow, which utilize only a single VH family, the dolphin expresses at least two distinct VH families, belonging to the mammalian VH clans I and III. At least two JH genes were identified in the dolphin. Some CDR3 regions of the dolphin VH are long (up to 21 amino acids), and contain multiple Cys residues, hypothesized to stabilize the CDR3 structure through disulfide bond formation.

  16. The PHF21B gene is associated with major depression and modulates the stress response.

    PubMed

    Wong, M-L; Arcos-Burgos, M; Liu, S; Vélez, J I; Yu, C; Baune, B T; Jawahar, M C; Arolt, V; Dannlowski, U; Chuah, A; Huttley, G A; Fogarty, R; Lewis, M D; Bornstein, S R; Licinio, J

    2017-07-01

    Major depressive disorder (MDD) affects around 350 million people worldwide; however, the underlying genetic basis remains largely unknown. In this study, we took into account that MDD is a gene-environment disorder, in which stress is a critical component, and used whole-genome screening of functional variants to investigate the 'missing heritability' in MDD. Genome-wide association studies (GWAS) using single- and multi-locus linear mixed-effect models were performed in a Los Angeles Mexican-American cohort (196 controls, 203 MDD) and in a replication European-ancestry cohort (499 controls, 473 MDD). Our analyses took into consideration the stress levels in the control populations. The Mexican-American controls, comprised primarily of recent immigrants, had high levels of stress due to acculturation issues and the European-ancestry controls with high stress levels were given higher weights in our analysis. We identified 44 common and rare functional variants associated with mild to moderate MDD in the Mexican-American cohort (genome-wide false discovery rate, FDR, <0.05), and their pathway analysis revealed that the three top overrepresented Gene Ontology (GO) processes were innate immune response, glutamate receptor signaling and detection of chemical stimulus in smell sensory perception. Rare variant analysis replicated the association of the PHF21B gene in the ethnically unrelated European-ancestry cohort. The TRPM2 gene, previously implicated in mood disorders, may also be considered replicated by our analyses. Whole-genome sequencing analyses of a subset of the cohorts revealed that European-ancestry individuals have a significantly reduced (50%) number of single nucleotide variants compared with Mexican-American individuals, and for this reason the role of rare variants may vary across populations. PHF21b variants contribute significantly to differences in the levels of expression of this gene in several brain areas, including the hippocampus. Furthermore, using an animal model of stress, we found that Phf21b hippocampal gene expression is significantly decreased in animals resilient to chronic restraint stress when compared with non-chronically stressed animals. Together, our results reveal that including stress level data enables the identification of novel rare functional variants associated with MDD.

  17. Genome wide association analysis for seedling response traits to thermal stress in sorghum germplasm

    USDA-ARS?s Scientific Manuscript database

    The sorghum association panel exhibited extensive variation for seedling traits under cold and heat stress. Genome-wide analyses identified thirty single nucleotide polymorphisms (SNPs) that were strongly associated with traits measured at seedling stage under cold stress and tagged genes that act a...

  18. iCOSSY: An Online Tool for Context-Specific Subnetwork Discovery from Gene Expression Data

    PubMed Central

    Saha, Ashis; Jeon, Minji; Tan, Aik Choon; Kang, Jaewoo

    2015-01-01

    Pathway analyses help reveal underlying molecular mechanisms of complex biological phenotypes. Biologists tend to perform multiple pathway analyses on the same dataset, as there is no single answer. It is often inefficient for them to implement and/or install all the algorithms by themselves. Online tools can help the community in this regard. Here we present an online gene expression analytical tool called iCOSSY which implements a novel pathway-based COntext-specific Subnetwork discoverY (COSSY) algorithm. iCOSSY also includes a few modifications of COSSY to increase its reliability and interpretability. Users can upload their gene expression datasets, and discover important subnetworks of closely interacting molecules to differentiate between two phenotypes (context). They can also interactively visualize the resulting subnetworks. iCOSSY is a web server that finds subnetworks that are differentially expressed in two phenotypes. Users can visualize the subnetworks to understand the biology of the difference. PMID:26147457

  19. Localization of an Ataxia-Telangiectasia Gene to an −500-kb Interval on Chromosome 11q23.1: Linkage Analysis of 176 Families by an International Consortium

    PubMed Central

    Lange, Ethan; Borresen, Anna-Lise; Chen, Xiaoguang; Chessa, Luciana; Chiplunkar, Sujata; Concannon, Patrick; Dandekar, Sugandha; Gerken, Steven; Lange, Kenneth; Liang, Teresa; McConville, Carmel; Polakow, Jeff; Porras, Oscar; Rotman, Galit; Sanal, Ozden; Sheikhavandi, Sepideh; Shiloh, Yosef; Sobel, Eric; Taylor, Malcolm; Telatar, Milhan; Teraoka, Sharon; Tolun, Aslihan; Udar, Nitin; Uhrhammer, Nancy; Vanagaite, Lina; Wang, Zhijun; Wapelhorst, Beth; Wright, Jocyndra; Yang, Huan-Ming; Yang, Lan; Ziv, Yael; Gatti, Richard A.

    1995-01-01

    We describe a 20-point linkage analysis map of chromosome 11q22-23 that is based on genotyping 249 families (59 CEPH and 190 A-T). Monte Carlo linkage analyses of 176 ataxia-telangiectasia (A-T) families localizes the major A-T locus to the region between S1819(A4) and S1818(A2). When seven nonlinking families were excluded from subsequent analyses, a 2-lod support interval of ∼500 kb was identified between S1819(A4) and S1294. No recombinants were observed between A-T and markers S384, B7, S535, or S1294. Only 17 of the international consortium families have been assigned to complementation groups. The available evidence favors either a cluster of A-T genes on chromosome 11 or intragenic defects in a single gene. PMID:7611279

  20. Discovering causal signaling pathways through gene-expression patterns

    PubMed Central

    Parikh, Jignesh R.; Klinger, Bertram; Xia, Yu; Marto, Jarrod A.; Blüthgen, Nils

    2010-01-01

    High-throughput gene-expression studies result in lists of differentially expressed genes. Most current meta-analyses of these gene lists include searching for significant membership of the translated proteins in various signaling pathways. However, such membership enrichment algorithms do not provide insight into which pathways caused the genes to be differentially expressed in the first place. Here, we present an intuitive approach for discovering upstream signaling pathways responsible for regulating these differentially expressed genes. We identify consistently regulated signature genes specific for signal transduction pathways from a panel of single-pathway perturbation experiments. An algorithm that detects overrepresentation of these signature genes in a gene group of interest is used to infer the signaling pathway responsible for regulation. We expose our novel resource and algorithm through a web server called SPEED: Signaling Pathway Enrichment using Experimental Data sets. SPEED can be freely accessed at http://speed.sys-bio.net/. PMID:20494976

  1. Characterization and Regulation of Aquaporin Genes of Sorghum [Sorghum bicolor (L.) Moench] in Response to Waterlogging Stress

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kadam, Suhas; Abril, Alejandra; Dhanapal, Arun P.

    Waterlogging is a significant environmental constraint to crop production, and a better understanding of plant responses is critical for the improvement of crop tolerance to waterlogged soils. Aquaporins (AQPs) are a class of channel-forming proteins that play an important role in water transport in plants. Our study aimed to examine the regulation of AQP genes under waterlogging stress and to characterize the genetic variability of AQP genes in sorghum (Sorghum bicolor). Transcriptional profiling of AQP genes in response to waterlogging stress in nodal root tips and nodal root basal regions of two tolerant and two sensitive sorghum genotypes at 18more » and 96 h after waterlogging stress imposition revealed significant gene-specific pattern with regard to genotype, root tissue sample, and time point. For some tissue sample and time point combinations, PIP2-6, PIP2-7, TIP2-2, TIP4-4, and TIP5-1 expression was differentially regulated in tolerant compared to sensitive genotypes. The differential response of these AQP genes suggests that they may play a tissue specific role in mitigating waterlogging stress. Genetic analysis of sorghum revealed that AQP genes were clustered into the same four subfamilies as in maize (Zea mays) and rice (Oryza sativa) and that residues determining the AQP channel specificity were largely conserved across species. Single nucleotide polymorphism (SNP) data from 50 sorghum accessions were used to build an AQP gene-based phylogeny of the haplotypes. Phylogenetic analysis based on single nucleotide polymorphisms of sorghum AQP genes placed the tolerant and sensitive genotypes used for the expression study in distinct groups. Expression analyses suggested that selected AQPs may play a pivotal role in sorghum tolerance to water logging stress. Furthermore experimentation is needed to verify their role and to leverage phylogenetic analyses and AQP expression data to improve water logging tolerance in sorghum.« less

  2. Characterization and Regulation of Aquaporin Genes of Sorghum [Sorghum bicolor (L.) Moench] in Response to Waterlogging Stress

    DOE PAGES

    Kadam, Suhas; Abril, Alejandra; Dhanapal, Arun P.; ...

    2017-05-30

    Waterlogging is a significant environmental constraint to crop production, and a better understanding of plant responses is critical for the improvement of crop tolerance to waterlogged soils. Aquaporins (AQPs) are a class of channel-forming proteins that play an important role in water transport in plants. Our study aimed to examine the regulation of AQP genes under waterlogging stress and to characterize the genetic variability of AQP genes in sorghum (Sorghum bicolor). Transcriptional profiling of AQP genes in response to waterlogging stress in nodal root tips and nodal root basal regions of two tolerant and two sensitive sorghum genotypes at 18more » and 96 h after waterlogging stress imposition revealed significant gene-specific pattern with regard to genotype, root tissue sample, and time point. For some tissue sample and time point combinations, PIP2-6, PIP2-7, TIP2-2, TIP4-4, and TIP5-1 expression was differentially regulated in tolerant compared to sensitive genotypes. The differential response of these AQP genes suggests that they may play a tissue specific role in mitigating waterlogging stress. Genetic analysis of sorghum revealed that AQP genes were clustered into the same four subfamilies as in maize (Zea mays) and rice (Oryza sativa) and that residues determining the AQP channel specificity were largely conserved across species. Single nucleotide polymorphism (SNP) data from 50 sorghum accessions were used to build an AQP gene-based phylogeny of the haplotypes. Phylogenetic analysis based on single nucleotide polymorphisms of sorghum AQP genes placed the tolerant and sensitive genotypes used for the expression study in distinct groups. Expression analyses suggested that selected AQPs may play a pivotal role in sorghum tolerance to water logging stress. Furthermore experimentation is needed to verify their role and to leverage phylogenetic analyses and AQP expression data to improve water logging tolerance in sorghum.« less

  3. Single cell genomics of uncultured marine alveolates shows paraphyly of basal dinoflagellates.

    PubMed

    Strassert, Jürgen F H; Karnkowska, Anna; Hehenberger, Elisabeth; Del Campo, Javier; Kolisko, Martin; Okamoto, Noriko; Burki, Fabien; Janouškovec, Jan; Poirier, Camille; Leonard, Guy; Hallam, Steven J; Richards, Thomas A; Worden, Alexandra Z; Santoro, Alyson E; Keeling, Patrick J

    2018-01-01

    Marine alveolates (MALVs) are diverse and widespread early-branching dinoflagellates, but most knowledge of the group comes from a few cultured species that are generally not abundant in natural samples, or from diversity analyses of PCR-based environmental SSU rRNA gene sequences. To more broadly examine MALV genomes, we generated single cell genome sequences from seven individually isolated cells. Genes expected of heterotrophic eukaryotes were found, with interesting exceptions like presence of proteorhodopsin and vacuolar H + -pyrophosphatase. Phylogenetic analysis of concatenated SSU and LSU rRNA gene sequences provided strong support for the paraphyly of MALV lineages. Dinoflagellate viral nucleoproteins were found only in MALV groups that branched as sister to dinokaryotes. Our findings indicate that multiple independent origins of several characteristics early in dinoflagellate evolution, such as a parasitic life style, underlie the environmental diversity of MALVs, and suggest they have more varied trophic modes than previously thought.

  4. Polymorphisms in the AOX2 gene are associated with the rooting ability of olive cuttings.

    PubMed

    Hedayati, Vahideh; Mousavi, Amir; Razavi, Khadijeh; Cultrera, Nicolò; Alagna, Fiammetta; Mariotti, Roberto; Hosseini-Mazinani, Mehdi; Baldoni, Luciana

    2015-07-01

    Different rooting ability candidate genes were tested on an olive cross progeny. Our results demonstrated that only the AOX2 gene was strongly induced. OeAOX2 was fully characterised and correlated to phenotypical traits. The formation of adventitious roots is a key step in the vegetative propagation of trees crop species, and this ability is under strict genetic control. While numerous studies have been carried out to identify genes controlling adventitious root formation, only a few loci have been characterised. In this work, candidate genes that were putatively involved in rooting ability were identified in olive (Olea europaea L.) by similarity with orthologs identified in other plant species. The mRNA levels of these genes were analysed by real-time PCR during root induction in high- (HR) and low-rooting (LR) individuals. Interestingly, alternative oxidase 2 (AOX2), which was previously reported to be a functional marker for rooting in olive cuttings, showed a strong induction in HR individuals. From the OeAOX2 full-length gene, alleles and effective polymorphisms were distinguished and analysed in the cross progeny, which were segregated based on rooting. The results revealed a possible correlation between two single nucleotide polymorphisms of OeAOX2 gene and rooting ability.

  5. A genome-wide association study of corneal astigmatism: The CREAM Consortium.

    PubMed

    Shah, Rupal L; Li, Qing; Zhao, Wanting; Tedja, Milly S; Tideman, J Willem L; Khawaja, Anthony P; Fan, Qiao; Yazar, Seyhan; Williams, Katie M; Verhoeven, Virginie J M; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J; Pärssinen, Olavi; Wedenoja, Juho; Biino, Ginevra; Concas, Maria Pina; Uitterlinden, André; Rivadeneira, Fernando; Jaddoe, Vincent W V; Hysi, Pirro G; Sim, Xueling; Tan, Nicholas; Tham, Yih-Chung; Sensaki, Sonoko; Hofman, Albert; Vingerling, Johannes R; Jonas, Jost B; Mitchell, Paul; Hammond, Christopher J; Höhn, René; Baird, Paul N; Wong, Tien-Yin; Cheng, Chinfsg-Yu; Teo, Yik Ying; Mackey, David A; Williams, Cathy; Saw, Seang-Mei; Klaver, Caroline C W; Guggenheim, Jeremy A; Bailey-Wilson, Joan E

    2018-01-01

    To identify genes and genetic markers associated with corneal astigmatism. A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha ( PDGFRA ) gene: top SNP: rs7673984, odds ratio=1.12 (95% CI:1.08-1.16), p=5.55×10 -9 . No other genome-wide significant loci were identified in the combined analysis or European/Asian ancestry-specific analyses. Gene-based analysis identified three novel candidate genes for corneal astigmatism in Europeans-claudin-7 ( CLDN7 ), acid phosphatase 2, lysosomal ( ACP2 ), and TNF alpha-induced protein 8 like 3 ( TNFAIP8L3 ). In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the PDGFRA gene, gene-based analysis identified three novel candidate genes, CLDN7 , ACP2 , and TNFAIP8L3 , that warrant further investigation to understand their role in the pathogenesis of corneal astigmatism. The much lower number of genetic variants and genes demonstrating an association with corneal astigmatism compared to published spherical equivalent GWAS analyses suggest a greater influence of rare genetic variants, non-additive genetic effects, or environmental factors in the development of astigmatism.

  6. A multiplexed single-cell CRISPR screening platform enables systematic dissection of the unfolded protein response

    PubMed Central

    Adamson, Britt; Norman, Thomas M.; Jost, Marco; Cho, Min Y.; Nuñez, James K.; Chen, Yuwen; Villalta, Jacqueline E.; Gilbert, Luke A.; Horlbeck, Max A.; Hein, Marco Y.; Pak, Ryan A.; Gray, Andrew N.; Gross, Carol A.; Dixit, Atray; Parnas, Oren; Regev, Aviv; Weissman, Jonathan S.

    2016-01-01

    SUMMARY Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis. Subjecting ~100 hits to Perturb-seq enabled high-precision functional clustering of genes. Single-cell analyses decoupled the three UPR branches, revealed bifurcated UPR branch activation among cells subject to the same perturbation, and uncovered differential activation of the branches across hits, including an isolated feedback loop between the translocon and IRE1α. These studies provide insight into how the three sensors of ER homeostasis monitor distinct types of stress and highlight the ability of Perturb-seq to dissect complex cellular responses. PMID:27984733

  7. An Updated Collection of Sequence Barcoded Temperature-Sensitive Alleles of Yeast Essential Genes

    PubMed Central

    Kofoed, Megan; Milbury, Karissa L.; Chiang, Jennifer H.; Sinha, Sunita; Ben-Aroya, Shay; Giaever, Guri; Nislow, Corey; Hieter, Philip; Stirling, Peter C.

    2015-01-01

    Systematic analyses of essential gene function using mutant collections in Saccharomyces cerevisiae have been conducted using collections of heterozygous diploids, promoter shut-off alleles, through alleles with destabilized mRNA, destabilized protein, or bearing mutations that lead to a temperature-sensitive (ts) phenotype. We previously described a method for construction of barcoded ts alleles in a systematic fashion. Here we report the completion of this collection of alleles covering 600 essential yeast genes. This resource covers a larger gene repertoire than previous collections and provides a complementary set of strains suitable for single gene and genomic analyses. We use deep sequencing to characterize the amino acid changes leading to the ts phenotype in half of the alleles. We also use high-throughput approaches to describe the relative ts behavior of the alleles. Finally, we demonstrate the experimental usefulness of the collection in a high-content, functional genomic screen for ts alleles that increase spontaneous P-body formation. By increasing the number of alleles and improving the annotation, this ts collection will serve as a community resource for probing new aspects of biology for essential yeast genes. PMID:26175450

  8. Arthropod phylogeny based on eight molecular loci and morphology

    NASA Technical Reports Server (NTRS)

    Giribet, G.; Edgecombe, G. D.; Wheeler, W. C.

    2001-01-01

    The interrelationships of major clades within the Arthropoda remain one of the most contentious issues in systematics, which has traditionally been the domain of morphologists. A growing body of DNA sequences and other types of molecular data has revitalized study of arthropod phylogeny and has inspired new considerations of character evolution. Novel hypotheses such as a crustacean-hexapod affinity were based on analyses of single or few genes and limited taxon sampling, but have received recent support from mitochondrial gene order, and eye and brain ultrastructure and neurogenesis. Here we assess relationships within Arthropoda based on a synthesis of all well sampled molecular loci together with a comprehensive data set of morphological, developmental, ultrastructural and gene-order characters. The molecular data include sequences of three nuclear ribosomal genes, three nuclear protein-coding genes, and two mitochondrial genes (one protein coding, one ribosomal). We devised new optimization procedures and constructed a parallel computer cluster with 256 central processing units to analyse molecular data on a scale not previously possible. The optimal 'total evidence' cladogram supports the crustacean-hexapod clade, recognizes pycnogonids as sister to other euarthropods, and indicates monophyly of Myriapoda and Mandibulata.

  9. The low-abundance transcriptome reveals novel biomarkers, specific intracellular pathways and targetable genes associated with advanced gastric cancer.

    PubMed

    Bizama, Carolina; Benavente, Felipe; Salvatierra, Edgardo; Gutiérrez-Moraga, Ana; Espinoza, Jaime A; Fernández, Elmer A; Roa, Iván; Mazzolini, Guillermo; Sagredo, Eduardo A; Gidekel, Manuel; Podhajcer, Osvaldo L

    2014-02-15

    Studies on the low-abundance transcriptome are of paramount importance for identifying the intimate mechanisms of tumor progression that can lead to novel therapies. The aim of the present study was to identify novel markers and targetable genes and pathways in advanced human gastric cancer through analyses of the low-abundance transcriptome. The procedure involved an initial subtractive hybridization step, followed by global gene expression analysis using microarrays. We observed profound differences, both at the single gene and gene ontology levels, between the low-abundance transcriptome and the whole transcriptome. Analysis of the low-abundance transcriptome led to the identification and validation by tissue microarrays of novel biomarkers, such as LAMA3 and TTN; moreover, we identified cancer type-specific intracellular pathways and targetable genes, such as IRS2, IL17, IFNγ, VEGF-C, WISP1, FZD5 and CTBP1 that were not detectable by whole transcriptome analyses. We also demonstrated that knocking down the expression of CTBP1 sensitized gastric cancer cells to mainstay chemotherapeutic drugs. We conclude that the analysis of the low-abundance transcriptome provides useful insights into the molecular basis and treatment of cancer. © 2013 UICC.

  10. An Updated Collection of Sequence Barcoded Temperature-Sensitive Alleles of Yeast Essential Genes.

    PubMed

    Kofoed, Megan; Milbury, Karissa L; Chiang, Jennifer H; Sinha, Sunita; Ben-Aroya, Shay; Giaever, Guri; Nislow, Corey; Hieter, Philip; Stirling, Peter C

    2015-07-14

    Systematic analyses of essential gene function using mutant collections in Saccharomyces cerevisiae have been conducted using collections of heterozygous diploids, promoter shut-off alleles, through alleles with destabilized mRNA, destabilized protein, or bearing mutations that lead to a temperature-sensitive (ts) phenotype. We previously described a method for construction of barcoded ts alleles in a systematic fashion. Here we report the completion of this collection of alleles covering 600 essential yeast genes. This resource covers a larger gene repertoire than previous collections and provides a complementary set of strains suitable for single gene and genomic analyses. We use deep sequencing to characterize the amino acid changes leading to the ts phenotype in half of the alleles. We also use high-throughput approaches to describe the relative ts behavior of the alleles. Finally, we demonstrate the experimental usefulness of the collection in a high-content, functional genomic screen for ts alleles that increase spontaneous P-body formation. By increasing the number of alleles and improving the annotation, this ts collection will serve as a community resource for probing new aspects of biology for essential yeast genes. Copyright © 2015 Kofoed et al.

  11. Incorporating Single-nucleotide Polymorphisms Into the Lyman Model to Improve Prediction of Radiation Pneumonitis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tucker, Susan L., E-mail: sltucker@mdanderson.org; Li Minghuan; Xu Ting

    2013-01-01

    Purpose: To determine whether single-nucleotide polymorphisms (SNPs) in genes associated with DNA repair, cell cycle, transforming growth factor-{beta}, tumor necrosis factor and receptor, folic acid metabolism, and angiogenesis can significantly improve the fit of the Lyman-Kutcher-Burman (LKB) normal-tissue complication probability (NTCP) model of radiation pneumonitis (RP) risk among patients with non-small cell lung cancer (NSCLC). Methods and Materials: Sixteen SNPs from 10 different genes (XRCC1, XRCC3, APEX1, MDM2, TGF{beta}, TNF{alpha}, TNFR, MTHFR, MTRR, and VEGF) were genotyped in 141 NSCLC patients treated with definitive radiation therapy, with or without chemotherapy. The LKB model was used to estimate the risk ofmore » severe (grade {>=}3) RP as a function of mean lung dose (MLD), with SNPs and patient smoking status incorporated into the model as dose-modifying factors. Multivariate analyses were performed by adding significant factors to the MLD model in a forward stepwise procedure, with significance assessed using the likelihood-ratio test. Bootstrap analyses were used to assess the reproducibility of results under variations in the data. Results: Five SNPs were selected for inclusion in the multivariate NTCP model based on MLD alone. SNPs associated with an increased risk of severe RP were in genes for TGF{beta}, VEGF, TNF{alpha}, XRCC1 and APEX1. With smoking status included in the multivariate model, the SNPs significantly associated with increased risk of RP were in genes for TGF{beta}, VEGF, and XRCC3. Bootstrap analyses selected a median of 4 SNPs per model fit, with the 6 genes listed above selected most often. Conclusions: This study provides evidence that SNPs can significantly improve the predictive ability of the Lyman MLD model. With a small number of SNPs, it was possible to distinguish cohorts with >50% risk vs <10% risk of RP when they were exposed to high MLDs.« less

  12. Association study between kynurenine 3-monooxygenase gene and schizophrenia in the Japanese population.

    PubMed

    Aoyama, N; Takahashi, N; Saito, S; Maeno, N; Ishihara, R; Ji, X; Miura, H; Ikeda, M; Suzuki, T; Kitajima, T; Yamanouchi, Y; Kinoshita, Y; Yoshida, K; Iwata, N; Inada, T; Ozaki, N

    2006-06-01

    Several lines of evidence suggest that metabolic changes in the kynurenic acid (KYNA) pathway are related to the etiology of schizophrenia. The inhibitor of kynurenine 3-monooxygenase (KMO) is known to increase KYNA levels, and the KMO gene is located in the chromosome region associated with schizophrenia, 1q42-q44. Single-marker and haplotype analyses for 6-tag single nucleotide polymorphisms (SNPs) of KMO were performed (cases = 465, controls = 440). Significant association of rs2275163 with schizophrenia was observed by single-marker comparisons (P = 0.032) and haplotype analysis including this SNP (P = 0.0049). Significant association of rs2275163 and haplotype was not replicated using a second, independent set of samples (cases = 480, controls = 448) (P = 0.706 and P = 0.689, respectively). These results suggest that the KMO is unlikely to be related to the development of schizophrenia in Japanese.

  13. The contribution of individual and pairwise combinations of SNPs in the APOA1 and APOC3 genes to interindividual HDL-C variability.

    PubMed

    Brown, C M; Rea, T J; Hamon, S C; Hixson, J E; Boerwinkle, E; Clark, A G; Sing, C F

    2006-07-01

    Apolipoproteins (apo) A-I and C-III are components of high-density lipoprotein-cholesterol (HDL-C), a quantitative trait negatively correlated with risk of cardiovascular disease (CVD). We analyzed the contribution of individual and pairwise combinations of single nucleotide polymorphisms (SNPs) in the APOA1/APOC3 genes to HDL-C variability to evaluate (1) consistency of published single-SNP studies with our single-SNP analyses; (2) consistency of single-SNP and two-SNP phenotype-genotype relationships across race-, gender-, and geographical location-dependent contexts; and (3) the contribution of single SNPs and pairs of SNPs to variability beyond that explained by plasma apo A-I concentration. We analyzed 45 SNPs in 3,831 young African-American (N=1,858) and European-American (N=1,973) females and males ascertained by the Coronary Artery Risk Development in Young Adults (CARDIA) study. We found three SNPs that significantly impact HDL-C variability in both the literature and the CARDIA sample. Single-SNP analyses identified only one of five significant HDL-C SNP genotype relationships in the CARDIA study that was consistent across all race-, gender-, and geographical location-dependent contexts. The other four were consistent across geographical locations for a particular race-gender context. The portion of total phenotypic variance explained by single-SNP genotypes and genotypes defined by pairs of SNPs was less than 3%, an amount that is miniscule compared to the contribution explained by variability in plasma apo A-I concentration. Our findings illustrate the impact of context-dependence on SNP selection for prediction of CVD risk factor variability.

  14. An autonomous molecular computer for logical control of gene expression.

    PubMed

    Benenson, Yaakov; Gil, Binyamin; Ben-Dor, Uri; Adar, Rivka; Shapiro, Ehud

    2004-05-27

    Early biomolecular computer research focused on laboratory-scale, human-operated computers for complex computational problems. Recently, simple molecular-scale autonomous programmable computers were demonstrated allowing both input and output information to be in molecular form. Such computers, using biological molecules as input data and biologically active molecules as outputs, could produce a system for 'logical' control of biological processes. Here we describe an autonomous biomolecular computer that, at least in vitro, logically analyses the levels of messenger RNA species, and in response produces a molecule capable of affecting levels of gene expression. The computer operates at a concentration of close to a trillion computers per microlitre and consists of three programmable modules: a computation module, that is, a stochastic molecular automaton; an input module, by which specific mRNA levels or point mutations regulate software molecule concentrations, and hence automaton transition probabilities; and an output module, capable of controlled release of a short single-stranded DNA molecule. This approach might be applied in vivo to biochemical sensing, genetic engineering and even medical diagnosis and treatment. As a proof of principle we programmed the computer to identify and analyse mRNA of disease-related genes associated with models of small-cell lung cancer and prostate cancer, and to produce a single-stranded DNA molecule modelled after an anticancer drug.

  15. Cross-organism learning method to discover new gene functionalities.

    PubMed

    Domeniconi, Giacomo; Masseroli, Marco; Moro, Gianluca; Pinoli, Pietro

    2016-04-01

    Knowledge of gene and protein functions is paramount for the understanding of physiological and pathological biological processes, as well as in the development of new drugs and therapies. Analyses for biomedical knowledge discovery greatly benefit from the availability of gene and protein functional feature descriptions expressed through controlled terminologies and ontologies, i.e., of gene and protein biomedical controlled annotations. In the last years, several databases of such annotations have become available; yet, these valuable annotations are incomplete, include errors and only some of them represent highly reliable human curated information. Computational techniques able to reliably predict new gene or protein annotations with an associated likelihood value are thus paramount. Here, we propose a novel cross-organisms learning approach to reliably predict new functionalities for the genes of an organism based on the known controlled annotations of the genes of another, evolutionarily related and better studied, organism. We leverage a new representation of the annotation discovery problem and a random perturbation of the available controlled annotations to allow the application of supervised algorithms to predict with good accuracy unknown gene annotations. Taking advantage of the numerous gene annotations available for a well-studied organism, our cross-organisms learning method creates and trains better prediction models, which can then be applied to predict new gene annotations of a target organism. We tested and compared our method with the equivalent single organism approach on different gene annotation datasets of five evolutionarily related organisms (Homo sapiens, Mus musculus, Bos taurus, Gallus gallus and Dictyostelium discoideum). Results show both the usefulness of the perturbation method of available annotations for better prediction model training and a great improvement of the cross-organism models with respect to the single-organism ones, without influence of the evolutionary distance between the considered organisms. The generated ranked lists of reliably predicted annotations, which describe novel gene functionalities and have an associated likelihood value, are very valuable both to complement available annotations, for better coverage in biomedical knowledge discovery analyses, and to quicken the annotation curation process, by focusing it on the prioritized novel annotations predicted. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  16. Comparative Analyses of Single-Nucleotide Polymorphisms in the TNF Promoter Region Provide Further Validation for the Vervet Monkey Model of Obesity

    PubMed Central

    Gray, Stanton B; Howard, Timothy D; Langefeld, Carl D; Hawkins, Gregory A; Diallo, Abdoulaye F; Wagner, Janice D

    2009-01-01

    Tumor necrosis factor is a cytokine that plays critical roles in inflammation, the innate immune response, and a variety of other physiologic and pathophysiologic processes. In addition, TNF has recently been shown to mediate an intersection of chronic, low-grade inflammation and concurrent metabolic dysregulation associated with obesity and its comorbidities. As part of an ongoing initiative to further characterize vervet monkeys originating from St Kitts as an animal model of obesity and inflammation, we sequenced and genotyped the human ortholog vervet TNF gene and approximately 1 kb of the flanking 3′ and 5′ regions from 265 monkeys in a closed, pedigreed colony. This process revealed a total of 11 single-nucleotide polymorphisms (SNPs) and a single 4-bp insertion–deletion, with minor allele frequencies of 0.08 to 0.39. Many of these polymorphisms were in strong or complete linkage disequilibrium with each other, and all but 1 were contained within a single haplotype block, comprising 5 haplotypes with frequencies of 0.075 to 0.298. Using sequences from humans, chimpanzees, vervets, baboons, and rhesus macaques, phylogenetic shadowing of the TNF promoter region revealed that vervet SNPs, like the SNPs in related species, were clustered nonrandomly and nonuniformly around conserved transcription factor binding sites. These data, combined with previously defined heritable phenotypes, permit future association analyses in this nonhuman primate model and have great potential to help dissect the genetic and nongenetic contributions to complex diseases like obesity. More broadly, the sequence data and comparative analyses reported herein facilitates study of the evolution of regulatory sequences of inflammatory and immune-related genes. PMID:20034434

  17. Nonsyndromic cleft lip with or without cleft palate: Increased burden of rare variants within Gremlin-1, a component of the bone morphogenetic protein 4 pathway.

    PubMed

    Al Chawa, Taofik; Ludwig, Kerstin U; Fier, Heide; Pötzsch, Bernd; Reich, Rudolf H; Schmidt, Gül; Braumann, Bert; Daratsianos, Nikolaos; Böhmer, Anne C; Schuencke, Hannah; Alblas, Margrieta; Fricker, Nadine; Hoffmann, Per; Knapp, Michael; Lange, Christoph; Nöthen, Markus M; Mangold, Elisabeth

    2014-06-01

    The genes Gremlin-1 (GREM1) and Noggin (NOG) are components of the bone morphogenetic protein 4 pathway, which has been implicated in craniofacial development. Both genes map to recently identified susceptibility loci (chromosomal region 15q13, 17q22) for nonsyndromic cleft lip with or without cleft palate (nsCL/P). The aim of the present study was to determine whether rare variants in either gene are implicated in nsCL/P etiology. The complete coding regions, untranslated regions, and splice sites of GREM1 and NOG were sequenced in 96 nsCL/P patients and 96 controls of Central European ethnicity. Three burden and four nonburden tests were performed. Statistically significant results were followed up in a second case-control sample (n = 96, respectively). For rare variants observed in cases, segregation analyses were performed. In NOG, four rare sequence variants (minor allele frequency < 1%) were identified. Here, burden and nonburden analyses generated nonsignificant results. In GREM1, 33 variants were identified, 15 of which were rare. Of these, five were novel. Significant p-values were generated in three nonburden analyses. Segregation analyses revealed incomplete penetrance for all variants investigated. Our study did not provide support for NOG being the causal gene at 17q22. However, the observation of a significant excess of rare variants in GREM1 supports the hypothesis that this is the causal gene at chr. 15q13. Because no single causal variant was identified, future sequencing analyses of GREM1 should involve larger samples and the investigation of regulatory elements. © 2014 Wiley Periodicals, Inc.

  18. Evolution of gremlin 2 in cetartiodactyl mammals: gene loss coincides with lack of upper jaw incisors in ruminants.

    PubMed

    Opazo, Juan C; Zavala, Kattina; Krall, Paola; Arias, Rodrigo A

    2017-01-01

    Understanding the processes that give rise to genomic variability in extant species is an active area of research within evolutionary biology. With the availability of whole genome sequences, it is possible to quantify different forms of variability such as variation in gene copy number, which has been described as an important source of genetic variability and in consequence of phenotypic variability. Most of the research on this topic has been focused on understanding the biological significance of gene duplication, and less attention has been given to the evolutionary role of gene loss. Gremlin 2 is a member of the DAN gene family and plays a significant role in tooth development by blocking the ligand-signaling pathway of BMP2 and BMP4. The goal of this study was to investigate the evolutionary history of gremlin 2 in cetartiodactyl mammals, a group that possesses highly divergent teeth morphology. Results from our analyses indicate that gremlin 2 has experienced a mixture of gene loss, gene duplication, and rate acceleration. Although the last common ancestor of cetartiodactyls possessed a single gene copy, pigs and camels are the only cetartiodactyl groups that have retained gremlin 2. According to the phyletic distribution of this gene and synteny analyses, we propose that gremlin 2 was lost in the common ancestor of ruminants and cetaceans between 56.3 and 63.5 million years ago as a product of a chromosomal rearrangement. Our analyses also indicate that the rate of evolution of gremlin 2 has been accelerated in the two groups that have retained this gene. Additionally, the lack of this gene could explain the high diversity of teeth among cetartiodactyl mammals; specifically, the presence of this gene could act as a biological constraint. Thus, our results support the notions that gene loss is a way to increase phenotypic diversity and that gremlin 2 is a dispensable gene, at least in cetartiodactyl mammals.

  19. ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

    PubMed

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.

  20. ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

    PubMed Central

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614

  1. Efficient CRISPR/Cas9-mediated gene editing in Arabidopsis thaliana and inheritance of modified genes in the T2 and T3 generations.

    PubMed

    Jiang, WenZhi; Yang, Bing; Weeks, Donald P

    2014-01-01

    The newly developed CRISPR/Cas9 system for targeted gene knockout or editing has recently been shown to function in plants in both transient expression systems as well as in primary T1 transgenic plants. However, stable transmission of genes modified by the Cas9/single guide RNA (sgRNA) system to the T2 generation and beyond has not been demonstrated. Here we provide extensive data demonstrating the efficiency of Cas9/sgRNA in causing modification of a chromosomally integrated target reporter gene during early development of transgenic Arabidopsis plants and inheritance of the modified gene in T2 and T3 progeny. Efficient conversion of a nonfunctional, out-of-frame GFP gene to a functional GFP gene was confirmed in T1 plants by the observation of green fluorescent signals in leaf tissues as well as the presence of mutagenized DNA sequences at the sgRNA target site within the GFP gene. All GFP-positive T1 transgenic plants and nearly all GFP-negative plants examined contained mutagenized GFP genes. Analyses of 42 individual T2 generation plants derived from 6 different T1 progenitor plants showed that 50% of T2 plants inherited a single T-DNA insert. The efficiency of the Cas9/sgRNA system and stable inheritance of edited genes point to the promise of this system for facile editing of plant genes.

  2. Association of Single-Nucleotide Polymorphisms of the Tau Gene With Late-Onset Parkinson Disease

    PubMed Central

    Martin, Eden R.; Scott, William K.; Nance, Martha A.; Watts, Ray L.; Hubble, Jean P.; Koller, William C.; Lyons, Kelly; Pahwa, Rajesh; Stern, Matthew B.; Colcher, Amy; Hiner, Bradley C.; Jankovic, Joseph; Ondo, William G.; Allen, Fred H.; Goetz, Christopher G.; Small, Gary W.; Masterman, Donna; Mastaglia, Frank; Laing, Nigel G.; Stajich, Jeffrey M.; Ribble, Robert C.; Booze, Michael W.; Rogala, Allison; Hauser, Michael A.; Zhang, Fengyu; Gibson, Rachel A.; Middleton, Lefkos T.; Roses, Allen D.; Haines, Jonathan L.; Scott, Burton L.; Pericak-Vance, Margaret A.; Vance, Jeffery M.

    2013-01-01

    Context The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. Objective To investigate whether the tau gene is involved in idiopathic PD. Design, Setting, and Participants Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Main Outcome Measure Family-based tests of association, calculated using asymptotic distributions. Results Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P = .03; SNP 9i, P = .04; and SNP 11, P = .04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P = .11, and SNP 9iii, P = .87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P = .009) and a negative association with another haplotype (P = .007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3,9i, 9ii, and 11). Conclusions This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD. PMID:11710889

  3. Systematic evaluation of RNA quality, microarray data reliability and pathway analysis in fresh, fresh frozen and formalin-fixed paraffin-embedded tissue samples.

    PubMed

    Wimmer, Isabella; Tröscher, Anna R; Brunner, Florian; Rubino, Stephen J; Bien, Christian G; Weiner, Howard L; Lassmann, Hans; Bauer, Jan

    2018-04-20

    Formalin-fixed paraffin-embedded (FFPE) tissues are valuable resources commonly used in pathology. However, formalin fixation modifies nucleic acids challenging the isolation of high-quality RNA for genetic profiling. Here, we assessed feasibility and reliability of microarray studies analysing transcriptome data from fresh, fresh-frozen (FF) and FFPE tissues. We show that reproducible microarray data can be generated from only 2 ng FFPE-derived RNA. For RNA quality assessment, fragment size distribution (DV200) and qPCR proved most suitable. During RNA isolation, extending tissue lysis time to 10 hours reduced high-molecular-weight species, while additional incubation at 70 °C markedly increased RNA yields. Since FF- and FFPE-derived microarrays constitute different data entities, we used indirect measures to investigate gene signal variation and relative gene expression. Whole-genome analyses revealed high concordance rates, while reviewing on single-genes basis showed higher data variation in FFPE than FF arrays. Using an experimental model, gene set enrichment analysis (GSEA) of FFPE-derived microarrays and fresh tissue-derived RNA-Seq datasets yielded similarly affected pathways confirming the applicability of FFPE tissue in global gene expression analysis. Our study provides a workflow comprising RNA isolation, quality assessment and microarray profiling using minimal RNA input, thus enabling hypothesis-generating pathway analyses from limited amounts of precious, pathologically significant FFPE tissues.

  4. Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing.

    PubMed

    Zuo, Chunman; Blow, Matthew; Sreedasyam, Avinash; Kuo, Rita C; Ramamoorthy, Govindarajan Kunde; Torres-Jerez, Ivone; Li, Guifen; Wang, Mei; Dilworth, David; Barry, Kerrie; Udvardi, Michael; Schmutz, Jeremy; Tang, Yuhong; Xu, Ying

    2018-01-01

    Switchgrass ( Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts. We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures. Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.

  5. Association analysis of single nucleotide polymorphisms in candidate genes with root traits in maize (Zea mays L.) seedlings.

    PubMed

    Kumar, Bharath; Abdel-Ghani, Adel H; Pace, Jordon; Reyes-Matamoros, Jenaro; Hochholdinger, Frank; Lübberstedt, Thomas

    2014-07-01

    Several genes involved in maize root development have been isolated. Identification of SNPs associated with root traits would enable the selection of maize lines with better root architecture that might help to improve N uptake, and consequently plant growth particularly under N deficient conditions. In the present study, an association study (AS) panel consisting of 74 maize inbred lines was screened for seedling root traits in 6, 10, and 14-day-old seedlings. Allele re-sequencing of candidate root genes Rtcl, Rth3, Rum1, and Rul1 was also carried out in the same AS panel lines. All four candidate genes displayed different levels of nucleotide diversity, haplotype diversity and linkage disequilibrium. Gene based association analyses were carried out between individual polymorphisms in candidate genes, and root traits measured in 6, 10, and 14-day-old maize seedlings. Association analyses revealed several polymorphisms within the Rtcl, Rth3, Rum1, and Rul1 genes associated with seedling root traits. Several nucleotide polymorphisms in Rtcl, Rth3, Rum1, and Rul1 were significantly (P<0.05) associated with seedling root traits in maize suggesting that all four tested genes are involved in the maize root development. Thus considerable allelic variation present in these root genes can be exploited for improving maize root characteristics. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  6. A Protein Domain and Family Based Approach to Rare Variant Association Analysis.

    PubMed

    Richardson, Tom G; Shihab, Hashem A; Rivas, Manuel A; McCarthy, Mark I; Campbell, Colin; Timpson, Nicholas J; Gaunt, Tom R

    2016-01-01

    It has become common practice to analyse large scale sequencing data with statistical approaches based around the aggregation of rare variants within the same gene. We applied a novel approach to rare variant analysis by collapsing variants together using protein domain and family coordinates, regarded to be a more discrete definition of a biologically functional unit. Using Pfam definitions, we collapsed rare variants (Minor Allele Frequency ≤ 1%) together in three different ways 1) variants within single genomic regions which map to individual protein domains 2) variants within two individual protein domain regions which are predicted to be responsible for a protein-protein interaction 3) all variants within combined regions from multiple genes responsible for coding the same protein domain (i.e. protein families). A conventional collapsing analysis using gene coordinates was also undertaken for comparison. We used UK10K sequence data and investigated associations between regions of variants and lipid traits using the sequence kernel association test (SKAT). We observed no strong evidence of association between regions of variants based on Pfam domain definitions and lipid traits. Quantile-Quantile plots illustrated that the overall distributions of p-values from the protein domain analyses were comparable to that of a conventional gene-based approach. Deviations from this distribution suggested that collapsing by either protein domain or gene definitions may be favourable depending on the trait analysed. We have collapsed rare variants together using protein domain and family coordinates to present an alternative approach over collapsing across conventionally used gene-based regions. Although no strong evidence of association was detected in these analyses, future studies may still find value in adopting these approaches to detect previously unidentified association signals.

  7. Multi-gene panel testing in Korean patients with common genetic generalized epilepsy syndromes.

    PubMed

    Lee, Cha Gon; Lee, Jeehun; Lee, Munhyang

    2018-01-01

    Genetic heterogeneity of common genetic generalized epilepsy syndromes is frequently considered. The present study conducted a focused analysis of potential candidate or susceptibility genes for common genetic generalized epilepsy syndromes using multi-gene panel testing with next-generation sequencing. This study included patients with juvenile myoclonic epilepsy, juvenile absence epilepsy, and epilepsy with generalized tonic-clonic seizures alone. We identified pathogenic variants according to the American College of Medical Genetics and Genomics guidelines and identified susceptibility variants using case-control association analyses and family analyses for familial cases. A total of 57 patients were enrolled, including 51 sporadic cases and 6 familial cases. Twenty-two pathogenic and likely pathogenic variants of 16 different genes were identified. CACNA1H was the most frequently observed single gene. Variants of voltage-gated Ca2+ channel genes, including CACNA1A, CACNA1G, and CACNA1H were observed in 32% of variants (n = 7/22). Analyses to identify susceptibility variants using case-control association analysis indicated that KCNMA1 c.400G>C was associated with common genetic generalized epilepsy syndromes. Only 1 family (family A) exhibited a candidate pathogenic variant p.(Arg788His) on CACNA1H, as determined via family analyses. This study identified candidate genetic variants in about a quarter of patients (n = 16/57) and an average of 2.8 variants was identified in each patient. The results reinforced the polygenic disorder with very high locus and allelic heterogeneity of common GGE syndromes. Further, voltage-gated Ca2+ channels are suggested as important contributors to common genetic generalized epilepsy syndromes. This study extends our comprehensive understanding of common genetic generalized epilepsy syndromes.

  8. Comparative genomics of the mimicry switch in Papilio dardanus.

    PubMed

    Timmermans, Martijn J T N; Baxter, Simon W; Clark, Rebecca; Heckel, David G; Vogel, Heiko; Collins, Steve; Papanicolaou, Alexie; Fukova, Iva; Joron, Mathieu; Thompson, Martin J; Jiggins, Chris D; ffrench-Constant, Richard H; Vogler, Alfried P

    2014-07-22

    The African Mocker Swallowtail, Papilio dardanus, is a textbook example in evolutionary genetics. Classical breeding experiments have shown that wing pattern variation in this polymorphic Batesian mimic is determined by the polyallelic H locus that controls a set of distinct mimetic phenotypes. Using bacterial artificial chromosome (BAC) sequencing, recombination analyses and comparative genomics, we show that H co-segregates with an interval of less than 500 kb that is collinear with two other Lepidoptera genomes and contains 24 genes, including the transcription factor genes engrailed (en) and invected (inv). H is located in a region of conserved gene order, which argues against any role for genomic translocations in the evolution of a hypothesized multi-gene mimicry locus. Natural populations of P. dardanus show significant associations of specific morphs with single nucleotide polymorphisms (SNPs), centred on en. In addition, SNP variation in the H region reveals evidence of non-neutral molecular evolution in the en gene alone. We find evidence for a duplication potentially driving physical constraints on recombination in the lamborni morph. Absence of perfect linkage disequilibrium between different genes in the other morphs suggests that H is limited to nucleotide positions in the regulatory and coding regions of en. Our results therefore support the hypothesis that a single gene underlies wing pattern variation in P. dardanus.

  9. Molecular characterization of the vitamin D receptor (VDR) gene in Holstein cows.

    PubMed

    Ali, Mayar O; El-Adl, Mohamed A; Ibrahim, Hussam M M; Elseedy, Youssef Y; Rizk, Mohamed A; El-Khodery, Sabry A

    2018-06-01

    Vitamin D plays a vital role in calcium homeostasis, growth, and immunoregulation. Because little is known about the vitamin D receptor (VDR) gene in cattle, the aim of the present investigation was to present the molecular characterization of exons 5 and 6 of the VDR gene in Holstein cows. DNA extraction, genomic sequencing, phylogenetic analysis, synteny mapping and single nucleotide gene polymorphism analysis of the VDR gene were performed to assess blood samples collected from 50 clinically healthy Holstein cows. The results revealed the presence of a 450-base pair (bp) nucleotide sequence that resembled exons 5 and 6 with intron 5 enclosed between these exons. Sequence alignment and phylogenetic analysis revealed a close relationship between the sequenced VDR region and that found in Hereford cattle. A close association between this region and the corresponding region in small ruminants was also documented. Moreover, a single nucleotide polymorphism (SNP) that caused the replacement of a glutamate with an arginine in the deduced amino acid sequence was detected at position 7 of exon 5. In conclusion, Holstein and Hereford cattle differ with respect to exon 5 of the VDR gene. Phylogenetic analysis of the VDR gene based on nucleotide sequence produced different results from prior analyses based on amino acid sequence. Copyright © 2018 Elsevier Ltd. All rights reserved.

  10. Morphological Identification and Single-Cell Genomics of Marine Diplonemids.

    PubMed

    Gawryluk, Ryan M R; Del Campo, Javier; Okamoto, Noriko; Strassert, Jürgen F H; Lukeš, Julius; Richards, Thomas A; Worden, Alexandra Z; Santoro, Alyson E; Keeling, Patrick J

    2016-11-21

    Recent global surveys of marine biodiversity have revealed that a group of organisms known as "marine diplonemids" constitutes one of the most abundant and diverse planktonic lineages [1]. Though discovered over a decade ago [2, 3], their potential importance was unrecognized, and our knowledge remains restricted to a single gene amplified from environmental DNA, the 18S rRNA gene (small subunit [SSU]). Here, we use single-cell genomics (SCG) and microscopy to characterize ten marine diplonemids, isolated from a range of depths in the eastern North Pacific Ocean. Phylogenetic analysis confirms that the isolates reflect the entire range of marine diplonemid diversity, and comparisons to environmental SSU surveys show that sequences from the isolates range from rare to superabundant, including the single most common marine diplonemid known. SCG generated a total of ∼915 Mbp of assembled sequence across all ten cells and ∼4,000 protein-coding genes with homologs in the Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology database, distributed across categories expected for heterotrophic protists. Models of highly conserved genes indicate a high density of non-canonical introns, lacking conventional GT-AG splice sites. Mapping metagenomic datasets [4] to SCG assemblies reveals virtually no overlap, suggesting that nuclear genomic diversity is too great for representative SCG data to provide meaningful phylogenetic context to metagenomic datasets. This work provides an entry point to the future identification, isolation, and cultivation of these elusive yet ecologically important cells. The high density of nonconventional introns, however, also portends difficulty in generating accurate gene models and highlights the need for the establishment of stable cultures and transcriptomic analyses. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Single-cell analysis of population context advances RNAi screening at multiple levels

    PubMed Central

    Snijder, Berend; Sacher, Raphael; Rämö, Pauli; Liberali, Prisca; Mench, Karin; Wolfrum, Nina; Burleigh, Laura; Scott, Cameron C; Verheije, Monique H; Mercer, Jason; Moese, Stefan; Heger, Thomas; Theusner, Kristina; Jurgeit, Andreas; Lamparter, David; Balistreri, Giuseppe; Schelhaas, Mario; De Haan, Cornelis A M; Marjomäki, Varpu; Hyypiä, Timo; Rottier, Peter J M; Sodeik, Beate; Marsh, Mark; Gruenberg, Jean; Amara, Ali; Greber, Urs; Helenius, Ari; Pelkmans, Lucas

    2012-01-01

    Isogenic cells in culture show strong variability, which arises from dynamic adaptations to the microenvironment of individual cells. Here we study the influence of the cell population context, which determines a single cell's microenvironment, in image-based RNAi screens. We developed a comprehensive computational approach that employs Bayesian and multivariate methods at the single-cell level. We applied these methods to 45 RNA interference screens of various sizes, including 7 druggable genome and 2 genome-wide screens, analysing 17 different mammalian virus infections and four related cell physiological processes. Analysing cell-based screens at this depth reveals widespread RNAi-induced changes in the population context of individual cells leading to indirect RNAi effects, as well as perturbations of cell-to-cell variability regulators. We find that accounting for indirect effects improves the consistency between siRNAs targeted against the same gene, and between replicate RNAi screens performed in different cell lines, in different labs, and with different siRNA libraries. In an era where large-scale RNAi screens are increasingly performed to reach a systems-level understanding of cellular processes, we show that this is often improved by analyses that account for and incorporate the single-cell microenvironment. PMID:22531119

  12. The complete chloroplast genome of Gentiana straminea (Gentianaceae), an endemic species to the Sino-Himalayan subregion.

    PubMed

    Ni, Lianghong; Zhao, Zhili; Xu, Hongxi; Chen, Shilin; Dorje, Gaawe

    2016-02-15

    Endemic to the Sino-Himalayan subregion, the medicinal alpine plant Gentiana straminea is a threatened species. The genetic and molecular data about it is deficient. Here we report the complete chloroplast (cp) genome sequence of G. straminea, as the first sequenced member of the family Gentianaceae. The cp genome is 148,991bp in length, including a large single copy (LSC) region of 81,240bp, a small single copy (SSC) region of 17,085bp and a pair of inverted repeats (IRs) of 25,333bp. It contains 112 unique genes, including 78 protein-coding genes, 30 tRNAs and 4 rRNAs. The rps16 gene lacks exon2 between trnK-UUU and trnQ-UUG, which is the first rps16 pseudogene found in the nonparasitic plants of Asterids clade. Sequence analysis revealed the presence of 13 forward repeats, 13 palindrome repeats and 39 simple sequence repeats (SSRs). An entire cp genome comparison study of G. straminea and four other species in Gentianales was carried out. Phylogenetic analyses using maximum likelihood (ML) and maximum parsimony (MP) were performed based on 69 protein-coding genes from 36 species of Asterids. The results strongly supported the position of Gentianaceae as one member of the order Gentianales. The complete chloroplast genome sequence will provide intragenic information for its conservation and contribute to research on the genetic and phylogenetic analyses of Gentianales and Asterids. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. Advances in the phylogenesis of Agaricales and its higher ranks and strategies for establishing phylogenetic hypotheses§

    PubMed Central

    Zhao, Rui-lin; Desjardin, Dennis E.; Soytong, Kasem; Hyde, Kevin D.

    2008-01-01

    We present an overview of previous research results on the molecular phylogenetic analyses in Agaricales and its higher ranks (Agaricomycetes/Agaricomycotina/Basidiomycota) along with the most recent treatments of taxonomic systems in these taxa. Establishing phylogenetic hypotheses using DNA sequences, from which an understanding of the natural evolutionary relationships amongst clades may be derived, requires a robust dataset. It has been recognized that single-gene phylogenies may not truly represent organismal phylogenies, but the concordant phylogenetic genealogies from multiple-gene datasets can resolve this problem. The genes commonly used in mushroom phylogenetic research are summarized. PMID:18837104

  14. Genes under weaker stabilizing selection increase network evolvability and rapid regulatory adaptation to an environmental shift.

    PubMed

    Laarits, T; Bordalo, P; Lemos, B

    2016-08-01

    Regulatory networks play a central role in the modulation of gene expression, the control of cellular differentiation, and the emergence of complex phenotypes. Regulatory networks could constrain or facilitate evolutionary adaptation in gene expression levels. Here, we model the adaptation of regulatory networks and gene expression levels to a shift in the environment that alters the optimal expression level of a single gene. Our analyses show signatures of natural selection on regulatory networks that both constrain and facilitate rapid evolution of gene expression level towards new optima. The analyses are interpreted from the standpoint of neutral expectations and illustrate the challenge to making inferences about network adaptation. Furthermore, we examine the consequence of variable stabilizing selection across genes on the strength and direction of interactions in regulatory networks and in their subsequent adaptation. We observe that directional selection on a highly constrained gene previously under strong stabilizing selection was more efficient when the gene was embedded within a network of partners under relaxed stabilizing selection pressure. The observation leads to the expectation that evolutionarily resilient regulatory networks will contain optimal ratios of genes whose expression is under weak and strong stabilizing selection. Altogether, our results suggest that the variable strengths of stabilizing selection across genes within regulatory networks might itself contribute to the long-term adaptation of complex phenotypes. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.

  15. Molecular Markers Reveal Limited Population Genetic Structure in a North American Corvid, Clark’s Nutcracker (Nucifraga columbiana)

    PubMed Central

    Dohms, Kimberly M.; Burg, Theresa M.

    2013-01-01

    The genetic impact of barriers and Pleistocene glaciations on high latitude resident species has not been widely investigated. The Clark’s nutcracker is an endemic North American corvid closely associated with Pinus-dominated forests. The nutcracker’s encompasses known barriers to dispersal for other species, and glaciated and unglaciated areas. Clark’s nutcrackers also irruptively disperse long distances in search of pine seed crops, creating the potential for gene flow among populations. Using the highly variable mitochondrial DNA control region, seven microsatellite loci, and species distribution modeling, we examined the effects of glaciations and dispersal barriers on population genetic patterns and population structure of nutcrackers. We sequenced 900 bp of mitochondrial control region for 169 individuals from 15 populations and analysed seven polymorphic microsatellite loci for 13 populations across the Clark’s nutcracker range. We used species distribution modeling and a range of phylogeographic analyses to examine evolutionary history. Clark’s nutcracker populations are not highly differentiated throughout their range, suggesting high levels of gene flow among populations, though we did find some evidence of isolation by distance and peripheral isolation. Our analyses suggested expansion from a single refugium after the last glacial maximum, but patterns of genetic diversity and paleodistribution modeling of suitable habitat were inconclusive as to the location of this refugium. Potential barriers to dispersal (e.g. mountain ranges) do not appear to restrict gene flow in Clark’s nutcracker, and postglacial expansion likely occurred quickly from a single refugium located south of the ice sheets. PMID:24223982

  16. Reconsideration of systematic relationships within the order Euplotida (Protista, Ciliophora) using new sequences of the gene coding for small-subunit rRNA and testing the use of combined data sets to construct phylogenies of the Diophrys-complex.

    PubMed

    Yi, Zhenzhen; Song, Weibo; Clamp, John C; Chen, Zigui; Gao, Shan; Zhang, Qianqian

    2009-03-01

    Comprehensive molecular analyses of phylogenetic relationships within euplotid ciliates are relatively rare, and the relationships among some families remain questionable. We performed phylogenetic analyses of the order Euplotida based on new sequences of the gene coding for small-subunit RNA (SSrRNA) from a variety of taxa across the entire order as well as sequences from some of these taxa of other genes (ITS1-5.8S-ITS2 region and histone H4) that have not been included in previous analyses. Phylogenetic trees based on SSrRNA gene sequences constructed with four different methods had a consistent branching pattern that included the following features: (1) the "typical" euplotids comprised a paraphyletic assemblage composed of two divergent clades (family Uronychiidae and families Euplotidae-Certesiidae-Aspidiscidae-Gastrocirrhidae), (2) in the family Uronychiidae, the genera Uronychia and Paradiophrys formed a clearly outlined, well-supported clade that seemed to be rather divergent from Diophrys and Diophryopsis, suggesting that the Diophrys-complex may have had a longer and more separate evolutionary history than previously supposed, (3) inclusion of 12 new SSrRNA sequences in analyses of Euplotidae revealed two new clades of species within the family and cast additional doubt on the present classification of genera within the family, and (4) the intraspecific divergence among five species of Aspidisca was far greater than those of closely related genera. The ITS1-5.8S-ITS2 coding regions and partial histone H4 genes of six morphospecies in the Diophrys-complex were sequenced along with their SSrRNA genes and used to compare phylogenies constructed from single data sets to those constructed from combined sets. Results indicated that combined analyses could be used to construct more reliable, less ambiguous phylogenies of complex groups like the order Euplotida, because they provide a greater amount and diversity of information.

  17. Single-Cell RNA-Seq Reveals the Transcriptional Landscape and Heterogeneity of Aortic Macrophages in Murine Atherosclerosis.

    PubMed

    Cochain, Clément; Vafadarnejad, Ehsan; Arampatzi, Panagiota; Jaroslav, Pelisek; Winkels, Holger; Ley, Klaus; Wolf, Dennis; Saliba, Antoine-Emmanuel; Zernecke, Alma

    2018-03-15

    Rationale: It is assumed that atherosclerotic arteries contain several macrophage subsets endowed with specific functions. The precise identity of these subsets is poorly characterized as they ha ve been defined by the expression of a restricted number of markers. Objective: We have applied single-cell RNA-seq as an unbiased profiling strategy to interrogate and classify aortic macrophage heterogeneity at the single-cell level in atherosclerosis. Methods and Results: We performed single-cell RNA sequencing of total aortic CD45 + cells extracted from the non-diseased (chow fed) and atherosclerotic (11 weeks of high fat diet) aorta of Ldlr -/- mice. Unsupervised clustering singled out 13 distinct aortic cell clusters. Among the myeloid cell populations, Resident-like macrophages with a gene expression profile similar to aortic resident macrophages were found in healthy and diseased aortae, whereas monocytes, monocyte-derived dendritic cells (MoDC), and two populations of macrophages were almost exclusively detectable in atherosclerotic aortae, comprising Inflammatory macrophages showing enrichment in I l1b , and previously undescribed TREM2 hi macrophages. Differential gene expression and gene ontology enrichment analyses revealed specific gene expression patterns distinguishing these three macrophage subsets and MoDC, and uncovered putative functions of each cell type. Notably, TREM2 hi macrophages appeared to be endowed with specialized functions in lipid metabolism and catabolism, and presented a gene expression signature reminiscent of osteoclasts, suggesting a role in lesion calcification. TREM2 expression was moreover detected in human lesional macrophages. Importantly, these macrophage populations were present also in advanced atherosclerosis and in Apoe -/- aortae, indicating relevance of our findings in different stages of atherosclerosis and mouse models. Conclusions: These data unprecedentedly uncovered the transcriptional landscape and phenotypic heterogeneity of aortic macrophages and MoDCs in atherosclerotic and identified previously unrecognized macrophage populations and their gene expression signature, suggesting specialized functions. Our findings will open up novel opportunities to explore distinct myeloid cell populations and their functions in atherosclerosis.

  18. Mechanisms of Surface Antigenic Variation in the Human Pathogenic Fungus Pneumocystis jirovecii.

    PubMed

    Schmid-Siegert, Emanuel; Richard, Sophie; Luraschi, Amanda; Mühlethaler, Konrad; Pagni, Marco; Hauser, Philippe M

    2017-11-07

    Microbial pathogens commonly escape the human immune system by varying surface proteins. We investigated the mechanisms used for that purpose by Pneumocystis jirovecii This uncultivable fungus is an obligate pulmonary pathogen that in immunocompromised individuals causes pneumonia, a major life-threatening infection. Long-read PacBio sequencing was used to assemble a core of subtelomeres of a single P. jirovecii strain from a bronchoalveolar lavage fluid specimen from a single patient. A total of 113 genes encoding surface proteins were identified, including 28 pseudogenes. These genes formed a subtelomeric gene superfamily, which included five families encoding adhesive glycosylphosphatidylinositol (GPI)-anchored glycoproteins and one family encoding excreted glycoproteins. Numerical analyses suggested that diversification of the glycoproteins relies on mosaic genes created by ectopic recombination and occurs only within each family. DNA motifs suggested that all genes are expressed independently, except those of the family encoding the most abundant surface glycoproteins, which are subject to mutually exclusive expression. PCR analyses showed that exchange of the expressed gene of the latter family occurs frequently, possibly favored by the location of the genes proximal to the telomere because this allows concomitant telomere exchange. Our observations suggest that (i) the P. jirovecii cell surface is made of a complex mixture of different surface proteins, with a majority of a single isoform of the most abundant glycoprotein, (ii) genetic mosaicism within each family ensures variation of the glycoproteins, and (iii) the strategy of the fungus consists of the continuous production of new subpopulations composed of cells that are antigenically different. IMPORTANCE Pneumocystis jirovecii is a fungus causing severe pneumonia in immunocompromised individuals. It is the second most frequent life-threatening invasive fungal infection. We have studied the mechanisms of antigenic variation used by this pathogen to escape the human immune system, a strategy commonly used by pathogenic microorganisms. Using a new DNA sequencing technology generating long reads, we could characterize the highly repetitive gene families encoding the proteins that are present on the cellular surface of this pest. These gene families are localized in the regions close to the ends of all chromosomes, the subtelomeres. Such chromosomal localization was found to favor genetic recombinations between members of each gene family and to allow diversification of these proteins continuously over time. This pathogen seems to use a strategy of antigenic variation consisting of the continuous production of new subpopulations composed of cells that are antigenically different. Such a strategy is unique among human pathogens. Copyright © 2017 Schmid-Siegert et al.

  19. Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales).

    PubMed

    Moore, Abigail J; Vos, Jurriaan M De; Hancock, Lillian P; Goolsby, Eric; Edwards, Erika J

    2018-05-01

    Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the "portullugo" (Caryophyllales), a moderately sized lineage of flowering plants (~ 2200 species) that includes the cacti and harbors many evolutionary transitions to C$_{\\mathrm{4}}$ and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C$_{\\mathrm{4}}$ and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C$_{\\mathrm{4}}$ and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75-218 loci across 74 taxa, with ~ 50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae $+$ Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.

  20. Identification and validation of single nucleotide polymorphisms in growth- and maturation-related candidate genes in sole (Solea solea L.).

    PubMed

    Diopere, Eveline; Hellemans, Bart; Volckaert, Filip A M; Maes, Gregory E

    2013-03-01

    Genomic methodologies applied in evolutionary and fisheries research have been of great benefit to understand the marine ecosystem and the management of natural resources. Although single nucleotide polymorphisms (SNPs) are attractive for the study of local adaptation, spatial stock management and traceability, and investigating the effects of fisheries-induced selection, they have rarely been exploited in non-model organisms. This is partly due to difficulties in finding and validating SNPs in species with limited or no genomic resources. Complementary to random genome-scan approaches, a targeted candidate gene approach has the potential to unveil pre-selected functional diversity and provides more in depth information on the action of selection at specific genes. For example genes can be under selective pressure due to climate change and sustained periods of heavy fishing pressure. In this study, we applied a candidate gene approach in sole (Solea solea L.), an important member of the demersal ecosystem. As consumption flatfish it is heavy exploited and has experienced associated life-history changes over the last 60years. To discover novel genetic polymorphisms in or around genes linked to important life history traits in sole, we screened a total of 76 candidate genes related to growth and maturation using a targeted resequencing approach. We identified in total 86 putative SNPs in 22 genes and validated 29 SNPs using a multiplex single-base extension genotyping assay. We found 22 informative SNPs, of which two represent non-synonymous mutations, potentially of functional relevance. These novel markers should be rapidly and broadly applicable in analyses of natural sole populations, as a measure of the evolutionary signature of overfishing and for initiatives on marker assisted selection. Copyright © 2012 Elsevier B.V. All rights reserved.

  1. Distribution of cytokine gene polymorphisms in five Malay subethnic groups in Peninsular Malaysia.

    PubMed

    Norhalifah, H K; Zafarina, Z; Sundararajulu, P; Norazmi, M N; Edinur, H A

    2015-06-01

    In this survey, we have successfully genotyped 22 single nucleotide polymorphisms in the 13 cytokine genes for five Malay subethnic groups (Kelantan, Acheh, Mandailing, Minangkabau and Patani Malays) using polymerase chain reaction-sequence-specific primer cytokine genotyping kit (Invitrogen, Carlsbad, CA, USA). Most of the cytokine genes showed similar pattern of allelic spectra with wild-type alleles (e.g. ILIa-889/C, ILIB+3962/C and IL6 nt565/G) that represent more than 80% in the studied Malay subethnic groups. These newly observed cytokine alleles and subsequent analyses clearly indicate genetic contribution from Asia in the studied Malay subethnic groups with evidence of admixture from neighbouring populations in Patani Malays. The cytokine data sets for the five Malay subethnic groups deposited in this report can also be used as reference standard for searching suitable donor for allograft transplant and diseases association study. This is particularly relevance as our analyses showed differences between the Malay subethnic groups and other populations screened for cytokine genes. © 2015 John Wiley & Sons Ltd.

  2. Compositions and methods for detecting gene rearrangements and translocations

    DOEpatents

    Rowley, Janet D.; Diaz, Manuel O.

    2000-01-01

    Disclosed is a series of nucleic acid probes for use in diagnosing and monitoring certain types of leukemia using, e.g., Southern and Northern blot analyses and fluorescence in situ hybridization (FISH). These probes detect rearrangements, such as translocations involving chromosome band 11q23 with other chromosomes bands, including 4q21, 6q27, 9p22, 19p13.3, in both dividing leukemic cells and interphase nuclei. The breakpoints in all such translocations are clustered within an 8.3 kb BamHI genomic region of the MLL gene. A novel 0.7 kb BamH1 cDNA fragment derived from this gene detects rearrangements on Southern blot analysis with a single BamHI restriction digest in all patients with the common 11q23 translocations and in patients with other 11q23 anomalies. Northern blot analyses are presented demonstrating that the MLL gene has multiple transcripts and that transcript size differentiates leukemic cells from normal cells. Also disclosed are MLL fusion proteins, MLL protein domains and anti-MLL antibodies.

  3. Comprehensive analysis of Arabidopsis expression level polymorphisms with simple inheritance

    PubMed Central

    Plantegenet, Stephanie; Weber, Johann; Goldstein, Darlene R; Zeller, Georg; Nussbaumer, Cindy; Thomas, Jérôme; Weigel, Detlef; Harshman, Keith; Hardtke, Christian S

    2009-01-01

    In Arabidopsis thaliana, gene expression level polymorphisms (ELPs) between natural accessions that exhibit simple, single locus inheritance are promising quantitative trait locus (QTL) candidates to explain phenotypic variability. It is assumed that such ELPs overwhelmingly represent regulatory element polymorphisms. However, comprehensive genome-wide analyses linking expression level, regulatory sequence and gene structure variation are missing, preventing definite verification of this assumption. Here, we analyzed ELPs observed between the Eil-0 and Lc-0 accessions. Compared with non-variable controls, 5′ regulatory sequence variation in the corresponding genes is indeed increased. However, ∼42% of all the ELP genes also carry major transcription unit deletions in one parent as revealed by genome tiling arrays, representing a >4-fold enrichment over controls. Within the subset of ELPs with simple inheritance, this proportion is even higher and deletions are generally more severe. Similar results were obtained from analyses of the Bay-0 and Sha accessions, using alternative technical approaches. Collectively, our results suggest that drastic structural changes are a major cause for ELPs with simple inheritance, corroborating experimentally observed indel preponderance in cloned Arabidopsis QTL. PMID:19225455

  4. Database for High Throughput Screening Hits (dHITS): a simple tool to retrieve gene specific phenotypes from systematic screens done in yeast.

    PubMed

    Chuartzman, Silvia G; Schuldiner, Maya

    2018-03-25

    In the last decade several collections of Saccharomyces cerevisiae yeast strains have been created. In these collections every gene is modified in a similar manner such as by a deletion or the addition of a protein tag. Such libraries have enabled a diversity of systematic screens, giving rise to large amounts of information regarding gene functions. However, often papers describing such screens focus on a single gene or a small set of genes and all other loci affecting the phenotype of choice ('hits') are only mentioned in tables that are provided as supplementary material and are often hard to retrieve or search. To help unify and make such data accessible, we have created a Database of High Throughput Screening Hits (dHITS). The dHITS database enables information to be obtained about screens in which genes of interest were found as well as the other genes that came up in that screen - all in a readily accessible and downloadable format. The ability to query large lists of genes at the same time provides a platform to easily analyse hits obtained from transcriptional analyses or other screens. We hope that this platform will serve as a tool to facilitate investigation of protein functions to the yeast community. © 2018 The Authors Yeast Published by John Wiley & Sons Ltd.

  5. Extensive tissue-specific transcriptomic plasticity in maize primary roots upon water deficit.

    PubMed

    Opitz, Nina; Marcon, Caroline; Paschold, Anja; Malik, Waqas Ahmed; Lithio, Andrew; Brandt, Ronny; Piepho, Hans-Peter; Nettleton, Dan; Hochholdinger, Frank

    2016-02-01

    Water deficit is the most important environmental constraint severely limiting global crop growth and productivity. This study investigated early transcriptome changes in maize (Zea mays L.) primary root tissues in response to moderate water deficit conditions by RNA-Sequencing. Differential gene expression analyses revealed a high degree of plasticity of the water deficit response. The activity status of genes (active/inactive) was determined by a Bayesian hierarchical model. In total, 70% of expressed genes were constitutively active in all tissues. In contrast, <3% (50 genes) of water deficit-responsive genes (1915) were consistently regulated in all tissues, while >75% (1501 genes) were specifically regulated in a single root tissue. Water deficit-responsive genes were most numerous in the cortex of the mature root zone and in the elongation zone. The most prominent functional categories among differentially expressed genes in all tissues were 'transcriptional regulation' and 'hormone metabolism', indicating global reprogramming of cellular metabolism as an adaptation to water deficit. Additionally, the most significant transcriptomic changes in the root tip were associated with cell wall reorganization, leading to continued root growth despite water deficit conditions. This study provides insight into tissue-specific water deficit responses and will be a resource for future genetic analyses and breeding strategies to develop more drought-tolerant maize cultivars. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  6. Complete chloroplast genome of Tetragonia tetragonioides: Molecular phylogenetic relationships and evolution in Caryophyllales.

    PubMed

    Choi, Kyoung Su; Kwak, Myounghai; Lee, Byoungyoon; Park, SeonJoo

    2018-01-01

    The chloroplast genome of Tetragonia tetragonioides (Aizoaceae; Caryophyllales) was sequenced to provide information for studies on phylogeny and evolution within Caryophyllales. The chloroplast genome of Tetragonia tetragonioides is 149,506 bp in length and includes a pair of inverted repeats (IRs) of 24,769 bp that separate a large single copy (LSC) region of 82,780 bp and a small single copy (SSC) region of 17,188 bp. Comparative analysis of the chloroplast genome showed that Caryphyllales species have lost many genes. In particular, the rpl2 intron and infA gene were not found in T. tetragonioides, and core Caryophyllales lack the rpl2 intron. Phylogenetic analyses were conducted using 55 genes in 16 complete chloroplast genomes. Caryophyllales was found to divide into two clades; core Caryophyllales and noncore Caryophyllales. The genus Tetragonia is closely related to Mesembryanthemum. Comparisons of the synonymous (Ks), nonsynonymous (Ka), and Ka/Ks substitution rates revealed that nonsynonymous substitution rates were lower than synonymous substitution rates and that Ka/Ks rates were less than 1. The findings of the present study suggest that most genes are a purified selection.

  7. Genome-wide and gene-based association implicates FRMD6 in Alzheimer disease.

    PubMed

    Hong, Mun-Gwan; Reynolds, Chandra A; Feldman, Adina L; Kallin, Mikael; Lambert, Jean-Charles; Amouyel, Philippe; Ingelsson, Erik; Pedersen, Nancy L; Prince, Jonathan A

    2012-03-01

    Genome-wide association studies (GWAS) that allow for allelic heterogeneity may facilitate the discovery of novel genes not detectable by models that require replication of a single variant site. One strategy to accomplish this is to focus on genes rather than markers as units of association, and so potentially capture a spectrum of causal alleles that differ across populations. Here, we conducted a GWAS of Alzheimer disease (AD) in 2,586 Swedes and performed gene-based meta-analysis with three additional studies from France, Canada, and the United States, in total encompassing 4,259 cases and 8,284 controls. Implementing a newly designed gene-based algorithm, we identified two loci apart from the region around APOE that achieved study-wide significance in combined samples, the strongest finding being for FRMD6 on chromosome 14q (P = 2.6 × 10(-14)) and a weaker signal for NARS2 that is immediately adjacent to GAB2 on chromosome 11q (P = 7.8 × 10(-9)). Ontology-based pathway analyses revealed significant enrichment of genes involved in glycosylation. Results suggest that gene-based approaches that accommodate allelic heterogeneity in GWAS can provide a complementary avenue for gene discovery and may help to explain a portion of the missing heritability not detectable with single nucleotide polymorphisms (SNPs) derived from marker-specific meta-analysis. © 2011 Wiley Periodicals, Inc.

  8. Genetic variations in the serotonergic system contribute to amygdala volume in humans

    PubMed Central

    Li, Jin; Chen, Chunhui; Wu, Karen; Zhang, Mingxia; Zhu, Bi; Chen, Chuansheng; Moyzis, Robert K.; Dong, Qi

    2015-01-01

    The amygdala plays a critical role in emotion processing and psychiatric disorders associated with emotion dysfunction. Accumulating evidence suggests that amygdala structure is modulated by serotonin-related genes. However, there is a gap between the small contributions of single loci (less than 1%) and the reported 63–65% heritability of amygdala structure. To understand the “missing heritability,” we systematically explored the contribution of serotonin genes on amygdala structure at the gene set level. The present study of 417 healthy Chinese volunteers examined 129 representative polymorphisms in genes from multiple biological mechanisms in the regulation of serotonin neurotransmission. A system-level approach using multiple regression analyses identified that nine SNPs collectively accounted for approximately 8% of the variance in amygdala volume. Permutation analyses showed that the probability of obtaining these findings by chance was low (p = 0.043, permuted for 1000 times). Findings showed that serotonin genes contribute moderately to individual differences in amygdala volume in a healthy Chinese sample. These results indicate that the system-level approach can help us to understand the genetic basis of a complex trait such as amygdala structure. PMID:26500508

  9. Mitochondrial DNA variants can mediate methylation status of inflammation, angiogenesis and signaling genes

    PubMed Central

    Atilano, Shari R.; Malik, Deepika; Chwa, Marilyn; Cáceres-Del-Carpio, Javier; Nesburn, Anthony B.; Boyer, David S.; Kuppermann, Baruch D.; Jazwinski, S. Michal; Miceli, Michael V.; Wallace, Douglas C.; Udar, Nitin; Kenney, M. Cristina

    2015-01-01

    Mitochondrial (mt) DNA can be classified into haplogroups representing different geographic and/or racial origins of populations. The H haplogroup is protective against age-related macular degeneration (AMD), while the J haplogroup is high risk for AMD. In the present study, we performed comparison analyses of human retinal cell cybrids, which possess identical nuclei, but mtDNA from subjects with either the H or J haplogroups, and demonstrate differences in total global methylation, and expression patterns for two genes related to acetylation and five genes related to methylation. Analyses revealed that untreated-H and -J cybrids have different expression levels for nuclear genes (CFH, EFEMP1, VEGFA and NFkB2). However, expression levels for these genes become equivalent after treatment with a methylation inhibitor, 5-aza-2′-deoxycytidine. Moreover, sequencing of the entire mtDNA suggests that differences in epigenetic status found in cybrids are likely due to single nucleotide polymorphisms (SNPs) within the haplogroup profiles rather than rare variants or private SNPs. In conclusion, our findings indicate that mtDNA variants can mediate methylation profiles and transcription for inflammation, angiogenesis and various signaling pathways, which are important in several common diseases. PMID:25964427

  10. Integrative and conjugative elements and their hosts: composition, distribution and organization

    PubMed Central

    Touchon, Marie; Rocha, Eduardo P. C.

    2017-01-01

    Abstract Conjugation of single-stranded DNA drives horizontal gene transfer between bacteria and was widely studied in conjugative plasmids. The organization and function of integrative and conjugative elements (ICE), even if they are more abundant, was only studied in a few model systems. Comparative genomics of ICE has been precluded by the difficulty in finding and delimiting these elements. Here, we present the results of a method that circumvents these problems by requiring only the identification of the conjugation genes and the species’ pan-genome. We delimited 200 ICEs and this allowed the first large-scale characterization of these elements. We quantified the presence in ICEs of a wide set of functions associated with the biology of mobile genetic elements, including some that are typically associated with plasmids, such as partition and replication. Protein sequence similarity networks and phylogenetic analyses revealed that ICEs are structured in functional modules. Integrases and conjugation systems have different evolutionary histories, even if the gene repertoires of ICEs can be grouped in function of conjugation types. Our characterization of the composition and organization of ICEs paves the way for future functional and evolutionary analyses of their cargo genes, composed of a majority of unknown function genes. PMID:28911112

  11. Intra-isolate genome variation in arbuscular mycorrhizal fungi persists in the transcriptome.

    PubMed

    Boon, E; Zimmerman, E; Lang, B F; Hijri, M

    2010-07-01

    Arbuscular mycorrhizal fungi (AMF) are heterokaryotes with an unusual genetic makeup. Substantial genetic variation occurs among nuclei within a single mycelium or isolate. AMF reproduce through spores that contain varying fractions of this heterogeneous population of nuclei. It is not clear whether this genetic variation on the genome level actually contributes to the AMF phenotype. To investigate the extent to which polymorphisms in nuclear genes are transcribed, we analysed the intra-isolate genomic and cDNA sequence variation of two genes, the large subunit ribosomal RNA (LSU rDNA) of Glomus sp. DAOM-197198 (previously known as G. intraradices) and the POL1-like sequence (PLS) of Glomus etunicatum. For both genes, we find high sequence variation at the genome and transcriptome level. Reconstruction of LSU rDNA secondary structure shows that all variants are functional. Patterns of PLS sequence polymorphism indicate that there is one functional gene copy, PLS2, which is preferentially transcribed, and one gene copy, PLS1, which is a pseudogene. This is the first study that investigates AMF intra-isolate variation at the transcriptome level. In conclusion, it is possible that, in AMF, multiple nuclear genomes contribute to a single phenotype.

  12. Brief Report: Glutamate Transporter Gene (SLC1A1) Single Nucleotide Polymorphism (rs301430) and Repetitive Behaviors and Anxiety in Children with Autism Spectrum Disorder

    PubMed Central

    Gadow, Kenneth D.; Roohi, Jasmin; DeVincent, Carla J.; Kirsch, Sarah; Hatchwell, Eli

    2015-01-01

    Investigated association of single nucleotide polymorphism (SNP) rs301430 in glutamate transporter gene (SLC1A1) with severity of repetitive behaviors (obsessive–compulsive behaviors, tics) and anxiety in children with autism spectrum disorder (ASD). Mothers and/or teachers completed a validated DSM-IV-referenced rating scale for 67 children with autism spectrum disorder. Although analyses were not significant for repetitive behaviors, youths homozygous for the high expressing C allele had more severe anxiety than carriers of the T allele. Allelic variation in SLC1A1 may be a biomarker for or modifier of anxiety symptom severity in children with ASD, but study findings are best conceptualized as tentative pending replication with larger independent samples. PMID:20155310

  13. Glutamate transporter gene (SLC1A1) single nucleotide polymorphism (rs301430) and repetitive behaviors and anxiety in children with autism spectrum disorder.

    PubMed

    Gadow, Kenneth D; Roohi, Jasmin; DeVincent, Carla J; Kirsch, Sarah; Hatchwell, Eli

    2010-09-01

    Investigated association of single nucleotide polymorphism (SNP) rs301430 in glutamate transporter gene (SLC1A1) with severity of repetitive behaviors (obsessive-compulsive behaviors, tics) and anxiety in children with autism spectrum disorder (ASD). Mothers and/or teachers completed a validated DSM-IV-referenced rating scale for 67 children with autism spectrum disorder. Although analyses were not significant for repetitive behaviors, youths homozygous for the high expressing C allele had more severe anxiety than carriers of the T allele. Allelic variation in SLC1A1 may be a biomarker for or modifier of anxiety symptom severity in children with ASD, but study findings are best conceptualized as tentative pending replication with larger independent samples.

  14. Single-cell transcriptomes identify human islet cell signatures and reveal cell-type–specific expression changes in type 2 diabetes

    PubMed Central

    Bolisetty, Mohan; Kursawe, Romy; Sun, Lili; Sivakamasundari, V.; Kycia, Ina

    2017-01-01

    Blood glucose levels are tightly controlled by the coordinated action of at least four cell types constituting pancreatic islets. Changes in the proportion and/or function of these cells are associated with genetic and molecular pathophysiology of monogenic, type 1, and type 2 (T2D) diabetes. Cellular heterogeneity impedes precise understanding of the molecular components of each islet cell type that govern islet (dys)function, particularly the less abundant delta and gamma/pancreatic polypeptide (PP) cells. Here, we report single-cell transcriptomes for 638 cells from nondiabetic (ND) and T2D human islet samples. Analyses of ND single-cell transcriptomes identified distinct alpha, beta, delta, and PP/gamma cell-type signatures. Genes linked to rare and common forms of islet dysfunction and diabetes were expressed in the delta and PP/gamma cell types. Moreover, this study revealed that delta cells specifically express receptors that receive and coordinate systemic cues from the leptin, ghrelin, and dopamine signaling pathways implicating them as integrators of central and peripheral metabolic signals into the pancreatic islet. Finally, single-cell transcriptome profiling revealed genes differentially regulated between T2D and ND alpha, beta, and delta cells that were undetectable in paired whole islet analyses. This study thus identifies fundamental cell-type–specific features of pancreatic islet (dys)function and provides a critical resource for comprehensive understanding of islet biology and diabetes pathogenesis. PMID:27864352

  15. Multi-Tissue Omics Analyses Reveal Molecular Regulatory Networks for Puberty in Composite Beef Cattle

    PubMed Central

    Cánovas, Angela; Reverter, Antonio; DeAtley, Kasey L.; Ashley, Ryan L.; Colgrave, Michelle L.; Fortes, Marina R. S.; Islas-Trejo, Alma; Lehnert, Sigrid; Porto-Neto, Laercio; Rincón, Gonzalo; Silver, Gail A.; Snelling, Warren M.; Medrano, Juan F.; Thomas, Milton G.

    2014-01-01

    Puberty is a complex physiological event by which animals mature into an adult capable of sexual reproduction. In order to enhance our understanding of the genes and regulatory pathways and networks involved in puberty, we characterized the transcriptome of five reproductive tissues (i.e. hypothalamus, pituitary gland, ovary, uterus, and endometrium) as well as tissues known to be relevant to growth and metabolism needed to achieve puberty (i.e., longissimus dorsi muscle, adipose, and liver). These tissues were collected from pre- and post-pubertal Brangus heifers (3/8 Brahman; Bos indicus x 5/8 Angus; Bos taurus) derived from a population of cattle used to identify quantitative trait loci associated with fertility traits (i.e., age of first observed corpus luteum (ACL), first service conception (FSC), and heifer pregnancy (HPG)). In order to exploit the power of complementary omics analyses, pre- and post-puberty co-expression gene networks were constructed by combining the results from genome-wide association studies (GWAS), RNA-Seq, and bovine transcription factors. Eight tissues among pre-pubertal and post-pubertal Brangus heifers revealed 1,515 differentially expressed and 943 tissue-specific genes within the 17,832 genes confirmed by RNA-Seq analysis. The hypothalamus experienced the most notable up-regulation of genes via puberty (i.e., 204 out of 275 genes). Combining the results of GWAS and RNA-Seq, we identified 25 loci containing a single nucleotide polymorphism (SNP) associated with ACL, FSC, and (or) HPG. Seventeen of these SNP were within a gene and 13 of the genes were expressed in uterus or endometrium. Multi-tissue omics analyses revealed 2,450 co-expressed genes relative to puberty. The pre-pubertal network had 372,861 connections whereas the post-pubertal network had 328,357 connections. A sub-network from this process revealed key transcriptional regulators (i.e., PITX2, FOXA1, DACH2, PROP1, SIX6, etc.). Results from these multi-tissue omics analyses improve understanding of the number of genes and their complex interactions for puberty in cattle. PMID:25048735

  16. Mitochondrial gene sequences alone or combined with ITS region sequences provide firm molecular criteria for the classification of Lecanicillium species.

    PubMed

    Kouvelis, Vassili N; Sialakouma, Aphrodite; Typas, Milton A

    2008-07-01

    The recent revision of Verticillium sect. Prostrata led to the introduction of the genus Lecanicillium, which comprises the majority of the entomopathogenic strains. Sixty-five strains previously classified as Verticillium lecanii or Verticillium sp. from different geographical regions and hosts were examined and their phylogenetic relationships were determined using sequences from three mitochondrial (mt) genes [the small rRNA subunit (rns), the NADH dehydrogenase subunits 1 (nad1) and 3 (nad3)] and the ITS region. In general, single gene phylogenetic trees differentiated and placed the strains examined in well-supported (by BS analysis) groups of L. lecanii, L. longisporum, L. muscarium, and L. nodulosum, although in some cases a few uncertainties still remained. nad1 was the most informative single gene in phylogenetic analyses and was also found to contain group I introns with putative open reading frames (ORFs) encoding for GIY-YIG endonucleases. The combined use of mt gene sequences resolved taxonomic uncertainties arisen from ITS analysis and, alone or in combination with ITS sequences, helped in placing uncharacterised Verticillium lecanii and Verticillium sp. firmly into Lecanicillium species. Combined gene data from all the mt genes and all the mt genes and the ITS region together, were very similar. Furthermore, a relaxed correlation with host specificity -- at least for Homoptera -- was indicated for the rns and the combined mt gene sequences. Thus, the usefulness of mt gene sequences as a convenient molecular tool in phylogenetic studies of entomopathogenic fungi was demonstrated.

  17. Evolution of the Class IV HD-Zip Gene Family in Streptophytes

    PubMed Central

    Zalewski, Christopher S.; Floyd, Sandra K.; Furumizu, Chihiro; Sakakibara, Keiko; Stevenson, Dennis W.; Bowman, John L.

    2013-01-01

    Class IV homeodomain leucine zipper (C4HDZ) genes are plant-specific transcription factors that, based on phenotypes in Arabidopsis thaliana, play an important role in epidermal development. In this study, we sampled all major extant lineages and their closest algal relatives for C4HDZ homologs and phylogenetic analyses result in a gene tree that mirrors land plant evolution with evidence for gene duplications in many lineages, but minimal evidence for gene losses. Our analysis suggests an ancestral C4HDZ gene originated in an algal ancestor of land plants and a single ancestral gene was present in the last common ancestor of land plants. Independent gene duplications are evident within several lineages including mosses, lycophytes, euphyllophytes, seed plants, and, most notably, angiosperms. In recently evolved angiosperm paralogs, we find evidence of pseudogenization via mutations in both coding and regulatory sequences. The increasing complexity of the C4HDZ gene family through the diversification of land plants correlates to increasing complexity in epidermal characters. PMID:23894141

  18. Antigenic variation in malaria: in situ switching, relaxed and mutually exclusive transcription of var genes during intra-erythrocytic development in Plasmodium falciparum.

    PubMed Central

    Scherf, A; Hernandez-Rivas, R; Buffet, P; Bottius, E; Benatar, C; Pouvelle, B; Gysin, J; Lanzer, M

    1998-01-01

    Members of the Plasmodium falciparum var gene family encode clonally variant adhesins, which play an important role in the pathogenicity of tropical malaria. Here we employ a selective panning protocol to generate isogenic P.falciparum populations with defined adhesive phenotypes for CD36, ICAM-1 and CSA, expressing single and distinct var gene variants. This technique has established the framework for examining var gene expression, its regulation and switching. It was found that var gene switching occurs in situ. Ubiquitous transcription of all var gene variants appears to occur in early ring stages. However, var gene expression is tightly regulated in trophozoites and is exerted through a silencing mechanism. Transcriptional control is mutually exclusive in parasites that express defined adhesive phenotypes. In situ var gene switching is apparently mediated at the level of transcriptional initiation, as demonstrated by nuclear run-on analyses. Our results suggest that an epigenetic mechanism(s) is involved in var gene regulation. PMID:9736619

  19. Case-control approach application for finding a relationship between candidate genes and clinical mastitis in Holstein dairy cattle.

    PubMed

    Bagheri, Masoumeh; Moradi-Sharhrbabak, M; Miraie-Ashtiani, R; Safdari-Shahroudi, M; Abdollahi-Arpanahi, R

    2016-02-01

    Mastitis is a major source of economic loss in dairy herds. The objective of this research was to evaluate the association between genotypes within SLC11A1 and CXCR1 candidate genes and clinical mastitis in Holstein dairy cattle using the selective genotyping method. The data set contained clinical mastitis records of 3,823 Holstein cows from two Holstein dairy herds located in two different regions in Iran. Data included the number of cases of clinical mastitis per lactation. Selective genotyping was based on extreme values for clinical mastitis residuals (CMR) from mixed model analyses. Two extreme groups consisting of 135 cows were formed (as cases and controls), and genotyped for the two candidate genes, namely, SLC11A1 and CXCR1, using polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) and polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), respectively. Associations between single nucleotide polymorphism (SNP) genotypes with CMR and breeding values for milk and protein yield were carried out by applying logistic regression analyses, i.e. estimating the probability of the heterogeneous genotype in the dependency of values for CMR and breeding values (BVs). The sequencing results revealed a novel mutation in 1139 bp of exon 11 of the SLC11A1 gene and this SNP had a significant association with CMR (P < 0.05). PCR-RFLP analysis leads to three banding patterns for CXCR1c.735C>G and these genotypes had significant relationships with CMR. Overall, the results showed that SLC11A1 and CXCR1 are valuable candidate genes for the improvement of mastitis resistance as well as production traits in dairy cattle populations.

  20. Genome-Wide Association Studies Identify CHRNA5/3 and HTR4 in the Development of Airflow Obstruction

    PubMed Central

    Shrine, Nick R. G.; Loehr, Laura R.; Zhao, Jing Hua; Manichaikul, Ani; Lopez, Lorna M.; Smith, Albert Vernon; Heckbert, Susan R.; Smolonska, Joanna; Tang, Wenbo; Loth, Daan W.; Curjuric, Ivan; Hui, Jennie; Latourelle, Jeanne C.; Henry, Amanda P.; Aldrich, Melinda; Bakke, Per; Beaty, Terri H.; Bentley, Amy R.; Borecki, Ingrid B.; Brusselle, Guy G.; Burkart, Kristin M.; Chen, Ting-hsu; Couper, David; Crapo, James D.; Davies, Gail; Dupuis, Josée; Franceschini, Nora; Gulsvik, Amund; Hancock, Dana B.; Harris, Tamara B.; Hofman, Albert; Imboden, Medea; James, Alan L.; Khaw, Kay-Tee; Lahousse, Lies; Launer, Lenore J.; Litonjua, Augusto; Liu, Yongmei; Lohman, Kurt K.; Lomas, David A.; Lumley, Thomas; Marciante, Kristin D.; McArdle, Wendy L.; Meibohm, Bernd; Morrison, Alanna C.; Musk, Arthur W.; Myers, Richard H.; North, Kari E.; Postma, Dirkje S.; Psaty, Bruce M.; Rich, Stephen S.; Rivadeneira, Fernando; Rochat, Thierry; Rotter, Jerome I.; Artigas, María Soler; Starr, John M.; Uitterlinden, André G.; Wareham, Nicholas J.; Wijmenga, Cisca; Zanen, Pieter; Province, Michael A.; Silverman, Edwin K.; Deary, Ian J.; Palmer, Lyle J.; Cassano, Patricia A.; Gudnason, Vilmundur; Barr, R. Graham; Loos, Ruth J. F.; Strachan, David P.; London, Stephanie J.; Boezen, H. Marike; Probst-Hensch, Nicole; Gharib, Sina A.; Hall, Ian P.; O’Connor, George T.; Tobin, Martin D.; Stricker, Bruno H.

    2012-01-01

    Rationale: Genome-wide association studies (GWAS) have identified loci influencing lung function, but fewer genes influencing chronic obstructive pulmonary disease (COPD) are known. Objectives: Perform meta-analyses of GWAS for airflow obstruction, a key pathophysiologic characteristic of COPD assessed by spirometry, in population-based cohorts examining all participants, ever smokers, never smokers, asthma-free participants, and more severe cases. Methods: Fifteen cohorts were studied for discovery (3,368 affected; 29,507 unaffected), and a population-based family study and a meta-analysis of case-control studies were used for replication and regional follow-up (3,837 cases; 4,479 control subjects). Airflow obstruction was defined as FEV1 and its ratio to FVC (FEV1/FVC) both less than their respective lower limits of normal as determined by published reference equations. Measurements and Main Results: The discovery meta-analyses identified one region on chromosome 15q25.1 meeting genome-wide significance in ever smokers that includes AGPHD1, IREB2, and CHRNA5/CHRNA3 genes. The region was also modestly associated among never smokers. Gene expression studies confirmed the presence of CHRNA5/3 in lung, airway smooth muscle, and bronchial epithelial cells. A single-nucleotide polymorphism in HTR4, a gene previously related to FEV1/FVC, achieved genome-wide statistical significance in combined meta-analysis. Top single-nucleotide polymorphisms in ADAM19, RARB, PPAP2B, and ADAMTS19 were nominally replicated in the COPD meta-analysis. Conclusions: These results suggest an important role for the CHRNA5/3 region as a genetic risk factor for airflow obstruction that may be independent of smoking and implicate the HTR4 gene in the etiology of airflow obstruction. PMID:22837378

  1. Polymorphisms within the prolactin and growth hormone/insulin-like growth factor-1 functional pathways associated with fertility traits in Holstein cows raised in a hot-humid climate.

    PubMed

    Leyva-Corona, Jose C; Reyna-Granados, Javier R; Zamorano-Algandar, Ricardo; Sanchez-Castro, Miguel A; Thomas, Milton G; Enns, R Mark; Speidel, Scott E; Medrano, Juan F; Rincon, Gonzalo; Luna-Nevarez, Pablo

    2018-06-20

    Prolactin (PRL), growth hormone (GH), and insulin-like growth factor-1 (IGF-1) are in hormone-response pathways involved in energy metabolism during thermoregulation processes in cattle. Objective herein was to study the association between single nucleotide polymorphisms (SNP) within genes of the PRL and GH/IGF-1 pathways with fertility traits such as services per conception (SPC) and days open (DO) in Holstein cattle lactating under a hot-humid climate. Ambient temperature and relative humidity were used to calculate the temperature-humidity index (THI) which revealed that the cows were exposed to heat stress conditions from June to November of 2012 in southern Sonora, Mexico. Individual blood samples from all cows were collected, spotted on FTA cards, and used to genotype a 179 tag SNP panel within 44 genes from the PRL and GH/IGF-1 pathways. The associative analyses among SNP genotypes and fertility traits were performed using mixed-effect models. Allele substitution effects were calculated using a regression model that included the genotype term as covariate. Single-SNP association analyses indicated that eight SNP within the genes IGF-1, IGF-1R, IGFBP5, PAPPA1, PMCH, PRLR, SOCS5, and SSTR2 were associated with SPC (P < 0.05), whereas four SNP in the genes GHR, PAPPA2, PRLR, and SOCS4 were associated with DO (P < 0.05). In conclusion, SNP within genes of the PRL and GH/IGF-1 pathways resulted as predictors of reproductive phenotypes in heat-stressed Holstein cows, and these SNP are proposed as candidates for a marker-assisted selection program intended to improve fertility of dairy cattle raised in warm climates.

  2. Detecting Antigen-Specific T Cell Responses: From Bulk Populations to Single Cells.

    PubMed

    Phetsouphanh, Chansavath; Zaunders, John James; Kelleher, Anthony Dominic

    2015-08-12

    A new generation of sensitive T cell-based assays facilitates the direct quantitation and characterization of antigen-specific T cell responses. Single-cell analyses have focused on measuring the quality and breadth of a response. Accumulating data from these studies demonstrate that there is considerable, previously-unrecognized, heterogeneity. Standard assays, such as the ICS, are often insufficient for characterization of rare subsets of cells. Enhanced flow cytometry with imaging capabilities enables the determination of cell morphology, as well as the spatial localization of the protein molecules within a single cell. Advances in both microfluidics and digital PCR have improved the efficiency of single-cell sorting and allowed multiplexed gene detection at the single-cell level. Delving further into the transcriptome of single-cells using RNA-seq is likely to reveal the fine-specificity of cellular events such as alternative splicing (i.e., splice variants) and allele-specific expression, and will also define the roles of new genes. Finally, detailed analysis of clonally related antigen-specific T cells using single-cell TCR RNA-seq will provide information on pathways of differentiation of memory T cells. With these state of the art technologies the transcriptomics and genomics of Ag-specific T cells can be more definitively elucidated.

  3. Detecting Antigen-Specific T Cell Responses: From Bulk Populations to Single Cells

    PubMed Central

    Phetsouphanh, Chansavath; Zaunders, John James; Kelleher, Anthony Dominic

    2015-01-01

    A new generation of sensitive T cell-based assays facilitates the direct quantitation and characterization of antigen-specific T cell responses. Single-cell analyses have focused on measuring the quality and breadth of a response. Accumulating data from these studies demonstrate that there is considerable, previously-unrecognized, heterogeneity. Standard assays, such as the ICS, are often insufficient for characterization of rare subsets of cells. Enhanced flow cytometry with imaging capabilities enables the determination of cell morphology, as well as the spatial localization of the protein molecules within a single cell. Advances in both microfluidics and digital PCR have improved the efficiency of single-cell sorting and allowed multiplexed gene detection at the single-cell level. Delving further into the transcriptome of single-cells using RNA-seq is likely to reveal the fine-specificity of cellular events such as alternative splicing (i.e., splice variants) and allele-specific expression, and will also define the roles of new genes. Finally, detailed analysis of clonally related antigen-specific T cells using single-cell TCR RNA-seq will provide information on pathways of differentiation of memory T cells. With these state of the art technologies the transcriptomics and genomics of Ag-specific T cells can be more definitively elucidated. PMID:26274954

  4. The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes

    PubMed Central

    Pombert, Jean-François; Lemieux, Claude; Turmel, Monique

    2006-01-01

    Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. The basal position of the Prasinophyceae has been well documented, but the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae is currently debated. The four complete chloroplast DNA (cpDNA) sequences presently available for representatives of these classes have revealed extensive variability in overall structure, gene content, intron composition and gene order. The chloroplast genome of Pseudendoclonium (Ulvophyceae), in particular, is characterized by an atypical quadripartite architecture that deviates from the ancestral type by a large inverted repeat (IR) featuring an inverted rRNA operon and a small single-copy (SSC) region containing 14 genes normally found in the large single-copy (LSC) region. To gain insights into the nature of the events that led to the reorganization of the chloroplast genome in the Ulvophyceae, we have determined the complete cpDNA sequence of Oltmannsiellopsis viridis, a representative of a distinct, early diverging lineage. Results The 151,933 bp IR-containing genome of Oltmannsiellopsis differs considerably from Pseudendoclonium and other chlorophyte cpDNAs in intron content and gene order, but shares close similarities with its ulvophyte homologue at the levels of quadripartite architecture, gene content and gene density. Oltmannsiellopsis cpDNA encodes 105 genes, contains five group I introns, and features many short dispersed repeats. As in Pseudendoclonium cpDNA, the rRNA genes in the IR are transcribed toward the single copy region featuring the genes typically found in the ancestral LSC region, and the opposite single copy region harbours genes characteristic of both the ancestral SSC and LSC regions. The 52 genes that were transferred from the ancestral LSC to SSC region include 12 of those observed in Pseudendoclonium cpDNA. Surprisingly, the overall gene organization of Oltmannsiellopsis cpDNA more closely resembles that of Chlorella (Trebouxiophyceae) cpDNA. Conclusion The chloroplast genome of the last common ancestor of Oltmannsiellopsis and Pseudendoclonium contained a minimum of 108 genes, carried only a few group I introns, and featured a distinctive quadripartite architecture. Numerous changes were experienced by the chloroplast genome in the lineages leading to Oltmannsiellopsis and Pseudendoclonium. Our comparative analyses of chlorophyte cpDNAs support the notion that the Ulvophyceae is sister to the Chlorophyceae. PMID:16472375

  5. A recently transferred cluster of bacterial genes in Trichomonas vaginalis - lateral gene transfer and the fate of acquired genes

    PubMed Central

    2014-01-01

    Background Lateral Gene Transfer (LGT) has recently gained recognition as an important contributor to some eukaryote proteomes, but the mechanisms of acquisition and fixation in eukaryotic genomes are still uncertain. A previously defined norm for LGTs in microbial eukaryotes states that the majority are genes involved in metabolism, the LGTs are typically localized one by one, surrounded by vertically inherited genes on the chromosome, and phylogenetics shows that a broad collection of bacterial lineages have contributed to the transferome. Results A unique 34 kbp long fragment with 27 clustered genes (TvLF) of prokaryote origin was identified in the sequenced genome of the protozoan parasite Trichomonas vaginalis. Using a PCR based approach we confirmed the presence of the orthologous fragment in four additional T. vaginalis strains. Detailed sequence analyses unambiguously suggest that TvLF is the result of one single, recent LGT event. The proposed donor is a close relative to the firmicute bacterium Peptoniphilus harei. High nucleotide sequence similarity between T. vaginalis strains, as well as to P. harei, and the absence of homologs in other Trichomonas species, suggests that the transfer event took place after the radiation of the genus Trichomonas. Some genes have undergone pseudogenization and degradation, indicating that they may not be retained in the future. Functional annotations reveal that genes involved in informational processes are particularly prone to degradation. Conclusions We conclude that, although the majority of eukaryote LGTs are single gene occurrences, they may be acquired in clusters of several genes that are subsequently cleansed of evolutionarily less advantageous genes. PMID:24898731

  6. Streptococcus pneumoniae Supragenome Hybridization Arrays for Profiling of Genetic Content and Gene Expression.

    PubMed

    Kadam, Anagha; Janto, Benjamin; Eutsey, Rory; Earl, Joshua P; Powell, Evan; Dahlgren, Margaret E; Hu, Fen Z; Ehrlich, Garth D; Hiller, N Luisa

    2015-02-02

    There is extensive genomic diversity among Streptococcus pneumoniae isolates. Approximately half of the comprehensive set of genes in the species (the supragenome or pangenome) is present in all the isolates (core set), and the remaining is unevenly distributed among strains (distributed set). The Streptococcus pneumoniae Supragenome Hybridization (SpSGH) array provides coverage for an extensive set of genes and polymorphisms encountered within this species, capturing this genomic diversity. Further, the capture is quantitative. In this manner, the SpSGH array allows for both genomic and transcriptomic analyses of diverse S. pneumoniae isolates on a single platform. In this unit, we present the SpSGH array, and describe in detail its design and implementation for both genomic and transcriptomic analyses. The methodology can be applied to construction and modification of SpSGH array platforms, as well to other bacterial species as long as multiple whole-genome sequences are available that collectively capture the vast majority of the species supragenome. Copyright © 2015 John Wiley & Sons, Inc.

  7. Cassava genome from a wild ancestor to cultivated varieties

    PubMed Central

    Wang, Wenquan; Feng, Binxiao; Xiao, Jingfa; Xia, Zhiqiang; Zhou, Xincheng; Li, Pinghua; Zhang, Weixiong; Wang, Ying; Møller, Birger Lindberg; Zhang, Peng; Luo, Ming-Cheng; Xiao, Gong; Liu, Jingxing; Yang, Jun; Chen, Songbi; Rabinowicz, Pablo D.; Chen, Xin; Zhang, Hong-Bin; Ceballos, Henan; Lou, Qunfeng; Zou, Meiling; Carvalho, Luiz J.C.B.; Zeng, Changying; Xia, Jing; Sun, Shixiang; Fu, Yuhua; Wang, Haiyan; Lu, Cheng; Ruan, Mengbin; Zhou, Shuigeng; Wu, Zhicheng; Liu, Hui; Kannangara, Rubini Maya; Jørgensen, Kirsten; Neale, Rebecca Louise; Bonde, Maya; Heinz, Nanna; Zhu, Wenli; Wang, Shujuan; Zhang, Yang; Pan, Kun; Wen, Mingfu; Ma, Ping-An; Li, Zhengxu; Hu, Meizhen; Liao, Wenbin; Hu, Wenbin; Zhang, Shengkui; Pei, Jinli; Guo, Anping; Guo, Jianchun; Zhang, Jiaming; Zhang, Zhengwen; Ye, Jianqiu; Ou, Wenjun; Ma, Yaqin; Liu, Xinyue; Tallon, Luke J.; Galens, Kevin; Ott, Sandra; Huang, Jie; Xue, Jingjing; An, Feifei; Yao, Qingqun; Lu, Xiaojing; Fregene, Martin; López-Lavalle, L. Augusto Becerra; Wu, Jiajie; You, Frank M.; Chen, Meili; Hu, Songnian; Wu, Guojiang; Zhong, Silin; Ling, Peng; Chen, Yeyuan; Wang, Qinghuang; Liu, Guodao; Liu, Bin; Li, Kaimian; Peng, Ming

    2014-01-01

    Cassava is a major tropical food crop in the Euphorbiaceae family that has high carbohydrate production potential and adaptability to diverse environments. Here we present the draft genome sequences of a wild ancestor and a domesticated variety of cassava and comparative analyses with a partial inbred line. We identify 1,584 and 1,678 gene models specific to the wild and domesticated varieties, respectively, and discover high heterozygosity and millions of single-nucleotide variations. Our analyses reveal that genes involved in photosynthesis, starch accumulation and abiotic stresses have been positively selected, whereas those involved in cell wall biosynthesis and secondary metabolism, including cyanogenic glucoside formation, have been negatively selected in the cultivated varieties, reflecting the result of natural selection and domestication. Differences in microRNA genes and retrotransposon regulation could partly explain an increased carbon flux towards starch accumulation and reduced cyanogenic glucoside accumulation in domesticated cassava. These results may contribute to genetic improvement of cassava through better understanding of its biology. PMID:25300236

  8. Heritability and molecular-genetic basis of the P3 event-related brain potential: A genome-wide association study

    PubMed Central

    MALONE, STEPHEN M.; VAIDYANATHAN, UMA; BASU, SAONLI; MILLER, MICHAEL B.; MCGUE, MATT; IACONO, WILLIAM G.

    2014-01-01

    P3 amplitude is a candidate endophenotype for disinhibitory psychopathology, psychosis, and other disorders. The present study is a comprehensive analysis of the behavioral- and molecular-genetic basis of P3 amplitude and a P3 genetic factor score in a large community sample (N = 4,211) of adolescent twins and their parents, genotyped for 527,829 single nucleotide polymorphisms (SNPs). Biometric models indicated that as much as 65% of the variance in each measure was due to additive genes. All SNPs in aggregate accounted for approximately 40% to 50% of the heritable variance. However, analyses of individual SNPs did not yield any significant associations. Analyses of individual genes did not confirm previous associations between P3 amplitude and candidate genes but did yield a novel association with myelin expression factor 2 (MYEF2). Main effects of individual variants may be too small to be detected by GWAS without larger samples. PMID:25387705

  9. Does Marriage Moderate Genetic Effects on Delinquency and Violence?

    PubMed Central

    Li, Yi; Liu, Hexuan; Guo, Guang

    2015-01-01

    Using data from the National Longitudinal Study of Adolescent to Adult Health (N = 1,254), the authors investigated whether marriage can foster desistance from delinquency and violence by moderating genetic effects. In contrast to existing gene–environment research that typically focuses on one or a few genetic polymorphisms, they extended a recently developed mixed linear model to consider the collective influence of 580 single nucleotide polymorphisms in 64 genes related to aggression and risky behavior. The mixed linear model estimates the proportion of variance in the phenotype that is explained by the single nucleotide polymorphisms. The authors found that the proportion of variance in delinquency/violence explained was smaller among married individuals than unmarried individuals. Because selection, confounding, and heterogeneity may bias the estimate of the Gene × Marriage interaction, they conducted a series of analyses to address these issues. The findings suggest that the Gene × Marriage interaction results were not seriously affected by these issues. PMID:26549892

  10. Characterizing the “POAGome”: A bioinformatics-driven approach to primary open-angle glaucoma

    PubMed Central

    Danford, Ian D.; Verkuil, Lana D.; Choi, Daniel J.; Collins, David W.; Gudiseva, Harini V.; Uyhazi, Katherine E.; Lau, Marisa K.; Kanu, Levi N.; Grant, Gregory R.; Chavali, Venkata R.M.; O’Brien, Joan M.

    2017-01-01

    Primary open-angle glaucoma (POAG) is a genetically, physiologically, and phenotypically complex neurodegenerative disorder. This study addressed the expanding collection of genes associated with POAG, referred to as the “POAGome.” We used bioinformatics tools to perform an extensive, systematic literature search and compiled 542 genes with confirmed associations with POAG and its related phenotypes (normal tension glaucoma, ocular hypertension, juvenile open-angle glaucoma, and primary congenital glaucoma). The genes were classified according to their associated ocular tissues and phenotypes, and functional annotation and pathway analyses were subsequently performed. Our study reveals that no single molecular pathway can encompass the pathophysiology of POAG. The analyses suggested that inflammation and senescence may play pivotal roles in both the development and perpetuation of the retinal ganglion cell degeneration seen in POAG. The TGF-β signaling pathway was repeatedly implicated in our analyses, suggesting that it may be an important contributor to the manifestation of POAG in the anterior and posterior segments of the globe. We propose a molecular model of POAG revolving around TGF-β signaling, which incorporates the roles of inflammation and senescence in this disease. Finally, we highlight emerging molecular therapies that show promise for treating POAG. PMID:28223208

  11. Genetic diversity of Babesia bovis in virulent and attenuated strains.

    PubMed

    Mazuz, M L; Molad, T; Fish, L; Leibovitz, B; Wolkomirsky, R; Fleiderovitz, L; Shkap, V

    2012-03-01

    The aim of this study was to compare the genetic diversity of the single copy Bv80 gene sequences of Babesia bovis in populations of attenuated and virulent parasites. PCR/ RT-PCR followed by cloning and sequence analyses of 4 attenuated and 4 virulent strains were performed. Multiple fragments in the range of 420 to 744 bp were amplified by PCR or RT-PCR. Cloning of the PCR fragments and sequence analyses revealed the presence of mixed subpopulations in either virulent or attenuated parasites with a total of 19 variants with 12 different sequences that differed in number and type of tandem repeats. High levels of intra- and inter-strain diversity of the Bv80 gene, with the presence of mixed populations of parasites were found in both the virulent field isolates and the attenuated vaccine strains. In addition, during the attenuation process, sequence analyses showed changes in the pattern of the parasite subpopulations. Despite high polymorphism found by sequence analyses, the patterns observed and the number of repeats, order, or motifs found could not discriminate between virulent field isolates and attenuated vaccine strains of the parasite.

  12. Strain diversity and host specificity in bee gut symbionts revealed by deep sampling of single copy protein-coding sequences

    PubMed Central

    Powell, J. Elijah; Ratnayeke, Nalin; Moran, Nancy A.

    2017-01-01

    High throughput rRNA amplicon surveys of bacterial communities provide a rapid snapshot of taxonomic composition. But strains with nearly identical rRNA sequences often differ in gene repertoires and metabolic capabilities. To assess strain-level variation within Snodgrassella alvi, a gut symbiont of corbiculate bees, we performed deep sequencing on amplicons of a single copy coding gene (minD) as well as the 16S rDNA V4 region. We surveyed honey bees (Apis mellifera) sampled globally and 12 bumble bee species (Bombus) sampled from two regions of the USA. The minD analyses reveal that S. alvi contains far more strain diversity than is evident from 16S rDNA analysis. Many taxa inferred on the basis of 16S rDNA are shared between A. mellifera and Bombus species, but taxa inferred on the basis of minD are never shared and often are restricted to particular Bombus species. Clustering based on minD revealed that gut communities often reflect host species and geographic location. Both minD and 16S rDNA analyses indicate that strain diversity is higher in A. mellifera than in Bombus species. The minD locus flanks a 16S gene, enabling development of strain-specific 16S fluorescent probes to illuminate the spatial relationship of strains within the bee gut. PMID:27482856

  13. An autonomous molecular computer for logical control of gene expression

    PubMed Central

    Benenson, Yaakov; Gil, Binyamin; Ben-Dor, Uri; Adar, Rivka; Shapiro, Ehud

    2013-01-01

    Early biomolecular computer research focused on laboratory-scale, human-operated computers for complex computational problems1–7. Recently, simple molecular-scale autonomous programmable computers were demonstrated8–15 allowing both input and output information to be in molecular form. Such computers, using biological molecules as input data and biologically active molecules as outputs, could produce a system for ‘logical’ control of biological processes. Here we describe an autonomous biomolecular computer that, at least in vitro, logically analyses the levels of messenger RNA species, and in response produces a molecule capable of affecting levels of gene expression. The computer operates at a concentration of close to a trillion computers per microlitre and consists of three programmable modules: a computation module, that is, a stochastic molecular automaton12–17; an input module, by which specific mRNA levels or point mutations regulate software molecule concentrations, and hence automaton transition probabilities; and an output module, capable of controlled release of a short single-stranded DNA molecule. This approach might be applied in vivo to biochemical sensing, genetic engineering and even medical diagnosis and treatment. As a proof of principle we programmed the computer to identify and analyse mRNA of disease-related genes18–22 associated with models of small-cell lung cancer and prostate cancer, and to produce a single-stranded DNA molecule modelled after an anticancer drug. PMID:15116117

  14. Using Public Data for Comparative Proteome Analysis in Precision Medicine Programs.

    PubMed

    Hughes, Christopher S; Morin, Gregg B

    2018-03-01

    Maximizing the clinical utility of information obtained in longitudinal precision medicine programs would benefit from robust comparative analyses to known information to assess biological features of patient material toward identifying the underlying features driving their disease phenotype. Herein, the potential for utilizing publically deposited mass-spectrometry-based proteomics data to perform inter-study comparisons of cell-line or tumor-tissue materials is investigated. To investigate the robustness of comparison between MS-based proteomics studies carried out with different methodologies, deposited data representative of label-free (MS1) and isobaric tagging (MS2 and MS3 quantification) are utilized. In-depth quantitative proteomics data acquired from analysis of ovarian cancer cell lines revealed the robust recapitulation of observable gene expression dynamics between individual studies carried out using significantly different methodologies. The observed signatures enable robust inter-study clustering of cell line samples. In addition, the ability to classify and cluster tumor samples based on observed gene expression trends when using a single patient sample is established. With this analysis, relevant gene expression dynamics are obtained from a single patient tumor, in the context of a precision medicine analysis, by leveraging a large cohort of repository data as a comparator. Together, these data establish the potential for state-of-the-art MS-based proteomics data to serve as resources for robust comparative analyses in precision medicine applications. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. Association of gene polymorphisms in ABO blood group chromosomal regions and menstrual disorders

    PubMed Central

    SU, YONG; KONG, GUI-LIAN; SU, YA-LI; ZHOU, YAN; LV, LI-FANG; WANG, QIONG; HUANG, BAO-PING; ZHENG, RUI-ZHI; LI, QUAN-ZHONG; YUAN, HUI-JUAN; ZHAO, ZHI-GANG

    2015-01-01

    This study aimed to investigate whether single nucleotide polymorphisms (SNPs) located near the gene of the ABO blood group play an important role in the genetic aetiology of menstrual disorders (MDs). Polymerase chain reaction-ligase detection reaction technology was used to detect eight SNPs near the ABO gene location on the chromosomes in 250 cases of MD and 250 cases of normal menstruation. The differences in the distribution of each genotype, as well as the allele frequency in the normal and control groups, were analysed using Pearson's χ2 test to search for disease-associated loci. SHEsis software was used to analyse the linkage disequilibrium and haplotype frequencies and to inspect the correlation between haplotypes and the disease. Compared with the control group, the experimental group exhibited statistically significant differences in the genotype distribution frequencies of the rs657152 locus of the ABO blood group gene and the rs17250673 locus of the tumour necrosis factor cofactor 2 (TRAF2) gene, which is located downstream of the ABO gene. The allele distribution frequencies of rs657152 and rs495828 loci in the ABO blood group gene exhibited significant differences between the groups. Dominant and recessive genetic model analysis of each locus revealed that the experimental group exhibited statistically significant differences from the control group in the genotype distribution frequencies of rs657152 and rs495828 loci, respectively. These results indicate that the ABO blood group gene and TRAF2 gene may be a cause of MDs. PMID:26136981

  16. Genome-Wide Gene-Sodium Interaction Analyses on Blood Pressure: The Genetic Epidemiology Network of Salt-Sensitivity Study.

    PubMed

    Li, Changwei; He, Jiang; Chen, Jing; Zhao, Jinying; Gu, Dongfeng; Hixson, James E; Rao, Dabeeru C; Jaquish, Cashell E; Gu, Charles C; Chen, Jichun; Huang, Jianfeng; Chen, Shufeng; Kelly, Tanika N

    2016-08-01

    We performed genome-wide analyses to identify genomic loci that interact with sodium to influence blood pressure (BP) using single-marker-based (1 and 2 df joint tests) and gene-based tests among 1876 Chinese participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. Among GenSalt participants, the average of 3 urine samples was used to estimate sodium excretion. Nine BP measurements were taken using a random zero sphygmomanometer. A total of 2.05 million single-nucleotide polymorphisms were imputed using Affymetrix 6.0 genotype data and the Chinese Han of Beijing and Japanese of Tokyo HapMap reference panel. Promising findings (P<1.00×10(-4)) from GenSalt were evaluated for replication among 775 Chinese participants of the Multi-Ethnic Study of Atherosclerosis (MESA). Single-nucleotide polymorphism and gene-based results were meta-analyzed across the GenSalt and MESA studies to determine genome-wide significance. The 1 df tests identified interactions for UST rs13211840 on diastolic BP (P=3.13×10(-9)). The 2 df tests additionally identified associations for CLGN rs2567241 (P=3.90×10(-12)) and LOC105369882 rs11104632 (P=4.51×10(-8)) with systolic BP. The CLGN variant rs2567241 was also associated with diastolic BP (P=3.11×10(-22)) and mean arterial pressure (P=2.86×10(-15)). Genome-wide gene-based analysis identified MKNK1 (P=6.70×10(-7)), C2orf80 (P<1.00×10(-12)), EPHA6 (P=2.88×10(-7)), SCOC-AS1 (P=4.35×10(-14)), SCOC (P=6.46×10(-11)), CLGN (P=3.68×10(-13)), MGAT4D (P=4.73×10(-11)), ARHGAP42 (P≤1.00×10(-12)), CASP4 (P=1.31×10(-8)), and LINC01478 (P=6.75×10(-10)) that were associated with at least 1 BP phenotype. In summary, we identified 8 novel and 1 previously reported BP loci through the examination of single-nucleotide polymorphism and gene-based interactions with sodium. © 2016 American Heart Association, Inc.

  17. Using a multi-gene approach to infer the complicated phylogeny and evolutionary history of lorises (Order Primates: Family Lorisidae).

    PubMed

    Munds, Rachel A; Titus, Chelsea L; Eggert, Lori S; Blomquist, Gregory E

    2018-05-25

    Extensive phylogenetic studies have found robust phylogenies are modeled by using a multi-gene approach and sampling from the majority of the taxa of interest. Yet, molecular studies focused on the lorises, a cryptic primate family, have often relied on one gene, or just mitochondrial DNA, and many were unable to include all four genera in the analyses, resulting in inconclusive phylogenies. Past phylogenetic loris studies resulted in lorises being monophyletic, paraphyletic, or an unresolvable trichotomy with the closely related galagos. The purpose of our study is to improve our understanding of loris phylogeny and evolutionary history by using a multi-gene approach. We used the mitochondrial genes cytochrome b, and cytochrome c oxidase subunit 1, along with a nuclear intron (recombination activating gene 2) and nuclear exon (the melanocortin 1 receptor). Maximum Likelihood and Bayesian phylogenetic analyses were conducted based on data from each locus, as well as on the concatenated sequences. The robust, concatenated results found lorises to be a monophyletic family (Lorisidae) (PP ≥ 0.99) with two distinct subfamilies: the African Perodictinae (PP ≥ 0.99) and the Asian Lorisinae (PP ≥ 0.99). Additionally, from these analyses all four genera were all recovered as monophyletic (PP ≥ 0.99). Some of our single-gene analyses recovered monophyly, but many had discordances, with some showing paraphyly or a deep-trichotomy. Bayesian partitioned analyses inferred the most recent common ancestors of lorises emerged ∼42 ± 6 million years ago (mya), the Asian Lorisinae separated ∼30 ± 9 mya, and Perodictinae arose ∼26 ± 10 mya. These times fit well with known historical tectonic shifts of the area, as well as with the sparse loris fossil record. Additionally, our results agree with previous multi-gene studies on Lorisidae which found lorises to be monophyletic and arising ∼40 mya (Perelman et al., 2011; Pozzi et al., 2014). By taking a multi-gene approach, we were able to recover a well-supported, monophyletic loris phylogeny and inferred the evolutionary history of this cryptic family. Copyright © 2018 Elsevier Inc. All rights reserved.

  18. Sequence variations of the human MPDZ gene and association with alcoholism in subjects with European ancestry.

    PubMed

    Karpyak, Victor M; Kim, Jeong-Hyun; Biernacka, Joanna M; Wieben, Eric D; Mrazek, David A; Black, John L; Choi, Doo-Sup

    2009-04-01

    Mpdz gene variations are known contributors of acute alcohol withdrawal severity and seizures in mice. To investigate the relevance of these findings for human alcoholism, we resequenced 46 exons, exon-intron boundaries, and 2 kilobases in the 5' region of the human MPDZ gene in 61 subjects with a history of alcohol withdrawal seizures (AWS), 59 subjects with a history of alcohol withdrawal without AWS, and 64 Coriell samples from self-reported nonalcoholic subjects [all European American (EA) ancestry] and compared with the Mpdz sequences of 3 mouse strains with different propensity to AWS. To explore potential associations of the human MPDZ gene with alcoholism and AWS, single SNP and haplotype analyses were performed using 13 common variants. Sixty-seven new, mostly rare variants were discovered in the human MPDZ gene. Sequence comparison revealed that the human gene does not have variations identical to those comprising Mpdz gene haplotype associated with AWS in mice. We also found no significant association between MPDZ haplotypes and AWS in humans. However, a global test of haplotype association revealed a significant difference in haplotype frequencies between alcohol-dependent subjects without AWS and Coriell controls (p = 0.015), suggesting a potential role of MPDZ in alcoholism and/or related phenotypes other than AWS. Haplotype-specific tests for the most common haplotypes (frequency > 0.05), revealed a specific high-risk haplotype (p = 0.006, maximum statistic p = 0.051), containing rs13297480G allele also found to be significantly more prevalent in alcoholics without AWS compared with nonalcoholic Coriell subjects (p = 0.019). Sequencing of MPDZ gene in individuals with EA ancestry revealed no variations in the sites identical to those associated with AWS in mice. Exploratory haplotype and single SNP association analyses suggest a possible association between the MPDZ gene and alcohol dependence but not AWS. Further functional genomic analysis of MPDZ variants and investigation of their association with a broader array of alcoholism-related phenotypes could reveal additional genetic markers of alcoholism.

  19. Phylogeny of the bears (Ursidae) based on nuclear and mitochondrial genes.

    PubMed

    Yu, Li; Li, Qing-wei; Ryder, O A; Zhang, Ya-ping

    2004-08-01

    The taxomic classification and phylogenetic relationships within the bear family remain argumentative subjects in recent years. Prior investigation has been concentrated on the application of different mitochondrial (mt) sequence data, herein we employ two nuclear single-copy gene segments, the partial exon 1 from gene encoding interphotoreceptor retinoid binding protein (IRBP) and the complete intron 1 from transthyretin (TTR) gene, in conjunction with previously published mt data, to clarify these enigmatic problems. The combined analyses of nuclear IRBP and TTR datasets not only corroborated prior hypotheses, positioning the spectacled bear most basally and grouping the brown and polar bear together but also provided new insights into the bear phylogeny, suggesting the sister-taxa association of sloth bear and sun bear with strong support. Analyses based on combination of nuclear and mt genes differed from nuclear analysis in recognizing the sloth bears as the earliest diverging species among the subfamily ursine representatives while the exact placement of the sun bear did not resolved. Asiatic and American black bears clustered as sister group in all analyses with moderate levels of bootstrap support and high posterior probabilities. Comparisons between the nuclear and mtDNA findings suggested that our combined nuclear dataset have the resolving power comparable to mtDNA dataset for the phylogenetic interpretation of the bear family. As can be seen from present study, the unanimous phylogeny for this recently derived family was still not produced and additional independent genetic markers were in need.

  20. Association study of 21 circadian genes with bipolar I disorder, schizoaffective disorder, and schizophrenia

    PubMed Central

    Mansour, Hader A; Talkowski, Michael E; Wood, Joel; Chowdari, Kodavali V; McClain, Lora; Prasad, Konasale; Montrose, Debra; Fagiolini, Andrea; Friedman, Edward S; Allen, Michael H; Bowden, Charles L; Calabrese, Joseph; El-Mallakh, Rif S; Escamilla, Michael; Faraone, Stephen V; Fossey, Mark D; Gyulai, Laszlo; Loftis, Jennifer M; Hauser, Peter; Ketter, Terence A; Marangell, Lauren B; Miklowitz, David J; Nierenberg, Andrew A; Patel, Jayendra; Sachs, Gary S; Sklar, Pamela; Smoller, Jordan W; Laird, Nan; Keshavan, Matcheri; Thase, Michael E; Axelson, David; Birmaher, Boris; Lewis, David; Monk, Tim; Frank, Ellen; Kupfer, David J; Devlin, Bernie; Nimgaonkar, Vishwajit L

    2012-01-01

    Objective Published studies suggest associations between circadian gene polymorphisms and bipolar I disorder (BPI), as well as schizoaffective disorder (SZA) and schizophrenia (SZ). The results are plausible, based on prior studies of circadian abnormalities. As replications have not been attempted uniformly, we evaluated representative, common polymorphisms in all three disorders. Methods We assayed 276 publicly available ‘tag’ single nucleotide polymorphisms (SNPs) at 21 circadian genes among 523 patients with BPI, 527 patients with SZ/SZA, and 477 screened adult controls. Detected associations were evaluated in relation to two published genome-wide association studies (GWAS). Results Using gene-based tests, suggestive associations were noted between EGR3 and BPI (p = 0.017), and between NPAS2 and SZ/SZA (p = 0.034). Three SNPs were associated with both sets of disorders (NPAS2: rs13025524 and rs11123857; RORB: rs10491929; p < 0.05). None of the associations remained significant following corrections for multiple comparisons. Approximately 15% of the analyzed SNPs overlapped with an independent study that conducted GWAS for BPI; suggestive overlap between the GWAS analyses and ours was noted at ARNTL. Conclusions Several suggestive, novel associations were detected with circadian genes and BPI and SZ/SZA, but the present analyses do not support associations with common polymorphisms that confer risk with odds ratios greater than 1.5. Additional analyses using adequately powered samples are warranted to further evaluate these results. PMID:19839995

  1. Rare copy number variants in patients with congenital conotruncal heart defects.

    PubMed

    Xie, Hongbo M; Werner, Petra; Stambolian, Dwight; Bailey-Wilson, Joan E; Hakonarson, Hakon; White, Peter S; Taylor, Deanne M; Goldmuntz, Elizabeth

    2017-03-01

    Previous studies using different cardiac phenotypes, technologies and designs suggest a burden of large, rare or de novo copy number variants (CNVs) in subjects with congenital heart defects. We sought to identify disease-related CNVs, candidate genes, and functional pathways in a large number of cases with conotruncal and related defects that carried no known genetic syndrome. Cases and control samples were divided into two cohorts and genotyped to assess each subject's CNV content. Analyses were performed to ascertain differences in overall CNV prevalence and to identify enrichment of specific genes and functional pathways in conotruncal cases relative to healthy controls. Only findings present in both cohorts are presented. From 973 total conotruncal cases, a burden of rare CNVs was detected in both cohorts. Candidate genes from rare CNVs found in both cohorts were identified based on their association with cardiac development or disease, and/or their reported disruption in published studies. Functional and pathway analyses revealed significant enrichment of terms involved in either heart or early embryonic development. Our study tested one of the largest cohorts specifically with cardiac conotruncal and related defects. These results confirm and extend previous findings that CNVs contribute to disease risk for congenital heart defects in general and conotruncal defects in particular. As disease heterogeneity renders identification of single recurrent genes or loci difficult, functional pathway and gene regulation network analyses appear to be more informative. Birth Defects Research 109:271-295, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  2. A genome-wide association study of corneal astigmatism: The CREAM Consortium

    PubMed Central

    Shah, Rupal L.; Li, Qing; Zhao, Wanting; Tedja, Milly S.; Tideman, J. Willem L.; Khawaja, Anthony P.; Fan, Qiao; Yazar, Seyhan; Williams, Katie M.; Verhoeven, Virginie J.M.; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J.; Pärssinen, Olavi; Wedenoja, Juho; Biino, Ginevra; Concas, Maria Pina; Uitterlinden, André; Rivadeneira, Fernando; Jaddoe, Vincent W.V.; Hysi, Pirro G.; Sim, Xueling; Tan, Nicholas; Tham, Yih-Chung; Sensaki, Sonoko; Hofman, Albert; Vingerling, Johannes R.; Jonas, Jost B.; Mitchell, Paul; Hammond, Christopher J.; Höhn, René; Baird, Paul N.; Wong, Tien-Yin; Cheng, Chinfsg-Yu; Teo, Yik Ying; Mackey, David A.; Williams, Cathy; Saw, Seang-Mei; Klaver, Caroline C.W.; Bailey-Wilson, Joan E.

    2018-01-01

    Purpose To identify genes and genetic markers associated with corneal astigmatism. Methods A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. Results The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha (PDGFRA) gene: top SNP: rs7673984, odds ratio=1.12 (95% CI:1.08–1.16), p=5.55×10−9. No other genome-wide significant loci were identified in the combined analysis or European/Asian ancestry-specific analyses. Gene-based analysis identified three novel candidate genes for corneal astigmatism in Europeans—claudin-7 (CLDN7), acid phosphatase 2, lysosomal (ACP2), and TNF alpha-induced protein 8 like 3 (TNFAIP8L3). Conclusions In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the PDGFRA gene, gene-based analysis identified three novel candidate genes, CLDN7, ACP2, and TNFAIP8L3, that warrant further investigation to understand their role in the pathogenesis of corneal astigmatism. The much lower number of genetic variants and genes demonstrating an association with corneal astigmatism compared to published spherical equivalent GWAS analyses suggest a greater influence of rare genetic variants, non-additive genetic effects, or environmental factors in the development of astigmatism. PMID:29422769

  3. Hypervirulent Chlamydia trachomatis Clinical Strain Is a Recombinant between Lymphogranuloma Venereum (L2) and D Lineages

    PubMed Central

    Somboonna, Naraporn; Wan, Raymond; Ojcius, David M.; Pettengill, Matthew A.; Joseph, Sandeep J.; Chang, Alexander; Hsu, Ray; Read, Timothy D.; Dean, Deborah

    2011-01-01

    ABSTRACT Chlamydia trachomatis is an obligate intracellular bacterium that causes a diversity of severe and debilitating diseases worldwide. Sporadic and ongoing outbreaks of lymphogranuloma venereum (LGV) strains among men who have sex with men (MSM) support the need for research on virulence factors associated with these organisms. Previous analyses have been limited to single genes or genomes of laboratory-adapted reference strain L2/434 and outbreak strain L2b/UCH-1/proctitis. We characterized an unusual LGV strain, termed L2c, isolated from an MSM with severe hemorrhagic proctitis. L2c developed nonfusing, grape-like inclusions and a cytotoxic phenotype in culture, unlike the LGV strains described to date. Deep genome sequencing revealed that L2c was a recombinant of L2 and D strains with conserved clustered regions of genetic exchange, including a 78-kb region and a partial, yet functional, toxin gene that was lost with prolonged culture. Indels (insertions/deletions) were discovered in an ftsK gene promoter and in the tarp and hctB genes, which encode key proteins involved in replication, inclusion formation, and histone H1-like protein activity, respectively. Analyses suggest that these indels affect gene and/or protein function, supporting the in vitro and disease phenotypes. While recombination has been known to occur for C. trachomatis based on gene sequence analyses, we provide the first whole-genome evidence for recombination between a virulent, invasive LGV strain and a noninvasive common urogenital strain. Given the lack of a genetic system for producing stable C. trachomatis mutants, identifying naturally occurring recombinants can clarify gene function and provide opportunities for discovering avenues for genomic manipulation. PMID:21540364

  4. Estrogen pathway polymorphisms in relation to primary open angle glaucoma: An analysis accounting for gender from the United States

    PubMed Central

    Loomis, Stephanie J.; Weinreb, Robert N.; Kang, Jae H.; Yaspan, Brian L.; Bailey, Jessica Cooke; Gaasterland, Douglas; Gaasterland, Terry; Lee, Richard K.; Scott, William K.; Lichter, Paul R.; Budenz, Donald L.; Liu, Yutao; Realini, Tony; Friedman, David S.; McCarty, Catherine A.; Moroi, Sayoko E.; Olson, Lana; Schuman, Joel S.; Singh, Kuldev; Vollrath, Douglas; Wollstein, Gadi; Zack, Donald J.; Brilliant, Murray; Sit, Arthur J.; Christen, William G.; Fingert, John; Kraft, Peter; Zhang, Kang; Allingham, R. Rand; Pericak-Vance, Margaret A.; Richards, Julia E.; Hauser, Michael A.; Haines, Jonathan L.; Wiggs, Janey L.

    2013-01-01

    Purpose Circulating estrogen levels are relevant in glaucoma phenotypic traits. We assessed the association between an estrogen metabolism single nucleotide polymorphism (SNP) panel in relation to primary open angle glaucoma (POAG), accounting for gender. Methods We included 3,108 POAG cases and 3,430 controls of both genders from the Glaucoma Genes and Environment (GLAUGEN) study and the National Eye Institute Glaucoma Human Genetics Collaboration (NEIGHBOR) consortium genotyped on the Illumina 660W-Quad platform. We assessed the relation between the SNP panels representative of estrogen metabolism and POAG using pathway- and gene-based approaches with the Pathway Analysis by Randomization Incorporating Structure (PARIS) software. PARIS executes a permutation algorithm to assess statistical significance relative to the pathways and genes of comparable genetic architecture. These analyses were performed using the meta-analyzed results from the GLAUGEN and NEIGHBOR data sets. We evaluated POAG overall as well as two subtypes of POAG defined as intraocular pressure (IOP) ≥22 mmHg (high-pressure glaucoma [HPG]) or IOP <22 mmHg (normal pressure glaucoma [NPG]) at diagnosis. We conducted these analyses for each gender separately and then jointly in men and women. Results Among women, the estrogen SNP pathway was associated with POAG overall (permuted p=0.006) and HPG (permuted p<0.001) but not NPG (permuted p=0.09). Interestingly, there was no relation between the estrogen SNP pathway and POAG when men were considered alone (permuted p>0.99). Among women, gene-based analyses revealed that the catechol-O-methyltransferase gene showed strong associations with HTG (permuted gene p≤0.001) and NPG (permuted gene p=0.01). Conclusions The estrogen SNP pathway was associated with POAG among women. PMID:23869166

  5. Molecular phylogenetic and scanning electron microscopical analyses places the Choanephoraceae and the Gilbertellaceae in a monophyletic group within the Mucorales (Zygomycetes, Fungi).

    PubMed

    Voigt, Kerstin; Olsson, L

    2008-09-01

    A multi-gene genealogy based on maximum parsimony and distance analyses of the exonic genes for actin (act) and translation elongation factor 1 alpha (tef), the nuclear genes for the small (18S) and large (28S) subunit ribosomal RNA (comprising 807, 1092, 1863, 389 characters, respectively) of all 50 genera of the Mucorales (Zygomycetes) suggests that the Choanephoraceae is a monophyletic group. The monotypic Gilbertellaceae appears in close phylogenetic relatedness to the Choanephoraceae. The monophyly of the Choanephoraceae has moderate to strong support (bootstrap proportions 67% and 96% in distance and maximum parsimony analyses, respectively), whereas the monophyly of the Choanephoraceae-Gilbertellaceae clade is supported by high bootstrap values (100% and 98%). This suggests that the two families can be joined into one family, which leads to the elimination of the Gilbertellaceae as a separate family. In order to test this hypothesis single-locus neighbor-joining analyses were performed on nuclear genes of the 18S, 5.8S, 28S and internal transcribed spacer (ITS) 1 ribosomal RNA and the translation elongation factor 1 alpha (tef) and beta tubulin (betatub) nucleotide sequences. The common monophyletic origin of the Choanephoraceae-Gilbertellaceae clade could be confirmed in all gene trees and by investigation of their ultrastructure. Sporangia with persistent, sutured walls splitting in half at maturity and ellipsoidal sporangiospores with striated ornamentations and polar ciliate appendages arising from spores in persistent sporangia and dehiscent sporangiola represent synapomorphic characters of this group. We discuss our data in the context of the historical development of their taxonomy and physiology and propose a reduction of the two families to one family, the Choanephoraceae sensu lato comprising species which are facultative plant pathogens and parasites, especially in subtropical to tropical regions.

  6. The moderating effect of ANKK1 on the association of family environment with longitudinal executive function following traumatic brain injury in early childhood: A preliminary study.

    PubMed

    Smith-Paine, Julia; Wade, Shari L; Treble-Barna, Amery; Zhang, Nanhua; Zang, Huaiyu; Martin, Lisa J; Yeates, Keith Owen; Taylor, H Gerry; Kurowski, Brad G

    2018-05-02

    This study examined whether the ankyrin repeat and kinase domain containing 1 gene (ANKK1) C/T single-nucleotide polymorphism (SNP) rs1800497 moderated the association of family environment with long-term executive function (EF) following traumatic injury in early childhood. Caregivers of children with traumatic brain injury (TBI) and children with orthopedic injury (OI) completed the Behavior Rating Inventory of Executive Function (BRIEF) at post injury visits. DNA was collected to identify the rs1800497 genotype in the ANKK1 gene. General linear models examined gene-environment interactions as moderators of the effects of TBI on EF at two times post injury (12 months and 7 years). At 12 months post injury, analyses revealed a significant 3-way interaction of genotype with level of permissive parenting and injury type. Post-hoc analyses showed genetic effects were more pronounced for children with TBI from more positive family environments, such that children with TBI who were carriers of the risk allele (T-allele) had significantly poorer EF compared to non-carriers only when they were from more advantaged environments. At 7 years post injury, analyses revealed a significant 2-way interaction of genotype with level of authoritarian parenting. Post-hoc analyses found that carriers of the risk allele had significantly poorer EF compared to non-carriers only when they were from more advantaged environments. These results suggest a gene-environment interaction involving the ANKK1 gene as a predictor of EF in a pediatric injury population. The findings highlight the importance of considering environmental influences in future genetic studies on recovery following TBI and other traumatic injuries in childhood.

  7. Vibrio cholerae biofilm growth program and architecture revealed by single-cell live imaging

    PubMed Central

    Yan, Jing; Sharo, Andrew G.; Stone, Howard A.; Wingreen, Ned S.; Bassler, Bonnie L.

    2016-01-01

    Biofilms are surface-associated bacterial communities that are crucial in nature and during infection. Despite extensive work to identify biofilm components and to discover how they are regulated, little is known about biofilm structure at the level of individual cells. Here, we use state-of-the-art microscopy techniques to enable live single-cell resolution imaging of a Vibrio cholerae biofilm as it develops from one single founder cell to a mature biofilm of 10,000 cells, and to discover the forces underpinning the architectural evolution. Mutagenesis, matrix labeling, and simulations demonstrate that surface adhesion-mediated compression causes V. cholerae biofilms to transition from a 2D branched morphology to a dense, ordered 3D cluster. We discover that directional proliferation of rod-shaped bacteria plays a dominant role in shaping the biofilm architecture in V. cholerae biofilms, and this growth pattern is controlled by a single gene, rbmA. Competition analyses reveal that the dense growth mode has the advantage of providing the biofilm with superior mechanical properties. Our single-cell technology can broadly link genes to biofilm fine structure and provides a route to assessing cell-to-cell heterogeneity in response to external stimuli. PMID:27555592

  8. Gene Duplication, Population Genomics, and Species-Level Differentiation within a Tropical Mountain Shrub

    PubMed Central

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H.; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C.

    2014-01-01

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767

  9. Strain diversity and host specificity in a specialized gut symbiont of honeybees and bumblebees.

    PubMed

    Powell, Elijah; Ratnayeke, Nalin; Moran, Nancy A

    2016-09-01

    Host-restricted lineages of gut bacteria often include many closely related strains, but this fine-scale diversity is rarely investigated. The specialized gut symbiont Snodgrassella alvi has codiversified with honeybees (Apis mellifera) and bumblebees (Bombus) for millions of years. Snodgrassella alvi strains are nearly identical for 16S rRNA gene sequences but have distinct gene repertoires potentially affecting host biology and community interactions. We examined S. alvi strain diversity within and between hosts using deep sequencing both of a single-copy coding gene (minD) and of the V4 region of the 16S rRNA gene. We sampled workers from domestic and feral A. mellifera colonies and wild-caught Bombus representing 14 species. Conventional analyses of community profiles, based on the V4 region of the 16S rRNA gene, failed to expose most strain variation. In contrast, the minD analysis revealed extensive strain variation within and between host species and individuals. Snodgrassella alvi strain diversity is significantly higher in A. mellifera than in Bombus, supporting the hypothesis that colony founding by swarms of workers enables retention of more diversity than colony founding by a single queen. Most Bombus individuals (72%) are dominated by a single S. alvi strain, whereas most A. mellifera (86%) possess multiple strains. No S. alvi strains are shared between A. mellifera and Bombus, indicating some host specificity. Among Bombus-restricted strains, some are restricted to a single host species or subgenus, while others occur in multiple subgenera. Findings demonstrate that strains diversify both within and between host species and can be highly specific or relatively generalized in their host associations. © 2016 John Wiley & Sons Ltd.

  10. Genetic diagnosis of Duchenne and Becker muscular dystrophy using next-generation sequencing technology: comprehensive mutational search in a single platform.

    PubMed

    Lim, Byung Chan; Lee, Seungbok; Shin, Jong-Yeon; Kim, Jong-Il; Hwang, Hee; Kim, Ki Joong; Hwang, Yong Seung; Seo, Jeong-Sun; Chae, Jong Hee

    2011-11-01

    Duchenne muscular dystrophy or Becker muscular dystrophy might be a suitable candidate disease for application of next-generation sequencing in the genetic diagnosis because the complex mutational spectrum and the large size of the dystrophin gene require two or more analytical methods and have a high cost. The authors tested whether large deletions/duplications or small mutations, such as point mutations or short insertions/deletions of the dystrophin gene, could be predicted accurately in a single platform using next-generation sequencing technology. A custom solution-based target enrichment kit was designed to capture whole genomic regions of the dystrophin gene and other muscular-dystrophy-related genes. A multiplexing strategy, wherein four differently bar-coded samples were captured and sequenced together in a single lane of the Illumina Genome Analyser, was applied. The study subjects were 25 16 with deficient dystrophin expression without a large deletion/duplication and 9 with a known large deletion/duplication. Nearly 100% of the exonic region of the dystrophin gene was covered by at least eight reads with a mean read depth of 107. Pathogenic small mutations were identified in 15 of the 16 patients without a large deletion/duplication. Using these 16 patients as the standard, the authors' method accurately predicted the deleted or duplicated exons in the 9 patients with known mutations. Inclusion of non-coding regions and paired-end sequence analysis enabled accurate identification by increasing the read depth and providing information about the breakpoint junction. The current method has an advantage for the genetic diagnosis of Duchenne muscular dystrophy and Becker muscular dystrophy wherein a comprehensive mutational search may be feasible using a single platform.

  11. Complete chloroplast genome sequences of Drimys, Liriodendron, andPiper: Implications for the phylogeny of magnoliids and the evolution ofGC content

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhengqiu, C.; Penaflor, C.; Kuehl, J.V.

    2006-06-01

    The magnoliids represent the largest basal angiosperm clade with four orders, 19 families and 8,500 species. Although several recent angiosperm molecular phylogenies have supported the monophyly of magnoliids and suggested relationships among the orders, the limited number of genes examined resulted in only weak support, and these issues remain controversial. Furthermore, considerable incongruence has resulted in phylogenies supporting three different sets of relationships among magnoliids and the two large angiosperm clades, monocots and eudicots. This is one of the most important remaining issues concerning relationships among basal angiosperms. We sequenced the chloroplast genomes of three magnoliids, Drimys (Canellales), Liriodendron (Magnoliales),more » and Piper (Piperales), and used these data in combination with 32 other completed angiosperm chloroplast genomes to assess phylogenetic relationships among magnoliids. The Drimys and Piper chloroplast genomes are nearly identical in size at 160,606 and 160,624 bp, respectively. The genomes include a pair of inverted repeats of 26,649 bp (Drimys) and 27,039 (Piper), separated by a small single copy region of 18,621 (Drimys) and 18,878 (Piper) and a large single copy region of 88,685 bp (Drimys) and 87,666 bp (Piper). The gene order of both taxa is nearly identical to many other unrearranged angiosperm chloroplast genomes, including Calycanthus, the other published magnoliid genome. Comparisons of angiosperm chloroplast genomes indicate that GC content is not uniformly distributed across the genome. Overall GC content ranges from 34-39%, and coding regions have a substantially higher GC content than non-coding regions (both intergenic spacers and introns). Among protein-coding genes, GC content varies by codon position with 1st codon > 2nd codon > 3rd codon, and it varies by functional group with photosynthetic genes having the highest percentage and NADH genes the lowest. Across the genome, GC content is highest in the inverted repeat due to the presence of rRNA genes and lowest in the small single copy region where most NADH genes are located. Phylogenetic analyses using maximum parsimony and maximum likelihood methods were performed on DNA sequences of 61 protein-coding genes. Trees from both analyses provided strong support for the monophyly of magnoliids and two strongly supported groups were identified, the Canellales/Piperales and the Laurales/Magnoliales. The phylogenies also provided moderate to strong support for the basal position of Amborella, and a sister relationship of magnoliids to a clade that includes monocots and eudicots. The complete sequences of three magnoliid chloroplast genomes provide new data from the largest basal angiosperm clade. Evolutionary comparisons of these new genome sequences, combined with other published angiosperm genome, confirm that GC content is unevenly distributed across the genome by location, codon position, and functional group. Furthermore, phylogenetic analyses provide the strongest support so far for the hypothesis that the magnoliids are sister to a large clade that includes both monocots and eudicots.« less

  12. Rare Variant Association Test with Multiple Phenotypes

    PubMed Central

    Lee, Selyeong; Won, Sungho; Kim, Young Jin; Kim, Yongkang; Kim, Bong-Jo; Park, Taesung

    2016-01-01

    Although genome-wide association studies (GWAS) have now discovered thousands of genetic variants associated with common traits, such variants cannot explain the large degree of “missing heritability,” likely due to rare variants. The advent of next generation sequencing technology has allowed rare variant detection and association with common traits, often by investigating specific genomic regions for rare variant effects on a trait. Although multiply correlated phenotypes are often concurrently observed in GWAS, most studies analyze only single phenotypes, which may lessen statistical power. To increase power, multivariate analyses, which consider correlations between multiple phenotypes, can be used. However, few existing multi-variant analyses can identify rare variants for assessing multiple phenotypes. Here, we propose Multivariate Association Analysis using Score Statistics (MAAUSS), to identify rare variants associated with multiple phenotypes, based on the widely used Sequence Kernel Association Test (SKAT) for a single phenotype. We applied MAAUSS to Whole Exome Sequencing (WES) data from a Korean population of 1,058 subjects, to discover genes associated with multiple traits of liver function. We then assessed validation of those genes by a replication study, using an independent dataset of 3,445 individuals. Notably, we detected the gene ZNF620 among five significant genes. We then performed a simulation study to compare MAAUSS's performance with existing methods. Overall, MAAUSS successfully conserved type 1 error rates and in many cases, had a higher power than the existing methods. This study illustrates a feasible and straightforward approach for identifying rare variants correlated with multiple phenotypes, with likely relevance to missing heritability. PMID:28039885

  13. Cloning and expression of calmodulin gene in Scoparia dulcis.

    PubMed

    Saitoh, Daisuke; Asakura, Yuki; Nkembo, Marguerite Kasidimoko; Shite, Masato; Sugiyama, Ryuji; Lee, Jung-Bum; Hayashi, Toshimitsu; Kurosaki, Fumiya

    2007-06-01

    A homology-based cloning strategy yielded a cDNA clone, designated Sd-cam, encoding calmodulin protein from Scoparia dulcis. The restriction digests of genomic DNA of S. dulcis showed a single hybridized signal when probed with the fragment of this gene in Southern blot analyses, suggesting that Sd-cam occurs as a sole gene encoding calmodulin in the plant. The reverse-transcription polymerase chain reaction analysis revealed that Sd-cam was appreciably expressed in leaf, root and stem tissues. It appeared that transcription of this gene increased transiently when the leaf cultures of S. dulcis were treated with methyl jasmonate and calcium ionophore A23187. These results suggest that transcriptional activation of Sd-cam is one of the early cellular events of the methyl jasmonate-induced responses of S. dulcis.

  14. Circadian Enhancers Coordinate Multiple Phases of Rhythmic Gene Transcription In Vivo

    PubMed Central

    Fang, Bin; Everett, Logan J.; Jager, Jennifer; Briggs, Erika; Armour, Sean M.; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A.

    2014-01-01

    SUMMARY Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of eRNAs that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed novel mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed new light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ. PMID:25416951

  15. T-DNA transfer and integration in the ectomycorrhizal fungus Suillus bovinus using hygromycin B as a selectable marker.

    PubMed

    Hanif, Mubashir; Pardo, Alejandro Guillermo; Gorfer, Markus; Raudaskoski, Marjatta

    2002-06-01

    The T-DNA of Agrobacterium tumefaciens can be transferred to plants, yeasts, fungi and human cells. Using this system, dikaryotic mycelium of the ectomycorrhizal fungus Suillus bovinus was transformed with recombinant hygromycin B phosphotransferase (hph)and enhanced green fluorescent protein (EGFP) genes fused with a heterologous fungal promoter and CaMV35S terminator. Transformation resulted in hygromycin B-resistant clones, which were mitotically stable. Putative transformants were analysed for the presence of hph and EGFP genes by PCR and Southern analysis. The latter analysis proved both multiple- and single-copy integrations of the genes in the S. bovinus genome. A. tumeficiens transformation should make possible the development of tagged mutagenesis and targeted gene disruption technology for S. bovinus.

  16. Circadian enhancers coordinate multiple phases of rhythmic gene transcription in vivo.

    PubMed

    Fang, Bin; Everett, Logan J; Jager, Jennifer; Briggs, Erika; Armour, Sean M; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A

    2014-11-20

    Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of enhancer RNAs (eRNAs) that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ.

  17. Comprehensive analysis of alternative splicing and functionality in neuronal differentiation of P19 cells.

    PubMed

    Suzuki, Hitoshi; Osaki, Ken; Sano, Kaori; Alam, A H M Khurshid; Nakamura, Yuichiro; Ishigaki, Yasuhito; Kawahara, Kozo; Tsukahara, Toshifumi

    2011-02-18

    Alternative splicing, which produces multiple mRNAs from a single gene, occurs in most human genes and contributes to protein diversity. Many alternative isoforms are expressed in a spatio-temporal manner, and function in diverse processes, including in the neural system. The purpose of the present study was to comprehensively investigate neural-splicing using P19 cells. GeneChip Exon Array analysis was performed using total RNAs purified from cells during neuronal cell differentiation. To efficiently and readily extract the alternative exon candidates, 9 filtering conditions were prepared, yielding 262 candidate exons (236 genes). Semiquantitative RT-PCR results in 30 randomly selected candidates suggested that 87% of the candidates were differentially alternatively spliced in neuronal cells compared to undifferentiated cells. Gene ontology and pathway analyses suggested that many of the candidate genes were associated with neural events. Together with 66 genes whose functions in neural cells or organs were reported previously, 47 candidate genes were found to be linked to 189 events in the gene-level profile of neural differentiation. By text-mining for the alternative isoform, distinct functions of the isoforms of 9 candidate genes indicated by the result of Exon Array were confirmed. Alternative exons were successfully extracted. Results from the informatics analyses suggested that neural events were primarily governed by genes whose expression was increased and whose transcripts were differentially alternatively spliced in the neuronal cells. In addition to known functions in neural cells or organs, the uninvestigated alternative splicing events of 11 genes among 47 candidate genes suggested that cell cycle events are also potentially important. These genes may help researchers to differentiate the roles of alternative splicing in cell differentiation and cell proliferation.

  18. Pathway-driven gene stability selection of two rheumatoid arthritis GWAS identifies and validates new susceptibility genes in receptor mediated signalling pathways.

    PubMed

    Eleftherohorinou, Hariklia; Hoggart, Clive J; Wright, Victoria J; Levin, Michael; Coin, Lachlan J M

    2011-09-01

    Rheumatoid arthritis (RA) is the commonest chronic, systemic, inflammatory disorder affecting ∼1% of the world population. It has a strong genetic component and a growing number of associated genes have been discovered in genome-wide association studies (GWAS), which nevertheless only account for 23% of the total genetic risk. We aimed to identify additional susceptibility loci through the analysis of GWAS in the context of biological function. We bridge the gap between pathway and gene-oriented analyses of GWAS, by introducing a pathway-driven gene stability-selection methodology that identifies potential causal genes in the top-associated disease pathways that may be driving the pathway association signals. We analysed the WTCCC and the NARAC studies of ∼5000 and ∼2000 subjects, respectively. We examined 700 pathways comprising ∼8000 genes. Ranking pathways by significance revealed that the NARAC top-ranked ∼6% laid within the top 10% of WTCCC. Gene selection on those pathways identified 58 genes in WTCCC and 61 in NARAC; 21 of those were common (P(overlap)< 10(-21)), of which 16 were novel discoveries. Among the identified genes, we validated 10 known RA associations in WTCCC and 13 in NARAC, not discovered using single-SNP approaches on the same data. Gene ontology functional enrichment analysis on the identified genes showed significant over-representation of signalling activity (P< 10(-29)) in both studies. Our findings suggest a novel model of RA genetic predisposition, which involves cell-membrane receptors and genes in second messenger signalling systems, in addition to genes that regulate immune responses, which have been the focus of interest previously.

  19. Genetic variation may explain why females are less susceptible to dental erosion.

    PubMed

    Uhlen, Marte-Mari; Stenhagen, Kjersti R; Dizak, Piper M; Holme, Børge; Mulic, Aida; Tveit, Anne B; Vieira, Alexandre R

    2016-10-01

    Not all individuals at risk for dental erosion (DE) display erosive lesions. The prevalence of DE is higher among male subjects. The occurrence of DE may depend on more than just acidic challenge, with genetics possibly playing a role. The aim of this study was to investigate the association of enamel-formation genes with DE. One premolar and a saliva sample were collected from 90 individuals. Prepared teeth were immersed in 0.01 M HCl (pH 2.2), and enamel loss (μm) was measured using white light interferometry. DNA was extracted from saliva, and 15 single-nucleotide polymorphisms were analysed. Allele and genotype frequencies were related to the enamel loss of the specimens. Single-marker and haplotype analyses were performed using sex as a covariate. Mean enamel loss was higher for male donors than for female donors (P = 0.047). Significant associations were found between enamel loss and amelogenin, X-linked (AMELX), tuftelin 1 (TUFT1), and tuftelin-interacting protein 11 (TFIP11). Analyses showed significant associations between variation in enamel-formation genes and a lower susceptibility to DE in female subjects. The results indicate that susceptibility to DE is influenced by genetic variation, and may, in part, explain why some individuals are more susceptible than others to DE, including differences between female subjects and male subjects. © 2016 Eur J Oral Sci.

  20. Epiregulin (EREG) and human V-ATPase (TCIRG1): genetic variation, ethnicity and pulmonary tuberculosis susceptibility in Guinea-Bissau and The Gambia

    PubMed Central

    White, Marquitta J.; Tacconelli, Alessandra; Chen, Jane S.; Wejse, Christian; Hill, Philip C.; Gomez, Victor F; Velez-Edwards, Digna R.; Østergaard, Lars J.; Hu, Ting; Moore, Jason H.; Novelli, Giuseppe; Scott, William K.; Williams, Scott M.; Sirugo, Giorgio

    2017-01-01

    We analyzed two West African samples (Guinea-Bissau: n = 289 cases, 322 controls; The Gambia: n = 240 cases, 248 controls) to evaluate single nucleotide polymorphisms (SNPs) in Epiregulin (EREG) and V-ATPase (T cell immune regulator 1, TCIRG1) using single and multi-locus analyses to determine whether previously described associations with pulmonary tuberculosis (PTB) in Vietnamese and Italians would replicate in African populations. We did not detect any significant single locus or haplotype associations in either sample. We also performed exploratory pairwise interaction analyses using Visualization of Statistical Epistasis Networks (ViSEN), a novel method to detect only interactions among multiple variables, to elucidate possible interaction effects between SNPs and demographic factors. Although we found no strong evidence of marginal effects, there were several significant pairwise interactions that were identified in either the Guinea-Bissau or The Gambia samples, two of which replicated across populations. Our results indicate that the effects of EREG and TCIRG1 variants on PTB susceptibility, to the extent that they exist, are dependent on gene-gene interactions in West African populations as detected with ViSEN. In addition, epistatic effects are likely to be influenced by inter- and intra-population differences in genetic or environmental context and/or the mycobacterial lineages causing disease. PMID:24898387

  1. Multiple displacement amplification on single cell and possible PGD applications.

    PubMed

    Hellani, Ali; Coskun, Serdar; Benkhalifa, Moncef; Tbakhi, Abelghani; Sakati, Nadia; Al-Odaib, Ali; Ozand, Pinar

    2004-11-01

    Multiple displacement amplification (MDA) is a technique used in the amplification of very low amounts of DNA and reported to yield large quantities of high-quality DNA. We used MDA to amplify the whole genome directly from a single cell. The most common techniques used in PGD are PCR and fluorescent in-situ hybridization (FISH). There are many limitations to these techniques including, the number of chromosomes diagnosed for FISH or the quality of DNA issued from a single cell PCR. This report shows, for the first time, use of MDA for single cell whole genome amplification. A total of 16 short tandem repeats (STRs) were amplified successfully with a similar pattern to the genomic DNA. Furthermore, allelic drop out (ADO) derived from MDA was assessed in 40 single cells by analysing (i) heterozygosity for a known beta globin mutation (IVSI-5 C-G) and by studying (ii) the heterozygous loci present in the STRs. ADO turned out to be 10.25% for the beta globin gene sequencing and 5% for the fluorescent PCR analysis of STRs. Moreover, the amplification accuracy of MDA permitted the detection of trisomy 21 on a single cell using comparative genome hybridization-array. Altogether, these data suggest that MDA can be used for single cell molecular karyotyping and the diagnosis of any single gene disorder in PGD.

  2. Methods to increase reproducibility in differential gene expression via meta-analysis

    PubMed Central

    Sweeney, Timothy E.; Haynes, Winston A.; Vallania, Francesco; Ioannidis, John P.; Khatri, Purvesh

    2017-01-01

    Findings from clinical and biological studies are often not reproducible when tested in independent cohorts. Due to the testing of a large number of hypotheses and relatively small sample sizes, results from whole-genome expression studies in particular are often not reproducible. Compared to single-study analysis, gene expression meta-analysis can improve reproducibility by integrating data from multiple studies. However, there are multiple choices in designing and carrying out a meta-analysis. Yet, clear guidelines on best practices are scarce. Here, we hypothesized that studying subsets of very large meta-analyses would allow for systematic identification of best practices to improve reproducibility. We therefore constructed three very large gene expression meta-analyses from clinical samples, and then examined meta-analyses of subsets of the datasets (all combinations of datasets with up to N/2 samples and K/2 datasets) compared to a ‘silver standard’ of differentially expressed genes found in the entire cohort. We tested three random-effects meta-analysis models using this procedure. We showed relatively greater reproducibility with more-stringent effect size thresholds with relaxed significance thresholds; relatively lower reproducibility when imposing extraneous constraints on residual heterogeneity; and an underestimation of actual false positive rate by Benjamini–Hochberg correction. In addition, multivariate regression showed that the accuracy of a meta-analysis increased significantly with more included datasets even when controlling for sample size. PMID:27634930

  3. A cryptochrome-like protein is involved in the regulation of photosynthesis genes in Rhodobacter sphaeroides.

    PubMed

    Hendrischk, Anne-Kathrin; Frühwirth, Sebastian Walter; Moldt, Julia; Pokorny, Richard; Metz, Sebastian; Kaiser, Gebhard; Jäger, Andreas; Batschauer, Alfred; Klug, Gabriele

    2009-11-01

    Blue light receptors belonging to the cryptochrome/photolyase family are found in all kingdoms of life. The functions of photolyases in repair of UV-damaged DNA as well as of cryptochromes in the light-dependent regulation of photomorphogenetic processes and in the circadian clock in plants and animals are well analysed. In prokaryotes, the only role of members of this protein family that could be demonstrated is DNA repair. Recently, we identified a gene for a cryptochrome-like protein (CryB) in the alpha-proteobacterium Rhodobacter sphaeroides. The protein lacks the typical C-terminal extension of cryptochromes, and is not related to the Cry DASH family. Here we demonstrate that CryB binds flavin adenine dinucleotide that can be photoreduced by blue light. CryB binds single-stranded DNA with very high affinity (K(d) approximately 10(-8) M) but double-stranded DNA and single-stranded RNA with far lower affinity (K(d) approximately 10(-6) M). Despite of that, no in vitro repair activity for pyrimidine dimers in single-stranded DNA could be detected. However, we show that CryB clearly affects the expression of genes for pigment-binding proteins and consequently the amount of photosynthetic complexes in R. sphaeroides. Thus, for the first time a role of a bacterial cryptochrome in gene regulation together with a biological function is demonstrated.

  4. Fine mapping of a QTL on chromosome 13 for submaximal exercise capacity training response: the HERITAGE Family Study.

    PubMed

    Rice, Treva K; Sarzynski, Mark A; Sung, Yun Ju; Argyropoulos, George; Stütz, Adrian M; Teran-Garcia, Margarita; Rao, D C; Bouchard, Claude; Rankinen, Tuomo

    2012-08-01

    Although regular exercise improves submaximal aerobic capacity, there is large variability in its response to exercise training. While this variation is thought to be partly due to genetic differences, relatively little is known about the causal genes. Submaximal aerobic capacity traits in the current report include the responses of oxygen consumption (ΔVO(2)60), power output (ΔWORK60), and cardiac output (ΔQ60) at 60% of VO2max to a standardized 20-week endurance exercise training program. Genome-wide linkage analysis in 475 HERITAGE Family Study Caucasians identified a locus on chromosome 13q for ΔVO(2)60 (LOD = 3.11). Follow-up fine mapping involved a dense marker panel of over 1,800 single-nucleotide polymorphisms (SNPs) in a 7.9-Mb region (21.1-29.1 Mb from p-terminus). Single-SNP analyses found 14 SNPs moderately associated with both ΔVO(2)60 at P ≤ 0.005 and the correlated traits of ΔWORK60 and ΔQ60 at P < 0.05. Haplotype analyses provided several strong signals (P < 1.0 × 10(-5)) for ΔVO(2)60. Overall, association analyses narrowed the target region and included potential biological candidate genes (MIPEP and SGCG). Consistent with maximal heritability estimates of 23%, up to 20% of the phenotypic variance in ΔVO(2)60 was accounted for by these SNPs. These results implicate candidate genes on chromosome 13q12 for the ability to improve submaximal exercise capacity in response to regular exercise. Submaximal exercise at 60% of maximal capacity is an exercise intensity that falls well within the range recommended in the Physical Activity Guidelines for Americans and thus has potential public health relevance.

  5. Fine mapping of a QTL on chromosome 13 for submaximal exercise capacity training response: the HERITAGE Family Study

    PubMed Central

    Rice, Treva K.; Sarzynski, Mark A.; Sung, Yun Ju; Argyropoulos, George; Stütz, Adrian M.; Teran-Garcia, Margarita; Rao, D. C.; Bouchard, Claude

    2014-01-01

    Although regular exercise improves submaximal aerobic capacity, there is large variability in its response to exercise training. While this variation is thought to be partly due to genetic differences, relatively little is known about the causal genes. Submaximal aerobic capacity traits in the current report include the responses of oxygen consumption (ΔVO260), power output (ΔWORK60), and cardiac output (ΔQ60) at 60% of VO2max to a standardized 20-week endurance exercise training program. Genome-wide linkage analysis in 475 HERITAGE Family Study Caucasians identified a locus on chromosome 13q for ΔVO260 (LOD = 3.11). Follow-up fine mapping involved a dense marker panel of over 1,800 single-nucleotide polymorphisms (SNPs) in a 7.9-Mb region (21.1–29.1 Mb from p-terminus). Single-SNP analyses found 14 SNPs moderately associated with both ΔVO260 at P ≤ 0.005 and the correlated traits of ΔWORK60 and ΔQ60 at P < 0.05. Haplotype analyses provided several strong signals (P<1.0 × 10−5) for ΔVO260. Overall, association analyses narrowed the target region and included potential biological candidate genes (MIPEP and SGCG). Consistent with maximal heritability estimates of 23%, up to 20% of the phenotypic variance in ΔVO260 was accounted for by these SNPs. These results implicate candidate genes on chromosome 13q12 for the ability to improve submaximal exercise capacity in response to regular exercise. Submaximal exercise at 60% of maximal capacity is an exercise intensity that falls well within the range recommended in the Physical Activity Guidelines for Americans and thus has potential public health relevance. PMID:22170014

  6. The computational core and fixed point organization in Boolean networks

    NASA Astrophysics Data System (ADS)

    Correale, L.; Leone, M.; Pagnani, A.; Weigt, M.; Zecchina, R.

    2006-03-01

    In this paper, we analyse large random Boolean networks in terms of a constraint satisfaction problem. We first develop an algorithmic scheme which allows us to prune simple logical cascades and underdetermined variables, returning thereby the computational core of the network. Second, we apply the cavity method to analyse the number and organization of fixed points. We find in particular a phase transition between an easy and a complex regulatory phase, the latter being characterized by the existence of an exponential number of macroscopically separated fixed point clusters. The different techniques developed are reinterpreted as algorithms for the analysis of single Boolean networks, and they are applied in the analysis of and in silico experiments on the gene regulatory networks of baker's yeast (Saccharomyces cerevisiae) and the segment-polarity genes of the fruitfly Drosophila melanogaster.

  7. Genetic factors controlling wool shedding in a composite Easycare sheep flock.

    PubMed

    Matika, O; Bishop, S C; Pong-Wong, R; Riggio, V; Headon, D J

    2013-12-01

    Historically, sheep have been selectively bred for desirable traits including wool characteristics. However, recent moves towards extensive farming and reduced farm labour have seen a renewed interest in Easycare breeds. The aim of this study was to quantify the underlying genetic architecture of wool shedding in an Easycare flock. Wool shedding scores were collected from 565 pedigreed commercial Easycare sheep from 2002 to 2010. The wool scoring system was based on a 10-point (0-9) scale, with score 0 for animals retaining full fleece and 9 for those completely shedding. DNA was sampled from 200 animals of which 48 with extreme phenotypes were genotyped using a 50-k SNP chip. Three genetic analyses were performed: heritability analysis, complex segregation analysis to test for a major gene hypothesis and a genome-wide association study to map regions in the genome affecting the trait. Phenotypes were treated as a continuous or binary variable and categories. High estimates of heritability (0.80 when treated as a continuous, 0.65-0.75 as binary and 0.75 as categories) for shedding were obtained from linear mixed model analyses. Complex segregation analysis gave similar estimates (0.80 ± 0.06) to those above with additional evidence for a major gene with dominance effects. Mixed model association analyses identified four significant (P < 0.05) SNPs. Further analyses of these four SNPs in all 200 animals revealed that one of the SNPs displayed dominance effects similar to those obtained from the complex segregation analyses. In summary, we found strong genetic control for wool shedding, demonstrated the possibility of a single putative dominant gene controlling this trait and identified four SNPs that may be in partial linkage disequilibrium with gene(s) controlling shedding. © 2013 University of Edinburgh, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.

  8. Quantitative DNA Methylation Profiling in Cancer.

    PubMed

    Ammerpohl, Ole; Haake, Andrea; Kolarova, Julia; Siebert, Reiner

    2016-01-01

    Epigenetic mechanisms including DNA methylation are fundamental for the regulation of gene expression. Epigenetic alterations can lead to the development and the evolution of malignant tumors as well as the emergence of phenotypically different cancer cells or metastasis from one single tumor cell. Here we describe bisulfite pyrosequencing, a technology to perform quantitative DNA methylation analyses, to detect aberrant DNA methylation in malignant tumors.

  9. Genome-wide association studies and epistasis analyses of candidate genes related to age at menarche and age at natural menopause in a Korean population.

    PubMed

    Pyun, Jung-A; Kim, Sunshin; Cho, Nam H; Koh, InSong; Lee, Jong-Young; Shin, Chol; Kwack, KyuBum

    2014-05-01

    The aim of this study was to identify polymorphisms and gene-gene interactions that are significantly associated with age at menarche and age at menopause in a Korean population. A total of 3,452 and 1,827 women participated in studies of age at menarche and age at natural menopause, respectively. Linear regression analyses adjusted for residence area were used to perform genome-wide association studies (GWAS), candidate gene association studies, and interactions between the candidate genes for age at menarche and age at natural menopause. In GWAS, four single nucleotide polymorphisms (SNPs; rs7528241, rs1324329, rs11597068, and rs6495785) were strongly associated with age at natural menopause (lowest P = 9.66 × 10). However, GWAS of age at menarche did not reveal any strong associations. In candidate gene association studies, SNPs with P < 0.01 were selected to test their synergistic interactions. For age at natural menopause, there was a significant interaction between intronic SNPs on ADAM metallopeptidase with thrombospondin type I motif 9 (ADAMTS9) and SMAD family member 3 (SMAD3) genes (P = 9.52 × 10). For age at menarche, there were three significant interactions between three intronic SNPs on follicle-stimulating hormone receptor (FSHR) gene and one SNP located at the 3' flanking region of insulin-like growth factor 2 receptor (IGF2R) gene (lowest P = 1.95 × 10). Novel SNPs and synergistic interactions between candidate genes are significantly associated with age at menarche and age at natural menopause in a Korean population.

  10. Phylogenetic relationships and species circumscription in Trentepohlia and Printzina (Trentepohliales, Chlorophyta).

    PubMed

    Rindi, Fabio; Lam, Daryl W; López-Bautista, Juan M

    2009-08-01

    Subaerial green microalgae represent a polyphyletic complex of organisms, whose genetic diversity is much higher than their simple morphologies suggest. The order Trentepohliales is the only species-rich group of subaerial algae belonging to the class Ulvophyceae and represents an ideal model taxon to investigate evolutionary patterns of these organisms. We studied phylogenetic relationships in two common genera of Trentepohliales (Trentepohlia and Printzina) by separate and combined analyses of the rbcL and 18S rRNA genes. Trentepohlia and Printzina were not resolved as monophyletic groups. Three main clades were recovered in all analyses, but none corresponded to any trentepohlialean genus as defined based on morphological grounds. The rbcL and 18S rRNA datasets provided congruent phylogenetic signals and similar topologies were recovered in single-gene analyses. Analyses performed on the combined 2-gene dataset inferred generally higher nodal support. The results clarified several taxonomic problems and showed that the evolution of these algae has been characterized by considerable morphological convergence. Trentepohlia abietina and T. flava were shown to be separate species from T. aurea; Printzina lagenifera, T. arborum and T. umbrina were resolved as polyphyletic taxa, whose vegetative morphology appears to have evolved independently in separate lineages. Incongruence between phylogenetic relationships and traditional morphological classification was demonstrated, showing that the morphological characters commonly used in the taxonomy of the Trentepohliales are phylogenetically irrelevant.

  11. Genome-Wide Linkage and Association Analysis Identifies Major Gene Loci for Guttural Pouch Tympany in Arabian and German Warmblood Horses

    PubMed Central

    Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2012-01-01

    Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553

  12. Three alpha-subunits of heterotrimeric G proteins and an adenylyl cyclase have distinct roles in fruiting body development in the homothallic fungus Sordaria macrospora.

    PubMed

    Kamerewerd, Jens; Jansson, Malin; Nowrousian, Minou; Pöggeler, Stefanie; Kück, Ulrich

    2008-09-01

    Sordaria macrospora, a self-fertile filamentous ascomycete, carries genes encoding three different alpha-subunits of heterotrimeric G proteins (gsa, G protein Sordaria alpha subunit). We generated knockout strains for all three gsa genes (Deltagsa1, Deltagsa2, and Deltagsa3) as well as all combinations of double mutants. Phenotypic analysis of single and double mutants showed that the genes for Galpha-subunits have distinct roles in the sexual life cycle. While single mutants show some reduction of fertility, double mutants Deltagsa1Deltagsa2 and Deltagsa1Deltagsa3 are completely sterile. To test whether the pheromone receptors PRE1 and PRE2 mediate signaling via distinct Galpha-subunits, two recently generated Deltapre strains were crossed with all Deltagsa strains. Analyses of the corresponding double mutants revealed that compared to GSA2, GSA1 is a more predominant regulator of a signal transduction cascade downstream of the pheromone receptors and that GSA3 is involved in another signaling pathway that also contributes to fruiting body development and fertility. We further isolated the gene encoding adenylyl cyclase (AC) (sac1) for construction of a knockout strain. Analyses of the three DeltagsaDeltasac1 double mutants and one Deltagsa2Deltagsa3Deltasac1 triple mutant indicate that SAC1 acts downstream of GSA3, parallel to a GSA1-GSA2-mediated signaling pathway. In addition, the function of STE12 and PRO41, two presumptive signaling components, was investigated in diverse double mutants lacking those developmental genes in combination with the gsa genes. This analysis was further completed by expression studies of the ste12 and pro41 transcripts in wild-type and mutant strains. From the sum of all our data, we propose a model for how different Galpha-subunits interact with pheromone receptors, adenylyl cyclase, and STE12 and thus cooperatively regulate sexual development in S. macrospora.

  13. Associations of Renin-Angiotensin-Aldosterone System Genes With Blood Pressure Changes and Hypertension Incidence.

    PubMed

    He, William J; Li, Changwei; Rao, Dabeeru C; Hixson, James E; Huang, Jianfeng; Cao, Jie; Rice, Treva K; Shimmin, Lawrence C; Gu, Dongfeng; Kelly, Tanika N

    2015-11-01

    The renin-angiotensin-aldosterone system (RAAS) plays an important role in blood pressure (BP) regulation. The current study uses single-marker and gene-based analyses to examine the association between RAAS genes and longitudinal BP phenotypes in a Han Chinese population. A total of 1,768 participants from the Genetic Epidemiology Network of Salt Sensitivity (GenSalt) follow-up study were included in the current study. Twenty-seven BP measurements were taken using random-zero sphygmomanometers at baseline and 2 follow-up visits. Mixed-effect models were used to assess the additive associations of 106 single-nucleotide polymorphisms (SNPs) in 10 RAAS genes with longitudinal BP changes and hypertension incidence. Gene-based analyses were conducted using the truncated product method. Attempts were made to replicate significant findings among Asian participants of the Multi-ethnic Study of Atherosclerosis (MESA). False discovery rate procedures were used to adjust for multiple testing. During an average of 7.2 years of follow-up, average systolic and diastolic BP increased, and 32.1% (512) of participants free from hypertension at baseline developed hypertension. NR3C2 SNPs rs7694064 and rs6856803 were significantly associated with longitudinal changes in systolic BP (P interaction = 6.9×10(-5) and 8.2×10(-4), respectively). Through gene-based analysis, NR3C2 was found to be significantly associated with longitudinal systolic BP change (P value of 1.00×10(-7)), even after removal of significant markers rs7694064 and rs6856803 from the analysis. The association between NR3C2 and longitudinal systolic BP change was replicated in Asian MESA participants (P value of 1.00×10(-4)). These findings indicate that NR3C2 may play an important role in BP progression and development of hypertension. © American Journal of Hypertension, Ltd 2015. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  14. Exception to the Rule: Genomic Characterization of Naturally Occurring Unusual Vibrio cholerae Strains with a Single Chromosome

    DOE PAGES

    Xie, Gary; Johnson, Shannon Lyn; Davenport, Karen Walston; ...

    2017-08-29

    Here, the genetic make-up of most bacteria is encoded in a single chromosome while about 10% have more than one chromosome. Among these, Vibrio cholerae, with two chromosomes, has served as a model system to study various aspects of chromosome maintenance, mainly replication, and faithful partitioning of multipartite genomes. Here, we describe the genomic characterization of strains that are an exception to the two chromosome rules: naturally occurring single-chromosome V. cholerae. Whole genome sequence analyses of NSCV1 and NSCV2 (natural single-chromosome vibrio) revealed that the Chr1 and Chr2 fusion junctions contain prophages, IS elements, and direct repeats, in addition tomore » large-scale chromosomal rearrangements such as inversions, insertions, and long tandem repeats elsewhere in the chromosome compared to prototypical two chromosome V. cholerae genomes. Many of the known cholera virulence factors are absent. The two origins of replication and associated genes are generally intact with synonymous mutations in some genes, as arerecAand mismatch repair (MMR) genes dam, mutH, and mutL; MutS function is probably impaired in NSCV2. These strains are ideal tools for studying mechanistic aspects of maintenance of chromosomes with multiple origins and other rearrangements and the biological, functional, and evolutionary significance of multipartite genome architecture in general.« less

  15. Exception to the Rule: Genomic Characterization of Naturally Occurring Unusual Vibrio cholerae Strains with a Single Chromosome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xie, Gary; Johnson, Shannon Lyn; Davenport, Karen Walston

    Here, the genetic make-up of most bacteria is encoded in a single chromosome while about 10% have more than one chromosome. Among these, Vibrio cholerae, with two chromosomes, has served as a model system to study various aspects of chromosome maintenance, mainly replication, and faithful partitioning of multipartite genomes. Here, we describe the genomic characterization of strains that are an exception to the two chromosome rules: naturally occurring single-chromosome V. cholerae. Whole genome sequence analyses of NSCV1 and NSCV2 (natural single-chromosome vibrio) revealed that the Chr1 and Chr2 fusion junctions contain prophages, IS elements, and direct repeats, in addition tomore » large-scale chromosomal rearrangements such as inversions, insertions, and long tandem repeats elsewhere in the chromosome compared to prototypical two chromosome V. cholerae genomes. Many of the known cholera virulence factors are absent. The two origins of replication and associated genes are generally intact with synonymous mutations in some genes, as arerecAand mismatch repair (MMR) genes dam, mutH, and mutL; MutS function is probably impaired in NSCV2. These strains are ideal tools for studying mechanistic aspects of maintenance of chromosomes with multiple origins and other rearrangements and the biological, functional, and evolutionary significance of multipartite genome architecture in general.« less

  16. Erwinia amylovora Expresses Fast and Simultaneously hrp/dsp Virulence Genes during Flower Infection on Apple Trees

    PubMed Central

    Pester, Doris; Milčevičová, Renáta; Schaffer, Johann; Wilhelm, Eva; Blümel, Sylvia

    2012-01-01

    Background Pathogen entry through host blossoms is the predominant infection pathway of the Gram-negative bacterium Erwinia amylovora leading to manifestation of the disease fire blight. Like in other economically important plant pathogens, E. amylovora pathogenicity depends on a type III secretion system encoded by hrp genes. However, timing and transcriptional order of hrp gene expression during flower infections are unknown. Methodology/Principal Findings Using quantitative real-time PCR analyses, we addressed the questions of how fast, strong and uniform key hrp virulence genes and the effector dspA/E are expressed when bacteria enter flowers provided with the full defense mechanism of the apple plant. In non-invasive bacterial inoculations of apple flowers still attached to the tree, E. amylovora activated expression of key type III secretion genes in a narrow time window, mounting in a single expression peak of all investigated hrp/dspA/E genes around 24–48 h post inoculation (hpi). This single expression peak coincided with a single depression in the plant PR-1 expression at 24 hpi indicating transient manipulation of the salicylic acid pathway as one target of E. amylovora type III effectors. Expression of hrp/dspA/E genes was highly correlated to expression of the regulator hrpL and relative transcript abundances followed the ratio: hrpA>hrpN>hrpL>dspA/E. Acidic conditions (pH 4) in flower infections led to reduced virulence/effector gene expression without the typical expression peak observed under natural conditions (pH 7). Conclusion/Significance The simultaneous expression of hrpL, hrpA, hrpN, and the effector dspA/E during early floral infection indicates that speed and immediate effector transmission is important for successful plant invasion. When this delicate balance is disturbed, e.g., by acidic pH during infection, virulence gene expression is reduced, thus partly explaining the efficacy of acidification in fire blight control on a molecular level. PMID:22412891

  17. Targeted Changes of the Cell Wall Proteome Influence Candida albicans Ability to Form Single- and Multi-strain Biofilms

    PubMed Central

    Walker, Louise A.; Martin-Yken, Hélène; Dague, Etienne; Legrand, Mélanie; Lee, Keunsook; Chauvel, Murielle; Firon, Arnaud; Rossignol, Tristan; Richard, Mathias L.; Munro, Carol A.; Bachellier-Bassi, Sophie; d'Enfert, Christophe

    2014-01-01

    Biofilm formation is an important virulence trait of the pathogenic yeast Candida albicans. We have combined gene overexpression, strain barcoding and microarray profiling to screen a library of 531 C. albicans conditional overexpression strains (∼10% of the genome) for genes affecting biofilm development in mixed-population experiments. The overexpression of 16 genes increased strain occupancy within a multi-strain biofilm, whereas overexpression of 4 genes decreased it. The set of 16 genes was significantly enriched for those encoding predicted glycosylphosphatidylinositol (GPI)-modified proteins, namely Ihd1/Pga36, Phr2, Pga15, Pga19, Pga22, Pga32, Pga37, Pga42 and Pga59; eight of which have been classified as pathogen-specific. Validation experiments using either individually- or competitively-grown overexpression strains revealed that the contribution of these genes to biofilm formation was variable and stage-specific. Deeper functional analysis of PGA59 and PGA22 at a single-cell resolution using atomic force microscopy showed that overexpression of either gene increased C. albicans ability to adhere to an abiotic substrate. However, unlike PGA59, PGA22 overexpression led to cell cluster formation that resulted in increased sensitivity to shear forces and decreased ability to form a single-strain biofilm. Within the multi-strain environment provided by the PGA22-non overexpressing cells, PGA22-overexpressing cells were protected from shear forces and fitter for biofilm development. Ultrastructural analysis, genome-wide transcript profiling and phenotypic analyses in a heterologous context suggested that PGA22 affects cell adherence through alteration of cell wall structure and/or function. Taken together, our findings reveal that several novel predicted GPI-modified proteins contribute to the cooperative behaviour between biofilm cells and are important participants during C. albicans biofilm formation. Moreover, they illustrate the power of using signature tagging in conjunction with gene overexpression for the identification of novel genes involved in processes pertaining to C. albicans virulence. PMID:25502890

  18. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

    PubMed Central

    Parks, Donovan H.; Imelfort, Michael; Skennerton, Connor T.; Hugenholtz, Philip; Tyson, Gene W.

    2015-01-01

    Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of “marker” genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. PMID:25977477

  19. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

    PubMed

    Parks, Donovan H; Imelfort, Michael; Skennerton, Connor T; Hugenholtz, Philip; Tyson, Gene W

    2015-07-01

    Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of "marker" genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. © 2015 Parks et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Pathological mechanisms underlying single large‐scale mitochondrial DNA deletions

    PubMed Central

    Rocha, Mariana C.; Rosa, Hannah S.; Grady, John P.; Blakely, Emma L.; He, Langping; Romain, Nadine; Haller, Ronald G.; Newman, Jane; McFarland, Robert; Ng, Yi Shiau; Gorman, Grainne S.; Schaefer, Andrew M.; Tuppen, Helen A.; Taylor, Robert W.

    2018-01-01

    Objective Single, large‐scale deletions in mitochondrial DNA (mtDNA) are a common cause of mitochondrial disease. This study aimed to investigate the relationship between the genetic defect and molecular phenotype to improve understanding of pathogenic mechanisms associated with single, large‐scale mtDNA deletions in skeletal muscle. Methods We investigated 23 muscle biopsies taken from adult patients (6 males/17 females with a mean age of 43 years) with characterized single, large‐scale mtDNA deletions. Mitochondrial respiratory chain deficiency in skeletal muscle biopsies was quantified by immunoreactivity levels for complex I and complex IV proteins. Single muscle fibers with varying degrees of deficiency were selected from 6 patient biopsies for determination of mtDNA deletion level and copy number by quantitative polymerase chain reaction. Results We have defined 3 “classes” of single, large‐scale deletion with distinct patterns of mitochondrial deficiency, determined by the size and location of the deletion. Single fiber analyses showed that fibers with greater respiratory chain deficiency harbored higher levels of mtDNA deletion with an increase in total mtDNA copy number. For the first time, we have demonstrated that threshold levels for complex I and complex IV deficiency differ based on deletion class. Interpretation Combining genetic and immunofluorescent assays, we conclude that thresholds for complex I and complex IV deficiency are modulated by the deletion of complex‐specific protein‐encoding genes. Furthermore, removal of mt‐tRNA genes impacts specific complexes only at high deletion levels, when complex‐specific protein‐encoding genes remain. These novel findings provide valuable insight into the pathogenic mechanisms associated with these mutations. Ann Neurol 2018;83:115–130 PMID:29283441

  1. Chromatin structure and methylation of rat rRNA genes studied by formaldehyde fixation and psoralen cross-linking.

    PubMed Central

    Stancheva, I; Lucchini, R; Koller, T; Sogo, J M

    1997-01-01

    By using formaldehyde cross-linking of histones to DNA and gel retardation assays we show that formaldehyde fixation, similar to previously established psoralen photocross-linking, discriminates between nucleosome- packed (inactive) and nucleosome-free (active) fractions of ribosomal RNA genes. By both cross-linking techniques we were able to purify fragments from agarose gels, corresponding to coding, enhancer and promoter sequences of rRNA genes, which were further investigated with respect to DNA methylation. This approach allows us to analyse independently and in detail methylation patterns of active and inactive rRNA gene copies by the combination of Hpa II and Msp I restriction enzymes. We found CpG methylation mainly present in enhancer and promoter regions of inactive rRNA gene copies. The methylation of one single Hpa II site, located in the promoter region, showed particularly strong correlation with the transcriptional activity. PMID:9108154

  2. Ascorbate peroxidase-related (APx-R) is not a duplicable gene.

    PubMed

    Dunand, Christophe; Mathé, Catherine; Lazzarotto, Fernanda; Margis, Rogério; Margis-Pinheiro, Marcia

    2011-12-01

    Phylogenetic, genomic and functional analyses have allowed the identification of a new class of putative heme peroxidases, so called APx-R (APx-Related). These new class, mainly present in the green lineage (including green algae and land plants), can also be detected in other unicellular chloroplastic organisms. Except for recent polyploid organisms, only single-copy of APx-R gene was detected in each genome, suggesting that the majority of the APx-R extra-copies were lost after chromosomal or segmental duplications. In a similar way, most APx-R co-expressed genes in Arabidopsis genome do not have conserved extra-copies after chromosomal duplications and are predicted to be localized in organelles, as are the APx-R. The member of this gene network can be considered as unique gene, well conserved through the evolution due to a strong negative selection pressure and a low evolution rate. © 2011 Landes Bioscience

  3. Multilocus phylogeographic assessment of the California Mountain Kingsnake (Lampropeltis zonata) suggests alternative patterns of diversification for the California Floristic Province.

    PubMed

    Myers, E A; Rodríguez-Robles, J A; Denardo, D F; Staub, R E; Stropoli, A; Ruane, S; Burbrink, F T

    2013-11-01

    Phylogeographic inference can determine the timing of population divergence, historical demographic processes, patterns of migration, and when extended to multiple species, the history of communities. Single-locus analyses can mislead interpretations of the evolutionary history of taxa and comparative analyses. It is therefore important to revisit previous single-locus phylogeographic studies, particularly those that have been used to propose general patterns for regional biotas and the processes responsible for generating inferred patterns. Here, we employ a multilocus statistical approach to re-examine the phylogeography of Lampropeltis zonata. Using nonparametic and Bayesian species delimitation, we determined that there are two well-supported species within L. zonata. Ecological niche modelling supports the delimitation of these taxa, suggesting that the two species inhabit distinct climatic environments. Gene flow between the two taxa is low and appears to occur unidirectionally. Further, our data suggest that gene flow was mediated by females, a rare pattern in snakes. In contrast to previous analyses, we determined that the divergence between the two lineages occurred in the late Pliocene (c. 2.07 Ma). Spatially and temporally, the divergence of these lineages is associated with the inundation of central California by the Monterey Bay. The effective population sizes of the two species appear to have been unaffected by Pleistocene glaciation. Our increased sampling of loci for L. zonata, combined with previously published multilocus analyses of other sympatric species, suggests that previous conclusions reached by comparative phylogeographic studies conducted within the California Floristic Province should be reassessed. © 2013 John Wiley & Sons Ltd.

  4. Identification of Putative Transmembrane Proteins Involved in Salinity Tolerance in Chenopodium quinoa by Integrating Physiological Data, RNAseq, and SNP Analyses

    PubMed Central

    Schmöckel, Sandra M.; Lightfoot, Damien J.; Razali, Rozaimi; Tester, Mark; Jarvis, David E.

    2017-01-01

    Chenopodium quinoa (quinoa) is an emerging crop that produces nutritious grains with the potential to contribute to global food security. Quinoa can also grow on marginal lands, such as soils affected by high salinity. To identify candidate salt tolerance genes in the recently sequenced quinoa genome, we used a multifaceted approach integrating RNAseq analyses with comparative genomics and topology prediction. We identified 219 candidate genes by selecting those that were differentially expressed in response to salinity, were specific to or overrepresented in quinoa relative to other Amaranthaceae species, and had more than one predicted transmembrane domain. To determine whether these genes might underlie variation in salinity tolerance in quinoa and its close relatives, we compared the response to salinity stress in a panel of 21 Chenopodium accessions (14 C. quinoa, 5 C. berlandieri, and 2 C. hircinum). We found large variation in salinity tolerance, with one C. hircinum displaying the highest salinity tolerance. Using genome re-sequencing data from these accessions, we investigated single nucleotide polymorphisms and copy number variation (CNV) in the 219 candidate genes in accessions of contrasting salinity tolerance, and identified 15 genes that could contribute to the differences in salinity tolerance of these Chenopodium accessions. PMID:28680429

  5. Integrative and conjugative elements and their hosts: composition, distribution and organization.

    PubMed

    Cury, Jean; Touchon, Marie; Rocha, Eduardo P C

    2017-09-06

    Conjugation of single-stranded DNA drives horizontal gene transfer between bacteria and was widely studied in conjugative plasmids. The organization and function of integrative and conjugative elements (ICE), even if they are more abundant, was only studied in a few model systems. Comparative genomics of ICE has been precluded by the difficulty in finding and delimiting these elements. Here, we present the results of a method that circumvents these problems by requiring only the identification of the conjugation genes and the species' pan-genome. We delimited 200 ICEs and this allowed the first large-scale characterization of these elements. We quantified the presence in ICEs of a wide set of functions associated with the biology of mobile genetic elements, including some that are typically associated with plasmids, such as partition and replication. Protein sequence similarity networks and phylogenetic analyses revealed that ICEs are structured in functional modules. Integrases and conjugation systems have different evolutionary histories, even if the gene repertoires of ICEs can be grouped in function of conjugation types. Our characterization of the composition and organization of ICEs paves the way for future functional and evolutionary analyses of their cargo genes, composed of a majority of unknown function genes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. [First attempts of detecting fetal cells in the maternal circulation].

    PubMed

    Nagy, Gyula Richárd; Bán, Zoltán; Sipos, Ferenc; Fent, János; Oroszné Nagy, Judit; Beke, Artúr; Furész, József; Papp, Zoltán

    2004-10-31

    In prenatal diagnosis there is great interest for noninvasive diagnostic methods. Authors report their first results in detecting fetal cells in the maternal circulation during pregnancy. The aim of the study was to detect fetal gender from maternal peripheral blood samples during pregnancy. Authors have analysed fetal nucleated red blood cells. In 12 cases after a double density Percoll gradient separation they labelled the surface antigens of the cells with anti-glycophorin-A and anti-CD45 fluorescent antibodies, did an intracellular staining of the epsilon haemoglobin chain, and analysed the cells with flow cytometry. The CD45 negative/glycophorin-A positive/epsilon-haemoglobin chain positive cells were considered as fetal cells. Having the results, in another 13 cases magnetic activated cell sorting with CD71 antibody were used as an enrichment step. Authors made an intracellular staining of the epsilon haemoglobin chain, the positive cells were isolated by micromanipulation, and analysed by single cell fluorescent polymerase chain reaction. Primers for the amelogenin gene were used to detect fetal gender. Only the Percoll enrichment step itself is not enough for using the samples for diagnostic molecular-biologic examinations, a following enrichment step is needed. For this the authors used magnetic activated cell sorting with CD71 antibody. With the help of this enrichment step, after the intracellular staining of the epsilon haemoglobin chain the direct micromanipulator isolation of the epsilon haemoglobin chain positive cells could be done. After analysing single cells by fluorescent polymerase chain reaction, in 8 out of the 11 comparable cases the results were similar to those, what was found during the genetic amniocentesis. In 2 cases from this 8, genetic amniocentesis proved Klinefelter syndrome, which they could also confirm with the examination of fetal cells in the maternal circulation. The results of the study suggest that the method described above can be useful in prenatal genetic diagnosis, and improving it could be useful to detect other genetic abnormalities (chromosomal abnormalities, single gene disorders) as well.

  7. Phylogeographic reconstruction of a bacterial species with high levels of lateral gene transfer

    USGS Publications Warehouse

    Pearson, T.; Giffard, P.; Beckstrom-Sternberg, S.; Auerbach, R.; Hornstra, H.; Tuanyok, A.; Price, E.P.; Glass, M.B.; Leadem, B.; Beckstrom-Sternberg, J. S.; Allan, G.J.; Foster, J.T.; Wagner, D.M.; Okinaka, R.T.; Sim, S.H.; Pearson, O.; Wu, Z.; Chang, J.; Kaul, R.; Hoffmaster, A.R.; Brettin, T.S.; Robison, R.A.; Mayo, M.; Gee, J.E.; Tan, P.; Currie, B.J.; Keim, P.

    2009-01-01

    Background: Phylogeographic reconstruction of some bacterial populations is hindered by low diversity coupled with high levels of lateral gene transfer. A comparison of recombination levels and diversity at seven housekeeping genes for eleven bacterial species, most of which are commonly cited as having high levels of lateral gene transfer shows that the relative contributions of homologous recombination versus mutation for Burkholderia pseudomallei is over two times higher than for Streptococcus pneumoniae and is thus the highest value yet reported in bacteria. Despite the potential for homologous recombination to increase diversity, B. pseudomallei exhibits a relative lack of diversity at these loci. In these situations, whole genome genotyping of orthologous shared single nucleotide polymorphism loci, discovered using next generation sequencing technologies, can provide very large data sets capable of estimating core phylogenetic relationships. We compared and searched 43 whole genome sequences of B. pseudomallei and its closest relatives for single nucleotide polymorphisms in orthologous shared regions to use in phylogenetic reconstruction. Results: Bayesian phylogenetic analyses of >14,000 single nucleotide polymorphisms yielded completely resolved trees for these 43 strains with high levels of statistical support. These results enable a better understanding of a separate analysis of population differentiation among >1,700 B. pseudomallei isolates as defined by sequence data from seven housekeeping genes. We analyzed this larger data set for population structure and allele sharing that can be attributed to lateral gene transfer. Our results suggest that despite an almost panmictic population, we can detect two distinct populations of B. pseudomallei that conform to biogeographic patterns found in many plant and animal species. That is, separation along Wallace's Line, a biogeographic boundary between Southeast Asia and Australia. Conclusion: We describe an Australian origin for B. pseudomallei, characterized by a single introduction event into Southeast Asia during a recent glacial period, and variable levels of lateral gene transfer within populations. These patterns provide insights into mechanisms of genetic diversification in B. pseudomallei and its closest relatives, and provide a framework for integrating the traditionally separate fields of population genetics and phylogenetics for other bacterial species with high levels of lateral gene transfer. ?? 2009 Pearson et al; licensee BioMed Central Ltd.

  8. Whole-exome sequencing of 228 patients with sporadic Parkinson's disease.

    PubMed

    Sandor, Cynthia; Honti, Frantisek; Haerty, Wilfried; Szewczyk-Krolikowski, Konrad; Tomlinson, Paul; Evetts, Sam; Millin, Stephanie; Keane, Thomas; McCarthy, Shane A; Durbin, Richard; Talbot, Kevin; Hu, Michele; Webber, Caleb; Ponting, Chris P; Wade-Martins, Richard

    2017-01-24

    Parkinson's disease (PD) is the most common neurodegenerative movement disorder, affecting 1% of the population over 65 years characterized clinically by both motor and non-motor symptoms accompanied by the preferential loss of dopamine neurons in the substantia nigra pars compacta. Here, we sequenced the exomes of 244 Parkinson's patients selected from the Oxford Parkinson's Disease Centre Discovery Cohort and, after quality control, 228 exomes were available for analyses. The PD patient exomes were compared to 884 control exomes selected from the UK10K datasets. No single non-synonymous (NS) single nucleotide variant (SNV) nor any gene carrying a higher burden of NS SNVs was significantly associated with PD status after multiple-testing correction. However, significant enrichments of genes whose proteins have roles in the extracellular matrix were amongst the top 300 genes with the most significantly associated NS SNVs, while regions associated with PD by a recent Genome Wide Association (GWA) study were enriched in genes containing PD-associated NS SNVs. By examining genes within GWA regions possessing rare PD-associated SNVs, we identified RAD51B. The protein-product of RAD51B interacts with that of its paralogue RAD51, which is associated with congenital mirror movements phenotypes, a phenotype also comorbid with PD.

  9. Cloning and expression analysis of the ornithine decarboxylase gene (PbrODC) of the pathogenic fungus Paracoccidioides brasiliensis.

    PubMed

    Niño-Vega, Gustavo A; Sorais, Françoise; Calcagno, Ana-María; Ruiz-Herrera, José; Martínez-Espinoza, Alfredo D; San-Blas, Gioconda

    2004-02-01

    We describe the isolation and sequencing of PbrODC, the gene encoding ornithine decarboxylase (ODC) in Paracoccidioides brasiliensis. The gene contains a single open reading frame made of 1413 bp with a single intron (72 bp), and encodes a 447 amino acid polypeptide with a predicted molecular weight of 50.0 kDa, an isoelectric point of 4.9 and a high similarity to other fungal ornithine decarboxylases. Functionality of the gene was demonstrated by transformation into a Saccharomyces cerevisiae odc null mutant. A phylogenetic tree generated with several fungal ODCs provided additional evidence to favour a taxonomic position for P. brasiliensis as an ascomycetous fungus, belonging to the order Onygenales. Expression of the PbrODC gene was determined by Northern analyses during growth of the mycelial and yeast forms, and through the temperature-regulated dimorphic transition between these two extreme phases. Expression of PbrODC remained constant at all stages of the fungal growth, and did not correlate with a previously observed increase in the activity of ornithine decarboxylase at the onset of the budding process in both yeast growth and mycelium-to-yeast transition. Accordingly, post-transcriptional regulation for the product of PbrODC is suggested. Copyright 2004 John Wiley & Sons, Ltd.

  10. Single-cell and metagenomic analyses indicate a fermentative and saccharolytic lifestyle for members of the OP9 lineage

    PubMed Central

    Dodsworth, Jeremy A.; Blainey, Paul C.; Murugapiran, Senthil K.; Swingley, Wesley D.; Ross, Christian A.; Tringe, Susannah G.; Chain, Patrick S. G.; Scholz, Matthew B.; Lo, Chien-Chi; Raymond, Jason; Quake, Stephen R.; Hedlund, Brian P.

    2013-01-01

    OP9 is a yet-uncultivated bacterial lineage found in geothermal systems, petroleum reservoirs, anaerobic digesters, and wastewater treatment facilities. Here we use single-cell and metagenome sequencing to obtain two distinct, nearly-complete OP9 genomes, one constructed from single cells sorted from hot spring sediments and the other derived from binned metagenomic contigs from an in situ-enriched cellulolytic, thermophilic community. Phylogenomic analyses support the designation of OP9 as a candidate phylum for which we propose the name ‘Atribacteria’. Although a plurality of predicted proteins is most similar to those from Firmicutes, the presence of key genes suggests a diderm cell envelope. Metabolic reconstruction from the core genome suggests an anaerobic lifestyle based on sugar fermentation by Embden-Meyerhof glycolysis with production of hydrogen, acetate, and ethanol. Putative glycohydrolases and an endoglucanase may enable catabolism of (hemi)cellulose in thermal environments. This study lays a foundation for understanding the physiology and ecological role of the ‘Atribacteria’. PMID:23673639

  11. Identification and environmental distribution of dcpA encoding the 1,2-dichloropropane-to-propene reductive dehalogenase in organohalide-respiring Chloroflexi

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Padilla-Crespo, Elizabeth; Yan, Jun; Swift, Cynthia M

    2014-01-01

    Dehalococcoides mccartyi (Dhc) strains KS and RC grow with 1,2-dichloropropane (1,2-D) as an electron acceptor in enrichment cultures derived from hydrocarbon-contaminated and pristine river sediments, respectively. Transcription, expression, enzymatic and PCR analyses implicated the reductive dehalogenase gene dcpA in 1,2-D dichloroelimination to propene and inorganic chloride. Quantitative real-time PCR (qPCR) analyses demonstrated Dhc cell increase during growth with 1,2-D and suggested that both Dhc strains carried a single dcpA gene copy per genome. Dhc strain RC and strain KS produced 1.8 0.1 x 107 and 1.4 0.5 x 107 cells per mole of propene formed, respectively. The dcpA gene wasmore » identified in 1,2-D-to-propene-dechlorinating microcosms established with sediment samples collected from different geographical locations in Europe and North and South America. Clone library analysis revealed two distinct dcpA phylogenetic clusters, both of which the dcpA gene-targeted qPCR assay captured, suggesting the qPCR assay is useful for site assessment and bioremediation monitoring at 1,2-D-contaminated sites.« less

  12. Genomic Biomarkers for Breast Cancer Risk

    PubMed Central

    Walsh, Michael F.; Nathanson, Katherine L.; Couch, Fergus J.

    2016-01-01

    Clinical risk assessment for cancer predisposition includes a three-generation pedigree and physical examination to identify inherited syndromes. Additionally genetic and genomic biomarkers may identify individuals with a constitutional basis for their disease that may not be evident clinically. Genomic biomarker testing may detect molecular variations in single genes, panels of genes, or entire genomes. The strength of evidence for the association of a genomic biomarker with disease risk may be weak or strong. The factors contributing to clinical validity and utility of genomic biomarkers include functional laboratory analyses and genetic epidemiologic evidence. Genomic biomarkers may be further classified as low, moderate or highly penetrant based on the likelihood of disease. Genomic biomarkers for breast cancer are comprised of rare highly penetrant mutations of genes such as BRCA1 or BRCA2, moderately penetrant mutations of genes such as CHEK2, as well as more common genomic variants, including single nucleotide polymorphisms, associated with modest effect sizes. When applied in the context of appropriate counseling and interpretation, identification of genomic biomarkers of inherited risk for breast cancer may decrease morbidity and mortality, allow for definitive prevention through assisted reproduction, and serve as a guide to targeted therapy. PMID:26987529

  13. Genetic variations in the beta-tubulin gene and the internal transcribed spacer 2 region of Trichuris species from man and baboons.

    PubMed

    Hansen, Tina V A; Thamsborg, Stig M; Olsen, Annette; Prichard, Roger K; Nejsum, Peter

    2013-08-12

    The whipworm Trichuris trichiura has been estimated to infect 604 - 795 million people worldwide. The current control strategy against trichuriasis using the benzimidazoles (BZs) albendazole (400 mg) or mebendazole (500 mg) as single-dose treatment is not satisfactory. The occurrence of single nucleotide polymorphisms (SNPs) in codons 167, 198 or 200 of the beta-tubulin gene has been reported to convey BZ-resistance in intestinal nematodes of veterinary importance. It was hypothesised that the low susceptibility of T. trichiura to BZ could be due to a natural occurrence of such SNPs. The aim of this study was to investigate whether these SNPs were present in the beta-tubulin gene of Trichuris spp. from humans and baboons. As a secondary objective, the degree of identity between T. trichiura from humans and Trichuris spp. from baboons was evaluated based on the beta-tubulin gene and the internal transcribed spacer 2 region (ITS2). Nucleotide sequences of the beta-tubulin gene were generated by PCR using degenerate primers, specific primers and DNA from worms and eggs of T. trichiura and worms of Trichuris spp. from baboons. The ITS2 region was amplified using adult Trichuris spp. from baboons. PCR products were sequenced and analysed. The beta-tubulin fragments were studied for SNPs in codons 167, 198 or 200 and the ITS2 amplicons were compared with GenBank records of T. trichiura. No SNPs in codons 167, 198 or 200 were identified in any of the analysed Trichuris spp. from humans and baboons. Based on the ITS2 region, the similarity between Trichuris spp. from baboons and GenBank records of T. trichiura was found to be 98 - 99%. Single nucleotide polymorphisms in codon 167, 198 and 200, known to confer BZ-resistance in other nematodes, were absent in the studied material. This study does not provide data that could explain previous reports of poor BZ treatment efficacy in terms of polymorphism in these codons of beta-tubulin. Based on a fragment of the beta-tubulin gene and the ITS2 region sequenced, it was found that T. trichiura from humans and Trichuris spp. isolated from baboons are closely related and may be the same species.

  14. Genetic variations in the beta-tubulin gene and the internal transcribed spacer 2 region of Trichuris species from man and baboons

    PubMed Central

    2013-01-01

    Background The whipworm Trichuris trichiura has been estimated to infect 604 – 795 million people worldwide. The current control strategy against trichuriasis using the benzimidazoles (BZs) albendazole (400 mg) or mebendazole (500 mg) as single-dose treatment is not satisfactory. The occurrence of single nucleotide polymorphisms (SNPs) in codons 167, 198 or 200 of the beta-tubulin gene has been reported to convey BZ-resistance in intestinal nematodes of veterinary importance. It was hypothesised that the low susceptibility of T. trichiura to BZ could be due to a natural occurrence of such SNPs. The aim of this study was to investigate whether these SNPs were present in the beta-tubulin gene of Trichuris spp. from humans and baboons. As a secondary objective, the degree of identity between T. trichiura from humans and Trichuris spp. from baboons was evaluated based on the beta-tubulin gene and the internal transcribed spacer 2 region (ITS2). Methods Nucleotide sequences of the beta-tubulin gene were generated by PCR using degenerate primers, specific primers and DNA from worms and eggs of T. trichiura and worms of Trichuris spp. from baboons. The ITS2 region was amplified using adult Trichuris spp. from baboons. PCR products were sequenced and analysed. The beta-tubulin fragments were studied for SNPs in codons 167, 198 or 200 and the ITS2 amplicons were compared with GenBank records of T. trichiura. Results No SNPs in codons 167, 198 or 200 were identified in any of the analysed Trichuris spp. from humans and baboons. Based on the ITS2 region, the similarity between Trichuris spp. from baboons and GenBank records of T. trichiura was found to be 98 – 99%. Conclusions Single nucleotide polymorphisms in codon 167, 198 and 200, known to confer BZ-resistance in other nematodes, were absent in the studied material. This study does not provide data that could explain previous reports of poor BZ treatment efficacy in terms of polymorphism in these codons of beta-tubulin. Based on a fragment of the beta-tubulin gene and the ITS2 region sequenced, it was found that T. trichiura from humans and Trichuris spp. isolated from baboons are closely related and may be the same species. PMID:23938038

  15. Effects of simulated microgravity on gene expression and biological phenotypes of a single generation Caenorhabditis elegans cultured on 2 different media

    NASA Astrophysics Data System (ADS)

    Tee, Ling Fei; Neoh, Hui-min; Then, Sue Mian; Murad, Nor Azian; Asillam, Mohd Fairos; Hashim, Mohd Helmy; Nathan, Sheila; Jamal, Rahman

    2017-11-01

    Studies of multigenerational Caenorhabditis elegans exposed to long-term spaceflight have revealed expression changes of genes involved in longevity, DNA repair, and locomotion. However, results from spaceflight experiments are difficult to reproduce as space missions are costly and opportunities are rather limited for researchers. In addition, multigenerational cultures of C. elegans used in previous studies contribute to mixture of gene expression profiles from both larvae and adult worms, which were recently reported to be different. Usage of different culture media during microgravity simulation experiments might also give rise to differences in the gene expression and biological phenotypes of the worms. In this study, we investigated the effects of simulated microgravity on the gene expression and biological phenotype profiles of a single generation of C. elegans worms cultured on 2 different culture media. A desktop Random Positioning Machine (RPM) was used to simulate microgravity on the worms for approximately 52 to 54 h. Gene expression profile was analysed using the Affymetrix GeneChip® C. elegans 1.0 ST Array. Only one gene (R01H2.2) was found to be downregulated in nematode growth medium (NGM)-cultured worms exposed to simulated microgravity. On the other hand, eight genes were differentially expressed for C. elegans Maintenance Medium (CeMM)-cultured worms in microgravity; six were upregulated, while two were downregulated. Five of the upregulated genes (C07E3.15, C34H3.21, C32D5.16, F35H8.9 and C34F11.17) encode non-coding RNAs. In terms of biological phenotype, we observed that microgravity-simulated worms experienced minimal changes in terms of lifespan, locomotion and reproductive capabilities in comparison with the ground controls. Taking it all together, simulated microgravity on a single generation of C. elegans did not confer major changes to their gene expression and biological phenotype. Nevertheless, exposure of the worms to microgravity lead to higher expression of non-coding RNA genes, which may play an epigenetic role in the worms during longer terms of microgravity exposure.

  16. Associations Between Genetic Variants of NADPH Oxidase-Related Genes and Blood Pressure Responses to Dietary Sodium Intervention: The GenSalt Study.

    PubMed

    Han, Xikun; Hu, Zunsong; Chen, Jing; Huang, Jianfeng; Huang, Chen; Liu, Fangchao; Gu, Charles; Yang, Xueli; Hixson, James E; Lu, Xiangfeng; Wang, Laiyuan; Liu, De-Pei; He, Jiang; Chen, Shufeng; Gu, Dongfeng

    2017-04-01

    The aim of this study was to comprehensively test the associations of genetic variants of nicotinamide adenine dinucleotide phosphate (NADPH) oxidase-related genes with blood pressure (BP) responses to dietary sodium intervention in a Chinese population. We conducted a 7-day low-sodium intervention followed by a 7-day high-sodium intervention among 1,906 participants in rural China. BP measurements were obtained at baseline and each dietary intervention using a random-zero sphygmomanometer. Linear mixed-effect models were used to assess the additive associations of 63 tag single-nucleotide polymorphisms in 11 NADPH oxidase-related genes with BP responses to dietary sodium intervention. Gene-based analyses were conducted using the truncated product method. The Bonferroni method was used to adjust for multiple testing in all analyses. Systolic BP (SBP) response to high-sodium intervention significantly decreased with the number of minor T allele of marker rs6967221 in RAC1 (P = 4.51 × 10-4). SBP responses (95% confidence interval) for genotypes CC, CT, and TT were 5.03 (4.71, 5.36), 4.20 (3.54, 4.85), and 0.56 (-1.08, 2.20) mm Hg, respectively, during the high-sodium intervention. Gene-based analyses revealed that RAC1 was significantly associated with SBP response to high-sodium intervention (P = 1.00 × 10-6) and diastolic BP response to low-sodium intervention (P = 9.80 × 10-4). These findings suggested that genetic variants of NADPH oxidase-related genes may contribute to the variation of BP responses to sodium intervention in Chinese population. Further replication of these findings is warranted. © American Journal of Hypertension, Ltd 2017. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  17. Genetic variations in vitamin D-related pathways and breast cancer risk in African American women in the AMBER consortium

    PubMed Central

    Yao, Song; Haddad, Stephen A.; Hu, Qiang; Liu, Song; Lunetta, Kathryn L.; Ruiz-Narvaez, Edward A.; Hong, Chi-Chen; Zhu, Qianqian; Sucheston-Campbell, Lara; Cheng, Ting-Yuan David; Bensen, Jeannette T.; Johnson, Candace S.; Trump, Donald L.; Haiman, Christopher A.; Olshan, Andrew F.; Palmer, Julie R.; Ambrosone, Christine B.

    2016-01-01

    Studies of genetic variations in vitamin D-related pathways and breast cancer risk have been conducted mostly in populations of European ancestry, and only sparsely in African Americans (AA), who are known for a high prevalence of vitamin D deficiency. We analyzed 24,445 germline variants in 63 genes from vitamin D-related pathways in the African American Breast Cancer Epidemiology and Risk (AMBER) consortium, including 3,663 breast cancer cases and 4,687 controls. Odds ratios (OR) were derived from logistic regression models for overall breast cancer, by estrogen receptor (ER) status (1,983 ER positive and 1,098 ER negative), and for case-only analyses of ER status. None of the three vitamin D-related pathways were associated with breast cancer risk overall or by ER status. Gene-level analyses identified associations with risk for several genes at a nominal p ≤ 0.05, particularly for ER− breast cancer, including rs4647707 in DDB2. In case-only analyses, vitamin D metabolism and signaling pathways were associated with ER− cancer (pathway-level p = 0.02), driven by a single gene CASR (gene-level p = 0.001). The top SNP in CASR was rs112594756 (p = 7 × 10−5, gene-wide corrected p = 0.01), followed by a second signal from a nearby SNP rs6799828 (p = 1 × 10−4, corrected p = 0.03). In summary, several variants in vitamin D pathways were associated with breast cancer risk in AA women. In addition, CASR may be related to tumor ER status, supporting a role of vitamin D or calcium in modifying breast cancer phenotypes. PMID:26650177

  18. Mitogenome Phylogenetics: The Impact of Using Single Regions and Partitioning Schemes on Topology, Substitution Rate and Divergence Time Estimation

    PubMed Central

    Duchêne, Sebastián; Archer, Frederick I.; Vilstrup, Julia; Caballero, Susana; Morin, Phillip A.

    2011-01-01

    The availability of mitochondrial genome sequences is growing as a result of recent technological advances in molecular biology. In phylogenetic analyses, the complete mitogenome is increasingly becoming the marker of choice, usually providing better phylogenetic resolution and precision relative to traditional markers such as cytochrome b (CYTB) and the control region (CR). In some cases, the differences in phylogenetic estimates between mitogenomic and single-gene markers have yielded incongruent conclusions. By comparing phylogenetic estimates made from different genes, we identified the most informative mitochondrial regions and evaluated the minimum amount of data necessary to reproduce the same results as the mitogenome. We compared results among individual genes and the mitogenome for recently published complete mitogenome datasets of selected delphinids (Delphinidae) and killer whales (genus Orcinus). Using Bayesian phylogenetic methods, we investigated differences in estimation of topologies, divergence dates, and clock-like behavior among genes for both datasets. Although the most informative regions were not the same for each taxonomic group (COX1, CYTB, ND3 and ATP6 for Orcinus, and ND1, COX1 and ND4 for Delphinidae), in both cases they were equivalent to less than a quarter of the complete mitogenome. This suggests that gene information content can vary among groups, but can be adequately represented by a portion of the complete sequence. Although our results indicate that complete mitogenomes provide the highest phylogenetic resolution and most precise date estimates, a minimum amount of data can be selected using our approach when the complete sequence is unavailable. Studies based on single genes can benefit from the addition of a few more mitochondrial markers, producing topologies and date estimates similar to those obtained using the entire mitogenome. PMID:22073275

  19. Mitochondrial phylogeny, divergence history and high-altitude adaptation of grassland caterpillars (Lepidoptera: Lymantriinae: Gynaephora) inhabiting the Tibetan Plateau.

    PubMed

    Yuan, Ming-Long; Zhang, Qi-Lin; Zhang, Li; Jia, Cheng-Lin; Li, Xiao-Peng; Yang, Xing-Zhuo; Feng, Run-Qiu

    2018-05-01

    Grassland caterpillars (Lepidoptera: Lymantriinae: Gynaephora) are the most important pests in alpine meadows of the Tibetan Plateau (TP) and have well adapted to high-altitude environments. To further understand the evolutionary history and their adaptation to the TP, we newly determined seven complete TP Gynaephora mitogenomes. Compared to single genes, whole mitogenomes provided the best phylogenetic signals and obtained robust results, supporting the monophyly of the TP Gynaephora species and a phylogeny of Arctiinae + (Aganainae + Lymantriinae). Incongruent phylogenetic signals were found among single mitochondrial genes, none of which recovered the same phylogeny as the whole mitogenome. We identified six best-performing single genes using Shimodaira-Hasegawa tests and found that the combinations of rrnS and either cox1 or cox3 generated the same phylogeny as the whole mitogenome, indicating the phylogenetic potential of these three genes for future evolutionary studies of Gynaephora. The TP Gynaephora species were estimated to radiate on the TP during the Pliocene and Quaternary, supporting an association of the diversification and speciation of the TP Gynaephora species with the TP uplifts and associated climate changes during this time. Selection analyses revealed accelerated evolutionary rates of the mitochondrial protein-coding genes in the TP Gynaephora species, suggesting that they accumulated more nonsynonymous substitutions that may benefit their adaptation to high altitudes. Furthermore, signals of positive selection were detected in nad5 of two Gynaephora species with the highest altitude-distributions, indicating that this gene may contribute to Gynaephora's adaptation to divergent altitudes. This study adds to the understanding of the TP Gynaephora evolutionary relationships and suggests a link between mitogenome evolution and ecological adaptation to high-altitude environments in grassland caterpillars. Copyright © 2018 Elsevier Inc. All rights reserved.

  20. Association between polymorphisms in prostanoid receptor genes and aspirin-intolerant asthma.

    PubMed

    Kim, Sang-Heon; Kim, Yoon-Keun; Park, Heung-Woo; Jee, Young-Koo; Kim, Sang-Hoon; Bahn, Joon-Woo; Chang, Yoon-Seok; Kim, Seung-Hyun; Ye, Young-Min; Shin, Eun-Soon; Lee, Jong-Eun; Park, Hae-Sim; Min, Kyung-Up

    2007-04-01

    Genetic predisposition is linked to the pathogenesis of aspirin-intolerant asthma. Most candidate gene approaches have focused on leukotriene-related pathways, whereas there have been relatively few studies evaluating the effects of polymorphisms in prostanoid receptor genes on the development of aspirin-intolerant asthma. Therefore, we investigated the potential association between prostanoid receptor gene polymorphisms and the aspirin-intolerant asthma phenotype. We screened for genetic variations in the prostanoid receptor genes PTGER1, PTGER2, PTGER3, PTGER4, PTGDR, PTGIR, PTGFR, and TBXA2R using direct sequencing, and selected 32 tagging single nucleotide polymorphisms among the 77 polymorphisms with frequencies >0.02 based on linkage disequilibrium for genotyping. We compared the genotype distributions and allele frequencies of three participant groups (108 patients with aspirin-intolerant asthma, 93 patients with aspirin-tolerant asthma, and 140 normal controls). Through association analyses studies of the 32 single nucleotide polymorphisms, the following single nucleotide polymorphisms were found to have significant associations with the aspirin-intolerant asthma phenotype: -616C>G (P=0.038) and -166G>A (P=0.023) in PTGER2; -1709T>A (P=0.043) in PTGER3; -1254A>G (P=0.018) in PTGER4; 1915T>C (P=0.015) in PTGIR; and -4684C>T (P=0.027), and 795T>C (P=0.032) in TBXA2R. In the haplotype analysis of each gene, the frequency of PTGIR ht3[G-G-C-C], which includes 1915T>C, differed significantly between the aspirin-intolerant asthma patients and aspirin-tolerant asthma patients (P=0.015). These findings suggest that genetic polymorphisms in PTGER2, PTGER3, PTGER4, PTGIR, and TBXA2R play important roles in the pathogenesis of aspirin-intolerant asthma.

  1. Dancing together and separate again: gymnosperms exhibit frequent changes of fundamental 5S and 35S rRNA gene (rDNA) organisation

    PubMed Central

    Garcia, S; Kovařík, A

    2013-01-01

    In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S–5.8S–26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S–18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S–5.8S–26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants. PMID:23512008

  2. Dancing together and separate again: gymnosperms exhibit frequent changes of fundamental 5S and 35S rRNA gene (rDNA) organisation.

    PubMed

    Garcia, S; Kovařík, A

    2013-07-01

    In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S-5.8S-26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S-18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S-5.8S-26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants.

  3. Recent events dominate interdomain lateral gene transfers between prokaryotes and eukaryotes and, with the exception of endosymbiotic gene transfers, few ancient transfer events persist

    PubMed Central

    Katz, Laura A.

    2015-01-01

    While there is compelling evidence for the impact of endosymbiotic gene transfer (EGT; transfer from either mitochondrion or chloroplast to the nucleus) on genome evolution in eukaryotes, the role of interdomain transfer from bacteria and/or archaea (i.e. prokaryotes) is less clear. Lateral gene transfers (LGTs) have been argued to be potential sources of phylogenetic information, particularly for reconstructing deep nodes that are difficult to recover with traditional phylogenetic methods. We sought to identify interdomain LGTs by using a phylogenomic pipeline that generated 13 465 single gene trees and included up to 487 eukaryotes, 303 bacteria and 118 archaea. Our goals include searching for LGTs that unite major eukaryotic clades, and describing the relative contributions of LGT and EGT across the eukaryotic tree of life. Given the difficulties in interpreting single gene trees that aim to capture the approximately 1.8 billion years of eukaryotic evolution, we focus on presence–absence data to identify interdomain transfer events. Specifically, we identify 1138 genes found only in prokaryotes and representatives of three or fewer major clades of eukaryotes (e.g. Amoebozoa, Archaeplastida, Excavata, Opisthokonta, SAR and orphan lineages). The majority of these genes have phylogenetic patterns that are consistent with recent interdomain LGTs and, with the notable exception of EGTs involving photosynthetic eukaryotes, we detect few ancient interdomain LGTs. These analyses suggest that LGTs have probably occurred throughout the history of eukaryotes, but that ancient events are not maintained unless they are associated with endosymbiotic gene transfer among photosynthetic lineages. PMID:26323756

  4. Pathway analyses and understanding disease associations

    PubMed Central

    Liu, Yu; Chance, Mark R

    2013-01-01

    High throughput technologies have been applied to investigate the underlying mechanisms of complex diseases, identify disease-associations and help to improve treatment. However it is challenging to derive biological insight from conventional single gene based analysis of “omics” data from high throughput experiments due to sample and patient heterogeneity. To address these challenges, many novel pathway and network based approaches were developed to integrate various “omics” data, such as gene expression, copy number alteration, Genome Wide Association Studies, and interaction data. This review will cover recent methodological developments in pathway analysis for the detection of dysregulated interactions and disease-associated subnetworks, prioritization of candidate disease genes, and disease classifications. For each application, we will also discuss the associated challenges and potential future directions. PMID:24319650

  5. Failure to find linkage between a functional polymorphism in the dopamine D4 receptor gene and schizophrenia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shaikh, S.; Gill, M.; Collier, D.A.

    1994-03-15

    We report the results of a linkage study in 24 families multiply affected with schizophrenia using a polymorphic DNA sequence encoding the third cytoplasmic loop of the dopamine D4 receptor. Two-point LOD score analyses with a range of single gene models ranging from near dominant to near recessive revealed no evidence for linkage. In addition, we examined the data by non-parametric sib-pair analysis and found no excess sharing of alleles between affected sib-pairs. We therefore conclude that mutations within the dopamine D4 receptor gene do not have a major aetiological role in schizophrenia in our collection of pedigrees. 20 refs.,more » 2 tabs.« less

  6. Complete chloroplast genome sequence and comparative analysis of loblolly pine (Pinus taeda L.) with related species

    PubMed Central

    Khan, Abdul Latif; Khan, Muhammad Aaqil; Shahzad, Raheem; Lubna; Kang, Sang Mo; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

    2018-01-01

    Pinaceae, the largest family of conifers, has a diversified organization of chloroplast (cp) genomes with two typical highly reduced inverted repeats (IRs). In the current study, we determined the complete sequence of the cp genome of an economically and ecologically important conifer tree, the loblolly pine (Pinus taeda L.), using Illumina paired-end sequencing and compared the sequence with those of other pine species. The results revealed a genome size of 121,531 base pairs (bp) containing a pair of 830-bp IR regions, distinguished by a small single copy (42,258 bp) and large single copy (77,614 bp) region. The chloroplast genome of P. taeda encodes 120 genes, comprising 81 protein-coding genes, four ribosomal RNA genes, and 35 tRNA genes, with 151 randomly distributed microsatellites. Approximately 6 palindromic, 34 forward, and 22 tandem repeats were found in the P. taeda cp genome. Whole cp genome comparison with those of other Pinus species exhibited an overall high degree of sequence similarity, with some divergence in intergenic spacers. Higher and lower numbers of indels and single-nucleotide polymorphism substitutions were observed relative to P. contorta and P. monophylla, respectively. Phylogenomic analyses based on the complete genome sequence revealed that 60 shared genes generated trees with the same topologies, and P. taeda was closely related to P. contorta in the subgenus Pinus. Thus, the complete P. taeda genome provided valuable resources for population and evolutionary studies of gymnosperms and can be used to identify related species. PMID:29596414

  7. Identification of Suitable Reference Genes for Gene Expression Normalization in qRT-PCR Analysis in Watermelon

    PubMed Central

    Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

    2014-01-01

    Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT–PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT–PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT–PCR analyses involving watermelon. PMID:24587403

  8. Identification of suitable reference genes for gene expression normalization in qRT-PCR analysis in watermelon.

    PubMed

    Kong, Qiusheng; Yuan, Jingxian; Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

    2014-01-01

    Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT-PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT-PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT-PCR analyses involving watermelon.

  9. Phylogeny and temporal diversification of darters (Percidae: Etheostomatinae).

    PubMed

    Near, Thomas J; Bossu, Christen M; Bradburd, Gideon S; Carlson, Rose L; Harrington, Richard C; Hollingsworth, Phillip R; Keck, Benjamin P; Etnier, David A

    2011-10-01

    Discussions aimed at resolution of the Tree of Life are most often focused on the interrelationships of major organismal lineages. In this study, we focus on the resolution of some of the most apical branches in the Tree of Life through exploration of the phylogenetic relationships of darters, a species-rich clade of North American freshwater fishes. With a near-complete taxon sampling of close to 250 species, we aim to investigate strategies for efficient multilocus data sampling and the estimation of divergence times using relaxed-clock methods when a clade lacks a fossil record. Our phylogenetic data set comprises a single mitochondrial DNA (mtDNA) gene and two nuclear genes sampled from 245 of the 248 darter species. This dense sampling allows us to determine if a modest amount of nuclear DNA sequence data can resolve relationships among closely related animal species. Darters lack a fossil record to provide age calibration priors in relaxed-clock analyses. Therefore, we use a near-complete species-sampled phylogeny of the perciform clade Centrarchidae, which has a rich fossil record, to assess two distinct strategies of external calibration in relaxed-clock divergence time estimates of darters: using ages inferred from the fossil record and molecular evolutionary rate estimates. Comparison of Bayesian phylogenies inferred from mtDNA and nuclear genes reveals that heterospecific mtDNA is present in approximately 12.5% of all darter species. We identify three patterns of mtDNA introgression in darters: proximal mtDNA transfer, which involves the transfer of mtDNA among extant and sympatric darter species, indeterminate introgression, which involves the transfer of mtDNA from a lineage that cannot be confidently identified because the introgressed haplotypes are not clearly referable to mtDNA haplotypes in any recognized species, and deep introgression, which is characterized by species diversification within a recipient clade subsequent to the transfer of heterospecific mtDNA. The results of our analyses indicate that DNA sequences sampled from single-copy nuclear genes can provide appreciable phylogenetic resolution for closely related animal species. A well-resolved near-complete species-sampled phylogeny of darters was estimated with Bayesian methods using a concatenated mtDNA and nuclear gene data set with all identified heterospecific mtDNA haplotypes treated as missing data. The relaxed-clock analyses resulted in very similar posterior age estimates across the three sampled genes and methods of calibration and therefore offer a viable strategy for estimating divergence times for clades that lack a fossil record. In addition, an informative rank-free clade-based classification of darters that preserves the rich history of nomenclature in the group and provides formal taxonomic communication of darter clades was constructed using the mtDNA and nuclear gene phylogeny. On the whole, the appeal of mtDNA for phylogeny inference among closely related animal species is diminished by the observations of extensive mtDNA introgression and by finding appreciable phylogenetic signal in a modest sampling of nuclear genes in our phylogenetic analyses of darters.

  10. Human cognitive ability is influenced by genetic variation in components of postsynaptic signalling complexes assembled by NMDA receptors and MAGUK proteins

    PubMed Central

    Hill, W D; Davies, G; van de Lagemaat, L N; Christoforou, A; Marioni, R E; Fernandes, C P D; Liewald, D C; Croning, M D R; Payton, A; Craig, L C A; Whalley, L J; Horan, M; Ollier, W; Hansell, N K; Wright, M J; Martin, N G; Montgomery, G W; Steen, V M; Le Hellard, S; Espeseth, T; Lundervold, A J; Reinvang, I; Starr, J M; Pendleton, N; Grant, S G N; Bates, T C; Deary, I J

    2014-01-01

    Differences in general cognitive ability (intelligence) account for approximately half of the variation in any large battery of cognitive tests and are predictive of important life events including health. Genome-wide analyses of common single-nucleotide polymorphisms indicate that they jointly tag between a quarter and a half of the variance in intelligence. However, no single polymorphism has been reliably associated with variation in intelligence. It remains possible that these many small effects might be aggregated in networks of functionally linked genes. Here, we tested a network of 1461 genes in the postsynaptic density and associated complexes for an enriched association with intelligence. These were ascertained in 3511 individuals (the Cognitive Ageing Genetics in England and Scotland (CAGES) consortium) phenotyped for general cognitive ability, fluid cognitive ability, crystallised cognitive ability, memory and speed of processing. By analysing the results of a genome wide association study (GWAS) using Gene Set Enrichment Analysis, a significant enrichment was found for fluid cognitive ability for the proteins found in the complexes of N-methyl-D-aspartate receptor complex; P=0.002. Replication was sought in two additional cohorts (N=670 and 2062). A meta-analytic P-value of 0.003 was found when these were combined with the CAGES consortium. The results suggest that genetic variation in the macromolecular machines formed by membrane-associated guanylate kinase (MAGUK) scaffold proteins and their interaction partners contributes to variation in intelligence. PMID:24399044

  11. Analysis of resistance genes of clinical Pannonibacter phragmitetus strain 31801 by complete genome sequencing.

    PubMed

    Ming, De-Song; Chen, Qing-Qing; Chen, Xiao-Tin

    2018-05-14

    To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of β-lactamase [NPS β-lactamase (EC 3.5.2.6), β-lactamase class C, and a metal-dependent hydrolase of β-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.

  12. Phylogenomics Controlling for Base Compositional Bias Reveals a Single Origin of Eusociality in Corbiculate Bees.

    PubMed

    Romiguier, Jonathan; Cameron, Sydney A; Woodard, S Hollis; Fischman, Brielle J; Keller, Laurent; Praz, Christophe J

    2016-03-01

    As increasingly large molecular data sets are collected for phylogenomics, the conflicting phylogenetic signal among gene trees poses challenges to resolve some difficult nodes of the Tree of Life. Among these nodes, the phylogenetic position of the honey bees (Apini) within the corbiculate bee group remains controversial, despite its considerable importance for understanding the emergence and maintenance of eusociality. Here, we show that this controversy stems in part from pervasive phylogenetic conflicts among GC-rich gene trees. GC-rich genes typically have a high nucleotidic heterogeneity among species, which can induce topological conflicts among gene trees. When retaining only the most GC-homogeneous genes or using a nonhomogeneous model of sequence evolution, our analyses reveal a monophyletic group of the three lineages with a eusocial lifestyle (honey bees, bumble bees, and stingless bees). These phylogenetic relationships strongly suggest a single origin of eusociality in the corbiculate bees, with no reversal to solitary living in this group. To accurately reconstruct other important evolutionary steps across the Tree of Life, we suggest removing GC-rich and GC-heterogeneous genes from large phylogenomic data sets. Interpreted as a consequence of genome-wide variations in recombination rates, this GC effect can affect all taxa featuring GC-biased gene conversion, which is common in eukaryotes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Molecular and genetic ecotoxicologic approaches to aquatic environmental bioreporting.

    PubMed Central

    Beaty, B J; Black, W C; Carlson, J O; Clements, W H; DuTeau, N; Harrahy, E; Nuckols, J; Kenneth, E; Olson, K E; Rayms-Keller, A

    1998-01-01

    Molecular and population genetic ecotoxicologic approaches are being developed for the utilization of arthropods as bioreporters of heavy metal mixtures in the environment. The explosion of knowledge in molecular biology, molecular genetics, and biotechnology provides an unparalleled opportunity to use arthropods as bioreporter organisms. Interspecific differences in aquatic arthropod populations have been previously demonstrated in response to heavy metal insult in the Arkansas River (AR) California Gulch Superfund site (CGSS). Population genetic analyses were conducted on the mayfly Baetis tricaudatus. Genetic polymorphisms were detected in polymerase chain reaction amplified 16S mitochondrial rDNA (a selectively neutral gene) of B tricaudatus using single-strand conformation polymorphism analysis. Genetic differences may have resulted from impediments to gene flow in the population caused by mortality arising from exposure to heavy metal mixture pollution. In laboratory studies a candidate metal-responsive mucinlike gene, which is metal and dose specific, has been identified in Chironomus tentans and other potential AR-CGSS bioreporter species. Population genetic analyses using the mucinlike gene may provide insight into the role of this selectable gene in determining the breeding structure of B. tricaudatus in the AR-CGSS and may provide mechanistic insight into determinants of aquatic arthropod response to heavy metal insult. Metal-responsive (MR) genes and regulatory sequences are being isolated, characterized, and assayed for differential gene expression in response to heavy metal mixture pollution in the AR-CGSS. Identified promoter sequences can then be engineered into previously developed MR constructs to provide sensitive in vitro assays for environmental bioreporting of heavy metal mixtures. The results of the population genetic studies are being entered into an AR geographic information system that contains substantial biological, chemical, and geophysical information. Integrated spatial, structural, and temporal analyses of these parameters will provide invaluable information concerning environmental determinants that restrict or promote gene flow in bioreporter populations. Images Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 PMID:9860898

  14. EUPAN enables pan-genome studies of a large number of eukaryotic genomes.

    PubMed

    Hu, Zhiqiang; Sun, Chen; Lu, Kuang-Chen; Chu, Xixia; Zhao, Yue; Lu, Jinyuan; Shi, Jianxin; Wei, Chaochun

    2017-08-01

    Pan-genome analyses are routinely carried out for bacteria to interpret the within-species gene presence/absence variations (PAVs). However, pan-genome analyses are rare for eukaryotes due to the large sizes and higher complexities of their genomes. Here we proposed EUPAN, a eukaryotic pan-genome analysis toolkit, enabling automatic large-scale eukaryotic pan-genome analyses and detection of gene PAVs at a relatively low sequencing depth. In the previous studies, we demonstrated the effectiveness and high accuracy of EUPAN in the pan-genome analysis of 453 rice genomes, in which we also revealed widespread gene PAVs among individual rice genomes. Moreover, EUPAN can be directly applied to the current re-sequencing projects primarily focusing on single nucleotide polymorphisms. EUPAN is implemented in Perl, R and C ++. It is supported under Linux and preferred for a computer cluster with LSF and SLURM job scheduling system. EUPAN together with its standard operating procedure (SOP) is freely available for non-commercial use (CC BY-NC 4.0) at http://cgm.sjtu.edu.cn/eupan/index.html . ccwei@sjtu.edu.cn or jianxin.shi@sjtu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  15. HomozygosityMapper2012--bridging the gap between homozygosity mapping and deep sequencing.

    PubMed

    Seelow, Dominik; Schuelke, Markus

    2012-07-01

    Homozygosity mapping is a common method to map recessive traits in consanguineous families. To facilitate these analyses, we have developed HomozygosityMapper, a web-based approach to homozygosity mapping. HomozygosityMapper allows researchers to directly upload the genotype files produced by the major genotyping platforms as well as deep sequencing data. It detects stretches of homozygosity shared by the affected individuals and displays them graphically. Users can interactively inspect the underlying genotypes, manually refine these regions and eventually submit them to our candidate gene search engine GeneDistiller to identify the most promising candidate genes. Here, we present the new version of HomozygosityMapper. The most striking new feature is the support of Next Generation Sequencing *.vcf files as input. Upon users' requests, we have implemented the analysis of common experimental rodents as well as of important farm animals. Furthermore, we have extended the options for single families and loss of heterozygosity studies. Another new feature is the export of *.bed files for targeted enrichment of the potential disease regions for deep sequencing strategies. HomozygosityMapper also generates files for conventional linkage analyses which are already restricted to the possible disease regions, hence superseding CPU-intensive genome-wide analyses. HomozygosityMapper is freely available at http://www.homozygositymapper.org/.

  16. Analysis of alkaptonuria (AKU) mutations and polymorphisms reveals that the CCC sequence motif is a mutational hot spot in the homogentisate 1,2 dioxygenase gene (HGO).

    PubMed Central

    Beltrán-Valero de Bernabé, D; Jimenez, F J; Aquaron, R; Rodríguez de Córdoba, S

    1999-01-01

    We recently showed that alkaptonuria (AKU) is caused by loss-of-function mutations in the homogentisate 1,2 dioxygenase gene (HGO). Herein we describe haplotype and mutational analyses of HGO in seven new AKU pedigrees. These analyses identified two novel single-nucleotide polymorphisms (INV4+31A-->G and INV11+18A-->G) and six novel AKU mutations (INV1-1G-->A, W60G, Y62C, A122D, P230T, and D291E), which further illustrates the remarkable allelic heterogeneity found in AKU. Reexamination of all 29 mutations and polymorphisms thus far described in HGO shows that these nucleotide changes are not randomly distributed; the CCC sequence motif and its inverted complement, GGG, are preferentially mutated. These analyses also demonstrated that the nucleotide substitutions in HGO do not involve CpG dinucleotides, which illustrates important differences between HGO and other genes for the occurrence of mutation at specific short-sequence motifs. Because the CCC sequence motifs comprise a significant proportion (34.5%) of all mutated bases that have been observed in HGO, we conclude that the CCC triplet is a mutational hot spot in HGO. PMID:10205262

  17. Single nucleotide polymorphisms in the growth hormone - insulin like growth factor axis in straight bred and crossbred Angus, Brahman, and Romosinuano heifers: population genetic analyses and association of genotypes

    USDA-ARS?s Scientific Manuscript database

    The growth endocrine axis influences reproduction. Objectives of this study were to evaluate population genetic characteristics of SNP genotypes within genes of the GH and IGF axis in straightbred and diallel-crossed Angus, Brahman and Romosinuano heifers (n = 650) and to test the associations of th...

  18. Comparative analyses of plastid genomes from fourteen Cornales species: inferences for phylogenetic relationships and genome evolution.

    PubMed

    Fu, Chao-Nan; Li, Hong-Tao; Milne, Richard; Zhang, Ting; Ma, Peng-Fei; Yang, Jing; Li, De-Zhu; Gao, Lian-Ming

    2017-12-08

    The Cornales is the basal lineage of the asterids, the largest angiosperm clade. Phylogenetic relationships within the order were previously not fully resolved. Fifteen plastid genomes representing 14 species, ten genera and seven families of Cornales were newly sequenced for comparative analyses of genome features, evolution, and phylogenomics based on different partitioning schemes and filtering strategies. All plastomes of the 14 Cornales species had the typical quadripartite structure with a genome size ranging from 156,567 bp to 158,715 bp, which included two inverted repeats (25,859-26,451 bp) separated by a large single-copy region (86,089-87,835 bp) and a small single-copy region (18,250-18,856 bp) region. These plastomes encoded the same set of 114 unique genes including 31 transfer RNA, 4 ribosomal RNA and 79 coding genes, with an identical gene order across all examined Cornales species. Two genes (rpl22 and ycf15) contained premature stop codons in seven and five species respectively. The phylogenetic relationships among all sampled species were fully resolved with maximum support. Different filtering strategies (none, light and strict) of sequence alignment did not have an effect on these relationships. The topology recovered from coding and noncoding data sets was the same as for the whole plastome, regardless of filtering strategy. Moreover, mutational hotspots and highly informative regions were identified. Phylogenetic relationships among families and intergeneric relationships within family of Cornales were well resolved. Different filtering strategies and partitioning schemes do not influence the relationships. Plastid genomes have great potential to resolve deep phylogenetic relationships of plants.

  19. Three gangliogliomas: results of GTG-banding, SKY, genome-wide high resolution SNP-array, gene expression and review of the literature.

    PubMed

    Xu, Li-Xin; Holland, Heidrun; Kirsten, Holger; Ahnert, Peter; Krupp, Wolfgang; Bauer, Manfred; Schober, Ralf; Mueller, Wolf; Fritzsch, Dominik; Meixensberger, Jürgen; Koschny, Ronald

    2015-04-01

    According to the World Health Organization gangliogliomas are classified as well-differentiated and slowly growing neuroepithelial tumors, composed of neoplastic mature ganglion and glial cells. It is the most frequent tumor entity observed in patients with long-term epilepsy. Comprehensive cytogenetic and molecular cytogenetic data including high-resolution genomic profiling (single nucleotide polymorphism (SNP)-array) of gangliogliomas are scarce but necessary for a better oncological understanding of this tumor entity. For a detailed characterization at the single cell and cell population levels, we analyzed genomic alterations of three gangliogliomas using trypsin-Giemsa banding (GTG-banding) and by spectral karyotyping (SKY) in combination with SNP-array and gene expression array experiments. By GTG and SKY, we could confirm frequently detected chromosomal aberrations (losses within chromosomes 10, 13 and 22; gains within chromosomes 5, 7, 8 and 12), and identify so far unknown genetic aberrations like the unbalanced non-reciprocal translocation t(1;18)(q21;q21). Interestingly, we report on the second so far detected ganglioglioma with ring chromosome 1. Analyses of SNP-array data from two of the tumors and respective germline DNA (peripheral blood) identified few small gains and losses and a number of copy-neutral regions with loss of heterozygosity (LOH) in germline and in tumor tissue. In comparison to germline DNA, tumor tissues did not show substantial regions with significant loss or gain or with newly developed LOH. Gene expression analyses of tumor-specific genes revealed similarities in the profile of the analyzed samples regarding different relevant pathways. Taken together, we describe overlapping but also distinct and novel genetic aberrations of three gangliogliomas. © 2014 Japanese Society of Neuropathology.

  20. Optimizing Hybrid de Novo Transcriptome Assembly and Extending Genomic Resources for Giant Freshwater Prawns (Macrobrachium rosenbergii): The Identification of Genes and Markers Associated with Reproduction.

    PubMed

    Jung, Hyungtaek; Yoon, Byung-Ha; Kim, Woo-Jin; Kim, Dong-Wook; Hurwood, David A; Lyons, Russell E; Salin, Krishna R; Kim, Heui-Soo; Baek, Ilseon; Chand, Vincent; Mather, Peter B

    2016-05-07

    The giant freshwater prawn, Macrobrachium rosenbergii, a sexually dimorphic decapod crustacean is currently the world's most economically important cultured freshwater crustacean species. Despite its economic importance, there is currently a lack of genomic resources available for this species, and this has limited exploration of the molecular mechanisms that control the M. rosenbergii sex-differentiation system more widely in freshwater prawns. Here, we present the first hybrid transcriptome from M. rosenbergii applying RNA-Seq technologies directed at identifying genes that have potential functional roles in reproductive-related traits. A total of 13,733,210 combined raw reads (1720 Mbp) were obtained from Ion-Torrent PGM and 454 FLX. Bioinformatic analyses based on three state-of-the-art assemblers, the CLC Genomic Workbench, Trans-ABySS, and Trinity, that use single and multiple k-mer methods respectively, were used to analyse the data. The influence of multiple k-mers on assembly performance was assessed to gain insight into transcriptome assembly from short reads. After optimisation, de novo assembly resulted in 44,407 contigs with a mean length of 437 bp, and the assembled transcripts were further functionally annotated to detect single nucleotide polymorphisms and simple sequence repeat motifs. Gene expression analysis was also used to compare expression patterns from ovary and testis tissue libraries to identify genes with potential roles in reproduction and sex differentiation. The large transcript set assembled here represents the most comprehensive set of transcriptomic resources ever developed for reproduction traits in M. rosenbergii, and the large number of genetic markers predicted should constitute an invaluable resource for future genetic research studies on M. rosenbergii and can be applied more widely on other freshwater prawn species in the genus Macrobrachium.

  1. Optimizing Hybrid de Novo Transcriptome Assembly and Extending Genomic Resources for Giant Freshwater Prawns (Macrobrachium rosenbergii): The Identification of Genes and Markers Associated with Reproduction

    PubMed Central

    Jung, Hyungtaek; Yoon, Byung-Ha; Kim, Woo-Jin; Kim, Dong-Wook; Hurwood, David A.; Lyons, Russell E.; Salin, Krishna R.; Kim, Heui-Soo; Baek, Ilseon; Chand, Vincent; Mather, Peter B.

    2016-01-01

    The giant freshwater prawn, Macrobrachium rosenbergii, a sexually dimorphic decapod crustacean is currently the world’s most economically important cultured freshwater crustacean species. Despite its economic importance, there is currently a lack of genomic resources available for this species, and this has limited exploration of the molecular mechanisms that control the M. rosenbergii sex-differentiation system more widely in freshwater prawns. Here, we present the first hybrid transcriptome from M. rosenbergii applying RNA-Seq technologies directed at identifying genes that have potential functional roles in reproductive-related traits. A total of 13,733,210 combined raw reads (1720 Mbp) were obtained from Ion-Torrent PGM and 454 FLX. Bioinformatic analyses based on three state-of-the-art assemblers, the CLC Genomic Workbench, Trans-ABySS, and Trinity, that use single and multiple k-mer methods respectively, were used to analyse the data. The influence of multiple k-mers on assembly performance was assessed to gain insight into transcriptome assembly from short reads. After optimisation, de novo assembly resulted in 44,407 contigs with a mean length of 437 bp, and the assembled transcripts were further functionally annotated to detect single nucleotide polymorphisms and simple sequence repeat motifs. Gene expression analysis was also used to compare expression patterns from ovary and testis tissue libraries to identify genes with potential roles in reproduction and sex differentiation. The large transcript set assembled here represents the most comprehensive set of transcriptomic resources ever developed for reproduction traits in M. rosenbergii, and the large number of genetic markers predicted should constitute an invaluable resource for future genetic research studies on M. rosenbergii and can be applied more widely on other freshwater prawn species in the genus Macrobrachium. PMID:27164098

  2. An efficient transgenic system by TA cloning vectors and RNAi for C. elegans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gengyo-Ando, Keiko; CREST, JST, 4-1-8 Hon-cho, Kawaguchi, Saitama 332-0012; Yoshina, Sawako

    2006-11-03

    In the nematode, transgenic analyses have been performed by microinjection of DNA from various sources into the syncytium gonad. To expedite these transgenic analyses, we solved two potential problems in this work. First, we constructed an efficient TA-cloning vector system which is useful for any promoter. By amplifying the genomic DNA fragments which contain regulatory sequences with or without the coding region, we could easily construct plasmids expressing fluorescent protein fusion without considering restriction sites. We could dissect motor neurons with three colors in a single animal. Second, we used feeding RNAi to isolate transgenic strains which express lag-2::venus fusionmore » gene. We found that the fusion protein is toxic when ectopically expressed in embryos but is functional to rescue a loss of function mutant in the lag-2 gene. Thus, the transgenic system described here should be useful to examine the protein function in the nematode.« less

  3. Pathway-based analyses.

    PubMed

    Kent, Jack W

    2016-02-03

    New technologies for acquisition of genomic data, while offering unprecedented opportunities for genetic discovery, also impose severe burdens of interpretation and penalties for multiple testing. The Pathway-based Analyses Group of the Genetic Analysis Workshop 19 (GAW19) sought reduction of multiple-testing burden through various approaches to aggregation of highdimensional data in pathways informed by prior biological knowledge. Experimental methods testedincluded the use of "synthetic pathways" (random sets of genes) to estimate power and false-positive error rate of methods applied to simulated data; data reduction via independent components analysis, single-nucleotide polymorphism (SNP)-SNP interaction, and use of gene sets to estimate genetic similarity; and general assessment of the efficacy of prior biological knowledge to reduce the dimensionality of complex genomic data. The work of this group explored several promising approaches to managing high-dimensional data, with the caveat that these methods are necessarily constrained by the quality of external bioinformatic annotation.

  4. Simultaneous genomic identification and profiling of a single cell using semiconductor-based next generation sequencing.

    PubMed

    Watanabe, Manabu; Kusano, Junko; Ohtaki, Shinsaku; Ishikura, Takashi; Katayama, Jin; Koguchi, Akira; Paumen, Michael; Hayashi, Yoshiharu

    2014-09-01

    Combining single-cell methods and next-generation sequencing should provide a powerful means to understand single-cell biology and obviate the effects of sample heterogeneity. Here we report a single-cell identification method and seamless cancer gene profiling using semiconductor-based massively parallel sequencing. A549 cells (adenocarcinomic human alveolar basal epithelial cell line) were used as a model. Single-cell capture was performed using laser capture microdissection (LCM) with an Arcturus® XT system, and a captured single cell and a bulk population of A549 cells (≈ 10(6) cells) were subjected to whole genome amplification (WGA). For cell identification, a multiplex PCR method (AmpliSeq™ SNP HID panel) was used to enrich 136 highly discriminatory SNPs with a genotype concordance probability of 10(31-35). For cancer gene profiling, we used mutation profiling that was performed in parallel using a hotspot panel for 50 cancer-related genes. Sequencing was performed using a semiconductor-based bench top sequencer. The distribution of sequence reads for both HID and Cancer panel amplicons was consistent across these samples. For the bulk population of cells, the percentages of sequence covered at coverage of more than 100 × were 99.04% for the HID panel and 98.83% for the Cancer panel, while for the single cell percentages of sequence covered at coverage of more than 100 × were 55.93% for the HID panel and 65.96% for the Cancer panel. Partial amplification failure or randomly distributed non-amplified regions across samples from single cells during the WGA procedures or random allele drop out probably caused these differences. However, comparative analyses showed that this method successfully discriminated a single A549 cancer cell from a bulk population of A549 cells. Thus, our approach provides a powerful means to overcome tumor sample heterogeneity when searching for somatic mutations.

  5. Genomic investigation of porcine periweaning failure to thrive syndrome (PFTS).

    PubMed

    Bertolini, Francesca; Yang, Tianfu; Huang, Yanyun; Harding, John C S; Plastow, Graham S; Rothschild, Max F

    2018-04-25

    Porcine periweaning failure to thrive syndrome (PFTS) can be defined by anorexia, lethargy, progressive debilitation and compulsive behaviours that occur in seemingly healthy pigs within two to threeweeks of weaning in the absence of any known infectious, nutritional, management or environmental factors. A genetic component has been hypothesised for this syndrome. In the present study, 119 commercial pigs (80 cases and 39 controls) were genotyped with the porcine 80K single nucleotide polymorphism-chip and were analysed with logistic regression and two Fixation Index-based approaches. The analyses revealed several regions on chromosomes 1, 3, 6 and 11 with moderate divergence between cases and controls, particularly three haplotypes on SSC3 and 11. The gene-based analyses of the candidate regions revealed the presence of genes that have been reported to be associated with phenotypes like PFST including depression ( PDE10A ) and intestinal villous atrophy ( CUL4A ). It is important to increase the effort of collecting more samples to improve the power of these analyses. © British Veterinary Association (unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  6. The clpB gene of Bifidobacterium breve UCC 2003: transcriptional analysis and first insights into stress induction.

    PubMed

    Ventura, Marco; Kenny, John G; Zhang, Ziding; Fitzgerald, Gerald F; van Sinderen, Douwe

    2005-09-01

    The so-called clp genes, which encode components of the Clp proteolytic complex, are widespread among bacteria. The Bifidobacterium breve UCC 2003 genome contains a clpB gene with significant homology to predicted clpB genes from other members of the Actinobacteridae group. The heat- and osmotic-inducibility of the B. breve UCC 2003 clpB homologue was verified by slot-blot analysis, while Northern blot and primer extension analyses showed that the clpB gene is transcribed as a monocistronic unit with a single promoter. The role of a hspR homologue, known to control the regulation of clpB and dnaK gene expression in other high G+C content bacteria was investigated by gel mobility shift assays. Moreover the predicted 3D structure of HspR provides further insight into the binding mode of this protein to the clpB promoter region, and highlights the key amino acid residues believed to be involved in the protein-DNA interaction.

  7. Characterization of a novel variant of Mycobacterium chimaera.

    PubMed

    van Ingen, J; Hoefsloot, W; Buijtels, P C A M; Tortoli, E; Supply, P; Dekhuijzen, P N R; Boeree, M J; van Soolingen, D

    2012-09-01

    In this study, nonchromogenic mycobacteria were isolated from pulmonary samples of three patients in the Netherlands. All isolates had identical, unique 16S rRNA gene and 16S-23S ITS sequences, which were closely related to those of Mycobacterium chimaera and Mycobacterium marseillense. The biochemical features of the isolates differed slightly from those of M. chimaera, suggesting that the isolates may represent a possible separate species within the Mycobacterium avium complex (MAC). However, the cell-wall mycolic acid pattern, analysed by HPLC, and the partial sequences of the hsp65 and rpoB genes were identical to those of M. chimaera. We concluded that the isolates represent a novel variant of M. chimaera. The results of this analysis have led us to question the currently used methods of species definition for members of the genus Mycobacterium, which are based largely on 16S rRNA or rpoB gene sequencing. Definitions based on a single genetic target are likely to be insufficient. Genetic divergence, especially in the MAC, yields strains that cannot be confidently assigned to a specific species based on the analysis of a single genetic target.

  8. n-CoDeR concept: unique types of antibodies for diagnostic use and therapy.

    PubMed

    Carlsson, R; Söderlind, E

    2001-05-01

    The n-CoDeR recombinant antibody gene libraries are built on a single master framework, into which diverse in vivo-formed complementarity determining regions (CDRs) are allowed to recombine. These CDRs are sampled from in vivo-processed and proof-read gene sequences, thus ensuring an optimal level of correctly folded and functional molecules. By the modularized assembly process, up to six CDRs can be varied at the same time, providing a possibility for the creation of a hitherto undescribed genetic and functional variation. The n-CoDeR antibody gene libraries can be used to select highly specific, human antibody fragments with specificities to virtually any antigen, including carbohydrates and human self-proteins and with affinities down into the subnanomolar range. Furthermore, combining CDRs sampled from in vivo-processed sequences into a single framework result in molecules exhibiting a lower immunogenicity compared to normal human immunoglobulins, as determined by computer analyses. The distinguished features of the n-CoDeR libraries in the therapeutic and diagnostic areas are discussed.

  9. Expression and phylogenetic analyses reveal paralogous lineages of putatively classical and non-classical MHC-I genes in three sparrow species (Passer).

    PubMed

    Drews, Anna; Strandh, Maria; Råberg, Lars; Westerdahl, Helena

    2017-06-26

    The Major Histocompatibility Complex (MHC) plays a central role in immunity and has been given considerable attention by evolutionary ecologists due to its associations with fitness-related traits. Songbirds have unusually high numbers of MHC class I (MHC-I) genes, but it is not known whether all are expressed and equally important for immune function. Classical MHC-I genes are highly expressed, polymorphic and present peptides to T-cells whereas non-classical MHC-I genes have lower expression, are more monomorphic and do not present peptides to T-cells. To get a better understanding of the highly duplicated MHC genes in songbirds, we studied gene expression in a phylogenetic framework in three species of sparrows (house sparrow, tree sparrow and Spanish sparrow), using high-throughput sequencing. We hypothesize that sparrows could have classical and non-classical genes, as previously indicated though never tested using gene expression. The phylogenetic analyses reveal two distinct types of MHC-I alleles among the three sparrow species, one with high and one with low level of polymorphism, thus resembling classical and non-classical genes, respectively. All individuals had both types of alleles, but there was copy number variation both within and among the sparrow species. However, the number of highly polymorphic alleles that were expressed did not vary between species, suggesting that the structural genomic variation is counterbalanced by conserved gene expression. Overall, 50% of the MHC-I alleles were expressed in sparrows. Expression of the highly polymorphic alleles was very variable, whereas the alleles with low polymorphism had uniformly low expression. Interestingly, within an individual only one or two alleles from the polymorphic genes were highly expressed, indicating that only a single copy of these is highly expressed. Taken together, the phylogenetic reconstruction and the analyses of expression suggest that sparrows have both classical and non-classical MHC-I genes, and that the evolutionary origin of these genes predate the split of the three investigated sparrow species 7 million years ago. Because only the classical MHC-I genes are involved in antigen presentation, the function of different MHC-I genes should be considered in future ecological and evolutionary studies of MHC-I in sparrows and other songbirds.

  10. Comparative Genomics and Phylogenomics of East Asian Tulips (Amana, Liliaceae)

    PubMed Central

    Li, Pan; Lu, Rui-Sen; Xu, Wu-Qin; Ohi-Toma, Tetsuo; Cai, Min-Qi; Qiu, Ying-Xiong; Cameron, Kenneth M.; Fu, Cheng-Xin

    2017-01-01

    The genus Amana Honda (Liliaceae), when it is treated as separate from Tulipa, comprises six perennial herbaceous species that are restricted to China, Japan and the Korean Peninsula. Although all six Amana species have important medicinal and horticultural uses, studies focused on species identification and molecular phylogenetics are few. Here we report the nucleotide sequences of six complete Amana chloroplast (cp) genomes. The cp genomes of Amana range from 150,613 bp to 151,136 bp in length, all including a pair of inverted repeats (25,629–25,859 bp) separated by the large single-copy (81,482–82,218 bp) and small single-copy (17,366–17,465 bp) regions. Each cp genome equivalently contains 112 unique genes consisting of 30 transfer RNA genes, four ribosomal RNA genes, and 78 protein coding genes. Gene content, gene order, AT content, and IR/SC boundary structure are nearly identical among all Amana cp genomes. However, the relative contraction and expansion of the IR/SC borders among the six Amana cp genomes results in length variation among them. Simple sequence repeat (SSR) analyses of these Amana cp genomes indicate that the richest SSRs are A/T mononucleotides. The number of repeats among the six Amana species varies from 54 (A. anhuiensis) to 69 (Amana kuocangshanica) with palindromic (28–35) and forward repeats (23–30) as the most common types. Phylogenomic analyses based on these complete cp genomes and 74 common protein-coding genes strongly support the monophyly of the genus, and a sister relationship between Amana and Erythronium, rather than a shared common ancestor with Tulipa. Nine DNA markers (rps15–ycf1, accD–psaI, petA–psbJ, rpl32–trnL, atpH–atpI, petD–rpoA, trnS–trnG, psbM–trnD, and ycf4–cemA) with number of variable sites greater than 0.9% were identified, and these may be useful for future population genetic and phylogeographic studies of Amana species. PMID:28421090

  11. Gene duplication, population genomics, and species-level differentiation within a tropical mountain shrub.

    PubMed

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C

    2014-09-14

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. Single prokaryotic cell isolation and total transcript amplification protocol for transcriptomic analysis.

    PubMed

    Kang, Yun; McMillan, Ian; Norris, Michael H; Hoang, Tung T

    2015-07-01

    Until recently, transcriptome analyses of single cells have been confined to eukaryotes. The information obtained from single-cell transcripts can provide detailed insight into spatiotemporal gene expression, and it could be even more valuable if expanded to prokaryotic cells. Transcriptome analysis of single prokaryotic cells is a recently developed and powerful tool. Here we describe a procedure that allows amplification of the total transcript of a single prokaryotic cell for in-depth analysis. This is performed by using a laser-capture microdissection instrument for single-cell isolation, followed by reverse transcription via Moloney murine leukemia virus, degradation of chromosomal DNA with McrBC and DpnI restriction enzymes, single-stranded cDNA (ss-cDNA) ligation using T4 polynucleotide kinase and CircLigase, and polymerization of ss-cDNA to double-stranded cDNA (ds-cDNA) by Φ29 polymerase. This procedure takes ∼5 d, and sufficient amounts of ds-cDNA can be obtained from single-cell RNA template for further microarray analysis.

  13. Differentially co-expressed interacting protein pairs discriminate samples under distinct stages of HIV type 1 infection.

    PubMed

    Yoon, Dukyong; Kim, Hyosil; Suh-Kim, Haeyoung; Park, Rae Woong; Lee, KiYoung

    2011-01-01

    Microarray analyses based on differentially expressed genes (DEGs) have been widely used to distinguish samples across different cellular conditions. However, studies based on DEGs have not been able to clearly determine significant differences between samples of pathophysiologically similar HIV-1 stages, e.g., between acute and chronic progressive (or AIDS) or between uninfected and clinically latent stages. We here suggest a novel approach to allow such discrimination based on stage-specific genetic features of HIV-1 infection. Our approach is based on co-expression changes of genes known to interact. The method can identify a genetic signature for a single sample as contrasted with existing protein-protein-based analyses with correlational designs. Our approach distinguishes each sample using differentially co-expressed interacting protein pairs (DEPs) based on co-expression scores of individual interacting pairs within a sample. The co-expression score has positive value if two genes in a sample are simultaneously up-regulated or down-regulated. And the score has higher absolute value if expression-changing ratios are similar between the two genes. We compared characteristics of DEPs with that of DEGs by evaluating their usefulness in separation of HIV-1 stage. And we identified DEP-based network-modules and their gene-ontology enrichment to find out the HIV-1 stage-specific gene signature. Based on the DEP approach, we observed clear separation among samples from distinct HIV-1 stages using clustering and principal component analyses. Moreover, the discrimination power of DEPs on the samples (70-100% accuracy) was much higher than that of DEGs (35-45%) using several well-known classifiers. DEP-based network analysis also revealed the HIV-1 stage-specific network modules; the main biological processes were related to "translation," "RNA splicing," "mRNA, RNA, and nucleic acid transport," and "DNA metabolism." Through the HIV-1 stage-related modules, changing stage-specific patterns of protein interactions could be observed. DEP-based method discriminated the HIV-1 infection stages clearly, and revealed a HIV-1 stage-specific gene signature. The proposed DEP-based method might complement existing DEG-based approaches in various microarray expression analyses.

  14. Scanning the genome for gene single nucleotide polymorphisms involved in adaptive population differentiation in white spruce

    PubMed Central

    Namroud, Marie-Claire; Beaulieu, Jean; Juge, Nicolas; Laroche, Jérôme; Bousquet, Jean

    2008-01-01

    Conifers are characterized by a large genome size and a rapid decay of linkage disequilibrium, most often within gene limits. Genome scans based on noncoding markers are less likely to detect molecular adaptation linked to genes in these species. In this study, we assessed the effectiveness of a genome-wide single nucleotide polymorphism (SNP) scan focused on expressed genes in detecting local adaptation in a conifer species. Samples were collected from six natural populations of white spruce (Picea glauca) moderately differentiated for several quantitative characters. A total of 534 SNPs representing 345 expressed genes were analysed. Genes potentially under natural selection were identified by estimating the differentiation in SNP frequencies among populations (FST) and identifying outliers, and by estimating local differentiation using a Bayesian approach. Both average expected heterozygosity and population differentiation estimates (HE = 0.270 and FST = 0.006) were comparable to those obtained with other genetic markers. Of all genes, 5.5% were identified as outliers with FST at the 95% confidence level, while 14% were identified as candidates for local adaptation with the Bayesian method. There was some overlap between the two gene sets. More than half of the candidate genes for local adaptation were specific to the warmest population, about 20% to the most arid population, and 15% to the coldest and most humid higher altitude population. These adaptive trends were consistent with the genes’ putative functions and the divergence in quantitative traits noted among the populations. The results suggest that an approach separating the locus and population effects is useful to identify genes potentially under selection. These candidates are worth exploring in more details at the physiological and ecological levels. PMID:18662225

  15. DNA sequence polymorphisms in a panel of eight candidate bovine imprinted genes and their association with performance traits in Irish Holstein-Friesian cattle.

    PubMed

    Magee, David A; Sikora, Klaudia M; Berkowicz, Erik W; Berry, Donagh P; Howard, Dawn J; Mullen, Michael P; Evans, Ross D; Spillane, Charles; MacHugh, David E

    2010-10-13

    Studies in mice and humans have shown that imprinted genes, whereby expression from one of the two parentally inherited alleles is attenuated or completely silenced, have a major effect on mammalian growth, metabolism and physiology. More recently, investigations in livestock species indicate that genes subject to this type of epigenetic regulation contribute to, or are associated with, several performance traits, most notably muscle mass and fat deposition. In the present study, a candidate gene approach was adopted to assess 17 validated single nucleotide polymorphisms (SNPs) and their association with a range of performance traits in 848 progeny-tested Irish Holstein-Friesian artificial insemination sires. These SNPs are located proximal to, or within, the bovine orthologs of eight genes (CALCR, GRB10, PEG3, PHLDA2, RASGRF1, TSPAN32, ZIM2 and ZNF215) that have been shown to be imprinted in cattle or in at least one other mammalian species (i.e. human/mouse/pig/sheep). Heterozygosities for all SNPs analysed ranged from 0.09 to 0.46 and significant deviations from Hardy-Weinberg proportions (P ≤ 0.01) were observed at four loci. Phenotypic associations (P ≤ 0.05) were observed between nine SNPs proximal to, or within, six of the eight analysed genes and a number of performance traits evaluated, including milk protein percentage, somatic cell count, culled cow and progeny carcass weight, angularity, body conditioning score, progeny carcass conformation, body depth, rump angle, rump width, animal stature, calving difficulty, gestation length and calf perinatal mortality. Notably, SNPs within the imprinted paternally expressed gene 3 (PEG3) gene cluster were associated (P ≤ 0.05) with calving, calf performance and fertility traits, while a single SNP in the zinc finger protein 215 gene (ZNF215) was associated with milk protein percentage (P ≤ 0.05), progeny carcass weight (P ≤ 0.05), culled cow carcass weight (P ≤ 0.01), angularity (P ≤ 0.01), body depth (P ≤ 0.01), rump width (P ≤ 0.01) and animal stature (P ≤ 0.01). Of the eight candidate bovine imprinted genes assessed, DNA sequence polymorphisms in six of these genes (CALCR, GRB10, PEG3, RASGRF1, ZIM2 and ZNF215) displayed associations with several of the phenotypes included for analyses. The genotype-phenotype associations detected here are further supported by the biological function of these six genes, each of which plays important roles in mammalian growth, development and physiology. The associations between SNPs within the imprinted PEG3 gene cluster and traits related to calving, calf performance and gestation length suggest that this domain on chromosome 18 may play a role regulating pre-natal growth and development and fertility. SNPs within the bovine ZNF215 gene were associated with bovine growth and body conformation traits and studies in humans have revealed that the human ZNF215 ortholog belongs to the imprinted gene cluster associated with Beckwith-Wiedemann syndrome--a genetic disorder characterised by growth abnormalities. Similarly, the data presented here suggest that the ZNF215 gene may have an important role in regulating bovine growth. Collectively, our results support previous work showing that (candidate) imprinted genes/loci contribute to heritable variation in bovine performance traits and suggest that DNA sequence polymorphisms within these genes/loci represents an important reservoir of genomic markers for future genetic improvement of dairy and beef cattle populations.

  16. DNA sequence polymorphisms in a panel of eight candidate bovine imprinted genes and their association with performance traits in Irish Holstein-Friesian cattle

    PubMed Central

    2010-01-01

    Background Studies in mice and humans have shown that imprinted genes, whereby expression from one of the two parentally inherited alleles is attenuated or completely silenced, have a major effect on mammalian growth, metabolism and physiology. More recently, investigations in livestock species indicate that genes subject to this type of epigenetic regulation contribute to, or are associated with, several performance traits, most notably muscle mass and fat deposition. In the present study, a candidate gene approach was adopted to assess 17 validated single nucleotide polymorphisms (SNPs) and their association with a range of performance traits in 848 progeny-tested Irish Holstein-Friesian artificial insemination sires. These SNPs are located proximal to, or within, the bovine orthologs of eight genes (CALCR, GRB10, PEG3, PHLDA2, RASGRF1, TSPAN32, ZIM2 and ZNF215) that have been shown to be imprinted in cattle or in at least one other mammalian species (i.e. human/mouse/pig/sheep). Results Heterozygosities for all SNPs analysed ranged from 0.09 to 0.46 and significant deviations from Hardy-Weinberg proportions (P ≤ 0.01) were observed at four loci. Phenotypic associations (P ≤ 0.05) were observed between nine SNPs proximal to, or within, six of the eight analysed genes and a number of performance traits evaluated, including milk protein percentage, somatic cell count, culled cow and progeny carcass weight, angularity, body conditioning score, progeny carcass conformation, body depth, rump angle, rump width, animal stature, calving difficulty, gestation length and calf perinatal mortality. Notably, SNPs within the imprinted paternally expressed gene 3 (PEG3) gene cluster were associated (P ≤ 0.05) with calving, calf performance and fertility traits, while a single SNP in the zinc finger protein 215 gene (ZNF215) was associated with milk protein percentage (P ≤ 0.05), progeny carcass weight (P ≤ 0.05), culled cow carcass weight (P ≤ 0.01), angularity (P ≤ 0.01), body depth (P ≤ 0.01), rump width (P ≤ 0.01) and animal stature (P ≤ 0.01). Conclusions Of the eight candidate bovine imprinted genes assessed, DNA sequence polymorphisms in six of these genes (CALCR, GRB10, PEG3, RASGRF1, ZIM2 and ZNF215) displayed associations with several of the phenotypes included for analyses. The genotype-phenotype associations detected here are further supported by the biological function of these six genes, each of which plays important roles in mammalian growth, development and physiology. The associations between SNPs within the imprinted PEG3 gene cluster and traits related to calving, calf performance and gestation length suggest that this domain on chromosome 18 may play a role regulating pre-natal growth and development and fertility. SNPs within the bovine ZNF215 gene were associated with bovine growth and body conformation traits and studies in humans have revealed that the human ZNF215 ortholog belongs to the imprinted gene cluster associated with Beckwith-Wiedemann syndrome--a genetic disorder characterised by growth abnormalities. Similarly, the data presented here suggest that the ZNF215 gene may have an important role in regulating bovine growth. Collectively, our results support previous work showing that (candidate) imprinted genes/loci contribute to heritable variation in bovine performance traits and suggest that DNA sequence polymorphisms within these genes/loci represents an important reservoir of genomic markers for future genetic improvement of dairy and beef cattle populations. PMID:20942903

  17. Performance and Scalability of Discriminative Metrics for Comparative Gene Identification in 12 Drosophila Genomes

    PubMed Central

    Lin, Michael F.; Deoras, Ameya N.; Rasmussen, Matthew D.; Kellis, Manolis

    2008-01-01

    Comparative genomics of multiple related species is a powerful methodology for the discovery of functional genomic elements, and its power should increase with the number of species compared. Here, we use 12 Drosophila genomes to study the power of comparative genomics metrics to distinguish between protein-coding and non-coding regions. First, we study the relative power of different comparative metrics and their relationship to single-species metrics. We find that even relatively simple multi-species metrics robustly outperform advanced single-species metrics, especially for shorter exons (≤240 nt), which are common in animal genomes. Moreover, the two capture largely independent features of protein-coding genes, with different sensitivity/specificity trade-offs, such that their combinations lead to even greater discriminatory power. In addition, we study how discovery power scales with the number and phylogenetic distance of the genomes compared. We find that species at a broad range of distances are comparably effective informants for pairwise comparative gene identification, but that these are surpassed by multi-species comparisons at similar evolutionary divergence. In particular, while pairwise discovery power plateaued at larger distances and never outperformed the most advanced single-species metrics, multi-species comparisons continued to benefit even from the most distant species with no apparent saturation. Last, we find that genes in functional categories typically considered fast-evolving can nonetheless be recovered at very high rates using comparative methods. Our results have implications for comparative genomics analyses in any species, including the human. PMID:18421375

  18. Single cell gene expression profiling in Alzheimer's disease.

    PubMed

    Ginsberg, Stephen D; Che, Shaoli; Counts, Scott E; Mufson, Elliott J

    2006-07-01

    Development and implementation of microarray techniques to quantify expression levels of dozens to hundreds to thousands of transcripts simultaneously within select tissue samples from normal control subjects and neurodegenerative diseased brains has enabled scientists to create molecular fingerprints of vulnerable neuronal populations in Alzheimer's disease (AD) and related disorders. A goal is to sample gene expression from homogeneous cell types within a defined region without potential contamination by expression profiles of adjacent neuronal subpopulations and nonneuronal cells. The precise resolution afforded by single cell and population cell RNA analysis in combination with microarrays and real-time quantitative polymerase chain reaction (qPCR)-based analyses allows for relative gene expression level comparisons across cell types under different experimental conditions and disease progression. The ability to analyze single cells is an important distinction from global and regional assessments of mRNA expression and can be applied to optimally prepared tissues from animal models of neurodegeneration as well as postmortem human brain tissues. Gene expression analysis in postmortem AD brain regions including the hippocampal formation and neocortex reveals selectively vulnerable cell types share putative pathogenetic alterations in common classes of transcripts, for example, markers of glutamatergic neurotransmission, synaptic-related markers, protein phosphatases and kinases, and neurotrophins/neurotrophin receptors. Expression profiles of vulnerable regions and neurons may reveal important clues toward the understanding of the molecular pathogenesis of various neurological diseases and aid in identifying rational targets toward pharmacotherapeutic interventions for progressive, late-onset neurodegenerative disorders such as mild cognitive impairment (MCI) and AD.

  19. Mutations in the von Hippel-Lindau (VHL) tumor suppressor gene and VHL-haplotype analysis in patients with presumable congenital erythrocytosis.

    PubMed

    Cario, Holger; Schwarz, Klaus; Jorch, Norbert; Kyank, Ulrike; Petrides, Petro E; Schneider, Dominik T; Uhle, Renate; Debatin, Klaus-Michael; Kohne, Elisabeth

    2005-01-01

    Congenital erythrocytoses or polycythemias are rare and heterogeneous. A homozygous mutation (C598T->Arg200Trp) in the von Hippel-Lindau (VHL) gene was originally identified as the cause of the endemic Chuvash polycythemia. Subsequently this and other mutations in the VHL gene were also detected in several patients of different ethnic origin. Haplotype analyses of the VHL gene suggested a common origin for the Chuvash-type mutation. Thirty-four patients with presumable congenital erythrocytosis due to an unknown underlying disorder were examined for VHL gene mutations and VHL region haplotypes. Four patients were homozygous and one patient heterozygous for the Chuvash-type mutation. One additional patient presented a previously not described heterozygous mutation G311->T VHL in exon 1. The haplotype analyses were in agreement with recently published data for three of the four patients with homozygous mutations as well as for the patient with a heterozygous Chuvash-type mutation. One patient of Turkish origin with homozygous Chuvash-type mutation had a haplotype not previously found in individuals with Chuvash-type mutation. These results confirm that mutations in the VHL gene are responsible for a substantial proportion of patients with congenital erythrocytoses. Erythrocytoses due to a C598->T mutation of the VHL gene are not geographically restricted. The majority of patients with Chuvash polycythemia share a common VHL gene haplotype. The different haplotype in one of the patients with Chuvash-type mutation indicates that this mutation was not spread only from a single founder but developed independently in other individuals.

  20. Evolutionary origins of a novel host plant detoxification gene in butterflies.

    PubMed

    Fischer, Hanna M; Wheat, Christopher W; Heckel, David G; Vogel, Heiko

    2008-05-01

    Chemical interactions between plants and their insect herbivores provide an excellent opportunity to study the evolution of species interactions on a molecular level. Here, we investigate the molecular evolutionary events that gave rise to a novel detoxifying enzyme (nitrile-specifier protein [NSP]) in the butterfly family Pieridae, previously identified as a coevolutionary key innovation. By generating and sequencing expressed sequence tags, genomic libraries, and screening databases we found NSP to be a member of an insect-specific gene family, which we characterized and named the NSP-like gene family. Members consist of variable tandem repeats, are gut expressed, and are found across Insecta evolving in a dynamic, ongoing birth-death process. In the Lepidoptera, multiple copies of single-domain major allergen genes are present and originate via tandem duplications. Multiple domain genes are found solely within the brassicaceous-feeding Pieridae butterflies, one of them being NSP and another called major allergen (MA). Analyses suggest that NSP and its paralog MA have a unique single-domain evolutionary origin, being formed by intragenic domain duplication followed by tandem whole-gene duplication. Duplicates subsequently experienced a period of relaxed constraint followed by an increase in constraint, perhaps after neofunctionalization. NSP and its ortholog MA are still experiencing high rates of change, reflecting a dynamic evolution consistent with the known role of NSP in plant-insect interactions. Our results provide direct evidence to the hypothesis that gene duplication is one of the driving forces for speciation and adaptation, showing that both within- and whole-gene tandem duplications are a powerful force underlying evolutionary adaptation.

  1. The First Complete Chloroplast Genome Sequences in Actinidiaceae: Genome Structure and Comparative Analysis.

    PubMed

    Yao, Xiaohong; Tang, Ping; Li, Zuozhou; Li, Dawei; Liu, Yifei; Huang, Hongwen

    2015-01-01

    Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5' portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.

  2. Enhanced Gene Expression Rather than Natural Polymorphism in Coding Sequence of the OsbZIP23 Determines Drought Tolerance and Yield Improvement in Rice Genotypes

    PubMed Central

    Dey, Avishek; Samanta, Milan Kumar; Gayen, Srimonta; Sen, Soumitra K.; Maiti, Mrinal K.

    2016-01-01

    Drought is one of the major limiting factors for productivity of crops including rice (Oryza sativa L.). Understanding the role of allelic variations of key regulatory genes involved in stress-tolerance is essential for developing an effective strategy to combat drought. The bZIP transcription factors play a crucial role in abiotic-stress adaptation in plants via abscisic acid (ABA) signaling pathway. The present study aimed to search for allelic polymorphism in the OsbZIP23 gene across selected drought-tolerant and drought-sensitive rice genotypes, and to characterize the new allele through overexpression (OE) and gene-silencing (RNAi). Analyses of the coding DNA sequence (CDS) of the cloned OsbZIP23 gene revealed single nucleotide polymorphism at four places and a 15-nucleotide deletion at one place. The single-copy OsbZIP23 gene is expressed at relatively higher level in leaf tissues of drought-tolerant genotypes, and its abundance is more in reproductive stage. Cloning and sequence analyses of the OsbZIP23-promoter from drought-tolerant O. rufipogon and drought-sensitive IR20 cultivar showed variation in the number of stress-responsive cis-elements and a 35-nucleotide deletion at 5’-UTR in IR20. Analysis of the GFP reporter gene function revealed that the promoter activity of O. rufipogon is comparatively higher than that of IR20. The overexpression of any of the two polymorphic forms (1083 bp and 1068 bp CDS) of OsbZIP23 improved drought tolerance and yield-related traits significantly by retaining higher content of cellular water, soluble sugar and proline; and exhibited decrease in membrane lipid peroxidation in comparison to RNAi lines and non-transgenic plants. The OE lines showed higher expression of target genes-OsRab16B, OsRab21 and OsLEA3-1 and increased ABA sensitivity; indicating that OsbZIP23 is a positive transcriptional-regulator of the ABA-signaling pathway. Taken together, the present study concludes that the enhanced gene expression rather than natural polymorphism in coding sequence of OsbZIP23 is accountable for improved drought tolerance and yield performance in rice genotypes. PMID:26959651

  3. Common genetic variants in the 9p21 region and their associations with multiple tumours.

    PubMed

    Gu, F; Pfeiffer, R M; Bhattacharjee, S; Han, S S; Taylor, P R; Berndt, S; Yang, H; Sigurdson, A J; Toro, J; Mirabello, L; Greene, M H; Freedman, N D; Abnet, C C; Dawsey, S M; Hu, N; Qiao, Y-L; Ding, T; Brenner, A V; Garcia-Closas, M; Hayes, R; Brinton, L A; Lissowska, J; Wentzensen, N; Kratz, C; Moore, L E; Ziegler, R G; Chow, W-H; Savage, S A; Burdette, L; Yeager, M; Chanock, S J; Chatterjee, N; Tucker, M A; Goldstein, A M; Yang, X R

    2013-04-02

    The chromosome 9p21.3 region has been implicated in the pathogenesis of multiple cancers. We systematically examined up to 203 tagging SNPs of 22 genes on 9p21.3 (19.9-32.8 Mb) in eight case-control studies: thyroid cancer, endometrial cancer (EC), renal cell carcinoma, colorectal cancer (CRC), colorectal adenoma (CA), oesophageal squamous cell carcinoma (ESCC), gastric cardia adenocarcinoma and osteosarcoma (OS). We used logistic regression to perform single SNP analyses for each study separately, adjusting for study-specific covariates. We combined SNP results across studies by fixed-effect meta-analyses and a newly developed subset-based statistical approach (ASSET). Gene-based P-values were obtained by the minP method using the Adaptive Rank Truncated Product program. We adjusted for multiple comparisons by Bonferroni correction. Rs3731239 in cyclin-dependent kinase inhibitors 2A (CDKN2A) was significantly associated with ESCC (P=7 × 10(-6)). The CDKN2A-ESCC association was further supported by gene-based analyses (Pgene=0.0001). In the meta-analyses by ASSET, four SNPs (rs3731239 in CDKN2A, rs615552 and rs573687 in CDKN2B and rs564398 in CDKN2BAS) showed significant associations with ESCC and EC (P<2.46 × 10(-4)). One SNP in MTAP (methylthioadenosine phosphorylase) (rs7023329) that was previously associated with melanoma and nevi in multiple genome-wide association studies was associated with CRC, CA and OS by ASSET (P=0.007). Our data indicate that genetic variants in CDKN2A, and possibly nearby genes, may be associated with ESCC and several other tumours, further highlighting the importance of 9p21.3 genetic variants in carcinogenesis.

  4. Association of α-, β-, and γ-Synuclein With Diffuse Lewy Body Disease

    PubMed Central

    Nishioka, Kenya; Wider, Christian; Vilariño-Güell, Carles; Soto-Ortolaza, Alexandra I.; Lincoln, Sarah J.; Kachergus, Jennifer M.; Jasinska-Myga, Barbara; Ross, Owen A.; Rajput, Alex; Robinson, Christopher A.; Ferman, Tanis J.; Wszolek, Zbigniew K.; Dickson, Dennis W.; Farrer, Matthew J.

    2016-01-01

    Objective To determine the association of the genes that encode α-, β-, and γ-synuclein (SNCA, SNCB, and SNCG, respectively) with diffuse Lewy body disease (DLBD). Design Case-control study. Subjects A total of 172 patients with DLBD consistent with a clinical diagnosis of Parkinson disease dementia/dementia with Lewy bodies and 350 clinically and 97 pathologically normal controls. Interventions Sequencing of SNCA, SNCB, and SNCG and genotyping of single-nucleotide polymorphisms performed on an Applied Biosystems capillary sequencer and a Sequenom MassArray pLEX platform, respectively. Associations were determined using χ2 or Fisher exact tests. Results Initial sequencing studies of the coding regions of each gene in 89 patients with DLBD did not detect any pathogenic substitutions. Nevertheless, genotyping of known polymorphic variability in sequence-conserved regions detected several single-nucleotide polymorphisms in the SNCA and SNCG genes that were significantly associated with disease (P=.05 to <.001). Significant association was also observed for 3 single-nucleotide polymorphisms located in SNCB when comparing DLBD cases and pathologically confirmed normal controls (P=.03-.01); however, this association was not significant for the clinical controls alone or the combined clinical and pathological controls (P>.05). After correction for multiple testing, only 1 single-nucleotide polymorphism in SNCG (rs3750823) remained significant in all of the analyses (P=.05-.009). Conclusion These findings suggest that variants in all 3 members of the synuclein gene family, particularly SNCA and SNCG, affect the risk of developing DLBD and warrant further investigation in larger, pathologically defined data sets as well as clinically diagnosed Parkinson disease/dementia with Lewy bodies case-control series. PMID:20697047

  5. Association of SNPs in dopamine and serotonin pathway genes and their interacting genes with temperament traits in Charolais cows.

    PubMed

    Garza-Brenner, E; Sifuentes-Rincón, A M; Randel, R D; Paredes-Sánchez, F A; Parra-Bracamonte, G M; Arellano Vera, W; Rodríguez Almeida, F A; Segura Cabrera, A

    2017-08-01

    Cattle temperament is a complex trait, and molecular studies aimed at defining this trait are scarce. We used an interaction networks approach to identify new genes (interacting genes) and to estimate their effects and those of 19 dopamine- and serotonin-related genes on the temperament traits of Charolais cattle. The genes proopiomelanocortin (POMC), neuropeptide Y (NPY), solute carrier family 18, member 2 (SLC18A2) and FBJ murine osteosarcoma viral oncogene homologue (FOSFBJ) were identified as new candidates. Their potential to be associated with temperament was estimated according to their reported biological activities, which included interactions with neural activity, receptor function, targeting or synthesis of neurotransmitters and association with behaviour. Pen score (PS) and exit velocity (EV) measures were determined from 412 Charolais cows to calculate their temperament score (TS). Based on the TS, calm (n = 55; TS, 1.09 ± 0.33) and temperamental (n = 58; TS, 2.27 ± 0.639) cows were selected and genotyped using a 248 single-nucleotide variation (SNV) panel. Of the 248 variations in the panel, only 151 were confirmed to be polymorphic (single-nucleotide polymorphisms; SNPs) in the tested population. Single-marker association analyses between genotypes and temperament measures (EV, PS and/or TS) indicated significant associations of six SNPs from four candidate genes. The markers rs109576799 and rs43696138, located in the DRD3 and HTR2A genes, respectively, were significantly associated with both EV and TS traits. Four markers, rs110365063 and rs137756569 from the POMC gene and rs110365063 and rs135155082 located in SLC18A2 and DRD2, respectively, were associated with PS. The variant rs110365063 located in bovine SLC18A2 causes a change in the amino acid sequence from Ala to Thr. Further studies are needed to confirm the association of genetic profile with cattle temperament; however, our study represents important progress in understanding the regulation of cattle temperament by different genes with divergent functions.

  6. Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.

    PubMed

    Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao

    2016-04-01

    To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.

  7. Tests of linkage and/or association of the LEPR gene polymorphisms with obesity phenotypes in Caucasian nuclear families.

    PubMed

    Liu, Yong-Jun; Rocha-Sanchez, Sonia M S; Liu, Peng-Yuan; Long, Ji-Rong; Lu, Yan; Elze, Leo; Recker, Robert R; Deng, Hong-Wen

    2004-04-13

    Genetic variations in the leptin receptor (LEPR) gene have been conceived to affect body weight in general populations. In this study, using the tests implemented in the statistical package QTDT, we evaluated association and/or linkage of the LEPR gene with obesity phenotypes in a large sample comprising 1,873 subjects from 405 Caucasian nuclear families. Obesity phenotypes tested include body mass index (BMI), fat mass, percentage fat mass (PFM), and lean mass, with the latter three measured by dual-energy X-ray absorptiometry (DXA). Three single nucleotide polymorphisms (SNPs), namely Lys109Arg (A/G), Lys656Asn (G/C), Pro1019Pro (G/A), in the LEPR gene were analyzed. Significant linkage disequilibrium (0.394 < or = |D'| < or = 0.688, P < 0.001) was observed between pairs of the three SNPs. No significant population stratification was found for any SNP/phenotype. In single-locus analyses, evidence of association was observed for Lys656Asn with lean mass (P = 0.002) and fat mass (P = 0.015). The contribution of this polymorphism to the phenotypic variation of lean mass and fat mass was 2.63% and 1.15%, respectively. Subjects carrying allele G at the Lys656Asn site had, on average, 3.16% higher lean mass and 2.71% higher fat mass than those without it. In the analyses for haplotypes defined by the three SNPs, significant associations were detected between haplotype GCA (P = 0.005) and lean mass. In addition, marginally significant evidence of association was observed for this haplotype with fat mass (P = 0.012). No statistically significant linkage was found, largely due to the limited power of the linkage approach to detect small genetic effects in our data sets. Our results suggest that the LEPR gene polymorphisms contribute to variation in obesity phenotypes.

  8. Using Movies to Analyse Gene Circuit Dynamics in Single Cells

    PubMed Central

    Locke, James CW; Elowitz, Michael B

    2010-01-01

    Preface Many bacterial systems rely on dynamic genetic circuits to control critical processes. A major goal of systems biology is to understand these behaviours in terms of individual genes and their interactions. However, traditional techniques based on population averages wash out critical dynamics that are either unsynchronized between cells or driven by fluctuations, or ‘noise,’ in cellular components. Recently, the combination of time-lapse microscopy, quantitative image analysis, and fluorescent protein reporters has enabled direct observation of multiple cellular components over time in individual cells. In conjunction with mathematical modelling, these techniques are now providing powerful insights into genetic circuit behaviour in diverse microbial systems. PMID:19369953

  9. A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research.

    PubMed

    Weidner, Christopher; Steinfath, Matthias; Wistorf, Elisa; Oelgeschläger, Michael; Schneider, Marlon R; Schönfelder, Gilbert

    2017-08-16

    Recent studies that compared transcriptomic datasets of human diseases with datasets from mouse models using traditional gene-to-gene comparison techniques resulted in contradictory conclusions regarding the relevance of animal models for translational research. A major reason for the discrepancies between different gene expression analyses is the arbitrary filtering of differentially expressed genes. Furthermore, the comparison of single genes between different species and platforms often is limited by technical variance, leading to misinterpretation of the con/discordance between data from human and animal models. Thus, standardized approaches for systematic data analysis are needed. To overcome subjective gene filtering and ineffective gene-to-gene comparisons, we recently demonstrated that gene set enrichment analysis (GSEA) has the potential to avoid these problems. Therefore, we developed a standardized protocol for the use of GSEA to distinguish between appropriate and inappropriate animal models for translational research. This protocol is not suitable to predict how to design new model systems a-priori, as it requires existing experimental omics data. However, the protocol describes how to interpret existing data in a standardized manner in order to select the most suitable animal model, thus avoiding unnecessary animal experiments and misleading translational studies.

  10. PIGD: a database for intronless genes in the Poaceae.

    PubMed

    Yan, Hanwei; Jiang, Cuiping; Li, Xiaoyu; Sheng, Lei; Dong, Qing; Peng, Xiaojian; Li, Qian; Zhao, Yang; Jiang, Haiyang; Cheng, Beijiu

    2014-10-01

    Intronless genes are a feature of prokaryotes; however, they are widespread and unequally distributed among eukaryotes and represent an important resource to study the evolution of gene architecture. Although many databases on exons and introns exist, there is currently no cohesive database that collects intronless genes in plants into a single database. In this study, we present the Poaceae Intronless Genes Database (PIGD), a user-friendly web interface to explore information on intronless genes from different plants. Five Poaceae species, Sorghum bicolor, Zea mays, Setaria italica, Panicum virgatum and Brachypodium distachyon, are included in the current release of PIGD. Gene annotations and sequence data were collected and integrated from different databases. The primary focus of this study was to provide gene descriptions and gene product records. In addition, functional annotations, subcellular localization prediction and taxonomic distribution are reported. PIGD allows users to readily browse, search and download data. BLAST and comparative analyses are also provided through this online database, which is available at http://pigd.ahau.edu.cn/. PIGD provides a solid platform for the collection, integration and analysis of intronless genes in the Poaceae. As such, this database will be useful for subsequent bio-computational analysis in comparative genomics and evolutionary studies.

  11. New encoded single-indicator sequences based on physico-chemical parameters for efficient exon identification.

    PubMed

    Meher, J K; Meher, P K; Dash, G N; Raval, M K

    2012-01-01

    The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.

  12. Genome-wide bisulfite sensitivity profiling of yeast suggests bisulfite inhibits transcription.

    PubMed

    Segovia, Romulo; Mathew, Veena; Tam, Annie S; Stirling, Peter C

    2017-09-01

    Bisulfite, in the form of sodium bisulfite or metabisulfite, is used commercially as a food preservative. Bisulfite is used in the laboratory as a single-stranded DNA mutagen in epigenomic analyses of DNA methylation. Recently it has also been used on whole yeast cells to induce mutations in exposed single-stranded regions in vivo. To understand the effects of bisulfite on live cells we conducted a genome-wide screen for bisulfite sensitive mutants in yeast. Screening the deletion mutant array, and collections of essential gene mutants we define a genetic network of bisulfite sensitive mutants. Validation of screen hits revealed hyper-sensitivity of transcription and RNA processing mutants, rather than DNA repair pathways and follow-up analyses support a role in perturbation of RNA transactions. We propose a model in which bisulfite-modified nucleotides may interfere with transcription or RNA metabolism when used in vivo. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Fluid Mechanics, Arterial Disease, and Gene Expression.

    PubMed

    Tarbell, John M; Shi, Zhong-Dong; Dunn, Jessilyn; Jo, Hanjoong

    2014-01-01

    This review places modern research developments in vascular mechanobiology in the context of hemodynamic phenomena in the cardiovascular system and the discrete localization of vascular disease. The modern origins of this field are traced, beginning in the 1960s when associations between flow characteristics, particularly blood flow-induced wall shear stress, and the localization of atherosclerotic plaques were uncovered, and continuing to fluid shear stress effects on the vascular lining endothelial) cells (ECs), including their effects on EC morphology, biochemical production, and gene expression. The earliest single-gene studies and genome-wide analyses are considered. The final section moves from the ECs lining the vessel wall to the smooth muscle cells and fibroblasts within the wall that are fluid me chanically activated by interstitial flow that imposes shear stresses on their surfaces comparable with those of flowing blood on EC surfaces. Interstitial flow stimulates biochemical production and gene expression, much like blood flow on ECs.

  14. Biological annotation of genetic loci associated with intelligence in a meta-analysis of 87,740 individuals.

    PubMed

    Coleman, Jonathan R I; Bryois, Julien; Gaspar, Héléna A; Jansen, Philip R; Savage, Jeanne E; Skene, Nathan; Plomin, Robert; Muñoz-Manchado, Ana B; Linnarsson, Sten; Crawford, Greg; Hjerling-Leffler, Jens; Sullivan, Patrick F; Posthuma, Danielle; Breen, Gerome

    2018-03-08

    Variance in IQ is associated with a wide range of health outcomes, and 1% of the population are affected by intellectual disability. Despite a century of research, the fundamental neural underpinnings of intelligence remain unclear. We integrate results from genome-wide association studies (GWAS) of intelligence with brain tissue and single cell gene expression data to identify tissues and cell types associated with intelligence. GWAS data for IQ (N = 78,308) were meta-analyzed with a study comparing 1247 individuals with mean IQ ~170 to 8185 controls. Genes associated with intelligence implicate pyramidal neurons of the somatosensory cortex and CA1 region of the hippocampus, and midbrain embryonic GABAergic neurons. Tissue-specific analyses find the most significant enrichment for frontal cortex brain expressed genes. These results suggest specific neuronal cell types and genes may be involved in intelligence and provide new hypotheses for neuroscience experiments using model systems.

  15. Identification of embryonic pancreatic genes using Xenopus DNA microarrays.

    PubMed

    Hayata, Tadayoshi; Blitz, Ira L; Iwata, Nahoko; Cho, Ken W Y

    2009-06-01

    The pancreas is both an exocrine and endocrine endodermal organ involved in digestion and glucose homeostasis. During embryogenesis, the anlagen of the pancreas arise from dorsal and ventral evaginations of the foregut that later fuse to form a single organ. To better understand the molecular genetics of early pancreas development, we sought to isolate markers that are uniquely expressed in this tissue. Microarray analysis was performed comparing dissected pancreatic buds, liver buds, and the stomach region of tadpole stage Xenopus embryos. A total of 912 genes were found to be differentially expressed between these organs during early stages of organogenesis. K-means clustering analysis predicted 120 of these genes to be specifically enriched in the pancreas. Of these, we report on the novel expression patterns of 24 genes. Our analyses implicate the involvement of previously unsuspected signaling pathways during early pancreas development. Developmental Dynamics 238:1455-1466, 2009. (c) 2009 Wiley-Liss, Inc.

  16. 3D FISH to analyse gene domain-specific chromatin re-modeling in human cancer cell lines.

    PubMed

    Kocanova, Silvia; Goiffon, Isabelle; Bystricky, Kerstin

    2018-06-01

    Fluorescence in situ hybridization (FISH) is a common technique used to label DNA and/or RNA for detection of a genomic region of interest. However, the technique can be challenging, in particular when applied to single genes in human cancer cells. Here, we provide a step-by-step protocol for analysis of short (35 kb-300 kb) genomic regions in three dimensions (3D). We discuss the experimental design and provide practical considerations for 3D imaging and data analysis to determine chromatin folding. We demonstrate that 3D FISH using BACs (Bacterial Artificial Chromosomes) or fosmids can provide detailed information of the architecture of gene domains. More specifically, we show that mapping of specific chromatin landscapes informs on changes associated with estrogen stimulated gene activity in human breast cancer cell lines. Copyright © 2018 Elsevier Inc. All rights reserved.

  17. Single-cell gene expression analysis reveals diversity among human spermatogonia.

    PubMed

    Neuhaus, N; Yoon, J; Terwort, N; Kliesch, S; Seggewiss, J; Huge, A; Voss, R; Schlatt, S; Grindberg, R V; Schöler, H R

    2017-02-10

    Is the molecular profile of human spermatogonia homogeneous or heterogeneous when analysed at the single-cell level? Heterogeneous expression profiles may be a key characteristic of human spermatogonia, supporting the existence of a heterogeneous stem cell population. Despite the fact that many studies have sought to identify specific markers for human spermatogonia, the molecular fingerprint of these cells remains hitherto unknown. Testicular tissues from patients with spermatogonial arrest (arrest, n = 1) and with qualitatively normal spermatogenesis (normal, n = 7) were selected from a pool of 179 consecutively obtained biopsies. Gene expression analyses of cell populations and single-cells (n = 105) were performed. Two OCT4-positive individual cells were selected for global transcriptional capture using shallow RNA-seq. Finally, expression of four candidate markers was assessed by immunohistochemistry. Histological analysis and blood hormone measurements for LH, FSH and testosterone were performed prior to testicular sample selection. Following enzymatic digestion of testicular tissues, differential plating and subsequent micromanipulation of individual cells was employed to enrich and isolate human spermatogonia, respectively. Endpoint analyses were qPCR analysis of cell populations and individual cells, shallow RNA-seq and immunohistochemical analyses. Unexpectedly, single-cell expression data from the arrest patient (20 cells) showed heterogeneous expression profiles. Also, from patients with normal spermatogenesis, heterogeneous expression patterns of undifferentiated (OCT4, UTF1 and MAGE A4) and differentiated marker genes (BOLL and PRM2) were obtained within each spermatogonia cluster (13 clusters with 85 cells). Shallow RNA-seq analysis of individual human spermatogonia was validated, and a spermatogonia-specific heterogeneous protein expression of selected candidate markers (DDX5, TSPY1, EEF1A1 and NGN3) was demonstrated. The heterogeneity of human spermatogonia at the RNA and protein levels is a snapshot. To further assess the functional meaning of this heterogeneity and the dynamics of stem cell populations, approaches need to be developed to facilitate the repeated analysis of individual cells. Our data suggest that heterogeneous expression profiles may be a key characteristic of human spermatogonia, supporting the model of a heterogeneous stem cell population. Future studies will assess the dynamics of spermatogonial populations in fertile and infertile patients. RNA-seq data is published in the GEO database: GSE91063. This work was supported by the Max Planck Society and the Deutsche Forschungsgemeinschaft DFG-Research Unit FOR 1041 Germ Cell Potential (grant numbers SCHO 340/7-1, SCHL394/11-2). The authors declare that there is no conflict of interest. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  18. Polymorphisms in an obesity-related gene (PCSK1) are associated with fat deposition and production traits in Italian heavy pigs.

    PubMed

    Fontanesi, L; Bertolini, F; Scotti, E; Trevisi, P; Buttazzoni, L; Dall'olio, S; Davoli, R; Bosi, P; Russo, V

    2012-12-01

    The proprotein convertase subtilisin/kexin type 1 (PCSK1) gene encodes the prohormone convertase 1/3 enzyme that processes prohormones into functional hormones that, in turn, regulate central and peripheral energy metabolism. Mutations in the human PCSK1 gene cause severe monogenic obesity or confer risk of obesity. We herein investigated the porcine PCSK1 gene with the aim of identifying polymorphisms associated with fat deposition and production traits in Italian heavy pigs. By re-sequencing about 5.1 kb of this gene in 21 pigs of different breeds, we discovered 14 polymorphisms that were organized in nine haplotypes, clearly distributed in two clades of putative European and Asian origin. Then we re-mapped this gene on porcine chromosome 2 and analysed its expression in several tissues including gastric oxyntic mucosa of weanling pigs in which PCSK1 processes the pre-pro-ghrelin into ghrelin, which in turn is involved in the control of feed intake and energy metabolism. Association analyses between PCSK1 single-nucleotide polymorphisms (SNPs) and production, carcass and several other traits were conducted on five groups of pigs from three different experimental designs, for a total of 1221 animals. Results indicated that the analysed SNPs were associated (P < 0.01 or P < 0.05) with several traits including backfat thickness and visible intermuscular fat in Italian Duroc (ID) and growth performances in Italian Large White (ILW) and in ILW × Italian Landrace pigs. However, the effects estimated in the ILW were opposite to the effects reported in the ID pigs. Suggestive association (P < 0.10) was observed with muscle cathepsin B activity, opening, if confirmed, potential applications to reduce the excessive softness defect of the green hams that is of particular concern for the processing industry. The results obtained supported the need to further investigate the PCSK1 gene to fully exploit the value of its variability and apply this information in pig breeding programmes.

  19. Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N=112 117)

    PubMed Central

    Clarke, T-K; Adams, M J; Davies, G; Howard, D M; Hall, L S; Padmanabhan, S; Murray, A D; Smith, B H; Campbell, A; Hayward, C; Porteous, D J; Deary, I J; McIntosh, A M

    2017-01-01

    Alcohol consumption has been linked to over 200 diseases and is responsible for over 5% of the global disease burden. Well-known genetic variants in alcohol metabolizing genes, for example, ALDH2 and ADH1B, are strongly associated with alcohol consumption but have limited impact in European populations where they are found at low frequency. We performed a genome-wide association study (GWAS) of self-reported alcohol consumption in 112 117 individuals in the UK Biobank (UKB) sample of white British individuals. We report significant genome-wide associations at 14 loci. These include single-nucleotide polymorphisms (SNPs) in alcohol metabolizing genes (ADH1B/ADH1C/ADH5) and two loci in KLB, a gene recently associated with alcohol consumption. We also identify SNPs at novel loci including GCKR, CADM2 and FAM69C. Gene-based analyses found significant associations with genes implicated in the neurobiology of substance use (DRD2, PDE4B). GCTA analyses found a significant SNP-based heritability of self-reported alcohol consumption of 13% (se=0.01). Sex-specific analyses found largely overlapping GWAS loci and the genetic correlation (rG) between male and female alcohol consumption was 0.90 (s.e.=0.09, P-value=7.16 × 10−23). Using LD score regression, genetic overlap was found between alcohol consumption and years of schooling (rG=0.18, s.e.=0.03), high-density lipoprotein cholesterol (rG=0.28, s.e.=0.05), smoking (rG=0.40, s.e.=0.06) and various anthropometric traits (for example, overweight, rG=−0.19, s.e.=0.05). This study replicates the association between alcohol consumption and alcohol metabolizing genes and KLB, and identifies novel gene associations that should be the focus of future studies investigating the neurobiology of alcohol consumption. PMID:28937693

  20. A single regulatory gene is sufficient to alter Vibrio aestuarianus pathogenicity in oysters.

    PubMed

    Goudenège, David; Travers, Marie Agnès; Lemire, Astrid; Petton, Bruno; Haffner, Philippe; Labreuche, Yannick; Tourbiez, Delphine; Mangenot, Sophie; Calteau, Alexandra; Mazel, Didier; Nicolas, Jean Louis; Jacq, Annick; Le roux, Frédérique

    2015-11-01

    Oyster diseases caused by pathogenic vibrios pose a major challenge to the sustainability of oyster farming. In France, since 2012 a disease affecting specifically adult oysters has been associated with the presence of Vibrio aestuarianus. Here, by combining genome comparison, phylogenetic analyses and high-throughput infections of strains isolated before or during the recent outbreaks, we show that virulent strains cluster into two V. aestuarianus lineages independently of the sampling dates. The bacterial lethal dose was not different between strains isolated before or after 2012. Hence, the emergence of a new highly virulent clonal strain is unlikely. Each lineage comprises nearly identical strains, the majority of them being virulent, suggesting that within these phylogenetically coherent virulent lineages a few strains have lost their pathogenicity. Comparative genomics allowed the identification of a single frameshift in a non-virulent strain. This mutation affects the varS gene that codes for a signal transduction histidine-protein kinase. Genetic analyses confirmed that varS is necessary for infection of oysters and for a secreted metalloprotease expression. For the first time in a Vibrio species, we show here that VarS is a key factor of pathogenicity. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.

  1. Analysis of the 227 bp short interspersed nuclear element (SINE) insertion of the promoter of the myostatin (MSTN) gene in different horse breeds.

    PubMed

    Dall'Olio, Stefania; Scotti, Emilio; Fontanesi, Luca; Tassinari, Marco

    2014-01-01

    The myostatin (MSTN) gene encodes a protein known to be a negative regulator of muscle mass in mammalian species. Different polymorphisms of the horse (Equus caballus) MSTN gene have been identified, including single nucleotide polymorphisms and a short interspersed nuclear element (SINE) insertion of 227 bp within the promoter of the gene. The SINE insertion has been associated with performance traits in Thoroughbred racehorses and it was proposed as a predictor of optimum racing distance. The aims of this study were to perform in silico analysis to identify putative gains or abrogation of transcription-factor binding sites (TFBSs) generated by the SINE allele of the promoter and to analyse the frequency of the SINE insertion in horses used for racing (gallop and trot) and other purposes. The SINE insertion was genotyped in 227 horses from 10 breeds belonging to different morphological types (brachimorphic, mesomorphic, meso-dolichomorphic and dolichomorphic). The presence of the insertion was confirmed in the Quarter Horse (SINE allele frequency of 0.81) and in the Thoroughbred (0.51), whereas the SINE allele did not segregate in any of the other analysed breeds. As the SINE MSTN gene polymorphism may be population or breed specific, it is not a useful marker for association studies in all breeds.

  2. Isolation, phylogeny and evolution of the SymRK gene in the legume genus Lupinus L.

    PubMed

    Mahé, Frédéric; Markova, Dragomira; Pasquet, Rémy; Misset, Marie-Thérèse; Aïnouche, Abdelkader

    2011-07-01

    SymRK is one of the key genes involved in initial steps of legume symbiotic association with fungi (mycorrhization) and nitrogen-fixing bacteria (nodulation). A large portion of the sequence encoding the extracellular domain of SYMRK was obtained for 38 lupine accessions and 2 outgroups in order to characterize this region, to evaluate its phylogenetic utility, and to examine whether its molecular evolutionary pattern is correlated with rhizobial diversity and specificity in Lupinus. The data suggested that, in Lupinus, SymRK is a single copy gene that shows good phylogenetic potential. Accordingly, SymRK provided additional support to previous molecular phylogenies, and shed additional light on relationships within the Old World group of Lupinus, especially among the African species. Similar to results of other studies, analyses of SymRK sequences were unable to resolve placement of the Florida unifoliolate lineage, whose relationship was weakly supported to either the Old or the New World lupines. Our data are consistent with strong purifying selection operating on SymRK in Lupinus, preserving rather than diversifying its function. Thus, although SymRK was demonstrated to be a vital gene in the early stages of the root-bacterial symbiotic associations, no evidence from present analyses indicate that this gene is involved in changes in rhizobial specificity in Lupinus. Copyright © 2011 Elsevier Inc. All rights reserved.

  3. Genome-environment association study suggests local adaptation to climate at the regional scale in Fagus sylvatica.

    PubMed

    Pluess, Andrea R; Frank, Aline; Heiri, Caroline; Lalagüe, Hadrien; Vendramin, Giovanni G; Oddou-Muratorio, Sylvie

    2016-04-01

    The evolutionary potential of long-lived species, such as forest trees, is fundamental for their local persistence under climate change (CC). Genome-environment association (GEA) analyses reveal if species in heterogeneous environments at the regional scale are under differential selection resulting in populations with potential preadaptation to CC within this area. In 79 natural Fagus sylvatica populations, neutral genetic patterns were characterized using 12 simple sequence repeat (SSR) markers, and genomic variation (144 single nucleotide polymorphisms (SNPs) out of 52 candidate genes) was related to 87 environmental predictors in the latent factor mixed model, logistic regressions and isolation by distance/environmental (IBD/IBE) tests. SSR diversity revealed relatedness at up to 150 m intertree distance but an absence of large-scale spatial genetic structure and IBE. In the GEA analyses, 16 SNPs in 10 genes responded to one or several environmental predictors and IBE, corrected for IBD, was confirmed. The GEA often reflected the proposed gene functions, including indications for adaptation to water availability and temperature. Genomic divergence and the lack of large-scale neutral genetic patterns suggest that gene flow allows the spread of advantageous alleles in adaptive genes. Thereby, adaptation processes are likely to take place in species occurring in heterogeneous environments, which might reduce their regional extinction risk under CC. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  4. Evolution and population genomics of the Lyme borreliosis pathogen, Borrelia burgdorferi.

    PubMed

    Seifert, Stephanie N; Khatchikian, Camilo E; Zhou, Wei; Brisson, Dustin

    2015-04-01

    Population genomic studies have the potential to address many unresolved questions about microbial pathogens by facilitating the identification of genes underlying ecologically important traits, such as novel virulence factors and adaptations to humans or other host species. Additionally, this framework improves estimations of population demography and evolutionary history to accurately reconstruct recent epidemics and identify the molecular and environmental factors that resulted in the outbreak. The Lyme disease bacterium, Borrelia burgdorferi, exemplifies the power and promise of the application of population genomics to microbial pathogens. We discuss here the future of evolutionary studies in B. burgdorferi, focusing on the primary evolutionary forces of horizontal gene transfer, natural selection, and migration, as investigations transition from analyses of single genes to genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.

  5. The Complete Chloroplast Genome of Wild Rice (Oryza minuta) and Its Comparison to Related Species.

    PubMed

    Asaf, Sajjad; Waqas, Muhammad; Khan, Abdul L; Khan, Muhammad A; Kang, Sang-Mo; Imran, Qari M; Shahzad, Raheem; Bilal, Saqib; Yun, Byung-Wook; Lee, In-Jung

    2017-01-01

    Oryza minuta , a tetraploid wild relative of cultivated rice (family Poaceae), possesses a BBCC genome and contains genes that confer resistance to bacterial blight (BB) and white-backed (WBPH) and brown (BPH) plant hoppers. Based on the importance of this wild species, this study aimed to understand the phylogenetic relationships of O. minuta with other Oryza species through an in-depth analysis of the composition and diversity of the chloroplast (cp) genome. The analysis revealed a cp genome size of 135,094 bp with a typical quadripartite structure and consisting of a pair of inverted repeats separated by small and large single copies, 139 representative genes, and 419 randomly distributed microsatellites. The genomic organization, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. Approximately 30 forward, 28 tandem and 20 palindromic repeats were detected in the O . minuta cp genome. Comparison of the complete O. minuta cp genome with another eleven Oryza species showed a high degree of sequence similarity and relatively high divergence of intergenic spacers. Phylogenetic analyses were conducted based on the complete genome sequence, 65 shared genes and matK gene showed same topologies and O. minuta forms a single clade with parental O. punctata . Thus, the complete O . minuta cp genome provides interesting insights and valuable information that can be used to identify related species and reconstruct its phylogeny.

  6. Single Cells within the Puerto Rico Trench Suggest Hadal Adaptation of Microbial Lineages

    PubMed Central

    León-Zayas, Rosa; Novotny, Mark; Podell, Sheila; Shepard, Charles M.; Berkenpas, Eric; Nikolenko, Sergey; Pevzner, Pavel; Lasken, Roger S.

    2015-01-01

    Hadal ecosystems are found at a depth of 6,000 m below sea level and below, occupying less than 1% of the total area of the ocean. The microbial communities and metabolic potential in these ecosystems are largely uncharacterized. Here, we present four single amplified genomes (SAGs) obtained from 8,219 m below the sea surface within the hadal ecosystem of the Puerto Rico Trench (PRT). These SAGs are derived from members of deep-sea clades, including the Thaumarchaeota and SAR11 clade, and two are related to previously isolated piezophilic (high-pressure-adapted) microorganisms. In order to identify genes that might play a role in adaptation to deep-sea environments, comparative analyses were performed with genomes from closely related shallow-water microbes. The archaeal SAG possesses genes associated with mixotrophy, including lipoylation and the glycine cleavage pathway. The SAR11 SAG encodes glycolytic enzymes previously reported to be missing from this abundant and cosmopolitan group. The other SAGs, which are related to piezophilic isolates, possess genes that may supplement energy demands through the oxidation of hydrogen or the reduction of nitrous oxide. We found evidence for potential trench-specific gene distributions, as several SAG genes were observed only in a PRT metagenome and not in shallower deep-sea metagenomes. These results illustrate new ecotype features that might perform important roles in the adaptation of microorganisms to life in hadal environments. PMID:26386059

  7. A method for release and multiple strand amplification of small quantities of DNA from endospores of the fastidious bacterium Pasteuria penetrans.

    PubMed

    Mauchline, T H; Mohan, S; Davies, K G; Schaff, J E; Opperman, C H; Kerry, B R; Hirsch, P R

    2010-05-01

    To establish a reliable protocol to extract DNA from Pasteuria penetrans endospores for use as template in multiple strand amplification, thus providing sufficient material for genetic analyses. To develop a highly sensitive PCR-based diagnostic tool for P. penetrans. An optimized method to decontaminate endospores, release and purify DNA enabled multiple strand amplification. DNA purity was assessed by cloning and sequencing gyrB and 16S rRNA gene fragments obtained from PCR using generic primers. Samples indicated to be 100%P. penetrans by the gyrB assay were estimated at 46% using the 16S rRNA gene. No bias was detected on cloning and sequencing 12 housekeeping and sporulation gene fragments from amplified DNA. The detection limit by PCR with Pasteuria-specific 16S rRNA gene primers following multiple strand amplification of DNA extracted using the method was a single endospore. Generation of large quantities DNA will facilitate genomic sequencing of P. penetrans. Apparent differences in sample purity are explained by variations in 16S rRNA gene copy number in Eubacteria leading to exaggerated estimations of sample contamination. Detection of single endospores will facilitate investigations of P. penetrans molecular ecology. These methods will advance studies on P. penetrans and facilitate research on other obligate and fastidious micro-organisms where it is currently impractical to obtain DNA in sufficient quantity and quality.

  8. Comparative proteomics of a tor inducible Aspergillus fumigatus mutant reveals involvement of the Tor kinase in iron regulation.

    PubMed

    Baldin, Clara; Valiante, Vito; Krüger, Thomas; Schafferer, Lukas; Haas, Hubertus; Kniemeyer, Olaf; Brakhage, Axel A

    2015-07-01

    The Tor (target of rapamycin) kinase is one of the major regulatory nodes in eukaryotes. Here, we analyzed the Tor kinase in Aspergillus fumigatus, which is the most important airborne fungal pathogen of humans. Because deletion of the single tor gene was apparently lethal, we generated a conditional lethal tor mutant by replacing the endogenous tor gene by the inducible xylp-tor gene cassette. By both 2DE and gel-free LC-MS/MS, we found that Tor controls a variety of proteins involved in nutrient sensing, stress response, cell cycle progression, protein biosynthesis and degradation, but also processes in mitochondria, such as respiration and ornithine metabolism, which is required for siderophore formation. qRT-PCR analyses indicated that mRNA levels of ornithine biosynthesis genes were increased under iron limitation. When tor was repressed, iron regulation was lost. In a deletion mutant of the iron regulator HapX also carrying the xylp-tor cassette, the regulation upon iron deprivation was similar to that of the single tor inducible mutant strain. In line, hapX expression was significantly reduced when tor was repressed. Thus, Tor acts either upstream of HapX or independently of HapX as a repressor of the ornithine biosynthesis genes and thereby regulates the production of siderophores. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Hormone-Related Pathways and Risk of Breast Cancer Subtypes in African American Women

    PubMed Central

    Haddad, Stephen A.; Lunetta, Kathryn L.; Ruiz-Narváez, Edward A.; Bensen, Jeannette T.; Hong, Chi-Chen; Sucheston-Campbell, Lara E.; Yao, Song; Bandera, Elisa V.; Rosenberg, Lynn; Haiman, Christopher A.; Troester, Melissa A.; Ambrosone, Christine B.; Palmer, Julie R.

    2016-01-01

    Purpose We sought to investigate genetic variation in hormone pathways in relation to risk of overall and subtype-specific breast cancer in women of African ancestry (AA). Methods Genotyping and imputation yielded data on 143,934 SNPs in 308 hormone-related genes for 3663 breast cancer cases (1098 ER-, 1983 ER+, 582 ER unknown) and 4687 controls from the African American Breast Cancer Epidemiology and Risk (AMBER) Consortium. AMBER includes data from four large studies of AA women: the Carolina Breast Cancer Study, the Women's Circle of Health Study, the Black Women's Health Study, and the Multiethnic Cohort Study. Pathway- and gene-based analyses were conducted, and single SNP tests were run for the top genes. Results There were no strong associations at the pathway level. The most significantly associated genes were GHRH, CALM2, CETP, and AKR1C1 for overall breast cancer (gene-based nominal p ≤0.01); NR0B1, IGF2R, CALM2, CYP1B1, and GRB2 for ER+ breast cancer (p ≤0.02); and PGR, MAPK3, MAP3K1, and LHCGR for ER- disease (p ≤0.02). Single-SNP tests for SNPs with pairwise linkage disequilibrium r2 <0.8 in the top genes identified 12 common SNPs (in CALM2, CETP, NR0B1, IGF2R, CYP1B1, PGR, MAPK3, and MAP3K1) associated with overall or subtype-specific breast cancer after gene-level correction for multiple testing. Rs11571215 in PGR (progesterone receptor) was the SNP most strongly associated with ER- disease. Conclusion We identified eight genes in hormone pathways that contain common variants associated with breast cancer in AA women after gene-level correction for multiple testing. PMID:26458823

  10. How immunogenetically different are domestic pigs from wild boars: a perspective from single-nucleotide polymorphisms of 19 immunity-related candidate genes.

    PubMed

    Chen, Shanyuan; Gomes, Rui; Costa, Vânia; Santos, Pedro; Charneca, Rui; Zhang, Ya-ping; Liu, Xue-hong; Wang, Shao-qing; Bento, Pedro; Nunes, Jose-Luis; Buzgó, József; Varga, Gyula; Anton, István; Zsolnai, Attila; Beja-Pereira, Albano

    2013-10-01

    The coexistence of wild boars and domestic pigs across Eurasia makes it feasible to conduct comparative genetic or genomic analyses for addressing how genetically different a domestic species is from its wild ancestor. To test whether there are differences in patterns of genetic variability between wild and domestic pigs at immunity-related genes and to detect outlier loci putatively under selection that may underlie differences in immune responses, here we analyzed 54 single-nucleotide polymorphisms (SNPs) of 19 immunity-related candidate genes on 11 autosomes in three pairs of wild boar and domestic pig populations from China, Iberian Peninsula, and Hungary. Our results showed no statistically significant differences in allele frequency and heterozygosity across SNPs between three pairs of wild and domestic populations. This observation was more likely due to the widespread and long-lasting gene flow between wild boars and domestic pigs across Eurasia. In addition, we detected eight coding SNPs from six genes as outliers being under selection consistently by three outlier tests (BayeScan2.1, FDIST2, and Arlequin3.5). Among four non-synonymous outlier SNPs, one from TLR4 gene was identified as being subject to positive (diversifying) selection and three each from CD36, IFNW1, and IL1B genes were suggested as under balancing selection. All of these four non-synonymous variants were predicted as being benign by PolyPhen-2. Our results were supported by other independent lines of evidence for positive selection or balancing selection acting on these four immune genes (CD36, IFNW1, IL1B, and TLR4). Our study showed an example applying a candidate gene approach to identify functionally important mutations (i.e., outlier loci) in wild and domestic pigs for subsequent functional experiments.

  11. Genetic assessment and folate receptor autoantibodies in infantile-onset cerebral folate deficiency (CFD) syndrome.

    PubMed

    Ramaekers, V Th; Segers, K; Sequeira, J M; Koenig, M; Van Maldergem, L; Bours, V; Kornak, U; Quadros, E V

    2018-05-01

    Cerebral folate deficiency (CFD) syndromes are defined as neuro-psychiatric conditions with low CSF folate and attributed to different causes such as autoantibodies against the folate receptor-alpha (FR) protein that can block folate transport across the choroid plexus, FOLR1 gene mutations or mitochondrial disorders. High-dose folinic acid treatment restores many neurologic deficits. Among 36 patients from 33 families the infantile-onset CFD syndrome was diagnosed based on typical clinical features and low CSF folate. All parents were healthy. Three families had 2 affected siblings, while parents from 4 families were first cousins. We analysed serum FR autoantibodies and the FOLR1 and FOLR2 genes. Among three consanguineous families homozygosity mapping attempted to identify a monogenetic cause. Whole exome sequencing (WES) was performed in the fourth consanguineous family, where two siblings also suffered from polyneuropathy as an atypical finding. Boys (72%) outnumbered girls (28%). Most patients (89%) had serum FR autoantibodies fluctuating over 5-6 weeks. Two children had a genetic FOLR1 variant without pathological significance. Homozygosity mapping failed to detect a single autosomal recessive gene. WES revealed an autosomal recessive polynucleotide kinase 3´phosphatase (PNKP) gene abnormality in the siblings with polyneuropathy. Infantile-onset CFD was characterized by serum FR autoantibodies as its predominant pathology whereas pathogenic FOLR1 gene mutations were absent. Homozygosity mapping excluded autosomal recessive inheritance of any single responsible gene. WES in one consanguineous family identified a PNKP gene abnormality that explained the polyneuropathy and also its contribution to the infantile CFD syndrome because the PNKP gene plays a dual role in both neurodevelopment and immune-regulatory function. Further research for candidate genes predisposing to FRα-autoimmunity is suggested to include X-chromosomal and non-coding DNA regions. Copyright © 2018 Elsevier Inc. All rights reserved.

  12. Multiplexed pyrosequencing of nine sea anemone (Cnidaria: Anthozoa: Hexacorallia: Actiniaria) mitochondrial genomes.

    PubMed

    Foox, Jonathan; Brugler, Mercer; Siddall, Mark Edward; Rodríguez, Estefanía

    2016-07-01

    Six complete and three partial actiniarian mitochondrial genomes were amplified in two semi-circles using long-range PCR and pyrosequenced in a single run on a 454 GS Junior, doubling the number of complete mitogenomes available within the order. Typical metazoan mtDNA features included circularity, 13 protein-coding genes, 2 ribosomal RNA genes, and length ranging from 17,498 to 19,727 bp. Several typical anthozoan mitochondrial genome features were also observed including the presence of only two transfer RNA genes, elevated A + T richness ranging from 54.9 to 62.4%, large intergenic regions, and group 1 introns interrupting NADH dehydrogenase subunit 5 and cytochrome c oxidase subunit I, the latter of which possesses a homing endonuclease gene. Within the sea anemone Alicia sansibarensis, we report the first mitochondrial gene order rearrangement within the Actiniaria, as well as putative novel non-canonical protein-coding genes. Phylogenetic analyses of all 13 protein-coding and 2 ribosomal genes largely corroborated current hypotheses of sea anemone interrelatedness, with a few lower-level differences.

  13. Selection and environmental adaptation along a path to speciation in the Tibetan frog Nanorana parkeri.

    PubMed

    Wang, Guo-Dong; Zhang, Bao-Lin; Zhou, Wei-Wei; Li, Yong-Xin; Jin, Jie-Qiong; Shao, Yong; Yang, He-Chuan; Liu, Yan-Hu; Yan, Fang; Chen, Hong-Man; Jin, Li; Gao, Feng; Zhang, Yaoguang; Li, Haipeng; Mao, Bingyu; Murphy, Robert W; Wake, David B; Zhang, Ya-Ping; Che, Jing

    2018-05-29

    Tibetan frogs, Nanorana parkeri , are differentiated genetically but not morphologically along geographical and elevational gradients in a challenging environment, presenting a unique opportunity to investigate processes leading to speciation. Analyses of whole genomes of 63 frogs reveal population structuring and historical demography, characterized by highly restricted gene flow in a narrow geographic zone lying between matrilines West (W) and East (E). A population found only along a single tributary of the Yalu Zangbu River has the mitogenome only of E, whereas nuclear genes of W comprise 89-95% of the nuclear genome. Selection accounts for 579 broadly scattered, highly divergent regions (HDRs) of the genome, which involve 365 genes. These genes fall into 51 gene ontology (GO) functional classes, 14 of which are likely to be important in driving reproductive isolation. GO enrichment analyses of E reveal many overrepresented functional categories associated with adaptation to high elevations, including blood circulation, response to hypoxia, and UV radiation. Four genes, including DNAJC8 in the brain, TNNC1 and ADORA1 in the heart, and LAMB3 in the lung, differ in levels of expression between low- and high-elevation populations. High-altitude adaptation plays an important role in maintaining and driving continuing divergence and reproductive isolation. Use of total genomes enabled recognition of selection and adaptation in and between populations, as well as documentation of evolution along a stepped cline toward speciation. Copyright © 2018 the Author(s). Published by PNAS.

  14. Association of the mu-opioid receptor gene with type 2 diabetes mellitus in an African American population.

    PubMed

    Gallagher, Carla J; Gordon, Candace J; Langefeld, Carl D; Mychaleckyj, Josyf C; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M

    2006-01-01

    African Americans (AA) are at increased risk for developing type 2 diabetes mellitus (T2DM) relative to European Americans. We previously detected linkage of T2DM to 6q24-q27 (LOD 2.26) at 163.5 cM, closest to marker D6S1035, in a genome-wide scan of AA families. The mu-opioid receptor gene (OPRM1) is located within the LOD-1 support interval of this linkage peak. OPRM1 is an attractive positional candidate gene for T2DM susceptibility since agonists of OPRM1 affect glucose-induced insulin release and OPRM1 knockout mice have a more rapid induction of insulin resistance than wild-type. Twenty-two SNPs in this gene, at an average spacing of 3.9 kb, were genotyped in 380 AA T2DM cases and 276 AA controls. In single SNP association analyses, rs648007 demonstrated significant evidence of association with T2DM (P=0.013). Four blocks of high linkage disequilibrium were detected across the OPRM1 gene. Association analyses of haplotypes in each of these blocks revealed two haplotype blocks with significant overall P values (P=0.007 and 0.046). Significant, but rare, risk and protective haplotypes were identified as driving these associations with T2DM (P=0.034-0.047). These associations suggest that the OPRM1 gene plays a role in T2DM susceptibility in African Americans.

  15. Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field.

    PubMed

    Crépeau, Valentin; Cambon Bonavita, Marie-Anne; Lesongeur, Françoise; Randrianalivelo, Henintsoa; Sarradin, Pierre-Marie; Sarrazin, Jozée; Godfroy, Anne

    2011-06-01

    Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field (Mid-Atlantic Ridge) were investigated using molecular approaches. DNA and RNA were extracted from mat samples overlaying hydrothermal deposits and Bathymodiolus azoricus mussel assemblages. We constructed and analyzed libraries of 16S rRNA gene sequences and sequences of functional genes involved in autotrophic carbon fixation [forms I and II RuBisCO (cbbL/M), ATP-citrate lyase B (aclB)]; methane oxidation [particulate methane monooxygenase (pmoA)] and sulfur oxidation [adenosine-5'-phosphosulfate reductase (aprA) and soxB]. To gain new insights into the relationships between mats and mussels, we also used new domain-specific 16S rRNA gene primers targeting Bathymodiolus sp. symbionts. All identified archaeal sequences were affiliated with a single group: the marine group 1 Thaumarchaeota. In contrast, analyses of bacterial sequences revealed much higher diversity, although two phyla Proteobacteria and Bacteroidetes were largely dominant. The 16S rRNA gene sequence library revealed that species affiliated to Beggiatoa Gammaproteobacteria were the dominant active population. Analyses of DNA and RNA functional gene libraries revealed a diverse and active chemolithoautotrophic population. Most of these sequences were affiliated with Gammaproteobacteria, including hydrothermal fauna symbionts, Thiotrichales and Methylococcales. PCR and reverse transcription-PCR using 16S rRNA gene primers targeted to Bathymodiolus sp. symbionts revealed sequences affiliated with both methanotrophic and thiotrophic endosymbionts. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  16. Combining Genome Wide Association Study and lung eQTL analysis provides evidence for novel genes associated with asthma

    PubMed Central

    Nieuwenhuis, Maartje A.; Siedlinski, Matteusz; van den Berge, Maarten; Granell, Raquel; Li, Xingnan; Niens, Marijke; van der Vlies, Pieter; Altmüller, Janine; Nürnberg, Peter; Kerkhof, Marjan; van Schayck, Onno C.; Riemersma, Ronald A.; van der Molen, Thys; de Monchy, Jan G.; Bossé, Yohan; Sandford, Andrew; Bruijnzeel-Koomen, Carla A.; van Wijk, Roy G.; ten Hacken, Nick H.; Timens, Wim; Boezen, H. Marike; Henderson, John; Kabesch, Michael; Vonk, Judith M.; Postma, Dirkje S.; Koppelman, Gerard H.

    2016-01-01

    Background Genome wide association studies (GWAS) of asthma have identified single nucleotide polymorphisms (SNPs) that modestly increase the risk for asthma. This could be due to phenotypic heterogeneity of asthma. Bronchial hyperresponsiveness (BHR) is a phenotypic hallmark of asthma. We aim to identify susceptibility genes for asthma combined with BHR and analyse the presence of cis-eQTLs among replicated SNPs. Secondly, we compare the genetic association of SNPs previously associated with (doctor diagnosed) asthma to our GWAS of asthma with BHR. Methods A GWAS was performed in 920 asthmatics with BHR and 980 controls. Top SNPs of our GWAS were analysed in four replication cohorts and lung cis-eQTL analysis was performed on replicated SNPs. We investigated association of SNPs previously associated with asthma in our data. Results 368 SNPs were followed up for replication. Six SNPs in genes encoding ABI3BP, NAF1, MICA and the 17q21 locus replicated in one or more cohorts, with one locus (17q21) achieving genome wide significance after meta-analysis. Five out of 6 replicated SNPs regulated 35 gene transcripts in whole lung. Eight of 20 asthma associated SNPs from previous GWAS were significantly associated with asthma and BHR. Three SNPs, in IL-33 and GSDMB, showed larger effect sizes in our data compared to published literature. Conclusions Combining GWAS with subsequent lung eQTL analysis revealed disease associated SNPs regulating lung mRNA expression levels of potential new asthma genes. Adding BHR to the asthma definition does not lead to an overall larger genetic effect size than analysing (doctor’s diagnosed) asthma. PMID:27439200

  17. Investigation of exomic variants associated with overall survival in ovarian cancer

    PubMed Central

    Ann Chen, Yian; Larson, Melissa C; Fogarty, Zachary C; Earp, Madalene A; Anton-Culver, Hoda; Bandera, Elisa V; Cramer, Daniel; Doherty, Jennifer A; Goodman, Marc T; Gronwald, Jacek; Karlan, Beth Y; Kjaer, Susanne K; Levine, Douglas A; Menon, Usha; Ness, Roberta B; Pearce, Celeste L; Pejovic, Tanja; Rossing, Mary Anne; Wentzensen, Nicolas; Bean, Yukie T; Bisogna, Maria; Brinton, Louise A; Carney, Michael E; Cunningham, Julie M; Cybulski, Cezary; deFazio, Anna; Dicks, Ed M; Edwards, Robert P; Gayther, Simon A; Gentry-Maharaj, Aleksandra; Gore, Martin; Iversen, Edwin S; Jensen, Allan; Johnatty, Sharon E; Lester, Jenny; Lin, Hui-Yi; Lissowska, Jolanta; Lubinski, Jan; Menkiszak, Janusz; Modugno, Francesmary; Moysich, Kirsten B; Orlow, Irene; Pike, Malcolm C; Ramus, Susan J; Song, Honglin; Terry, Kathryn L; Thompson, Pamela J; Tyrer, Jonathan P; van den Berg, David J; Vierkant, Robert A; Vitonis, Allison F; Walsh, Christine; Wilkens, Lynne R; Wu, Anna H; Yang, Hannah; Ziogas, Argyrios; Berchuck, Andrew; Chenevix-Trench, Georgia; Schildkraut, Joellen M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pharoah, Paul D P; Fridley, Brooke L

    2016-01-01

    Background While numerous susceptibility loci for epithelial ovarian cancer (EOC) have been identified, few associations have been reported with overall survival. In the absence of common prognostic genetic markers, we hypothesize that rare coding variants may be associated with overall EOC survival and assessed their contribution in two exome-based genotyping projects of the Ovarian Cancer Association Consortium (OCAC). Methods The primary patient set (Set 1) included 14 independent EOC studies (4293 patients) and 227,892 variants, and a secondary patient set (Set 2) included six additional EOC studies (1744 patients) and 114,620 variants. Because power to detect rare variants individually is reduced, gene-level tests were conducted. Sets were analyzed separately at individual variants and by gene, and then combined with meta-analyses (73,203 variants and 13,163 genes overlapped). Results No individual variant reached genome-wide statistical significance. A SNP previously implicated to be associated with EOC risk and, to a lesser extent, survival, rs8170, showed the strongest evidence of association with survival and similar effect size estimates across sets (Pmeta=1.1E-6, HRSet1=1.17, HRSet2=1.14). Rare variants in ATG2B, an autophagy gene important for apoptosis, were significantly associated with survival after multiple testing correction (Pmeta=1.1E-6; Pcorrected=0.01). Conclusions Common variant rs8170 and rare variants in ATG2B may be associated with EOC overall survival, although further study is needed. Impact This study represents the first exome-wide association study of EOC survival to include rare variant analyses, and suggests that complementary single variant and gene-level analyses in large studies are needed to identify rare variants that warrant follow-up study. PMID:26747452

  18. Truncated Photosystem Chlorophyll Antenna Size in the Green Microalga Chlamydomonas reinhardtii upon Deletion of the TLA3-CpSRP43 Gene1[C][W][OA

    PubMed Central

    Kirst, Henning; Garcia-Cerdan, Jose Gines; Zurbriggen, Andreas; Ruehle, Thilo; Melis, Anastasios

    2012-01-01

    The truncated light-harvesting antenna size3 (tla3) DNA insertional transformant of Chlamydomonas reinhardtii is a chlorophyll-deficient mutant with a lighter green phenotype, a lower chlorophyll (Chl) per cell content, and higher Chl a/b ratio than corresponding wild-type strains. Functional analyses revealed a higher intensity for the saturation of photosynthesis and greater light-saturated photosynthetic activity in the tla3 mutant than in the wild type and a Chl antenna size of the photosystems that was only about 40% of that in the wild type. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis and western-blot analyses showed that the tla3 strain was deficient in the Chl a/b light-harvesting complex. Molecular and genetic analyses revealed a single plasmid insertion in chromosome 4 of the tla3 nuclear genome, causing deletion of predicted gene g5047 and plasmid insertion within the fourth intron of downstream-predicted gene g5046. Complementation studies defined that gene g5047 alone was necessary and sufficient to rescue the tla3 mutation. Gene g5047 encodes a C. reinhardtii homolog of the chloroplast-localized SRP43 signal recognition particle, whose occurrence and function in green microalgae has not hitherto been investigated. Biochemical analysis showed that the nucleus-encoded and chloroplast-localized CrCpSRP43 protein specifically operates in the assembly of the peripheral components of the Chl a/b light-harvesting antenna. This work demonstrates that cpsrp43 deletion in green microalgae can be employed to generate tla mutants with a substantially diminished Chl antenna size. The latter exhibit improved solar energy conversion efficiency and photosynthetic productivity under mass culture and bright sunlight conditions. PMID:23043081

  19. The first complete chloroplast genome sequence of a lycophyte,Huperzia lucidula (Lycopodiaceae)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wolf, Paul G.; Karol, Kenneth G.; Mandoli, Dina F.

    2005-02-01

    We used a unique combination of techniques to sequence the first complete chloroplast genome of a lycophyte, Huperzia lucidula. This plant belongs to a significant clade hypothesized to represent the sister group to all other vascular plants. We used fluorescence-activated cell sorting (FACS) to isolate the organelles, rolling circle amplification (RCA) to amplify the genome, and shotgun sequencing to 8x depth coverage to obtain the complete chloroplast genome sequence. The genome is 154,373bp, containing inverted repeats of 15,314 bp each, a large single-copy region of 104,088 bp, and a small single-copy region of 19,671 bp. Gene order is more similarmore » to those of mosses, liverworts, and hornworts than to gene order for other vascular plants. For example, the Huperziachloroplast genome possesses the bryophyte gene order for a previously characterized 30 kb inversion, thus supporting the hypothesis that lycophytes are sister to all other extant vascular plants. The lycophytechloroplast genome data also enable a better reconstruction of the basaltracheophyte genome, which is useful for inferring relationships among bryophyte lineages. Several unique characters are observed in Huperzia, such as movement of the gene ndhF from the small single copy region into the inverted repeat. We present several analyses of evolutionary relationships among land plants by using nucleotide data, amino acid sequences, and by comparing gene arrangements from chloroplast genomes. The results, while still tentative pending the large number of chloroplast genomes from other key lineages that are soon to be sequenced, are intriguing in themselves, and contribute to a growing comparative database of genomic and morphological data across the green plants.« less

  20. Ex vivo gene editing of the dystrophin gene in muscle stem cells mediated by peptide nucleic acid single stranded oligodeoxynucleotides induces stable expression of dystrophin in a mouse model for Duchenne muscular dystrophy.

    PubMed

    Nik-Ahd, Farnoosh; Bertoni, Carmen

    2014-07-01

    Duchenne muscular dystrophy (DMD) is a fatal disease caused by mutations in the dystrophin gene, which result in the complete absence of dystrophin protein throughout the body. Gene correction strategies hold promise to treating DMD. Our laboratory has previously demonstrated the ability of peptide nucleic acid single-stranded oligodeoxynucleotides (PNA-ssODNs) to permanently correct single-point mutations at the genomic level. In this study, we show that PNA-ssODNs can target and correct muscle satellite cells (SCs), a population of stem cells capable of self-renewing and differentiating into muscle fibers. When transplanted into skeletal muscles, SCs transfected with correcting PNA-ssODNs were able to engraft and to restore dystrophin expression. The number of dystrophin-positive fibers was shown to significantly increase over time. Expression was confirmed to be the result of the activation of a subpopulation of SCs that had undergone repair as demonstrated by immunofluorescence analyses of engrafted muscles using antibodies specific to full-length dystrophin transcripts and by genomic DNA analysis of dystrophin-positive fibers. Furthermore, the increase in dystrophin expression detected over time resulted in a significant improvement in muscle morphology. The ability of transplanted cells to return into quiescence and to activate upon demand was confirmed in all engrafted muscles following injury. These results demonstrate the feasibility of using gene editing strategies to target and correct SCs and further establish the therapeutic potential of this approach to permanently restore dystrophin expression into muscle of DMD patients. © 2014 AlphaMed Press.

  1. Variable association of reactive intermediate genes with systemic lupus erythematosus in populations with different African ancestry.

    PubMed

    Ramos, Paula S; Oates, James C; Kamen, Diane L; Williams, Adrienne H; Gaffney, Patrick M; Kelly, Jennifer A; Kaufman, Kenneth M; Kimberly, Robert P; Niewold, Timothy B; Jacob, Chaim O; Tsao, Betty P; Alarcón, Graciela S; Brown, Elizabeth E; Edberg, Jeffrey C; Petri, Michelle A; Ramsey-Goldman, Rosalind; Reveille, John D; Vilá, Luis M; James, Judith A; Guthridge, Joel M; Merrill, Joan T; Boackle, Susan A; Freedman, Barry I; Scofield, R Hal; Stevens, Anne M; Vyse, Timothy J; Criswell, Lindsey A; Moser, Kathy L; Alarcón-Riquelme, Marta E; Langefeld, Carl D; Harley, John B; Gilkeson, Gary S

    2013-06-01

    Little is known about the genetic etiology of systemic lupus erythematosus (SLE) in individuals of African ancestry, despite its higher prevalence and greater disease severity. Overproduction of nitric oxide (NO) and reactive oxygen species are implicated in the pathogenesis and severity of SLE, making NO synthases and other reactive intermediate-related genes biological candidates for disease susceptibility. We analyzed variation in reactive intermediate genes for association with SLE in 2 populations with African ancestry. A total of 244 single-nucleotide polymorphisms (SNP) from 53 regions were analyzed in non-Gullah African Americans (AA; 1432 cases and 1687 controls) and the genetically more homogeneous Gullah of the Sea Islands of South Carolina (133 cases and 112 controls). Single-marker, haplotype, and 2-locus interaction tests were computed for these populations. The glutathione reductase gene GSR (rs2253409; p = 0.0014, OR 1.26, 95% CI 1.09-1.44) was the most significant single SNP association in AA. In the Gullah, the NADH dehydrogenase NDUFS4 (rs381575; p = 0.0065, OR 2.10, 95% CI 1.23-3.59) and NO synthase gene NOS1 (rs561712; p = 0.0072, OR 0.62, 95% CI 0.44-0.88) were most strongly associated with SLE. When both populations were analyzed together, GSR remained the most significant effect (rs2253409; p = 0.00072, OR 1.26, 95% CI 1.10-1.44). Haplotype and 2-locus interaction analyses also uncovered different loci in each population. These results suggest distinct patterns of association with SLE in African-derived populations; specific loci may be more strongly associated within select population groups.

  2. Entropy Based Genetic Association Tests and Gene-Gene Interaction Tests

    PubMed Central

    de Andrade, Mariza; Wang, Xin

    2011-01-01

    In the past few years, several entropy-based tests have been proposed for testing either single SNP association or gene-gene interaction. These tests are mainly based on Shannon entropy and have higher statistical power when compared to standard χ2 tests. In this paper, we extend some of these tests using a more generalized entropy definition, Rényi entropy, where Shannon entropy is a special case of order 1. The order λ (>0) of Rényi entropy weights the events (genotype/haplotype) according to their probabilities (frequencies). Higher λ places more emphasis on higher probability events while smaller λ (close to 0) tends to assign weights more equally. Thus, by properly choosing the λ, one can potentially increase the power of the tests or the p-value level of significance. We conducted simulation as well as real data analyses to assess the impact of the order λ and the performance of these generalized tests. The results showed that for dominant model the order 2 test was more powerful and for multiplicative model the order 1 or 2 had similar power. The analyses indicate that the choice of λ depends on the underlying genetic model and Shannon entropy is not necessarily the most powerful entropy measure for constructing genetic association or interaction tests. PMID:23089811

  3. The differentiation of tuna (family: Scombridae) products through the PCR-based analysis of the cytochrome b gene and parvalbumin introns.

    PubMed

    Abdullah, Asadatun; Rehbein, Hartmut

    2016-01-30

    In spite of the many studies performed over the years, there are still problems in the authentication of closely related tuna species, not only for canned fish but also for raw products. With the aim of providing screening methods to identify different tuna species and related scombrids, segments of mitochondrial cytochrome b (cyt b) and nuclear parvalbumin genes were amplified and sequenced or subjected to single-strand conformation polymorphism (SSCP) and restriction fragment length polymorphism (RFLP) analyses. The nucleotide diagnostic sites in the cyt b gene of five tuna species from Indonesia were determined in this study and used to construct a phylogenetic tree. In addition, the suitability of the nuclear gene that encodes parvalbumin for the differentiation of tuna species was determined by SSCP and RFLP analyses of an intron segment. RFLP differentiated Thunnus albacares and from T. obesus, and fish species in the Thunnus genus could be distinguished from bullet tuna (Auxis rochei) by SSCP. Parvalbumin-based polymerase chain reaction systems could serve as an additional tool in the detection and identification of tuna and other Scombridae fish species for routine seafood control. This reaction can be performed in addition to the cyt b analysis as previously described. © 2015 Society of Chemical Industry.

  4. Frequent heteroplasmy and recombination in the mitochondrial genomes of the basidiomycete mushroom Thelephora ganbajun.

    PubMed

    Wang, Pengfei; Sha, Tao; Zhang, Yunrun; Cao, Yang; Mi, Fei; Liu, Cunli; Yang, Dan; Tang, Xiaozhao; He, Xiaoxia; Dong, Jianyong; Wu, Jinyan; Yoell, Shanze; Yoell, Liam; Zhang, Ke-Qin; Zhang, Ying; Xu, Jianping

    2017-05-09

    In the majority of sexual eukaryotes, the mitochondrial genomes are inherited uniparentally. As a result, individual organisms are homoplasmic, containing mitochondrial DNA (mtDNA) from a single parent. Here we analyzed the mitochondrial genotypes in Clade I of the gourmet mushroom Thelephora ganbajun from its broad geographic distribution range. A total of 299 isolates from 28 geographic locations were sequenced at three mitochondrial loci: the mitochondrial small ribosomal RNA gene, and the cytochrome c oxidase subunits I (COX1) and III (COX3) genes. Quantitative PCR analyses showed that the strains had about 60-160 copies of mitochondrial genomes per cell. Interestingly, while no evidence of heteroplasmy was found at the 12S rRNA gene, 262 of the 299 isolates had clear evidence of heterogeneity at either the COX1 (261 isolates) or COX3 (12 isolates) gene fragments. The COX1 heteroplasmy was characterized by two types of introns residing at different sites of the same region and at different frequencies among the isolates. Allelic association analyses of the observed mitochondrial polymorphic nucleotide sites suggest that mtDNA recombination is common in natural populations of this fungus. Our results contrast the prevailing view that heteroplasmy, if exists, is only transient in basidiomycete fungi.

  5. Genome-wide Association Study of a Quantitative Disordered Gambling Trait

    PubMed Central

    Lind, Penelope A.; Zhu, Gu; Montgomery, Grant W; Madden, Pamela A.F.; Heath, Andrew C.; Martin, Nicholas G.; Slutske, Wendy S.

    2012-01-01

    Disordered gambling is a moderately heritable trait, but the underlying genetic basis is largely unknown. We performed a genome-wide association study (GWAS) for disordered gambling using a quantitative factor score in 1,312 twins from 894 Australian families. Association was conducted for 2,381,914 single nucleotide polymorphisms (SNPs) using the family-based association test in Merlin followed by gene and pathway enrichment analyses. Although no SNP reached genome-wide significance, six achieved P-values < 1 × 10−5 with variants in three genes (MT1X, ATXN1 and VLDLR) implicated in disordered gambling. Secondary case-control analyses found two SNPs on chromosome 9 (rs1106076 and rs12305135 near VLDLR) and rs10812227 near FZD10 on chromosome 12 to be significantly associated with lifetime DSM-IV pathological gambling and SOGS classified probable pathological gambling status. Furthermore, several addiction-related pathways were enriched for SNPs associated with disordered gambling. Finally, gene-based analysis of 24 candidate genes for dopamine agonist induced gambling in individuals with Parkinson’s disease suggested an enrichment of SNPs associated with disordered gambling. We report the first GWAS of disordered gambling. While further replication is required, the identification of susceptibility loci and biological pathways will be important in characterizing the biological mechanisms that underpin disordered gambling. PMID:22780124

  6. Exploring the roles of DNA methylation in the metal-reducing bacterium Shewanella oneidensis MR-1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bendall, Matthew L.; Luong, Khai; Wetmore, Kelly M.

    2013-08-30

    We performed whole genome analyses of DNA methylation in Shewanella 17 oneidensis MR-1 to examine its possible role in regulating gene expression and 18 other cellular processes. Single-Molecule Real Time (SMRT) sequencing 19 revealed extensive methylation of adenine (N6mA) throughout the 20 genome. These methylated bases were located in five sequence motifs, 21 including three novel targets for Type I restriction/modification enzymes. The 22 sequence motifs targeted by putative methyltranferases were determined via 23 SMRT sequencing of gene knockout mutants. In addition, we found S. 24 oneidensis MR-1 cultures grown under various culture conditions displayed 25 different DNA methylation patterns.more » However, the small number of differentially 26 methylated sites could not be directly linked to the much larger number of 27 differentially expressed genes in these conditions, suggesting DNA methylation is 28 not a major regulator of gene expression in S. oneidensis MR-1. The enrichment 29 of methylated GATC motifs in the origin of replication indicate DNA methylation 30 may regulate genome replication in a manner similar to that seen in Escherichia 31 coli. Furthermore, comparative analyses suggest that many 32 Gammaproteobacteria, including all members of the Shewanellaceae family, may 33 also utilize DNA methylation to regulate genome replication.« less

  7. Single Nucleotide Polymorphisms of Stemness Genes Predicted to Regulate RNA Splicing, microRNA and Oncogenic Signaling are Associated with Prostate Cancer Survival.

    PubMed

    Freedman, Jennifer A; Wang, Yanru; Li, Xuechan; Liu, Hongliang; Moorman, Patricia G; George, Daniel J; Lee, Norman H; Hyslop, Terry; Wei, Qingyi; Patierno, Steven R

    2018-05-03

    Prostate cancer is a clinically and molecularly heterogeneous disease, with variation in outcomes only partially predicted by grade and stage. Additional tools to distinguish indolent from aggressive disease are needed. Phenotypic characteristics of stemness correlate with poor cancer prognosis. Given this correlation, we identified single nucleotide polymorphisms (SNPs) of stemness-related genes and examined their associations with prostate cancer survival. SNPs within stemness-related genes were analyzed for association with overall survival of prostate cancer in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial. Significant SNPs predicted to be functional were selected for linkage disequilibrium analysis and combined and stratified analyses. Identified SNPs were evaluated for association with gene expression. SNPs of CD44 (rs9666607), ABCC1 (rs35605 and rs212091) and GDF15 (rs1058587) were associated with prostate cancer survival and predicted to be functional. A role for rs9666607 of CD44 and rs35605 of ABCC1 in RNA splicing regulation, rs212091 of ABCC1 in miRNA binding site activity and rs1058587 of GDF15 in causing an amino acid change was predicted. These SNPs represent potential novel prognostic markers for overall survival of prostate cancer and support a contribution of the stemness pathway to prostate cancer patient outcome.

  8. Single and multiple phenotype QTL analyses of downy mildew resistance in interspecific grapevines.

    PubMed

    Divilov, Konstantin; Barba, Paola; Cadle-Davidson, Lance; Reisch, Bruce I

    2018-05-01

    Downy mildew resistance across days post-inoculation, experiments, and years in two interspecific grapevine F 1 families was investigated using linear mixed models and Bayesian networks, and five new QTL were identified. Breeding grapevines for downy mildew disease resistance has traditionally relied on qualitative gene resistance, which can be overcome by pathogen evolution. Analyzing two interspecific F 1 families, both having ancestry derived from Vitis vinifera and wild North American Vitis species, across 2 years and multiple experiments, we found multiple loci associated with downy mildew sporulation and hypersensitive response in both families using a single phenotype model. The loci explained between 7 and 17% of the variance for either phenotype, suggesting a complex genetic architecture for these traits in the two families studied. For two loci, we used RNA-Seq to detect differentially transcribed genes and found that the candidate genes at these loci were likely not NBS-LRR genes. Additionally, using a multiple phenotype Bayesian network analysis, we found effects between the leaf trichome density, hypersensitive response, and sporulation phenotypes. Moderate-high heritabilities were found for all three phenotypes, suggesting that selection for downy mildew resistance is an achievable goal by breeding for either physical- or non-physical-based resistance mechanisms, with the combination of the two possibly providing durable resistance.

  9. Genetic regulation of gene expression in the lung identifies CST3 and CD22 as potential causal genes for airflow obstruction.

    PubMed

    Lamontagne, Maxime; Timens, Wim; Hao, Ke; Bossé, Yohan; Laviolette, Michel; Steiling, Katrina; Campbell, Joshua D; Couture, Christian; Conti, Massimo; Sherwood, Karen; Hogg, James C; Brandsma, Corry-Anke; van den Berge, Maarten; Sandford, Andrew; Lam, Stephen; Lenburg, Marc E; Spira, Avrum; Paré, Peter D; Nickle, David; Sin, Don D; Postma, Dirkje S

    2014-11-01

    COPD is a complex chronic disease with poorly understood pathogenesis. Integrative genomic approaches have the potential to elucidate the biological networks underlying COPD and lung function. We recently combined genome-wide genotyping and gene expression in 1111 human lung specimens to map expression quantitative trait loci (eQTL). To determine causal associations between COPD and lung function-associated single nucleotide polymorphisms (SNPs) and lung tissue gene expression changes in our lung eQTL dataset. We evaluated causality between SNPs and gene expression for three COPD phenotypes: FEV(1)% predicted, FEV(1)/FVC and COPD as a categorical variable. Different models were assessed in the three cohorts independently and in a meta-analysis. SNPs associated with a COPD phenotype and gene expression were subjected to causal pathway modelling and manual curation. In silico analyses evaluated functional enrichment of biological pathways among newly identified causal genes. Biologically relevant causal genes were validated in two separate gene expression datasets of lung tissues and bronchial airway brushings. High reliability causal relations were found in SNP-mRNA-phenotype triplets for FEV(1)% predicted (n=169) and FEV(1)/FVC (n=80). Several genes of potential biological relevance for COPD were revealed. eQTL-SNPs upregulating cystatin C (CST3) and CD22 were associated with worse lung function. Signalling pathways enriched with causal genes included xenobiotic metabolism, apoptosis, protease-antiprotease and oxidant-antioxidant balance. By using integrative genomics and analysing the relationships of COPD phenotypes with SNPs and gene expression in lung tissue, we identified CST3 and CD22 as potential causal genes for airflow obstruction. This study also augmented the understanding of previously described COPD pathways. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  10. Monitoring transcription initiation activities in rat and dog.

    PubMed

    Lizio, Marina; Mukarram, Abdul Kadir; Ohno, Mizuho; Watanabe, Shoko; Itoh, Masayoshi; Hasegawa, Akira; Lassmann, Timo; Severin, Jessica; Harshbarger, Jayson; Abugessaisa, Imad; Kasukawa, Takeya; Hon, Chung Chau; Carninci, Piero; Hayashizaki, Yoshihide; Forrest, Alistair R R; Kawaji, Hideya

    2017-11-28

    The promoter landscape of several non-human model organisms is far from complete. As a part of FANTOM5 data collection, we generated 13 profiles of transcription initiation activities in dog and rat aortic smooth muscle cells, mesenchymal stem cells and hepatocytes by employing CAGE (Cap Analysis of Gene Expression) technology combined with single molecule sequencing. Our analyses show that the CAGE profiles recapitulate known transcription start sites (TSSs) consistently, in addition to uncover novel TSSs. Our dataset can be thus used with high confidence to support gene annotation in dog and rat species. We identified 28,497 and 23,147 CAGE peaks, or promoter regions, for rat and dog respectively, and associated them to known genes. This approach could be seen as a standard method for improvement of existing gene models, as well as discovery of novel genes. Given that the FANTOM5 data collection includes dog and rat matched cell types in human and mouse as well, this data would also be useful for cross-species studies.

  11. Profiling the genome-wide DNA methylation pattern of porcine ovaries using reduced representation bisulfite sequencing.

    PubMed

    Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi

    2016-02-25

    Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.

  12. Molecular evolution of the plastid genome during diversification of the cotton genus.

    PubMed

    Chen, Zhiwen; Grover, Corrinne E; Li, Pengbo; Wang, Yumei; Nie, Hushuai; Zhao, Yanpeng; Wang, Meiyan; Liu, Fang; Zhou, Zhongli; Wang, Xingxing; Cai, Xiaoyan; Wang, Kunbo; Wendel, Jonathan F; Hua, Jinping

    2017-07-01

    Cotton (Gossypium spp.) is commonly grouped into eight diploid genomic groups, designated A-G and K, and one tetraploid genomic group, namely AD. To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome duringdiversification, chloroplast genomes (cpDNA) from 6 D-genome and 2 G-genome species of Gossypium (G. armourianum D 2-1 , G. harknessii D 2-2 , G. davidsonii D 3-d , G. klotzschianum D 3-k , G. aridum D 4 , G. trilobum D 8 , and G. australe G 2 , G. nelsonii G 3 ) were newly reported here. In combination with the 26 previously released cpDNA sequences, we performed comparative phylogenetic analyses of 34 Gossypium chloroplast genomes that collectively represent most of the diversity in the genus. Gossypium chloroplasts span a small range in size that is mostly attributable to indels that occur in the large single copy (LSC) region of the genome. Phylogenetic analysis using a concatenation of all genes provides robust support for six major Gossypium clades, largely supporting earlier inferences but also revealing new information on intrageneric relationships. Using Theobroma cacao as an outgroup, diversification of the genus was dated, yielding results that are in accord with previous estimates of divergence times, but also offering new perspectives on the basal, early radiation of all major clades within the genus as well as gaps in the record indicative of extinctions. Like most higher-plant chloroplast genomes, all cotton species exhibit a conserved quadripartite structure, i.e., two large inverted repeats (IR) containing most of the ribosomal RNA genes, and two unique regions, LSC (large single sequence) and SSC (small single sequence). Within Gossypium, the IR-single copy region junctions are both variable and homoplasious among species. Two genes, accD and psaJ, exhibited greater rates of synonymous and non-synonymous substitutions than did other genes. Most genes exhibited Ka/Ks ratios suggestive of neutral evolution, with 8 exceptions distributed among one to several species. This research provides an overview of the molecular evolution of a single, large non-recombining molecular during the diversification of this important genus. Copyright © 2017 Elsevier Inc. All rights reserved.

  13. TGFβ Receptor 1: An Immune Susceptibility Gene in HPV-Associated Cancer

    PubMed Central

    Levovitz, Chaya; Chen, Dan; Ivansson, Emma; Gyllensten, Ulf; Finnigan, John P.; Alshawish, Sara; Zhang, Weijia; Schadt, Eric E.; Posner, Marshal R.; Genden, Eric M.; Boffetta, Paolo; Sikora, Andrew G.

    2015-01-01

    Only a minority of those exposed to human papillomavirus (HPV) develop HPV-related cervical and oropharyngeal cancer. Because host immunity affects infection and progression to cancer, we tested the hypothesis that genetic variation in immune-related genes is a determinant of susceptibility to oropharyngeal cancer and other HPV-associated cancers by performing a multitier integrative computational analysis with oropharyngeal cancer data from a head and neck cancer genome-wide association study (GWAS). Independent analyses, including single-gene, gene-interconnectivity, protein–protein interaction, gene expression, and pathway analysis, identified immune genes and pathways significantly associated with oropharyngeal cancer. TGFβR1, which intersected all tiers of analysis and thus selected for validation, replicated significantly in the head and neck cancer GWAS limited to HPV-seropositive cases and an independent cervical cancer GWAS. The TGFβR1 containing p38–MAPK pathway was significantly associated with oropharyngeal cancer and cervical cancer, and TGFβR1 was overexpressed in oropharyngeal cancer, cervical cancer, and HPV+ head and neck cancer tumors. These concordant analyses implicate TGFβR1 signaling as a process dysregulated across HPV-related cancers. This study demonstrates that genetic variation in immune-related genes is associated with susceptibility to oropharyngeal cancer and implicates TGFβR1/TGFβ signaling in the development of both oropharyngeal cancer and cervical cancer. Better understanding of the immunogenetic basis of susceptibility to HPV-associated cancers may provide insight into host/virus interactions and immune processes dysregulated in the minority of HPV-exposed individuals who progress to cancer. PMID:25273091

  14. The Hairless Stem Phenotype of Cotton (Gossypium barbadense) Is Linked to a Copia-Like Retrotransposon Insertion in a Homeodomain-Leucine Zipper Gene (HD1)

    PubMed Central

    Ding, Mingquan; Ye, Wuwei; Lin, Lifeng; He, Shae; Du, Xiongming; Chen, Aiqun; Cao, Yuefen; Qin, Yuan; Yang, Fen; Jiang, Yurong; Zhang, Hua; Wang, Xiyin; Paterson, Andrew H.; Rong, Junkang

    2015-01-01

    Cotton (Gossypium) stem trichomes are mostly single cells that arise from stem epidermal cells. In this study, a homeodomain-leucine zipper gene (HD1) was found to cosegregate with the dominant trichome locus previously designated as T1 and mapped to chromosome 6. Characterization of HD1 orthologs revealed that the absence of stem trichomes in modern Gossypium barbadense varieties is linked to a large retrotransposon insertion in the ninth exon, 2565 bp downstream from the initial codon in the At subgenome HD1 gene (At-GbHD1). In both the At and Dt subgenomes, reduced transcription of GbHD1 genes is caused by this insertion. The disruption of At-HD1 further affects the expression of downstream GbMYB25 and GbHOX3 genes. Analyses of primitive cultivated accessions identified another retrotransposon insertion event in the sixth exon of At-GbHD1 that might predate the previously identified retrotransposon in modern varieties. Although both retrotransposon insertions results in similar phenotypic changes, the timing of these two retrotransposon insertion events fits well with our current understanding of the history of cotton speciation and dispersal. Taken together, the results of genetics mapping, gene expression and association analyses suggest that GbHD1 is an important component that controls stem trichome development and is a promising candidate gene for the T1 locus. The interspecific phenotypic difference in stem trichome traits also may be attributable to HD1 inactivation associated with retrotransposon insertion. PMID:26133897

  15. Identification of T1D susceptibility genes within the MHC region by combining protein interaction networks and SNP genotyping data

    PubMed Central

    Brorsson, C.; Hansen, N. T.; Lage, K.; Bergholdt, R.; Brunak, S.; Pociot, F.

    2009-01-01

    Aim To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1 genes. Methods We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein–protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein modules were statistically evaluated using permutation. Results A total of 151 genes could be mapped to nodes within the protein interaction network and their interaction partners were identified. Five protein interaction modules reached statistical significance using this approach. The identified proteins are well known in the pathogenesis of T1D, but the modules also contain additional candidates that have been implicated in β-cell development and diabetic complications. Conclusions The extensive LD within the MHC region makes it important to develop new methods for analysing genotyping data for identification of additional risk genes for T1D. Combining genetic data with knowledge about functional pathways provides new insight into mechanisms underlying T1D. PMID:19143816

  16. Chromosome map of the thermophilic archaebacterium Thermococcus celer

    NASA Technical Reports Server (NTRS)

    Noll, K. M.; Woese, C. R. (Principal Investigator)

    1989-01-01

    A physical map for the chromosome of the thermophilic archaebacterium Thermococcus celer Vu13 has been constructed. Thirty-four restriction endonucleases were tested for their ability to generate large restriction fragments from the chromosome of T. celer. Of these, the enzymes NheI, SpeI, and XbaI yielded the fewest fragments when analyzed by pulsed-field electrophoresis. NheI and SpeI each gave 5 fragments, while XbaI gave 12. The size of the T. celer chromosome was determined from the sum of the apparent sizes of restriction fragments derived from single and double digests by using these enzymes and was found to be 1,890 +/- 27 kilobase pairs. Partial and complete digests allowed the order of all but three small (less than 15 kilobase pairs) fragments to be deduced. These three fragments were assigned positions by using hybridization probes derived from these restriction fragments. The positions of the other fragments were confirmed by using hybridization probes derived in the same manner. The positions of the 5S, 16S, and 23S rRNA genes as well as the 7S RNA gene were located on this map by using cloned portions of these genes as hybridization probes. The 5S rRNA gene was localized 48 to 196 kilobases from the 5' end of the 16S gene. The 7S RNA gene was localized 190 to 504 kilobases from the 3' end of the 23S gene. These analyses demonstrated that the chromosome of T. celer is a single, circular DNA molecule. This is the first such demonstration of the structure of an archaebacterial chromosome.

  17. Gene flow connects coastal populations of a habitat specialist, the Clapper Rail Rallus crepitans

    USGS Publications Warehouse

    Coster, Stephanie S.; Welsh, Amy B.; Costanzo, Gary R.; Harding, Sergio R.; Anderson, James T.; Katzner, Todd

    2018-01-01

    Examining population genetic structure can reveal patterns of reproductive isolation or population mixing and inform conservation management. Some avian species are predicted to exhibit minimal genetic differentiation among populations as a result of the species high mobility, with habitat specialists tending to show greater fine‐scale genetic structure. To explore the relationship between habitat specialization and gene flow, we investigated the genetic structure of a saltmarsh specialist with high potential mobility across a wide geographic range of fragmented habitat. Little variation among mitochondrial sequences (620 bp from ND2) was observed among 149 individual Clapper Rails Rallus crepitans sampled along the Atlantic coast of North America, with the majority of individuals at all sampling sites sharing a single haplotype. Genotyping of nine microsatellite loci across 136 individuals revealed moderate genetic diversity, no evidence of bottlenecks, and a weak pattern of genetic differentiation that increased with geographic distance. Multivariate analyses, Bayesian clustering and an AMOVA all suggested a lack of genetic structuring across the North American Atlantic coast, with all individuals grouped into a single interbreeding population. Spatial autocorrelation analyses showed evidence of weak female philopatry and a lack of male philopatry. We conclude that high gene flow connecting populations of this habitat specialist may result from the interaction of ecological and behavioral factors that promote dispersal and limit natal philopatry and breeding‐site fidelity. As climate change threatens saltmarshes, the genetic diversity and population connectivity of Clapper Rails may promote resilience of their populations. This finding helps inform about potential fates of other similarly behaving saltmarsh specialists on the Atlantic coast.

  18. The prevalence of terraced treescapes in analyses of phylogenetic data sets.

    PubMed

    Dobrin, Barbara H; Zwickl, Derrick J; Sanderson, Michael J

    2018-04-04

    The pattern of data availability in a phylogenetic data set may lead to the formation of terraces, collections of equally optimal trees. Terraces can arise in tree space if trees are scored with parsimony or with partitioned, edge-unlinked maximum likelihood. Theory predicts that terraces can be large, but their prevalence in contemporary data sets has never been surveyed. We selected 26 data sets and phylogenetic trees reported in recent literature and investigated the terraces to which the trees would belong, under a common set of inference assumptions. We examined terrace size as a function of the sampling properties of the data sets, including taxon coverage density (the proportion of taxon-by-gene positions with any data present) and a measure of gene sampling "sufficiency". We evaluated each data set in relation to the theoretical minimum gene sampling depth needed to reduce terrace size to a single tree, and explored the impact of the terraces found in replicate trees in bootstrap methods. Terraces were identified in nearly all data sets with taxon coverage densities < 0.90. They were not found, however, in high-coverage-density (i.e., ≥ 0.94) transcriptomic and genomic data sets. The terraces could be very large, and size varied inversely with taxon coverage density and with gene sampling sufficiency. Few data sets achieved a theoretical minimum gene sampling depth needed to reduce terrace size to a single tree. Terraces found during bootstrap resampling reduced overall support. If certain inference assumptions apply, trees estimated from empirical data sets often belong to large terraces of equally optimal trees. Terrace size correlates to data set sampling properties. Data sets seldom include enough genes to reduce terrace size to one tree. When bootstrap replicate trees lie on a terrace, statistical support for phylogenetic hypotheses may be reduced. Although some of the published analyses surveyed were conducted with edge-linked inference models (which do not induce terraces), unlinked models have been used and advocated. The present study describes the potential impact of that inference assumption on phylogenetic inference in the context of the kinds of multigene data sets now widely assembled for large-scale tree construction.

  19. Three α-Subunits of Heterotrimeric G Proteins and an Adenylyl Cyclase Have Distinct Roles in Fruiting Body Development in the Homothallic Fungus Sordaria macrospora

    PubMed Central

    Kamerewerd, Jens; Jansson, Malin; Nowrousian, Minou; Pöggeler, Stefanie; Kück, Ulrich

    2008-01-01

    Sordaria macrospora, a self-fertile filamentous ascomycete, carries genes encoding three different α-subunits of heterotrimeric G proteins (gsa, G protein Sordaria alpha subunit). We generated knockout strains for all three gsa genes (Δgsa1, Δgsa2, and Δgsa3) as well as all combinations of double mutants. Phenotypic analysis of single and double mutants showed that the genes for Gα-subunits have distinct roles in the sexual life cycle. While single mutants show some reduction of fertility, double mutants Δgsa1Δgsa2 and Δgsa1Δgsa3 are completely sterile. To test whether the pheromone receptors PRE1 and PRE2 mediate signaling via distinct Gα-subunits, two recently generated Δpre strains were crossed with all Δgsa strains. Analyses of the corresponding double mutants revealed that compared to GSA2, GSA1 is a more predominant regulator of a signal transduction cascade downstream of the pheromone receptors and that GSA3 is involved in another signaling pathway that also contributes to fruiting body development and fertility. We further isolated the gene encoding adenylyl cyclase (AC) (sac1) for construction of a knockout strain. Analyses of the three ΔgsaΔsac1 double mutants and one Δgsa2Δgsa3Δsac1 triple mutant indicate that SAC1 acts downstream of GSA3, parallel to a GSA1–GSA2-mediated signaling pathway. In addition, the function of STE12 and PRO41, two presumptive signaling components, was investigated in diverse double mutants lacking those developmental genes in combination with the gsa genes. This analysis was further completed by expression studies of the ste12 and pro41 transcripts in wild-type and mutant strains. From the sum of all our data, we propose a model for how different Gα-subunits interact with pheromone receptors, adenylyl cyclase, and STE12 and thus cooperatively regulate sexual development in S. macrospora. PMID:18723884

  20. Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.

    PubMed

    Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L

    2015-01-01

    Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.

  1. With Reference to Reference Genes: A Systematic Review of Endogenous Controls in Gene Expression Studies.

    PubMed

    Chapman, Joanne R; Waldenström, Jonas

    2015-01-01

    The choice of reference genes that are stably expressed amongst treatment groups is a crucial step in real-time quantitative PCR gene expression studies. Recent guidelines have specified that a minimum of two validated reference genes should be used for normalisation. However, a quantitative review of the literature showed that the average number of reference genes used across all studies was 1.2. Thus, the vast majority of studies continue to use a single gene, with β-actin (ACTB) and/or glyceraldehyde 3-phosphate dehydrogenase (GAPDH) being commonly selected in studies of vertebrate gene expression. Few studies (15%) tested a panel of potential reference genes for stability of expression before using them to normalise data. Amongst studies specifically testing reference gene stability, few found ACTB or GAPDH to be optimal, whereby these genes were significantly less likely to be chosen when larger panels of potential reference genes were screened. Fewer reference genes were tested for stability in non-model organisms, presumably owing to a dearth of available primers in less well characterised species. Furthermore, the experimental conditions under which real-time quantitative PCR analyses were conducted had a large influence on the choice of reference genes, whereby different studies of rat brain tissue showed different reference genes to be the most stable. These results highlight the importance of validating the choice of normalising reference genes before conducting gene expression studies.

  2. DNA sequence polymorphisms within the bovine guanine nucleotide-binding protein Gs subunit alpha (Gsα)-encoding (GNAS) genomic imprinting domain are associated with performance traits.

    PubMed

    Sikora, Klaudia M; Magee, David A; Berkowicz, Erik W; Berry, Donagh P; Howard, Dawn J; Mullen, Michael P; Evans, Ross D; Machugh, David E; Spillane, Charles

    2011-01-07

    Genes which are epigenetically regulated via genomic imprinting can be potential targets for artificial selection during animal breeding. Indeed, imprinted loci have been shown to underlie some important quantitative traits in domestic mammals, most notably muscle mass and fat deposition. In this candidate gene study, we have identified novel associations between six validated single nucleotide polymorphisms (SNPs) spanning a 97.6 kb region within the bovine guanine nucleotide-binding protein Gs subunit alpha gene (GNAS) domain on bovine chromosome 13 and genetic merit for a range of performance traits in 848 progeny-tested Holstein-Friesian sires. The mammalian GNAS domain consists of a number of reciprocally-imprinted, alternatively-spliced genes which can play a major role in growth, development and disease in mice and humans. Based on the current annotation of the bovine GNAS domain, four of the SNPs analysed (rs43101491, rs43101493, rs43101485 and rs43101486) were located upstream of the GNAS gene, while one SNP (rs41694646) was located in the second intron of the GNAS gene. The final SNP (rs41694656) was located in the first exon of transcripts encoding the putative bovine neuroendocrine-specific protein NESP55, resulting in an aspartic acid-to-asparagine amino acid substitution at amino acid position 192. SNP genotype-phenotype association analyses indicate that the single intronic GNAS SNP (rs41694646) is associated (P ≤ 0.05) with a range of performance traits including milk yield, milk protein yield, the content of fat and protein in milk, culled cow carcass weight and progeny carcass conformation, measures of animal body size, direct calving difficulty (i.e. difficulty in calving due to the size of the calf) and gestation length. Association (P ≤ 0.01) with direct calving difficulty (i.e. due to calf size) and maternal calving difficulty (i.e. due to the maternal pelvic width size) was also observed at the rs43101491 SNP. Following adjustment for multiple-testing, significant association (q ≤ 0.05) remained between the rs41694646 SNP and four traits (animal stature, body depth, direct calving difficulty and milk yield) only. Notably, the single SNP in the bovine NESP55 gene (rs41694656) was associated (P ≤ 0.01) with somatic cell count--an often-cited indicator of resistance to mastitis and overall health status of the mammary system--and previous studies have demonstrated that the chromosomal region to where the GNAS domain maps underlies an important quantitative trait locus for this trait. This association, however, was not significant after adjustment for multiple testing. The three remaining SNPs assayed were not associated with any of the performance traits analysed in this study. Analysis of all pairwise linkage disequilibrium (r2) values suggests that most allele substitution effects for the assayed SNPs observed are independent. Finally, the polymorphic coding SNP in the putative bovine NESP55 gene was used to test the imprinting status of this gene across a range of foetal bovine tissues. Previous studies in other mammalian species have shown that DNA sequence variation within the imprinted GNAS gene cluster contributes to several physiological and metabolic disorders, including obesity in humans and mice. Similarly, the results presented here indicate an important role for the imprinted GNAS cluster in underlying complex performance traits in cattle such as animal growth, calving, fertility and health. These findings suggest that GNAS domain-associated polymorphisms may serve as important genetic markers for future livestock breeding programs and support previous studies that candidate imprinted loci may act as molecular targets for the genetic improvement of agricultural populations. In addition, we present new evidence that the bovine NESP55 gene is epigenetically regulated as a maternally expressed imprinted gene in placental and intestinal tissues from 8-10 week old bovine foetuses.

  3. DNA sequence polymorphisms within the bovine guanine nucleotide-binding protein Gs subunit alpha (Gsα)-encoding (GNAS) genomic imprinting domain are associated with performance traits

    PubMed Central

    2011-01-01

    Background Genes which are epigenetically regulated via genomic imprinting can be potential targets for artificial selection during animal breeding. Indeed, imprinted loci have been shown to underlie some important quantitative traits in domestic mammals, most notably muscle mass and fat deposition. In this candidate gene study, we have identified novel associations between six validated single nucleotide polymorphisms (SNPs) spanning a 97.6 kb region within the bovine guanine nucleotide-binding protein Gs subunit alpha gene (GNAS) domain on bovine chromosome 13 and genetic merit for a range of performance traits in 848 progeny-tested Holstein-Friesian sires. The mammalian GNAS domain consists of a number of reciprocally-imprinted, alternatively-spliced genes which can play a major role in growth, development and disease in mice and humans. Based on the current annotation of the bovine GNAS domain, four of the SNPs analysed (rs43101491, rs43101493, rs43101485 and rs43101486) were located upstream of the GNAS gene, while one SNP (rs41694646) was located in the second intron of the GNAS gene. The final SNP (rs41694656) was located in the first exon of transcripts encoding the putative bovine neuroendocrine-specific protein NESP55, resulting in an aspartic acid-to-asparagine amino acid substitution at amino acid position 192. Results SNP genotype-phenotype association analyses indicate that the single intronic GNAS SNP (rs41694646) is associated (P ≤ 0.05) with a range of performance traits including milk yield, milk protein yield, the content of fat and protein in milk, culled cow carcass weight and progeny carcass conformation, measures of animal body size, direct calving difficulty (i.e. difficulty in calving due to the size of the calf) and gestation length. Association (P ≤ 0.01) with direct calving difficulty (i.e. due to calf size) and maternal calving difficulty (i.e. due to the maternal pelvic width size) was also observed at the rs43101491 SNP. Following adjustment for multiple-testing, significant association (q ≤ 0.05) remained between the rs41694646 SNP and four traits (animal stature, body depth, direct calving difficulty and milk yield) only. Notably, the single SNP in the bovine NESP55 gene (rs41694656) was associated (P ≤ 0.01) with somatic cell count--an often-cited indicator of resistance to mastitis and overall health status of the mammary system--and previous studies have demonstrated that the chromosomal region to where the GNAS domain maps underlies an important quantitative trait locus for this trait. This association, however, was not significant after adjustment for multiple testing. The three remaining SNPs assayed were not associated with any of the performance traits analysed in this study. Analysis of all pairwise linkage disequilibrium (r2) values suggests that most allele substitution effects for the assayed SNPs observed are independent. Finally, the polymorphic coding SNP in the putative bovine NESP55 gene was used to test the imprinting status of this gene across a range of foetal bovine tissues. Conclusions Previous studies in other mammalian species have shown that DNA sequence variation within the imprinted GNAS gene cluster contributes to several physiological and metabolic disorders, including obesity in humans and mice. Similarly, the results presented here indicate an important role for the imprinted GNAS cluster in underlying complex performance traits in cattle such as animal growth, calving, fertility and health. These findings suggest that GNAS domain-associated polymorphisms may serve as important genetic markers for future livestock breeding programs and support previous studies that candidate imprinted loci may act as molecular targets for the genetic improvement of agricultural populations. In addition, we present new evidence that the bovine NESP55 gene is epigenetically regulated as a maternally expressed imprinted gene in placental and intestinal tissues from 8-10 week old bovine foetuses. PMID:21214909

  4. Genome-wide signatures of flowering adaptation to climate temperature: Regional analyses in a highly diverse native range of Arabidopsis thaliana.

    PubMed

    Tabas-Madrid, Daniel; Méndez-Vigo, Belén; Arteaga, Noelia; Marcer, Arnald; Pascual-Montano, Alberto; Weigel, Detlef; Xavier Picó, F; Alonso-Blanco, Carlos

    2018-03-08

    Current global change is fueling an interest to understand the genetic and molecular mechanisms of plant adaptation to climate. In particular, altered flowering time is a common strategy for escape from unfavourable climate temperature. In order to determine the genomic bases underlying flowering time adaptation to this climatic factor, we have systematically analysed a collection of 174 highly diverse Arabidopsis thaliana accessions from the Iberian Peninsula. Analyses of 1.88 million single nucleotide polymorphisms provide evidence for a spatially heterogeneous contribution of demographic and adaptive processes to geographic patterns of genetic variation. Mountains appear to be allele dispersal barriers, whereas the relationship between flowering time and temperature depended on the precise temperature range. Environmental genome-wide associations supported an overall genome adaptation to temperature, with 9.4% of the genes showing significant associations. Furthermore, phenotypic genome-wide associations provided a catalogue of candidate genes underlying flowering time variation. Finally, comparison of environmental and phenotypic genome-wide associations identified known (Twin Sister of FT, FRIGIDA-like 1, and Casein Kinase II Beta chain 1) and new (Epithiospecifer Modifier 1 and Voltage-Dependent Anion Channel 5) genes as candidates for adaptation to climate temperature by altered flowering time. Thus, this regional collection provides an excellent resource to address the spatial complexity of climate adaptation in annual plants. © 2018 John Wiley & Sons Ltd.

  5. Phylogenomics, Diversification Dynamics, and Comparative Transcriptomics across the Spider Tree of Life.

    PubMed

    Fernández, Rosa; Kallal, Robert J; Dimitrov, Dimitar; Ballesteros, Jesús A; Arnedo, Miquel A; Giribet, Gonzalo; Hormiga, Gustavo

    2018-05-07

    Dating back to almost 400 mya, spiders are among the most diverse terrestrial predators [1]. However, despite considerable effort [1-9], their phylogenetic relationships and diversification dynamics remain poorly understood. Here, we use a synergistic approach to study spider evolution through phylogenomics, comparative transcriptomics, and lineage diversification analyses. Our analyses, based on ca. 2,500 genes from 159 spider species, reject a single origin of the orb web (the "ancient orb-web hypothesis") and suggest that orb webs evolved multiple times since the late Triassic-Jurassic. We find no significant association between the loss of foraging webs and increases in diversification rates, suggesting that other factors (e.g., habitat heterogeneity or biotic interactions) potentially played a key role in spider diversification. Finally, we report notable genomic differences in the main spider lineages: while araneoids (ecribellate orb-weavers and their allies) reveal an enrichment in genes related to behavior and sensory reception, the retrolateral tibial apophysis (RTA) clade-the most diverse araneomorph spider lineage-shows enrichment in genes related to immune responses and polyphenic determination. This study, one of the largest invertebrate phylogenomic analyses to date, highlights the usefulness of transcriptomic data not only to build a robust backbone for the Spider Tree of Life, but also to address the genetic basis of diversification in the spider evolutionary chronicle. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. Structural and transcriptional analysis of plant genes encoding the bifunctional lysine ketoglutarate reductase saccharopine dehydrogenase enzyme.

    PubMed

    Anderson, Olin D; Coleman-Derr, Devin; Gu, Yong Q; Heath, Sekou

    2010-06-16

    Among the dietary essential amino acids, the most severely limiting in the cereals is lysine. Since cereals make up half of the human diet, lysine limitation has quality/nutritional consequences. The breakdown of lysine is controlled mainly by the catabolic bifunctional enzyme lysine ketoglutarate reductase - saccharopine dehydrogenase (LKR/SDH). The LKR/SDH gene has been reported to produce transcripts for the bifunctional enzyme and separate monofunctional transcripts. In addition to lysine metabolism, this gene has been implicated in a number of metabolic and developmental pathways, which along with its production of multiple transcript types and complex exon/intron structure suggest an important node in plant metabolism. Understanding more about the LKR/SDH gene is thus interesting both from applied standpoint and for basic plant metabolism. The current report describes a wheat genomic fragment containing an LKR/SDH gene and adjacent genes. The wheat LKR/SDH genomic segment was found to originate from the A-genome of wheat, and EST analysis indicates all three LKR/SDH genes in hexaploid wheat are transcriptionally active. A comparison of a set of plant LKR/SDH genes suggests regions of greater sequence conservation likely related to critical enzymatic functions and metabolic controls. Although most plants contain only a single LKR/SDH gene per genome, poplar contains at least two functional bifunctional genes in addition to a monofunctional LKR gene. Analysis of ESTs finds evidence for monofunctional LKR transcripts in switchgrass, and monofunctional SDH transcripts in wheat, Brachypodium, and poplar. The analysis of a wheat LKR/SDH gene and comparative structural and functional analyses among available plant genes provides new information on this important gene. Both the structure of the LKR/SDH gene and the immediately adjacent genes show lineage-specific differences between monocots and dicots, and findings suggest variation in activity of LKR/SDH genes among plants. Although most plant genomes seem to contain a single conserved LKR/SDH gene per genome, poplar possesses multiple contiguous genes. A preponderance of SDH transcripts suggests the LKR region may be more rate-limiting. Only switchgrass has EST evidence for LKR monofunctional transcripts. Evidence for monofunctional SDH transcripts shows a novel intron in wheat, Brachypodium, and poplar.

  7. Identification of a novel susceptibility locus at 13q34 and refinement of the 20p12.2 region as a multi-signal locus associated with bladder cancer risk in individuals of European ancestry

    PubMed Central

    Figueroa, Jonine D.; Middlebrooks, Candace D.; Banday, A. Rouf; Ye, Yuanqing; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Koutros, Stella; Kiemeney, Lambertus A.; Rafnar, Thorunn; Bishop, Timothy; Furberg, Helena; Matullo, Giuseppe; Golka, Klaus; Gago-Dominguez, Manuela; Taylor, Jack A.; Fletcher, Tony; Siddiq, Afshan; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Dinney, Colin P.; Malats, Núria; Baris, Dalsu; Purdue, Mark P.; Jacobs, Eric J.; Albanes, Demetrius; Wang, Zhaoming; Chung, Charles C.; Vermeulen, Sita H.; Aben, Katja K.; Galesloot, Tessel E.; Thorleifsson, Gudmar; Sulem, Patrick; Stefansson, Kari; Kiltie, Anne E.; Harland, Mark; Teo, Mark; Offit, Kenneth; Vijai, Joseph; Bajorin, Dean; Kopp, Ryan; Fiorito, Giovanni; Guarrera, Simonetta; Sacerdote, Carlotta; Selinski, Silvia; Hengstler, Jan G.; Gerullis, Holger; Ovsiannikov, Daniel; Blaszkewicz, Meinolf; Castelao, Jose Esteban; Calaza, Manuel; Martinez, Maria Elena; Cordeiro, Patricia; Xu, Zongli; Panduri, Vijayalakshmi; Kumar, Rajiv; Gurzau, Eugene; Koppova, Kvetoslava; Bueno-De-Mesquita, H. Bas; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth C.; Tjønneland, Anne; Brennan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Stern, Marianna C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Jeppson, Rebecca P.; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Turman, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Kamat, Ashish M.; Zhang, Liren; Gong, Yilei; Pu, Xia; Hutchinson, Amy; Burdett, Laurie; Wheeler, William A.; Karagas, Margaret R.; Johnson, Alison; Schned, Alan; Monawar Hosain, G. M.; Schwenn, Molly; Kogevinas, Manolis; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Andriole, Gerald; Grubb, Robert; Black, Amanda; Diver, W. Ryan; Gapstur, Susan M.; Weinstein, Stephanie; Virtamo, Jarmo; Haiman, Christopher A.; Landi, Maria Teresa; Caporaso, Neil E.; Fraumeni, Joseph F.; Vineis, Paolo; Wu, Xifeng; Chanock, Stephen J.; Silverman, Debra T.; Prokunina-Olsson, Ludmila; Rothman, Nathaniel

    2016-01-01

    Candidate gene and genome-wide association studies (GWAS) have identified 15 independent genomic regions associated with bladder cancer risk. In search for additional susceptibility variants, we followed up on four promising single-nucleotide polymorphisms (SNPs) that had not achieved genome-wide significance in 6911 cases and 11 814 controls (rs6104690, rs4510656, rs5003154 and rs4907479, P < 1 × 10−6), using additional data from existing GWAS datasets and targeted genotyping for studies that did not have GWAS data. In a combined analysis, which included data on up to 15 058 cases and 286 270 controls, two SNPs achieved genome-wide statistical significance: rs6104690 in a gene desert at 20p12.2 (P = 2.19 × 10−11) and rs4907479 within the MCF2L gene at 13q34 (P = 3.3 × 10−10). Imputation and fine-mapping analyses were performed in these two regions for a subset of 5551 bladder cancer cases and 10 242 controls. Analyses at the 13q34 region suggest a single signal marked by rs4907479. In contrast, we detected two signals in the 20p12.2 region—the first signal is marked by rs6104690, and the second signal is marked by two moderately correlated SNPs (r2 = 0.53), rs6108803 and the previously reported rs62185668. The second 20p12.2 signal is more strongly associated with the risk of muscle-invasive (T2-T4 stage) compared with non-muscle-invasive (Ta, T1 stage) bladder cancer (case–case P ≤ 0.02 for both rs62185668 and rs6108803). Functional analyses are needed to explore the biological mechanisms underlying these novel genetic associations with risk for bladder cancer. PMID:26732427

  8. Fine-mapping of the HNF1B multicancer locus identifies candidate variants that mediate endometrial cancer risk.

    PubMed

    Painter, Jodie N; O'Mara, Tracy A; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S; Kaufmann, Susanne; Hillman, Kristine M; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W; Webb, Penelope M; Scott, Rodney J; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G; Nyholt, Dale R; Henders, Anjali K; Fasching, Peter A; Hein, Alexander; Beckmann, Matthias W; Renner, Stefan P; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C; Goode, Ellen L; Teoman, Attila; Salvesen, Helga B; Trovik, Jone; Njolstad, Tormund S; Werner, Henrica M J; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L; Southey, Melissa C; Ekici, Arif B; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J; Olson, Janet E; Giles, Graham G; Bruinsma, Fiona; Cunningham, Julie M; Fridley, Brooke L; Børresen-Dale, Anne-Lise; Kristensen, Vessela N; Cox, Angela; Swerdlow, Anthony J; Orr, Nicholas; Bolla, Manjeet K; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D; Pharoah, Paul D P; Dunning, Alison M; Tomlinson, Ian; Easton, Douglas F; Edwards, Stacey L; Thompson, Deborah J; Spurdle, Amanda B

    2015-03-01

    Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10(-14), odds ratio = 0.86, 95% confidence interval = 0.82-0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  9. Fine-mapping of the HNF1B multicancer locus identifies candidate variants that mediate endometrial cancer risk

    PubMed Central

    Painter, Jodie N.; O'Mara, Tracy A.; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A.; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P.; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S.; Kaufmann, Susanne; Hillman, Kristine M.; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma. Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R.; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W.; Webb, Penelope M.; Scott, Rodney J.; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G.; Nyholt, Dale R.; Henders, Anjali K.; Fasching, Peter A.; Hein, Alexander; Beckmann, Matthias W.; Renner, Stefan P.; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C.; Goode, Ellen L.; Teoman, Attila; Salvesen, Helga B.; Trovik, Jone; Njolstad, Tormund S.; Werner, Henrica M.J.; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L.; Southey, Melissa C.; Ekici, Arif B.; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K.; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J.; Olson, Janet E.; Giles, Graham G.; Bruinsma, Fiona; Cunningham, Julie M.; Fridley, Brooke L.; Børresen-Dale, Anne-Lise; Kristensen, Vessela N.; Cox, Angela; Swerdlow, Anthony J.; Orr, Nicholas; Bolla, Manjeet K.; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D.; Pharoah, Paul D.P.; Dunning, Alison M.; Tomlinson, Ian; Easton, Douglas F.; Edwards, Stacey L.; Thompson, Deborah J.; Spurdle, Amanda B.

    2015-01-01

    Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10−14, odds ratio = 0.86, 95% confidence interval = 0.82–0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. PMID:25378557

  10. Platypus globin genes and flanking loci suggest a new insertional model for beta-globin evolution in birds and mammals.

    PubMed

    Patel, Vidushi S; Cooper, Steven J B; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer A M

    2008-07-25

    Vertebrate alpha (alpha)- and beta (beta)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the alpha- and beta-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil beta-globin gene (omega) in the marsupial alpha-cluster, however, suggested that duplication of the alpha-beta cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous alpha- and beta-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. The platypus alpha-globin cluster (chromosome 21) contains embryonic and adult alpha- globin genes, a beta-like omega-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-zeta-zeta'-alphaD-alpha3-alpha2-alpha1-omega-GBY-3'. The platypus beta-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-epsilon-beta-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate alpha-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal beta-globin clusters are embedded in olfactory genes. Thus, the mammalian alpha- and beta-globin clusters are orthologous to the bird alpha- and beta-globin clusters respectively. We propose that alpha- and beta-globin clusters evolved from an ancient MPG-C16orf35-alpha-beta-GBY-LUC7L arrangement 410 million years ago. A copy of the original beta (represented by omega in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of beta-globin genes with different expression profiles in different lineages.

  11. The Fanconi anemia/BRCA gene network in zebrafish: embryonic expression and comparative genomics.

    PubMed

    Titus, Tom A; Yan, Yi-Lin; Wilson, Catherine; Starks, Amber M; Frohnmayer, Jonathan D; Bremiller, Ruth A; Cañestro, Cristian; Rodriguez-Mari, Adriana; He, Xinjun; Postlethwait, John H

    2009-07-31

    Fanconi anemia (FA) is a genetic disease resulting in bone marrow failure, high cancer risks, and infertility, and developmental anomalies including microphthalmia, microcephaly, hypoplastic radius and thumb. Here we present cDNA sequences, genetic mapping, and genomic analyses for the four previously undescribed zebrafish FA genes (fanci, fancj, fancm, and fancn), and show that they reverted to single copy after the teleost genome duplication. We tested the hypothesis that FA genes are expressed during embryonic development in tissues that are disrupted in human patients by investigating fanc gene expression patterns. We found fanc gene maternal message, which can provide Fanc proteins to repair DNA damage encountered in rapid cleavage divisions. Zygotic expression was broad but especially strong in eyes, central nervous system and hematopoietic tissues. In the pectoral fin bud at hatching, fanc genes were expressed specifically in the apical ectodermal ridge, a signaling center for fin/limb development that may be relevant to the radius/thumb anomaly of FA patients. Hatching embryos expressed fanc genes strongly in the oral epithelium, a site of squamous cell carcinomas in FA patients. Larval and adult zebrafish expressed fanc genes in proliferative regions of the brain, which may be related to microcephaly in FA. Mature ovaries and testes expressed fanc genes in specific stages of oocyte and spermatocyte development, which may be related to DNA repair during homologous recombination in meiosis and to infertility in human patients. The intestine strongly expressed some fanc genes specifically in proliferative zones. Our results show that zebrafish has a complete complement of fanc genes in single copy and that these genes are expressed in zebrafish embryos and adults in proliferative tissues that are often affected in FA patients. These results support the notion that zebrafish offers an attractive experimental system to help unravel mechanisms relevant not only to FA, but also to breast cancer, given the involvement of fancj (brip1), fancn (palb2) and fancd1 (brca2) in both conditions.

  12. The Fanconi anemia/BRCA gene network in zebrafish: Embryonic expression and comparative genomics

    PubMed Central

    Titus, Tom A.; Yan, Yi-Lin; Wilson, Catherine; Starks, Amber M.; Frohnmayer, Jonathan D.; Canestro, Cristian; Rodriguez-Mari, Adriana; He, Xinjun; Postlethwait, John H.

    2008-01-01

    Fanconi anemia (FA) is a genic disease resulting in bone marrow failure, high cancer risks, and infertility, and developmental anomalies including microphthalmia, microcephaly, hypoplastic radius and thumb. Here we present cDNA sequences, genetic mapping, and genomic analyses for the four previously undescribed zebrafish FA genes (fanci, fancj, fancm, and fancn, and show that they reverted to single copy after the teleost genome duplication. We tested the hypothesis that FA genes are expressed during embryonic development in tissues that are disrupted in human patients by investigating fanc gene expression patterns. We found fanc gene maternal message, which can provide Fanc proteins to repair DNA damage encountered in rapid cleavage divisions. Zygotic expression was broad but especially strong in eyes, central nervous system and hematopoietic tissues. In the pectoral fin bud at hatching, fanc genes were expressed specifically in the apical ectodermal ridge, a signaling center for fin/limb development that may be relevant to the radius/thumb anomaly of FA patients. Hatching embryos expressed fanc genes strongly in the oral epithelium, a site of squamous cell carcinomas in FA patients. Larval and adult zebrafish expressed fanc genes in proliferative regions of the brain, which may be related to microcephaly in FA. Mature ovaries and testes expressed fanc genes in specific stages of oocyte and spermatocyte development, which may be related to DNA repair during homologous recombination in meiosis and to infertility in human patients. The intestine strongly expressed some fanc genes specifically in proliferative zones. Our results show that zebrafish has a complete complement of fanc genes in single copy and that these genes are expressed in zebrafish embryos and adults in proliferative tissues that are often affected in FA patients. These results support the notion that zebrafish offers an attractive experimental system to help unravel mechanisms relevant not only to FA, but also to breast cancer, given the involvement of fancj (brip1), fancn (palb2) and fancd1 (brca2) in both conditions. PMID:19101574

  13. iGWAS: Integrative Genome-Wide Association Studies of Genetic and Genomic Data for Disease Susceptibility Using Mediation Analysis.

    PubMed

    Huang, Yen-Tsung; Liang, Liming; Moffatt, Miriam F; Cookson, William O C M; Lin, Xihong

    2015-07-01

    Genome-wide association studies (GWAS) have been a standard practice in identifying single nucleotide polymorphisms (SNPs) for disease susceptibility. We propose a new approach, termed integrative GWAS (iGWAS) that exploits the information of gene expressions to investigate the mechanisms of the association of SNPs with a disease phenotype, and to incorporate the family-based design for genetic association studies. Specifically, the relations among SNPs, gene expression, and disease are modeled within the mediation analysis framework, which allows us to disentangle the genetic effect on a disease phenotype into two parts: an effect mediated through a gene expression (mediation effect, ME) and an effect through other biological mechanisms or environment-mediated mechanisms (alternative effect, AE). We develop omnibus tests for the ME and AE that are robust to underlying true disease models. Numerical studies show that the iGWAS approach is able to facilitate discovering genetic association mechanisms, and outperforms the SNP-only method for testing genetic associations. We conduct a family-based iGWAS of childhood asthma that integrates genetic and genomic data. The iGWAS approach identifies six novel susceptibility genes (MANEA, MRPL53, LYCAT, ST8SIA4, NDFIP1, and PTCH1) using the omnibus test with false discovery rate less than 1%, whereas no gene using SNP-only analyses survives with the same cut-off. The iGWAS analyses further characterize that genetic effects of these genes are mostly mediated through their gene expressions. In summary, the iGWAS approach provides a new analytic framework to investigate the mechanism of genetic etiology, and identifies novel susceptibility genes of childhood asthma that were biologically meaningful. © 2015 WILEY PERIODICALS, INC.

  14. The complete chloroplast DNA sequence of the green alga Nephroselmis olivacea: Insights into the architecture of ancestral chloroplast genomes

    PubMed Central

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    1999-01-01

    Green plants seem to form two sister lineages: Chlorophyta, comprising the green algal classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae, and Chlorophyceae, and Streptophyta, comprising the Charophyceae and land plants. We have determined the complete chloroplast DNA (cpDNA) sequence (200,799 bp) of Nephroselmis olivacea, a member of the class (Prasinophyceae) thought to include descendants of the earliest-diverging green algae. The 127 genes identified in this genome represent the largest gene repertoire among the green algal and land plant cpDNAs completely sequenced to date. Of the Nephroselmis genes, 2 (ycf81 and ftsI, a gene involved in peptidoglycan synthesis) have not been identified in any previously investigated cpDNA; 5 genes [ftsW, rnE, ycf62, rnpB, and trnS(cga)] have been found only in cpDNAs of nongreen algae; and 10 others (ndh genes) have been described only in land plant cpDNAs. Nephroselmis and land plant cpDNAs share the same quadripartite structure—which is characterized by the presence of a large rRNA-encoding inverted repeat and two unequal single-copy regions—and very similar sets of genes in corresponding genomic regions. Given that our phylogenetic analyses place Nephroselmis within the Chlorophyta, these structural characteristics were most likely present in the cpDNA of the common ancestor of chlorophytes and streptophytes. Comparative analyses of chloroplast genomes indicate that the typical quadripartite architecture and gene-partitioning pattern of land plant cpDNAs are ancient features that may have been derived from the genome of the cyanobacterial progenitor of chloroplasts. Our phylogenetic data also offer insight into the chlorophyte ancestor of euglenophyte chloroplasts. PMID:10468594

  15. Germline Variation in Complement Genes and Event-Free Survival in Follicular and Diffuse Large B-Cell Lymphoma

    PubMed Central

    Charbonneau, Bridget; Maurer, Matthew J.; Fredericksen, Zachary S.; Zent, Clive S.; Link, Brian K.; Novak, Anne J.; Ansell, Stephen M.; Weiner, George J.; Wang, Alice H.; Witzig, Thomas E.; Dogan, Ahmet; Slager, Susan L.; Habermann, Thomas M.; Cerhan, James R.

    2013-01-01

    The complement pathway plays a central role in innate immunity, and also functions as a regulator of the overall immune response. We evaluated whether polymorphisms in complement genes are associated with event-free survival (EFS) in follicular (FL) and diffuse large B-cell (DLBCL) lymphoma. We genotyped 167 single nucleotide polymorphisms (SNPs) from 30 complement pathway genes in a prospective cohort study of newly diagnosed FL (N=107) and DLBCL (N=82) patients enrolled at the Mayo Clinic from 2002–2005. Cox regression was used to estimate Hazard Ratios (HRs) for individual SNPs with EFS, adjusting for FLIPI or IPI and treatment. For gene-level analyses, we used a principal components based gene-level test. In gene-level analyses for FL EFS, CFH (p=0.009), CD55 (p=0.006), CFHR5 (p=0.01), C9 (p=0.02), CFHR1 (p=0.03), and CD46 (p=0.03) were significant at p<0.05, and these genes remained noteworthy after accounting for multiple testing (q<0.15). SNPs in CFH, CFHR1, and CFHR5 showed stronger associations among patients receiving any rituximab, while SNPs from CD55 and CD46 showed stronger associations among patients who were observed. For DLBCL, only CLU (p=0.001) and C7 (p=0.03) were associated with EFS, but did not remain noteworthy after accounting for multiple testing (q>0.15). Genes from the Regulators of Complement Activation (CFH, CD55, CFHR1, CFHR5, CD46) at 1q32-q32.1, along with C9, were associated with FL EFS after adjusting for clinical variables, and if replicated, these findings add further support for the role of host innate immunity in FL prognosis. PMID:22718493

  16. Blood lead levels, iron metabolism gene polymorphisms and homocysteine: a gene-environment interaction study.

    PubMed

    Kim, Kyoung-Nam; Lee, Mee-Ri; Lim, Youn-Hee; Hong, Yun-Chul

    2017-12-01

    Homocysteine has been causally associated with various adverse health outcomes. Evidence supporting the relationship between lead and homocysteine levels has been accumulating, but most prior studies have not focused on the interaction with genetic polymorphisms. From a community-based prospective cohort, we analysed 386 participants (aged 41-71 years) with information regarding blood lead and plasma homocysteine levels. Blood lead levels were measured between 2001 and 2003, and plasma homocysteine levels were measured in 2007. Interactions of lead levels with 42 genotyped single-nucleotide polymorphisms (SNPs) in five genes ( TF , HFE , CBS , BHMT and MTR ) were assessed via a 2-degree of freedom (df) joint test and a 1-df interaction test. In secondary analyses using imputation, we further assessed 58 imputed SNPs in the TF and MTHFR genes. Blood lead concentrations were positively associated with plasma homocysteine levels (p=0.0276). Six SNPs in the TF and MTR genes were screened using the 2-df joint test, and among them, three SNPs in the TF gene showed interactions with lead with respect to homocysteine levels through the 1-df interaction test (p<0.0083). Seven SNPs in the MTHFR gene were associated with homocysteine levels at an α-level of 0.05, but the associations did not persist after Bonferroni correction. These SNPs did not show interactions with lead levels. Blood lead levels were positively associated with plasma homocysteine levels measured 4-6 years later, and three SNPs in the TF gene modified the association. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  17. The vestigial olfactory receptor subgenome of odontocete whales: phylogenetic congruence between gene-tree reconciliation and supermatrix methods.

    PubMed

    McGowen, Michael R; Clark, Clay; Gatesy, John

    2008-08-01

    The macroevolutionary transition of whales (cetaceans) from a terrestrial quadruped to an obligate aquatic form involved major changes in sensory abilities. Compared to terrestrial mammals, the olfactory system of baleen whales is dramatically reduced, and in toothed whales is completely absent. We sampled the olfactory receptor (OR) subgenomes of eight cetacean species from four families. A multigene tree of 115 newly characterized OR sequences from these eight species and published data for Bos taurus revealed a diverse array of class II OR paralogues in Cetacea. Evolution of the OR gene superfamily in toothed whales (Odontoceti) featured a multitude of independent pseudogenization events, supporting anatomical evidence that odontocetes have lost their olfactory sense. We explored the phylogenetic utility of OR pseudogenes in Cetacea, concentrating on delphinids (oceanic dolphins), the product of a rapid evolutionary radiation that has been difficult to resolve in previous studies of mitochondrial DNA sequences. Phylogenetic analyses of OR pseudogenes using both gene-tree reconciliation and supermatrix methods yielded fully resolved, consistently supported relationships among members of four delphinid subfamilies. Alternative minimizations of gene duplications, gene duplications plus gene losses, deep coalescence events, and nucleotide substitutions plus indels returned highly congruent phylogenetic hypotheses. Novel DNA sequence data for six single-copy nuclear loci and three mitochondrial genes (> 5000 aligned nucleotides) provided an independent test of the OR trees. Nucleotide substitutions and indels in OR pseudogenes showed a very low degree of homoplasy in comparison to mitochondrial DNA and, on average, provided more variation than single-copy nuclear DNA. Our results suggest that phylogenetic analysis of the large OR superfamily will be effective for resolving relationships within Cetacea whether supermatrix or gene-tree reconciliation procedures are used.

  18. Progranulin gene variation affects serum progranulin levels differently in Danish bipolar individuals compared with healthy controls.

    PubMed

    Buttenschøn, Henriette N; Nielsen, Marit N; Thotakura, Gangadaar; Lee, Chris W; Nykjær, Anders; Mors, Ole; Glerup, Simon

    2017-06-01

    The identification of peripheral biomarkers for bipolar disorder is of great importance and has the potential to improve diagnosis, treatment and prognosis. Recent studies have reported lower plasma progranulin levels in bipolar individuals compared with controls and association with single nucleotide polymorphisms (SNPs) within the progranulin gene (GRN). In the present study, we investigated the effect of GRN and sortilin (SORT1) gene variation on serum progranulin levels in bipolar individuals and controls. In a Danish cohort of individuals with bipolar disorder and controls, we analysed the serum progranulin level (nbipolar=80, ncontrols=76) and five SNPs located within GRN and two SNPs near the SORT1 gene encoding sortilin, a progranulin scavenger receptor known to affect circulating progranulin levels (nbipolar=166, ncontrols=186). We observed no significant difference in the serum progranulin level between cases and controls and none of the analysed SNPs located within GRN or close to SORT1 were associated with bipolar disorder. Crude and adjusted (adjusted for case-control status, sex and age) linear regression analyses showed no effect of any SNPs on the serum progranulin level. However, we observed that the mean serum progranulin level in cases and controls is affected differently depending on the genotypes of two SNPs within GRN (rs2879096 and rs4792938). The sample size is relatively small and detailed information on medication and polarity of the disorder is not available. No correction for multiple testing was performed. Our study suggests that the potential of progranulin as a biomarker for bipolar disorder is genotype dependent.

  19. Frameshift mutations of TAF1C gene, a core component for transcription by RNA polymerase I, and its regional heterogeneity in gastric and colorectal cancers.

    PubMed

    Oh, Hye Rim; An, Chang Hyeok; Yoo, Nam Jin; Lee, Sug Hyung

    2015-02-01

    Initiation of transcription for ribosomal RNA (rRNA) by RNA polymerase I requires TATA-binding protein (TBP) and TBP-associated factors (TAF1A, TAF1B and TAF1C). p53 tumour suppressor inhibits rRNA transcription by blocking TAF1C-UBF interaction, but alterations of TAF1C itself in tumorigenesis remain unknown. The aim of this study was to explore whether TAF1C gene was mutated in gastric (GC) and colorectal cancers (CRC).In a public database, we found that TAF1C gene had a mononucleotide repeat (C8) in the coding sequences that might be a mutation target in the cancers with microsatellite instability (MSI). We analysed 79 GC and 124 CRC by single-strand conformation polymorphism and DNA sequencing analyses. In this study, we found TAF1C frameshift mutations (8.8% of GC and 10.1% of CRC with MSI-H), which were not found in stable MSI/low MSI (MSS/MSI-L) (0/90). In addition, we analysed intratumoural heterogeneity (ITH) of TAF1C frameshift mutations in 16 CRC and found that three CRC (18.8%) harboured regional ITH of the TAF1C frameshift mutations. Our results indicate that TAF1C gene harboured not only somatic frameshift mutations but also the mutational ITH, which together might play a role in tumourigenesis of GC and CRC. Our data also suggest that multi-regional mutation analysis is needed for a better evaluation of the mutation status in CRC.

  20. Gene inactivation in the plant pathogen Glomerella cingulata: three strategies for the disruption of the pectin lyase gene pnlA.

    PubMed

    Bowen, J K; Templeton, M D; Sharrock, K R; Crowhurst, R N; Rikkerink, E H

    1995-01-20

    The feasibility of performing routine transformation-mediated mutagenesis in Glomerella cingulata was analysed by adopting three one-step gene disruption strategies targeted at the pectin lyase gene pnlA. The efficiencies of disruption following transformation with gene replacement- or gene truncation-disruption vectors were compared. To effect replacement-disruption, G. cingulata was transformed with a vector carrying DNA from the pnlA locus in which the majority of the coding sequence had been replaced by the gene for hygromycin B resistance. Two of the five transformants investigated contained an inactivated pnlA gene (pnlA-); both also contained ectopically integrated vector sequences. The efficacy of gene disruption by transformation with two gene truncation-disruption vectors was also assessed. Both vectors carried at 5' and 3' truncated copy of the pnlA coding sequence, adjacent to the gene for hygromycin B resistance. The promoter sequences controlling the selectable marker differed in the two vectors. In one vector the homologous G. cingulata gpdA promoter controlled hygromycin B phosphotransferase expression (homologous truncation vector), whereas in the second vector promoter elements were from the Aspergillus nidulans gpdA gene (heterologous truncation vector). Following transformation with the homologous truncation vector, nine transformants were analysed by Southern hybridisation; no transformants contained a disrupted pnlA gene. Of nineteen heterologous truncation vector transformants, three contained a disrupted pnlA gene; Southern analysis revealed single integrations of vector sequence at pnlA in two of these transformants. pnlA mRNA was not detected by Northern hybridisation in pnlA- transformants. pnlA- transformants failed to produce a PNLA protein with a pI identical to one normally detected in wild-type isolates by silver and activity staining of isoelectric focussing gels. Pathogenesis on Capsicum and apple was unaffected by disruption of the pnlA gene, indicating that the corresponding gene product, PNLA, is not essential for pathogenicity. Gene disruption is a feasible method for selectively mutating defined loci in G. cingulata for functional analysis of the corresponding gene products.

  1. Live births after simultaneous avoidance of monogenic diseases and chromosome abnormality by next-generation sequencing with linkage analyses.

    PubMed

    Yan, Liying; Huang, Lei; Xu, Liya; Huang, Jin; Ma, Fei; Zhu, Xiaohui; Tang, Yaqiong; Liu, Mingshan; Lian, Ying; Liu, Ping; Li, Rong; Lu, Sijia; Tang, Fuchou; Qiao, Jie; Xie, X Sunney

    2015-12-29

    In vitro fertilization (IVF), preimplantation genetic diagnosis (PGD), and preimplantation genetic screening (PGS) help patients to select embryos free of monogenic diseases and aneuploidy (chromosome abnormality). Next-generation sequencing (NGS) methods, while experiencing a rapid cost reduction, have improved the precision of PGD/PGS. However, the precision of PGD has been limited by the false-positive and false-negative single-nucleotide variations (SNVs), which are not acceptable in IVF and can be circumvented by linkage analyses, such as short tandem repeats or karyomapping. It is noteworthy that existing methods of detecting SNV/copy number variation (CNV) and linkage analysis often require separate procedures for the same embryo. Here we report an NGS-based PGD/PGS procedure that can simultaneously detect a single-gene disorder and aneuploidy and is capable of linkage analysis in a cost-effective way. This method, called "mutated allele revealed by sequencing with aneuploidy and linkage analyses" (MARSALA), involves multiple annealing and looping-based amplification cycles (MALBAC) for single-cell whole-genome amplification. Aneuploidy is determined by CNVs, whereas SNVs associated with the monogenic diseases are detected by PCR amplification of the MALBAC product. The false-positive and -negative SNVs are avoided by an NGS-based linkage analysis. Two healthy babies, free of the monogenic diseases of their parents, were born after such embryo selection. The monogenic diseases originated from a single base mutation on the autosome and the X-chromosome of the disease-carrying father and mother, respectively.

  2. Divergent gene copies in the asexual class Bdelloidea (Rotifera) separated before the bdelloid radiation or within bdelloid families.

    PubMed

    Mark Welch, David B; Cummings, Michael P; Hillis, David M; Meselson, Matthew

    2004-02-10

    Rotifers of the asexual class Bdelloidea are unusual in possessing two or more divergent copies of every gene that has been examined. Phylogenetic analysis of the heat-shock gene hsp82 and the TATA-box-binding protein gene tbp in multiple bdelloid species suggested that for each gene, each copy belonged to one of two lineages that began to diverge before the bdelloid radiation. Such gene trees are consistent with the two lineages having descended from former alleles that began to diverge after meiotic segregation ceased or from subgenomes of an alloploid ancestor of the bdelloids. However, the original analyses of bdelloid gene-copy divergence used only a single outgroup species and were based on parsimony and neighbor joining. We have now used maximum likelihood and Bayesian inference methods and, for hsp82, multiple outgroups in an attempt to produce more robust gene trees. Here we report that the available data do not unambiguously discriminate between gene trees that root the origin of hsp82 and tbp copy divergence before the bdelloid radiation and those which indicate that the gene copies began to diverge within bdelloid families. The remarkable presence of multiple diverged gene copies in individual genomes is nevertheless consistent with the loss of sex in an ancient ancestor of bdelloids.

  3. Influence of ghrelin gene polymorphisms on hypertension and atherosclerotic disease.

    PubMed

    Berthold, Heiner K; Giannakidou, Eleni; Krone, Wilhelm; Trégouët, David-Alexandre; Gouni-Berthold, Ioanna

    2010-02-01

    Ghrelin is involved in several metabolic and cardiovascular processes. Recent evidence suggests its involvement in blood pressure regulation and hypertension. The aim of the study was to determine associations of single-nucleotide polymorphisms (SNPs) and haplotypes of the ghrelin gene (GHRL) with hypertension and atherosclerotic disease. Six GHRL SNPs (rs27647, rs26802, rs34911341, rs696217, rs4684677 and a -473G/A (with no assigned rsID)) were investigated in a sample of 1143 hypertensive subjects and 1489 controls of Caucasian origin. Both single-locus and haplotype association analyses were performed. In single-locus analyses, only the non-synonymous rs34911341 was associated with hypertension (odds ratio (OR)=1.95 (95% confidence interval (CI): 1.26-3.02), P=0.003). Six common haplotypes with frequency >1% were inferred from the studied GHRL SNPs, and their frequency distribution was significantly different between hypertensive subjects and controls (chi(2)=12.96 with 5 d.f. (degree of freedom), P=0.024). The effect of rs26802 was found to be significantly (P=0.017) modulated by other GHRL SNPs, as its C allele conferred either an increased risk (OR=1.30 (1.08-1.57), P=0.005) or a decreased risk (OR=0.50 (0.23-1.06), P=0.07) of hypertension according to the two different haplotypes on which it can be found. No association of GHRL SNPs or haplotypes with atherosclerotic disease was observed. In conclusion, we observed statistical evidence for association between GHRL SNPs and risk of hypertension.

  4. Quantitative expression analysis of selected transcription factors in pavement, basal and trichome cells of mature leaves from Arabidopsis thaliana

    PubMed Central

    Schliep, Martin; Ebert, Berit; Simon-Rosin, Ulrike; Zoeller, Daniela

    2010-01-01

    Gene expression levels of several transcription factors from Arabidopsis thaliana that were described previously to be involved in leaf development and trichome formation were analysed in trichome, basal and pavement cells of mature leaves. Single cell samples of these three cells types were collected by glass micro-capillaries. Real-time reverse transcription (RT)-PCR was used to analyse expression patterns of the following transcription factors: MYB23, MYB55, AtHB1, FILAMENTOUS FLOWER (FIL)/YABBY1 (YAB1), TRIPTYCHON (TRY) and CAPRICE (CPC). A difference in the expression patterns of TRY and CPC was revealed. Contrary to the CPC expression pattern, no transcripts of TRY could be detected in pavement cells. FIL/YAB1 was exclusively expressed in trichome cells. AtHB1 was highly expressed throughout all three cell types. MYB55 was higher expressed in basal cells than in trichome and pavement cells. MYB23 showed a pattern of low expression in pavement cells, medium in basal cells and high expression in trichomes. Expression patterns obtained by single cell sampling and real-time RT-PCR were compared to promoter GUS fusions of the selected transcription factors. Therefore, we regenerated two transgenic Arabidopsis lines that expressed the GUS reporter gene under control of the promoters of MYB55 and YAB1. In conclusion, despite their function in leaf morphogenesis, all six transcription factors were detected in mature leaves. Furthermore, single cell sampling and promoter GUS staining patterns demonstrated the predominant presence of MYB55 in basal cells as compared to pavement cells and trichomes. PMID:20101514

  5. Quantitative expression analysis of selected transcription factors in pavement, basal and trichome cells of mature leaves from Arabidopsis thaliana.

    PubMed

    Schliep, Martin; Ebert, Berit; Simon-Rosin, Ulrike; Zoeller, Daniela; Fisahn, Joachim

    2010-05-01

    Gene expression levels of several transcription factors from Arabidopsis thaliana that were described previously to be involved in leaf development and trichome formation were analysed in trichome, basal and pavement cells of mature leaves. Single cell samples of these three cells types were collected by glass micro-capillaries. Real-time reverse transcription (RT)-PCR was used to analyse expression patterns of the following transcription factors: MYB23, MYB55, AtHB1, FILAMENTOUS FLOWER (FIL)/YABBY1 (YAB1), TRIPTYCHON (TRY) and CAPRICE (CPC). A difference in the expression patterns of TRY and CPC was revealed. Contrary to the CPC expression pattern, no transcripts of TRY could be detected in pavement cells. FIL/YAB1 was exclusively expressed in trichome cells. AtHB1 was highly expressed throughout all three cell types. MYB55 was higher expressed in basal cells than in trichome and pavement cells. MYB23 showed a pattern of low expression in pavement cells, medium in basal cells and high expression in trichomes. Expression patterns obtained by single cell sampling and real-time RT-PCR were compared to promoter GUS fusions of the selected transcription factors. Therefore, we regenerated two transgenic Arabidopsis lines that expressed the GUS reporter gene under control of the promoters of MYB55 and YAB1. In conclusion, despite their function in leaf morphogenesis, all six transcription factors were detected in mature leaves. Furthermore, single cell sampling and promoter GUS staining patterns demonstrated the predominant presence of MYB55 in basal cells as compared to pavement cells and trichomes.

  6. Somatic mutations affect key pathways in lung adenocarcinoma

    PubMed Central

    Ding, Li; Getz, Gad; Wheeler, David A.; Mardis, Elaine R.; McLellan, Michael D.; Cibulskis, Kristian; Sougnez, Carrie; Greulich, Heidi; Muzny, Donna M.; Morgan, Margaret B.; Fulton, Lucinda; Fulton, Robert S.; Zhang, Qunyuan; Wendl, Michael C.; Lawrence, Michael S.; Larson, David E.; Chen, Ken; Dooling, David J.; Sabo, Aniko; Hawes, Alicia C.; Shen, Hua; Jhangiani, Shalini N.; Lewis, Lora R.; Hall, Otis; Zhu, Yiming; Mathew, Tittu; Ren, Yanru; Yao, Jiqiang; Scherer, Steven E.; Clerc, Kerstin; Metcalf, Ginger A.; Ng, Brian; Milosavljevic, Aleksandar; Gonzalez-Garay, Manuel L.; Osborne, John R.; Meyer, Rick; Shi, Xiaoqi; Tang, Yuzhu; Koboldt, Daniel C.; Lin, Ling; Abbott, Rachel; Miner, Tracie L.; Pohl, Craig; Fewell, Ginger; Haipek, Carrie; Schmidt, Heather; Dunford-Shore, Brian H.; Kraja, Aldi; Crosby, Seth D.; Sawyer, Christopher S.; Vickery, Tammi; Sander, Sacha; Robinson, Jody; Winckler, Wendy; Baldwin, Jennifer; Chirieac, Lucian R.; Dutt, Amit; Fennell, Tim; Hanna, Megan; Johnson, Bruce E.; Onofrio, Robert C.; Thomas, Roman K.; Tonon, Giovanni; Weir, Barbara A.; Zhao, Xiaojun; Ziaugra, Liuda; Zody, Michael C.; Giordano, Thomas; Orringer, Mark B.; Roth, Jack A.; Spitz, Margaret R.; Wistuba, Ignacio I.; Ozenberger, Bradley; Good, Peter J.; Chang, Andrew C.; Beer, David G.; Watson, Mark A.; Ladanyi, Marc; Broderick, Stephen; Yoshizawa, Akihiko; Travis, William D.; Pao, William; Province, Michael A.; Weinstock, George M.; Varmus, Harold E.; Gabriel, Stacey B.; Lander, Eric S.; Gibbs, Richard A.; Meyerson, Matthew; Wilson, Richard K.

    2009-01-01

    Determining the genetic basis of cancer requires comprehensive analyses of large collections of histopathologically well-classified primary tumours. Here we report the results of a collaborative study to discover somatic mutations in 188 human lung adenocarcinomas. DNA sequencing of 623 genes with known or potential relationships to cancer revealed more than 1,000 somatic mutations across the samples. Our analysis identified 26 genes that are mutated at significantly high frequencies and thus are probably involved in carcinogenesis. The frequently mutated genes include tyrosine kinases, among them the EGFR homologue ERBB4; multiple ephrin receptor genes, notably EPHA3; vascular endothelial growth factor receptor KDR; and NTRK genes. These data provide evidence of somatic mutations in primary lung adenocarcinoma for several tumour suppressor genes involved in other cancers—including NF1, APC, RB1 and ATM—and for sequence changes in PTPRD as well as the frequently deleted gene LRP1B. The observed mutational profiles correlate with clinical features, smoking status and DNA repair defects. These results are reinforced by data integration including single nucleotide polymorphism array and gene expression array. Our findings shed further light on several important signalling pathways involved in lung adenocarcinoma, and suggest new molecular targets for treatment. PMID:18948947

  7. Polymorphisms in Inflammatory Genes are Associated with Term Small for Gestational Age and Preeclampsia

    PubMed Central

    Harmon, Quaker E.; Engel, Stephanie M.; Wu, Michael C.; Moran, Thomas M.; Luo, Jingchun; Stuebe, Alison M.; Avery, Christy L.; Olshan, Andrew F.

    2014-01-01

    Problem Inflammatory biomarkers are associated with preeclampsia (PE) and poor fetal growth; however, genetic epidemiologic studies have been limited by reduced gene coverage and the exclusion of African American mothers. Method of study Cases and controls (N = 1646) from a pregnancy cohort were genotyped for 503 tagSNPs in 40 genes related to inflammation. Gene-set analyses were stratified by race and were followed by a single SNP analysis within significant gene sets. Results Gene-level associations were found for IL6 and KLRD1 for term small for gestational age (SGA) among African Americans. LTA/TNF and TBX21 were associated with PE among European Americans. The strongest association was for PE among European Americans for an upstream regulator of TNF with RR = 1.8 (95% CI 1.1–2.7). Conclusion Although previous studies have suggested null associations, increased tagging and stratification by genetic ancestry suggests important associations between IL6 and term SGA for African Americans, and a TNF regulator and PE among European Americans (N = 149). PMID:24702779

  8. Polymorphisms in inflammatory genes are associated with term small for gestational age and preeclampsia.

    PubMed

    Harmon, Quaker E; Engel, Stephanie M; Wu, Michael C; Moran, Thomas M; Luo, Jingchun; Stuebe, Alison M; Avery, Christy L; Olshan, Andrew F

    2014-05-01

    Inflammatory biomarkers are associated with preeclampsia (PE) and poor fetal growth; however, genetic epidemiologic studies have been limited by reduced gene coverage and the exclusion of African American mothers. Cases and controls (N = 1646) from a pregnancy cohort were genotyped for 503 tagSNPs in 40 genes related to inflammation. Gene-set analyses were stratified by race and were followed by a single SNP analysis within significant gene sets. Gene-level associations were found for IL6 and KLRD1 for term small for gestational age (SGA) among African Americans. LTA/TNF and TBX21 were associated with PE among European Americans. The strongest association was for PE among European Americans for an upstream regulator of TNF with RR = 1.8 (95% CI 1.1-2.7). Although previous studies have suggested null associations, increased tagging and stratification by genetic ancestry suggests important associations between IL6 and term SGA for African Americans, and a TNF regulator and PE among European Americans (N = 149). © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  9. Pleiotropic Genes Affecting Carcass Traits in Bos indicus (Nellore) Cattle Are Modulators of Growth

    PubMed Central

    Milanesi, Marco; Torrecilha, Rafaela B. P.; Carmo, Adriana S.; Neves, Haroldo H. R.; Carvalheiro, Roberto; Ajmone-Marsan, Paolo; Sonstegard, Tad S.; Sölkner, Johann; Contreras-Castillo, Carmen J.; Garcia, José F.

    2016-01-01

    Two complementary methods, namely Multi-Trait Meta-Analysis and Versatile Gene-Based Test for Genome-wide Association Studies (VEGAS), were used to identify putative pleiotropic genes affecting carcass traits in Bos indicus (Nellore) cattle. The genotypic data comprised over 777,000 single-nucleotide polymorphism markers scored in 995 bulls, and the phenotypic data included deregressed breeding values (dEBV) for weight measurements at birth, weaning and yearling, as well visual scores taken at weaning and yearling for carcass finishing precocity, conformation and muscling. Both analyses pointed to the pleomorphic adenoma gene 1 (PLAG1) as a major pleiotropic gene. VEGAS analysis revealed 224 additional candidates. From these, 57 participated, together with PLAG1, in a network involved in the modulation of the function and expression of IGF1 (insulin like growth factor 1), IGF2 (insulin like growth factor 2), GH1 (growth hormone 1), IGF1R (insulin like growth factor 1 receptor) and GHR (growth hormone receptor), suggesting that those pleiotropic genes operate as satellite regulators of the growth pathway. PMID:27410030

  10. Transgenic rice expressing Allium sativum leaf lectin with enhanced resistance against sap-sucking insect pests.

    PubMed

    Saha, Prasenjit; Majumder, Pralay; Dutta, Indrajit; Ray, Tui; Roy, S C; Das, Sampa

    2006-05-01

    Mannose binding Allium sativum leaf agglutinin (ASAL) has been shown to be antifeedant and insecticidal against sap-sucking insects. In the present investigation, ASAL coding sequence was expressed under the control of CaMV35S promoter in a chimeric gene cassette containing plant selection marker, hpt and gusA reporter gene of pCAMBIA1301 binary vector in an elite indica rice cv. IR64. Many fertile transgenic plants were generated using scutellar calli as initial explants through Agrobacterium-mediated transformation technology. GUS activity was observed in selected calli and in mature plants. Transformation frequency was calculated to be approximately 12.1%+/-0.351 (mean +/- SE). Southern blot analyses revealed the integration of ASAL gene into rice genome with a predominant single copy insertion. Transgene localization was detected on chromosomes of transformed plants using PRINS and C-PRINS techniques. Northern and western blot analyses determined the expression of transgene in transformed lines. ELISA analyses estimated ASAL expression up to 0.72 and 0.67% of total soluble protein in T0 and T1 plants, respectively. Survival and fecundity of brown planthopper and green leafhopper were reduced to 36% (P < 0.01), 32% (P < 0.05) and 40.5, 29.5% (P < 0.001), respectively, when tested on selected plants in comparison to control plants. Specific binding of expressed ASAL to receptor proteins of insect gut was analysed. Analysis of T1 progenies confirmed the inheritance of the transgenes. Thus, ASAL promises to be a potential component in insect resistance rice breeding programme.

  11. Polymorphisms in the Myostatin-1 gene and their association with growth traits in Ancherythroculter nigrocauda

    NASA Astrophysics Data System (ADS)

    Sun, Yanhong; Li, Qing; Wang, Guiying; Zhu, Dongmei; Chen, Jian; Li, Pei; Tong, Jingou

    2017-05-01

    Myostatin ( MSTN) is a member of the transforming growth factor-β gene superfamily that negatively regulates skeletal muscle development and growth. In the present study, partial genomic fragments of Myostatin-1 ( MSTN-1) in two commercial hatchery populations of Ancherythroculter nigrocauda, an economically important freshwater fish, were screened for single nucleotide polymorphisms (SNPs) and then genotyped by direct sequencing of PCR products. Five SNPs were identified in intron 1 and exon 2, including a non-synonymous mutation causing an amino acid change (Val to Ile) at position 180. Association analyses based on 300 individuals revealed that the g.1129T>C SNP locus was significantly associated with total length (TL), body length (BL), body height (BH) and body weight (BW) in 6- and 18-month-old populations, while the g.1289G>A locus was significantly associated with BH and BW in the 6-month-old population. Haplotype analyses revealed that fish with the genotype combinations TC/TC or TC/GA showed better growth performance. Our results suggest that g.1129T>C and g.1289G>A have positive effects on growth traits and may be candidate gene markers for marker-assisted selection in A. nigrocauda.

  12. Isolation driven divergence: speciation in a widespread North American songbird (Aves: Certhiidae)

    PubMed Central

    Manthey, Joseph D.; Klicka, John; Spellman, Garth M.

    2011-01-01

    Lineage, or true “species,” trees may differ from gene trees because of stochastic processes in molecular evolution leading to gene-tree heterogeneity. Problems with inferring species trees due to excessive incomplete lineage sorting may be exacerbated in lineages with rapid diversification or recent divergences necessitating the use of multiple loci and individuals. Many recent multilocus studies that investigate divergence times identify lineage splitting to be more recent than single locus studies, forcing the revision of biogeographic scenarios driving divergence. Here we use 21 nuclear loci from regional populations to reevaluate hypotheses identified in an mtDNA phylogeographic study of the Brown Creeper (Certhia americana), as well as identify processes driving divergence. Nuclear phylogeographic analyses identified hierarchical genetic structure, supporting a basal split at roughly 32°N latitude, splitting northern and southern populations, with mixed patterns of genealogical concordance and discordance between datasets within the major lineages. Coalescent-based analyses identify isolation, with little to no gene flow, as the primary driver of divergence between lineages. Recent isolation appears to have caused genetic bottlenecks in populations in the Sierra Madre Oriental and coastal mountain ranges of California, which may be targets for conservation concerns. PMID:21933295

  13. Genealogical analyses of multiple loci of litostomatean ciliates (Protista, Ciliophora, Litostomatea)

    PubMed Central

    Vd’ačný, Peter; Bourland, William A.; Orsi, William; Epstein, Slava S.; Foissner, Wilhelm

    2012-01-01

    The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria + Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. PMID:22789763

  14. Genealogical analyses of multiple loci of litostomatean ciliates (Protista, Ciliophora, Litostomatea).

    PubMed

    Vd'ačný, Peter; Bourland, William A; Orsi, William; Epstein, Slava S; Foissner, Wilhelm

    2012-11-01

    The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria+Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. Copyright © 2012 Elsevier Inc. All rights reserved.

  15. A non-stop S-antigen gene mutation is associated with late onset hereditary retinal degeneration in dogs

    PubMed Central

    Jordan, Julie Ann; Aguirre, Gustavo D.; Acland, Gregory M.

    2013-01-01

    Purpose To identify the causative mutation of canine progressive retinal atrophy (PRA) segregating as an adult onset autosomal recessive disorder in the Basenji breed of dog. Methods Basenji dogs were ascertained for the PRA phenotype by clinical ophthalmoscopic examination. Blood samples from six affected cases and three nonaffected controls were collected, and DNA extraction was used for a genome-wide association study using the canine HD Illumina single nucleotide polymorphism (SNP) array and PLINK. Positional candidate genes identified within the peak association signal region were evaluated. Results The highest -Log10(P) value of 4.65 was obtained for 12 single nucleotide polymorphisms on three chromosomes. Homozygosity and linkage disequilibrium analyses favored one chromosome, CFA25, and screening of the S-antigen (SAG) gene identified a non-stop mutation (c.1216T>C), which would result in the addition of 25 amino acids (p.*405Rext*25). Conclusions Identification of this non-stop SAG mutation in dogs affected with retinal degeneration establishes this canine disease as orthologous to Oguchi disease and SAG-associated retinitis pigmentosa in humans, and offers opportunities for genetic therapeutic intervention. PMID:24019744

  16. Congenital Hypogonadotropic Hypogonadism during Childhood: Presentation and Genetic Analyses in 46 Boys

    PubMed Central

    Vizeneux, Audrey; Hilfiger, Aude; Bouligand, Jérôme; Pouillot, Monique; Brailly-Tabard, Sylvie; Bashamboo, Anu; McElreavey, Ken; Brauner, Raja

    2013-01-01

    Background The majority of the patients reported with mutations in isolated hypogonadotropic hypogonadism (HH) are adults. We analysed the presentation and the plasma inhibin B and anti-müllerian hormone (AMH) concentrations during childhood and adolescence, and compared them to the genetic results. Methods This was a retrospective, single-center study of 46 boys with HH. Results Fourteen (30.4%) had Kallmann syndrome (KS), 4 (8.7%) had CHARGE syndrome and 28 (60.9%) had HH without olfaction deficit nor olfactive bulb hypoplasia. Eighteen (39%) had an associated malformation or syndromes. At diagnosis, 22 (47.8%) boys were aged

  17. Mitochondrial comparative genomics and phylogenetic signal assessment of mtDNA among arbuscular mycorrhizal fungi.

    PubMed

    Nadimi, Maryam; Daubois, Laurence; Hijri, Mohamed

    2016-05-01

    Mitochondrial (mt) genes, such as cytochrome C oxidase genes (cox), have been widely used for barcoding in many groups of organisms, although this approach has been less powerful in the fungal kingdom due to the rapid evolution of their mt genomes. The use of mt genes in phylogenetic studies of Dikarya has been met with success, while early diverging fungal lineages remain less studied, particularly the arbuscular mycorrhizal fungi (AMF). Advances in next-generation sequencing have substantially increased the number of publically available mtDNA sequences for the Glomeromycota. As a result, comparison of mtDNA across key AMF taxa can now be applied to assess the phylogenetic signal of individual mt coding genes, as well as concatenated subsets of coding genes. Here we show comparative analyses of publically available mt genomes of Glomeromycota, augmented with two mtDNA genomes that were newly sequenced for this study (Rhizophagus irregularis DAOM240159 and Glomus aggregatum DAOM240163), resulting in 16 complete mtDNA datasets. R. irregularis isolate DAOM240159 and G. aggregatum isolate DAOM240163 showed mt genomes measuring 72,293bp and 69,505bp with G+C contents of 37.1% and 37.3%, respectively. We assessed the phylogenies inferred from single mt genes and complete sets of coding genes, which are referred to as "supergenes" (16 concatenated coding genes), using Shimodaira-Hasegawa tests, in order to identify genes that best described AMF phylogeny. We found that rnl, nad5, cox1, and nad2 genes, as well as concatenated subset of these genes, provided phylogenies that were similar to the supergene set. This mitochondrial genomic analysis was also combined with principal coordinate and partitioning analyses, which helped to unravel certain evolutionary relationships in the Rhizophagus genus and for G. aggregatum within the Glomeromycota. We showed evidence to support the position of G. aggregatum within the R. irregularis 'species complex'. Copyright © 2016 Elsevier Inc. All rights reserved.

  18. Comparative transcriptomics of elasmobranchs and teleosts highlight important processes in adaptive immunity and regional endothermy.

    PubMed

    Marra, Nicholas J; Richards, Vincent P; Early, Angela; Bogdanowicz, Steve M; Pavinski Bitar, Paulina D; Stanhope, Michael J; Shivji, Mahmood S

    2017-01-30

    Comparative genomic and/or transcriptomic analyses involving elasmobranchs remain limited, with genome level comparisons of the elasmobranch immune system to that of higher vertebrates, non-existent. This paper reports a comparative RNA-seq analysis of heart tissue from seven species, including four elasmobranchs and three teleosts, focusing on immunity, but concomitantly seeking to identify genetic similarities shared by the two lamnid sharks and the single billfish in our study, which could be linked to convergent evolution of regional endothermy. Across seven species, we identified an average of 10,877 Swiss-Prot annotated genes from an average of 32,474 open reading frames within each species' heart transcriptome. About half of these genes were shared between all species while the remainder included functional differences between our groups of interest (elasmobranch vs. teleost and endotherms vs. ectotherms) as revealed by Gene Ontology (GO) and selection analyses. A repeatedly represented functional category, in both the uniquely expressed elasmobranch genes (total of 259) and the elasmobranch GO enrichment results, involved antibody-mediated immunity, either in the recruitment of immune cells (Fc receptors) or in antigen presentation, including such terms as "antigen processing and presentation of exogenous peptide antigen via MHC class II", and such genes as MHC class II, HLA-DPB1. Molecular adaptation analyses identified three genes in elasmobranchs with a history of positive selection, including legumain (LGMN), a gene with roles in both innate and adaptive immunity including producing antigens for presentation by MHC class II. Comparisons between the endothermic and ectothermic species revealed an enrichment of GO terms associated with cardiac muscle contraction in endotherms, with 19 genes expressed solely in endotherms, several of which have significant roles in lipid and fat metabolism. This collective comparative evidence provides the first multi-taxa transcriptomic-based perspective on differences between elasmobranchs and teleosts, and suggests various unique features associated with the adaptive immune system of elasmobranchs, pointing in particular to the potential importance of MHC Class II. This in turn suggests that expanded comparative work involving additional tissues, as well as genome sequencing of multiple elasmobranch species would be productive in elucidating the regulatory and genome architectural hallmarks of elasmobranchs.

  19. Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes.

    PubMed

    Rohmann, Kevin N; Deitcher, David L; Bass, Andrew H

    2009-07-01

    Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.

  20. SNPing Away at Complex Diseases: Analysis of Single-Nucleotide Polymorphisms around APOE in Alzheimer Disease

    PubMed Central

    Martin, Eden R.; Lai, Eric H.; Gilbert, John R.; Rogala, Allison R.; Afshari, A. J.; Riley, John; Finch, K. L.; Stevens, J. F.; Livak, K. J.; Slotterbeck, Brandon D.; Slifer, Susan H.; Warren, Liling L.; Conneally, P. Michael; Schmechel, Donald E.; Purvis, Ian; Pericak-Vance, Margaret A.; Roses, Allen D.; Vance, Jeffery M.

    2000-01-01

    There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P⩽.05) was identified for 7 of 13 SNPs, including the APOE-4 polymorphism, spanning 40 kb on either side of APOE. As expected, very strong evidence for association with AD was seen for the APOE-4 polymorphism, as well as for two other SNPs that lie <16 kb from APOE. Haplotype analysis using family data increased significance over that seen in single-locus tests for some of the markers, and, for these data, improved localization of the gene. Our results demonstrate that associations can be detected at SNPs near a complex disease gene. We found that a high density of markers will be necessary in order to have a good chance of including SNPs with detectable levels of allelic association with the disease mutation, and statistical analysis based on haplotypes can provide additional information with respect to tests of significance and fine localization of complex disease genes. PMID:10869235

  1. Calcium-Activated Potassium (BK) Channels Are Encoded by Duplicate slo1 Genes in Teleost Fishes

    PubMed Central

    Deitcher, David L.; Bass, Andrew H.

    2009-01-01

    Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue. PMID:19321796

  2. SNPing away at complex diseases: analysis of single-nucleotide polymorphisms around APOE in Alzheimer disease.

    PubMed

    Martin, E R; Lai, E H; Gilbert, J R; Rogala, A R; Afshari, A J; Riley, J; Finch, K L; Stevens, J F; Livak, K J; Slotterbeck, B D; Slifer, S H; Warren, L L; Conneally, P M; Schmechel, D E; Purvis, I; Pericak-Vance, M A; Roses, A D; Vance, J M

    2000-08-01

    There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P

  3. Allelic variation contributes to bacterial host specificity

    DOE PAGES

    Yue, Min; Han, Xiangan; Masi, Leon De; ...

    2015-10-30

    Understanding the molecular parameters that regulate cross-species transmission and host adaptation of potential pathogens is crucial to control emerging infectious disease. Although microbial pathotype diversity is conventionally associated with gene gain or loss, the role of pathoadaptive nonsynonymous single-nucleotide polymorphisms (nsSNPs) has not been systematically evaluated. Here, our genome-wide analysis of core genes within Salmonella enterica serovar Typhimurium genomes reveals a high degree of allelic variation in surface-exposed molecules, including adhesins that promote host colonization. Subsequent multinomial logistic regression, MultiPhen and Random Forest analyses of known/suspected adhesins from 580 independent Typhimurium isolates identifies distinct host-specific nsSNP signatures. Moreover, population andmore » functional analyses of host-associated nsSNPs for FimH, the type 1 fimbrial adhesin, highlights the role of key allelic residues in host-specific adherence in vitro. In conclusion, together, our data provide the first concrete evidence that functional differences between allelic variants of bacterial proteins likely contribute to pathoadaption to diverse hosts.« less

  4. Fine-scale mapping of a locus for severe bipolar mood disorder on chromosome 18p11.3 in the Costa Rican population

    PubMed Central

    McInnes, L. Alison; Service, Susan K.; Reus, Victor I.; Barnes, Glenn; Charlat, Olga; Jawahar, Satya; Lewitzky, Steve; Yang, Qing; Duong, Quyen; Spesny, Mitzi; Araya, Carmen; Araya, Xinia; Gallegos, Alvaro; Meza, Luis; Molina, Julio; Ramirez, Rolando; Mendez, Roxana; Silva, Sandra; Fournier, Eduardo; Batki, Steven L.; Mathews, Carol A.; Neylan, Thomas; Glatt, Charles E.; Escamilla, Michael A.; Luo, David; Gajiwala, Paresh; Song, Terry; Crook, Stephen; Nguyen, Jasmine B.; Roche, Erin; Meyer, Joanne M.; Leon, Pedro; Sandkuijl, Lodewijk A.; Freimer, Nelson B.; Chen, Hong

    2001-01-01

    We have searched for genes predisposing to bipolar disorder (BP) by studying individuals with the most extreme form of the affected phenotype, BP-I, ascertained from the genetically isolated population of the Central Valley of Costa Rica (CVCR). The results of a previous linkage analysis on two extended CVCR BP-I pedigrees, CR001 and CR004, and of linkage disequilibrium (LD) analyses of a CVCR population sample of BP-I patients implicated a candidate region on 18p11.3. We further investigated this region by creating a physical map and developing 4 new microsatellite and 26 single-nucleotide polymorphism markers for typing in the pedigree and population samples. We report the results of fine-scale association analyses in the population sample, as well as evaluation of haplotypes in pedigree CR001. Our results suggest a candidate region containing six genes but also highlight the complexities of LD mapping of common disorders. PMID:11572994

  5. Embryonic domains of the aorta derived from diverse origins exhibit distinct properties that converge into a common phenotype in the adult

    PubMed Central

    Pfaltzgraff, Elise R.; Shelton, Elaine L.; Galindo, Cristi L.; Nelms, Brian L.; Hooper, Christopher W.; Poole, Stanley D.; Labosky, Patricia A.; Bader, David M.; Reese, Jeff

    2014-01-01

    Vascular smooth muscle cells (VSMCs) are derived from distinct embryonic origins. Vessels originating from differing smooth muscle cell populations have distinct vascular and pathological properties involving calcification, atherosclerosis, and structural defects such as aneurysm and coarctation. We hypothesized that domains within a single vessel, such as the aorta, vary in phenotype based on embryonic origin. Gene profiling and myographic analyses demonstrated that embryonic ascending and descending aortic domains exhibited distinct phenotypes. In vitro analyses demonstrated that VSMCs from each region were dissimilar in terms of cytoskeletal and migratory properties, and retention of different gene expression patterns. Using the same analysis, we found that these same two domains are indistinguishable in the adult vessel. Our data demonstrate that VSMCs from different embryonic origins are functionally distinct in the embryonic mouse, but converge to assume a common phenotype in the aorta of healthy adults. These findings have fundamental implications for aortic development, function and disease progression. PMID:24508561

  6. The whole-genome landscape of medulloblastoma subtypes.

    PubMed

    Northcott, Paul A; Buchhalter, Ivo; Morrissy, A Sorana; Hovestadt, Volker; Weischenfeldt, Joachim; Ehrenberger, Tobias; Gröbner, Susanne; Segura-Wang, Maia; Zichner, Thomas; Rudneva, Vasilisa A; Warnatz, Hans-Jörg; Sidiropoulos, Nikos; Phillips, Aaron H; Schumacher, Steven; Kleinheinz, Kortine; Waszak, Sebastian M; Erkek, Serap; Jones, David T W; Worst, Barbara C; Kool, Marcel; Zapatka, Marc; Jäger, Natalie; Chavez, Lukas; Hutter, Barbara; Bieg, Matthias; Paramasivam, Nagarajan; Heinold, Michael; Gu, Zuguang; Ishaque, Naveed; Jäger-Schmidt, Christina; Imbusch, Charles D; Jugold, Alke; Hübschmann, Daniel; Risch, Thomas; Amstislavskiy, Vyacheslav; Gonzalez, Francisco German Rodriguez; Weber, Ursula D; Wolf, Stephan; Robinson, Giles W; Zhou, Xin; Wu, Gang; Finkelstein, David; Liu, Yanling; Cavalli, Florence M G; Luu, Betty; Ramaswamy, Vijay; Wu, Xiaochong; Koster, Jan; Ryzhova, Marina; Cho, Yoon-Jae; Pomeroy, Scott L; Herold-Mende, Christel; Schuhmann, Martin; Ebinger, Martin; Liau, Linda M; Mora, Jaume; McLendon, Roger E; Jabado, Nada; Kumabe, Toshihiro; Chuah, Eric; Ma, Yussanne; Moore, Richard A; Mungall, Andrew J; Mungall, Karen L; Thiessen, Nina; Tse, Kane; Wong, Tina; Jones, Steven J M; Witt, Olaf; Milde, Till; Von Deimling, Andreas; Capper, David; Korshunov, Andrey; Yaspo, Marie-Laure; Kriwacki, Richard; Gajjar, Amar; Zhang, Jinghui; Beroukhim, Rameen; Fraenkel, Ernest; Korbel, Jan O; Brors, Benedikt; Schlesner, Matthias; Eils, Roland; Marra, Marco A; Pfister, Stefan M; Taylor, Michael D; Lichter, Peter

    2017-07-19

    Current therapies for medulloblastoma, a highly malignant childhood brain tumour, impose debilitating effects on the developing child, and highlight the need for molecularly targeted treatments with reduced toxicity. Previous studies have been unable to identify the full spectrum of driver genes and molecular processes that operate in medulloblastoma subgroups. Here we analyse the somatic landscape across 491 sequenced medulloblastoma samples and the molecular heterogeneity among 1,256 epigenetically analysed cases, and identify subgroup-specific driver alterations that include previously undiscovered actionable targets. Driver mutations were confidently assigned to most patients belonging to Group 3 and Group 4 medulloblastoma subgroups, greatly enhancing previous knowledge. New molecular subtypes were differentially enriched for specific driver events, including hotspot in-frame insertions that target KBTBD4 and 'enhancer hijacking' events that activate PRDM6. Thus, the application of integrative genomics to an extensive cohort of clinical samples derived from a single childhood cancer entity revealed a series of cancer genes and biologically relevant subtype diversity that represent attractive therapeutic targets for the treatment of patients with medulloblastoma.

  7. BacillOndex: an integrated data resource for systems and synthetic biology.

    PubMed

    Misirli, Goksel; Wipat, Anil; Mullen, Joseph; James, Katherine; Pocock, Matthew; Smith, Wendy; Allenby, Nick; Hallinan, Jennifer S

    2013-04-10

    BacillOndex is an extension of the Ondex data integration system, providing a semantically annotated, integrated knowledge base for the model Gram-positive bacterium Bacillus subtilis. This application allows a user to mine a variety of B. subtilis data sources, and analyse the resulting integrated dataset, which contains data about genes, gene products and their interactions. The data can be analysed either manually, by browsing using Ondex, or computationally via a Web services interface. We describe the process of creating a BacillOndex instance, and describe the use of the system for the analysis of single nucleotide polymorphisms in B. subtilis Marburg. The Marburg strain is the progenitor of the widely-used laboratory strain B. subtilis 168. We identified 27 SNPs with predictable phenotypic effects, including genetic traits for known phenotypes. We conclude that BacillOndex is a valuable tool for the systems-level investigation of, and hypothesis generation about, this important biotechnology workhorse. Such understanding contributes to our ability to construct synthetic genetic circuits in this organism.

  8. BacillOndex: An Integrated Data Resource for Systems and Synthetic Biology.

    PubMed

    Misirli, Goksel; Wipat, Anil; Mullen, Joseph; James, Katherine; Pocock, Matthew; Smith, Wendy; Allenby, Nick; Hallinan, Jennifer S

    2013-06-01

    BacillOndex is an extension of the Ondex data integration system, providing a semantically annotated, integrated knowledge base for the model Gram-positive bacterium Bacillus subtilis. This application allows a user to mine a variety of B. subtilis data sources, and analyse the resulting integrated dataset, which contains data about genes, gene products and their interactions. The data can be analysed either manually, by browsing using Ondex, or computationally via a Web services interface. We describe the process of creating a BacillOndex instance, and describe the use of the system for the analysis of single nucleotide polymorphisms in B. subtilis Marburg. The Marburg strain is the progenitor of the widely-used laboratory strain B. subtilis 168. We identified 27 SNPs with predictable phenotypic effects, including genetic traits for known phenotypes. We conclude that BacillOndex is a valuable tool for the systems-level investigation of, and hypothesis generation about, this important biotechnology workhorse. Such understanding contributes to our ability to construct synthetic genetic circuits in this organism.

  9. Genetic diversity in Treponema pallidum: implications for pathogenesis, evolution and molecular diagnostics of syphilis and yaws

    PubMed Central

    Šmajs, David; Norris, Steven J.; Weinstock, George M.

    2013-01-01

    Pathogenic uncultivable treponemes, similar to syphilis-causing Treponema pallidum subspecies pallidum, include T. pallidum ssp. pertenue, T. pallidum ssp. endemicum and Treponema carateum, which cause yaws, bejel and pinta, respectively. Genetic analyses of these pathogens revealed striking similarity among these bacteria and also a high degree of similarity to the rabbit pathogen, T. paraluiscuniculi, a treponeme not infectious to humans. Genome comparisons between pallidum and non-pallidum treponemes revealed genes with potential involvement in human infectivity, whereas comparisons between pallidum and pertenue treponemes identified genes possibly involved in the high invasivity of syphilis treponemes. Genetic variability within syphilis strains is considered as the basis of syphilis molecular epidemiology with potential to detect more virulent strains, whereas genetic variability within a single strain is related to its ability to elude the immune system of the host. Genome analyses also shed light on treponemal evolution and on chromosomal targets for molecular diagnostics of treponemal infections. PMID:22198325

  10. Noncoding Genomics in Gastric Cancer and the Gastric Precancerous Cascade: Pathogenesis and Biomarkers

    PubMed Central

    Garcia-Bloj, Benjamin; Fry, Jacqueline; Wichmann, Ignacio

    2015-01-01

    Gastric cancer is the fifth most common cancer and the third leading cause of cancer-related death, whose patterns vary among geographical regions and ethnicities. It is a multifactorial disease, and its development depends on infection by Helicobacter pylori (H. pylori) and Epstein-Barr virus (EBV), host genetic factors, and environmental factors. The heterogeneity of the disease has begun to be unraveled by a comprehensive mutational evaluation of primary tumors. The low-abundance of mutations suggests that other mechanisms participate in the evolution of the disease, such as those found through analyses of noncoding genomics. Noncoding genomics includes single nucleotide polymorphisms (SNPs), regulation of gene expression through DNA methylation of promoter sites, miRNAs, other noncoding RNAs in regulatory regions, and other topics. These processes and molecules ultimately control gene expression. Potential biomarkers are appearing from analyses of noncoding genomics. This review focuses on noncoding genomics and potential biomarkers in the context of gastric cancer and the gastric precancerous cascade. PMID:26379360

  11. Haplotype diversity of the myostatin gene among beef cattle breeds

    PubMed Central

    Dunner, Susana; Miranda, M Eugenia; Amigues, Yves; Cañón, Javier; Georges, Michel; Hanset, Roger; Williams, John; Ménissier, François

    2003-01-01

    A total of 678 individuals from 28 European bovine breeds were both phenotyped and analysed at the myostatin locus by the Single Strand Conformation Polymorphism (SSCP) method. Seven new mutations were identified which contribute to the high polymorphism (1 SNP every 100 bp) present in this small gene; twenty haplotypes were described and a genotyping method was set up using the Oligonucleotide Ligation Assay (OLA) method. Some haplotypes appeared to be exclusive to a particular breed; this was the case for 5 in the Charolaise (involving mutation Q204X) and 7 in the Maine-Anjou (involving mutation E226X). The relationships between the different haplotypes were studied, thus allowing to test the earlier hypothesis on the origin of muscular hypertrophy in Europe: muscular hypertrophy (namely nt821(del11)) was mainly spread in different waves from northern Europe milk purpose populations in most breeds; however, other mutations (mostly disruptive) arose in a single breed, were highly selected and have since scarcely evolved to other populations. PMID:12605853

  12. Single cell dual adherent-suspension co-culture micro-environment for studying tumor-stromal interactions with functionally selected cancer stem-like cells.

    PubMed

    Chen, Yu-Chih; Zhang, Zhixiong; Fouladdel, Shamileh; Deol, Yadwinder; Ingram, Patrick N; McDermott, Sean P; Azizi, Ebrahim; Wicha, Max S; Yoon, Euisik

    2016-08-07

    Considerable evidence suggests that cancer stem-like cells (CSCs) are critical in tumor pathogenesis, but their rarity and transience has led to much controversy about their exact nature. Although CSCs can be functionally identified using dish-based tumorsphere assays, it is difficult to handle and monitor single cells in dish-based approaches; single cell-based microfluidic approaches offer better control and reliable single cell derived sphere formation. However, like normal stem cells, CSCs are heavily regulated by their microenvironment, requiring tumor-stromal interactions for tumorigenic and proliferative behaviors. To enable single cell derived tumorsphere formation within a stromal microenvironment, we present a dual adherent/suspension co-culture device, which combines a suspension environment for single-cell tumorsphere assays and an adherent environment for co-culturing stromal cells in close proximity by selectively patterning polyHEMA in indented microwells. By minimizing dead volume and improving cell capture efficiency, the presented platform allows for the use of small numbers of cells (<100 cells). As a proof of concept, we co-cultured single T47D (breast cancer) cells and primary cancer associated fibroblasts (CAF) on-chip for 14 days to monitor sphere formation and growth. Compared to mono-culture, co-cultured T47D have higher tumorigenic potential (sphere formation rate) and proliferation rates (larger sphere size). Furthermore, 96-multiplexed single-cell transcriptome analyses were performed to compare the gene expression of co-cultured and mono-cultured T47D cells. Phenotypic changes observed in co-culture correlated with expression changes in genes associated with proliferation, apoptotic suppression, tumorigenicity and even epithelial-to-mesechymal transition. Combining the presented platform with single cell transcriptome analysis, we successfully identified functional CSCs and investigated the phenotypic and transcriptome effects induced by tumor-stromal interactions.

  13. Huntington's Disease and its therapeutic target genes: a global functional profile based on the HD Research Crossroads database

    PubMed Central

    2012-01-01

    Background Huntington’s disease (HD) is a fatal progressive neurodegenerative disorder caused by the expansion of the polyglutamine repeat region in the huntingtin gene. Although the disease is triggered by the mutation of a single gene, intensive research has linked numerous other genes to its pathogenesis. To obtain a systematic overview of these genes, which may serve as therapeutic targets, CHDI Foundation has recently established the HD Research Crossroads database. With currently over 800 cataloged genes, this web-based resource constitutes the most extensive curation of genes relevant to HD. It provides us with an unprecedented opportunity to survey molecular mechanisms involved in HD in a holistic manner. Methods To gain a synoptic view of therapeutic targets for HD, we have carried out a variety of bioinformatical and statistical analyses to scrutinize the functional association of genes curated in the HD Research Crossroads database. In particular, enrichment analyses were performed with respect to Gene Ontology categories, KEGG signaling pathways, and Pfam protein families. For selected processes, we also analyzed differential expression, using published microarray data. Additionally, we generated a candidate set of novel genetic modifiers of HD by combining information from the HD Research Crossroads database with previous genome-wide linkage studies. Results Our analyses led to a comprehensive identification of molecular mechanisms associated with HD. Remarkably, we not only recovered processes and pathways, which have frequently been linked to HD (such as cytotoxicity, apoptosis, and calcium signaling), but also found strong indications for other potentially disease-relevant mechanisms that have been less intensively studied in the context of HD (such as the cell cycle and RNA splicing, as well as Wnt and ErbB signaling). For follow-up studies, we provide a regularly updated compendium of molecular mechanism, that are associated with HD, at http://hdtt.sysbiolab.eu Additionally, we derived a candidate set of 24 novel genetic modifiers, including histone deacetylase 3 (HDAC3), metabotropic glutamate receptor 1 (GRM1), CDK5 regulatory subunit 2 (CDK5R2), and coactivator 1ß of the peroxisome proliferator-activated receptor gamma (PPARGC1B). Conclusions The results of our study give us an intriguing picture of the molecular complexity of HD. Our analyses can be seen as a first step towards a comprehensive list of biological processes, molecular functions, and pathways involved in HD, and may provide a basis for the development of more holistic disease models and new therapeutics. PMID:22741533

  14. Transcriptional maturation of the mouse auditory forebrain.

    PubMed

    Hackett, Troy A; Guo, Yan; Clause, Amanda; Hackett, Nicholas J; Garbett, Krassimira; Zhang, Pan; Polley, Daniel B; Mirnics, Karoly

    2015-08-14

    The maturation of the brain involves the coordinated expression of thousands of genes, proteins and regulatory elements over time. In sensory pathways, gene expression profiles are modified by age and sensory experience in a manner that differs between brain regions and cell types. In the auditory system of altricial animals, neuronal activity increases markedly after the opening of the ear canals, initiating events that culminate in the maturation of auditory circuitry in the brain. This window provides a unique opportunity to study how gene expression patterns are modified by the onset of sensory experience through maturity. As a tool for capturing these features, next-generation sequencing of total RNA (RNAseq) has tremendous utility, because the entire transcriptome can be screened to index expression of any gene. To date, whole transcriptome profiles have not been generated for any central auditory structure in any species at any age. In the present study, RNAseq was used to profile two regions of the mouse auditory forebrain (A1, primary auditory cortex; MG, medial geniculate) at key stages of postnatal development (P7, P14, P21, adult) before and after the onset of hearing (~P12). Hierarchical clustering, differential expression, and functional geneset enrichment analyses (GSEA) were used to profile the expression patterns of all genes. Selected genesets related to neurotransmission, developmental plasticity, critical periods and brain structure were highlighted. An accessible repository of the entire dataset was also constructed that permits extraction and screening of all data from the global through single-gene levels. To our knowledge, this is the first whole transcriptome sequencing study of the forebrain of any mammalian sensory system. Although the data are most relevant for the auditory system, they are generally applicable to forebrain structures in the visual and somatosensory systems, as well. The main findings were: (1) Global gene expression patterns were tightly clustered by postnatal age and brain region; (2) comparing A1 and MG, the total numbers of differentially expressed genes were comparable from P7 to P21, then dropped to nearly half by adulthood; (3) comparing successive age groups, the greatest numbers of differentially expressed genes were found between P7 and P14 in both regions, followed by a steady decline in numbers with age; (4) maturational trajectories in expression levels varied at the single gene level (increasing, decreasing, static, other); (5) between regions, the profiles of single genes were often asymmetric; (6) GSEA revealed that genesets related to neural activity and plasticity were typically upregulated from P7 to adult, while those related to structure tended to be downregulated; (7) GSEA and pathways analysis of selected functional networks were not predictive of expression patterns in the auditory forebrain for all genes, reflecting regional specificity at the single gene level. Gene expression in the auditory forebrain during postnatal development is in constant flux and becomes increasingly stable with age. Maturational changes are evident at the global through single gene levels. Transcriptome profiles in A1 and MG are distinct at all ages, and differ from other brain regions. The database generated by this study provides a rich foundation for the identification of novel developmental biomarkers, functional gene pathways, and targeted studies of postnatal maturation in the auditory forebrain.

  15. Integration of multi-omics data for integrative gene regulatory network inference.

    PubMed

    Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun; Kang, Mingon

    2017-01-01

    Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called 'multi-omics data', that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN's capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed.

  16. Epigenetic regulation of serotype expression antagonizes transcriptome dynamics in Paramecium tetraurelia

    PubMed Central

    Cheaib, Miriam; Dehghani Amirabad, Azim; Nordström, Karl J. V.; Schulz, Marcel H.; Simon, Martin

    2015-01-01

    Phenotypic variation of a single genotype is achieved by alterations in gene expression patterns. Regulation of such alterations depends on their time scale, where short-time adaptations differ from permanently established gene expression patterns maintained by epigenetic mechanisms. In the ciliate Paramecium, serotypes were described for an epigenetically controlled gene expression pattern of an individual multigene family. Paradoxically, individual serotypes can be triggered in Paramecium by alternating environments but are then stabilized by epigenetic mechanisms, thus raising the question to which extend their expression follows environmental stimuli. To characterize environmental adaptation in the context of epigenetically controlled serotype expression, we used RNA-seq to characterize transcriptomes of serotype pure cultures. The resulting vegetative transcriptome resource is first analysed for genes involved in the adaptive response to the altered environment. Secondly, we identified groups of genes that do not follow the adaptive response but show co-regulation with the epigenetically controlled serotype system, suggesting that their gene expression pattern becomes manifested by similar mechanisms. In our experimental set-up, serotype expression and the entire group of co-regulated genes were stable among environmental changes and only heat-shock genes altered expression of these gene groups. The data suggest that the maintenance of these gene expression patterns in a lineage represents epigenetically controlled robustness counteracting short-time adaptation processes. PMID:26231545

  17. High-Throughput Sequencing of Arabidopsis microRNAs: Evidence for Frequent Birth and Death of MIRNA Genes

    PubMed Central

    Fahlgren, Noah; Howell, Miya D.; Kasschau, Kristin D.; Chapman, Elisabeth J.; Sullivan, Christopher M.; Cumbie, Jason S.; Givan, Scott A.; Law, Theresa F.; Grant, Sarah R.; Dangl, Jeffery L.; Carrington, James C.

    2007-01-01

    In plants, microRNAs (miRNAs) comprise one of two classes of small RNAs that function primarily as negative regulators at the posttranscriptional level. Several MIRNA genes in the plant kingdom are ancient, with conservation extending between angiosperms and the mosses, whereas many others are more recently evolved. Here, we use deep sequencing and computational methods to identify, profile and analyze non-conserved MIRNA genes in Arabidopsis thaliana. 48 non-conserved MIRNA families, nearly all of which were represented by single genes, were identified. Sequence similarity analyses of miRNA precursor foldback arms revealed evidence for recent evolutionary origin of 16 MIRNA loci through inverted duplication events from protein-coding gene sequences. Interestingly, these recently evolved MIRNA genes have taken distinct paths. Whereas some non-conserved miRNAs interact with and regulate target transcripts from gene families that donated parental sequences, others have drifted to the point of non-interaction with parental gene family transcripts. Some young MIRNA loci clearly originated from one gene family but form miRNAs that target transcripts in another family. We suggest that MIRNA genes are undergoing relatively frequent birth and death, with only a subset being stabilized by integration into regulatory networks. PMID:17299599

  18. Integration of multi-omics data for integrative gene regulatory network inference

    PubMed Central

    Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun

    2017-01-01

    Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called ‘multi-omics data’, that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN’s capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed. PMID:29354189

  19. Validation of miRNA genes suitable as reference genes in qPCR analyses of miRNA gene expression in Atlantic salmon (Salmo salar).

    PubMed

    Johansen, Ilona; Andreassen, Rune

    2014-12-23

    MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the post-transcriptional level. They play important roles by regulating genes that control multiple biological processes, and recent years there has been an increased interest in studying miRNA genes and miRNA gene expression. The most common method applied to study gene expression of single genes is quantitative PCR (qPCR). However, before expression of mature miRNAs can be studied robust qPCR methods (miRNA-qPCR) must be developed. This includes identification and validation of suitable reference genes. We are particularly interested in Atlantic salmon (Salmo salar). This is an economically important aquaculture species, but no reference genes dedicated for use in miRNA-qPCR methods has been validated for this species. Our aim was, therefore, to identify suitable reference genes for miRNA-qPCR methods in Salmo salar. We used a systematic approach where we utilized similar studies in other species, some biological criteria, results from deep sequencing of small RNAs and, finally, experimental validation of candidate reference genes by qPCR to identify the most suitable reference genes. Ssa-miR-25-3p was identified as most suitable single reference gene. The best combinations of two reference genes were ssa-miR-25-3p and ssa-miR-455-5p. These two genes were constitutively and stably expressed across many different tissues. Furthermore, infectious salmon anaemia did not seem to affect their expression levels. These genes were amplified with high specificity, good efficiency and the qPCR assays showed a good linearity when applying a simple cybergreen miRNA-PCR method using miRNA gene specific forward primers. We have identified suitable reference genes for miRNA-qPCR in Atlantic salmon. These results will greatly facilitate further studies on miRNA genes in this species. The reference genes identified are conserved genes that are identical in their mature sequence in many aquaculture species. Therefore, they may also be suitable as reference genes in other teleosts. Finally, the systematic approach used in our study successfully identified suitable reference genes, suggesting that this may be a useful strategy to apply in similar validation studies in other aquaculture species.

  20. A systems biology approach to investigate the effect of pH-induced gene regulation on solvent production by Clostridium acetobutylicum in continuous culture.

    PubMed

    Haus, Sylvia; Jabbari, Sara; Millat, Thomas; Janssen, Holger; Fischer, Ralf-Jörg; Bahl, Hubert; King, John R; Wolkenhauer, Olaf

    2011-01-19

    Clostridium acetobutylicum is an anaerobic bacterium which is known for its solvent-producing capabilities, namely regarding the bulk chemicals acetone and butanol, the latter being a highly efficient biofuel. For butanol production by C. acetobutylicum to be optimized and exploited on an industrial scale, the effect of pH-induced gene regulation on solvent production by C. acetobutylicum in continuous culture must be understood as fully as possible. We present an ordinary differential equation model combining the metabolic network governing solvent production with regulation at the genetic level of the enzymes required for this process. Parameterizing the model with experimental data from continuous culture, we demonstrate the influence of pH upon fermentation products: at high pH (pH 5.7) acids are the dominant product while at low pH (pH 4.5) this switches to solvents. Through steady-state analyses of the model we focus our investigations on how alteration in gene expression of C. acetobutylicum could be exploited to increase butanol yield in a continuous culture fermentation. Incorporating gene regulation into the model of solvent production by C. acetobutylicum enables an accurate representation of the pH-induced switch to solvent production to be obtained and theoretical investigations of possible synthetic-biology approaches to be pursued. Steady-state analyses suggest that, to increase butanol yield, alterations in the expression of single solvent-associated genes are insufficient; a more complex approach targeting two or more genes is required.

  1. Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries

    PubMed Central

    Guo, Xiuqing; Franceschini, Nora; Cheng, Ching-Yu; Sim, Xueling; Vojinovic, Dina; Marten, Jonathan; Musani, Solomon K.; Li, Changwei; Schwander, Karen; Richard, Melissa A.; Noordam, Raymond; Aschard, Hugues; Bartz, Traci M.; Bielak, Lawrence F.; Dorajoo, Rajkumar; Fisher, Virginia; Hartwig, Fernando P.; Horimoto, Andrea R. V. R.; Lohman, Kurt K.; Manning, Alisa K.; Rankinen, Tuomo; Smith, Albert V.; Wojczynski, Mary K.; Alver, Maris; Boissel, Mathilde; Cai, Qiuyin; Divers, Jasmin; Gao, Chuan; Goel, Anuj; Harris, Sarah E.; He, Meian; Hsu, Fang-Chi; Jackson, Anne U.; Kähönen, Mika; Kasturiratne, Anuradhani; Komulainen, Pirjo; Kühnel, Brigitte; Laguzzi, Federica; Luan, Jian'an; Nolte, Ilja M.; Padmanabhan, Sandosh; Robino, Antonietta; Scott, Robert A.; Sofer, Tamar; Stančáková, Alena; Takeuchi, Fumihiko; Tayo, Bamidele O.; Varga, Tibor V.; Vitart, Veronique; Wang, Yajuan; Warren, Helen R.; Wen, Wanqing; Yanek, Lisa R.; Zhang, Weihua; Zhao, Jing Hua; Afaq, Saima; Amin, Najaf; Arking, Dan E.; Aung, Tin; Boerwinkle, Eric; Borecki, Ingrid; Broeckel, Ulrich; Brown, Morris; Brumat, Marco; Burke, Gregory L.; Chakravarti, Aravinda; Charumathi, Sabanayagam; Ida Chen, Yii-Der; Connell, John M.; Correa, Adolfo; de las Fuentes, Lisa; de Mutsert, Renée; de Silva, H. Janaka; Deng, Xuan; Ding, Jingzhong; Duan, Qing; Eaton, Charles B.; Ehret, Georg; Eppinga, Ruben N.; Faul, Jessica D.; Felix, Stephan B.; Forouhi, Nita G.; Forrester, Terrence; Franco, Oscar H.; Friedlander, Yechiel; Gandin, Ilaria; Gao, He; Ghanbari, Mohsen; Gigante, Bruna; Gu, C. Charles; Gu, Dongfeng; Hagenaars, Saskia P.; Hallmans, Göran; Harris, Tamara B.; He, Jiang; Heng, Chew-Kiat; Hirata, Makoto; Howard, Barbara V.; Ikram, M. Arfan; John, Ulrich; Katsuya, Tomohiro; Khor, Chiea Chuen; Kilpeläinen, Tuomas O.; Koh, Woon-Puay; Krieger, José E.; Kritchevsky, Stephen B.; Kubo, Michiaki; Kuusisto, Johanna; Lakka, Timo A.; Langefeld, Carl D.; Langenberg, Claudia; Launer, Lenore J.; Lehne, Benjamin; Lewis, Cora E.; Li, Yize; Lin, Shiow; Liu, Jianjun; Liu, Jingmin; Loh, Marie; Louie, Tin; Mägi, Reedik; McKenzie, Colin A.; Meitinger, Thomas; Milaneschi, Yuri; Milani, Lili; Mohlke, Karen L.; Momozawa, Yukihide; Nalls, Mike A.; Nelson, Christopher P.; Sotoodehnia, Nona; Norris, Jill M.; O'Connell, Jeff R.; Palmer, Nicholette D.; Perls, Thomas; Pedersen, Nancy L.; Peters, Annette; Peyser, Patricia A.; Poulter, Neil; Raffel, Leslie J.; Raitakari, Olli T.; Roll, Kathryn; Rose, Lynda M.; Rosendaal, Frits R.; Rotter, Jerome I.; Schmidt, Carsten O.; Schreiner, Pamela J.; Schupf, Nicole; Scott, William R.; Shi, Yuan; Sidney, Stephen; Sims, Mario; Sitlani, Colleen M.; Smith, Jennifer A.; Snieder, Harold; Starr, John M.; Strauch, Konstantin; Stringham, Heather M.; Tan, Nicholas Y. Q.; Tang, Hua; Taylor, Kent D.; Teo, Yik Ying; Tham, Yih Chung; Turner, Stephen T.; Uitterlinden, André G.; Vollenweider, Peter; Waldenberger, Melanie; Wang, Lihua; Wang, Ya Xing; Wei, Wen Bin; Williams, Christine; Yao, Jie; Yu, Caizheng; Yuan, Jian-Min; Zhao, Wei; Zonderman, Alan B.; Becker, Diane M.; Boehnke, Michael; Bowden, Donald W.; Chambers, John C.; Deary, Ian J.; Esko, Tõnu; Farrall, Martin; Franks, Paul W.; Freedman, Barry I.; Froguel, Philippe; Gasparini, Paolo; Gieger, Christian; Kamatani, Yoichiro; Kato, Norihiro; Kooner, Jaspal S.; Kutalik, Zoltán; Laakso, Markku; Laurie, Cathy C.; Leander, Karin; Lehtimäki, Terho; Study, Lifelines Cohort; Magnusson, Patrik K. E.; Oldehinkel, Albertine J.; Penninx, Brenda W. J. H.; Polasek, Ozren; Porteous, David J.; Rauramaa, Rainer; Samani, Nilesh J.; Scott, James; Shu, Xiao-Ou; van der Harst, Pim; Wagenknecht, Lynne E.; Watkins, Hugh; Weir, David R.; Wickremasinghe, Ananda R.; Wu, Tangchun; Zheng, Wei; Bouchard, Claude; Christensen, Kaare; Evans, Michele K.; Gudnason, Vilmundur; Horta, Bernardo L.; Kardia, Sharon L. R.; Liu, Yongmei; Pereira, Alexandre C.; Psaty, Bruce M.; Ridker, Paul M.; van Dam, Rob M.; Gauderman, W. James; Zhu, Xiaofeng; Mook-Kanamori, Dennis O.; Fornage, Myriam; Rotimi, Charles N.; Cupples, L. Adrienne; Kelly, Tanika N.; Fox, Ervin R.; Hayward, Caroline; van Duijn, Cornelia M.; Tai, E Shyong; Wong, Tien Yin; Kooperberg, Charles; Palmas, Walter; Morrison, Alanna C.; Caulfield, Mark J.; Munroe, Patricia B.; Rao, Dabeeru C.; Province, Michael A.; Levy, Daniel

    2018-01-01

    Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10−5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10−8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10−8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension. PMID:29912962

  2. A systems biology approach to investigate the effect of pH-induced gene regulation on solvent production by Clostridium acetobutylicum in continuous culture

    PubMed Central

    2011-01-01

    Background Clostridium acetobutylicum is an anaerobic bacterium which is known for its solvent-producing capabilities, namely regarding the bulk chemicals acetone and butanol, the latter being a highly efficient biofuel. For butanol production by C. acetobutylicum to be optimized and exploited on an industrial scale, the effect of pH-induced gene regulation on solvent production by C. acetobutylicum in continuous culture must be understood as fully as possible. Results We present an ordinary differential equation model combining the metabolic network governing solvent production with regulation at the genetic level of the enzymes required for this process. Parameterizing the model with experimental data from continuous culture, we demonstrate the influence of pH upon fermentation products: at high pH (pH 5.7) acids are the dominant product while at low pH (pH 4.5) this switches to solvents. Through steady-state analyses of the model we focus our investigations on how alteration in gene expression of C. acetobutylicum could be exploited to increase butanol yield in a continuous culture fermentation. Conclusions Incorporating gene regulation into the model of solvent production by C. acetobutylicum enables an accurate representation of the pH-induced switch to solvent production to be obtained and theoretical investigations of possible synthetic-biology approaches to be pursued. Steady-state analyses suggest that, to increase butanol yield, alterations in the expression of single solvent-associated genes are insufficient; a more complex approach targeting two or more genes is required. PMID:21247470

  3. Mutations of maturity-onset diabetes of the young (MODY) genes in Thais with early-onset type 2 diabetes mellitus.

    PubMed

    Plengvidhya, Nattachet; Boonyasrisawat, Watip; Chongjaroen, Nalinee; Jungtrakoon, Prapaporn; Sriussadaporn, Sutin; Vannaseang, Sathit; Banchuin, Napatawn; Yenchitsomanus, Pa-thai

    2009-06-01

    Six known genes responsible for maturity-onset diabetes of the young (MODY) were analysed to evaluate the prevalence of their mutations in Thai patients with MODY and early-onset type 2 diabetes. Fifty-one unrelated probands with early-onset type 2 diabetes, 21 of them fitted into classic MODY criteria, were analysed for nucleotide variations in promoters, exons, and exon-intron boundaries of six known MODY genes, including HNF-4alpha, GCK, HNF-1alpha, IPF-1, HNF-1beta, and NeuroD1/beta2, by the polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) method followed by direct DNA sequencing. Missense mutations or mutations located in regulatory region, which were absent in 130 chromosomes of non-diabetic controls, were classified as potentially pathogenic mutations. We found that mutations of the six known MODY genes account for a small proportion of classic MODY (19%) and early-onset type 2 diabetes (10%) in Thais. Five of these mutations are novel including GCK R327H, HNF-1alpha P475L, HNF-1alphaG554fsX556, NeuroD1-1972 G > A and NeuroD1 A322N. Mutations of IPF-1 and HNF-1beta were not identified in the studied probands. Mutations of the six known MODY genes may not be a major cause of MODY and early-onset type 2 diabetes in Thais. Therefore, unidentified genes await discovery in a majority of Thai patients with MODY and early-onset type 2 diabetes.

  4. Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries.

    PubMed

    Feitosa, Mary F; Kraja, Aldi T; Chasman, Daniel I; Sung, Yun J; Winkler, Thomas W; Ntalla, Ioanna; Guo, Xiuqing; Franceschini, Nora; Cheng, Ching-Yu; Sim, Xueling; Vojinovic, Dina; Marten, Jonathan; Musani, Solomon K; Li, Changwei; Bentley, Amy R; Brown, Michael R; Schwander, Karen; Richard, Melissa A; Noordam, Raymond; Aschard, Hugues; Bartz, Traci M; Bielak, Lawrence F; Dorajoo, Rajkumar; Fisher, Virginia; Hartwig, Fernando P; Horimoto, Andrea R V R; Lohman, Kurt K; Manning, Alisa K; Rankinen, Tuomo; Smith, Albert V; Tajuddin, Salman M; Wojczynski, Mary K; Alver, Maris; Boissel, Mathilde; Cai, Qiuyin; Campbell, Archie; Chai, Jin Fang; Chen, Xu; Divers, Jasmin; Gao, Chuan; Goel, Anuj; Hagemeijer, Yanick; Harris, Sarah E; He, Meian; Hsu, Fang-Chi; Jackson, Anne U; Kähönen, Mika; Kasturiratne, Anuradhani; Komulainen, Pirjo; Kühnel, Brigitte; Laguzzi, Federica; Luan, Jian'an; Matoba, Nana; Nolte, Ilja M; Padmanabhan, Sandosh; Riaz, Muhammad; Rueedi, Rico; Robino, Antonietta; Said, M Abdullah; Scott, Robert A; Sofer, Tamar; Stančáková, Alena; Takeuchi, Fumihiko; Tayo, Bamidele O; van der Most, Peter J; Varga, Tibor V; Vitart, Veronique; Wang, Yajuan; Ware, Erin B; Warren, Helen R; Weiss, Stefan; Wen, Wanqing; Yanek, Lisa R; Zhang, Weihua; Zhao, Jing Hua; Afaq, Saima; Amin, Najaf; Amini, Marzyeh; Arking, Dan E; Aung, Tin; Boerwinkle, Eric; Borecki, Ingrid; Broeckel, Ulrich; Brown, Morris; Brumat, Marco; Burke, Gregory L; Canouil, Mickaël; Chakravarti, Aravinda; Charumathi, Sabanayagam; Ida Chen, Yii-Der; Connell, John M; Correa, Adolfo; de Las Fuentes, Lisa; de Mutsert, Renée; de Silva, H Janaka; Deng, Xuan; Ding, Jingzhong; Duan, Qing; Eaton, Charles B; Ehret, Georg; Eppinga, Ruben N; Evangelou, Evangelos; Faul, Jessica D; Felix, Stephan B; Forouhi, Nita G; Forrester, Terrence; Franco, Oscar H; Friedlander, Yechiel; Gandin, Ilaria; Gao, He; Ghanbari, Mohsen; Gigante, Bruna; Gu, C Charles; Gu, Dongfeng; Hagenaars, Saskia P; Hallmans, Göran; Harris, Tamara B; He, Jiang; Heikkinen, Sami; Heng, Chew-Kiat; Hirata, Makoto; Howard, Barbara V; Ikram, M Arfan; John, Ulrich; Katsuya, Tomohiro; Khor, Chiea Chuen; Kilpeläinen, Tuomas O; Koh, Woon-Puay; Krieger, José E; Kritchevsky, Stephen B; Kubo, Michiaki; Kuusisto, Johanna; Lakka, Timo A; Langefeld, Carl D; Langenberg, Claudia; Launer, Lenore J; Lehne, Benjamin; Lewis, Cora E; Li, Yize; Lin, Shiow; Liu, Jianjun; Liu, Jingmin; Loh, Marie; Louie, Tin; Mägi, Reedik; McKenzie, Colin A; Meitinger, Thomas; Metspalu, Andres; Milaneschi, Yuri; Milani, Lili; Mohlke, Karen L; Momozawa, Yukihide; Nalls, Mike A; Nelson, Christopher P; Sotoodehnia, Nona; Norris, Jill M; O'Connell, Jeff R; Palmer, Nicholette D; Perls, Thomas; Pedersen, Nancy L; Peters, Annette; Peyser, Patricia A; Poulter, Neil; Raffel, Leslie J; Raitakari, Olli T; Roll, Kathryn; Rose, Lynda M; Rosendaal, Frits R; Rotter, Jerome I; Schmidt, Carsten O; Schreiner, Pamela J; Schupf, Nicole; Scott, William R; Sever, Peter S; Shi, Yuan; Sidney, Stephen; Sims, Mario; Sitlani, Colleen M; Smith, Jennifer A; Snieder, Harold; Starr, John M; Strauch, Konstantin; Stringham, Heather M; Tan, Nicholas Y Q; Tang, Hua; Taylor, Kent D; Teo, Yik Ying; Tham, Yih Chung; Turner, Stephen T; Uitterlinden, André G; Vollenweider, Peter; Waldenberger, Melanie; Wang, Lihua; Wang, Ya Xing; Wei, Wen Bin; Williams, Christine; Yao, Jie; Yu, Caizheng; Yuan, Jian-Min; Zhao, Wei; Zonderman, Alan B; Becker, Diane M; Boehnke, Michael; Bowden, Donald W; Chambers, John C; Deary, Ian J; Esko, Tõnu; Farrall, Martin; Franks, Paul W; Freedman, Barry I; Froguel, Philippe; Gasparini, Paolo; Gieger, Christian; Jonas, Jost Bruno; Kamatani, Yoichiro; Kato, Norihiro; Kooner, Jaspal S; Kutalik, Zoltán; Laakso, Markku; Laurie, Cathy C; Leander, Karin; Lehtimäki, Terho; Study, Lifelines Cohort; Magnusson, Patrik K E; Oldehinkel, Albertine J; Penninx, Brenda W J H; Polasek, Ozren; Porteous, David J; Rauramaa, Rainer; Samani, Nilesh J; Scott, James; Shu, Xiao-Ou; van der Harst, Pim; Wagenknecht, Lynne E; Wareham, Nicholas J; Watkins, Hugh; Weir, David R; Wickremasinghe, Ananda R; Wu, Tangchun; Zheng, Wei; Bouchard, Claude; Christensen, Kaare; Evans, Michele K; Gudnason, Vilmundur; Horta, Bernardo L; Kardia, Sharon L R; Liu, Yongmei; Pereira, Alexandre C; Psaty, Bruce M; Ridker, Paul M; van Dam, Rob M; Gauderman, W James; Zhu, Xiaofeng; Mook-Kanamori, Dennis O; Fornage, Myriam; Rotimi, Charles N; Cupples, L Adrienne; Kelly, Tanika N; Fox, Ervin R; Hayward, Caroline; van Duijn, Cornelia M; Tai, E Shyong; Wong, Tien Yin; Kooperberg, Charles; Palmas, Walter; Rice, Kenneth; Morrison, Alanna C; Elliott, Paul; Caulfield, Mark J; Munroe, Patricia B; Rao, Dabeeru C; Province, Michael A; Levy, Daniel

    2018-01-01

    Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10-5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10-8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10-8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension.

  5. Normal uniform mixture differential gene expression detection for cDNA microarrays

    PubMed Central

    Dean, Nema; Raftery, Adrian E

    2005-01-01

    Background One of the primary tasks in analysing gene expression data is finding genes that are differentially expressed in different samples. Multiple testing issues due to the thousands of tests run make some of the more popular methods for doing this problematic. Results We propose a simple method, Normal Uniform Differential Gene Expression (NUDGE) detection for finding differentially expressed genes in cDNA microarrays. The method uses a simple univariate normal-uniform mixture model, in combination with new normalization methods for spread as well as mean that extend the lowess normalization of Dudoit, Yang, Callow and Speed (2002) [1]. It takes account of multiple testing, and gives probabilities of differential expression as part of its output. It can be applied to either single-slide or replicated experiments, and it is very fast. Three datasets are analyzed using NUDGE, and the results are compared to those given by other popular methods: unadjusted and Bonferroni-adjusted t tests, Significance Analysis of Microarrays (SAM), and Empirical Bayes for microarrays (EBarrays) with both Gamma-Gamma and Lognormal-Normal models. Conclusion The method gives a high probability of differential expression to genes known/suspected a priori to be differentially expressed and a low probability to the others. In terms of known false positives and false negatives, the method outperforms all multiple-replicate methods except for the Gamma-Gamma EBarrays method to which it offers comparable results with the added advantages of greater simplicity, speed, fewer assumptions and applicability to the single replicate case. An R package called nudge to implement the methods in this paper will be made available soon at . PMID:16011807

  6. Association of tumour necrosis factor-alpha G/A -238 and G/A -308 single nucleotide polymorphisms with juvenile idiopathic arthritis.

    PubMed

    Maddah, M; Harsini, S; Ziaee, V; Moradinejad, M H; Rezaei, A; Zoghi, S; Sadr, M; Aghighi, Y; Rezaei, N

    2016-12-01

    Juvenile idiopathic arthritis (JIA) is a heterogeneous autoimmune disorder of unknown origin. As proinflammatory cytokines are known to contribute towards the pathogenesis of JIA, this case-control study was performed to examine the associations of certain single nucleotide polymorphisms (SNPs) of tumour necrosis factor-α (TNF-α) gene. Fifty-three patients with JIA participated in this study as patients group and compared with 137 healthy unrelated controls. Genotyping was performed for TNF-α gene at positions -308 and -238, using polymerase chain reaction with sequence-specific primers method. Results of the analysed data revealed a significant positive association for TNF-α gene at positions -308 and -238 for A allele in patients group compared with controls (P < 0.01). At the genotypic level, the frequency of TNF-α gene at positions -308 and -238 for GG genotype was discovered to be higher in the patients with JIA compared to the healthy controls (P < 0.01), while GA genotype at the same positions was observed to be less frequent in the case group than the controls (P < 0.01). At the haplotypic level, a significant positive association for TNF-α GG haplotype (positions -308, -238) together with a notable negative association for TNF-α AG and GA haplotypes at the same positions were detected in the patients group in comparison with the healthy individuals (P < 0.01). Cytokine gene polymorphisms might affect the development of JIA. Particular TNF-α gene variants could render individuals more susceptible to JIA.. © 2016 John Wiley & Sons Ltd.

  7. Genetic dissection and validation of candidate genes for flag leaf size in rice (Oryza sativa L.).

    PubMed

    Tang, Xinxin; Gong, Rong; Sun, Wenqiang; Zhang, Chaopu; Yu, Sibin

    2018-04-01

    Two major loci with functional candidate genes were identified and validated affecting flag leaf size, which offer desirable genes to improve leaf architecture and photosynthetic capacity in rice. Leaf size is a major determinant of plant architecture and yield potential in crops. However, the genetic and molecular mechanisms regulating leaf size remain largely elusive. In this study, quantitative trait loci (QTLs) for flag leaf length and flag leaf width in rice were detected with high-density single nucleotide polymorphism genotyping of a chromosomal segment substitution line (CSSL) population, in which each line carries one or a few chromosomal segments from the japonica cultivar Nipponbare in a common background of the indica variety Zhenshan 97. In total, 14 QTLs for flag leaf length and nine QTLs for flag leaf width were identified in the CSSL population. Among them, qFW4-2 for flag leaf width was mapped to a 37-kb interval, with the most likely candidate gene being the previously characterized NAL1. Another major QTL for both flag leaf width and length was delimited by substitution mapping to a small region of 13.5 kb that contains a single gene, Ghd7.1. Mutants of Ghd7.1 generated using CRISPR/CAS9 approach showed reduced leaf size. Allelic variation analyses also validated Ghd7.1 as a functional candidate gene for leaf size, photosynthetic capacity and other yield-related traits. These results provide useful genetic information for the improvement of leaf size and yield in rice breeding programs.

  8. Landscape genomic analysis of candidate genes for climate adaptation in a California endemic oak, Quercus lobata.

    PubMed

    Sork, Victoria L; Squire, Kevin; Gugger, Paul F; Steele, Stephanie E; Levy, Eric D; Eckert, Andrew J

    2016-01-01

    The ability of California tree populations to survive anthropogenic climate change will be shaped by the geographic structure of adaptive genetic variation. Our goal is to test whether climate-associated candidate genes show evidence of spatially divergent selection in natural populations of valley oak, Quercus lobata, as preliminary indication of local adaptation. Using DNA from 45 individuals from 13 localities across the species' range, we sequenced portions of 40 candidate genes related to budburst/flowering, growth, osmotic stress, and temperature stress. Using 195 single nucleotide polymorphisms (SNPs), we estimated genetic differentiation across populations and correlated allele frequencies with climate gradients using single-locus and multivariate models. The top 5% of FST estimates ranged from 0.25 to 0.68, yielding loci potentially under spatially divergent selection. Environmental analyses of SNP frequencies with climate gradients revealed three significantly correlated SNPs within budburst/flowering genes and two SNPs within temperature stress genes with mean annual precipitation, after controlling for multiple testing. A redundancy model showed a significant association between SNPs and climate variables and revealed a similar set of SNPs with high loadings on the first axis. In the RDA, climate accounted for 67% of the explained variation, when holding climate constant, in contrast to a putatively neutral SSR data set where climate accounted for only 33%. Population differentiation and geographic gradients of allele frequencies in climate-associated functional genes in Q. lobata provide initial evidence of adaptive genetic variation and background for predicting population response to climate change. © 2016 Botanical Society of America.

  9. Brain Transcriptomic Response to Social Eavesdropping in Zebrafish (Danio rerio)

    PubMed Central

    Oliveira, Rui F.

    2015-01-01

    Public information is widely available at low cost to animals living in social groups. For instance, bystanders may eavesdrop on signaling interactions between conspecifics and use it to adapt their subsequent behavior towards the observed individuals. This social eavesdropping ability is expected to require specialized mechanisms such as social attention, which selects social information available for learning. To begin exploring the genetic basis of social eavesdropping, we used a previously established attention paradigm in the lab to study the brain gene expression profile of male zebrafish (Danio rerio) in relation to the attention they paid towards conspecifics involved or not involved in agonistic interactions. Microarray gene chips were used to characterize their brain transcriptomes based on differential expression of single genes and gene sets. These analyses were complemented by promoter region-based techniques. Using data from both approaches, we further drafted protein interaction networks. Our results suggest that attentiveness towards conspecifics, whether interacting or not, activates pathways linked to neuronal plasticity and memory formation. The network analyses suggested that fos and jun are key players on this response, and that npas4a, nr4a1 and egr4 may also play an important role. Furthermore, specifically observing fighting interactions further triggered pathways associated to a change in the alertness status (dnajb5) and to other genes related to memory formation (btg2, npas4b), which suggests that the acquisition of eavesdropped information about social relationships activates specific processes on top of those already activated just by observing conspecifics. PMID:26713440

  10. Large Deletions of TSPAN12 Cause Familial Exudative Vitreoretinopathy (FEVR).

    PubMed

    Seo, Soo Hyun; Kim, Man Jin; Park, Sung Wook; Kim, Jeong Hun; Yu, Young Suk; Song, Ji Yun; Cho, Sung Im; Ahn, Joo Hyun; Oh, Yeon Hee; Lee, Jee-Soo; Lee, Seungjun; Seong, Moon-Woo; Park, Sung Sup; Kim, Ji Yeon

    2016-12-01

    Familial exudative vitreoretinopathy (FEVR) is a rare, hereditary visual disorder. The gene TSPAN12 is associated with autosomal dominant inheritance of FEVR. The prevalence and impact of large deletions/duplications of TSPAN12 on FEVR patients is unknown. To glean better insight of TSPAN12 on FEVR pathology, herein, we describe three FEVR patients with TSPAN12 deletions. Thirty-three Korean FEVR patients, who previously screened negative for TSPAN12 mutations, mutations in other FEVR-associated genes such as NDP, FZD4, LRP5, and large deletions and duplications of NDP, FZD4, and LRP5, were selected for TSPAN12 large deletion and duplication analyses. Semiquantitative multiplex PCR for TSPAN12 gene dosage analyses were performed, followed by droplet digital PCR (ddPCR) for validation. Among the 33 patients, three patients were confirmed to carry large TSPAN12 deletions. Two of them had whole-gene deletions of TSPAN12, and the other patient possessed a deletion of TSPAN12 in exon 4. FEVR severity detected in these patients was not more severe than in a patient with TSPAN12 point mutation. Regarding previously reported proportions of FEVR-associated genes contributing to the disorder's autosomal dominant inheritance pattern in Korea, we determined that patients with TSPAN12 large deletions were more common than patients with single nucleotide variants in TSPAN12. Evaluating TSPAN12 large deletions and duplications should be considered in FEVR screening and diagnosis as well as in routine genetic workups for FEVR patients.

  11. Genome sequence of M6, a diploid inbred clone of the high-glycoalkaloid-producing tuber-bearing potato species Solanum chacoense, reveals residual heterozygosity.

    PubMed

    Leisner, Courtney P; Hamilton, John P; Crisovan, Emily; Manrique-Carpintero, Norma C; Marand, Alexandre P; Newton, Linsey; Pham, Gina M; Jiang, Jiming; Douches, David S; Jansky, Shelley H; Buell, C Robin

    2018-05-01

    Cultivated potato (Solanum tuberosum L.) is a highly heterozygous autotetraploid that presents challenges in genome analyses and breeding. Wild potato species serve as a resource for the introgression of important agronomic traits into cultivated potato. One key species is Solanum chacoense and the diploid, inbred clone M6, which is self-compatible and has desirable tuber market quality and disease resistance traits. Sequencing and assembly of the genome of the M6 clone of S. chacoense generated an assembly of 825 767 562 bp in 8260 scaffolds with an N50 scaffold size of 713 602 bp. Pseudomolecule construction anchored 508 Mb of the genome assembly into 12 chromosomes. Genome annotation yielded 49 124 high-confidence gene models representing 37 740 genes. Comparative analyses of the M6 genome with six other Solanaceae species revealed a core set of 158 367 Solanaceae genes and 1897 genes unique to three potato species. Analysis of single nucleotide polymorphisms across the M6 genome revealed enhanced residual heterozygosity on chromosomes 4, 8 and 9 relative to the other chromosomes. Access to the M6 genome provides a resource for identification of key genes for important agronomic traits and aids in genome-enabled development of inbred diploid potatoes with the potential to accelerate potato breeding. © 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.

  12. Characterization of the definitive classical calpain family of vertebrates using phylogenetic, evolutionary and expression analyses.

    PubMed

    Macqueen, Daniel J; Wilcox, Alexander H

    2014-04-09

    The calpains are a superfamily of proteases with extensive relevance to human health and welfare. Vast research attention is given to the vertebrate 'classical' subfamily, making it surprising that the evolutionary origins, distribution and relationships of these genes is poorly characterized. Consequently, there exists uncertainty about the conservation of gene family structure, function and expression that has been principally defined from work with mammals. Here, more than 200 vertebrate classical calpains were incorporated in phylogenetic analyses spanning an unprecedented range of taxa, including jawless and cartilaginous fish. We demonstrate that the common vertebrate ancestor had at least six classical calpains, including a single gene that gave rise to CAPN11, 1, 2 and 8 in the early jawed fish lineage, plus CAPN3, 9, 12, 13 and a novel calpain gene, hereafter named CAPN17. We reveal that while all vertebrate classical calpains have been subject to persistent purifying selection during evolution, the degree and nature of selective pressure has often been lineage-dependent. The tissue expression of the complete classic calpain family was assessed in representative teleost fish, amphibians, reptiles and mammals. This highlighted systematic divergence in expression across vertebrate taxa, with most classic calpain genes from fish and amphibians having more extensive tissue distribution than in amniotes. Our data suggest that classical calpain functions have frequently diverged during vertebrate evolution and challenge the ongoing value of the established system of classifying calpains by expression.

  13. Characterization of the definitive classical calpain family of vertebrates using phylogenetic, evolutionary and expression analyses

    PubMed Central

    Macqueen, Daniel J.; Wilcox, Alexander H.

    2014-01-01

    The calpains are a superfamily of proteases with extensive relevance to human health and welfare. Vast research attention is given to the vertebrate ‘classical’ subfamily, making it surprising that the evolutionary origins, distribution and relationships of these genes is poorly characterized. Consequently, there exists uncertainty about the conservation of gene family structure, function and expression that has been principally defined from work with mammals. Here, more than 200 vertebrate classical calpains were incorporated in phylogenetic analyses spanning an unprecedented range of taxa, including jawless and cartilaginous fish. We demonstrate that the common vertebrate ancestor had at least six classical calpains, including a single gene that gave rise to CAPN11, 1, 2 and 8 in the early jawed fish lineage, plus CAPN3, 9, 12, 13 and a novel calpain gene, hereafter named CAPN17. We reveal that while all vertebrate classical calpains have been subject to persistent purifying selection during evolution, the degree and nature of selective pressure has often been lineage-dependent. The tissue expression of the complete classic calpain family was assessed in representative teleost fish, amphibians, reptiles and mammals. This highlighted systematic divergence in expression across vertebrate taxa, with most classic calpain genes from fish and amphibians having more extensive tissue distribution than in amniotes. Our data suggest that classical calpain functions have frequently diverged during vertebrate evolution and challenge the ongoing value of the established system of classifying calpains by expression. PMID:24718597

  14. SCOUP: a probabilistic model based on the Ornstein-Uhlenbeck process to analyze single-cell expression data during differentiation.

    PubMed

    Matsumoto, Hirotaka; Kiryu, Hisanori

    2016-06-08

    Single-cell technologies make it possible to quantify the comprehensive states of individual cells, and have the power to shed light on cellular differentiation in particular. Although several methods have been developed to fully analyze the single-cell expression data, there is still room for improvement in the analysis of differentiation. In this paper, we propose a novel method SCOUP to elucidate differentiation process. Unlike previous dimension reduction-based approaches, SCOUP describes the dynamics of gene expression throughout differentiation directly, including the degree of differentiation of a cell (in pseudo-time) and cell fate. SCOUP is superior to previous methods with respect to pseudo-time estimation, especially for single-cell RNA-seq. SCOUP also successfully estimates cell lineage more accurately than previous method, especially for cells at an early stage of bifurcation. In addition, SCOUP can be applied to various downstream analyses. As an example, we propose a novel correlation calculation method for elucidating regulatory relationships among genes. We apply this method to a single-cell RNA-seq data and detect a candidate of key regulator for differentiation and clusters in a correlation network which are not detected with conventional correlation analysis. We develop a stochastic process-based method SCOUP to analyze single-cell expression data throughout differentiation. SCOUP can estimate pseudo-time and cell lineage more accurately than previous methods. We also propose a novel correlation calculation method based on SCOUP. SCOUP is a promising approach for further single-cell analysis and available at https://github.com/hmatsu1226/SCOUP.

  15. Single cells within the Puerto Rico trench suggest hadal adaptation of microbial lineages.

    PubMed

    León-Zayas, Rosa; Novotny, Mark; Podell, Sheila; Shepard, Charles M; Berkenpas, Eric; Nikolenko, Sergey; Pevzner, Pavel; Lasken, Roger S; Bartlett, Douglas H

    2015-12-01

    Hadal ecosystems are found at a depth of 6,000 m below sea level and below, occupying less than 1% of the total area of the ocean. The microbial communities and metabolic potential in these ecosystems are largely uncharacterized. Here, we present four single amplified genomes (SAGs) obtained from 8,219 m below the sea surface within the hadal ecosystem of the Puerto Rico Trench (PRT). These SAGs are derived from members of deep-sea clades, including the Thaumarchaeota and SAR11 clade, and two are related to previously isolated piezophilic (high-pressure-adapted) microorganisms. In order to identify genes that might play a role in adaptation to deep-sea environments, comparative analyses were performed with genomes from closely related shallow-water microbes. The archaeal SAG possesses genes associated with mixotrophy, including lipoylation and the glycine cleavage pathway. The SAR11 SAG encodes glycolytic enzymes previously reported to be missing from this abundant and cosmopolitan group. The other SAGs, which are related to piezophilic isolates, possess genes that may supplement energy demands through the oxidation of hydrogen or the reduction of nitrous oxide. We found evidence for potential trench-specific gene distributions, as several SAG genes were observed only in a PRT metagenome and not in shallower deep-sea metagenomes. These results illustrate new ecotype features that might perform important roles in the adaptation of microorganisms to life in hadal environments. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  16. Genetic Variants in SDC3 Gene are Significantly Associated with Growth Traits in Two Chinese Beef Cattle Breeds.

    PubMed

    Huang, Yong-Zhen; Wang, Qin; Zhang, Chun-Lei; Fang, Xing-Tang; Song, En-Liang; Chen, Hong

    2016-01-01

    Identification of the genes and polymorphisms underlying quantitative traits, and understanding these genes and polymorphisms affect economic growth traits, are important for successful marker-assisted selection and more efficient management strategies in commercial cattle (Bos taurus) population. Syndecan-3 (SDC3), a member of the syndecan family of type I transmembrane heparan sulfate proteoglycans is a novel regulator of feeding behavior and body weight. The aim of this study is to examine the association of the SDC3 polymorphism with growth traits in Chinese Jiaxian and Qinchuan cattle breeds (). Four single nucleotide polymorphisms (SNPs: 1-4) were detected in 555 cows from three Chinese native cattle breeds by means of sequencing pooled DNA samples and polymerase chain reaction-single stranded conformational polymorphism (PCR-SSCP) methods. We found one SNP (g.28362A > G) in intron and three SNPs (g.30742T > G, g.30821C > T and 33418 A > G) in exons. The statistical analyses indicated that these SNPs of SDC3 gene were associated with bovine body height, body length, chest circumference, and circumference of cannon bone (P < 0.05). The mutant-type variant was superior for growth traits; the heterozygote was associated with higher growth traits compared to wild-type homozygote. Our result confirms the polymorphisms in the SDC3 gene are associated with growth traits that may be used for marker-assisted selection in beef cattle breeding programs.

  17. SOS2 and ACP1 Loci Identified through Large-Scale Exome Chip Analysis Regulate Kidney Development and Function.

    PubMed

    Li, Man; Li, Yong; Weeks, Olivia; Mijatovic, Vladan; Teumer, Alexander; Huffman, Jennifer E; Tromp, Gerard; Fuchsberger, Christian; Gorski, Mathias; Lyytikäinen, Leo-Pekka; Nutile, Teresa; Sedaghat, Sanaz; Sorice, Rossella; Tin, Adrienne; Yang, Qiong; Ahluwalia, Tarunveer S; Arking, Dan E; Bihlmeyer, Nathan A; Böger, Carsten A; Carroll, Robert J; Chasman, Daniel I; Cornelis, Marilyn C; Dehghan, Abbas; Faul, Jessica D; Feitosa, Mary F; Gambaro, Giovanni; Gasparini, Paolo; Giulianini, Franco; Heid, Iris; Huang, Jinyan; Imboden, Medea; Jackson, Anne U; Jeff, Janina; Jhun, Min A; Katz, Ronit; Kifley, Annette; Kilpeläinen, Tuomas O; Kumar, Ashish; Laakso, Markku; Li-Gao, Ruifang; Lohman, Kurt; Lu, Yingchang; Mägi, Reedik; Malerba, Giovanni; Mihailov, Evelin; Mohlke, Karen L; Mook-Kanamori, Dennis O; Robino, Antonietta; Ruderfer, Douglas; Salvi, Erika; Schick, Ursula M; Schulz, Christina-Alexandra; Smith, Albert V; Smith, Jennifer A; Traglia, Michela; Yerges-Armstrong, Laura M; Zhao, Wei; Goodarzi, Mark O; Kraja, Aldi T; Liu, Chunyu; Wessel, Jennifer; Boerwinkle, Eric; Borecki, Ingrid B; Bork-Jensen, Jette; Bottinger, Erwin P; Braga, Daniele; Brandslund, Ivan; Brody, Jennifer A; Campbell, Archie; Carey, David J; Christensen, Cramer; Coresh, Josef; Crook, Errol; Curhan, Gary C; Cusi, Daniele; de Boer, Ian H; de Vries, Aiko P J; Denny, Joshua C; Devuyst, Olivier; Dreisbach, Albert W; Endlich, Karlhans; Esko, Tõnu; Franco, Oscar H; Fulop, Tibor; Gerhard, Glenn S; Glümer, Charlotte; Gottesman, Omri; Grarup, Niels; Gudnason, Vilmundur; Hansen, Torben; Harris, Tamara B; Hayward, Caroline; Hocking, Lynne; Hofman, Albert; Hu, Frank B; Husemoen, Lise Lotte N; Jackson, Rebecca D; Jørgensen, Torben; Jørgensen, Marit E; Kähönen, Mika; Kardia, Sharon L R; König, Wolfgang; Kooperberg, Charles; Kriebel, Jennifer; Launer, Lenore J; Lauritzen, Torsten; Lehtimäki, Terho; Levy, Daniel; Linksted, Pamela; Linneberg, Allan; Liu, Yongmei; Loos, Ruth J F; Lupo, Antonio; Meisinger, Christine; Melander, Olle; Metspalu, Andres; Mitchell, Paul; Nauck, Matthias; Nürnberg, Peter; Orho-Melander, Marju; Parsa, Afshin; Pedersen, Oluf; Peters, Annette; Peters, Ulrike; Polasek, Ozren; Porteous, David; Probst-Hensch, Nicole M; Psaty, Bruce M; Qi, Lu; Raitakari, Olli T; Reiner, Alex P; Rettig, Rainer; Ridker, Paul M; Rivadeneira, Fernando; Rossouw, Jacques E; Schmidt, Frank; Siscovick, David; Soranzo, Nicole; Strauch, Konstantin; Toniolo, Daniela; Turner, Stephen T; Uitterlinden, André G; Ulivi, Sheila; Velayutham, Dinesh; Völker, Uwe; Völzke, Henry; Waldenberger, Melanie; Wang, Jie Jin; Weir, David R; Witte, Daniel; Kuivaniemi, Helena; Fox, Caroline S; Franceschini, Nora; Goessling, Wolfram; Köttgen, Anna; Chu, Audrey Y

    2017-03-01

    Genome-wide association studies have identified >50 common variants associated with kidney function, but these variants do not fully explain the variation in eGFR. We performed a two-stage meta-analysis of associations between genotypes from the Illumina exome array and eGFR on the basis of serum creatinine (eGFRcrea) among participants of European ancestry from the CKDGen Consortium ( n Stage1 : 111,666; n Stage2 : 48,343). In single-variant analyses, we identified single nucleotide polymorphisms at seven new loci associated with eGFRcrea ( PPM1J , EDEM3, ACP1, SPEG, EYA4, CYP1A1 , and ATXN2L ; P Stage1 <3.7×10 -7 ), of which most were common and annotated as nonsynonymous variants. Gene-based analysis identified associations of functional rare variants in three genes with eGFRcrea, including a novel association with the SOS Ras/Rho guanine nucleotide exchange factor 2 gene, SOS2 ( P =5.4×10 -8 by sequence kernel association test). Experimental follow-up in zebrafish embryos revealed changes in glomerular gene expression and renal tubule morphology in the embryonic kidney of acp1- and sos2 -knockdowns. These developmental abnormalities associated with altered blood clearance rate and heightened prevalence of edema. This study expands the number of loci associated with kidney function and identifies novel genes with potential roles in kidney formation. Copyright © 2017 by the American Society of Nephrology.

  18. Bipartite Community Structure of eQTLs.

    PubMed

    Platig, John; Castaldi, Peter J; DeMeo, Dawn; Quackenbush, John

    2016-09-01

    Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.

  19. A linear concatenation strategy to construct 5'-enriched amplified cDNA libraries using multiple displacement amplification.

    PubMed

    Gadkar, Vijay J; Filion, Martin

    2013-06-01

    In various experimental systems, limiting available amounts of RNA may prevent a researcher from performing large-scale analyses of gene transcripts. One way to circumvent this is to 'pre-amplify' the starting RNA/cDNA, so that sufficient amounts are available for any downstream analysis. In the present study, we report the development of a novel protocol for constructing amplified cDNA libraries using the Phi29 DNA polymerase based multiple displacement amplification (MDA) system. Using as little as 200 ng of total RNA, we developed a linear concatenation strategy to make the single-stranded cDNA template amenable for MDA. The concatenation, made possible by the template switching property of the reverse transcriptase enzyme, resulted in the amplified cDNA library with intact 5' ends. MDA generated micrograms of template, allowing large-scale polymerase chain reaction analyses or other large-scale downstream applications. As the amplified cDNA library contains intact 5' ends, it is also compatible with 5' RACE analyses of specific gene transcripts. Empirical validation of this protocol is demonstrated on a highly characterized (tomato) and an uncharacterized (corn gromwell) experimental system.

  20. Sybil--efficient constraint-based modelling in R.

    PubMed

    Gelius-Dietrich, Gabriel; Desouki, Abdelmoneim Amer; Fritzemeier, Claus Jonathan; Lercher, Martin J

    2013-11-13

    Constraint-based analyses of metabolic networks are widely used to simulate the properties of genome-scale metabolic networks. Publicly available implementations tend to be slow, impeding large scale analyses such as the genome-wide computation of pairwise gene knock-outs, or the automated search for model improvements. Furthermore, available implementations cannot easily be extended or adapted by users. Here, we present sybil, an open source software library for constraint-based analyses in R; R is a free, platform-independent environment for statistical computing and graphics that is widely used in bioinformatics. Among other functions, sybil currently provides efficient methods for flux-balance analysis (FBA), MOMA, and ROOM that are about ten times faster than previous implementations when calculating the effect of whole-genome single gene deletions in silico on a complete E. coli metabolic model. Due to the object-oriented architecture of sybil, users can easily build analysis pipelines in R or even implement their own constraint-based algorithms. Based on its highly efficient communication with different mathematical optimisation programs, sybil facilitates the exploration of high-dimensional optimisation problems on small time scales. Sybil and all its dependencies are open source. Sybil and its documentation are available for download from the comprehensive R archive network (CRAN).

Top