Uchiyama, Ikuo
2008-10-31
Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
Ping, Yanyan; Deng, Yulan; Wang, Li; Zhang, Hongyi; Zhang, Yong; Xu, Chaohan; Zhao, Hongying; Fan, Huihui; Yu, Fulong; Xiao, Yun; Li, Xia
2015-01-01
The driver genetic aberrations collectively regulate core cellular processes underlying cancer development. However, identifying the modules of driver genetic alterations and characterizing their functional mechanisms are still major challenges for cancer studies. Here, we developed an integrative multi-omics method CMDD to identify the driver modules and their affecting dysregulated genes through characterizing genetic alteration-induced dysregulated networks. Applied to glioblastoma (GBM), the CMDD identified a core gene module of 17 genes, including seven known GBM drivers, and their dysregulated genes. The module showed significant association with shorter survival of GBM. When classifying driver genes in the module into two gene sets according to their genetic alteration patterns, we found that one gene set directly participated in the glioma pathway, while the other indirectly regulated the glioma pathway, mostly, via their dysregulated genes. Both of the two gene sets were significant contributors to survival and helpful for classifying GBM subtypes, suggesting their critical roles in GBM pathogenesis. Also, by applying the CMDD to other six cancers, we identified some novel core modules associated with overall survival of patients. Together, these results demonstrate integrative multi-omics data can identify driver modules and uncover their dysregulated genes, which is useful for interpreting cancer genome. PMID:25653168
Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila
2017-07-12
The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.
Hester, Susan D; Nesnow, Stephen
2008-03-15
Conazoles are azole-containing fungicides that are used in agriculture and medicine. Conazoles can induce follicular cell adenomas of the thyroid in rats after chronic bioassay. The goal of this study was to identify pathways and networks of genes that were associated with thyroid tumorigenesis through transcriptional analyses. To this end, we compared transcriptional profiles from tissues of rats treated with a tumorigenic and a non-tumorigenic conazole. Triadimefon, a rat thyroid tumorigen, and myclobutanil, which was not tumorigenic in rats after a 2-year bioassay, were administered in the feed to male Wistar/Han rats for 30 or 90 days similar to the treatment conditions previously used in their chronic bioassays. Thyroid gene expression was determined using high density Affymetrix GeneChips (Rat 230_2). Gene expression was analyzed by the Gene Set Expression Analyses method which clearly separated the tumorigenic treatments (tumorigenic response group (TRG)) from the non-tumorigenic treatments (non-tumorigenic response group (NRG)). Core genes from these gene sets were mapped to canonical, metabolic, and GeneGo processes and these processes compared across group and treatment time. Extensive analyses were performed on the 30-day gene sets as they represented the major perturbations. Gene sets in the 30-day TRG group had over representation of fatty acid metabolism, oxidation, and degradation processes (including PPARgamma and CYP involvement), and of cell proliferation responses. Core genes from these gene sets were combined into networks and found to possess signaling interactions. In addition, the core genes in each gene set were compared with genes known to be associated with human thyroid cancer. Among the genes that appeared in both rat and human data sets were: Acaca, Asns, Cebpg, Crem, Ddit3, Gja1, Grn, Jun, Junb, and Vegf. These genes were major contributors in the previously developed network from triadimefon-treated rat thyroids. It is postulated that triadimefon induces oxidative response genes and activates the nuclear receptor, Ppargamma, initiating transcription of gene products and signaling to a series of genes involved in cell proliferation.
Comparative Bacterial Proteomics: Analysis of the Core Genome Concept
Callister, Stephen J.; McCue, Lee Ann; Turse, Joshua E.; Monroe, Matthew E.; Auberry, Kenneth J.; Smith, Richard D.; Adkins, Joshua N.; Lipton, Mary S.
2008-01-01
While comparative bacterial genomic studies commonly predict a set of genes indicative of common ancestry, experimental validation of the existence of this core genome requires extensive measurement and is typically not undertaken. Enabled by an extensive proteome database developed over six years, we have experimentally verified the expression of proteins predicted from genomic ortholog comparisons among 17 environmental and pathogenic bacteria. More exclusive relationships were observed among the expressed protein content of phenotypically related bacteria, which is indicative of the specific lifestyles associated with these organisms. Although genomic studies can establish relative orthologous relationships among a set of bacteria and propose a set of ancestral genes, our proteomics study establishes expressed lifestyle differences among conserved genes and proposes a set of expressed ancestral traits. PMID:18253490
2010-01-01
Background The genus Neisseria contains two important yet very different pathogens, N. meningitidis and N. gonorrhoeae, in addition to non-pathogenic species, of which N. lactamica is the best characterized. Genomic comparisons of these three bacteria will provide insights into the mechanisms and evolution of pathogenesis in this group of organisms, which are applicable to understanding these processes more generally. Results Non-pathogenic N. lactamica exhibits very similar population structure and levels of diversity to the meningococcus, whilst gonococci are essentially recent descendents of a single clone. All three species share a common core gene set estimated to comprise around 1190 CDSs, corresponding to about 60% of the genome. However, some of the nucleotide sequence diversity within this core genome is particular to each group, indicating that cross-species recombination is rare in this shared core gene set. Other than the meningococcal cps region, which encodes the polysaccharide capsule, relatively few members of the large accessory gene pool are exclusive to one species group, and cross-species recombination within this accessory genome is frequent. Conclusion The three Neisseria species groups represent coherent biological and genetic groupings which appear to be maintained by low rates of inter-species horizontal genetic exchange within the core genome. There is extensive evidence for exchange among positively selected genes and the accessory genome and some evidence of hitch-hiking of housekeeping genes with other loci. It is not possible to define a 'pathogenome' for this group of organisms and the disease causing phenotypes are therefore likely to be complex, polygenic, and different among the various disease-associated phenotypes observed. PMID:21092259
Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O
2015-08-25
Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.
Computing prokaryotic gene ubiquity: rescuing the core from extinction.
Charlebois, Robert L; Doolittle, W Ford
2004-12-01
The genomic core concept has found several uses in comparative and evolutionary genomics. Defined as the set of all genes common to (ubiquitous among) all genomes in a phylogenetically coherent group, core size decreases as the number and phylogenetic diversity of the relevant group increases. Here, we focus on methods for defining the size and composition of the core of all genes shared by sequenced genomes of prokaryotes (Bacteria and Archaea). There are few (almost certainly less than 50) genes shared by all of the 147 genomes compared, surely insufficient to conduct all essential functions. Sequencing and annotation errors are responsible for the apparent absence of some genes, while very limited but genuine disappearances (from just one or a few genomes) can account for several others. Core size will continue to decrease as more genome sequences appear, unless the requirement for ubiquity is relaxed. Such relaxation seems consistent with any reasonable biological purpose for seeking a core, but it renders the problem of definition more problematic. We propose an alternative approach (the phylogenetically balanced core), which preserves some of the biological utility of the core concept. Cores, however delimited, preferentially contain informational rather than operational genes; we present a new hypothesis for why this might be so.
Identification of the Core Set of Carbon-Associated Genes in a Bioenergy Grassland Soil
Howe, Adina; Yang, Fan; Williams, Ryan J.; ...
2016-11-17
Despite the central role of soil microbial communities in global carbon (C) cycling, little is known about soil microbial community structure and even less about their metabolic pathways. Efforts to characterize soil communities often focus on identifying differences in gene content across environmental gradients, but an alternative question is what genes are similar in soils. These genes may indicate critical species or potential functions that are required in all soils. Here we identified the “core” set of C cycling sequences widely present in multiple soil metagenomes from a fertilized prairie (FP). Of 226,887 sequences associated with known enzymes involved inmore » the synthesis, metabolism, and transport of carbohydrates, 843 were identified to be consistently prevalent across four replicate soil metagenomes. This core metagenome was functionally and taxonomically diverse, representing five enzyme classes and 99 enzyme families within the CAZy database. Though it only comprised 0.4% of all CAZy-associated genes identified in FP metagenomes, the core was found to be comprised of functions similar to those within cumulative soils. The FP CAZy-associated core sequences were present in multiple publicly available soil metagenomes and most similar to soils sharing geographic proximity. As a result, in soil ecosystems, where high diversity remains a key challenge for metagenomic investigations, these core genes represent a subset of critical functions necessary for carbohydrate metabolism, which can be targeted to evaluate important C fluxes in these and other similar soils.« less
Dominguez, Daniel; Tsai, Yi-Hsuan; Gomez, Nicholas; Jha, Deepak Kumar; Davis, Ian; Wang, Zefeng
2016-01-01
Progression through the cell cycle is largely dependent on waves of periodic gene expression, and the regulatory networks for these transcriptome dynamics have emerged as critical points of vulnerability in various aspects of tumor biology. Through RNA-sequencing of human cells during two continuous cell cycles (>2.3 billion paired reads), we identified over 1 000 mRNAs, non-coding RNAs and pseudogenes with periodic expression. Periodic transcripts are enriched in functions related to DNA metabolism, mitosis, and DNA damage response, indicating these genes likely represent putative cell cycle regulators. Using our set of periodic genes, we developed a new approach termed “mitotic trait” that can classify primary tumors and normal tissues by their transcriptome similarity to different cell cycle stages. By analyzing >4 000 tumor samples in The Cancer Genome Atlas (TCGA) and other expression data sets, we found that mitotic trait significantly correlates with genetic alterations, tumor subtype and, notably, patient survival. We further defined a core set of 67 genes with robust periodic expression in multiple cell types. Proteins encoded by these genes function as major hubs of protein-protein interaction and are mostly required for cell cycle progression. The core genes also have unique chromatin features including increased levels of CTCF/RAD21 binding and H3K36me3. Loss of these features in uterine and kidney cancers is associated with altered expression of the core 67 genes. Our study suggests new chromatin-associated mechanisms for periodic gene regulation and offers a predictor of cancer patient outcomes. PMID:27364684
Drury, Suzanne; Salter, Janine; Baehner, Frederick L; Shak, Steven; Dowsett, Mitch
2010-06-01
To determine whether 0.6 mm cores of formalin-fixed paraffin-embedded (FFPE) tissue, as commonly used to construct immunohistochemical tissue microarrays, may be a valid alternative to tissue sections as source material for quantitative real-time PCR-based transcriptional profiling of breast cancer. Four matched 0.6 mm cores of invasive breast tumour and two 10 microm whole sections were taken from eight FFPE blocks. RNA was extracted and reverse transcribed, and TaqMan assays were performed on the 21 genes of the Oncotype DX Breast Cancer assay. Expression of the 16 recurrence-related genes was normalised to the set of five reference genes, and the recurrence score (RS) was calculated. RNA yield was lower from 0.6 mm cores than from 10 microm whole sections, but was still more than sufficient to perform the assay. RS and single gene data from cores were highly comparable with those from whole sections (RS p=0.005). Greater variability was seen between cores than between sections. FFPE sections are preferable to 0.6 mm cores for RNA profiling in order to maximise RNA yield and to allow for standard histopathological assessment. However, 0.6 mm cores are sufficient and would be appropriate to use for large cohort studies.
Analysis of co-evolving genes in campylobacter jejuni and C. coli
USDA-ARS?s Scientific Manuscript database
Background: The population structure of Campylobacter has been frequently studied by MLST, for which fragments of housekeeping genes are compared. We wished to determine if the used MLST genes are representative of the complete genome. Methods: A set of 1029 core gene families (CGF) was identifie...
Essentiality, conservation, evolutionary pressure and codon bias in bacterial genomes.
Dilucca, Maddalena; Cimini, Giulio; Giansanti, Andrea
2018-07-15
Essential genes constitute the core of genes which cannot be mutated too much nor lost along the evolutionary history of a species. Natural selection is expected to be stricter on essential genes and on conserved (highly shared) genes, than on genes that are either nonessential or peculiar to a single or a few species. In order to further assess this expectation, we study here how essentiality of a gene is connected with its degree of conservation among several unrelated bacterial species, each one characterised by its own codon usage bias. Confirming previous results on E. coli, we show the existence of a universal exponential relation between gene essentiality and conservation in bacteria. Moreover, we show that, within each bacterial genome, there are at least two groups of functionally distinct genes, characterised by different levels of conservation and codon bias: i) a core of essential genes, mainly related to cellular information processing; ii) a set of less conserved nonessential genes with prevalent functions related to metabolism. In particular, the genes in the first group are more retained among species, are subject to a stronger purifying conservative selection and display a more limited repertoire of synonymous codons. The core of essential genes is close to the minimal bacterial genome, which is in the focus of recent studies in synthetic biology, though we confirm that orthologs of genes that are essential in one species are not necessarily essential in other species. We also list a set of highly shared genes which, reasonably, could constitute a reservoir of targets for new anti-microbial drugs. Copyright © 2018 Elsevier B.V. All rights reserved.
Hestand, Matthew S; van Galen, Michiel; Villerius, Michel P; van Ommen, Gert-Jan B; den Dunnen, Johan T; 't Hoen, Peter AC
2008-01-01
Background The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. Results We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites) that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFACR database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. Conclusion The program CORE_TF is accessible in a user friendly web interface at . It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites. PMID:19036135
Anderson, Ashley K.; Ohler, Uwe; Wassarman, David A.
2012-01-01
To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5′ untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300–400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation. PMID:22984601
Katzenberger, Rebeccah J; Rach, Elizabeth A; Anderson, Ashley K; Ohler, Uwe; Wassarman, David A
2012-01-01
To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5' untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300-400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation.
Citerne, Hélène L.; Le Guilloux, Martine; Sannier, Julie; Nadot, Sophie; Damerval, Catherine
2013-01-01
TCP ECE genes encode transcription factors which have received much attention for their repeated recruitment in the control of floral symmetry in core eudicots, and more recently in monocots. Major duplications of TCP ECE genes have been described in core eudicots, but the evolutionary history of this gene family is unknown in basal eudicots. Reconstructing the phylogeny of ECE genes in basal eudicots will help set a framework for understanding the functional evolution of these genes. TCP ECE genes were sequenced in all major lineages of basal eudicots and Gunnera which belongs to the sister clade to all other core eudicots. We show that in these lineages they have a complex evolutionary history with repeated duplications. We estimate the timing of the two major duplications already identified in the core eudicots within a timeframe before the divergence of Gunnera and after the divergence of Proteales. We also use a synteny-based approach to examine the extent to which the expansion of TCP ECE genes in diverse eudicot lineages may be due to genome-wide duplications. The three major core-eudicot specific clades share a number of collinear genes, and their common evolutionary history may have originated at the γ event. Genomic comparisons in Arabidopsis thaliana and Solanum lycopersicum highlight their separate polyploid origin, with syntenic fragments with and without TCP ECE genes showing differential gene loss and genomic rearrangements. Comparison between recently available genomes from two basal eudicots Aquilegia coerulea and Nelumbo nucifera suggests that the two TCP ECE paralogs in these species are also derived from large-scale duplications. TCP ECE loci from basal eudicots share many features with the three main core eudicot loci, and allow us to infer the makeup of the ancestral eudicot locus. PMID:24019982
PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.
Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay
2015-12-01
A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.
Evaluation and Design of Genome-Wide CRISPR/SpCas9 Knockout Screens
Hart, Traver; Tong, Amy Hin Yan; Chan, Katie; Van Leeuwen, Jolanda; Seetharaman, Ashwin; Aregger, Michael; Chandrashekhar, Megha; Hustedt, Nicole; Seth, Sahil; Noonan, Avery; Habsid, Andrea; Sizova, Olga; Nedyalkova, Lyudmila; Climie, Ryan; Tworzyanski, Leanne; Lawson, Keith; Sartori, Maria Augusta; Alibeh, Sabriyeh; Tieu, David; Masud, Sanna; Mero, Patricia; Weiss, Alexander; Brown, Kevin R.; Usaj, Matej; Billmann, Maximilian; Rahman, Mahfuzur; Costanzo, Michael; Myers, Chad L.; Andrews, Brenda J.; Boone, Charles; Durocher, Daniel; Moffat, Jason
2017-01-01
The adaptation of CRISPR/SpCas9 technology to mammalian cell lines is transforming the study of human functional genomics. Pooled libraries of CRISPR guide RNAs (gRNAs) targeting human protein-coding genes and encoded in viral vectors have been used to systematically create gene knockouts in a variety of human cancer and immortalized cell lines, in an effort to identify whether these knockouts cause cellular fitness defects. Previous work has shown that CRISPR screens are more sensitive and specific than pooled-library shRNA screens in similar assays, but currently there exists significant variability across CRISPR library designs and experimental protocols. In this study, we reanalyze 17 genome-scale knockout screens in human cell lines from three research groups, using three different genome-scale gRNA libraries. Using the Bayesian Analysis of Gene Essentiality algorithm to identify essential genes, we refine and expand our previously defined set of human core essential genes from 360 to 684 genes. We use this expanded set of reference core essential genes, CEG2, plus empirical data from six CRISPR knockout screens to guide the design of a sequence-optimized gRNA library, the Toronto KnockOut version 3.0 (TKOv3) library. We then demonstrate the high effectiveness of the library relative to reference sets of essential and nonessential genes, as well as other screens using similar approaches. The optimized TKOv3 library, combined with the CEG2 reference set, provide an efficient, highly optimized platform for performing and assessing gene knockout screens in human cell lines. PMID:28655737
Identification of a core set of rhizobial infection genes using data from single cell-types.
Chen, Da-Song; Liu, Cheng-Wu; Roy, Sonali; Cousins, Donna; Stacey, Nicola; Murray, Jeremy D
2015-01-01
Genome-wide expression studies on nodulation have varied in their scale from entire root systems to dissected nodules or root sections containing nodule primordia (NP). More recently efforts have focused on developing methods for isolation of root hairs from infected plants and the application of laser-capture microdissection technology to nodules. Here we analyze two published data sets to identify a core set of infection genes that are expressed in the nodule and in root hairs during infection. Among the genes identified were those encoding phenylpropanoid biosynthesis enzymes including Chalcone-O-Methyltransferase which is required for the production of the potent Nod gene inducer 4',4-dihydroxy-2-methoxychalcone. A promoter-GUS analysis in transgenic hairy roots for two genes encoding Chalcone-O-Methyltransferase isoforms revealed their expression in rhizobially infected root hairs and the nodule infection zone but not in the nitrogen fixation zone. We also describe a group of Rhizobially Induced Peroxidases whose expression overlaps with the production of superoxide in rhizobially infected root hairs and in nodules and roots. Finally, we identify a cohort of co-regulated transcription factors as candidate regulators of these processes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huang, Tingting; Chang, Chin -Yuan; Lohman, Jeremy R.
Comparative analysis of the enediyne biosynthetic gene clusters revealed sets of conserved genes serving as outstanding candidates for the enediyne core. Here we report the crystal structures of SgcJ and its homologue NCS-Orf16, together with gene inactivation and site-directed mutagenesis studies, to gain insight into enediyne core biosynthesis. Gene inactivation in vivo establishes that SgcJ is required for C-1027 production in Streptomyces globisporus. SgcJ and NCS-Orf16 share a common structure with the nuclear transport factor 2-like superfamily of proteins, featuring a putative substrate binding or catalytic active site. Site-directed mutagenesis of the conserved residues lining this site allowed us tomore » propose that SgcJ and its homologues may play a catalytic role in transforming the linear polyene intermediate, along with other enediyne polyketide synthase-associated enzymes, into an enzyme-sequestered enediyne core intermediate. In conclusion, these findings will help formulate hypotheses and design experiments to ascertain the function of SgcJ and its homologues in nine-membered enediyne core biosynthesis.« less
Genome variations associated with viral susceptibility and calcification in Emiliania huxleyi.
Kegel, Jessica U; John, Uwe; Valentin, Klaus; Frickenhaus, Stephan
2013-01-01
Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain) to perform comparative genomic hybridizations (CGH) of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification). Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core) genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies.
Genome Variations Associated with Viral Susceptibility and Calcification in Emiliania huxleyi
Kegel, Jessica U.; John, Uwe; Valentin, Klaus; Frickenhaus, Stephan
2013-01-01
Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain) to perform comparative genomic hybridizations (CGH) of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification). Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core) genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies. PMID:24260453
Determination of the Core of a Minimal Bacterial Gene Set†
Gil, Rosario; Silva, Francisco J.; Peretó, Juli; Moya, Andrés
2004-01-01
The availability of a large number of complete genome sequences raises the question of how many genes are essential for cellular life. Trying to reconstruct the core of the protein-coding gene set for a hypothetical minimal bacterial cell, we have performed a computational comparative analysis of eight bacterial genomes. Six of the analyzed genomes are very small due to a dramatic genome size reduction process, while the other two, corresponding to free-living relatives, are larger. The available data from several systematic experimental approaches to define all the essential genes in some completely sequenced bacterial genomes were also considered, and a reconstruction of a minimal metabolic machinery necessary to sustain life was carried out. The proposed minimal genome contains 206 protein-coding genes with all the genetic information necessary for self-maintenance and reproduction in the presence of a full complement of essential nutrients and in the absence of environmental stress. The main features of such a minimal gene set, as well as the metabolic functions that must be present in the hypothetical minimal cell, are discussed. PMID:15353568
Novel Phylogenetic Approaches to Problems in Microbial Genomics
2010-09-01
Transfer Loss Birth Chromalveolata Plantae Fungi Metazoa Crenarcheota Euryarchaeota Nanoarchaeota a a 0.5 Ga A B Alm, Figure 1 Event Rate A r c h a e a...loss scenario is invoked by the mini-reconciler. Other more complex scenarios exist (e.g., s4:sD), but these can be ignored without affecting overall...identical HGTs affected all of the genes in the core set. However, reconstructions of organismal histories using core genes have been criticized for ignoring
Wei, Lei; Wang, Jianmin; Lampert, Erika; Schlanger, Simon; DePriest, Adam D.; Hu, Qiang; Gomez, Eduardo Cortes; Murakam, Mitsuko; Glenn, Sean T.; Conroy, Jeffrey; Morrison, Carl; Azabdaftari, Gissou; Mohler, James L.; Liu, Song; Heemers, Hannelore V.
2018-01-01
Background Next-generation sequencing is revealing genomic heterogeneity in localized prostate cancer (CaP). Incomplete sampling of CaP multiclonality has limited the implications for molecular subtyping, stratification, and systemic treatment. Objective To determine the impact of genomic and transcriptomic diversity within and among intraprostatic CaP foci on CaP molecular taxonomy, predictors of progression, and actionable therapeutic targets. Design, setting, and participants Four consecutive patients with clinically localized National Comprehensive Cancer Network intermediate- or high-risk CaP who did not receive neoadjuvant therapy underwent radical prostatectomy at Roswell Park Cancer Institute in June–July 2014. Presurgical information on CaP content and a customized tissue procurement procedure were used to isolate nonmicroscopic and noncontiguous CaP foci in radical prostatectomy specimens. Three cores were obtained from the index lesion and one core from smaller lesions. RNA and DNA were extracted simultaneously from 26 cores with ≥90% CaP content and analyzed using whole-exome sequencing, single-nucleotide polymorphism arrays, and RNA sequencing. Outcome measurements and statistical analysis Somatic mutations, copy number alternations, gene expression, gene fusions, and phylogeny were defined. The impact of genomic alterations on CaP molecular classification, gene sets measured in Oncotype DX, Prolaris, and Decipher assays, and androgen receptor activity among CaP cores was determined. Results and limitations There was considerable variability in genomic alterations among CaP cores, and between RNA- and DNA-based platforms. Heterogeneity was found in molecular grouping of individual CaP foci and the activity of gene sets underlying the assays for risk stratification and androgen receptor activity, and was validated in independent genomic data sets. Determination of the implications for clinical decision-making requires follow-up studies. Conclusions Genomic make-up varies widely among CaP foci, so care should be taken when making treatment decisions based on a single biopsy or index lesions. Patient summary We examined the molecular composition of individual cancers in a patient’s prostate. We found a lot of genetic diversity among these cancers, and concluded that information from a single cancer biopsy is not sufficient to guide treatment decisions. PMID:27451135
Zanotto, Paolo Marinho de Andrade; Krakauer, David C.
2008-01-01
We consider the concerted evolution of viral genomes in four families of DNA viruses. Given the high rate of horizontal gene transfer among viruses and their hosts, it is an open question as to how representative particular genes are of the evolutionary history of the complete genome. To address the concerted evolution of viral genes, we compared genomic evolution across four distinct, extant viral families. For all four viral families we constructed DNA-dependent DNA polymerase-based (DdDp) phylogenies and in addition, whole genome sequence, as quantitative descriptions of inter-genome relationships. We found that the history of the polymerase gene was highly predictive of the history of the genome as a whole, which we explain in terms of repeated, co-divergence events of the core DdDp gene accompanied by a number of satellite, accessory genetic loci. We also found that the rate of gene gain in baculovirus and poxviruses proceeds significantly more quickly than the rate of gene loss and that there is convergent acquisition of satellite functions promoting contextual adaptation when distinct viral families infect related hosts. The congruence of the genome and polymerase trees suggests that a large set of viral genes, including polymerase, derive from a phylogenetically conserved core of genes of host origin, secondarily reinforced by gene acquisition from common hosts or co-infecting viruses within the host. A single viral genome can be thought of as a mutualistic network, with the core genes acting as an effective host and the satellite genes as effective symbionts. Larger virus genomes show a greater departure from linkage equilibrium between core and satellites functions. PMID:18941535
Cancer cell redirection biomarker discovery using a mutual information approach.
Roche, Kimberly; Feltus, F Alex; Park, Jang Pyo; Coissieux, Marie-May; Chang, Chenyan; Chan, Vera B S; Bentires-Alj, Mohamed; Booth, Brian W
2017-01-01
Introducing tumor-derived cells into normal mammary stem cell niches at a sufficiently high ratio of normal to tumorous cells causes those tumor cells to undergo a change to normal mammary phenotype and yield normal mammary progeny. This phenomenon has been termed cancer cell redirection. We have developed an in vitro model that mimics in vivo redirection of cancer cells by the normal mammary microenvironment. Using the RNA profiling data from this cellular model, we examined high-level characteristics of the normal, redirected, and tumor transcriptomes and found the global expression profiles clearly distinguish the three expression states. To identify potential redirection biomarkers that cause the redirected state to shift toward the normal expression pattern, we used mutual information relationships between normal, redirected, and tumor cell groups. Mutual information relationship analysis reduced a dataset of over 35,000 gene expression measurements spread over 13,000 curated gene sets to a set of 20 significant molecular signatures totaling 906 unique loci. Several of these molecular signatures are hallmark drivers of the tumor state. Using differential expression as a guide, we further refined the gene set to 120 core redirection biomarker genes. The expression levels of these core biomarkers are sufficient to make the normal and redirected gene expression states indistinguishable from each other but radically different from the tumor state.
Cancer cell redirection biomarker discovery using a mutual information approach
Roche, Kimberly; Feltus, F. Alex; Park, Jang Pyo; Coissieux, Marie-May; Chang, Chenyan; Chan, Vera B. S.; Bentires-Alj, Mohamed
2017-01-01
Introducing tumor-derived cells into normal mammary stem cell niches at a sufficiently high ratio of normal to tumorous cells causes those tumor cells to undergo a change to normal mammary phenotype and yield normal mammary progeny. This phenomenon has been termed cancer cell redirection. We have developed an in vitro model that mimics in vivo redirection of cancer cells by the normal mammary microenvironment. Using the RNA profiling data from this cellular model, we examined high-level characteristics of the normal, redirected, and tumor transcriptomes and found the global expression profiles clearly distinguish the three expression states. To identify potential redirection biomarkers that cause the redirected state to shift toward the normal expression pattern, we used mutual information relationships between normal, redirected, and tumor cell groups. Mutual information relationship analysis reduced a dataset of over 35,000 gene expression measurements spread over 13,000 curated gene sets to a set of 20 significant molecular signatures totaling 906 unique loci. Several of these molecular signatures are hallmark drivers of the tumor state. Using differential expression as a guide, we further refined the gene set to 120 core redirection biomarker genes. The expression levels of these core biomarkers are sufficient to make the normal and redirected gene expression states indistinguishable from each other but radically different from the tumor state. PMID:28594912
Huang, Tingting; Chang, Chin -Yuan; Lohman, Jeremy R.; ...
2016-10-01
Comparative analysis of the enediyne biosynthetic gene clusters revealed sets of conserved genes serving as outstanding candidates for the enediyne core. Here we report the crystal structures of SgcJ and its homologue NCS-Orf16, together with gene inactivation and site-directed mutagenesis studies, to gain insight into enediyne core biosynthesis. Gene inactivation in vivo establishes that SgcJ is required for C-1027 production in Streptomyces globisporus. SgcJ and NCS-Orf16 share a common structure with the nuclear transport factor 2-like superfamily of proteins, featuring a putative substrate binding or catalytic active site. Site-directed mutagenesis of the conserved residues lining this site allowed us tomore » propose that SgcJ and its homologues may play a catalytic role in transforming the linear polyene intermediate, along with other enediyne polyketide synthase-associated enzymes, into an enzyme-sequestered enediyne core intermediate. In conclusion, these findings will help formulate hypotheses and design experiments to ascertain the function of SgcJ and its homologues in nine-membered enediyne core biosynthesis.« less
Analysis of evolutionary patterns of genes in campylobacter jejuni and C. coli
USDA-ARS?s Scientific Manuscript database
Background: In order to investigate the population genetics structure of thermophilic Campylobacter spp., we extracted a set of 1029 core gene families (CGF) from 25 sequenced genomes of C. jejuni, C. coli and C. lari. Based on these CGFs we employed different approaches to reveal the evolutionary ...
Auvergne, Romane M; Sim, Fraser J; Wang, Su; Chandler-Militello, Devin; Burch, Jaclyn; Al Fanek, Yazan; Davis, Danielle; Benraiss, Abdellatif; Walter, Kevin; Achanta, Pragathi; Johnson, Mahlon; Quinones-Hinojosa, Alfredo; Natesan, Sridaran; Ford, Heide L; Goldman, Steven A
2013-06-27
Glial progenitor cells (GPCs) are a potential source of malignant gliomas. We used A2B5-based sorting to extract tumorigenic GPCs from human gliomas spanning World Health Organization grades II-IV. Messenger RNA profiling identified a cohort of genes that distinguished A2B5+ glioma tumor progenitor cells (TPCs) from A2B5+ GPCs isolated from normal white matter. A core set of genes and pathways was substantially dysregulated in A2B5+ TPCs, which included the transcription factor SIX1 and its principal cofactors, EYA1 and DACH2. Small hairpin RNAi silencing of SIX1 inhibited the expansion of glioma TPCs in vitro and in vivo, suggesting a critical and unrecognized role of the SIX1-EYA1-DACH2 system in glioma genesis or progression. By comparing the expression patterns of glioma TPCs with those of normal GPCs, we have identified a discrete set of pathways by which glial tumorigenesis may be better understood and more specifically targeted. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Flotte, Terence R; Daniels, Eric; Benson, Janet; Bevett-Rose, Jeneé M; Cornetta, Kenneth; Diggins, Margaret; Johnston, Julie; Sepelak, Susan; van der Loo, Johannes C M; Wilson, James M; McDonald, Cheryl L
2017-12-01
Over a 10-year period, the Gene Therapy Resource Program (GTRP) of the National Heart Lung and Blood Institute has provided a set of core services to investigators to facilitate the clinical translation of gene therapy. These services have included a preclinical (research-grade) vector production core; current Good Manufacturing Practice clinical-grade vector cores for recombinant adeno-associated virus and lentivirus vectors; a pharmacology and toxicology core; and a coordinating center to manage program logistics and to provide regulatory and financial support to early-phase clinical trials. In addition, the GTRP has utilized a Steering Committee and a Scientific Review Board to guide overall progress and effectiveness and to evaluate individual proposals. These resources have been deployed to assist 82 investigators with 172 approved service proposals. These efforts have assisted in clinical trial implementation across a wide range of genetic, cardiac, pulmonary, and blood diseases. Program outcomes and potential future directions of the program are discussed.
Determination of performance characteristics of scientific applications on IBM Blue Gene/Q
DOE Office of Scientific and Technical Information (OSTI.GOV)
Evangelinos, C.; Walkup, R. E.; Sachdeva, V.
The IBM Blue Gene®/Q platform presents scientists and engineers with a rich set of hardware features such as 16 cores per chip sharing a Level 2 cache, a wide SIMD (single-instruction, multiple-data) unit, a five-dimensional torus network, and hardware support for collective operations. Especially important is the feature related to cores that have four “hardware threads,” which makes it possible to hide latencies and obtain a high fraction of the peak issue rate from each core. All of these hardware resources present unique performance-tuning opportunities on Blue Gene/Q. We provide an overview of several important applications and solvers and studymore » them on Blue Gene/Q using performance counters and Message Passing Interface profiles. We also discuss how Blue Gene/Q tools help us understand the interaction of the application with the hardware and software layers and provide guidance for optimization. Furthermore, on the basis of our analysis, we discuss code improvement strategies targeting Blue Gene/Q. Information about how these algorithms map to the Blue Gene® architecture is expected to have an impact on future system design as we move to the exascale era.« less
Kang, Yu; Gu, Chaohao; Yuan, Lina; Wang, Yue; Zhu, Yanmin; Li, Xinna; Luo, Qibin; Xiao, Jingfa; Jiang, Daquan; Qian, Minping; Ahmed Khan, Aftab; Chen, Fei; Zhang, Zhang; Yu, Jun
2014-11-25
The prokaryotic pangenome partitions genes into core and dispensable genes. The order of core genes, albeit assumed to be stable under selection in general, is frequently interrupted by horizontal gene transfer and rearrangement, but how a core-gene-defined genome maintains its stability or flexibility remains to be investigated. Based on data from 30 species, including 425 genomes from six phyla, we grouped core genes into syntenic blocks in the context of a pangenome according to their stability across multiple isolates. A subset of the core genes, often species specific and lineage associated, formed a core-gene-defined genome organizational framework (cGOF). Such cGOFs are either single segmental (one-third of the species analyzed) or multisegmental (the rest). Multisegment cGOFs were further classified into symmetric or asymmetric according to segment orientations toward the origin-terminus axis. The cGOFs in Gram-positive species are exclusively symmetric and often reversible in orientation, as opposed to those of the Gram-negative bacteria, which are all asymmetric and irreversible. Meanwhile, all species showing strong strand-biased gene distribution contain symmetric cGOFs and often specific DnaE (α subunit of DNA polymerase III) isoforms. Furthermore, functional evaluations revealed that cGOF genes are hub associated with regard to cellular activities, and the stability of cGOF provides efficient indexes for scaffold orientation as demonstrated by assembling virtual and empirical genome drafts. cGOFs show species specificity, and the symmetry of multisegmental cGOFs is conserved among taxa and constrained by DNA polymerase-centric strand-biased gene distribution. The definition of species-specific cGOFs provides powerful guidance for genome assembly and other structure-based analysis. Prokaryotic genomes are frequently interrupted by horizontal gene transfer (HGT) and rearrangement. To know whether there is a set of genes not only conserved in position among isolates but also functionally essential for a given species and to further evaluate the stability or flexibility of such genome structures across lineages are of importance. Based on a large number of multi-isolate pangenomic data, our analysis reveals that a subset of core genes is organized into a core-gene-defined genome organizational framework, or cGOF. Furthermore, the lineage-associated cGOFs among Gram-positive and Gram-negative bacteria behave differently: the former, composed of 2 to 4 segments, have their fragments symmetrically rearranged around the origin-terminus axis, whereas the latter show more complex segmentation and are partitioned asymmetrically into chromosomal structures. The definition of cGOFs provides new insights into prokaryotic genome organization and efficient guidance for genome assembly and analysis. Copyright © 2014 Kang et al.
Gene regulation is governed by a core network in hepatocellular carcinoma.
Gu, Zuguang; Zhang, Chenyu; Wang, Jin
2012-05-01
Hepatocellular carcinoma (HCC) is one of the most lethal cancers worldwide, and the mechanisms that lead to the disease are still relatively unclear. However, with the development of high-throughput technologies it is possible to gain a systematic view of biological systems to enhance the understanding of the roles of genes associated with HCC. Thus, analysis of the mechanism of molecule interactions in the context of gene regulatory networks can reveal specific sub-networks that lead to the development of HCC. In this study, we aimed to identify the most important gene regulations that are dysfunctional in HCC generation. Our method for constructing gene regulatory network is based on predicted target interactions, experimentally-supported interactions, and co-expression model. Regulators in the network included both transcription factors and microRNAs to provide a complete view of gene regulation. Analysis of gene regulatory network revealed that gene regulation in HCC is highly modular, in which different sets of regulators take charge of specific biological processes. We found that microRNAs mainly control biological functions related to mitochondria and oxidative reduction, while transcription factors control immune responses, extracellular activity and the cell cycle. On the higher level of gene regulation, there exists a core network that organizes regulations between different modules and maintains the robustness of the whole network. There is direct experimental evidence for most of the regulators in the core gene regulatory network relating to HCC. We infer it is the central controller of gene regulation. Finally, we explored the influence of the core gene regulatory network on biological pathways. Our analysis provides insights into the mechanism of transcriptional and post-transcriptional control in HCC. In particular, we highlight the importance of the core gene regulatory network; we propose that it is highly related to HCC and we believe further experimental validation is worthwhile.
APPRIS 2017: principal isoforms for multiple gene sets
Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso
2018-01-01
Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475
Welcome to pandoraviruses at the ‘Fourth TRUC’ club
Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier
2015-01-01
Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9–2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the ‘Fourth TRUC’ club, encompassing distinct life forms compared with cellular organisms. PMID:26042093
Welcome to pandoraviruses at the 'Fourth TRUC' club.
Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier
2015-01-01
Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9-2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the 'Fourth TRUC' club, encompassing distinct life forms compared with cellular organisms.
Orgeur, Mickael; Martens, Marvin; Leonte, Georgeta; Nassari, Sonya; Bonnin, Marie-Ange; Börno, Stefan T; Timmermann, Bernd; Hecht, Jochen; Duprez, Delphine; Stricker, Sigmar
2018-03-29
Connective tissues support organs and play crucial roles in development, homeostasis and fibrosis, yet our understanding of their formation is still limited. To gain insight into the molecular mechanisms of connective tissue specification, we selected five zinc-finger transcription factors - OSR1, OSR2, EGR1, KLF2 and KLF4 - based on their expression patterns and/or known involvement in connective tissue subtype differentiation. RNA-seq and ChIP-seq profiling of chick limb micromass cultures revealed a set of common genes regulated by all five transcription factors, which we describe as a connective tissue core expression set. This common core was enriched with genes associated with axon guidance and myofibroblast signature, including fibrosis-related genes. In addition, each transcription factor regulated a specific set of signalling molecules and extracellular matrix components. This suggests a concept whereby local molecular niches can be created by the expression of specific transcription factors impinging on the specification of local microenvironments. The regulatory network established here identifies common and distinct molecular signatures of limb connective tissue subtypes, provides novel insight into the signalling pathways governing connective tissue specification, and serves as a resource for connective tissue development. © 2018. Published by The Company of Biologists Ltd.
Chikkagoudar, Satish; Wang, Kai; Li, Mingyao
2011-05-26
Gene-gene interaction in genetic association studies is computationally intensive when a large number of SNPs are involved. Most of the latest Central Processing Units (CPUs) have multiple cores, whereas Graphics Processing Units (GPUs) also have hundreds of cores and have been recently used to implement faster scientific software. However, currently there are no genetic analysis software packages that allow users to fully utilize the computing power of these multi-core devices for genetic interaction analysis for binary traits. Here we present a novel software package GENIE, which utilizes the power of multiple GPU or CPU processor cores to parallelize the interaction analysis. GENIE reads an entire genetic association study dataset into memory and partitions the dataset into fragments with non-overlapping sets of SNPs. For each fragment, GENIE analyzes: 1) the interaction of SNPs within it in parallel, and 2) the interaction between the SNPs of the current fragment and other fragments in parallel. We tested GENIE on a large-scale candidate gene study on high-density lipoprotein cholesterol. Using an NVIDIA Tesla C1060 graphics card, the GPU mode of GENIE achieves a speedup of 27 times over its single-core CPU mode run. GENIE is open-source, economical, user-friendly, and scalable. Since the computing power and memory capacity of graphics cards are increasing rapidly while their cost is going down, we anticipate that GENIE will achieve greater speedups with faster GPU cards. Documentation, source code, and precompiled binaries can be downloaded from http://www.cceb.upenn.edu/~mli/software/GENIE/.
2011-01-01
Background Gene-gene interaction in genetic association studies is computationally intensive when a large number of SNPs are involved. Most of the latest Central Processing Units (CPUs) have multiple cores, whereas Graphics Processing Units (GPUs) also have hundreds of cores and have been recently used to implement faster scientific software. However, currently there are no genetic analysis software packages that allow users to fully utilize the computing power of these multi-core devices for genetic interaction analysis for binary traits. Findings Here we present a novel software package GENIE, which utilizes the power of multiple GPU or CPU processor cores to parallelize the interaction analysis. GENIE reads an entire genetic association study dataset into memory and partitions the dataset into fragments with non-overlapping sets of SNPs. For each fragment, GENIE analyzes: 1) the interaction of SNPs within it in parallel, and 2) the interaction between the SNPs of the current fragment and other fragments in parallel. We tested GENIE on a large-scale candidate gene study on high-density lipoprotein cholesterol. Using an NVIDIA Tesla C1060 graphics card, the GPU mode of GENIE achieves a speedup of 27 times over its single-core CPU mode run. Conclusions GENIE is open-source, economical, user-friendly, and scalable. Since the computing power and memory capacity of graphics cards are increasing rapidly while their cost is going down, we anticipate that GENIE will achieve greater speedups with faster GPU cards. Documentation, source code, and precompiled binaries can be downloaded from http://www.cceb.upenn.edu/~mli/software/GENIE/. PMID:21615923
Turmel, Monique; de Cambiaire, Jean-Charles; Otis, Christian; Lemieux, Claude
2016-01-01
The Chlorodendrophyceae is a small class of green algae belonging to the core Chlorophyta, an assemblage that also comprises the Pedinophyceae, Trebouxiophyceae, Ulvophyceae and Chlorophyceae. Here we describe for the first time the chloroplast genomes of chlorodendrophycean algae (Scherffelia dubia, 137,161 bp; Tetraselmis sp. CCMP 881, 100,264 bp). Characterized by a very small single-copy (SSC) region devoid of any gene and an unusually large inverted repeat (IR), the quadripartite structures of the Scherffelia and Tetraselmis genomes are unique among all core chlorophytes examined thus far. The lack of genes in the SSC region is offset by the rich and atypical gene complement of the IR, which includes genes from the SSC and large single-copy regions of prasinophyte and streptophyte chloroplast genomes having retained an ancestral quadripartite structure. Remarkably, seven of the atypical IR-encoded genes have also been observed in the IRs of pedinophycean and trebouxiophycean chloroplast genomes, suggesting that they were already present in the IR of the common ancestor of all core chlorophytes. Considering that the relationships among the main lineages of the core Chlorophyta are still unresolved, we evaluated the impact of including the Chlorodendrophyceae in chloroplast phylogenomic analyses. The trees we inferred using data sets of 79 and 108 genes from 71 chlorophytes indicate that the Chlorodendrophyceae is a deep-diverging lineage of the core Chlorophyta, although the placement of this class relative to the Pedinophyceae remains ambiguous. Interestingly, some of our phylogenomic trees together with our comparative analysis of gene order data support the monophyly of the Trebouxiophyceae, thus offering further evidence that the previously observed affiliation between the Chlorellales and Pedinophyceae is the result of systematic errors in phylogenetic reconstruction.
Rogic, Sanja; Wong, Albertina; Pavlidis, Paul
2017-01-01
Background Prenatal alcohol exposure (PAE) can result in an array of morphological, behavioural and neurobiological deficits that can range in their severity. Despite extensive research in the field and a significant progress made, especially in understanding the range of possible malformations and neurobehavioral abnormalities, the molecular mechanisms of alcohol responses in development are still not well understood. There have been multiple transcriptomic studies looking at the changes in gene expression after PAE in animal models, however there is a limited apparent consensus among the reported findings. In an effort to address this issue, we performed a comprehensive re-analysis and meta-analysis of all suitable, publically available expression data sets. Methods We assembled ten microarray data sets of gene expression after PAE in mouse and rat models consisting of samples from a total of 63 ethanol-exposed and 80 control animals. We re-analyzed each data set for differential expression and then used the results to perform meta-analyses considering all data sets together or grouping them by time or duration of exposure (pre- and post-natal, acute and chronic, respectively). We performed network and Gene Ontology enrichment analysis to further characterize the identified signatures. Results For each sub-analysis we identified signatures of differential expressed genes that show support from multiple studies. Overall, the changes in gene expression were more extensive after acute ethanol treatment during prenatal development than in other models. Considering the analysis of all the data together, we identified a robust core signature of 104 genes down-regulated after PAE, with no up-regulated genes. Functional analysis reveals over-representation of genes involved in protein synthesis, mRNA splicing and chromatin organization. Conclusions Our meta-analysis shows that existing studies, despite superficial dissimilarity in findings, share features that allow us to identify a common core signature set of transcriptome changes in PAE. This is an important step to identifying the biological processes that underlie the etiology of FASD. PMID:26996386
Nonell, Lara; Puigdecanet, Eulàlia; Astier, Laura; Solé, Francesc; Bayes-Genis, Antoni
2013-01-01
Molecular mechanisms associated with pathophysiological changes in ventricular remodelling due to myocardial infarction (MI) remain poorly understood. We analyzed changes in gene expression by microarray technology in porcine myocardial tissue at 1, 4, and 6 weeks post-MI. MI was induced by coronary artery ligation in 9 female pigs (30–40 kg). Animals were randomly sacrificed at 1, 4, or 6 weeks post-MI (n = 3 per group) and 3 healthy animals were also included as control group. Total RNA from myocardial samples was hybridized to GeneChip® Porcine Genome Arrays. Functional analysis was obtained with the Ingenuity Pathway Analysis (IPA) online tool. Validation of microarray data was performed by quantitative real-time PCR (qRT-PCR). More than 8,000 different probe sets showed altered expression in the remodelling myocardium at 1, 4, or 6 weeks post-MI. Ninety-seven percent of altered transcripts were detected in the infarct core and 255 probe sets were differentially expressed in the remote myocardium. Functional analysis revealed 28 genes de-regulated in the remote myocardial region in at least one of the three temporal analyzed stages, including genes associated with heart failure (HF), systemic sclerosis and coronary artery disease. In the infarct core tissue, eight major time-dependent gene expression patterns were recognized among 4,221 probe sets commonly altered over time. Altered gene expression of ACVR2B, BID, BMP2, BMPR1A, LMNA, NFKBIA, SMAD1, TGFB3, TNFRSF1A, and TP53 were further validated. The clustering of similar expression patterns for gene products with related function revealed molecular footprints, some of them described for the first time, which elucidate changes in biological processes at different stages after MI. PMID:23372767
van der Does, H. Charlotte; Schmidt, Sarah M.; Langereis, Léon; Hughes, Timothy R.
2016-01-01
Proteins secreted by pathogens during host colonization largely determine the outcome of pathogen-host interactions and are commonly called ‘effectors’. In fungal plant pathogens, coordinated transcriptional up-regulation of effector genes is a key feature of pathogenesis and effectors are often encoded in genomic regions with distinct repeat content, histone code and rate of evolution. In the tomato pathogen Fusarium oxysporum f. sp. lycopersici (Fol), effector genes reside on one of four accessory chromosomes, known as the ‘pathogenicity’ chromosome, which can be exchanged between strains through horizontal transfer. The three other accessory chromosomes in the Fol reference strain may also be important for virulence towards tomato. Expression of effector genes in Fol is highly up-regulated upon infection and requires Sge1, a transcription factor encoded on the core genome. Interestingly, the pathogenicity chromosome itself contains 13 predicted transcription factor genes and for all except one, there is a homolog on the core genome. We determined DNA binding specificity for nine transcription factors using oligonucleotide arrays. The binding sites for homologous transcription factors were highly similar, suggesting that extensive neofunctionalization of DNA binding specificity has not occurred. Several DNA binding sites are enriched on accessory chromosomes, and expression of FTF1, its core homolog FTF2 and SGE1 from a constitutive promoter can induce expression of effector genes. The DNA binding sites of only these three transcription factors are enriched among genes up-regulated during infection. We further show that Ftf1, Ftf2 and Sge1 can activate transcription from their binding sites in yeast. RNAseq analysis revealed that in strains with constitutive expression of FTF1, FTF2 or SGE1, expression of a similar set of plant-responsive genes on the pathogenicity chromosome is induced, including most effector genes. We conclude that the Fol pathogenicity chromosome may be partially transcriptionally autonomous, but there are also extensive transcriptional connections between core and accessory chromosomes. PMID:27855160
Challenges of the information age: the impact of false discovery on pathway identification.
Rog, Colin J; Chekuri, Srinivasa C; Edgerton, Mary E
2012-11-21
Pathways with members that have known relevance to a disease are used to support hypotheses generated from analyses of gene expression and proteomic studies. Using cancer as an example, the pitfalls of searching pathways databases as support for genes and proteins that could represent false discoveries are explored. The frequency with which networks could be generated from 100 instances each of randomly selected five and ten genes sets as input to MetaCore, a commercial pathways database, was measured. A PubMed search enumerated cancer-related literature published for any gene in the networks. Using three, two, and one maximum intervening step between input genes to populate the network, networks were generated with frequencies of 97%, 77%, and 7% using ten gene sets and 73%, 27%, and 1% using five gene sets. PubMed reported an average of 4225 cancer-related articles per network gene. This can be attributed to the richly populated pathways databases and the interest in the molecular basis of cancer. As information sources become enriched, they are more likely to generate plausible mechanisms for false discoveries.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Howe, Adina; Yang, Fan; Williams, Ryan J.
Despite the central role of soil microbial communities in global carbon (C) cycling, little is known about soil microbial community structure and even less about their metabolic pathways. Efforts to characterize soil communities often focus on identifying differences in gene content across environmental gradients, but an alternative question is what genes are similar in soils. These genes may indicate critical species or potential functions that are required in all soils. Here we identified the “core” set of C cycling sequences widely present in multiple soil metagenomes from a fertilized prairie (FP). Of 226,887 sequences associated with known enzymes involved inmore » the synthesis, metabolism, and transport of carbohydrates, 843 were identified to be consistently prevalent across four replicate soil metagenomes. This core metagenome was functionally and taxonomically diverse, representing five enzyme classes and 99 enzyme families within the CAZy database. Though it only comprised 0.4% of all CAZy-associated genes identified in FP metagenomes, the core was found to be comprised of functions similar to those within cumulative soils. The FP CAZy-associated core sequences were present in multiple publicly available soil metagenomes and most similar to soils sharing geographic proximity. As a result, in soil ecosystems, where high diversity remains a key challenge for metagenomic investigations, these core genes represent a subset of critical functions necessary for carbohydrate metabolism, which can be targeted to evaluate important C fluxes in these and other similar soils.« less
Gupta, Gagan D.; Howes, Mark T.; Chandran, Ruma; Das, Anupam; Menon, Sindhu; Parton, Robert G.; Sowdhamini, R.; Thattai, Mukund; Mayor, Satyajit
2014-01-01
Single-cell-resolved measurements reveal heterogeneous distributions of clathrin-dependent (CD) and -independent (CLIC/GEEC: CG) endocytic activity in Drosophila cell populations. dsRNA-mediated knockdown of core versus peripheral endocytic machinery induces strong changes in the mean, or subtle changes in the shapes of these distributions, respectively. By quantifying these subtle shape changes for 27 single-cell features which report on endocytic activity and cell morphology, we organize 1072 Drosophila genes into a tree-like hierarchy. We find that tree nodes contain gene sets enriched in functional classes and protein complexes, providing a portrait of core and peripheral control of CD and CG endocytosis. For 470 genes we obtain additional features from separate assays and classify them into early- or late-acting genes of the endocytic pathways. Detailed analyses of specific genes at intermediate levels of the tree suggest that Vacuolar ATPase and lysosomal genes involved in vacuolar biogenesis play an evolutionarily conserved role in CG endocytosis. PMID:24971745
2010-01-01
Background Molecular genetic studies of floral development have concentrated on several core eudicots and grasses (monocots), which have canalized floral forms. Basal eudicots possess a wider range of floral morphologies than the core eudicots and grasses and can serve as an evolutionary link between core eudicots and monocots, and provide a reference for studies of other basal angiosperms. Recent advances in genomics have enabled researchers to profile gene activities during floral development, primarily in the eudicot Arabidopsis thaliana and the monocots rice and maize. However, our understanding of floral developmental processes among the basal eudicots remains limited. Results Using a recently generated expressed sequence tag (EST) set, we have designed an oligonucleotide microarray for the basal eudicot Eschscholzia californica (California poppy). We performed microarray experiments with an interwoven-loop design in order to characterize the E. californica floral transcriptome and to identify differentially expressed genes in flower buds with pre-meiotic and meiotic cells, four floral organs at pre-anthesis stages (sepals, petals, stamens and carpels), developing fruits, and leaves. Conclusions Our results provide a foundation for comparative gene expression studies between eudicots and basal angiosperms. We identified whorl-specific gene expression patterns in E. californica and examined the floral expression of several gene families. Interestingly, most E. californica homologs of Arabidopsis genes important for flower development, except for genes encoding MADS-box transcription factors, show different expression patterns between the two species. Our comparative transcriptomics study highlights the unique evolutionary position of E. californica compared with basal angiosperms and core eudicots. PMID:20950453
Deschamps, Philippe; Zivanovic, Yvan; Moreira, David; Rodriguez-Valera, Francisco; López-García, Purificación
2014-06-12
Horizontal gene transfer (HGT) is an important force in evolution, which may lead, among other things, to the adaptation to new environments by the import of new metabolic functions. Recent studies based on phylogenetic analyses of a few genome fragments containing archaeal 16S rRNA genes and fosmid-end sequences from deep-sea metagenomic libraries have suggested that marine planktonic archaea could be affected by high HGT frequency. Likewise, a composite genome of an uncultured marine euryarchaeote showed high levels of gene sequence similarity to bacterial genes. In this work, we ask whether HGT is frequent and widespread in genomes of these marine archaea, and whether HGT is an ancient and/or recurrent phenomenon. To answer these questions, we sequenced 997 fosmid archaeal clones from metagenomic libraries of deep-Mediterranean waters (1,000 and 3,000 m depth) and built comprehensive pangenomes for planktonic Thaumarchaeota (Group I archaea) and Euryarchaeota belonging to the uncultured Groups II and III Euryarchaeota (GII/III-Euryarchaeota). Comparison with available reference genomes of Thaumarchaeota and a composite marine surface euryarchaeote genome allowed us to define sets of core, lineage-specific core, and shell gene ortholog clusters for the two archaeal lineages. Molecular phylogenetic analyses of all gene clusters showed that 23.9% of marine Thaumarchaeota genes and 29.7% of GII/III-Euryarchaeota genes had been horizontally acquired from bacteria. HGT is not only extensive and directional but also ongoing, with high HGT levels in lineage-specific core (ancient transfers) and shell (recent transfers) genes. Many of the acquired genes are related to metabolism and membrane biogenesis, suggesting an adaptive value for life in cold, oligotrophic oceans. We hypothesize that the acquisition of an important amount of foreign genes by the ancestors of these archaeal groups significantly contributed to their divergence and ecological success. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Han, Junwei; Shang, Desi; Zhang, Yunpeng; Zhang, Wei; Yao, Qianlan; Han, Lei; Xu, Yanjun; Yan, Wei; Bao, Zhaoshi; You, Gan; Jiang, Tao; Kang, Chunsheng; Li, Xia
2014-01-01
The prognosis of glioma patients is usually poor, especially in patients with glioblastoma (World Health Organization (WHO) grade IV). The regulatory functions of microRNA (miRNA) on genes have important implications in glioma cell survival. However, there are not many studies that have investigated glioma survival by integrating miRNAs and genes while also considering pathway structure. In this study, we performed sample-matched miRNA and mRNA expression profilings to systematically analyze glioma patient survival. During this analytical process, we developed pathway-based random walk to identify a glioma core miRNA-gene module, simultaneously considering pathway structure information and multi-level involvement of miRNAs and genes. The core miRNA-gene module we identified was comprised of four apparent sub-modules; all four sub-modules displayed a significant correlation with patient survival in the testing set (P-values≤0.001). Notably, one sub-module that consisted of 6 miRNAs and 26 genes also correlated with survival time in the high-grade subgroup (WHO grade III and IV), P-value = 0.0062. Furthermore, the 26-gene expression signature from this sub-module had robust predictive power in four independent, publicly available glioma datasets. Our findings suggested that the expression signatures, which were identified by integration of miRNA and gene level, were closely associated with overall survival among the glioma patients with various grades. PMID:24809850
Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R
2005-09-01
We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.
Zhu, Xinyu; Ma, Hong; Chen, Zhiduan
2011-03-09
Plants contain numerous Su(var)3-9 homologues (SUVH) and related (SUVR) genes, some of which await functional characterization. Although there have been studies on the evolution of plant Su(var)3-9 SET genes, a systematic evolutionary study including major land plant groups has not been reported. Large-scale phylogenetic and evolutionary analyses can help to elucidate the underlying molecular mechanisms and contribute to improve genome annotation. Putative orthologs of plant Su(var)3-9 SET protein sequences were retrieved from major representatives of land plants. A novel clustering that included most members analyzed, henceforth referred to as core Su(var)3-9 homologues and related (cSUVHR) gene clade, was identified as well as all orthologous groups previously identified. Our analysis showed that plant Su(var)3-9 SET proteins possessed a variety of domain organizations, and can be classified into five types and ten subtypes. Plant Su(var)3-9 SET genes also exhibit a wide range of gene structures among different paralogs within a family, even in the regions encoding conserved PreSET and SET domains. We also found that the majority of SUVH members were intronless and formed three subclades within the SUVH clade. A detailed phylogenetic analysis of the plant Su(var)3-9 SET genes was performed. A novel deep phylogenetic relationship including most plant Su(var)3-9 SET genes was identified. Additional domains such as SAR, ZnF_C2H2 and WIYLD were early integrated into primordial PreSET/SET/PostSET domain organization. At least three classes of gene structures had been formed before the divergence of Physcomitrella patens (moss) from other land plants. One or multiple retroposition events might have occurred among SUVH genes with the donor genes leading to the V-2 orthologous group. The structural differences among evolutionary groups of plant Su(var)3-9 SET genes with different functions were described, contributing to the design of further experimental studies.
Pathways Involved in Sasang Constitution from Genome-Wide Analysis in a Korean Population
Yu, Sung-Gon; Kim, Jong-Yeol; Song, Kwang Hoon
2012-01-01
Abstract Objective Sasang constitution (SC) medicine, a branch of Korean traditional medicine, classifies the individual into one of four constitutional types (Taeum, TE; Soeum, SE; Soyang, SY; and Taeyang, TY) based on physiologic characteristics. The authors of the current article recently reported individual genetic elements associated with SC types via genome-wide association (GWA) analysis. However, to understand the biologic mechanisms underlying constitution, a comprehensive approach that combines individual genetic effects was applied. Design Genotypes of 1222 subjects of defined constitution types were measured for 341,998 genetic loci across the entire genome. The biologic pathways associated with SC types were identified via GWA analysis using three different algorithms—namely, the Z-static method, a restandardized gene set assay, and a gene set enrichment assay. Results Distinct pathways were associated (p<0.05) with each constitution type. The TE type was significantly associated with cytoskeleton-related pathways. The SE type was significantly associated with cardio- and amino-acid metabolism–related pathways. The SY type was associated with enriched melanoma-related pathways. TY subjects were excluded because of the small size of that sample. Among these functionally related pathways, core-node genes regulating multiple pathways were identified. TJP1, PTK2, and SRC were selected as core-nodes for TE; RHOA, and MAOA/MAOB for SE; and GNAO1 for SY (p<0.05), respectively. Conclusions The current authors systematically identified the biologic pathways and core-node genes associated with SC types from the GWA study; this information should provide insights regarding the molecular mechanisms inherent in constitutional pathophysiology. PMID:22889377
Ghosh, Sujoy; Vivar, Juan; Nelson, Christopher P; Willenborg, Christina; Segrè, Ayellet V; Mäkinen, Ville-Petteri; Nikpay, Majid; Erdmann, Jeannette; Blankenberg, Stefan; O'Donnell, Christopher; März, Winfried; Laaksonen, Reijo; Stewart, Alexandre FR; Epstein, Stephen E; Shah, Svati H; Granger, Christopher B; Hazen, Stanley L; Kathiresan, Sekar; Reilly, Muredach P; Yang, Xia; Quertermous, Thomas; Samani, Nilesh J; Schunkert, Heribert; Assimes, Themistocles L; McPherson, Ruth
2016-01-01
Objective Genome-wide association (GWA) studies have identified multiple genetic variants affecting the risk of coronary artery disease (CAD). However, individually these explain only a small fraction of the heritability of CAD and for most, the causal biological mechanisms remain unclear. We sought to obtain further insights into potential causal processes of CAD by integrating large-scale GWA data with expertly curated databases of core human pathways and functional networks. Approaches and Results Employing pathways (gene sets) from Reactome, we carried out a two-stage gene set enrichment analysis strategy. From a meta-analyzed discovery cohort of 7 CADGWAS data sets (9,889 cases/11,089 controls), nominally significant gene-sets were tested for replication in a meta-analysis of 9 additional studies (15,502 cases/55,730 controls) from the CARDIoGRAM Consortium. A total of 32 of 639 Reactome pathways tested showed convincing association with CAD (replication p<0.05). These pathways resided in 9 of 21 core biological processes represented in Reactome, and included pathways relevant to extracellular matrix integrity, innate immunity, axon guidance, and signaling by PDRF, NOTCH, and the TGF-β/SMAD receptor complex. Many of these pathways had strengths of association comparable to those observed in lipid transport pathways. Network analysis of unique genes within the replicated pathways further revealed several interconnected functional and topologically interacting modules representing novel associations (e.g. semaphorin regulated axonal guidance pathway) besides confirming known processes (lipid metabolism). The connectivity in the observed networks was statistically significant compared to random networks (p<0.001). Network centrality analysis (‘degree’ and ‘betweenness’) further identified genes (e.g. NCAM1, FYN, FURIN etc.) likely to play critical roles in the maintenance and functioning of several of the replicated pathways. Conclusions These findings provide novel insights into how genetic variation, interpreted in the context of biological processes and functional interactions among genes, may help define the genetic architecture of CAD. PMID:25977570
Genomic Prediction of Gene Bank Wheat Landraces.
Crossa, José; Jarquín, Diego; Franco, Jorge; Pérez-Rodríguez, Paulino; Burgueño, Juan; Saint-Pierre, Carolina; Vikram, Prashant; Sansaloni, Carolina; Petroli, Cesar; Akdemir, Deniz; Sneller, Clay; Reynolds, Matthew; Tattaris, Maria; Payne, Thomas; Guzman, Carlos; Peña, Roberto J; Wenzl, Peter; Singh, Sukhwinder
2016-07-07
This study examines genomic prediction within 8416 Mexican landrace accessions and 2403 Iranian landrace accessions stored in gene banks. The Mexican and Iranian collections were evaluated in separate field trials, including an optimum environment for several traits, and in two separate environments (drought, D and heat, H) for the highly heritable traits, days to heading (DTH), and days to maturity (DTM). Analyses accounting and not accounting for population structure were performed. Genomic prediction models include genotype × environment interaction (G × E). Two alternative prediction strategies were studied: (1) random cross-validation of the data in 20% training (TRN) and 80% testing (TST) (TRN20-TST80) sets, and (2) two types of core sets, "diversity" and "prediction", including 10% and 20%, respectively, of the total collections. Accounting for population structure decreased prediction accuracy by 15-20% as compared to prediction accuracy obtained when not accounting for population structure. Accounting for population structure gave prediction accuracies for traits evaluated in one environment for TRN20-TST80 that ranged from 0.407 to 0.677 for Mexican landraces, and from 0.166 to 0.662 for Iranian landraces. Prediction accuracy of the 20% diversity core set was similar to accuracies obtained for TRN20-TST80, ranging from 0.412 to 0.654 for Mexican landraces, and from 0.182 to 0.647 for Iranian landraces. The predictive core set gave similar prediction accuracy as the diversity core set for Mexican collections, but slightly lower for Iranian collections. Prediction accuracy when incorporating G × E for DTH and DTM for Mexican landraces for TRN20-TST80 was around 0.60, which is greater than without the G × E term. For Iranian landraces, accuracies were 0.55 for the G × E model with TRN20-TST80. Results show promising prediction accuracies for potential use in germplasm enhancement and rapid introgression of exotic germplasm into elite materials. Copyright © 2016 Crossa et al.
Genomic Prediction of Gene Bank Wheat Landraces
Crossa, José; Jarquín, Diego; Franco, Jorge; Pérez-Rodríguez, Paulino; Burgueño, Juan; Saint-Pierre, Carolina; Vikram, Prashant; Sansaloni, Carolina; Petroli, Cesar; Akdemir, Deniz; Sneller, Clay; Reynolds, Matthew; Tattaris, Maria; Payne, Thomas; Guzman, Carlos; Peña, Roberto J.; Wenzl, Peter; Singh, Sukhwinder
2016-01-01
This study examines genomic prediction within 8416 Mexican landrace accessions and 2403 Iranian landrace accessions stored in gene banks. The Mexican and Iranian collections were evaluated in separate field trials, including an optimum environment for several traits, and in two separate environments (drought, D and heat, H) for the highly heritable traits, days to heading (DTH), and days to maturity (DTM). Analyses accounting and not accounting for population structure were performed. Genomic prediction models include genotype × environment interaction (G × E). Two alternative prediction strategies were studied: (1) random cross-validation of the data in 20% training (TRN) and 80% testing (TST) (TRN20-TST80) sets, and (2) two types of core sets, “diversity” and “prediction”, including 10% and 20%, respectively, of the total collections. Accounting for population structure decreased prediction accuracy by 15–20% as compared to prediction accuracy obtained when not accounting for population structure. Accounting for population structure gave prediction accuracies for traits evaluated in one environment for TRN20-TST80 that ranged from 0.407 to 0.677 for Mexican landraces, and from 0.166 to 0.662 for Iranian landraces. Prediction accuracy of the 20% diversity core set was similar to accuracies obtained for TRN20-TST80, ranging from 0.412 to 0.654 for Mexican landraces, and from 0.182 to 0.647 for Iranian landraces. The predictive core set gave similar prediction accuracy as the diversity core set for Mexican collections, but slightly lower for Iranian collections. Prediction accuracy when incorporating G × E for DTH and DTM for Mexican landraces for TRN20-TST80 was around 0.60, which is greater than without the G × E term. For Iranian landraces, accuracies were 0.55 for the G × E model with TRN20-TST80. Results show promising prediction accuracies for potential use in germplasm enhancement and rapid introgression of exotic germplasm into elite materials. PMID:27172218
Evolution of a reassortant North American gull influenza virus lineage: drift, shift and stability
Hall, Jeffrey S.; TeSlaa, Joshua L.; Nashold, Sean W.; Halpin, Rebecca A.; Stockwell, Timothy; Wentworth, David E.; Dugan, Vivien; Ip, Hon S.
2013-01-01
Background: The role of gulls in the ecology of avian influenza (AI) is different than that of waterfowl. Different constellations of subtypes circulate within the two groups of birds and AI viruses isolated from North American gulls frequently possess reassortant genomes with genetic elements from both North America and Eurasian lineages. A 2008 isolate from a Newfoundland Great Black-backed Gull contained a mix of North American waterfowl, North American gull and Eurasian lineage genes. Methods: We isolated, sequenced and phylogenetically compared avian influenza viruses from 2009 Canadian wild birds. Results: We analyzed six 2009 virus isolates from Canada and found the same phylogenetic lineage had persisted over a larger geographic area, with an expanded host range that included dabbling and diving ducks as well as gulls. All of the 2009 virus isolates contained an internal protein coding set of genes of the same Eurasian lineage genes except PB1 that was from a North American lineage, and these genes continued to evolve by genetic drift. We show evidence that the 2008 Great Black-backed Gull virus was derived from this lineage with a reassortment of a North American PA gene into the more stable core set of internal protein coding genes that has circulated in avian populations for at least 2 years. From this core, the surface glycoprotein genes have switched several times creating H13N6, H13N2, and H16N3 subtypes. These gene segments were from North American lineages except for the H16 and N3 vRNAs. Conclusions: This process appears similar to genetic shifts seen with swine influenza where a stable "triple reassortant internal gene" core has circulated in swine populations with genetic shifts occurring with hemaggluttinin and neuraminidase proteins getting periodically switched. Thus gulls may serve as genetic mixing vessels for different lineages of avian influenza, similar to the role of swine with regards to human influenza. These findings illustrate the need for continued surveillance in gull and waterfowl populations, both on the Pacific and especially Atlantic coasts of North America, to document virus intercontinental movement and the role of gull species in the evolution and epidemiology of AI.
Novel transcriptome resources for three scleractinian coral species from the Indo-Pacific
Kenkel, Carly D.; Bay, Line K
2017-01-01
Abstract Transcriptomic resources for coral species can provide insight into coral evolutionary history and stress-response physiology. Goniopora columna, Galaxea astreata, and Galaxea acrhelia are scleractinian corals of the Indo-Pacific, representing a diversity of morphologies and life-history traits. G. columna and G. astreata are common and cosmopolitan, while G. acrhelia is largely restricted to the coral triangle and Great Barrier Reef. Reference transcriptomes for these species were assembled from replicate colony fragments exposed to elevated (31°C) and ambient (27°C) temperatures. Trinity was used to create de novo assemblies for each species from 92–102 million raw Illumina Hiseq 2 × 150 bp reads. Host-specific assemblies contained 65 460–72 405 contigs, representing 26 693–37 894 isogroups (∼genes) with an average N50 of 2254. Gene name and/or gene ontology annotations were possible for 58% of isogroups on average. Transcriptomes contained 93.1–94.3% of EuKaryotic Orthologous Groups comprising the core eukaryotic gene set, and 89.98–91.92% of the single-copy metazoan core gene set orthologs were complete, indicating fairly comprehensive assemblies. This work expands the complement of transcriptomic resources available for scleractinian coral species, including the first reference for a representative of Goniopora spp. as well as species with novel morphology. PMID:28938722
Novel transcriptome resources for three scleractinian coral species from the Indo-Pacific.
Kenkel, Carly D; Bay, Line K
2017-09-01
Transcriptomic resources for coral species can provide insight into coral evolutionary history and stress-response physiology. Goniopora columna, Galaxea astreata, and Galaxea acrhelia are scleractinian corals of the Indo-Pacific, representing a diversity of morphologies and life-history traits. G. columna and G. astreata are common and cosmopolitan, while G. acrhelia is largely restricted to the coral triangle and Great Barrier Reef. Reference transcriptomes for these species were assembled from replicate colony fragments exposed to elevated (31°C) and ambient (27°C) temperatures. Trinity was used to create de novo assemblies for each species from 92-102 million raw Illumina Hiseq 2 × 150 bp reads. Host-specific assemblies contained 65 460-72 405 contigs, representing 26 693-37 894 isogroups (∼genes) with an average N50 of 2254. Gene name and/or gene ontology annotations were possible for 58% of isogroups on average. Transcriptomes contained 93.1-94.3% of EuKaryotic Orthologous Groups comprising the core eukaryotic gene set, and 89.98-91.92% of the single-copy metazoan core gene set orthologs were complete, indicating fairly comprehensive assemblies. This work expands the complement of transcriptomic resources available for scleractinian coral species, including the first reference for a representative of Goniopora spp. as well as species with novel morphology. © The Authors 2017. Published by Oxford University Press.
Ohtani, Shin; Ushiyama, Akira; Maeda, Machiko; Hattori, Kenji; Kunugita, Naoki; Wang, Jianqing; Ishii, Kazuyuki
2016-01-01
We investigated the thermal effects of radiofrequency electromagnetic fields (RF-EMFs) on the variation in core temperature and gene expression of some stress markers in rats. Sprague-Dawley rats were exposed to 2.14 GHz wideband code division multiple access (W-CDMA) RF signals at a whole-body averaged specific absorption rate (WBA-SAR) of 4 W/kg, which causes behavioral disruption in laboratory animals, and 0.4 W/kg, which is the limit for the occupational exposure set by the International Commission on Non-Ionizing Radiation Protection guideline. It is important to understand the possible in vivo effects derived from RF-EMF exposures at these intensities. Because of inadequate data on real-time core temperature analyses using free-moving animal and the association between stress and thermal effects of RF-EMF exposure, we analyzed the core body temperature under nonanesthetic condition during RF-EMF exposure. The results revealed that the core temperature increased by approximately 1.5°C compared with the baseline and reached a plateau till the end of RF-EMF exposure. Furthermore, we analyzed the gene expression of heat-shock proteins (Hsp) and heat-shock transcription factors (Hsf) family after RF-EMF exposure. At WBA-SAR of 4 W/kg, some Hsp and Hsf gene expression levels were significantly upregulated in the cerebral cortex and cerebellum following exposure for 6 hr/day but were not upregulated after exposure for 3 hr/day. On the other hand, there was no significant change in the core temperature and gene expression at WBA-SAR of 0.4 W/kg. Thus, 2.14-GHz RF-EMF exposure at WBA-SAR of 4 W/kg induced increases in the core temperature and upregulation of some stress markers, particularly in the cerebellum.
Gene repertoire of amoeba-associated giant viruses.
Colson, Philippe; Raoult, Didier
2010-01-01
Acanthamoeba polyphaga mimivirus, Marseillevirus, and Sputnik, a virophage, are intra-amoebal viruses that have been isolated from water collected in cooling towers. They have provided fascinating data and have raised exciting questions about viruses definition and evolution. Mimivirus and Marseillevirus have been classified in the nucleo-cytoplasmic large DNA viruses (NCLDVs) class. Their genomes are the largest and fifth largest viral genomes sequenced so far. The gene repertoire of these amoeba-associated viruses can be divided into four groups: the core genome, genes acquired by lateral gene transfer, duplicated genes, and ORFans. Open reading frames (ORFs) that have homologs in the NCLDVs core gene set represent 2.9 and 6.1% of the Mimivirus and Marseillevirus gene contents, respectively. A substantial proportion of the Mimivirus, Marseillevirus and Sputnik ORFs exhibit sequence similarities to homologs found in bacteria, archaea, eukaryotes or viruses. The large amount of chimeric genes in these viral genomes might have resulted from acquisitions by lateral gene transfers, implicating sympatric bacteria and viruses with an intra-amoebal lifestyle. In addition, lineage-specific gene expansion may have played a major role in the genome shaping. Altogether, the data so far accumulated on amoeba-associated giant viruses are a powerful incentive to isolate and study additional strains to gain better understanding of their pangenome. Copyright 2010 S. Karger AG, Basel.
Jenkins, Catherine E; Gusscott, Samuel; Wong, Rachel J; Shevchuk, Olena O; Rana, Gurneet; Giambra, Vincenzo; Tyshchenko, Kateryna; Islam, Rashedul; Hirst, Martin; Weng, Andrew P
2018-05-04
RUNX1 is frequently mutated in T-cell acute lymphoblastic leukemia (T-ALL). The spectrum of RUNX1 mutations has led to the notion that it acts as a tumor suppressor in this context; however, other studies have placed RUNX1 along with transcription factors TAL1 and NOTCH1 as core drivers of an oncogenic transcriptional program. To reconcile these divergent roles, we knocked down RUNX1 in human T-ALL cell lines and deleted Runx1 or Cbfb in primary mouse T-cell leukemias. RUNX1 depletion consistently resulted in reduced cell proliferation and increased apoptosis. RUNX1 upregulated variable sets of target genes in each cell line, but consistently included a core set of oncogenic effectors including IGF1R and NRAS. Our results support the conclusion that RUNX1 has a net positive effect on cell growth in the context of established T-ALL. Copyright © 2018. Published by Elsevier Inc.
Evaluation of soybean germplasm conserved in NIAS genebank and development of mini core collections
Kaga, Akito; Shimizu, Takehiko; Watanabe, Satoshi; Tsubokura, Yasutaka; Katayose, Yuichi; Harada, Kyuya; Vaughan, Duncan A.; Tomooka, Norihiko
2012-01-01
Genetic variation and population structure among 1603 soybean accessions, consisted of 832 Japanese landraces, 109 old and 57 recent Japanese varieties, 341 landrace from 16 Asian countries and 264 wild soybean accessions, were characterized using 191 SNP markers. Although gene diversity of Japanese soybean germplasm was slight lower than that of exotic soybean germplasm, population differentiation and clustering analyses indicated clear genetic differentiation among Japanese cultivated soybeans, exotic cultivated soybeans and wild soybeans. Nine hundred ninety eight Japanese accessions were separated to a certain extent into groups corresponding to their agro-morphologic characteristics such as photosensitivity and seed characteristics rather than their geographical origin. Based on the assessment of the SNP markers and several agro-morphologic traits, accessions that retain gene diversity of the whole collection were selected to develop several soybean sets of different sizes using an heuristic approach; a minimum of 12 accessions can represent the observed gene diversity; a mini-core collection of 96 accession can represent a major proportion of both geographic origin and agro-morphologic trait variation. These selected sets of germplasm will provide an effective platform for enhancing soybean diversity studies and assist in finding novel traits for crop improvement. PMID:23136496
Chaillou, Thomas; Jackson, Janna R; England, Jonathan H; Kirby, Tyler J; Richards-White, Jena; Esser, Karyn A; Dupont-Versteegden, Esther E; McCarthy, John J
2015-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. Copyright © 2015 the American Physiological Society.
Chaillou, Thomas; Jackson, Janna R.; England, Jonathan H.; Kirby, Tyler J.; Richards-White, Jena; Esser, Karyn A.; Dupont-Versteegden, Esther E.
2014-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. PMID:25554798
Ferguson, A L; Hughes, A D; Tufail, U; Baumann, C G; Scott, D J; Hoggett, J G
2000-09-22
The interaction between the core form of bacterial RNA polymerases and sigma factors is essential for specific promoter recognition, and for coordinating the expression of different sets of genes in response to varying cellular needs. The interaction between Escherichia coli core RNA polymerase and sigma 70 has been investigated by surface plasmon resonance. The His-tagged form of sigma 70 factor was immobilised on a Ni2+-NTA chip for monitoring its interaction with core polymerase. The binding constant for the interaction was found to be 1.9x10(-7) M, and the dissociation rate constant for release of sigma from core, in the absence of DNA or transcription, was 4x10(-3) s(-1), corresponding to a half-life of about 200 s.
Kadam, Anagha; Janto, Benjamin; Eutsey, Rory; Earl, Joshua P; Powell, Evan; Dahlgren, Margaret E; Hu, Fen Z; Ehrlich, Garth D; Hiller, N Luisa
2015-02-02
There is extensive genomic diversity among Streptococcus pneumoniae isolates. Approximately half of the comprehensive set of genes in the species (the supragenome or pangenome) is present in all the isolates (core set), and the remaining is unevenly distributed among strains (distributed set). The Streptococcus pneumoniae Supragenome Hybridization (SpSGH) array provides coverage for an extensive set of genes and polymorphisms encountered within this species, capturing this genomic diversity. Further, the capture is quantitative. In this manner, the SpSGH array allows for both genomic and transcriptomic analyses of diverse S. pneumoniae isolates on a single platform. In this unit, we present the SpSGH array, and describe in detail its design and implementation for both genomic and transcriptomic analyses. The methodology can be applied to construction and modification of SpSGH array platforms, as well to other bacterial species as long as multiple whole-genome sequences are available that collectively capture the vast majority of the species supragenome. Copyright © 2015 John Wiley & Sons, Inc.
Promoting gene expression in plants by permissive histone lysine methylation
Millar, Tony; Finnegan, E Jean
2009-01-01
Plants utilize sophisticated epigenetic regulatory mechanisms to coordinate changes in gene expression during development and in response to environmental stimuli. Epigenetics refers to the modification of DNA and chromatin associated proteins, which affect gene expression and cell function, without changing the DNA sequence. Such modifications are inherited through mitosis, and in rare instances through meiosis, although it can be reversible and thus regulatory. Epigenetic modifications are controlled by groups of proteins, such as the family of histone lysine methytransferases (HKMTs). The catalytic core known as the SET domain encodes HKMT activity and either promotes or represses gene expression. A large family of SET domain proteins is present in Arabidopsis where there is growing evidence that two classes of these genes are involved in promoting gene expression in a diverse range of developmental processes. This review will focus on the function of these two classes and the processes that they control, highlighting the huge potential this regulatory mechanism has in plants. PMID:19816124
Design and verification of a pangenome microarray oligonucleotide probe set for Dehalococcoides spp.
Hug, Laura A; Salehi, Maryam; Nuin, Paulo; Tillier, Elisabeth R; Edwards, Elizabeth A
2011-08-01
Dehalococcoides spp. are an industrially relevant group of Chloroflexi bacteria capable of reductively dechlorinating contaminants in groundwater environments. Existing Dehalococcoides genomes revealed a high level of sequence identity within this group, including 98 to 100% 16S rRNA sequence identity between strains with diverse substrate specificities. Common molecular techniques for identification of microbial populations are often not applicable for distinguishing Dehalococcoides strains. Here we describe an oligonucleotide microarray probe set designed based on clustered Dehalococcoides genes from five different sources (strain DET195, CBDB1, BAV1, and VS genomes and the KB-1 metagenome). This "pangenome" probe set provides coverage of core Dehalococcoides genes as well as strain-specific genes while optimizing the potential for hybridization to closely related, previously unknown Dehalococcoides strains. The pangenome probe set was compared to probe sets designed independently for each of the five Dehalococcoides strains. The pangenome probe set demonstrated better predictability and higher detection of Dehalococcoides genes than strain-specific probe sets on nontarget strains with <99% average nucleotide identity. An in silico analysis of the expected probe hybridization against the recently released Dehalococcoides strain GT genome and additional KB-1 metagenome sequence data indicated that the pangenome probe set performs more robustly than the combined strain-specific probe sets in the detection of genes not included in the original design. The pangenome probe set represents a highly specific, universal tool for the detection and characterization of Dehalococcoides from contaminated sites. It has the potential to become a common platform for Dehalococcoides-focused research, allowing meaningful comparisons between microarray experiments regardless of the strain examined.
Yang, Xinan Holly; Li, Meiyi; Wang, Bin; Zhu, Wanqi; Desgardin, Aurelie; Onel, Kenan; de Jong, Jill; Chen, Jianjun; Chen, Luonan; Cunningham, John M
2015-03-24
Genes that regulate stem cell function are suspected to exert adverse effects on prognosis in malignancy. However, diverse cancer stem cell signatures are difficult for physicians to interpret and apply clinically. To connect the transcriptome and stem cell biology, with potential clinical applications, we propose a novel computational "gene-to-function, snapshot-to-dynamics, and biology-to-clinic" framework to uncover core functional gene-sets signatures. This framework incorporates three function-centric gene-set analysis strategies: a meta-analysis of both microarray and RNA-seq data, novel dynamic network mechanism (DNM) identification, and a personalized prognostic indicator analysis. This work uses complex disease acute myeloid leukemia (AML) as a research platform. We introduced an adjustable "soft threshold" to a functional gene-set algorithm and found that two different analysis methods identified distinct gene-set signatures from the same samples. We identified a 30-gene cluster that characterizes leukemic stem cell (LSC)-depleted cells and a 25-gene cluster that characterizes LSC-enriched cells in parallel; both mark favorable-prognosis in AML. Genes within each signature significantly share common biological processes and/or molecular functions (empirical p = 6e-5 and 0.03 respectively). The 25-gene signature reflects the abnormal development of stem cells in AML, such as AURKA over-expression. We subsequently determined that the clinical relevance of both signatures is independent of known clinical risk classifications in 214 patients with cytogenetically normal AML. We successfully validated the prognosis of both signatures in two independent cohorts of 91 and 242 patients respectively (log-rank p < 0.0015 and 0.05; empirical p < 0.015 and 0.08). The proposed algorithms and computational framework will harness systems biology research because they efficiently translate gene-sets (rather than single genes) into biological discoveries about AML and other complex diseases.
Schürch, A C; Arredondo-Alonso, S; Willems, R J L; Goering, R V
2018-04-01
Whole genome sequence (WGS)-based strain typing finds increasing use in the epidemiologic analysis of bacterial pathogens in both public health as well as more localized infection control settings. This minireview describes methodologic approaches that have been explored for WGS-based epidemiologic analysis and considers the challenges and pitfalls of data interpretation. Personal collection of relevant publications. When applying WGS to study the molecular epidemiology of bacterial pathogens, genomic variability between strains is translated into measures of distance by determining single nucleotide polymorphisms in core genome alignments or by indexing allelic variation in hundreds to thousands of core genes, assigning types to unique allelic profiles. Interpreting isolate relatedness from these distances is highly organism specific, and attempts to establish species-specific cutoffs are unlikely to be generally applicable. In cases where single nucleotide polymorphism or core gene typing do not provide the resolution necessary for accurate assessment of the epidemiology of bacterial pathogens, inclusion of accessory gene or plasmid sequences may provide the additional required discrimination. As with all epidemiologic analysis, realizing the full potential of the revolutionary advances in WGS-based approaches requires understanding and dealing with issues related to the fundamental steps of data generation and interpretation. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Ackley, Brian D
2014-08-01
During the development of the nervous system, neurons encounter signals that inform their outgrowth and polarization. Understanding how these signals combinatorially function to pattern the nervous system is of considerable interest to developmental neurobiologists. The Wnt ligands and their receptors have been well characterized in polarizing cells during asymmetric cell division. The planar cell polarity (PCP) pathway is also critical for cell polarization in the plane of an epithelium. The core set of PCP genes include members of the conserved Wnt-signaling pathway, such as Frizzled and Disheveled, but also the cadherin-domain protein Flamingo. In Drosophila, the Fat and Dachsous cadherins also function in PCP, but in parallel to the core PCP components. C. elegans also have two Fat-like and one Dachsous-like cadherins, at least one of which, cdh-4, contributes to neural development. In C. elegans Wnt ligands and the conserved PCP genes have been shown to regulate a number of different events, including embryonic cell polarity, vulval morphogenesis, and cell migration. As is also observed in vertebrates, the Wnt and PCP genes appear to function to primarily provide information about the anterior to posterior axis of development. Here, we review the recent work describing how mutations in the Wnt and core PCP genes affect axon guidance and synaptogenesis in C. elegans. © 2013 Wiley Periodicals, Inc.
Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
Chen, Guocai; Zhao, Jieyi; Cohen, Trevor; Tao, Cui; Sun, Jingchun; Xu, Hua; Bernstam, Elmer V.; Lawson, Andrew; Zeng, Jia; Johnson, Amber M.; Holla, Vijaykumar; Bailey, Ann M.; Lara-Guerra, Humberto; Litzenburger, Beate; Meric-Bernstam, Funda; Jim Zheng, W.
2015-01-01
Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association between genes and biomedical literature to disambiguate gene names. We obtained 93.6% precision for the test gene set and 80.4% for the area under a receiver-operating characteristics curve for gene and article association. The core algorithm was implemented using a graphics processing unit-based MapReduce framework to handle big data and to improve performance. We conclude that Ontology Fingerprints can help disambiguate gene names mentioned in text and analyse the association between genes and articles. Database URL: http://www.ontologyfingerprint.org PMID:25858285
Core body temperature in obesity.
Heikens, Marc J; Gorbach, Alexander M; Eden, Henry S; Savastano, David M; Chen, Kong Y; Skarulis, Monica C; Yanovski, Jack A
2011-05-01
A lower core body temperature set point has been suggested to be a factor that could potentially predispose humans to develop obesity. We tested the hypothesis that obese individuals have lower core temperatures than those in normal-weight individuals. In study 1, nonobese [body mass index (BMI; in kg/m(2)) <30] and obese (BMI ≥30) adults swallowed wireless core temperature-sensing capsules, and we measured core temperatures continuously for 24 h. In study 2, normal-weight (BMI of 18-25) and obese subjects swallowed temperature-sensing capsules to measure core temperatures continuously for ≥48 h and kept activity logs. We constructed daily, 24-h core temperature profiles for analysis. Mean (±SE) daily core body temperature did not differ significantly between the 35 nonobese and 46 obese subjects (36.92 ± 0.03°C compared with 36.89 ± 0.03°C; P = 0.44). Core temperature 24-h profiles did not differ significantly between 11 normal-weight and 19 obese subjects (P = 0.274). Women had a mean core body temperature ≈0.23°C greater than that of men (36.99 ± 0.03°C compared with 36.76 ± 0.03°C; P < 0.0001). Obesity is not generally associated with a reduced core body temperature. It may be necessary to study individuals with function-altering mutations in core temperature-regulating genes to determine whether differences in the core body temperature set point affect the regulation of human body weight. These trials were registered at clinicaltrials.gov as NCT00428987 and NCT00266500.
Core body temperature in obesity123
Heikens, Marc J; Gorbach, Alexander M; Eden, Henry S; Savastano, David M; Chen, Kong Y; Skarulis, Monica C
2011-01-01
Background: A lower core body temperature set point has been suggested to be a factor that could potentially predispose humans to develop obesity. Objective: We tested the hypothesis that obese individuals have lower core temperatures than those in normal-weight individuals. Design: In study 1, nonobese [body mass index (BMI; in kg/m2) <30] and obese (BMI ≥30) adults swallowed wireless core temperature–sensing capsules, and we measured core temperatures continuously for 24 h. In study 2, normal-weight (BMI of 18–25) and obese subjects swallowed temperature-sensing capsules to measure core temperatures continuously for ≥48 h and kept activity logs. We constructed daily, 24-h core temperature profiles for analysis. Results: Mean (±SE) daily core body temperature did not differ significantly between the 35 nonobese and 46 obese subjects (36.92 ± 0.03°C compared with 36.89 ± 0.03°C; P = 0.44). Core temperature 24-h profiles did not differ significantly between 11 normal-weight and 19 obese subjects (P = 0.274). Women had a mean core body temperature ≈0.23°C greater than that of men (36.99 ± 0.03°C compared with 36.76 ± 0.03°C; P < 0.0001). Conclusions: Obesity is not generally associated with a reduced core body temperature. It may be necessary to study individuals with function-altering mutations in core temperature–regulating genes to determine whether differences in the core body temperature set point affect the regulation of human body weight. These trials were registered at clinicaltrials.gov as NCT00428987 and NCT00266500. PMID:21367952
Orthopoxvirus Genome Evolution: The Role of Gene Loss
Hendrickson, Robert Curtis; Wang, Chunlin; Hatcher, Eneida L.; Lefkowitz, Elliot J.
2010-01-01
Poxviruses are highly successful pathogens, known to infect a variety of hosts. The family Poxviridae includes Variola virus, the causative agent of smallpox, which has been eradicated as a public health threat but could potentially reemerge as a bioterrorist threat. The risk scenario includes other animal poxviruses and genetically engineered manipulations of poxviruses. Studies of orthologous gene sets have established the evolutionary relationships of members within the Poxviridae family. It is not clear, however, how variations between family members arose in the past, an important issue in understanding how these viruses may vary and possibly produce future threats. Using a newly developed poxvirus-specific tool, we predicted accurate gene sets for viruses with completely sequenced genomes in the genus Orthopoxvirus. Employing sensitive sequence comparison techniques together with comparison of syntenic gene maps, we established the relationships between all viral gene sets. These techniques allowed us to unambiguously identify the gene loss/gain events that have occurred over the course of orthopoxvirus evolution. It is clear that for all existing Orthopoxvirus species, no individual species has acquired protein-coding genes unique to that species. All existing species contain genes that are all present in members of the species Cowpox virus and that cowpox virus strains contain every gene present in any other orthopoxvirus strain. These results support a theory of reductive evolution in which the reduction in size of the core gene set of a putative ancestral virus played a critical role in speciation and confining any newly emerging virus species to a particular environmental (host or tissue) niche. PMID:21994715
A functional portrait of Med7 and the mediator complex in Candida albicans.
Tebbji, Faiza; Chen, Yaolin; Richard Albert, Julien; Gunsalus, Kearney T W; Kumamoto, Carol A; Nantel, André; Sellam, Adnane; Whiteway, Malcolm
2014-11-01
Mediator is a multi-subunit protein complex that regulates gene expression in eukaryotes by integrating physiological and developmental signals and transmitting them to the general RNA polymerase II machinery. We examined, in the fungal pathogen Candida albicans, a set of conditional alleles of genes encoding Mediator subunits of the head, middle, and tail modules that were found to be essential in the related ascomycete Saccharomyces cerevisiae. Intriguingly, while the Med4, 8, 10, 11, 14, 17, 21 and 22 subunits were essential in both fungi, the structurally highly conserved Med7 subunit was apparently non-essential in C. albicans. While loss of CaMed7 did not lead to loss of viability under normal growth conditions, it dramatically influenced the pathogen's ability to grow in different carbon sources, to form hyphae and biofilms, and to colonize the gastrointestinal tracts of mice. We used epitope tagging and location profiling of the Med7 subunit to examine the distribution of the DNA sites bound by Mediator during growth in either the yeast or the hyphal form, two distinct morphologies characterized by different transcription profiles. We observed a core set of 200 genes bound by Med7 under both conditions; this core set is expanded moderately during yeast growth, but is expanded considerably during hyphal growth, supporting the idea that Mediator binding correlates with changes in transcriptional activity and that this binding is condition specific. Med7 bound not only in the promoter regions of active genes but also within coding regions and at the 3' ends of genes. By combining genome-wide location profiling, expression analyses and phenotyping, we have identified different Med7p-influenced regulons including genes related to glycolysis and the Filamentous Growth Regulator family. In the absence of Med7, the ribosomal regulon is de-repressed, suggesting Med7 is involved in central aspects of growth control.
A Functional Portrait of Med7 and the Mediator Complex in Candida albicans
Tebbji, Faiza; Chen, Yaolin; Richard Albert, Julien; Gunsalus, Kearney T. W.; Kumamoto, Carol A.; Nantel, André; Sellam, Adnane; Whiteway, Malcolm
2014-01-01
Mediator is a multi-subunit protein complex that regulates gene expression in eukaryotes by integrating physiological and developmental signals and transmitting them to the general RNA polymerase II machinery. We examined, in the fungal pathogen Candida albicans, a set of conditional alleles of genes encoding Mediator subunits of the head, middle, and tail modules that were found to be essential in the related ascomycete Saccharomyces cerevisiae. Intriguingly, while the Med4, 8, 10, 11, 14, 17, 21 and 22 subunits were essential in both fungi, the structurally highly conserved Med7 subunit was apparently non-essential in C. albicans. While loss of CaMed7 did not lead to loss of viability under normal growth conditions, it dramatically influenced the pathogen's ability to grow in different carbon sources, to form hyphae and biofilms, and to colonize the gastrointestinal tracts of mice. We used epitope tagging and location profiling of the Med7 subunit to examine the distribution of the DNA sites bound by Mediator during growth in either the yeast or the hyphal form, two distinct morphologies characterized by different transcription profiles. We observed a core set of 200 genes bound by Med7 under both conditions; this core set is expanded moderately during yeast growth, but is expanded considerably during hyphal growth, supporting the idea that Mediator binding correlates with changes in transcriptional activity and that this binding is condition specific. Med7 bound not only in the promoter regions of active genes but also within coding regions and at the 3′ ends of genes. By combining genome-wide location profiling, expression analyses and phenotyping, we have identified different Med7p-influenced regulons including genes related to glycolysis and the Filamentous Growth Regulator family. In the absence of Med7, the ribosomal regulon is de-repressed, suggesting Med7 is involved in central aspects of growth control. PMID:25375174
Ghosh, Sujoy; Vivar, Juan; Nelson, Christopher P; Willenborg, Christina; Segrè, Ayellet V; Mäkinen, Ville-Petteri; Nikpay, Majid; Erdmann, Jeannette; Blankenberg, Stefan; O'Donnell, Christopher; März, Winfried; Laaksonen, Reijo; Stewart, Alexandre F R; Epstein, Stephen E; Shah, Svati H; Granger, Christopher B; Hazen, Stanley L; Kathiresan, Sekar; Reilly, Muredach P; Yang, Xia; Quertermous, Thomas; Samani, Nilesh J; Schunkert, Heribert; Assimes, Themistocles L; McPherson, Ruth
2015-07-01
Genome-wide association studies have identified multiple genetic variants affecting the risk of coronary artery disease (CAD). However, individually these explain only a small fraction of the heritability of CAD and for most, the causal biological mechanisms remain unclear. We sought to obtain further insights into potential causal processes of CAD by integrating large-scale GWA data with expertly curated databases of core human pathways and functional networks. Using pathways (gene sets) from Reactome, we carried out a 2-stage gene set enrichment analysis strategy. From a meta-analyzed discovery cohort of 7 CAD genome-wide association study data sets (9889 cases/11 089 controls), nominally significant gene sets were tested for replication in a meta-analysis of 9 additional studies (15 502 cases/55 730 controls) from the Coronary ARtery DIsease Genome wide Replication and Meta-analysis (CARDIoGRAM) Consortium. A total of 32 of 639 Reactome pathways tested showed convincing association with CAD (replication P<0.05). These pathways resided in 9 of 21 core biological processes represented in Reactome, and included pathways relevant to extracellular matrix (ECM) integrity, innate immunity, axon guidance, and signaling by PDRF (platelet-derived growth factor), NOTCH, and the transforming growth factor-β/SMAD receptor complex. Many of these pathways had strengths of association comparable to those observed in lipid transport pathways. Network analysis of unique genes within the replicated pathways further revealed several interconnected functional and topologically interacting modules representing novel associations (eg, semaphoring-regulated axonal guidance pathway) besides confirming known processes (lipid metabolism). The connectivity in the observed networks was statistically significant compared with random networks (P<0.001). Network centrality analysis (degree and betweenness) further identified genes (eg, NCAM1, FYN, FURIN, etc) likely to play critical roles in the maintenance and functioning of several of the replicated pathways. These findings provide novel insights into how genetic variation, interpreted in the context of biological processes and functional interactions among genes, may help define the genetic architecture of CAD. © 2015 American Heart Association, Inc.
GERICOS: A Generic Framework for the Development of On-Board Software
NASA Astrophysics Data System (ADS)
Plasson, P.; Cuomo, C.; Gabriel, G.; Gauthier, N.; Gueguen, L.; Malac-Allain, L.
2016-08-01
This paper presents an overview of the GERICOS framework (GEneRIC Onboard Software), its architecture, its various layers and its future evolutions. The GERICOS framework, developed and qualified by LESIA, offers a set of generic, reusable and customizable software components for the rapid development of payload flight software. The GERICOS framework has a layered structure. The first layer (GERICOS::CORE) implements the concept of active objects and forms an abstraction layer over the top of real-time kernels. The second layer (GERICOS::BLOCKS) offers a set of reusable software components for building flight software based on generic solutions to recurrent functionalities. The third layer (GERICOS::DRIVERS) implements software drivers for several COTS IP cores of the LEON processor ecosystem.
Let them fall where they may: congruence analysis in massive phylogenetically messy data sets.
Leigh, Jessica W; Schliep, Klaus; Lopez, Philippe; Bapteste, Eric
2011-10-01
Interest in congruence in phylogenetic data has largely focused on issues affecting multicellular organisms, and animals in particular, in which the level of incongruence is expected to be relatively low. In addition, assessment methods developed in the past have been designed for reasonably small numbers of loci and scale poorly for larger data sets. However, there are currently over a thousand complete genome sequences available and of interest to evolutionary biologists, and these sequences are predominantly from microbial organisms, whose molecular evolution is much less frequently tree-like than that of multicellular life forms. As such, the level of incongruence in these data is expected to be high. We present a congruence method that accommodates both very large numbers of genes and high degrees of incongruence. Our method uses clustering algorithms to identify subsets of genes based on similarity of phylogenetic signal. It involves only a single phylogenetic analysis per gene, and therefore, computation time scales nearly linearly with the number of genes in the data set. We show that our method performs very well with sets of sequence alignments simulated under a wide variety of conditions. In addition, we present an analysis of core genes of prokaryotes, often assumed to have been largely vertically inherited, in which we identify two highly incongruent classes of genes. This result is consistent with the complexity hypothesis.
Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C
2009-02-01
Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.
Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature.
Chen, Guocai; Zhao, Jieyi; Cohen, Trevor; Tao, Cui; Sun, Jingchun; Xu, Hua; Bernstam, Elmer V; Lawson, Andrew; Zeng, Jia; Johnson, Amber M; Holla, Vijaykumar; Bailey, Ann M; Lara-Guerra, Humberto; Litzenburger, Beate; Meric-Bernstam, Funda; Jim Zheng, W
2015-01-01
Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association between genes and biomedical literature to disambiguate gene names. We obtained 93.6% precision for the test gene set and 80.4% for the area under a receiver-operating characteristics curve for gene and article association. The core algorithm was implemented using a graphics processing unit-based MapReduce framework to handle big data and to improve performance. We conclude that Ontology Fingerprints can help disambiguate gene names mentioned in text and analyse the association between genes and articles. Database URL: http://www.ontologyfingerprint.org © The Author(s) 2015. Published by Oxford University Press.
Diversification of Root Hair Development Genes in Vascular Plants.
Huang, Ling; Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui; Schiefelbein, John
2017-07-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis ( Arabidopsis thaliana ). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Diversification of Root Hair Development Genes in Vascular Plants1[OPEN
Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui
2017-01-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis (Arabidopsis thaliana). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. PMID:28487476
The transcriptomic fingerprint of glucoamylase over-expression in Aspergillus niger
2012-01-01
Background Filamentous fungi such as Aspergillus niger are well known for their exceptionally high capacity for secretion of proteins, organic acids, and secondary metabolites and they are therefore used in biotechnology as versatile microbial production platforms. However, system-wide insights into their metabolic and secretory capacities are sparse and rational strain improvement approaches are therefore limited. In order to gain a genome-wide view on the transcriptional regulation of the protein secretory pathway of A. niger, we investigated the transcriptome of A. niger when it was forced to overexpression the glaA gene (encoding glucoamylase, GlaA) and secrete GlaA to high level. Results An A. niger wild-type strain and a GlaA over-expressing strain, containing multiple copies of the glaA gene, were cultivated under maltose-limited chemostat conditions (specific growth rate 0.1 h-1). Elevated glaA mRNA and extracellular GlaA levels in the over-expressing strain were accompanied by elevated transcript levels from 772 genes and lowered transcript levels from 815 genes when compared to the wild-type strain. Using GO term enrichment analysis, four higher-order categories were identified in the up-regulated gene set: i) endoplasmic reticulum (ER) membrane translocation, ii) protein glycosylation, iii) vesicle transport, and iv) ion homeostasis. Among these, about 130 genes had predicted functions for the passage of proteins through the ER and those genes included target genes of the HacA transcription factor that mediates the unfolded protein response (UPR), e.g. bipA, clxA, prpA, tigA and pdiA. In order to identify those genes that are important for high-level secretion of proteins by A. niger, we compared the transcriptome of the GlaA overexpression strain of A. niger with six other relevant transcriptomes of A. niger. Overall, 40 genes were found to have either elevated (from 36 genes) or lowered (from 4 genes) transcript levels under all conditions that were examined, thus defining the core set of genes important for ensuring high protein traffic through the secretory pathway. Conclusion We have defined the A. niger genes that respond to elevated secretion of GlaA and, furthermore, we have defined a core set of genes that appear to be involved more generally in the intensified traffic of proteins through the secretory pathway of A. niger. The consistent up-regulation of a gene encoding the acetyl-coenzyme A transporter suggests a possible role for transient acetylation to ensure correct folding of secreted proteins. PMID:23237452
Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes*
Zehavi, Yonathan; Kuznetsov, Olga; Ovadia-Shochat, Avital; Juven-Gershon, Tamar
2014-01-01
Developmental processes are highly dependent on transcriptional regulation by RNA polymerase II. The RNA polymerase II core promoter is the ultimate target of a multitude of transcription factors that control transcription initiation. Core promoters consist of core promoter motifs, e.g. the initiator, TATA box, and the downstream core promoter element (DPE), which confer specific properties to the core promoter. Here, we explored the importance of core promoter functions in the dorsal-ventral developmental gene regulatory network. This network includes multiple genes that are activated by different nuclear concentrations of Dorsal, an NFκB homolog transcription factor, along the dorsal-ventral axis. We show that over two-thirds of Dorsal target genes contain DPE sequence motifs, which is significantly higher than the proportion of DPE-containing promoters in Drosophila genes. We demonstrate that multiple Dorsal target genes are evolutionarily conserved and functionally dependent on the DPE. Furthermore, we have analyzed the activation of key Dorsal target genes by Dorsal, as well as by another Rel family transcription factor, Relish, and the dependence of their activation on the DPE motif. Using hybrid enhancer-promoter constructs in Drosophila cells and embryo extracts, we have demonstrated that the core promoter composition is an important determinant of transcriptional activity of Dorsal target genes. Taken together, our results provide evidence for the importance of core promoter composition in the regulation of Dorsal target genes. PMID:24634215
The essential gene set of a photosynthetic organism
Rubin, Benjamin E.; Wetmore, Kelly M.; Price, Morgan N.; ...
2015-10-27
Synechococcus elongatus PCC 7942 is a model organism used for studying photosynthesis and the circadian clock, and it is being developed for the production of fuel, industrial chemicals, and pharmaceuticals. To identify a comprehensive set of genes and intergenic regions that impacts fitness in S. elongatus, we created a pooled library of ~250,000 transposon mutants and used sequencing to identify the insertion locations. By analyzing the distribution and survival of these mutants, we identified 718 of the organism's 2,723 genes as essential for survival under laboratory conditions. The validity of the essential gene set is supported by its tight overlapmore » with wellconserved genes and its enrichment for core biological processes. The differences noted between our dataset and these predictors of essentiality, however, have led to surprising biological insights. One such finding is that genes in a large portion of the TCA cycle are dispensable, suggesting that S. elongatus does not require a cyclic TCA process. Furthermore, the density of the transposon mutant library enabled individual and global statements about the essentiality of noncoding RNAs, regulatory elements, and other intergenic regions. In this way, a group I intron located in tRNA Leu , which has been used extensively for phylogenetic studies, was shown here to be essential for the survival of S. elongatus. Our survey of essentiality for every locus in the S. elongatus genome serves as a powerful resource for understanding the organism's physiology and defines the essential gene set required for the growth of a photosynthetic organism.« less
Ede, Christopher; Chen, Ximin; Lin, Meng-Yin; Chen, Yvonne Y
2016-05-20
Inducible transcription systems play a crucial role in a wide array of synthetic biology circuits. However, the majority of inducible promoters are constructed from a limited set of tried-and-true promoter parts, which are susceptible to common shortcomings such as high basal expression levels (i.e., leakiness). To expand the toolbox for regulated mammalian gene expression and facilitate the construction of mammalian genetic circuits with precise functionality, we quantitatively characterized a panel of eight core promoters, including sequences with mammalian, viral, and synthetic origins. We demonstrate that this selection of core promoters can provide a wide range of basal gene expression levels and achieve a gradient of fold-inductions spanning 2 orders of magnitude. Furthermore, commonly used parts such as minimal CMV and minimal SV40 promoters were shown to achieve robust gene expression upon induction, but also suffer from high levels of leakiness. In contrast, a synthetic promoter, YB_TATA, was shown to combine low basal expression with high transcription rate in the induced state to achieve significantly higher fold-induction ratios compared to all other promoters tested. These behaviors remain consistent when the promoters are coupled to different genetic outputs and different response elements, as well as across different host-cell types and DNA copy numbers. We apply this quantitative understanding of core promoter properties to the successful engineering of human T cells that respond to antigen stimulation via chimeric antigen receptor signaling specifically under hypoxic environments. Results presented in this study can facilitate the design and calibration of future mammalian synthetic biology systems capable of precisely programmed functionality.
Batty, Elizabeth M; Chaemchuen, Suwittra; Blacksell, Stuart; Richards, Allen L; Paris, Daniel; Bowden, Rory; Chan, Caroline; Lachumanan, Ramkumar; Day, Nicholas; Donnelly, Peter; Chen, Swaine; Salje, Jeanne
2018-06-01
Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species. We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia. Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.
A global interaction network maps a wiring diagram of cellular function
Costanzo, Michael; VanderSluis, Benjamin; Koch, Elizabeth N.; Baryshnikova, Anastasia; Pons, Carles; Tan, Guihong; Wang, Wen; Usaj, Matej; Hanchard, Julia; Lee, Susan D.; Pelechano, Vicent; Styles, Erin B.; Billmann, Maximilian; van Leeuwen, Jolanda; van Dyk, Nydia; Lin, Zhen-Yuan; Kuzmin, Elena; Nelson, Justin; Piotrowski, Jeff S.; Srikumar, Tharan; Bahr, Sondra; Chen, Yiqun; Deshpande, Raamesh; Kurat, Christoph F.; Li, Sheena C.; Li, Zhijian; Usaj, Mojca Mattiazzi; Okada, Hiroki; Pascoe, Natasha; Luis, Bryan-Joseph San; Sharifpoor, Sara; Shuteriqi, Emira; Simpkins, Scott W.; Snider, Jamie; Suresh, Harsha Garadi; Tan, Yizhao; Zhu, Hongwei; Malod-Dognin, Noel; Janjic, Vuk; Przulj, Natasa; Troyanskaya, Olga G.; Stagljar, Igor; Xia, Tian; Ohya, Yoshikazu; Gingras, Anne-Claude; Raught, Brian; Boutros, Michael; Steinmetz, Lars M.; Moore, Claire L.; Rosebrock, Adam P.; Caudy, Amy A.; Myers, Chad L.; Andrews, Brenda; Boone, Charles
2017-01-01
We generated a global genetic interaction network for Saccharomyces cerevisiae, constructing over 23 million double mutants, identifying ~550,000 negative and ~350,000 positive genetic interactions. This comprehensive network maps genetic interactions for essential gene pairs, highlighting essential genes as densely connected hubs. Genetic interaction profiles enabled assembly of a hierarchical model of cell function, including modules corresponding to protein complexes and pathways, biological processes, and cellular compartments. Negative interactions connected functionally related genes, mapped core bioprocesses, and identified pleiotropic genes, whereas positive interactions often mapped general regulatory connections among gene pairs, rather than shared functionality. The global network illustrates how coherent sets of genetic interactions connect protein complex and pathway modules to map a functional wiring diagram of the cell. PMID:27708008
Generation of oscillating gene regulatory network motifs
NASA Astrophysics Data System (ADS)
van Dorp, M.; Lannoo, B.; Carlon, E.
2013-07-01
Using an improved version of an evolutionary algorithm originally proposed by François and Hakim [Proc. Natl. Acad. Sci. USAPNASA60027-842410.1073/pnas.0304532101 101, 580 (2004)], we generated small gene regulatory networks in which the concentration of a target protein oscillates in time. These networks may serve as candidates for oscillatory modules to be found in larger regulatory networks and protein interaction networks. The algorithm was run for 105 times to produce a large set of oscillating modules, which were systematically classified and analyzed. The robustness of the oscillations against variations of the kinetic rates was also determined, to filter out the least robust cases. Furthermore, we show that the set of evolved networks can serve as a database of models whose behavior can be compared to experimentally observed oscillations. The algorithm found three smallest (core) oscillators in which nonlinearities and number of components are minimal. Two of those are two-gene modules: the mixed feedback loop, already discussed in the literature, and an autorepressed gene coupled with a heterodimer. The third one is a single gene module which is competitively regulated by a monomer and a dimer. The evolutionary algorithm also generated larger oscillating networks, which are in part extensions of the three core modules and in part genuinely new modules. The latter includes oscillators which do not rely on feedback induced by transcription factors, but are purely of post-transcriptional type. Analysis of post-transcriptional mechanisms of oscillation may provide useful information for circadian clock research, as recent experiments showed that circadian rhythms are maintained even in the absence of transcription.
Berenger, Byron M; Berry, Chrystal; Peterson, Trevor; Fach, Patrick; Delannoy, Sabine; Li, Vincent; Tschetter, Lorelee; Nadon, Celine; Honish, Lance; Louie, Marie; Chui, Linda
2015-01-01
A standardised method for determining Escherichia coli O157:H7 strain relatedness using whole genome sequencing or virulence gene profiling is not yet established. We sought to assess the capacity of either high-throughput polymerase chain reaction (PCR) of 49 virulence genes, core-genome single nt variants (SNVs) or k-mer clustering to discriminate between outbreak-associated and sporadic E. coli O157:H7 isolates. Three outbreaks and multiple sporadic isolates from the province of Alberta, Canada were included in the study. Two of the outbreaks occurred concurrently in 2014 and one occurred in 2012. Pulsed-field gel electrophoresis (PFGE) and multilocus variable-number tandem repeat analysis (MLVA) were employed as comparator typing methods. The virulence gene profiles of isolates from the 2012 and 2014 Alberta outbreak events and contemporary sporadic isolates were mostly identical; therefore the set of virulence genes chosen in this study were not discriminatory enough to distinguish between outbreak clusters. Concordant with PFGE and MLVA results, core genome SNV and k-mer phylogenies clustered isolates from the 2012 and 2014 outbreaks as distinct events. k-mer phylogenies demonstrated increased discriminatory power compared with core SNV phylogenies. Prior to the widespread implementation of whole genome sequencing for routine public health use, issues surrounding cost, technical expertise, software standardisation, and data sharing/comparisons must be addressed.
The phenotypic manifestations of rare CNVs in schizophrenia.
Merikangas, Alison K; Segurado, Ricardo; Cormican, Paul; Heron, Elizabeth A; Anney, Richard J L; Moore, Susan; Kelleher, Eric; Hargreaves, April; Anderson-Schmidt, Heike; Gill, Michael; Gallagher, Louise; Corvin, Aiden
2014-09-01
There is compelling evidence for the role of copy number variants (CNVs) in schizophrenia susceptibility, and it has been estimated that up to 2-3% of schizophrenia cases may carry rare CNVs. Despite evidence that these events are associated with an increased risk across categorical neurodevelopmental disorders, there is limited understanding of the impact of CNVs on the core features of disorders like schizophrenia. Our objective was to evaluate associations between rare CNVs in differentially brain expressed (BE) genes and the core features and clinical correlates of schizophrenia. The sample included 386 cases of Irish ancestry with a diagnosis of schizophrenia, at least one rare CNV impacting any gene, and a core set of phenotypic measures. Statistically significant associations between deletions in differentially BE genes were found for family history of mental illness (decreased prevalence of all CNVs and deletions, unadjusted and adjusted) and for paternal age (increase in deletions only, unadjusted, among those with later ages at birth of patient). The strong effect of a lack of a family history on BE genes suggests that CNVs may comprise one pathway to schizophrenia, whereas a positive family history could index other genetic mechanisms that increase schizophrenia vulnerability. To our knowledge, this is the first investigation of the association between genome-wide CNVs and risk factors and sub-phenotypic features of schizophrenia beyond cognitive function. Copyright © 2014 Elsevier B.V. All rights reserved.
Choudhary, Neeraj; Bawa, Vanya; Paliwal, Rajneesh; Singh, Bikram; Bhat, Mohd. Ashraf; Mir, Javid Iqbal; Gupta, Moni; Sofi, Parvaze A.; Thudi, Mahendar; Varshney, Rajeev K.
2018-01-01
Common bean (Phaseolus vulgaris L.) is one of the most important grain legume crops in the world. The beans grown in north-western Himalayas possess huge diversity for seed color, shape and size but are mostly susceptible to Anthracnose disease caused by seed born fungus Colletotrichum lindemuthianum. Dozens of QTLs/genes have been already identified for this disease in common bean world-wide. However, this is the first report of gene/QTL discovery for Anthracnose using bean germplasm from north-western Himalayas of state Jammu & Kashmir, India. A core set of 96 bean lines comprising 54 indigenous local landraces from 11 hot-spots and 42 exotic lines from 10 different countries were phenotyped at two locations (SKUAST-Jammu and Bhaderwah, Jammu) for Anthracnose resistance. The core set was also genotyped with genome-wide (91) random and trait linked SSR markers. The study of marker-trait associations (MTAs) led to the identification of 10 QTLs/genes for Anthracnose resistance. Among the 10 QTLs/genes identified, two MTAs are stable (BM45 & BM211), two MTAs (PVctt1 & BM211) are major explaining more than 20% phenotypic variation for Anthracnose and one MTA (BM211) is both stable and major. Six (06) genomic regions are reported for the first time, while as four (04) genomic regions validated the already known QTL/gene regions/clusters for Anthracnose. The major, stable and validated markers reported during the present study associated with Anthracnose resistance will prove useful in common bean molecular breeding programs aimed at enhancing Anthracnose resistance of local bean landraces grown in north-western Himalayas of state Jammu and Kashmir. PMID:29389971
Choudhary, Neeraj; Bawa, Vanya; Paliwal, Rajneesh; Singh, Bikram; Bhat, Mohd Ashraf; Mir, Javid Iqbal; Gupta, Moni; Sofi, Parvaze A; Thudi, Mahendar; Varshney, Rajeev K; Mir, Reyazul Rouf
2018-01-01
Common bean (Phaseolus vulgaris L.) is one of the most important grain legume crops in the world. The beans grown in north-western Himalayas possess huge diversity for seed color, shape and size but are mostly susceptible to Anthracnose disease caused by seed born fungus Colletotrichum lindemuthianum. Dozens of QTLs/genes have been already identified for this disease in common bean world-wide. However, this is the first report of gene/QTL discovery for Anthracnose using bean germplasm from north-western Himalayas of state Jammu & Kashmir, India. A core set of 96 bean lines comprising 54 indigenous local landraces from 11 hot-spots and 42 exotic lines from 10 different countries were phenotyped at two locations (SKUAST-Jammu and Bhaderwah, Jammu) for Anthracnose resistance. The core set was also genotyped with genome-wide (91) random and trait linked SSR markers. The study of marker-trait associations (MTAs) led to the identification of 10 QTLs/genes for Anthracnose resistance. Among the 10 QTLs/genes identified, two MTAs are stable (BM45 & BM211), two MTAs (PVctt1 & BM211) are major explaining more than 20% phenotypic variation for Anthracnose and one MTA (BM211) is both stable and major. Six (06) genomic regions are reported for the first time, while as four (04) genomic regions validated the already known QTL/gene regions/clusters for Anthracnose. The major, stable and validated markers reported during the present study associated with Anthracnose resistance will prove useful in common bean molecular breeding programs aimed at enhancing Anthracnose resistance of local bean landraces grown in north-western Himalayas of state Jammu and Kashmir.
McDonald, Jacqueline U.; Kaforou, Myrsini; Clare, Simon; Hale, Christine; Ivanova, Maria; Huntley, Derek; Dorner, Marcus; Wright, Victoria J.; Levin, Michael; Martinon-Torres, Federico; Herberg, Jethro A.
2016-01-01
ABSTRACT Greater understanding of the functions of host gene products in response to infection is required. While many of these genes enable pathogen clearance, some enhance pathogen growth or contribute to disease symptoms. Many studies have profiled transcriptomic and proteomic responses to infection, generating large data sets, but selecting targets for further study is challenging. Here we propose a novel data-mining approach combining multiple heterogeneous data sets to prioritize genes for further study by using respiratory syncytial virus (RSV) infection as a model pathogen with a significant health care impact. The assumption was that the more frequently a gene is detected across multiple studies, the more important its role is. A literature search was performed to find data sets of genes and proteins that change after RSV infection. The data sets were standardized, collated into a single database, and then panned to determine which genes occurred in multiple data sets, generating a candidate gene list. This candidate gene list was validated by using both a clinical cohort and in vitro screening. We identified several genes that were frequently expressed following RSV infection with no assigned function in RSV control, including IFI27, IFIT3, IFI44L, GBP1, OAS3, IFI44, and IRF7. Drilling down into the function of these genes, we demonstrate a role in disease for the gene for interferon regulatory factor 7, which was highly ranked on the list, but not for IRF1, which was not. Thus, we have developed and validated an approach for collating published data sets into a manageable list of candidates, identifying novel targets for future analysis. IMPORTANCE Making the most of “big data” is one of the core challenges of current biology. There is a large array of heterogeneous data sets of host gene responses to infection, but these data sets do not inform us about gene function and require specialized skill sets and training for their utilization. Here we describe an approach that combines and simplifies these data sets, distilling this information into a single list of genes commonly upregulated in response to infection with RSV as a model pathogen. Many of the genes on the list have unknown functions in RSV disease. We validated the gene list with new clinical, in vitro, and in vivo data. This approach allows the rapid selection of genes of interest for further, more-detailed studies, thus reducing time and costs. Furthermore, the approach is simple to use and widely applicable to a range of diseases. PMID:27822537
The Genome Sequence of Taurine Cattle: A Window to Ruminant Biology and Evolution
USDA-ARS?s Scientific Manuscript database
As a major step toward understanding the biology and evolution of ruminants, the cattle genome was sequenced to ~7x coverage using a combined whole genome shotgun and BAC skim approach. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs found in seven mammalian...
Genome plasticity and systems evolution in Streptomyces
2012-01-01
Background Streptomycetes are filamentous soil-dwelling bacteria. They are best known as the producers of a great variety of natural products such as antibiotics, antifungals, antiparasitics, and anticancer agents and the decomposers of organic substances for carbon recycling. They are also model organisms for the studies of gene regulatory networks, morphological differentiation, and stress response. The availability of sets of genomes from closely related Streptomyces strains makes it possible to assess the mechanisms underlying genome plasticity and systems adaptation. Results We present the results of a comprehensive analysis of the genomes of five Streptomyces species with distinct phenotypes. These streptomycetes have a pan-genome comprised of 17,362 orthologous families which includes 3,096 components in the core genome, 5,066 components in the dispensable genome, and 9,200 components that are uniquely present in only one species. The core genome makes up about 33%-45% of each genome repertoire. It contains important genes for Streptomyces biology including those involved in gene regulation, secretion, secondary metabolism and morphological differentiation. Abundant duplicate genes have been identified, with 4%-11% of the whole genomes composed of lineage-specific expansions (LSEs), suggesting that frequent gene duplication or lateral gene transfer events play a role in shaping the genome diversification within this genus. Two patterns of expansion, single gene expansion and chromosome block expansion are observed, representing different scales of duplication. Conclusions Our results provide a catalog of genome components and their potential functional roles in gene regulatory networks and metabolic networks. The core genome components reveal the minimum requirement for streptomycetes to sustain a successful lifecycle in the soil environment, reflecting the effects of both genome evolution and environmental stress acting upon the expressed phenotypes. A better understanding of the LSE gene families will, on the other hand, bring a wealth of new insights into the mechanisms underlying strain-specific phenotypes, such as the production of novel antibiotics, pathogenesis, and adaptive response to environmental challenges. PMID:22759432
2016-01-01
Herpesviridae family is one of the significant viral families which comprises major pathogens of a wide range of hosts. This family includes at least eight species of viruses which are known to infect humans. This family has evolved 180–220 million years ago and the present study highlights that it is still evolving and more genes can be added to the repertoire of this family. In addition, its core-genome includes important viral proteins including glycoprotein B and helicase. Most of the infections caused by human herpesviruses have no definitive cure; thus, search for new therapeutic strategies is necessary. The present study finds core-genome of human herpesviruses that differs from that of Herpesviridae family and nonhuman herpes strains of this family and might be a putative target for vaccine development. The phylogenetic reconstruction based upon the protein sequences of core gene set of Herpesviridae family reveals the sharp splits of its different subfamilies and supports the hypothesis of coevolution of viruses with their hosts. In addition, data mining for cis-elements in the genomes of human herpesviruses results in the prediction of numerous regulatory elements which can be used for regulating the expression of viral based vectors implicated in gene therapies. PMID:27314006
Leroy, Thierry; De Bellis, Fabien; Legnate, Hyacinthe; Musoli, Pascal; Kalonji, Adrien; Loor Solórzano, Rey Gastón; Cubry, Philippe
2014-06-01
The management of diversity for conservation and breeding is of great importance for all plant species and is particularly true in perennial species, such as the coffee Coffea canephora. This species exhibits a large genetic and phenotypic diversity with six different diversity groups. Large field collections are available in the Ivory Coast, Uganda and other Asian, American and African countries but are very expensive and time consuming to establish and maintain in large areas. We propose to improve coffee germplasm management through the construction of genetic core collections derived from a set of 565 accessions that are characterized with 13 microsatellite markers. Core collections of 12, 24 and 48 accessions were defined using two methods aimed to maximize the allelic diversity (Maximization strategy) or genetic distance (Maximum-Length Sub-Tree method). A composite core collection of 77 accessions is proposed for both objectives of an optimal management of diversity and breeding. This core collection presents a gene diversity value of 0.8 and exhibits the totality of the major alleles (i.e., 184) that are present in the initial set. The seven proposed core collections constitute a valuable tool for diversity management and a foundation for breeding programs. The use of these collections for collection management in research centers and breeding perspectives for coffee improvement are discussed.
Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer.
Bromberg, Raquel; Grishin, Nick V; Otwinowski, Zbyszek
2016-06-01
Advances in sequencing have generated a large number of complete genomes. Traditionally, phylogenetic analysis relies on alignments of orthologs, but defining orthologs and separating them from paralogs is a complex task that may not always be suited to the large datasets of the future. An alternative to traditional, alignment-based approaches are whole-genome, alignment-free methods. These methods are scalable and require minimal manual intervention. We developed SlopeTree, a new alignment-free method that estimates evolutionary distances by measuring the decay of exact substring matches as a function of match length. SlopeTree corrects for horizontal gene transfer, for composition variation and low complexity sequences, and for branch-length nonlinearity caused by multiple mutations at the same site. We tested SlopeTree on 495 bacteria, 73 archaea, and 72 strains of Escherichia coli and Shigella. We compared our trees to the NCBI taxonomy, to trees based on concatenated alignments, and to trees produced by other alignment-free methods. The results were consistent with current knowledge about prokaryotic evolution. We assessed differences in tree topology over different methods and settings and found that the majority of bacteria and archaea have a core set of proteins that evolves by descent. In trees built from complete genomes rather than sets of core genes, we observed some grouping by phenotype rather than phylogeny, for instance with a cluster of sulfur-reducing thermophilic bacteria coming together irrespective of their phyla. The source-code for SlopeTree is available at: http://prodata.swmed.edu/download/pub/slopetree_v1/slopetree.tar.gz.
Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer
Grishin, Nick V.; Otwinowski, Zbyszek
2016-01-01
Advances in sequencing have generated a large number of complete genomes. Traditionally, phylogenetic analysis relies on alignments of orthologs, but defining orthologs and separating them from paralogs is a complex task that may not always be suited to the large datasets of the future. An alternative to traditional, alignment-based approaches are whole-genome, alignment-free methods. These methods are scalable and require minimal manual intervention. We developed SlopeTree, a new alignment-free method that estimates evolutionary distances by measuring the decay of exact substring matches as a function of match length. SlopeTree corrects for horizontal gene transfer, for composition variation and low complexity sequences, and for branch-length nonlinearity caused by multiple mutations at the same site. We tested SlopeTree on 495 bacteria, 73 archaea, and 72 strains of Escherichia coli and Shigella. We compared our trees to the NCBI taxonomy, to trees based on concatenated alignments, and to trees produced by other alignment-free methods. The results were consistent with current knowledge about prokaryotic evolution. We assessed differences in tree topology over different methods and settings and found that the majority of bacteria and archaea have a core set of proteins that evolves by descent. In trees built from complete genomes rather than sets of core genes, we observed some grouping by phenotype rather than phylogeny, for instance with a cluster of sulfur-reducing thermophilic bacteria coming together irrespective of their phyla. The source-code for SlopeTree is available at: http://prodata.swmed.edu/download/pub/slopetree_v1/slopetree.tar.gz. PMID:27336403
A highly efficient multi-core algorithm for clustering extremely large datasets
2010-01-01
Background In recent years, the demand for computational power in computational biology has increased due to rapidly growing data sets from microarray and other high-throughput technologies. This demand is likely to increase. Standard algorithms for analyzing data, such as cluster algorithms, need to be parallelized for fast processing. Unfortunately, most approaches for parallelizing algorithms largely rely on network communication protocols connecting and requiring multiple computers. One answer to this problem is to utilize the intrinsic capabilities in current multi-core hardware to distribute the tasks among the different cores of one computer. Results We introduce a multi-core parallelization of the k-means and k-modes cluster algorithms based on the design principles of transactional memory for clustering gene expression microarray type data and categorial SNP data. Our new shared memory parallel algorithms show to be highly efficient. We demonstrate their computational power and show their utility in cluster stability and sensitivity analysis employing repeated runs with slightly changed parameters. Computation speed of our Java based algorithm was increased by a factor of 10 for large data sets while preserving computational accuracy compared to single-core implementations and a recently published network based parallelization. Conclusions Most desktop computers and even notebooks provide at least dual-core processors. Our multi-core algorithms show that using modern algorithmic concepts, parallelization makes it possible to perform even such laborious tasks as cluster sensitivity and cluster number estimation on the laboratory computer. PMID:20370922
Blood Gene Expression Profiling of Breast Cancer Survivors Experiencing Fibrosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Landmark-Hoyvik, Hege, E-mail: hblandma@rr-research.n; Institute for Clinical Medicine, University of Oslo, Oslo; Dumeaux, Vanessa
2011-03-01
Purpose: To extend knowledge on the mechanisms and pathways involved in maintenance of radiation-induced fibrosis (RIF) by performing gene expression profiling of whole blood from breast cancer (BC) survivors with and without fibrosis 3-7 years after end of radiotherapy treatment. Methods and Materials: Gene expression profiles from blood were obtained for 254 BC survivors derived from a cohort of survivors, treated with adjuvant radiotherapy for breast cancer 3-7 years earlier. Analyses of transcriptional differences in blood gene expression between BC survivors with fibrosis (n = 31) and BC survivors without fibrosis (n = 223) were performed using R version 2.8.0more » and tools from the Bioconductor project. Gene sets extracted through a literature search on fibrosis and breast cancer were subsequently used in gene set enrichment analysis. Results: Substantial differences in blood gene expression between BC survivors with and without fibrosis were observed, and 87 differentially expressed genes were identified through linear analysis. Transforming growth factor-{beta}1 signaling was identified as the most significant gene set, showing a down-regulation of most of the core genes, together with up-regulation of a transcriptional activator of the inhibitor of fibrinolysis, Plasminogen activator inhibitor 1 in the BC survivors with fibrosis. Conclusion: Transforming growth factor-{beta}1 signaling was found down-regulated during the maintenance phase of fibrosis as opposed to the up-regulation reported during the early, initiating phase of fibrosis. Hence, once the fibrotic tissue has developed, the maintenance phase might rather involve a deregulation of fibrinolysis and altered degradation of extracellular matrix components.« less
Kahlke, Tim; Goesmann, Alexander; Hjerde, Erik; Willassen, Nils Peder; Haugen, Peik
2012-05-10
The criteria for defining bacterial species and even the concept of bacterial species itself are under debate, and the discussion is apparently intensifying as more genome sequence data is becoming available. However, it is still unclear how the new advances in genomics should be used most efficiently to address this question. In this study we identify genes that are common to any group of genomes in our dataset, to determine whether genes specific to a particular taxon exist and to investigate their potential role in adaptation of bacteria to their specific niche. These genes were named unique core genes. Additionally, we investigate the existence and importance of unique core genes that are found in isolates of phylogenetically non-coherent groups. These groups of isolates, that share a genetic feature without sharing a closest common ancestor, are termed genophyletic groups. The bacterial family Vibrionaceae was used as the model, and we compiled and compared genome sequences of 64 different isolates. Using the software orthoMCL we determined clusters of homologous genes among the investigated genome sequences. We used multilocus sequence analysis to build a host phylogeny and mapped the numbers of unique core genes of all distinct groups of isolates onto the tree. The results show that unique core genes are more likely to be found in monophyletic groups of isolates. Genophyletic groups of isolates, in contrast, are less common especially for large groups of isolate. The subsequent annotation of unique core genes that are present in genophyletic groups indicate a high degree of horizontally transferred genes. Finally, the annotation of the unique core genes of Vibrio cholerae revealed genes involved in aerotaxis and biosynthesis of the iron-chelator vibriobactin. The presented work indicates that genes specific for any taxon inside the bacterial family Vibrionaceae exist. These unique core genes encode conserved metabolic functions that can shed light on the adaptation of a species to its ecological niche. Additionally, our study suggests that unique core genes can be used to aid classification of bacteria and contribute to a bacterial species definition on a genomic level. Furthermore, these genes may be of importance in clinical diagnostics and drug development.
On the Concept of Cis-regulatory Information: From Sequence Motifs to Logic Functions
NASA Astrophysics Data System (ADS)
Tarpine, Ryan; Istrail, Sorin
The regulatory genome is about the “system level organization of the core genomic regulatory apparatus, and how this is the locus of causality underlying the twin phenomena of animal development and animal evolution” (E.H. Davidson. The Regulatory Genome: Gene Regulatory Networks in Development and Evolution, Academic Press, 2006). Information processing in the regulatory genome is done through regulatory states, defined as sets of transcription factors (sequence-specific DNA binding proteins which determine gene expression) that are expressed and active at the same time. The core information processing machinery consists of modular DNA sequence elements, called cis-modules, that interact with transcription factors. The cis-modules “read” the information contained in the regulatory state of the cell through transcription factor binding, “process” it, and directly or indirectly communicate with the basal transcription apparatus to determine gene expression. This endowment of each gene with the information-receiving capacity through their cis-regulatory modules is essential for the response to every possible regulatory state to which it might be exposed during all phases of the life cycle and in all cell types. We present here a set of challenges addressed by our CYRENE research project aimed at studying the cis-regulatory code of the regulatory genome. The CYRENE Project is devoted to (1) the construction of a database, the cis-Lexicon, containing comprehensive information across species about experimentally validated cis-regulatory modules; and (2) the software development of a next-generation genome browser, the cis-Browser, specialized for the regulatory genome. The presentation is anchored on three main computational challenges: the Gene Naming Problem, the Consensus Sequence Bottleneck Problem, and the Logic Function Inference Problem.
Ay, Ahmet; Holland, Jack; Sperlea, Adriana; Devakanmalai, Gnanapackiam Sheela; Knierer, Stephan; Sangervasi, Sebastian; Stevenson, Angel; Özbudak, Ertuğrul M.
2014-01-01
The vertebrate segmentation clock is a gene expression oscillator controlling rhythmic segmentation of the vertebral column during embryonic development. The period of oscillations becomes longer as cells are displaced along the posterior to anterior axis, which results in traveling waves of clock gene expression sweeping in the unsegmented tissue. Although various hypotheses necessitating the inclusion of additional regulatory genes into the core clock network at different spatial locations have been proposed, the mechanism underlying traveling waves has remained elusive. Here, we combined molecular-level computational modeling and quantitative experimentation to solve this puzzle. Our model predicts the existence of an increasing gradient of gene expression time delays along the posterior to anterior direction to recapitulate spatiotemporal profiles of the traveling segmentation clock waves in different genetic backgrounds in zebrafish. We validated this prediction by measuring an increased time delay of oscillatory Her1 protein production along the unsegmented tissue. Our results refuted the need for spatial expansion of the core feedback loop to explain the occurrence of traveling waves. Spatial regulation of gene expression time delays is a novel way of creating dynamic patterns; this is the first report demonstrating such a control mechanism in any tissue and future investigations will explore the presence of analogous examples in other biological systems. PMID:25336742
Phylogenetic Origin and Diversification of RNAi Pathway Genes in Insects.
Dowling, Daniel; Pauli, Thomas; Donath, Alexander; Meusemann, Karen; Podsiadlowski, Lars; Petersen, Malte; Peters, Ralph S; Mayer, Christoph; Liu, Shanlin; Zhou, Xin; Misof, Bernhard; Niehuis, Oliver
2016-12-01
RNA interference (RNAi) refers to the set of molecular processes found in eukaryotic organisms in which small RNA molecules mediate the silencing or down-regulation of target genes. In insects, RNAi serves a number of functions, including regulation of endogenous genes, anti-viral defense, and defense against transposable elements. Despite being well studied in model organisms, such as Drosophila, the distribution of core RNAi pathway genes and their evolution in insects is not well understood. Here we present the most comprehensive overview of the distribution and diversity of core RNAi pathway genes across 100 insect species, encompassing all currently recognized insect orders. We inferred the phylogenetic origin of insect-specific RNAi pathway genes and also identified several hitherto unrecorded gene expansions using whole-body transcriptome data from the international 1KITE (1000 Insect Transcriptome Evolution) project as well as other resources such as i5K (5000 Insect Genome Project). Specifically, we traced the origin of the double stranded RNA binding protein R2D2 to the last common ancestor of winged insects (Pterygota), the loss of Sid-1/Tag-130 orthologs in Antliophora (fleas, flies and relatives, and scorpionflies in a broad sense), and confirm previous evidence for the splitting of the Argonaute proteins Aubergine and Piwi in Brachyceran flies (Diptera, Brachycera). Our study offers new reference points for future experimental research on RNAi-related pathway genes in insects. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Galperin, Michael Y; Mekhedov, Sergei L; Puigbo, Pere; Smirnov, Sergey; Wolf, Yuri I; Rigden, Daniel J
2012-01-01
Three classes of low-G+C Gram-positive bacteria (Firmicutes), Bacilli, Clostridia and Negativicutes, include numerous members that are capable of producing heat-resistant endospores. Spore-forming firmicutes include many environmentally important organisms, such as insect pathogens and cellulose-degrading industrial strains, as well as human pathogens responsible for such diseases as anthrax, botulism, gas gangrene and tetanus. In the best-studied model organism Bacillus subtilis, sporulation involves over 500 genes, many of which are conserved among other bacilli and clostridia. This work aimed to define the genomic requirements for sporulation through an analysis of the presence of sporulation genes in various firmicutes, including those with smaller genomes than B. subtilis. Cultivable spore-formers were found to have genomes larger than 2300 kb and encompass over 2150 protein-coding genes of which 60 are orthologues of genes that are apparently essential for sporulation in B. subtilis. Clostridial spore-formers lack, among others, spoIIB, sda, spoVID and safA genes and have non-orthologous displacements of spoIIQ and spoIVFA, suggesting substantial differences between bacilli and clostridia in the engulfment and spore coat formation steps. Many B. subtilis sporulation genes, particularly those encoding small acid-soluble spore proteins and spore coat proteins, were found only in the family Bacillaceae, or even in a subset of Bacillus spp. Phylogenetic profiles of sporulation genes, compiled in this work, confirm the presence of a common sporulation gene core, but also illuminate the diversity of the sporulation processes within various lineages. These profiles should help further experimental studies of uncharacterized widespread sporulation genes, which would ultimately allow delineation of the minimal set(s) of sporulation-specific genes in Bacilli and Clostridia. PMID:22882546
Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.
Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin
2016-04-01
Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.
A role for clock genes in sleep homeostasis.
Franken, Paul
2013-10-01
The timing and quality of both sleep and wakefulness are thought to be regulated by the interaction of two processes. One of these two processes keeps track of the prior sleep-wake history and controls the homeostatic need for sleep while the other sets the time-of-day that sleep preferably occurs. The molecular pathways underlying the latter, circadian process have been studied in detail and their key role in physiological time-keeping has been well established. Analyses of sleep in mice and flies lacking core circadian clock gene proteins have demonstrated, however, that besides disrupting circadian rhythms, also sleep homeostatic processes were affected. Subsequent studies revealed that sleep loss alters both the mRNA levels and the specific DNA-binding of the key circadian transcriptional regulators to their target sequences in the mouse brain. The fact that sleep loss impinges on the very core of the molecular circadian circuitry might explain why both inadequate sleep and disrupted circadian rhythms can similarly lead to metabolic pathology. The evidence for a role for clock genes in sleep homeostasis will be reviewed here. Copyright © 2013 Elsevier Ltd. All rights reserved.
Arabidopsis G-protein interactome reveals connections to cell wall carbohydrates and morphogenesis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klopffleisch, Karsten; Phan, Nguyen; Chen, Jay
2011-01-01
The heterotrimeric G-protein complex is minimally composed of G{alpha}, G{beta}, and G{gamma} subunits. In the classic scenario, the G-protein complex is the nexus in signaling from the plasma membrane, where the heterotrimeric G-protein associates with heptahelical G-protein-coupled receptors (GPCRs), to cytoplasmic target proteins called effectors. Although a number of effectors are known in metazoans and fungi, none of these are predicted to exist in their canonical forms in plants. To identify ab initio plant G-protein effectors and scaffold proteins, we screened a set of proteins from the G-protein complex using two-hybrid complementation in yeast. After deep and exhaustive interrogation, wemore » detected 544 interactions between 434 proteins, of which 68 highly interconnected proteins form the core G-protein interactome. Within this core, over half of the interactions comprising two-thirds of the nodes were retested and validated as genuine in planta. Co-expression analysis in combination with phenotyping of loss-of-function mutations in a set of core interactome genes revealed a novel role for G-proteins in regulating cell wall modification.« less
Arabidopsis G-protein interactome reveals connections to cell wall carbohydrates and morphogenesis.
Klopffleisch, Karsten; Phan, Nguyen; Augustin, Kelsey; Bayne, Robert S; Booker, Katherine S; Botella, Jose R; Carpita, Nicholas C; Carr, Tyrell; Chen, Jin-Gui; Cooke, Thomas Ryan; Frick-Cheng, Arwen; Friedman, Erin J; Fulk, Brandon; Hahn, Michael G; Jiang, Kun; Jorda, Lucia; Kruppe, Lydia; Liu, Chenggang; Lorek, Justine; McCann, Maureen C; Molina, Antonio; Moriyama, Etsuko N; Mukhtar, M Shahid; Mudgil, Yashwanti; Pattathil, Sivakumar; Schwarz, John; Seta, Steven; Tan, Matthew; Temp, Ulrike; Trusov, Yuri; Urano, Daisuke; Welter, Bastian; Yang, Jing; Panstruga, Ralph; Uhrig, Joachim F; Jones, Alan M
2011-09-27
The heterotrimeric G-protein complex is minimally composed of Gα, Gβ, and Gγ subunits. In the classic scenario, the G-protein complex is the nexus in signaling from the plasma membrane, where the heterotrimeric G-protein associates with heptahelical G-protein-coupled receptors (GPCRs), to cytoplasmic target proteins called effectors. Although a number of effectors are known in metazoans and fungi, none of these are predicted to exist in their canonical forms in plants. To identify ab initio plant G-protein effectors and scaffold proteins, we screened a set of proteins from the G-protein complex using two-hybrid complementation in yeast. After deep and exhaustive interrogation, we detected 544 interactions between 434 proteins, of which 68 highly interconnected proteins form the core G-protein interactome. Within this core, over half of the interactions comprising two-thirds of the nodes were retested and validated as genuine in planta. Co-expression analysis in combination with phenotyping of loss-of-function mutations in a set of core interactome genes revealed a novel role for G-proteins in regulating cell wall modification.
Arabidopsis G-protein interactome reveals connections to cell wall carbohydrates and morphogenesis
Klopffleisch, Karsten; Phan, Nguyen; Augustin, Kelsey; Bayne, Robert S; Booker, Katherine S; Botella, Jose R; Carpita, Nicholas C; Carr, Tyrell; Chen, Jin-Gui; Cooke, Thomas Ryan; Frick-Cheng, Arwen; Friedman, Erin J; Fulk, Brandon; Hahn, Michael G; Jiang, Kun; Jorda, Lucia; Kruppe, Lydia; Liu, Chenggang; Lorek, Justine; McCann, Maureen C; Molina, Antonio; Moriyama, Etsuko N; Mukhtar, M Shahid; Mudgil, Yashwanti; Pattathil, Sivakumar; Schwarz, John; Seta, Steven; Tan, Matthew; Temp, Ulrike; Trusov, Yuri; Urano, Daisuke; Welter, Bastian; Yang, Jing; Panstruga, Ralph; Uhrig, Joachim F; Jones, Alan M
2011-01-01
The heterotrimeric G-protein complex is minimally composed of Gα, Gβ, and Gγ subunits. In the classic scenario, the G-protein complex is the nexus in signaling from the plasma membrane, where the heterotrimeric G-protein associates with heptahelical G-protein-coupled receptors (GPCRs), to cytoplasmic target proteins called effectors. Although a number of effectors are known in metazoans and fungi, none of these are predicted to exist in their canonical forms in plants. To identify ab initio plant G-protein effectors and scaffold proteins, we screened a set of proteins from the G-protein complex using two-hybrid complementation in yeast. After deep and exhaustive interrogation, we detected 544 interactions between 434 proteins, of which 68 highly interconnected proteins form the core G-protein interactome. Within this core, over half of the interactions comprising two-thirds of the nodes were retested and validated as genuine in planta. Co-expression analysis in combination with phenotyping of loss-of-function mutations in a set of core interactome genes revealed a novel role for G-proteins in regulating cell wall modification. PMID:21952135
The genome sequence of taurine cattle: a window to ruminant biology and evolution.
Elsik, Christine G; Tellam, Ross L; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Weinstock, George M; Adelson, David L; Eichler, Evan E; Elnitski, Laura; Guigó, Roderic; Hamernik, Debora L; Kappes, Steve M; Lewin, Harris A; Lynn, David J; Nicholas, Frank W; Reymond, Alexandre; Rijnkels, Monique; Skow, Loren C; Zdobnov, Evgeny M; Schook, Lawrence; Womack, James; Alioto, Tyler; Antonarakis, Stylianos E; Astashyn, Alex; Chapple, Charles E; Chen, Hsiu-Chuan; Chrast, Jacqueline; Câmara, Francisco; Ermolaeva, Olga; Henrichsen, Charlotte N; Hlavina, Wratko; Kapustin, Yuri; Kiryutin, Boris; Kitts, Paul; Kokocinski, Felix; Landrum, Melissa; Maglott, Donna; Pruitt, Kim; Sapojnikov, Victor; Searle, Stephen M; Solovyev, Victor; Souvorov, Alexandre; Ucla, Catherine; Wyss, Carine; Anzola, Juan M; Gerlach, Daniel; Elhaik, Eran; Graur, Dan; Reese, Justin T; Edgar, Robert C; McEwan, John C; Payne, Gemma M; Raison, Joy M; Junier, Thomas; Kriventseva, Evgenia V; Eyras, Eduardo; Plass, Mireya; Donthu, Ravikiran; Larkin, Denis M; Reecy, James; Yang, Mary Q; Chen, Lin; Cheng, Ze; Chitko-McKown, Carol G; Liu, George E; Matukumalli, Lakshmi K; Song, Jiuzhou; Zhu, Bin; Bradley, Daniel G; Brinkman, Fiona S L; Lau, Lilian P L; Whiteside, Matthew D; Walker, Angela; Wheeler, Thomas T; Casey, Theresa; German, J Bruce; Lemay, Danielle G; Maqbool, Nauman J; Molenaar, Adrian J; Seo, Seongwon; Stothard, Paul; Baldwin, Cynthia L; Baxter, Rebecca; Brinkmeyer-Langford, Candice L; Brown, Wendy C; Childers, Christopher P; Connelley, Timothy; Ellis, Shirley A; Fritz, Krista; Glass, Elizabeth J; Herzig, Carolyn T A; Iivanainen, Antti; Lahmers, Kevin K; Bennett, Anna K; Dickens, C Michael; Gilbert, James G R; Hagen, Darren E; Salih, Hanni; Aerts, Jan; Caetano, Alexandre R; Dalrymple, Brian; Garcia, Jose Fernando; Gill, Clare A; Hiendleder, Stefan G; Memili, Erdogan; Spurlock, Diane; Williams, John L; Alexander, Lee; Brownstein, Michael J; Guan, Leluo; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Moore, Richard; Moore, Stephen S; Roberts, Andy; Taniguchi, Masaaki; Waterman, Richard C; Chacko, Joseph; Chandrabose, Mimi M; Cree, Andy; Dao, Marvin Diep; Dinh, Huyen H; Gabisi, Ramatu Ayiesha; Hines, Sandra; Hume, Jennifer; Jhangiani, Shalini N; Joshi, Vandita; Kovar, Christie L; Lewis, Lora R; Liu, Yih-Shin; Lopez, John; Morgan, Margaret B; Nguyen, Ngoc Bich; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Wright, Rita A; Buhay, Christian; Ding, Yan; Dugan-Rocha, Shannon; Herdandez, Judith; Holder, Michael; Sabo, Aniko; Egan, Amy; Goodell, Jason; Wilczek-Boney, Katarzyna; Fowler, Gerald R; Hitchens, Matthew Edward; Lozado, Ryan J; Moen, Charles; Steffen, David; Warren, James T; Zhang, Jingkun; Chiu, Readman; Schein, Jacqueline E; Durbin, K James; Havlak, Paul; Jiang, Huaiyang; Liu, Yue; Qin, Xiang; Ren, Yanru; Shen, Yufeng; Song, Henry; Bell, Stephanie Nicole; Davis, Clay; Johnson, Angela Jolivet; Lee, Sandra; Nazareth, Lynne V; Patel, Bella Mayurkumar; Pu, Ling-Ling; Vattathil, Selina; Williams, Rex Lee; Curry, Stacey; Hamilton, Cerissa; Sodergren, Erica; Wheeler, David A; Barris, Wes; Bennett, Gary L; Eggen, André; Green, Ronnie D; Harhay, Gregory P; Hobbs, Matthew; Jann, Oliver; Keele, John W; Kent, Matthew P; Lien, Sigbjørn; McKay, Stephanie D; McWilliam, Sean; Ratnakumar, Abhirami; Schnabel, Robert D; Smith, Timothy; Snelling, Warren M; Sonstegard, Tad S; Stone, Roger T; Sugimoto, Yoshikazu; Takasuga, Akiko; Taylor, Jeremy F; Van Tassell, Curtis P; Macneil, Michael D; Abatepaulo, Antonio R R; Abbey, Colette A; Ahola, Virpi; Almeida, Iassudara G; Amadio, Ariel F; Anatriello, Elen; Bahadue, Suria M; Biase, Fernando H; Boldt, Clayton R; Carroll, Jeffery A; Carvalho, Wanessa A; Cervelatti, Eliane P; Chacko, Elsa; Chapin, Jennifer E; Cheng, Ye; Choi, Jungwoo; Colley, Adam J; de Campos, Tatiana A; De Donato, Marcos; Santos, Isabel K F de Miranda; de Oliveira, Carlo J F; Deobald, Heather; Devinoy, Eve; Donohue, Kaitlin E; Dovc, Peter; Eberlein, Annett; Fitzsimmons, Carolyn J; Franzin, Alessandra M; Garcia, Gustavo R; Genini, Sem; Gladney, Cody J; Grant, Jason R; Greaser, Marion L; Green, Jonathan A; Hadsell, Darryl L; Hakimov, Hatam A; Halgren, Rob; Harrow, Jennifer L; Hart, Elizabeth A; Hastings, Nicola; Hernandez, Marta; Hu, Zhi-Liang; Ingham, Aaron; Iso-Touru, Terhi; Jamis, Catherine; Jensen, Kirsty; Kapetis, Dimos; Kerr, Tovah; Khalil, Sari S; Khatib, Hasan; Kolbehdari, Davood; Kumar, Charu G; Kumar, Dinesh; Leach, Richard; Lee, Justin C-M; Li, Changxi; Logan, Krystin M; Malinverni, Roberto; Marques, Elisa; Martin, William F; Martins, Natalia F; Maruyama, Sandra R; Mazza, Raffaele; McLean, Kim L; Medrano, Juan F; Moreno, Barbara T; Moré, Daniela D; Muntean, Carl T; Nandakumar, Hari P; Nogueira, Marcelo F G; Olsaker, Ingrid; Pant, Sameer D; Panzitta, Francesca; Pastor, Rosemeire C P; Poli, Mario A; Poslusny, Nathan; Rachagani, Satyanarayana; Ranganathan, Shoba; Razpet, Andrej; Riggs, Penny K; Rincon, Gonzalo; Rodriguez-Osorio, Nelida; Rodriguez-Zas, Sandra L; Romero, Natasha E; Rosenwald, Anne; Sando, Lillian; Schmutz, Sheila M; Shen, Libing; Sherman, Laura; Southey, Bruce R; Lutzow, Ylva Strandberg; Sweedler, Jonathan V; Tammen, Imke; Telugu, Bhanu Prakash V L; Urbanski, Jennifer M; Utsunomiya, Yuri T; Verschoor, Chris P; Waardenberg, Ashley J; Wang, Zhiquan; Ward, Robert; Weikard, Rosemarie; Welsh, Thomas H; White, Stephen N; Wilming, Laurens G; Wunderlich, Kris R; Yang, Jianqi; Zhao, Feng-Qi
2009-04-24
To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.
Comparative genomics of Lactobacillus
Kant, Ravi; Blom, Jochen; Palva, Airi; Siezen, Roland J.; de Vos, Willem M.
2011-01-01
Summary The genus Lactobacillus includes a diverse group of bacteria consisting of many species that are associated with fermentations of plants, meat or milk. In addition, various lactobacilli are natural inhabitants of the intestinal tract of humans and other animals. Finally, several Lactobacillus strains are marketed as probiotics as their consumption can confer a health benefit to host. Presently, 154 Lactobacillus species are known and a growing fraction of these are subject to draft genome sequencing. However, complete genome sequences are needed to provide a platform for detailed genomic comparisons. Therefore, we selected a total of 20 genomes of various Lactobacillus strains for which complete genomic sequences have been reported. These genomes had sizes varying from 1.8 to 3.3 Mb and other characteristic features, such as G+C content that ranged from 33% to 51%. The Lactobacillus pan genome was found to consist of approximately 14 000 protein‐encoding genes while all 20 genomes shared a total of 383 sets of orthologous genes that defined the Lactobacillus core genome (LCG). Based on advanced phylogeny of the proteins encoded by this LCG, we grouped the 20 strains into three main groups and defined core group genes present in all genomes of a single group, signature group genes shared in all genomes of one group but absent in all other Lactobacillus genomes, and Group‐specific ORFans present in core group genes of one group and absent in all other complete genomes. The latter are of specific value in defining the different groups of genomes. The study provides a platform for present individual comparisons as well as future analysis of new Lactobacillus genomes. PMID:21375712
Construction of a minimal genome as a chassis for synthetic biology.
Sung, Bong Hyun; Choe, Donghui; Kim, Sun Chang; Cho, Byung-Kwan
2016-11-30
Microbial diversity and complexity pose challenges in understanding the voluminous genetic information produced from whole-genome sequences, bioinformatics and high-throughput '-omics' research. These challenges can be overcome by a core blueprint of a genome drawn with a minimal gene set, which is essential for life. Systems biology and large-scale gene inactivation studies have estimated the number of essential genes to be ∼300-500 in many microbial genomes. On the basis of the essential gene set information, minimal-genome strains have been generated using sophisticated genome engineering techniques, such as genome reduction and chemical genome synthesis. Current size-reduced genomes are not perfect minimal genomes, but chemically synthesized genomes have just been constructed. Some minimal genomes provide various desirable functions for bioindustry, such as improved genome stability, increased transformation efficacy and improved production of biomaterials. The minimal genome as a chassis genome for synthetic biology can be used to construct custom-designed genomes for various practical and industrial applications. © 2016 The Author(s). published by Portland Press Limited on behalf of the Biochemical Society.
Genetic and epigenetic variation in the lineage specification of regulatory T cells
Arvey, Aaron; van der Veeken, Joris; Plitas, George; Rich, Stephen S; Concannon, Patrick; Rudensky, Alexander Y
2015-01-01
Regulatory T (Treg) cells, which suppress autoimmunity and other inflammatory states, are characterized by a distinct set of genetic elements controlling their gene expression. However, the extent of genetic and associated epigenetic variation in the Treg cell lineage and its possible relation to disease states in humans remain unknown. We explored evolutionary conservation of regulatory elements and natural human inter-individual epigenetic variation in Treg cells to identify the core transcriptional control program of lineage specification. Analysis of single nucleotide polymorphisms in core lineage-specific enhancers revealed disease associations, which were further corroborated by high-resolution genotyping to fine map causal polymorphisms in lineage-specific enhancers. Our findings suggest that a small set of regulatory elements specify the Treg lineage and that genetic variation in Treg cell-specific enhancers may alter Treg cell function contributing to polygenic disease. DOI: http://dx.doi.org/10.7554/eLife.07571.001 PMID:26510014
Exploiting the functional and taxonomic structure of genomic data by probabilistic topic modeling.
Chen, Xin; Hu, Xiaohua; Lim, Tze Y; Shen, Xiajiong; Park, E K; Rosen, Gail L
2012-01-01
In this paper, we present a method that enable both homology-based approach and composition-based approach to further study the functional core (i.e., microbial core and gene core, correspondingly). In the proposed method, the identification of major functionality groups is achieved by generative topic modeling, which is able to extract useful information from unlabeled data. We first show that generative topic model can be used to model the taxon abundance information obtained by homology-based approach and study the microbial core. The model considers each sample as a “document,” which has a mixture of functional groups, while each functional group (also known as a “latent topic”) is a weight mixture of species. Therefore, estimating the generative topic model for taxon abundance data will uncover the distribution over latent functions (latent topic) in each sample. Second, we show that, generative topic model can also be used to study the genome-level composition of “N-mer” features (DNA subreads obtained by composition-based approaches). The model consider each genome as a mixture of latten genetic patterns (latent topics), while each functional pattern is a weighted mixture of the “N-mer” features, thus the existence of core genomes can be indicated by a set of common N-mer features. After studying the mutual information between latent topics and gene regions, we provide an explanation of the functional roles of uncovered latten genetic patterns. The experimental results demonstrate the effectiveness of proposed method.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rubin, Benjamin E.; Wetmore, Kelly M.; Price, Morgan N.
Synechococcus elongatus PCC 7942 is a model organism used for studying photosynthesis and the circadian clock, and it is being developed for the production of fuel, industrial chemicals, and pharmaceuticals. To identify a comprehensive set of genes and intergenic regions that impacts fitness in S. elongatus, we created a pooled library of ~250,000 transposon mutants and used sequencing to identify the insertion locations. By analyzing the distribution and survival of these mutants, we identified 718 of the organism's 2,723 genes as essential for survival under laboratory conditions. The validity of the essential gene set is supported by its tight overlapmore » with wellconserved genes and its enrichment for core biological processes. The differences noted between our dataset and these predictors of essentiality, however, have led to surprising biological insights. One such finding is that genes in a large portion of the TCA cycle are dispensable, suggesting that S. elongatus does not require a cyclic TCA process. Furthermore, the density of the transposon mutant library enabled individual and global statements about the essentiality of noncoding RNAs, regulatory elements, and other intergenic regions. In this way, a group I intron located in tRNA Leu , which has been used extensively for phylogenetic studies, was shown here to be essential for the survival of S. elongatus. Our survey of essentiality for every locus in the S. elongatus genome serves as a powerful resource for understanding the organism's physiology and defines the essential gene set required for the growth of a photosynthetic organism.« less
Mor, Avishai; Koh, Eugene; Weiner, Lev; Rosenwasser, Shilo; Sibony-Benyamini, Hadas; Fluhr, Robert
2014-05-01
The production of singlet oxygen is typically associated with inefficient dissipation of photosynthetic energy or can arise from light reactions as a result of accumulation of chlorophyll precursors as observed in fluorescent (flu)-like mutants. Such photodynamic production of singlet oxygen is thought to be involved in stress signaling and programmed cell death. Here we show that transcriptomes of multiple stresses, whether from light or dark treatments, were correlated with the transcriptome of the flu mutant. A core gene set of 118 genes, common to singlet oxygen, biotic and abiotic stresses was defined and confirmed to be activated photodynamically by the photosensitizer Rose Bengal. In addition, induction of the core gene set by abiotic and biotic selected stresses was shown to occur in the dark and in nonphotosynthetic tissue. Furthermore, when subjected to various biotic and abiotic stresses in the dark, the singlet oxygen-specific probe Singlet Oxygen Sensor Green detected rapid production of singlet oxygen in the Arabidopsis (Arabidopsis thaliana) root. Subcellular localization of Singlet Oxygen Sensor Green fluorescence showed its accumulation in mitochondria, peroxisomes, and the nucleus, suggesting several compartments as the possible origins or targets for singlet oxygen. Collectively, the results show that singlet oxygen can be produced by multiple stress pathways and can emanate from compartments other than the chloroplast in a light-independent manner. The results imply that the role of singlet oxygen in plant stress regulation and response is more ubiquitous than previously thought.
Mor, Avishai; Koh, Eugene; Weiner, Lev; Rosenwasser, Shilo; Sibony-Benyamini, Hadas; Fluhr, Robert
2014-01-01
The production of singlet oxygen is typically associated with inefficient dissipation of photosynthetic energy or can arise from light reactions as a result of accumulation of chlorophyll precursors as observed in fluorescent (flu)-like mutants. Such photodynamic production of singlet oxygen is thought to be involved in stress signaling and programmed cell death. Here we show that transcriptomes of multiple stresses, whether from light or dark treatments, were correlated with the transcriptome of the flu mutant. A core gene set of 118 genes, common to singlet oxygen, biotic and abiotic stresses was defined and confirmed to be activated photodynamically by the photosensitizer Rose Bengal. In addition, induction of the core gene set by abiotic and biotic selected stresses was shown to occur in the dark and in nonphotosynthetic tissue. Furthermore, when subjected to various biotic and abiotic stresses in the dark, the singlet oxygen-specific probe Singlet Oxygen Sensor Green detected rapid production of singlet oxygen in the Arabidopsis (Arabidopsis thaliana) root. Subcellular localization of Singlet Oxygen Sensor Green fluorescence showed its accumulation in mitochondria, peroxisomes, and the nucleus, suggesting several compartments as the possible origins or targets for singlet oxygen. Collectively, the results show that singlet oxygen can be produced by multiple stress pathways and can emanate from compartments other than the chloroplast in a light-independent manner. The results imply that the role of singlet oxygen in plant stress regulation and response is more ubiquitous than previously thought. PMID:24599491
Using the epigenetic field defect to detect prostate cancer in biopsy negative patients.
Truong, Matthew; Yang, Bing; Livermore, Andrew; Wagner, Jennifer; Weeratunga, Puspha; Huang, Wei; Dhir, Rajiv; Nelson, Joel; Lin, Daniel W; Jarrard, David F
2013-06-01
We determined whether a novel combination of field defect DNA methylation markers could predict the presence of prostate cancer using histologically normal transrectal ultrasound guided biopsy cores. Methylation was assessed using quantitative Pyrosequencing® in a training set consisting of 65 nontumor and tumor associated prostate tissues from University of Wisconsin. A multiplex model was generated using multivariate logistic regression and externally validated in blinded fashion in a set of 47 nontumor and tumor associated biopsy specimens from University of Washington. We observed robust methylation differences in all genes at all CpGs assayed (p <0.0001). Regression models incorporating individual genes (EVX1, CAV1 and FGF1) and a gene combination (EVX1 and FGF1) discriminated nontumor from tumor associated tissues in the original training set (AUC 0.796-0.898, p <0.001). On external validation uniplex models incorporating EVX1, CAV1 or FGF1 discriminated tumor from nontumor associated biopsy negative specimens (AUC 0.702, 0.696 and 0.658, respectively, p <0.05). A multiplex model (EVX1 and FGF1) identified patients with prostate cancer (AUC 0.774, p = 0.001) and had a negative predictive value of 0.909. Comparison between 2 separate cores in patients in this validation set revealed similar methylation defects, indicating detection of a widespread field defect. A widespread epigenetic field defect can be used to detect prostate cancer in patients with histologically negative biopsies. To our knowledge this assay is unique, in that it detects alterations in nontumor cells. With further validation this marker combination (EVX1 and FGF1) has the potential to decrease the need for repeat prostate biopsies, a procedure associated with cost and complications. Copyright © 2013 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Comparative Metagenomics Revealed Commonly Enriched Gene Sets in Human Gut Microbiomes
Kurokawa, Ken; Itoh, Takehiko; Kuwahara, Tomomi; Oshima, Kenshiro; Toh, Hidehiro; Toyoda, Atsushi; Takami, Hideto; Morita, Hidetoshi; Sharma, Vineet K.; Srivastava, Tulika P.; Taylor, Todd D.; Noguchi, Hideki; Mori, Hiroshi; Ogura, Yoshitoshi; Ehrlich, Dusko S.; Itoh, Kikuji; Takagi, Toshihisa; Sakaki, Yoshiyuki; Hayashi, Tetsuya; Hattori, Masahira
2007-01-01
Numerous microbes inhabit the human intestine, many of which are uncharacterized or uncultivable. They form a complex microbial community that deeply affects human physiology. To identify the genomic features common to all human gut microbiomes as well as those variable among them, we performed a large-scale comparative metagenomic analysis of fecal samples from 13 healthy individuals of various ages, including unweaned infants. We found that, while the gut microbiota from unweaned infants were simple and showed a high inter-individual variation in taxonomic and gene composition, those from adults and weaned children were more complex but showed a high functional uniformity regardless of age or sex. In searching for the genes over-represented in gut microbiomes, we identified 237 gene families commonly enriched in adult-type and 136 families in infant-type microbiomes, with a small overlap. An analysis of their predicted functions revealed various strategies employed by each type of microbiota to adapt to its intestinal environment, suggesting that these gene sets encode the core functions of adult and infant-type gut microbiota. By analysing the orphan genes, 647 new gene families were identified to be exclusively present in human intestinal microbiomes. In addition, we discovered a conjugative transposon family explosively amplified in human gut microbiomes, which strongly suggests that the intestine is a ‘hot spot’ for horizontal gene transfer between microbes. PMID:17916580
Interrogating the topological robustness of gene regulatory circuits by randomization
Levine, Herbert; Onuchic, Jose N.
2017-01-01
One of the most important roles of cells is performing their cellular tasks properly for survival. Cells usually achieve robust functionality, for example, cell-fate decision-making and signal transduction, through multiple layers of regulation involving many genes. Despite the combinatorial complexity of gene regulation, its quantitative behavior has been typically studied on the basis of experimentally verified core gene regulatory circuitry, composed of a small set of important elements. It is still unclear how such a core circuit operates in the presence of many other regulatory molecules and in a crowded and noisy cellular environment. Here we report a new computational method, named random circuit perturbation (RACIPE), for interrogating the robust dynamical behavior of a gene regulatory circuit even without accurate measurements of circuit kinetic parameters. RACIPE generates an ensemble of random kinetic models corresponding to a fixed circuit topology, and utilizes statistical tools to identify generic properties of the circuit. By applying RACIPE to simple toggle-switch-like motifs, we observed that the stable states of all models converge to experimentally observed gene state clusters even when the parameters are strongly perturbed. RACIPE was further applied to a proposed 22-gene network of the Epithelial-to-Mesenchymal Transition (EMT), from which we identified four experimentally observed gene states, including the states that are associated with two different types of hybrid Epithelial/Mesenchymal phenotypes. Our results suggest that dynamics of a gene circuit is mainly determined by its topology, not by detailed circuit parameters. Our work provides a theoretical foundation for circuit-based systems biology modeling. We anticipate RACIPE to be a powerful tool to predict and decode circuit design principles in an unbiased manner, and to quantitatively evaluate the robustness and heterogeneity of gene expression. PMID:28362798
Schröder, R; Maassen, A; Lippoldt, A; Börner, T; von Baehr, R; Dobrowolski, P
1991-08-01
Using the broad-host-range promoter probe vector pRS201 for cloning of phage Acm1 promoters, we established a convenient vector system for expression of heterologous genes in different Gram-negative bacteria. The usefulness of this system was demonstrated by expression of the HBV core gene in Acetobacter methanolicus. Plasmids carrying the HBV core gene downstream of different Acm1-phage promoters were transferred to A. methanolicus, a new potential host for recombinant DNA expression. Using enzyme immunoassay and immunoblot techniques, the amount and composition of core antigen produced in A. methanolicus were compared with that derived from Escherichia coli. The expression of immunoreactive core antigen in A. methanolicus exceeds by sevenfold that in E. coli using an expression system with tandemly arranged promoters. Morphological observations by electron microscopy show that the HBV core gene products isolated from both hosts are assembled into regular spherical particles with a diameter of about 28 nm that are comparable to original viral nucleocapsids.
The carnegie protein trap library: a versatile tool for Drosophila developmental studies.
Buszczak, Michael; Paterno, Shelley; Lighthouse, Daniel; Bachman, Julia; Planck, Jamie; Owen, Stephenie; Skora, Andrew D; Nystul, Todd G; Ohlstein, Benjamin; Allen, Anna; Wilhelm, James E; Murphy, Terence D; Levis, Robert W; Matunis, Erika; Srivali, Nahathai; Hoskins, Roger A; Spradling, Allan C
2007-03-01
Metazoan physiology depends on intricate patterns of gene expression that remain poorly known. Using transposon mutagenesis in Drosophila, we constructed a library of 7404 protein trap and enhancer trap lines, the Carnegie collection, to facilitate gene expression mapping at single-cell resolution. By sequencing the genomic insertion sites, determining splicing patterns downstream of the enhanced green fluorescent protein (EGFP) exon, and analyzing expression patterns in the ovary and salivary gland, we found that 600-900 different genes are trapped in our collection. A core set of 244 lines trapped different identifiable protein isoforms, while insertions likely to act as GFP-enhancer traps were found in 256 additional genes. At least 8 novel genes were also identified. Our results demonstrate that the Carnegie collection will be useful as a discovery tool in diverse areas of cell and developmental biology and suggest new strategies for greatly increasing the coverage of the Drosophila proteome with protein trap insertions.
Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing
2009-03-11
Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene-encoded proteins are attached to the core at more peripheral positions of the networks.
Hara, Yuichiro; Tatsumi, Kaori; Yoshida, Michio; Kajikawa, Eriko; Kiyonari, Hiroshi; Kuraku, Shigehiro
2015-11-18
RNA-seq enables gene expression profiling in selected spatiotemporal windows and yields massive sequence information with relatively low cost and time investment, even for non-model species. However, there remains a large room for optimizing its workflow, in order to take full advantage of continuously developing sequencing capacity. Transcriptome sequencing for three embryonic stages of Madagascar ground gecko (Paroedura picta) was performed with the Illumina platform. The output reads were assembled de novo for reconstructing transcript sequences. In order to evaluate the completeness of transcriptome assemblies, we prepared a reference gene set consisting of vertebrate one-to-one orthologs. To take advantage of increased read length of >150 nt, we demonstrated shortened RNA fragmentation time, which resulted in a dramatic shift of insert size distribution. To evaluate products of multiple de novo assembly runs incorporating reads with different RNA sources, read lengths, and insert sizes, we introduce a new reference gene set, core vertebrate genes (CVG), consisting of 233 genes that are shared as one-to-one orthologs by all vertebrate genomes examined (29 species)., The completeness assessment performed by the computational pipelines CEGMA and BUSCO referring to CVG, demonstrated higher accuracy and resolution than with the gene set previously established for this purpose. As a result of the assessment with CVG, we have derived the most comprehensive transcript sequence set of the Madagascar ground gecko by means of assembling individual libraries followed by clustering the assembled sequences based on their overall similarities. Our results provide several insights into optimizing de novo RNA-seq workflow, including the coordination between library insert size and read length, which manifested in improved connectivity of assemblies. The approach and assembly assessment with CVG demonstrated here would be applicable to transcriptome analysis of other species as well as whole genome analyses.
Benchmarking of Methods for Genomic Taxonomy
Larsen, Mette V.; Cosentino, Salvatore; Lukjancenko, Oksana; ...
2014-02-26
One of the first issues that emerges when a prokaryotic organism of interest is encountered is the question of what it is—that is, which species it is. The 16S rRNA gene formed the basis of the first method for sequence-based taxonomy and has had a tremendous impact on the field of microbiology. Nevertheless, the method has been found to have a number of shortcomings. In this paper, we trained and benchmarked five methods for whole-genome sequence-based prokaryotic species identification on a common data set of complete genomes: (i) SpeciesFinder, which is based on the complete 16S rRNA gene; (ii) Reads2Typemore » that searches for species-specific 50-mers in either the 16S rRNA gene or the gyrB gene (for the Enterobacteraceae family); (iii) the ribosomal multilocus sequence typing (rMLST) method that samples up to 53 ribosomal genes; (iv) TaxonomyFinder, which is based on species-specific functional protein domain profiles; and finally (v) KmerFinder, which examines the number of cooccurring k-mers (substrings of k nucleotides in DNA sequence data). The performances of the methods were subsequently evaluated on three data sets of short sequence reads or draft genomes from public databases. In total, the evaluation sets constituted sequence data from more than 11,000 isolates covering 159 genera and 243 species. Our results indicate that methods that sample only chromosomal, core genes have difficulties in distinguishing closely related species which only recently diverged. Finally, the KmerFinder method had the overall highest accuracy and correctly identified from 93% to 97% of the isolates in the evaluations sets.« less
Zheng, Chunfang; Santos Muñoz, Daniella; Albert, Victor A; Sankoff, David
2015-01-01
Following whole genome duplication (WGD), there is a compact distribution of gene similarities within the genome reflecting duplicate pairs of all the genes in the genome. With time, the distribution broadens and loses volume due to variable decay of duplicate gene similarity and to the process of duplicate gene loss. If there are two WGD, the older one becomes so reduced and broad that it merges with the tail of the distributions resulting from more recent events, and it becomes difficult to distinguish them. The goal of this paper is to advance statistical methods of identifying, or at least counting, the WGD events in the lineage of a given genome. For a set of 15 angiosperm genomes, we analyze all 15 × 14 = 210 ordered pairs of target genome versus reference genome, using SynMap to find syntenic blocks. We consider all sets of B ≥ 2 syntenic blocks in the target genome that overlap in the reference genome as evidence of WGD activity in the target, whether it be one event or several. We hypothesize that in fitting an exponential function to the tail of the empirical distribution f (B) of block multiplicities, the size of the exponent will reflect the amount of WGD in the history of the target genome. By amalgamating the results from all reference genomes, a range of values of SynMap parameters, and alternative cutoff points for the tail, we find a clear pattern whereby multiple-WGD core eudicots have the smallest (negative) exponents, followed by core eudicots with only the single "γ" triplication in their history, followed by a non-core eudicot with a single WGD, followed by the monocots, with a basal angiosperm, the WGD-free Amborella having the largest exponent. The hypothesis that the exponent of the fit to the tail of the multiplicity distribution is a signature of the amount of WGD is verified, but there is also a clear complicating factor in the monocot clade, where a history of multiple WGD is not reflected in a small exponent.
Tailoring the dendrimer core for efficient gene delivery.
Hu, Jingjing; Hu, Ke; Cheng, Yiyun
2016-04-15
Dendrimers have been widely used as non-viral gene vectors due to well-defined chemical structures, high density of cationic charges and ease of surface modification. Although a large number of studies have reported the important roles of dendrimer architecture, component, generation and surface functionality in gene delivery, the effect of dendrimer core on this issue still remains unclear. Recent literatures suggest that a slight alternation in dendrimer core has a profound effect in the transfection efficacy and biocompatibility. In this review, we will discuss the transfection mechanism of dendrimers with different types of cores in respect of flexibility, hydrophobicity and functionality. We hope to open a possibility of designing efficient dendrimers for gene delivery by choosing a proper dendrimer core. As a branch of researches on dendrimers and dendritic polymers, the design of biocompatible and high efficient polymeric gene carriers has attracted increasing attentions during these years. Although the effect of dendrimer generation, species, architecture and surface functionality on gene delivery have been widely reported, the effect of dendrimer core on this issue still remains unclear. Recent literatures suggest that a minor variation on the dendrimer core has a profound effect in the transfection efficacy and biocompatibility. This critical review summarized the dendrimers with different types of cores and discussed the transfection mechanism with particular focus on the flexibility, hydrophobicity, and functionality. It is hoped to provide a new insight to design efficient and safe dendrimer-based gene vectors by choosing a proper core. To the best of our knowledge, this is the first review on the effect of dendrimer core on gene delivery. The findings obtained in this filed are of central importance in the design of efficient polymeric gene vectors. This article will appeal a wide readership such as physical chemist, dendrimer chemist, biological chemist, pharmaceutical scientist, and biomaterial researchers. We hope that this review article can be published by Acta Biomaterialia, a top journal that publishes important reviews in the field of biomaterials science. Copyright © 2016 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
Defining the Estimated Core Genome of Bacterial Populations Using a Bayesian Decision Model
van Tonder, Andries J.; Mistry, Shilan; Bray, James E.; Hill, Dorothea M. C.; Cody, Alison J.; Farmer, Chris L.; Klugman, Keith P.; von Gottberg, Anne; Bentley, Stephen D.; Parkhill, Julian; Jolley, Keith A.; Maiden, Martin C. J.; Brueggemann, Angela B.
2014-01-01
The bacterial core genome is of intense interest and the volume of whole genome sequence data in the public domain available to investigate it has increased dramatically. The aim of our study was to develop a model to estimate the bacterial core genome from next-generation whole genome sequencing data and use this model to identify novel genes associated with important biological functions. Five bacterial datasets were analysed, comprising 2096 genomes in total. We developed a Bayesian decision model to estimate the number of core genes, calculated pairwise evolutionary distances (p-distances) based on nucleotide sequence diversity, and plotted the median p-distance for each core gene relative to its genome location. We designed visually-informative genome diagrams to depict areas of interest in genomes. Case studies demonstrated how the model could identify areas for further study, e.g. 25% of the core genes with higher sequence diversity in the Campylobacter jejuni and Neisseria meningitidis genomes encoded hypothetical proteins. The core gene with the highest p-distance value in C. jejuni was annotated in the reference genome as a putative hydrolase, but further work revealed that it shared sequence homology with beta-lactamase/metallo-beta-lactamases (enzymes that provide resistance to a range of broad-spectrum antibiotics) and thioredoxin reductase genes (which reduce oxidative stress and are essential for DNA replication) in other C. jejuni genomes. Our Bayesian model of estimating the core genome is principled, easy to use and can be applied to large genome datasets. This study also highlighted the lack of knowledge currently available for many core genes in bacterial genomes of significant global public health importance. PMID:25144616
Density-based cluster algorithms for the identification of core sets
NASA Astrophysics Data System (ADS)
Lemke, Oliver; Keller, Bettina G.
2016-10-01
The core-set approach is a discretization method for Markov state models of complex molecular dynamics. Core sets are disjoint metastable regions in the conformational space, which need to be known prior to the construction of the core-set model. We propose to use density-based cluster algorithms to identify the cores. We compare three different density-based cluster algorithms: the CNN, the DBSCAN, and the Jarvis-Patrick algorithm. While the core-set models based on the CNN and DBSCAN clustering are well-converged, constructing core-set models based on the Jarvis-Patrick clustering cannot be recommended. In a well-converged core-set model, the number of core sets is up to an order of magnitude smaller than the number of states in a conventional Markov state model with comparable approximation error. Moreover, using the density-based clustering one can extend the core-set method to systems which are not strongly metastable. This is important for the practical application of the core-set method because most biologically interesting systems are only marginally metastable. The key point is to perform a hierarchical density-based clustering while monitoring the structure of the metric matrix which appears in the core-set method. We test this approach on a molecular-dynamics simulation of a highly flexible 14-residue peptide. The resulting core-set models have a high spatial resolution and can distinguish between conformationally similar yet chemically different structures, such as register-shifted hairpin structures.
The shape of the human language-ready brain
Boeckx, Cedric; Benítez-Burraco, Antonio
2014-01-01
Our core hypothesis is that the emergence of our species-specific language-ready brain ought to be understood in light of the developmental changes expressed at the levels of brain morphology and neural connectivity that occurred in our species after the split from Neanderthals–Denisovans and that gave us a more globular braincase configuration. In addition to changes at the cortical level, we hypothesize that the anatomical shift that led to globularity also entailed significant changes at the subcortical level. We claim that the functional consequences of such changes must also be taken into account to gain a fuller understanding of our linguistic capacity. Here we focus on the thalamus, which we argue is central to language and human cognition, as it modulates fronto-parietal activity. With this new neurobiological perspective in place, we examine its possible molecular basis. We construct a candidate gene set whose members are involved in the development and connectivity of the thalamus, in the evolution of the human head, and are known to give rise to language-associated cognitive disorders. We submit that the new gene candidate set opens up new windows into our understanding of the genetic basis of our linguistic capacity. Thus, our hypothesis aims at generating new testing grounds concerning core aspects of language ontogeny and phylogeny. PMID:24772099
The Genome Sequence of Taurine Cattle: A window to ruminant biology and evolution
Elsik, Christine G.; Tellam, Ross L.; Worley, Kim C.
2010-01-01
To understand the biology and evolution of ruminants, the cattle genome was sequenced to ∼7× coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1,217 are absent or undetected in non-eutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides an enabling resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production. PMID:19390049
Salazar-Jaramillo, Laura; Jalvingh, Kirsten M; de Haan, Ammerins; Kraaijeveld, Ken; Buermans, Henk; Wertheim, Bregje
2017-04-27
Parasitoid resistance in Drosophila varies considerably, among and within species. An immune response, lamellocyte-mediated encapsulation, evolved in a subclade of Drosophila and was subsequently lost in at least one species within this subclade. While the mechanisms of resistance are fairly well documented in D. melanogaster, much less is known for closely related species. Here, we studied the inter- and intra-species variation in gene expression after parasitoid attack in Drosophila. We used RNA-seq after parasitization of four closely related Drosophila species of the melanogaster subgroup and replicated lines of D. melanogaster experimentally selected for increased resistance to gain insights into short- and long-term evolutionary changes. We found a core set of genes that are consistently up-regulated after parasitoid attack in the species and lines tested, regardless of their level of resistance. Another set of genes showed no up-regulation or expression in D. sechellia, the species unable to raise an immune response against parasitoids. This set consists largely of genes that are lineage-restricted to the melanogaster subgroup. Artificially selected lines did not show significant differences in gene expression with respect to non-selected lines in their responses to parasitoid attack, but several genes showed differential exon usage. We showed substantial similarities, but also notable differences, in the transcriptional responses to parasitoid attack among four closely related Drosophila species. In contrast, within D. melanogaster, the responses were remarkably similar. We confirmed that in the short-term, selection does not act on a pre-activation of the immune response. Instead it may target alternative mechanisms such as differential exon usage. In the long-term, we found support for the hypothesis that the ability to immunologically resist parasitoid attack is contingent on new genes that are restricted to the melanogaster subgroup.
Woo, Sangsoon; Gao, Hong; Henderson, David; Zacharias, Wolfgang; Liu, Gang; Tran, Quynh T; Prasad, G L
2017-05-03
Smoking has been established as a major risk factor for developing oral squamous cell carcinoma (OSCC), but less attention has been paid to the effects of smokeless tobacco products. Our objective is to identify potential biomarkers to distinguish the biological effects of combustible tobacco products from those of non-combustible ones using oral cell lines. Normal human gingival epithelial cells (HGEC), non-metastatic (101A) and metastatic (101B) OSCC cell lines were exposed to different tobacco product preparations (TPPs) including cigarette smoke total particulate matter (TPM), whole-smoke conditioned media (WS-CM), smokeless tobacco extract in complete artificial saliva (STE), or nicotine (NIC) alone. We performed microarray-based gene expression profiling and found 3456 probe sets from 101A, 1432 probe sets from 101B, and 2717 probe sets from HGEC to be differentially expressed. Gene Set Enrichment Analysis (GSEA) revealed xenobiotic metabolism and steroid biosynthesis were the top two pathways that were upregulated by combustible but not by non-combustible TPPs. Notably, aldo-keto reductase genes, AKR1C1 and AKR1C2 , were the core genes in the top enriched pathways and were statistically upregulated more than eight-fold by combustible TPPs. Quantitative real time polymerase chain reaction (qRT-PCR) results statistically support AKR1C1 as a potential biomarker for differentiating the biological effects of combustible from non-combustible tobacco products.
Woo, Sangsoon; Gao, Hong; Henderson, David; Zacharias, Wolfgang; Liu, Gang; Tran, Quynh T.; Prasad, G.L.
2017-01-01
Smoking has been established as a major risk factor for developing oral squamous cell carcinoma (OSCC), but less attention has been paid to the effects of smokeless tobacco products. Our objective is to identify potential biomarkers to distinguish the biological effects of combustible tobacco products from those of non-combustible ones using oral cell lines. Normal human gingival epithelial cells (HGEC), non-metastatic (101A) and metastatic (101B) OSCC cell lines were exposed to different tobacco product preparations (TPPs) including cigarette smoke total particulate matter (TPM), whole-smoke conditioned media (WS-CM), smokeless tobacco extract in complete artificial saliva (STE), or nicotine (NIC) alone. We performed microarray-based gene expression profiling and found 3456 probe sets from 101A, 1432 probe sets from 101B, and 2717 probe sets from HGEC to be differentially expressed. Gene Set Enrichment Analysis (GSEA) revealed xenobiotic metabolism and steroid biosynthesis were the top two pathways that were upregulated by combustible but not by non-combustible TPPs. Notably, aldo-keto reductase genes, AKR1C1 and AKR1C2, were the core genes in the top enriched pathways and were statistically upregulated more than eight-fold by combustible TPPs. Quantitative real time polymerase chain reaction (qRT-PCR) results statistically support AKR1C1 as a potential biomarker for differentiating the biological effects of combustible from non-combustible tobacco products. PMID:28467356
Role of HCV Core gene of genotype 1a and 3a and host gene Cox-2 in HCV-induced pathogenesis
2011-01-01
Background Hepatitis C virus (HCV) Core protein is thought to trigger activation of multiple signaling pathways and play a significant role in the alteration of cellular gene expression responsible for HCV pathogenesis leading to hepatocellular carcinoma (HCC). However, the exact molecular mechanism of HCV genome specific pathogenesis remains unclear. We examined the in vitro effects of HCV Core protein of HCV genotype 3a and 1a on the cellular genes involved in oxidative stress and angiogenesis. We also studied the ability of HCV Core and Cox-2 siRNA either alone or in combination to inhibit viral replication and cell proliferation in HCV serum infected Huh-7 cells. Results Over expression of Core gene of HCV 3a genotype showed stronger effect in regulating RNA and protein levels of Cox-2, iNOS, VEGF, p-Akt as compared to HCV-1a Core in hepatocellular carcinoma cell line Huh-7 accompanied by enhanced PGE2 release and cell proliferation. We also observed higher expression levels of above genes in HCV 3a patient's blood and biopsy samples. Interestingly, the Core and Cox-2-specific siRNAs down regulated the Core 3a-enhanced expression of Cox-2, iNOS, VEGF, p-Akt. Furthermore, the combined siRNA treatment also showed a dramatic reduction in viral titer and expression of these genes in HCV serum-infected Huh-7 cells. Taken together, these results demonstrated a differential response by HCV 3a genotype in HCV-induced pathogenesis, which may be due to Core and host factor Cox-2 individually or in combination. Conclusions Collectively, these studies not only suggest a genotype-specific interaction between key players of HCV pathogenesis but also may represent combined viral and host gene silencing as a potential therapeutic strategy. PMID:21457561
Schmid, Michael; Muri, Jonathan; Melidis, Damianos; Varadarajan, Adithi R; Somerville, Vincent; Wicki, Adrian; Moser, Aline; Bourqui, Marc; Wenzel, Claudia; Eugster-Meier, Elisabeth; Frey, Juerg E; Irmler, Stefan; Ahrens, Christian H
2018-01-01
Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences' long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus -to our knowledge-identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus . Notably, the functional Clusters of Orthologous Groups of proteins categories "cell wall/membrane biogenesis" and "defense mechanisms" were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences proved to be very useful for the analysis of natural whey starter cultures with metagenomics, as a larger percentage of the sequenced reads of these complex mixtures could be unambiguously assigned down to the strain level.
Schmid, Michael; Muri, Jonathan; Melidis, Damianos; Varadarajan, Adithi R.; Somerville, Vincent; Wicki, Adrian; Moser, Aline; Bourqui, Marc; Wenzel, Claudia; Eugster-Meier, Elisabeth; Frey, Juerg E.; Irmler, Stefan; Ahrens, Christian H.
2018-01-01
Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences' long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus—to our knowledge—identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus. Notably, the functional Clusters of Orthologous Groups of proteins categories “cell wall/membrane biogenesis” and “defense mechanisms” were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences proved to be very useful for the analysis of natural whey starter cultures with metagenomics, as a larger percentage of the sequenced reads of these complex mixtures could be unambiguously assigned down to the strain level. PMID:29441050
Multiple origins of interdependent endosymbiotic complexes in a genus of cicadas.
Łukasik, Piotr; Nazario, Katherine; Van Leuven, James T; Campbell, Matthew A; Meyer, Mariah; Michalik, Anna; Pessacq, Pablo; Simon, Chris; Veloso, Claudio; McCutcheon, John P
2018-01-09
Bacterial endosymbionts that provide nutrients to hosts often have genomes that are extremely stable in structure and gene content. In contrast, the genome of the endosymbiont Hodgkinia cicadicola has fractured into multiple distinct lineages in some species of the cicada genus Tettigades To better understand the frequency, timing, and outcomes of Hodgkinia lineage splitting throughout this cicada genus, we sampled cicadas over three field seasons in Chile and performed genomics and microscopy on representative samples. We found that a single ancestral Hodgkinia lineage has split at least six independent times in Tettigades over the last 4 million years, resulting in complexes of between two and six distinct Hodgkinia lineages per host. Individual genomes in these symbiotic complexes differ dramatically in relative abundance, genome size, organization, and gene content. Each Hodgkinia lineage retains a small set of core genes involved in genetic information processing, but the high level of gene loss experienced by all genomes suggests that extensive sharing of gene products among symbiont cells must occur. In total, Hodgkinia complexes that consist of multiple lineages encode nearly complete sets of genes present on the ancestral single lineage and presumably perform the same functions as symbionts that have not undergone splitting. However, differences in the timing of the splits, along with dissimilar gene loss patterns on the resulting genomes, have led to very different outcomes of lineage splitting in extant cicadas.
paraGSEA: a scalable approach for large-scale gene expression profiling
Peng, Shaoliang; Yang, Shunyun
2017-01-01
Abstract More studies have been conducted using gene expression similarity to identify functional connections among genes, diseases and drugs. Gene Set Enrichment Analysis (GSEA) is a powerful analytical method for interpreting gene expression data. However, due to its enormous computational overhead in the estimation of significance level step and multiple hypothesis testing step, the computation scalability and efficiency are poor on large-scale datasets. We proposed paraGSEA for efficient large-scale transcriptome data analysis. By optimization, the overall time complexity of paraGSEA is reduced from O(mn) to O(m+n), where m is the length of the gene sets and n is the length of the gene expression profiles, which contributes more than 100-fold increase in performance compared with other popular GSEA implementations such as GSEA-P, SAM-GS and GSEA2. By further parallelization, a near-linear speed-up is gained on both workstations and clusters in an efficient manner with high scalability and performance on large-scale datasets. The analysis time of whole LINCS phase I dataset (GSE92742) was reduced to nearly half hour on a 1000 node cluster on Tianhe-2, or within 120 hours on a 96-core workstation. The source code of paraGSEA is licensed under the GPLv3 and available at http://github.com/ysycloud/paraGSEA. PMID:28973463
Evolutionary trends and functional anatomy of the human expanded autophagy network
Till, Andreas; Saito, Rintaro; Merkurjev, Daria; Liu, Jing-Jing; Syed, Gulam Hussain; Kolnik, Martin; Siddiqui, Aleem; Glas, Martin; Scheffler, Björn; Ideker, Trey; Subramani, Suresh
2015-01-01
All eukaryotic cells utilize autophagy for protein and organelle turnover, thus assuring subcellular quality control, homeostasis, and survival. In order to address recent advances in identification of human autophagy associated genes, and to describe autophagy on a system-wide level, we established an autophagy-centered gene interaction network by merging various primary data sets and by retrieving respective interaction data. The resulting network (‘AXAN’) was analyzed with respect to subnetworks, e.g. the prime gene subnetwork (including the core machinery, signaling pathways and autophagy receptors) and the transcription subnetwork. To describe aspects of evolution within this network, we assessed the presence of protein orthologs across 99 eukaryotic model organisms. We visualized evolutionary trends for prime gene categories and evolutionary tracks for selected AXAN genes. This analysis confirms the eukaryotic origin of autophagy core genes while it points to a diverse evolutionary history of autophagy receptors. Next, we used module identification to describe the functional anatomy of the network at the level of pathway modules. In addition to obvious pathways (e.g., lysosomal degradation, insulin signaling) our data unveil the existence of context-related modules such as Rho GTPase signaling. Last, we used a tripartite, image-based RNAi – screen to test candidate genes predicted to play a role in regulation of autophagy. We verified the Rho GTPase, CDC42, as a novel regulator of autophagy-related signaling. This study emphasizes the applicability of system-wide approaches to gain novel insights into a complex biological process and to describe the human autophagy pathway at a hitherto unprecedented level of detail. PMID:26103419
Core outcome sets in women's and newborn health: a systematic review.
Duffy, Jmn; Rolph, R; Gale, C; Hirsch, M; Khan, K S; Ziebland, S; McManus, R J
2017-09-01
Variation in outcome collection and reporting is a serious hindrance to progress in our specialty; therefore, over 80 journals have come together to support the development, dissemination, and implementation of core outcome sets. This study systematically reviewed and characterised registered, progressing, or completed core outcome sets relevant to women's and newborn health. Systematic search using the Core Outcome Measures in Effectiveness Trial initiative and the Core Outcomes in Women's and Newborn Health initiative databases. Registry entries, protocols, systematic reviews, and core outcome sets. Descriptive statistics to describe characteristics and results. There were 49 core outcome sets registered in maternal and newborn health, with the majority registered in 2015 (n = 22; 48%) or 2016 (n = 16; 32%). Benign gynaecology (n = 8; 16%) and newborn health (n = 3; 6%) are currently under-represented. Twenty-four (52%) core outcome sets were funded by international (n = 1; <1%), national (n = 18; 38%), and regional (n = 4; 8%) bodies. Seven protocols were published. Twenty systematic reviews have characterised the inconsistency in outcome reporting across a broad range of relevant healthcare conditions. Four core outcome sets were completed: reconstructive breast surgery (11 outcomes), preterm birth (13 outcomes), epilepsy in pregnancy (29 outcomes), and maternity care (48 outcomes). The quantitative, qualitative, and consensus methods used to develop core outcome sets varied considerably. Core outcome sets are currently being developed across women's and newborn health, although coverage of topics is variable. Development of further infrastructure to develop, disseminate, and implement core outcome sets is urgently required. Forty-nine women's and newborn core outcome sets registered. 50% funded. 7 protocols, 20 systematic reviews, and 4 core outcome sets published. @coreoutcomes @jamesmnduffy. © 2017 Royal College of Obstetricians and Gynaecologists.
Mina, Lida; Soule, Sharon E; Badve, Sunil; Baehner, Fredrick L; Baker, Joffre; Cronin, Maureen; Watson, Drew; Liu, Mei-Lan; Sledge, George W; Shak, Steve; Miller, Kathy D
2007-06-01
Primary chemotherapy provides an ideal opportunity to correlate gene expression with response to treatment. We used paraffin-embedded core biopsies from a completed phase II trial to identify genes that correlate with response to primary chemotherapy. Patients with newly diagnosed stage II or III breast cancer were treated with sequential doxorubicin 75 mg/M2 q2 wks x 3 and docetaxel 40 mg/M2 weekly x 6; treatment order was randomly assigned. Pretreatment core biopsy samples were interrogated for genes that might correlate with pathologic complete response (pCR). In addition to the individual genes, the correlation of the Oncotype DX Recurrence Score with pCR was examined. Of 70 patients enrolled in the parent trial, core biopsies samples with sufficient RNA for gene analyses were available from 45 patients; 9 (20%) had inflammatory breast cancer (IBC). Six (14%) patients achieved a pCR. Twenty-two of the 274 candidate genes assessed correlated with pCR (p < 0.05). Genes correlating with pCR could be grouped into three large clusters: angiogenesis-related genes, proliferation related genes, and invasion-related genes. Expression of estrogen receptor (ER)-related genes and Recurrence Score did not correlate with pCR. In an exploratory analysis we compared gene expression in IBC to non-inflammatory breast cancer; twenty-four (9%) of the genes were differentially expressed (p < 0.05), 5 were upregulated and 19 were downregulated in IBC. Gene expression analysis on core biopsy samples is feasible and identifies candidate genes that correlate with pCR to primary chemotherapy. Gene expression in IBC differs significantly from noninflammatory breast cancer.
Erives, Albert J
2017-11-28
While the genomes of eukaryotes and Archaea both encode the histone-fold domain, only eukaryotes encode the core histone paralogs H2A, H2B, H3, and H4. With DNA, these core histones assemble into the nucleosomal octamer underlying eukaryotic chromatin. Importantly, core histones for H2A and H3 are maintained as neofunctionalized paralogs adapted for general bulk chromatin (canonical H2 and H3) or specialized chromatin (H2A.Z enriched at gene promoters and cenH3s enriched at centromeres). In this context, the identification of core histone-like "doublets" in the cytoplasmic replication factories of the Marseilleviridae (MV) is a novel finding with possible relevance to understanding the origin of eukaryotic chromatin. Here, we analyze and compare the core histone doublet genes from all known MV genomes as well as other MV genes relevant to the origin of the eukaryotic replisome. Using different phylogenetic approaches, we show that MV histone domains encode obligate H2B-H2A and H4-H3 dimers of possible proto-eukaryotic origin. MV core histone moieties form sister clades to each of the four eukaryotic clades of canonical and variant core histones. This suggests that MV core histone moieties diverged prior to eukaryotic neofunctionalizations associated with paired linear chromosomes and variant histone octamer assembly. We also show that MV genomes encode a proto-eukaryotic DNA topoisomerase II enzyme that forms a sister clade to eukaryotes. This is a relevant finding given that DNA topo II influences histone deposition and chromatin compaction and is the second most abundant nuclear protein after histones. The combined domain architecture and phylogenomic analyses presented here suggest that a primitive origin for MV histone genes is a more parsimonious explanation than horizontal gene transfers + gene fusions + sufficient divergence to eliminate relatedness to eukaryotic neofunctionalizations within the H2A and H3 clades without loss of relatedness to each of the four core histone clades. We thus suggest MV histone doublet genes and their DNA topo II gene possibly were acquired from an organism with a chromatinized replisome that diverged prior to the origin of eukaryotic core histone variants for H2/H2A.Z and H3/cenH3. These results also imply that core histones were utilized ancestrally in viral DNA compaction and/or protection from host endonucleases.
Wang, Yupeng; Ficklin, Stephen P; Wang, Xiyin; Feltus, F Alex; Paterson, Andrew H
2016-01-01
Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots.
Wang, Yupeng; Ficklin, Stephen P.; Wang, Xiyin; Feltus, F. Alex; Paterson, Andrew H.
2016-01-01
Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots. PMID:27195960
Galperin, Michael Y; Mekhedov, Sergei L; Puigbo, Pere; Smirnov, Sergey; Wolf, Yuri I; Rigden, Daniel J
2012-11-01
Three classes of low-G+C Gram-positive bacteria (Firmicutes), Bacilli, Clostridia and Negativicutes, include numerous members that are capable of producing heat-resistant endospores. Spore-forming firmicutes include many environmentally important organisms, such as insect pathogens and cellulose-degrading industrial strains, as well as human pathogens responsible for such diseases as anthrax, botulism, gas gangrene and tetanus. In the best-studied model organism Bacillus subtilis, sporulation involves over 500 genes, many of which are conserved among other bacilli and clostridia. This work aimed to define the genomic requirements for sporulation through an analysis of the presence of sporulation genes in various firmicutes, including those with smaller genomes than B. subtilis. Cultivable spore-formers were found to have genomes larger than 2300 kb and encompass over 2150 protein-coding genes of which 60 are orthologues of genes that are apparently essential for sporulation in B. subtilis. Clostridial spore-formers lack, among others, spoIIB, sda, spoVID and safA genes and have non-orthologous displacements of spoIIQ and spoIVFA, suggesting substantial differences between bacilli and clostridia in the engulfment and spore coat formation steps. Many B. subtilis sporulation genes, particularly those encoding small acid-soluble spore proteins and spore coat proteins, were found only in the family Bacillaceae, or even in a subset of Bacillus spp. Phylogenetic profiles of sporulation genes, compiled in this work, confirm the presence of a common sporulation gene core, but also illuminate the diversity of the sporulation processes within various lineages. These profiles should help further experimental studies of uncharacterized widespread sporulation genes, which would ultimately allow delineation of the minimal set(s) of sporulation-specific genes in Bacilli and Clostridia. Published 2012. This article is a U.S. Government work and is in the public domain in the USA.
Study of hepatitis B virus gene mutations with enzymatic colorimetry-based DNA microarray.
Mao, Hailei; Wang, Huimin; Zhang, Donglei; Mao, Hongju; Zhao, Jianlong; Shi, Jian; Cui, Zhichu
2006-01-01
To establish a modified microarray method for detecting HBV gene mutations in the clinic. Site-specific oligonucleotide probes were immobilized to microarray slides and hybridized to biotin-labeled HBV gene fragments amplified from two-step PCR. Hybridized targets were transferred to nitrocellulose membranes, followed by intensity measurement using BCIP/NBT colorimetry. HBV genes from 99 Hepatitis B patients and 40 healthy blood donors were analyzed. Mutation frequencies of HBV pre-core/core and basic core promoter (BCP) regions were found to be significantly higher in the patient group (42%, 40% versus 2.5%, 5%, P < 0.01). Compared with a traditional fluorescence method, the colorimetry method exhibited the same level of sensitivity and reproducibility. An enzymatic colorimetry-based DNA microarray assay was successfully established to monitor HBV mutations. Pre-core/core and BCP mutations of HBV genes could be major causes of HBV infection in HBeAg-negative patients and could also be relevant to chronicity and aggravation of hepatitis B.
Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans
Iraola, Gregorio; Vazquez, Gustavo; Spangenberg, Lucía; Naya, Hugo
2012-01-01
Although there have been great advances in understanding bacterial pathogenesis, there is still a lack of integrative information about what makes a bacterium a human pathogen. The advent of high-throughput sequencing technologies has dramatically increased the amount of completed bacterial genomes, for both known human pathogenic and non-pathogenic strains; this information is now available to investigate genetic features that determine pathogenic phenotypes in bacteria. In this work we determined presence/absence patterns of different virulence-related genes among more than finished bacterial genomes from both human pathogenic and non-pathogenic strains, belonging to different taxonomic groups (i.e: Actinobacteria, Gammaproteobacteria, Firmicutes, etc.). An accuracy of 95% using a cross-fold validation scheme with in-fold feature selection is obtained when classifying human pathogens and non-pathogens. A reduced subset of highly informative genes () is presented and applied to an external validation set. The statistical model was implemented in the BacFier v1.0 software (freely available at ), that displays not only the prediction (pathogen/non-pathogen) and an associated probability for pathogenicity, but also the presence/absence vector for the analyzed genes, so it is possible to decipher the subset of virulence genes responsible for the classification on the analyzed genome. Furthermore, we discuss the biological relevance for bacterial pathogenesis of the core set of genes, corresponding to eight functional categories, all with evident and documented association with the phenotypes of interest. Also, we analyze which functional categories of virulence genes were more distinctive for pathogenicity in each taxonomic group, which seems to be a completely new kind of information and could lead to important evolutionary conclusions. PMID:22916122
The phenotypic manifestations of rare genic CNVs in autism spectrum disorder
Merikangas, A K; Segurado, R; Heron, E A; Anney, R J L; Paterson, A D; Cook, E H; Pinto, D; Scherer, S W; Szatmari, P; Gill, M; Corvin, A P; Gallagher, L
2015-01-01
Significant evidence exists for the association between copy number variants (CNVs) and Autism Spectrum Disorder (ASD); however, most of this work has focused solely on the diagnosis of ASD. There is limited understanding of the impact of CNVs on the ‘sub-phenotypes' of ASD. The objective of this paper is to evaluate associations between CNVs in differentially brain expressed (DBE) genes or genes previously implicated in ASD/intellectual disability (ASD/ID) and specific sub-phenotypes of ASD. The sample consisted of 1590 cases of European ancestry from the Autism Genome Project (AGP) with a diagnosis of an ASD and at least one rare CNV impacting any gene and a core set of phenotypic measures, including symptom severity, language impairments, seizures, gait disturbances, intelligence quotient (IQ) and adaptive function, as well as paternal and maternal age. Classification analyses using a non-parametric recursive partitioning method (random forests) were employed to define sets of phenotypic characteristics that best classify the CNV-defined groups. There was substantial variation in the classification accuracy of the two sets of genes. The best variables for classification were verbal IQ for the ASD/ID genes, paternal age at birth for the DBE genes and adaptive function for de novo CNVs. CNVs in the ASD/ID list were primarily associated with communication and language domains, whereas CNVs in DBE genes were related to broader manifestations of adaptive function. To our knowledge, this is the first study to examine the associations between sub-phenotypes and CNVs genome-wide in ASD. This work highlights the importance of examining the diverse sub-phenotypic manifestations of CNVs in ASD, including the specific features, comorbid conditions and clinical correlates of ASD that comprise underlying characteristics of the disorder. PMID:25421404
The phenotypic manifestations of rare genic CNVs in autism spectrum disorder.
Merikangas, A K; Segurado, R; Heron, E A; Anney, R J L; Paterson, A D; Cook, E H; Pinto, D; Scherer, S W; Szatmari, P; Gill, M; Corvin, A P; Gallagher, L
2015-11-01
Significant evidence exists for the association between copy number variants (CNVs) and Autism Spectrum Disorder (ASD); however, most of this work has focused solely on the diagnosis of ASD. There is limited understanding of the impact of CNVs on the 'sub-phenotypes' of ASD. The objective of this paper is to evaluate associations between CNVs in differentially brain expressed (DBE) genes or genes previously implicated in ASD/intellectual disability (ASD/ID) and specific sub-phenotypes of ASD. The sample consisted of 1590 cases of European ancestry from the Autism Genome Project (AGP) with a diagnosis of an ASD and at least one rare CNV impacting any gene and a core set of phenotypic measures, including symptom severity, language impairments, seizures, gait disturbances, intelligence quotient (IQ) and adaptive function, as well as paternal and maternal age. Classification analyses using a non-parametric recursive partitioning method (random forests) were employed to define sets of phenotypic characteristics that best classify the CNV-defined groups. There was substantial variation in the classification accuracy of the two sets of genes. The best variables for classification were verbal IQ for the ASD/ID genes, paternal age at birth for the DBE genes and adaptive function for de novo CNVs. CNVs in the ASD/ID list were primarily associated with communication and language domains, whereas CNVs in DBE genes were related to broader manifestations of adaptive function. To our knowledge, this is the first study to examine the associations between sub-phenotypes and CNVs genome-wide in ASD. This work highlights the importance of examining the diverse sub-phenotypic manifestations of CNVs in ASD, including the specific features, comorbid conditions and clinical correlates of ASD that comprise underlying characteristics of the disorder.
Westermann, Frank; Muth, Daniel; Benner, Axel; Bauer, Tobias; Henrich, Kai-Oliver; Oberthuer, André; Brors, Benedikt; Beissbarth, Tim; Vandesompele, Jo; Pattyn, Filip; Hero, Barbara; König, Rainer; Fischer, Matthias; Schwab, Manfred
2008-01-01
Background Amplified MYCN oncogene resulting in deregulated MYCN transcriptional activity is observed in 20% of neuroblastomas and identifies a highly aggressive subtype. In MYCN single-copy neuroblastomas, elevated MYCN mRNA and protein levels are paradoxically associated with a more favorable clinical phenotype, including disseminated tumors that subsequently regress spontaneously (stage 4s-non-amplified). In this study, we asked whether distinct transcriptional MYCN or c-MYC activities are associated with specific neuroblastoma phenotypes. Results We defined a core set of direct MYCN/c-MYC target genes by applying gene expression profiling and chromatin immunoprecipitation (ChIP, ChIP-chip) in neuroblastoma cells that allow conditional regulation of MYCN and c-MYC. Their transcript levels were analyzed in 251 primary neuroblastomas. Compared to localized-non-amplified neuroblastomas, MYCN/c-MYC target gene expression gradually increases from stage 4s-non-amplified through stage 4-non-amplified to MYCN amplified tumors. This was associated with MYCN activation in stage 4s-non-amplified and predominantly c-MYC activation in stage 4-non-amplified tumors. A defined set of MYCN/c-MYC target genes was induced in stage 4-non-amplified but not in stage 4s-non-amplified neuroblastomas. In line with this, high expression of a subset of MYCN/c-MYC target genes identifies a patient subtype with poor overall survival independent of the established risk markers amplified MYCN, disease stage, and age at diagnosis. Conclusions High MYCN/c-MYC target gene expression is a hallmark of malignant neuroblastoma progression, which is predominantly driven by c-MYC in stage 4-non-amplified tumors. In contrast, moderate MYCN function gain in stage 4s-non-amplified tumors induces only a restricted set of target genes that is still compatible with spontaneous regression. PMID:18851746
Genome expansion and gene loss in powdery mildew fungi reveal tradeoffs in extreme parasitism.
Spanu, Pietro D; Abbott, James C; Amselem, Joelle; Burgis, Timothy A; Soanes, Darren M; Stüber, Kurt; Ver Loren van Themaat, Emiel; Brown, James K M; Butcher, Sarah A; Gurr, Sarah J; Lebrun, Marc-Henri; Ridout, Christopher J; Schulze-Lefert, Paul; Talbot, Nicholas J; Ahmadinejad, Nahal; Ametz, Christian; Barton, Geraint R; Benjdia, Mariam; Bidzinski, Przemyslaw; Bindschedler, Laurence V; Both, Maike; Brewer, Marin T; Cadle-Davidson, Lance; Cadle-Davidson, Molly M; Collemare, Jerome; Cramer, Rainer; Frenkel, Omer; Godfrey, Dale; Harriman, James; Hoede, Claire; King, Brian C; Klages, Sven; Kleemann, Jochen; Knoll, Daniela; Koti, Prasanna S; Kreplak, Jonathan; López-Ruiz, Francisco J; Lu, Xunli; Maekawa, Takaki; Mahanil, Siraprapa; Micali, Cristina; Milgroom, Michael G; Montana, Giovanni; Noir, Sandra; O'Connell, Richard J; Oberhaensli, Simone; Parlange, Francis; Pedersen, Carsten; Quesneville, Hadi; Reinhardt, Richard; Rott, Matthias; Sacristán, Soledad; Schmidt, Sarah M; Schön, Moritz; Skamnioti, Pari; Sommer, Hans; Stephens, Amber; Takahara, Hiroyuki; Thordal-Christensen, Hans; Vigouroux, Marielle; Wessling, Ralf; Wicker, Thomas; Panstruga, Ralph
2010-12-10
Powdery mildews are phytopathogens whose growth and reproduction are entirely dependent on living plant cells. The molecular basis of this life-style, obligate biotrophy, remains unknown. We present the genome analysis of barley powdery mildew, Blumeria graminis f.sp. hordei (Blumeria), as well as a comparison with the analysis of two powdery mildews pathogenic on dicotyledonous plants. These genomes display massive retrotransposon proliferation, genome-size expansion, and gene losses. The missing genes encode enzymes of primary and secondary metabolism, carbohydrate-active enzymes, and transporters, probably reflecting their redundancy in an exclusively biotrophic life-style. Among the 248 candidate effectors of pathogenesis identified in the Blumeria genome, very few (less than 10) define a core set conserved in all three mildews, suggesting that most effectors represent species-specific adaptations.
Schiariti, Verónica; Mahdi, Soheil; Bölte, Sven
2018-05-30
Capturing functional information is crucial in childhood disability. The International Classification of Functioning, Disability and Health (ICF) Core Sets promote assessments of functional abilities and disabilities in clinical practice regarding circumscribed diagnoses. However, the specificity of ICF Core Sets for childhood-onset disabilities has been doubted. This study aimed to identify content commonalities and differences among the ICF Core Sets for cerebral palsy (CP), and the newly developed Core Sets for autism spectrum disorder (ASD) and attention-deficit-hyperactivity disorder (ADHD). The categories within each Core Set were aggregated at the ICF component and chapter levels. Content comparison was conducted using descriptive analyses. The activities and participation component of the ICF was the most covered across all Core Sets. Main differences included representation of ICF components and coverage of ICF chapters within each component. CP included all ICF components, while ADHD and ASD predominantly focused on activities and participation. Environmental factors were highly represented in the ADHD Core Sets (40.5%) compared to the ASD (28%) and CP (27%) Core Sets. International Classification of Functioning, Disability and Health Core Sets for CP, ASD, and ADHD capture both common but also unique functional information, showing the importance of creating condition-specific, ICF-based tools to build functional profiles of individuals with childhood-onset disabilities. The International Classification of Functioning, Disability and Health (ICF) Core Sets for cerebral palsy (CP), autism spectrum disorder (ASD), and attention-deficit-hyperactivity disorder (ADHD) include unique functional information. The ICF-based tools for CP, ASD, and ADHD differ in terms of representation and coverage of ICF components and ICF chapters. Representation of environmental factors uniquely influences functioning and disability across ICF Core Sets for CP, ASD and ADHD. © 2018 Mac Keith Press.
Evolution and comparative genomics of pAQU-like conjugative plasmids in Vibrio species.
Li, Ruichao; Ye, Lianwei; Wong, Marcus Ho Yin; Zheng, Zhiwei; Chan, Edward Wai Chi; Chen, Sheng
2017-09-01
To investigate a set of MDR conjugative plasmids found in Vibrio species and characterize the underlying evolution process. pAQU-type plasmids from Vibrio species were sequenced using both Illumina and PacBio platforms. Bioinformatics tools were utilized to analyse the typical MDR regions and core genes in the plasmids. The nine pAQU-type plasmids ranged from ∼160 to 206 kb in size and were found to harbour as many as 111 core genes encoding conjugative, replication and maintenance functions. Eight plasmids were found to carry a typical MDR region, which contained various accessory and resistance genes, including ISCR1-blaPER-1-bearing complex class 1 integrons, ISCR2-floR, ISCR2-tet(D)-tetR-ISCR2, qnrVC6, a Tn10-like structure and others associated with mobile elements. Comparison between a plasmid without resistance genes and different MDR plasmids showed that integration of different mobile elements, such as IS26, ISCR1, ISCR2, IS10 and IS6100, into the plasmid backbone was the key mechanism by which foreign resistance genes were acquired during the evolution process. This study identified pAQU-type plasmids as emerging MDR conjugative plasmids among important pathogens from different origins in Asia. These findings suggest that aquatic bacteria constitute a major reservoir of resistance genes, which may be transmissible to other human pathogens during food production and processing. © The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Chang, Xingzhi; Jin, Yiwen; Zhao, Haijuan; Huang, Qionghui; Wang, Jingmin; Yuan, Yun; Han, Ying; Qin, Jiong
2013-03-01
Central core disease is a rare inherited neuromuscular disorder caused by mutations in ryanodine receptor type 1 gene. The clinical phenotype of the disease is highly variable. We report a Chinese pedigree with central core disease confirmed by the gene sequencing. All 3 patients in the family presented with mild proximal limb weakness. The serum level of creatine kinase was normal, and electromyography suggested myogenic changes. The histologic analysis of muscle biopsy showed identical central core lesions in almost all of the muscle fibers in the index case. Exon 90-106 in the C-terminal domain of the ryanodine receptor type 1 gene was amplified using polymerase chain reaction. One heterozygous missense mutation G14678A (Arg4893Gln) in exon 102 was identified in all 3 patients. This is the first report of a familial case of central core disease confirmed by molecular study in mainland China.
Sinha, Amit; Sommer, Ralf J; Dieterich, Christoph
2012-06-19
An organism can respond to changing environmental conditions by adjusting gene regulation and by forming alternative phenotypes. In nematodes, these mechanisms are coupled because many species will form dauer larvae, a stress-resistant and non-aging developmental stage, when exposed to unfavorable environmental conditions, and execute gene expression programs that have been selected for the survival of the animal in the wild. These dauer larvae represent an environmentally induced, homologous developmental stage across many nematode species, sharing conserved morphological and physiological properties. Hence it can be expected that some core components of the associated transcriptional program would be conserved across species, while others might diverge over the course of evolution. However, transcriptional and metabolic analysis of dauer development has been largely restricted to Caenorhabditis elegans. Here, we use a transcriptomic approach to compare the dauer stage in the evolutionary model system Pristionchus pacificus with the dauer stage in C. elegans. We have employed Agilent microarrays, which represent 20,446 P. pacificus and 20,143 C. elegans genes to show an unexpected divergence in the expression profiles of these two nematodes in dauer and dauer exit samples. P. pacificus and C. elegans differ in the dynamics and function of genes that are differentially expressed. We find that only a small number of orthologous gene pairs show similar expression pattern in the dauers of the two species, while the non-orthologous fraction of genes is a major contributor to the active transcriptome in dauers. Interestingly, many of the genes acquired by horizontal gene transfer and orphan genes in P. pacificus, are differentially expressed suggesting that these genes are of evolutionary and functional importance. Our data set provides a catalog for future functional investigations and indicates novel insight into evolutionary mechanisms. We discuss the limited conservation of core developmental and transcriptional programs as a common aspect of animal evolution.
2012-01-01
Background An organism can respond to changing environmental conditions by adjusting gene regulation and by forming alternative phenotypes. In nematodes, these mechanisms are coupled because many species will form dauer larvae, a stress-resistant and non-aging developmental stage, when exposed to unfavorable environmental conditions, and execute gene expression programs that have been selected for the survival of the animal in the wild. These dauer larvae represent an environmentally induced, homologous developmental stage across many nematode species, sharing conserved morphological and physiological properties. Hence it can be expected that some core components of the associated transcriptional program would be conserved across species, while others might diverge over the course of evolution. However, transcriptional and metabolic analysis of dauer development has been largely restricted to Caenorhabditis elegans. Here, we use a transcriptomic approach to compare the dauer stage in the evolutionary model system Pristionchus pacificus with the dauer stage in C. elegans. Results We have employed Agilent microarrays, which represent 20,446 P. pacificus and 20,143 C. elegans genes to show an unexpected divergence in the expression profiles of these two nematodes in dauer and dauer exit samples. P. pacificus and C. elegans differ in the dynamics and function of genes that are differentially expressed. We find that only a small number of orthologous gene pairs show similar expression pattern in the dauers of the two species, while the non-orthologous fraction of genes is a major contributor to the active transcriptome in dauers. Interestingly, many of the genes acquired by horizontal gene transfer and orphan genes in P. pacificus, are differentially expressed suggesting that these genes are of evolutionary and functional importance. Conclusion Our data set provides a catalog for future functional investigations and indicates novel insight into evolutionary mechanisms. We discuss the limited conservation of core developmental and transcriptional programs as a common aspect of animal evolution. PMID:22712530
Janich, Peggy; Arpat, Alaaddin Bulak; Castelo-Szekely, Violeta; Lopes, Maykel; Gatfield, David
2015-01-01
Mammalian gene expression displays widespread circadian oscillations. Rhythmic transcription underlies the core clock mechanism, but it cannot explain numerous observations made at the level of protein rhythmicity. We have used ribosome profiling in mouse liver to measure the translation of mRNAs into protein around the clock and at high temporal and nucleotide resolution. We discovered, transcriptome-wide, extensive rhythms in ribosome occupancy and identified a core set of approximately 150 mRNAs subject to particularly robust daily changes in translation efficiency. Cycling proteins produced from nonoscillating transcripts revealed thus-far-unknown rhythmic regulation associated with specific pathways (notably in iron metabolism, through the rhythmic translation of transcripts containing iron responsive elements), and indicated feedback to the rhythmic transcriptome through novel rhythmic transcription factors. Moreover, estimates of relative levels of core clock protein biosynthesis that we deduced from the data explained known features of the circadian clock better than did mRNA expression alone. Finally, we identified uORF translation as a novel regulatory mechanism within the clock circuitry. Consistent with the occurrence of translated uORFs in several core clock transcripts, loss-of-function of Denr, a known regulator of reinitiation after uORF usage and of ribosome recycling, led to circadian period shortening in cells. In summary, our data offer a framework for understanding the dynamics of translational regulation, circadian gene expression, and metabolic control in a solid mammalian organ. PMID:26486724
Biogenesis of the yeast cytochrome bc1 complex.
Zara, Vincenzo; Conte, Laura; Trumpower, Bernard L
2009-01-01
The mitochondrial respiratory chain is composed of four different protein complexes that cooperate in electron transfer and proton pumping across the inner mitochondrial membrane. The cytochrome bc1 complex, or complex III, is a component of the mitochondrial respiratory chain. This review will focus on the biogenesis of the bc1 complex in the mitochondria of the yeast Saccharomyces cerevisiae. In wild type yeast mitochondrial membranes the major part of the cytochrome bc1 complex was found in association with one or two copies of the cytochrome c oxidase complex. The analysis of several yeast mutant strains in which single genes or pairs of genes encoding bc1 subunits had been deleted revealed the presence of a common set of bc1 sub-complexes. These sub-complexes are represented by the central core of the bc1 complex, consisting of cytochrome b bound to subunit 7 and subunit 8, by the two core proteins associated with each other, by the Rieske protein associated with subunit 9, and by those deriving from the unexpected interaction of each of the two core proteins with cytochrome c1. Furthermore, a higher molecular mass sub-complex is that composed of cytochrome b, cytochrome c1, core protein 1 and 2, subunit 6, subunit 7 and subunit 8. The identification and characterization of all these sub-complexes may help in defining the steps and the molecular events leading to bc1 assembly in yeast mitochondria.
Two-component signal transduction systems of Xanthomonas spp.: a lesson from genomics.
Qian, Wei; Han, Zhong-Ji; He, Chaozu
2008-02-01
The two-component signal transduction systems (TCSTSs), consisting of a histidine kinase sensor (HK) and a response regulator (RR), are the dominant molecular mechanisms by which prokaryotes sense and respond to environmental stimuli. Genomes of Xanthomonas generally contain a large repertoire of TCSTS genes (approximately 92 to 121 for each genome), which encode diverse structural groups of HKs and RRs. Among them, although a core set of 70 TCSTS genes (about two-thirds in total) which accumulates point mutations with a slow rate are shared by these genomes, the other genes, especially hybrid HKs, experienced extensive genetic recombination, including genomic rearrangement, gene duplication, addition or deletion, and fusion or fission. The recombinations potentially promote the efficiency and complexity of TCSTSs in regulating gene expression. In addition, our analysis suggests that a co-evolutionary model, rather than a selfish operon model, is the major mechanism for the maintenance and microevolution of TCSTS genes in the genomes of Xanthomonas. Genomic annotation, secondary protein structure prediction, and comparative genomic analyses of TCSTS genes reviewed here provide insights into our understanding of signal networks in these important phytopathogenic bacteria.
2012-01-01
Background Epinotia aporema (Lepidoptera: Tortricidae) is an important pest of legume crops in South America. Epinotia aporema granulovirus (EpapGV) is a baculovirus that causes a polyorganotropic infection in the host larva. Its high pathogenicity and host specificity make EpapGV an excellent candidate to be used as a biological control agent. Results The genome of Epinotia aporema granulovirus (EpapGV) was sequenced and analyzed. Its circular double-stranded DNA genome is 119,082 bp in length and codes for 133 putative genes. It contains the 31 baculovirus core genes and a set of 19 genes that are GV exclusive. Seventeen ORFs were unique to EpapGV in comparison with other baculoviruses. Of these, 16 found no homologues in GenBank, and one encoded a thymidylate kinase. Analysis of nucleotide sequence repeats revealed the presence of 16 homologous regions (hrs) interspersed throughout the genome. Each hr was characterized by the presence of 1 to 3 clustered imperfect palindromes which are similar to previously described palindromes of tortricid-specific GVs. Also, one of the hrs (hr4) has flanking sequences suggestive of a putative non-hr ori. Interestingly, two more complex hrs were found in opposite loci, dividing the circular dsDNA genome in two halves. Gene synteny maps showed the great colinearity of sequenced GVs, being EpapGV the most dissimilar as it has a 20 kb-long gene block inversion. Phylogenetic study performed with 31 core genes of 58 baculoviral genomes suggests that EpapGV is the baculovirus isolate closest to the putative common ancestor of tortricid specific betabaculoviruses. Conclusions This study, along with previous characterization of EpapGV infection, is useful for the better understanding of the pathology caused by this virus and its potential utilization as a bioinsecticide. PMID:23051685
Hofberger, Johannes A.; Ramirez, Aldana M.; van den Bergh, Erik; Zhu, Xinguang; Bouwmeester, Harro J.; Schuurink, Robert C.; Schranz, M. Eric
2015-01-01
An important component of plant evolution is the plethora of pathways producing more than 200,000 biochemically diverse specialized metabolites with pharmacological, nutritional and ecological significance. To unravel dynamics underlying metabolic diversification, it is critical to determine lineage-specific gene family expansion in a phylogenomics framework. However, robust functional annotation is often only available for core enzymes catalyzing committed reaction steps within few model systems. In a genome informatics approach, we extracted information from early-draft gene-space assemblies and non-redundant transcriptomes to identify protein families involved in isoprenoid biosynthesis. Isoprenoids comprise terpenoids with various roles in plant-environment interaction, such as pollinator attraction or pathogen defense. Combining lines of evidence provided by synteny, sequence homology and Hidden-Markov-Modelling, we screened 17 genomes including 12 major crops and found evidence for 1,904 proteins associated with terpenoid biosynthesis. Our terpenoid genes set contains evidence for 840 core terpene-synthases and 338 triterpene-specific synthases. We further identified 190 prenyltransferases, 39 isopentenyl-diphosphate isomerases as well as 278 and 219 proteins involved in mevalonate and methylerithrol pathways, respectively. Assessing the impact of gene and genome duplication to lineage-specific terpenoid pathway expansion, we illustrated key events underlying terpenoid metabolic diversification within 250 million years of flowering plant radiation. By quantifying Angiosperm-wide versatility and phylogenetic relationships of pleiotropic gene families in terpenoid modular pathways, our analysis offers significant insight into evolutionary dynamics underlying diversification of plant secondary metabolism. Furthermore, our data provide a blueprint for future efforts to identify and more rapidly clone terpenoid biosynthetic genes from any plant species. PMID:26046541
The Carnegie Protein Trap Library: A Versatile Tool for Drosophila Developmental Studies
Buszczak, Michael; Paterno, Shelley; Lighthouse, Daniel; Bachman, Julia; Planck, Jamie; Owen, Stephenie; Skora, Andrew D.; Nystul, Todd G.; Ohlstein, Benjamin; Allen, Anna; Wilhelm, James E.; Murphy, Terence D.; Levis, Robert W.; Matunis, Erika; Srivali, Nahathai; Hoskins, Roger A.; Spradling, Allan C.
2007-01-01
Metazoan physiology depends on intricate patterns of gene expression that remain poorly known. Using transposon mutagenesis in Drosophila, we constructed a library of 7404 protein trap and enhancer trap lines, the Carnegie collection, to facilitate gene expression mapping at single-cell resolution. By sequencing the genomic insertion sites, determining splicing patterns downstream of the enhanced green fluorescent protein (EGFP) exon, and analyzing expression patterns in the ovary and salivary gland, we found that 600–900 different genes are trapped in our collection. A core set of 244 lines trapped different identifiable protein isoforms, while insertions likely to act as GFP-enhancer traps were found in 256 additional genes. At least 8 novel genes were also identified. Our results demonstrate that the Carnegie collection will be useful as a discovery tool in diverse areas of cell and developmental biology and suggest new strategies for greatly increasing the coverage of the Drosophila proteome with protein trap insertions. PMID:17194782
Ceapa, Corina; Davids, Mark; Ritari, Jarmo; Lambert, Jolanda; Wels, Michiel; Douillard, François P.; Smokvina, Tamara; de Vos, Willem M.; Knol, Jan; Kleerebezem, Michiel
2016-01-01
Lactobacillus rhamnosus is a diverse Gram-positive species with strains isolated from different ecological niches. Here, we report the genome sequence analysis of 40 diverse strains of L. rhamnosus and their genomic comparison, with a focus on the variable genome. Genomic comparison of 40 L. rhamnosus strains discriminated the conserved genes (core genome) and regions of plasticity involving frequent rearrangements and horizontal transfer (variome). The L. rhamnosus core genome encompasses 2,164 genes, out of 4,711 genes in total (the pan-genome). The accessory genome is dominated by genes encoding carbohydrate transport and metabolism, extracellular polysaccharides (EPS) biosynthesis, bacteriocin production, pili production, the cas system, and the associated clustered regularly interspaced short palindromic repeat (CRISPR) loci, and more than 100 transporter functions and mobile genetic elements like phages, plasmid genes, and transposons. A clade distribution based on amino acid differences between core (shared) proteins matched with the clade distribution obtained from the presence–absence of variable genes. The phylogenetic and variome tree overlap indicated that frequent events of gene acquisition and loss dominated the evolutionary segregation of the strains within this species, which is paralleled by evolutionary diversification of core gene functions. The CRISPR-Cas system could have contributed to this evolutionary segregation. Lactobacillus rhamnosus strains contain the genetic and metabolic machinery with strain-specific gene functions required to adapt to a large range of environments. A remarkable congruency of the evolutionary relatedness of the strains’ core and variome functions, possibly favoring interspecies genetic exchanges, underlines the importance of gene-acquisition and loss within the L. rhamnosus strain diversification. PMID:27358423
Ahmed, Rina; Chang, Zisong; Younis, Abuelhassan Elshazly; Langnick, Claudia; Li, Na; Chen, Wei; Brattig, Norbert; Dieterich, Christoph
2013-01-01
Animal development is complex yet surprisingly robust. Animals may develop alternative phenotypes conditional on environmental changes. Under unfavorable conditions, Caenorhabditis elegans larvae enter the dauer stage, a developmentally arrested, long-lived, and stress-resistant state. Dauer larvae of free-living nematodes and infective larvae of parasitic nematodes share many traits including a conserved endocrine signaling module (DA/DAF-12), which is essential for the formation of dauer and infective larvae. We speculated that conserved post-transcriptional regulatory mechanism might also be involved in executing the dauer and infective larvae fate. We used an unbiased sequencing strategy to characterize the microRNA (miRNA) gene complement in C. elegans, Pristionchus pacificus, and Strongyloides ratti. Our study raised the number of described miRNA genes to 257 for C. elegans, tripled the known gene set for P. pacificus to 362 miRNAs, and is the first to describe miRNAs in a Strongyloides parasite. Moreover, we found a limited core set of 24 conserved miRNA families in all three species. Interestingly, our estimated expression fold changes between dauer versus nondauer stages and infective larvae versus free-living stages reveal that despite the speed of miRNA gene set evolution in nematodes, homologous gene families with conserved “dauer-infective” expression signatures are present. These findings suggest that common post-transcriptional regulatory mechanisms are at work and that the same miRNA families play important roles in developmental arrest and long-term survival in free-living and parasitic nematodes. PMID:23729632
Van den Bussche, Karen; De Meyer, Dorien; Van Damme, Nele; Kottner, Jan; Beeckman, Dimitri
2017-10-01
This study protocol describes the methodology for the development of a core set of outcomes and a core set of measurements for incontinence-associated dermatitis. Incontinence is a widespread disorder with an important impact on quality of life. One of the most common complications is incontinence-associated dermatitis, resulting from chemical and physical irritation of the skin barrier, triggering inflammation and skin damage. Managing incontinence-associated dermatitis is an important challenge for nurses. Several interventions have been assessed in clinical trials, but heterogeneity in study outcomes complicates the comparability and standardization. To overcome this challenge, the development of a core outcome set, a minimum set of outcomes and measurements to be assessed in clinical research, is needed. A project team, International Steering Committee and panelists will be involved to guide the development of the core outcome set. The framework of the Harmonizing Outcomes Measures for Eczema roadmap endorsed by Cochrane Skin Group Core Outcomes Set Initiative, is used to inform the project design. A systematic literature review, interviews to integrate the patients' perspective and a consensus study with healthcare researchers and providers using the Delphi procedure will be performed. The project was approved by the Ethics review Committee (April 2016). This is the first project that will identify a core outcome set of outcomes and measurements for incontinence-associated dermatitis research. A core outcome set will reduce possible reporting bias, allow results comparisons and statistical pooling across trials and strengthen evidence-based practice and decision-making. This project has been registered in the Core Outcome Measures in Effectiveness Trials (COMET) database and is part of the Cochrane Skin Group Core Outcomes Set Initiative (CSG-COUSIN). © 2016 John Wiley & Sons Ltd.
HCV core protein induces hepatic lipid accumulation by activating SREBP1 and PPAR{gamma}
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Kook Hwan; Hong, Sung Pyo; Kim, KyeongJin
2007-04-20
Hepatic steatosis is a common feature in patients with chronic hepatitis C virus (HCV) infection. HCV core protein plays an important role in the development of hepatic steatosis in HCV infection. Because SREBP1 (sterol regulatory element binding protein 1) and PPAR{gamma} (peroxisome proliferators-activated receptor {gamma}) are involved in the regulation of lipid metabolism of hepatocyte, we sought to determine whether HCV core protein may impair the expression and activity of SREBP1 and PPAR{gamma}. In this study, it was demonstrated that HCV core protein increases the gene expression of SREBP1 not only in Chang liver, Huh7, and HepG2 cells transiently transfectedmore » with HCV core protein expression plasmid, but also in Chang liver-core stable cells. Furthermore, HCV core protein enhanced the transcriptional activity of SREBP1. In addition, HCV core protein elevated PPAR{gamma} transcriptional activity. However, HCV core protein had no effect on PPAR{gamma} gene expression. Finally, we showed that HCV core protein stimulates the genes expression of lipogenic enzyme and fatty acid uptake associated protein. Therefore, our finding provides a new insight into the mechanism of hepatic steatosis by HCV infection.« less
Intrafocal heterogeneity of ERG protein expression and gene fusion pattern in prostate cancer.
Suh, Ja Hee; Park, Jeong Hwan; Lee, Cheol; Moon, Kyung Chul
2017-10-01
Prostate cancer is considered to be highly heterogeneous, with various morphologic features and biologic behaviors. The TMPRSS2-ERG gene fusion is the most frequently observed genetic aberration in prostate cancer. The aim of this study was to elucidate the intrafocal heterogeneity of ERG gene fusion status. ERG immunohistochemistry (IHC) was performed in samples from 168 prostate cancer patients who had undergone radical prostatectomy, and 40 cases showing ERG-positive IHC staining were selected for tissue microarray (TMA) construction. Two to six representative cores were selected from each tumor focus. In the cases with heterogeneous ERG IHC staining intensity, the areas showing different intensities were separately selected. Using the TMA blocks, IHC and fluorescence in situ hybridization (FISH) were conducted to evaluate the heterogeneity of ERG protein expression and ERG fusion gene patterns, respectively, in a single tumor focus. Heterogeneity of ERG IHC staining was defined as the simultaneous presence of negative and positive cores in the same tumor focus. Heterogeneity of ERG FISH was defined by the presence of cores with positive and negative FISH signals or cores with break-apart and interstitial deletion FISH signals in the same tumor focus. A total of 202 TMA cores were isolated from 40 ERG-positive cases. Of the 202 total cores, 19 were negative for ERG IHC staining, and 46 showed 1+, 52 showed 2+, and 85 showed 3+ ERG staining intensity. Eleven cores were negative for ERG FISH signal, 119 cores showed ERG break-apart FISH signals, and the remaining 72 cores revealed interstitial deletion. Intrafocal heterogeneity of ERG IHC staining was found in 20% (8/40) of cases, and intrafocal heterogeneity of ERG gene fusion pattern was found in 32.5% (13/40) of cases. In summary, this study showed significantly frequent intrafocal heterogeneity of ERG protein expression, gene fusion status and fusion pattern. This heterogeneity can be caused by the development of subclones during cancer progression or the intermingling of different tumors. © 2017 Wiley Periodicals, Inc.
bcgTree: automatized phylogenetic tree building from bacterial core genomes.
Ankenbrand, Markus J; Keller, Alexander
2016-10-01
The need for multi-gene analyses in scientific fields such as phylogenetics and DNA barcoding has increased in recent years. In particular, these approaches are increasingly important for differentiating bacterial species, where reliance on the standard 16S rDNA marker can result in poor resolution. Additionally, the assembly of bacterial genomes has become a standard task due to advances in next-generation sequencing technologies. We created a bioinformatic pipeline, bcgTree, which uses assembled bacterial genomes either from databases or own sequencing results from the user to reconstruct their phylogenetic history. The pipeline automatically extracts 107 essential single-copy core genes, found in a majority of bacteria, using hidden Markov models and performs a partitioned maximum-likelihood analysis. Here, we describe the workflow of bcgTree and, as a proof-of-concept, its usefulness in resolving the phylogeny of 293 publically available bacterial strains of the genus Lactobacillus. We also evaluate its performance in both low- and high-level taxonomy test sets. The tool is freely available at github ( https://github.com/iimog/bcgTree ) and our institutional homepage ( http://www.dna-analytics.biozentrum.uni-wuerzburg.de ).
GeneSCF: a real-time based functional enrichment tool with support for multiple organisms.
Subhash, Santhilal; Kanduri, Chandrasekhar
2016-09-13
High-throughput technologies such as ChIP-sequencing, RNA-sequencing, DNA sequencing and quantitative metabolomics generate a huge volume of data. Researchers often rely on functional enrichment tools to interpret the biological significance of the affected genes from these high-throughput studies. However, currently available functional enrichment tools need to be updated frequently to adapt to new entries from the functional database repositories. Hence there is a need for a simplified tool that can perform functional enrichment analysis by using updated information directly from the source databases such as KEGG, Reactome or Gene Ontology etc. In this study, we focused on designing a command-line tool called GeneSCF (Gene Set Clustering based on Functional annotations), that can predict the functionally relevant biological information for a set of genes in a real-time updated manner. It is designed to handle information from more than 4000 organisms from freely available prominent functional databases like KEGG, Reactome and Gene Ontology. We successfully employed our tool on two of published datasets to predict the biologically relevant functional information. The core features of this tool were tested on Linux machines without the need for installation of more dependencies. GeneSCF is more reliable compared to other enrichment tools because of its ability to use reference functional databases in real-time to perform enrichment analysis. It is an easy-to-integrate tool with other pipelines available for downstream analysis of high-throughput data. More importantly, GeneSCF can run multiple gene lists simultaneously on different organisms thereby saving time for the users. Since the tool is designed to be ready-to-use, there is no need for any complex compilation and installation procedures.
Nguyen, Quan; Lukowski, Samuel; Chiu, Han; Senabouth, Anne; Bruxner, Timothy; Christ, Angelika; Palpant, Nathan; Powell, Joseph
2018-05-11
Heterogeneity of cell states represented in pluripotent cultures have not been described at the transcriptional level. Since gene expression is highly heterogeneous between cells, single-cell RNA sequencing can be used to identify how individual pluripotent cells function. Here, we present results from the analysis of single-cell RNA sequencing data from 18,787 individual WTC CRISPRi human induced pluripotent stem cells. We developed an unsupervised clustering method, and through this identified four subpopulations distinguishable on the basis of their pluripotent state including: a core pluripotent population (48.3%), proliferative (47.8%), early-primed for differentiation (2.8%) and late-primed for differentiation (1.1%). For each subpopulation we were able to identify the genes and pathways that define differences in pluripotent cell states. Our method identified four discrete predictor gene sets comprised of 165 unique genes that denote the specific pluripotency states; and using these sets, we developed a multigenic machine learning prediction method to accurately classify single cells into each of the subpopulations. Compared against a set of established pluripotency markers, our method increases prediction accuracy by 10%, specificity by 20%, and explains a substantially larger proportion of deviance (up to 3-fold) from the prediction model. Finally, we developed an innovative method to predict cells transitioning between subpopulations, and support our conclusions with results from two orthogonal pseudotime trajectory methods. Published by Cold Spring Harbor Laboratory Press.
Core-core and core-valence correlation
NASA Technical Reports Server (NTRS)
Bauschlicher, Charles W., Jr.; Langhoff, Stephen R.; Taylor, Peter R.
1988-01-01
The effect of (1s) core correlation on properties and energy separations was analyzed using full configuration-interaction (FCI) calculations. The Be 1 S - 1 P, the C 3 P - 5 S and CH+ 1 Sigma + or - 1 Pi separations, and CH+ spectroscopic constants, dipole moment and 1 Sigma + - 1 Pi transition dipole moment were studied. The results of the FCI calculations are compared to those obtained using approximate methods. In addition, the generation of atomic natural orbital (ANO) basis sets, as a method for contracting a primitive basis set for both valence and core correlation, is discussed. When both core-core and core-valence correlation are included in the calculation, no suitable truncated CI approach consistently reproduces the FCI, and contraction of the basis set is very difficult. If the (nearly constant) core-core correlation is eliminated, and only the core-valence correlation is included, CASSCF/MRCI approached reproduce the FCI results and basis set contraction is significantly easier.
2018-01-01
Host responses to infection encompass many processes in addition to activation of the immune system, including metabolic adaptations, stress responses, tissue repair, and other reactions. The response to bacterial infection in Drosophila melanogaster has been classically described in studies that focused on the immune response elicited by a small set of largely avirulent microbes. Thus, we have surprisingly limited knowledge of responses to infection that are outside the canonical immune response, of how the response to pathogenic infection differs from that to avirulent bacteria, or even of how generic the response to various microbes is and what regulates that core response. In this study, we addressed these questions by profiling the D. melanogaster transcriptomic response to 10 bacteria that span the spectrum of virulence. We found that each bacterium triggers a unique transcriptional response, with distinct genes making up to one third of the response elicited by highly virulent bacteria. We also identified a core set of 252 genes that are differentially expressed in response to the majority of bacteria tested. Among these, we determined that the transcription factor CrebA is a novel regulator of infection tolerance. Knock-down of CrebA significantly increased mortality from microbial infection without any concomitant change in bacterial number. Upon infection, CrebA is upregulated by both the Toll and Imd pathways in the fat body, where it is required to induce the expression of secretory pathway genes. Loss of CrebA during infection triggered endoplasmic reticulum (ER) stress and activated the unfolded protein response (UPR), which contributed to infection-induced mortality. Altogether, our study reveals essential features of the response to bacterial infection and elucidates the function of a novel regulator of infection tolerance. PMID:29394281
Zhao, Dehua; Liu, Xiaomeng; Zhang, Bo; Xie, Jianbo; Hong, Yuanyuan; Li, Pengfei; Chen, Sanfeng; Dixon, Ray; Li, Jilun
2013-01-01
Most biological nitrogen fixation is catalyzed by molybdenum-dependent nitrogenase, an enzyme complex comprising two component proteins that contains three different metalloclusters. Diazotrophs contain a common core of nitrogen fixation nif genes that encode the structural subunits of the enzyme and components required to synthesize the metalloclusters. However, the complement of nif genes required to enable diazotrophic growth varies significantly amongst nitrogen fixing bacteria and archaea. In this study, we identified a minimal nif gene cluster consisting of nine nif genes in the genome of Paenibacillus sp. WLY78, a gram-positive, facultative anaerobe isolated from the rhizosphere of bamboo. We demonstrate that the nif genes in this organism are organized as an operon comprising nifB, nifH, nifD, nifK, nifE, nifN, nifX, hesA and nifV and that the nif cluster is under the control of a σ70 (σA)-dependent promoter located upstream of nifB. To investigate genetic requirements for diazotrophy, we transferred the Paenibacillus nif cluster to Escherichia coli. The minimal nif gene cluster enables synthesis of catalytically active nitrogenase in this host, when expressed either from the native nifB promoter or from the T7 promoter. Deletion analysis indicates that in addition to the core nif genes, hesA plays an important role in nitrogen fixation and is responsive to the availability of molybdenum. Whereas nif transcription in Paenibacillus is regulated in response to nitrogen availability and by the external oxygen concentration, transcription from the nifB promoter is constitutive in E. coli, indicating that negative regulation of nif transcription is bypassed in the heterologous host. This study demonstrates the potential for engineering nitrogen fixation in a non-nitrogen fixing organism with a minimum set of nine nif genes. PMID:24146630
Analysis of the core genome and pangenome of Pseudomonas putida.
Udaondo, Zulema; Molina, Lázaro; Segura, Ana; Duque, Estrella; Ramos, Juan L
2016-10-01
Pseudomonas putida are strict aerobes that proliferate in a range of temperate niches and are of interest for environmental applications due to their capacity to degrade pollutants and ability to promote plant growth. Furthermore solvent-tolerant strains are useful for biosynthesis of added-value chemicals. We present a comprehensive comparative analysis of nine strains and the first characterization of the Pseudomonas putida pangenome. The core genome of P. putida comprises approximately 3386 genes. The most abundant genes within the core genome are those that encode nutrient transporters. Other conserved genes include those for central carbon metabolism through the Entner-Doudoroff pathway, the pentose phosphate cycle, arginine and proline metabolism, and pathways for degradation of aromatic chemicals. Genes that encode transporters, enzymes and regulators for amino acid metabolism (synthesis and degradation) are all part of the core genome, as well as various electron transporters, which enable aerobic metabolism under different oxygen regimes. Within the core genome are 30 genes for flagella biosynthesis and 12 key genes for biofilm formation. Pseudomonas putida strains share 85% of the coding regions with Pseudomonas aeruginosa; however, in P. putida, virulence factors such as exotoxins and type III secretion systems are absent. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Structure of large dsDNA viruses
Klose, Thomas; Rossmann, Michael G.
2015-01-01
Nucleocytoplasmic large dsDNA viruses (NCLDVs) encompass an ever-increasing group of large eukaryotic viruses, infecting a wide variety of organisms. The set of core genes shared by all these viruses includes a major capsid protein with a double jelly-roll fold forming an icosahedral capsid, which surrounds a double layer membrane that contains the viral genome. Furthermore, some of these viruses, such as the members of the Mimiviridae and Phycodnaviridae have a unique vertex that is used during infection to transport DNA into the host. PMID:25003382
The Comparative Toxicogenomics Database: update 2017.
Davis, Allan Peter; Grondin, Cynthia J; Johnson, Robin J; Sciaky, Daniela; King, Benjamin L; McMorran, Roy; Wiegers, Jolene; Wiegers, Thomas C; Mattingly, Carolyn J
2017-01-04
The Comparative Toxicogenomics Database (CTD; http://ctdbase.org/) provides information about interactions between chemicals and gene products, and their relationships to diseases. Core CTD content (chemical-gene, chemical-disease and gene-disease interactions manually curated from the literature) are integrated with each other as well as with select external datasets to generate expanded networks and predict novel associations. Today, core CTD includes more than 30.5 million toxicogenomic connections relating chemicals/drugs, genes/proteins, diseases, taxa, Gene Ontology (GO) annotations, pathways, and gene interaction modules. In this update, we report a 33% increase in our core data content since 2015, describe our new exposure module (that harmonizes exposure science information with core toxicogenomic data) and introduce a novel dataset of GO-disease inferences (that identify common molecular underpinnings for seemingly unrelated pathologies). These advancements centralize and contextualize real-world chemical exposures with molecular pathways to help scientists generate testable hypotheses in an effort to understand the etiology and mechanisms underlying environmentally influenced diseases. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A Common Set of Core Values - The Foundation for a More Effective Joint Force
2015-05-18
these codes stopped short of codifying a set of core values and instead focused on right and wrong behaviors. This adherence to sets of rules and...Armed Forces independently recognized the limitations of compliance-based rules and the criticality of establishing a strong foundation with core...institutional values vice core values? The knee -jerk reaction of the 1990s and a subsequent lack of a formal effort to institute a single set of core
Core histone genes of Giardia intestinalis: genomic organization, promoter structure, and expression
Yee, Janet; Tang, Anita; Lau, Wei-Ling; Ritter, Heather; Delport, Dewald; Page, Melissa; Adam, Rodney D; Müller, Miklós; Wu, Gang
2007-01-01
Background Giardia intestinalis is a protist found in freshwaters worldwide, and is the most common cause of parasitic diarrhea in humans. The phylogenetic position of this parasite is still much debated. Histones are small, highly conserved proteins that associate tightly with DNA to form chromatin within the nucleus. There are two classes of core histone genes in higher eukaryotes: DNA replication-independent histones and DNA replication-dependent ones. Results We identified two copies each of the core histone H2a, H2b and H3 genes, and three copies of the H4 gene, at separate locations on chromosomes 3, 4 and 5 within the genome of Giardia intestinalis, but no gene encoding a H1 linker histone could be recognized. The copies of each gene share extensive DNA sequence identities throughout their coding and 5' noncoding regions, which suggests these copies have arisen from relatively recent gene duplications or gene conversions. The transcription start sites are at triplet A sequences 1–27 nucleotides upstream of the translation start codon for each gene. We determined that a 50 bp region upstream from the start of the histone H4 coding region is the minimal promoter, and a highly conserved 15 bp sequence called the histone motif (him) is essential for its activity. The Giardia core histone genes are constitutively expressed at approximately equivalent levels and their mRNAs are polyadenylated. Competition gel-shift experiments suggest that a factor within the protein complex that binds him may also be a part of the protein complexes that bind other promoter elements described previously in Giardia. Conclusion In contrast to other eukaryotes, the Giardia genome has only a single class of core histone genes that encode replication-independent histones. Our inability to locate a gene encoding the linker histone H1 leads us to speculate that the H1 protein may not be required for the compaction of Giardia's small and gene-rich genome. PMID:17425802
Gesing, Stefan; Schindler, Daniel; Nowrousian, Minou
2013-09-01
Ascomycetes differentiate four major morphological types of fruiting bodies (apothecia, perithecia, pseudothecia and cleistothecia) that are derived from an ancestral fruiting body. Thus, fruiting body differentiation is most likely controlled by a set of common core genes. One way to identify such genes is to search for genes with evolutionary conserved expression patterns. Using suppression subtractive hybridization (SSH), we selected differentially expressed transcripts in Pyronema confluens (Pezizales) by comparing two cDNA libraries specific for sexual and for vegetative development, respectively. The expression patterns of selected genes from both libraries were verified by quantitative real time PCR. Expression of several corresponding homologous genes was found to be conserved in two members of the Sordariales (Sordaria macrospora and Neurospora crassa), a derived group of ascomycetes that is only distantly related to the Pezizales. Knockout studies with N. crassa orthologues of differentially regulated genes revealed a functional role during fruiting body development for the gene NCU05079, encoding a putative MFS peptide transporter. These data indicate conserved gene expression patterns and a functional role of the corresponding genes during fruiting body development; such genes are candidates of choice for further functional analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Antonysamy, Stephen; Condon, Bradley; Druzina, Zhanna; Bonanno, Jeffrey B.; Gheyi, Tarun; Zhang, Feiyu; MacEwan, Iain; Zhang, Aiping; Ashok, Sheela; Rodgers, Logan; Russell, Marijane; Gately Luz, John
2013-01-01
The enhancer-of-zeste homolog 2 (EZH2) gene product is an 87 kDa polycomb group (PcG) protein containing a C-terminal methyltransferase SET domain. EZH2, along with binding partners, i.e., EED and SUZ12, upon which it is dependent for activity forms the core of the polycomb repressive complex 2 (PRC2). PRC2 regulates gene silencing by catalyzing the methylation of histone H3 at lysine 27. Both overexpression and mutation of EZH2 are associated with the incidence and aggressiveness of various cancers. The novel crystal structure of the SET domain was determined in order to understand disease-associated EZH2 mutations and derive an explanation for its inactivity independent of complex formation. The 2.00 Å crystal structure reveals that, in its uncomplexed form, the EZH2 C-terminus folds back into the active site blocking engagement with substrate. Furthermore, the S-adenosyl-L-methionine (SAM) binding pocket observed in the crystal structure of homologous SET domains is notably absent. This suggests that a conformational change in the EZH2 SET domain, dependent upon complex formation, must take place for cofactor and substrate binding activities to be recapitulated. In addition, the data provide a structural context for clinically significant mutations found in the EZH2 SET domain. PMID:24367637
Molecular epidemiology of Epizootic haematopoietic necrosis virus (EHNV).
Hick, Paul M; Subramaniam, Kuttichantran; Thompson, Patrick M; Waltzek, Thomas B; Becker, Joy A; Whittington, Richard J
2017-11-01
Low genetic diversity of Epizootic haematopoietic necrosis virus (EHNV) was determined for the complete genome of 16 isolates spanning the natural range of hosts, geography and time since the first outbreaks of disease. Genomes ranged from 125,591-127,487 nucleotides with 97.47% pairwise identity and 106-109 genes. All isolates shared 101 core genes with 121 potential genes predicted within the pan-genome of this collection. There was high conservation within 90,181 nucleotides of the core genes with isolates separated by average genetic distance of 3.43 × 10 -4 substitutions per site. Evolutionary analysis of the core genome strongly supported historical epidemiological evidence of iatrogenic spread of EHNV to naïve hosts and establishment of endemic status in discrete ecological niches. There was no evidence of structural genome reorganization, however, the complement of non-core genes and variation in repeat elements enabled fine scale molecular epidemiological investigation of this unpredictable pathogen of fish. Copyright © 2017 Elsevier Inc. All rights reserved.
Redefinition and unification of the SXT/R391 family of integrative and conjugative elements.
Bioteau, Audrey; Durand, Romain; Burrus, Vincent
2018-04-13
Integrative and conjugative elements (ICEs) of the SXT/R391 family are key drivers of the spread of antibiotic resistance in Vibrio cholerae , the infectious agent of cholera, and other pathogenic bacteria. The SXT/R391 family of ICEs was defined based on the conservation of a core set of 52 genes and site-specific integration into the 5' end of the chromosomal gene prfC Hence, the integrase gene int has been intensively used as a marker to detect SXT/R391 ICEs in clinical isolates. ICEs sharing most core genes but differing by their integration site and integrase gene have been recently reported and excluded from the SXT/R391 family. Here we explored the prevalence and diversity of atypical ICEs in Genbank databases and their relationship with typical SXT/R391 ICEs. We found atypical ICEs in V. cholerae isolates that predate the emergence and expansion of typical SXT/R391 ICEs in the mid-1980s in seventh pandemic toxigenic V. cholerae O1 and O139 strains. Our analyses revealed that while atypical ICEs are not associated with antibiotic resistance genes, they often carry cation efflux pumps suggesting heavy metal resistance. Atypical ICEs constitute a polyphyletic group likely because of occasional recombination events with typical ICEs. Furthermore, we show that the alternative integration and excision genes of atypical ICEs remain under the control of SetCD, the main activator of the conjugative functions of SXT/R391 ICEs. Together these observations indicate that substitution of the integration/excision module and change of specificity of integration do not preclude atypical ICEs from inclusion into the SXT/R391 family. Importance Vibrio cholerae is the causative agent of cholera, an acute intestinal infection that remains to this day a world public health threat. Integrative and conjugative elements (ICEs) of the SXT/R391 family have played a major role in spreading antimicrobial resistance in seventh pandemic V. cholerae but also in several species of Enterobacteriaceae Most epidemiological surveys use the integrase gene as a marker to screen for SXT/R391 ICEs in clinical or environmental strains. With the recent reports of closely related elements that encode an alternative integrase gene, it became urgent to investigate whether ICEs that have been left out of the family are a liability for the accuracy of such screenings. In this study based on comparative genomics, we broaden the SXT/R391 family of ICEs to include atypical ICEs that are often associated with heavy metal resistance. Copyright © 2018 American Society for Microbiology.
Huang, Shih W; Lin, Li F; Chou, Lin C; Wu, Mei J; Liao, Chun D; Liou, Tsan H
2016-04-01
Previously, we reported the use of an International Classification of Functioning (ICF) core set that can provide a holistic framework for evaluating the risk factors of falls; however, data on the feasibility of applying this core set are lacking. To investigate the feasibility of applying the fall-related ICF risk-factor core set in the case of patients in an acute-rehabilitation setting. A cross-sectional and descriptive correlational design. Acute-rehabilitation ward. A total of 273 patients who experienced fall at acute-rehabilitation ward. The data on falls were collected from the hospital's Nursing Information System (NIS) and the fall-reporting system (Adverse Event Reporting System, AERS) between 2010 and 2013. The relationship of both systems to the fall-related ICF core set was analyzed to assess the feasibility of their clinical application. We evaluated the feasibility of using the fall-related ICF risk-factor core set by using the frequency and the percentage of the fall patients in of the listed categories. The fall-related ICF risk-factor core set category b735 (muscle tone functions) exhibited a high feasibility (85.95%) for clinical application, and the category b730 (muscle power functions) covered 77.11% of the patients. The feasibility of application of the category d410 (change basic body position) was also high in the case of all fall patients (81.69%). In the acute-rehabilitation setting, the feasibility of application of the fall-related ICF risk-factor core set is high. The fall-related ICF risk-factor core set can help multidisciplinary teams develop fall-prevention strategies in acute rehabilitation wards.
Olvera-Carrillo, Yadira; Van Bel, Michiel; Van Hautegem, Tom; Fendrych, Matyáš; Huysmans, Marlies; Simaskova, Maria; van Durme, Matthias; Buscaill, Pierre; Rivas, Susana; Coll, Nuria S.; Coppens, Frederik; Maere, Steven; Nowack, Moritz K.
2015-12-01
A plethora of diverse programmed cell death (PCD) processes has been described in living organisms. In animals and plants, different forms of PCD play crucial roles in development, immunity, and responses to the environment. While the molecular control of some animal PCD forms such as apoptosis is known in great detail, we still know comparatively little about the regulation of the diverse types of plant PCD. In part, this deficiency in molecular understanding is caused by the lack of reliable reporters to detect PCD processes. Here, we addressed this issue by using a combination of bioinformatics approaches to identify commonly regulated genes during diverse plant PCD processes in Arabidopsis (Arabidopsis thaliana). Our results indicate that the transcriptional signatures of developmentally controlled cell death are largely distinct from the ones associated with environmentally induced cell death. Moreover, different cases of developmental PCD share a set of cell death-associated genes. Most of these genes are evolutionary conserved within the green plant lineage, arguing for an evolutionary conserved core machinery of developmental PCD. Based on this information, we established an array of specific promoter-reporter lines for developmental PCD in Arabidopsis. These PCD indicators represent a powerful resource that can be used in addition to established morphological and biochemical methods to detect and analyze PCD processes in vivo and in planta. © 2015 American Society of Plant Biologists. All Rights Reserved.
PanACEA: a bioinformatics tool for the exploration and visualization of bacterial pan-chromosomes.
Clarke, Thomas H; Brinkac, Lauren M; Inman, Jason M; Sutton, Granger; Fouts, Derrick E
2018-06-27
Bacterial pan-genomes, comprised of conserved and variable genes across multiple sequenced bacterial genomes, allow for identification of genomic regions that are phylogenetically discriminating or functionally important. Pan-genomes consist of large amounts of data, which can restrict researchers ability to locate and analyze these regions. Multiple software packages are available to visualize pan-genomes, but currently their ability to address these concerns are limited by using only pre-computed data sets, prioritizing core over variable gene clusters, or by not accounting for pan-chromosome positioning in the viewer. We introduce PanACEA (Pan-genome Atlas with Chromosome Explorer and Analyzer), which utilizes locally-computed interactive web-pages to view ordered pan-genome data. It consists of multi-tiered, hierarchical display pages that extend from pan-chromosomes to both core and variable regions to single genes. Regions and genes are functionally annotated to allow for rapid searching and visual identification of regions of interest with the option that user-supplied genomic phylogenies and metadata can be incorporated. PanACEA's memory and time requirements are within the capacities of standard laptops. The capability of PanACEA as a research tool is demonstrated by highlighting a variable region important in differentiating strains of Enterobacter hormaechei. PanACEA can rapidly translate the results of pan-chromosome programs into an intuitive and interactive visual representation. It will empower researchers to visually explore and identify regions of the pan-chromosome that are most biologically interesting, and to obtain publication quality images of these regions.
Jacobs, Edwin H; de Vries, Taco J; Smit, August B; Schoffelmeer, Anton N M
2004-01-01
Long-term drug-induced alterations in neurotransmission within the nucleus accumbens (NAc) shell and core may underlie relapse to drug-seeking behavior and drug-taking upon re-exposure to drugs and drug-associated stimuli (cues) during abstinence. Using an open screening strategy, we recently identified 25 gene transcripts, encoding for proteins involved in neuronal functioning and structure that are down-regulated in rat NAc shell after contingent (active), but not after non-contingent (passive), heroin administration. Studying the expression of the same transcripts in the NAc core by means of quantitative PCR, we now demonstrate that most of these transcripts are up-regulated in that NAc subregion long (3 weeks) after heroin self-administration in rats. A similar up-regulation in gene expression was also apparent in the NAc core of animals with a history of non-contingent heroin administration (yoked controls). These data indicate that heroin self-administration differentially regulates genes in the NAc core as compared with the shell. Moreover, whereas cognitive processes involved in active drug self-administration (e.g., instrumental learning) seems to direct gene expression in the NAc shell, neuroplasticity in the NAc core may be due to the pharmacological effects of heroin (including Pavlovian conditioning), as expressed in rats upon contingent as well as non-contingent administration of heroin.
Identification of Reference Genes for RT-qPCR Data Normalization in Cannabis sativa Stem Tissues.
Mangeot-Peter, Lauralie; Legay, Sylvain; Hausman, Jean-Francois; Esposito, Sergio; Guerriero, Gea
2016-09-15
Gene expression profiling via quantitative real-time PCR is a robust technique widely used in the life sciences to compare gene expression patterns in, e.g., different tissues, growth conditions, or after specific treatments. In the field of plant science, real-time PCR is the gold standard to study the dynamics of gene expression and is used to validate the results generated with high throughput techniques, e.g., RNA-Seq. An accurate relative quantification of gene expression relies on the identification of appropriate reference genes, that need to be determined for each experimental set-up used and plant tissue studied. Here, we identify suitable reference genes for expression profiling in stems of textile hemp (Cannabis sativa L.), whose tissues (isolated bast fibres and core) are characterized by remarkable differences in cell wall composition. We additionally validate the reference genes by analysing the expression of putative candidates involved in the non-oxidative phase of the pentose phosphate pathway and in the first step of the shikimate pathway. The goal is to describe the possible regulation pattern of some genes involved in the provision of the precursors needed for lignin biosynthesis in the different hemp stem tissues. The results here shown are useful to design future studies focused on gene expression analyses in hemp.
Cooney, Marese; Galvin, Rose; Connolly, Elizabeth; Stokes, Emma
2013-05-01
The ICF Core Set for breast cancer was generated by international experts for women who have had surgery and radiation but it has not yet been validated. The objective of the study was to validate the ICF Core Set from the perspective of women with breast cancer. A qualitative focus group methodology was used. The sessions were transcribed verbatim. Meaning units were identified by two independent researchers. The agreed list was subsequently linked to ICF categories by two independent researchers according to pre-defined linking rules. Data saturation determined the number of focus groups conducted. Quality of the data analyses was assured by multiple coding and peer review. Thirty-four women participated in seven focus groups. A total of 1621 meaning units were identified which were linked to 74 of the existing 80 Core Set categories. Additional ICF categories not currently included in the Core Set were identified by the women. The validity of the Core Set was largely supported. However, some categories currently not covered by the ICF Core Set for Breast Cancer will need to be considered for inclusion if the Core Set is to reflect all women who have had treatment for breast cancer
Chromatin Insulators: A Role in Nuclear Organization and Gene Expression
Yang, Jingping; Corces, Victor G.
2011-01-01
Chromatin insulators are DNA-protein complexes with broad functions in nuclear biology. Based on the ability of insulator proteins to interact with each other, it was originally thought that insulators form loops that could constitute functional domains of co-regulated gene expression. Nevertheless, data from genome-wide localization studies indicate that insulator proteins can be present in intergenic regions as well as at the 5′, introns or 3′ of genes, suggesting a broader role in chromosome biology. Cells have developed mechanisms to control insulator activity by recruiting specialized proteins or by covalent modification of core components. Recent results suggest that insulators mediate intra- and inter-chromosomal interactions to affect transcription, imprinting and recombination. It is possible that these interactions set up cell-specific blueprints of nuclear organization that may contribute to the establishment of different patterns of gene expression during cell differentiation. As a consequence, disruption of insulator activity could result in the development of cancer or other disease states. PMID:21704228
Zhang, Z; Cavalier-Smith, T; Green, B R
2001-08-01
Chloroplast genes of several dinoflagellate species are located on unigenic DNA minicircular chromosomes. We have now completely sequenced five aberrant minicircular chromosomes from the dinoflagellate Heterocapsa triquetra. These probably nonfunctional DNA circles lack complete genes, with each being composed of several short fragments of two or three different chloroplast genes and a common conserved region with a tripartite 9G-9A-9G core like the putative replicon origin of functional single-gene circular chloroplast chromosomes. Their sequences imply that all five circles evolved by differential deletions and duplications from common ancestral circles bearing fragments of four genes: psbA, psbC, 16S rRNA, and 23S rRNA. It appears that recombination between separate unigenic chromosomes initially gave intermediate heterodimers, which were subsequently stabilized by deletions that included part or all of one putative replicon origin. We suggest that homologous recombination at the 9G-9A-9G core regions produced a psbA/psbC heterodimer which generated two distinct chimeric circles by differential deletions and duplications. A 23S/16S rRNA heterodimer more likely formed by illegitimate recombination between 16S and 23S rRNA genes. Homologous recombination between the 9G-9A-9G core regions of both heterodimers and additional differential deletions and duplications could then have yielded the other three circles. Near identity of the gene fragments and 9G-9A-9G cores, despite diverging adjacent regions, may be maintained by gene conversion. The conserved organization of the 9G-9A-9G cores alone favors the idea that they are replicon origins and suggests that they may enable the aberrant minicircles to parasitize the chloroplast's replication machinery as selfish circles.
Thakur, Nishant; Arguel, Marie-Jeanne; Polanowska, Jolanta; Henrissat, Bernard; Record, Eric; Magdelenat, Ghislaine; Barbe, Valérie; Raffaele, Sylvain; Barbry, Pascal
2016-01-01
Drechmeria coniospora is an obligate fungal pathogen that infects nematodes via the adhesion of specialized spores to the host cuticle. D. coniospora is frequently found associated with Caenorhabditis elegans in environmental samples. It is used in the study of the nematode’s response to fungal infection. Full understanding of this bi-partite interaction requires knowledge of the pathogen’s genome, analysis of its gene expression program and a capacity for genetic engineering. The acquisition of all three is reported here. A phylogenetic analysis placed D. coniospora close to the truffle parasite Tolypocladium ophioglossoides, and Hirsutella minnesotensis, another nematophagous fungus. Ascomycete nematopathogenicity is polyphyletic; D. coniospora represents a branch that has not been molecularly characterized. A detailed in silico functional analysis, comparing D. coniospora to 11 fungal species, revealed genes and gene families potentially involved in virulence and showed it to be a highly specialized pathogen. A targeted comparison with nematophagous fungi highlighted D. coniospora-specific genes and a core set of genes associated with nematode parasitism. A comparative gene expression analysis of samples from fungal spores and mycelia, and infected C. elegans, gave a molecular view of the different stages of the D. coniospora lifecycle. Transformation of D. coniospora allowed targeted gene knock-out and the production of fungus that expresses fluorescent reporter genes. It also permitted the initial characterisation of a potential fungal counter-defensive strategy, involving interference with a host antimicrobial mechanism. This high-quality annotated genome for D. coniospora gives insights into the evolution and virulence of nematode-destroying fungi. Coupled with genetic transformation, it opens the way for molecular dissection of D. coniospora physiology, and will allow both sides of the interaction between D. coniospora and C. elegans, as well as the evolutionary arms race that exists between pathogen and host, to be studied. PMID:27153332
Epigenetic regulation of female puberty.
Lomniczi, Alejandro; Wright, Hollis; Ojeda, Sergio R
2015-01-01
Substantial progress has been made in recent years toward deciphering the molecular and genetic underpinnings of the pubertal process. The availability of powerful new methods to interrogate the human genome has led to the identification of genes that are essential for puberty to occur. Evidence has also emerged suggesting that the initiation of puberty requires the coordinated activity of gene sets organized into functional networks. At a cellular level, it is currently thought that loss of transsynaptic inhibition, accompanied by an increase in excitatory inputs, results in the pubertal activation of GnRH release. This concept notwithstanding, a mechanism of epigenetic repression targeting genes required for the pubertal activation of GnRH neurons was recently identified as a core component of the molecular machinery underlying the central restraint of puberty. In this chapter we will discuss the potential contribution of various mechanisms of epigenetic regulation to the hypothalamic control of female puberty. Copyright © 2014 Elsevier Inc. All rights reserved.
Haack, Frederike S.; Poehlein, Anja; Kröger, Cathrin; Voigt, Christian A.; Piepenbring, Meike; Bode, Helge B.; Daniel, Rolf; Schäfer, Wilhelm; Streit, Wolfgang R.
2016-01-01
Janthinobacterium and Duganella are well-known for their antifungal effects. Surprisingly, almost nothing is known on molecular aspects involved in the close bacterium-fungus interaction. To better understand this interaction, we established the genomes of 11 Janthinobacterium and Duganella isolates in combination with phylogenetic and functional analyses of all publicly available genomes. Thereby, we identified a core and pan genome of 1058 and 23,628 genes. All strains encoded secondary metabolite gene clusters and chitinases, both possibly involved in fungal growth suppression. All but one strain carried a single gene cluster involved in the biosynthesis of alpha-hydroxyketone-like autoinducer molecules, designated JAI-1. Genome-wide RNA-seq studies employing the background of two isolates and the corresponding JAI-1 deficient strains identified a set of 45 QS-regulated genes in both isolates. Most regulated genes are characterized by a conserved sequence motif within the promoter region. Among the most strongly regulated genes were secondary metabolite and type VI secretion system gene clusters. Most intriguing, co-incubation studies of J. sp. HH102 or its corresponding JAI-1 synthase deletion mutant with the plant pathogen Fusarium graminearum provided first evidence of a QS-dependent interaction with this pathogen. PMID:27833590
Sperschneider, Jana; Gardiner, Donald M.; Thatcher, Louise F.; Lyons, Rebecca; Singh, Karam B.; Manners, John M.; Taylor, Jennifer M.
2015-01-01
Pathogens and hosts are in an ongoing arms race and genes involved in host–pathogen interactions are likely to undergo diversifying selection. Fusarium plant pathogens have evolved diverse infection strategies, but how they interact with their hosts in the biotrophic infection stage remains puzzling. To address this, we analyzed the genomes of three Fusarium plant pathogens for genes that are under diversifying selection. We found a two-speed genome structure both on the chromosome and gene group level. Diversifying selection acts strongly on the dispensable chromosomes in Fusarium oxysporum f. sp. lycopersici and on distinct core chromosome regions in Fusarium graminearum, all of which have associations with virulence. Members of two gene groups evolve rapidly, namely those that encode proteins with an N-terminal [SG]-P-C-[KR]-P sequence motif and proteins that are conserved predominantly in pathogens. Specifically, 29 F. graminearum genes are rapidly evolving, in planta induced and encode secreted proteins, strongly pointing toward effector function. In summary, diversifying selection in Fusarium is strongly reflected as genomic footprints and can be used to predict a small gene set likely to be involved in host–pathogen interactions for experimental verification. PMID:25994930
PLGA/polymeric liposome for targeted drug and gene co-delivery.
Wang, Hanjie; Zhao, Peiqi; Su, Wenya; Wang, Sheng; Liao, Zhenyu; Niu, Ruifang; Chang, Jin
2010-11-01
Chemotherapy is one of the most effective approaches to treat cancers in the clinic, but the problems, such as multidrug resistance (MDR), low bioavailability and toxicity, severely constrain the further application of chemotherapy. Our group recently reported that cationic PLGA/folate coated PEGlated polymeric liposome core-shell nanoparticles (PLGA/FPL NPs). It was self-assembled from a hydrophobic PLGA core and a hydrophilic folate coated PEGlated lipid shell for targeting co-delivery of drug and gene. Hydrophobic drugs can be incorporated into the core and the cationic shell of the drug-loaded nanoparticles can be used to bind DNA. The drug-loaded PLGA/FPL NPs/DNA complexes offer advantages to overcome these problems mentioned above, such as co-delivery of drugs and DNA to improving the chemosensitivity of cancer cells at a gene level, and targeting delivery of drug to the cancer tissue that enhance the bioavailability and reduce the toxicity. The experiment showed that nanoparticles have core-shell structure with nanosize, sustained drug release profile and good DNA-binding ability. Importantly, the core-shell nanoparticles achieve the possibility of co-delivering drugs and genes to the same cells with high gene transfection and drug delivery efficiency. Our data suggest that the PLGA/FPL NPs may be a useful drug and gene co-delivery system. Copyright © 2010 Elsevier Ltd. All rights reserved.
Structures and Boolean Dynamics in Gene Regulatory Networks
NASA Astrophysics Data System (ADS)
Szedlak, Anthony
This dissertation discusses the topological and dynamical properties of GRNs in cancer, and is divided into four main chapters. First, the basic tools of modern complex network theory are introduced. These traditional tools as well as those developed by myself (set efficiency, interset efficiency, and nested communities) are crucial for understanding the intricate topological properties of GRNs, and later chapters recall these concepts. Second, the biology of gene regulation is discussed, and a method for disease-specific GRN reconstruction developed by our collaboration is presented. This complements the traditional exhaustive experimental approach of building GRNs edge-by-edge by quickly inferring the existence of as of yet undiscovered edges using correlations across sets of gene expression data. This method also provides insight into the distribution of common mutations across GRNs. Third, I demonstrate that the structures present in these reconstructed networks are strongly related to the evolutionary histories of their constituent genes. Investigation of how the forces of evolution shaped the topology of GRNs in multicellular organisms by growing outward from a core of ancient, conserved genes can shed light upon the ''reverse evolution'' of normal cells into unicellular-like cancer states. Next, I simulate the dynamics of the GRNs of cancer cells using the Hopfield model, an infinite range spin-glass model designed with the ability to encode Boolean data as attractor states. This attractor-driven approach facilitates the integration of gene expression data into predictive mathematical models. Perturbations representing therapeutic interventions are applied to sets of genes, and the resulting deviations from their attractor states are recorded, suggesting new potential drug targets for experimentation. Finally, I extend the Hopfield model to modular networks, cyclic attractors, and complex attractors, and apply these concepts to simulations of the cell cycle process. Futher development of these and other theoretical and computational tools is necessary to analyze the deluge of experimental data produced by modern and future biological high throughput methods. (Abstract shortened by ProQuest.).
Three-dimensional coil inductor
Bernhardt, Anthony F.; Malba, Vincent
2002-01-01
A three-dimensional coil inductor is disclosed. The inductor includes a substrate; a set of lower electrically conductive traces positioned on the substrate; a core placed over the lower traces; a set of side electrically conductive traces laid on the core and the lower traces; and a set of upper electrically conductive traces attached to the side traces so as to form the inductor. Fabrication of the inductor includes the steps of forming a set of lower traces on a substrate; positioning a core over the lower traces; forming a set of side traces on the core; connecting the side traces to the lower traces; forming a set of upper traces on the core; and connecting the upper traces to the side traces so as to form a coil structure.
Progress on core outcome sets for critical care research.
Blackwood, Bronagh; Marshall, John; Rose, Louise
2015-10-01
Appropriate selection and definition of outcome measures are essential for clinical trials to be maximally informative. Core outcome sets (an agreed, standardized collection of outcomes measured and reported in all trials for a specific clinical area) were developed due to established inconsistencies in trial outcome selection. This review discusses the rationale for, and methods of, core outcome set development, as well as current initiatives in critical care. Recent systematic reviews of reported outcomes and measurement instruments relevant to the critically ill highlight inconsistencies in outcome selection, definition, and measurement, thus establishing the need for core outcome sets. Current critical care initiatives include development of core outcome sets for trials aimed at reducing mechanical ventilation duration; rehabilitation following critical illness; long-term outcomes in acute respiratory failure; and epidemic and pandemic studies of severe acute respiratory infection. Development and utilization of core outcome sets for studies relevant to the critically ill is in its infancy compared to other specialties. Notwithstanding, core outcome set development frameworks and guidelines are available, several sets are in various stages of development, and there is strong support from international investigator-led collaborations including the International Forum for Acute Care Trialists.
Jiang, Qiao; Zhao, Li; Dai, Junbiao; Wu, Qingyu
2012-01-01
Background Microalgae, with the ability to mitigate CO2 emission and produce carbohydrates and lipids, are considered one of the most promising resources for producing bioenergy. Recently, we discovered that autophagy plays a critical role in the metabolism of photosynthetic system and lipids production. So far, more than 30-autophagy related (ATG) genes in all subtypes of autophagy have been identified. However, compared with yeast and mammals, in silico and experimental research of autophagy pathways in microalgae remained limited and fragmentary. Principal Findings In this article, we performed a genome-wide analysis of ATG genes in 7 microalgae species and explored their distributions, domain structures and evolution. Eighteen “core autophagy machinery” proteins, four mammalian-specific ATG proteins and more than 30 additional proteins (including “receptor-adaptor” complexes) in all subtypes of autophagy were analyzed. Data revealed that receptor proteins in cytoplasm-to-vacuole targeting and mitophagy seem to be absent in microalgae. However, most of the “core autophagy machinery” and mammalian-specific proteins are conserved among microalgae, except for the ATG9-cycling system in Chlamydomonas reinhardtii and the second ubiquitin-like protein conjugation complex in several algal species. The catalytic and binding residues in ATG3, ATG5, ATG7, ATG8, ATG10 and ATG12 are also conserved and the phylogenetic tree of ATG8 coincides well with the phylogenies. Chlorella contains the entire set of the core autophagy machinery. In addition, RT-PCR analysis verified that all crucial ATG genes tested are expressed during autophagy in both Chlorella and Chlamydomonas reinhardtii. Finally, we discovered that addition of 3-Methyladenine (a PI3K specific inhibitor) could suppress the formation of autophagic vacuoles in Chlorella. Conclusions Taken together, Chlorella may represent a potential model organism to investigate autophagy pathways in photosynthetic eukaryotes. The study will not only promote understanding of the general features of autophagic pathways, but also benefit the production of Chlorella-derived biofuel with future commercial applications. PMID:22848622
A Novel Protective Vaccine Antigen from the Core Escherichia coli Genome
Moriel, Danilo G.; Tan, Lendl; Goh, Kelvin G. K.; Ipe, Deepak S.; Lo, Alvin W.; Peters, Kate M.
2016-01-01
ABSTRACT Escherichia coli is a versatile pathogen capable of causing intestinal and extraintestinal infections that result in a huge burden of global human disease. The diversity of E. coli is reflected by its multiple different pathotypes and mosaic genome composition. E. coli strains are also a major driver of antibiotic resistance, emphasizing the urgent need for new treatment and prevention measures. Here, we used a large data set comprising 1,700 draft and complete genomes to define the core and accessory genome of E. coli and demonstrated the overlapping relationship between strains from different pathotypes. In combination with proteomic investigation, this analysis revealed core genes that encode surface-exposed or secreted proteins that represent potential broad-coverage vaccine antigens. One of these antigens, YncE, was characterized as a conserved immunogenic antigen able to protect against acute systemic infection in mice after vaccination. Overall, this work provides a genomic blueprint for future analyses of conserved and accessory E. coli genes. The work also identified YncE as a novel antigen that could be exploited in the development of a vaccine against all pathogenic E. coli strains—an important direction given the high global incidence of infections caused by multidrug-resistant strains for which there are few effective antibiotics. IMPORTANCE E. coli is a multifaceted pathogen of major significance to global human health and an important contributor to increasing antibiotic resistance. Given the paucity of therapies still effective against multidrug-resistant pathogenic E. coli strains, novel treatment and prevention strategies are urgently required. In this study, we defined the core and accessory components of the E. coli genome by examining a large collection of draft and completely sequenced strains available from public databases. This data set was mined by employing a reverse-vaccinology approach in combination with proteomics to identify putative broadly protective vaccine antigens. One such antigen was identified that was highly immunogenic and induced protection in a mouse model of bacteremia. Overall, our study provides a genomic and proteomic framework for the selection of novel vaccine antigens that could mediate broad protection against pathogenic E. coli. PMID:27904885
Wolf, Yuri I; Makarova, Kira S; Yutin, Natalya; Koonin, Eugene V
2012-12-14
Collections of Clusters of Orthologous Genes (COGs) provide indispensable tools for comparative genomic analysis, evolutionary reconstruction and functional annotation of new genomes. Initially, COGs were made for all complete genomes of cellular life forms that were available at the time. However, with the accumulation of thousands of complete genomes, construction of a comprehensive COG set has become extremely computationally demanding and prone to error propagation, necessitating the switch to taxon-specific COG collections. Previously, we reported the collection of COGs for 41 genomes of Archaea (arCOGs). Here we present a major update of the arCOGs and describe evolutionary reconstructions to reveal general trends in the evolution of Archaea. The updated version of the arCOG database incorporates 91% of the pangenome of 120 archaea (251,032 protein-coding genes altogether) into 10,335 arCOGs. Using this new set of arCOGs, we performed maximum likelihood reconstruction of the genome content of archaeal ancestral forms and gene gain and loss events in archaeal evolution. This reconstruction shows that the last Common Ancestor of the extant Archaea was an organism of greater complexity than most of the extant archaea, probably with over 2,500 protein-coding genes. The subsequent evolution of almost all archaeal lineages was apparently dominated by gene loss resulting in genome streamlining. Overall, in the evolution of Archaea as well as a representative set of bacteria that was similarly analyzed for comparison, gene losses are estimated to outnumber gene gains at least 4 to 1. Analysis of specific patterns of gene gain in Archaea shows that, although some groups, in particular Halobacteria, acquire substantially more genes than others, on the whole, gene exchange between major groups of Archaea appears to be largely random, with no major 'highways' of horizontal gene transfer. The updated collection of arCOGs is expected to become a key resource for comparative genomics, evolutionary reconstruction and functional annotation of new archaeal genomes. Given that, in spite of the major increase in the number of genomes, the conserved core of archaeal genes appears to be stabilizing, the major evolutionary trends revealed here have a chance to stand the test of time. This article was reviewed by (for complete reviews see the Reviewers' Reports section): Dr. PLG, Prof. PF, Dr. PL (nominated by Prof. JPG).
Shimada, Nao; Maruo, Toshinari; Maeda, Mineko; Urushihara, Hideko; Kawata, Takefumi
2005-02-01
Dd-STATa, a Dictyostelium homolog of the metazoan STAT (signal transducers and activators of transcription) proteins, is necessary in the slug for correct entry into culmination. Dd-STATa-null mutant fails to culminate and its phenotype correlates with the loss of a funnel-shaped core region, the pstAB core region, which expresses both the ecmA and ecmB genes. To understand how the differentiation of pstAB core cells is regulated, we identified an EST that is expressed in the core cells of normal slugs but down-regulated in the Dd-STATa-null mutant. This EST, SSK348, encodes a close homolog of the Dictyostelium acetyl-CoA synthetase (ACS). A promoter fragment of the cognate gene, aslA (acetyl-CoA synthetase-like A), was fused to a lacZ reporter and the expression pattern determined. As expected from the behavior of the endogenous aslA gene, the aslA::lacZ fusion gene is not expressed in Dd-STATa-null slugs. In parental cells, the aslA promoter is first activated in the funnel-shaped core cells located at the slug anterior, the "pstAB core." During culmination, the pstAB core cells move down, through the prespore cells, to form the inner part of the basal disc. As the spore mass climbs the stalk, the aslA gene comes to be expressed in cells of the upper and lower cups, structures that cradle the spore head. Deletion and point mutation analyses of the promoter identified an AT-rich sequence that is necessary for expression in the pstAB core. This acts in combination with repressor regions that prevent ectopic aslA expression in the pre-stalk regions of slugs and the stalks of culminants. Thus, this study confirms that Dd-STATa is necessary for the differentiation of pstAB core cells, by showing that it is needed for the activation of the aslA gene. It also identifies aslA promoter elements that are likely to be regulated, directly or indirectly, by Dd-STATa.
Chen, Ming; Henry, Nathan; Almsaeed, Abdullah; Zhou, Xiao; Wegrzyn, Jill; Ficklin, Stephen
2017-01-01
Abstract Tripal is an open source software package for developing biological databases with a focus on genetic and genomic data. It consists of a set of core modules that deliver essential functions for loading and displaying data records and associated attributes including organisms, sequence features and genetic markers. Beyond the core modules, community members are encouraged to contribute extension modules to build on the Tripal core and to customize Tripal for individual community needs. To expand the utility of the Tripal software system, particularly for RNASeq data, we developed two new extension modules. Tripal Elasticsearch enables fast, scalable searching of the entire content of a Tripal site as well as the construction of customized advanced searches of specific data types. We demonstrate the use of this module for searching assembled transcripts by functional annotation. A second module, Tripal Analysis Expression, houses and displays records from gene expression assays such as RNA sequencing. This includes biological source materials (biomaterials), gene expression values and protocols used to generate the data. In the case of an RNASeq experiment, this would reflect the individual organisms and tissues used to produce sequencing libraries, the normalized gene expression values derived from the RNASeq data analysis and a description of the software or code used to generate the expression values. The module will load data from common flat file formats including standard NCBI Biosample XML. Data loading, display options and other configurations can be controlled by authorized users in the Drupal administrative backend. Both modules are open source, include usage documentation, and can be found in the Tripal organization’s GitHub repository. Database URL: Tripal Elasticsearch module: https://github.com/tripal/tripal_elasticsearch Tripal Analysis Expression module: https://github.com/tripal/tripal_analysis_expression PMID:29220446
Molecular Evolution of Phosphoprotein Phosphatases in Drosophila
Miskei, Márton; Ádám, Csaba; Kovács, László; Karányi, Zsolt; Dombrádi, Viktor
2011-01-01
Phosphoprotein phosphatases (PPP), these ancient and important regulatory enzymes are present in all eukaryotic organisms. Based on the genome sequences of 12 Drosophila species we traced the evolution of the PPP catalytic subunits and noted a substantial expansion of the gene family. We concluded that the 18–22 PPP genes of Drosophilidae were generated from a core set of 8 indispensable phosphatases that are present in most of the insects. Retropositons followed by tandem gene duplications extended the phosphatase repertoire, and sporadic gene losses contributed to the species specific variations in the PPP complement. During the course of these studies we identified 5, up till now uncharacterized phosphatase retrogenes: PpY+, PpD5+, PpD6+, Pp4+, and Pp6+ which are found only in some ancient Drosophila. We demonstrated that all of these new PPP genes exhibit a distinct male specific expression. In addition to the changes in gene numbers, the intron-exon structure and the chromosomal localization of several PPP genes was also altered during evolution. The G−C content of the coding regions decreased when a gene moved into the heterochromatic region of chromosome Y. Thus the PPP enzymes exemplify the various types of dynamic rearrangements that accompany the molecular evolution of a gene family in Drosophilidae. PMID:21789237
Sun, Zhengda; Wang, Chih-Yang; Lawson, Devon A; Kwek, Serena; Velozo, Hugo Gonzalez; Owyong, Mark; Lai, Ming-Derg; Fong, Lawrence; Wilson, Mark; Su, Hua; Werb, Zena; Cooke, Daniel L
2018-02-16
Tumor endothelial cells (TEC) play an indispensible role in tumor growth and metastasis although much of the detailed mechanism still remains elusive. In this study we characterized and compared the global gene expression profiles of TECs and control ECs isolated from human breast cancerous tissues and reduction mammoplasty tissues respectively by single cell RNA sequencing (scRNA-seq). Based on the qualified scRNA-seq libraries that we made, we found that 1302 genes were differentially expressed between these two EC phenotypes. Both principal component analysis (PCA) and heat map-based hierarchical clustering separated the cancerous versus control ECs as two distinctive clusters, and MetaCore disease biomarker analysis indicated that these differentially expressed genes are highly correlated with breast neoplasm diseases. Gene Set Enrichment Analysis software (GSEA) enriched these genes to extracellular matrix (ECM) signal pathways and highlighted 127 ECM-associated genes. External validation verified some of these ECM-associated genes are not only generally overexpressed in various cancer tissues but also specifically overexpressed in colorectal cancer ECs and lymphoma ECs. In conclusion, our data demonstrated that ECM-associated genes play pivotal roles in breast cancer EC biology and some of them could serve as potential TEC biomarkers for various cancers.
Szijártó, Valéria; Pal, Tibor; Nagy, Gabor; Nagy, Eszter; Ghazawi, Akela; al-Haj, Mohammed; El Kurdi, Sylvia; Sonnevend, Agnes
2012-07-01
The clone Escherichia coli O25 ST131, typically producing extended-spectrum beta-lactamases (ESBLs), has spread globally and became the dominant type among extraintestinal isolates at many parts of the world. However, the reasons behind the emergence and success of this clone are only partially understood. We compared the core type genes by PCR of ESBL-producing and ESBL-nonproducing strains isolated from urinary tract infections in the United Arab Emirates and found a surprisingly high frequency of the K-12 core type (44.6%) among members of the former group, while in the latter one, it was as low (3.7%), as reported earlier. The high figure was almost entirely attributable to the presence of members of the clone O25 ST131 among ESBL producers. Strains from the same clone isolated in Europe also carried the K-12 core type genes. Sequencing the entire core operon of an O25 ST131 isolate revealed a high level of similarity to known K-12 core gene sequences and an almost complete identity with a recently sequenced non-O25 ST131 fecal isolate. The exact chemical structure and whether and how this unusual core type contributed to the sudden emergence of ST131 require further investigations. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Comparative Genomics of Bacteriophage of the Genus Seuratvirus
Sazinas, Pavelas; Redgwell, Tamsin; Rihtman, Branko; Grigonyte, Aurelija; Michniewski, Slawomir; Scanlan, David J; Hobman, Jon
2018-01-01
Abstract Despite being more abundant and having smaller genomes than their bacterial host, relatively few bacteriophages have had their genomes sequenced. Here, we isolated 14 bacteriophages from cattle slurry and performed de novo genome sequencing, assembly, and annotation. The commonly used marker genes polB and terL showed these bacteriophages to be closely related to members of the genus Seuratvirus. We performed a core-gene analysis using the 14 new and four closely related genomes. A total of 58 core genes were identified, the majority of which has no known function. These genes were used to construct a core-gene phylogeny, the results of which confirmed the new isolates to be part of the genus Seuratvirus and expanded the number of species within this genus to four. All bacteriophages within the genus contained the genes queCDE encoding enzymes involved in queuosine biosynthesis. We suggest these genes are carried as a mechanism to modify DNA in order to protect these bacteriophages against host endonucleases. PMID:29272407
Conserved Curvature of RNA Polymerase I Core Promoter Beyond rRNA Genes: The Case of the Tritryps
Smircich, Pablo; Duhagon, María Ana; Garat, Beatriz
2015-01-01
In trypanosomatids, the RNA polymerase I (RNAPI)-dependent promoters controlling the ribosomal RNA (rRNA) genes have been well identified. Although the RNAPI transcription machinery recognizes the DNA conformation instead of the DNA sequence of promoters, no conformational study has been reported for these promoters. Here we present the in silico analysis of the intrinsic DNA curvature of the rRNA gene core promoters in Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major. We found that, in spite of the absence of sequence conservation, these promoters hold conformational properties similar to other eukaryotic rRNA promoters. Our results also indicated that the intrinsic DNA curvature pattern is conserved within the Leishmania genus and also among strains of T. cruzi and T. brucei. Furthermore, we analyzed the impact of point mutations on the intrinsic curvature and their impact on the promoter activity. Furthermore, we found that the core promoters of protein-coding genes transcribed by RNAPI in T. brucei show the same conserved conformational characteristics. Overall, our results indicate that DNA intrinsic curvature of the rRNA gene core promoters is conserved in these ancient eukaryotes and such conserved curvature might be a requirement of RNAPI machinery for transcription of not only rRNA genes but also protein-coding genes. PMID:26718450
Lin, Y-N; Chang, K-H; Lin, C-Y; Hsu, M-I; Chen, H-C; Chen, H-H; Liou, T-H
2014-04-01
The International Classification of Functioning, Disability, and Health (ICF) provides a framework for measuring functioning and disability based on a biopsychosocial model. The aim of this study was to develop comprehensive and brief ICF core sets for morbid obesity for disability assessment in Taiwan. Observational Other Twenty-nine multidisciplinary experts of ICF METHODS: The questionnaire contained 112 obesity-relevant and second-level ICF categories. Using a 5-point Likert scale, the participants rated the significance of the effects of each category on the heath status of people with obesity. Correlation between an individual's score and the average score of the group indicated consensus. The categories were selected for the comprehensive core set for obesity if more than 50% of the experts rated them as "important" in the third round of the Delphi exercise, and for the brief core set if more than 80% of the experts rated them "very important." Twenty-nine experts participated in the study. These included 18 physicians, 4 dieticians, 3 physical therapists, 2 nurses, and 2 ICF experts. The comprehensive core set for morbid obesity contained 61 categories. Of these, 26 categories were from the component body function, 8 were from body structure, 18 were from activities and participation, and 9 were from environmental factors. The brief core set for obesity disability contained 29 categories. Of these, 19 categories were from the component body function, 3 were from body structure, 6 were from activities and participation, and one was from environmental factors. The comprehensive and brief ICF core sets provide comprehensive information on the health effects of morbid obesity and concise information for clinical practice. Comprehensive and brief core sets were created after three rounds of Delphi technique. Further validation study of these core sets by applying to patients with morbid obesity is needed. The comprehensive ICF core set for morbid obesity provides comprehensive information on the health effects of morbid obesity; the brief core set can provide concise information for clinical practice.
Kirschneck, M; Legner, R; Armbrust, W; Nowak, D; Cieza, A
2015-04-01
Social-medical expert reports from the German statutory pension insurance are essential for the German statutory pension regulatory authority to decide whether to grant services regarding participation as well as retirement pensions due to incapacity to work.The objective of this investigation is to determine whether the ICF Core Sets and other international approaches, such as the EUMASS Core Sets or ICF Core Set for vocational rehabilitation cover the content of the social-medical expert reports as well as to propose an approach how the ICF can be economically used by the social medicine practitioner when writing a social-medical expert report. A retrospective quantitative study design was used to translate a total of 294 social-medical expert reports from patients with low back pain (LBP) or chronic widespread pain (CWP) into the language of the ICF (linking) by 2 independent health professionals and compare the results with the ICF Core Sets for specific health conditions and other international approaches. The content of social-medical expert reports was largely reflected by the condition specific brief ICF Core Sets, brief ICF Core Sets for vocational rehabilitation and EUMASS Core Sets. The weighted Kappa statistic for the agreement between the 2 health professionals who translated the expert reports were in CWP 0.69 with a bootstrapped confidence interval of 0.67-0.71 and in LBP 0.73 (0.71-0.74). The analyses show that the content of social-medical expert reports varies enormously. A combination of a condition specific brief ICF Core Set as well as vocational rehabilitation and EUMASS ICF Core Sets as well as all ICF-categories from the expert reports that were named at least in 50% of it can largely provide a basis for preparing expert reports. Within the scope of implementation the need for a specific ICF Core Set for expert reports of the German statutory pension insurance should be further analyzed and discussed. © Georg Thieme Verlag KG Stuttgart · New York.
Opanasopit, Praneet; Leksantikul, Lalita; Niyomtham, Nattisa; Rojanarata, Theerasak; Ngawhirunpat, Tanasait; Yingyongnarongkul, Boon-Ek
2017-05-01
Cationic niosomes formulated from Span 20, cholesterol (Chol) and novel spermine-based cationic lipids of multiple central core structures (di(oxyethyl)amino, di(oxyethyl)amino carboxy, 3-amino-1,2-dioxypropyl and 2-amino-1,3-dioxypropyl) were successfully prepared for improving transfection efficiency in vitro. The niosomes composed of spermine cationic lipid with central core structure of di(oxyethyl)amino revealed the highest gene transfection efficiency. To investigate the factors affecting gene transfection and cell viability including differences in the central core structures of cationic lipids, the composition of vesicles, molar ratio of cationic lipids in formulations and the weight ratio of niosomes to DNA. Cationic niosomes composed of nonionic surfactants (Span20), cholesterol and spermine-based cationic lipids of multiple central core structures were formulated. Gene transfection and cell viability were evaluated on a human cervical carcinoma cell line (HeLa cells) using pDNA encoding green fluorescent protein (pEGFP-C2). The morphology, size and charge were also characterized. High transfection efficiency was obtained from cationic niosomes composed of Span20:Chol:cationic lipid at the molar ratio of 2.5:2.5:0.5 mM. Cationic lipids with di(oxyethyl)amino as a central core structure exhibited highest transfection efficiency. In addition, there was also no serum effect on transfection efficiency. These novel cationic niosomes may constitute a good alternative carrier for gene transfection.
Markett, Sebastian; de Reus, Marcel A; Reuter, Martin; Montag, Christian; Weber, Bernd; Schoene-Bake, Jan-Christoph; van den Heuvel, Martijn P
2017-03-01
The rich club comprises a densely mutually connected set of hub regions in the brain, thought to serve as a processing and integration core. We assessed the impact of normal variation of the tryptophane hydroxylase 2 gene's promotor region (TPH2 rs4570625) on structural connectivity of the rich club pathways by means of a candidate gene association design. Tryptophane hydroxylase 2 (TPH2) is a rate-limiting enzyme in the biosynthesis of serotonin and is known to inhibit, in addition to its role as a trans-synaptic messenger, axonal and dendritic growth. The TPH2 T-variant has been associated with reduced mRNA expression and reduced serotonin levels, which may particularly influence the development of macroscale anatomical connectivity. Here, we show larger mean connectivity in the rich club in carriers of the T-variant, suggesting potential effects of upregulation of neural connectivity growth in this central core system. In addition, by edge-removal statistics, we show that the TPH2-associated higher levels of rich club connectivity are of importance for the functioning of the total structural network. The observed association is speculated to result from an effect of serotonin levels on brain development, potentially leading to stronger structural connectivity in heavily interconnected hubs. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Sved, J A; Yu, H; Dominiak, B; Gilchrist, A S
2003-02-01
Long-range dispersal of a species may involve either a single long-distance movement from a core population or spreading via unobserved intermediate populations. Where the new populations originate as small propagules, genetic drift may be extreme and gene frequency or assignment methods may not prove useful in determining the relation between the core population and outbreak samples. We describe computationally simple resampling methods for use in this situation to distinguish between the different modes of dispersal. First, estimates of heterozygosity can be used to test for direct sampling from the core population and to estimate the effective size of intermediate populations. Second, a test of sharing of alleles, particularly rare alleles, can show whether outbreaks are related to each other rather than arriving as independent samples from the core population. The shared-allele statistic also serves as a genetic distance measure that is appropriate for small samples. These methods were applied to data on a fruit fly pest species, Bactrocera tryoni, which is quarantined from some horticultural areas in Australia. We concluded that the outbreaks in the quarantine zone came from a heterogeneous set of genetically differentiated populations, possibly ones that overwinter in the vicinity of the quarantine zone.
Core signaling pathways in human pancreatic cancers revealed by global genomic analyses.
Jones, Siân; Zhang, Xiaosong; Parsons, D Williams; Lin, Jimmy Cheng-Ho; Leary, Rebecca J; Angenendt, Philipp; Mankoo, Parminder; Carter, Hannah; Kamiyama, Hirohiko; Jimeno, Antonio; Hong, Seung-Mo; Fu, Baojin; Lin, Ming-Tseh; Calhoun, Eric S; Kamiyama, Mihoko; Walter, Kimberly; Nikolskaya, Tatiana; Nikolsky, Yuri; Hartigan, James; Smith, Douglas R; Hidalgo, Manuel; Leach, Steven D; Klein, Alison P; Jaffee, Elizabeth M; Goggins, Michael; Maitra, Anirban; Iacobuzio-Donahue, Christine; Eshleman, James R; Kern, Scott E; Hruban, Ralph H; Karchin, Rachel; Papadopoulos, Nickolas; Parmigiani, Giovanni; Vogelstein, Bert; Velculescu, Victor E; Kinzler, Kenneth W
2008-09-26
There are currently few therapeutic options for patients with pancreatic cancer, and new insights into the pathogenesis of this lethal disease are urgently needed. Toward this end, we performed a comprehensive genetic analysis of 24 pancreatic cancers. We first determined the sequences of 23,219 transcripts, representing 20,661 protein-coding genes, in these samples. Then, we searched for homozygous deletions and amplifications in the tumor DNA by using microarrays containing probes for approximately 10(6) single-nucleotide polymorphisms. We found that pancreatic cancers contain an average of 63 genetic alterations, the majority of which are point mutations. These alterations defined a core set of 12 cellular signaling pathways and processes that were each genetically altered in 67 to 100% of the tumors. Analysis of these tumors' transcriptomes with next-generation sequencing-by-synthesis technologies provided independent evidence for the importance of these pathways and processes. Our data indicate that genetically altered core pathways and regulatory processes only become evident once the coding regions of the genome are analyzed in depth. Dysregulation of these core pathways and processes through mutation can explain the major features of pancreatic tumorigenesis.
Gu, Liqiang; Yu, Jun; Wang, Qing; Xu, Bin; Ji, Liechen; Yu, Lin; Zhang, Xipeng; Cai, Hui
2018-05-03
The present study aimed to investigate potential prognostic long noncoding RNAs (lncRNAs) associated with colorectal cancer (CRC). An mRNA‑seq dataset obtained from The Cancer Genome Atlas was employed to identify the differentially expressed lncRNAs (DELs) between CRC patients with good and poor prognoses. Subsequently, univariate and multivariate Cox regression analyses were conducted to analyze the prognosis‑associated lncRNAs among all DELs. In addition, a risk scoring system was developed according to the expression levels of the prognostic lncRNAs, which was then applied to a training set and an independent testing set. Furthermore, the co‑expressed genes of prognostic lncRNAs were screened using a Multi‑Experiment Matrix online tool for construction of lncRNA‑gene networks. Finally, Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology (GO) function enrichment analyses were performed on genes in the lncRNA‑gene networks using KOBAS, GOATOOLS and ClusterProfiler. The present study identified 82 DELs, of which long intergenic nonprotein coding RNA 2159, RP11‑452L6.6, RP11‑894P9.1 and RP11‑69M1.6, and whey acidic protein four‑disulfide core domain 21 (WFDC21P) were reported to be independently associated with the prognosis of patients with CRC. A 5‑lncRNA signature‑based risk scoring system was developed, which may be used to classify patients into low‑ and high‑risk groups with significantly different recurrence‑free survival times in the training and testing sets (P<0.05). Co‑expressed genes of WFDC21P or RP11‑69M1.6 were utilized to construct the lncRNA‑gene networks. Genes in the networks were significantly enriched in 'tight junction', 'focal adhesion' and 'regulation of actin cytoskeleton' pathways, and numerous GO terms associated with 'reactive oxygen species metabolism' and 'nitric oxide metabolism'. The present study proposed a 5‑lncRNA signature‑based risk scoring system for predicting the prognosis of patients with CRC, and revealed the associated signaling pathways and biological processes. The results of the present study may help improve prognostic evaluation in clinical practice.
Zhong, Jinshun; Kellogg, Elizabeth A
2015-08-01
• CYCLOIDEA2 (CYC2)-like and RADIALIS (RAD)-like genes are needed for the normal development of corolla bilateral symmetry in Antirrhinum majus L. (snapdragon, Plantaginaceae, Lamiales). However, if and how changes in expression of CYC2-like and RAD-like genes correlate with the origin of corolla bilateral symmetry early in Lamiales remains largely unknown. The asymmetrical expression of CYC2-like and/or RAD-like genes during floral meristem development could be ancestral or derived in Plantaginaceae.• We used in situ RNA localization to examine the expression of CYC2-like and RAD-like genes in two early-diverging Lamiales.• CYC2-like and RAD-like genes are expressed broadly in the floral meristems in early-diverging Lamiales with radially symmetrical corollas, in contrast to their restricted expression in adaxial/lateral regions in core Lamiales. The expression pattern of CYC2-like genes has evolved in stepwise fashion, in that CYC2-like genes are likely expressed briefly in the floral meristem during flower development in sampled Oleaceae; prolonged expression of CYC2-like genes in petals originated in the common ancestor of Tetrachondraceae and core Lamiales, and asymmetrical expression in adaxial/lateral petals appeared later, in the common ancestor of the core Lamiales. Likewise, expression of RAD-like genes in petals appeared in early-diverging Lamiales or earlier; asymmetrical expression in adaxial/lateral petals then appeared in core Lamiales.• These data plus published reports of CYC2-like and RAD-like genes show that asymmetrical expression of these two genes is likely derived and correlates with the origins of corolla bilateral symmetry. © 2015 Botanical Society of America, Inc.
Cai, Xiaojun; Jin, Rongrong; Wang, Jiali; Yue, Dong; Jiang, Qian; Wu, Yao; Gu, Zhongwei
2016-03-09
Polymeric vectors have shown great promise in the development of safe and efficient gene delivery systems; however, only a few have been developed in clinical settings due to poor transport across multiple physiological barriers. To address this issue and promote clinical translocation of polymeric vectors, a new type of polymeric vector, bioreducible fluorinated peptide dendrimers (BFPDs), was designed and synthesized by reversible cross-linking of fluorinated low generation peptide dendrimers. Through masterly integration all of the features of reversible cross-linking, fluorination, and polyhedral oligomeric silsesquioxane (POSS) core-based peptide dendrimers, this novel vector exhibited lots of unique features, including (i) inactive surface to resist protein interactions; (ii) virus-mimicking surface topography to augment cellular uptake; (iii) fluorination-mediated efficient cellular uptake, endosome escape, cytoplasm trafficking, and nuclear entry, and (iv) disulfide-cleavage-mediated polyplex disassembly and DNA release that allows efficient DNA transcription. Noteworthy, all of these features are functionally important and can synergistically facilitate DNA transport from solution to the nucleus. As a consequences, BFPDs showed excellent gene transfection efficiency in several cell lines (∼95% in HEK293 cells) and superior biocompatibility compared with polyethylenimine (PEI). Meanwhile BFPDs provided excellent serum resistance in gene delivery. More importantly, BFPDs offer considerable in vivo gene transfection efficiency (in muscular tissues and in HepG2 tumor xenografts), which was approximately 77-fold higher than that of PEI in luciferase activity. These results suggest bioreducible fluorinated peptide dendrimers are a new class of highly efficient and safe gene delivery vectors and should be used in clinical settings.
Zhang, Sheng-Jia; Zou, Ming; Lu, Li; Lau, David; Ditzel, Désirée A. W.; Delucinge-Vivier, Celine; Aso, Yoshinori; Descombes, Patrick; Bading, Hilmar
2009-01-01
Synaptic activity can boost neuroprotection through a mechanism that requires synapse-to-nucleus communication and calcium signals in the cell nucleus. Here we show that in hippocampal neurons nuclear calcium is one of the most potent signals in neuronal gene expression. The induction or repression of 185 neuronal activity-regulated genes is dependent upon nuclear calcium signaling. The nuclear calcium-regulated gene pool contains a genomic program that mediates synaptic activity-induced, acquired neuroprotection. The core set of neuroprotective genes consists of 9 principal components, termed Activity-regulated Inhibitor of Death (AID) genes, and includes Atf3, Btg2, GADD45β, GADD45γ, Inhibin β-A, Interferon activated gene 202B, Npas4, Nr4a1, and Serpinb2, which strongly promote survival of cultured hippocampal neurons. Several AID genes provide neuroprotection through a common process that renders mitochondria more resistant to cellular stress and toxic insults. Stereotaxic delivery of AID gene-expressing recombinant adeno-associated viruses to the hippocampus confers protection in vivo against seizure-induced brain damage. Thus, treatments that enhance nuclear calcium signaling or supplement AID genes represent novel therapies to combat neurodegenerative conditions and neuronal cell loss caused by synaptic dysfunction, which may be accompanied by a deregulation of calcium signal initiation and/or propagation to the cell nucleus. PMID:19680447
Adaptation to climate through flowering phenology: a case study in Medicago truncatula.
Burgarella, Concetta; Chantret, Nathalie; Gay, Laurène; Prosperi, Jean-Marie; Bonhomme, Maxime; Tiffin, Peter; Young, Nevin D; Ronfort, Joelle
2016-07-01
Local climatic conditions likely constitute an important selective pressure on genes underlying important fitness-related traits such as flowering time, and in many species, flowering phenology and climatic gradients strongly covary. To test whether climate shapes the genetic variation on flowering time genes and to identify candidate flowering genes involved in the adaptation to environmental heterogeneity, we used a large Medicago truncatula core collection to examine the association between nucleotide polymorphisms at 224 candidate genes and both climate variables and flowering phenotypes. Unlike genome-wide studies, candidate gene approaches are expected to enrich for the number of meaningful trait associations because they specifically target genes that are known to affect the trait of interest. We found that flowering time mediates adaptation to climatic conditions mainly by variation at genes located upstream in the flowering pathways, close to the environmental stimuli. Variables related to the annual precipitation regime reflected selective constraints on flowering time genes better than the other variables tested (temperature, altitude, latitude or longitude). By comparing phenotype and climate associations, we identified 12 flowering genes as the most promising candidates responsible for phenological adaptation to climate. Four of these genes were located in the known flowering time QTL region on chromosome 7. However, climate and flowering associations also highlighted largely distinct gene sets, suggesting different genetic architectures for adaptation to climate and flowering onset. © 2016 John Wiley & Sons Ltd.
Developing core outcome measurement sets for clinical trials: OMERACT filter 2.0.
Boers, Maarten; Kirwan, John R; Wells, George; Beaton, Dorcas; Gossec, Laure; d'Agostino, Maria-Antonietta; Conaghan, Philip G; Bingham, Clifton O; Brooks, Peter; Landewé, Robert; March, Lyn; Simon, Lee S; Singh, Jasvinder A; Strand, Vibeke; Tugwell, Peter
2014-07-01
Lack of standardization of outcome measures limits the usefulness of clinical trial evidence to inform health care decisions. This can be addressed by agreeing on a minimum core set of outcome measures per health condition, containing measures relevant to patients and decision makers. Since 1992, the Outcome Measures in Rheumatology (OMERACT) consensus initiative has successfully developed core sets for many rheumatologic conditions, actively involving patients since 2002. Its expanding scope required an explicit formulation of its underlying conceptual framework and process. Literature searches and iterative consensus process (surveys and group meetings) of stakeholders including patients, health professionals, and methodologists within and outside rheumatology. To comprehensively sample patient-centered and intervention-specific outcomes, a framework emerged that comprises three core "Areas," namely Death, Life Impact, and Pathophysiological Manifestations; and one strongly recommended Resource Use. Through literature review and consensus process, core set development for any specific health condition starts by identifying at least one core "Domain" within each of the Areas to formulate the "Core Domain Set." Next, at least one applicable measurement instrument for each core Domain is identified to formulate a "Core Outcome Measurement Set." Each instrument must prove to be truthful (valid), discriminative, and feasible. In 2012, 96% of the voting participants (n=125) at the OMERACT 11 consensus conference endorsed this model and process. The OMERACT Filter 2.0 explicitly describes a comprehensive conceptual framework and a recommended process to develop core outcome measurement sets for rheumatology likely to be useful as a template in other areas of health care. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Woznica, Arielle; Haeussler, Maximilian; Starobinska, Ella; Jemmett, Jessica; Li, Younan; Mount, David; Davidson, Brad
2012-08-01
The complex, partially redundant gene regulatory architecture underlying vertebrate heart formation has been difficult to characterize. Here, we dissect the primary cardiac gene regulatory network in the invertebrate chordate, Ciona intestinalis. The Ciona heart progenitor lineage is first specified by Fibroblast Growth Factor/Map Kinase (FGF/MapK) activation of the transcription factor Ets1/2 (Ets). Through microarray analysis of sorted heart progenitor cells, we identified the complete set of primary genes upregulated by FGF/Ets shortly after heart progenitor emergence. Combinatorial sequence analysis of these co-regulated genes generated a hypothetical regulatory code consisting of Ets binding sites associated with a specific co-motif, ATTA. Through extensive reporter analysis, we confirmed the functional importance of the ATTA co-motif in primary heart progenitor gene regulation. We then used the Ets/ATTA combination motif to successfully predict a number of additional heart progenitor gene regulatory elements, including an intronic element driving expression of the core conserved cardiac transcription factor, GATAa. This work significantly advances our understanding of the Ciona heart gene network. Furthermore, this work has begun to elucidate the precise regulatory architecture underlying the conserved, primary role of FGF/Ets in chordate heart lineage specification. Copyright © 2012 Elsevier Inc. All rights reserved.
Microarray Analysis of Differential Gene Expression Profile Between Human Fetal and Adult Heart.
Geng, Zhimin; Wang, Jue; Pan, Lulu; Li, Ming; Zhang, Jitai; Cai, Xueli; Chu, Maoping
2017-04-01
Although many changes have been discovered during heart maturation, the genetic mechanisms involved in the changes between immature and mature myocardium have only been partially elucidated. Here, gene expression profile changed between the human fetal and adult heart was characterized. A human microarray was applied to define the gene expression signatures of the fetal (13-17 weeks of gestation, n = 4) and adult hearts (30-40 years old, n = 4). Gene ontology analyses, pathway analyses, gene set enrichment analyses, and signal transduction network were performed to predict the function of the differentially expressed genes. Ten mRNAs were confirmed by quantificational real-time polymerase chain reaction. 5547 mRNAs were found to be significantly differentially expressed. "Cell cycle" was the most enriched pathway in the down-regulated genes. EFGR, IGF1R, and ITGB1 play a central role in the regulation of heart development. EGFR, IGF1R, and FGFR2 were the core genes regulating cardiac cell proliferation. The quantificational real-time polymerase chain reaction results were concordant with the microarray data. Our data identified the transcriptional regulation of heart development in the second trimester and the potential regulators that play a prominent role in the regulation of heart development and cardiac cells proliferation.
Reverse engineering and analysis of large genome-scale gene networks
Aluru, Maneesha; Zola, Jaroslaw; Nettleton, Dan; Aluru, Srinivas
2013-01-01
Reverse engineering the whole-genome networks of complex multicellular organisms continues to remain a challenge. While simpler models easily scale to large number of genes and gene expression datasets, more accurate models are compute intensive limiting their scale of applicability. To enable fast and accurate reconstruction of large networks, we developed Tool for Inferring Network of Genes (TINGe), a parallel mutual information (MI)-based program. The novel features of our approach include: (i) B-spline-based formulation for linear-time computation of MI, (ii) a novel algorithm for direct permutation testing and (iii) development of parallel algorithms to reduce run-time and facilitate construction of large networks. We assess the quality of our method by comparison with ARACNe (Algorithm for the Reconstruction of Accurate Cellular Networks) and GeneNet and demonstrate its unique capability by reverse engineering the whole-genome network of Arabidopsis thaliana from 3137 Affymetrix ATH1 GeneChips in just 9 min on a 1024-core cluster. We further report on the development of a new software Gene Network Analyzer (GeNA) for extracting context-specific subnetworks from a given set of seed genes. Using TINGe and GeNA, we performed analysis of 241 Arabidopsis AraCyc 8.0 pathways, and the results are made available through the web. PMID:23042249
Rodriguez-R, Luis M; Gunturu, Santosh; Harvey, William T; Rosselló-Mora, Ramon; Tiedje, James M; Cole, James R; Konstantinidis, Konstantinos T
2018-06-14
The small subunit ribosomal RNA gene (16S rRNA) has been successfully used to catalogue and study the diversity of prokaryotic species and communities but it offers limited resolution at the species and finer levels, and cannot represent the whole-genome diversity and fluidity. To overcome these limitations, we introduced the Microbial Genomes Atlas (MiGA), a webserver that allows the classification of an unknown query genomic sequence, complete or partial, against all taxonomically classified taxa with available genome sequences, as well as comparisons to other related genomes including uncultivated ones, based on the genome-aggregate Average Nucleotide and Amino Acid Identity (ANI/AAI) concepts. MiGA integrates best practices in sequence quality trimming and assembly and allows input to be raw reads or assemblies from isolate genomes, single-cell sequences, and metagenome-assembled genomes (MAGs). Further, MiGA can take as input hundreds of closely related genomes of the same or closely related species (a so-called 'Clade Project') to assess their gene content diversity and evolutionary relationships, and calculate important clade properties such as the pangenome and core gene sets. Therefore, MiGA is expected to facilitate a range of genome-based taxonomic and diversity studies, and quality assessment across environmental and clinical settings. MiGA is available at http://microbial-genomes.org/.
Candidate gene database and transcript map for peach, a model species for fruit trees.
Horn, Renate; Lecouls, Anne-Claire; Callahan, Ann; Dandekar, Abhaya; Garay, Lilibeth; McCord, Per; Howad, Werner; Chan, Helen; Verde, Ignazio; Main, Doreen; Jung, Sook; Georgi, Laura; Forrest, Sam; Mook, Jennifer; Zhebentyayeva, Tatyana; Yu, Yeisoo; Kim, Hye Ran; Jesudurai, Christopher; Sosinski, Bryon; Arús, Pere; Baird, Vance; Parfitt, Dan; Reighard, Gregory; Scorza, Ralph; Tomkins, Jeffrey; Wing, Rod; Abbott, Albert Glenn
2005-05-01
Peach (Prunus persica) is a model species for the Rosaceae, which includes a number of economically important fruit tree species. To develop an extensive Prunus expressed sequence tag (EST) database for identifying and cloning the genes important to fruit and tree development, we generated 9,984 high-quality ESTs from a peach cDNA library of developing fruit mesocarp. After assembly and annotation, a putative peach unigene set consisting of 3,842 ESTs was defined. Gene ontology (GO) classification was assigned based on the annotation of the single "best hit" match against the Swiss-Prot database. No significant homology could be found in the GenBank nr databases for 24.3% of the sequences. Using core markers from the general Prunus genetic map, we anchored bacterial artificial chromosome (BAC) clones on the genetic map, thereby providing a framework for the construction of a physical and transcript map. A transcript map was developed by hybridizing 1,236 ESTs from the putative peach unigene set and an additional 68 peach cDNA clones against the peach BAC library. Hybridizing ESTs to genetically anchored BACs immediately localized 11.2% of the ESTs on the genetic map. ESTs showed a clustering of expressed genes in defined regions of the linkage groups. [The data were built into a regularly updated Genome Database for Rosaceae (GDR), available at (http://www.genome.clemson.edu/gdr/).].
NASA Astrophysics Data System (ADS)
Stolzenburg, U.; Lux, T.
2011-12-01
Processes of social opinion formation might be dominated by a set of closely connected agents who constitute the cohesive `core' of a network and have a higher influence on the overall outcome of the process than those agents in the more sparsely connected `periphery'. Here we explore whether such a perspective could shed light on the dynamics of a well known economic sentiment index. To this end, we hypothesize that the respondents of the survey under investigation form a core-periphery network, and we identify those agents that define the core (in a discrete setting) or the proximity of each agent to the core (in a continuous setting). As it turns out, there is significant correlation between the so identified cores of different survey questions. Both the discrete and the continuous cores allow an almost perfect replication of the original series with a reduced data set of core members or weighted entries according to core proximity. Using a monthly time series on industrial production in Germany, we also compared experts' predictions with the real economic development. The core members identified in the discrete setting showed significantly better prediction capabilities than those agents assigned to the periphery of the network.
Sabree, Zakee L; Hansen, Allison K; Moran, Nancy A
2012-01-01
Starting in 2003, numerous studies using culture-independent methodologies to characterize the gut microbiota of honey bees have retrieved a consistent and distinctive set of eight bacterial species, based on near identity of the 16S rRNA gene sequences. A recent study [Mattila HR, Rios D, Walker-Sperling VE, Roeselers G, Newton ILG (2012) Characterization of the active microbiotas associated with honey bees reveals healthier and broader communities when colonies are genetically diverse. PLoS ONE 7(3): e32962], using pyrosequencing of the V1-V2 hypervariable region of the 16S rRNA gene, reported finding entirely novel bacterial species in honey bee guts, and used taxonomic assignments from these reads to predict metabolic activities based on known metabolisms of cultivable species. To better understand this discrepancy, we analyzed the Mattila et al. pyrotag dataset. In contrast to the conclusions of Mattila et al., we found that the large majority of pyrotag sequences belonged to clusters for which representative sequences were identical to sequences from previously identified core species of the bee microbiota. On average, they represent 95% of the bacteria in each worker bee in the Mattila et al. dataset, a slightly lower value than that found in other studies. Some colonies contain small proportions of other bacteria, mostly species of Enterobacteriaceae. Reanalysis of the Mattila et al. dataset also did not support a relationship between abundances of Bifidobacterium and of putative pathogens or a significant difference in gut communities between colonies from queens that were singly or multiply mated. Additionally, consistent with previous studies, the dataset supports the occurrence of considerable strain variation within core species, even within single colonies. The roles of these bacteria within bees, or the implications of the strain variation, are not yet clear.
Takeda, Itaru; Umemura, Myco; Koike, Hideaki; Asai, Kiyoshi; Machida, Masayuki
2014-08-01
Despite their biological importance, a significant number of genes for secondary metabolite biosynthesis (SMB) remain undetected due largely to the fact that they are highly diverse and are not expressed under a variety of cultivation conditions. Several software tools including SMURF and antiSMASH have been developed to predict fungal SMB gene clusters by finding core genes encoding polyketide synthase, nonribosomal peptide synthetase and dimethylallyltryptophan synthase as well as several others typically present in the cluster. In this work, we have devised a novel comparative genomics method to identify SMB gene clusters that is independent of motif information of the known SMB genes. The method detects SMB gene clusters by searching for a similar order of genes and their presence in nonsyntenic blocks. With this method, we were able to identify many known SMB gene clusters with the core genes in the genomic sequences of 10 filamentous fungi. Furthermore, we have also detected SMB gene clusters without core genes, including the kojic acid biosynthesis gene cluster of Aspergillus oryzae. By varying the detection parameters of the method, a significant difference in the sequence characteristics was detected between the genes residing inside the clusters and those outside the clusters. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Standardised assessment of functioning in ADHD: consensus on the ICF Core Sets for ADHD.
Bölte, Sven; Mahdi, Soheil; Coghill, David; Gau, Susan Shur-Fen; Granlund, Mats; Holtmann, Martin; Karande, Sunil; Levy, Florence; Rohde, Luis A; Segerer, Wolfgang; de Vries, Petrus J; Selb, Melissa
2018-02-12
Attention-deficit/hyperactivity disorder (ADHD) is associated with significant impairments in social, educational, and occupational functioning, as well as specific strengths. Currently, there is no internationally accepted standard to assess the functioning of individuals with ADHD. WHO's International Classification of Functioning, Disability and Health-child and youth version (ICF) can serve as a conceptual basis for such a standard. The objective of this study is to develop a comprehensive, a common brief, and three age-appropriate brief ICF Core Sets for ADHD. Using a standardised methodology, four international preparatory studies generated 132 second-level ICF candidate categories that served as the basis for developing ADHD Core Sets. Using these categories and following an iterative consensus process, 20 ADHD experts from nine professional disciplines and representing all six WHO regions selected the most relevant categories to constitute the ADHD Core Sets. The consensus process resulted in 72 second-level ICF categories forming the comprehensive ICF Core Set-these represented 8 body functions, 35 activities and participation, and 29 environmental categories. A Common Brief Core Set that included 38 categories was also defined. Age-specific brief Core Sets included a 47 category preschool version for 0-5 years old, a 55 category school-age version for 6-16 years old, and a 52 category version for older adolescents and adults 17 years old and above. The ICF Core Sets for ADHD mark a milestone toward an internationally standardised functional assessment of ADHD across the lifespan, and across educational, administrative, clinical, and research settings.
Cadman, Cassandra S C; Toorop, Peter E; Hilhorst, Henk W M; Finch-Savage, William E
2006-06-01
Physiologically dormant seeds, like those of Arabidopsis, will cycle through dormant states as seasons change until the environment is favourable for seedling establishment. This phenomenon is widespread in the plant kingdom, but has not been studied at the molecular level. Full-genome microarrays were used for a global transcript analysis of Arabidopsis thaliana (accession Cvi) seeds in a range of dormant and dry after-ripened states during cycling. Principal component analysis of the expression patterns observed showed that they differed in newly imbibed primary dormant seeds, as commonly used in experimental studies, compared with those in the maintained primary and secondary dormant states that exist during cycling. Dormant and after-ripened seeds appear to have equally active although distinct gene expression programmes, dormant seeds having greatly reduced gene expression associated with protein synthesis, potentially controlling the completion of germination. A core set of 442 genes were identified that had higher expression in all dormant states compared with after-ripened states. Abscisic acid (ABA) responsive elements were significantly over-represented in this set of genes the expression of which was enhanced when multiple copies of the elements were present. ABA regulation of dormancy was further supported by expression patterns of key genes in ABA synthesis/catabolism, and dormancy loss in the presence of fluridone. The data support an ABA-gibberelic acid hormone balance mechanism controlling cycling through dormant states that depends on synthetic and catabolic pathways of both hormones. Many of the most highly expressed genes in dormant states were stress-related even in the absence of abiotic stress, indicating that ABA, stress and dormancy responses overlap significantly at the transcriptome level.
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle
Nelson, William C.; Stegen, James C.
2015-01-01
Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in a broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. “Housekeeping” genes and genes for biosynthesis of peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides, and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle, or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest that the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum. PMID:26257709
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle
Nelson, William C.; Stegen, James C.
2015-07-21
Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in a broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. “Housekeeping” genes and genes for biosynthesismore » of peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides, and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle, or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest that the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum.« less
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nelson, William C.; Stegen, James C.
2015-07-21
Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. ‘Housekeeping’ genes and genes for biosynthesis ofmore » peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum.« less
Mrkusich, Eli M; Flanagan, Dustin J; Whitington, Paul M
2011-10-01
The atypical cadherin Drosophila protein Flamingo and its vertebrate homologues play widespread roles in the regulation of both dendrite and axon growth. However, little is understood about the molecular mechanisms that underpin these functions. Whereas flamingo interacts with a well-defined group of genes in regulating planar cell polarity, previous studies have uncovered little evidence that the other core planar cell polarity genes are involved in regulation of neurite growth. We present data in this study showing that the planar cell polarity gene prickle interacts with flamingo in regulating sensory axon advance at a key choice point - the transition between the peripheral nervous system and the central nervous system. The cytoplasmic tail of the Flamingo protein is not required for this interaction. Overexpression of another core planar cell polarity gene dishevelled produces a similar phenotype to prickle mutants, suggesting that this gene may also play a role in regulation of sensory axon advance. Crown Copyright © 2011. Published by Elsevier Inc. All rights reserved.
Singh, Jasvinder A; Dowsey, Michelle M; Dohm, Michael; Goodman, Susan M; Leong, Amye L; Scholte Voshaar, Marieke M J H; Choong, Peter F
2017-11-01
Discussion and endorsement of the OMERACT total joint replacement (TJR) core domain set for total hip replacement (THR) and total knee replacement (TKR) for endstage arthritis; and next steps for selection of instruments. The OMERACT TJR working group met at the 2016 meeting at Whistler, British Columbia, Canada. We summarized the previous systematic reviews, the preliminary OMERACT TJR core domain set and results from previous surveys. We discussed preliminary core domains for TJR clinical trials, made modifications, and identified challenges with domain measurement. Working group participants (n = 26) reviewed, clarified, and endorsed each of the inner and middle circle domains and added a range of motion domain to the research agenda. TJR were limited to THR and TKR but included all endstage hip and knee arthritis refractory to medical treatment. Participants overwhelmingly endorsed identification and evaluation of top instruments mapping to the core domains (100%) and use of subscales of validated multidimensional instruments to measure core domains for the TJR clinical trial core measurement set (92%). An OMERACT core domain set for hip/knee TJR trials has been defined and we are selecting instruments to develop the TJR clinical trial core measurement set to serve as a common foundation for harmonizing measures in TJR clinical trials.
Xu, Zejun; He, Bicheng; Shen, Jie; Yang, Wantai; Yin, Meizhen
2013-05-07
Different generations of perylenediimide-cored dendrimers with peripheral amine groups were synthesized. All these water-soluble dendrimers could rapidly internalize into live cells with high efficacy of gene transfection and low cytotoxicity. Increasing dendrimer generation increased their ability for gene transfection.
Microarray gene expression profiling using core biopsies of renal neoplasia.
Rogers, Craig G; Ditlev, Jonathon A; Tan, Min-Han; Sugimura, Jun; Qian, Chao-Nan; Cooper, Jeff; Lane, Brian; Jewett, Michael A; Kahnoski, Richard J; Kort, Eric J; Teh, Bin T
2009-01-01
We investigate the feasibility of using microarray gene expression profiling technology to analyze core biopsies of renal tumors for classification of tumor histology. Core biopsies were obtained ex-vivo from 7 renal tumors-comprised of four histological subtypes-following radical nephrectomy using 18-gauge biopsy needles. RNA was isolated from these samples and, in the case of biopsy samples, amplified by in vitro transcription. Microarray analysis was then used to quantify the mRNA expression patterns in these samples relative to non-diseased renal tissue mRNA. Genes with significant variation across all non-biopsy tumor samples were identified, and the relationship between tumor and biopsy samples in terms of expression levels of these genes was then quantified in terms of Euclidean distance, and visualized by complete linkage clustering. Final pathologic assessment of kidney tumors demonstrated clear cell renal cell carcinoma (4), oncocytoma (1), angiomyolipoma (1) and adrenalcortical carcinoma (1). Five of the seven biopsy samples were most similar in terms of gene expression to the resected tumors from which they were derived in terms of Euclidean distance. All seven biopsies were assigned to the correct histological class by hierarchical clustering. We demonstrate the feasibility of gene expression profiling of core biopsies of renal tumors to classify tumor histology.
Microarray gene expression profiling using core biopsies of renal neoplasia
Rogers, Craig G.; Ditlev, Jonathon A.; Tan, Min-Han; Sugimura, Jun; Qian, Chao-Nan; Cooper, Jeff; Lane, Brian; Jewett, Michael A.; Kahnoski, Richard J.; Kort, Eric J.; Teh, Bin T.
2009-01-01
We investigate the feasibility of using microarray gene expression profiling technology to analyze core biopsies of renal tumors for classification of tumor histology. Core biopsies were obtained ex-vivo from 7 renal tumors—comprised of four histological subtypes—following radical nephrectomy using 18-gauge biopsy needles. RNA was isolated from these samples and, in the case of biopsy samples, amplified by in vitro transcription. Microarray analysis was then used to quantify the mRNA expression patterns in these samples relative to non-diseased renal tissue mRNA. Genes with significant variation across all non-biopsy tumor samples were identified, and the relationship between tumor and biopsy samples in terms of expression levels of these genes was then quantified in terms of Euclidean distance, and visualized by complete linkage clustering. Final pathologic assessment of kidney tumors demonstrated clear cell renal cell carcinoma (4), oncocytoma (1), angiomyolipoma (1) and adrenalcortical carcinoma (1). Five of the seven biopsy samples were most similar in terms of gene expression to the resected tumors from which they were derived in terms of Euclidean distance. All seven biopsies were assigned to the correct histological class by hierarchical clustering. We demonstrate the feasibility of gene expression profiling of core biopsies of renal tumors to classify tumor histology. PMID:19966938
How to apply the ICF and ICF core sets for low back pain.
Stier-Jarmer, Marita; Cieza, Alarcos; Borchers, Michael; Stucki, Gerold
2009-01-01
To introduce the International Classification of Functioning, Disability and Health (ICF) as conceptual model and classification and the ICF Core Sets as a way to specify functioning for a specific health condition such as Low Back Pain, and to illustrate the application of the ICF and ICF Core Sets in the context of clinical practice, the planning and reporting of studies and the comparison of health status measures. A decision-making and consensus process was performed to develop the ICF Core Sets for Low Back Pain, the linking procedure was applied as basis for the content comparison of health-status measures and the Rehab-Cycle was used to exemplify the application of the ICE and ICF Core Sets in clinical practice. Two different ICF Core Sets, namely, a comprehensive and a brief, are presented, three different health-status measures were linked to the ICF and compared and a case example of a patient with Low back Pain was described based on the Rehab-Cycle. The ICF is a promising new framework and classification to assess the impact of Low Back Pain. The ICF and practical tools, such as the ICF Core Sets for Low Back Pain, are useful for clinical practice, outcome and rehabilitation research, education, health statistics, and regulation.
Marques, Alda; Jácome, Cristina; Gonçalves, Ana; Silva, Sara; Lucas, Carla; Cruz, Joana; Gabriel, Raquel
2014-06-01
This study aimed to validate the Comprehensive International Classification of Functioning, Disability and Health (ICF) Core Set for obstructive pulmonary diseases (OPDs) from the perspective of patients with chronic obstructive pulmonary disease. A cross-sectional qualitative study was carried out with outpatients with chronic obstructive pulmonary disease using focus groups with an ICF-based approach. Qualitative data were analysed using the meaning condensation procedure by two researchers with expertise in the ICF. Thirty-two participants (37.5% women; 63.8 ± 11.3 years old) were included in six focus groups. A total of 61 (86%) ICF categories of the Comprehensive ICF Core Set for OPD were confirmed. Thirty-nine additional second-level categories not included in the Core Set were identified: 15 from the body functions component, four from the body structures, nine from the activities and participation and 11 from the environmental factors. The majority of the categories included in the Comprehensive ICF Core Set for OPD were confirmed from the patients' perspective. However, additional categories, not included in the Core Set, were also reported. The categories included in the Core Set were not confirmed and the additional categories need to be investigated further to develop an instrument tailored to patients' needs. This will promote patient-centred assessments and rehabilitation interventions.
Gómez-Benito, Juana; Guilera, Georgina; Barrios, Maite; Rojo, Emilio; Pino, Oscar; Gorostiaga, Arantxa; Balluerka, Nekane; Hidalgo, María Dolores; Padilla, José Luis; Benítez, Isabel; Selb, Melissa
2017-07-30
Based on the International Classification of Functioning, Disability and Health (ICF), this paper presents the results of the process to develop the Comprehensive and Brief Core Sets for schizophrenia that allow to comprehensively describe functioning in persons with schizophrenia. Twenty health professionals from diverse backgrounds participated in a formal and iterative decision-making process during an international consensus conference to develop these Core Sets. The conference was carried out based on evidence gathered from four preparatory studies (systematic literature review, qualitative study, expert survey, and empirical study). The first step of this decision-making and consensus process comprised of discussions and voting in working groups and plenary sessions to develop the comprehensive version. The categories of the Comprehensive ICF Core Set for schizophrenia served as the basis for the second step -a ranking and cutoff procedure to decide on the brief version. Of the 184 candidate categories identified in the preparatory studies, 97 categories were included in the Comprehensive Core Set for schizophrenia. A total of 25 categories were selected to constitute the Brief Core Set. The formal decision-making and consensus process integrating evidence from four preparatory studies and expert opinion led to the first version of the Core Sets for schizophrenia. Comprehensive and Brief Core Sets for schizophrenia may provide a common language among different health professionals and researchers, and a basic international standard of what to measure, report, and assess the functioning of persons with schizophrenia. Implications for rehabilitation Schizophrenia is a chronic mental disorder that has a tremendous impact on functioning and daily life of persons living with the disorder. The International Classification of Functioning, Disability and Health (ICF) offers an internationally recognized standard for describing the functioning status of these individuals. The Core Sets for schizophrenia have potential use in supporting rehabilitation practice such as for planning mental health services and other interventions or defining rehabilitation goals, and documenting patient care. The Core Sets for schizophrenia may also be used to promote interdisciplinary coordination and facilitate communication between members of a multidisciplinary rehabilitation team. Rehabilitation research is another potential area of application of the Core Sets for schizophrenia. This is valuable, since rehabilitation research provides crucial evidence for optimizing rehabilitation practice.
MiR-3613-3p affects cell proliferation and cell cycle in hepatocellular carcinoma
Zhang, Donghui; Liu, Enqin; Kang, Jian; Yang, Xin; Liu, Hong
2017-01-01
Hepatocellular carcinoma (HCC) is one of the most common types of malignant tumors with poor sensitivity to chemotherapy drugs and poor prognosis among patients. In the present study, we downloaded the original data from the Gene Expression Omnibus and compared gene expression profiles of liver cancer cells in patients with HCC with those of colon epithelial cells of healthy controls to identify differentially expressed genes (DEGs). After filtering target microRNAs (miRNA) from core DEGs, we cultured HepG2 cells in vitro, knocked down the miRNA and core mRNAs, and analyzed the effects. We found 228 differentially expressed genes between liver cancer tissue and healthy control tissue. We also integrated the protein-proteininteraction network and module analysis to screen 13 core genes, consisting of 12 up-regulated genes and 1 down-regulated gene. Five core genes were regulated hsa-miR-3613-3p, therefor we hypothesized that hsa-miR-3613-3p was a critical miRNA. After the transfection procedure, we found that changes in hsa-miR-3613-3p were the most obvious. Therefore, we speculated that hsa-miR-3613-3p was a main target miRNA. In addition, we transfected with si (BIRC5, CDK1, NUF2, ZWINT and SPC24), to target genes that can be targeted by miR-3613-3p. Our data shows that BIRC5, NUF2, and SPC24 may be promising liver cancer biomarkers that may not only predict disease occurrence but also potential personalized treatment options. PMID:29190974
Ayuso-Mateos, José L; Avila, Carolina C; Anaya, Celia; Cieza, Alarcos; Vieta, Eduard
2013-01-01
The International Classification of Functioning, Disability and Health (ICF) is a tool of the World Health Organization (WHO) designed to be a guide to identify and classify relevant domains of human experience affected by health conditions. The purpose of this article is to describe the process for the development of two Core Sets for bipolar disorder (BD) in the framework of the ICF. The Comprehensive ICF Core Set for BD intends to be a guide for multidisciplinary assessment of patients diagnosed with this condition, while the Brief ICF Core Set for BD will be useful when rating aspects of patient's experience for clinical practice or epidemiological studies. An international consensus conference involving a sample of experts with different professional backgrounds was performed using the nominal group technique. Various preparatory studies identified a set of 743 potential ICF categories to be included in the Core Sets. A total of 38 ICF categories were selected to be included in the Comprehensive Core Set for BD. A total of 19 ICF categories from the Comprehensive Core Set were chosen as the most significant to constitute the Brief Core Set for BD. The formal consensus process integrating evidence and expert opinion on the ICF led to the formal adoption of the ICF Core Sets for BD. The most important categories included are representative of the characteristics usually associated with BD. The next phase of this ICF project is to conduct a formal validation process to establish its applicability in clinical settings. Implications for Rehabilitation Bipolar disorder (BD) is a prevalent condition that has a great impact on people who suffer it, not only in health but also in daily functioning and quality of life. No standard has been defined so far regarding the problems in functioning of persons with BDs. The process described in this article defines the set of areas of functioning to be addressed in clinical assessments of persons with BD and establish the starting point for the development of condition-specific outcome measures.
Evans, Melissa; Hocking, Clare; Kersten, Paula
2017-12-01
This study aim was to evaluate whether the Extended International Classification of Functioning, Disability and Health Core Set for Stroke captured the interventions of a community stroke rehabilitation team situated in a large city in New Zealand. It was proposed that the results would identify the contribution of each discipline, and the gaps and differences in service provision to Māori and non-Māori. Applying the Extended International Classification of Functioning, Disability and Health Core Set for Stroke in this way would also inform whether this core set should be adopted in New Zealand. Interventions were retrospectively extracted from 18 medical records and linked to the International Classification of Functioning, Disability and Health and the Extended International Classification of Functioning, Disability and Health Core Set for Stroke. The frequencies of linked interventions and the health discipline providing the intervention were calculated. Analysis revealed that 98.8% of interventions provided by the rehabilitation team could be linked to the Extended International Classification of Functioning, Disability and Health Core Set for Stroke, with more interventions for body function and structure than for activities and participation; no interventions for emotional concerns; and limited interventions for community, social and civic life. Results support previous recommendations for additions to the EICSS. The results support the use of the Extended International Classification of Functioning, Disability and Health Core Set for Stroke in New Zealand and demonstrates its use as a quality assurance tool that can evaluate the scope and practice of a rehabilitation service. Implications for Rehabilitation The Extended International Classification of Functioning Disability and Health Core Set for Stroke appears to represent the stroke interventions of a community stroke rehabilitation team in New Zealand. As a result, researchers and clinicians may have increased confidence to use this core set in research and clinical practice. The Extended International Classification of Functioning Disability and Health Core Set for Stroke can be used as a quality assurance tool to establish whether a community stroke rehabilitation team is meeting the functional needs of its stroke population.
Global transcriptome analysis of eukaryotic genes affected by gromwell extract.
Bang, Soohyun; Lee, Dohyun; Kim, Hanhe; Park, Jiyong; Bahn, Yong-Sun
2014-02-01
Gromwell is known to have diverse pharmacological, cosmetic and nutritional benefits for humans. Nevertheless, the biological influence of gromwell extract (GE) on the general physiology of eukaryotic cells remains unknown. In this study a global transcriptome analysis was performed to identify genes affected by the addition of GE with Cryptococcus neoformans as the model system. In response to GE treatment, genes involved in signal transduction were immediately regulated, and the evolutionarily conserved sets of genes involved in the core cellular functions, including DNA replication, RNA transcription/processing and protein translation/processing, were generally up-regulated. In contrast, a number of genes involved in carbohydrate metabolism and transport, inorganic ion transport and metabolism, post-translational modification/protein turnover/chaperone functions and signal transduction were down-regulated. Among the GE-responsive genes that are also evolutionarily conserved in the human genome, the expression patterns of YSA1, TPO2, CFO1 and PZF1 were confirmed by northern blot analysis. Based on the functional characterization of some GE-responsive genes, it was found that GE treatment may promote cellular tolerance against a variety of environmental stresses in eukaryotes. GE treatment affects the expression levels of a significant portion of the Cryptococcus genome, implying that GE significantly affects the general physiology of eukaryotic cells. © 2013 Society of Chemical Industry.
A community effort to assess and improve drug sensitivity prediction algorithms
Costello, James C; Heiser, Laura M; Georgii, Elisabeth; Gönen, Mehmet; Menden, Michael P; Wang, Nicholas J; Bansal, Mukesh; Ammad-ud-din, Muhammad; Hintsanen, Petteri; Khan, Suleiman A; Mpindi, John-Patrick; Kallioniemi, Olli; Honkela, Antti; Aittokallio, Tero; Wennerberg, Krister; Collins, James J; Gallahan, Dan; Singer, Dinah; Saez-Rodriguez, Julio; Kaski, Samuel; Gray, Joe W; Stolovitzky, Gustavo
2015-01-01
Predicting the best treatment strategy from genomic information is a core goal of precision medicine. Here we focus on predicting drug response based on a cohort of genomic, epigenomic and proteomic profiling data sets measured in human breast cancer cell lines. Through a collaborative effort between the National Cancer Institute (NCI) and the Dialogue on Reverse Engineering Assessment and Methods (DREAM) project, we analyzed a total of 44 drug sensitivity prediction algorithms. The top-performing approaches modeled nonlinear relationships and incorporated biological pathway information. We found that gene expression microarrays consistently provided the best predictive power of the individual profiling data sets; however, performance was increased by including multiple, independent data sets. We discuss the innovations underlying the top-performing methodology, Bayesian multitask MKL, and we provide detailed descriptions of all methods. This study establishes benchmarks for drug sensitivity prediction and identifies approaches that can be leveraged for the development of new methods. PMID:24880487
A community effort to assess and improve drug sensitivity prediction algorithms.
Costello, James C; Heiser, Laura M; Georgii, Elisabeth; Gönen, Mehmet; Menden, Michael P; Wang, Nicholas J; Bansal, Mukesh; Ammad-ud-din, Muhammad; Hintsanen, Petteri; Khan, Suleiman A; Mpindi, John-Patrick; Kallioniemi, Olli; Honkela, Antti; Aittokallio, Tero; Wennerberg, Krister; Collins, James J; Gallahan, Dan; Singer, Dinah; Saez-Rodriguez, Julio; Kaski, Samuel; Gray, Joe W; Stolovitzky, Gustavo
2014-12-01
Predicting the best treatment strategy from genomic information is a core goal of precision medicine. Here we focus on predicting drug response based on a cohort of genomic, epigenomic and proteomic profiling data sets measured in human breast cancer cell lines. Through a collaborative effort between the National Cancer Institute (NCI) and the Dialogue on Reverse Engineering Assessment and Methods (DREAM) project, we analyzed a total of 44 drug sensitivity prediction algorithms. The top-performing approaches modeled nonlinear relationships and incorporated biological pathway information. We found that gene expression microarrays consistently provided the best predictive power of the individual profiling data sets; however, performance was increased by including multiple, independent data sets. We discuss the innovations underlying the top-performing methodology, Bayesian multitask MKL, and we provide detailed descriptions of all methods. This study establishes benchmarks for drug sensitivity prediction and identifies approaches that can be leveraged for the development of new methods.
Developing, implementing and disseminating a core outcome set for neonatal medicine.
Webbe, James; Brunton, Ginny; Ali, Shohaib; Duffy, James Mn; Modi, Neena; Gale, Chris
2017-01-01
In high resource settings, 1 in 10 newborn babies require admission to a neonatal unit. Research evaluating neonatal care involves recording and reporting many different outcomes and outcome measures. Such variation limits the usefulness of research as studies cannot be compared or combined. To address these limitations, we aim to develop, disseminate and implement a core outcome set for neonatal medicine. A steering group that includes parents and former patients, healthcare professionals and researchers has been formed to guide the development of the core outcome set. We will review neonatal trials systematically to identify previously reported outcomes. Additionally, we will specifically identify outcomes of importance to parents, former patients and healthcare professionals through a systematic review of qualitative studies. Outcomes identified will be entered into an international, multi-perspective eDelphi survey. All key stakeholders will be invited to participate. The Delphi method will encourage individual and group stakeholder consensus to identify a core outcome set. The core outcome set will be mapped to existing, routinely recorded data where these exist. Use of a core set will ensure outcomes of importance to key stakeholders, including former patients and parents, are recorded and reported in a standard fashion in future research. Embedding the core outcome set within future clinical studies will extend the usefulness of research to inform practice, enhance patient care and ultimately improve outcomes. Using routinely recorded electronic data will facilitate implementation with minimal addition burden. Core Outcome Measures in Effectiveness Trials (COMET) database: 842 (www.comet-initiative.org/studies/details/842).
Glässel, A; Coenen, M; Kollerits, B; Cieza, A
2014-06-01
The extended ICF Core Set for stroke is an application of the International Classification of Functioning, Disability and Health (ICF) of the World Health Organisation (WHO) with the purpose to represent the typical spectrum of functioning of persons with stroke. The objective of the study is to add evidence to the content validity of the extended ICF Core Set for stroke from persons after stroke taking into account gender perspective. A qualitative study design was conducted by using individual interviews with women and men after stroke in an in- and outpatient rehabilitation setting. The sampling followed the maximum variation strategy. Sample size was determined by saturation. Concepts from qualitative data analysis were linked to ICF categories and compared to the extended ICF Core Set for stroke. Twelve women and 12 men participated in 24 individual interviews. In total, 143 out of 166 ICF categories included in the extended ICF Core Set for stroke were confirmed (women: N.=13; men: N.=17; both genders: N.=113). Thirty-eight additional categories that are not yet included in the extended ICF Core Set for stroke were raised by women and men. This study confirms that the experience of functioning and disability after stroke shows communalities and differences for women and men. The validity of the extended ICF Core Set for stroke could be mostly confirmed, since it does not only include those areas of functioning and disability relevant to both genders but also those exclusively relevant to either women or men. Further research is needed on ICF categories not yet included in the extended ICF Core Set for stroke.
Epigenetic Regulation of Bovine Spermatogenic Cell-Specific Gene Boule
Luo, Hua; Xu, Hongtao; Pan, Zengxiang; Xie, Zhuang; Li, Qifa
2015-01-01
Non-primate mammals have two deleted azoospermia (DAZ) family genes, DAZL and Boule; genes in this family encode RNA-binding proteins essential for male fertility in diverse animals. Testicular DAZL transcription is regulated by epigenetic factors such as DNA methylation. However, nothing is known about the epigenetic regulation of Boule. Here, we explored the role of DNA methylation in the regulation of the bovine Boule (bBoule) gene. We found that a long CpG island (CGI) in the bBoule promoter was hypermethylated in the testes of cattle-yak hybrids with low bBoule expression, whereas cattle had relatively low methylation levels (P < 0.01), and there was no difference in the methylation level in the short CGI of the gene body between cattle and cattle-yak hybrids (P > 0.05). We identified a 107 bp proximal core promoter region of bBoule. Intriguingly, the differences in the methylation level between cattle and cattle-yak hybrids were larger in the core promoter than outside the core promoter. An in vitro methylation assay showed that the core promoter activity of bBoule decreased significantly after M.SssI methylase treatment (P < 0.01). We also observed dramatically increased bBoule transcription in bovine mammary epithelial cells (BMECs) after treatment with the methyltransferase inhibitor 5-Aza-dC. Taken together, our results establish that methylation status of the core promoter might be involved in testicular bBoule transcription, and may provide new insight into the epigenetic regulation of DAZ family genes and clinical insights regarding male infertility. PMID:26030766
Leung, Chi K.; Wang, Ying; Deonarine, Andrew; Tang, Lanlan; Prasse, Stephanie
2013-01-01
Negative-feedback loops between transcription factors and repressors in responses to xenobiotics, oxidants, heat, hypoxia, DNA damage, and infection have been described. Although common, the function of feedback is largely unstudied. Here, we define a negative-feedback loop between the Caenorhabditis elegans detoxification/antioxidant response factor SKN-1/Nrf and its repressor wdr-23 and investigate its function in vivo. Although SKN-1 promotes stress resistance and longevity, we find that tight regulation by WDR-23 is essential for growth and reproduction. By disabling SKN-1 transactivation of wdr-23, we reveal that feedback is required to set the balance between growth/reproduction and stress resistance/longevity. We also find that feedback is required to set the sensitivity of a core SKN-1 target gene to an electrophile. Interestingly, the effect of feedback on target gene induction is greatly reduced when the stress response is strongly activated, presumably to ensure maximum activation of cytoprotective genes during potentially fatal conditions. Our work provides a framework for understanding the function of negative feedback in inducible stress responses and demonstrates that manipulation of feedback alone can shift the balance of competing animal processes toward cell protection, health, and longevity. PMID:23836880
Chacon, Diego; Beck, Dominik; Perera, Dilmi; Wong, Jason W H; Pimanda, John E
2014-01-01
The BloodChIP database (http://www.med.unsw.edu.au/CRCWeb.nsf/page/BloodChIP) supports exploration and visualization of combinatorial transcription factor (TF) binding at a particular locus in human CD34-positive and other normal and leukaemic cells or retrieval of target gene sets for user-defined combinations of TFs across one or more cell types. Increasing numbers of genome-wide TF binding profiles are being added to public repositories, and this trend is likely to continue. For the power of these data sets to be fully harnessed by experimental scientists, there is a need for these data to be placed in context and easily accessible for downstream applications. To this end, we have built a user-friendly database that has at its core the genome-wide binding profiles of seven key haematopoietic TFs in human stem/progenitor cells. These binding profiles are compared with binding profiles in normal differentiated and leukaemic cells. We have integrated these TF binding profiles with chromatin marks and expression data in normal and leukaemic cell fractions. All queries can be exported into external sites to construct TF-gene and protein-protein networks and to evaluate the association of genes with cellular processes and tissue expression.
Buchbinder, Rachelle; Page, Matthew J; Huang, Hsiaomin; Verhagen, Arianne P; Beaton, Dorcas; Kopkow, Christian; Lenza, Mario; Jain, Nitin B; Richards, Bethan; Richards, Pamela; Voshaar, Marieke; van der Windt, Danielle; Gagnier, Joel J
2017-12-01
The Outcome Measures in Rheumatology (OMERACT) Shoulder Core Outcome Set Special Interest Group (SIG) was established to develop a core outcome set (COS) for clinical trials of shoulder disorders. In preparation for OMERACT 2016, we systematically examined all outcome domains and measurement instruments reported in 409 randomized trials of interventions for shoulder disorders published between 1954 and 2015. Informed by these data, we conducted an international Delphi consensus study including shoulder trial experts, clinicians, and patients to identify key domains that should be included in a shoulder disorder COS. Findings were discussed at a stakeholder premeeting of OMERACT. At OMERACT 2016, we sought consensus on a preliminary core domain set and input into next steps. There were 13 and 15 participants at the premeeting and the OMERACT 2016 SIG meeting, respectively (9 attended both meetings). Consensus was reached on a preliminary core domain set consisting of an inner core of 4 domains: pain, physical function/activity, global perceived effect, and adverse events including death. A middle core consisted of 3 domains: emotional well-being, sleep, and participation (recreation and work). An outer core of research required to inform the final COS was also formulated. Our next steps are to (1) analyze whether participation (recreation and work) should be in the inner core, (2) conduct a third Delphi round to finalize definitions and wording of domains and reach final endorsement for the domains, and (3) determine which instruments fulfill the OMERACT criteria for measuring each domain.
Maughan, Michele N; Dougherty, Lorna S; Preskenis, Lauren A; Ladman, Brian S; Gelb, Jack; Spackman, Erica V; Keeler, Calvin L
2013-03-23
Wild waterfowl, including ducks, represent the classic reservoir for low pathogenicity avian influenza (LPAI) viruses and play a major role in the worldwide dissemination of AIV. AIVs belonging to the hemagglutinin (H) 7 subtype are of epidemiological and economic importance due to their potential to mutate into a highly pathogenic form of the virus. Thus far, however, relatively little work has been conducted on elucidating the host-pathogen interactions of ducks and H7 LPAIVs. In the current study, three H7 LPAIVs isolated from either chicken, duck, or turkey avian species were evaluated for their comparative effect on the transcriptional innate immune response of ducks. Three H7 LPAIV isolates, chicken-origin (A/chicken/Maryland/MinhMa/2004), duck-origin (A/pintail/Minnesota/423/1999), and turkey-origin (A/turkey/Virginia/SEP-67/2002) were used to infect Pekin ducks. At 3 days post-infection, RNA from spleen tissue was used for transcriptional analysis using the Avian Innate Immune Microarray (AIIM) and quantitative real-time RT-PCR (qRT-PCR). Microarray analysis revealed that a core set of 61 genes was differentially regulated in response to all three LPAIVs. Furthermore, we observed 101, 135, and 628 differentially expressed genes unique to infection with the chicken-, duck-, or turkey-origin LPAIV isolates, respectively. qRT-PCR results revealed significant (p<0.05) induction of IL-1β, IL-2, and IFNγ transcription, with the greatest induction observed upon infection with the chicken-origin isolate. Several key innate immune pathways were activated in response to LPAIV infection including the toll-like receptor and RIG-I-like receptor pathways. Pekin ducks elicit a unique innate immune response to different species-of-origin H7 LPAIV isolates. However, twelve identifiable genes and their associated cell signaling pathways (RIG-I, NOD, TLR) are differentially expressed regardless of isolate origin. This core set of genes are critical to the duck immune response to AI. These data provide insight into the potential mechanisms employed by ducks to tolerate AI viral infection.
Hori, Motohide; Shibato, Junko; Nakamachi, Tomoya; Rakwal, Randeep; Ogawa, Tetsuo; Shioda, Seiji; Numazawa, Satoshi
2015-01-01
Toward twin goals of identifying molecular factors in brain injured by ischemic stroke, and the effects of neuropeptide pituitary adenylate-cyclase activating polypeptide (PACAP) on the ischemic brain, we have established the permanent middle cerebral artery occlusion (PMCAO) mouse model and utilized the Agilent mouse whole genome 4 × 44 K DNA chip. PACAP38 (1 pmol) injection was given intracerebroventrically in comparison to a control saline (0.9% NaCl) injection, to screen genes responsive to PACAP38. Two sets of tissues were prepared, whole hemispheres (ischemic and non-ischemic) and infract core and penumbra regions at 6 and 24 h. In this study, we have detailed the experimental design and protocol used therein and explained the quality controls for the use of total RNA in the downstream DNA microarray experiment utilizing a two-color dye-swap approach for stringent and confident gene identification published in a series of papers by Hori and coworkers (Hori et al., 2012–2015). PMID:26484166
Identification of core pathways based on attractor and crosstalk in ischemic stroke.
Diao, Xiufang; Liu, Aijuan
2018-02-01
Ischemic stroke is a leading cause of mortality and disability around the world. It is an important task to identify dysregulated pathways which infer molecular and functional insights existing in high-throughput experimental data. Gene expression profile of E-GEOD-16561 was collected. Pathways were obtained from the database of Kyoto Encyclopedia of Genes and Genomes and Retrieval of Interacting Genes was used to download protein-protein interaction sets. Attractor and crosstalk approaches were applied to screen dysregulated pathways. A total of 20 differentially expressed genes were identified in ischemic stroke. Thirty-nine significant differential pathways were identified according to P<0.01 and 28 pathways were identified with RP<0.01 and 17 pathways were identified with impact factor >250. On the basis of the three criteria, 11 significant dysfunctional pathways were identified. Among them, Epstein-Barr virus infection was the most significant differential pathway. In conclusion, with the method based on attractor and crosstalk, significantly dysfunctional pathways were identified. These pathways are expected to provide molecular mechanism of ischemic stroke and represents a novel potential therapeutic target for ischemic stroke treatment.
Price, Stephen J.
2015-01-01
Recent research on genome evolution of large DNA viruses has highlighted a number of incredibly dynamic processes that can facilitate rapid adaptation. The genomes of amphibian-like ranaviruses – double-stranded DNA viruses infecting amphibians, reptiles, and fish (family Iridoviridae) – were examined to assess variation in genome content and evolutionary processes. The viruses studied were closely related, but their genome content varied considerably, with 29 genes identified that were not present in all of the major clades. Twenty-one genes had evidence of recombination, while a virus isolated from a captive reptile appeared to be a mosaic of two divergent parents. Positive selection was also found to be acting on more than a quarter of Ranavirus genes and was found most frequently in the Spanish common midwife toad virus, which has had a severe impact on amphibian host communities. Efforts to resolve the root of this group by inclusion of an outgroup were inconclusive, but a set of core genes were identified, which recovered a well-supported species tree. PMID:27812275
Rudolf, Klaus-Dieter; Kus, Sandra; Chung, Kevin C; Johnston, Marie; LeBlanc, Monique; Cieza, Alarcos
2012-01-01
A formal decision-making and consensus process was applied to develop the first version of the International Classification on Functioning, Disability and Health (ICF) Core Sets for Hand Conditions. To convene an international panel to develop the ICF Core Sets for Hand Conditions (HC), preparatory studies were conducted, which included an expert survey, a systematic literature review, a qualitative study and an empirical data collection process involving persons with hand conditions. A consensus conference was convened in Switzerland in May 2009 that was attended by 23 healthcare professionals, who treat hand conditions, representing 22 countries. The preparatory studies identified a set of 743 ICF categories at the second, third or fourth hierarchical level. Altogether, 117 chapter-, second-, or third-level categories were included in the comprehensive ICF Core Set for HC. The brief ICF Core Set for HC included a total of 23 chapter- and second-level categories. A formal consensus process integrating evidence and expert opinion based on the ICF led to the formal adoption of the ICF Core Sets for Hand Conditions. The next phase of this ICF project is to conduct a formal validation process to establish its applicability in clinical settings.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boutros, Paul C.; Yao, Cindy Q.; Watson, John D.
2011-03-01
The dioxin congener 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) causes a wide range of toxic effects in rodent species, all of which are mediated by a ligand-dependent transcription-factor, the aryl hydrocarbon receptor (AHR). The Han/Wistar (Kuopio) (H/W) strain shows exceptional resistance to many TCDD-induced toxicities; the LD{sub 50} of > 9600 {mu}g/kg for H/W rats is higher than for any other wild-type mammal known. We previously showed that this resistance primarily results from H/W rats expressing a variant AHR isoform that has a substantial portion of the AHR transactivation domain deleted. Despite this large deletion, H/W rats are not entirely refractory to the effectsmore » of TCDD; the variant AHR in these animals remains fully competent to up-regulate well-known dioxin-inducible genes. TCDD-sensitive (Long-Evans, L-E) and resistant (H/W) rats were treated with either corn-oil (with or without feed-restriction) or 100 {mu}g/kg TCDD for either four or ten days. Hepatic transcriptional profiling was done using microarrays, and was validated by RT-PCR analysis of 41 genes. A core set of genes was altered in both strains at all time points tested, including CYP1A1, CYP1A2, CYP1B1, Nqo1, Aldh3a1, Tiparp, Exoc3, and Inmt. Outside this core, the strains differed significantly in the breadth of response: three-fold more genes were altered in L-E than H/W rats. At ten days almost all expressed genes were dysregulated in L-E rats, likely reflecting emerging toxic responses. Far fewer genes were affected by feed-restriction, suggesting that only a minority of the TCDD-induced changes are secondary to the wasting syndrome.« less
First principles of Hamiltonian medicine.
Crespi, Bernard; Foster, Kevin; Úbeda, Francisco
2014-05-19
We introduce the field of Hamiltonian medicine, which centres on the roles of genetic relatedness in human health and disease. Hamiltonian medicine represents the application of basic social-evolution theory, for interactions involving kinship, to core issues in medicine such as pathogens, cancer, optimal growth and mental illness. It encompasses three domains, which involve conflict and cooperation between: (i) microbes or cancer cells, within humans, (ii) genes expressed in humans, (iii) human individuals. A set of six core principles, based on these domains and their interfaces, serves to conceptually organize the field, and contextualize illustrative examples. The primary usefulness of Hamiltonian medicine is that, like Darwinian medicine more generally, it provides novel insights into what data will be productive to collect, to address important clinical and public health problems. Our synthesis of this nascent field is intended predominantly for evolutionary and behavioural biologists who aspire to address questions directly relevant to human health and disease.
Spent Fuel Test-Climax: core logging for site investigation and instrumentation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilder, D.G.; Yow, J.L. Jr.; Thorpe, R.K.
1982-05-28
As an integral part of the Spent Fuel Test-Climax 5150 ft (1570 m) of granite core was obtained. This core was diamond drilled in various sizes, mainly 38-mm and 76-mm diameters. The core was teken with single tube core barrels and was unoriented. Techniques used to drill and log this core are discussed, as well as techniques to orient the core. Of the 5150 ft (1570 m) of core more than 3645 ft (1111 m) was retained and logged in some detail. As a result of the core logging, geologic discontinuities were identified, joint frequency and spacing characterized. Discontinuities identifiedmore » included several joint sets, shear zones and faults. Correlations based on coring along were generally found to be impossible, even for the more prominent features. The only feature properly correlated from the exploratory drilling was the fault system at the end of the facility, but it was not identified from the exploratory core as a fault. Identification of discontinuities was later helped by underground mapping that identified several different joint sets with different characteristics. It was found that joint frequency varied from 0.3 to 1.1 joint per foot of core for open fractures and from 0.3 to 3.3/ft for closed or healed fractures. Histograms of fracture spacing indicate that there is likely a random distribution of spacing superimposed upon uniformly spaced fractures. It was found that a low angle joint set had a persistent mean orientation. These joints were healed and had pervasive wall rock alteration which made identification of joints in this set possible. The recognition of a joint set with known attitude allowed orientation of much of the core. This orientation technique was found to be effective. 10 references, 25 figures, 4 tables.« less
Patel, Anamika; Vought, Valarie E; Dharmarajan, Venkatasubramanian; Cosgrove, Michael S
2011-02-04
Gene expression within the context of eukaryotic chromatin is regulated by enzymes that catalyze histone lysine methylation. Histone lysine methyltransferases that have been identified to date possess the evolutionarily conserved SET or Dot1-like domains. We previously reported the identification of a new multi-subunit histone H3 lysine 4 methyltransferase lacking homology to the SET or Dot1 family of histone lysine methyltransferases. This enzymatic activity requires a complex that includes WRAD (WDR5, RbBP5, Ash2L, and DPY-30), a complex that is part of the MLL1 (mixed lineage leukemia protein-1) core complex but that also exists independently of MLL1 in the cell. Here, we report that the minimal complex required for WRAD enzymatic activity includes WDR5, RbBP5, and Ash2L and that DPY-30, although not required for enzymatic activity, increases the histone substrate specificity of the WRAD complex. We also show that WRAD requires zinc for catalytic activity, displays Michaelis-Menten kinetics, and is inhibited by S-adenosyl-homocysteine. In addition, we demonstrate that WRAD preferentially methylates lysine 4 of histone H3 within the context of the H3/H4 tetramer but does not methylate nucleosomal histone H3 on its own. In contrast, we find that MLL1 and WRAD are required for nucleosomal histone H3 methylation, and we provide evidence suggesting that each plays distinct structural and catalytic roles in the recognition and methylation of a nucleosome substrate. Our results indicate that WRAD is a new H3K4 methyltransferase with functions that include regulating the substrate and product specificities of the MLL1 core complex.
NASA Astrophysics Data System (ADS)
Zamora, Genesis; Wang, Frederick; Sun, Chung-Ho; Trinidad, Anthony; Kwon, Young Jik; Cho, Soo Kyung; Berg, Kristian; Madsen, Steen J.; Hirschberg, Henry
2014-10-01
The overall objective of the research was to investigate the utility of photochemical internalization (PCI) for the enhanced nonviral transfection of genes into glioma cells. The PCI-mediated introduction of the tumor suppressor gene phosphatase and tensin homolog (PTEN) or the cytosine deaminase (CD) pro-drug activating gene into U87 or U251 glioma cell monolayers and multicell tumor spheroids were evaluated. In the study reported here, polyamine-DNA gene polyplexes were encapsulated in a nanoparticle (NP) with an acid degradable polyketal outer shell. These NP synthetically mimic the roles of viral capsid and envelope, which transport and release the gene, respectively. The effects of PCI-mediated suppressor and suicide genes transfection efficiency employing either "naked" polyplex cores alone or as NP-shelled cores were compared. PCI was performed with the photosensitizer AlPcS2a and λ=670-nm laser irradiance. The results clearly demonstrated that the PCI can enhance the delivery of both the PTEN or CD genes in human glioma cell monolayers and multicell tumor spheroids. The transfection efficiency, as measured by cell survival and inhibition of spheroid growth, was found to be significantly greater at suboptimal light and DNA levels for shelled NPs compared with polyamine-DNA polyplexes alone.
Discrimination learning and attentional set formation in a mouse model of Fragile X.
Casten, Kimberly S; Gray, Annette C; Burwell, Rebecca D
2011-06-01
Fragile X Syndrome is the most prevalent genetic cause of mental retardation. Selective deficits in executive function, including inhibitory control and attention, are core features of the disorder. In humans, Fragile X results from a trinucleotide repeat in the Fmr1 gene that renders it functionally silent and has been modeled in mice by targeted deletion of the Fmr1 gene. Fmr1 knockout (KO) mice recapitulate many features of Fragile X syndrome, but evidence for deficits in executive function is inconsistent. To address this issue, we trained wild-type and Fmr1 KO mice on an experimental paradigm that assesses attentional set-shifting. Mice learned to discriminate between stimuli differing in two of three perceptual dimensions. Successful discrimination required attending only to the relevant dimension, while ignoring irrelevant dimensions. Mice were trained on three discriminations in the same perceptual dimension, each followed by a reversal. This procedure normally results in the formation of an attentional set to the relevant dimension. Mice were then required to shift attention and discriminate based on a previously irrelevant perceptual dimension. Wild-type mice exhibited the increase in trials to criterion expected when shifting attention from one perceptual dimension to another. In contrast, the Fmr1 KO group failed to show the expected increase, suggesting impairment in forming an attentional set. Fmr1 KO mice also exhibited a general impairment in learning discriminations and reversals. This is the first demonstration that Fmr1 KO mice show a deficit in attentional set formation.
Papaleo, Maria Cristiana; Russo, Edda; Fondi, Marco; Emiliani, Giovanni; Frandi, Antonio; Brilli, Matteo; Pastorelli, Roberta; Fani, Renato
2009-12-01
In this work a detailed analysis of the structure, the expression and the organization of his genes belonging to the core of histidine biosynthesis (hisBHAF) in 40 newly determined and 13 available sequences of Burkholderia strains was carried out. Data obtained revealed a strong conservation of the structure and organization of these genes through the entire genus. The phylogenetic analysis showed the monophyletic origin of this gene cluster and indicated that it did not undergo horizontal gene transfer events. The analysis of the intergenic regions, based on the substitution rate, entropy plot and bendability suggested the existence of a putative transcription promoter upstream of hisB, that was supported by the genetic analysis that showed that this cluster was able to complement Escherichia colihisA, hisB, and hisF mutations. Moreover, a preliminary transcriptional analysis and the analysis of microarray data revealed that the expression of the his core was constitutive. These findings are in agreement with the fact that the entire Burkholderiahis operon is heterogeneous, in that it contains "alien" genes apparently not involved in histidine biosynthesis. Besides, they also support the idea that the proteobacterial his operon was piece-wisely assembled, i.e. through accretion of smaller units containing only some of the genes (eventually together with their own promoters) involved in this biosynthetic route. The correlation existing between the structure, organization and regulation of his "core" genes and the function(s) they perform in cellular metabolism is discussed.
Pandey, Manmohan; Kumar, Ravindra; Srivastava, Prachi; Agarwal, Suyash; Srivastava, Shreya; Nagpure, Naresh S; Jena, Joy K; Kushwaha, Basdeo
2018-03-16
Mining and characterization of Simple Sequence Repeat (SSR) markers from whole genomes provide valuable information about biological significance of SSR distribution and also facilitate development of markers for genetic analysis. Whole genome sequencing (WGS)-SSR Annotation Tool (WGSSAT) is a graphical user interface pipeline developed using Java Netbeans and Perl scripts which facilitates in simplifying the process of SSR mining and characterization. WGSSAT takes input in FASTA format and automates the prediction of genes, noncoding RNA (ncRNA), core genes, repeats and SSRs from whole genomes followed by mapping of the predicted SSRs onto a genome (classified according to genes, ncRNA, repeats, exonic, intronic, and core gene region) along with primer identification and mining of cross-species markers. The program also generates a detailed statistical report along with visualization of mapped SSRs, genes, core genes, and RNAs. The features of WGSSAT were demonstrated using Takifugu rubripes data. This yielded a total of 139 057 SSR, out of which 113 703 SSR primer pairs were uniquely amplified in silico onto a T. rubripes (fugu) genome. Out of 113 703 mined SSRs, 81 463 were from coding region (including 4286 exonic and 77 177 intronic), 7 from RNA, 267 from core genes of fugu, whereas 105 641 SSR and 601 SSR primer pairs were uniquely mapped onto the medaka genome. WGSSAT is tested under Ubuntu Linux. The source code, documentation, user manual, example dataset and scripts are available online at https://sourceforge.net/projects/wgssat-nbfgr.
Kim, Tae-Sung; He, Qiang; Kim, Kyu-Won; Yoon, Min-Young; Ra, Won-Hee; Li, Feng Peng; Tong, Wei; Yu, Jie; Oo, Win Htet; Choi, Buung; Heo, Eun-Beom; Yun, Byoung-Kook; Kwon, Soon-Jae; Kwon, Soon-Wook; Cho, Yoo-Hyun; Lee, Chang-Yong; Park, Beom-Seok; Park, Yong-Jin
2016-05-26
Rice germplasm collections continue to grow in number and size around the world. Since maintaining and screening such massive resources remains challenging, it is important to establish practical methods to manage them. A core collection, by definition, refers to a subset of the entire population that preserves the majority of genetic diversity, enhancing the efficiency of germplasm utilization. Here, we report whole-genome resequencing of the 137 rice mini core collection or Korean rice core set (KRICE_CORE) that represents 25,604 rice germplasms deposited in the Korean genebank of the Rural Development Administration (RDA). We implemented the Illumina HiSeq 2000 and 2500 platform to produce short reads and then assembled those with 9.8 depths using Nipponbare as a reference. Comparisons of the sequences with the reference genome yielded more than 15 million (M) single nucleotide polymorphisms (SNPs) and 1.3 M INDELs. Phylogenetic and population analyses using 2,046,529 high-quality SNPs successfully assigned rice accessions to the relevant rice subgroups, suggesting that these SNPs capture evolutionary signatures that have accumulated in rice subpopulations. Furthermore, genome-wide association studies (GWAS) for four exemplary agronomic traits in the KRIC_CORE manifest the utility of KRICE_CORE; that is, identifying previously defined genes or novel genetic factors that potentially regulate important phenotypes. This study provides strong evidence that the size of KRICE_CORE is small but contains high genetic and functional diversity across the genome. Thus, our resequencing results will be useful for future breeding, as well as functional and evolutionary studies, in the post-genomic era.
Insights into the 1.59-Mbp largest plasmid of Azospirillum brasilense CBG497.
Acosta-Cruz, Erika; Wisniewski-Dyé, Florence; Rouy, Zoé; Barbe, Valérie; Valdés, María; Mavingui, Patrick
2012-09-01
The plant growth-promoting proteobacterium Azospirillum brasilense enhances growth of many economically important crops, such as wheat, maize, and rice. The sequencing and annotation of the 1.59-Mbp replicon of A. brasilense CBG497, a strain isolated from a maize rhizosphere grown on an alkaline soil in the northeast of Mexico, revealed a GC content of 68.7 % and the presence of 1,430 potential protein-encoding genes, 1,147 of them classified into clusters of orthologous groups categories, and 16 tRNA genes representing 11 tRNA species. The presence of sixty-two genes representatives of the minimal gene set and chromid core genes suggests its importance in bacterial survival. The phaAB → G operon, reported as involved in the bacterial adaptation to alkaline pH in the presence of K(+), was also found on this replicon and detected in several Azospirillum strains. Phylogenetic analysis suggests that it was laterally acquired. We were not able to show its inference on the adaptation to basic pH, giving a hint about the presence of an alternative system for adaptation to alkaline pH.
Basis sets for the calculation of core-electron binding energies
NASA Astrophysics Data System (ADS)
Hanson-Heine, Magnus W. D.; George, Michael W.; Besley, Nicholas A.
2018-05-01
Core-electron binding energies (CEBEs) computed within a Δ self-consistent field approach require large basis sets to achieve convergence with respect to the basis set limit. It is shown that supplementing a basis set with basis functions from the corresponding basis set for the element with the next highest nuclear charge (Z + 1) provides basis sets that give CEBEs close to the basis set limit. This simple procedure provides relatively small basis sets that are well suited for calculations where the description of a core-ionised state is important, such as time-dependent density functional theory calculations of X-ray emission spectroscopy.
Thiry, M; Scheer, U; Goessens, G
1991-01-01
Nucleoli are the morphological expression of the activity of a defined set of chromosomal segments bearing rRNA genes. The topological distribution and composition of the intranucleolar chromatin as well as the definition of nucleolar structures in which enzymes of the rDNA transcription machinery reside have been investigated in mammalian cells by various immunogold labelling approaches at the ultrastructural level. The precise intranucleolar location of rRNA genes has been further specified by electron microscopic in situ hybridization with a non-autoradiographic procedure. Our results indicate that the fibrillar centers are the sole nucleolar structures where rDNA, core histones, RNA polymerase I and DNA topoisomerase I are located together. Taking into account the potential value and limitations of immunoelectron microscopic techniques, we propose that transcription of the rRNA genes takes place within the confines of the fibrillar centers, probably close to the boundary regions to the surrounding dense fibrillar component.
An intersectional gene regulatory strategy defines subclass diversity of C. elegans motor neurons.
Kratsios, Paschalis; Kerk, Sze Yen; Catela, Catarina; Liang, Joseph; Vidal, Berta; Bayer, Emily A; Feng, Weidong; De La Cruz, Estanisla Daniel; Croci, Laura; Consalez, G Giacomo; Mizumoto, Kota; Hobert, Oliver
2017-07-05
A core principle of nervous system organization is the diversification of neuron classes into subclasses that share large sets of features but differ in select traits. We describe here a molecular mechanism necessary for motor neurons to acquire subclass-specific traits in the nematode Caenorhabditis elegans . Cholinergic motor neuron classes of the ventral nerve cord can be subdivided into subclasses along the anterior-posterior (A-P) axis based on synaptic connectivity patterns and molecular features. The conserved COE-type terminal selector UNC-3 not only controls the expression of traits shared by all members of a neuron class, but is also required for subclass-specific traits expressed along the A-P axis. UNC-3, which is not regionally restricted, requires region-specific cofactors in the form of Hox proteins to co-activate subclass-specific effector genes in post-mitotic motor neurons. This intersectional gene regulatory principle for neuronal subclass diversification may be conserved from nematodes to mice.
Regulatory Response to Carbon Starvation in Caulobacter crescentus
Britos, Leticia; Abeliuk, Eduardo; Taverner, Thomas; Lipton, Mary; McAdams, Harley; Shapiro, Lucy
2011-01-01
Bacteria adapt to shifts from rapid to slow growth, and have developed strategies for long-term survival during prolonged starvation and stress conditions. We report the regulatory response of C. crescentus to carbon starvation, based on combined high-throughput proteome and transcriptome analyses. Our results identify cell cycle changes in gene expression in response to carbon starvation that involve the prominent role of the FixK FNR/CAP family transcription factor and the CtrA cell cycle regulator. Notably, the SigT ECF sigma factor mediates the carbon starvation-induced degradation of CtrA, while activating a core set of general starvation-stress genes that respond to carbon starvation, osmotic stress, and exposure to heavy metals. Comparison of the response of swarmer cells and stalked cells to carbon starvation revealed four groups of genes that exhibit different expression profiles. Also, cell pole morphogenesis and initiation of chromosome replication normally occurring at the swarmer-to-stalked cell transition are uncoupled in carbon-starved cells. PMID:21494595
Being Aquifex aeolicus: Untangling a Hyperthermophile’s Checkered Past
Eveleigh, Robert J.M.; Meehan, Conor J.; Archibald, John M.; Beiko, Robert G.
2013-01-01
Lateral gene transfer (LGT) is an important factor contributing to the evolution of prokaryotic genomes. The Aquificae are a hyperthermophilic bacterial group whose genes show affiliations to many other lineages, including the hyperthermophilic Thermotogae, the Proteobacteria, and the Archaea. Previous phylogenomic analyses focused on Aquifex aeolicus identified Thermotogae and Aquificae either as successive early branches or sisters in a rooted bacterial phylogeny, but many phylogenies and cellular traits have suggested a stronger affiliation with the Epsilonproteobacteria. Different scenarios for the evolution of the Aquificae yield different phylogenetic predictions. Here, we outline these scenarios and consider the fit of the available data, including three sequenced Aquificae genomes, to different sets of predictions. Evidence from phylogenetic profiles and trees suggests that the Epsilonproteobacteria have the strongest affinities with the three Aquificae analyzed. However, this pattern is shown by only a minority of encoded proteins, and the Archaea, many lineages of thermophilic bacteria, and members of genus Clostridium and class Deltaproteobacteria also show strong connections to the Aquificae. The phylogenetic affiliations of different functional subsystems showed strong biases: Most but not all genes implicated in the core translational apparatus tended to group Aquificae with Thermotogae, whereas a wide range of metabolic and cellular processes strongly supported the link between Aquificae and Epsilonproteobacteria. Depending on which sets of genes are privileged, either Thermotogae or Epsilonproteobacteria is the most plausible adjacent lineage to the Aquificae. Both scenarios require massive sharing of genes to explain the history of this enigmatic group, whose history is further complicated by specific affinities of different members of Aquificae to different partner lineages. PMID:24281050
Being Aquifex aeolicus: Untangling a hyperthermophile's checkered past.
Eveleigh, Robert J M; Meehan, Conor J; Archibald, John M; Beiko, Robert G
2013-01-01
Lateral gene transfer (LGT) is an important factor contributing to the evolution of prokaryotic genomes. The Aquificae are a hyperthermophilic bacterial group whose genes show affiliations to many other lineages, including the hyperthermophilic Thermotogae, the Proteobacteria, and the Archaea. Previous phylogenomic analyses focused on Aquifex aeolicus identified Thermotogae and Aquificae either as successive early branches or sisters in a rooted bacterial phylogeny, but many phylogenies and cellular traits have suggested a stronger affiliation with the Epsilonproteobacteria. Different scenarios for the evolution of the Aquificae yield different phylogenetic predictions. Here, we outline these scenarios and consider the fit of the available data, including three sequenced Aquificae genomes, to different sets of predictions. Evidence from phylogenetic profiles and trees suggests that the Epsilonproteobacteria have the strongest affinities with the three Aquificae analyzed. However, this pattern is shown by only a minority of encoded proteins, and the Archaea, many lineages of thermophilic bacteria, and members of genus Clostridium and class Deltaproteobacteria also show strong connections to the Aquificae. The phylogenetic affiliations of different functional subsystems showed strong biases: Most but not all genes implicated in the core translational apparatus tended to group Aquificae with Thermotogae, whereas a wide range of metabolic and cellular processes strongly supported the link between Aquificae and Epsilonproteobacteria. Depending on which sets of genes are privileged, either Thermotogae or Epsilonproteobacteria is the most plausible adjacent lineage to the Aquificae. Both scenarios require massive sharing of genes to explain the history of this enigmatic group, whose history is further complicated by specific affinities of different members of Aquificae to different partner lineages.
The most conserved genome segments for life detection on Earth and other planets.
Isenbarger, Thomas A; Carr, Christopher E; Johnson, Sarah Stewart; Finney, Michael; Church, George M; Gilbert, Walter; Zuber, Maria T; Ruvkun, Gary
2008-12-01
On Earth, very simple but powerful methods to detect and classify broad taxa of life by the polymerase chain reaction (PCR) are now standard practice. Using DNA primers corresponding to the 16S ribosomal RNA gene, one can survey a sample from any environment for its microbial inhabitants. Due to massive meteoritic exchange between Earth and Mars (as well as other planets), a reasonable case can be made for life on Mars or other planets to be related to life on Earth. In this case, the supremely sensitive technologies used to study life on Earth, including in extreme environments, can be applied to the search for life on other planets. Though the 16S gene has become the standard for life detection on Earth, no genome comparisons have established that the ribosomal genes are, in fact, the most conserved DNA segments across the kingdoms of life. We present here a computational comparison of full genomes from 13 diverse organisms from the Archaea, Bacteria, and Eucarya to identify genetic sequences conserved across the widest divisions of life. Our results identify the 16S and 23S ribosomal RNA genes as well as other universally conserved nucleotide sequences in genes encoding particular classes of transfer RNAs and within the nucleotide binding domains of ABC transporters as the most conserved DNA sequence segments across phylogeny. This set of sequences defines a core set of DNA regions that have changed the least over billions of years of evolution and provides a means to identify and classify divergent life, including ancestrally related life on other planets.
Fission control system for nuclear reactor
Conley, G.H.; Estes, G.P.
Control system for nuclear reactor comprises a first set of reactivity modifying rods fixed in a reactor core with their upper ends stepped in height across the core, and a second set of reactivity modifying rods movable vertically within the reactor core and having their lower ends stepped to correspond with the stepped arrangement of the first set of rods, pairs of the rods of the first and second sets being in coaxial alignment.
Morgan, Esi M; Riebschleger, Meredith P; Horonjeff, Jennifer; Consolaro, Alessandro; Munro, Jane E; Thornhill, Susan; Beukelman, Timothy; Brunner, Hermine I; Creek, Emily L; Harris, Julia G; Horton, Daniel B; Lovell, Daniel J; Mannion, Melissa L; Olson, Judyann C; Rahimi, Homaira; Gallo, Maria Chiara; Calandra, Serena; Ravelli, Angelo; Ringold, Sarah; Shenoi, Susan; Stinson, Jennifer; Toupin-April, Karine; Strand, Vibeke; Bingham, Clifton O
2017-12-01
The current Juvenile Idiopathic Arthritis (JIA) Core Set was developed in 1997 to identify the outcome measures to be used in JIA clinical trials using statistical and consensus-based techniques, but without patient involvement. The importance of patient/parent input into the research process has increasingly been recognized over the years. An Outcome Measures in Rheumatology (OMERACT) JIA Core Set Working Group was formed to determine whether the outcome domains of the current core set are relevant to those involved or whether the core set domains should be revised. Twenty-four people from the United States, Canada, Australia, and Europe, including patient partners, formed the working group. Guided by the OMERACT Filter 2.0 process, we performed (1) a systematic literature review of outcome domains, (2) a Web-based survey (142 patients, 343 parents), (3) an idea-generation study (120 parents), (4) 4 online discussion boards (24 patients, 20 parents), and (5) a Special Interest Group (SIG) activity at the OMERACT 13 (2016) meeting. A MEDLINE search of outcome domains used in studies of JIA yielded 5956 citations, of which 729 citations underwent full-text review, and identified additional domains to those included in the current JIA Core Set. Qualitative studies on the effect of JIA identified multiple additional domains, including pain and participation. Twenty-one participants in the SIG achieved consensus on the need to revise the entire JIA Core Set. The results of qualitative studies and literature review support the need to expand the JIA Core Set, considering, among other things, additional patient/parent-centered outcomes, clinical data, and imaging data.
Makarova, Kira S; Sorokin, Alexander V; Novichkov, Pavel S; Wolf, Yuri I; Koonin, Eugene V
2007-11-27
An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover approximately 88% of the genes in a genome compared to a approximately 76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; approximately 40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems. The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
Inglin, Raffael C; Meile, Leo; Stevens, Marc J A
2018-04-24
Bacterial taxonomy aims to classify bacteria based on true evolutionary events and relies on a polyphasic approach that includes phenotypic, genotypic and chemotaxonomic analyses. Until now, complete genomes are largely ignored in taxonomy. The genus Lactobacillus consists of 173 species and many genomes are available to study taxonomy and evolutionary events. We analyzed and clustered 98 completely sequenced genomes of the genus Lactobacillus and 234 draft genomes of 5 different Lactobacillus species, i.e. L. reuteri, L. delbrueckii, L. plantarum, L. rhamnosus and L. helveticus. The core-genome of the genus Lactobacillus contains 266 genes and the pan-genome 20'800 genes. Clustering of the Lactobacillus pan- and core-genome resulted in two highly similar trees. This shows that evolutionary history is traceable in the core-genome and that clustering of the core-genome is sufficient to explore relationships. Clustering of core- and pan-genomes at species' level resulted in similar trees as well. Detailed analyses of the core-genomes showed that the functional class "genetic information processing" is conserved in the core-genome but that "signaling and cellular processes" is not. The latter class encodes functions that are involved in environmental interactions. Evolution of lactobacilli seems therefore directed by the environment. The type species L. delbrueckii was analyzed in detail and its pan-genome based tree contained two major clades whose members contained different genes yet identical functions. In addition, evidence for horizontal gene transfer between strains of L. delbrueckii, L. plantarum, and L. rhamnosus, and between species of the genus Lactobacillus is presented. Our data provide evidence for evolution of some lactobacilli according to a parapatric-like model for species differentiation. Core-genome trees are useful to detect evolutionary relationships in lactobacilli and might be useful in taxonomic analyses. Lactobacillus' evolution is directed by the environment and HGT.
Sved, J A; Yu, H; Dominiak, B; Gilchrist, A S
2003-01-01
Long-range dispersal of a species may involve either a single long-distance movement from a core population or spreading via unobserved intermediate populations. Where the new populations originate as small propagules, genetic drift may be extreme and gene frequency or assignment methods may not prove useful in determining the relation between the core population and outbreak samples. We describe computationally simple resampling methods for use in this situation to distinguish between the different modes of dispersal. First, estimates of heterozygosity can be used to test for direct sampling from the core population and to estimate the effective size of intermediate populations. Second, a test of sharing of alleles, particularly rare alleles, can show whether outbreaks are related to each other rather than arriving as independent samples from the core population. The shared-allele statistic also serves as a genetic distance measure that is appropriate for small samples. These methods were applied to data on a fruit fly pest species, Bactrocera tryoni, which is quarantined from some horticultural areas in Australia. We concluded that the outbreaks in the quarantine zone came from a heterogeneous set of genetically differentiated populations, possibly ones that overwinter in the vicinity of the quarantine zone. PMID:12618417
Idzerda, Leanne; Rader, Tamara; Tugwell, Peter; Boers, Maarten
2014-05-01
The usefulness of randomized control trials to advance clinical care depends upon the outcomes reported, but disagreement on the choice of outcome measures has resulted in inconsistency and the potential for reporting bias. One solution to this problem is the development of a core outcome set: a minimum set of outcome measures deemed critical for clinical decision making. Within rheumatology the Outcome Measures in Rheumatology (OMERACT) initiative has pioneered the development of core outcome sets since 1992. As the number of diseases addressed by OMERACT has increased and its experience in formulating core sets has grown, clarification and update of the conceptual framework and formulation of a more explicit process of area/domain core set development has become necessary. As part of the update process of the OMERACT Filter criteria to version 2, a literature review was undertaken to compare and contrast the OMERACT conceptual framework with others within and outside rheumatology. A scoping search was undertaken to examine the extent, range, and nature of conceptual frameworks for core set outcome selection in health. We searched the following resources: Cochrane Library Methods Group Register; Medline; Embase; PsycInfo; Environmental Studies and Policy Collection; and ABI/INFORM Global. We also conducted a targeted Google search. Five conceptual frameworks were identified: the WHO tripartite definition of health; the 5 Ds (discomfort, disability, drug toxicity, dollar cost, and death); the International Classification of Functioning (ICF); PROMIS (Patient-Reported Outcomes Measurement System); and the Outcomes Hierarchy. Of these, only the 5 Ds and ICF frameworks have been systematically applied in core set development. Outside the area of rheumatology, several core sets were identified; these had been developed through a limited range of consensus-based methods with varying degrees of methodological rigor. None applied a framework to ensure content validity of the end product. This scoping review reinforced the need for clear methods and standards for core set development. Based on these findings, OMERACT will make its own conceptual framework and working process more explicit. Proposals for how to achieve this were discussed at the OMERACT 11 conference.
Ferreira, Ari J S; Siam, Rania; Setubal, João C; Moustafa, Ahmed; Sayed, Ahmed; Chambergo, Felipe S; Dawe, Adam S; Ghazy, Mohamed A; Sharaf, Hazem; Ouf, Amged; Alam, Intikhab; Abdel-Haleem, Alyaa M; Lehvaslaiho, Heikki; Ramadan, Eman; Antunes, André; Stingl, Ulrich; Archer, John A C; Jankovic, Boris R; Sogin, Mitchell; Bajic, Vladimir B; El-Dorry, Hamza
2014-01-01
Metagenomics-based functional profiling analysis is an effective means of gaining deeper insight into the composition of marine microbial populations and developing a better understanding of the interplay between the functional genome content of microbial communities and abiotic factors. Here we present a comprehensive analysis of 24 datasets covering surface and depth-related environments at 11 sites around the world's oceans. The complete datasets comprises approximately 12 million sequences, totaling 5,358 Mb. Based on profiling patterns of Clusters of Orthologous Groups (COGs) of proteins, a core set of reference photic and aphotic depth-related COGs, and a collection of COGs that are associated with extreme oxygen limitation were defined. Their inferred functions were utilized as indicators to characterize the distribution of light- and oxygen-related biological activities in marine environments. The results reveal that, while light level in the water column is a major determinant of phenotypic adaptation in marine microorganisms, oxygen concentration in the aphotic zone has a significant impact only in extremely hypoxic waters. Phylogenetic profiling of the reference photic/aphotic gene sets revealed a greater variety of source organisms in the aphotic zone, although the majority of individual photic and aphotic depth-related COGs are assigned to the same taxa across the different sites. This increase in phylogenetic and functional diversity of the core aphotic related COGs most probably reflects selection for the utilization of a broad range of alternate energy sources in the absence of light.
Ferreira, Ari J. S.; Siam, Rania; Setubal, João C.; Moustafa, Ahmed; Sayed, Ahmed; Chambergo, Felipe S.; Dawe, Adam S.; Ghazy, Mohamed A.; Sharaf, Hazem; Ouf, Amged; Alam, Intikhab; Abdel-Haleem, Alyaa M.; Lehvaslaiho, Heikki; Ramadan, Eman; Antunes, André; Stingl, Ulrich; Archer, John A. C.; Jankovic, Boris R.; Sogin, Mitchell; Bajic, Vladimir B.; El-Dorry, Hamza
2014-01-01
Metagenomics-based functional profiling analysis is an effective means of gaining deeper insight into the composition of marine microbial populations and developing a better understanding of the interplay between the functional genome content of microbial communities and abiotic factors. Here we present a comprehensive analysis of 24 datasets covering surface and depth-related environments at 11 sites around the world's oceans. The complete datasets comprises approximately 12 million sequences, totaling 5,358 Mb. Based on profiling patterns of Clusters of Orthologous Groups (COGs) of proteins, a core set of reference photic and aphotic depth-related COGs, and a collection of COGs that are associated with extreme oxygen limitation were defined. Their inferred functions were utilized as indicators to characterize the distribution of light- and oxygen-related biological activities in marine environments. The results reveal that, while light level in the water column is a major determinant of phenotypic adaptation in marine microorganisms, oxygen concentration in the aphotic zone has a significant impact only in extremely hypoxic waters. Phylogenetic profiling of the reference photic/aphotic gene sets revealed a greater variety of source organisms in the aphotic zone, although the majority of individual photic and aphotic depth-related COGs are assigned to the same taxa across the different sites. This increase in phylogenetic and functional diversity of the core aphotic related COGs most probably reflects selection for the utilization of a broad range of alternate energy sources in the absence of light. PMID:24921648
Izquierdo, Luis; Coderch, Núria; Piqué, Nuria; Bedini, Emiliano; Michela Corsaro, Maria; Merino, Susana; Fresno, Sandra; Tomás, Juan M.; Regué, Miguel
2003-01-01
To determine the function of the wabG gene in the biosynthesis of the core lipopolysaccharide (LPS) of Klebsiella pneumoniae, we constructed wabG nonpolar mutants. Data obtained from the comparative chemical and structural analysis of LPS samples obtained from the wild type, the mutant strain, and the complemented mutant demonstrated that the wabG gene is involved in attachment to α-l-glycero-d-manno-heptopyranose II (l,d-HeppII) at the O-3 position of an α-d-galactopyranosyluronic acid (α-d-GalAp) residue. K. pneumoniae nonpolar wabG mutants were devoid of the cell-attached capsular polysaccharide but were still able to produce capsular polysaccharide. Similar results were obtained with K. pneumoniae nonpolar waaC and waaF mutants, which produce shorter LPS core molecules than do wabG mutants. Other outer core K. pneumoniae nonpolar mutants in the waa gene cluster were encapsulated. K. pneumoniae waaC, waaF, and wabG mutants were avirulent when tested in different animal models. Furthermore, these mutants were more sensitive to some hydrophobic compounds than the wild-type strains. All these characteristics were rescued by reintroduction of the waaC, waaF, and wabG genes from K. pneumoniae. PMID:14645282
[Hot spot mutation screening of RYR1 gene in diagnosis of congenital myopathies].
Chang, Xing-zhi; Jin, Yi-wen; Wang, Jing-min; Yuan, Yun; Xiong, Hui; Wang, Shuang; Qin, Jiong
2014-10-18
To detect hot spot mutation of RYR1 gene in 15 cases of congenital myopathy with different subtypes, and to discuss the value of RYR1 gene hot spot mutation detection in the diagnosis of the disease. Clinical data were collected in all the patients, including clinical manifestations and signs, serum creatine kinase, electromyography. Fourteen of the patients accepted the muscle biopsy. Hot spot mutation in the C-terminal of RYR1 gene (extron 96-106) had been detected in all the 15 patients. All the patients presented with motor development delay, and they could walk at the age of 1 to 3.5 years,but were always easy to fall and could not run or jump. There were no progressive deteriorations. Physical examination showed different degrees of muscle weakness and hypotonia.High arched palates were noted in 3 patients. The serum levels of creatine kinase were mildly elevated in 3 cases, and normal in 12 cases. Electromyography showed "myogenic" features in 11 patients, being normal in the other 4 patients. Muscle biopsy pathologic diagnosis was the central core disease in 3 patients, the central nuclei in 2 patients, the congenital fiber type disproportion in 2 patients, the nameline myopathy in 3 patient, the multiminicore disease in 1 patient, and nonspecific minimal changes in the other 3 patients; one patient was diagnosed with central core disease according to positive family history and gene mutation. In the family case (Patient 2) of central core disease, the c.14678G>A (p.Arg4893Gln) mutation in 102 extron of RYR1 was identified in three members of the family, which had been reported to be a pathogenic mutation. The c.14596A>G(p.Lys4866Gln) mutation in 101 extron was found in one patient with central core disease(Patient 1), and the c.14719G>A(p.Gly4907Ser) mutation in 102 extron was found in another case of the central core disease(Patient 3).The same novel mutation was verified in one of the patients' (Patient 3) asymptomatic father. Congenital myopathies in the different subtype have the similar clinical manifestations, signs, enzyme detection and electromyography changes. Muscle biopsy plays an important role in the selection of genes to be detected. Hot spot mutation in C-terminal of the RYR1 gene can only be identified in patients with central core disease, so we suggest this hot spot gene mutation screening apply to the suspicious patient with central core disease only.
Morris, Christopher; Dunkley, Colin; Gibbon, Frances M; Currier, Janet; Roberts, Deborah; Rogers, Morwenna; Crudgington, Holly; Bray, Lucy; Carter, Bernie; Hughes, Dyfrig; Tudur Smith, Catrin; Williamson, Paula R; Gringras, Paul; Pal, Deb K
2017-11-28
There is increasing recognition that establishing a core set of outcomes to be evaluated and reported in trials of interventions for particular conditions will improve the usefulness of health research. There is no established core outcome set for childhood epilepsy. The aim of this work is to select a core outcome set to be used in evaluative research of interventions for children with rolandic epilepsy, as an exemplar of common childhood epilepsy syndromes. First we will identify what outcomes should be measured; then we will decide how to measure those outcomes. We will engage relevant UK charities and health professional societies as partners, and convene advisory panels for young people with epilepsy and parents of children with epilepsy. We will identify candidate outcomes from a search for trials of interventions for childhood epilepsy, statutory guidance and consultation with our advisory panels. Families, charities and health, education and neuropsychology professionals will be invited to participate in a Delphi survey following recommended practices in the development of core outcome sets. Participants will be able to recommend additional outcome domains. Over three rounds of Delphi survey participants will rate the importance of candidate outcome domains and state the rationale for their decisions. Over the three rounds we will seek consensus across and between families and health professionals on the more important outcomes. A face-to-face meeting will be convened to ratify the core outcome set. We will then review and recommend ways to measure the shortlisted outcomes using clinical assessment and/or patient-reported outcome measures. Our methodology is a proportionate and pragmatic approach to expediently produce a core outcome set for evaluative research of interventions aiming to improve the health of children with epilepsy. A number of decisions have to be made when designing a study to develop a core outcome set including defining the scope, choosing which stakeholders to engage, most effective ways to elicit their views, especially children and a potential role for qualitative research.
Ruaro, João A; Ruaro, Marinêz B; Guerra, Ricardo O
2014-01-01
To facilitate a systematic, comprehensive description of functioning and to enable the use of the International Classification of Functioning, Disability and Health (ICF) in clinical practice and research, core sets have been developed. The aim of this study was to propose a version of the ICF core set to classify the physical health of older adults. The proposition of the ICF core set was based on the Delphi technique. The panel of experts included 8 Brazilian researchers (physical therapists, medical doctors, nurses, and physical educators). The communication was wholly electronic. In total, there were 5 rounds of interactivity between the participants to arrive at the final version of the construct. The ICF core set presented 30 categories (14 on body functions, 4 on body structures, 9 on activities or participation, and 3 on environmental factors) and had a Cronbach α of 0.964. The presented core set is a secure, fast, and accurate instrument for assessing the physical health and engagement of older adults. It defines points related to functioning and health that are relevant when evaluating this population, as well as when reevaluating it and monitoring changes.
Mapping of a standard documentation template to the ICF core sets for arthritis and low back pain.
Escorpizo, Reuben; Davis, Kandace; Stumbo, Teri
2010-12-01
To identify the contents of a documentation template in The Guide to Physical Therapist Practice using the International Classification of Functioning, Disability, and Health (ICF) Core Sets for rheumatoid arthritis, osteoarthritis, and low back pain (LBP) as reference. Concepts were identified from items of an outpatient documentation template and mapped to the ICF using established linking rules. The ICF categories that were linked were compared with existing arthritis and LBP Core Sets. Based on the ICF, the template had the highest number (29%) of linked categories under Activities and participation while Body structures had the least (17%). ICF categories in the arthritis and LBP Core Sets had a 37-55% match with the ICF categories found in the template. We found 164 concepts that were not classified or not defined and 37 as personal factors. The arthritis and LBP Core Sets were reflected in the contents of the template. ICF categories in the Core Sets were reflected in the template (demonstrating up to 55% match). Potential integration of ICF in documentation templates could be explored and examined in the future to enhance clinical encounters and multidisciplinary communication. Copyright © 2010 John Wiley & Sons, Ltd.
Bölte, Sven; de Schipper, Elles; Holtmann, Martin; Karande, Sunil; de Vries, Petrus J; Selb, Melissa; Tannock, Rosemary
2014-12-01
In the study of health and quality of life in attention deficit/hyperactivity disorder (ADHD), it is of paramount importance to include assessment of functioning. The International Classification of Functioning, Disability and Health (ICF) provides a comprehensive, universally accepted framework for the description of functioning in relation to health conditions. In this paper, the authors outline the process to develop ICF Core Sets for ADHD. ICF Core Sets are subgroups of ICF categories selected to capture the aspects of functioning that are most likely to be affected in specific disorders. The ICF categories that will be included in the ICF Core Sets for ADHD will be determined at an ICF Core Set Consensus Conference, wherein evidence from four preliminary studies (a systematic review, an expert survey, a patient and caregiver qualitative study, and a clinical cross-sectional study) will be integrated. Comprehensive and Brief ICF Core Sets for ADHD will be developed with the goal of providing useful standards for research and clinical practice, and to generate a common language for the description of functioning in ADHD in different areas of life and across the lifespan.
The impact of the EUSCLE Core Set Questionnaire for the assessment of cutaneous lupus erythematosus.
Kuhn, A; Patsinakidis, N; Bonsmann, G
2010-08-01
Epidemiological data and standard European guidelines for the diagnosis and treatment of cutaneous lupus erythematosus (CLE) are lacking in the current literature. In order to provide a standardized tool for an extensive consistent data collection, a study group of the European Society of Cutaneous Lupus Erythematosus (EUSCLE) recently developed a Core Set Questionnaire for the assessment of patients with different subtypes of CLE. The EUSCLE Core Set Questionnaire includes six sections on patient data, diagnosis, skin involvement, activity and damage of disease, laboratory analysis, and treatment. An instrument like the EUSCLE Core Set Questionnaire is essential to gain a broad and comparable data collection of patients with CLE from different European centres and to achieve consensus concerning clinical standards for the disease. The data will also be important for further characterization of the different CLE subtypes and the evaluation of therapeutic strategies; moreover, the EUSCLE Core Set Questionnaire might also be useful for the comparison of data in clinical trials. In this review, the impact of the EUSCLE Core Set Questionnaire is discussed in detail with regard to clinical and serological features as well as therapeutic modalities in CLE.
Diversity captured in the USDA-ARS National Plant Germplasm System apple core collection
USDA-ARS?s Scientific Manuscript database
Core collections have been used widely in genetic resources to provide a representative and compact sample to use in breeding evaluation. In the 1990s a core set was developed by the USDA-ARS Plant Genetic Resources Unit (PGRU) in Geneva, NY. Using data available at the time, a core set was develo...
Blackwood, Bronagh; Ringrow, Suzanne; Clarke, Mike; Marshall, John; Rose, Louise; Williamson, Paula; McAuley, Danny
2015-08-20
Among clinical trials of interventions that aim to modify time spent on mechanical ventilation for critically ill patients there is considerable inconsistency in chosen outcomes and how they are measured. The Core Outcomes in Ventilation Trials (COVenT) study aims to develop a set of core outcomes for use in future ventilation trials in mechanically ventilated adults and children. We will use a mixed methods approach that incorporates a randomised trial nested within a Delphi study and a consensus meeting. Additionally, we will conduct an observational cohort study to evaluate uptake of the core outcome set in published studies at 5 and 10 years following core outcome set publication. The three-round online Delphi study will use a list of outcomes that have been reported previously in a review of ventilation trials. The Delphi panel will include a range of stakeholder groups including patient support groups. The panel will be randomised to one of three feedback methods to assess the impact of the feedback mechanism on subsequent ranking of outcomes. A final consensus meeting will be held with stakeholder representatives to review outcomes. The COVenT study aims to develop a core outcome set for ventilation trials in critical care, explore the best Delphi feedback mechanism for achieving consensus and determine if participation increases use of the core outcome set in the long term.
Rusz, Orsolya; Papp, Orsolya; Vízkeleti, Laura; Molnár, Béla Ákos; Bende, Kristóf Csaba; Lotz, Gábor; Ács, Balázs; Kahán, Zsuzsanna; Székely, Tamás; Báthori, Ágnes; Szundi, Csilla; Kulka, Janina; Szállási, Zoltán; Tőkés, Anna-Mária
2018-05-16
To determine the associations between lysosomal-associated transmembrane protein 4b (LAPTM4B) gene copy number and response to different chemotherapy regimens in hormone receptor negative (HR-) primary breast carcinomas. Two cohorts were analyzed: (1) 69 core biopsies from HR-breast carcinomas treated with neoadjuvant chemotherapy (anthracycline based in 72.5% of patients and non-anthracycline based in 27.5% of patients). (2) Tissue microarray (TMA) of 74 HR-breast carcinomas treated with adjuvant therapy (77.0% of the patients received anthracycline, 17.6% of the patients non-anthracycline-based therapy, and in 5.4% of the cases, no treatment data are available). Interphase FISH technique was applied on pretreatment core biopsies (cohort I) and on TMAs (cohort II) using custom-made dual-labelled FISH probes (LAPTM4B/CEN8q FISH probe Abnova Corp.). In the neoadjuvant cohort in the anthracycline-treated group, we observed a significant difference (p = 0.029) of average LAPTM4B copy number between the non-responder and pathological complete responder groups (4.1 ± 1.1 vs. 2.6 ± 0.1). In the adjuvant setting, the anthracycline-treated group of metastatic breast carcinomas was characterized by higher LAPTM4B copy number comparing to the non-metastatic ones (p = 0.046). In contrast, in the non-anthracycline-treated group of patients, we did not find any LAPTM4B gene copy number differences between responder vs. non-responder groups or between metastatic vs. non-metastatic groups. Our results confirm the possible role of the LAPTM4B gene in anthracycline resistance in HR- breast cancer. Analyzing LAPTM4B copy number pattern may support future treatment decision.
Feretzaki, Marianna; Billmyre, R Blake; Clancey, Shelly Applen; Wang, Xuying; Heitman, Joseph
2016-03-01
RNAi is a ubiquitous pathway that serves central functions throughout eukaryotes, including maintenance of genome stability and repression of transposon expression and movement. However, a number of organisms have lost their RNAi pathways, including the model yeast Saccharomyces cerevisiae, the maize pathogen Ustilago maydis, the human pathogen Cryptococcus deuterogattii, and some human parasite pathogens, suggesting there may be adaptive benefits associated with both retention and loss of RNAi. By comparing the RNAi-deficient genome of the Pacific Northwest Outbreak C. deuterogattii strain R265 with the RNAi-proficient genomes of the Cryptococcus pathogenic species complex, we identified a set of conserved genes that were lost in R265 and all other C. deuterogattii isolates examined. Genetic and molecular analyses reveal several of these lost genes play roles in RNAi pathways. Four novel components were examined further. Znf3 (a zinc finger protein) and Qip1 (a homolog of N. crassa Qip) were found to be essential for RNAi, while Cpr2 (a constitutive pheromone receptor) and Fzc28 (a transcription factor) are involved in sex-induced but not mitosis-induced silencing. Our results demonstrate that the mitotic and sex-induced RNAi pathways rely on the same core components, but sex-induced silencing may be a more specific, highly induced variant that involves additional specialized or regulatory components. Our studies further illustrate how gene network polymorphisms involving known components of key cellular pathways can inform identification of novel elements and suggest that RNAi loss may have been a core event in the speciation of C. deuterogattii and possibly contributed to its pathogenic trajectory.
Tormey, Duncan; Colbourne, John K; Mockaitis, Keithanne; Choi, Jeong-Hyeon; Lopez, Jacqueline; Burkhart, Joshua; Bradshaw, William; Holzapfel, Christina
2015-10-06
Internal circadian (circa, about; dies, day) clocks enable organisms to maintain adaptive timing of their daily behavioral activities and physiological functions. Eukaryotic clocks consist of core transcription-translation feedback loops that generate a cycle and post-translational modifiers that maintain that cycle at about 24 h. We use the pitcher-plant mosquito, Wyeomyia smithii (subfamily Culicini, tribe Sabethini), to test whether evolutionary divergence of the circadian clock genes in this species, relative to other insects, has involved primarily genes in the core feedback loops or the post-translational modifiers. Heretofore, there is no reference transcriptome or genome sequence for any mosquito in the tribe Sabethini, which includes over 375 mainly circumtropical species. We sequenced, assembled and annotated the transcriptome of W. smithii containing nearly 95 % of conserved single-copy orthologs in animal genomes. We used the translated contigs and singletons to determine the average rates of circadian clock-gene divergence in W. smithii relative to three other mosquito genera, to Drosophila, to the butterfly, Danaus, and to the wasp, Nasonia. Over 1.08 million cDNA sequence reads were obtained consisting of 432.5 million nucleotides. Their assembly produced 25,904 contigs and 54,418 singletons of which 62 % and 28 % are annotated as protein-coding genes, respectively, sharing homology with other animal proteomes. The W. smithii transcriptome includes all nine circadian transcription-translation feedback-loop genes and all eight post-translational modifier genes we sought to identify (Fig. 1). After aligning translated W. smithii contigs and singletons from this transcriptome with other insects, we determined that there was no significant difference in the average divergence of W. smithii from the six other taxa between the core feedback-loop genes and post-translational modifiers. The characterized transcriptome is sufficiently complete and of sufficient quality to have uncovered all of the insect circadian clock genes we sought to identify (Fig. 1). Relative divergence does not differ between core feedback-loop genes and post-translational modifiers of those genes in a Sabethine species (W. smithii) that has experienced a continual northward dispersal into temperate regions of progressively longer summer day lengths as compared with six other insect taxa. An associated microarray platform derived from this work will enable the investigation of functional genomics of circadian rhythmicity, photoperiodic time measurement, and diapause along a photic and seasonal geographic gradient.
2014-01-01
Background Although serotype O157:H7 is the predominant enterohemorrhagic Escherichia coli (EHEC), outbreaks of non-O157 EHEC that cause severe foodborne illness, including hemolytic uremic syndrome have increased worldwide. In fact, non-O157 serotypes are now estimated to cause over half of all the Shiga toxin-producing Escherichia coli (STEC) cases, and outbreaks of non-O157 EHEC infections are frequently associated with serotypes O26, O45, O103, O111, O121, and O145. Currently, there are no complete genomes for O145 in public databases. Results We determined the complete genome sequences of two O145 strains (EcO145), one linked to a US lettuce-associated outbreak (RM13514) and one to a Belgium ice-cream-associated outbreak (RM13516). Both strains contain one chromosome and two large plasmids, with genome sizes of 5,737,294 bp for RM13514 and 5,559,008 bp for RM13516. Comparative analysis of the two EcO145 genomes revealed a large core (5,173 genes) and a considerable amount of strain-specific genes. Additionally, the two EcO145 genomes display distinct chromosomal architecture, virulence gene profile, phylogenetic origin of Stx2a prophage, and methylation profile (methylome). Comparative analysis of EcO145 genomes to other completely sequenced STEC and other E. coli and Shigella genomes revealed that, unlike any other known non-O157 EHEC strain, EcO145 ascended from a common lineage with EcO157/EcO55. This evolutionary relationship was further supported by the pangenome analysis of the 10 EHEC str ains. Of the 4,192 EHEC core genes, EcO145 shares more genes with EcO157 than with the any other non-O157 EHEC strains. Conclusions Our data provide evidence that EcO145 and EcO157 evolved from a common lineage, but ultimately each serotype evolves via a lineage-independent nature to EHEC by acquisition of the core set of EHEC virulence factors, including the genes encoding Shiga toxin and the large virulence plasmid. The large variation between the two EcO145 genomes suggests a distinctive evolutionary path between the two outbreak strains. The distinct methylome between the two EcO145 strains is likely due to the presence of a BsuBI/PstI methyltransferase gene cassette in the Stx2a prophage of the strain RM13514, suggesting a role of horizontal gene transfer-mediated epigenetic alteration in the evolution of individual EHEC strains. PMID:24410921
Inagaki, Fumio; Tsunogai, Urumu; Suzuki, Masae; Kosaka, Ayako; Machiyama, Hideaki; Takai, Ken; Nunoura, Takuro; Nealson, Kenneth H.; Horikoshi, Koki
2004-01-01
Samples from three submerged sites (MC, a core obtained in the methane seep area; MR, a reference core obtained at a distance from the methane seep; and HC, a gas-bubbling carbonate sample) at the Kuroshima Knoll in the southern Ryuku arc were analyzed to gain insight into the organisms present and the processes involved in this oxic-anoxic methane seep environment. 16S rRNA gene analyses by quantitative real-time PCR and clone library sequencing revealed that the MC core sediments contained abundant archaea (∼34% of the total prokaryotes), including both mesophilic methanogens related to the genus Methanolobus and ANME-2 members of the Methanosarcinales, as well as members of the δ-Proteobacteria, suggesting that both anaerobic methane oxidation and methanogenesis occurred at this site. In addition, several functional genes connected with methane metabolism were analyzed by quantitative competitive-PCR, including the genes encoding particulate methane monooxygenase (pmoA), soluble methane monooxygenase (mmoX), methanol dehydrogenese (mxaF), and methyl coenzyme M reductase (mcrA). In the MC core sediments, the most abundant gene was mcrA (2.5 × 106 copies/g [wet weight]), while the pmoA gene of the type I methanotrophs (5.9 × 106 copies/g [wet weight]) was most abundant at the surface of the MC core. These results indicate that there is a very complex environment in which methane production, anaerobic methane oxidation, and aerobic methane oxidation all occur in close proximity. The HC carbonate site was rich in γ-Proteobacteria and had a high copy number of mxaF (7.1 × 106 copies/g [wet weight]) and a much lower copy number of the pmoA gene (3.2 × 102 copies/g [wet weight]). The mmoX gene was never detected. In contrast, the reference core contained familiar sequences of marine sedimentary archaeal and bacterial groups but not groups specific to C1 metabolism. Geochemical characterization of the amounts and isotopic composition of pore water methane and sulfate strongly supported the notion that in this zone both aerobic methane oxidation and anaerobic methane oxidation, as well as methanogenesis, occur. PMID:15574947
Yu, Xiaoming; Jiang, Lili; Wu, Rui; Meng, Xinchao; Zhang, Ai; Li, Ning; Xia, Qiong; Qi, Xin; Pang, Jinsong; Xu, Zheng-Yi; Liu, Bao
2016-12-05
ATP-dependent chromatin remodeling complexes play essential roles in the regulation of diverse biological processes by formulating a DNA template that is accessible to the general transcription apparatus. Although the function of chromatin remodelers in plant development has been studied in A. thaliana, how it affects growth and development of major crops (e.g., maize) remains uninvestigated. Combining genetic, genomic and bioinformatic analyses, we show here that the maize core subunit of chromatin remodeling complex, ZmCHB101, plays essential roles in growth and development of maize at both vegetative and reproductive stages. Independent ZmCHB101 RNA interference plant lines displayed abaxially curling leaf phenotype due to increase of bulliform cell numbers, and showed impaired development of tassel and cob. RNA-seq-based transcriptome profiling revealed that ZmCHB101 dictated transcriptional reprogramming of a significant set of genes involved in plant development, photosynthesis, metabolic regulation, stress response and gene expressional regulation. Intriguingly, we found that ZmCHB101 was required for maintaining normal nucleosome density and 45 S rDNA compaction. Our findings suggest that the SWI3 protein, ZmCHB101, plays pivotal roles in maize normal growth and development via regulation of chromatin structure.
Jiménez, Natalia; Senchenkova, Sofya N; Knirel, Yuriy A; Pieretti, Giuseppina; Corsaro, Maria M; Aquilini, Eleonora; Regué, Miguel; Merino, Susana; Tomás, Juan M
2012-07-01
The presence of cell-bound K1 capsule and K1 polysaccharide in culture supernatants was determined in a series of in-frame nonpolar core biosynthetic mutants from Escherichia coli KT1094 (K1, R1 core lipopolysaccharide [LPS] type) for which the major core oligosaccharide structures were determined. Cell-bound K1 capsule was absent from mutants devoid of phosphoryl modifications on L-glycero-D-manno-heptose residues (HepI and HepII) of the inner-core LPS and reduced in mutants devoid of phosphoryl modification on HepII or devoid of HepIII. In contrast, in all of the mutants, K1 polysaccharide was found in culture supernatants. These results were confirmed by using a mutant with a deletion spanning from the hldD to waaQ genes of the waa gene cluster to which individual genes were reintroduced. A nuclear magnetic resonance (NMR) analysis of core LPS from HepIII-deficient mutants showed an alteration in the pattern of phosphoryl modifications. A cell extract containing both K1 capsule polysaccharide and LPS obtained from an O-antigen-deficient mutant could be resolved into K1 polysaccharide and core LPS by column chromatography only when EDTA and deoxycholate (DOC) buffer were used. These results suggest that the K1 polysaccharide remains cell associated by ionically interacting with the phosphate-negative charges of the core LPS.
Jiménez, Natalia; Senchenkova, Sofya N.; Knirel, Yuriy A.; Pieretti, Giuseppina; Corsaro, Maria M.; Aquilini, Eleonora; Regué, Miguel; Merino, Susana
2012-01-01
The presence of cell-bound K1 capsule and K1 polysaccharide in culture supernatants was determined in a series of in-frame nonpolar core biosynthetic mutants from Escherichia coli KT1094 (K1, R1 core lipopolysaccharide [LPS] type) for which the major core oligosaccharide structures were determined. Cell-bound K1 capsule was absent from mutants devoid of phosphoryl modifications on l-glycero-d-manno-heptose residues (HepI and HepII) of the inner-core LPS and reduced in mutants devoid of phosphoryl modification on HepII or devoid of HepIII. In contrast, in all of the mutants, K1 polysaccharide was found in culture supernatants. These results were confirmed by using a mutant with a deletion spanning from the hldD to waaQ genes of the waa gene cluster to which individual genes were reintroduced. A nuclear magnetic resonance (NMR) analysis of core LPS from HepIII-deficient mutants showed an alteration in the pattern of phosphoryl modifications. A cell extract containing both K1 capsule polysaccharide and LPS obtained from an O-antigen-deficient mutant could be resolved into K1 polysaccharide and core LPS by column chromatography only when EDTA and deoxycholate (DOC) buffer were used. These results suggest that the K1 polysaccharide remains cell associated by ionically interacting with the phosphate-negative charges of the core LPS. PMID:22522903
Bosi, Emanuele; Monk, Jonathan M.; Aziz, Ramy K.; Fondi, Marco; Nizet, Victor; Palsson, Bernhard Ø.
2016-01-01
Staphylococcus aureus is a preeminent bacterial pathogen capable of colonizing diverse ecological niches within its human host. We describe here the pangenome of S. aureus based on analysis of genome sequences from 64 strains of S. aureus spanning a range of ecological niches, host types, and antibiotic resistance profiles. Based on this set, S. aureus is expected to have an open pangenome composed of 7,411 genes and a core genome composed of 1,441 genes. Metabolism was highly conserved in this core genome; however, differences were identified in amino acid and nucleotide biosynthesis pathways between the strains. Genome-scale models (GEMs) of metabolism were constructed for the 64 strains of S. aureus. These GEMs enabled a systems approach to characterizing the core metabolic and panmetabolic capabilities of the S. aureus species. All models were predicted to be auxotrophic for the vitamins niacin (vitamin B3) and thiamin (vitamin B1), whereas strain-specific auxotrophies were predicted for riboflavin (vitamin B2), guanosine, leucine, methionine, and cysteine, among others. GEMs were used to systematically analyze growth capabilities in more than 300 different growth-supporting environments. The results identified metabolic capabilities linked to pathogenic traits and virulence acquisitions. Such traits can be used to differentiate strains responsible for mild vs. severe infections and preference for hosts (e.g., animals vs. humans). Genome-scale analysis of multiple strains of a species can thus be used to identify metabolic determinants of virulence and increase our understanding of why certain strains of this deadly pathogen have spread rapidly throughout the world. PMID:27286824
Singh, Jasvinder A; Dowsey, Michelle; Choong, Peter F
2017-03-15
A patient- and surgeon-Delphi-derived Outcome Measures in Rheumatology (OMERACT) draft core domain set for total joint arthroplasty (TJR) trials was recently developed. Our objective was to obtain further patient stakeholder endorsement of draft core domain set for TJR clinical trials. We surveyed two patient groups: (1) OMERACT patient partners; and (2) patients who had undergone hip or knee TJR. Patients received an introductory email with explanations about the core domain set and instructions to rate the core domains, i.e., important aspects, of OMERACT TJR clinical trial draft core domain set. Rating was on a nominal scale, where 1-3 indicated a domain of limited importance, 4-6 an important, but not critical domain, and 7-9 a critical domain. We used Mann-Whitney test (a non-parametric test) to compare the distribution of ratings between the two groups. Thirty one survey participants from the OMERACT patient partner group and 118 knee/hip TJR patients responded with response rates of 66 and 80%, respectively. Majority of the survey respondents were female, 87 vs. 53%, and were 55 years or older, 57 vs. 94%. Median (interquartile range [IQR]) scores for six core domains by OMERACT and knee/hip TJR patient groups were, respectively: pain, 8 [8, 9] and 9 [8, 9]; function, 9 [8, 9] and 9 [8, 9]; patient satisfaction, 8 [8, 9] and 8 [7, 9]; revision surgery, 7 [7, 8] and 7 [5, 9]; adverse events, 8 [7, 9] and 8 [6, 9]; and death, 9 [6, 9] and 9 [4, 9]. No statistically significant differences in rating were noted for any of the six core domains between the two groups (p ≥ 0.31). Among the additional domains, ratings for patient participation did not differ by group (p = 0.98), but ratings for cost were significantly different (p = 0.005). Patients provided qualitative feedback regarding core domains, and did not propose any modifications to the draft core domain set. Two separate patient stakeholder groups endorsed the OMERACT TJR draft core domain set for TJR trials.
A protocol for developing, disseminating, and implementing a core outcome set for pre-eclampsia.
Duffy, James M N; van 't Hooft, Janneke; Gale, Chris; Brown, Mark; Grobman, William; Fitzpatrick, Ray; Karumanchi, S Ananth; Lucas, Nuala; Magee, Laura; Mol, Ben; Stark, Michael; Thangaratinam, Shakila; Wilson, Mathew; von Dadelszen, Peter; Williamson, Paula; Khan, Khalid S; Ziebland, Sue; McManus, Richard J
2016-10-01
Pre-eclampsia is a serious complication of pregnancy and contributes to maternal and offspring mortality and morbidity. Randomised controlled trials evaluating therapeutic interventions for pre-eclampsia have reported many different outcomes and outcome measures. Such variation contributes to an inability to compare, contrast, and combine individual studies, limiting the usefulness of research to inform clinical practice. The development and use of a core outcome set would help to address these issues ensuring outcomes important to all stakeholders, including patients, will be collected and reported in a standardised fashion. An international steering group including healthcare professionals, researchers, and patients, has been formed to guide the development of this core outcome set. Potential outcomes will be identified through a comprehensive literature review and semi-structured interviews with patients. Potential core outcomes will be entered into an international, multi-perspective online Delphi survey. All key stakeholders, including healthcare professionals, researchers, and patients will be invited to participate. The modified Delphi method encourages whole and stakeholder group convergence towards consensus 'core' outcomes. Once core outcomes have been agreed upon it is important to determine how they should be measured. The truth, discrimination, and feasibility assessment framework will assess the quality of potential outcome measures. High quality outcome measures will be associated with core outcomes. Mechanisms exist to disseminate and implement the resulting core outcome set within an international context. Embedding the core outcome set within future clinical trials, systematic reviews, and clinical practice guidelines could make a profound contribution to advancing the usefulness of research to inform clinical practice, enhance patient care, and improve maternal and offspring outcomes. The infrastructure created by developing a core outcome set for pre-eclampsia could be leveraged in other settings, for example selecting research priorities and clinical practice guideline development. PROSPECTIVE REGISTRATION: [1] Core Outcome Measures in Effectiveness Trials (COMET) registration number: 588. [2] International Prospective Register of Systematic Reviews (PROSPERO) registration number: CRD42015015529. Copyright © 2016 International Society for the Study of Hypertension in Pregnancy. Published by Elsevier B.V. All rights reserved.
Metagenomics of urban sewage identifies an extensively shared antibiotic resistome in China.
Su, Jian-Qiang; An, Xin-Li; Li, Bing; Chen, Qing-Lin; Gillings, Michael R; Chen, Hong; Zhang, Tong; Zhu, Yong-Guan
2017-07-19
Antibiotic-resistant pathogens are challenging treatment of infections worldwide. Urban sewage is potentially a major conduit for dissemination of antibiotic resistance genes into various environmental compartments. However, the diversity and abundance of such genes in wastewater are not well known. Here, seasonal and geographical distributions of antibiotic resistance genes and their host bacterial communities from Chinese urban sewage were characterized, using metagenomic analyses and 16S rRNA gene-based Illumina sequencing, respectively. In total, 381 different resistance genes were detected, and these genes were extensively shared across China, with no geographical clustering. Seasonal variation in abundance of resistance genes was observed, with average concentrations of 3.27 × 10 11 and 1.79 × 10 12 copies/L in summer and winter, respectively. Bacterial communities did not exhibit geographical clusters, but did show a significant distance-decay relationship (P < 0.01). The core, shared resistome accounted for 57.7% of the total resistance genes, and was significantly associated with the core microbial community (P < 0.01). The core human gut microbiota was also strongly associated with the shared resistome, demonstrating the potential contribution of human gut microbiota to the dissemination of resistance elements via sewage disposal. This study provides a baseline for investigating environmental dissemination of resistance elements and raises the possibility of using the abundance of resistance genes in sewage as a tool for antibiotic stewardship.
Evolutionary Dynamics of Small RNAs in 27 Escherichia coli and Shigella Genomes
Skippington, Elizabeth; Ragan, Mark A.
2012-01-01
Small RNAs (sRNAs) are widespread in bacteria and play critical roles in regulating physiological processes. They are best characterized in Escherichia coli K-12 MG1655, where 83 sRNAs constitute nearly 2% of the gene complement. Most sRNAs act by base pairing with a target mRNA, modulating its translation and/or stability; many of these RNAs share only limited complementarity to their mRNA target, and require the chaperone Hfq to facilitate base pairing. Little is known about the evolutionary dynamics of bacterial sRNAs. Here, we apply phylogenetic and network analyses to investigate the evolutionary processes and principles that govern sRNA gene distribution in 27 E. coli and Shigella genomes. We identify core (encoded in all 27 genomes) and variable sRNAs; more than two-thirds of the E. coli K-12 MG1655 sRNAs are core, whereas the others show patterns of presence and absence that are principally due to genetic loss, not duplication or lateral genetic transfer. We present evidence that variable sRNAs are less tightly integrated into cellular genetic regulatory networks than are the core sRNAs, and that Hfq facilitates posttranscriptional cross talk between the E. coli–Shigella core and variable genomes. Finally, we present evidence that more than 80% of genes targeted by Hfq-associated core sRNAs have been transferred within the E. coli–Shigella clade, and that most of these genes have been transferred intact. These results suggest that Hfq and sRNAs help integrate laterally acquired genes into established regulatory networks. PMID:22223756
González, Carolina; Lazcano, Marcelo; Valdés, Jorge; Holmes, David S.
2016-01-01
Using phylogenomic and gene compositional analyses, five highly conserved gene families have been detected in the core genome of the phylogenetically coherent genus Acidithiobacillus of the class Acidithiobacillia. These core gene families are absent in the closest extant genus Thermithiobacillus tepidarius that subtends the Acidithiobacillus genus and roots the deepest in this class. The predicted proteins encoded by these core gene families are not detected by a BLAST search in the NCBI non-redundant database of more than 90 million proteins using a relaxed cut-off of 1.0e−5. None of the five families has a clear functional prediction. However, bioinformatic scrutiny, using pI prediction, motif/domain searches, cellular location predictions, genomic context analyses, and chromosome topology studies together with previously published transcriptomic and proteomic data, suggests that some may have functions associated with membrane remodeling during cell division perhaps in response to pH stress. Despite the high level of amino acid sequence conservation within each family, there is sufficient nucleotide variation of the respective genes to permit the use of the DNA sequences to distinguish different species of Acidithiobacillus, making them useful additions to the armamentarium of tools for phylogenetic analysis. Since the protein families are unique to the Acidithiobacillus genus, they can also be leveraged as probes to detect the genus in environmental metagenomes and metatranscriptomes, including industrial biomining operations, and acid mine drainage (AMD). PMID:28082953
González, Carolina; Lazcano, Marcelo; Valdés, Jorge; Holmes, David S
2016-01-01
Using phylogenomic and gene compositional analyses, five highly conserved gene families have been detected in the core genome of the phylogenetically coherent genus Acidithiobacillus of the class Acidithiobacillia . These core gene families are absent in the closest extant genus Thermithiobacillus tepidarius that subtends the Acidithiobacillus genus and roots the deepest in this class. The predicted proteins encoded by these core gene families are not detected by a BLAST search in the NCBI non-redundant database of more than 90 million proteins using a relaxed cut-off of 1.0e -5 . None of the five families has a clear functional prediction. However, bioinformatic scrutiny, using pI prediction, motif/domain searches, cellular location predictions, genomic context analyses, and chromosome topology studies together with previously published transcriptomic and proteomic data, suggests that some may have functions associated with membrane remodeling during cell division perhaps in response to pH stress. Despite the high level of amino acid sequence conservation within each family, there is sufficient nucleotide variation of the respective genes to permit the use of the DNA sequences to distinguish different species of Acidithiobacillus , making them useful additions to the armamentarium of tools for phylogenetic analysis. Since the protein families are unique to the Acidithiobacillus genus, they can also be leveraged as probes to detect the genus in environmental metagenomes and metatranscriptomes, including industrial biomining operations, and acid mine drainage (AMD).
Random forests-based differential analysis of gene sets for gene expression data.
Hsueh, Huey-Miin; Zhou, Da-Wei; Tsai, Chen-An
2013-04-10
In DNA microarray studies, gene-set analysis (GSA) has become the focus of gene expression data analysis. GSA utilizes the gene expression profiles of functionally related gene sets in Gene Ontology (GO) categories or priori-defined biological classes to assess the significance of gene sets associated with clinical outcomes or phenotypes. Many statistical approaches have been proposed to determine whether such functionally related gene sets express differentially (enrichment and/or deletion) in variations of phenotypes. However, little attention has been given to the discriminatory power of gene sets and classification of patients. In this study, we propose a method of gene set analysis, in which gene sets are used to develop classifications of patients based on the Random Forest (RF) algorithm. The corresponding empirical p-value of an observed out-of-bag (OOB) error rate of the classifier is introduced to identify differentially expressed gene sets using an adequate resampling method. In addition, we discuss the impacts and correlations of genes within each gene set based on the measures of variable importance in the RF algorithm. Significant classifications are reported and visualized together with the underlying gene sets and their contribution to the phenotypes of interest. Numerical studies using both synthesized data and a series of publicly available gene expression data sets are conducted to evaluate the performance of the proposed methods. Compared with other hypothesis testing approaches, our proposed methods are reliable and successful in identifying enriched gene sets and in discovering the contributions of genes within a gene set. The classification results of identified gene sets can provide an valuable alternative to gene set testing to reveal the unknown, biologically relevant classes of samples or patients. In summary, our proposed method allows one to simultaneously assess the discriminatory ability of gene sets and the importance of genes for interpretation of data in complex biological systems. The classifications of biologically defined gene sets can reveal the underlying interactions of gene sets associated with the phenotypes, and provide an insightful complement to conventional gene set analyses. Copyright © 2012 Elsevier B.V. All rights reserved.
ITEP: an integrated toolkit for exploration of microbial pan-genomes.
Benedict, Matthew N; Henriksen, James R; Metcalf, William W; Whitaker, Rachel J; Price, Nathan D
2014-01-03
Comparative genomics is a powerful approach for studying variation in physiological traits as well as the evolution and ecology of microorganisms. Recent technological advances have enabled sequencing large numbers of related genomes in a single project, requiring computational tools for their integrated analysis. In particular, accurate annotations and identification of gene presence and absence are critical for understanding and modeling the cellular physiology of newly sequenced genomes. Although many tools are available to compare the gene contents of related genomes, new tools are necessary to enable close examination and curation of protein families from large numbers of closely related organisms, to integrate curation with the analysis of gain and loss, and to generate metabolic networks linking the annotations to observed phenotypes. We have developed ITEP, an Integrated Toolkit for Exploration of microbial Pan-genomes, to curate protein families, compute similarities to externally-defined domains, analyze gene gain and loss, and generate draft metabolic networks from one or more curated reference network reconstructions in groups of related microbial species among which the combination of core and variable genes constitute the their "pan-genomes". The ITEP toolkit consists of: (1) a series of modular command-line scripts for identification, comparison, curation, and analysis of protein families and their distribution across many genomes; (2) a set of Python libraries for programmatic access to the same data; and (3) pre-packaged scripts to perform common analysis workflows on a collection of genomes. ITEP's capabilities include de novo protein family prediction, ortholog detection, analysis of functional domains, identification of core and variable genes and gene regions, sequence alignments and tree generation, annotation curation, and the integration of cross-genome analysis and metabolic networks for study of metabolic network evolution. ITEP is a powerful, flexible toolkit for generation and curation of protein families. ITEP's modular design allows for straightforward extension as analysis methods and tools evolve. By integrating comparative genomics with the development of draft metabolic networks, ITEP harnesses the power of comparative genomics to build confidence in links between genotype and phenotype and helps disambiguate gene annotations when they are evaluated in both evolutionary and metabolic network contexts.
Dutta, B; Pusztai, L; Qi, Y; André, F; Lazar, V; Bianchini, G; Ueno, N; Agarwal, R; Wang, B; Shiang, C Y; Hortobagyi, G N; Mills, G B; Symmans, W F; Balázsi, G
2012-01-01
Background: The rapid collection of diverse genome-scale data raises the urgent need to integrate and utilise these resources for biological discovery or biomedical applications. For example, diverse transcriptomic and gene copy number variation data are currently collected for various cancers, but relatively few current methods are capable to utilise the emerging information. Methods: We developed and tested a data-integration method to identify gene networks that drive the biology of breast cancer clinical subtypes. The method simultaneously overlays gene expression and gene copy number data on protein–protein interaction, transcriptional-regulatory and signalling networks by identifying coincident genomic and transcriptional disturbances in local network neighborhoods. Results: We identified distinct driver-networks for each of the three common clinical breast cancer subtypes: oestrogen receptor (ER)+, human epidermal growth factor receptor 2 (HER2)+, and triple receptor-negative breast cancers (TNBC) from patient and cell line data sets. Driver-networks inferred from independent datasets were significantly reproducible. We also confirmed the functional relevance of a subset of randomly selected driver-network members for TNBC in gene knockdown experiments in vitro. We found that TNBC driver-network members genes have increased functional specificity to TNBC cell lines and higher functional sensitivity compared with genes selected by differential expression alone. Conclusion: Clinical subtype-specific driver-networks identified through data integration are reproducible and functionally important. PMID:22343619
Molinier, Cécile; Reisser, Céline M.O.; Fields, Peter; Ségard, Adeline; Galimov, Yan; Haag, Christoph R.
2018-01-01
Daphnia reproduce by cyclic-parthenogenesis, where phases of asexual reproduction are intermitted by sexual production of diapause stages. This life cycle, together with environmental sex determination, allow the comparison of gene expression between genetically identical males and females. We investigated gene expression differences between males and females in four genotypes of Daphnia magna and compared the results with published data on sex-biased gene expression in two other Daphnia species, each representing one of the major phylogenetic clades within the genus. We found that 42% of all annotated genes showed sex-biased expression in D. magna. This proportion is similar both to estimates from other Daphnia species as well as from species with genetic sex determination, suggesting that sex-biased expression is not reduced under environmental sex determination. Among 7453 single copy, one-to-one orthologs in the three Daphnia species, 707 consistently showed sex-biased expression and 675 were biased in the same direction in all three species. Hence these genes represent a core-set of genes with consistent sex-differential expression in the genus. A functional analysis identified that several of them are involved in known sex determination pathways. Moreover, 75% were overexpressed in females rather than males, a pattern that appears to be a general feature of sex-biased gene expression in Daphnia. PMID:29535148
Molinier, Cécile; Reisser, Céline M O; Fields, Peter; Ségard, Adeline; Galimov, Yan; Haag, Christoph R
2018-05-04
Daphnia reproduce by cyclic-parthenogenesis, where phases of asexual reproduction are intermitted by sexual production of diapause stages. This life cycle, together with environmental sex determination, allow the comparison of gene expression between genetically identical males and females. We investigated gene expression differences between males and females in four genotypes of Daphnia magna and compared the results with published data on sex-biased gene expression in two other Daphnia species, each representing one of the major phylogenetic clades within the genus. We found that 42% of all annotated genes showed sex-biased expression in D. magna This proportion is similar both to estimates from other Daphnia species as well as from species with genetic sex determination, suggesting that sex-biased expression is not reduced under environmental sex determination. Among 7453 single copy, one-to-one orthologs in the three Daphnia species, 707 consistently showed sex-biased expression and 675 were biased in the same direction in all three species. Hence these genes represent a core-set of genes with consistent sex-differential expression in the genus. A functional analysis identified that several of them are involved in known sex determination pathways. Moreover, 75% were overexpressed in females rather than males, a pattern that appears to be a general feature of sex-biased gene expression in Daphnia . Copyright © 2018 Molinier et al.
Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.
2005-01-01
We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085
Widespread Enhancer Activity from Core Promoters.
Medina-Rivera, Alejandra; Santiago-Algarra, David; Puthier, Denis; Spicuglia, Salvatore
2018-06-01
Gene expression in higher eukaryotes is precisely regulated in time and space through the interplay between promoters and gene-distal regulatory regions, known as enhancers. The original definition of enhancers implies the ability to activate gene expression remotely, while promoters entail the capability to locally induce gene expression. Despite the conventional distinction between them, promoters and enhancers share many genomic and epigenomic features. One intriguing finding in the gene regulation field comes from the observation that many core promoter regions display enhancer activity. Recent high-throughput reporter assays along with clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-related approaches have indicated that this phenomenon is common and might have a strong impact on our global understanding of genome organisation and gene expression regulation. Copyright © 2018 Elsevier Ltd. All rights reserved.
Comparing Patterns of Natural Selection across Species Using Selective Signatures
Shapiro, B. Jesse; Alm, Eric J
2008-01-01
Comparing gene expression profiles over many different conditions has led to insights that were not obvious from single experiments. In the same way, comparing patterns of natural selection across a set of ecologically distinct species may extend what can be learned from individual genome-wide surveys. Toward this end, we show how variation in protein evolutionary rates, after correcting for genome-wide effects such as mutation rate and demographic factors, can be used to estimate the level and types of natural selection acting on genes across different species. We identify unusually rapidly and slowly evolving genes, relative to empirically derived genome-wide and gene family-specific background rates for 744 core protein families in 30 γ-proteobacterial species. We describe the pattern of fast or slow evolution across species as the “selective signature” of a gene. Selective signatures represent a profile of selection across species that is predictive of gene function: pairs of genes with correlated selective signatures are more likely to share the same cellular function, and genes in the same pathway can evolve in concert. For example, glycolysis and phenylalanine metabolism genes evolve rapidly in Idiomarina loihiensis, mirroring an ecological shift in carbon source from sugars to amino acids. In a broader context, our results suggest that the genomic landscape is organized into functional modules even at the level of natural selection, and thus it may be easier than expected to understand the complex evolutionary pressures on a cell. PMID:18266472
Remiche, Gauthier; Kadhim, Hazim; Abramowicz, Marc; Mavroudakis, Nicolas; Monnier, Nicole; Lunardi, Joël
2015-05-01
We report a novel and particularly unusual type of mutation, namely, large deletion in the RYR1 gene, in a Belgian family with myopathy: Patients were found to be compound heterozygous and presented a clinico-pathological phenotype characterized by late-onset and recessive myopathy with cores. We depict the clinical, electrophysiological, pathological and molecular genetic characteristics of family members. To date, large deletions in the RYR1 gene have been reported in only two cases. Both involved different mutations and, in sharp contrast to our cases, presented with a very early-onset, neonatal, and a very severe or lethal phenotype. Overview of reported clinico-pathologic phenotypes, also highlights the rarity of combined late-onset/recessive co-occurrence in this group of myopathies with cores. Finally, this report underlines the broadening spectrum in this group of myopathologic disorders and highlights the concept of 'RYR1-associated/related core myopathies'. Copyright © 2015 Elsevier B.V. All rights reserved.
Kaech Moll, Veronika M; Escorpizo, Reuben; Portmann Bergamaschi, Ruth; Finger, Monika E
2016-08-01
The Comprehensive ICF Core Set for vocational rehabilitation (VR) is a list of essential categories on functioning based on the World Health Organization (WHO) International Classification of Functioning, Disability and Health (ICF), which describes a standard for interdisciplinary assessment, documentation, and communication in VR. The aim of this study was to examine the content validity of the Comprehensive ICF Core Set for VR from the perspective of physical therapists. A 3-round email survey was performed using the Delphi method. A convenience sample of international physical therapists working in VR with work experience of ≥2 years were asked to identify aspects they consider as relevant when evaluating or treating clients in VR. Responses were linked to the ICF categories and compared with the Comprehensive ICF Core Set for VR. Sixty-two physical therapists from all 6 WHO world regions responded with 3,917 statements that were subsequently linked to 338 ICF categories. Fifteen (17%) of the 90 categories in the Comprehensive ICF Core Set for VR were confirmed by the physical therapists in the sample. Twenty-two additional ICF categories were identified that were not included in the Comprehensive ICF Core Set for VR. Vocational rehabilitation in physical therapy is not well defined in every country and might have resulted in the small sample size. Therefore, the results cannot be generalized to all physical therapists practicing in VR. The content validity of the ICF Core Set for VR is insufficient from solely a physical therapist perspective. The results of this study could be used to define a physical therapy-specific set of ICF categories to develop and guide physical therapist clinical practice in VR. © 2016 American Physical Therapy Association.
Renom, Marta; Conrad, Andrea; Bascuñana, Helena; Cieza, Alarcos; Galán, Ingrid; Kesselring, Jürg; Coenen, Michaela
2014-11-01
The Comprehensive International Classification of Functioning, Disability and Health (ICF) Core Set for Multiple Sclerosis (MS) is a comprehensive framework to structure the information obtained in multidisciplinary clinical settings according to the biopsychosocial perspective of the International Classification of Functioning, Disability and Health (ICF) and to guide the treatment and rehabilitation process accordingly. It is now undergoing validation from the user perspective for which it has been developed in the first place. To validate the content of the Comprehensive ICF Core Set for MS from the perspective of speech and language therapists (SLTs) involved in the treatment of persons with MS (PwMS). Within a three-round e-mail-based Delphi Study 34 SLTs were asked about PwMS' problems, resources and aspects of the environment treated by SLTs. Responses were linked to ICF categories. Identified ICF categories were compared with those included in the Comprehensive ICF Core Set for MS to examine its content validity. Thirty-four SLTs named 524 problems and resources, as well as aspects of environment. Statements were linked to 129 ICF categories (60 Body-functions categories, two Body-structures categories, 42 Activities-&-participation categories, and 25 Environmental-factors categories). SLTs confirmed 46 categories in the Comprehensive ICF Core Set. Twenty-one ICF categories were identified as not-yet-included categories. This study contributes to the content validity of the Comprehensive ICF Core Set for MS from the perspective of SLTs. Study participants agreed on a few not-yet-included categories that should be further discussed for inclusion in a revised version of the Comprehensive ICF Core Set to strengthen SLTs' perspective in PwMS' neurorehabilitation. © 2014 Royal College of Speech and Language Therapists.
McIntyre, Anne; Tempest, Stephanie
2007-09-30
The International Classification of Functioning, Disability and Health (ICF) has been received favourably by health care professionals, disability rights organizations and proponents of the social model of disability. The success of the ICF largely depends on its uptake in practice and is considered unwieldy in its full format. To enhance the application of the ICF in practice, disease and site-specific core sets have been developed. The objective of this paper is to stimulate thought and discussion about the place of the ICF core sets in rehabilitation practice. The authors' review of the literature uses the ICF core sets (especially stroke), to debate if the ICF is at risk of taking two steps forward, one step back in its holistic portrayal of health. ICF disease specific core sets could be seen as taking two steps forward to enhance the user friendliness of the ICF and evidence-based practice in rehabilitation. However, there is a danger of taking one step back in reverting to a disease-specific classification. It is too early to conclude the efficacy of the disease-specific core sets, but there is an opportunity to debate where the next steps may lead.
An Independent Filter for Gene Set Testing Based on Spectral Enrichment.
Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H
2015-01-01
Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in common gene set collections, however, testing is often performed with nearly as many gene sets as underlying genomic variables. To address the challenge to statistical power posed by large gene set collections, we have developed spectral gene set filtering (SGSF), a novel technique for independent filtering of gene set collections prior to gene set testing. The SGSF method uses as a filter statistic the p-value measuring the statistical significance of the association between each gene set and the sample principal components (PCs), taking into account the significance of the associated eigenvalues. Because this filter statistic is independent of standard gene set test statistics under the null hypothesis but dependent under the alternative, the proportion of enriched gene sets is increased without impacting the type I error rate. As shown using simulated and real gene expression data, the SGSF algorithm accurately filters gene sets unrelated to the experimental outcome resulting in significantly increased gene set testing power.
Fackrell, Kathryn; Smith, Harriet; Colley, Veronica; Thacker, Brian; Horobin, Adele; Haider, Haúla F; Londero, Alain; Mazurek, Birgit; Hall, Deborah A
2017-08-23
The reporting of outcomes in clinical trials of subjective tinnitus indicates that many different tinnitus-related complaints are of interest to investigators, from perceptual attributes of the sound (e.g. loudness) to psychosocial impacts (e.g. quality of life). Even when considering one type of intervention strategy for subjective tinnitus, there is no agreement about what is critically important for deciding whether a treatment is effective. The main purpose of this observational study is, therefore to, develop Core Outcome Domain Sets for the three different intervention strategies (sound, psychological, and pharmacological) for adults with chronic subjective tinnitus that should be measured and reported in every clinical trial of these interventions. Secondary objectives are to identify the strengths and limitations of our study design for recruiting and reducing attrition of participants, and to explore uptake of the core outcomes. The 'Core Outcome Measures in Tinnitus: International Delphi' (COMIT'ID) study will use a mixed-methods approach that incorporates input from health care users at the pre-Delphi stage, a modified three-round Delphi survey and final consensus meetings (one for each intervention). The meetings will generate recommendations by stakeholder representatives on agreed Core Outcome Domain Sets specific to each intervention. A subsequent step will establish a common cross-cutting Core Outcome Domain Set by identifying the common outcome domains included in all three intervention-specific Core Outcome Domain Sets. To address the secondary objectives, we will gather feedback from participants about their experience of taking part in the Delphi process. We aspire to conduct an observational cohort study to evaluate uptake of the core outcomes in published studies at 7 years following Core Outcome Set publication. The COMIT'ID study aims to develop a Core Outcome Domain Set that is agreed as critically important for deciding whether a treatment for subjective tinnitus is effective. Such a recommendation would help to standardise future clinical trials worldwide and so we will determine if participation increases use of the Core Outcome Set in the long term. This project has been registered (November 2014) in the database of the Core Outcome Measures in Effectiveness Trials (COMET) initiative.
van der Stap, Djamilla K.D.; Rider, Lisa G.; Alexanderson, Helene; Huber, Adam M.; Gualano, Bruno; Gordon, Patrick; van der Net, Janjaap; Mathiesen, Pernille; Johnson, Liam G.; Ernste, Floranne C.; Feldman, Brian M.; Houghton, Kristin M.; Singh-Grewal, Davinder; Kutzbach, Abraham Garcia; Munters, Li Alemo; Takken, Tim
2015-01-01
OBJECTIVES Currently there are no evidence-based recommendations regarding which fitness and strength tests to use for patients with childhood or adult idiopathic inflammatory myopathies (IIM). This hinders clinicians and researchers in choosing the appropriate fitness- or muscle strength-related outcome measures for these patients. Through a Delphi survey, we aimed to identify a candidate core-set of fitness and strength tests for children and adults with IIM. METHODS Fifteen experts participated in a Delphi survey that consisted of five stages to achieve a consensus. Using an extensive search of published literature and through the expertise of the experts, a candidate core-set based on expert opinion and clinimetric properties was developed. Members of the International Myositis Assessment and Clinical Studies Group (IMACS) were invited to review this candidate core-set during the final stage, which led to a final candidate core-set. RESULTS A core-set of fitness- and strength-related outcome measures was identified for children and adults with IIM. For both children and adults, different tests were identified and selected for maximal aerobic fitness, submaximal aerobic fitness, anaerobic fitness, muscle strength tests and muscle function tests. CONCLUSIONS The core-set of fitness and strength-related outcome measures provided by this expert consensus process will assist practitioners and researchers in deciding which tests to use in IIM patients. This will improve the uniformity of fitness and strength tests across studies, thereby facilitating the comparison of study results and therapeutic exercise program outcomes among patients with IIM. PMID:26568594
Hirasaki, Masataka; Hiraki-Kamon, Keiko; Kamon, Masayoshi; Suzuki, Ayumu; Katano, Miyuki; Nishimoto, Masazumi; Okuda, Akihiko
2013-01-01
Predominant transcriptional subnetworks called Core, Myc, and PRC modules have been shown to participate in preservation of the pluripotency and self-renewality of embryonic stem cells (ESCs). Epiblast stem cells (EpiSCs) are another cell type that possesses pluripotency and self-renewality. However, the roles of these modules in EpiSCs have not been systematically examined to date. Here, we compared the average expression levels of Core, Myc, and PRC module genes between ESCs and EpiSCs. EpiSCs showed substantially higher and lower expression levels of PRC and Core module genes, respectively, compared with those in ESCs, while Myc module members showed almost equivalent levels of average gene expression. Subsequent analyses revealed that the similarity in gene expression levels of the Myc module between these two cell types was not just overall, but striking similarities were evident even when comparing the expression of individual genes. We also observed equivalent levels of similarity in the expression of individual Myc module genes between induced pluripotent stem cells (iPSCs) and partial iPSCs that are an unwanted byproduct generated during iPSC induction. Moreover, our data demonstrate that partial iPSCs depend on a high level of c-Myc expression for their self-renewal properties. PMID:24386274
Frédéric, Melissa Y; Lundin, Victor F; Whiteside, Matthew D; Cueva, Juan G; Tu, Domena K; Kang, S Y Catherine; Singh, Hansmeet; Baillie, David L; Hutter, Harald; Goodman, Miriam B; Brinkman, Fiona S L; Leroux, Michel R
2013-01-01
The evolution of metazoans from their choanoflagellate-like unicellular ancestor coincided with the acquisition of novel biological functions to support a multicellular lifestyle, and eventually, the unique cellular and physiological demands of differentiated cell types such as those forming the nervous, muscle and immune systems. In an effort to understand the molecular underpinnings of such metazoan innovations, we carried out a comparative genomics analysis for genes found exclusively in, and widely conserved across, metazoans. Using this approach, we identified a set of 526 core metazoan-specific genes (the 'metazoanome'), approximately 10% of which are largely uncharacterized, 16% of which are associated with known human disease, and 66% of which are conserved in Trichoplax adhaerens, a basal metazoan lacking neurons and other specialized cell types. Global analyses of previously-characterized core metazoan genes suggest a prevalent property, namely that they act as partially redundant modifiers of ancient eukaryotic pathways. Our data also highlights the importance of exaptation of pre-existing genetic tools during metazoan evolution. Expression studies in C. elegans revealed that many metazoan-specific genes, including tubulin folding cofactor E-like (TBCEL/coel-1), are expressed in neurons. We used C. elegans COEL-1 as a representative to experimentally validate the metazoan-specific character of our dataset. We show that coel-1 disruption results in developmental hypersensitivity to the microtubule drug paclitaxel/taxol, and that overexpression of coel-1 has broad effects during embryonic development and perturbs specialized microtubules in the touch receptor neurons (TRNs). In addition, coel-1 influences the migration, neurite outgrowth and mechanosensory function of the TRNs, and functionally interacts with components of the tubulin acetylation/deacetylation pathway. Together, our findings unveil a conserved molecular toolbox fundamental to metazoan biology that contains a number of neuronally expressed and disease-related genes, and reveal a key role for TBCEL/coel-1 in regulating microtubule function during metazoan development and neuronal differentiation.
Yutin, Natalya; Raoult, Didier; Koonin, Eugene V
2013-05-23
Recent advances of genomics and metagenomics reveal remarkable diversity of viruses and other selfish genetic elements. In particular, giant viruses have been shown to possess their own mobilomes that include virophages, small viruses that parasitize on giant viruses of the Mimiviridae family, and transpovirons, distinct linear plasmids. One of the virophages known as the Mavirus, a parasite of the giant Cafeteria roenbergensis virus, shares several genes with large eukaryotic self-replicating transposon of the Polinton (Maverick) family, and it has been proposed that the polintons evolved from a Mavirus-like ancestor. We performed a comprehensive phylogenomic analysis of the available genomes of virophages and traced the evolutionary connections between the virophages and other selfish genetic elements. The comparison of the gene composition and genome organization of the virophages reveals 6 conserved, core genes that are organized in partially conserved arrays. Phylogenetic analysis of those core virophage genes, for which a sufficient diversity of homologs outside the virophages was detected, including the maturation protease and the packaging ATPase, supports the monophyly of the virophages. The results of this analysis appear incompatible with the origin of polintons from a Mavirus-like agent but rather suggest that Mavirus evolved through recombination between a polinton and an unknown virus. Altogether, virophages, polintons, a distinct Tetrahymena transposable element Tlr1, transpovirons, adenoviruses, and some bacteriophages form a network of evolutionary relationships that is held together by overlapping sets of shared genes and appears to represent a distinct module in the vast total network of viruses and mobile elements. The results of the phylogenomic analysis of the virophages and related genetic elements are compatible with the concept of network-like evolution of the virus world and emphasize multiple evolutionary connections between bona fide viruses and other classes of capsid-less mobile elements.
2013-01-01
Background Recent advances of genomics and metagenomics reveal remarkable diversity of viruses and other selfish genetic elements. In particular, giant viruses have been shown to possess their own mobilomes that include virophages, small viruses that parasitize on giant viruses of the Mimiviridae family, and transpovirons, distinct linear plasmids. One of the virophages known as the Mavirus, a parasite of the giant Cafeteria roenbergensis virus, shares several genes with large eukaryotic self-replicating transposon of the Polinton (Maverick) family, and it has been proposed that the polintons evolved from a Mavirus-like ancestor. Results We performed a comprehensive phylogenomic analysis of the available genomes of virophages and traced the evolutionary connections between the virophages and other selfish genetic elements. The comparison of the gene composition and genome organization of the virophages reveals 6 conserved, core genes that are organized in partially conserved arrays. Phylogenetic analysis of those core virophage genes, for which a sufficient diversity of homologs outside the virophages was detected, including the maturation protease and the packaging ATPase, supports the monophyly of the virophages. The results of this analysis appear incompatible with the origin of polintons from a Mavirus-like agent but rather suggest that Mavirus evolved through recombination between a polinton and an unknownvirus. Altogether, virophages, polintons, a distinct Tetrahymena transposable element Tlr1, transpovirons, adenoviruses, and some bacteriophages form a network of evolutionary relationships that is held together by overlapping sets of shared genes and appears to represent a distinct module in the vast total network of viruses and mobile elements. Conclusions The results of the phylogenomic analysis of the virophages and related genetic elements are compatible with the concept of network-like evolution of the virus world and emphasize multiple evolutionary connections between bona fide viruses and other classes of capsid-less mobile elements. PMID:23701946
NASA Astrophysics Data System (ADS)
Martin, Jan M. L.; Sundermann, Andreas
2001-02-01
We propose large-core correlation-consistent (cc) pseudopotential basis sets for the heavy p-block elements Ga-Kr and In-Xe. The basis sets are of cc-pVTZ and cc-pVQZ quality, and have been optimized for use with the large-core (valence-electrons only) Stuttgart-Dresden-Bonn (SDB) relativistic pseudopotentials. Validation calculations on a variety of third-row and fourth-row diatomics suggest them to be comparable in quality to the all-electron cc-pVTZ and cc-pVQZ basis sets for lighter elements. Especially the SDB-cc-pVQZ basis set in conjunction with a core polarization potential (CPP) yields excellent agreement with experiment for compounds of the later heavy p-block elements. For accurate calculations on Ga (and, to a lesser extent, Ge) compounds, explicit treatment of 13 valence electrons appears to be desirable, while it seems inevitable for In compounds. For Ga and Ge, we propose correlation consistent basis sets extended for (3d) correlation. For accurate calculations on organometallic complexes of interest to homogenous catalysis, we recommend a combination of the standard cc-pVTZ basis set for first- and second-row elements, the presently derived SDB-cc-pVTZ basis set for heavier p-block elements, and for transition metals, the small-core [6s5p3d] Stuttgart-Dresden basis set-relativistic effective core potential combination supplemented by (2f1g) functions with exponents given in the Appendix to the present paper.
Huang, Chun-Kai; Sie, Yi-Syuan; Chen, Yu-Fu; Huang, Tian-Sheng; Lu, Chung-An
2016-04-12
The exon junction complex (EJC), which contains four core components, eukaryotic initiation factor 4AIII (eIF4AIII), MAGO/NASHI (MAGO), Y14/Tsunagi/RNA-binding protein 8A, and Barentsz/Metastatic lymph node 51, is formed in both nucleus and cytoplasm, and plays important roles in gene expression. Genes encoding core EJC components have been found in plants, including rice. Currently, the functional characterizations of MAGO and Y14 homologs have been demonstrated in rice. However, it is still unknown whether eIF4AIII is essential for the functional EJC in rice. This study investigated two DEAD box RNA helicases, OsRH2 and OsRH34, which are homologous to eIF4AIII, in rice. Amino acid sequence analysis indicated that OsRH2 and OsRH34 had 99 % identity and 100 % similarity, and their gene expression patterns were similar in various rice tissues, but the level of OsRH2 mRNA was about 58-fold higher than that of OsRH34 mRNA in seedlings. From bimolecular fluorescence complementation results, OsRH2 and OsRH34 interacted physically with OsMAGO1 and OsY14b, respectively, which indicated that both of OsRH2 and OsRH34 were core components of the EJC in rice. To study the biological roles of OsRH2 and OsRH34 in rice, transgenic rice plants were generated by RNA interference. The phenotypes of three independent OsRH2 and OsRH34 double-knockdown transgenic lines included dwarfism, a short internode distance, reproductive delay, defective embryonic development, and a low seed setting rate. These phenotypes resembled those of mutants with gibberellin-related developmental defects. In addition, the OsRH2 and OsRH34 double-knockdown transgenic lines exhibited the accumulation of unspliced rice UNDEVELOPED TAPETUM 1 mRNA. Rice contains two eIF4AIII paralogous genes, OsRH2 and OsRH34. The abundance of OsRH2 mRNA was about 58-fold higher than that of OsRH34 mRNA in seedlings, suggesting that the OsRH2 is major eIF4AIII in rice. Both OsRH2 and OsRH34 are core components of the EJC, and participate in regulating of plant height, pollen, and seed development in rice.
Down-weighting overlapping genes improves gene set analysis
2012-01-01
Background The identification of gene sets that are significantly impacted in a given condition based on microarray data is a crucial step in current life science research. Most gene set analysis methods treat genes equally, regardless how specific they are to a given gene set. Results In this work we propose a new gene set analysis method that computes a gene set score as the mean of absolute values of weighted moderated gene t-scores. The gene weights are designed to emphasize the genes appearing in few gene sets, versus genes that appear in many gene sets. We demonstrate the usefulness of the method when analyzing gene sets that correspond to the KEGG pathways, and hence we called our method Pathway Analysis with Down-weighting of Overlapping Genes (PADOG). Unlike most gene set analysis methods which are validated through the analysis of 2-3 data sets followed by a human interpretation of the results, the validation employed here uses 24 different data sets and a completely objective assessment scheme that makes minimal assumptions and eliminates the need for possibly biased human assessments of the analysis results. Conclusions PADOG significantly improves gene set ranking and boosts sensitivity of analysis using information already available in the gene expression profiles and the collection of gene sets to be analyzed. The advantages of PADOG over other existing approaches are shown to be stable to changes in the database of gene sets to be analyzed. PADOG was implemented as an R package available at: http://bioinformaticsprb.med.wayne.edu/PADOG/or http://www.bioconductor.org. PMID:22713124
Wong, Alex W K; Lau, Stephen C L; Fong, Mandy W M; Cella, David; Lai, Jin-Shei; Heinemann, Allen W
2018-04-03
To determine the extent to which the content of the Quality of Life in Neurological Disorders (Neuro-QoL) covers the International Classification of Functioning, Disability and Health (ICF) Core Sets for multiple sclerosis (MS), stroke, spinal cord injury (SCI), and traumatic brain injury (TBI) using summary linkage indicators. Content analysis by linking content of the Neuro-QoL to corresponding ICF codes of each Core Set for MS, stroke, SCI, and TBI. Three academic centers. None. None. Four summary linkage indicators proposed by MacDermid et al were estimated to compare the content coverage between Neuro-QoL and the ICF codes of Core Sets for MS, stroke, MS, and TBI. Neuro-QoL represented 20% to 30% Core Set codes for different conditions in which more codes in Core Sets for MS (29%), stroke (28%), and TBI (28%) were covered than those for SCI in the long-term (20%) and early postacute (19%) contexts. Neuro-QoL represented nearly half of the unique Activity and Participation codes (43%-49%) and less than one third of the unique Body Function codes (12%-32%). It represented fewer Environmental Factors codes (2%-6%) and no Body Structures codes. Absolute linkage indicators found that at least 60% of Neuro-QoL items were linked to Core Set codes (63%-95%), but many items covered the same codes as revealed by unique linkage indicators (7%-13%), suggesting high concept redundancy among items. The Neuro-QoL links more closely to ICF Core Sets for stroke, MS, and TBI than to those for SCI, and primarily covers activity and participation ICF domains. Other instruments are needed to address concepts not measured by the Neuro-QoL when a comprehensive health assessment is needed. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Low back pain in 17 countries, a Rasch analysis of the ICF core set for low back pain.
Røe, Cecilie; Bautz-Holter, Erik; Cieza, Alarcos
2013-03-01
Previous studies indicate that a worldwide measurement tool may be developed based on the International Classification of Functioning Disability and Health (ICF) Core Sets for chronic conditions. The aim of the present study was to explore the possibility of constructing a cross-cultural measurement of functioning for patients with low back pain (LBP) on the basis of the Comprehensive ICF Core Set for LBP and to evaluate the properties of the ICF Core Set. The Comprehensive ICF Core Set for LBP was scored by health professionals for 972 patients with LBP from 17 countries. Qualifier levels of the categories, invariance across age, sex and countries, construct validity and the ordering of the categories in the components of body function, body structure, activities and participation were explored by Rasch analysis. The item-trait χ2-statistics showed that the 53 categories in the ICF Core Set for LBP did not fit the Rasch model (P<0.001). The main challenge was the invariance in the responses according to country. Analysis of the four countries with the largest sample sizes indicated that the data from Germany fit the Rasch model, and the data from Norway, Serbia and Kuwait in terms of the components of body functions and activities and participation also fit the model. The component of body functions and activity and participation had a negative mean location, -2.19 (SD 1.19) and -2.98 (SD 1.07), respectively. The negative location indicates that the ICF Core Set reflects patients with a lower level of function than the present patient sample. The present results indicate that it may be possible to construct a clinical measure of function on the basis of the Comprehensive ICF Core Set for LBP by calculating country-specific scores before pooling the data.
Novel gene sets improve set-level classification of prokaryotic gene expression data.
Holec, Matěj; Kuželka, Ondřej; Železný, Filip
2015-10-28
Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.
Aziz, Ramy K.; Dwivedi, Bhakti; Akhter, Sajia; Breitbart, Mya; Edwards, Robert A.
2015-01-01
Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set of publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. We propose adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution. PMID:26005436
Aziz, Ramy K.; Dwivedi, Bhakti; Akhter, Sajia; ...
2015-05-08
Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set ofmore » publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. By adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aziz, Ramy K.; Dwivedi, Bhakti; Akhter, Sajia
Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set ofmore » publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. By adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution.« less
Narusaka, Yoshihiro; Nakashima, Kazuo; Shinwari, Zabta K; Sakuma, Yoh; Furihata, Takashi; Abe, Hiroshi; Narusaka, Mari; Shinozaki, Kazuo; Yamaguchi-Shinozaki, Kazuko
2003-04-01
Many abiotic stress-inducible genes contain two cis-acting elements, namely a dehydration-responsive element (DRE; TACCGACAT) and an ABA-responsive element (ABRE; ACGTGG/TC), in their promoter regions. We precisely analyzed the 120 bp promoter region (-174 to -55) of the Arabidopsis rd29A gene whose expression is induced by dehydration, high-salinity, low-temperature, and abscisic acid (ABA) treatments and whose 120 bp promoter region contains the DRE, DRE/CRT-core motif (A/GCCGAC), and ABRE sequences. Deletion and base substitution analyses of this region showed that the DRE-core motif functions as DRE and that the DRE/DRE-core motif could be a coupling element of ABRE. Gel mobility shift assays revealed that DRE-binding proteins (DREB1s/CBFs and DREB2s) bind to both DRE and the DRE-core motif and that ABRE-binding proteins (AREBs/ABFs) bind to ABRE in the 120 bp promoter region. In addition, transactivation experiments using Arabidopsis leaf protoplasts showed that DREBs and AREBs cumulatively transactivate the expression of a GUS reporter gene fused to the 120 bp promoter region of rd29A. These results indicate that DRE and ABRE are interdependent in the ABA-responsive expression of the rd29A gene in response to ABA in Arabidopsis.
Brookes, Sara T; Macefield, Rhiannon C; Williamson, Paula R; McNair, Angus G; Potter, Shelley; Blencowe, Natalie S; Strong, Sean; Blazeby, Jane M
2016-08-17
Methods for developing a core outcome or information set require involvement of key stakeholders to prioritise many items and achieve agreement as to the core set. The Delphi technique requires participants to rate the importance of items in sequential questionnaires (or rounds) with feedback provided in each subsequent round such that participants are able to consider the views of others. This study examines the impact of receiving feedback from different stakeholder groups, on the subsequent rating of items and the level of agreement between stakeholders. Randomized controlled trials were nested within the development of three core sets each including a Delphi process with two rounds of questionnaires, completed by patients and health professionals. Participants rated items from 1 (not essential) to 9 (absolutely essential). For round 2, participants were randomized to receive feedback from their peer stakeholder group only (peer) or both stakeholder groups separately (multiple). Decisions as to which items to retain following each round were determined by pre-specified criteria. Whilst type of feedback did not impact on the percentage of items for which a participant subsequently changed their rating, or the magnitude of change, it did impact on items retained at the end of round 2. Each core set contained discordant items retained by one feedback group but not the other (3-22 % discordant items). Consensus between patients and professionals in items to retain was greater amongst those receiving multiple group feedback in each core set (65-82 % agreement for peer-only feedback versus 74-94 % for multiple feedback). In addition, differences in round 2 scores were smaller between stakeholder groups receiving multiple feedback than between those receiving peer group feedback only. Variability in item scores across stakeholders was reduced following any feedback but this reduction was consistently greater amongst the multiple feedback group. In the development of a core outcome or information set, providing feedback within Delphi questionnaires from all stakeholder groups separately may influence the final core set and improve consensus between the groups. Further work is needed to better understand how participants rate and re-rate items within a Delphi process. The three randomized controlled trials reported here were each nested within the development of a core information or outcome set to investigate processes in core outcome and information set development. Outcomes were not health-related and therefore trial registration was not applicable.
Willcocks, Samuel J; Stabler, Richard A; Atkins, Helen S; Oyston, Petra F; Wren, Brendan W
2018-05-31
Yersinia pseudotuberculosis is a zoonotic pathogen, causing mild gastrointestinal infection in humans. From this comparatively benign pathogenic species emerged the highly virulent plague bacillus, Yersinia pestis, which has experienced significant genetic divergence in a relatively short time span. Much of our knowledge of Yersinia spp. evolution stems from genomic comparison and gene expression studies. Here we apply transposon-directed insertion site sequencing (TraDIS) to describe the essential gene set of Y. pseudotuberculosis IP32953 in optimised in vitro growth conditions, and contrast these with the published essential genes of Y. pestis. The essential genes of an organism are the core genetic elements required for basic survival processes in a given growth condition, and are therefore attractive targets for antimicrobials. One such gene we identified is yptb3665, which encodes a peptide deformylase, and here we report for the first time, the sensitivity of Y. pseudotuberculosis to actinonin, a deformylase inhibitor. Comparison of the essential genes of Y. pseudotuberculosis with those of Y. pestis revealed the genes whose importance are shared by both species, as well as genes that were differentially required for growth. In particular, we find that the two species uniquely rely upon different iron acquisition and respiratory metabolic pathways under similar in vitro conditions. The discovery of uniquely essential genes between the closely related Yersinia spp. represent some of the fundamental, species-defining points of divergence that arose during the evolution of Y. pestis from its ancestor. Furthermore, the shared essential genes represent ideal candidates for the development of novel antimicrobials against both species.
Approach to numerical safety guidelines based on a core melt criterion. [PWR; BWR
DOE Office of Scientific and Technical Information (OSTI.GOV)
Azarm, M.A.; Hall, R.E.
1982-01-01
A plausible approach is proposed for translating a single level criterion to a set of numerical guidelines. The criterion for core melt probability is used to set numerical guidelines for various core melt sequences, systems and component unavailabilities. These guidelines can be used as a means for making decisions regarding the necessity for replacing a component or improving part of a safety system. This approach is applied to estimate a set of numerical guidelines for various sequences of core melts that are analyzed in Reactor Safety Study for the Peach Bottom Nuclear Power Plant.
Gao, Lei; Wang, Bo; Wang, Zhi-Wei; Zhou, Yuan; Su, Ying-Juan; Wang, Ting
2013-01-01
Previous studies have shown that core leptosporangiates, the most species-rich group of extant ferns (monilophytes), have a distinct plastid genome (plastome) organization pattern from basal fern lineages. However, the details of genome structure transformation from ancestral ferns to core leptosporangiates remain unclear because of limited plastome data available. Here, we have determined the complete chloroplast genome sequences of Lygodium japonicum (Lygodiaceae), a member of schizaeoid ferns (Schizaeales), and Marsilea crenata (Marsileaceae), a representative of heterosporous ferns (Salviniales). The two species represent the sister and the basal lineages of core leptosporangiates, respectively, for which the plastome sequences are currently unavailable. Comparative genomic analysis of all sequenced fern plastomes reveals that the gene order of L. japonicum plastome occupies an intermediate position between that of basal ferns and core leptosporangiates. The two exons of the fern ndhB gene have a unique pattern of intragenic copy number variances. Specifically, the substitution rate heterogeneity between the two exons is congruent with their copy number changes, confirming the constraint role that inverted repeats may play on the substitution rate of chloroplast gene sequences. PMID:23821521
The NF-YC–RGL2 module integrates GA and ABA signalling to regulate seed germination in Arabidopsis
Liu, Xu; Hu, Pengwei; Huang, Mingkun; Tang, Yang; Li, Yuge; Li, Ling; Hou, Xingliang
2016-01-01
The antagonistic crosstalk between gibberellic acid (GA) and abscisic acid (ABA) plays a pivotal role in the modulation of seed germination. However, the molecular mechanism of such phytohormone interaction remains largely elusive. Here we show that three Arabidopsis NUCLEAR FACTOR-Y C (NF-YC) homologues NF-YC3, NF-YC4 and NF-YC9 redundantly modulate GA- and ABA-mediated seed germination. These NF-YCs interact with the DELLA protein RGL2, a key repressor of GA signalling. The NF-YC–RGL2 module targets ABI5, a gene encoding a core component of ABA signalling, via specific CCAAT elements and collectively regulates a set of GA- and ABA-responsive genes, thus controlling germination. These results suggest that the NF-YC–RGL2–ABI5 module integrates GA and ABA signalling pathways during seed germination. PMID:27624486
Engineering a Functional Small RNA Negative Autoregulation Network with Model-Guided Design.
Hu, Chelsea Y; Takahashi, Melissa K; Zhang, Yan; Lucks, Julius B
2018-05-22
RNA regulators are powerful components of the synthetic biology toolbox. Here, we expand the repertoire of synthetic gene networks built from these regulators by constructing a transcriptional negative autoregulation (NAR) network out of small RNAs (sRNAs). NAR network motifs are core motifs of natural genetic networks, and are known for reducing network response time and steady state signal. Here we use cell-free transcription-translation (TX-TL) reactions and a computational model to design and prototype sRNA NAR constructs. Using parameter sensitivity analysis, we design a simple set of experiments that allow us to accurately predict NAR function in TX-TL. We transfer successful network designs into Escherichia coli and show that our sRNA transcriptional network reduces both network response time and steady-state gene expression. This work broadens our ability to construct increasingly sophisticated RNA genetic networks with predictable function.
Differential carbohydrate utilization and organic acid production by honey bee symbionts.
Lee, Fredrick J; Miller, Kayla I; McKinlay, James B; Newton, Irene L G
2018-06-06
The honey bee worker gut hosts a community of bacteria that comprises 8-10 core bacterial species, along with a set of more transient environmental microbes. Collectively, these microbes break down and ferment saccharides present in the host's diet, based on analyses of metagenomes, and metatranscriptomes from this environment. As part of this metabolism, the bacteria produce short-chain fatty acids that may serve as a food source for the host bee, stimulating biological processes that may contribute to host weight gain. To identify metabolic contributions of symbionts within the honey bee gut, we utilized a combination of molecular and biochemical approaches. We show significant variation in the metabolic capabilities of honey bee associated taxa, highlighting the fact that honey bee gut microbiota members of the same clade are highly variable in their ability to use specific carbohydrates and produce organic acids. Finally, we confirm that the honey bee core microbes are active in vivo, expressing key enzymatic genes critical for utilizing plant-derived molecules and producing organic acids (i.e. acetate and lactate). These results suggest that core taxa may contribute significantly to weight gain in the honey bee, specifically through the production of organic acids.
Identification and analysis of pig chimeric mRNAs using RNA sequencing data
2012-01-01
Background Gene fusion is ubiquitous over the course of evolution. It is expected to increase the diversity and complexity of transcriptomes and proteomes through chimeric sequence segments or altered regulation. However, chimeric mRNAs in pigs remain unclear. Here we identified some chimeric mRNAs in pigs and analyzed the expression of them across individuals and breeds using RNA-sequencing data. Results The present study identified 669 putative chimeric mRNAs in pigs, of which 251 chimeric candidates were detected in a set of RNA-sequencing data. The 618 candidates had clear trans-splicing sites, 537 of which obeyed the canonical GU-AG splice rule. Only two putative pig chimera variants whose fusion junction was overlapped with that of a known human chimeric mRNA were found. A set of unique chimeric events were considered middle variances in the expression across individuals and breeds, and revealed non-significant variance between sexes. Furthermore, the genomic region of the 5′ partner gene shares a similar DNA sequence with that of the 3′ partner gene for 458 putative chimeric mRNAs. The 81 of those shared DNA sequences significantly matched the known DNA-binding motifs in the JASPAR CORE database. Four DNA motifs shared in parental genomic regions had significant similarity with known human CTCF binding sites. Conclusions The present study provided detailed information on some pig chimeric mRNAs. We proposed a model that trans-acting factors, such as CTCF, induced the spatial organisation of parental genes to the same transcriptional factory so that parental genes were coordinatively transcribed to give birth to chimeric mRNAs. PMID:22925561
Lefébure, Tristan; Stanhope, Michael J
2007-01-01
Background The genus Streptococcus is one of the most diverse and important human and agricultural pathogens. This study employs comparative evolutionary analyses of 26 Streptococcus genomes to yield an improved understanding of the relative roles of recombination and positive selection in pathogen adaptation to their hosts. Results Streptococcus genomes exhibit extreme levels of evolutionary plasticity, with high levels of gene gain and loss during species and strain evolution. S. agalactiae has a large pan-genome, with little recombination in its core-genome, while S. pyogenes has a smaller pan-genome and much more recombination of its core-genome, perhaps reflecting the greater habitat, and gene pool, diversity for S. agalactiae compared to S. pyogenes. Core-genome recombination was evident in all lineages (18% to 37% of the core-genome judged to be recombinant), while positive selection was mainly observed during species differentiation (from 11% to 34% of the core-genome). Positive selection pressure was unevenly distributed across lineages and biochemical main role categories. S. suis was the lineage with the greatest level of positive selection pressure, the largest number of unique loci selected, and the largest amount of gene gain and loss. Conclusion Recombination is an important evolutionary force in shaping Streptococcus genomes, not only in the acquisition of significant portions of the genome as lineage specific loci, but also in facilitating rapid evolution of the core-genome. Positive selection, although undoubtedly a slower process, has nonetheless played an important role in adaptation of the core-genome of different Streptococcus species to different hosts. PMID:17475002
Comparison of neuronal spike exchange methods on a Blue Gene/P supercomputer.
Hines, Michael; Kumar, Sameer; Schürmann, Felix
2011-01-01
For neural network simulations on parallel machines, interprocessor spike communication can be a significant portion of the total simulation time. The performance of several spike exchange methods using a Blue Gene/P (BG/P) supercomputer has been tested with 8-128 K cores using randomly connected networks of up to 32 M cells with 1 k connections per cell and 4 M cells with 10 k connections per cell, i.e., on the order of 4·10(10) connections (K is 1024, M is 1024(2), and k is 1000). The spike exchange methods used are the standard Message Passing Interface (MPI) collective, MPI_Allgather, and several variants of the non-blocking Multisend method either implemented via non-blocking MPI_Isend, or exploiting the possibility of very low overhead direct memory access (DMA) communication available on the BG/P. In all cases, the worst performing method was that using MPI_Isend due to the high overhead of initiating a spike communication. The two best performing methods-the persistent Multisend method using the Record-Replay feature of the Deep Computing Messaging Framework DCMF_Multicast; and a two-phase multisend in which a DCMF_Multicast is used to first send to a subset of phase one destination cores, which then pass it on to their subset of phase two destination cores-had similar performance with very low overhead for the initiation of spike communication. Departure from ideal scaling for the Multisend methods is almost completely due to load imbalance caused by the large variation in number of cells that fire on each processor in the interval between synchronization. Spike exchange time itself is negligible since transmission overlaps with computation and is handled by a DMA controller. We conclude that ideal performance scaling will be ultimately limited by imbalance between incoming processor spikes between synchronization intervals. Thus, counterintuitively, maximization of load balance requires that the distribution of cells on processors should not reflect neural net architecture but be randomly distributed so that sets of cells which are burst firing together should be on different processors with their targets on as large a set of processors as possible.
2013-01-01
Background Austism spectrum disorder (ASD) is a heterogeneous behavioral disorder or condition characterized by severe impairment of social engagement and the presence of repetitive activities. The molecular etiology of ASD is still largely unknown despite a strong genetic component. Part of the difficulty in turning genetics into disease mechanisms and potentially new therapeutics is the sheer number and diversity of the genes that have been associated with ASD and ASD symptoms. The goal of this work is to use shRNA-generated models of genetic defects proposed as causative for ASD to identify the common pathways that might explain how they produce a core clinical disability. Methods Transcript levels of Mecp2, Mef2a, Mef2d, Fmr1, Nlgn1, Nlgn3, Pten, and Shank3 were knocked-down in mouse primary neuron cultures using shRNA constructs. Whole genome expression analysis was conducted for each of the knockdown cultures as well as a mock-transduced culture and a culture exposed to a lentivirus expressing an anti-luciferase shRNA. Gene set enrichment and a causal reasoning engine was employed to identify pathway level perturbations generated by the transcript knockdown. Results Quantification of the shRNA targets confirmed the successful knockdown at the transcript and protein levels of at least 75% for each of the genes. After subtracting out potential artifacts caused by viral infection, gene set enrichment and causal reasoning engine analysis showed that a significant number of gene expression changes mapped to pathways associated with neurogenesis, long-term potentiation, and synaptic activity. Conclusions This work demonstrates that despite the complex genetic nature of ASD, there are common molecular mechanisms that connect many of the best established autism candidate genes. By identifying the key regulatory checkpoints in the interlinking transcriptional networks underlying autism, we are better able to discover the ideal points of intervention that provide the broadest efficacy across the diverse population of autism patients. PMID:24238429
Lanz, Thomas A; Guilmette, Edward; Gosink, Mark M; Fischer, James E; Fitzgerald, Lawrence W; Stephenson, Diane T; Pletcher, Mathew T
2013-11-15
Austism spectrum disorder (ASD) is a heterogeneous behavioral disorder or condition characterized by severe impairment of social engagement and the presence of repetitive activities. The molecular etiology of ASD is still largely unknown despite a strong genetic component. Part of the difficulty in turning genetics into disease mechanisms and potentially new therapeutics is the sheer number and diversity of the genes that have been associated with ASD and ASD symptoms. The goal of this work is to use shRNA-generated models of genetic defects proposed as causative for ASD to identify the common pathways that might explain how they produce a core clinical disability. Transcript levels of Mecp2, Mef2a, Mef2d, Fmr1, Nlgn1, Nlgn3, Pten, and Shank3 were knocked-down in mouse primary neuron cultures using shRNA constructs. Whole genome expression analysis was conducted for each of the knockdown cultures as well as a mock-transduced culture and a culture exposed to a lentivirus expressing an anti-luciferase shRNA. Gene set enrichment and a causal reasoning engine was employed to identify pathway level perturbations generated by the transcript knockdown. Quantification of the shRNA targets confirmed the successful knockdown at the transcript and protein levels of at least 75% for each of the genes. After subtracting out potential artifacts caused by viral infection, gene set enrichment and causal reasoning engine analysis showed that a significant number of gene expression changes mapped to pathways associated with neurogenesis, long-term potentiation, and synaptic activity. This work demonstrates that despite the complex genetic nature of ASD, there are common molecular mechanisms that connect many of the best established autism candidate genes. By identifying the key regulatory checkpoints in the interlinking transcriptional networks underlying autism, we are better able to discover the ideal points of intervention that provide the broadest efficacy across the diverse population of autism patients.
Greber, Boris; Siatkowski, Marcin; Paudel, Yogesh; Warsow, Gregor; Cap, Clemens; Schöler, Hans; Fuellen, Georg
2010-01-01
Background Analysis of the mechanisms underlying pluripotency and reprogramming would benefit substantially from easy access to an electronic network of genes, proteins and mechanisms. Moreover, interpreting gene expression data needs to move beyond just the identification of the up-/downregulation of key genes and of overrepresented processes and pathways, towards clarifying the essential effects of the experiment in molecular terms. Methodology/Principal Findings We have assembled a network of 574 molecular interactions, stimulations and inhibitions, based on a collection of research data from 177 publications until June 2010, involving 274 mouse genes/proteins, all in a standard electronic format, enabling analyses by readily available software such as Cytoscape and its plugins. The network includes the core circuit of Oct4 (Pou5f1), Sox2 and Nanog, its periphery (such as Stat3, Klf4, Esrrb, and c-Myc), connections to upstream signaling pathways (such as Activin, WNT, FGF, BMP, Insulin, Notch and LIF), and epigenetic regulators as well as some other relevant genes/proteins, such as proteins involved in nuclear import/export. We describe the general properties of the network, as well as a Gene Ontology analysis of the genes included. We use several expression data sets to condense the network to a set of network links that are affected in the course of an experiment, yielding hypotheses about the underlying mechanisms. Conclusions/Significance We have initiated an electronic data repository that will be useful to understand pluripotency and to facilitate the interpretation of high-throughput data. To keep up with the growth of knowledge on the fundamental processes of pluripotency and reprogramming, we suggest to combine Wiki and social networking software towards a community curation system that is easy to use and flexible, and tailored to provide a benefit for the scientist, and to improve communication and exchange of research results. A PluriNetWork tutorial is available at http://www.ibima.med.uni-rostock.de/IBIMA/PluriNetWork/. PMID:21179244
Parallel evolution of storage roots in morning glories (Convolvulaceae).
Eserman, Lauren A; Jarret, Robert L; Leebens-Mack, James H
2018-05-29
Storage roots are an ecologically and agriculturally important plant trait that have evolved numerous times in angiosperms. Storage roots primarily function to store carbohydrates underground as reserves for perennial species. In morning glories, storage roots are well characterized in the crop species sweetpotato, where starch accumulates in storage roots. This starch-storage tissue proliferates, and roots thicken to accommodate the additional tissue. In morning glories, storage roots have evolved numerous times. The primary goal of this study is to understand whether this was through parallel evolution, where species use a common genetic mechanism to achieve storage root formation, or through convergent evolution, where storage roots in distantly related species are formed using a different set of genes. Pairs of species where one forms storage roots and the other does not were sampled from two tribes in the morning glory family, the Ipomoeeae and Merremieae. Root anatomy in storage roots and fine roots was examined. Furthermore, we sequenced total mRNA from storage roots and fine roots in these species and analyzed differential gene expression. Anatomical results reveal that storage roots of species in the Ipomoeeae tribe, such as sweetpotato, accumulate starch similar to species in the Merremieae tribe but differ in vascular tissue organization. In both storage root forming species, more genes were found to be upregulated in storage roots compared to fine roots. Further, we find that fifty-seven orthologous genes were differentially expressed between storage roots and fine roots in both storage root forming species. These genes are primarily involved in starch biosynthesis, regulation of starch biosynthesis, and transcription factor activity. Taken together, these results demonstrate that storage roots of species from both morning glory tribes are anatomically different but utilize a common core set of genes in storage root formation. This is consistent with a pattern of parallel evolution, thus highlighting the importance of examining anatomy together with gene expression to understand the evolutionary origins of ecologically and economically important plant traits.
The Extent of Genome Flux and Its Role in the Differentiation of Bacterial Lineages
Nowell, Reuben W.; Green, Sarah; Laue, Bridget E.; Sharp, Paul M.
2014-01-01
Horizontal gene transfer (HGT) and gene loss are key processes in bacterial evolution. However, the role of gene gain and loss in the emergence and maintenance of ecologically differentiated bacterial populations remains an open question. Here, we use whole-genome sequence data to quantify gene gain and loss for 27 lineages of the plant-associated bacterium Pseudomonas syringae. We apply an extensive error-control procedure that accounts for errors in draft genome data and greatly improves the accuracy of patterns of gene occurrence among these genomes. We demonstrate a history of extensive genome fluctuation for this species and show that individual lineages could have acquired thousands of genes in the same period in which a 1% amino acid divergence accrues in the core genome. Elucidating the dynamics of genome fluctuation reveals the rapid turnover of gained genes, such that the majority of recently gained genes are quickly lost. Despite high observed rates of fluctuation, a phylogeny inferred from patterns of gene occurrence is similar to a phylogeny based on amino acid replacements within the core genome. Furthermore, the core genome phylogeny suggests that P. syringae should be considered a number of distinct species, with levels of divergence at least equivalent to those between recognized bacterial species. Gained genes are transferred from a variety of sources, reflecting the depth and diversity of the potential gene pool available via HGT. Overall, our results provide further insights into the evolutionary dynamics of genome fluctuation and implicate HGT as a major factor contributing to the diversification of P. syringae lineages. PMID:24923323
Limited dissemination of the wastewater treatment plant core resistome.
Munck, Christian; Albertsen, Mads; Telke, Amar; Ellabaan, Mostafa; Nielsen, Per Halkjær; Sommer, Morten O A
2015-09-30
Horizontal gene transfer is a major contributor to the evolution of bacterial genomes and can facilitate the dissemination of antibiotic resistance genes between environmental reservoirs and potential pathogens. Wastewater treatment plants (WWTPs) are believed to play a central role in the dissemination of antibiotic resistance genes. However, the contribution of the dominant members of the WWTP resistome to resistance in human pathogens remains poorly understood. Here we use a combination of metagenomic functional selections and comprehensive metagenomic sequencing to uncover the dominant genes of the WWTP resistome. We find that this core resistome is unique to the WWTP environment, with <10% of the resistance genes found outside the WWTP environment. Our data highlight that, despite an abundance of functional resistance genes within WWTPs, only few genes are found in other environments, suggesting that the overall dissemination of the WWTP resistome is comparable to that of the soil resistome.
Limited dissemination of the wastewater treatment plant core resistome
Munck, Christian; Albertsen, Mads; Telke, Amar; Ellabaan, Mostafa; Nielsen, Per Halkjær; Sommer, Morten O. A.
2015-01-01
Horizontal gene transfer is a major contributor to the evolution of bacterial genomes and can facilitate the dissemination of antibiotic resistance genes between environmental reservoirs and potential pathogens. Wastewater treatment plants (WWTPs) are believed to play a central role in the dissemination of antibiotic resistance genes. However, the contribution of the dominant members of the WWTP resistome to resistance in human pathogens remains poorly understood. Here we use a combination of metagenomic functional selections and comprehensive metagenomic sequencing to uncover the dominant genes of the WWTP resistome. We find that this core resistome is unique to the WWTP environment, with <10% of the resistance genes found outside the WWTP environment. Our data highlight that, despite an abundance of functional resistance genes within WWTPs, only few genes are found in other environments, suggesting that the overall dissemination of the WWTP resistome is comparable to that of the soil resistome. PMID:26419330
The chromosomal organization of horizontal gene transfer in bacteria.
Oliveira, Pedro H; Touchon, Marie; Cury, Jean; Rocha, Eduardo P C
2017-10-10
Bacterial adaptation is accelerated by the acquisition of novel traits through horizontal gene transfer, but the integration of these genes affects genome organization. We found that transferred genes are concentrated in only ~1% of the chromosomal regions (hotspots) in 80 bacterial species. This concentration increases with genome size and with the rate of transfer. Hotspots diversify by rapid gene turnover; their chromosomal distribution depends on local contexts (neighboring core genes), and content in mobile genetic elements. Hotspots concentrate most changes in gene repertoires, reduce the trade-off between genome diversification and organization, and should be treasure troves of strain-specific adaptive genes. Most mobile genetic elements and antibiotic resistance genes are in hotspots, but many hotspots lack recognizable mobile genetic elements and exhibit frequent homologous recombination at flanking core genes. Overrepresentation of hotspots with fewer mobile genetic elements in naturally transformable bacteria suggests that homologous recombination and horizontal gene transfer are tightly linked in genome evolution.Horizontal gene transfer (HGT) is an important mechanism for genome evolution and adaptation in bacteria. Here, Oliveira and colleagues find HGT hotspots comprising ~ 1% of the chromosomal regions in 80 bacterial species.
Evolution and Expression Patterns of TCP Genes in Asparagales
Madrigal, Yesenia; Alzate, Juan F.; Pabón-Mora, Natalia
2017-01-01
CYCLOIDEA-like genes are involved in the symmetry gene network, limiting cell proliferation in the dorsal regions of bilateral flowers in core eudicots. CYC-like and closely related TCP genes (acronym for TEOSINTE BRANCHED1, CYCLOIDEA, and PROLIFERATION CELL FACTOR) have been poorly studied in Asparagales, the largest order of monocots that includes both bilateral flowers in Orchidaceae (ca. 25.000 spp) and radially symmetrical flowers in Hypoxidaceae (ca. 200 spp). With the aim of assessing TCP gene evolution in the Asparagales, we isolated TCP-like genes from publicly available databases and our own transcriptomes of Cattleya trianae (Orchidaceae) and Hypoxis decumbens (Hypoxidaceae). Our matrix contains 452 sequences representing the three major clades of TCP genes. Besides the previously identified CYC specific core eudicot duplications, our ML phylogenetic analyses recovered an early CIN-like duplication predating all angiosperms, two CIN-like Asparagales-specific duplications and a duplication prior to the diversification of Orchidoideae and Epidendroideae. In addition, we provide evidence of at least three duplications of PCF-like genes in Asparagales. While CIN-like and PCF-like genes have multiplied in Asparagales, likely enhancing the genetic network for cell proliferation, CYC-like genes remain as single, shorter copies with low expression. Homogeneous expression of CYC-like genes in the labellum as well as the lateral petals suggests little contribution to the bilateral perianth in C. trianae. CIN-like and PCF-like gene expression suggests conserved roles in cell proliferation in leaves, sepals and petals, carpels, ovules and fruits in Asparagales by comparison with previously reported functions in core eudicots and monocots. This is the first large scale analysis of TCP-like genes in Asparagales that will serve as a platform for in-depth functional studies in emerging model monocots. PMID:28144250
Cohen, Seth D.; Tarara, Julie M.; Gambetta, Greg A.; Matthews, Mark A.; Kennedy, James A.
2012-01-01
Little is known about the impact of temperature on proanthocyanidin (PA) accumulation in grape skins, despite its significance in berry composition and wine quality. Field-grown grapes (cv. Merlot) were cooled during the day or heated at night by +/–8 °C, from fruit set to véraison in three seasons, to determine the effect of temperature on PA accumulation. Total PA content per berry varied only in one year, when PA content was highest in heated berries (1.46 mg berry−1) and lowest in cooled berries (0.97 mg berry−1). In two years, cooling berries resulted in a significant increase in the proportion of (–)-epigallocatechin as an extension subunit. In the third year, rates of berry development, PA accumulation, and the expression levels of several genes involved in flavonoid biosynthesis were assessed. Heating and cooling berries altered the initial rates of PA accumulation, which was correlated strongly with the expression of core genes in the flavonoid pathway. Both heating and cooling altered the rate of berry growth and coloration, and the expression of several structural genes within the flavonoid pathway. PMID:22268158
Fanconi Anemia Core Complex Gene Promoters Harbor Conserved Transcription Regulatory Elements
Meier, Daniel; Schindler, Detlev
2011-01-01
The Fanconi anemia (FA) gene family is a recent addition to the complex network of proteins that respond to and repair certain types of DNA damage in the human genome. Since little is known about the regulation of this novel group of genes at the DNA level, we characterized the promoters of the eight genes (FANCA, B, C, E, F, G, L and M) that compose the FA core complex. The promoters of these genes show the characteristic attributes of housekeeping genes, such as a high GC content and CpG islands, a lack of TATA boxes and a low conservation. The promoters functioned in a monodirectional way and were, in their most active regions, comparable in strength to the SV40 promoter in our reporter plasmids. They were also marked by a distinctive transcriptional start site (TSS). In the 5′ region of each promoter, we identified a region that was able to negatively regulate the promoter activity in HeLa and HEK 293 cells in isolation. The central and 3′ regions of the promoter sequences harbor binding sites for several common and rare transcription factors, including STAT, SMAD, E2F, AP1 and YY1, which indicates that there may be cross-connections to several established regulatory pathways. Electrophoretic mobility shift assays and siRNA experiments confirmed the shared regulatory responses between the prominent members of the TGF-β and JAK/STAT pathways and members of the FA core complex. Although the promoters are not well conserved, they share region and sequence specific regulatory motifs and transcription factor binding sites (TBFs), and we identified a bi-partite nature to these promoters. These results support a hypothesis based on the co-evolution of the FA core complex genes that was expanded to include their promoters. PMID:21826217
Fanconi anemia core complex gene promoters harbor conserved transcription regulatory elements.
Meier, Daniel; Schindler, Detlev
2011-01-01
The Fanconi anemia (FA) gene family is a recent addition to the complex network of proteins that respond to and repair certain types of DNA damage in the human genome. Since little is known about the regulation of this novel group of genes at the DNA level, we characterized the promoters of the eight genes (FANCA, B, C, E, F, G, L and M) that compose the FA core complex. The promoters of these genes show the characteristic attributes of housekeeping genes, such as a high GC content and CpG islands, a lack of TATA boxes and a low conservation. The promoters functioned in a monodirectional way and were, in their most active regions, comparable in strength to the SV40 promoter in our reporter plasmids. They were also marked by a distinctive transcriptional start site (TSS). In the 5' region of each promoter, we identified a region that was able to negatively regulate the promoter activity in HeLa and HEK 293 cells in isolation. The central and 3' regions of the promoter sequences harbor binding sites for several common and rare transcription factors, including STAT, SMAD, E2F, AP1 and YY1, which indicates that there may be cross-connections to several established regulatory pathways. Electrophoretic mobility shift assays and siRNA experiments confirmed the shared regulatory responses between the prominent members of the TGF-β and JAK/STAT pathways and members of the FA core complex. Although the promoters are not well conserved, they share region and sequence specific regulatory motifs and transcription factor binding sites (TBFs), and we identified a bi-partite nature to these promoters. These results support a hypothesis based on the co-evolution of the FA core complex genes that was expanded to include their promoters.
Versatile types of polysaccharide-based supramolecular polycation/pDNA nanoplexes for gene delivery
NASA Astrophysics Data System (ADS)
Hu, Yang; Zhao, Nana; Yu, Bingran; Liu, Fusheng; Xu, Fu-Jian
2014-06-01
Different polysaccharide-based supramolecular polycations were readily synthesized by assembling multiple β-cyclodextrin-cored star polycations with an adamantane-functionalized dextran via host-guest interaction in the absence or presence of bioreducible linkages. Compared with nanoplexes of the starting star polycation and pDNA, the supramolecular polycation/pDNA nanoplexes exhibited similarly low cytotoxicity, improved cellular internalization and significantly higher gene transfection efficiencies. The incorporation of disulfide linkages imparted the supramolecular polycation/pDNA nanoplexes with the advantage of intracellular bioreducibility, resulting in better gene delivery properties. In addition, the antitumor properties of supramolecular polycation/pDNA nanoplexes were also investigated using a suicide gene therapy system. The present study demonstrates that the proper assembly of cyclodextrin-cored polycations with adamantane-functionalized polysaccharides is an effective strategy for the production of new nanoplex delivery systems.Different polysaccharide-based supramolecular polycations were readily synthesized by assembling multiple β-cyclodextrin-cored star polycations with an adamantane-functionalized dextran via host-guest interaction in the absence or presence of bioreducible linkages. Compared with nanoplexes of the starting star polycation and pDNA, the supramolecular polycation/pDNA nanoplexes exhibited similarly low cytotoxicity, improved cellular internalization and significantly higher gene transfection efficiencies. The incorporation of disulfide linkages imparted the supramolecular polycation/pDNA nanoplexes with the advantage of intracellular bioreducibility, resulting in better gene delivery properties. In addition, the antitumor properties of supramolecular polycation/pDNA nanoplexes were also investigated using a suicide gene therapy system. The present study demonstrates that the proper assembly of cyclodextrin-cored polycations with adamantane-functionalized polysaccharides is an effective strategy for the production of new nanoplex delivery systems. Electronic supplementary information (ESI) available: 1H NMR assay and synthetic route of Dex-Ad and Dex-SS-Ad. See DOI: 10.1039/c4nr01590h
2012-01-01
Background The potential contribution of upstream sequence variation to the unique features of orthologous genes is just beginning to be unraveled. A core subset of stress-associated bZIP transcription factors from rice (Oryza sativa) formed ten clusters of orthologous groups (COG) with genes from the monocot sorghum (Sorghum bicolor) and dicot Arabidopsis (Arabidopsis thaliana). The total cis-regulatory information content of each stress-associated COG was examined by phylogenetic footprinting to reveal ortholog-specific, lineage-specific and species-specific conservation patterns. Results The most apparent pattern observed was the occurrence of spatially conserved ‘core modules’ among the COGs but not among paralogs. These core modules are comprised of various combinations of two to four putative transcription factor binding site (TFBS) classes associated with either developmental or stress-related functions. Outside the core modules are specific stress (ABA, oxidative, abiotic, biotic) or organ-associated signals, which may be functioning as ‘regulatory fine-tuners’ and further define lineage-specific and species-specific cis-regulatory signatures. Orthologous monocot and dicot promoters have distinct TFBS classes involved in disease and oxidative-regulated expression, while the orthologous rice and sorghum promoters have distinct combinations of root-specific signals, a pattern that is not particularly conserved in Arabidopsis. Conclusions Patterns of cis-regulatory conservation imply that each ortholog has distinct signatures, further suggesting that they are potentially unique in a regulatory context despite the presumed conservation of broad biological function during speciation. Based on the observed patterns of conservation, we postulate that core modules are likely primary determinants of basal developmental programming, which may be integrated with and further elaborated by additional intrinsic or extrinsic signals in conjunction with lineage-specific or species-specific regulatory fine-tuners. This synergy may be critical for finer-scale spatio-temporal regulation, hence unique expression profiles of homologous transcription factors from different species with distinct zones of ecological adaptation such as rice, sorghum and Arabidopsis. The patterns revealed from these comparisons set the stage for further empirical validation by functional genomics. PMID:22992304
Hettne, Kristina M; Boorsma, André; van Dartel, Dorien A M; Goeman, Jelle J; de Jong, Esther; Piersma, Aldert H; Stierum, Rob H; Kleinjans, Jos C; Kors, Jan A
2013-01-29
Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect.
2013-01-01
Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Conclusions Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect. PMID:23356878
Nicholson, Matthew J.; Eaton, Carla J.; Stärkel, Cornelia; Tapper, Brian A.; Cox, Murray P.; Scott, Barry
2015-01-01
The penitremane and janthitremane families of indole-diterpenes are abundant natural products synthesized by Penicillium crustosum and P. janthinellum. Using a combination of PCR, cosmid library screening, and Illumina sequencing we have identified gene clusters encoding enzymes for the synthesis of these compounds. Targeted deletion of penP in P. crustosum abolished the synthesis of penitrems A, B, D, E, and F, and led to accumulation of paspaline, a key intermediate for paxilline biosynthesis in P. paxilli. Similarly, deletion of janP and janD in P. janthinellum abolished the synthesis of prenyl-elaborated indole-diterpenes, and led to accumulation in the latter of 13-desoxypaxilline, a key intermediate for the synthesis of the structurally related aflatremanes synthesized by Aspergillus flavus. This study helps resolve the genetic basis for the complexity of indole-diterpene natural products found within the Penicillium and Aspergillus species. All indole-diterpene gene clusters identified to date have a core set of genes for the synthesis of paspaline and a suite of genes encoding multi-functional cytochrome P450 monooxygenases, FAD dependent monooxygenases, and prenyl transferases that catalyse various regio- and stereo- specific oxidations that give rise to the diversity of indole-diterpene products synthesized by this group of fungi. PMID:26213965
Lundgren, Benjamin R.; Thornton, William; Dornan, Mark H.; Villegas-Peñaranda, Luis Roberto; Boddy, Christopher N.
2013-01-01
Many pseudomonads produce redox active compounds called phenazines that function in a variety of biological processes. Phenazines are well known for their toxicity against non-phenazine-producing organisms, which allows them to serve as crucial biocontrol agents and virulence factors during infection. As for other secondary metabolites, conditions of nutritional stress or limitation stimulate the production of phenazines, but little is known of the molecular details underlying this phenomenon. Using a combination of microarray and metabolite analyses, we demonstrate that the assimilation of glycine as a carbon source and the biosynthesis of pyocyanin in Pseudomonas aeruginosa PAO1 are both dependent on the PA2449 gene. The inactivation of the PA2449 gene was found to influence the transcription of a core set of genes encoding a glycine cleavage system, serine hydroxymethyltransferase, and serine dehydratase. PA2449 also affected the transcription of several genes that are integral in cell signaling and pyocyanin biosynthesis in P. aeruginosa PAO1. This study sheds light on the unexpected relationship between the utilization of an unfavorable carbon source and the production of pyocyanin. PA2449 is conserved among pseudomonads and might be universally involved in the assimilation of glycine among this metabolically diverse group of bacteria. PMID:23457254
Remote reprogramming of hepatic circadian transcriptome by breast cancer.
Hojo, Hiroaki; Enya, Sora; Arai, Miki; Suzuki, Yutaka; Nojiri, Takashi; Kangawa, Kenji; Koyama, Shinsuke; Kawaoka, Shinpei
2017-05-23
Cancers adversely affect organismal physiology. To date, the genes within a patient responsible for systemically spreading cancer-induced physiological disruption remain elusive. To identify host genes responsible for transmitting disruptive, cancer-driven signals, we thoroughly analyzed the transcriptome of a suite of host organs from mice bearing 4T1 breast cancer, and discovered complexly rewired patterns of circadian gene expression in the liver. Our data revealed that 7 core clock transcription factors, represented by Rev-erba and Rorg, exhibited abnormal daily expression rhythm in the liver of 4T1-bearing mice. Accordingly, expression patterns of specific set of downstream circadian genes were compromised. Osgin1, a marker for oxidative stress, was an example. Specific downstream genes, including E2f8, a transcriptional repressor that controls cellular polyploidy, displayed a striking pattern of disruption, "day-night reversal." Meanwhile, we found that the liver of 4T1-bearing mice suffered from increased oxidative stress. The tetraploid hepatocytes population was concomitantly increased in 4T1-bearing mice, which has not been previously appreciated as a cancer-induced phenotype. In summary, the current study provides a comprehensive characterization of the 4T1-affected hepatic circadian transcriptome that possibly underlies cancer-induced physiological alteration in the liver.
Key enzymes and proteins of crop insects as candidate for RNAi based gene silencing
Kola, Vijaya Sudhakara Rao; Renuka, P.; Madhav, Maganti Sheshu; Mangrauthia, Satendra K.
2015-01-01
RNA interference (RNAi) is a mechanism of homology dependent gene silencing present in plants and animals. It operates through 21–24 nucleotides small RNAs which are processed through a set of core enzymatic machinery that involves Dicer and Argonaute proteins. In recent past, the technology has been well appreciated toward the control of plant pathogens and insects through suppression of key genes/proteins of infecting organisms. The genes encoding key enzymes/proteins with the great potential for developing an effective insect control by RNAi approach are actylcholinesterase, cytochrome P450 enzymes, amino peptidase N, allatostatin, allatotropin, tryptophan oxygenase, arginine kinase, vacuolar ATPase, chitin synthase, glutathione-S-transferase, catalase, trehalose phosphate synthase, vitellogenin, hydroxy-3-methylglutaryl coenzyme A reductase, and hormone receptor genes. Through various studies, it is demonstrated that RNAi is a reliable molecular tool which offers great promises in meeting the challenges imposed by crop insects with careful selection of key enzymes/proteins. Utilization of RNAi tool to target some of these key proteins of crop insects through various approaches is described here. The major challenges of RNAi based insect control such as identifying potential targets, delivery methods of silencing trigger, off target effects, and complexity of insect biology are very well illustrated. Further, required efforts to address these challenges are also discussed. PMID:25954206
Andrew, Audra L; Perry, Blair W; Card, Daren C; Schield, Drew R; Ruggiero, Robert P; McGaugh, Suzanne E; Choudhary, Amit; Secor, Stephen M; Castoe, Todd A
2017-05-02
Previous studies examining post-feeding organ regeneration in the Burmese python (Python molurus bivittatus) have identified thousands of genes that are significantly differentially regulated during this process. However, substantial gaps remain in our understanding of coherent mechanisms and specific growth pathways that underlie these rapid and extensive shifts in organ form and function. Here we addressed these gaps by comparing gene expression in the Burmese python heart, liver, kidney, and small intestine across pre- and post-feeding time points (fasted, one day post-feeding, and four days post-feeding), and by conducting detailed analyses of molecular pathways and predictions of upstream regulatory molecules across these organ systems. Identified enriched canonical pathways and upstream regulators indicate that while downstream transcriptional responses are fairly tissue specific, a suite of core pathways and upstream regulator molecules are shared among responsive tissues. Pathways such as mTOR signaling, PPAR/LXR/RXR signaling, and NRF2-mediated oxidative stress response are significantly differentially regulated in multiple tissues, indicative of cell growth and proliferation along with coordinated cell-protective stress responses. Upstream regulatory molecule analyses identify multiple growth factors, kinase receptors, and transmembrane receptors, both within individual organs and across separate tissues. Downstream transcription factors MYC and SREBF are induced in all tissues. These results suggest that largely divergent patterns of post-feeding gene regulation across tissues are mediated by a core set of higher-level signaling molecules. Consistent enrichment of the NRF2-mediated oxidative stress response indicates this pathway may be particularly important in mediating cellular stress during such extreme regenerative growth.
Modeling coding-sequence evolution within the context of residue solvent accessibility.
Scherrer, Michael P; Meyer, Austin G; Wilke, Claus O
2012-09-12
Protein structure mediates site-specific patterns of sequence divergence. In particular, residues in the core of a protein (solvent-inaccessible residues) tend to be more evolutionarily conserved than residues on the surface (solvent-accessible residues). Here, we present a model of sequence evolution that explicitly accounts for the relative solvent accessibility of each residue in a protein. Our model is a variant of the Goldman-Yang 1994 (GY94) model in which all model parameters can be functions of the relative solvent accessibility (RSA) of a residue. We apply this model to a data set comprised of nearly 600 yeast genes, and find that an evolutionary-rate ratio ω that varies linearly with RSA provides a better model fit than an RSA-independent ω or an ω that is estimated separately in individual RSA bins. We further show that the branch length t and the transition-transverion ratio κ also vary with RSA. The RSA-dependent GY94 model performs better than an RSA-dependent Muse-Gaut 1994 (MG94) model in which the synonymous and non-synonymous rates individually are linear functions of RSA. Finally, protein core size affects the slope of the linear relationship between ω and RSA, and gene expression level affects both the intercept and the slope. Structure-aware models of sequence evolution provide a significantly better fit than traditional models that neglect structure. The linear relationship between ω and RSA implies that genes are better characterized by their ω slope and intercept than by just their mean ω.
Solving the Problem: Genome Annotation Standards before the Data Deluge.
Klimke, William; O'Donovan, Claire; White, Owen; Brister, J Rodney; Clark, Karen; Fedorov, Boris; Mizrachi, Ilene; Pruitt, Kim D; Tatusova, Tatiana
2011-10-15
The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries.
Solving the Problem: Genome Annotation Standards before the Data Deluge
Klimke, William; O'Donovan, Claire; White, Owen; Brister, J. Rodney; Clark, Karen; Fedorov, Boris; Mizrachi, Ilene; Pruitt, Kim D.; Tatusova, Tatiana
2011-01-01
The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries. PMID:22180819
Kirkham, Jamie J; Clarke, Mike; Williamson, Paula R
2017-05-17
Objective To assess the uptake of the rheumatoid arthritis core outcome set using a new assessment method of calculating uptake from data in clinical trial registry entries. Design Review of randomised trials. Setting ClinicalTrials.gov. Subjects 273 randomised trials of drug interventions for the treatment of rheumatoid arthritis and registered in ClinicalTrials.gov between 2002 and 2016. Full publications were identified for completed studies from information in the trial registry or from an internet search using Google and the citation database Web of Science. Main outcome measure The percentage of trials reporting or planning to measure the rheumatoid arthritis core outcome set calculated from the information presented in the trial registry and compared with the percentage reporting the rheumatoid arthritis core outcome set in the resulting trial publications. Results The full rheumatoid arthritis core outcome set was reported in 81% (116/143) of trials identified on the registry as completed (or terminated) for which results were found in either the published literature or the registry. For trials identified on the registry as completed (or terminated), using information only available in the registry gives an estimate for uptake of 77% (145/189). Conclusions The uptake of the rheumatoid arthritis core outcome set in clinical trials has continued to increase over time. Using the information on outcomes listed for completed or terminated studies in a trial registry provides a reasonable estimate of the uptake of a core outcome set and is a more efficient and up-to-date approach than examining the outcomes in published trial reports. The method proposed may provide an efficient approach for an up-to-date assessment of the uptake of the 300 core outcome sets already published. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Dengjel, Jörn; Høyer-Hansen, Maria; Nielsen, Maria O.; Eisenberg, Tobias; Harder, Lea M.; Schandorff, Søren; Farkas, Thomas; Kirkegaard, Thomas; Becker, Andrea C.; Schroeder, Sabrina; Vanselow, Katja; Lundberg, Emma; Nielsen, Mogens M.; Kristensen, Anders R.; Akimov, Vyacheslav; Bunkenborg, Jakob; Madeo, Frank; Jäättelä, Marja; Andersen, Jens S.
2012-01-01
Autophagy is one of the major intracellular catabolic pathways, but little is known about the composition of autophagosomes. To study the associated proteins, we isolated autophagosomes from human breast cancer cells using two different biochemical methods and three stimulus types: amino acid deprivation or rapamycin or concanamycin A treatment. The autophagosome-associated proteins were dependent on stimulus, but a core set of proteins was stimulus-independent. Remarkably, proteasomal proteins were abundant among the stimulus-independent common autophagosome-associated proteins, and the activation of autophagy significantly decreased the cellular proteasome level and activity supporting interplay between the two degradation pathways. A screen of yeast strains defective in the orthologs of the human genes encoding for a common set of autophagosome-associated proteins revealed several regulators of autophagy, including subunits of the retromer complex. The combined spatiotemporal proteomic and genetic data sets presented here provide a basis for further characterization of autophagosome biogenesis and cargo selection. PMID:22311637
Children's everyday exposure to food marketing: an objective analysis using wearable cameras.
Signal, L N; Stanley, J; Smith, M; Barr, M B; Chambers, T J; Zhou, J; Duane, A; Gurrin, C; Smeaton, A F; McKerchar, C; Pearson, A L; Hoek, J; Jenkin, G L S; Ni Mhurchu, C
2017-10-08
Over the past three decades the global prevalence of childhood overweight and obesity has increased by 47%. Marketing of energy-dense nutrient-poor foods and beverages contributes to this worldwide increase. Previous research on food marketing to children largely uses self-report, reporting by parents, or third-party observation of children's environments, with the focus mostly on single settings and/or media. This paper reports on innovative research, Kids'Cam, in which children wore cameras to examine the frequency and nature of everyday exposure to food marketing across multiple media and settings. Kids'Cam was a cross-sectional study of 168 children (mean age 12.6 years, SD = 0.5) in Wellington, New Zealand. Each child wore a wearable camera on four consecutive days, capturing images automatically every seven seconds. Images were manually coded as either recommended (core) or not recommended (non-core) to be marketed to children by setting, marketing medium, and product category. Images in convenience stores and supermarkets were excluded as marketing examples were considered too numerous to count. On average, children were exposed to non-core food marketing 27.3 times a day (95% CI 24.8, 30.1) across all settings. This was more than twice their average exposure to core food marketing (12.3 per day, 95% CI 8.7, 17.4). Most non-core exposures occurred at home (33%), in public spaces (30%) and at school (19%). Food packaging was the predominant marketing medium (74% and 64% for core and non-core foods) followed by signs (21% and 28% for core and non-core). Sugary drinks, fast food, confectionary and snack foods were the most commonly encountered non-core foods marketed. Rates were calculated using Poisson regression. Children in this study were frequently exposed, across multiple settings, to marketing of non-core foods not recommended to be marketed to children. The study provides further evidence of the need for urgent action to reduce children's exposure to marketing of unhealthy foods, and suggests the settings and media in which to act. Such action is necessary if the Commission on Ending Childhood Obesity's vision is to be achieved.
Snyder, David A; Montelione, Gaetano T
2005-06-01
An important open question in the field of NMR-based biomolecular structure determination is how best to characterize the precision of the resulting ensemble of structures. Typically, the RMSD, as minimized in superimposing the ensemble of structures, is the preferred measure of precision. However, the presence of poorly determined atomic coordinates and multiple "RMSD-stable domains"--locally well-defined regions that are not aligned in global superimpositions--complicate RMSD calculations. In this paper, we present a method, based on a novel, structurally defined order parameter, for identifying a set of core atoms to use in determining superimpositions for RMSD calculations. In addition we present a method for deciding whether to partition that core atom set into "RMSD-stable domains" and, if so, how to determine partitioning of the core atom set. We demonstrate our algorithm and its application in calculating statistically sound RMSD values by applying it to a set of NMR-derived structural ensembles, superimposing each RMSD-stable domain (or the entire core atom set, where appropriate) found in each protein structure under consideration. A parameter calculated by our algorithm using a novel, kurtosis-based criterion, the epsilon-value, is a measure of precision of the superimposition that complements the RMSD. In addition, we compare our algorithm with previously described algorithms for determining core atom sets. The methods presented in this paper for biomolecular structure superimposition are quite general, and have application in many areas of structural bioinformatics and structural biology.
Wen, Zhensong; Sertil, Odeniel; Cheng, Yongxin; Zhang, Shanshan; Liu, Xue; Wang, Wen-Ching
2015-01-01
Streptococcus pneumoniae is a major bacterial pathogen in humans. Its polysaccharide capsule is a key virulence factor that promotes bacterial evasion of human phagocytic killing. While S. pneumoniae produces at least 94 antigenically different types of capsule, the genes for biosynthesis of almost all capsular types are arranged in the same locus. The transcription of the capsular polysaccharide (cps) locus is not well understood. This study determined the transcriptional features of the cps locus in the type 2 virulent strain D39. The initial analysis revealed that the cps genes are cotranscribed from a major transcription start site at the −25 nucleotide (G) upstream of cps2A, the first gene in the locus. Using unmarked chromosomal truncations and a luciferase-based transcriptional reporter, we showed that the full transcription of the cps genes not only depends on the core promoter immediately upstream of cps2A, but also requires additional elements upstream of the core promoter, particularly a 59-bp sequence immediately upstream of the core promoter. Unmarked deletions of these promoter elements in the D39 genome also led to significant reduction in CPS production and virulence in mice. Lastly, common cps gene (cps2ABCD) mutants did not show significant abnormality in cps transcription, although they produced significantly less CPS, indicating that the CpsABCD proteins are involved in the encapsulation of S. pneumoniae in a posttranscriptional manner. This study has yielded important information on the transcriptional characteristics of the cps locus in S. pneumoniae. PMID:25733517
Six, Christophe; Thomas, Jean-Claude; Garczarek, Laurence; Ostrowski, Martin; Dufresne, Alexis; Blot, Nicolas; Scanlan, David J; Partensky, Frédéric
2007-01-01
Marine Synechococcus owe their specific vivid color (ranging from blue-green to orange) to their large extrinsic antenna complexes called phycobilisomes, comprising a central allophycocyanin core and rods of variable phycobiliprotein composition. Three major pigment types can be defined depending on the major phycobiliprotein found in the rods (phycocyanin, phycoerythrin I or phycoerythrin II). Among strains containing both phycoerythrins I and II, four subtypes can be distinguished based on the ratio of the two chromophores bound to these phycobiliproteins. Genomes of eleven marine Synechococcus strains recently became available with one to four strains per pigment type or subtype, allowing an unprecedented comparative genomics study of genes involved in phycobilisome metabolism. By carefully comparing the Synechococcus genomes, we have retrieved candidate genes potentially required for the synthesis of phycobiliproteins in each pigment type. This includes linker polypeptides, phycobilin lyases and a number of novel genes of uncharacterized function. Interestingly, strains belonging to a given pigment type have similar phycobilisome gene complements and organization, independent of the core genome phylogeny (as assessed using concatenated ribosomal proteins). While phylogenetic trees based on concatenated allophycocyanin protein sequences are congruent with the latter, those based on phycocyanin and phycoerythrin notably differ and match the Synechococcus pigment types. We conclude that the phycobilisome core has likely evolved together with the core genome, while rods must have evolved independently, possibly by lateral transfer of phycobilisome rod genes or gene clusters between Synechococcus strains, either via viruses or by natural transformation, allowing rapid adaptation to a variety of light niches.
Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M
2012-01-01
Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.
Nathaniel, Thomas I; Otukonyong, Effiong; Abdellatif, Ahmed; Soyinka, Julius O
2012-10-01
Recent investigations of hypoxia physiology in the naked mole rat have opened up an interesting line of research into the basic physiological and genomic alterations that accompany hypoxia survival. The extent to which such findings connect the effect of hypoxia to metabolic rate (O₂ consumption), core body temperature (Tb), and transcripts encoding the immediate early gene product (such as c-fos) under a constant ambient temperature (Ta) is not well known. We investigated this issue in the current study. Our first sets of experiments measured Tb and metabolic rates during exposure of naked mole rats to hypoxia over a constant Ta. Hypoxia significantly decreased metabolic rates in the naked mole rat. Although core Tb also decreased during hypoxia, the effect of hypoxia in suppressing core Tb was not significant. The second series of experiments revealed that c-fos protein and mRNA expression in the hippocampus neurons (CA1) increased in naked mole rats that were repeatedly exposed to 3% O₂ for 60 min per day for 5 days when compared to normoxia. Our findings provide evidence for the up-regulation of c-fos and suppression of metabolic rate in hypoxia tolerating naked mole rats under constant ambient temperature. Metabolic suppression and c-fos upregulation constitute part of the physiological complex associated with adaptation to hypoxia. Published by Elsevier Ltd.
Targeted interactomics reveals a complex core cell cycle machinery in Arabidopsis thaliana.
Van Leene, Jelle; Hollunder, Jens; Eeckhout, Dominique; Persiau, Geert; Van De Slijke, Eveline; Stals, Hilde; Van Isterdael, Gert; Verkest, Aurine; Neirynck, Sandy; Buffel, Yelle; De Bodt, Stefanie; Maere, Steven; Laukens, Kris; Pharazyn, Anne; Ferreira, Paulo C G; Eloy, Nubia; Renne, Charlotte; Meyer, Christian; Faure, Jean-Denis; Steinbrenner, Jens; Beynon, Jim; Larkin, John C; Van de Peer, Yves; Hilson, Pierre; Kuiper, Martin; De Veylder, Lieven; Van Onckelen, Harry; Inzé, Dirk; Witters, Erwin; De Jaeger, Geert
2010-08-10
Cell proliferation is the main driving force for plant growth. Although genome sequence analysis revealed a high number of cell cycle genes in plants, little is known about the molecular complexes steering cell division. In a targeted proteomics approach, we mapped the core complex machinery at the heart of the Arabidopsis thaliana cell cycle control. Besides a central regulatory network of core complexes, we distinguished a peripheral network that links the core machinery to up- and downstream pathways. Over 100 new candidate cell cycle proteins were predicted and an in-depth biological interpretation demonstrated the hypothesis-generating power of the interaction data. The data set provided a comprehensive view on heterodimeric cyclin-dependent kinase (CDK)-cyclin complexes in plants. For the first time, inhibitory proteins of plant-specific B-type CDKs were discovered and the anaphase-promoting complex was characterized and extended. Important conclusions were that mitotic A- and B-type cyclins form complexes with the plant-specific B-type CDKs and not with CDKA;1, and that D-type cyclins and S-phase-specific A-type cyclins seem to be associated exclusively with CDKA;1. Furthermore, we could show that plants have evolved a combinatorial toolkit consisting of at least 92 different CDK-cyclin complex variants, which strongly underscores the functional diversification among the large family of cyclins and reflects the pivotal role of cell cycle regulation in the developmental plasticity of plants.
Hori, Motohide; Nakamachi, Tomoya; Shibato, Junko; Rakwal, Randeep; Tsuchida, Masachi; Shioda, Seiji; Numazawa, Satoshi
2014-01-01
Pituitary adenylate-cyclase activating polypeptide (PACAP) has neuroprotective and axonal guidance functions, but the mechanisms behind such actions remain unclear. Previously we examined effects of PACAP (PACAP38, 1 pmol) injection intracerebroventrically in a mouse model of permanent middle cerebral artery occlusion (PMCAO) along with control saline (0.9% NaCl) injection. Transcriptomic and proteomic approaches using ischemic (ipsilateral) brain hemisphere revealed differentially regulated genes and proteins by PACAP38 at 6 and 24 h post-treatment. However, as the ischemic hemisphere consisted of infarct core, penumbra, and non-ischemic regions, specificity of expression and localization of these identified molecular factors remained incomplete. This led us to devise a new experimental strategy wherein, ischemic core and penumbra were carefully sampled and compared to the corresponding contralateral (healthy) core and penumbra regions at 6 and 24 h post PACAP38 or saline injections. Both reverse transcription-polymerase chain reaction (RT-PCR) and Western blotting were used to examine targeted gene expressions and the collapsin response mediator protein 2 (CRMP2) protein profiles, respectively. Clear differences in expression of genes and CRMP2 protein abundance and degradation product/short isoform was observed between ischemic core and penumbra and also compared to the contralateral healthy tissues after PACAP38 or saline treatment. Results indicate the importance of region-specific analyses to further identify, localize and functionally analyse target molecular factors for clarifying the neuroprotective function of PACAP38. PMID:25257527
No3CoGP: non-conserved and conserved coexpressed gene pairs.
Mal, Chittabrata; Aftabuddin, Md; Kundu, Sudip
2014-12-08
Analyzing the microarray data of different conditions, one can identify the conserved and condition-specific genes and gene modules, and thus can infer the underlying cellular activities. All the available tools based on Bioconductor and R packages differ in how they extract differential coexpression and at what level they study. There is a need for a user-friendly, flexible tool which can start analysis using raw or preprocessed microarray data and can report different levels of useful information. We present a GUI software, No3CoGP: Non-Conserved and Conserved Coexpressed Gene Pairs which takes Affymetrix microarray data (.CEL files or log2 normalized.txt files) along with annotation file (.csv file), Chip Definition File (CDF file) and probe file as inputs, utilizes the concept of network density cut-off and Fisher's z-test to extract biologically relevant information. It can identify four possible types of gene pairs based on their coexpression relationships. These are (i) gene pair showing coexpression in one condition but not in the other, (ii) gene pair which is positively coexpressed in one condition but negatively coexpressed in the other condition, (iii) positively and (iv) negatively coexpressed in both the conditions. Further, it can generate modules of coexpressed genes. Easy-to-use GUI interface enables researchers without knowledge in R language to use No3CoGP. Utilization of one or more CPU cores, depending on the availability, speeds up the program. The output files stored in the respective directories under the user-defined project offer the researchers to unravel condition-specific functionalities of gene, gene sets or modules.
Orbai, Ana-Maria; de Wit, Maarten; Mease, Philip J; Callis Duffin, Kristina; Elmamoun, Musaab; Tillett, William; Campbell, Willemina; FitzGerald, Oliver; Gladman, Dafna D; Goel, Niti; Gossec, Laure; Hoejgaard, Pil; Leung, Ying Ying; Lindsay, Chris; Strand, Vibeke; van der Heijde, Désirée M; Shea, Bev; Christensen, Robin; Coates, Laura; Eder, Lihi; McHugh, Neil; Kalyoncu, Umut; Steinkoenig, Ingrid; Ogdie, Alexis
2017-10-01
To include the patient perspective in accordance with the Outcome Measures in Rheumatology (OMERACT) Filter 2.0 in the updated Psoriatic Arthritis (PsA) Core Domain Set for randomized controlled trials (RCT) and longitudinal observational studies (LOS). At OMERACT 2016, research conducted to update the PsA Core Domain Set was presented and discussed in breakout groups. The updated PsA Core Domain Set was voted on and endorsed by OMERACT participants. We conducted a systematic literature review of domains measured in PsA RCT and LOS, and identified 24 domains. We conducted 24 focus groups with 130 patients from 7 countries representing 5 continents to identify patient domains. We achieved consensus through 2 rounds of separate surveys with 50 patients and 75 physicians, and a nominal group technique meeting with 12 patients and 12 physicians. We conducted a workshop and breakout groups at OMERACT 2016 in which findings were presented and discussed. The updated PsA Core Domain Set endorsed with 90% agreement by OMERACT 2016 participants included musculoskeletal disease activity, skin disease activity, fatigue, pain, patient's global assessment, physical function, health-related quality of life, and systemic inflammation, which were recommended for all RCT and LOS. These were important, but not required in all RCT and LOS: economic cost, emotional well-being, participation, and structural damage. Independence, sleep, stiffness, and treatment burden were on the research agenda. The updated PsA Core Domain Set was endorsed at OMERACT 2016. Next steps for the PsA working group include evaluation of PsA outcome measures and development of a PsA Core Outcome Measurement Set.
Teng, S; Thomson, P A; McCarthy, S; Kramer, M; Muller, S; Lihm, J; Morris, S; Soares, D C; Hennah, W; Harris, S; Camargo, L M; Malkov, V; McIntosh, A M; Millar, J K; Blackwood, D H; Evans, K L; Deary, I J; Porteous, D J; McCombie, W R
2018-05-01
Schizophrenia (SCZ), bipolar disorder (BD) and recurrent major depressive disorder (rMDD) are common psychiatric illnesses. All have been associated with lower cognitive ability, and show evidence of genetic overlap and substantial evidence of pleiotropy with cognitive function and neuroticism. Disrupted in schizophrenia 1 (DISC1) protein directly interacts with a large set of proteins (DISC1 Interactome) that are involved in brain development and signaling. Modulation of DISC1 expression alters the expression of a circumscribed set of genes (DISC1 Regulome) that are also implicated in brain biology and disorder. Here we report targeted sequencing of 59 DISC1 Interactome genes and 154 Regulome genes in 654 psychiatric patients and 889 cognitively-phenotyped control subjects, on whom we previously reported evidence for trait association from complete sequencing of the DISC1 locus. Burden analyses of rare and singleton variants predicted to be damaging were performed for psychiatric disorders, cognitive variables and personality traits. The DISC1 Interactome and Regulome showed differential association across the phenotypes tested. After family-wise error correction across all traits (FWER across ), an increased burden of singleton disruptive variants in the Regulome was associated with SCZ (FWER across P=0.0339). The burden of singleton disruptive variants in the DISC1 Interactome was associated with low cognitive ability at age 11 (FWER across P=0.0043). These results identify altered regulation of schizophrenia candidate genes by DISC1 and its core Interactome as an alternate pathway for schizophrenia risk, consistent with the emerging effects of rare copy number variants associated with intellectual disability.
Boĭko, A G; Labas, Iu A; Gordeeva, A V
2009-01-01
Natural selection is just one of the factors determining genome evolution of Metazoa. But it's not a domineering one along with non-adaptive processes: horizontal gene transfer and input of egoistic genetic elements. That's why in phylogenesis (1) there are more genes of first Metazoa lost than new ones acquired; (2) the appearance of new genes among Metazoa branches is a very rare occasion; (3) genetically Metazoa is a homogeneous group of species with similar set of cellular mechanisms which was established in the course of evolution. The genome of first Metazoa turned to be so successful that evolution connected with organism amplification didn't demand radical changes in the genetic repertoire but it demanded changes in DNA sites regulating genes work. These facts along with the fact of existence of species of Metazoa with negligible aging overturn the core theories of aging biology which consider this or that cellular mechanism to be the initiating factor of organism's aging as sets of cellular mechanisms in aging and non-aging Metazoa forms are practically identical. That's why the basis of aging biology is in essence a collection of theories and dogmas that have never been proved but which are still in use and which have since long ago turned into a dangerous myth standing in the way of progress. If we are interested in progress of biogerontology a number of domineering pseudo scientific dogmas must be revisited. The matter of conservatism in this issue is inappropriate.
Praz, Coraline R; Menardo, Fabrizio; Robinson, Mark D; Müller, Marion C; Wicker, Thomas; Bourras, Salim; Keller, Beat
2018-01-01
Powdery mildew is an important disease of cereals. It is caused by one species, Blumeria graminis , which is divided into formae speciales each of which is highly specialized to one host. Recently, a new form capable of growing on triticale ( B.g. triticale ) has emerged through hybridization between wheat and rye mildews ( B.g. tritici and B.g. secalis , respectively). In this work, we used RNA sequencing to study the molecular basis of host adaptation in B.g. triticale . We analyzed gene expression in three B.g. tritici isolates, two B.g. secalis isolates and two B.g. triticale isolates and identified a core set of putative effector genes that are highly expressed in all formae speciales . We also found that the genes differentially expressed between isolates of the same form as well as between different formae speciales were enriched in putative effectors. Their coding genes belong to several families including some which contain known members of mildew avirulence ( Avr ) and suppressor ( Svr ) genes. Based on these findings we propose that effectors play an important role in host adaptation that is mechanistically based on Avr-Resistance gene-Svr interactions. We also found that gene expression in the B.g. triticale hybrid is mostly conserved with the parent-of-origin, but some genes inherited from B.g. tritici showed a B.g. secalis -like expression. Finally, we identified 11 unambiguous cases of putative effector genes with hybrid-specific, non-parent of origin gene expression, and we propose that they are possible determinants of host specialization in triticale mildew. These data suggest that altered expression of multiple effector genes, in particular Avr and Svr related factors, might play a role in mildew host adaptation based on hybridization.
Biogeography of serpentinite-hosted microbial ecosystems
NASA Astrophysics Data System (ADS)
Brazelton, W.; Cardace, D.; Fruh-Green, G.; Lang, S. Q.; Lilley, M. D.; Morrill, P. L.; Szponar, N.; Twing, K. I.; Schrenk, M. O.
2012-12-01
Ultramafic rocks in the Earth's mantle represent a tremendous reservoir of carbon and reducing power. Upon tectonic uplift and exposure to fluid flow, serpentinization of these materials generates copious energy, sustains abiogenic synthesis of organic molecules, and releases hydrogen gas (H2). To date, however, the "serpentinite microbiome" is poorly constrained- almost nothing is known about the microbial diversity endemic to rocks actively undergoing serpentinization. Through the Census of Deep Life, we have obtained 16S rRNA gene pyrotag sequences from fluids and rocks from serpentinizing ophiolites in California, Canada, and Italy. The samples include high pH serpentinite springs, presumably representative of deeper environments within the ophiolite complex, wells which directly access subsurface aquifers, and rocks obtained from drill cores into serpentinites. These data represent a unique opportunity to examine biogeographic patterns among a restricted set of microbial taxa that are adapted to similar environmental conditions and are inhabiting sites with related geological histories. In general, our results point to potentially H2-utilizing Betaproteobacteria thriving in shallow, oxic-anoxic transition zones and anaerobic Clostridia thriving in anoxic, deep subsurface habitats. These general taxonomic and biogeochemical trends were also observed in seafloor Lost City hydrothermal chimneys, indicating that we are beginning to identify a core serpentinite microbial community that spans marine and continental settings.
[The true story and advantages of the famous Hepatitis B virus core particles: Outlook 2016].
Pumpens, P; Grens, E
2016-01-01
This review article is a continuation of the paper "Hepatitis B core particles as a universal display model: a structure-function basis for development" written by Pumpens P. and Grens E., ordered by Professor Lev Kisselev and published in FEBS Letters, 1999, 442, 1-6. The past 17 years have strengthened the paper's finding that the human hepatitis B virus core protein, along with other Hepadnaviridae family member core proteins, is a mysterious, multifunctional protein. The core gene of the Hepadnaviridae genome encodes five partially collinear proteins. The most important of these is the HBV core protein p21, or HBc. It can self-assemble by forming viral HBc particles, but also plays a crucial role in the regulation of viral replication. Since 1986, the HBc protein has been one of the first and the most successful tools of the virus-like particle (VLP) technology. Later, the woodchuck hepatitis virus core protein (WHc) was also used as a VLP carrier. The Hepadnaviridae core proteins remain favourite VLP candidates for the knowledge-based design of future vaccines, gene therapy vectors, specifically targeted nanocontainers, and other modern nanotechnological tools for prospective medical use.
Spoorenberg, Sophie L W; Reijneveld, Sijmen A; Middel, Berrie; Uittenbroek, Ronald J; Kremer, Hubertus P H; Wynia, Klaske
2015-01-01
The aim of the present study was to develop a valid Geriatric ICF Core Set reflecting relevant health-related problems of community-living older adults without dementia. A Delphi study was performed in order to reach consensus (≥70% agreement) on second-level categories from the International Classification of Functioning, Disability and Health (ICF). The Delphi panel comprised 41 older adults, medical and non-medical experts. Content validity of the set was tested in a cross-sectional study including 267 older adults identified as frail or having complex care needs. Consensus was reached for 30 ICF categories in the Delphi study (fourteen Body functions, ten Activities and Participation and six Environmental Factors categories). Content validity of the set was high: the prevalence of all the problems was >10%, except for d530 Toileting. The most frequently reported problems were b710 Mobility of joint functions (70%), b152 Emotional functions (65%) and b455 Exercise tolerance functions (62%). No categories had missing values. The final Geriatric ICF Core Set is a comprehensive and valid set of 29 ICF categories, reflecting the most relevant health-related problems among community-living older adults without dementia. This Core Set may contribute to optimal care provision and support of the older population. Implications for Rehabilitation The Geriatric ICF Core Set may provide a practical tool for gaining an understanding of the relevant health-related problems of community-living older adults without dementia. The Geriatric ICF Core Set may be used in primary care practice as an assessment tool in order to tailor care and support to the needs of older adults. The Geriatric ICF Core Set may be suitable for use in multidisciplinary teams in integrated care settings, since it is based on a broad range of problems in functioning. Professionals should pay special attention to health problems related to mobility and emotional functioning since these are the most prevalent problems in community-living older adults.
Chung, Pearl; Yun, Sarah Jin; Khan, Fary
2014-02-01
To compare the contents of participation outcome measures in traumatic brain injury with the International Classification of Functioning, Disability and Health (ICF) Core Sets for traumatic brain injury. A systematic search with an independent review process selected relevant articles to identify outcome measures in participation in traumatic brain injury. Instruments used in two or more studies were linked to the ICF categories, which identified categories in participation for comparison with the ICF Core Sets for traumatic brain injury. Selected articles (n = 101) identified participation instruments used in two or more studies (n = 9): Community Integration Questionnaire, Craig Handicap Assessment and Reporting Technique, Mayo-Portland Adaptability Inventory-4 Participation Index, Sydney Psychosocial Reintegration Scale Version-2, Participation Assessment with Recombined Tool-Objective, Community Integration Measure, Participation Objective Participation Subjective, Community Integration Questionnaire-2, and Quality of Community Integration Questionnaire. Each instrument was linked to 4-35 unique second-level ICF categories, of which 39-100% related to participation. Instruments addressed 86-100% and 50-100% of the participation categories in the Comprehensive and Brief ICF Core Sets for traumatic brain injury, respectively. Participation measures in traumatic brain injury were compared with the ICF Core Sets for traumatic brain injury. The ICF Core Sets for traumatic brain injury could contribute to the development and selection of participation measures.
Ballert, C; Oberhauser, C; Biering-Sørensen, F; Stucki, G; Cieza, A
2012-10-01
Psychometric study analyzing the data of a cross-sectional, multicentric study with 1048 persons with spinal cord injury (SCI). To shed light on how to apply the Brief Core Sets for SCI of the International Classification of Functioning, Disability and Health (ICF) by determining whether the ICF categories contained in the Core Sets capture differences in overall health. Lasso regression was applied using overall health, rated by the patients and health professionals, as dependent variables and the ICF categories of the Comprehensive ICF Core Sets for SCI as independent variables. The ICF categories that best capture differences in overall health refer to areas of life such as self-care, relationships, economic self-sufficiency and community life. Only about 25% of the ICF categories of the Brief ICF Core Sets for the early post-acute and for long-term contexts were selected in the Lasso regression and differentiate, therefore, among levels of overall health. ICF categories such as d570 Looking after one's health, d870 Economic self-sufficiency, d620 Acquisition of goods and services and d910 Community life, which capture changes in overall health in patients with SCI, should be considered in addition to those of the Brief ICF Core Sets in clinical and epidemiological studies in persons with SCI.
Evaluation of Ceramic Honeycomb Core Compression Behavior at Room Temperature
NASA Technical Reports Server (NTRS)
Bird, Richard K.; Lapointe, Thomas S.
2013-01-01
Room temperature flatwise compression tests were conducted on two varieties of ceramic honeycomb core specimens that have potential for high-temperature structural applications. One set of specimens was fabricated using strips of a commercially-available thin-gage "ceramic paper" sheet molded into a hexagonal core configuration. The other set was fabricated by machining honeycomb core directly from a commercially available rigid insulation tile material. This paper summarizes the results from these tests.
The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease.
Eppig, Janan T; Blake, Judith A; Bult, Carol J; Kadin, James A; Richardson, Joel E
2015-01-01
The Mouse Genome Database (MGD, http://www.informatics.jax.org) serves the international biomedical research community as the central resource for integrated genomic, genetic and biological data on the laboratory mouse. To facilitate use of mouse as a model in translational studies, MGD maintains a core of high-quality curated data and integrates experimentally and computationally generated data sets. MGD maintains a unified catalog of genes and genome features, including functional RNAs, QTL and phenotypic loci. MGD curates and provides functional and phenotype annotations for mouse genes using the Gene Ontology and Mammalian Phenotype Ontology. MGD integrates phenotype data and associates mouse genotypes to human diseases, providing critical mouse-human relationships and access to repositories holding mouse models. MGD is the authoritative source of nomenclature for genes, genome features, alleles and strains following guidelines of the International Committee on Standardized Genetic Nomenclature for Mice. A new addition to MGD, the Human-Mouse: Disease Connection, allows users to explore gene-phenotype-disease relationships between human and mouse. MGD has also updated search paradigms for phenotypic allele attributes, incorporated incidental mutation data, added a module for display and exploration of genes and microRNA interactions and adopted the JBrowse genome browser. MGD resources are freely available to the scientific community. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Gesing, Stefan; Schindler, Daniel; Fränzel, Benjamin; Wolters, Dirk; Nowrousian, Minou
2012-05-01
Ascomycetes develop four major types of fruiting bodies that share a common ancestor, and a set of common core genes most likely controls this process. One way to identify such genes is to search for conserved expression patterns. We analysed microarray data of Fusarium graminearum and Sordaria macrospora, identifying 78 genes with similar expression patterns during fruiting body development. One of these genes was asf1 (anti-silencing function 1), encoding a predicted histone chaperone. asf1 expression is also upregulated during development in the distantly related ascomycete Pyronema confluens. To test whether asf1 plays a role in fungal development, we generated an S. macrospora asf1 deletion mutant. The mutant is sterile and can be complemented to fertility by transformation with the wild-type asf1 and its P. confluens homologue. An ASF1-EGFP fusion protein localizes to the nucleus. By tandem-affinity purification/mass spectrometry as well as yeast two-hybrid analysis, we identified histones H3 and H4 as ASF1 interaction partners. Several developmental genes are dependent on asf1 for correct transcriptional expression. Deletion of the histone chaperone genes rtt106 and cac2 did not cause any developmental phenotypes. These data indicate that asf1 of S. macrospora encodes a conserved histone chaperone that is required for fruiting body development. © 2012 Blackwell Publishing Ltd.
Li, Xinyue; Du, Yu; Du, Pengcheng; Dai, Hang; Fang, Yujie; Li, Zhenpeng; Lv, Na; Zhu, Baoli; Kan, Biao; Wang, Duochun
2016-01-01
SXT/R391 integrative and conjugative elements (ICEs) are self-transmissible mobile genetic elements that are found in most members of Enterobacteriaceae. Here, we determined fifteen SXT/R391 ICEs carried by Proteus isolates from food (4.2%) and diarrhoea patients (17.3%). BLASTn searches against GenBank showed that the fifteen SXT/R391 ICEs were closely related to that from different Enterobacteriaceae species, including Proteus mirabilis. Using core gene phylogenetic analysis, the fifteen SXT/R391 ICEs were grouped into six distinct clusters, including a dominant cluster and three clusters that have not been previously reported in Proteus isolates. The SXT/R391 ICEs shared a common structure with a set of conserved genes, five hotspots and two variable regions, which contained more foreign genes, including drug-resistance genes. Notably, a class A β-lactamase gene was identified in nine SXT/R391 ICEs. Collectively, the ICE-carrying isolates carried resistance genes for 20 tested drugs. Six isolates were resistant to chloramphenicol, kanamycin, streptomycin, trimethoprim-sulfamethoxazole, sulfisoxazole and tetracycline, which are drug resistances commonly encoded by ICEs. Our results demonstrate abundant genetic diversity and multidrug resistance of the SXT/R391 ICEs carried by Proteus isolates, which may have significance for public health. It is therefore necessary to continuously monitor the antimicrobial resistance and related mobile elements among Proteus isolates. PMID:27892525
Estimation and Control for Autonomous Coring from a Rover Manipulator
NASA Technical Reports Server (NTRS)
Hudson, Nicolas; Backes, Paul; DiCicco, Matt; Bajracharya, Max
2010-01-01
A system consisting of a set of estimators and autonomous behaviors has been developed which allows robust coring from a low-mass rover platform, while accommodating for moderate rover slip. A redundant set of sensors, including a force-torque sensor, visual odometry, and accelerometers are used to monitor discrete critical and operational modes, as well as to estimate continuous drill parameters during the coring process. A set of critical failure modes pertinent to shallow coring from a mobile platform is defined, and autonomous behaviors associated with each critical mode are used to maintain nominal coring conditions. Autonomous shallow coring is demonstrated from a low-mass rover using a rotary-percussive coring tool mounted on a 5 degree-of-freedom (DOF) arm. A new architecture of using an arm-stabilized, rotary percussive tool with the robotic arm used to provide the drill z-axis linear feed is validated. Particular attention to hole start using this architecture is addressed. An end-to-end coring sequence is demonstrated, where the rover autonomously detects and then recovers from a series of slip events that exceeded 9 cm total displacement.
Schlessinger, Daniel I; Iyengar, Sanjana; Yanes, Arianna F; Henley, Jill K; Ashchyan, Hovik J; Kurta, Anastasia O; Patel, Payal M; Sheikh, Umar A; Franklin, Matthew J; Hanna, Courtney C; Chen, Brian R; Chiren, Sarah G; Schmitt, Jochen; Deckert, Stefanie; Furlan, Karina C; Poon, Emily; Maher, Ian A; Cartee, Todd V; Sobanko, Joseph F; Alam, Murad
2017-08-01
Facial aging is a concern for many patients. Wrinkles, loss of volume, and discoloration are common physical manifestations of aging skin. Genetic heritage, prior ultraviolet light exposure, and Fitzpatrick skin type may be associated with the rate and type of facial aging. Although many clinical trials assess the correlates of skin aging, there is heterogeneity in the outcomes assessed, which limits the quality of evaluation and comparison of treatment modalities. To address the inconsistency in outcomes, in this project we will develop a core set of outcomes that are to be evaluated in all clinical trials relevant to facial aging. A long list of measureable outcomes will be created from four sources: (1) systematic medical literature review, (2) patient interviews, (3) other published sources, and (4) stakeholder involvement. Two rounds of Delphi processes with homogeneous groups of physicians and patients will be performed to prioritize and condense the list. At a consensus meeting attended by physicians, patients, and stakeholders, outcomes will be further condensed on the basis of participant scores. By the end of the meeting, members will vote and decide on a final recommended set of core outcomes. Subsequent to this, specific measures will be selected or created to assess these outcomes. The aim of this study is to develop a core outcome set and relevant measures for clinical trials relevant to facial aging. We hope to improve the reliability and consistency of outcome reporting of skin aging, thereby enabling improved evaluation of treatment efficacy and patient satisfaction. Core Outcome Measures in Effectiveness Trials (COMET) Initiative, accessible at http://www.comet-initiative.org/studies/details/737 . Core Outcomes Set Initiative, (CSG-COUSIN) accessible at https://www.uniklinikum-dresden.de/de/das-klinikum/universitaetscentren/zegv/cousin/meet-the-teams/project-groups/core-outcome-set-for-the-appearance-of-facial-aging . Protocol version date is 28 July 2016.
Crosstalk of clock gene expression and autophagy in aging
Kalfalah, Faiza; Janke, Linda; Schiavi, Alfonso; Tigges, Julia; Ix, Alexander; Ventura, Natascia; Boege, Fritz; Reinke, Hans
2016-01-01
Autophagy and the circadian clock counteract tissue degeneration and support longevity in many organisms. Accumulating evidence indicates that aging compromises both the circadian clock and autophagy but the mechanisms involved are unknown. Here we show that the expression levels of transcriptional repressor components of the circadian oscillator, most prominently the human Period homologue PER2, are strongly reduced in primary dermal fibroblasts from aged humans, while raising the expression of PER2 in the same cells partially restores diminished autophagy levels. The link between clock gene expression and autophagy is corroborated by the finding that the circadian clock drives cell-autonomous, rhythmic autophagy levels in immortalized murine fibroblasts, and that siRNA-mediated downregulation of PER2 decreases autophagy levels while leaving core clock oscillations intact. Moreover, the Period homologue lin-42 regulates autophagy and life span in the nematode Caenorhabditis elegans, suggesting an evolutionarily conserved role for Period proteins in autophagy control and aging. Taken together, this study identifies circadian clock proteins as set-point regulators of autophagy and puts forward a model, in which age-related changes of clock gene expression promote declining autophagy levels. PMID:27574892
Crosstalk of clock gene expression and autophagy in aging.
Kalfalah, Faiza; Janke, Linda; Schiavi, Alfonso; Tigges, Julia; Ix, Alexander; Ventura, Natascia; Boege, Fritz; Reinke, Hans
2016-08-28
Autophagy and the circadian clock counteract tissue degeneration and support longevity in many organisms. Accumulating evidence indicates that aging compromises both the circadian clock and autophagy but the mechanisms involved are unknown. Here we show that the expression levels of transcriptional repressor components of the circadian oscillator, most prominently the human Period homologue PER2 , are strongly reduced in primary dermal fibroblasts from aged humans, while raising the expression of PER2 in the same cells partially restores diminished autophagy levels. The link between clock gene expression and autophagy is corroborated by the finding that the circadian clock drives cell-autonomous, rhythmic autophagy levels in immortalized murine fibroblasts, and that siRNA-mediated downregulation of PER2 decreases autophagy levels while leaving core clock oscillations intact. Moreover, the Period homologue lin-42 regulates autophagy and life span in the nematode Caenorhabditis elegans , suggesting an evolutionarily conserved role for Period proteins in autophagy control and aging. Taken together, this study identifies circadian clock proteins as set-point regulators of autophagy and puts forward a model, in which age-related changes of clock gene expression promote declining autophagy levels.
Leliaert, Frederik; Marcelino, Vanessa R
2018-01-01
Abstract Chloroplast genomes have undergone tremendous alterations through the evolutionary history of the green algae (Chloroplastida). This study focuses on the evolution of chloroplast genomes in the siphonous green algae (order Bryopsidales). We present five new chloroplast genomes, which along with existing sequences, yield a data set representing all but one families of the order. Using comparative phylogenetic methods, we investigated the evolutionary dynamics of genomic features in the order. Our results show extensive variation in chloroplast genome architecture and intron content. Variation in genome size is accounted for by the amount of intergenic space and freestanding open reading frames that do not show significant homology to standard plastid genes. We show the diversity of these nonstandard genes based on their conserved protein domains, which are often associated with mobile functions (reverse transcriptase/intron maturase, integrases, phage- or plasmid-DNA primases, transposases, integrases, ligases). Investigation of the introns showed proliferation of group II introns in the early evolution of the order and their subsequent loss in the core Halimedineae, possibly through RT-mediated intron loss. PMID:29635329
Effects of nitrogen seeding on core ion thermal transport in JET ILW L-mode plasmas
NASA Astrophysics Data System (ADS)
Bonanomi, N.; Mantica, P.; Citrin, J.; Giroud, C.; Lerche, E.; Sozzi, C.; Taylor, D.; Tsalas, M.; Van Eester, D.; contributors, JET
2018-02-01
A set of experiments was carried out in JET ILW (Joint European Torus with ITER-Like Wall) L-mode plasmas in order to study the effects of light impurities on core ion thermal transport. N was puffed into some discharges and its profile was measured by active Charge Exchange diagnostics, while ICRH power was deposited on- and off-axis in ({\\hspace{0pt}}3He)-D minority scheme in order to have a scan of local heat flux at constant total power with and without N injection. Experimentally, the ion temperature profiles are more peaked for similar heat fluxes when N is injected in the plasma. Gyro-kinetic simulations using the GENE code indicate that a stabilization of Ion Temperature Gradient driven turbulent transport due to main ion dilution and to changes in Te/Ti and s/q is responsible of the enhanced peaking. The quasi-linear models TGLF and QuaLiKiz are tested against the experimental and the gyro-kinetic results.
Molecular Characterization of Shiga Toxin-Producing Escherichia coli Strains Isolated in Poland.
Januszkiewicz, Aleksandra; Rastawicki, Waldemar
2016-08-26
Shiga toxin-producing Escherichia coli (STEC) strains also called verotoxin-producing E. coli (VTEC) represent one of the most important groups of food-borne pathogens that can cause several human diseases such as hemorrhagic colitis (HC) and hemolytic - uremic syndrome (HUS) worldwide. The ability of STEC strains to cause disease is associated with the presence of wide range of identified and putative virulence factors including those encoding Shiga toxin. In this study, we examined the distribution of various virulence determinants among STEC strains isolated in Poland from different sources. A total of 71 Shiga toxin-producing E. coli strains isolated from human, cattle and food over the years 1996-2010 were characterized by microarray and PCR detection of virulence genes. As stx1a subtype was present in all of the tested Shiga toxin 1 producing E. coli strains, a greater diversity of subtypes was found in the gene stx2, which occurred in five subtypes: stx2a, stx2b, stx2c, stx2d, stx2g. Among STEC O157 strains we observed conserved core set of 14 virulence factors, stable in bacteria genome at long intervals of time. There was one cattle STEC isolate which possessed verotoxin gene as well as sta1 gene encoded heat-stable enterotoxin STIa characteristic for enterotoxigenic E. coli. To the best of our knowledge, this is the first comprehensive analysis of virulence gene profiles identified in STEC strains isolated from human, cattle and food in Poland. The results obtained using microarrays technology confirmed high effectiveness of this method in determining STEC virulotypes which provides data suitable for molecular risk assessment of the potential virulence of this bacteria. virulence factors including those encoding Shiga toxin. In this study, we examined the distribution of various virulence determinants among STEC strains isolated in Poland from different sources. A total of 71 Shiga toxin-producing E. coli strains isolated from human, cattle and food over the years 1996-2010 were characterized by microarray and PCR detection of virulence genes. As stx1a subtype was present in all of the tested Shiga toxin 1 producing E. coli strains, a greater diversity of subtypes was found in the gene stx2, which occurred in five subtypes: stx2a, stx2b, stx2c, stx2d, stx2g. Among STEC O157 strains we observed conserved core set of 14 virulence factors, stable in bacteria genome at long intervals of time. There was one cattle STEC isolate which possessed verotoxin gene as well as sta1 gene encoded heat-stable enterotoxin STIa characteristic for enterotoxigenic E. coli. To the best of our knowledge, this is the first comprehensive analysis of virulence gene profiles identified in STEC strains isolated from human, cattle and food in Poland. The results obtained using microarrays technology confirmed high effectiveness of this method in determining STEC virulotypes which provides data suitable for molecular risk assessment of the potential virulence of this bacteria.
Saliva Microbiota Carry Caries-Specific Functional Gene Signatures
Chang, Xingzhi; Yuan, Xiao; Tu, Qichao; Yuan, Tong; Deng, Ye; Hemme, Christopher L.; Van Nostrand, Joy; Cui, Xinping; He, Zhili; Chen, Zhenggang; Guo, Dawei; Yu, Jiangbo; Zhang, Yue; Zhou, Jizhong; Xu, Jian
2014-01-01
Human saliva microbiota is phylogenetically divergent among host individuals yet their roles in health and disease are poorly appreciated. We employed a microbial functional gene microarray, HuMiChip 1.0, to reconstruct the global functional profiles of human saliva microbiota from ten healthy and ten caries-active adults. Saliva microbiota in the pilot population featured a vast diversity of functional genes. No significant distinction in gene number or diversity indices was observed between healthy and caries-active microbiota. However, co-presence network analysis of functional genes revealed that caries-active microbiota was more divergent in non-core genes than healthy microbiota, despite both groups exhibited a similar degree of conservation at their respective core genes. Furthermore, functional gene structure of saliva microbiota could potentially distinguish caries-active patients from healthy hosts. Microbial functions such as Diaminopimelate epimerase, Prephenate dehydrogenase, Pyruvate-formate lyase and N-acetylmuramoyl-L-alanine amidase were significantly linked to caries. Therefore, saliva microbiota carried disease-associated functional signatures, which could be potentially exploited for caries diagnosis. PMID:24533043
Saliva microbiota carry caries-specific functional gene signatures.
Yang, Fang; Ning, Kang; Chang, Xingzhi; Yuan, Xiao; Tu, Qichao; Yuan, Tong; Deng, Ye; Hemme, Christopher L; Van Nostrand, Joy; Cui, Xinping; He, Zhili; Chen, Zhenggang; Guo, Dawei; Yu, Jiangbo; Zhang, Yue; Zhou, Jizhong; Xu, Jian
2014-01-01
Human saliva microbiota is phylogenetically divergent among host individuals yet their roles in health and disease are poorly appreciated. We employed a microbial functional gene microarray, HuMiChip 1.0, to reconstruct the global functional profiles of human saliva microbiota from ten healthy and ten caries-active adults. Saliva microbiota in the pilot population featured a vast diversity of functional genes. No significant distinction in gene number or diversity indices was observed between healthy and caries-active microbiota. However, co-presence network analysis of functional genes revealed that caries-active microbiota was more divergent in non-core genes than healthy microbiota, despite both groups exhibited a similar degree of conservation at their respective core genes. Furthermore, functional gene structure of saliva microbiota could potentially distinguish caries-active patients from healthy hosts. Microbial functions such as Diaminopimelate epimerase, Prephenate dehydrogenase, Pyruvate-formate lyase and N-acetylmuramoyl-L-alanine amidase were significantly linked to caries. Therefore, saliva microbiota carried disease-associated functional signatures, which could be potentially exploited for caries diagnosis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhardwaj, A.; Walker-Kopp, N; Casjens, S
2009-01-01
Bacteriophages of the Podoviridae family use short noncontractile tails to inject their genetic material into Gram-negative bacteria. In phage P22, the tail contains a thin needle, encoded by the phage gene 26, which is essential both for stabilization and for ejection of the packaged viral genome. Bioinformatic analysis of the N-terminal domain of gp26 (residues 1-60) led us to identify a family of genes encoding putative homologues of the tail needle gp26. To validate this idea experimentally and to explore their diversity, we cloned the gp26-like gene from phages HK620, Sf6 and HS1, and characterized these gene products in solution.more » All gp26-like factors contain an elongated {alpha}-helical coiled-coil core consisting of repeating, adjacent trimerization heptads and form trimeric fibers with length ranging between about 240 to 300 {angstrom}. gp26 tail needles display a high level of structural stability in solution, with Tm (temperature of melting) between 85 and 95 C. To determine how the structural stability of these phage fibers correlates with the length of the {alpha}-helical core, we investigated the effect of insertions and deletions in the helical core. In the P22 tail needle, we identified an 85-residue-long helical domain, termed MiCRU (minimal coiled-coil repeat unit), that can be inserted in-frame inside the gp26 helical core, preserving the straight morphology of the fiber. Likewise, we were able to remove three quarters of the helical core of the HS1 tail needle, minimally decreasing the stability of the fiber. We conclude that in the gp26 family of tail needles, structural stability increases nonlinearly with the length of the {alpha}-helical core. Thus, the overall stability of these bacteriophage fibers is not solely dependent on the number of trimerization repeats in the {alpha}-helical core.« less
HCV IRES-Mediated Core Expression in Zebrafish
Zhang, Jing-Pu; Hu, Zhan-Ying; Tong, Jun-Wei; Ding, Cun-Bao; Peng, Zong-Gen; Zhao, Li-Xun; Song, Dan-Qing; Jiang, Jian-Dong
2013-01-01
The lack of small animal models for hepatitis C virus has impeded the discovery and development of anti-HCV drugs. HCV-IRES plays an important role in HCV gene expression, and is an attractive target for antiviral therapy. In this study, we report a zebrafish model with a biscistron expression construct that can co-transcribe GFP and HCV-core genes by human hepatic lipase promoter and zebrafish liver fatty acid binding protein enhancer. HCV core translation was designed mediated by HCV-IRES sequence and gfp was by a canonical cap-dependent mechanism. Results of fluorescence image and in situ hybridization indicate that expression of HCV core and GFP is liver-specific; RT-PCR and Western blotting show that both core and gfp expression are elevated in a time-dependent manner for both transcription and translation. It means that the HCV-IRES exerted its role in this zebrafish model. Furthermore, the liver-pathological impact associated with HCV-infection was detected by examination of gene markers and some of them were elevated, such as adiponectin receptor, heparanase, TGF-β, PDGF-α, etc. The model was used to evaluate three clinical drugs, ribavirin, IFNα-2b and vitamin B12. The results show that vitamin B12 inhibited core expression in mRNA and protein levels in dose-dependent manner, but failed to impact gfp expression. Also VB12 down-regulated some gene transcriptions involved in fat liver, liver fibrosis and HCV-associated pathological process in the larvae. It reveals that HCV-IRES responds to vitamin B12 sensitively in the zebrafish model. Ribavirin did not disturb core expression, hinting that HCV-IRES is not a target site of ribavirin. IFNα-2b was not active, which maybe resulted from its degradation in vivo for the long time. These findings demonstrate the feasibility of the zebrafish model for screening of anti-HCV drugs targeting to HCV-IRES. The zebrafish system provides a novel evidence of using zebrafish as a HCV model organism. PMID:23469178
Pan- and core- network analysis of co-expression genes in a model plant
He, Fei; Maslov, Sergei
2016-12-16
Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Pan- and core- network analysis of co-expression genes in a model plant
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Fei; Maslov, Sergei
Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Mining functionally relevant gene sets for analyzing physiologically novel clinical expression data.
Turcan, Sevin; Vetter, Douglas E; Maron, Jill L; Wei, Xintao; Slonim, Donna K
2011-01-01
Gene set analyses have become a standard approach for increasing the sensitivity of transcriptomic studies. However, analytical methods incorporating gene sets require the availability of pre-defined gene sets relevant to the underlying physiology being studied. For novel physiological problems, relevant gene sets may be unavailable or existing gene set databases may bias the results towards only the best-studied of the relevant biological processes. We describe a successful attempt to mine novel functional gene sets for translational projects where the underlying physiology is not necessarily well characterized in existing annotation databases. We choose targeted training data from public expression data repositories and define new criteria for selecting biclusters to serve as candidate gene sets. Many of the discovered gene sets show little or no enrichment for informative Gene Ontology terms or other functional annotation. However, we observe that such gene sets show coherent differential expression in new clinical test data sets, even if derived from different species, tissues, and disease states. We demonstrate the efficacy of this method on a human metabolic data set, where we discover novel, uncharacterized gene sets that are diagnostic of diabetes, and on additional data sets related to neuronal processes and human development. Our results suggest that our approach may be an efficient way to generate a collection of gene sets relevant to the analysis of data for novel clinical applications where existing functional annotation is relatively incomplete.
NASA Astrophysics Data System (ADS)
Seto, K.
2015-12-01
Koji Seto (ReCCLE, Shimane Univ.), Hiroyuki Takata (Pusan Univ.), Kota Katsuki (KIGAM), Takeshi Sonoda (Tokyo Univ. of Agr.) In the coastal area of the Sea of Okhotsk in the east part of Hokkaido located to for subarctic zone, many brackish-water lakes are distributed. Lake Mokoto has two-layer structure of polyhaline surface waters and mixoeuhaline bottom water. The bottom water shows the anoxic conditions in summer season. In this reason, the sediments of Lake Mokoto consist of organic mud with the lamination. The 09Mk-1C and 09Mk-2C cores collected from Lake Mokoto at 2009. In the soft X-ray photograph, the cyclic lamina set is observed in their core. The cyclic lamina set consists of low-, intermedium- and high-density lamina. It is considered that this cyclic lamina set is the verve. According to the meteorological data in Abashiri region, the annually precipitation is high from August to September. Probably, the cyclic lamina set is formed by seasonal change of precipitation. In this study, we are discussed about the relationship between the high-density lamina and precipitation by sedimentologic and geochemical high-resolution analysis. The 09Mk-1C and 09Mk-2C cores collected from Lake Mokoto show the length of 1.78 to 3.87m, respectively. In 09Mk-2C core, Ta-a tephra (AD 1739) was observed at the 3.5m depths. The 09Mk-1C core consist of organic mud with the lamination in all cores. The core top 100 cm in this core shows the black (N1.5/0), and it seems to indicate the seasonal anoxic environment as present. The organic mud below 100cm depth shows black (10YR1.7/1). The sedimentation rate in 09Mk-1C core increase from late 1960's for the age of cyclic lamina set. It is suggest that supply of sediment in Lake Mokoto is increasing by land development in drainage basin. Phosphorus flux in 09Mk-1C core increase from late 1950's. The increasing of phosphorus flux may be caused by excess drainage of pollution from stock farm. In 2015, we were able to take the new core (15Mk-3C core). We have observed a new lamina set in detail, and compared with precipitation in Abashiri Region.
RUCS: rapid identification of PCR primers for unique core sequences.
Thomsen, Martin Christen Frølund; Hasman, Henrik; Westh, Henrik; Kaya, Hülya; Lund, Ole
2017-12-15
Designing PCR primers to target a specific selection of whole genome sequenced strains can be a long, arduous and sometimes impractical task. Such tasks would benefit greatly from an automated tool to both identify unique targets, and to validate the vast number of potential primer pairs for the targets in silico. Here we present RUCS, a program that will find PCR primer pairs and probes for the unique core sequences of a positive genome dataset complement to a negative genome dataset. The resulting primer pairs and probes are in addition to simple selection also validated through a complex in silico PCR simulation. We compared our method, which identifies the unique core sequences, against an existing tool called ssGeneFinder, and found that our method was 6.5-20 times more sensitive. We used RUCS to design primer pairs that would target a set of genomes known to contain the mcr-1 colistin resistance gene. Three of the predicted pairs were chosen for experimental validation using PCR and gel electrophoresis. All three pairs successfully produced an amplicon with the target length for the samples containing mcr-1 and no amplification products were produced for the negative samples. The novel methods presented in this manuscript can reduce the time needed to identify target sequences, and provide a quick virtual PCR validation to eliminate time wasted on ambiguously binding primers. Source code is freely available on https://bitbucket.org/genomicepidemiology/rucs. Web service is freely available on https://cge.cbs.dtu.dk/services/RUCS. mcft@cbs.dtu.dk. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Zhang, Dapeng; Xiong, Huiling; Mennigen, Jan A; Popesku, Jason T; Marlatt, Vicki L; Martyniuk, Christopher J; Crump, Kate; Cossins, Andrew R; Xia, Xuhua; Trudeau, Vance L
2009-06-05
Many vertebrates, including the goldfish, exhibit seasonal reproductive rhythms, which are a result of interactions between external environmental stimuli and internal endocrine systems in the hypothalamo-pituitary-gonadal axis. While it is long believed that differential expression of neuroendocrine genes contributes to establishing seasonal reproductive rhythms, no systems-level investigation has yet been conducted. In the present study, by analyzing multiple female goldfish brain microarray datasets, we have characterized global gene expression patterns for a seasonal cycle. A core set of genes (873 genes) in the hypothalamus were identified to be differentially expressed between May, August and December, which correspond to physiologically distinct stages that are sexually mature (prespawning), sexual regression, and early gonadal redevelopment, respectively. Expression changes of these genes are also shared by another brain region, the telencephalon, as revealed by multivariate analysis. More importantly, by examining one dataset obtained from fish in October who were kept under long-daylength photoperiod (16 h) typical of the springtime breeding season (May), we observed that the expression of identified genes appears regulated by photoperiod, a major factor controlling vertebrate reproductive cyclicity. Gene ontology analysis revealed that hormone genes and genes functionally involved in G-protein coupled receptor signaling pathway and transmission of nerve impulses are significantly enriched in an expression pattern, whose transition is located between prespawning and sexually regressed stages. The existence of seasonal expression patterns was verified for several genes including isotocin, ependymin II, GABA(A) gamma2 receptor, calmodulin, and aromatase b by independent samplings of goldfish brains from six seasonal time points and real-time PCR assays. Using both theoretical and experimental strategies, we report for the first time global gene expression patterns throughout a breeding season which may account for dynamic neuroendocrine regulation of seasonal reproductive development.
Mennigen, Jan A.; Popesku, Jason T.; Marlatt, Vicki L.; Martyniuk, Christopher J.; Crump, Kate; Cossins, Andrew R.; Xia, Xuhua; Trudeau, Vance L.
2009-01-01
Background Many vertebrates, including the goldfish, exhibit seasonal reproductive rhythms, which are a result of interactions between external environmental stimuli and internal endocrine systems in the hypothalamo-pituitary-gonadal axis. While it is long believed that differential expression of neuroendocrine genes contributes to establishing seasonal reproductive rhythms, no systems-level investigation has yet been conducted. Methodology/Principal Findings In the present study, by analyzing multiple female goldfish brain microarray datasets, we have characterized global gene expression patterns for a seasonal cycle. A core set of genes (873 genes) in the hypothalamus were identified to be differentially expressed between May, August and December, which correspond to physiologically distinct stages that are sexually mature (prespawning), sexual regression, and early gonadal redevelopment, respectively. Expression changes of these genes are also shared by another brain region, the telencephalon, as revealed by multivariate analysis. More importantly, by examining one dataset obtained from fish in October who were kept under long-daylength photoperiod (16 h) typical of the springtime breeding season (May), we observed that the expression of identified genes appears regulated by photoperiod, a major factor controlling vertebrate reproductive cyclicity. Gene ontology analysis revealed that hormone genes and genes functionally involved in G-protein coupled receptor signaling pathway and transmission of nerve impulses are significantly enriched in an expression pattern, whose transition is located between prespawning and sexually regressed stages. The existence of seasonal expression patterns was verified for several genes including isotocin, ependymin II, GABAA gamma2 receptor, calmodulin, and aromatase b by independent samplings of goldfish brains from six seasonal time points and real-time PCR assays. Conclusions/Significance Using both theoretical and experimental strategies, we report for the first time global gene expression patterns throughout a breeding season which may account for dynamic neuroendocrine regulation of seasonal reproductive development. PMID:19503831
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kwon, Deug-Nam; Park, Mi-Ryung; Park, Jong-Yi
Highlights: {yields} The sequences of -604 to -84 bp of the pUPII promoter contained the region of a putative negative cis-regulatory element. {yields} The core promoter was located in the 5F-1. {yields} Transcription factor HNF4 can directly bind in the pUPII core promoter region, which plays a critical role in controlling promoter activity. {yields} These features of the pUPII promoter are fundamental to development of a target-specific vector. -- Abstract: Uroplakin II (UPII) is a one of the integral membrane proteins synthesized as a major differentiation product of mammalian urothelium. UPII gene expression is bladder specific and differentiation dependent, butmore » little is known about its transcription response elements and molecular mechanism. To identify the cis-regulatory elements in the pig UPII (pUPII) gene promoter region, we constructed pUPII 5' upstream region deletion mutants and demonstrated that each of the deletion mutants participates in controlling the expression of the pUPII gene in human bladder carcinoma RT4 cells. We also identified a new core promoter region and putative negative cis-regulatory element within a minimal promoter region. In addition, we showed that hepatocyte nuclear factor 4 (HNF4) can directly bind in the pUPII core promoter (5F-1) region, which plays a critical role in controlling promoter activity. Transient cotransfection experiments showed that HNF4 positively regulates pUPII gene promoter activity. Thus, the binding element and its binding protein, HNF4 transcription factor, may be involved in the mechanism that specifically regulates pUPII gene transcription.« less
2009-01-01
Background Bacterial genomes are mosaic structures composed of genes present in every strain of the same species (core genome), and genes present in some but not all strains of a species (accessory genome). The aim of this study was to compare the genetic diversity of core and accessory genes of a Salmonella enterica subspecies enterica serovar Typhimurium (Typhimurium) population isolated from food-animal and human sources in four regions of Mexico. Multilocus sequence typing (MLST) and macrorestriction fingerprints by pulsed-field gel electrophoresis (PFGE) were used to address the core genetic variation, and genes involved in pathogenesis and antibiotic resistance were selected to evaluate the accessory genome. Results We found a low genetic diversity for both housekeeping and accessory genes. Sequence type 19 (ST19) was supported as the founder genotype of STs 213, 302 and 429. We found a temporal pattern in which the derived ST213 is replacing the founder ST19 in the four geographic regions analyzed and a geographic trend in the number of resistance determinants. The distribution of the accessory genes was not random among chromosomal genotypes. We detected strong associations among the different accessory genes and the multilocus chromosomal genotypes (STs). First, the Salmonella virulence plasmid (pSTV) was found mostly in ST19 isolates. Second, the plasmid-borne betalactamase cmy-2 was found only in ST213 isolates. Third, the most abundant integron, IP-1 (dfrA12, orfF and aadA2), was found only in ST213 isolates. Fourth, the Salmonella genomic island (SGI1) was found mainly in a subgroup of ST19 isolates carrying pSTV. The mapping of accessory genes and multilocus genotypes on the dendrogram derived from macrorestiction fingerprints allowed the establishment of genetic subgroups within the population. Conclusion Despite the low levels of genetic diversity of core and accessory genes, the non-random distribution of the accessory genes across chromosomal backgrounds allowed us to discover genetic subgroups within the population. This study provides information about the importance of the accessory genome in generating genetic variability within a bacterial population. PMID:19573249
Waters, Aoife Mi; Tudur Smith, Catrin; Young, Bridget; Jones, Terry M
2014-05-13
The incidence of oropharyngeal cancer is increasing in the developed world. This has led to a large rise in research activity and clinical trials in this area, yet there is no consensus on which outcomes should be measured. As a result, the outcomes measured often differ between trials of comparable interventions, making the combination or comparison of results between trials impossible. Outcomes may also be 'cherry-picked', such that favourable results are reported, and less favourable results withheld. The development of a minimum outcome reporting standard, known as a core outcome set, goes some way to addressing these problems. Core outcome sets are ideally developed using a patient-centred approach so that the outcomes measured are relevant to patients and clinical practice. Core outcome sets drive up the quality and relevance of research by ensuring that the right outcomes are consistently measured and reported in trials in specific areas of health or healthcare. This is a mixed methods study involving three phases to develop a core outcome set for oropharyngeal cancer clinical trials. Firstly, a systematic review will establish which outcomes are measured in published oropharyngeal cancer randomised controlled trials (RCTs). Secondly, qualitative interviews with patients and carers in the UK and the USA will aim to establish which outcomes are important to these stakeholders. Data from these first two stages will be used to develop a comprehensive list of outcomes to be considered for inclusion in the core outcome set. In the third stage, patients and clinicians will participate in an iterative consensus exercise known as a Delphi study to refine the contents of the core outcome set. This protocol lays out the methodology to be implemented in the CONSENSUS study. A core outcome set defines a minimum outcome reporting standard for clinical trials in a particular area of health or healthcare. Its consistent implementation in oropharyngeal cancer clinical trials will improve the quality and relevance of research. This study is registered at the National Institute for Health Research (NIHR) Clinical Research Network (CRN) portfolio, ID 13823 (17 January 2013).
2014-01-01
Background The incidence of oropharyngeal cancer is increasing in the developed world. This has led to a large rise in research activity and clinical trials in this area, yet there is no consensus on which outcomes should be measured. As a result, the outcomes measured often differ between trials of comparable interventions, making the combination or comparison of results between trials impossible. Outcomes may also be ‘cherry-picked’, such that favourable results are reported, and less favourable results withheld. The development of a minimum outcome reporting standard, known as a core outcome set, goes some way to addressing these problems. Core outcome sets are ideally developed using a patient-centred approach so that the outcomes measured are relevant to patients and clinical practice. Core outcome sets drive up the quality and relevance of research by ensuring that the right outcomes are consistently measured and reported in trials in specific areas of health or healthcare. Methods/Design This is a mixed methods study involving three phases to develop a core outcome set for oropharyngeal cancer clinical trials. Firstly, a systematic review will establish which outcomes are measured in published oropharyngeal cancer randomised controlled trials (RCTs). Secondly, qualitative interviews with patients and carers in the UK and the USA will aim to establish which outcomes are important to these stakeholders. Data from these first two stages will be used to develop a comprehensive list of outcomes to be considered for inclusion in the core outcome set. In the third stage, patients and clinicians will participate in an iterative consensus exercise known as a Delphi study to refine the contents of the core outcome set. This protocol lays out the methodology to be implemented in the CONSENSUS study. Discussion A core outcome set defines a minimum outcome reporting standard for clinical trials in a particular area of health or healthcare. Its consistent implementation in oropharyngeal cancer clinical trials will improve the quality and relevance of research. Trials and registration This study is registered at the National Institute for Health Research (NIHR) Clinical Research Network (CRN) portfolio, ID 13823 (17 January 2013). PMID:24885068
Wang, Zhe; Shen, Yan
2017-03-01
The fast growing evidences have indicated that the natural product osthole is a promising drug candidate for fighting several serious human diseases, for example, cancer and inflammation. However, the mode-of-action (MoA) of osthole remains largely incomplete. In this study, we investigated the growth inhibition activity of osthole using fission yeast as a model, with the goal of understanding the osthole's mechanism of action, especially from the molecular level. Microarray analysis indicated that osthole has significant impacts on gene transcription levels (In total, 214 genes are up-regulated, and 97 genes are down-regulated). Gene set enrichment analysis (GSEA) indicated that 11 genes belong to the "Respiration module" category, especially including the components of complex III and V of mitochondrial respiration chain. Based on GSEA and network analysis, we also found that 54 up-regulated genes belong to the "Core Environmental Stress Responses" category, particularly including many transporter genes, which suggests that the rapidly activated nutrient exchange between cell and environment is part of the MoA of osthole. In summary, osthole can greatly impact on fission yeast transcriptome, and it primarily represses the expression levels of the genes in respiration chain, which next causes the inefficiency of ATP production and thus largely explains osthole's growth inhibition activity in Schizosaccharomyces pombe (S. pombe). The complexity of the osthole's MoA shown in previous studies and our current research demonstrates that the omics approach and bioinformatics tools should be applied together to acquire the complete landscape of osthole's growth inhibition activity.
The pangenome of hexaploid bread wheat.
Montenegro, Juan D; Golicz, Agnieszka A; Bayer, Philipp E; Hurgobin, Bhavna; Lee, HueyTyng; Chan, Chon-Kit Kenneth; Visendi, Paul; Lai, Kaitao; Doležel, Jaroslav; Batley, Jacqueline; Edwards, David
2017-06-01
There is an increasing understanding that variation in gene presence-absence plays an important role in the heritability of agronomic traits; however, there have been relatively few studies on variation in gene presence-absence in crop species. Hexaploid wheat is one of the most important food crops in the world and intensive breeding has reduced the genetic diversity of elite cultivars. Major efforts have produced draft genome assemblies for the cultivar Chinese Spring, but it is unknown how well this represents the genome diversity found in current modern elite cultivars. In this study we build an improved reference for Chinese Spring and explore gene diversity across 18 wheat cultivars. We predict a pangenome size of 140 500 ± 102 genes, a core genome of 81 070 ± 1631 genes and an average of 128 656 genes in each cultivar. Functional annotation of the variable gene set suggests that it is enriched for genes that may be associated with important agronomic traits. In addition to variation in gene presence, more than 36 million intervarietal single nucleotide polymorphisms were identified across the pangenome. This study of the wheat pangenome provides insight into genome diversity in elite wheat as a basis for genomics-based improvement of this important crop. A wheat pangenome, GBrowse, is available at http://appliedbioinformatics.com.au/cgi-bin/gb2/gbrowse/WheatPan/, and data are available to download from http://wheatgenome.info/wheat_genome_databases.php. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.
Tintle, Nathan L; Sitarik, Alexandra; Boerema, Benjamin; Young, Kylie; Best, Aaron A; Dejongh, Matthew
2012-08-08
Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
Khalil, Asma; Perry, Helen; Duffy, James; Reed, Keith; Baschat, Ahmet; Deprest, Jan; Hecher, Kurt; Lewi, Liesbeth; Lopriore, Enrico; Oepkes, Dick
2017-07-14
Twin-Twin Transfusion Syndrome (TTTS) is associated with an increased risk of perinatal mortality and morbidity. Several treatment interventions have been described for TTTS, including fetoscopic laser surgery, amnioreduction, septostomy, expectant management, and pregnancy termination. Over the last decade, fetoscopic laser surgery has become the primary treatment. The literature to date reports on many different outcomes, making it difficult to compare results or combine data from individual studies, limiting the value of research to guide clinical practice. With the advent and ongoing development of new therapeutic techniques, this is more important than ever. The development and use of a core outcome set has been proposed to address these issues, prioritising outcomes important to the key stakeholders, including patients. We aim to produce, disseminate, and implement a core outcome set for TTTS. An international steering group has been established to oversee the development of this core outcome set. This group includes healthcare professionals, researchers and patients. A systematic review is planned to identify previously reported outcomes following treatment for TTTS. Following completion, the identified outcomes will be evaluated by stakeholders using an international, multi-perspective online modified Delphi method to build consensus on core outcomes. This method encourages the participants towards consensus 'core' outcomes. All key stakeholders will be invited to participate. The steering group will then hold a consensus meeting to discuss results and form a core outcome set to be introduced and measured. Once core outcomes have been agreed, the next step will be to determine how they should be measured, disseminated, and implemented within an international context. The development, dissemination, and implementation of a core outcome set in TTTS will enable its use in future clinical trials, systematic reviews and clinical practice guidelines. This is likely to advance the quality of research studies and their effective use in order to guide clinical practice and improve patient care, maternal, short-term perinatal outcomes and long-term neurodevelopmental outcomes. Core Outcome Measures in Effectiveness Trials (COMET), 921 Registered on July 2016. International Prospective Register of Systematic Reviews (PROSPERO), CRD42016043999 . Registered on 2 August 2016.
p53 targets chromatin structure alteration to repress alpha-fetoprotein gene expression.
Ogden, S K; Lee, K C; Wernke-Dollries, K; Stratton, S A; Aronow, B; Barton, M C
2001-11-09
Many of the functions ascribed to p53 tumor suppressor protein are mediated through transcription regulation. We have shown that p53 represses hepatic-specific alpha-fetoprotein (AFP) gene expression by direct interaction with a composite HNF-3/p53 DNA binding element. Using solid-phase, chromatin-assembled AFP DNA templates and analysis of chromatin structure and transcription in vitro, we find that p53 binds DNA and alters chromatin structure at the AFP core promoter to regulate transcription. Chromatin assembled in the presence of hepatoma extracts is activated for AFP transcription with an open, accessible core promoter structure. Distal (-850) binding of p53 during chromatin assembly, but not post-assembly, reverses transcription activation concomitant with promoter inaccessibility to restriction enzyme digestion. Inhibition of histone deacetylase activity by trichostatin-A (TSA) addition, prior to and during chromatin assembly, activated chromatin transcription in parallel with increased core promoter accessibility. Chromatin immunoprecipitation analyses showed increased H3 and H4 acetylated histones at the core promoter in the presence of TSA, while histone acetylation remained unchanged at the site of distal p53 binding. Our data reveal that p53 targets chromatin structure alteration at the core promoter, independently of effects on histone acetylation, to establish repressed AFP gene expression.
Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong
2015-01-01
Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.
Seok, Junhee; Davis, Ronald W.; Xiao, Wenzhong
2015-01-01
Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn’t been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge. PMID:25933378
Core information set for oesophageal cancer surgery.
Blazeby, J M; Macefield, R; Blencowe, N S; Jacobs, M; McNair, A G K; Sprangers, M; Brookes, S T
2015-07-01
Surgeons provide patients with information before surgery, although standards of information are lacking and practice varies. The development and use of a 'core information set' as baseline information before surgery may improve understanding. A core set is a minimum set of information to use in all consultations before a specific procedure. This study developed a core information set for oesophageal cancer surgery. Information was identified from the literature, observations of clinical consultations and patient interviews. This was integrated to create a questionnaire survey. Stakeholders (patients and professionals) were surveyed twice to assess views on importance of information from 'not essential' to 'absolutely essential' using Delphi methods. Items not meeting predefined criteria were discarded after each survey and the final retained items were voted on, in separate patient and professional stakeholder meetings, to agree the core set. Some 67 information items were identified initially from multiple sources. Survey response rates were 76·5 per cent (185 of 242) and 54·8 per cent (126 of 230) for patients and professionals respectively (first round), and over 83 per cent in both groups thereafter. Health professionals rated short-term clinical outcomes most highly (technical complications), whereas patients prioritized information related to long-term benefits. The consensus meetings agreed the final set, which consisted of: in-hospital milestones to recovery, rates of open-and-close surgery, in-hospital mortality, major complications (reoperation), milestones in recovery after discharge, longer-term eating and drinking and overall quality of life, and chances of survival. This study has established a core information set for surgery for oesophageal cancer. © 2015 BJS Society Ltd Published by John Wiley & Sons Ltd.
Shen, Wenfeng; Wei, Daming; Xu, Weimin; Zhu, Xin; Yuan, Shizhong
2010-10-01
Biological computations like electrocardiological modelling and simulation usually require high-performance computing environments. This paper introduces an implementation of parallel computation for computer simulation of electrocardiograms (ECGs) in a personal computer environment with an Intel CPU of Core (TM) 2 Quad Q6600 and a GPU of Geforce 8800GT, with software support by OpenMP and CUDA. It was tested in three parallelization device setups: (a) a four-core CPU without a general-purpose GPU, (b) a general-purpose GPU plus 1 core of CPU, and (c) a four-core CPU plus a general-purpose GPU. To effectively take advantage of a multi-core CPU and a general-purpose GPU, an algorithm based on load-prediction dynamic scheduling was developed and applied to setting (c). In the simulation with 1600 time steps, the speedup of the parallel computation as compared to the serial computation was 3.9 in setting (a), 16.8 in setting (b), and 20.0 in setting (c). This study demonstrates that a current PC with a multi-core CPU and a general-purpose GPU provides a good environment for parallel computations in biological modelling and simulation studies. Copyright 2010 Elsevier Ireland Ltd. All rights reserved.
Yang, Liyan; Cui, Guimei; Wang, Yixue; Hao, Yaoshan; Du, Jianzhong; Zhang, Hongmei; Wang, Changbiao; Zhang, Huanhuan; Wu, Shu-Biao; Sun, Yi
2017-01-01
Plant genetic transformation has arguably been the core of plant improvement in recent decades. Efforts have been made to develop in planta transformation systems due to the limitations present in the tissue-culture-based methods. Herein, we report an improved in planta transformation system, and provide the evidence of reporter gene expression in pollen tube, embryos and stable transgenicity of the plants following pollen-mediated plant transformation with optimized sonication treatment of pollen. The results showed that the aeration at 4°C treatment of pollen grains in sucrose prior to sonication significantly improved the pollen viability leading to improved kernel set and transformation efficiency. Scanning electron microscopy observation revealed that the removal of operculum covering pollen pore by ultrasonication might be one of the reasons for the pollen grains to become competent for transformation. Evidences have shown that the eGfp gene was expressed in the pollen tube and embryos, and the Cry1Ac gene was detected in the subsequent T 1 and T 2 progenies, suggesting the successful transfer of the foreign genes to the recipient plants. The Southern blot analysis of Cry1Ac gene in T 2 progenies and PCR-identified Apr gene segregation in T 2 seedlings confirmed the stable inheritance of the transgene. The outcome illustrated that the pollen-mediated genetic transformation system can be widely applied in the plant improvement programs with apparent advantages over tissue-culture-based transformation methods.
Predicting the reproduction strategies of several microalgae through their genome sequences
NASA Astrophysics Data System (ADS)
Guo, Li; Yang, Guanpin
2014-10-01
Documenting the sex and sexual reproduction of the microalgae is very difficult, as most of the results are based on the microscopic observation that can be heavily influenced by genetic, physiological and environmental conditions. Understanding the reproduction strategy of some microalgae is required to breed them in large scale culture industry. Instead of direct observation of sex and sexual reproduction under microscope, the whole set or the majority of core meiosis genes may evidence the sex and sexual reproduction in the unicellular algae, as the meiosis is necessary for maintaining the genomic stability and the advantages of genetic recombination. So far, the available genome sequences and bioinformatic tools (in this study, homolog searching and phylogenetic analysis) allow us to propose that at least 20 core meiosis genes (among them ≥6 must be meiosis specific) are enough for an alga to maintain its sexual reproduction. According to this assumption and the genome sequences, it is possible that sexual reproduction was carried out by Micromonas pusilla and Cyanidioschyzon merolae, while asexual reproduction was adopted by Bigelowiella natans, Guillardia theta, Nannochloropsis gaditana, N. oceanica, Chlorella variablis, Phaeodactylum tricornutum and Thalassiosira pseudonana. This understanding will facilitate the breeding trials of some economic microalgae (e.g., N. gaditana, N. oceanica, C. variablis and P. tricornutum). However, the reproduction strategies of these microalgae need to be proved by further biological experiments.
López-Pérez, Mario; Gonzaga, Aitor; Rodriguez-Valera, Francisco
2013-01-01
We have compared genomes of Alteromonas macleodii “deep ecotype” isolates from two deep Mediterranean sites and two surface samples from the Aegean and the English Channel. A total of nine different genomes were analyzed. They belong to five clonal frames (CFs) that differ among them by approximately 30,000 single-nucleotide polymorphisms (SNPs) over their core genomes. Two of the CFs contain three strains each with nearly identical genomes (∼100 SNPs over the core genome). One of the CFs had representatives that were isolated from samples taken more than 1,000 km away, 2,500 m deeper, and 5 years apart. These data mark the longest proven persistence of a CF in nature (outside of clinical settings). We have found evidence for frequent recombination events between or within CFs and even with the distantly related A. macleodii surface ecotype. The different CFs had different flexible genomic islands. They can be classified into two groups; one type is additive, that is, containing different numbers of gene cassettes, and is very variable in short time periods (they often varied even within a single CF). The other type was more stable and produced the complete replacement of a genomic fragment by another with different genes. Although this type was more conserved within each CF, we found examples of recombination among distantly related CFs including English Channel and Mediterranean isolates. PMID:23729633
Gyrokinetic simulation of residual turbulence in transport barriers
NASA Astrophysics Data System (ADS)
Jenko, Frank; Told, Daniel; Goerler, Tobias; Brunner, Stephan; Sautter, Olivier
2011-10-01
One of the ultimate aims for gyrokinetic simulation is to describe the formation and evolution of transport barriers. An important step in that direction is the study of the residual turbulence in established barriers - a challenging task in itself, given that a wide range of spatio-temporal scales can be involved. In the present work, we employ the physically comprehensive, nonlocal gyrokinetic turbulence code GENE to study turbulence in both core and edge transport barriers. First, we apply GENE to a set of discharges in the TCV tokamak which exhibit electron ITBs. Nonlinear gyrokinetic simulations are used to examine the influence of a varying current profile on the strength of the barrier. For each case, the transport spectra reveal how much transport (for each channel) is done in the low-k, medium-k, and high-k regimes, respectively. The role of ETG turbulence is discussed. Second, we explore the role of ETG turbulence in a typical ASDEX Upgrade H-mode discharge. Numerical convergence is carefully examined, and new insights on the characteristics of ETG turbulence in the edge will be discussed, focusing particularly on the role of streamers, which had been found to be a necessary ingredient for experimentally relevant ETG transport in core plasmas. The radial dependence of the resulting electron heat diffusivity is also examined and a simple ETG model is presented which can be used in future edge modeling efforts.
Predicting the reproduction strategies of several microalgae through their genome sequences
NASA Astrophysics Data System (ADS)
Guo, Li; Yang, Guanpin
2015-06-01
Documenting the sex and sexual reproduction of the microalgae is very difficult, as most of the results are based on the microscopic observation that can be heavily influenced by genetic, physiological and environmental conditions. Understanding the reproduction strategy of some microalgae is required to breed them in large scale culture industry. Instead of direct observation of sex and sexual reproduction under microscope, the whole set or the majority of core meiosis genes may evidence the sex and sexual reproduction in the unicellular algae, as the meiosis is necessary for maintaining the genomic stability and the advantages of genetic recombination. So far, the available genome sequences and bioinformatic tools (in this study, homolog searching and phylogenetic analysis) allow us to propose that at least 20 core meiosis genes (among them ≥6 must be meiosis specific) are enough for an alga to maintain its sexual reproduction. According to this assumption and the genome sequences, it is possible that sexual reproduction was carried out by Micromonas pusilla and Cyanidioschyzon merolae, while asexual reproduction was adopted by Bigelowiella natans, Guillardia theta, Nannochloropsis gaditana, N. oceanica, Chlorella variablis, Phaeodactylum tricornutum and Thalassiosira pseudonana. This understanding will facilitate the breeding trials of some economic microalgae ( e.g., N. gaditana, N. oceanica, C. variablis and P. tricornutum). However, the reproduction strategies of these microalgae need to be proved by further biological experiments.
Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.
Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M
2011-11-01
Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.
MAGMA: Generalized Gene-Set Analysis of GWAS Data
de Leeuw, Christiaan A.; Mooij, Joris M.; Heskes, Tom; Posthuma, Danielle
2015-01-01
By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn’s Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn’s Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn’s Disease data was found to be considerably faster as well. PMID:25885710
MAGMA: generalized gene-set analysis of GWAS data.
de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle
2015-04-01
By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
Kottner, J; Jacobi, L; Hahnel, E; Alam, M; Balzer, K; Beeckman, D; Busard, C; Chalmers, J; Deckert, S; Eleftheriadou, V; Furlan, K; Horbach, S E R; Kirkham, J; Nast, A; Spuls, P; Thiboutot, D; Thorlacius, L; Weller, K; Williams, H C; Schmitt, J
2018-04-01
Results of clinical trials are the most important information source for generating external clinical evidence. The use of different outcomes across trials, which investigate similar interventions for similar patient groups, significantly limits the interpretation, comparability and clinical application of trial results. Core outcome sets (COSs) aim to overcome this limitation. A COS is an agreed standardized collection of outcomes that should be measured and reported in all clinical trials for a specific clinical condition. The Core Outcome Set Initiative within the Cochrane Skin Group (CSG-COUSIN) supports the development of core outcomes in dermatology. In the second CSG-COUSIN meeting held in 2017, 11 COS development groups working on skin diseases presented their current work. The presentations and discussions identified the following overarching methodological challenges for COS development in dermatology: it is not always easy to define the disease focus of a COS; the optimal method for outcome domain identification and level of detail needed to specify such domains is challenging to many; decision rules within Delphi surveys need to be improved; appropriate ways of patient involvement are not always clear. In addition, there appear to be outcome domains that may be relevant as potential core outcome domains for the majority of skin diseases. The close collaboration between methodologists in the Core Outcome Set Initiative and the international Cochrane Skin Group has major advantages for trialists, systematic reviewers and COS developers. © 2018 British Association of Dermatologists.
Eikrem, Oystein S; Strauss, Philipp; Beisland, Christian; Scherer, Andreas; Landolt, Lea; Flatberg, Arnar; Leh, Sabine; Beisvag, Vidar; Skogstrand, Trude; Hjelle, Karin; Shresta, Anjana; Marti, Hans-Peter
2016-12-01
A previous study by this group demonstrated the feasibility of RNA sequencing (RNAseq) technology for capturing disease biology of clear cell renal cell carcinoma (ccRCC), and presented initial results for carbonic anhydrase-9 (CA9) and tumor necrosis factor-α-induced protein-6 (TNFAIP6) as possible biomarkers of ccRCC (discovery set) [Eikrem et al. PLoS One 2016;11:e0149743]. To confirm these results, the previous study is expanded, and RNAseq data from additional matched ccRCC and normal renal biopsies are analyzed (confirmation set). Two core biopsies from patients (n = 12) undergoing partial or full nephrectomy were obtained with a 16 g needle. RNA sequencing libraries were generated with the Illumina TruSeq ® Access library preparation protocol. Comparative analysis was done using linear modeling (voom/Limma; R Bioconductor). The formalin-fixed and paraffin-embedded discovery and confirmation data yielded 8957 and 11,047 detected transcripts, respectively. The two data sets shared 1193 of differentially expressed genes with each other. The average expression and the log 2 -fold changes of differentially expressed transcripts in both data sets correlated, with R² = .95 and R² = .94, respectively. Among transcripts with the highest fold changes were CA9, neuronal pentraxin-2 and uromodulin. Epithelial-mesenchymal transition was highlighted by differential expression of, for example, transforming growth factor-β 1 and delta-like ligand-4. The diagnostic accuracy of CA9 was 100% and 93.9% when using the discovery set as the training set and the confirmation data as the test set, and vice versa, respectively. These data further support TNFAIP6 as a novel biomarker of ccRCC. TNFAIP6 had combined accuracy of 98.5% in the two data sets. This study provides confirmatory data on the potential use of CA9 and TNFAIP6 as biomarkers of ccRCC. Thus, next-generation sequencing expands the clinical application of tissue analyses.
78 FR 71617 - Agency Information Collection Activities: Proposed Collection; Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2013-11-29
... agencies that have prescription drug programs are required to perform prospective and retrospective drug... study to validate the core competency set among the workforce; (2) establishing the core competency set...
Schmitt, Jochen; Apfelbacher, Christian; Spuls, Phyllis I; Thomas, Kim S; Simpson, Eric L; Furue, Masutaka; Chalmers, Joanne; Williams, Hywel C
2015-01-01
Core outcome sets (COSs) are consensus-derived minimum sets of outcomes to be assessed in a specific situation. COSs are being increasingly developed to limit outcome-reporting bias, allow comparisons across trials, and strengthen clinical decision making. Despite the increasing interest in outcomes research, methods to develop COSs have not yet been standardized. The aim of this paper is to present the Harmonizing Outcomes Measures for Eczema (HOME) roadmap for the development and implementation of COSs, which was developed on the basis of our experience in the standardization of outcome measurements for atopic eczema. Following the establishment of a panel representing all relevant stakeholders and a research team experienced in outcomes research, the scope and setting of the core set should be defined. The next steps are the definition of a core set of outcome domains such as symptoms or quality of life, followed by the identification or development and validation of appropriate outcome measurement instruments to measure these core domains. Finally, the consented COS needs to be disseminated, implemented, and reviewed. We believe that the HOME roadmap is a useful methodological framework to develop COSs in dermatology, with the ultimate goal of better decision making and promoting patient-centered health care.
Zhang, Bing; Schmoyer, Denise; Kirov, Stefan; Snoddy, Jay
2004-01-01
Background Microarray and other high-throughput technologies are producing large sets of interesting genes that are difficult to analyze directly. Bioinformatics tools are needed to interpret the functional information in the gene sets. Results We have created a web-based tool for data analysis and data visualization for sets of genes called GOTree Machine (GOTM). This tool was originally intended to analyze sets of co-regulated genes identified from microarray analysis but is adaptable for use with other gene sets from other high-throughput analyses. GOTree Machine generates a GOTree, a tree-like structure to navigate the Gene Ontology Directed Acyclic Graph for input gene sets. This system provides user friendly data navigation and visualization. Statistical analysis helps users to identify the most important Gene Ontology categories for the input gene sets and suggests biological areas that warrant further study. GOTree Machine is available online at . Conclusion GOTree Machine has a broad application in functional genomic, proteomic and other high-throughput methods that generate large sets of interesting genes; its primary purpose is to help users sort for interesting patterns in gene sets. PMID:14975175
Schlessinger, Daniel I; Iyengar, Sanjana; Yanes, Arianna F; Chiren, Sarah G; Godinez-Puig, Victoria; Chen, Brian R; Kurta, Anastasia O; Schmitt, Jochen; Deckert, Stefanie; Furlan, Karina C; Poon, Emily; Cartee, Todd V; Maher, Ian A; Alam, Murad; Sobanko, Joseph F
2017-07-12
Squamous cell carcinoma (SCC) is a common skin cancer that poses a risk of metastasis. Clinical investigations into SCC treatment are common, but the outcomes reported are highly variable, omitted, or clinically irrelevant. The outcome heterogeneity and reporting bias of these studies leave clinicians unable to accurately compare studies. Core outcome sets (COSs) are an agreed minimum set of outcomes recommended to be measured and reported in all clinical trials of a given condition or disease. Although COSs are under development for several dermatologic conditions, work has yet to be done to identify core outcomes specific for SCC. Outcome extraction for COS generation will occur via four methods: (1) systematic literature review; (2) patient interviews; (3) other published sources; and (4) input from stakeholders in medicine, pharmacy, and other relevant industries. The list of outcomes will be revaluated by the Measuring PRiority Outcome Variables via Excellence in Dermatologic surgery (IMPROVED) Steering Committee. Delphi processes will be performed separately by expert clinicians and patients to condense the list of outcomes generated. A consensus meeting with relevant stakeholders will be conducted after the Delphi exercise to further select outcomes, taking into account participant scores. At the end of the meeting, members will vote and decide on a final recommended set of core outcomes. The Core Outcome Measures in Effectiveness Trials (COMET) organization and the Cochrane Skin Group - Core Outcome Set Initiative (CSG-COUSIN) will serve as advisers throughout the COS generation process. Comparison of clinical trials via systematic reviews and meta-analyses is facilitated when investigators study outcomes that are relevant and similar. The aim of this project is to develop a COS to guide use for future clinical trials.
Gene set analysis using variance component tests.
Huang, Yen-Tsung; Lin, Xihong
2013-06-28
Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.
Shariati J, Vahid; Malboobi, Mohammad Ali; Tabrizi, Zeinab; Tavakol, Elahe; Owilia, Parviz; Safari, Maryam
2017-11-15
In this study, we provide a comparative genomic analysis of Pantoea agglomerans strain P5 and 10 closely related strains based on phylogenetic analyses. A next-generation shotgun strategy was implemented using the Illumina HiSeq 2500 technology followed by core- and pan-genome analysis. The genome of P. agglomerans strain P5 contains an assembly size of 5082485 bp with 55.4% G + C content. P. agglomerans consists of 2981 core and 3159 accessory genes for Coding DNA Sequences (CDSs) based on the pan-genome analysis. Strain P5 can be grouped closely with strains PG734 and 299 R using pan and core genes, respectively. All the predicted and annotated gene sequences were allocated to KEGG pathways. Accordingly, genes involved in plant growth-promoting (PGP) ability, including phosphate solubilization, IAA and siderophore production, acetoin and 2,3-butanediol synthesis and bacterial secretion, were assigned. This study provides an in-depth view of the PGP characteristics of strain P5, highlighting its potential use in agriculture as a biofertilizer.
Modular architecture of the T4 phage superfamily: A conserved core genome and a plastic periphery
DOE Office of Scientific and Technical Information (OSTI.GOV)
Comeau, Andre M.; Bertrand, Claire; Letarov, Andrei
2007-06-05
Among the most numerous objects in the biosphere, phages show enormous diversity in morphology and genetic content. We have sequenced 7 T4-like phages and compared their genome architecture. All seven phages share a core genome with T4 that is interrupted by several hyperplastic regions (HPRs) where most of their divergence occurs. The core primarily includes homologues of essential T4 genes, such as the virion structure and DNA replication genes. In contrast, the HPRs contain mostly novel genes of unknown function and origin. A few of the HPR genes that can be assigned putative functions, such as a series of novelmore » Internal Proteins, are implicated in phage adaptation to the host. Thus, the T4-like genome appears to be partitioned into discrete segments that fulfil different functions and behave differently in evolution. Such partitioning may be critical for these large and complex phages to maintain their flexibility, while simultaneously allowing them to conserve their highly successful virion design and mode of replication.« less
GARNET--gene set analysis with exploration of annotation relations.
Rho, Kyoohyoung; Kim, Bumjin; Jang, Youngjun; Lee, Sanghyun; Bae, Taejeong; Seo, Jihae; Seo, Chaehwa; Lee, Jihyun; Kang, Hyunjung; Yu, Ungsik; Kim, Sunghoon; Lee, Sanghyuk; Kim, Wan Kyu
2011-02-15
Gene set analysis is a powerful method of deducing biological meaning for an a priori defined set of genes. Numerous tools have been developed to test statistical enrichment or depletion in specific pathways or gene ontology (GO) terms. Major difficulties towards biological interpretation are integrating diverse types of annotation categories and exploring the relationships between annotation terms of similar information. GARNET (Gene Annotation Relationship NEtwork Tools) is an integrative platform for gene set analysis with many novel features. It includes tools for retrieval of genes from annotation database, statistical analysis & visualization of annotation relationships, and managing gene sets. In an effort to allow access to a full spectrum of amassed biological knowledge, we have integrated a variety of annotation data that include the GO, domain, disease, drug, chromosomal location, and custom-defined annotations. Diverse types of molecular networks (pathways, transcription and microRNA regulations, protein-protein interaction) are also included. The pair-wise relationship between annotation gene sets was calculated using kappa statistics. GARNET consists of three modules--gene set manager, gene set analysis and gene set retrieval, which are tightly integrated to provide virtually automatic analysis for gene sets. A dedicated viewer for annotation network has been developed to facilitate exploration of the related annotations. GARNET (gene annotation relationship network tools) is an integrative platform for diverse types of gene set analysis, where complex relationships among gene annotations can be easily explored with an intuitive network visualization tool (http://garnet.isysbio.org/ or http://ercsb.ewha.ac.kr/garnet/).
Production and pathogenicity of hepatitis C virus core gene products
Li, Hui-Chun; Ma, Hsin-Chieh; Yang, Chee-Hing; Lo, Shih-Yen
2014-01-01
Hepatitis C virus (HCV) is a major cause of chronic liver diseases, including steatosis, cirrhosis and hepatocellular carcinoma, and its infection is also associated with insulin resistance and type 2 diabetes mellitus. HCV, belonging to the Flaviviridae family, is a small enveloped virus whose positive-stranded RNA genome encoding a polyprotein. The HCV core protein is cleaved first at residue 191 by the host signal peptidase and further cleaved by the host signal peptide peptidase at about residue 177 to generate the mature core protein (a.a. 1-177) and the cleaved peptide (a.a. 178-191). Core protein could induce insulin resistance, steatosis and even hepatocellular carcinoma through various mechanisms. The peptide (a.a. 178-191) may play a role in the immune response. The polymorphism of this peptide is associated with the cellular lipid drop accumulation, contributing to steatosis development. In addition to the conventional open reading frame (ORF), in the +1 frame, an ORF overlaps with the core protein-coding sequence and encodes the alternative reading frame proteins (ARFP or core+1). ARFP/core+1/F protein could enhance hepatocyte growth and may regulate iron metabolism. In this review, we briefly summarized the current knowledge regarding the production of different core gene products and their roles in viral pathogenesis. PMID:24966583
Phylogeographic reconstruction of a bacterial species with high levels of lateral gene transfer
Pearson, T.; Giffard, P.; Beckstrom-Sternberg, S.; Auerbach, R.; Hornstra, H.; Tuanyok, A.; Price, E.P.; Glass, M.B.; Leadem, B.; Beckstrom-Sternberg, J. S.; Allan, G.J.; Foster, J.T.; Wagner, D.M.; Okinaka, R.T.; Sim, S.H.; Pearson, O.; Wu, Z.; Chang, J.; Kaul, R.; Hoffmaster, A.R.; Brettin, T.S.; Robison, R.A.; Mayo, M.; Gee, J.E.; Tan, P.; Currie, B.J.; Keim, P.
2009-01-01
Background: Phylogeographic reconstruction of some bacterial populations is hindered by low diversity coupled with high levels of lateral gene transfer. A comparison of recombination levels and diversity at seven housekeeping genes for eleven bacterial species, most of which are commonly cited as having high levels of lateral gene transfer shows that the relative contributions of homologous recombination versus mutation for Burkholderia pseudomallei is over two times higher than for Streptococcus pneumoniae and is thus the highest value yet reported in bacteria. Despite the potential for homologous recombination to increase diversity, B. pseudomallei exhibits a relative lack of diversity at these loci. In these situations, whole genome genotyping of orthologous shared single nucleotide polymorphism loci, discovered using next generation sequencing technologies, can provide very large data sets capable of estimating core phylogenetic relationships. We compared and searched 43 whole genome sequences of B. pseudomallei and its closest relatives for single nucleotide polymorphisms in orthologous shared regions to use in phylogenetic reconstruction. Results: Bayesian phylogenetic analyses of >14,000 single nucleotide polymorphisms yielded completely resolved trees for these 43 strains with high levels of statistical support. These results enable a better understanding of a separate analysis of population differentiation among >1,700 B. pseudomallei isolates as defined by sequence data from seven housekeeping genes. We analyzed this larger data set for population structure and allele sharing that can be attributed to lateral gene transfer. Our results suggest that despite an almost panmictic population, we can detect two distinct populations of B. pseudomallei that conform to biogeographic patterns found in many plant and animal species. That is, separation along Wallace's Line, a biogeographic boundary between Southeast Asia and Australia. Conclusion: We describe an Australian origin for B. pseudomallei, characterized by a single introduction event into Southeast Asia during a recent glacial period, and variable levels of lateral gene transfer within populations. These patterns provide insights into mechanisms of genetic diversification in B. pseudomallei and its closest relatives, and provide a framework for integrating the traditionally separate fields of population genetics and phylogenetics for other bacterial species with high levels of lateral gene transfer. ?? 2009 Pearson et al; licensee BioMed Central Ltd.
Lemieux, Claude; Vincent, Antony T; Labarre, Aurélie; Otis, Christian; Turmel, Monique
2015-12-01
The class Chlorophyceae (Chlorophyta) includes morphologically and ecologically diverse green algae. Most of the documented species belong to the clade formed by the Chlamydomonadales (also called Volvocales) and Sphaeropleales. Although studies based on the nuclear 18S rRNA gene or a few combined genes have shed light on the diversity and phylogenetic structure of the Chlamydomonadales, the positions of many of the monophyletic groups identified remain uncertain. Here, we used a chloroplast phylogenomic approach to delineate the relationships among these lineages. To generate the analyzed amino acid and nucleotide data sets, we sequenced the chloroplast DNAs (cpDNAs) of 24 chlorophycean taxa; these included representatives from 16 of the 21 primary clades previously recognized in the Chlamydomonadales, two taxa from a coccoid lineage (Jenufa) that was suspected to be sister to the Golenkiniaceae, and two sphaeroplealeans. Using Bayesian and/or maximum likelihood inference methods, we analyzed an amino acid data set that was assembled from 69 cpDNA-encoded proteins of 73 core chlorophyte (including 33 chlorophyceans), as well as two nucleotide data sets that were generated from the 69 genes coding for these proteins and 29 RNA-coding genes. The protein and gene phylogenies were congruent and robustly resolved the branching order of most of the investigated lineages. Within the Chlamydomonadales, 22 taxa formed an assemblage of five major clades/lineages. The earliest-diverging clade displayed Hafniomonas laevis and the Crucicarteria, and was followed by the Radicarteria and then by the Chloromonadinia. The latter lineage was sister to two superclades, one consisting of the Oogamochlamydinia and Reinhardtinia and the other of the Caudivolvoxa and Xenovolvoxa. To our surprise, the Jenufa species and the two spine-bearing green algae belonging to the Golenkinia and Treubaria genera were recovered in a highly supported monophyletic group that also included three taxa representing distinct families of the Sphaeropleales (Bracteacoccaceae, Mychonastaceae, and Scenedesmaceae). Our phylogenomic study advances our knowledge regarding the circumscription and internal structure of the Chlamydomonadales, suggesting that a previously unrecognized lineage is sister to the Sphaeropleales. In addition, it offers new insights into the flagellar structures of the founding members of both the Chlamydomonadales and Sphaeropleales.
Estimation of gene induction enables a relevance-based ranking of gene sets.
Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens
2009-07-01
In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
Functional cohesion of gene sets determined by latent semantic indexing of PubMed abstracts.
Xu, Lijing; Furlotte, Nicholas; Lin, Yunyue; Heinrich, Kevin; Berry, Michael W; George, Ebenezer O; Homayouni, Ramin
2011-04-14
High-throughput genomic technologies enable researchers to identify genes that are co-regulated with respect to specific experimental conditions. Numerous statistical approaches have been developed to identify differentially expressed genes. Because each approach can produce distinct gene sets, it is difficult for biologists to determine which statistical approach yields biologically relevant gene sets and is appropriate for their study. To address this issue, we implemented Latent Semantic Indexing (LSI) to determine the functional coherence of gene sets. An LSI model was built using over 1 million Medline abstracts for over 20,000 mouse and human genes annotated in Entrez Gene. The gene-to-gene LSI-derived similarities were used to calculate a literature cohesion p-value (LPv) for a given gene set using a Fisher's exact test. We tested this method against genes in more than 6,000 functional pathways annotated in Gene Ontology (GO) and found that approximately 75% of gene sets in GO biological process category and 90% of the gene sets in GO molecular function and cellular component categories were functionally cohesive (LPv<0.05). These results indicate that the LPv methodology is both robust and accurate. Application of this method to previously published microarray datasets demonstrated that LPv can be helpful in selecting the appropriate feature extraction methods. To enable real-time calculation of LPv for mouse or human gene sets, we developed a web tool called Gene-set Cohesion Analysis Tool (GCAT). GCAT can complement other gene set enrichment approaches by determining the overall functional cohesion of data sets, taking into account both explicit and implicit gene interactions reported in the biomedical literature. GCAT is freely available at http://binf1.memphis.edu/gcat.
ERIC Educational Resources Information Center
Renom, Marta; Conrad, Andrea; Bascuñana, Helena; Cieza, Alarcos; Galán, Ingrid; Kesselring, Jürg; Coenen, Michaela
2014-01-01
Background: The Comprehensive International Classification of Functioning, Disability and Health (ICF) Core Set for Multiple Sclerosis (MS) is a comprehensive framework to structure the information obtained in multidisciplinary clinical settings according to the biopsychosocial perspective of the International Classification of Functioning,…
The COG database: an updated version includes eukaryotes
Tatusov, Roman L; Fedorova, Natalie D; Jackson, John D; Jacobs, Aviva R; Kiryutin, Boris; Koonin, Eugene V; Krylov, Dmitri M; Mazumder, Raja; Mekhedov, Sergei L; Nikolskaya, Anastasia N; Rao, B Sridhar; Smirnov, Sergei; Sverdlov, Alexander V; Vasudevan, Sona; Wolf, Yuri I; Yin, Jodie J; Natale, Darren A
2003-01-01
Background The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Results We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. Conclusion The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. PMID:12969510
The Gene Set Builder: collation, curation, and distribution of sets of genes
Yusuf, Dimas; Lim, Jonathan S; Wasserman, Wyeth W
2005-01-01
Background In bioinformatics and genomics, there are many applications designed to investigate the common properties for a set of genes. Often, these multi-gene analysis tools attempt to reveal sequential, functional, and expressional ties. However, while tremendous effort has been invested in developing tools that can analyze a set of genes, minimal effort has been invested in developing tools that can help researchers compile, store, and annotate gene sets in the first place. As a result, the process of making or accessing a set often involves tedious and time consuming steps such as finding identifiers for each individual gene. These steps are often repeated extensively to shift from one identifier type to another; or to recreate a published set. In this paper, we present a simple online tool which – with the help of the gene catalogs Ensembl and GeneLynx – can help researchers build and annotate sets of genes quickly and easily. Description The Gene Set Builder is a database-driven, web-based tool designed to help researchers compile, store, export, and share sets of genes. This application supports the 17 eukaryotic genomes found in version 32 of the Ensembl database, which includes species from yeast to human. User-created information such as sets and customized annotations are stored to facilitate easy access. Gene sets stored in the system can be "exported" in a variety of output formats – as lists of identifiers, in tables, or as sequences. In addition, gene sets can be "shared" with specific users to facilitate collaborations or fully released to provide access to published results. The application also features a Perl API (Application Programming Interface) for direct connectivity to custom analysis tools. A downloadable Quick Reference guide and an online tutorial are available to help new users learn its functionalities. Conclusion The Gene Set Builder is an Ensembl-facilitated online tool designed to help researchers compile and manage sets of genes in a user-friendly environment. The application can be accessed via . PMID:16371163
Core labeling of adenovirus with EGFP
DOE Office of Scientific and Technical Information (OSTI.GOV)
Le, Long P.; Le, Helen N.; Nelson, Amy R.
2006-08-01
The study of adenovirus could greatly benefit from diverse methods of virus detection. Recently, it has been demonstrated that carboxy-terminal EGFP fusions of adenovirus core proteins Mu, V, and VII properly localize to the nucleus and display novel function in the cell. Based on these observations, we hypothesized that the core proteins may serve as targets for labeling the adenovirus core with fluorescent proteins. To this end, we constructed various chimeric expression vectors with fusion core genes (Mu-EGFP, V-EGFP, preVII-EGFP, and matVII-EGFP) while maintaining expression of the native proteins. Expression of the fusion core proteins was suboptimal using E1 expressionmore » vectors with both conventional CMV and modified (with adenovirus tripartite leader sequence) CMV5 promoters, resulting in non-labeled viral particles. However, robust expression equivalent to the native protein was observed when the fusion genes were placed in the deleted E3 region. The efficient Ad-wt-E3-V-EGFP and Ad-wt-E3-preVII-EGFP expression vectors were labeled allowing visualization of purified virus and tracking of the viral core during early infection. The vectors maintained their viral function, including viral DNA replication, viral DNA encapsidation, cytopathic effect, and thermostability. Core labeling offers a means to track the adenovirus core in vector targeting studies as well as basic adenovirus virology.« less
Mason, Mike J; Fan, Guoping; Plath, Kathrin; Zhou, Qing; Horvath, Steve
2009-01-01
Background Recent work has revealed that a core group of transcription factors (TFs) regulates the key characteristics of embryonic stem (ES) cells: pluripotency and self-renewal. Current efforts focus on identifying genes that play important roles in maintaining pluripotency and self-renewal in ES cells and aim to understand the interactions among these genes. To that end, we investigated the use of unsigned and signed network analysis to identify pluripotency and differentiation related genes. Results We show that signed networks provide a better systems level understanding of the regulatory mechanisms of ES cells than unsigned networks, using two independent murine ES cell expression data sets. Specifically, using signed weighted gene co-expression network analysis (WGCNA), we found a pluripotency module and a differentiation module, which are not identified in unsigned networks. We confirmed the importance of these modules by incorporating genome-wide TF binding data for key ES cell regulators. Interestingly, we find that the pluripotency module is enriched with genes related to DNA damage repair and mitochondrial function in addition to transcriptional regulation. Using a connectivity measure of module membership, we not only identify known regulators of ES cells but also show that Mrpl15, Msh6, Nrf1, Nup133, Ppif, Rbpj, Sh3gl2, and Zfp39, among other genes, have important roles in maintaining ES cell pluripotency and self-renewal. We also report highly significant relationships between module membership and epigenetic modifications (histone modifications and promoter CpG methylation status), which are known to play a role in controlling gene expression during ES cell self-renewal and differentiation. Conclusion Our systems biologic re-analysis of gene expression, transcription factor binding, epigenetic and gene ontology data provides a novel integrative view of ES cell biology. PMID:19619308
Nejat, Naghmeh; Cahill, David M; Vadamalai, Ganesan; Ziemann, Mark; Rookes, James; Naderali, Neda
2015-10-01
Invasive phytoplasmas wreak havoc on coconut palms worldwide, leading to high loss of income, food insecurity and extreme poverty of farmers in producing countries. Phytoplasmas as strictly biotrophic insect-transmitted bacterial pathogens instigate distinct changes in developmental processes and defence responses of the infected plants and manipulate plants to their own advantage; however, little is known about the cellular and molecular mechanisms underlying host-phytoplasma interactions. Further, phytoplasma-mediated transcriptional alterations in coconut palm genes have not yet been identified. This study evaluated the whole transcriptome profiles of naturally infected leaves of Cocos nucifera ecotype Malayan Red Dwarf in response to yellow decline phytoplasma from group 16SrXIV, using RNA-Seq technique. Transcriptomics-based analysis reported here identified genes involved in coconut innate immunity. The number of down-regulated genes in response to phytoplasma infection exceeded the number of genes up-regulated. Of the 39,873 differentially expressed unigenes, 21,860 unigenes were suppressed and 18,013 were induced following infection. Comparative analysis revealed that genes associated with defence signalling against biotic stimuli were significantly overexpressed in phytoplasma-infected leaves versus healthy coconut leaves. Genes involving cell rescue and defence, cellular transport, oxidative stress, hormone stimulus and metabolism, photosynthesis reduction, transcription and biosynthesis of secondary metabolites were differentially represented. Our transcriptome analysis unveiled a core set of genes associated with defence of coconut in response to phytoplasma attack, although several novel defence response candidate genes with unknown function have also been identified. This study constitutes valuable sequence resource for uncovering the resistance genes and/or susceptibility genes which can be used as genetic tools in disease resistance breeding.
2011-01-01
Background Simpler biological systems should be easier to understand and to engineer towards pre-defined goals. One way to achieve biological simplicity is through genome minimization. Here we looked for genomic islands in the fresh water cyanobacteria Synechococcus elongatus PCC 7942 (genome size 2.7 Mb) that could be used as targets for deletion. We also looked for conserved genes that might be essential for cell survival. Results By using a combination of methods we identified 170 xenologs, 136 ORFans and 1401 core genes in the genome of S. elongatus PCC 7942. These represent 6.5%, 5.2% and 53.6% of the annotated genes respectively. We considered that genes in genomic islands could be found if they showed a combination of: a) unusual G+C content; b) unusual phylogenetic similarity; and/or c) a small number of the highly iterated palindrome 1 (HIP1) motif plus an unusual codon usage. The origin of the largest genomic island by horizontal gene transfer (HGT) could be corroborated by lack of coverage among metagenomic sequences from a fresh water microbialite. Evidence is also presented that xenologous genes tend to cluster in operons. Interestingly, most genes coding for proteins with a diguanylate cyclase domain are predicted to be xenologs, suggesting a role for horizontal gene transfer in the evolution of Synechococcus sensory systems. Conclusions Our estimates of genomic islands in PCC 7942 are larger than those predicted by other published methods like SIGI-HMM. Our results set a guide to non-essential genes in S. elongatus PCC 7942 indicating a path towards the engineering of a model photoautotrophic bacterial cell. PMID:21226929
The common transcriptional subnetworks of the grape berry skin in the late stages of ripening.
Ghan, Ryan; Petereit, Juli; Tillett, Richard L; Schlauch, Karen A; Toubiana, David; Fait, Aaron; Cramer, Grant R
2017-05-30
Wine grapes are important economically in many countries around the world. Defining the optimum time for grape harvest is a major challenge to the grower and winemaker. Berry skins are an important source of flavor, color and other quality traits in the ripening stage. Senescent-like processes such as chloroplast disorganization and cell death characterize the late ripening stage. To better understand the molecular and physiological processes involved in the late stages of berry ripening, RNA-seq analysis of the skins of seven wine grape cultivars (Cabernet Franc, Cabernet Sauvignon, Merlot, Pinot Noir, Chardonnay, Sauvignon Blanc and Semillon) was performed. RNA-seq analysis identified approximately 2000 common differentially expressed genes for all seven cultivars across four different berry sugar levels (20 to 26 °Brix). Network analyses, both a posteriori (standard) and a priori (gene co-expression network analysis), were used to elucidate transcriptional subnetworks and hub genes associated with traits in the berry skins of the late stages of berry ripening. These independent approaches revealed genes involved in photosynthesis, catabolism, and nucleotide metabolism. The transcript abundance of most photosynthetic genes declined with increasing sugar levels in the berries. The transcript abundance of other processes increased such as nucleic acid metabolism, chromosome organization and lipid catabolism. Weighted gene co-expression network analysis (WGCNA) identified 64 gene modules that were organized into 12 subnetworks of three modules or more and six higher order gene subnetworks. Some gene subnetworks were highly correlated with sugar levels and some subnetworks were highly enriched in the chloroplast and nucleus. The petal R package was utilized independently to construct a true small-world and scale-free complex gene co-expression network model. A subnetwork of 216 genes with the highest connectivity was elucidated, consistent with the module results from WGCNA. Hub genes in these subnetworks were identified including numerous members of the core circadian clock, RNA splicing, proteolysis and chromosome organization. An integrated model was constructed linking light sensing with alternative splicing, chromosome remodeling and the circadian clock. A common set of differentially expressed genes and gene subnetworks from seven different cultivars were examined in the skin of the late stages of grapevine berry ripening. A densely connected gene subnetwork was elucidated involving a complex interaction of berry senescent processes (autophagy), catabolism, the circadian clock, RNA splicing, proteolysis and epigenetic regulation. Hypotheses were induced from these data sets involving sugar accumulation, light, autophagy, epigenetic regulation, and fruit development. This work provides a better understanding of berry development and the transcriptional processes involved in the late stages of ripening.
Aggarwal, Rohit; Rider, Lisa G; Ruperto, Nicolino; Bayat, Nastaran; Erman, Brian; Feldman, Brian M; Oddis, Chester V; Amato, Anthony A; Chinoy, Hector; Cooper, Robert G; Dastmalchi, Maryam; Fiorentino, David; Isenberg, David; Katz, James D; Mammen, Andrew; de Visser, Marianne; Ytterberg, Steven R; Lundberg, Ingrid E; Chung, Lorinda; Danko, Katalin; García-De la Torre, Ignacio; Song, Yeong Wook; Villa, Luca; Rinaldi, Mariangela; Rockette, Howard; Lachenbruch, Peter A; Miller, Frederick W; Vencovsky, Jiri
2017-05-01
To develop response criteria for adult dermatomyositis (DM) and polymyositis (PM). Expert surveys, logistic regression, and conjoint analysis were used to develop 287 definitions using core set measures. Myositis experts rated greater improvement among multiple pairwise scenarios in conjoint analysis surveys, where different levels of improvement in 2 core set measures were presented. The PAPRIKA (Potentially All Pairwise Rankings of All Possible Alternatives) method determined the relative weights of core set measures and conjoint analysis definitions. The performance characteristics of the definitions were evaluated on patient profiles using expert consensus (gold standard) and were validated using data from a clinical trial. The nominal group technique was used to reach consensus. Consensus was reached for a conjoint analysis-based continuous model using absolute percent change in core set measures (physician, patient, and extramuscular global activity, muscle strength, Health Assessment Questionnaire, and muscle enzyme levels). A total improvement score (range 0-100), determined by summing scores for each core set measure, was based on improvement in and relative weight of each core set measure. Thresholds for minimal, moderate, and major improvement were ≥20, ≥40, and ≥60 points in the total improvement score. The same criteria were chosen for juvenile DM, with different improvement thresholds. Sensitivity and specificity in DM/PM patient cohorts were 85% and 92%, 90% and 96%, and 92% and 98% for minimal, moderate, and major improvement, respectively. Definitions were validated in the clinical trial analysis for differentiating the physician rating of improvement (P < 0.001). The response criteria for adult DM/PM consisted of the conjoint analysis model based on absolute percent change in 6 core set measures, with thresholds for minimal, moderate, and major improvement. © 2017, American College of Rheumatology.
Weigl, Martin; Wild, Heike
2017-09-15
To validate the International Classification of Functioning, Disability and Health Comprehensive Core Set for Osteoarthritis from the patient perspective in Europe. This multicenter cross-sectional study involved 375 patients with knee or hip osteoarthritis. Trained health professionals completed the Comprehensive Core Set, and patients completed the Short-Form 36 questionnaire. Content validity was evaluated by calculating prevalences of impairments in body function and structures, limitations in activities and participation and environmental factors, which were either barriers or facilitators. Convergent construct validity was evaluated by correlating the International Classification of Functioning, Disability and Health categories with the Short-Form 36 Physical Component Score and the SF-36 Mental Component Score in a subgroup of 259 patients. The prevalences of all body function, body structure and activities and participation categories were >40%, >32% and >20%, respectively, and all environmental factors were relevant for >16% of patients. Few categories showed relevant differences between knee and hip osteoarthritis. All body function categories and all but two activities and participation categories showed significant correlations with the Physical Component Score. Body functions from the ICF chapter Mental Functions showed higher correlations with the Mental Component Score than with the Physical Component Score. This study supports the validity of the International Classification of Functioning, Disability and Health Comprehensive Core Set for Osteoarthritis. Implications for Rehabilitation Comprehensive International Classification of Functioning, Disability and Health Core Sets were developed as practical tools for application in multidisciplinary assessments. The validity of the Comprehensive International Classification of Functioning, Disability and Health Core Set for Osteoarthritis in this study supports its application in European patients with osteoarthritis. The differences in results between this Europe validation study and a previous Singaporean validation study underscore the need to validate the International Classification of Functioning, Disability and Health Core Sets in different regions of the world.
LCR 5' hypersensitive site specificity for globin gene activation within the active chromatin hub.
Peterson, Kenneth R; Fedosyuk, Halyna; Harju-Baker, Susanna
2012-12-01
The DNaseI hypersensitive sites (HSs) of the human β-globin locus control region (LCR) may function as part of an LCR holocomplex within a larger active chromatin hub (ACH). Differential activation of the globin genes during development may be controlled in part by preferential interaction of each gene with specific individual HSs during globin gene switching, a change in conformation of the LCR holocomplex, or both. To distinguish between these possibilities, human β-globin locus yeast artificial chromosome (β-YAC) lines were produced in which the ε-globin gene was replaced with a second marked β-globin gene (β(m)), coupled to an intact LCR, a 5'HS3 complete deletion (5'ΔHS3) or a 5'HS3 core deletion (5'ΔHS3c). The 5'ΔHS3c mice expressed β(m)-globin throughout development; γ-globin was co-expressed in the embryonic yolk sac, but not in the fetal liver; and wild-type β-globin was co-expressed in adult mice. Although the 5'HS3 core was not required for β(m)-globin expression, previous work showed that the 5'HS3 core is necessary for ε-globin expression during embryonic erythropoiesis. A similar phenotype was observed in 5'HS complete deletion mice, except β(m)-globin expression was higher during primitive erythropoiesis and γ-globin expression continued into fetal definitive erythropoiesis. These data support a site specificity model of LCR HS-globin gene interaction.
Spectral gene set enrichment (SGSE).
Frost, H Robert; Li, Zhigang; Moore, Jason H
2015-03-03
Gene set testing is typically performed in a supervised context to quantify the association between groups of genes and a clinical phenotype. In many cases, however, a gene set-based interpretation of genomic data is desired in the absence of a phenotype variable. Although methods exist for unsupervised gene set testing, they predominantly compute enrichment relative to clusters of the genomic variables with performance strongly dependent on the clustering algorithm and number of clusters. We propose a novel method, spectral gene set enrichment (SGSE), for unsupervised competitive testing of the association between gene sets and empirical data sources. SGSE first computes the statistical association between gene sets and principal components (PCs) using our principal component gene set enrichment (PCGSE) method. The overall statistical association between each gene set and the spectral structure of the data is then computed by combining the PC-level p-values using the weighted Z-method with weights set to the PC variance scaled by Tracy-Widom test p-values. Using simulated data, we show that the SGSE algorithm can accurately recover spectral features from noisy data. To illustrate the utility of our method on real data, we demonstrate the superior performance of the SGSE method relative to standard cluster-based techniques for testing the association between MSigDB gene sets and the variance structure of microarray gene expression data. Unsupervised gene set testing can provide important information about the biological signal held in high-dimensional genomic data sets. Because it uses the association between gene sets and samples PCs to generate a measure of unsupervised enrichment, the SGSE method is independent of cluster or network creation algorithms and, most importantly, is able to utilize the statistical significance of PC eigenvalues to ignore elements of the data most likely to represent noise.
Kumar, Nitin; Lad, Ganesh; Giuntini, Elisa; Kaye, Maria E.; Udomwong, Piyachat; Shamsani, N. Jannah; Young, J. Peter W.; Bailly, Xavier
2015-01-01
Biological species may remain distinct because of genetic isolation or ecological adaptation, but these two aspects do not always coincide. To establish the nature of the species boundary within a local bacterial population, we characterized a sympatric population of the bacterium Rhizobium leguminosarum by genomic sequencing of 72 isolates. Although all strains have 16S rRNA typical of R. leguminosarum, they fall into five genospecies by the criterion of average nucleotide identity (ANI). Many genes, on plasmids as well as the chromosome, support this division: recombination of core genes has been largely within genospecies. Nevertheless, variation in ecological properties, including symbiotic host range and carbon-source utilization, cuts across these genospecies, so that none of these phenotypes is diagnostic of genospecies. This phenotypic variation is conferred by mobile genes. The genospecies meet the Mayr criteria for biological species in respect of their core genes, but do not correspond to coherent ecological groups, so periodic selection may not be effective in purging variation within them. The population structure is incompatible with traditional ‘polyphasic taxonomy′ that requires bacterial species to have both phylogenetic coherence and distinctive phenotypes. More generally, genomics has revealed that many bacterial species share adaptive modules by horizontal gene transfer, and we envisage a more consistent taxonomic framework that explicitly recognizes this. Significant phenotypes should be recognized as ‘biovars' within species that are defined by core gene phylogeny. PMID:25589577
Cavanagh, Jorunn Pauline; Hjerde, Erik; Holden, Matthew T G; Kahlke, Tim; Klingenberg, Claus; Flægstad, Trond; Parkhill, Julian; Bentley, Stephen D; Sollid, Johanna U Ericson
2014-11-01
Staphylococcus haemolyticus is an emerging cause of nosocomial infections, primarily affecting immunocompromised patients. A comparative genomic analysis was performed on clinical S. haemolyticus isolates to investigate their genetic relationship and explore the coding sequences with respect to antimicrobial resistance determinants and putative hospital adaptation. Whole-genome sequencing was performed on 134 isolates of S. haemolyticus from geographically diverse origins (Belgium, 2; Germany, 10; Japan, 13; Norway, 54; Spain, 2; Switzerland, 43; UK, 9; USA, 1). Each genome was individually assembled. Protein coding sequences (CDSs) were predicted and homologous genes were categorized into three types: Type I, core genes, homologues present in all strains; Type II, unique core genes, homologues shared by only a subgroup of strains; and Type III, unique genes, strain-specific CDSs. The phylogenetic relationship between the isolates was built from variable sites in the form of single nucleotide polymorphisms (SNPs) in the core genome and used to construct a maximum likelihood phylogeny. SNPs in the genome core regions divided the isolates into one major group of 126 isolates and one minor group of isolates with highly diverse genomes. The major group was further subdivided into seven clades (A-G), of which four (A-D) encompassed isolates only from Europe. Antimicrobial multiresistance was observed in 77.7% of the collection. High levels of homologous recombination were detected in genes involved in adherence, staphylococcal host adaptation and bacterial cell communication. The presence of several successful and highly resistant clones underlines the adaptive potential of this opportunistic pathogen. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy.
Cavanagh, Jorunn Pauline; Hjerde, Erik; Holden, Matthew T. G.; Kahlke, Tim; Klingenberg, Claus; Flægstad, Trond; Parkhill, Julian; Bentley, Stephen D.; Sollid, Johanna U. Ericson
2014-01-01
Objectives Staphylococcus haemolyticus is an emerging cause of nosocomial infections, primarily affecting immunocompromised patients. A comparative genomic analysis was performed on clinical S. haemolyticus isolates to investigate their genetic relationship and explore the coding sequences with respect to antimicrobial resistance determinants and putative hospital adaptation. Methods Whole-genome sequencing was performed on 134 isolates of S. haemolyticus from geographically diverse origins (Belgium, 2; Germany, 10; Japan, 13; Norway, 54; Spain, 2; Switzerland, 43; UK, 9; USA, 1). Each genome was individually assembled. Protein coding sequences (CDSs) were predicted and homologous genes were categorized into three types: Type I, core genes, homologues present in all strains; Type II, unique core genes, homologues shared by only a subgroup of strains; and Type III, unique genes, strain-specific CDSs. The phylogenetic relationship between the isolates was built from variable sites in the form of single nucleotide polymorphisms (SNPs) in the core genome and used to construct a maximum likelihood phylogeny. Results SNPs in the genome core regions divided the isolates into one major group of 126 isolates and one minor group of isolates with highly diverse genomes. The major group was further subdivided into seven clades (A–G), of which four (A–D) encompassed isolates only from Europe. Antimicrobial multiresistance was observed in 77.7% of the collection. High levels of homologous recombination were detected in genes involved in adherence, staphylococcal host adaptation and bacterial cell communication. Conclusions The presence of several successful and highly resistant clones underlines the adaptive potential of this opportunistic pathogen. PMID:25038069
Takahashi, Kei-ichiro; Takigawa, Ichigaku; Mamitsuka, Hiroshi
2013-01-01
Detecting biclusters from expression data is useful, since biclusters are coexpressed genes under only part of all given experimental conditions. We present a software called SiBIC, which from a given expression dataset, first exhaustively enumerates biclusters, which are then merged into rather independent biclusters, which finally are used to generate gene set networks, in which a gene set assigned to one node has coexpressed genes. We evaluated each step of this procedure: 1) significance of the generated biclusters biologically and statistically, 2) biological quality of merged biclusters, and 3) biological significance of gene set networks. We emphasize that gene set networks, in which nodes are not genes but gene sets, can be more compact than usual gene networks, meaning that gene set networks are more comprehensible. SiBIC is available at http://utrecht.kuicr.kyoto-u.ac.jp:8080/miami/faces/index.jsp.
Beuscart, Jean-Baptiste; Dalleur, Olivia; Boland, Benoit; Thevelin, Stefanie; Knol, Wilma; Cullinan, Shane; Schneider, Claudio; O'Mahony, Denis; Rodondi, Nicolas; Spinewine, Anne
2017-01-01
Medication review has been advocated to address the challenge of polypharmacy in older patients, yet there is no consensus on how best to evaluate its efficacy. Heterogeneity of outcomes reported in clinical trials can hinder the comparison of clinical trial findings in systematic reviews. Moreover, the outcomes that matter most to older patients might be under-reported or disregarded altogether. A core outcome set can address this issue as it defines a minimum set of outcomes that should be reported in all clinical trials in any particular field of research. As part of the European Commission-funded project, called OPtimising thERapy to prevent Avoidable hospital admissions in the Multimorbid elderly, this paper describes the methods used to develop a core outcome set for clinical trials of medication review in older patients with multimorbidity. The study was designed in several steps. First, a systematic review established which outcomes were measured in published and ongoing clinical trials of medication review in older patients. Second, we undertook semistructured interviews with older patients and carers aimed at identifying additional relevant outcomes. Then, a multilanguage European Delphi survey adapted to older patients was designed. The international Delphi survey was conducted with older patients, health care professionals, researchers, and clinical experts in geriatric pharmacotherapy to validate outcomes to be included in the core outcome set. Consensus meetings were conducted to validate the results. We present the method for developing a core outcome set for medication review in older patients with multimorbidity. This study protocol could be used as a basis to develop core outcome sets in other fields of geriatric research.
Core outcome sets and trial registries.
Clarke, Mike; Williamson, Paula
2015-05-14
Some reasons for registering trials might be considered as self-serving, such as satisfying the requirements of a journal in which the researchers wish to publish their eventual findings or publicising the trial to boost recruitment. Registry entries also help others, including systematic reviewers, to know about ongoing or unpublished studies and contribute to reducing research waste by making it clear what studies are ongoing. Other sources of research waste include inconsistency in outcome measurement across trials in the same area, missing data on important outcomes from some trials, and selective reporting of outcomes. One way to reduce this waste is through the use of core outcome sets: standardised sets of outcomes for research in specific areas of health and social care. These do not restrict the outcomes that will be measured, but provide the minimum to include if a trial is to be of the most use to potential users. We propose that trial registries, such as ISRCTN, encourage researchers to note their use of a core outcome set in their entry. This will help people searching for trials and those worried about selective reporting in closed trials. Trial registries can facilitate these efforts to make new trials as useful as possible and reduce waste. The outcomes section in the entry could prompt the researcher to consider using a core outcome set and facilitate the specification of that core outcome set and its component outcomes through linking to the original core outcome set. In doing this, registries will contribute to the global effort to ensure that trials answer important uncertainties, can be brought together in systematic reviews, and better serve their ultimate aim of improving health and well-being through improving health and social care.
A core outcome set for clinical trials in acute diarrhoea.
Karas, Jacek; Ashkenazi, Shai; Guarino, Alfredo; Lo Vecchio, Andrea; Shamir, Raanan; Vandenplas, Yvan; Szajewska, Hania
2015-04-01
Core outcome sets are the baseline for what should be measured in clinical research and, thus, should serve as a guide for what should be collected and reported. The Consensus Group on Outcome Measures Made in Pediatric Enteral Nutrition Clinical Trials, established in 2012, agreed that consensus on a core set of outcomes with agreed-upon definitions that should be measured and reported in clinical trials was needed. To achieve this goal, six working groups (WGs) were setup, including WG on acute diarrhoea, whose main goal was to develop a core outcome set for trials in acute diarrhoea. The first step identified how published outcomes related to acute diarrhoea were reported. The second focused on the methodology for determining which outcomes to measure in clinical trials. The third employed a two-phase questionnaire study using the Delphi technique to define clinically important outcomes to clinicians and parents. For therapeutic studies, the five most important outcome measures were diarrhoea duration, degree of dehydration, need for hospitalisation (or duration of hospitalisation for inpatients), the proportion of patients recovered by 48 h and adverse effects. The prophylactic core outcome set included prevention of diarrhoea, prevention of dehydration, prevention of hospitalisation and adverse effects. The outcome sets for therapy and prevention can be recommended for use in future trials of patients with gastroenteritis. Their envisioned goal is to decrease study heterogeneity and to ease the comparability of studies. WG's next step is to determine how to measure the outcomes included in the core set. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
A model for enhancing Internet medical document retrieval with "medical core metadata".
Malet, G; Munoz, F; Appleyard, R; Hersh, W
1999-01-01
Finding documents on the World Wide Web relevant to a specific medical information need can be difficult. The goal of this work is to define a set of document content description tags, or metadata encodings, that can be used to promote disciplined search access to Internet medical documents. The authors based their approach on a proposed metadata standard, the Dublin Core Metadata Element Set, which has recently been submitted to the Internet Engineering Task Force. Their model also incorporates the National Library of Medicine's Medical Subject Headings (MeSH) vocabulary and MEDLINE-type content descriptions. The model defines a medical core metadata set that can be used to describe the metadata for a wide variety of Internet documents. The authors propose that their medical core metadata set be used to assign metadata to medical documents to facilitate document retrieval by Internet search engines.
A Model for Enhancing Internet Medical Document Retrieval with “Medical Core Metadata”
Malet, Gary; Munoz, Felix; Appleyard, Richard; Hersh, William
1999-01-01
Objective: Finding documents on the World Wide Web relevant to a specific medical information need can be difficult. The goal of this work is to define a set of document content description tags, or metadata encodings, that can be used to promote disciplined search access to Internet medical documents. Design: The authors based their approach on a proposed metadata standard, the Dublin Core Metadata Element Set, which has recently been submitted to the Internet Engineering Task Force. Their model also incorporates the National Library of Medicine's Medical Subject Headings (MeSH) vocabulary and Medline-type content descriptions. Results: The model defines a medical core metadata set that can be used to describe the metadata for a wide variety of Internet documents. Conclusions: The authors propose that their medical core metadata set be used to assign metadata to medical documents to facilitate document retrieval by Internet search engines. PMID:10094069
Defining Outcome Measures for Psoriasis: The IDEOM Report from the GRAPPA 2016 Annual Meeting.
Callis Duffin, Kristina; Gottlieb, Alice B; Merola, Joseph F; Latella, John; Garg, Amit; Armstrong, April W
2017-05-01
The International Dermatology Outcome Measures (IDEOM) psoriasis working group was established to develop core domains and measurements sets for psoriasis clinical trials and ultimately clinical practice. At the 2016 annual meeting of the Group for Research and Assessment of Psoriasis and Psoriatic Arthritis, the IDEOM psoriasis group presented an overview of its progress toward developing this psoriasis core domain set. First, it summarized the February 2016 meeting of all involved with the IDEOM, highlighting patient and payer perspectives on outcome measures. Second, the group presented an overview of the consensus process for developing the core domain set for psoriasis, including previous literature reviews, nominal group exercises, and meeting discussions. Future plans include the development of working groups to review candidate measures for at least 2 of the domains, including primary pathophysiologic manifestations and patient-reported outcomes, and Delphi surveys to gain consensus on the final psoriasis core domain set.
Nourdin-Galindo, Guillermo; Sánchez, Patricio; Molina, Cristian F; Espinoza-Rojas, Daniela A; Oliver, Cristian; Ruiz, Pamela; Vargas-Chacoff, Luis; Cárcamo, Juan G; Figueroa, Jaime E; Mancilla, Marcos; Maracaja-Coutinho, Vinicius; Yañez, Alejandro J
2017-01-01
Piscirickettsia salmonis is the etiological agent of salmonid rickettsial septicemia, a disease that seriously affects the salmonid industry. Despite efforts to genomically characterize P. salmonis , functional information on the life cycle, pathogenesis mechanisms, diagnosis, treatment, and control of this fish pathogen remain lacking. To address this knowledge gap, the present study conducted an in silico pan-genome analysis of 19 P. salmonis strains from distinct geographic locations and genogroups. Results revealed an expected open pan-genome of 3,463 genes and a core-genome of 1,732 genes. Two marked genogroups were identified, as confirmed by phylogenetic and phylogenomic relationships to the LF-89 and EM-90 reference strains, as well as by assessments of genomic structures. Different structural configurations were found for the six identified copies of the ribosomal operon in the P. salmonis genome, indicating translocation throughout the genetic material. Chromosomal divergences in genomic localization and quantity of genetic cassettes were also found for the Dot/Icm type IVB secretion system. To determine divergences between core-genomes, additional pan-genome descriptions were compiled for the so-termed LF and EM genogroups. Open pan-genomes composed of 2,924 and 2,778 genes and core-genomes composed of 2,170 and 2,228 genes were respectively found for the LF and EM genogroups. The core-genomes were functionally annotated using the Gene Ontology, KEGG, and Virulence Factor databases, revealing the presence of several shared groups of genes related to basic function of intracellular survival and bacterial pathogenesis. Additionally, the specific pan-genomes for the LF and EM genogroups were defined, resulting in the identification of 148 and 273 exclusive proteins, respectively. Notably, specific virulence factors linked to adherence, colonization, invasion factors, and endotoxins were established. The obtained data suggest that these genes could be directly associated with inter-genogroup differences in pathogenesis and host-pathogen interactions, information that could be useful in designing novel strategies for diagnosing and controlling P. salmonis infection.
Nourdin-Galindo, Guillermo; Sánchez, Patricio; Molina, Cristian F.; Espinoza-Rojas, Daniela A.; Oliver, Cristian; Ruiz, Pamela; Vargas-Chacoff, Luis; Cárcamo, Juan G.; Figueroa, Jaime E.; Mancilla, Marcos; Maracaja-Coutinho, Vinicius; Yañez, Alejandro J.
2017-01-01
Piscirickettsia salmonis is the etiological agent of salmonid rickettsial septicemia, a disease that seriously affects the salmonid industry. Despite efforts to genomically characterize P. salmonis, functional information on the life cycle, pathogenesis mechanisms, diagnosis, treatment, and control of this fish pathogen remain lacking. To address this knowledge gap, the present study conducted an in silico pan-genome analysis of 19 P. salmonis strains from distinct geographic locations and genogroups. Results revealed an expected open pan-genome of 3,463 genes and a core-genome of 1,732 genes. Two marked genogroups were identified, as confirmed by phylogenetic and phylogenomic relationships to the LF-89 and EM-90 reference strains, as well as by assessments of genomic structures. Different structural configurations were found for the six identified copies of the ribosomal operon in the P. salmonis genome, indicating translocation throughout the genetic material. Chromosomal divergences in genomic localization and quantity of genetic cassettes were also found for the Dot/Icm type IVB secretion system. To determine divergences between core-genomes, additional pan-genome descriptions were compiled for the so-termed LF and EM genogroups. Open pan-genomes composed of 2,924 and 2,778 genes and core-genomes composed of 2,170 and 2,228 genes were respectively found for the LF and EM genogroups. The core-genomes were functionally annotated using the Gene Ontology, KEGG, and Virulence Factor databases, revealing the presence of several shared groups of genes related to basic function of intracellular survival and bacterial pathogenesis. Additionally, the specific pan-genomes for the LF and EM genogroups were defined, resulting in the identification of 148 and 273 exclusive proteins, respectively. Notably, specific virulence factors linked to adherence, colonization, invasion factors, and endotoxins were established. The obtained data suggest that these genes could be directly associated with inter-genogroup differences in pathogenesis and host-pathogen interactions, information that could be useful in designing novel strategies for diagnosing and controlling P. salmonis infection. PMID:29164068
Schmidt, Johanna; Jezberová, Jitka; Koll, Ulrike; Hahn, Martin W.
2016-01-01
ABSTRACT Microdiversification of a planktonic freshwater bacterium was studied by comparing 37 Polynucleobacter asymbioticus strains obtained from three geographically separated sites in the Austrian Alps. Genome comparison of nine strains revealed a core genome of 1.8 Mb, representing 81% of the average genome size. Seventy-five percent of the remaining flexible genome is clustered in genomic islands (GIs). Twenty-four genomic positions could be identified where GIs are potentially located. These positions are occupied strain specifically from a set of 28 GI variants, classified according to similarities in their gene content. One variant, present in 62% of the isolates, encodes a pathway for the degradation of aromatic compounds, and another, found in 78% of the strains, contains an operon for nitrate assimilation. Both variants were shown in ecophysiological tests to be functional, thus providing the potential for microniche partitioning. In addition, detected interspecific horizontal exchange of GIs indicates a large gene pool accessible to Polynucleobacter species. In contrast to core genes, GIs are spread more successfully across spatially separated freshwater habitats. The mobility and functional diversity of GIs allow for rapid evolution, which may be a key aspect for the ubiquitous occurrence of Polynucleobacter bacteria. IMPORTANCE Assessing the ecological relevance of bacterial diversity is a key challenge for current microbial ecology. The polyphasic approach which was applied in this study, including targeted isolation of strains, genome analysis, and ecophysiological tests, is crucial for the linkage of genetic and ecological knowledge. Particularly great importance is attached to the high number of closely related strains which were investigated, represented by genome-wide average nucleotide identities (ANI) larger than 97%. The extent of functional diversification found on this narrow phylogenetic scale is compelling. Moreover, the transfer of metabolically relevant genomic islands between more distant members of the Polynucleobacter community provides important insights toward a better understanding of the evolution of these globally abundant freshwater bacteria. PMID:27836842
Li, D; Kong, Y; Yu, H; Lehtinen, A; Huang, H; Shen, F; Min, L; Zhou, J; Tang, G; Wang, Q
2008-04-01
A novel kind of non-viral gene delivery vector based on transferrin (Tf) as the core component was constructed with high transfection efficiency and low toxicity. The synthesis vector of Tf-PEI600 was confirmed by different physicochemical methods, including (1)H nuclear magnetic resonance, gel permeation chromatography, X-ray and thermogravimetric analysis. The cytotoxicity and gene delivery efficiency of the synthesized vector were verified by in vitro experiments. The agarose gel electrophoresis assay indicated that the novel copolymer Tf-PEI600 could efficiently condense plasmid DNA and the condensed nanoparticles exhibited a spherical shape. As the weight ratio of Tf-PEI600 to DNA reached 15.0, the particle size (about 200 nm) and the zeta potential (about 20 mV) of the nanoparticles became optimal for gene delivery. The methylthiazolyl tetrazolium (MTT) assay showed the cytotoxicity of Tf-PEI600 to be similar to that of PEI600 and much lower than that of PEI25kDa. In gene-delivery experiments with COS-7 cells and HepG2 cells, the Tf-PEI600 showed about a 30- to 53-fold higher efficiency than PEI600 and nearly equal to that of PEI25kDa. These data suggest that Tf-PEI600, with the advantages of low toxicity and high gene-delivery efficiency, might have great prospects in the practice of gene delivery. The core-shell structure of Tf-PEI600 also provided a novel strategy for the construction of non-viral gene delivery vectors.
Chiarotto, Alessandro; Terwee, Caroline B; Deyo, Richard A; Boers, Maarten; Lin, Chung-Wei Christine; Buchbinder, Rachelle; Corbin, Terry P; Costa, Leonardo O P; Foster, Nadine E; Grotle, Margreth; Koes, Bart W; Kovacs, Francisco M; Maher, Chris G; Pearson, Adam M; Peul, Wilco C; Schoene, Mark L; Turk, Dennis C; van Tulder, Maurits W; Ostelo, Raymond W
2014-12-26
Low back pain (LBP) is one of the most disabling and costly disorders affecting modern society, and approximately 90% of patients are labelled as having non-specific LBP (NSLBP). Several interventions for patients with NSLBP have been assessed in clinical trials, but heterogeneous reporting of outcomes in these trials has hindered comparison of results and performance of meta-analyses. Moreover, there is a risk of selective outcome reporting bias. To address these issues, the development of a core outcome set (COS) that should be measured in all clinical trials for a specific health condition has been recommended. A standardized set of outcomes for LBP was proposed in 1998, however, with evolution in COS development methodology, new instruments, interventions, and understanding of measurement properties, it is appropriate to update that proposal. This protocol describes the methods used in the initial step in developing a COS for NSLBP, namely, establishing a core domain set that should be measured in all clinical trials. An International Steering Committee including researchers, clinicians, and patient representatives from four continents was formed to guide the development of this COS. The approach of initiatives like Core Outcome Measures in Effectiveness Trials (COMET) and Outcome Measures in Rheumatology (OMERACT) was followed. Participants were invited to participate in a Delphi study aimed at generating a consensus-based core domain set for NSLBP. A list of potential core domains was drafted and presented to the Delphi participants who were asked to judge which domains were core. Participant suggestions about overlap, aggregation, or addition of potential core domains were addressed during the study. The patients' responses were isolated to assess whether there was substantial disagreement with the rest of the Delphi panel. A priori thresholds for consensus were established before each Delphi round. All participants' responses were analysed from a quantitative and qualitative perspective to ascertain that no substantial discrepancies between the two approaches emerged. We present the initial step in developing a COS for NSLBP. The next step will be to determine which measurement instruments adequately cover the domains.
Effect of the absolute statistic on gene-sampling gene-set analysis methods.
Nam, Dougu
2017-06-01
Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.
Gene essentiality, conservation index and co-evolution of genes in cyanobacteria.
Tiruveedula, Gopi Siva Sai; Wangikar, Pramod P
2017-01-01
Cyanobacteria, a group of photosynthetic prokaryotes, dominate the earth with ~ 1015 g wet biomass. Despite diversity in habitats and an ancient origin, cyanobacterial phylum has retained a significant core genome. Cyanobacteria are being explored for direct conversion of solar energy and carbon dioxide into biofuels. For this, efficient cyanobacterial strains will need to be designed via metabolic engineering. This will require identification of target knockouts to channelize the flow of carbon toward the product of interest while minimizing deletions of essential genes. We propose "Gene Conservation Index" (GCI) as a quick measure to predict gene essentiality in cyanobacteria. GCI is based on phylogenetic profile of a gene constructed with a reduced dataset of cyanobacterial genomes. GCI is the percentage of organism clusters in which the query gene is present in the reduced dataset. Of the 750 genes deemed to be essential in the experimental study on S. elongatus PCC 7942, we found 494 to be conserved across the phylum which largely comprise of the essential metabolic pathways. On the contrary, the conserved but non-essential genes broadly comprise of genes required under stress conditions. Exceptions to this rule include genes such as the glycogen synthesis and degradation enzymes, deoxyribose-phosphate aldolase (DERA), glucose-6-phosphate 1-dehydrogenase (zwf) and fructose-1,6-bisphosphatase class1, which are conserved but non-essential. While the essential genes are to be avoided during gene knockout studies as potentially lethal deletions, the non-essential but conserved set of genes could be interesting targets for metabolic engineering. Further, we identify clusters of co-evolving genes (CCG), which provide insights that may be useful in annotation. Principal component analysis (PCA) plots of the CCGs are demonstrated as data visualization tools that are complementary to the conventional heatmaps. Our dataset consists of phylogenetic profiles for 23,643 non-redundant cyanobacterial genes. We believe that the data and the analysis presented here will be a great resource to the scientific community interested in cyanobacteria.
Structure and Evolution of Chlorate Reduction Composite Transposons
Clark, Iain C.; Melnyk, Ryan A.; Engelbrektson, Anna; Coates, John D.
2013-01-01
ABSTRACT The genes for chlorate reduction in six bacterial strains were analyzed in order to gain insight into the metabolism. A newly isolated chlorate-reducing bacterium (Shewanella algae ACDC) and three previously isolated strains (Ideonella dechloratans, Pseudomonas sp. strain PK, and Dechloromarinus chlorophilus NSS) were genome sequenced and compared to published sequences (Alicycliphilus denitrificans BC plasmid pALIDE01 and Pseudomonas chloritidismutans AW-1). De novo assembly of genomes failed to join regions adjacent to genes involved in chlorate reduction, suggesting the presence of repeat regions. Using a bioinformatics approach and finishing PCRs to connect fragmented contigs, we discovered that chlorate reduction genes are flanked by insertion sequences, forming composite transposons in all four newly sequenced strains. These insertion sequences delineate regions with the potential to move horizontally and define a set of genes that may be important for chlorate reduction. In addition to core metabolic components, we have highlighted several such genes through comparative analysis and visualization. Phylogenetic analysis places chlorate reductase within a functionally diverse clade of type II dimethyl sulfoxide (DMSO) reductases, part of a larger family of enzymes with reactivity toward chlorate. Nucleotide-level forensics of regions surrounding chlorite dismutase (cld), as well as its phylogenetic clustering in a betaproteobacterial Cld clade, indicate that cld has been mobilized at least once from a perchlorate reducer to build chlorate respiration. PMID:23919996
Dia, Ndongo; Lavie, Laurence; Méténier, Guy; Toguebaye, Bhen S; Vivarès, Christian P; Cornillot, Emmanuel
2007-03-01
Microsporidia are fungi-related obligate intracellular parasites that infect numerous animals, including man. Encephalitozoon cuniculi harbours a very small genome (2.9 Mbp) with about 2,000 coding sequences (CDSs). Most repeated CDSs are of unknown function and are distributed in subterminal regions that mark the transitions between subtelomeric rDNA units and chromosome cores. A potential multigenic family (interB) encoding proteins within a size range of 579-641 aa was investigated by PCR and RT-PCR. Thirty members were finally assigned to the E. cuniculi interB family and a predominant interB transcript was found to originate from a newly identified gene on chromosome III. Microsporidian species from eight different genera infecting insects, fishes or mammals, were tested for a possible intra-phylum conservation of interB genes. Only representatives of the Encephalitozoon, Vittaforma and Brachiola genera, differing in host range but all able to invade humans, were positive. Molecular karyotyping of Brachiola algerae showed a complex set of chromosome bands, providing a haploid genome size estimate of 15-20 Mbp. In spite of this large difference in genome complexity, B. algerae and E. cuniculi shared some similar interB gene copies and a common location of interB genes in near-rDNA subterminal regions.
Age-dependent regulation of ERF-VII transcription factor activity in Arabidopsis thaliana.
Giuntoli, Beatrice; Shukla, Vinay; Maggiorelli, Federica; Giorgi, Federico M; Lombardi, Lara; Perata, Pierdomenico; Licausi, Francesco
2017-10-01
The Group VII Ethylene Responsive Factors (ERFs-VII) RAP2.2 and RAP2.12 have been mainly characterized with regard to their contribution as activators of fermentation in plants. However, transcriptional changes measured in conditions that stabilize these transcription factors exceed the mere activation of this biochemical pathway, implying additional roles performed by the ERF-VIIs in other processes. We evaluated gene expression in transgenic Arabidopsis lines expressing a stabilized form of RAP2.12, or hampered in ERF-VII activity, and identified genes affected by this transcriptional regulator and its homologs, including some involved in oxidative stress response, which are not universally induced under anaerobic conditions. The contribution of the ERF-VIIs in regulating this set of genes in response to chemically induced or submergence-stimulated mitochondria malfunctioning was found to depend on the plant developmental stage. A similar age-dependent mechanism also restrained ERF-VII activity upon the core-hypoxic genes, independently of the N-end rule pathway, which is accounted for the control of the anaerobic response. To conclude, this study shed new light on a dual role of ERF-VII proteins under submergence: as positive regulators of the hypoxic response and as repressors of oxidative-stress related genes, depending on the developmental stage at which plants are challenged by stress conditions. © 2017 John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Winchester, S. K.; Selvamurugan, N.; D'Alonzo, R. C.; Partridge, N. C.
2000-01-01
Collagenase-3 mRNA is initially detectable when osteoblasts cease proliferation, increasing during differentiation and mineralization. We showed that this developmental expression is due to an increase in collagenase-3 gene transcription. Mutation of either the activator protein-1 or the runt domain binding site decreased collagenase-3 promoter activity, demonstrating that these sites are responsible for collagenase-3 gene transcription. The activator protein-1 and runt domain binding sites bind members of the activator protein-1 and core-binding factor family of transcription factors, respectively. We identified core-binding factor a1 binding to the runt domain binding site and JunD in addition to a Fos-related antigen binding to the activator protein-1 site. Overexpression of both c-Fos and c-Jun in osteoblasts or core-binding factor a1 increased collagenase-3 promoter activity. Furthermore, overexpression of c-Fos, c-Jun, and core-binding factor a1 synergistically increased collagenase-3 promoter activity. Mutation of either the activator protein-1 or the runt domain binding site resulted in the inability of c-Fos and c-Jun or core-binding factor a1 to increase collagenase-3 promoter activity, suggesting that there is cooperative interaction between the sites and the proteins. Overexpression of Fra-2 and JunD repressed core-binding factor a1-induced collagenase-3 promoter activity. Our results suggest that members of the activator protein-1 and core-binding factor families, binding to the activator protein-1 and runt domain binding sites are responsible for the developmental regulation of collagenase-3 gene expression in osteoblasts.
Ras-mediated deregulation of the circadian clock in cancer.
Relógio, Angela; Thomas, Philippe; Medina-Pérez, Paula; Reischl, Silke; Bervoets, Sander; Gloc, Ewa; Riemer, Pamela; Mang-Fatehi, Shila; Maier, Bert; Schäfer, Reinhold; Leser, Ulf; Herzel, Hanspeter; Kramer, Achim; Sers, Christine
2014-01-01
Circadian rhythms are essential to the temporal regulation of molecular processes in living systems and as such to life itself. Deregulation of these rhythms leads to failures in biological processes and eventually to the manifestation of pathological phenotypes including cancer. To address the questions as to what are the elicitors of a disrupted clock in cancer, we applied a systems biology approach to correlate experimental, bioinformatics and modelling data from several cell line models for colorectal and skin cancer. We found strong and weak circadian oscillators within the same type of cancer and identified a set of genes, which allows the discrimination between the two oscillator-types. Among those genes are IFNGR2, PITX2, RFWD2, PPARγ, LOXL2, Rab6 and SPARC, all involved in cancer-related pathways. Using a bioinformatics approach, we extended the core-clock network and present its interconnection to the discriminative set of genes. Interestingly, such gene signatures link the clock to oncogenic pathways like the RAS/MAPK pathway. To investigate the potential impact of the RAS/MAPK pathway - a major driver of colorectal carcinogenesis - on the circadian clock, we used a computational model which predicted that perturbation of BMAL1-mediated transcription can generate the circadian phenotypes similar to those observed in metastatic cell lines. Using an inducible RAS expression system, we show that overexpression of RAS disrupts the circadian clock and leads to an increase of the circadian period while RAS inhibition causes a shortening of period length, as predicted by our mathematical simulations. Together, our data demonstrate that perturbations induced by a single oncogene are sufficient to deregulate the mammalian circadian clock.
Lee, Mikyung; Huang, Ruili; Tong, Weida
2016-01-01
Nuclear receptors (NRs) are ligand-activated transcriptional regulators that play vital roles in key biological processes such as growth, differentiation, metabolism, reproduction, and morphogenesis. Disruption of NRs can result in adverse health effects such as NR-mediated endocrine disruption. A comprehensive understanding of core transcriptional targets regulated by NRs helps to elucidate their key biological processes in both toxicological and therapeutic aspects. In this study, we applied a probabilistic graphical model to identify the transcriptional targets of NRs and the biological processes they govern. The Tox21 program profiled a collection of approximate 10 000 environmental chemicals and drugs against a panel of human NRs in a quantitative high-throughput screening format for their NR disruption potential. The Japanese Toxicogenomics Project, one of the most comprehensive efforts in the field of toxicogenomics, generated large-scale gene expression profiles on the effect of 131 compounds (in its first phase of study) at various doses, and different durations, and their combinations. We applied author-topic model to these 2 toxicological datasets, which consists of 11 NRs run in either agonist and/or antagonist mode (18 assays total) and 203 in vitro human gene expression profiles connected by 52 shared drugs. As a result, a set of clusters (topics), which consists of a set of NRs and their associated target genes were determined. Various transcriptional targets of the NRs were identified by assays run in either agonist or antagonist mode. Our results were validated by functional analysis and compared with TRANSFAC data. In summary, our approach resulted in effective identification of associated/affected NRs and their target genes, providing biologically meaningful hypothesis embedded in their relationships. PMID:26643261
34. DESPATCH CORE OVENS, GREY IRON FOUNDRY CORE ROOM, BAKES ...
34. DESPATCH CORE OVENS, GREY IRON FOUNDRY CORE ROOM, BAKES CORES THAT ARE NOT MADE ON HEATED OR COLD BOX CORE MACHINES, TO SET BINDING AGENTS MIXED WITH THE SAND CREATING CORES HARD ENOUGH TO WITHSTAND THE FLOW OF MOLTEN IRON INSIDE A MOLD. - Stockham Pipe & Fittings Company, Grey Iron Foundry, 4000 Tenth Avenue North, Birmingham, Jefferson County, AL
Complete genome of Cobetia marina JCM 21022T and phylogenomic analysis of the family Halomonadaceae
NASA Astrophysics Data System (ADS)
Tang, Xianghai; Xu, Kuipeng; Han, Xiaojuan; Mo, Zhaolan; Mao, Yunxiang
2018-03-01
Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. To better understand the role of the surface-associated lifestyle of C. marina and the phylogeny of the family Halomonadaceae, we sequenced the entire genome of C. marina JCM 21022T using single molecule real-time sequencing technology (SMRT) and performed comparative genomics and phylogenomics analyses. The circular chromosome was 4 176 300 bp with an average GC content of 62.44% and contained 3 611 predicted coding sequences, 72 tRNA genes, and 21 rRNA genes. The C. marina JCM 21022T genome contained a set of crucial genes involved in surface colonization processes. The comparative genome analysis indicated the significant differences between C. marina JCM 21022T and Cobetia amphilecti KMM 296 (formerly named C. marina KMM 296) resulted from sequence insertions or deletions and chromosomal recombination. Despite these differences, pan and core genome analysis showed similar gene functions between the two strains. The phylogenomic study of the family Halomonadaceae is reported here for the first time. We found that the relationships were well resolved among every genera tested, including Chromohalobacter, Halomonas, Cobetia, Kushneria, Zymobacter, and Halotalea.
Humphry, Matt; Bednarek, Paweł; Kemmerling, Birgit; Koh, Serry; Stein, Mónica; Göbel, Ulrike; Stüber, Kurt; Piślewska-Bednarek, Mariola; Loraine, Ann; Schulze-Lefert, Paul; Somerville, Shauna; Panstruga, Ralph
2010-01-01
At least two components that modulate plant resistance against the fungal powdery mildew disease are ancient and have been conserved since the time of the monocot–dicot split (≈200 Mya). These components are the seven transmembrane domain containing MLO/MLO2 protein and the syntaxin ROR2/PEN1, which act antagonistically and have been identified in the monocot barley (Hordeum vulgare) and the dicot Arabidopsis thaliana, respectively. Additionally, syntaxin-interacting N-ethylmaleimide sensitive factor adaptor protein receptor proteins (VAMP721/722 and SNAP33/34) as well as a myrosinase (PEN2) and an ABC transporter (PEN3) contribute to antifungal resistance in both barley and/or Arabidopsis. Here, we show that these genetically defined defense components share a similar set of coexpressed genes in the two plant species, comprising a statistically significant overrepresentation of gene products involved in regulation of transcription, posttranslational modification, and signaling. Most of the coexpressed Arabidopsis genes possess a common cis-regulatory element that may dictate their coordinated expression. We exploited gene coexpression to uncover numerous components in Arabidopsis involved in antifungal defense. Together, our data provide evidence for an evolutionarily conserved regulon composed of core components and clade/species-specific innovations that functions as a module in plant innate immunity. PMID:21098265
Zhou, Yong; Zhu, Jinyan; Li, Zhengyi; Yi, Chuandeng; Liu, Jun; Zhang, Honggen; Tang, Shuzhu; Gu, Minghong; Liang, Guohua
2009-09-01
Rice plant architecture is an important agronomic trait and a major determinant in high productivity. Panicle erectness is the preferred plant architecture in japonica rice, but the molecular mechanism underlying domestication of the erect panicle remains elusive. Here we report the map-based cloning of a major quantitative trait locus, qPE9-1, which plays an integral role in regulation of rice plant architecture including panicle erectness. The R6547 qPE9-1 gene encodes a 426-amino-acid protein, homologous to the keratin-associated protein 5-4 family. The gene is composed of three Von Willebrand factor type C domains, one transmembrane domain, and one 4-disulfide-core domain. Phenotypic comparisons of a set of near-isogenic lines and transgenic lines reveal that the functional allele (qPE9-1) results in drooping panicles, and the loss-of-function mutation (qpe9-1) leads to more erect panicles. In addition, the qPE9-1 locus regulates panicle and grain length, grain weight, and consequently grain yield. We propose that the panicle erectness trait resulted from a natural random loss-of-function mutation for the qPE9-1 gene and has subsequently been the target of artificial selection during japonica rice breeding.
Complete genome of Cobetia marina JCM 21022T and phylogenomic analysis of the family Halomonadaceae
NASA Astrophysics Data System (ADS)
Tang, Xianghai; Xu, Kuipeng; Han, Xiaojuan; Mo, Zhaolan; Mao, Yunxiang
2016-09-01
Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. To better understand the role of the surface-associated lifestyle of C. marina and the phylogeny of the family Halomonadaceae, we sequenced the entire genome of C. marina JCM 21022T using single molecule real-time sequencing technology (SMRT) and performed comparative genomics and phylogenomics analyses. The circular chromosome was 4 176 300 bp with an average GC content of 62.44% and contained 3 611 predicted coding sequences, 72 tRNA genes, and 21 rRNA genes. The C. marina JCM 21022T genome contained a set of crucial genes involved in surface colonization processes. The comparative genome analysis indicated the significant diff erences between C. marina JCM 21022T and Cobetia amphilecti KMM 296 (formerly named C. marina KMM 296) resulted from sequence insertions or deletions and chromosomal recombination. Despite these diff erences, pan and core genome analysis showed similar gene functions between the two strains. The phylogenomic study of the family Halomonadaceae is reported here for the first time. We found that the relationships were well resolved among every genera tested, including Chromohalobacter, Halomonas, Cobetia, Kushneria, Zymobacter, and Halotalea.
Molecular mechanisms of system responses to novel stimuli are predictable from public data
Danziger, Samuel A.; Ratushny, Alexander V.; Smith, Jennifer J.; Saleem, Ramsey A.; Wan, Yakun; Arens, Christina E.; Armstrong, Abraham M.; Sitko, Katherine; Chen, Wei-Ming; Chiang, Jung-Hsien; Reiss, David J.; Baliga, Nitin S.; Aitchison, John D.
2014-01-01
Systems scale models provide the foundation for an effective iterative cycle between hypothesis generation, experiment and model refinement. Such models also enable predictions facilitating the understanding of biological complexity and the control of biological systems. Here, we demonstrate the reconstruction of a globally predictive gene regulatory model from public data: a model that can drive rational experiment design and reveal new regulatory mechanisms underlying responses to novel environments. Specifically, using ∼1500 publically available genome-wide transcriptome data sets from Saccharomyces cerevisiae, we have reconstructed an environment and gene regulatory influence network that accurately predicts regulatory mechanisms and gene expression changes on exposure of cells to completely novel environments. Focusing on transcriptional networks that induce peroxisomes biogenesis, the model-guided experiments allow us to expand a core regulatory network to include novel transcriptional influences and linkage across signaling and transcription. Thus, the approach and model provides a multi-scalar picture of gene dynamics and are powerful resources for exploiting extant data to rationally guide experimentation. The techniques outlined here are generally applicable to any biological system, which is especially important when experimental systems are challenging and samples are difficult and expensive to obtain—a common problem in laboratory animal and human studies. PMID:24185701
The limitations of simple gene set enrichment analysis assuming gene independence.
Tamayo, Pablo; Steinhardt, George; Liberzon, Arthur; Mesirov, Jill P
2016-02-01
Since its first publication in 2003, the Gene Set Enrichment Analysis method, based on the Kolmogorov-Smirnov statistic, has been heavily used, modified, and also questioned. Recently a simplified approach using a one-sample t-test score to assess enrichment and ignoring gene-gene correlations was proposed by Irizarry et al. 2009 as a serious contender. The argument criticizes Gene Set Enrichment Analysis's nonparametric nature and its use of an empirical null distribution as unnecessary and hard to compute. We refute these claims by careful consideration of the assumptions of the simplified method and its results, including a comparison with Gene Set Enrichment Analysis's on a large benchmark set of 50 datasets. Our results provide strong empirical evidence that gene-gene correlations cannot be ignored due to the significant variance inflation they produced on the enrichment scores and should be taken into account when estimating gene set enrichment significance. In addition, we discuss the challenges that the complex correlation structure and multi-modality of gene sets pose more generally for gene set enrichment methods. © The Author(s) 2012.
Christensen, Anna L; Petersen, Dana M; Burton, Rachel A; Forsberg, Vanessa C; Devers, Kelly J
2017-01-01
Objectives The objective of this study was to describe factors that influence the ability of state Medicaid agencies to report the Centers for Medicare & Medicaid Services' (CMS) core set of children's health care quality measures (Child Core Set). Methods We conducted a multiple-case study of four high-performing states participating in the Children's Health Insurance Program Reauthorization Act (CHIPRA) Quality Demonstration Grant Program: Illinois, Maine, Pennsylvania, and Oregon. Cases were purposively selected for their diverse measurement approaches and used data from 2010 to 2015, including 154 interviews, semiannual grant progress reports, and annual public reports on Child Core Set measures. We followed Yin's multiple-case study methodology to describe how and why each state increased the number of measures reported to CMS. Results All four states increased the number of Child Core Set measures reported to CMS during the grant period. Each took a different approach to reporting, depending on the available technical, organizational, and behavioral inputs in the state. Reporting capacity was influenced by a state's Medicaid data availability, ability to link to other state data systems, past experience with quality measurement, staff time and technical expertise, and demand for the measures. These factors were enhanced by CHIPRA Quality Demonstration grant funding and other federal capacity building activities, as hypothesized in our conceptual framework. These and other states have made progress reporting the Child Core Set since 2010. Conclusion With financial support and investment in state data systems and organizational factors, states can overcome challenges to reporting most of the Child Core Set measures.