unique genomic signatures: Topics by Science.gov

Sample records for unique genomic signatures

Sequencing Needs for Viral Diagnostics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gardner, S N; Lam, M; Mulakken, N J

2004-01-26

We built a system to guide decisions regarding the amount of genomic sequencing required to develop diagnostic DNA signatures, which are short sequences that are sufficient to uniquely identify a viral species. We used our existing DNA diagnostic signature prediction pipeline, which selects regions of a target species genome that are conserved among strains of the target (for reliability, to prevent false negatives) and unique relative to other species (for specificity, to avoid false positives). We performed simulations, based on existing sequence data, to assess the number of genome sequences of a target species and of close phylogenetic relatives (''nearmore » neighbors'') that are required to predict diagnostic signature regions that are conserved among strains of the target species and unique relative to other bacterial and viral species. For DNA viruses such as variola (smallpox), three target genomes provide sufficient guidance for selecting species-wide signatures. Three near neighbor genomes are critical for species specificity. In contrast, most RNA viruses require four target genomes and no near neighbor genomes, since lack of conservation among strains is more limiting than uniqueness. SARS and Ebola Zaire are exceptional, as additional target genomes currently do not improve predictions, but near neighbor sequences are urgently needed. Our results also indicate that double stranded DNA viruses are more conserved among strains than are RNA viruses, since in most cases there was at least one conserved signature candidate for the DNA viruses and zero conserved signature candidates for the RNA viruses.« less
An archaeal genomic signature

NASA Technical Reports Server (NTRS)

Graham, D. E.; Overbeek, R.; Olsen, G. J.; Woese, C. R.

2000-01-01

Comparisons of complete genome sequences allow the most objective and comprehensive descriptions possible of a lineage's evolution. This communication uses the completed genomes from four major euryarchaeal taxa to define a genomic signature for the Euryarchaeota and, by extension, the Archaea as a whole. The signature is defined in terms of the set of protein-encoding genes found in at least two diverse members of the euryarchaeal taxa that function uniquely within the Archaea; most signature proteins have no recognizable bacterial or eukaryal homologs. By this definition, 351 clusters of signature proteins have been identified. Functions of most proteins in this signature set are currently unknown. At least 70% of the clusters that contain proteins from all the euryarchaeal genomes also have crenarchaeal homologs. This conservative set, which appears refractory to horizontal gene transfer to the Bacteria or the Eukarya, would seem to reflect the significant innovations that were unique and fundamental to the archaeal "design fabric." Genomic protein signature analysis methods may be extended to characterize the evolution of any phylogenetically defined lineage. The complete set of protein clusters for the archaeal genomic signature is presented as supplementary material (see the PNAS web site, www.pnas.org).
The Laccaria and Tuber Genomes Reveal Unique Signatures of Mycorrhizal Symbiosis Evolution (2010 JGI User Meeting)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Martin, Francis

Francis Martin from the French National Institute for Agricultural Research (INRA) talks on how "The Laccaria and Tuber genomes reveal unique signatures of mycorrhizal symbiosis evolution" on March 24, 2010 at the 5th Annual DOE JGI User Meeting
The spectrum of genomic signatures: from dinucleotides to chaos game representation.

PubMed

Wang, Yingwei; Hill, Kathleen; Singh, Shiva; Kari, Lila

2005-02-14

In the post genomic era, access to complete genome sequence data for numerous diverse species has opened multiple avenues for examining and comparing primary DNA sequence organization of entire genomes. Previously, the concept of a genomic signature was introduced with the observation of species-type specific Dinucleotide Relative Abundance Profiles (DRAPs); dinucleotides were identified as the subsequences with the greatest bias in representation in a majority of genomes. Herein, we demonstrate that DRAP is one particular genomic signature contained within a broader spectrum of signatures. Within this spectrum, an alternative genomic signature, Chaos Game Representation (CGR), provides a unique visualization of patterns in sequence organization. A genomic signature is associated with a particular integer order or subsequence length that represents a measure of the resolution or granularity in the analysis of primary DNA sequence organization. We quantitatively explore the organizational information provided by genomic signatures of different orders through different distance measures, including a novel Image Distance. The Image Distance and other existing distance measures are evaluated by comparing the phylogenetic trees they generate for 26 complete mitochondrial genomes from a diversity of species. The phylogenetic tree generated by the Image Distance is compatible with the known relatedness of species. Quantitative evaluation of the spectrum of genomic signatures may be used to ultimately gain insight into the determinants and biological relevance of the genome signatures.
Microbial Lifestyle and Genome Signatures

PubMed Central

Dutta, Chitra; Paul, Sandip

2012-01-01

Microbes are known for their unique ability to adapt to varying lifestyle and environment, even to the extreme or adverse ones. The genomic architecture of a microbe may bear the signatures not only of its phylogenetic position, but also of the kind of lifestyle to which it is adapted. The present review aims to provide an account of the specific genome signatures observed in microbes acclimatized to distinct lifestyles or ecological niches. Niche-specific signatures identified at different levels of microbial genome organization like base composition, GC-skew, purine-pyrimidine ratio, dinucleotide abundance, codon bias, oligonucleotide composition etc. have been discussed. Among the specific cases highlighted in the review are the phenomena of genome shrinkage in obligatory host-restricted microbes, genome expansion in strictly intra-amoebal pathogens, strand-specific codon usage in intracellular species, acquisition of genome islands in pathogenic or symbiotic organisms, discriminatory genomic traits of marine microbes with distinct trophic strategies, and conspicuous sequence features of certain extremophiles like those adapted to high temperature or high salinity. PMID:23024607
HTSFinder: Powerful Pipeline of DNA Signature Discovery by Parallel and Distributed Computing

PubMed Central

Karimi, Ramin; Hajdu, Andras

2016-01-01

Comprehensive effort for low-cost sequencing in the past few years has led to the growth of complete genome databases. In parallel with this effort, a strong need, fast and cost-effective methods and applications have been developed to accelerate sequence analysis. Identification is the very first step of this task. Due to the difficulties, high costs, and computational challenges of alignment-based approaches, an alternative universal identification method is highly required. Like an alignment-free approach, DNA signatures have provided new opportunities for the rapid identification of species. In this paper, we present an effective pipeline HTSFinder (high-throughput signature finder) with a corresponding k-mer generator GkmerG (genome k-mers generator). Using this pipeline, we determine the frequency of k-mers from the available complete genome databases for the detection of extensive DNA signatures in a reasonably short time. Our application can detect both unique and common signatures in the arbitrarily selected target and nontarget databases. Hadoop and MapReduce as parallel and distributed computing tools with commodity hardware are used in this pipeline. This approach brings the power of high-performance computing into the ordinary desktop personal computers for discovering DNA signatures in large databases such as bacterial genome. A considerable number of detected unique and common DNA signatures of the target database bring the opportunities to improve the identification process not only for polymerase chain reaction and microarray assays but also for more complex scenarios such as metagenomics and next-generation sequencing analysis. PMID:26884678
HTSFinder: Powerful Pipeline of DNA Signature Discovery by Parallel and Distributed Computing.

PubMed

Karimi, Ramin; Hajdu, Andras

2016-01-01

Comprehensive effort for low-cost sequencing in the past few years has led to the growth of complete genome databases. In parallel with this effort, a strong need, fast and cost-effective methods and applications have been developed to accelerate sequence analysis. Identification is the very first step of this task. Due to the difficulties, high costs, and computational challenges of alignment-based approaches, an alternative universal identification method is highly required. Like an alignment-free approach, DNA signatures have provided new opportunities for the rapid identification of species. In this paper, we present an effective pipeline HTSFinder (high-throughput signature finder) with a corresponding k-mer generator GkmerG (genome k-mers generator). Using this pipeline, we determine the frequency of k-mers from the available complete genome databases for the detection of extensive DNA signatures in a reasonably short time. Our application can detect both unique and common signatures in the arbitrarily selected target and nontarget databases. Hadoop and MapReduce as parallel and distributed computing tools with commodity hardware are used in this pipeline. This approach brings the power of high-performance computing into the ordinary desktop personal computers for discovering DNA signatures in large databases such as bacterial genome. A considerable number of detected unique and common DNA signatures of the target database bring the opportunities to improve the identification process not only for polymerase chain reaction and microarray assays but also for more complex scenarios such as metagenomics and next-generation sequencing analysis.
Systematic CpT (ApG) depletion and CpG excess are unique genomic signatures of large DNA viruses infecting invertebrates.

PubMed

Upadhyay, Mohita; Sharma, Neha; Vivekanandan, Perumal

2014-01-01

Differences in the relative abundance of dinucleotides, if any may provide important clues on host-driven evolution of viruses. We studied dinucleotide frequencies of large DNA viruses infecting vertebrates (n = 105; viruses infecting mammals = 99; viruses infecting aves = 6; viruses infecting reptiles = 1) and invertebrates (n = 88; viruses infecting insects = 84; viruses infecting crustaceans = 4). We have identified systematic depletion of CpT(ApG) dinucleotides and over-representation of CpG dinucleotides as the unique genomic signature of large DNA viruses infecting invertebrates. Detailed investigation of this unique genomic signature suggests the existence of invertebrate host-induced pressures specifically targeting CpT(ApG) and CpG dinucleotides. The depletion of CpT dinucleotides among large DNA viruses infecting invertebrates is at least in part, explained by non-canonical DNA methylation by the infected host. Our findings highlight the role of invertebrate host-related factors in shaping virus evolution and they also provide the necessary framework for future studies on evolution, epigenetics and molecular biology of viruses infecting this group of hosts.
Systematic CpT (ApG) Depletion and CpG Excess Are Unique Genomic Signatures of Large DNA Viruses Infecting Invertebrates

PubMed Central

Upadhyay, Mohita; Sharma, Neha; Vivekanandan, Perumal

2014-01-01

Differences in the relative abundance of dinucleotides, if any may provide important clues on host-driven evolution of viruses. We studied dinucleotide frequencies of large DNA viruses infecting vertebrates (n = 105; viruses infecting mammals = 99; viruses infecting aves = 6; viruses infecting reptiles = 1) and invertebrates (n = 88; viruses infecting insects = 84; viruses infecting crustaceans = 4). We have identified systematic depletion of CpT(ApG) dinucleotides and over-representation of CpG dinucleotides as the unique genomic signature of large DNA viruses infecting invertebrates. Detailed investigation of this unique genomic signature suggests the existence of invertebrate host-induced pressures specifically targeting CpT(ApG) and CpG dinucleotides. The depletion of CpT dinucleotides among large DNA viruses infecting invertebrates is at least in part, explained by non-canonical DNA methylation by the infected host. Our findings highlight the role of invertebrate host-related factors in shaping virus evolution and they also provide the necessary framework for future studies on evolution, epigenetics and molecular biology of viruses infecting this group of hosts. PMID:25369195
Genome-wide methylomic and transcriptomic analyses identify subtype-specific epigenetic signatures commonly dysregulated in glioma stem cells and glioblastoma.

PubMed

Pangeni, Rajendra P; Zhang, Zhou; Alvarez, Angel A; Wan, Xuechao; Sastry, Namratha; Lu, Songjian; Shi, Taiping; Huang, Tianzhi; Lei, Charles X; James, C David; Kessler, John A; Brennan, Cameron W; Nakano, Ichiro; Lu, Xinghua; Hu, Bo; Zhang, Wei; Cheng, Shi-Yuan

2018-06-21

Glioma stem cells (GSCs), a subpopulation of tumor cells, contribute to tumor heterogeneity and therapy resistance. Gene expression profiling classified glioblastoma (GBM) and GSCs into four transcriptomically-defined subtypes. Here, we determined the DNA methylation signatures in transcriptomically pre-classified GSC and GBM bulk tumors subtypes. We hypothesized that these DNA methylation signatures correlate with gene expression and are uniquely associated either with only GSCs or only GBM bulk tumors. Additional methylation signatures may be commonly associated with both GSCs and GBM bulk tumors, i.e., common to non-stem-like and stem-like tumor cell populations and correlating with the clinical prognosis of glioma patients. We analyzed Illumina 450K methylation array and expression data from a panel of 23 patient-derived GSCs. We referenced these results with The Cancer Genome Atlas (TCGA) GBM datasets to generate methylomic and transcriptomic signatures for GSCs and GBM bulk tumors of each transcriptomically pre-defined tumor subtype. Survival analyses were carried out for these signature genes using publicly available datasets, including from TCGA. We report that DNA methylation signatures in proneural and mesenchymal tumor subtypes are either unique to GSCs, unique to GBM bulk tumors, or common to both. Further, dysregulated DNA methylation correlates with gene expression and clinical prognoses. Additionally, many previously identified transcriptionally-regulated markers are also dysregulated due to DNA methylation. The subtype-specific DNA methylation signatures described in this study could be useful for refining GBM sub-classification, improving prognostic accuracy, and making therapeutic decisions.
Evaluation of Signature Erosion in Ebola Virus Due to Genomic Drift and Its Impact on the Performance of Diagnostic Assays

PubMed Central

Sozhamannan, Shanmuga; Holland, Mitchell Y.; Hall, Adrienne T.; Negrón, Daniel A.; Ivancich, Mychal; Koehler, Jeffrey W.; Minogue, Timothy D.; Campbell, Catherine E.; Berger, Walter J.; Christopher, George W.; Goodwin, Bruce G.; Smith, Michael A.

2015-01-01

Genome sequence analyses of the 2014 Ebola Virus (EBOV) isolates revealed a potential problem with the diagnostic assays currently in use; i.e., drifting genomic profiles of the virus may affect the sensitivity or even produce false-negative results. We evaluated signature erosion in ebolavirus molecular assays using an in silico approach and found frequent potential false-negative and false-positive results. We further empirically evaluated many EBOV assays, under real time PCR conditions using EBOV Kikwit (1995) and Makona (2014) RNA templates. These results revealed differences in performance between assays but were comparable between the old and new EBOV templates. Using a whole genome approach and a novel algorithm, termed BioVelocity, we identified new signatures that are unique to each of EBOV, Sudan virus (SUDV), and Reston virus (RESTV). Interestingly, many of the current assay signatures do not fall within these regions, indicating a potential drawback in the past assay design strategies. The new signatures identified in this study may be evaluated with real-time reverse transcription PCR (rRT-PCR) assay development and validation. In addition, we discuss regulatory implications and timely availability to impact a rapidly evolving outbreak using existing but perhaps less than optimal assays versus redesign these assays for addressing genomic changes. PMID:26090727
Hypermutation and unique mutational signatures of occupational cholangiocarcinoma in printing workers exposed to haloalkanes.

PubMed

Mimaki, Sachiyo; Totsuka, Yukari; Suzuki, Yutaka; Nakai, Chikako; Goto, Masanori; Kojima, Motohiro; Arakawa, Hirofumi; Takemura, Shigekazu; Tanaka, Shogo; Marubashi, Shigeru; Kinoshita, Masahiko; Matsuda, Tomonari; Shibata, Tatsuhiro; Nakagama, Hitoshi; Ochiai, Atsushi; Kubo, Shoji; Nakamori, Shoji; Esumi, Hiroyasu; Tsuchihara, Katsuya

2016-08-01

Cholangiocarcinoma is a relatively rare cancer, but its incidence is increasing worldwide. Although several risk factors have been suggested, the etiology and pathogenesis of the majority of cholangiocarcinomas remain unclear. Recently, a high incidence of early-onset cholangiocarcinoma was reported among the workers of a printing company in Osaka, Japan. These workers underwent high exposure to organic solvents, mainly haloalkanes such as 1,2-dichloropropane (1,2-DCP) and/or dichloromethane. We performed whole-exome analysis on four cases of cholangiocarcinoma among the printing workers. An average of 44.8 somatic mutations was detected per Mb in the genome of the printing workers' cholangiocarcinoma tissues, approximately 30-fold higher than that found in control common cholangiocarcinoma tissues. Furthermore, C:G-to-T:A transitions with substantial strand bias as well as unique trinucleotide mutational changes of GpCpY to GpTpY and NpCpY to NpTpY or NpApY were predominant in all of the printing workers' cholangiocarcinoma genomes. These results were consistent with the epidemiological observation that they had been exposed to high concentrations of chemical compounds. Whole-genome analysis of Salmonella typhimurium strain TA100 exposed to 1,2-DCP revealed a partial recapitulation of the mutational signature in the printing workers' cholangiocarcinoma. Although our results provide mutational signatures unique to occupational cholangiocarcinoma, the underlying mechanisms of the disease should be further investigated by using appropriate model systems and by comparison with genomic data from other cancers. © The Author 2016. Published by Oxford University Press.
Molecular Biology In Young Women With Breast Cancer: From Tumor Gene Expression To DNA Mutations.

PubMed

Gómez-Flores-Ramos, Liliana; Castro-Sánchez, Andrea; Peña-Curiel, Omar; Mohar-Betancourt, Alejandro

2017-01-01

Young women with breast cancer (YWBC) represent roughly 15% of breast cancer (BC) cases in Latin America and other developing regions. Breast tumors occurring at an early age are more aggressive and have an overall worse prognosis compared to breast tumors in postmenopausal women. The expression of relevant proliferation biomarkers such as endocrine receptors and human epidermal growth factor receptor 2 appears to be unique in YWBC. Moreover, histopathological, molecular, genetic, and genomic studies have shown that YWBC exhibit a higher frequency of aggressive subtypes, differential tumor gene expression, increased genetic susceptibility, and specific genomic signatures, compared to older women with BC. This article reviews the current knowledge on tumor biology and genomic signatures in YWBC.
The genome landscape of indigenous African cattle.

PubMed

Kim, Jaemin; Hanotte, Olivier; Mwai, Okeyo Ally; Dessie, Tadelle; Bashir, Salim; Diallo, Boubacar; Agaba, Morris; Kim, Kwondo; Kwak, Woori; Sung, Samsun; Seo, Minseok; Jeong, Hyeonsoo; Kwon, Taehyung; Taye, Mengistie; Song, Ki-Duk; Lim, Dajeong; Cho, Seoae; Lee, Hyun-Jeong; Yoon, Duhak; Oh, Sung Jong; Kemp, Stephen; Lee, Hak-Kyo; Kim, Heebal

2017-02-20

The history of African indigenous cattle and their adaptation to environmental and human selection pressure is at the root of their remarkable diversity. Characterization of this diversity is an essential step towards understanding the genomic basis of productivity and adaptation to survival under African farming systems. We analyze patterns of African cattle genetic variation by sequencing 48 genomes from five indigenous populations and comparing them to the genomes of 53 commercial taurine breeds. We find the highest genetic diversity among African zebu and sanga cattle. Our search for genomic regions under selection reveals signatures of selection for environmental adaptive traits. In particular, we identify signatures of selection including genes and/or pathways controlling anemia and feeding behavior in the trypanotolerant N'Dama, coat color and horn development in Ankole, and heat tolerance and tick resistance across African cattle especially in zebu breeds. Our findings unravel at the genome-wide level, the unique adaptive diversity of African cattle while emphasizing the opportunities for sustainable improvement of livestock productivity on the continent.
Phylogenomic analyses and molecular signatures for the class Halobacteria and its two major clades: a proposal for division of the class Halobacteria into an emended order Halobacteriales and two new orders, Haloferacales ord. nov. and Natrialbales ord. nov., containing the novel families Haloferacaceae fam. nov. and Natrialbaceae fam. nov.

PubMed

Gupta, Radhey S; Naushad, Sohail; Baker, Sheridan

2015-03-01

The Halobacteria constitute one of the largest groups within the Archaea. The hierarchical relationship among members of this large class, which comprises a single order and a single family, has proven difficult to determine based upon 16S rRNA gene trees and morphological and physiological characteristics. This work reports detailed phylogenetic and comparative genomic studies on >100 halobacterial (haloarchaeal) genomes containing representatives from 30 genera to investigate their evolutionary relationships. In phylogenetic trees reconstructed on the basis of 32 conserved proteins, using both neighbour-joining and maximum-likelihood methods, two major clades (clades A and B) encompassing nearly two-thirds of the sequenced haloarchaeal species were strongly supported. Clades grouping the same species/genera were also supported by the 16S rRNA gene trees and trees for several individual highly conserved proteins (RpoC, EF-Tu, UvrD, GyrA, EF-2/EF-G). In parallel, our comparative analyses of protein sequences from haloarchaeal genomes have identified numerous discrete molecular markers in the form of conserved signature indels (CSI) in protein sequences and conserved signature proteins (CSPs) that are found uniquely in specific groups of haloarchaea. Thirteen CSIs in proteins involved in diverse functions and 68 CSPs that are uniquely present in all or most genome-sequenced haloarchaea provide novel molecular means for distinguishing members of the class Halobacteria from all other prokaryotes. The members of clade A are distinguished from all other haloarchaea by the unique shared presence of two CSIs in the ribose operon protein and small GTP-binding protein and eight CSPs that are found specifically in members of this clade. Likewise, four CSIs in different proteins and five other CSPs are present uniquely in members of clade B and distinguish them from all other haloarchaea. Based upon their specific clustering in phylogenetic trees for different gene/protein sequences and the unique shared presence of large numbers of molecular signatures, members of clades A and B are indicated to be distinct from all other haloarchaea because of their uniquely shared evolutionary histories. Based upon these results, it is proposed that clades A and B be recognized as two new orders, Natrialbales ord. nov. and Haloferacales ord. nov., within the class Halobacteria, containing the novel families Natrialbaceae fam. nov. and Haloferacaceae fam. nov. Other members of the class Halobacteria that are not members of these two orders will remain part of the emended order Halobacteriales in an emended family Halobacteriaceae. © 2015 IUMS.
Defining the Genomic Signature of Totipotency and Pluripotency during Early Human Development

PubMed Central

Galan, Amparo; Diaz-Gimeno, Patricia; Poo, Maria Eugenia; Valbuena, Diana; Sanchez, Eva; Ruiz, Veronica; Dopazo, Joaquin; Montaner, David; Conesa, Ana; Simon, Carlos

2013-01-01

The genetic mechanisms governing human pre-implantation embryo development and the in vitro counterparts, human embryonic stem cells (hESCs), still remain incomplete. Previous global genome studies demonstrated that totipotent blastomeres from day-3 human embryos and pluripotent inner cell masses (ICMs) from blastocysts, display unique and differing transcriptomes. Nevertheless, comparative gene expression analysis has revealed that no significant differences exist between hESCs derived from blastomeres versus those obtained from ICMs, suggesting that pluripotent hESCs involve a new developmental progression. To understand early human stages evolution, we developed an undifferentiation network signature (UNS) and applied it to a differential gene expression profile between single blastomeres from day-3 embryos, ICMs and hESCs. This allowed us to establish a unique signature composed of highly interconnected genes characteristic of totipotency (61 genes), in vivo pluripotency (20 genes), and in vitro pluripotency (107 genes), and which are also proprietary according to functional analysis. This systems biology approach has led to an improved understanding of the molecular and signaling processes governing human pre-implantation embryo development, as well as enabling us to comprehend how hESCs might adapt to in vitro culture conditions. PMID:23614026
A Meta-Assembly of Selection Signatures in Cattle

PubMed Central

Randhawa, Imtiaz A. S.; Khatkar, Mehar S.; Thomson, Peter C.; Raadsma, Herman W.

2016-01-01

Since domestication, significant genetic improvement has been achieved for many traits of commercial importance in cattle, including adaptation, appearance and production. In response to such intense selection pressures, the bovine genome has undergone changes at the underlying regions of functional genetic variants, which are termed “selection signatures”. This article reviews 64 recent (2009–2015) investigations testing genomic diversity for departure from neutrality in worldwide cattle populations. In particular, we constructed a meta-assembly of 16,158 selection signatures for individual breeds and their archetype groups (European, African, Zebu and composite) from 56 genome-wide scans representing 70,743 animals of 90 pure and crossbred cattle breeds. Meta-selection-scores (MSS) were computed by combining published results at every given locus, within a sliding window span. MSS were adjusted for common samples across studies and were weighted for significance thresholds across and within studies. Published selection signatures show extensive coverage across the bovine genome, however, the meta-assembly provides a consensus profile of 263 genomic regions of which 141 were unique (113 were breed-specific) and 122 were shared across cattle archetypes. The most prominent peaks of MSS represent regions under selection across multiple populations and harboured genes of known major effects (coat color, polledness and muscle hypertrophy) and genes known to influence polygenic traits (stature, adaptation, feed efficiency, immunity, behaviour, reproduction, beef and dairy production). As the first meta-assembly of selection signatures, it offers novel insights about the hotspots of selective sweeps in the bovine genome, and this method could equally be applied to other species. PMID:27045296
Distinct Microbial Signatures Associated With Different Breast Cancer Types

PubMed Central

Banerjee, Sagarika; Tian, Tian; Wei, Zhi; Shih, Natalie; Feldman, Michael D.; Peck, Kristen N.; DeMichele, Angela M.; Alwine, James C.; Robertson, Erle S.

2018-01-01

A dysbiotic microbiome can potentially contribute to the pathogenesis of many different diseases including cancer. Breast cancer is the second leading cause of cancer death in women. Thus, we investigated the diversity of the microbiome in the four major types of breast cancer: endocrine receptor (ER) positive, triple positive, Her2 positive and triple negative breast cancers. Using a whole genome and transcriptome amplification and a pan-pathogen microarray (PathoChip) strategy, we detected unique and common viral, bacterial, fungal and parasitic signatures for each of the breast cancer types. These were validated by PCR and Sanger sequencing. Hierarchical cluster analysis of the breast cancer samples, based on their detected microbial signatures, showed distinct patterns for the triple negative and triple positive samples, while the ER positive and Her2 positive samples shared similar microbial signatures. These signatures, unique or common to the different breast cancer types, provide a new line of investigation to gain further insights into prognosis, treatment strategies and clinical outcome, as well as better understanding of the role of the micro-organisms in the development and progression of breast cancer. PMID:29867857
Evolutionary insights from Erwinia amylovora genomics.

PubMed

Smits, Theo H M; Rezzonico, Fabio; Duffy, Brion

2011-08-20

Evolutionary genomics is coming into focus with the recent availability of complete sequences for many bacterial species. A hypothesis on the evolution of virulence factors in the plant pathogen Erwinia amylovora, the causative agent of fire blight, was generated using comparative genomics with the genomes E. amylovora, Erwinia pyrifoliae and Erwinia tasmaniensis. Putative virulence factors were mapped to the proposed genealogy of the genus Erwinia that is based on phylogenetic and genomic data. Ancestral origin of several virulence factors was identified, including levan biosynthesis, sorbitol metabolism, three T3SS and two T6SS. Other factors appeared to have been acquired after divergence of pathogenic species, including a second flagellar gene and two glycosyltransferases involved in amylovoran biosynthesis. E. amylovora singletons include 3 unique T3SS effectors that may explain differential virulence/host ranges. E. amylovora also has a unique T1SS export system, and a unique third T6SS gene cluster. Genetic analysis revealed signatures of foreign DNA suggesting that horizontal gene transfer is responsible for some of these differential features between the three species. Copyright © 2010 Elsevier B.V. All rights reserved.
Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures

PubMed Central

Pride, David T; Schoenfeld, Thomas

2008-01-01

Background Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. Results From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes. Conclusion That BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis. PMID:18798991

Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures.

PubMed

Pride, David T; Schoenfeld, Thomas

2008-09-17

Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes. That BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis.
Phylogenomics and Comparative Genomic Studies Robustly Support Division of the Genus Mycobacterium into an Emended Genus Mycobacterium and Four Novel Genera

PubMed Central

Gupta, Radhey S.; Lo, Brian; Son, Jeen

2018-01-01

The genus Mycobacterium contains 188 species including several major human pathogens as well as numerous other environmental species. We report here comprehensive phylogenomics and comparative genomic analyses on 150 genomes of Mycobacterium species to understand their interrelationships. Phylogenetic trees were constructed for the 150 species based on 1941 core proteins for the genus Mycobacterium, 136 core proteins for the phylum Actinobacteria and 8 other conserved proteins. Additionally, the overall genome similarity amongst the Mycobacterium species was determined based on average amino acid identity of the conserved protein families. The results from these analyses consistently support the existence of five distinct monophyletic groups within the genus Mycobacterium at the highest level, which are designated as the “Tuberculosis-Simiae,” “Terrae,” “Triviale,” “Fortuitum-Vaccae,” and “Abscessus-Chelonae” clades. Some of these clades have also been observed in earlier phylogenetic studies. Of these clades, the “Abscessus-Chelonae” clade forms the deepest branching lineage and does not form a monophyletic grouping with the “Fortuitum-Vaccae” clade of fast-growing species. In parallel, our comparative analyses of proteins from mycobacterial genomes have identified 172 molecular signatures in the form of conserved signature indels and conserved signature proteins, which are uniquely shared by either all Mycobacterium species or by members of the five identified clades. The identified molecular signatures (or synapomorphies) provide strong independent evidence for the monophyly of the genus Mycobacterium and the five described clades and they provide reliable means for the demarcation of these clades and for their diagnostics. Based on the results of our comprehensive phylogenomic analyses and numerous identified molecular signatures, which consistently and strongly support the division of known mycobacterial species into the five described clades, we propose here division of the genus Mycobacterium into an emended genus Mycobacterium encompassing the “Tuberculosis-Simiae” clade, which includes all of the major human pathogens, and four novel genera viz. Mycolicibacterium gen. nov., Mycolicibacter gen. nov., Mycolicibacillus gen. nov. and Mycobacteroides gen. nov. corresponding to the “Fortuitum-Vaccae,” “Terrae,” “Triviale,” and “Abscessus-Chelonae” clades, respectively. With the division of mycobacterial species into these five distinct groups, attention can now be focused on unique genetic and molecular characteristics that differentiate members of these groups. PMID:29497402
Meta-Analysis of DNA Tumor-Viral Integration Site Selection Indicates a Role for Repeats, Gene Expression and Epigenetics

PubMed Central

Doolittle-Hall, Janet M.; Cunningham Glasspoole, Danielle L.; Seaman, William T.; Webster-Cyriaque, Jennifer

2015-01-01

Oncoviruses cause tremendous global cancer burden. For several DNA tumor viruses, human genome integration is consistently associated with cancer development. However, genomic features associated with tumor viral integration are poorly understood. We sought to define genomic determinants for 1897 loci prone to hosting human papillomavirus (HPV), hepatitis B virus (HBV) or Merkel cell polyomavirus (MCPyV). These were compared to HIV, whose enzyme-mediated integration is well understood. A comprehensive catalog of integration sites was constructed from the literature and experimentally-determined HPV integration sites. Features were scored in eight categories (genes, expression, open chromatin, histone modifications, methylation, protein binding, chromatin segmentation and repeats) and compared to random loci. Random forest models determined loci classification and feature selection. HPV and HBV integrants were not fragile site associated. MCPyV preferred integration near sensory perception genes. Unique signatures of integration-associated predictive genomic features were detected. Importantly, repeats, actively-transcribed regions and histone modifications were common tumor viral integration signatures. PMID:26569308
Mycobacterium tuberculosis strains exhibit differential and strain-specific molecular signatures in pulmonary epithelial cells.

PubMed

Mvubu, Nontobeko Eunice; Pillay, Balakrishna; Gamieldien, Junaid; Bishai, William; Pillay, Manormoney

2016-12-01

Although pulmonary epithelial cells are integral to innate and adaptive immune responses during Mycobacterium tuberculosis infection, global transcriptomic changes in these cells remain largely unknown. Changes in gene expression induced in pulmonary epithelial cells infected with M. tuberculosis F15/LAM4/KZN, F11, F28, Beijing and Unique genotypes were investigated by RNA sequencing (RNA-Seq). The Illumina HiSeq 2000 platform generated 50 bp reads that were mapped to the human genome (Hg19) using Tophat (2.0.10). Differential gene expression induced by the different strains in infected relative to the uninfected cells was quantified and compared using Cufflinks (2.1.0) and MeV (4.0.9), respectively. Gene expression varied among the strains with the total number of genes as follows: F15/LAM4/KZN (1187), Beijing (1252), F11 (1639), F28 (870), Unique (886) and H37Rv (1179). A subset of 292 genes was commonly induced by all strains, where 52 genes were down-regulated while 240 genes were up-regulated. Differentially expressed genes were compared among the strains and the number of induced strain-specific gene signatures were as follows: F15/LAM4/KZN (138), Beijing (52), F11 (255), F28 (55), Unique (186) and H37Rv (125). Strain-specific molecular gene signatures associated with functional pathways were observed only for the Unique and H37Rv strains while certain biological functions may be associated with other strain signatures. This study demonstrated that strains of M. tuberculosis induce differential gene expression and strain-specific molecular signatures in pulmonary epithelial cells. Specific signatures induced by clinical strains of M. tuberculosis can be further explored for novel host-associated biomarkers and adjunctive immunotherapies. Copyright © 2016 Elsevier Ltd. All rights reserved.
12-Chemokine Gene Signature Identifies Lymph Node-like Structures in Melanoma: Potential for Patient Selection for Immunotherapy?

NASA Astrophysics Data System (ADS)

Messina, Jane L.; Fenstermacher, David A.; Eschrich, Steven; Qu, Xiaotao; Berglund, Anders E.; Lloyd, Mark C.; Schell, Michael J.; Sondak, Vernon K.; Weber, Jeffrey S.; Mulé, James J.

2012-10-01

We have interrogated a 12-chemokine gene expression signature (GES) on genomic arrays of 14,492 distinct solid tumors and show broad distribution across different histologies. We hypothesized that this 12-chemokine GES might accurately predict a unique intratumoral immune reaction in stage IV (non-locoregional) melanoma metastases. The 12-chemokine GES predicted the presence of unique, lymph node-like structures, containing CD20+ B cell follicles with prominent areas of CD3+ T cells (both CD4+ and CD8+ subsets). CD86+, but not FoxP3+, cells were present within these unique structures as well. The direct correlation between the 12-chemokine GES score and the presence of unique, lymph nodal structures was also associated with better overall survival of the subset of melanoma patients. The use of this novel 12-chemokine GES may reveal basic information on in situ mechanisms of the anti-tumor immune response, potentially leading to improvements in the identification and selection of melanoma patients most suitable for immunotherapy.
Past Exposure to Densely Ionizing Radiation Leaves a Unique Permanent Signature in the Genome

PubMed Central

Hande, M. Prakash; Azizova, Tamara V.; Geard, Charles R.; Burak, Ludmilla E.; Mitchell, Catherine R.; Khokhryakov, Valentin F.; Vasilenko, Evgeny K.; Brenner, David J.

2003-01-01

Speculation has long surrounded the question of whether past exposure to ionizing radiation leaves a unique permanent signature in the genome. Intrachromosomal rearrangements or deletions are produced much more efficiently by densely ionizing radiation than by chemical mutagens, x-rays, or endogenous aging processes. Until recently, such stable intrachromosomal aberrations have been very hard to detect, but a new chromosome band painting technique has made their detection practical. We report the detection and quantification of stable intrachromosomal aberrations in lymphocytes of healthy former nuclear-weapons workers who were exposed to plutonium many years ago. Even many years after occupational exposure, more than half the blood cells of the healthy plutonium workers contain large (>6 Mb) intrachromosomal rearrangements. The yield of these aberrations was highly correlated with plutonium dose to the bone marrow. The control groups contained very few such intrachromosomal aberrations. Quantification of this large-scale chromosomal damage in human populations exposed many years earlier will lead to new insights into the mechanisms and risks of cytogenetic damage. PMID:12679897
Characterization of a species-specific repetitive DNA from a highly endangered wild animal, Rhinoceros unicornis, and assessment of genetic polymorphism by microsatellite associated sequence amplification (MASA).

PubMed

Ali, S; Azfer, M A; Bashamboo, A; Mathur, P K; Malik, P K; Mathur, V B; Raha, A K; Ansari, S

1999-03-04

We have cloned and sequenced a 906bp EcoRI repeat DNA fraction from Rhinoceros unicornis genome. The contig pSS(R)2 is AT rich with 340 A (37.53%), 187 C (20.64%), 173 G (19.09%) and 206 T (22.74%). The sequence contains MALT box, NF-E1, Poly-A signal, lariat consensus sequences, TATA box, translational initiation sequences and several stop codons. Translation of the contig showed seven different types of protein motifs, among which, EGF-like domain cysteine pattern signatures and Bowman-Birk serine protease inhibitor family signatures were prominent. The presence of eukaryotic transcriptional elements, protein signatures and analysis of subset sequences in the 5' region from 1 to 165nt indicating coding potential (test code value=0.97) suggest possible regulatory and/or functional role(s) of these sequences in the rhino genome. Translation of the complementary strand from 906 to 706nt and 190 to 2nt showed proteins of more than 7kDa rich in non-polar residues. This suggests that pSS(R)2 is either a part of, or adjacent to, a functional gene. The contig contains mostly non-consecutive simple repeat units from 2 to 17nt with varying frequencies, of which four base motifs were found to be predominant. Zoo-blot hybridization revealed that pSS(R)2 sequences are unique to R. unicornis genome because they do not cross-hybridize, even with the genomic DNA of South African black rhino Diceros bicornis. Southern blot analysis of R. unicornis genomic DNA with pSS(R)2 and other synthetic oligo probes revealed a high level of genetic homogeneity, which was also substantiated by microsatellite associated sequence amplification (MASA). Owing to its uniqueness, the pSS(R)2 probe has a potential application in the area of conservation biology for unequivocal identification of horn or other body tissues of R. unicornis. The evolutionary aspect of this repeat fraction in the context of comparative genome analysis is discussed.
Population genomics of Fusarium graminearum reveals signatures of divergent evolution within a major cereal pathogen

PubMed Central

2018-01-01

The cereal pathogen Fusarium graminearum is the primary cause of Fusarium head blight (FHB) and a significant threat to food safety and crop production. To elucidate population structure and identify genomic targets of selection within major FHB pathogen populations in North America we sequenced the genomes of 60 diverse F. graminearum isolates. We also assembled the first pan-genome for F. graminearum to clarify population-level differences in gene content potentially contributing to pathogen diversity. Bayesian and phylogenomic analyses revealed genetic structure associated with isolates that produce the novel NX-2 mycotoxin, suggesting a North American population that has remained genetically distinct from other endemic and introduced cereal-infecting populations. Genome scans uncovered distinct signatures of selection within populations, focused in high diversity, frequently recombining regions. These patterns suggested selection for genomic divergence at the trichothecene toxin gene cluster and thirteen additional regions containing genes potentially involved in pathogen specialization. Gene content differences further distinguished populations, in that 121 genes showed population-specific patterns of conservation. Genes that differentiated populations had predicted functions related to pathogenesis, secondary metabolism and antagonistic interactions, though a subset had unique roles in temperature and light sensitivity. Our results indicated that F. graminearum populations are distinguished by dozens of genes with signatures of selection and an array of dispensable accessory genes, suggesting that FHB pathogen populations may be equipped with different traits to exploit the agroecosystem. These findings provide insights into the evolutionary processes and genomic features contributing to population divergence in plant pathogens, and highlight candidate genes for future functional studies of pathogen specialization across evolutionarily and ecologically diverse fungi. PMID:29584736
Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes.

PubMed

Lee, Kang-Hoon; Shin, Kyung-Seop; Lim, Debora; Kim, Woo-Chan; Chung, Byung Chang; Han, Gyu-Bum; Roh, Jeongkyu; Cho, Dong-Ho; Cho, Kiho

2015-07-01

The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of any genome. Following the occurrence profiling of 5-nucleotide REs/words, the information from top-50 frequency words was transformed into a genome-specific signature and visualized as Genome Signature Images (GSIs), using a CMYK scheme. An algorithm for computing distances among GSIs was formulated using the GSIs' variables (word identity, frequency, and frequency order). The utility of the GSI-distance computation system was demonstrated with control genomes. GSI-based computation of genome-relatedness among 1766 microbes (117 archaea and 1649 bacteria) identified their clustering patterns; although the majority paralleled the established classification, some did not. The Genome Signature Imaging system, with its visualization and distance computation functions, enables genome-scale evolutionary studies involving numerous genomes with varying sizes. Copyright © 2015 Elsevier Inc. All rights reserved.
Strategies for the acquisition of transcriptional and epigenetic information in single cells.

PubMed

Li, Guang; Dzilic, Elda; Flores, Nick; Shieh, Alice; Wu, Sean M

2017-03-01

As the basic unit of living organisms, each single cell has unique molecular signatures and functions. Our ability to uncover the transcriptional and epigenetic signature of single cells has been hampered by the lack of tools to explore this area of research. The advent of microfluidic single cell technology along with single cell genome-wide DNA amplification methods had greatly improved our understanding of the expression variation in single cells. Transcriptional expression profile by multiplex qPCR or genome-wide RNA sequencing has enabled us to examine genes expression in single cells in different tissues. With the new tools, the identification of new cellular heterogeneity, novel marker genes, unique subpopulations, and spatial locations of each single cell can be acquired successfully. Epigenetic modifications for each single cell can also be obtained via similar methods. Based on single cell genome sequencing, single cell epigenetic information including histone modifications, DNA methylation, and chromatin accessibility have been explored and provided valuable insights regarding gene regulation and disease prognosis. In this article, we review the development of strategies to obtain single cell transcriptional and epigenetic data. Furthermore, we discuss ways in which single cell studies may help to provide greater understanding of the mechanisms of basic cardiovascular biology that will eventually lead to improvement in our ability to diagnose disease and develop new therapies.
The Divided Bacterial Genome: Structure, Function, and Evolution.

PubMed

diCenzo, George C; Finan, Turlough M

2017-09-01

Approximately 10% of bacterial genomes are split between two or more large DNA fragments, a genome architecture referred to as a multipartite genome. This multipartite organization is found in many important organisms, including plant symbionts, such as the nitrogen-fixing rhizobia, and plant, animal, and human pathogens, including the genera Brucella , Vibrio , and Burkholderia . The availability of many complete bacterial genome sequences means that we can now examine on a broad scale the characteristics of the different types of DNA molecules in a genome. Recent work has begun to shed light on the unique properties of each class of replicon, the unique functional role of chromosomal and nonchromosomal DNA molecules, and how the exploitation of novel niches may have driven the evolution of the multipartite genome. The aims of this review are to (i) outline the literature regarding bacterial genomes that are divided into multiple fragments, (ii) provide a meta-analysis of completed bacterial genomes from 1,708 species as a way of reviewing the abundant information present in these genome sequences, and (iii) provide an encompassing model to explain the evolution and function of the multipartite genome structure. This review covers, among other topics, salient genome terminology; mechanisms of multipartite genome formation; the phylogenetic distribution of multipartite genomes; how each part of a genome differs with respect to genomic signatures, genetic variability, and gene functional annotation; how each DNA molecule may interact; as well as the costs and benefits of this genome structure. Copyright © 2017 American Society for Microbiology.
Accurate read-based metagenome characterization using a hierarchical suite of unique signatures

PubMed Central

Freitas, Tracey Allen K.; Li, Po-E; Scholz, Matthew B.; Chain, Patrick S. G.

2015-01-01

A major challenge in the field of shotgun metagenomics is the accurate identification of organisms present within a microbial community, based on classification of short sequence reads. Though existing microbial community profiling methods have attempted to rapidly classify the millions of reads output from modern sequencers, the combination of incomplete databases, similarity among otherwise divergent genomes, errors and biases in sequencing technologies, and the large volumes of sequencing data required for metagenome sequencing has led to unacceptably high false discovery rates (FDR). Here, we present the application of a novel, gene-independent and signature-based metagenomic taxonomic profiling method with significantly and consistently smaller FDR than any other available method. Our algorithm circumvents false positives using a series of non-redundant signature databases and examines Genomic Origins Through Taxonomic CHAllenge (GOTTCHA). GOTTCHA was tested and validated on 20 synthetic and mock datasets ranging in community composition and complexity, was applied successfully to data generated from spiked environmental and clinical samples, and robustly demonstrates superior performance compared with other available tools. PMID:25765641
Unique signatures of long noncoding RNA expression in response to virus infection and altered innate immune signaling.

PubMed

Peng, Xinxia; Gralinski, Lisa; Armour, Christopher D; Ferris, Martin T; Thomas, Matthew J; Proll, Sean; Bradel-Tretheway, Birgit G; Korth, Marcus J; Castle, John C; Biery, Matthew C; Bouzek, Heather K; Haynor, David R; Frieman, Matthew B; Heise, Mark; Raymond, Christopher K; Baric, Ralph S; Katze, Michael G

2010-10-26

Studies of the host response to virus infection typically focus on protein-coding genes. However, non-protein-coding RNAs (ncRNAs) are transcribed in mammalian cells, and the roles of many of these ncRNAs remain enigmas. Using next-generation sequencing, we performed a whole-transcriptome analysis of the host response to severe acute respiratory syndrome coronavirus (SARS-CoV) infection across four founder mouse strains of the Collaborative Cross. We observed differential expression of approximately 500 annotated, long ncRNAs and 1,000 nonannotated genomic regions during infection. Moreover, studies of a subset of these ncRNAs and genomic regions showed the following. (i) Most were similarly regulated in response to influenza virus infection. (ii) They had distinctive kinetic expression profiles in type I interferon receptor and STAT1 knockout mice during SARS-CoV infection, including unique signatures of ncRNA expression associated with lethal infection. (iii) Over 40% were similarly regulated in vitro in response to both influenza virus infection and interferon treatment. These findings represent the first discovery of the widespread differential expression of long ncRNAs in response to virus infection and suggest that ncRNAs are involved in regulating the host response, including innate immunity. At the same time, virus infection models provide a unique platform for studying the biology and regulation of ncRNAs.
Unique Signatures of Long Noncoding RNA Expression in Response to Virus Infection and Altered Innate Immune Signaling

PubMed Central

Peng, Xinxia; Gralinski, Lisa; Armour, Christopher D.; Ferris, Martin T.; Thomas, Matthew J.; Proll, Sean; Bradel-Tretheway, Birgit G.; Korth, Marcus J.; Castle, John C.; Biery, Matthew C.; Bouzek, Heather K.; Haynor, David R.; Frieman, Matthew B.; Heise, Mark; Raymond, Christopher K.; Baric, Ralph S.; Katze, Michael G.

2010-01-01

Studies of the host response to virus infection typically focus on protein-coding genes. However, non-protein-coding RNAs (ncRNAs) are transcribed in mammalian cells, and the roles of many of these ncRNAs remain enigmas. Using next-generation sequencing, we performed a whole-transcriptome analysis of the host response to severe acute respiratory syndrome coronavirus (SARS-CoV) infection across four founder mouse strains of the Collaborative Cross. We observed differential expression of approximately 500 annotated, long ncRNAs and 1,000 nonannotated genomic regions during infection. Moreover, studies of a subset of these ncRNAs and genomic regions showed the following. (i) Most were similarly regulated in response to influenza virus infection. (ii) They had distinctive kinetic expression profiles in type I interferon receptor and STAT1 knockout mice during SARS-CoV infection, including unique signatures of ncRNA expression associated with lethal infection. (iii) Over 40% were similarly regulated in vitro in response to both influenza virus infection and interferon treatment. These findings represent the first discovery of the widespread differential expression of long ncRNAs in response to virus infection and suggest that ncRNAs are involved in regulating the host response, including innate immunity. At the same time, virus infection models provide a unique platform for studying the biology and regulation of ncRNAs. PMID:20978541
Connecting genes, coexpression modules, and molecular signatures to environmental stress phenotypes in plants

PubMed Central

Weston, David J; Gunter, Lee E; Rogers, Alistair; Wullschleger, Stan D

2008-01-01

Background One of the eminent opportunities afforded by modern genomic technologies is the potential to provide a mechanistic understanding of the processes by which genetic change translates to phenotypic variation and the resultant appearance of distinct physiological traits. Indeed much progress has been made in this area, particularly in biomedicine where functional genomic information can be used to determine the physiological state (e.g., diagnosis) and predict phenotypic outcome (e.g., patient survival). Ecology currently lacks an analogous approach where genomic information can be used to diagnose the presence of a given physiological state (e.g., stress response) and then predict likely phenotypic outcomes (e.g., stress duration and tolerance, fitness). Results Here, we demonstrate that a compendium of genomic signatures can be used to classify the plant abiotic stress phenotype in Arabidopsis according to the architecture of the transcriptome, and then be linked with gene coexpression network analysis to determine the underlying genes governing the phenotypic response. Using this approach, we confirm the existence of known stress responsive pathways and marker genes, report a common abiotic stress responsive transcriptome and relate phenotypic classification to stress duration. Conclusion Linking genomic signatures to gene coexpression analysis provides a unique method of relating an observed plant phenotype to changes in gene expression that underlie that phenotype. Such information is critical to current and future investigations in plant biology and, in particular, to evolutionary ecology, where a mechanistic understanding of adaptive physiological responses to abiotic stress can provide researchers with a tool of great predictive value in understanding species and population level adaptation to climate change. PMID:18248680
Phylogenomic evidence for ancient hybridization in the genomes of living cats (Felidae)

PubMed Central

Li, Gang; Davis, Brian W.; Eizirik, Eduardo; Murphy, William J.

2016-01-01

Inter-species hybridization has been recently recognized as potentially common in wild animals, but the extent to which it shapes modern genomes is still poorly understood. Distinguishing historical hybridization events from other processes leading to phylogenetic discordance among different markers requires a well-resolved species tree that considers all modes of inheritance and overcomes systematic problems due to rapid lineage diversification by sampling large genomic character sets. Here, we assessed genome-wide phylogenetic variation across a diverse mammalian family, Felidae (cats). We combined genotypes from a genome-wide SNP array with additional autosomal, X- and Y-linked variants to sample ∼150 kb of nuclear sequence, in addition to complete mitochondrial genomes generated using light-coverage Illumina sequencing. We present the first robust felid time tree that accounts for unique maternal, paternal, and biparental evolutionary histories. Signatures of phylogenetic discordance were abundant in the genomes of modern cats, in many cases indicating hybridization as the most likely cause. Comparison of big cat whole-genome sequences revealed a substantial reduction of X-linked divergence times across several large recombination cold spots, which were highly enriched for signatures of selection-driven post-divergence hybridization between the ancestors of the snow leopard and lion lineages. These results highlight the mosaic origin of modern felid genomes and the influence of sex chromosomes and sex-biased dispersal in post-speciation gene flow. A complete resolution of the tree of life will require comprehensive genomic sampling of biparental and sex-limited genetic variation to identify and control for phylogenetic conflict caused by ancient admixture and sex-biased differences in genomic transmission. PMID:26518481
Phylogenomic and Molecular Demarcation of the Core Members of the Polyphyletic Pasteurellaceae Genera Actinobacillus, Haemophilus, and Pasteurella

PubMed Central

Naushad, Sohail; Adeolu, Mobolaji; Goel, Nisha; Khadka, Bijendra; Al-Dahwi, Aqeel; Gupta, Radhey S.

2015-01-01

The genera Actinobacillus, Haemophilus, and Pasteurella exhibit extensive polyphyletic branching in phylogenetic trees and do not represent coherent clusters of species. In this study, we have utilized molecular signatures identified through comparative genomic analyses in conjunction with genome based and multilocus sequence based phylogenetic analyses to clarify the phylogenetic and taxonomic boundary of these genera. We have identified large clusters of Actinobacillus, Haemophilus, and Pasteurella species which represent the “sensu stricto” members of these genera. We have identified 3, 7, and 6 conserved signature indels (CSIs), which are specifically shared by sensu stricto members of Actinobacillus, Haemophilus, and Pasteurella, respectively. We have also identified two different sets of CSIs that are unique characteristics of the pathogen containing genera Aggregatibacter and Mannheimia, respectively. It is now possible to demarcate the genera Actinobacillus sensu stricto, Haemophilus sensu stricto, and Pasteurella sensu stricto on the basis of discrete molecular signatures. The other members of the genera Actinobacillus, Haemophilus, and Pasteurella that do not fall within the “sensu stricto” clades and do not contain these molecular signatures should be reclassified as other genera. The CSIs identified here also provide useful diagnostic targets for the identification of current and novel members of the indicated genera. PMID:25821780
Hybridization Reveals the Evolving Genomic Architecture of Speciation

PubMed Central

Kronforst, Marcus R.; Hansen, Matthew E.B.; Crawford, Nicholas G.; Gallant, Jason R.; Zhang, Wei; Kulathinal, Rob J.; Kapan, Durrell D.; Mullen, Sean P.

2014-01-01

SUMMARY The rate at which genomes diverge during speciation is unknown, as are the physical dynamics of the process. Here, we compare full genome sequences of 32 butterflies, representing five species from a hybridizing Heliconius butterfly community, to examine genome-wide patterns of introgression and infer how divergence evolves during the speciation process. Our analyses reveal that initial divergence is restricted to a small fraction of the genome, largely clustered around known wing-patterning genes. Over time, divergence evolves rapidly, due primarily to the origin of new divergent regions. Furthermore, divergent genomic regions display signatures of both selection and adaptive introgression, demonstrating the link between microevolutionary processes acting within species and the origin of species across macroevolutionary timescales. Our results provide a uniquely comprehensive portrait of the evolving species boundary due to the role that hybridization plays in reducing the background accumulation of divergence at neutral sites. PMID:24183670
GeneSigDB—a curated database of gene expression signatures

PubMed Central

Culhane, Aedín C.; Schwarzl, Thomas; Sultana, Razvan; Picard, Kermshlise C.; Picard, Shaita C.; Lu, Tim H.; Franklin, Katherine R.; French, Simon J.; Papenhausen, Gerald; Correll, Mick; Quackenbush, John

2010-01-01

The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated with a specific biological phenotype. Whilst thousands of experimentally derived gene signatures are published, their potential value to the community is limited by their computational inaccessibility. Gene signatures are embedded in published article figures, tables or in supplementary materials, and are frequently presented using non-standard gene or probeset nomenclature. We present GeneSigDB (http://compbio.dfci.harvard.edu/genesigdb) a manually curated database of gene expression signatures. GeneSigDB release 1.0 focuses on cancer and stem cells gene signatures and was constructed from more than 850 publications from which we manually transcribed 575 gene signatures. Most gene signatures (n = 560) were successfully mapped to the genome to extract standardized lists of EnsEMBL gene identifiers. GeneSigDB provides the original gene signature, the standardized gene list and a fully traceable gene mapping history for each gene from the original transcribed data table through to the standardized list of genes. The GeneSigDB web portal is easy to search, allows users to compare their own gene list to those in the database, and download gene signatures in most common gene identifier formats. PMID:19934259
Spatially Resolved Genomic, Stable Isotopic, and Lipid Analyses of a Modern Freshwater Microbialite from Cuatro Ciénegas, Mexico

PubMed Central

Nitti, Anthony; Daniels, Camille A.; Siefert, Janet; Souza, Valeria; Hollander, David

2012-01-01

Abstract Microbialites are biologically mediated carbonate deposits found in diverse environments worldwide. To explore the organisms and processes involved in microbialite formation, this study integrated genomic, lipid, and both organic and inorganic stable isotopic analyses to examine five discrete depth horizons spanning the surface 25 mm of a modern freshwater microbialite from Cuatro Ciénegas, Mexico. Distinct bacterial communities and geochemical signatures were observed in each microbialite layer. Photoautotrophic organisms accounted for approximately 65% of the sequences in the surface community and produced biomass with distinctive lipid biomarker and isotopic (δ13C) signatures. This photoautotrophic biomass was efficiently degraded in the deeper layers by heterotrophic organisms, primarily sulfate-reducing proteobacteria. Two spatially distinct zones of carbonate precipitation were observed within the microbialite, with the first zone corresponding to the phototroph-dominated portion of the microbialite and the second zone associated with the presence of sulfate-reducing heterotrophs. The coupling of photoautotrophic production, heterotrophic decomposition, and remineralization of organic matter led to the incorporation of a characteristic biogenic signature into the inorganic CaCO3 matrix. Overall, spatially resolved multidisciplinary analyses of the microbialite enabled correlations to be made between the distribution of specific organisms, precipitation of carbonate, and preservation of unique lipid and isotopic geochemical signatures. These findings are critical for understanding the formation of modern microbialites and have implications for the interpretation of ancient microbialite records. Key Words: Microbial ecology—Microbe-mineral interactions—Microbial mats—Stromatolites—Genomics. Astrobiology 12, 685–698. PMID:22882001

Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures

PubMed Central

Park, Paul J.; Fuchs, Robert; Wei, Lai; Jorgensen, Brian G.; Redelman, Doug; Ward, Sean M.; Sanders, Kenton M.

2017-01-01

Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC), which serve as slow-wave electrical pacemakers for gastrointestinal (GI) smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome) based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies. PMID:28426719
Rapid Evolutionary Rates and Unique Genomic Signatures Discovered in the First Reference Genome for the Southern Ocean Salp, Salpa thompsoni (Urochordata, Thaliacea)

PubMed Central

Jue, Nathaniel K.; Batta-Lona, Paola G.; Trusiak, Sarah; Obergfell, Craig; Bucklin, Ann; O’Neill, Michael J.; O’Neill, Rachel J.

2016-01-01

A preliminary genome sequence has been assembled for the Southern Ocean salp, Salpa thompsoni (Urochordata, Thaliacea). Despite the ecological importance of this species in Antarctic pelagic food webs and its potential role as an indicator of changing Southern Ocean ecosystems in response to climate change, no genomic resources are available for S. thompsoni or any closely related urochordate species. Using a multiple-platform, multiple-individual approach, we have produced a 318,767,936-bp genome sequence, covering >50% of the estimated 602 Mb (±173 Mb) genome size for S. thompsoni. Using a nonredundant set of predicted proteins, >50% (16,823) of sequences showed significant homology to known proteins and ∼38% (12,151) of the total protein predictions were associated with Gene Ontology functional information. We have generated 109,958 SNP variant and 9,782 indel predictions for this species, serving as a resource for future phylogenomic and population genetic studies. Comparing the salp genome to available assemblies for four other urochordates, Botryllus schlosseri, Ciona intestinalis, Ciona savignyi and Oikopleura dioica, we found that S. thompsoni shares the previously estimated rapid rates of evolution for these species. High mutation rates are thus independent of genome size, suggesting that rates of evolution >1.5 times that observed for vertebrates are a broad taxonomic characteristic of urochordates. Tests for positive selection implemented in PAML revealed a small number of genes with sites undergoing rapid evolution, including genes involved in ribosome biogenesis and metabolic and immune process that may be reflective of both adaptation to polar, planktonic environments as well as the complex life history of the salps. Finally, we performed an initial survey of small RNAs, revealing the presence of known, conserved miRNAs, as well as novel miRNA genes; unique piRNAs; and mature miRNA signatures for varying developmental stages. Collectively, these resources provide a genomic foundation supporting S. thompsoni as a model species for further examination of the exceptional rates and patterns of genomic evolution shown by urochordates. Additionally, genomic data will allow for the development of molecular indicators of key life history events and processes and afford new understandings and predictions of impacts of climate change on this key species of Antarctic pelagic ecosystems. PMID:27624472
Genomic analysis of Ugandan and Rwandan chicken ecotypes using a 600 k genotyping array.

PubMed

Fleming, D S; Koltes, J E; Markey, A D; Schmidt, C J; Ashwell, C M; Rothschild, M F; Persia, M E; Reecy, J M; Lamont, S J

2016-05-26

Indigenous populations of animals have developed unique adaptations to their local environments, which may include factors such as response to thermal stress, drought, pathogens and suboptimal nutrition. The survival and subsequent evolution within these local environments can be the result of both natural and artificial selection driving the acquisition of favorable traits, which over time leave genomic signatures in a population. This study's goals are to characterize genomic diversity and identify selection signatures in chickens from equatorial Africa to identify genomic regions that may confer adaptive advantages of these ecotypes to their environments. Indigenous chickens from Uganda (n = 72) and Rwanda (n = 100), plus Kuroilers (n = 24, an Indian breed imported to Africa), were genotyped using the Axiom® 600 k Chicken Genotyping Array. Indigenous ecotypes were defined based upon location of sampling within Africa. The results revealed the presence of admixture among the Ugandan, Rwandan, and Kuroiler populations. Genes within runs of homozygosity consensus regions are linked to gene ontology (GO) terms related to lipid metabolism, immune functions and stress-mediated responses (FDR < 0.15). The genes within regions of signatures of selection are enriched for GO terms related to health and oxidative stress processes. Key genes in these regions had anti-oxidant, apoptosis, and inflammation functions. The study suggests that these populations have alleles under selective pressure from their environment, which may aid in adaptation to harsh environments. The correspondence in gene ontology terms connected to stress-mediated processes across the populations could be related to the similarity of environments or an artifact of the detected admixture.
A New Omics Data Resource of Pleurocybella porrigens for Gene Discovery

PubMed Central

Dohra, Hideo; Someya, Takumi; Takano, Tomoyuki; Harada, Kiyonori; Omae, Saori; Hirai, Hirofumi; Yano, Kentaro; Kawagishi, Hirokazu

2013-01-01

Background Pleurocybella porrigens is a mushroom-forming fungus, which has been consumed as a traditional food in Japan. In 2004, 55 people were poisoned by eating the mushroom and 17 people among them died of acute encephalopathy. Since then, the Japanese government has been alerting Japanese people to take precautions against eating the P . porrigens mushroom. Unfortunately, despite efforts, the molecular mechanism of the encephalopathy remains elusive. The genome and transcriptome sequence data of P . porrigens and the related species, however, are not stored in the public database. To gain the omics data in P . porrigens , we sequenced genome and transcriptome of its fruiting bodies and mycelia by next generation sequencing. Methodology/Principal Findings Short read sequences of genomic DNAs and mRNAs in P . porrigens were generated by Illumina Genome Analyzer. Genome short reads were de novo assembled into scaffolds using Velvet. Comparisons of genome signatures among Agaricales showed that P . porrigens has a unique genome signature. Transcriptome sequences were assembled into contigs (unigenes). Biological functions of unigenes were predicted by Gene Ontology and KEGG pathway analyses. The majority of unigenes would be novel genes without significant counterparts in the public omics databases. Conclusions Functional analyses of unigenes present the existence of numerous novel genes in the basidiomycetes division. The results mean that the omics information such as genome, transcriptome and metabolome in basidiomycetes is short in the current databases. The large-scale omics information on P . porrigens , provided from this research, will give a new data resource for gene discovery in basidiomycetes. PMID:23936076
Genus-Wide Comparative Genomics of Malassezia Delineates Its Phylogeny, Physiology, and Niche Adaptation on Human Skin

PubMed Central

Wu, Guangxi; Zhao, He; Li, Chenhao; Rajapakse, Menaka Priyadarsani; Wong, Wing Cheong; Xu, Jun; Saunders, Charles W.; Reeder, Nancy L.; Reilman, Raymond A.; Scheynius, Annika; Sun, Sheng; Billmyre, Blake Robert; Li, Wenjun; Averette, Anna Floyd; Mieczkowski, Piotr; Heitman, Joseph; Theelen, Bart; Schröder, Markus S.; De Sessions, Paola Florez; Butler, Geraldine; Maurer-Stroh, Sebastian; Boekhout, Teun; Nagarajan, Niranjan; Dawson, Thomas L.

2015-01-01

Malassezia is a unique lipophilic genus in class Malasseziomycetes in Ustilaginomycotina, (Basidiomycota, fungi) that otherwise consists almost exclusively of plant pathogens. Malassezia are typically isolated from warm-blooded animals, are dominant members of the human skin mycobiome and are associated with common skin disorders. To characterize the genetic basis of the unique phenotypes of Malassezia spp., we sequenced the genomes of all 14 accepted species and used comparative genomics against a broad panel of fungal genomes to comprehensively identify distinct features that define the Malassezia gene repertoire: gene gain and loss; selection signatures; and lineage-specific gene family expansions. Our analysis revealed key gene gain events (64) with a single gene conserved across all Malassezia but absent in all other sequenced Basidiomycota. These likely horizontally transferred genes provide intriguing gain-of-function events and prime candidates to explain the emergence of Malassezia. A larger set of genes (741) were lost, with enrichment for glycosyl hydrolases and carbohydrate metabolism, concordant with adaptation to skin’s carbohydrate-deficient environment. Gene family analysis revealed extensive turnover and underlined the importance of secretory lipases, phospholipases, aspartyl proteases, and other peptidases. Combining genomic analysis with a re-evaluation of culture characteristics, we establish the likely lipid-dependence of all Malassezia. Our phylogenetic analysis sheds new light on the relationship between Malassezia and other members of Ustilaginomycotina, as well as phylogenetic lineages within the genus. Overall, our study provides a unique genomic resource for understanding Malassezia niche-specificity and potential virulence, as well as their abundance and distribution in the environment and on human skin. PMID:26539826
Genus-Wide Comparative Genomics of Malassezia Delineates Its Phylogeny, Physiology, and Niche Adaptation on Human Skin.

PubMed

Wu, Guangxi; Zhao, He; Li, Chenhao; Rajapakse, Menaka Priyadarsani; Wong, Wing Cheong; Xu, Jun; Saunders, Charles W; Reeder, Nancy L; Reilman, Raymond A; Scheynius, Annika; Sun, Sheng; Billmyre, Blake Robert; Li, Wenjun; Averette, Anna Floyd; Mieczkowski, Piotr; Heitman, Joseph; Theelen, Bart; Schröder, Markus S; De Sessions, Paola Florez; Butler, Geraldine; Maurer-Stroh, Sebastian; Boekhout, Teun; Nagarajan, Niranjan; Dawson, Thomas L

2015-11-01

Malassezia is a unique lipophilic genus in class Malasseziomycetes in Ustilaginomycotina, (Basidiomycota, fungi) that otherwise consists almost exclusively of plant pathogens. Malassezia are typically isolated from warm-blooded animals, are dominant members of the human skin mycobiome and are associated with common skin disorders. To characterize the genetic basis of the unique phenotypes of Malassezia spp., we sequenced the genomes of all 14 accepted species and used comparative genomics against a broad panel of fungal genomes to comprehensively identify distinct features that define the Malassezia gene repertoire: gene gain and loss; selection signatures; and lineage-specific gene family expansions. Our analysis revealed key gene gain events (64) with a single gene conserved across all Malassezia but absent in all other sequenced Basidiomycota. These likely horizontally transferred genes provide intriguing gain-of-function events and prime candidates to explain the emergence of Malassezia. A larger set of genes (741) were lost, with enrichment for glycosyl hydrolases and carbohydrate metabolism, concordant with adaptation to skin's carbohydrate-deficient environment. Gene family analysis revealed extensive turnover and underlined the importance of secretory lipases, phospholipases, aspartyl proteases, and other peptidases. Combining genomic analysis with a re-evaluation of culture characteristics, we establish the likely lipid-dependence of all Malassezia. Our phylogenetic analysis sheds new light on the relationship between Malassezia and other members of Ustilaginomycotina, as well as phylogenetic lineages within the genus. Overall, our study provides a unique genomic resource for understanding Malassezia niche-specificity and potential virulence, as well as their abundance and distribution in the environment and on human skin.
Genomic scar signatures associated with homologous recombination deficiency predict adverse clinical outcomes in patients with ovarian clear cell carcinoma.

PubMed

Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Jung, Shih-Ming; Lee, Yun-Shien; Chang, Wei-Yang; Yang, Lan-Yang; Ku, Fei-Chun; Huang, Huei-Jean; Chao, An-Shine; Wang, Chin-Jung; Chang, Ting-Chang; Wu, Ren-Chin

2018-05-03

We investigated whether genomic scar signatures associated with homologous recombination deficiency (HRD), which include telomeric allelic imbalance (TAI), large-scale transition (LST), and loss of heterozygosity (LOH), can predict clinical outcomes in patients with ovarian clear cell carcinoma (OCCC). We enrolled patients with OCCC (n = 80) and high-grade serous carcinoma (HGSC; n = 92) subjected to primary cytoreductive surgery, most of whom received platinum-based adjuvant chemotherapy. Genomic scar signatures based on genome-wide copy number data were determined in all participants and investigated in relation to prognosis. OCCC had significantly lower genomic scar signature scores than HGSC (p < 0.001). Near-triploid OCCC specimens showed higher TAI and LST scores compared with diploid tumors (p < 0.001). While high scores of these genomic scar signatures were significantly associated with better clinical outcomes in patients with HGSC, the opposite was evident for OCCC. Multivariate survival analysis in patients with OCCC identified high LOH scores as the main independent adverse predictor for both cancer-specific (hazard ratio [HR] = 3.22, p = 0.005) and progression-free survival (HR = 2.54, p = 0.01). In conclusion, genomic scar signatures associated with HRD predict adverse clinical outcomes in patients with OCCC. The LOH score was identified as the strongest prognostic indicator in this patient group. Genomic scar signatures associated with HRD are less frequent in OCCC than in HGSC. Genomic scar signatures associated with HRD have an adverse prognostic impact in patients with OCCC. LOH score is the strongest adverse prognostic factor in patients with OCCC.
The Spirodela polyrhiza genome reveals insights into its neotenous reduction fast growth and aquatic lifestyle

PubMed Central

Wang, W.; Haberer, G.; Gundlach, H.; Gläßer, C.; Nussbaumer, T.; Luo, M.C.; Lomsadze, A.; Borodovsky, M.; Kerstetter, R.A.; Shanklin, J.; Byrant, D.W.; Mockler, T.C.; Appenroth, K.J.; Grimwood, J.; Jenkins, J.; Chow, J.; Choi, C.; Adam, C.; Cao, X.-H.; Fuchs, J.; Schubert, I.; Rokhsar, D.; Schmutz, J.; Michael, T.P.; Mayer, K.F.X.; Messing, J

2014-01-01

The subfamily of the Lemnoideae belongs to a different order than other monocotyledonous species that have been sequenced and comprises aquatic plants that grow rapidly on the water surface. Here we select Spirodela polyrhiza for whole-genome sequencing. We show that Spirodela has a genome with no signs of recent retrotranspositions but signatures of two ancient whole-genome duplications, possibly 95 million years ago (mya), older than those in Arabidopsis and rice. Its genome has only 19,623 predicted protein-coding genes, which is 28% less than the dicotyledonous Arabidopsis thaliana and 50% less than monocotyledonous rice. We propose that at least in part, the neotenous reduction of these aquatic plants is based on readjusted copy numbers of promoters and repressors of the juvenile-to-adult transition. The Spirodela genome, along with its unique biology and physiology, will stimulate new insights into environmental adaptation, ecology, evolution and plant development, and will be instrumental for future bioenergy applications. PMID:24548928
Repertoire of novel sequence signatures for the detection of Candidatus Liberibacter asiaticus by quantitative real-time PCR

PubMed Central

2014-01-01

Background Huanglongbing (HLB) or citrus greening is a devastating disease of citrus. The gram-negative bacterium Candidatus Liberibacter asiaticus (Las) belonging to the α-proteobacteria is responsible for HLB in North America as well as in Asia. Currently, there is no cure for this disease. Early detection and quarantine of Las-infected trees are important management strategies used to prevent HLB from invading HLB-free citrus producing regions. Quantitative real-time PCR (qRT-PCR) based molecular diagnostic assays have been routinely used in the detection and diagnosis of Las. The oligonucleotide primer pairs based on conserved genes or regions, which include 16S rDNA and the β-operon, have been widely employed in the detection of Las by qRT-PCR. The availability of whole genome sequence of Las now allows the design of primers beyond the conserved regions for the detection of Las explicitly. Results We took a complimentary approach by systematically screening the genes in a genome-wide fashion, to identify the unique signatures that are only present in Las by an exhaustive sequence based similarity search against the nucleotide sequence database. Our search resulted in 34 probable unique signatures. Furthermore, by designing the primer pair specific to the identified signatures, we showed that most of our primer sets are able to detect Las from the infected plant and psyllid materials collected from the USA and China by qRT-PCR. Overall, 18 primer pairs of the 34 are found to be highly specific to Las with no cross reactivity to the closely related species Ca. L. americanus (Lam) and Ca. L. africanus (Laf). Conclusions We have designed qRT-PCR primers based on Las specific genes. Among them, 18 are suitable for the detection of Las from Las-infected plant and psyllid samples. The repertoire of primers that we have developed and characterized in this study enhanced the qRT-PCR based molecular diagnosis of HLB. PMID:24533511
Factor models for cancer signatures

NASA Astrophysics Data System (ADS)

Kakushadze, Zura; Yu, Willie

2016-11-01

We present a novel method for extracting cancer signatures by applying statistical risk models (http://ssrn.com/abstract=2732453) from quantitative finance to cancer genome data. Using 1389 whole genome sequenced samples from 14 cancers, we identify an ;overall; mode of somatic mutational noise. We give a prescription for factoring out this noise and source code for fixing the number of signatures. We apply nonnegative matrix factorization (NMF) to genome data aggregated by cancer subtype and filtered using our method. The resultant signatures have substantially lower variability than those from unfiltered data. Also, the computational cost of signature extraction is cut by about a factor of 10. We find 3 novel cancer signatures, including a liver cancer dominant signature (96% contribution) and a renal cell carcinoma signature (70% contribution). Our method accelerates finding new cancer signatures and improves their overall stability. Reciprocally, the methods for extracting cancer signatures could have interesting applications in quantitative finance.
Using Informatics-, Bioinformatics- and Genomics-Based Approaches for the Molecular Surveillance and Detection of Biothreat Agents

NASA Astrophysics Data System (ADS)

Seto, Donald

The convergence and wealth of informatics, bioinformatics and genomics methods and associated resources allow a comprehensive and rapid approach for the surveillance and detection of bacterial and viral organisms. Coupled with the continuing race for the fastest, most cost-efficient and highest-quality DNA sequencing technology, that is, "next generation sequencing", the detection of biological threat agents by `cheaper and faster' means is possible. With the application of improved bioinformatic tools for the understanding of these genomes and for parsing unique pathogen genome signatures, along with `state-of-the-art' informatics which include faster computational methods, equipment and databases, it is feasible to apply new algorithms to biothreat agent detection. Two such methods are high-throughput DNA sequencing-based and resequencing microarray-based identification. These are illustrated and validated by two examples involving human adenoviruses, both from real-world test beds.
Genome analysis of the platypus reveals unique signatures of evolution.

PubMed

Warren, Wesley C; Hillier, LaDeana W; Marshall Graves, Jennifer A; Birney, Ewan; Ponting, Chris P; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P; Miethke, Pat; Waters, Paul D; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S; López-Otín, Carlos; Ordóñez, Gonzalo R; Eichler, Evan E; Chen, Lin; Cheng, Ze; Deakin, Janine E; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T; Wakefield, Matthew J; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A; Smit, Arian F A; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A; Walker, Jerilyn A; Konkel, Miriam K; Harris, Robert S; Whittington, Camilla M; Wong, Emily S W; Gemmell, Neil J; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R; Ray, David A; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H; Taylor, James; Jones, Russell C; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N; Pohl, Craig S; Smith, Scott M; Hou, Shunfeng; Nefedov, Mikhail; de Jong, Pieter J; Renfree, Marilyn B; Mardis, Elaine R; Wilson, Richard K

2008-05-08

We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation.
Genome analysis of the platypus reveals unique signatures of evolution

PubMed Central

Warren, Wesley C.; Hillier, LaDeana W.; Marshall Graves, Jennifer A.; Birney, Ewan; Ponting, Chris P.; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T.; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P.; Miethke, Pat; Waters, Paul D.; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S.; López-Otín, Carlos; Ordóñez, Gonzalo R.; Eichler, Evan E.; Chen, Lin; Cheng, Ze; Deakin, Janine E.; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T.; Wakefield, Matthew J.; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A.; Smit, Arian F. A.; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A.; Walker, Jerilyn A.; Konkel, Miriam K.; Harris, Robert S.; Whittington, Camilla M.; Wong, Emily S. W.; Gemmell, Neil J.; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M.; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P.; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J.; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M.; Sharp, Julie A.; Nicholas, Kevin R.; Ray, David A.; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H.; Taylor, James; Jones, Russell C.; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N.; Pohl, Craig S.; Smith, Scott M.; Hou, Shunfeng; Renfree, Marilyn B.; Mardis, Elaine R.; Wilson, Richard K.

2009-01-01

We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734
Genomic signatures predict migration and spawning failure in wild Canadian salmon.

PubMed

Miller, Kristina M; Li, Shaorong; Kaukinen, Karia H; Ginther, Norma; Hammill, Edd; Curtis, Janelle M R; Patterson, David A; Sierocinski, Thomas; Donnison, Louise; Pavlidis, Paul; Hinch, Scott G; Hruska, Kimberly A; Cooke, Steven J; English, Karl K; Farrell, Anthony P

2011-01-14

Long-term population viability of Fraser River sockeye salmon (Oncorhynchus nerka) is threatened by unusually high levels of mortality as they swim to their spawning areas before they spawn. Functional genomic studies on biopsied gill tissue from tagged wild adults that were tracked through ocean and river environments revealed physiological profiles predictive of successful migration and spawning. We identified a common genomic profile that was correlated with survival in each study. In ocean-tagged fish, a mortality-related genomic signature was associated with a 13.5-fold greater chance of dying en route. In river-tagged fish, the same genomic signature was associated with a 50% increase in mortality before reaching the spawning grounds in one of three stocks tested. At the spawning grounds, the same signature was associated with 3.7-fold greater odds of dying without spawning. Functional analysis raises the possibility that the mortality-related signature reflects a viral infection.
Genomic signatures of rapid adaptive evolution in the bluespotted cornetfish, a Mediterranean Lessepsian invader.

PubMed

Bernardi, Giacomo; Azzurro, Ernesto; Golani, Daniel; Miller, Michael Ryan

2016-07-01

Biological invasions are increasingly creating ecological and economical problems both on land and in aquatic environments. For over a century, the Mediterranean Sea has steadily been invaded by Indian Ocean/Red Sea species (called Lessepsian invaders) via the Suez Canal, with a current estimate of ~450 species. The bluespotted cornetfish, Fistularia commersonii, considered a 'Lessepsian sprinter', entered the Mediterranean in 2000 and by 2007 had spread through the entire basin from Israel to Spain. The situation is unique and interesting both because of its unprecedented rapidity and by the fact that it took this species c. 130 years to immigrate into the Mediterranean. Using genome scans, with restriction site-associated DNA (RAD) sequencing, we evaluated neutral and selected genomic regions for Mediterranean vs. Red Sea cornetfish individuals. We found that few fixed neutral changes were detectable among populations. However, almost half of the genes associated with the 47 outlier loci (potentially under selection) were related to disease resistance and osmoregulation. Due to the short time elapsed from the beginning of the invasion to our sampling, we interpret these changes as signatures of rapid adaptation that may be explained by several mechanisms including preadaptation and strong local selection. Such genomic regions are therefore good candidates to further study their role in invasion success. © 2016 John Wiley & Sons Ltd.
Rapid Evolutionary Rates and Unique Genomic Signatures Discovered in the First Reference Genome for the Southern Ocean Salp, Salpa thompsoni (Urochordata, Thaliacea).

PubMed

Jue, Nathaniel K; Batta-Lona, Paola G; Trusiak, Sarah; Obergfell, Craig; Bucklin, Ann; O'Neill, Michael J; O'Neill, Rachel J

2016-10-30

A preliminary genome sequence has been assembled for the Southern Ocean salp, Salpa thompsoni (Urochordata, Thaliacea). Despite the ecological importance of this species in Antarctic pelagic food webs and its potential role as an indicator of changing Southern Ocean ecosystems in response to climate change, no genomic resources are available for S. thompsoni or any closely related urochordate species. Using a multiple-platform, multiple-individual approach, we have produced a 318,767,936-bp genome sequence, covering >50% of the estimated 602 Mb (±173 Mb) genome size for S. thompsoni Using a nonredundant set of predicted proteins, >50% (16,823) of sequences showed significant homology to known proteins and ∼38% (12,151) of the total protein predictions were associated with Gene Ontology functional information. We have generated 109,958 SNP variant and 9,782 indel predictions for this species, serving as a resource for future phylogenomic and population genetic studies. Comparing the salp genome to available assemblies for four other urochordates, Botryllus schlosseri, Ciona intestinalis, Ciona savignyi and Oikopleura dioica, we found that S. thompsoni shares the previously estimated rapid rates of evolution for these species. High mutation rates are thus independent of genome size, suggesting that rates of evolution >1.5 times that observed for vertebrates are a broad taxonomic characteristic of urochordates. Tests for positive selection implemented in PAML revealed a small number of genes with sites undergoing rapid evolution, including genes involved in ribosome biogenesis and metabolic and immune process that may be reflective of both adaptation to polar, planktonic environments as well as the complex life history of the salps. Finally, we performed an initial survey of small RNAs, revealing the presence of known, conserved miRNAs, as well as novel miRNA genes; unique piRNAs; and mature miRNA signatures for varying developmental stages. Collectively, these resources provide a genomic foundation supporting S. thompsoni as a model species for further examination of the exceptional rates and patterns of genomic evolution shown by urochordates. Additionally, genomic data will allow for the development of molecular indicators of key life history events and processes and afford new understandings and predictions of impacts of climate change on this key species of Antarctic pelagic ecosystems. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Signatures of selection in tilapia revealed by whole genome resequencing.

PubMed

Xia, Jun Hong; Bai, Zhiyi; Meng, Zining; Zhang, Yong; Wang, Le; Liu, Feng; Jing, Wu; Wan, Zi Yi; Li, Jiale; Lin, Haoran; Yue, Gen Hua

2015-09-16

Natural selection and selective breeding for genetic improvement have left detectable signatures within the genome of a species. Identification of selection signatures is important in evolutionary biology and for detecting genes that facilitate to accelerate genetic improvement. However, selection signatures, including artificial selection and natural selection, have only been identified at the whole genome level in several genetically improved fish species. Tilapia is one of the most important genetically improved fish species in the world. Using next-generation sequencing, we sequenced the genomes of 47 tilapia individuals. We identified a total of 1.43 million high-quality SNPs and found that the LD block sizes ranged from 10-100 kb in tilapia. We detected over a hundred putative selective sweep regions in each line of tilapia. Most selection signatures were located in non-coding regions of the tilapia genome. The Wnt signaling, gonadotropin-releasing hormone receptor and integrin signaling pathways were under positive selection in all improved tilapia lines. Our study provides a genome-wide map of genetic variation and selection footprints in tilapia, which could be important for genetic studies and accelerating genetic improvement of tilapia.
Genome-Wide Mutagenesis in Borrelia burgdorferi.

PubMed

Lin, Tao; Gao, Lihui

2018-01-01

Signature-tagged mutagenesis (STM) is a functional genomics approach to identify bacterial virulence determinants and virulence factors by simultaneously screening multiple mutants in a single host animal, and has been utilized extensively for the study of bacterial pathogenesis, host-pathogen interactions, and spirochete and tick biology. The signature-tagged transposon mutagenesis has been developed to investigate virulence determinants and pathogenesis of Borrelia burgdorferi. Mutants in genes important in virulence are identified by negative selection in which the mutants fail to colonize or disseminate in the animal host and tick vector. STM procedure combined with Luminex Flex ® Map™ technology and next-generation sequencing (e.g., Tn-seq) are the powerful high-throughput tools for the determination of Borrelia burgdorferi virulence determinants. The assessment of multiple tissue sites and two DNA resources at two different time points using Luminex Flex ® Map™ technology provides a robust data set. B. burgdorferi transposon mutant screening indicates that a high proportion of genes are the novel virulence determinants that are required for mouse and tick infection. In this protocol, an effective signature-tagged Himar1-based transposon suicide vector was developed and used to generate a sequence-defined library of nearly 4800 mutants in the infectious B. burgdorferi B31 clone. In STM, signature-tagged suicide vectors are constructed by inserting unique DNA sequences (tags) into the transposable elements. The signature-tagged transposon mutants are generated when transposon suicide vectors are transformed into an infectious B. burgdorferi clone, and the transposable element is transposed into the 5'-TA-3' sequence in the B. burgdorferi genome with the signature tag. The transposon library is created and consists of many sub-libraries, each sub-library has several hundreds of mutants with same tags. A group of mice or ticks are infected with a mixed population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.
Transcriptional profiling of pure fibrolamellar hepatocellular carcinoma reveals an endocrine signature.

PubMed

Malouf, Gabriel G; Job, Sylvie; Paradis, Valérie; Fabre, Monique; Brugières, Laurence; Saintigny, Pierre; Vescovo, Laure; Belghiti, Jacques; Branchereau, Sophie; Faivre, Sandrine; de Reyniès, Aurélien; Raymond, Eric

2014-06-01

Fibrolamellar hepatocellular carcinoma (FLC) is a rare subtype of liver cancer occurring mostly in children and young adults. We have shown that FLC comprises two separate entities: pure (p-FLC) and mixed-FLC (m-FLC), differing in clinical presentation and course. We show that p-FLCs have a distinct gene expression signature different from that of m-FLCs, which have a signature similar to that of classical hepatocellular carcinomas. We found p-FLC profiles to be unique among 263 profiles related to diverse tumoral and nontumoral liver samples. We identified two distinct molecular subgroups of p-FLCs with different outcomes. Pathway analysis of p-FLCs revealed ERBB2 overexpression and an up-regulation of glycolysis, possibly leading to compensatory mitochondrial hyperplasia and oncocytic differentiation. Four of the sixteen genes most significantly overexpressed in p-FLCs were neuroendocrine genes: prohormone convertase 1 (PCSK1); neurotensin; delta/notch-like EGF repeat containing; and calcitonin. PCSK1 overexpression was validated by immunohistochemistry, yielding specific, diffuse staining of the protein throughout the cytoplasm, possibly corresponding to a functional form of this convertase. p-FLCs have a unique transcriptomic signature characterized by the strong expression of specific neuroendocrine genes, suggesting that these tumors may have a cellular origin different from that of HCC. Our data have implications for the use of genomic profiling for diagnosis and selection of targeted therapies in patients with p-FLC. © 2014 by the American Association for the Study of Liver Diseases.
Draft versus finished sequence data for DNA and protein diagnostic signature development

PubMed Central

Gardner, Shea N.; Lam, Marisa W.; Smith, Jason R.; Torres, Clinton L.; Slezak, Tom R.

2005-01-01

Sequencing pathogen genomes is costly, demanding careful allocation of limited sequencing resources. We built a computational Sequencing Analysis Pipeline (SAP) to guide decisions regarding the amount of genomic sequencing necessary to develop high-quality diagnostic DNA and protein signatures. SAP uses simulations to estimate the number of target genomes and close phylogenetic relatives (near neighbors or NNs) to sequence. We use SAP to assess whether draft data are sufficient or finished sequencing is required using Marburg and variola virus sequences. Simulations indicate that intermediate to high-quality draft with error rates of 10−3–10−5 (∼8× coverage) of target organisms is suitable for DNA signature prediction. Low-quality draft with error rates of ∼1% (3× to 6× coverage) of target isolates is inadequate for DNA signature prediction, although low-quality draft of NNs is sufficient, as long as the target genomes are of high quality. For protein signature prediction, sequencing errors in target genomes substantially reduce the detection of amino acid sequence conservation, even if the draft is of high quality. In summary, high-quality draft of target and low-quality draft of NNs appears to be a cost-effective investment for DNA signature prediction, but may lead to underestimation of predicted protein signatures. PMID:16243783

Methyl-CpG island-associated genome signature tags

DOEpatents

Dunn, John J

2014-05-20

Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.
Detecting the Population Structure and Scanning for Signatures of Selection in Horses (Equus caballus) From Whole-Genome Sequencing Data

PubMed Central

Zhang, Cheng; Ni, Pan; Ahmad, Hafiz Ishfaq; Gemingguli, M; Baizilaitibei, A; Gulibaheti, D; Fang, Yaping; Wang, Haiyang; Asif, Akhtar Rasool; Xiao, Changyi; Chen, Jianhai; Ma, Yunlong; Liu, Xiangdong; Du, Xiaoyong; Zhao, Shuhong

2018-01-01

Animal domestication gives rise to gradual changes at the genomic level through selection in populations. Selective sweeps have been traced in the genomes of many animal species, including humans, cattle, and dogs. However, little is known regarding positional candidate genes and genomic regions that exhibit signatures of selection in domestic horses. In addition, an understanding of the genetic processes underlying horse domestication, especially the origin of Chinese native populations, is still lacking. In our study, we generated whole genome sequences from 4 Chinese native horses and combined them with 48 publicly available full genome sequences, from which 15 341 213 high-quality unique single-nucleotide polymorphism variants were identified. Kazakh and Lichuan horses are 2 typical Asian native breeds that were formed in Kazakh or Northwest China and South China, respectively. We detected 1390 loss-of-function (LoF) variants in protein-coding genes, and gene ontology (GO) enrichment analysis revealed that some LoF-affected genes were overrepresented in GO terms related to the immune response. Bayesian clustering, distance analysis, and principal component analysis demonstrated that the population structure of these breeds largely reflected weak geographic patterns. Kazakh and Lichuan horses were assigned to the same lineage with other Asian native breeds, in agreement with previous studies on the genetic origin of Chinese domestic horses. We applied the composite likelihood ratio method to scan for genomic regions showing signals of recent selection in the horse genome. A total of 1052 genomic windows of 10 kB, corresponding to 933 distinct core regions, significantly exceeded neutral simulations. The GO enrichment analysis revealed that the genes under selective sweeps were overrepresented with GO terms, including “negative regulation of canonical Wnt signaling pathway,” “muscle contraction,” and “axon guidance.” Frequent exercise training in domestic horses may have resulted in changes in the expression of genes related to metabolism, muscle structure, and the nervous system.
Classification and regression tree (CART) analyses of genomic signatures reveal sets of tetramers that discriminate temperature optima of archaea and bacteria

PubMed Central

Dyer, Betsey D.; Kahn, Michael J.; LeBlanc, Mark D.

2008-01-01

Classification and regression tree (CART) analysis was applied to genome-wide tetranucleotide frequencies (genomic signatures) of 195 archaea and bacteria. Although genomic signatures have typically been used to classify evolutionary divergence, in this study, convergent evolution was the focus. Temperature optima for most of the organisms examined could be distinguished by CART analyses of tetranucleotide frequencies. This suggests that pervasive (nonlinear) qualities of genomes may reflect certain environmental conditions (such as temperature) in which those genomes evolved. The predominant use of GAGA and AGGA as the discriminating tetramers in CART models suggests that purine-loading and codon biases of thermophiles may explain some of the results. PMID:19054742
A genomic portrait of haplotype diversity and signatures of selection in indigenous southern African populations.

PubMed

Chimusa, Emile R; Meintjies, Ayton; Tchanga, Milaine; Mulder, Nicola; Seoighe, Cathal; Seioghe, Cathal; Soodyall, Himla; Ramesar, Rajkumar

2015-03-01

We report a study of genome-wide, dense SNP (∼ 900K) and copy number polymorphism data of indigenous southern Africans. We demonstrate the genetic contribution to southern and eastern African populations, which involved admixture between indigenous San, Niger-Congo-speaking and populations of Eurasian ancestry. This finding illustrates the need to account for stratification in genome-wide association studies, and that admixture mapping would likely be a successful approach in these populations. We developed a strategy to detect the signature of selection prior to and following putative admixture events. Several genomic regions show an unusual excess of Niger-Kordofanian, and unusual deficiency of both San and Eurasian ancestry, which were considered the footprints of selection after population admixture. Several SNPs with strong allele frequency differences were observed predominantly between the admixed indigenous southern African populations, and their ancestral Eurasian populations. Interestingly, many candidate genes, which were identified within the genomic regions showing signals for selection, were associated with southern African-specific high-risk, mostly communicable diseases, such as malaria, influenza, tuberculosis, and human immunodeficiency virus/AIDs. This observation suggests a potentially important role that these genes might have played in adapting to the environment. Additionally, our analyses of haplotype structure, linkage disequilibrium, recombination, copy number variation and genome-wide admixture highlight, and support the unique position of San relative to both African and non-African populations. This study contributes to a better understanding of population ancestry and selection in south-eastern African populations; and the data and results obtained will support research into the genetic contributions to infectious as well as non-communicable diseases in the region.
A Genomic Portrait of Haplotype Diversity and Signatures of Selection in Indigenous Southern African Populations

PubMed Central

Chimusa, Emile R.; Meintjies, Ayton; Tchanga, Milaine; Mulder, Nicola; Seoighe, Cathal; Soodyall, Himla; Ramesar, Rajkumar

2015-01-01

We report a study of genome-wide, dense SNP (∼900K) and copy number polymorphism data of indigenous southern Africans. We demonstrate the genetic contribution to southern and eastern African populations, which involved admixture between indigenous San, Niger-Congo-speaking and populations of Eurasian ancestry. This finding illustrates the need to account for stratification in genome-wide association studies, and that admixture mapping would likely be a successful approach in these populations. We developed a strategy to detect the signature of selection prior to and following putative admixture events. Several genomic regions show an unusual excess of Niger-Kordofanian, and unusual deficiency of both San and Eurasian ancestry, which were considered the footprints of selection after population admixture. Several SNPs with strong allele frequency differences were observed predominantly between the admixed indigenous southern African populations, and their ancestral Eurasian populations. Interestingly, many candidate genes, which were identified within the genomic regions showing signals for selection, were associated with southern African-specific high-risk, mostly communicable diseases, such as malaria, influenza, tuberculosis, and human immunodeficiency virus/AIDs. This observation suggests a potentially important role that these genes might have played in adapting to the environment. Additionally, our analyses of haplotype structure, linkage disequilibrium, recombination, copy number variation and genome-wide admixture highlight, and support the unique position of San relative to both African and non-African populations. This study contributes to a better understanding of population ancestry and selection in south-eastern African populations; and the data and results obtained will support research into the genetic contributions to infectious as well as non-communicable diseases in the region. PMID:25811879
Experimental Identification of Actinobacillus pleuropneumoniae Strains L20 and JL03 Heptosyltransferases, Evidence for a New Heptosyltransferase Signature Sequence

PubMed Central

Merino, Susana; Knirel, Yuriy A.; Regué, Miguel; Tomás, Juan M.

2013-01-01

We experimentally identified the activities of six predicted heptosyltransferases in Actinobacillus pleuropneumoniae genome serotype 5b strain L20 and serotype 3 strain JL03. The initial identification was based on a bioinformatic analysis of the amino acid similarity between these putative heptosyltrasferases with others of known function from enteric bacteria and Aeromonas. The putative functions of all the Actinobacillus pleuropneumoniae heptosyltrasferases were determined by using surrogate LPS acceptor molecules from well-defined A. hydrophyla AH-3 and A. salmonicida A450 mutants. Our results show that heptosyltransferases APL_0981 and APJL_1001 are responsible for the transfer of the terminal outer core D-glycero-D-manno-heptose (D,D-Hep) residue although they are not currently included in the CAZY glycosyltransferase 9 family. The WahF heptosyltransferase group signature sequence [S(T/S)(GA)XXH] differs from the heptosyltransferases consensus signature sequence [D(TS)(GA)XXH], because of the substitution of D261 for S261, being unique. PMID:23383222
Signatures of selection in tilapia revealed by whole genome resequencing

PubMed Central

Hong Xia, Jun; Bai, Zhiyi; Meng, Zining; Zhang, Yong; Wang, Le; Liu, Feng; Jing, Wu; Yi Wan, Zi; Li, Jiale; Lin, Haoran; Hua Yue, Gen

2015-01-01

Natural selection and selective breeding for genetic improvement have left detectable signatures within the genome of a species. Identification of selection signatures is important in evolutionary biology and for detecting genes that facilitate to accelerate genetic improvement. However, selection signatures, including artificial selection and natural selection, have only been identified at the whole genome level in several genetically improved fish species. Tilapia is one of the most important genetically improved fish species in the world. Using next-generation sequencing, we sequenced the genomes of 47 tilapia individuals. We identified a total of 1.43 million high-quality SNPs and found that the LD block sizes ranged from 10–100 kb in tilapia. We detected over a hundred putative selective sweep regions in each line of tilapia. Most selection signatures were located in non-coding regions of the tilapia genome. The Wnt signaling, gonadotropin-releasing hormone receptor and integrin signaling pathways were under positive selection in all improved tilapia lines. Our study provides a genome-wide map of genetic variation and selection footprints in tilapia, which could be important for genetic studies and accelerating genetic improvement of tilapia. PMID:26373374
Comparative expression profiling in grape (Vitis vinifera) berries derived from frequency analysis of ESTs and MPSS signatures.

PubMed

Iandolino, Alberto; Nobuta, Kan; da Silva, Francisco Goes; Cook, Douglas R; Meyers, Blake C

2008-05-12

Vitis vinifera (V. vinifera) is the primary grape species cultivated for wine production, with an industry valued annually in the billions of dollars worldwide. In order to sustain and increase grape production, it is necessary to understand the genetic makeup of grape species. Here we performed mRNA profiling using Massively Parallel Signature Sequencing (MPSS) and combined it with available Expressed Sequence Tag (EST) data. These tag-based technologies, which do not require a priori knowledge of genomic sequence, are well-suited for transcriptional profiling. The sequence depth of MPSS allowed us to capture and quantify almost all the transcripts at a specific stage in the development of the grape berry. The number and relative abundance of transcripts from stage II grape berries was defined using Massively Parallel Signature Sequencing (MPSS). A total of 2,635,293 17-base and 2,259,286 20-base signatures were obtained, representing at least 30,737 and 26,878 distinct sequences. The average normalized abundance per signature was approximately 49 TPM (Transcripts Per Million). Comparisons of the MPSS signatures with available Vitis species' ESTs and a unigene set demonstrated that 6,430 distinct contigs and 2,190 singletons have a perfect match to at least one MPSS signature. Among the matched sequences, ESTs were identified from tissues other than berries or from berries at different developmental stages. Additional MPSS signatures not matching to known grape ESTs can extend our knowledge of the V. vinifera transcriptome, particularly when these data are used to assist in annotation of whole genome sequences from Vitis vinifera. The MPSS data presented here not only achieved a higher level of saturation than previous EST based analyses, but in doing so, expand the known set of transcripts of grape berries during the unique stage in development that immediately precedes the onset of ripening. The MPSS dataset also revealed evidence of antisense expression not previously reported in grapes but comparable to that reported in other plant species. Finally, we developed a novel web-based, public resource for utilization of the grape MPSS data [1].
Insights from the complete chloroplast genome into the evolution of Sesamum indicum L.

PubMed

Zhang, Haiyang; Li, Chun; Miao, Hongmei; Xiong, Songjin

2013-01-01

Sesame (Sesamum indicum L.) is one of the oldest oilseed crops. In order to investigate the evolutionary characters according to the Sesame Genome Project, apart from sequencing its nuclear genome, we sequenced the complete chloroplast genome of S. indicum cv. Yuzhi 11 (white seeded) using Illumina and 454 sequencing. Comparisons of chloroplast genomes between S. indicum and the 18 other higher plants were then analyzed. The chloroplast genome of cv. Yuzhi 11 contains 153,338 bp and a total of 114 unique genes (KC569603). The number of chloroplast genes in sesame is the same as that in Nicotiana tabacum, Vitis vinifera and Platanus occidentalis. The variation in the length of the large single-copy (LSC) regions and inverted repeats (IR) in sesame compared to 18 other higher plant species was the main contributor to size variation in the cp genome in these species. The 77 functional chloroplast genes, except for ycf1 and ycf2, were highly conserved. The deletion of the cp ycf1 gene sequence in cp genomes may be due either to its transfer to the nuclear genome, as has occurred in sesame, or direct deletion, as has occurred in Panax ginseng and Cucumis sativus. The sesame ycf2 gene is only 5,721 bp in length and has lost about 1,179 bp. Nucleotides 1-585 of ycf2 when queried in BLAST had hits in the sesame draft genome. Five repeats (R10, R12, R13, R14 and R17) were unique to the sesame chloroplast genome. We also found that IR contraction/expansion in the cp genome alters its rate of evolution. Chloroplast genes and repeats display the signature of convergent evolution in sesame and other species. These findings provide a foundation for further investigation of cp genome evolution in Sesamum and other higher plants.
Genome-Wide Locations of Potential Epimutations Associated with Environmentally Induced Epigenetic Transgenerational Inheritance of Disease Using a Sequential Machine Learning Prediction Approach.

PubMed

Haque, M Muksitul; Holder, Lawrence B; Skinner, Michael K

2015-01-01

Environmentally induced epigenetic transgenerational inheritance of disease and phenotypic variation involves germline transmitted epimutations. The primary epimutations identified involve altered differential DNA methylation regions (DMRs). Different environmental toxicants have been shown to promote exposure (i.e., toxicant) specific signatures of germline epimutations. Analysis of genomic features associated with these epimutations identified low-density CpG regions (<3 CpG / 100bp) termed CpG deserts and a number of unique DNA sequence motifs. The rat genome was annotated for these and additional relevant features. The objective of the current study was to use a machine learning computational approach to predict all potential epimutations in the genome. A number of previously identified sperm epimutations were used as training sets. A novel machine learning approach using a sequential combination of Active Learning and Imbalance Class Learner analysis was developed. The transgenerational sperm epimutation analysis identified approximately 50K individual sites with a 1 kb mean size and 3,233 regions that had a minimum of three adjacent sites with a mean size of 3.5 kb. A select number of the most relevant genomic features were identified with the low density CpG deserts being a critical genomic feature of the features selected. A similar independent analysis with transgenerational somatic cell epimutation training sets identified a smaller number of 1,503 regions of genome-wide predicted sites and differences in genomic feature contributions. The predicted genome-wide germline (sperm) epimutations were found to be distinct from the predicted somatic cell epimutations. Validation of the genome-wide germline predicted sites used two recently identified transgenerational sperm epimutation signature sets from the pesticides dichlorodiphenyltrichloroethane (DDT) and methoxychlor (MXC) exposure lineage F3 generation. Analysis of this positive validation data set showed a 100% prediction accuracy for all the DDT-MXC sperm epimutations. Observations further elucidate the genomic features associated with transgenerational germline epimutations and identify a genome-wide set of potential epimutations that can be used to facilitate identification of epigenetic diagnostics for ancestral environmental exposures and disease susceptibility.
Dissecting genetic and environmental mutation signatures with model organisms.

PubMed

Segovia, Romulo; Tam, Annie S; Stirling, Peter C

2015-08-01

Deep sequencing has impacted on cancer research by enabling routine sequencing of genomes and exomes to identify genetic changes associated with carcinogenesis. Researchers can now use the frequency, type, and context of all mutations in tumor genomes to extract mutation signatures that reflect the driving mutational processes. Identifying mutation signatures, however, may not immediately suggest a mechanism. Consequently, several recent studies have employed deep sequencing of model organisms exposed to discrete genetic or environmental perturbations. These studies exploit the simpler genomes and availability of powerful genetic tools in model organisms to analyze mutation signatures under controlled conditions, forging mechanistic links between mutational processes and signatures. We discuss the power of this approach and suggest that many such studies may be on the horizon. Copyright © 2015 Elsevier Ltd. All rights reserved.
Alu-miRNA interactions modulate transcript isoform diversity in stress response and reveal signatures of positive selection

NASA Astrophysics Data System (ADS)

Pandey, Rajesh; Bhattacharya, Aniket; Bhardwaj, Vivek; Jha, Vineet; Mandal, Amit K.; Mukerji, Mitali

2016-09-01

Primate-specific Alus harbor different regulatory features, including miRNA targets. In this study, we provide evidence for miRNA-mediated modulation of transcript isoform levels during heat-shock response through exaptation of Alu-miRNA sites in mature mRNA. We performed genome-wide expression profiling coupled with functional validation of miRNA target sites within exonized Alus, and analyzed conservation of these targets across primates. We observed that two miRNAs (miR-15a-3p and miR-302d-3p) elevated in stress response, target RAD1, GTSE1, NR2C1, FKBP9 and UBE2I exclusively within Alu. These genes map onto the p53 regulatory network. Ectopic overexpression of miR-15a-3p downregulates GTSE1 and RAD1 at the protein level and enhances cell survival. This Alu-mediated fine-tuning seems to be unique to humans as evident from the absence of orthologous sites in other primate lineages. We further analyzed signatures of selection on Alu-miRNA targets in the genome, using 1000 Genomes Phase-I data. We found that 198 out of 3177 Alu-exonized genes exhibit signatures of selection within Alu-miRNA sites, with 60 of them containing SNPs supported by multiple evidences (global-FST > 0.3, pair-wise-FST > 0.5, Fay-Wu’s H < -20, iHS > 2.0, high ΔDAF) and implicated in p53 network. We propose that by affecting multiple genes, Alu-miRNA interactions have the potential to facilitate population-level adaptations in response to environmental challenges.
Comparative Functional Genomics of Lactobacillus spp. Reveals Possible Mechanisms for Specialization of Vaginal Lactobacilli to Their Environment

PubMed Central

Suzuki, Haruo; Hickey, Roxana J.; Forney, Larry J.

2014-01-01

Lactobacilli are found in a wide variety of habitats. Four species, Lactobacillus crispatus, L. gasseri, L. iners, and L. jensenii, are common and abundant in the human vagina and absent from other habitats. These may be adapted to the vagina and possess characteristics enabling them to thrive in that environment. Furthermore, stable codominance of multiple Lactobacillus species in a single community is infrequently observed. Thus, it is possible that individual vaginal Lactobacillus species possess unique characteristics that confer to them host-specific competitive advantages. We performed comparative functional genomic analyses of representatives of 25 species of Lactobacillus, searching for habitat-specific traits in the genomes of the vaginal lactobacilli. We found that the genomes of the vaginal species were significantly smaller and had significantly lower GC content than those of the nonvaginal species. No protein families were found to be specific to the vaginal species analyzed, but some were either over- or underrepresented relative to nonvaginal species. We also found that within the vaginal species, each genome coded for species-specific protein families. Our results suggest that even though the vaginal species show no general signatures of adaptation to the vaginal environment, each species has specific and perhaps unique ways of interacting with its environment, be it the host or other microbes in the community. These findings will serve as a foundation for further exploring the role of lactobacilli in the ecological dynamics of vaginal microbial communities and their ultimate impact on host health. PMID:24488312
Gene Signature in Sessile Serrated Polyps Identifies Colon Cancer Subtype

PubMed Central

Kanth, Priyanka; Bronner, Mary P.; Boucher, Kenneth M.; Burt, Randall W.; Neklason, Deborah W.; Hagedorn, Curt H.; Delker, Don A.

2016-01-01

Sessile serrated colon adenoma/polyps (SSA/Ps) are found during routine screening colonoscopy and may account for 20–30% of colon cancers. However, differentiating SSA/Ps from hyperplastic polyps (HP) with little risk of cancer is challenging and complementary molecular markers are needed. Additionally, the molecular mechanisms of colon cancer development from SSA/Ps are poorly understood. RNA sequencing was performed on 21 SSA/Ps, 10 HPs, 10 adenomas, 21 uninvolved colon and 20 control colon specimens. Differential expression and leave-one-out cross validation methods were used to define a unique gene signature of SSA/Ps. Our SSA/P gene signature was evaluated in colon cancer RNA-Seq data from The Cancer Genome Atlas (TCGA) to identify a subtype of colon cancers that may develop from SSA/Ps. A total of 1422 differentially expressed genes were found in SSA/Ps relative to controls. Serrated polyposis syndrome (n=12) and sporadic SSA/Ps (n=9) exhibited almost complete (96%) gene overlap. A 51-gene panel in SSA/P showed similar expression in a subset of TCGA colon cancers with high microsatellite instability (MSI-H). A smaller seven-gene panel showed high sensitivity and specificity in identifying BRAF mutant, CpG island methylator phenotype high (CIMP-H) and MLH1 silenced colon cancers. We describe a unique gene signature in SSA/Ps that identifies a subset of colon cancers likely to develop through the serrated pathway. These gene panels may be utilized for improved differentiation of SSA/Ps from HPs and provide insights into novel molecular pathways altered in colon cancer arising from the serrated pathway. PMID:27026680
The topography of mutational processes in breast cancer genomes.

PubMed

Morganella, Sandro; Alexandrov, Ludmil B; Glodzik, Dominik; Zou, Xueqing; Davies, Helen; Staaf, Johan; Sieuwerts, Anieta M; Brinkman, Arie B; Martin, Sancha; Ramakrishna, Manasa; Butler, Adam; Kim, Hyung-Yong; Borg, Åke; Sotiriou, Christos; Futreal, P Andrew; Campbell, Peter J; Span, Paul N; Van Laere, Steven; Lakhani, Sunil R; Eyfjord, Jorunn E; Thompson, Alastair M; Stunnenberg, Hendrik G; van de Vijver, Marc J; Martens, John W M; Børresen-Dale, Anne-Lise; Richardson, Andrea L; Kong, Gu; Thomas, Gilles; Sale, Julian; Rada, Cristina; Stratton, Michael R; Birney, Ewan; Nik-Zainal, Serena

2016-05-02

Somatic mutations in human cancers show unevenness in genomic distribution that correlate with aspects of genome structure and function. These mutations are, however, generated by multiple mutational processes operating through the cellular lineage between the fertilized egg and the cancer cell, each composed of specific DNA damage and repair components and leaving its own characteristic mutational signature on the genome. Using somatic mutation catalogues from 560 breast cancer whole-genome sequences, here we show that each of 12 base substitution, 2 insertion/deletion (indel) and 6 rearrangement mutational signatures present in breast tissue, exhibit distinct relationships with genomic features relating to transcription, DNA replication and chromatin organization. This signature-based approach permits visualization of the genomic distribution of mutational processes associated with APOBEC enzymes, mismatch repair deficiency and homologous recombinational repair deficiency, as well as mutational processes of unknown aetiology. Furthermore, it highlights mechanistic insights including a putative replication-dependent mechanism of APOBEC-related mutagenesis.
Global diversity, population stratification, and selection of human copy number variation

PubMed Central

Sudmant, Peter H.; Mallick, Swapan; Nelson, Bradley J.; Hormozdiari, Fereydoun; Krumm, Niklas; Huddleston, John; Coe, Bradley P.; Baker, Carl; Nordenfelt, Susanne; Bamshad, Michael; Jorde, Lynn B.; Posukh, Olga L.; Sahakyan, Hovhannes; Watkins, W. Scott; Yepiskoposyan, Levon; Abdullah, M. Syafiq; Bravi, Claudio M.; Capelli, Cristian; Hervig, Tor; Wee, Joseph T. S.; Tyler-Smith, Chris; van Driem, George; Romero, Irene Gallego; Jha, Aashish R.; Karachanak-Yankova, Sena; Toncheva, Draga; Comas, David; Henn, Brenna; Kivisild, Toomas; Ruiz-Linares, Andres; Sajantila, Antti; Metspalu, Ene; Parik, Jüri; Villems, Richard; Starikovskaya, Elena B.; Ayodo, George; Beall, Cynthia M.; Di Rienzo, Anna; Hammer, Michael; Khusainova, Rita; Khusnutdinova, Elza; Klitz, William; Winkler, Cheryl; Labuda, Damian; Metspalu, Mait; Tishkoff, Sarah A.; Dryomov, Stanislav; Sukernik, Rem; Patterson, Nick; Reich, David; Eichler, Evan E.

2015-01-01

In order to explore the diversity and selective signatures of duplication and deletion human copy number variants (CNVs), we sequenced 236 individuals from 125 distinct human populations. We observed that duplications exhibit fundamentally different population genetic and selective signatures than deletions and are more likely to be stratified between human populations. Through reconstruction of the ancestral human genome, we identify megabases of DNA lost in different human lineages and pinpoint large duplications that introgressed from the extinct Denisova lineage now found at high frequency exclusively in Oceanic populations. We find that the proportion of CNV base pairs to single nucleotide variant base pairs is greater among non-Africans than it is among African populations, but we conclude that this difference is likely due to unique aspects of non-African population history as opposed to differences in CNV load. PMID:26249230
Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

USDA-ARS?s Scientific Manuscript database

The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...
Unstable genomes elevate transcriptome dynamics

PubMed Central

Stevens, Joshua B.; Liu, Guo; Abdallah, Batoul Y.; Horne, Steven D.; Ye, Karen J.; Bremer, Steven W.; Ye, Christine J.; Krawetz, Stephen A.; Heng, Henry H.

2015-01-01

The challenge of identifying common expression signatures in cancer is well known, however the reason behind this is largely unclear. Traditionally variation in expression signatures has been attributed to technological problems, however recent evidence suggests that chromosome instability (CIN) and resultant karyotypic heterogeneity may be a large contributing factor. Using a well-defined model of immortalization, we systematically compared the pattern of genome alteration and expression dynamics during somatic evolution. Co-measurement of global gene expression and karyotypic alteration throughout the immortalization process reveals that karyotype changes influence gene expression as major structural and numerical karyotypic alterations result in large gene expression deviation. Replicate samples from stages with stable genomes are more similar to each other than are replicate samples with karyotypic heterogeneity. Karyotypic and gene expression change during immortalization is dynamic as each stage of progression has a unique expression pattern. This was further verified by comparing global expression in two replicates grown in one flask with known karyotypes. Replicates with higher karyotypic instability were found to be less similar than replicates with stable karyotypes. This data illustrates the karyotype, transcriptome, and transcriptome determined pathways are in constant flux during somatic cellular evolution (particularly during the macroevolutionary phase) and this flux is an inextricable feature of CIN and essential for cancer formation. The findings presented here underscore the importance of understanding the evolutionary process of cancer in order to design improved treatment modalities. PMID:24122714
Determination of Genetic Structure and Signatures of Selection in Three Strains of Tanzania Shorthorn Zebu, Boran and Friesian Cattle by Genome-Wide SNP Analyses

PubMed Central

Msalya, George; Kim, Eui-Soo; Laisser, Emmanuel L. K.; Kipanyula, Maulilio J.; Karimuribo, Esron D.; Kusiluka, Lughano J. M.; Chenyambuga, Sebastian W.; Rothschild, Max F.

2017-01-01

Background More than 90 percent of cattle in Tanzania belong to the indigenous Tanzania Short Horn Zebu (TSZ) population which has been classified into 12 strains based on historical evidence, morphological characteristics, and geographic distribution. However, specific genetic information of each TSZ population has been lacking and has caused difficulties in designing programs such as selection, crossbreeding, breed improvement or conservation. This study was designed to evaluate the genetic structure, assess genetic relationships, and to identify signatures of selection among cattle of Tanzania with the main goal of understanding genetic relationship, variation and uniqueness among them. Methodology/Principal findings The Illumina Bos indicus SNP 80K BeadChip was used to genotype genome wide SNPs in 168 DNA samples obtained from three strains of TSZ cattle namely Maasai, Tarime and Sukuma as well as two comparative breeds; Boran and Friesian. Population structure and signatures of selection were examined using principal component analysis (PCA), admixture analysis, pairwise distances (FST), integrated haplotype score (iHS), identical by state (IBS) and runs of homozygosity (ROH). There was a low level of inbreeding (F~0.01) in the TSZ population compared to the Boran and Friesian breeds. The analyses of FST, IBS and admixture identified no considerable differentiation between TSZ trains. Importantly, common ancestry in Boran and TSZ were revealed based on admixture and IBD, implying gene flow between two populations. In addition, Friesian ancestry was found in Boran. A few common significant iHS were detected, which may reflect influence of recent selection in each breed or strain. Conclusions Population admixture and selection signatures could be applied to develop conservation plan of TSZ cattle as well as future breeding programs in East African cattle. PMID:28129396
Genomic Comparison of Indigenous African and Northern European Chickens Reveals Putative Mechanisms of Stress Tolerance Related to Environmental Selection Pressure

PubMed Central

Fleming, Damarius S.; Weigend, Steffen; Simianer, Henner; Weigend, Annett; Rothschild, Max; Schmidt, Carl; Ashwell, Chris; Persia, Mike; Reecy, James; Lamont, Susan J.

2017-01-01

Global climate change is increasing the magnitude of environmental stressors, such as temperature, pathogens, and drought, that limit the survivability and sustainability of livestock production. Poultry production and its expansion is dependent upon robust animals that are able to cope with stressors in multiple environments. Understanding the genetic strategies that indigenous, noncommercial breeds have evolved to survive in their environment could help to elucidate molecular mechanisms underlying biological traits of environmental adaptation. We examined poultry from diverse breeds and climates of Africa and Northern Europe for selection signatures that have allowed them to adapt to their indigenous environments. Selection signatures were studied using a combination of population genomic methods that employed FST, integrated haplotype score (iHS), and runs of homozygosity (ROH) procedures. All the analyses indicated differences in environment as a driver of selective pressure in both groups of populations. The analyses revealed unique differences in the genomic regions under selection pressure from the environment for each population. The African chickens showed stronger selection toward stress signaling and angiogenesis, while the Northern European chickens showed more selection pressure toward processes related to energy homeostasis. The results suggest that chromosomes 2 and 27 are the most diverged between populations and the most selected upon within the African (chromosome 27) and Northern European (chromosome 2) birds. Examination of the divergent populations has provided new insight into genes under possible selection related to tolerance of a population’s indigenous environment that may be baselines for examining the genomic contribution to tolerance adaptions. PMID:28341699

Genomic Comparison of Indigenous African and Northern European Chickens Reveals Putative Mechanisms of Stress Tolerance Related to Environmental Selection Pressure.

PubMed

Fleming, Damarius S; Weigend, Steffen; Simianer, Henner; Weigend, Annett; Rothschild, Max; Schmidt, Carl; Ashwell, Chris; Persia, Mike; Reecy, James; Lamont, Susan J

2017-05-05

Global climate change is increasing the magnitude of environmental stressors, such as temperature, pathogens, and drought, that limit the survivability and sustainability of livestock production. Poultry production and its expansion is dependent upon robust animals that are able to cope with stressors in multiple environments. Understanding the genetic strategies that indigenous, noncommercial breeds have evolved to survive in their environment could help to elucidate molecular mechanisms underlying biological traits of environmental adaptation. We examined poultry from diverse breeds and climates of Africa and Northern Europe for selection signatures that have allowed them to adapt to their indigenous environments. Selection signatures were studied using a combination of population genomic methods that employed F ST , integrated haplotype score (iHS), and runs of homozygosity (ROH) procedures. All the analyses indicated differences in environment as a driver of selective pressure in both groups of populations. The analyses revealed unique differences in the genomic regions under selection pressure from the environment for each population. The African chickens showed stronger selection toward stress signaling and angiogenesis, while the Northern European chickens showed more selection pressure toward processes related to energy homeostasis. The results suggest that chromosomes 2 and 27 are the most diverged between populations and the most selected upon within the African (chromosome 27) and Northern European (chromosome 2) birds. Examination of the divergent populations has provided new insight into genes under possible selection related to tolerance of a population's indigenous environment that may be baselines for examining the genomic contribution to tolerance adaptions. Copyright © 2017 Fleming et al.
Genetic signatures of natural selection in a model invasive ascidian

NASA Astrophysics Data System (ADS)

Lin, Yaping; Chen, Yiyong; Yi, Changho; Fong, Jonathan J.; Kim, Won; Rius, Marc; Zhan, Aibin

2017-03-01

Invasive species represent promising models to study species’ responses to rapidly changing environments. Although local adaptation frequently occurs during contemporary range expansion, the associated genetic signatures at both population and genomic levels remain largely unknown. Here, we use genome-wide gene-associated microsatellites to investigate genetic signatures of natural selection in a model invasive ascidian, Ciona robusta. Population genetic analyses of 150 individuals sampled in Korea, New Zealand, South Africa and Spain showed significant genetic differentiation among populations. Based on outlier tests, we found high incidence of signatures of directional selection at 19 loci. Hitchhiking mapping analyses identified 12 directional selective sweep regions, and all selective sweep windows on chromosomes were narrow (~8.9 kb). Further analyses indentified 132 candidate genes under selection. When we compared our genetic data and six crucial environmental variables, 16 putatively selected loci showed significant correlation with these environmental variables. This suggests that the local environmental conditions have left significant signatures of selection at both population and genomic levels. Finally, we identified “plastic” genomic regions and genes that are promising regions to investigate evolutionary responses to rapid environmental change in C. robusta.
The topography of mutational processes in breast cancer genomes

DOE PAGES

Morganella, Sandro; Alexandrov, Ludmil B.; Glodzik, Dominik; ...

2016-01-01

Somatic mutations in human cancers show unevenness in genomic distribution that correlate with aspects of genome structure and function. These mutations are, however, generated by multiple mutational processes operating through the cellular lineage between the fertilized egg and the cancer cell, each composed of specific DNA damage and repair components and leaving its own characteristic mutational signature on the genome. Using somatic mutation catalogues from 560 breast cancer whole-genome sequences, here we show that each of 12 base substitution, 2 insertion/deletion (indel) and 6 rearrangement mutational signatures present in breast tissue, exhibit distinct relationships with genomic features relating to transcription,more » DNA replication and chromatin organization. This signature-based approach permits visualization of the genomic distribution of mutational processes associated with APOBEC enzymes, mismatch repair deficiency and homologous recombinational repair deficiency, as well as mutational processes of unknown aetiology. Lastly, it highlights mechanistic insights including a putative replication-dependent mechanism of APOBEC-related mutagenesis.« less
Gut Microbiome-Based Metagenomic Signature for Non-invasive Detection of Advanced Fibrosis in Human Nonalcoholic Fatty Liver Disease.

PubMed

Loomba, Rohit; Seguritan, Victor; Li, Weizhong; Long, Tao; Klitgord, Niels; Bhatt, Archana; Dulai, Parambir Singh; Caussy, Cyrielle; Bettencourt, Richele; Highlander, Sarah K; Jones, Marcus B; Sirlin, Claude B; Schnabl, Bernd; Brinkac, Lauren; Schork, Nicholas; Chen, Chi-Hua; Brenner, David A; Biggs, William; Yooseph, Shibu; Venter, J Craig; Nelson, Karen E

2017-05-02

The presence of advanced fibrosis in nonalcoholic fatty liver disease (NAFLD) is the most important predictor of liver mortality. There are limited data on the diagnostic accuracy of gut microbiota-derived signature for predicting the presence of advanced fibrosis. In this prospective study, we characterized the gut microbiome compositions using whole-genome shotgun sequencing of DNA extracted from stool samples. This study included 86 uniquely well-characterized patients with biopsy-proven NAFLD, of which 72 had mild/moderate (stage 0-2 fibrosis) NAFLD, and 14 had advanced fibrosis (stage 3 or 4 fibrosis). We identified a set of 40 features (p < 0.006), which included 37 bacterial species that were used to construct a Random Forest classifier model to distinguish mild/moderate NAFLD from advanced fibrosis. The model had a robust diagnostic accuracy (AUC 0.936) for detecting advanced fibrosis. This study provides preliminary evidence for a fecal-microbiome-derived metagenomic signature to detect advanced fibrosis in NAFLD. Copyright © 2017 Elsevier Inc. All rights reserved.
C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency

PubMed Central

Meier, Bettina; Cooke, Susanna L.; Weiss, Joerg; Bailly, Aymeric P.; Alexandrov, Ludmil B.; Marshall, John; Raine, Keiran; Maddison, Mark; Anderson, Elizabeth; Stratton, Michael R.; Campbell, Peter J.

2014-01-01

Mutation is associated with developmental and hereditary disorders, aging, and cancer. While we understand some mutational processes operative in human disease, most remain mysterious. We used Caenorhabditis elegans whole-genome sequencing to model mutational signatures, analyzing 183 worm populations across 17 DNA repair-deficient backgrounds propagated for 20 generations or exposed to carcinogens. The baseline mutation rate in C. elegans was approximately one per genome per generation, not overtly altered across several DNA repair deficiencies over 20 generations. Telomere erosion led to complex chromosomal rearrangements initiated by breakage–fusion–bridge cycles and completed by simultaneously acquired, localized clusters of breakpoints. Aflatoxin B1 induced substitutions of guanines in a GpC context, as observed in aflatoxin-induced liver cancers. Mutational burden increased with impaired nucleotide excision repair. Cisplatin and mechlorethamine, DNA crosslinking agents, caused dose- and genotype-dependent signatures among indels, substitutions, and rearrangements. Strikingly, both agents induced clustered rearrangements resembling “chromoanasynthesis,” a replication-based mutational signature seen in constitutional genomic disorders, suggesting that interstrand crosslinks may play a pathogenic role in such events. Cisplatin mutagenicity was most pronounced in xpf-1 mutants, suggesting that this gene critically protects cells against platinum chemotherapy. Thus, experimental model systems combined with genome sequencing can recapture and mechanistically explain mutational signatures associated with human disease. PMID:25030888
Signatures of Diversifying Selection in European Pig Breeds

PubMed Central

Wilkinson, Samantha; Lu, Zen H.; Megens, Hendrik-Jan; Archibald, Alan L.; Haley, Chris; Jackson, Ian J.; Groenen, Martien A. M.; Crooijmans, Richard P. M. A.; Ogden, Rob; Wiener, Pamela

2013-01-01

Following domestication, livestock breeds have experienced intense selection pressures for the development of desirable traits. This has resulted in a large diversity of breeds that display variation in many phenotypic traits, such as coat colour, muscle composition, early maturity, growth rate, body size, reproduction, and behaviour. To better understand the relationship between genomic composition and phenotypic diversity arising from breed development, the genomes of 13 traditional and commercial European pig breeds were scanned for signatures of diversifying selection using the Porcine60K SNP chip, applying a between-population (differentiation) approach. Signatures of diversifying selection between breeds were found in genomic regions associated with traits related to breed standard criteria, such as coat colour and ear morphology. Amino acid differences in the EDNRB gene appear to be associated with one of these signatures, and variation in the KITLG gene may be associated with another. Other selection signals were found in genomic regions including QTLs and genes associated with production traits such as reproduction, growth, and fat deposition. Some selection signatures were associated with regions showing evidence of introgression from Asian breeds. When the European breeds were compared with wild boar, genomic regions with high levels of differentiation harboured genes related to bone formation, growth, and fat deposition. PMID:23637623
An algorithm of discovering signatures from DNA databases on a computer cluster.

PubMed

Lee, Hsiao Ping; Sheu, Tzu-Fang

2014-10-05

Signatures are short sequences that are unique and not similar to any other sequence in a database that can be used as the basis to identify different species. Even though several signature discovery algorithms have been proposed in the past, these algorithms require the entirety of databases to be loaded in the memory, thus restricting the amount of data that they can process. It makes those algorithms unable to process databases with large amounts of data. Also, those algorithms use sequential models and have slower discovery speeds, meaning that the efficiency can be improved. In this research, we are debuting the utilization of a divide-and-conquer strategy in signature discovery and have proposed a parallel signature discovery algorithm on a computer cluster. The algorithm applies the divide-and-conquer strategy to solve the problem posed to the existing algorithms where they are unable to process large databases and uses a parallel computing mechanism to effectively improve the efficiency of signature discovery. Even when run with just the memory of regular personal computers, the algorithm can still process large databases such as the human whole-genome EST database which were previously unable to be processed by the existing algorithms. The algorithm proposed in this research is not limited by the amount of usable memory and can rapidly find signatures in large databases, making it useful in applications such as Next Generation Sequencing and other large database analysis and processing. The implementation of the proposed algorithm is available at http://www.cs.pu.edu.tw/~fang/DDCSDPrograms/DDCSD.htm.
Cetaceans evolution: insights from the genome sequences of common minke whales.

PubMed

Park, Jung Youn; An, Yong-Rock; Kanda, Naohisa; An, Chul-Min; An, Hye Suck; Kang, Jung-Ha; Kim, Eun Mi; An, Du-Hae; Jung, Hojin; Joung, Myunghee; Park, Myung Hum; Yoon, Sook Hee; Lee, Bo-Young; Lee, Taeheon; Kim, Kyu-Won; Park, Won Cheoul; Shin, Dong Hyun; Lee, Young Sub; Kim, Jaemin; Kwak, Woori; Kim, Hyeon Jeong; Kwon, Young-Jun; Moon, Sunjin; Kim, Yuseob; Burt, David W; Cho, Seoae; Kim, Heebal

2015-01-22

Whales have captivated the human imagination for millennia. These incredible cetaceans are the only mammals that have adapted to life in the open oceans and have been a source of human food, fuel and tools around the globe. The transition from land to water has led to various aquatic specializations related to hairless skin and ability to regulate their body temperature in cold water. We present four common minke whale (Balaenoptera acutorostrata) genomes with depth of ×13 ~ ×17 coverage and perform resequencing technology without a reference sequence. Our results indicated the time to the most recent common ancestors of common minke whales to be about 2.3574 (95% HPD, 1.1521 - 3.9212) million years ago. Further, we found that genes associated with epilation and tooth-development showed signatures of positive selection, supporting the morphological uniqueness of whales. This whole-genome sequencing offers a chance to better understand the evolutionary journey of one of the largest mammals on earth.
Deciphering the recent phylogenetic expansion of the originally deeply rooted Mycobacterium tuberculosis lineage 7.

PubMed

Yimer, Solomon A; Namouchi, Amine; Zegeye, Ephrem Debebe; Holm-Hansen, Carol; Norheim, Gunnstein; Abebe, Markos; Aseffa, Abraham; Tønjum, Tone

2016-06-30

A deeply rooted phylogenetic lineage of Mycobacterium tuberculosis (M. tuberculosis) termed lineage 7 was discovered in Ethiopia. Whole genome sequencing of 30 lineage 7 strains from patients in Ethiopia was performed. Intra-lineage genome variation was defined and unique characteristics identified with a focus on genes involved in DNA repair, recombination and replication (3R genes). More than 800 mutations specific to M. tuberculosis lineage 7 strains were identified. The proportion of non-synonymous single nucleotide polymorphisms (nsSNPs) in 3R genes was higher after the recent expansion of M. tuberculosis lineage 7 strain started. The proportion of nsSNPs in genes involved in inorganic ion transport and metabolism was significantly higher before the expansion began. A total of 22346 bp deletions were observed. Lineage 7 strains also exhibited a high number of mutations in genes involved in carbohydrate transport and metabolism, transcription, energy production and conversion. We have identified unique genomic signatures of the lineage 7 strains. The high frequency of nsSNP in 3R genes after the phylogenetic expansion may have contributed to recent variability and adaptation. The abundance of mutations in genes involved in inorganic ion transport and metabolism before the expansion period may indicate an adaptive response of lineage 7 strains to enable survival, potentially under environmental stress exposure. As lineage 7 strains originally were phylogenetically deeply rooted, this may indicate fundamental adaptive genomic pathways affecting the fitness of M. tuberculosis as a species.
Calcium/calmodulin-mediated signal network in plants

NASA Technical Reports Server (NTRS)

Yang, Tianbao; Poovaiah, B. W.

2003-01-01

Various extracellular stimuli elicit specific calcium signatures that can be recognized by different calcium sensors. Calmodulin, the predominant calcium receptor, is one of the best-characterized calcium sensors in eukaryotes. In recent years, completion of the Arabidopsis genome project and advances in functional genomics have helped to identify and characterize numerous calmodulin-binding proteins in plants. There are some similarities in Ca(2+)/calmodulin-mediated signaling in plants and animals. However, plants possess multiple calmodulin genes and many calmodulin target proteins, including unique protein kinases and transcription factors. Some of these proteins are likely to act as "hubs" during calcium signal transduction. Hence, a better understanding of the function of these calmodulin target proteins should help in deciphering the Ca(2+)/calmodulin-mediated signal network and its role in plant growth, development and response to environmental stimuli.
Genetic diversity and genomic signatures of selection among cattle breeds from Siberia, eastern and northern Europe.

PubMed

Iso-Touru, T; Tapio, M; Vilkki, J; Kiseleva, T; Ammosov, I; Ivanova, Z; Popov, R; Ozerov, M; Kantanen, J

2016-12-01

Domestication in the near eastern region had a major impact on the gene pool of humpless taurine cattle (Bos taurus). As a result of subsequent natural and artificial selection, hundreds of different breeds have evolved, displaying a broad range of phenotypic traits. Here, 10 Eurasian B. taurus breeds from different biogeographic and production conditions, which exhibit different demographic histories and have been under artificial selection at various intensities, were investigated using the Illumina BovineSNP50 panel to understand their genetic diversity and population structure. In addition, we scanned genomes from eight breeds for signatures of diversifying selection. Our population structure analysis indicated six distinct breed groups, the most divergent being the Yakutian cattle from Siberia. Selection signals were shared (experimental P-value < 0.01) with more than four breeds on chromosomes 6, 7, 13, 16 and 22. The strongest selection signals in the Yakutian cattle were found on chromosomes 7 and 21, where a miRNA gene and genes related to immune system processes are respectively located. In general, genomic regions indicating selection overlapped with known QTL associated with milk production (e.g. on chromosome 19), reproduction (e.g. on chromosome 24) and meat quality (e.g. on chromosome 7). The selection map created in this study shows that native cattle breeds and their genetic resources represent unique material for future breeding. © 2016 Stichting International Foundation for Animal Genetics.
Genetic signatures of natural selection in a model invasive ascidian

PubMed Central

Lin, Yaping; Chen, Yiyong; Yi, Changho; Fong, Jonathan J.; Kim, Won; Rius, Marc; Zhan, Aibin

2017-01-01

Invasive species represent promising models to study species’ responses to rapidly changing environments. Although local adaptation frequently occurs during contemporary range expansion, the associated genetic signatures at both population and genomic levels remain largely unknown. Here, we use genome-wide gene-associated microsatellites to investigate genetic signatures of natural selection in a model invasive ascidian, Ciona robusta. Population genetic analyses of 150 individuals sampled in Korea, New Zealand, South Africa and Spain showed significant genetic differentiation among populations. Based on outlier tests, we found high incidence of signatures of directional selection at 19 loci. Hitchhiking mapping analyses identified 12 directional selective sweep regions, and all selective sweep windows on chromosomes were narrow (~8.9 kb). Further analyses indentified 132 candidate genes under selection. When we compared our genetic data and six crucial environmental variables, 16 putatively selected loci showed significant correlation with these environmental variables. This suggests that the local environmental conditions have left significant signatures of selection at both population and genomic levels. Finally, we identified “plastic” genomic regions and genes that are promising regions to investigate evolutionary responses to rapid environmental change in C. robusta. PMID:28266616
C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency.

PubMed

Meier, Bettina; Cooke, Susanna L; Weiss, Joerg; Bailly, Aymeric P; Alexandrov, Ludmil B; Marshall, John; Raine, Keiran; Maddison, Mark; Anderson, Elizabeth; Stratton, Michael R; Gartner, Anton; Campbell, Peter J

2014-10-01

Mutation is associated with developmental and hereditary disorders, aging, and cancer. While we understand some mutational processes operative in human disease, most remain mysterious. We used Caenorhabditis elegans whole-genome sequencing to model mutational signatures, analyzing 183 worm populations across 17 DNA repair-deficient backgrounds propagated for 20 generations or exposed to carcinogens. The baseline mutation rate in C. elegans was approximately one per genome per generation, not overtly altered across several DNA repair deficiencies over 20 generations. Telomere erosion led to complex chromosomal rearrangements initiated by breakage-fusion-bridge cycles and completed by simultaneously acquired, localized clusters of breakpoints. Aflatoxin B1 induced substitutions of guanines in a GpC context, as observed in aflatoxin-induced liver cancers. Mutational burden increased with impaired nucleotide excision repair. Cisplatin and mechlorethamine, DNA crosslinking agents, caused dose- and genotype-dependent signatures among indels, substitutions, and rearrangements. Strikingly, both agents induced clustered rearrangements resembling "chromoanasynthesis," a replication-based mutational signature seen in constitutional genomic disorders, suggesting that interstrand crosslinks may play a pathogenic role in such events. Cisplatin mutagenicity was most pronounced in xpf-1 mutants, suggesting that this gene critically protects cells against platinum chemotherapy. Thus, experimental model systems combined with genome sequencing can recapture and mechanistically explain mutational signatures associated with human disease. © 2014 Meier et al.; Published by Cold Spring Harbor Laboratory Press.
Comparative genomics of Beauveria bassiana: uncovering signatures of virulence against mosquitoes.

PubMed

Valero-Jiménez, Claudio A; Faino, Luigi; Spring In't Veld, Daphne; Smit, Sandra; Zwaan, Bas J; van Kan, Jan A L

2016-12-01

Entomopathogenic fungi such as Beauveria bassiana are promising biological agents for control of malaria mosquitoes. Indeed, infection with B. bassiana reduces the lifespan of mosquitoes in the laboratory and in the field. Natural isolates of B. bassiana show up to 10-fold differences in virulence between the most and the least virulent isolate. In this study, we sequenced the genomes of five isolates representing the extremes of low/high virulence and three RNA libraries, and applied a genome comparison approach to uncover genetic mechanisms underpinning virulence. A high-quality, near-complete genome assembly was achieved for the highly virulent isolate Bb8028, which was compared to the assemblies of the four other isolates. Whole genome analysis showed a high level of genetic diversity between the five isolates (2.85-16.8 SNPs/kb), which grouped into two distinct phylogenetic clusters. Mating type gene analysis revealed the presence of either the MAT1-1-1 or the MAT1-2-1 gene. Moreover, a putative new MAT gene (MAT1-2-8) was detected in the MAT1-2 locus. Comparative genome analysis revealed that Bb8028 contains 163 genes exclusive for this isolate. These unique genes have a tendency to cluster in the genome and to be often located near the telomeres. Among the genes unique to Bb8028 are a Non-Ribosomal Peptide Synthetase (NRPS) secondary metabolite gene cluster, a polyketide synthase (PKS) gene, and five genes with homology to bacterial toxins. A survey of candidate virulence genes for B. bassiana is presented. Our results indicate several genes and molecular processes that may underpin virulence towards mosquitoes. Thus, the genome sequences of five isolates of B. bassiana provide a better understanding of the natural variation in virulence and will offer a major resource for future research on this important biological control agent.
Screening of duplicated loci reveals hidden divergence patterns in a complex salmonid genome

USGS Publications Warehouse

Limborg, Morten T.; Larson, Wesley; Seeb, Lisa W.; Seeb, James E.

2017-01-01

A whole-genome duplication (WGD) doubles the entire genomic content of a species and is thought to have catalysed adaptive radiation in some polyploid-origin lineages. However, little is known about general consequences of a WGD because gene duplicates (i.e., paralogs) are commonly filtered in genomic studies; such filtering may remove substantial portions of the genome in data sets from polyploid-origin species. We demonstrate a new method that enables genome-wide scans for signatures of selection at both nonduplicated and duplicated loci by taking locus-specific copy number into account. We apply this method to RAD sequence data from different ecotypes of a polyploid-origin salmonid (Oncorhynchus nerka) and reveal signatures of divergent selection that would have been missed if duplicated loci were filtered. We also find conserved signatures of elevated divergence at pairs of homeologous chromosomes with residual tetrasomic inheritance, suggesting that joint evolution of some nondiverged gene duplicates may affect the adaptive potential of these genes. These findings illustrate that including duplicated loci in genomic analyses enables novel insights into the evolutionary consequences of WGDs and local segmental gene duplications.
Epigenetic Alterations in Epstein-Barr Virus-Associated Diseases.

PubMed

Niller, Hans Helmut; Banati, Ferenc; Salamon, Daniel; Minarovits, Janos

2016-01-01

Latent Epstein-Bar virus genomes undergo epigenetic modifications which are dependent on the respective tissue type and cellular phenotype. These define distinct viral epigenotypes corresponding with latent viral gene expression profiles. Viral Latent Membrane Proteins 1 and 2A can induce cellular DNA methyltransferases, thereby influencing the methylation status of the viral and cellular genomes. Therefore, not only the viral genomes carry epigenetic modifications, but also the cellular genomes adopt major epigenetic alterations upon EBV infection. The distinct cellular epigenotypes of EBV-infected cells differ from the epigenotypes of their normal counterparts. In Burkitt lymphoma (BL), nasopharyngeal carcinoma (NPC) and EBV-associated gastric carcinoma (EBVaGC) significant changes in the host cell methylome with a strong tendency towards CpG island hypermethylation are observed. Hypermethylated genes unique for EBVaGC suggest the existence of an EBV-specific "epigenetic signature". Contrary to the primary malignancies carrying latent EBV genomes, lymphoblastoid cells (LCs) established by EBV infection of peripheral B cells in vitro are characterized by a massive genome-wide demethylation and a significant decrease and redistribution of heterochromatic histone marks. Establishing complete epigenomes of the diverse EBV-associated malignancies shall clarify their similarities and differences and further clarify the contribution of EBV to the pathogenesis, especially for the epithelial malignancies, NPC and EBVaGC.
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan

USDA-ARS?s Scientific Manuscript database

Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be easily seen by analyzing the unexpected behaviour of the linkage disequilibrium (LD) decay. A heuristic process was proposed to identify those misassembly signatures and presented the ones found in ...
A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes

PubMed Central

Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

2018-01-01

We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. PMID:29367403
Proteomic Assessment of Fluid Shifts and Association with Visual Impairment and Intracranial Pressure in Twin Astronauts

NASA Technical Reports Server (NTRS)

Rana, Brinda K.; Stenger, Michael B.; Lee, Stuart M. C.; Macias, Brandon R.; Siamwala, Jamila; Piening, Brian Donald; Hook, Vivian; Ebert, Doug; Patel, Hemal; Smith, Scott;

2016-01-01

BACKGROUND: Astronauts participating in long duration space missions are at an increased risk of physiological disruptions. The development of visual impairment and intracranial pressure (VIIP) syndrome is one of the leading health concerns for crew members on long-duration space missions; microgravity-induced fluid shifts and chronic elevated cabin CO2 may be contributing factors. By studying physiological and molecular changes in one identical twin during his 1-year ISS mission and his ground-based co-twin, this work extends a current NASA-funded investigation to assess space flight induced "Fluid Shifts" in association with the development of VIIP. This twin study uniquely integrates physiological and -omic signatures to further our understanding of the molecular mechanisms underlying space flight-induced VIIP. We are: (i) conducting longitudinal proteomic assessments of plasma to identify fluid regulation-related molecular pathways altered by long-term space flight; and (ii) integrating physiological and proteomic data with genomic data to understand the genomic mechanism by which these proteomic signatures are regulated. PURPOSE: We are exploring proteomic signatures and genomic mechanisms underlying space flight-induced VIIP symptoms with the future goal of developing early biomarkers to detect and monitor the progression of VIIP. This study is first to employ a male monozygous twin pair to systematically determine the impact of fluid distribution in microgravity, integrating a comprehensive set of structural and functional measures with proteomic, metabolomic and genomic data. This project has a broader impact on Earth-based clinical areas, such as traumatic brain injury-induced elevations of intracranial pressure, hydrocephalus, and glaucoma. HYPOTHESIS: We predict that the space-flown twin will experience a space flight-induced alteration in proteins and peptides related to fluid balance, fluid control and brain injury as compared to his pre-flight protein/peptide signatures. Conversely, the trajectory of these protein signatures will remain relatively constant in his ground based co-twin. METHODS: We are using proteomic and standard immunoelectrophoresis techniques to delineate the change in protein signatures throughout the course of a long duration space flight in relation to the development of VIIP. We are also applying a novel cell-based metaboloic organ system assay ("Organs on a Plate") to address how these circulating biomarkers affect physiological processes at the cellular and organ level which could result in VIIP symptoms. These molecular data will be correlated with physiological measures (eg. extra and intracellular fluid volume, vascular filling/flow patterns, MRI, and Optic Coherence Tomography. DISCUSSION: Pre- and in-flight data collection is in progress for the space-flown twin, and similar data have been obtained from the ground-based twin. Biosamples will be batch processed when received from ISS after the conclusion of the 1-year mission. Omic and Physiological measures from the twin astronauts will be compared to similar data being collected on twin subjects who participated in simulated microgravity study. bed rest study.

Population-specific recombination sites within the human MHC region.

PubMed

Lam, T H; Shen, M; Chia, J-M; Chan, S H; Ren, E C

2013-08-01

Genetic rearrangement by recombination is one of the major driving forces for genome evolution, and recombination is known to occur in non-random, discreet recombination sites within the genome. Mapping of recombination sites has proved to be difficult, particularly, in the human MHC region that is complicated by both population variation and highly polymorphic HLA genes. To overcome these problems, HLA-typed individuals from three representative populations: Asian, European and African were used to generate phased HLA haplotypes. Extended haplotype homozygosity (EHH) plots constructed from the phased haplotype data revealed discreet EHH drops corresponding to recombination events and these signatures were observed to be different for each population. Surprisingly, the majority of recombination sites detected are unique to each population, rather than being common. Unique recombination sites account for 56.8% (21/37 of total sites) in the Asian cohort, 50.0% (15/30 sites) in Europeans and 63.2% (24/38 sites) in Africans. Validation carried out at a known sperm typing recombination site of 45 kb (HLA-F-telomeric) showed that EHH was an efficient method to narrow the recombination region to 826 bp, and this was further refined to 660 bp by resequencing. This approach significantly enhanced mapping of the genomic architecture within the human MHC, and will be useful in studies to identify disease risk genes.

Genomic DNA Methylation Signatures Enable Concurrent Diagnosis and Clinical Genetic Variant Classification in Neurodevelopmental Syndromes.

PubMed

Aref-Eshghi, Erfan; Rodenhiser, David I; Schenkel, Laila C; Lin, Hanxin; Skinner, Cindy; Ainsworth, Peter; Paré, Guillaume; Hood, Rebecca L; Bulman, Dennis E; Kernohan, Kristin D; Boycott, Kym M; Campeau, Philippe M; Schwartz, Charles; Sadikovic, Bekim

2018-01-04

Pediatric developmental syndromes present with systemic, complex, and often overlapping clinical features that are not infrequently a consequence of Mendelian inheritance of mutations in genes involved in DNA methylation, establishment of histone modifications, and chromatin remodeling (the "epigenetic machinery"). The mechanistic cross-talk between histone modification and DNA methylation suggests that these syndromes might be expected to display specific DNA methylation signatures that are a reflection of those primary errors associated with chromatin dysregulation. Given the interrelated functions of these chromatin regulatory proteins, we sought to identify DNA methylation epi-signatures that could provide syndrome-specific biomarkers to complement standard clinical diagnostics. In the present study, we examined peripheral blood samples from a large cohort of individuals encompassing 14 Mendelian disorders displaying mutations in the genes encoding proteins of the epigenetic machinery. We demonstrated that specific but partially overlapping DNA methylation signatures are associated with many of these conditions. The degree of overlap among these epi-signatures is minimal, further suggesting that, consistent with the initial event, the downstream changes are unique to every syndrome. In addition, by combining these epi-signatures, we have demonstrated that a machine learning tool can be built to concurrently screen for multiple syndromes with high sensitivity and specificity, and we highlight the utility of this tool in solving ambiguous case subjects presenting with variants of unknown significance, along with its ability to generate accurate predictions for subjects presenting with the overlapping clinical and molecular features associated with the disruption of the epigenetic machinery. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Adaptive genomic divergence under high gene flow between freshwater and brackish-water ecotypes of prickly sculpin (Cottus asper) revealed by Pool-Seq.

PubMed

Dennenmoser, Stefan; Vamosi, Steven M; Nolte, Arne W; Rogers, Sean M

2017-01-01

Understanding the genomic basis of adaptive divergence in the presence of gene flow remains a major challenge in evolutionary biology. In prickly sculpin (Cottus asper), an abundant euryhaline fish in northwestern North America, high genetic connectivity among brackish-water (estuarine) and freshwater (tributary) habitats of coastal rivers does not preclude the build-up of neutral genetic differentiation and emergence of different life history strategies. Because these two habitats present different osmotic niches, we predicted high genetic differentiation at known teleost candidate genes underlying salinity tolerance and osmoregulation. We applied whole-genome sequencing of pooled DNA samples (Pool-Seq) to explore adaptive divergence between two estuarine and two tributary habitats. Paired-end sequence reads were mapped against genomic contigs of European Cottus, and the gene content of candidate regions was explored based on comparisons with the threespine stickleback genome. Genes showing signals of repeated differentiation among brackish-water and freshwater habitats included functions such as ion transport and structural permeability in freshwater gills, which suggests that local adaptation to different osmotic niches might contribute to genomic divergence among habitats. Overall, the presence of both repeated and unique signatures of differentiation across many loci scattered throughout the genome is consistent with polygenic adaptation from standing genetic variation and locally variable selection pressures in the early stages of life history divergence. © 2016 John Wiley & Sons Ltd.
Breeding signatures of rice improvement revealed by a genomic variation map from a large germplasm collection

PubMed Central

Xie, Weibo; Wang, Gongwei; Yuan, Meng; Yao, Wen; Lyu, Kai; Zhao, Hu; Yang, Meng; Li, Pingbo; Zhang, Xing; Yuan, Jing; Wang, Quanxiu; Liu, Fang; Dong, Huaxia; Zhang, Lejing; Li, Xinglei; Meng, Xiangzhou; Zhang, Wan; Xiong, Lizhong; He, Yuqing; Wang, Shiping; Yu, Sibin; Xu, Caiguo; Luo, Jie; Li, Xianghua; Xiao, Jinghua; Lian, Xingming; Zhang, Qifa

2015-01-01

Intensive rice breeding over the past 50 y has dramatically increased productivity especially in the indica subspecies, but our knowledge of the genomic changes associated with such improvement has been limited. In this study, we analyzed low-coverage sequencing data of 1,479 rice accessions from 73 countries, including landraces and modern cultivars. We identified two major subpopulations, indica I (IndI) and indica II (IndII), in the indica subspecies, which corresponded to the two putative heterotic groups resulting from independent breeding efforts. We detected 200 regions spanning 7.8% of the rice genome that had been differentially selected between IndI and IndII, and thus referred to as breeding signatures. These regions included large numbers of known functional genes and loci associated with important agronomic traits revealed by genome-wide association studies. Grain yield was positively correlated with the number of breeding signatures in a variety, suggesting that the number of breeding signatures in a line may be useful for predicting agronomic potential and the selected loci may provide targets for rice improvement. PMID:26358652
Global Implementation of Genomic Medicine: We Are Not Alone

PubMed Central

Manolio, Teri A.; Abramowicz, Marc; Al-Mulla, Fahd; Anderson, Warwick; Balling, Rudi; Berger, Adam C.; Bleyl, Steven; Chakravarti, Aravinda; Chantratita, Wasun; Chisholm, Rex L.; Dissanayake, Vajira H. W.; Dunn, Michael; Dzau, Victor J.; Han, Bok-Ghee; Hubbard, Tim; Kolbe, Anne; Korf, Bruce; Kubo, Michiaki; Lasko, Paul; Leego, Erkki; Mahasirimongkol, Surakameth; Majumdar, Partha P.; Matthijs, Gert; McLeod, Howard L.; Metspalu, Andres; Meulien, Pierre; Miyano, Satoru; Naparstek, Yaakov; O’Rourke, P. Pearl; Patrinos, George P.; Rehm, Heidi L.; Relling, Mary V.; Rennert, Gad; Rodriguez, Laura Lyman; Roden, Dan M.; Shuldiner, Alan R.; Sinha, Sukdev; Tan, Patrick; Ulfendahl, Mats; Ward, Robyn; Williams, Marc S.; Wong, John E.L.; Green, Eric D.; Ginsburg, Geoffrey S.

2016-01-01

Advances in high-throughput genomic technologies coupled with a growing number of genomic results potentially useful in clinical care have led to ground-breaking genomic medicine implementation programs in various nations. Many of these innovative programs capitalize on unique local capabilities arising from the structure of their health care systems or their cultural or political milieu, as well as from unusual burdens of disease or risk alleles. Many such programs are being conducted in relative isolation and might benefit from sharing of approaches and lessons learned in other nations. The National Human Genome Research Institute recently brought together 25 of these groups from around the world to describe and compare projects, examine the current state of implementation and desired near-term capabilities, and identify opportunities for collaboration to promote the responsible implementation of genomic medicine. The wide variety of nascent programs in diverse settings demonstrates that implementation of genomic medicine is expanding globally in varied and highly innovative ways. Opportunities for collaboration abound in the areas of evidence generation, health information technology, education, workforce development, pharmacogenomics, and policy and regulatory issues. Several international organizations that are already facilitating effective research collaborations should engage to ensure implementation proceeds collaboratively without potentially wasteful duplication. Efforts to coalesce these groups around concrete but compelling signature projects, such as global eradication of genetically-mediated drug reactions or developing a truly global genomic variant data resource across a wide number of ethnicities, would accelerate appropriate implementation of genomics to improve clinical care world-wide. PMID:26041702
Prediction of Chemical Respiratory Sensitizers Using GARD, a Novel In Vitro Assay Based on a Genomic Biomarker Signature

PubMed Central

Albrekt, Ann-Sofie; Borrebaeck, Carl A. K.; Lindstedt, Malin

2015-01-01

Background Repeated exposure to certain low molecular weight (LMW) chemical compounds may result in development of allergic reactions in the skin or in the respiratory tract. In most cases, a certain LMW compound selectively sensitize the skin, giving rise to allergic contact dermatitis (ACD), or the respiratory tract, giving rise to occupational asthma (OA). To limit occurrence of allergic diseases, efforts are currently being made to develop predictive assays that accurately identify chemicals capable of inducing such reactions. However, while a few promising methods for prediction of skin sensitization have been described, to date no validated method, in vitro or in vivo, exists that is able to accurately classify chemicals as respiratory sensitizers. Results Recently, we presented the in vitro based Genomic Allergen Rapid Detection (GARD) assay as a novel testing strategy for classification of skin sensitizing chemicals based on measurement of a genomic biomarker signature. We have expanded the applicability domain of the GARD assay to classify also respiratory sensitizers by identifying a separate biomarker signature containing 389 differentially regulated genes for respiratory sensitizers in comparison to non-respiratory sensitizers. By using an independent data set in combination with supervised machine learning, we validated the assay, showing that the identified genomic biomarker is able to accurately classify respiratory sensitizers. Conclusions We have identified a genomic biomarker signature for classification of respiratory sensitizers. Combining this newly identified biomarker signature with our previously identified biomarker signature for classification of skin sensitizers, we have developed a novel in vitro testing strategy with a potent ability to predict both skin and respiratory sensitization in the same sample. PMID:25760038
Insights into Morphology and Disease from the Dog Genome Project

PubMed Central

Schoenebeck, Jeffrey J.; Ostrander, Elaine A.

2017-01-01

Although most modern dog breeds are less than 200 years old, the symbiosis between man and dog is ancient. Since prehistoric times, repeated selection events have transformed the wolf into man’s guardians, laborers, athletes, and companions. The rapid transformation from pack predator to loyal companion is a feat that is arguably unique among domesticated animals. How this transformation came to pass remained a biological mystery until recently: Within the past decade, the deployment of genomic approaches to study population structure, detect signatures of selection, and identify genetic variants that underlie canine phenotypes is ushering into focus novel biological mechanisms that make dogs remarkable. Ironically, the very practices responsible for breed formation also spurned morbidity; today, many diseases are correlated with breed identity. In this review, we discuss man’s best friend in the context of a genetic model to understand paradigms of heritable phenotypes, both desirable and disadvantageous. PMID:25062362
Personalizing therapy for colorectal cancer.

PubMed

Wong, Ashley; Ma, Brigette B Y

2014-01-01

Colorectal cancer (CRC) is the third most commonly diagnosed cancer worldwide. Several important scientific discoveries in the molecular biology of CRC have changed clinical practice in oncology. These included the comprehensive genome-wide profiling of CRC by the Cancer Genome Atlas Network, the discovery of mutations along the RAS-RAF signaling pathway as major determinants of response to antibodies against the epidermal growth factor receptor, the elucidation of new molecular subsets of CRC or gene signatures that may predict clinical outcome after adjuvant chemotherapy, and the innovative targeting of the family of vascular endothelial growth factor and receptors. These new data have allowed oncologists to individualize drug therapy on the basis of a patient's tumor's unique molecular profile, especially in the management of metastatic CRC. This review article will discuss the progress of personalized medicine in the contemporary management of CRC. Copyright © 2014 AGA Institute. Published by Elsevier Inc. All rights reserved.
Evolutionary analysis of vision genes identifies potential drivers of visual differences between giraffe and okapi

PubMed Central

Agaba, Morris; Cavener, Douglas R.

2017-01-01

Background The capacity of visually oriented species to perceive and respond to visual signal is integral to their evolutionary success. Giraffes are closely related to okapi, but the two species have broad range of phenotypic differences including their visual capacities. Vision studies rank giraffe’s visual acuity higher than all other artiodactyls despite sharing similar vision ecological determinants with many of them. The extent to which the giraffe’s unique visual capacity and its difference with okapi is reflected by changes in their vision genes is not understood. Methods The recent availability of giraffe and okapi genomes provided opportunity to identify giraffe and okapi vision genes. Multiple strategies were employed to identify thirty-six candidate mammalian vision genes in giraffe and okapi genomes. Quantification of selection pressure was performed by a combination of branch-site tests of positive selection and clade models of selection divergence through comparing giraffe and okapi vision genes and orthologous sequences from other mammals. Results Signatures of selection were identified in key genes that could potentially underlie giraffe and okapi visual adaptations. Importantly, some genes that contribute to optical transparency of the eye and those that are critical in light signaling pathway were found to show signatures of adaptive evolution or selection divergence. Comparison between giraffe and other ruminants identifies significant selection divergence in CRYAA and OPN1LW. Significant selection divergence was identified in SAG while positive selection was detected in LUM when okapi is compared with ruminants and other mammals. Sequence analysis of OPN1LW showed that at least one of the sites known to affect spectral sensitivity of the red pigment is uniquely divergent between giraffe and other ruminants. Discussion By taking a systemic approach to gene function in vision, the results provide the first molecular clues associated with giraffe and okapi vision adaptations. At least some of the genes that exhibit signature of selection may reflect adaptive response to differences in giraffe and okapi habitat. We hypothesize that requirement for long distance vision associated with predation and communication with conspecifics likely played an important role in the adaptive pressure on giraffe vision genes. PMID:28396824
Evolutionary analysis of vision genes identifies potential drivers of visual differences between giraffe and okapi.

PubMed

Ishengoma, Edson; Agaba, Morris; Cavener, Douglas R

2017-01-01

The capacity of visually oriented species to perceive and respond to visual signal is integral to their evolutionary success. Giraffes are closely related to okapi, but the two species have broad range of phenotypic differences including their visual capacities. Vision studies rank giraffe's visual acuity higher than all other artiodactyls despite sharing similar vision ecological determinants with many of them. The extent to which the giraffe's unique visual capacity and its difference with okapi is reflected by changes in their vision genes is not understood. The recent availability of giraffe and okapi genomes provided opportunity to identify giraffe and okapi vision genes. Multiple strategies were employed to identify thirty-six candidate mammalian vision genes in giraffe and okapi genomes. Quantification of selection pressure was performed by a combination of branch-site tests of positive selection and clade models of selection divergence through comparing giraffe and okapi vision genes and orthologous sequences from other mammals. Signatures of selection were identified in key genes that could potentially underlie giraffe and okapi visual adaptations. Importantly, some genes that contribute to optical transparency of the eye and those that are critical in light signaling pathway were found to show signatures of adaptive evolution or selection divergence. Comparison between giraffe and other ruminants identifies significant selection divergence in CRYAA and OPN1LW . Significant selection divergence was identified in SAG while positive selection was detected in LUM when okapi is compared with ruminants and other mammals. Sequence analysis of OPN1LW showed that at least one of the sites known to affect spectral sensitivity of the red pigment is uniquely divergent between giraffe and other ruminants. By taking a systemic approach to gene function in vision, the results provide the first molecular clues associated with giraffe and okapi vision adaptations. At least some of the genes that exhibit signature of selection may reflect adaptive response to differences in giraffe and okapi habitat. We hypothesize that requirement for long distance vision associated with predation and communication with conspecifics likely played an important role in the adaptive pressure on giraffe vision genes.
Revisiting demographic processes in cattle with genome-wide population genetic analysis

PubMed Central

Orozco-terWengel, Pablo; Barbato, Mario; Nicolazzi, Ezequiel; Biscarini, Filippo; Milanesi, Marco; Davies, Wyn; Williams, Don; Stella, Alessandra; Ajmone-Marsan, Paolo; Bruford, Michael W.

2015-01-01

The domestication of the aurochs took place approximately 10,000 years ago giving rise to the two main types of domestic cattle known today, taurine (Bos taurus) domesticated somewhere on or near the Fertile Crescent, and indicine (Bos indicus) domesticated in the Indus Valley. However, although cattle have historically played a prominent role in human society the exact origin of many extant breeds is not well known. Here we used a combination of medium and high-density Illumina Bovine SNP arrays (i.e., ~54,000 and ~770,000 SNPs, respectively), genotyped for over 1300 animals representing 56 cattle breeds, to describe the relationships among major European cattle breeds and detect patterns of admixture among them. Our results suggest modern cross-breeding and ancient hybridisation events have both played an important role, including with animals of indicine origin. We use these data to identify signatures of selection reflecting both domestication (hypothesized to produce a common signature across breeds) and local adaptation (predicted to exhibit a signature of selection unique to a single breed or group of related breeds with a common history) to uncover additional demographic complexity of modern European cattle. PMID:26082794
Molecular epidemiology demonstrated three emerging clusters of human immunodeficiency virus type 1 subtype B infection in Hong Kong.

PubMed

Leung, Tommy W C; Mak, Darwin; Wong, K H; Wang, Y; Song, Y H; Tsang, D N C; Wong, C; Shao, Y M; Lim, W L

2008-07-01

We conducted a molecular epidemiological study on newly diagnosed human immunodeficiency virus type 1 (HIV-1)-infected patients in Hong Kong to identify the epidemiological linkage of HIV-1 infection in the locality. Reverse transcription polymerase chain reaction (RT-PCR) for HIV-1 was performed on newly diagnosed HIV-1-positive sera collected from January 2002 to December 2006. PCR products correspond to the env C2V3V4 region and gag p17/p24 junction of the HIV-1 genome were nucleotide sequenced. Phylogenetic analyses performed on the acquired nucleotide sequences revealed that CRF01_AE and subtype B were the two dominant HIV-1 subtypes. Analyses also demonstrated the presence of three emerging HIV-1 clusters among the subtype B sequences in Hong Kong. Individual cluster possesses a unique cluster-specific amino acid signature for identification. Data show that one of the clusters (Cluster I) is rapidly expanding. In addition to the unique cluster-specific amino acid signature, the majority of sequences in Cluster I harbor a 6-amino acid insertion at the gag p17/p24 junction in a region that is thought to be closely associated with HIV-1 infectivity.
Genome-wide signals of positive selection in human evolution

PubMed Central

Enard, David; Messer, Philipp W.; Petrov, Dmitri A.

2014-01-01

The role of positive selection in human evolution remains controversial. On the one hand, scans for positive selection have identified hundreds of candidate loci, and the genome-wide patterns of polymorphism show signatures consistent with frequent positive selection. On the other hand, recent studies have argued that many of the candidate loci are false positives and that most genome-wide signatures of adaptation are in fact due to reduction of neutral diversity by linked deleterious mutations, known as background selection. Here we analyze human polymorphism data from the 1000 Genomes Project and detect signatures of positive selection once we correct for the effects of background selection. We show that levels of neutral polymorphism are lower near amino acid substitutions, with the strongest reduction observed specifically near functionally consequential amino acid substitutions. Furthermore, amino acid substitutions are associated with signatures of recent adaptation that should not be generated by background selection, such as unusually long and frequent haplotypes and specific distortions in the site frequency spectrum. We use forward simulations to argue that the observed signatures require a high rate of strongly adaptive substitutions near amino acid changes. We further demonstrate that the observed signatures of positive selection correlate better with the presence of regulatory sequences, as predicted by the ENCODE Project Consortium, than with the positions of amino acid substitutions. Our results suggest that adaptation was frequent in human evolution and provide support for the hypothesis of King and Wilson that adaptive divergence is primarily driven by regulatory changes. PMID:24619126
A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes.

PubMed

Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

2018-04-01

We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. Copyright © 2018 by the Genetics Society of America.
GEAR: genomic enrichment analysis of regional DNA copy number changes.

PubMed

Kim, Tae-Min; Jung, Yu-Chae; Rhyu, Mun-Gan; Jung, Myeong Ho; Chung, Yeun-Jun

2008-02-01

We developed an algorithm named GEAR (genomic enrichment analysis of regional DNA copy number changes) for functional interpretation of genome-wide DNA copy number changes identified by array-based comparative genomic hybridization. GEAR selects two types of chromosomal alterations with potential biological relevance, i.e. recurrent and phenotype-specific alterations. Then it performs functional enrichment analysis using a priori selected functional gene sets to identify primary and clinical genomic signatures. The genomic signatures identified by GEAR represent functionally coordinated genomic changes, which can provide clues on the underlying molecular mechanisms related to the phenotypes of interest. GEAR can help the identification of key molecular functions that are activated or repressed in the tumor genomes leading to the improved understanding on the tumor biology. GEAR software is available with online manual in the website, http://www.systemsbiology.co.kr/GEAR/.
Trespassing cancer cells: ‘fingerprinting’ invasive protrusions reveals metastatic culprits

PubMed Central

Klemke, Richard L.

2012-01-01

Metastatic cancer cells produce invasive membrane protrusions called invadopodia and pseudopodia, which play a central role in driving cancer cell dissemination in the body. Malignant cells use these structures to attach to and degrade extracellular matrix proteins, generate force for cell locomotion, and to penetrate the vasculature. Recent work using unique subcellular fractionation methodologies combined with spatial genomic, proteomic, and phosphoproteomic profiling has provided insight into the invadopodiome and pseudopodiome signaling networks that control the protrusion of invasive membranes. Here I highlight how these powerful spatial “omics” approaches reveal important signatures of metastatic cancer cells and possible new therapeutic targets aimed at treating metastatic disease. PMID:22980730
A genome-wide scan for signatures of differential artificial selection in ten cattle breeds.

PubMed

Rothammer, Sophie; Seichter, Doris; Förster, Martin; Medugorac, Ivica

2013-12-21

Since the times of domestication, cattle have been continually shaped by the influence of humans. Relatively recent history, including breed formation and the still enduring enormous improvement of economically important traits, is expected to have left distinctive footprints of selection within the genome. The purpose of this study was to map genome-wide selection signatures in ten cattle breeds and thus improve the understanding of the genome response to strong artificial selection and support the identification of the underlying genetic variants of favoured phenotypes. We analysed 47,651 single nucleotide polymorphisms (SNP) using Cross Population Extended Haplotype Homozygosity (XP-EHH). We set the significance thresholds using the maximum XP-EHH values of two essentially artificially unselected breeds and found up to 229 selection signatures per breed. Through a confirmation process we verified selection for three distinct phenotypes typical for one breed (polledness in Galloway, double muscling in Blanc-Bleu Belge and red coat colour in Red Holstein cattle). Moreover, we detected six genes strongly associated with known QTL for beef or dairy traits (TG, ABCG2, DGAT1, GH1, GHR and the Casein Cluster) within selection signatures of at least one breed. A literature search for genes lying in outstanding signatures revealed further promising candidate genes. However, in concordance with previous genome-wide studies, we also detected a substantial number of signatures without any yet known gene content. These results show the power of XP-EHH analyses in cattle to discover promising candidate genes and raise the hope of identifying phenotypically important variants in the near future. The finding of plausible functional candidates in some short signatures supports this hope. For instance, MAP2K6 is the only annotated gene of two signatures detected in Galloway and Gelbvieh cattle and is already known to be associated with carcass weight, back fat thickness and marbling score in Korean beef cattle. Based on the confirmation process and literature search we deduce that XP-EHH is able to uncover numerous artificial selection targets in subpopulations of domesticated animals.
Comparative genomics of four closely related Clostridium perfringens bacteriophages reveals variable evolution among core genes with therapeutic potential

PubMed Central

2011-01-01

Background Because biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context, we sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricultural and human pathogen. Results Phage whole-genome tetra-nucleotide signatures and proteomic tree topologies correlated closely with host phylogeny. Comparisons of our phage genomes to 26 others revealed three shared COGs; of particular interest within this core genome was an endolysin (PF01520, an N-acetylmuramoyl-L-alanine amidase) and a holin (PF04531). Comparative analyses of the evolutionary history and genomic context of these common phage proteins revealed two important results: 1) strongly significant host-specific sequence variation within the endolysin, and 2) a protein domain architecture apparently unique to our phage genomes in which the endolysin is located upstream of its associated holin. Endolysin sequences from our phages were one of two very distinct genotypes distinguished by variability within the putative enzymatically-active domain. The shared or core genome was comprised of genes with multiple sequence types belonging to five pfam families, and genes belonging to 12 pfam families, including the holin genes, which were nearly identical. Conclusions Significant genomic diversity exists even among closely-related bacteriophages. Holins and endolysins represent conserved functions across divergent phage genomes and, as we demonstrate here, endolysins can have significant variability and host-specificity even among closely-related genomes. Endolysins in our phage genomes may be subject to different selective pressures than the rest of the genome. These findings may have important implications for potential biotechnological applications of phage gene products. PMID:21631945
Evidence of evolutionary history and selective sweeps in the genome of Meishan pig reveals its genetic and phenotypic characterization.

PubMed

Zhao, Pengju; Yu, Ying; Feng, Wen; Du, Heng; Yu, Jian; Kang, Huimin; Zheng, Xianrui; Wang, Zhiquan; Liu, George E; Ernst, Catherine W; Ran, Xueqin; Wang, Jiafu; Liu, Jian-Feng

2018-05-01

Meishan is a pig breed indigenous to China and famous for its high fecundity. The traits of Meishan are strongly associated with its distinct evolutionary history and domestication. However, the genomic evidence linking the domestication of Meishan pigs with its unique features is still poorly understood. The goal of this study is to investigate the genomic signatures and evolutionary evidence related to the phenotypic traits of Meishan via large-scale sequencing. We found that the unique domestication of Meishan pigs occurred in the Taihu Basin area between the Majiabang and Liangzhu Cultures, during which 300 protein-coding genes have underwent positive selection. Notably, enrichment of the FoxO signaling pathway with significant enrichment signal and the harbored gene IGF1R were likely associated with the high fertility of Meishan pigs. Moreover, NFKB1 exhibited strong selective sweep signals and positively participated in hyaluronan biosynthesis as the key gene of NF-kB signaling, which may have resulted in the wrinkled skin and face of Meishan pigs. Particularly, three population-specific synonymous single-nucleotide variants occurred in PYROXD1, MC1R, and FAM83G genes; the T305C substitution in the MCIR gene explained the black coat of the Meishan pigs well. In addition, the shared haplotypes between Meishan and Duroc breeds confirmed the previous Asian-derived introgression and demonstrated the specific contribution of Meishan pigs. These findings will help us explain the unique genetic and phenotypic characteristics of Meishan pigs and offer a plausible method for their utilization of Meishan pigs as valuable genetic resources in pig breeding and as an animal model for human wrinkled skin disease research.
Benefits of Genomic Insights and CRISPR-Cas Signatures to Monitor Potential Pathogens across Drinking Water Production and Distribution Systems

PubMed Central

Zhang, Ya; Kitajima, Masaaki; Whittle, Andrew J.; Liu, Wen-Tso

2017-01-01

The occurrence of pathogenic bacteria in drinking water distribution systems (DWDSs) is a major health concern, and our current understanding is mostly related to pathogenic species such as Legionella pneumophila and Mycobacterium avium but not to bacterial species closely related to them. In this study, genomic-based approaches were used to characterize pathogen-related species in relation to their abundance, diversity, potential pathogenicity, genetic exchange, and distribution across an urban drinking water system. Nine draft genomes recovered from 10 metagenomes were identified as Legionella (4 draft genomes), Mycobacterium (3 draft genomes), Parachlamydia (1 draft genome), and Leptospira (1 draft genome). The pathogenicity potential of these genomes was examined by the presence/absence of virulence machinery, including genes belonging to Type III, IV, and VII secretion systems and their effectors. Several virulence factors known to pathogenic species were detected with these retrieved draft genomes except the Leptospira-related genome. Identical clustered regularly interspaced short palindromic repeats-CRISPR-associated proteins (CRISPR-Cas) genetic signatures were observed in two draft genomes recovered at different stages of the studied system, suggesting that the spacers in CRISPR-Cas could potentially be used as a biomarker in the monitoring of Legionella related strains at an evolutionary scale of several years across different drinking water production and distribution systems. Overall, metagenomics approach was an effective and complementary tool of culturing techniques to gain insights into the pathogenic characteristics and the CRISPR-Cas signatures of pathogen-related species in DWDSs. PMID:29097994
DNA damage in cells exhibiting radiation-induced genomic instability

DOE PAGES

Keszenman, Deborah J.; Kolodiuk, Lucia; Baulch, Janet E.

2015-02-22

Cells exhibiting radiation induced genomic instability exhibit varied spectra of genetic and chromosomal aberrations. Even so, oxidative stress remains a common theme in the initiation and/or perpetuation of this phenomenon. Isolated oxidatively modified bases, abasic sites, DNA single strand breaks and clustered DNA damage are induced in normal mammalian cultured cells and tissues due to endogenous reactive oxygen species generated during normal cellular metabolism in an aerobic environment. While sparse DNA damage may be easily repaired, clustered DNA damage may lead to persistent cytotoxic or mutagenic events that can lead to genomic instability. In this study, we tested the hypothesismore » that DNA damage signatures characterised by altered levels of endogenous, potentially mutagenic, types of DNA damage and chromosomal breakage are related to radiation-induced genomic instability and persistent oxidative stress phenotypes observed in the chromosomally unstable progeny of irradiated cells. The measurement of oxypurine, oxypyrimidine and abasic site endogenous DNA damage showed differences in non-double-strand breaks (DSB) clusters among the three of the four unstable clones evaluated as compared to genomically stable clones and the parental cell line. These three unstable clones also had increased levels of DSB clusters. The results of this study demonstrate that each unstable cell line has a unique spectrum of persistent damage and lead us to speculate that alterations in DNA damage signaling and repair may be related to the perpetuation of genomic instability.« less

Assessing signatures of selection through variation in linkage disequilibrium between taurine and indicine cattle

PubMed Central

2014-01-01

Background Signatures of selection are regions in the genome that have been preferentially increased in frequency and fixed in a population because of their functional importance in specific processes. These regions can be detected because of their lower genetic variability and specific regional linkage disequilibrium (LD) patterns. Methods By comparing the differences in regional LD variation between dairy and beef cattle types, and between indicine and taurine subspecies, we aim at finding signatures of selection for production and adaptation in cattle breeds. The VarLD method was applied to compare the LD variation in the autosomal genome between breeds, including Angus and Brown Swiss, representing taurine breeds, and Nelore and Gir, representing indicine breeds. Genomic regions containing the top 0.01 and 0.1 percentile of signals were characterized using the UMD3.1 Bos taurus genome assembly to identify genes in those regions and compared with previously reported selection signatures and regions with copy number variation. Results For all comparisons, the top 0.01 and 0.1 percentile included 26 and 165 signals and 17 and 125 genes, respectively, including TECRL, BT.23182 or FPPS, CAST, MYOM1, UVRAG and DNAJA1. Conclusions The VarLD method is a powerful tool to identify differences in linkage disequilibrium between cattle populations and putative signatures of selection with potential adaptive and productive importance. PMID:24592996
Mutational signatures reveal the role of RAD52 in p53-independent p21-driven genomic instability.

PubMed

Galanos, Panagiotis; Pappas, George; Polyzos, Alexander; Kotsinas, Athanassios; Svolaki, Ioanna; Giakoumakis, Nickolaos N; Glytsou, Christina; Pateras, Ioannis S; Swain, Umakanta; Souliotis, Vassilis L; Georgakilas, Alexandros G; Geacintov, Nicholas; Scorrano, Luca; Lukas, Claudia; Lukas, Jiri; Livneh, Zvi; Lygerou, Zoi; Chowdhury, Dipanjan; Sørensen, Claus Storgaard; Bartek, Jiri; Gorgoulis, Vassilis G

2018-03-16

Genomic instability promotes evolution and heterogeneity of tumors. Unraveling its mechanistic basis is essential for the design of appropriate therapeutic strategies. In a previous study, we reported an unexpected oncogenic property of p21 WAF1/Cip1 , showing that its chronic expression in a p53-deficient environment causes genomic instability by deregulation of the replication licensing machinery. We now demonstrate that p21 WAF1/Cip1 can further fuel genomic instability by suppressing the repair capacity of low- and high-fidelity pathways that deal with nucleotide abnormalities. Consequently, fewer single nucleotide substitutions (SNSs) occur, while formation of highly deleterious DNA double-strand breaks (DSBs) is enhanced, crafting a characteristic mutational signature landscape. Guided by the mutational signatures formed, we find that the DSBs are repaired by Rad52-dependent break-induced replication (BIR) and single-strand annealing (SSA) repair pathways. Conversely, the error-free synthesis-dependent strand annealing (SDSA) repair route is deficient. Surprisingly, Rad52 is activated transcriptionally in an E2F1-dependent manner, rather than post-translationally as is common for DNA repair factor activation. Our results signify the importance of mutational signatures as guides to disclose the repair history leading to genomic instability. We unveil how chronic p21 WAF1/Cip1 expression rewires the repair process and identifies Rad52 as a source of genomic instability and a candidate therapeutic target.
Mutalisk: a web-based somatic MUTation AnaLyIS toolKit for genomic, transcriptional and epigenomic signatures.

PubMed

Lee, Jongkeun; Lee, Andy Jinseok; Lee, June-Koo; Park, Jongkeun; Kwon, Youngoh; Park, Seongyeol; Chun, Hyonho; Ju, Young Seok; Hong, Dongwan

2018-05-22

Somatic genome mutations occur due to combinations of various intrinsic/extrinsic mutational processes and DNA repair mechanisms. Different molecular processes frequently generate different signatures of somatic mutations in their own favored contexts. As a result, the regional somatic mutation rate is dependent on the local DNA sequence, the DNA replication/RNA transcription dynamics and epigenomic chromatin organization landscape in the genome. Here, we propose an online computational framework, termed Mutalisk, which correlates somatic mutations with various genomic, transcriptional and epigenomic features in order to understand mutational processes that contribute to the generation of the mutations. This user-friendly tool explores the presence of localized hypermutations (kataegis), dissects the spectrum of mutations into the maximum likelihood combination of known mutational signatures and associates the mutation density with numerous regulatory elements in the genome. As a result, global patterns of somatic mutations in any query sample can be efficiently screened, thus enabling a deeper understanding of various mutagenic factors. This tool will facilitate more effective downstream analyses of cancer genome sequences to elucidate the diversity of mutational processes underlying the development and clonal evolution of cancer cells. Mutalisk is freely available at http://mutalisk.org.
Molecular Pathways: Extracting Medical Knowledge from High Throughput Genomic Data

PubMed Central

Goldstein, Theodore; Paull, Evan O.; Ellis, Matthew J.; Stuart, Joshua M.

2013-01-01

High-throughput genomic data that measures RNA expression, DNA copy number, mutation status and protein levels provide us with insights into the molecular pathway structure of cancer. Genomic lesions (amplifications, deletions, mutations) and epigenetic modifications disrupt biochemical cellular pathways. While the number of possible lesions is vast, different genomic alterations may result in concordant expression and pathway activities, producing common tumor subtypes that share similar phenotypic outcomes. How can these data be translated into medical knowledge that provides prognostic and predictive information? First generation mRNA expression signatures such as Genomic Health's Oncotype DX already provide prognostic information, but do not provide therapeutic guidance beyond the current standard of care – which is often inadequate in high-risk patients. Rather than building molecular signatures based on gene expression levels, evidence is growing that signatures based on higher-level quantities such as from genetic pathways may provide important prognostic and diagnostic cues. We provide examples of how activities for molecular entities can be predicted from pathway analysis and how the composite of all such activities, referred to here as the “activitome,” help connect genomic events to clinical factors in order to predict the drivers of poor outcome. PMID:23430023
Penicillin-Binding Protein Transpeptidase Signatures for Tracking and Predicting β-Lactam Resistance Levels in Streptococcus pneumoniae

PubMed Central

Metcalf, Benjamin J.; Chochua, Sopio; Li, Zhongya; Gertz, Robert E.; Walker, Hollis; Hawkins, Paulina A.; Tran, Theresa; Whitney, Cynthia G.; McGee, Lesley; Beall, Bernard W.

2016-01-01

ABSTRACT β-Lactam antibiotics are the drugs of choice to treat pneumococcal infections. The spread of β-lactam-resistant pneumococci is a major concern in choosing an effective therapy for patients. Systematically tracking β-lactam resistance could benefit disease surveillance. Here we developed a classification system in which a pneumococcal isolate is assigned to a “PBP type” based on sequence signatures in the transpeptidase domains (TPDs) of the three critical penicillin-binding proteins (PBPs), PBP1a, PBP2b, and PBP2x. We identified 307 unique PBP types from 2,528 invasive pneumococcal isolates, which had known MICs to six β-lactams based on broth microdilution. We found that increased β-lactam MICs strongly correlated with PBP types containing divergent TPD sequences. The PBP type explained 94 to 99% of variation in MICs both before and after accounting for genomic backgrounds defined by multilocus sequence typing, indicating that genomic backgrounds made little independent contribution to β-lactam MICs at the population level. We further developed and evaluated predictive models of MICs based on PBP type. Compared to microdilution MICs, MICs predicted by PBP type showed essential agreement (MICs agree within 1 dilution) of >98%, category agreement (interpretive results agree) of >94%, a major discrepancy (sensitive isolate predicted as resistant) rate of <3%, and a very major discrepancy (resistant isolate predicted as sensitive) rate of <2% for all six β-lactams. Thus, the PBP transpeptidase signatures are robust indicators of MICs to different β-lactam antibiotics in clinical pneumococcal isolates and serve as an accurate alternative to phenotypic susceptibility testing. PMID:27302760
Signatures of Extended Storage of Used Nuclear Fuel in Casks

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rauch, Eric Benton

2016-09-28

As the amount of used nuclear fuel continues to grow, more and more used nuclear fuel will be transferred to storage casks. A consolidated storage facility is currently in the planning stages for storing these casks, where at least 10,000 MTHM of fuel will be stored. This site will have potentially thousands of casks once it is operational. A facility this large presents new safeguards and nuclear material accounting concerns. A new signature based on the distribution of neutron sources and multiplication within casks was part of the Department of Energy Office of Nuclear Energy’s Material Protection, Account and Controlmore » Technologies (MPACT) campaign. Under this project we looked at fingerprinting each cask's neutron signature. Each cask has a unique set of fuel, with a unique spread of initial enrichment, burnup, cooling time, and power history. The unique set of fuel creates a unique signature of neutron intensity based on the arrangement of the assemblies. The unique arrangement of neutron sources and multiplication produces a reliable and unique identification of the cask that has been shown to be relatively constant over long time periods. The work presented here could be used to restore from a loss of continuity of knowledge at the storage site. This presentation will show the steps used to simulate and form this signature from the start of the effort through its conclusion in September 2016.« less
Signatures of selection in the three-spined stickleback along a small-scale brackish water - freshwater transition zone.

PubMed

Konijnendijk, Nellie; Shikano, Takahito; Daneels, Dorien; Volckaert, Filip A M; Raeymaekers, Joost A M

2015-09-01

Local adaptation is often obvious when gene flow is impeded, such as observed at large spatial scales and across strong ecological contrasts. However, it becomes less certain at small scales such as between adjacent populations or across weak ecological contrasts, when gene flow is strong. While studies on genomic adaptation tend to focus on the former, less is known about the genomic targets of natural selection in the latter situation. In this study, we investigate genomic adaptation in populations of the three-spined stickleback Gasterosteus aculeatus L. across a small-scale ecological transition with salinities ranging from brackish to fresh. Adaptation to salinity has been repeatedly demonstrated in this species. A genome scan based on 87 microsatellite markers revealed only few signatures of selection, likely owing to the constraints that homogenizing gene flow puts on adaptive divergence. However, the detected loci appear repeatedly as targets of selection in similar studies of genomic adaptation in the three-spined stickleback. We conclude that the signature of genomic selection in the face of strong gene flow is weak, yet detectable. We argue that the range of studies of genomic divergence should be extended to include more systems characterized by limited geographical and ecological isolation, which is often a realistic setting in nature.
Molecular signature of pancreatic adenocarcinoma: an insight from genotype to phenotype and challenges for targeted therapy

PubMed Central

Sahin, Ibrahim H; Iacobuzio-Donahue, Christine A; O’Reilly, Eileen M

2016-01-01

Introduction Pancreatic adenocarcinoma remains one of the most clinically challenging cancers despite an in-depth characterization of the molecular underpinnings and biology of this disease. Recent whole-genome-wide studies have elucidated the diverse and complex genetic alterations which generate a unique oncogenic signature for an individual pancreatic cancer patient and which may explain diverse disease behavior in a clinical setting. Areas covered In this review article, we discuss the key oncogenic pathways of pancreatic cancer including RAS-MAPK, PI3KCA and TGF-β signaling, as well as the impact of these pathways on the disease behavior and their potential targetability. The role of tumor suppressors particularly BRCA1 and BRCA2 genes and their role in pancreatic cancer treatment are elaborated upon. We further review recent genomic studies and their impact on future pancreatic cancer treatment. Expert opinion Targeted therapies inhibiting pro-survival pathways have limited impact on pancreatic cancer outcomes. Activation of pro-apoptotic pathways along with suppression of cancer-stem-related pathways may reverse treatment resistance in pancreatic cancer. While targeted therapy or a ‘precision medicine’ approach in pancreatic adenocarcinoma remains an elusive challenge for the majority of patients, there is a real sense of optimism that the strides made in understanding the molecular underpinnings of this disease will translate into improved outcomes. PMID:26439702
21 CFR 11.100 - General requirements.

Code of Federal Regulations, 2010 CFR

2010-04-01

... RECORDS; ELECTRONIC SIGNATURES Electronic Signatures § 11.100 General requirements. (a) Each electronic signature shall be unique to one individual and shall not be reused by, or reassigned to, anyone else. (b... signature, or any element of such electronic signature, the organization shall verify the identity of the...
Alignment-free genome tree inference by learning group-specific distance metrics.

PubMed

Patil, Kaustubh R; McHardy, Alice C

2013-01-01

Understanding the evolutionary relationships between organisms is vital for their in-depth study. Gene-based methods are often used to infer such relationships, which are not without drawbacks. One can now attempt to use genome-scale information, because of the ever increasing number of genomes available. This opportunity also presents a challenge in terms of computational efficiency. Two fundamentally different methods are often employed for sequence comparisons, namely alignment-based and alignment-free methods. Alignment-free methods rely on the genome signature concept and provide a computationally efficient way that is also applicable to nonhomologous sequences. The genome signature contains evolutionary signal as it is more similar for closely related organisms than for distantly related ones. We used genome-scale sequence information to infer taxonomic distances between organisms without additional information such as gene annotations. We propose a method to improve genome tree inference by learning specific distance metrics over the genome signature for groups of organisms with similar phylogenetic, genomic, or ecological properties. Specifically, our method learns a Mahalanobis metric for a set of genomes and a reference taxonomy to guide the learning process. By applying this method to more than a thousand prokaryotic genomes, we showed that, indeed, better distance metrics could be learned for most of the 18 groups of organisms tested here. Once a group-specific metric is available, it can be used to estimate the taxonomic distances for other sequenced organisms from the group. This study also presents a large scale comparison between 10 methods--9 alignment-free and 1 alignment-based.
Resolving prokaryotic taxonomy without rRNA: longer oligonucleotide word lengths improve genome and metagenome taxonomic classification.

PubMed

Alsop, Eric B; Raymond, Jason

2013-01-01

Oligonucleotide signatures, especially tetranucleotide signatures, have been used as method for homology binning by exploiting an organism's inherent biases towards the use of specific oligonucleotide words. Tetranucleotide signatures have been especially useful in environmental metagenomics samples as many of these samples contain organisms from poorly classified phyla which cannot be easily identified using traditional homology methods, including NCBI BLAST. This study examines oligonucleotide signatures across 1,424 completed genomes from across the tree of life, substantially expanding upon previous work. A comprehensive analysis of mononucleotide through nonanucleotide word lengths suggests that longer word lengths substantially improve the classification of DNA fragments across a range of sizes of relevance to high throughput sequencing. We find that, at present, heptanucleotide signatures represent an optimal balance between prediction accuracy and computational time for resolving taxonomy using both genomic and metagenomic fragments. We directly compare the ability of tetranucleotide and heptanucleotide world lengths (tetranucleotide signatures are the current standard for oligonucleotide word usage analyses) for taxonomic binning of metagenome reads. We present evidence that heptanucleotide word lengths consistently provide more taxonomic resolving power, particularly in distinguishing between closely related organisms that are often present in metagenomic samples. This implies that longer oligonucleotide word lengths should replace tetranucleotide signatures for most analyses. Finally, we show that the application of longer word lengths to metagenomic datasets leads to more accurate taxonomic binning of DNA scaffolds and have the potential to substantially improve taxonomic assignment and assembly of metagenomic data.
Understanding mutagenesis through delineation of mutational signatures in human cancer

DOE PAGES

Petljak, Mia; Alexandrov, Ludmil B.

2016-05-04

Each individual cell within a human body acquires a certain number of somatic mutations during a course of its lifetime. These mutations originate from a wide spectra of both endogenous and exogenous mutational processes that leave distinct patterns of mutations, termed mutational signatures, embedded within the genomes of all cells. In recent years, the vast amount of data produced by sequencing of cancer genomes was coupled with novel mathematical models and computational tools to generate the first comprehensive map of mutational signatures in human cancer. Up to date, >30 distinct mutational signatures have been identified, and etiologies have been proposedmore » for many of them. This paper provides a brief historical background on examination of mutational patterns in human cancer, summarizes the knowledge accumulated since introducing the concept of mutational signatures and discusses their future potential applications and perspectives within the field.« less
Genomic signatures of parasite-driven natural selection in north European Atlantic salmon (Salmo salar).

PubMed

Zueva, Ksenia J; Lumme, Jaakko; Veselov, Alexey E; Kent, Matthew P; Primmer, Craig R

2018-06-01

Understanding the genomic basis of host-parasite adaptation is important for predicting the long-term viability of species and developing successful management practices. However, in wild populations, identifying specific signatures of parasite-driven selection often presents a challenge, as it is difficult to unravel the molecular signatures of selection driven by different, but correlated, environmental factors. Furthermore, separating parasite-mediated selection from similar signatures due to genetic drift and population history can also be difficult. Populations of Atlantic salmon (Salmo salar L.) from northern Europe have pronounced differences in their reactions to the parasitic flatworm Gyrodactylus salaris Malmberg 1957 and are therefore a good model to search for specific genomic regions underlying inter-population differences in pathogen response. We used a dense Atlantic salmon SNP array, along with extensive sampling of 43 salmon populations representing the two G. salaris response extremes (extreme susceptibility vs resistant), to screen the salmon genome for signatures of directional selection while attempting to separate the parasite effect from other factors. After combining the results from two independent genome scan analyses, 57 candidate genes potentially under positive selection were identified, out of which 50 were functionally annotated. This candidate gene set was shown to be functionally enriched for lymph node development, focal adhesion genes and anti-viral response, which suggests that the regulation of both innate and acquired immunity might be an important mechanism for salmon response to G. salaris. Overall, our results offer insights into the apparently complex genetic basis of pathogen susceptibility in salmon and highlight methodological challenges for separating the effects of various environmental factors. Copyright © 2018 Elsevier B.V. All rights reserved.
Trespassing cancer cells: 'fingerprinting' invasive protrusions reveals metastatic culprits.

PubMed

Klemke, Richard L

2012-10-01

Metastatic cancer cells produce invasive membrane protrusions called invadopodia and pseudopodia, which play a central role in driving cancer cell dissemination in the body. Malignant cells use these structures to attach to and degrade extracellular matrix proteins, generate force for cell locomotion, and to penetrate the vasculature. Recent work using unique subcellular fractionation methodologies combined with spatial genomic, proteomic, and phosphoproteomic profiling has provided insight into the invadopodiome and pseudopodiome signaling networks that control the protrusion of invasive membranes. Here I highlight how these powerful spatial 'omics' approaches reveal important signatures of metastatic cancer cells and possible new therapeutic targets aimed at treating metastatic disease. Copyright © 2012 Elsevier Ltd. All rights reserved.
Comprehensive evaluation of gene expression signatures in response to electroacupuncture stimulation at Zusanli (ST36) acupoint by transcriptomic analysis.

PubMed

Wu, Jing-Shan; Lo, Hsin-Yi; Li, Chia-Cheng; Chen, Feng-Yuan; Hsiang, Chien-Yun; Ho, Tin-Yun

2017-08-15

Electroacupuncture (EA) has been applied to treat and prevent diseases for years. However, molecular events happened in both the acupunctured site and the internal organs after EA stimulation have not been clarified. Here we applied transcriptomic analysis to explore the gene expression signatures after EA stimulation. Mice were applied EA stimulation at ST36 for 15 min and nine tissues were collected three hours later for microarray analysis. We found that EA affected the expression of genes not only in the acupunctured site but also in the internal organs. EA commonly affected biological networks involved in cytoskeleton and cell adhesion, and also regulated unique process networks in specific organs, such as γ-aminobutyric acid-ergic neurotransmission in brain and inflammation process in lung. In addition, EA affected the expression of genes related to various diseases, such as neurodegenerative diseases in brain and obstructive pulmonary diseases in lung. This report applied, for the first time, a global comprehensive genome-wide approach to analyze the gene expression profiling of acupunctured site and internal organs after EA stimulation. The connection between gene expression signatures, biological processes, and diseases might provide a basis for prediction and explanation on the therapeutic potentials of acupuncture in organs.
Signatures of host specialization and a recent transposable element burst in the dynamic one-speed genome of the fungal barley powdery mildew pathogen.

PubMed

Frantzeskakis, Lamprinos; Kracher, Barbara; Kusch, Stefan; Yoshikawa-Maekawa, Makoto; Bauer, Saskia; Pedersen, Carsten; Spanu, Pietro D; Maekawa, Takaki; Schulze-Lefert, Paul; Panstruga, Ralph

2018-05-22

Powdery mildews are biotrophic pathogenic fungi infecting a number of economically important plants. The grass powdery mildew, Blumeria graminis, has become a model organism to study host specialization of obligate biotrophic fungal pathogens. We resolved the large-scale genomic architecture of B. graminis forma specialis hordei (Bgh) to explore the potential influence of its genome organization on the co-evolutionary process with its host plant, barley (Hordeum vulgare). The near-chromosome level assemblies of the Bgh reference isolate DH14 and one of the most diversified isolates, RACE1, enabled a comparative analysis of these haploid genomes, which are highly enriched with transposable elements (TEs). We found largely retained genome synteny and gene repertoires, yet detected copy number variation (CNV) of secretion signal peptide-containing protein-coding genes (SPs) and locally disrupted synteny blocks. Genes coding for sequence-related SPs are often locally clustered, but neither the SPs nor the TEs reside preferentially in genomic regions with unique features. Extended comparative analysis with different host-specific B. graminis formae speciales revealed the existence of a core suite of SPs, but also isolate-specific SP sets as well as congruence of SP CNV and phylogenetic relationship. We further detected evidence for a recent, lineage-specific expansion of TEs in the Bgh genome. The characteristics of the Bgh genome (largely retained synteny, CNV of SP genes, recently proliferated TEs and a lack of significant compartmentalization) are consistent with a "one-speed" genome that differs in its architecture and (co-)evolutionary pattern from the "two-speed" genomes reported for several other filamentous phytopathogens.
Adult high-grade B-cell lymphoma with Burkitt lymphoma signature: genomic features and potential therapeutic targets.

PubMed

Bouska, Alyssa; Bi, Chengfeng; Lone, Waseem; Zhang, Weiwei; Kedwaii, Ambreen; Heavican, Tayla; Lachel, Cynthia M; Yu, Jiayu; Ferro, Roberto; Eldorghamy, Nanees; Greiner, Timothy C; Vose, Julie; Weisenburger, Dennis D; Gascoyne, Randy D; Rosenwald, Andreas; Ott, German; Campo, Elias; Rimsza, Lisa M; Jaffe, Elaine S; Braziel, Rita M; Siebert, Reiner; Miles, Rodney R; Dave, Sandeep; Reddy, Anupama; Delabie, Jan; Staudt, Louis M; Song, Joo Y; McKeithan, Timothy W; Fu, Kai; Green, Michael; Chan, Wing C; Iqbal, Javeed

2017-10-19

The adult high-grade B-cell lymphomas sharing molecular features with Burkitt lymphoma (BL) are highly aggressive lymphomas with poor clinical outcome. High-resolution structural and functional genomic analysis of adult Burkitt lymphoma (BL) and high-grade B-cell lymphoma with BL gene signature (adult-molecularly defined BL [mBL]) revealed the MYC-ARF-p53 axis as the primary deregulated pathway. Adult-mBL had either unique or more frequent genomic aberrations (del13q14, del17p, gain8q24, and gain18q21) compared with pediatric-mBL, but shared commonly mutated genes. Mutations in genes promoting the tonic B-cell receptor (BCR)→PI3K pathway ( TCF3 and ID3 ) did not differ by age, whereas effectors of chronic BCR→NF-κB signaling were associated with adult-mBL. A subset of adult-mBL had BCL2 translocation and mutation and elevated BCL2 mRNA and protein expression, but had a mutation profile similar to mBL. These double-hit lymphomas may have arisen from a tumor precursor that acquired both BCL2 and MYC translocations and/or KMT2D ( MLL2 ) mutation. Gain/amplification of MIR17HG and its paralogue loci was observed in 50% of adult-mBL. In vitro studies suggested miR-17∼92 's role in constitutive activation of BCR signaling and sensitivity to ibrutinib. Overall integrative analysis identified an interrelated gene network affected by copy number and mutation, leading to disruption of the p53 pathway and the BCR→PI3K or NF-κB activation, which can be further exploited in vivo by small-molecule inhibitors for effective therapy in adult-mBL.
Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

PubMed

Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

2014-01-01

A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.
First draft genome of an iconic clownfish species (Amphiprion frenatus).

PubMed

Marcionetti, Anna; Rossier, Victor; Bertrand, Joris A M; Litsios, Glenn; Salamin, Nicolas

2018-02-17

Clownfishes (or anemonefishes) form an iconic group of coral reef fishes, principally known for their mutualistic interaction with sea anemones. They are characterized by particular life history traits, such as a complex social structure and mating system involving sequential hermaphroditism, coupled with an exceptionally long lifespan. Additionally, clownfishes are considered to be one of the rare groups to have experienced an adaptive radiation in the marine environment. Here, we assembled and annotated the first genome of a clownfish species, the tomato clownfish (Amphiprion frenatus). We obtained 17,801 assembled scaffolds, containing a total of 26,917 genes. The completeness of the assembly and annotation was satisfying, with 96.5% of the Actinopterygii Benchmarking Universal Single-Copy Orthologs (BUSCOs) being retrieved in A. frenatus assembly. The quality of the resulting assembly is comparable to other bony fish assemblies. This resource is valuable for advancing studies of the particular life history traits of clownfishes, as well as being useful for population genetic studies and the development of new phylogenetic markers. It will also open the way to comparative genomics. Indeed, future genomic comparison among closely related fishes may provide means to identify genes related to the unique adaptations to different sea anemone hosts, as well as better characterize the genomic signatures of an adaptive radiation. © 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Bipyrimidine Signatures as a Photoprotective Genome Strategy in G + C-rich Halophilic Archaea.

PubMed

Jones, Daniel L; Baxter, Bonnie K

2016-09-02

Halophilic archaea experience high levels of ultraviolet (UV) light in their environments and demonstrate resistance to UV irradiation. DNA repair systems and carotenoids provide UV protection but do not account for the high resistance observed. Herein, we consider genomic signatures as an additional photoprotective strategy. The predominant forms of UV-induced DNA damage are cyclobutane pyrimidine dimers, most notoriously thymine dimers (T^Ts), which form at adjacent Ts. We tested whether the high G + C content seen in halophilic archaea serves a photoprotective function through limiting T nucleotides, and thus T^T lesions. However, this speculation overlooks the other bipyrimidine sequences, all of which capable of forming photolesions to varying degrees. Therefore, we designed a program to determine the frequencies of the four bipyrimidine pairs (5' to 3': TT, TC, CT, and CC) within genomes of halophilic archaea and four other randomized sample groups for comparison. The outputs for each sampled genome were weighted by the intrinsic photoreactivities of each dinucleotide pair. Statistical methods were employed to investigate intergroup differences. Our findings indicate that the UV-resistance seen in halophilic archaea can be attributed in part to a genomic strategy: high G + C content and the resulting bipyrimidine signature reduces the genomic photoreactivity.

Bipyrimidine Signatures as a Photoprotective Genome Strategy in G + C-rich Halophilic Archaea

PubMed Central

Jones, Daniel L.; Baxter, Bonnie K.

2016-01-01

Halophilic archaea experience high levels of ultraviolet (UV) light in their environments and demonstrate resistance to UV irradiation. DNA repair systems and carotenoids provide UV protection but do not account for the high resistance observed. Herein, we consider genomic signatures as an additional photoprotective strategy. The predominant forms of UV-induced DNA damage are cyclobutane pyrimidine dimers, most notoriously thymine dimers (T^Ts), which form at adjacent Ts. We tested whether the high G + C content seen in halophilic archaea serves a photoprotective function through limiting T nucleotides, and thus T^T lesions. However, this speculation overlooks the other bipyrimidine sequences, all of which capable of forming photolesions to varying degrees. Therefore, we designed a program to determine the frequencies of the four bipyrimidine pairs (5’ to 3’: TT, TC, CT, and CC) within genomes of halophilic archaea and four other randomized sample groups for comparison. The outputs for each sampled genome were weighted by the intrinsic photoreactivities of each dinucleotide pair. Statistical methods were employed to investigate intergroup differences. Our findings indicate that the UV-resistance seen in halophilic archaea can be attributed in part to a genomic strategy: high G + C content and the resulting bipyrimidine signature reduces the genomic photoreactivity. PMID:27598206
Construction of Signature-tagged Mutant Library in Mesorhizobium loti as a Powerful Tool for Functional Genomics

PubMed Central

Shimoda, Yoshikazu; Mitsui, Hisayuki; Kamimatsuse, Hiroko; Minamisawa, Kiwamu; Nishiyama, Eri; Ohtsubo, Yoshiyuki; Nagata, Yuji; Tsuda, Masataka; Shinpo, Sayaka; Watanabe, Akiko; Kohara, Mitsuyo; Yamada, Manabu; Nakamura, Yasukazu; Tabata, Satoshi; Sato, Shusei

2008-01-01

Rhizobia are nitrogen-fixing soil bacteria that establish endosymbiosis with some leguminous plants. The completion of several rhizobial genome sequences provides opportunities for genome-wide functional studies of the physiological roles of many rhizobial genes. In order to carry out genome-wide phenotypic screenings, we have constructed a large mutant library of the nitrogen-fixing symbiotic bacterium, Mesorhizobium loti, by transposon mutagenesis. Transposon insertion mutants were generated using the signature-tagged mutagenesis (STM) technique and a total of 29 330 independent mutants were obtained. Along with the collection of transposon mutants, we have determined the transposon insertion sites for 7892 clones, and confirmed insertions in 3680 non-redundant M. loti genes (50.5% of the total number of M. loti genes). Transposon insertions were randomly distributed throughout the M. loti genome without any bias toward G+C contents of insertion target sites and transposon plasmids used for the mutagenesis. We also show the utility of STM mutants by examining the specificity of signature tags and test screenings for growth- and nodulation-deficient mutants. This defined mutant library allows for genome-wide forward- and reverse-genetic functional studies of M. loti and will serve as an invaluable resource for researchers to further our understanding of rhizobial biology. PMID:18658183
High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.

PubMed

Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie

2015-06-17

High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome.

PubMed

Greally, John M

2002-01-08

To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival.
Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome

PubMed Central

Greally, John M.

2002-01-01

To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival. PMID:11756672
Mutational patterns in chemotherapy resistant muscle-invasive bladder cancer.

PubMed

Liu, David; Abbosh, Philip; Keliher, Daniel; Reardon, Brendan; Miao, Diana; Mouw, Kent; Weiner-Taylor, Amaro; Wankowicz, Stephanie; Han, Garam; Teo, Min Yuen; Cipolla, Catharine; Kim, Jaegil; Iyer, Gopa; Al-Ahmadie, Hikmat; Dulaimi, Essel; Chen, David Y T; Alpaugh, R Katherine; Hoffman-Censits, Jean; Garraway, Levi A; Getz, Gad; Carter, Scott L; Bellmunt, Joaquim; Plimack, Elizabeth R; Rosenberg, Jonathan E; Van Allen, Eliezer M

2017-12-19

Despite continued widespread use, the genomic effects of cisplatin-based chemotherapy and implications for subsequent treatment are incompletely characterized. Here, we analyze whole exome sequencing of matched pre- and post-neoadjuvant cisplatin-based chemotherapy primary bladder tumor samples from 30 muscle-invasive bladder cancer patients. We observe no overall increase in tumor mutational burden post-chemotherapy, though a significant proportion of subclonal mutations are unique to the matched pre- or post-treatment tumor, suggesting chemotherapy-induced and/or spatial heterogeneity. We subsequently identify and validate a novel mutational signature in post-treatment tumors consistent with known characteristics of cisplatin damage and repair. We find that post-treatment tumor heterogeneity predicts worse overall survival, and further observe alterations in cell-cycle and immune checkpoint regulation genes in post-treatment tumors. These results provide insight into the clinical and genomic dynamics of tumor evolution with cisplatin-based chemotherapy, suggest mechanisms of clinical resistance, and inform development of clinically relevant biomarkers and trials of combination therapies.
Eotaxin-3 and a uniquely conserved gene-expression profile in eosinophilic esophagitis

PubMed Central

Blanchard, Carine; Wang, Ning; Stringer, Keith F.; Mishra, Anil; Fulkerson, Patricia C.; Abonia, J. Pablo; Jameson, Sean C.; Kirby, Cassie; Konikoff, Michael R.; Collins, Margaret H.; Cohen, Mitchell B.; Akers, Rachel; Hogan, Simon P.; Assa’ad, Amal H.; Putnam, Philip E.; Aronow, Bruce J.; Rothenberg, Marc E.

2006-01-01

Eosinophilic esophagitis (EE) is an emerging disorder with a poorly understood pathogenesis. In order to define disease mechanisms, we took an empirical approach analyzing esophageal tissue by a genome-wide microarray expression analysis. EE patients had a striking transcript signature involving 1% of the human genome that was remarkably conserved across sex, age, and allergic status and was distinct from that associated with non-EE chronic esophagitis. Notably, the gene encoding the eosinophil-specific chemoattractant eotaxin-3 (also known as CCL26) was the most highly induced gene in EE patients compared with its expression level in healthy individuals. Esophageal eotaxin-3 mRNA and protein levels strongly correlated with tissue eosinophilia and mastocytosis. Furthermore, a single-nucleotide polymorphism in the human eotaxin-3 gene was associated with disease susceptibility. Finally, mice deficient in the eotaxin receptor (also known as CCR3) were protected from experimental EE. These results implicate eotaxin-3 as a critical effector molecule for EE and provide insight into disease pathogenesis. PMID:16453027
Genomic Analyses Reveal the Influence of Geographic Origin, Migration, and Hybridization on Modern Dog Breed Development.

PubMed

Parker, Heidi G; Dreger, Dayna L; Rimbault, Maud; Davis, Brian W; Mullen, Alexandra B; Carpintero-Ramirez, Gretchen; Ostrander, Elaine A

2017-04-25

There are nearly 400 modern domestic dog breeds with a unique histories and genetic profiles. To track the genetic signatures of breed development, we have assembled the most diverse dataset of dog breeds, reflecting their extensive phenotypic variation and heritage. Combining genetic distance, migration, and genome-wide haplotype sharing analyses, we uncover geographic patterns of development and independent origins of common traits. Our analyses reveal the hybrid history of breeds and elucidate the effects of immigration, revealing for the first time a suggestion of New World dog within some modern breeds. Finally, we used cladistics and haplotype sharing to show that some common traits have arisen more than once in the history of the dog. These analyses characterize the complexities of breed development, resolving longstanding questions regarding individual breed origination, the effect of migration on geographically distinct breeds, and, by inference, transfer of trait and disease alleles among dog breeds. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Genome-wide mutant profiling predicts the mechanism of a Lipid II binding antibiotic.

PubMed

Santiago, Marina; Lee, Wonsik; Fayad, Antoine Abou; Coe, Kathryn A; Rajagopal, Mithila; Do, Truc; Hennessen, Fabienne; Srisuknimit, Veerasak; Müller, Rolf; Meredith, Timothy C; Walker, Suzanne

2018-06-01

Identifying targets of antibacterial compounds remains a challenging step in the development of antibiotics. We have developed a two-pronged functional genomics approach to predict mechanism of action that uses mutant fitness data from antibiotic-treated transposon libraries containing both upregulation and inactivation mutants. We treated a Staphylococcus aureus transposon library containing 690,000 unique insertions with 32 antibiotics. Upregulation signatures identified from directional biases in insertions revealed known molecular targets and resistance mechanisms for the majority of these. Because single-gene upregulation does not always confer resistance, we used a complementary machine-learning approach to predict the mechanism from inactivation mutant fitness profiles. This approach suggested the cell wall precursor Lipid II as the molecular target of the lysocins, a mechanism we have confirmed. We conclude that docking to membrane-anchored Lipid II precedes the selective bacteriolysis that distinguishes these lytic natural products, showing the utility of our approach for nominating the antibiotic mechanism of action.
Copy number variation signature to predict human ancestry

PubMed Central

2012-01-01

Background Copy number variations (CNVs) are genomic structural variants that are found in healthy populations and have been observed to be associated with disease susceptibility. Existing methods for CNV detection are often performed on a sample-by-sample basis, which is not ideal for large datasets where common CNVs must be estimated by comparing the frequency of CNVs in the individual samples. Here we describe a simple and novel approach to locate genome-wide CNVs common to a specific population, using human ancestry as the phenotype. Results We utilized our previously published Genome Alteration Detection Analysis (GADA) algorithm to identify common ancestry CNVs (caCNVs) and built a caCNV model to predict population structure. We identified a 73 caCNV signature using a training set of 225 healthy individuals from European, Asian, and African ancestry. The signature was validated on an independent test set of 300 individuals with similar ancestral background. The error rate in predicting ancestry in this test set was 2% using the 73 caCNV signature. Among the caCNVs identified, several were previously confirmed experimentally to vary by ancestry. Our signature also contains a caCNV region with a single microRNA (MIR270), which represents the first reported variation of microRNA by ancestry. Conclusions We developed a new methodology to identify common CNVs and demonstrated its performance by building a caCNV signature to predict human ancestry with high accuracy. The utility of our approach could be extended to large case–control studies to identify CNV signatures for other phenotypes such as disease susceptibility and drug response. PMID:23270563
The search for biomarkers in the critically ill: a cautionary tale.

PubMed

Moran, John L; Solomon, Patricia J

2018-06-01

The search for biomarkers has been described as a dismal patchwork of fragmented research. We review biomarkers in sepsis in the critically ill in terms of conventional single circulating proteins. Despite sepsis biomarker publications trebling over the past 6 years, currently only one, procalcitonin, has materialised promise. We survey genomic biomarker initiatives, single nucleotide polymorphisms (SNPs) and gene signatures. Despite many SNP associations with sepsis susceptibility and a limited number of genome-wide association studies, the status of these associations is that of genomic signposts only. The standing of gene signatures in the paradigmatic discipline, breast cancer, is described. Uncertainties in the understanding of the sepsis process are documented - the dissociation between blood and tissue element activity, or compartmentalisation. The paradox of the active search for gene signatures to refine the sepsis phenotype and discover target subtypes for new therapies in the absence of such therapies is presented.
Genomic signatures of near-extinction and rebirth of the crested ibis and other endangered bird species.

PubMed

Li, Shengbin; Li, Bo; Cheng, Cheng; Xiong, Zijun; Liu, Qingbo; Lai, Jianghua; Carey, Hannah V; Zhang, Qiong; Zheng, Haibo; Wei, Shuguang; Zhang, Hongbo; Chang, Liao; Liu, Shiping; Zhang, Shanxin; Yu, Bing; Zeng, Xiaofan; Hou, Yong; Nie, Wenhui; Guo, Youmin; Chen, Teng; Han, Jiuqiang; Wang, Jian; Wang, Jun; Chen, Chen; Liu, Jiankang; Stambrook, Peter J; Xu, Ming; Zhang, Guojie; Gilbert, M Thomas P; Yang, Huanming; Jarvis, Erich D; Yu, Jun; Yan, Jianqun

2014-01-01

Nearly one-quarter of all avian species is either threatened or nearly threatened. Of these, 73 species are currently being rescued from going extinct in wildlife sanctuaries. One of the previously most critically-endangered is the crested ibis, Nipponia nippon. Once widespread across North-East Asia, by 1981 only seven individuals from two breeding pairs remained in the wild. The recovering crested ibis populations thus provide an excellent example for conservation genomics since every individual bird has been recruited for genomic and demographic studies. Using high-quality genome sequences of multiple crested ibis individuals, its thriving co-habitant, the little egret, Egretta garzetta, and the recently sequenced genomes of 41 other avian species that are under various degrees of survival threats, including the bald eagle, we carry out comparative analyses for genomic signatures of near extinction events in association with environmental and behavioral attributes of species. We confirm that both loss of genetic diversity and enrichment of deleterious mutations of protein-coding genes contribute to the major genetic defects of the endangered species. We further identify that genetic inbreeding and loss-of-function genes in the crested ibis may all constitute genetic susceptibility to other factors including long-term climate change, over-hunting, and agrochemical overuse. We also establish a genome-wide DNA identification platform for molecular breeding and conservation practices, to facilitate sustainable recovery of endangered species. These findings demonstrate common genomic signatures of population decline across avian species and pave a way for further effort in saving endangered species and enhancing conservation genomic efforts.
The role of parasite-driven selection in shaping landscape genomic structure in red grouse (Lagopus lagopus scotica).

PubMed

Wenzel, Marius A; Douglas, Alex; James, Marianne C; Redpath, Steve M; Piertney, Stuart B

2016-01-01

Landscape genomics promises to provide novel insights into how neutral and adaptive processes shape genome-wide variation within and among populations. However, there has been little emphasis on examining whether individual-based phenotype-genotype relationships derived from approaches such as genome-wide association (GWAS) manifest themselves as a population-level signature of selection in a landscape context. The two may prove irreconcilable as individual-level patterns become diluted by high levels of gene flow and complex phenotypic or environmental heterogeneity. We illustrate this issue with a case study that examines the role of the highly prevalent gastrointestinal nematode Trichostrongylus tenuis in shaping genomic signatures of selection in red grouse (Lagopus lagopus scotica). Individual-level GWAS involving 384 SNPs has previously identified five SNPs that explain variation in T. tenuis burden. Here, we examine whether these same SNPs display population-level relationships between T. tenuis burden and genetic structure across a small-scale landscape of 21 sites with heterogeneous parasite pressure. Moreover, we identify adaptive SNPs showing signatures of directional selection using F(ST) outlier analysis and relate population- and individual-level patterns of multilocus neutral and adaptive genetic structure to T. tenuis burden. The five candidate SNPs for parasite-driven selection were neither associated with T. tenuis burden on a population level, nor under directional selection. Similarly, there was no evidence of parasite-driven selection in SNPs identified as candidates for directional selection. We discuss these results in the context of red grouse ecology and highlight the broader consequences for the utility of landscape genomics approaches for identifying signatures of selection. © 2015 John Wiley & Sons Ltd.
Genome-wide detection of selection signatures in Chinese indigenous Laiwu pigs revealed candidate genes regulating fat deposition in muscle.

PubMed

Chen, Minhui; Wang, Jiying; Wang, Yanping; Wu, Ying; Fu, Jinluan; Liu, Jian-Feng

2018-05-18

Currently, genome-wide scans for positive selection signatures in commercial breed have been investigated. However, few studies have focused on selection footprints of indigenous breeds. Laiwu pig is an invaluable Chinese indigenous pig breed with extremely high proportion of intramuscular fat (IMF), and an excellent model to detect footprint as the result of natural and artificial selection for fat deposition in muscle. In this study, based on GeneSeek Genomic profiler Porcine HD data, three complementary methods, F ST , iHS (integrated haplotype homozygosity score) and CLR (composite likelihood ratio), were implemented to detect selection signatures in the whole genome of Laiwu pigs. Totally, 175 candidate selected regions were obtained by at least two of the three methods, which covered 43.75 Mb genomic regions and corresponded to 1.79% of the genome sequence. Gene annotation of the selected regions revealed a list of functionally important genes for feed intake and fat deposition, reproduction, and immune response. Especially, in accordance to the phenotypic features of Laiwu pigs, among the candidate genes, we identified several genes, NPY1R, NPY5R, PIK3R1 and JAKMIP1, involved in the actions of two sets of neurons, which are central regulators in maintaining the balance between food intake and energy expenditure. Our results identified a number of regions showing signatures of selection, as well as a list of functionally candidate genes with potential effect on phenotypic traits, especially fat deposition in muscle. Our findings provide insights into the mechanisms of artificial selection of fat deposition and further facilitate follow-up functional studies.
Identifying anti-cancer drug response related genes using an integrative analysis of transcriptomic and genomic variations with cell line-based drug perturbations.

PubMed

Sun, Yi; Zhang, Wei; Chen, Yunqin; Ma, Qin; Wei, Jia; Liu, Qi

2016-02-23

Clinical responses to anti-cancer therapies often only benefit a defined subset of patients. Predicting the best treatment strategy hinges on our ability to effectively translate genomic data into actionable information on drug responses. To achieve this goal, we compiled a comprehensive collection of baseline cancer genome data and drug response information derived from a large panel of cancer cell lines. This data set was applied to identify the signature genes relevant to drug sensitivity and their resistance by integrating CNVs and the gene expression of cell lines with in vitro drug responses. We presented an efficient in-silico pipeline for integrating heterogeneous cell line data sources with the simultaneous modeling of drug response values across all the drugs and cell lines. Potential signature genes correlated with drug response (sensitive or resistant) in different cancer types were identified. Using signature genes, our collaborative filtering-based drug response prediction model outperformed the 44 algorithms submitted to the DREAM competition on breast cancer cells. The functions of the identified drug response related signature genes were carefully analyzed at the pathway level and the synthetic lethality level. Furthermore, we validated these signature genes by applying them to the classification of the different subtypes of the TCGA tumor samples, and further uncovered their in vivo implications using clinical patient data. Our work may have promise in translating genomic data into customized marker genes relevant to the response of specific drugs for a specific cancer type of individual patients.
The Valdostana goat: a genome-wide investigation of the distinctiveness of its selective sweep regions.

PubMed

Talenti, Andrea; Bertolini, Francesca; Pagnacco, Giulio; Pilla, Fabio; Ajmone-Marsan, Paolo; Rothschild, Max F; Crepaldi, Paola

2017-04-01

The Valdostana goat is an alpine breed, raised only in the northern Italian region of the Aosta Valley. This breed's main purpose is to produce milk and meat, but is peculiar for its involvement in the "Batailles de Chèvres," a recent tradition of non-cruel fight tournaments. At both the genetic and genomic levels, only a very limited number of studies have been performed with this breed and there are no studies about the genomic signatures left by selection. In this work, 24 unrelated Valdostana animals were screened for runs of homozygosity to identify highly homozygous regions. Then, six different approaches (ROH comparison, Fst single SNPs and windows based, Bayesian, Rsb, and XP-EHH) were applied comparing the Valdostana dataset with 14 other Italian goat breeds to confirm regions that were different among the comparisons. A total of three regions of selection that were also unique among the Valdostana were identified and located on chromosomes 1, 7, and 12 and contained 144 genes. Enrichment analyses detected genes such as cytokines and lymphocyte/leukocyte proliferation genes involved in the regulation of the immune system. A genetic link between an aggressive challenge, cytokines, and immunity has been hypothesized in many studies both in humans and in other species. Possible hypotheses associated with the signals of selection detected could be therefore related to immune-related factors as well as with the peculiar battle competition, or other breed-specific traits, and provided insights for further investigation of these unique regions, for the understanding and safeguard of the Valdostana breed.
Comprehensive sieve analysis of breakthrough HIV-1 sequences in the RV144 vaccine efficacy trial.

PubMed

Edlefsen, Paul T; Rolland, Morgane; Hertz, Tomer; Tovanabutra, Sodsai; Gartland, Andrew J; deCamp, Allan C; Magaret, Craig A; Ahmed, Hasan; Gottardo, Raphael; Juraska, Michal; McCoy, Connor; Larsen, Brendan B; Sanders-Buell, Eric; Carrico, Chris; Menis, Sergey; Kijak, Gustavo H; Bose, Meera; Arroyo, Miguel A; O'Connell, Robert J; Nitayaphan, Sorachai; Pitisuttithum, Punnee; Kaewkungwal, Jaranit; Rerks-Ngarm, Supachai; Robb, Merlin L; Kirys, Tatsiana; Georgiev, Ivelin S; Kwong, Peter D; Scheffler, Konrad; Pond, Sergei L Kosakovsky; Carlson, Jonathan M; Michael, Nelson L; Schief, William R; Mullins, James I; Kim, Jerome H; Gilbert, Peter B

2015-02-01

The RV144 clinical trial showed the partial efficacy of a vaccine regimen with an estimated vaccine efficacy (VE) of 31% for protecting low-risk Thai volunteers against acquisition of HIV-1. The impact of vaccine-induced immune responses can be investigated through sieve analysis of HIV-1 breakthrough infections (infected vaccine and placebo recipients). A V1/V2-targeted comparison of the genomes of HIV-1 breakthrough viruses identified two V2 amino acid sites that differed between the vaccine and placebo groups. Here we extended the V1/V2 analysis to the entire HIV-1 genome using an array of methods based on individual sites, k-mers and genes/proteins. We identified 56 amino acid sites or "signatures" and 119 k-mers that differed between the vaccine and placebo groups. Of those, 19 sites and 38 k-mers were located in the regions comprising the RV144 vaccine (Env-gp120, Gag, and Pro). The nine signature sites in Env-gp120 were significantly enriched for known antibody-associated sites (p = 0.0021). In particular, site 317 in the third variable loop (V3) overlapped with a hotspot of antibody recognition, and sites 369 and 424 were linked to CD4 binding site neutralization. The identified signature sites significantly covaried with other sites across the genome (mean = 32.1) more than did non-signature sites (mean = 0.9) (p < 0.0001), suggesting functional and/or structural relevance of the signature sites. Since signature sites were not preferentially restricted to the vaccine immunogens and because most of the associations were insignificant following correction for multiple testing, we predict that few of the genetic differences are strongly linked to the RV144 vaccine-induced immune pressure. In addition to presenting results of the first complete-genome analysis of the breakthrough infections in the RV144 trial, this work describes a set of statistical methods and tools applicable to analysis of breakthrough infection genomes in general vaccine efficacy trials for diverse pathogens.
Stromal-Based Signatures for the Classification of Gastric Cancer.

PubMed

Uhlik, Mark T; Liu, Jiangang; Falcon, Beverly L; Iyer, Seema; Stewart, Julie; Celikkaya, Hilal; O'Mahony, Marguerita; Sevinsky, Christopher; Lowes, Christina; Douglass, Larry; Jeffries, Cynthia; Bodenmiller, Diane; Chintharlapalli, Sudhakar; Fischl, Anthony; Gerald, Damien; Xue, Qi; Lee, Jee-Yun; Santamaria-Pang, Alberto; Al-Kofahi, Yousef; Sui, Yunxia; Desai, Keyur; Doman, Thompson; Aggarwal, Amit; Carter, Julia H; Pytowski, Bronislaw; Jaminet, Shou-Ching; Ginty, Fiona; Nasir, Aejaz; Nagy, Janice A; Dvorak, Harold F; Benjamin, Laura E

2016-05-01

Treatment of metastatic gastric cancer typically involves chemotherapy and monoclonal antibodies targeting HER2 (ERBB2) and VEGFR2 (KDR). However, reliable methods to identify patients who would benefit most from a combination of treatment modalities targeting the tumor stroma, including new immunotherapy approaches, are still lacking. Therefore, we integrated a mouse model of stromal activation and gastric cancer genomic information to identify gene expression signatures that may inform treatment strategies. We generated a mouse model in which VEGF-A is expressed via adenovirus, enabling a stromal response marked by immune infiltration and angiogenesis at the injection site, and identified distinct stromal gene expression signatures. With these data, we designed multiplexed IHC assays that were applied to human primary gastric tumors and classified each tumor to a dominant stromal phenotype representative of the vascular and immune diversity found in gastric cancer. We also refined the stromal gene signatures and explored their relation to the dominant patient phenotypes identified by recent large-scale studies of gastric cancer genomics (The Cancer Genome Atlas and Asian Cancer Research Group), revealing four distinct stromal phenotypes. Collectively, these findings suggest that a genomics-based systems approach focused on the tumor stroma can be used to discover putative predictive biomarkers of treatment response, especially to antiangiogenesis agents and immunotherapy, thus offering an opportunity to improve patient stratification. Cancer Res; 76(9); 2573-86. ©2016 AACR. ©2016 American Association for Cancer Research.
Signatures of positive selection in African Butana and Kenana dairy zebu cattle.

PubMed

Bahbahani, Hussain; Salim, Bashir; Almathen, Faisal; Al Enezi, Fahad; Mwacharo, Joram M; Hanotte, Olivier

2018-01-01

Butana and Kenana are two types of zebu cattle found in Sudan. They are unique amongst African indigenous zebu cattle because of their high milk production. Aiming to understand their genome structure, we genotyped 25 individuals from each breed using the Illumina BovineHD Genotyping BeadChip. Genetic structure analysis shows that both breeds have an admixed genome composed of an even proportion of indicine (0.75 ± 0.03 in Butana, 0.76 ± 0.006 in Kenana) and taurine (0.23 ± 0.009 in Butana, 0.24 ± 0.006 in Kenana) ancestries. We also observe a proportion of 0.02 to 0.12 of European taurine ancestry in ten individuals of Butana that were sampled from cattle herds in Tamboul area suggesting local crossbreeding with exotic breeds. Signatures of selection analyses (iHS and Rsb) reveal 87 and 61 candidate positive selection regions in Butana and Kenana, respectively. These regions span genes and quantitative trait loci (QTL) associated with biological pathways that are important for adaptation to marginal environments (e.g., immunity, reproduction and heat tolerance). Trypanotolerance QTL are intersecting candidate regions in Kenana cattle indicating selection pressure acting on them, which might be associated with an unexplored level of trypanotolerance in this cattle breed. Several dairy traits QTL are overlapping the identified candidate regions in these two zebu cattle breeds. Our findings underline the potential to improve dairy production in the semi-arid pastoral areas of Africa through breeding improvement strategy of indigenous local breeds.
Transcriptional architecture of the primate neocortex.

PubMed

Bernard, Amy; Lubbers, Laura S; Tanis, Keith Q; Luo, Rui; Podtelezhnikov, Alexei A; Finney, Eva M; McWhorter, Mollie M E; Serikawa, Kyle; Lemon, Tracy; Morgan, Rebecca; Copeland, Catherine; Smith, Kimberly; Cullen, Vivian; Davis-Turak, Jeremy; Lee, Chang-Kyu; Sunkin, Susan M; Loboda, Andrey P; Levine, David M; Stone, David J; Hawrylycz, Michael J; Roberts, Christopher J; Jones, Allan R; Geschwind, Daniel H; Lein, Ed S

2012-03-22

Genome-wide transcriptional profiling was used to characterize the molecular underpinnings of neocortical organization in rhesus macaque, including cortical areal specialization and laminar cell-type diversity. Microarray analysis of individual cortical layers across sensorimotor and association cortices identified robust and specific molecular signatures for individual cortical layers and areas, prominently involving genes associated with specialized neuronal function. Overall, transcriptome-based relationships were related to spatial proximity, being strongest between neighboring cortical areas and between proximal layers. Primary visual cortex (V1) displayed the most distinctive gene expression compared to other cortical regions in rhesus and human, both in the specialized layer 4 as well as other layers. Laminar patterns were more similar between macaque and human compared to mouse, as was the unique V1 profile that was not observed in mouse. These data provide a unique resource detailing neocortical transcription patterns in a nonhuman primate with great similarity in gene expression to human. Copyright © 2012 Elsevier Inc. All rights reserved.

Customized Molecular Phenotyping by Quantitative Gene Expression and Pattern Recognition Analysis

PubMed Central

Akilesh, Shreeram; Shaffer, Daniel J.; Roopenian, Derry

2003-01-01

Description of the molecular phenotypes of pathobiological processes in vivo is a pressing need in genomic biology. We have implemented a high-throughput real-time PCR strategy to establish quantitative expression profiles of a customized set of target genes. It enables rapid, reproducible data acquisition from limited quantities of RNA, permitting serial sampling of mouse blood during disease progression. We developed an easy to use statistical algorithm—Global Pattern Recognition—to readily identify genes whose expression has changed significantly from healthy baseline profiles. This approach provides unique molecular signatures for rheumatoid arthritis, systemic lupus erythematosus, and graft versus host disease, and can also be applied to defining the molecular phenotype of a variety of other normal and pathological processes. PMID:12840047
Lesson 5: Defining Valid Electronic Signatures

EPA Pesticide Factsheets

A valid electronic signature on an electronic document is one that is created with an electronic signature device that is uniquely entitled to a signatory, not compromised, and used by a signatory who is authorized to sign the electronic document.
Cis-regulatory signatures of orthologous stress-associated bZIP transcription factors from rice, sorghum and Arabidopsis based on phylogenetic footprints

PubMed Central

2012-01-01

Background The potential contribution of upstream sequence variation to the unique features of orthologous genes is just beginning to be unraveled. A core subset of stress-associated bZIP transcription factors from rice (Oryza sativa) formed ten clusters of orthologous groups (COG) with genes from the monocot sorghum (Sorghum bicolor) and dicot Arabidopsis (Arabidopsis thaliana). The total cis-regulatory information content of each stress-associated COG was examined by phylogenetic footprinting to reveal ortholog-specific, lineage-specific and species-specific conservation patterns. Results The most apparent pattern observed was the occurrence of spatially conserved ‘core modules’ among the COGs but not among paralogs. These core modules are comprised of various combinations of two to four putative transcription factor binding site (TFBS) classes associated with either developmental or stress-related functions. Outside the core modules are specific stress (ABA, oxidative, abiotic, biotic) or organ-associated signals, which may be functioning as ‘regulatory fine-tuners’ and further define lineage-specific and species-specific cis-regulatory signatures. Orthologous monocot and dicot promoters have distinct TFBS classes involved in disease and oxidative-regulated expression, while the orthologous rice and sorghum promoters have distinct combinations of root-specific signals, a pattern that is not particularly conserved in Arabidopsis. Conclusions Patterns of cis-regulatory conservation imply that each ortholog has distinct signatures, further suggesting that they are potentially unique in a regulatory context despite the presumed conservation of broad biological function during speciation. Based on the observed patterns of conservation, we postulate that core modules are likely primary determinants of basal developmental programming, which may be integrated with and further elaborated by additional intrinsic or extrinsic signals in conjunction with lineage-specific or species-specific regulatory fine-tuners. This synergy may be critical for finer-scale spatio-temporal regulation, hence unique expression profiles of homologous transcription factors from different species with distinct zones of ecological adaptation such as rice, sorghum and Arabidopsis. The patterns revealed from these comparisons set the stage for further empirical validation by functional genomics. PMID:22992304
Signatures of natural selection and ecological differentiation in microbial genomes.

PubMed

Shapiro, B Jesse

2014-01-01

We live in a microbial world. Most of the genetic and metabolic diversity that exists on earth - and has existed for billions of years - is microbial. Making sense of this vast diversity is a daunting task, but one that can be approached systematically by analyzing microbial genome sequences. This chapter explores how the evolutionary forces of recombination and selection act to shape microbial genome sequences, leaving signatures that can be detected using comparative genomics and population-genetic tests for selection. I describe the major classes of tests, paying special attention to their relative strengths and weaknesses when applied to microbes. Specifically, I apply a suite of tests for selection to a set of closely-related bacterial genomes with different microhabitat preferences within the marine water column, shedding light on the genomic mechanisms of ecological differentiation in the wild. I will focus on the joint problem of simultaneously inferring the boundaries between microbial populations, and the selective forces operating within and between populations.
matK-QR classifier: a patterns based approach for plant species identification.

PubMed

More, Ravi Prabhakar; Mane, Rupali Chandrashekhar; Purohit, Hemant J

2016-01-01

DNA barcoding is widely used and most efficient approach that facilitates rapid and accurate identification of plant species based on the short standardized segment of the genome. The nucleotide sequences of maturaseK ( matK ) and ribulose-1, 5-bisphosphate carboxylase ( rbcL ) marker loci are commonly used in plant species identification. Here, we present a new and highly efficient approach for identifying a unique set of discriminating nucleotide patterns to generate a signature (i.e. regular expression) for plant species identification. In order to generate molecular signatures, we used matK and rbcL loci datasets, which encompass 125 plant species in 52 genera reported by the CBOL plant working group. Initially, we performed Multiple Sequence Alignment (MSA) of all species followed by Position Specific Scoring Matrix (PSSM) for both loci to achieve a percentage of discrimination among species. Further, we detected Discriminating Patterns (DP) at genus and species level using PSSM for the matK dataset. Combining DP and consecutive pattern distances, we generated molecular signatures for each species. Finally, we performed a comparative assessment of these signatures with the existing methods including BLASTn, Support Vector Machines (SVM), Jrip-RIPPER, J48 (C4.5 algorithm), and the Naïve Bayes (NB) methods against NCBI-GenBank matK dataset. Due to the higher discrimination success obtained with the matK as compared to the rbcL , we selected matK gene for signature generation. We generated signatures for 60 species based on identified discriminating patterns at genus and species level. Our comparative assessment results suggest that a total of 46 out of 60 species could be correctly identified using generated signatures, followed by BLASTn (34 species), SVM (18 species), C4.5 (7 species), NB (4 species) and RIPPER (3 species) methods As a final outcome of this study, we converted signatures into QR codes and developed a software matK -QR Classifier (http://www.neeri.res.in/matk_classifier/index.htm), which search signatures in the query matK gene sequences and predict corresponding plant species. This novel approach of employing pattern-based signatures opens new avenues for the classification of species. In addition to existing methods, we believe that matK -QR Classifier would be a valuable tool for molecular taxonomists enabling precise identification of plant species.
Integrative ChIP-seq/Microarray Analysis Identifies a CTNNB1 Target Signature Enriched in Intestinal Stem Cells and Colon Cancer

PubMed Central

Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L.; Roberts, Brian S.; Arthur, William T.; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing

2014-01-01

Background Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. Results We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Conclusion Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells. PMID:24651522
Integrative ChIP-seq/microarray analysis identifies a CTNNB1 target signature enriched in intestinal stem cells and colon cancer.

PubMed

Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L; Roberts, Brian S; Arthur, William T; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing

2014-01-01

Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells.
Genetic Diversity on the Sex Chromosomes

PubMed Central

Wilson Sayres, Melissa A

2018-01-01

Abstract Levels and patterns of genetic diversity can provide insights into a population’s history. In species with sex chromosomes, differences between genomic regions with unique inheritance patterns can be used to distinguish between different sets of possible demographic and selective events. This review introduces the differences in population history for sex chromosomes and autosomes, provides the expectations for genetic diversity across the genome under different evolutionary scenarios, and gives an introductory description for how deviations in these expectations are calculated and can be interpreted. Predominantly, diversity on the sex chromosomes has been used to explore and address three research areas: 1) Mating patterns and sex-biased variance in reproductive success, 2) signatures of selection, and 3) evidence for modes of speciation and introgression. After introducing the theory, this review catalogs recent studies of genetic diversity on the sex chromosomes across species within the major research areas that sex chromosomes are typically applied to, arguing that there are broad similarities not only between male-heterogametic (XX/XY) and female-heterogametic (ZZ/ZW) sex determination systems but also any mating system with reduced recombination in a sex-determining region. Further, general patterns of reduced diversity in nonrecombining regions are shared across plants and animals. There are unique patterns across populations with vastly different patterns of mating and speciation, but these do not tend to cluster by taxa or sex determination system. PMID:29635328
Gene Expression Signatures Based on Variability can Robustly Predict Tumor Progression and Prognosis

PubMed Central

Dinalankara, Wikum; Bravo, Héctor Corrada

2015-01-01

Gene expression signatures are commonly used to create cancer prognosis and diagnosis methods, yet only a small number of them are successfully deployed in the clinic since many fail to replicate performance on subsequent validation. A primary reason for this lack of reproducibility is the fact that these signatures attempt to model the highly variable and unstable genomic behavior of cancer. Our group recently introduced gene expression anti-profiles as a robust methodology to derive gene expression signatures based on the observation that while gene expression measurements are highly heterogeneous across tumors of a specific cancer type relative to the normal tissue, their degree of deviation from normal tissue expression in specific genes involved in tissue differentiation is a stable tumor mark that is reproducible across experiments and cancer types. Here we show that constructing gene expression signatures based on variability and the anti-profile approach yields classifiers capable of successfully distinguishing benign growths from cancerous growths based on deviation from normal expression. We then show that this same approach generates stable and reproducible signatures that predict probability of relapse and survival based on tumor gene expression. These results suggest that using the anti-profile framework for the discovery of genomic signatures is an avenue leading to the development of reproducible signatures suitable for adoption in clinical settings. PMID:26078586
Innovative assembly strategy contributes to understanding the evolution and conservation genetics of the endangered Solenodon paradoxus from the island of Hispaniola.

PubMed

Grigorev, Kirill; Kliver, Sergey; Dobrynin, Pavel; Komissarov, Aleksey; Wolfsberger, Walter; Krasheninnikova, Ksenia; Afanador-Herna Ndez, Yashira M; Brandt, Adam L; Paulino, Liz A; Carreras, Rosanna; Rodríguez, Luis E; Nu N Ez, Adrell; Brandt, Jessica R; Silva, Filipe; Herna Ndez-Martich, J David; Majeske, Audrey J; Antunes, Agostinho; Roca, Alfred L; O'Brien, Stephen J; Martínez-Cruzado, Juan Carlos; Oleksyk, Taras K

2018-03-16

Solenodons are insectivores living in Hispaniola and Cuba that form an isolated branch in the tree of placental mammals highly divergent from other eulipothyplan insectivores The history, unique biology and adaptations of these enigmatic venomous species could be illuminated by the availability of genome data, but a whole genome assembly for solenodons has not been previously performed, partially due to the difficulty in obtaining samples from the field. Island isolation and reduced numbers have likely resulted in high homozygosity within the Hispaniolan solenodon (Solenodon paradoxus), thus we tested the performance of several assembly strategies on the genome of this genetically impoverished species. The string-graph based assembly strategy seemed a better choice compared to the conventional de Bruijn graph approach, due to the high levels of homozygosity, which is often a hallmark of endemic or endangered species. A consensus reference genome was assembled from sequences of five individuals from the southern subspecies (S. p. woodi). In addition, we obtained additional sequence from one sample of the northern subspecies (S. p. paradoxus). The resulting genome assemblies were compared to each other, and annotated for genes, with a specific emphasis on venom genes, repeats, variable microsatellite loci and other genomic variants. Phylogenetic positioning and selection signatures were inferred based on 4,416 single copy orthologs from 10 other mammals. We estimated that solenodons diverged from other extant mammals 73.6 Mya. Patterns of SNP variation allowed us to infer population demography, which supported a subspecies split within the Hispaniolan solenodon at least 300 Kya.
Landscape of somatic mutations in 560 breast cancer whole-genome sequences

DOE PAGES

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; ...

2016-05-02

Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Landscape of somatic mutations in 560 breast cancer whole-genome sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan

Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Resolution of habitat-associated ecogenomic signatures in bacteriophage genomes and application to microbial source tracking.

PubMed

Ogilvie, Lesley A; Nzakizwanayo, Jonathan; Guppy, Fergus M; Dedi, Cinzia; Diston, David; Taylor, Huw; Ebdon, James; Jones, Brian V

2018-04-01

Just as the expansion in genome sequencing has revealed and permitted the exploitation of phylogenetic signals embedded in bacterial genomes, the application of metagenomics has begun to provide similar insights at the ecosystem level for microbial communities. However, little is known regarding this aspect of bacteriophage associated with microbial ecosystems, and if phage encode discernible habitat-associated signals diagnostic of underlying microbiomes. Here we demonstrate that individual phage can encode clear habitat-related 'ecogenomic signatures', based on relative representation of phage-encoded gene homologues in metagenomic data sets. Furthermore, we show the ecogenomic signature encoded by the gut-associated ɸB124-14 can be used to segregate metagenomes according to environmental origin, and distinguish 'contaminated' environmental metagenomes (subject to simulated in silico human faecal pollution) from uncontaminated data sets. This indicates phage-encoded ecological signals likely possess sufficient discriminatory power for use in biotechnological applications, such as development of microbial source tracking tools for monitoring water quality.
Landscape of somatic mutations in 560 breast cancer whole genome sequences

PubMed Central

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; Ramakrishna, Manasa; Glodzik, Dominik; Zou, Xueqing; Martincorena, Inigo; Alexandrov, Ludmil B.; Martin, Sancha; Wedge, David C.; Van Loo, Peter; Ju, Young Seok; Smid, Marcel; Brinkman, Arie B; Morganella, Sandro; Aure, Miriam R.; Lingjærde, Ole Christian; Langerød, Anita; Ringnér, Markus; Ahn, Sung-Min; Boyault, Sandrine; Brock, Jane E.; Broeks, Annegien; Butler, Adam; Desmedt, Christine; Dirix, Luc; Dronov, Serge; Fatima, Aquila; Foekens, John A.; Gerstung, Moritz; Hooijer, Gerrit KJ; Jang, Se Jin; Jones, David R.; Kim, Hyung-Yong; King, Tari A.; Krishnamurthy, Savitri; Lee, Hee Jin; Lee, Jeong-Yeon; Li, Yilong; McLaren, Stuart; Menzies, Andrew; Mustonen, Ville; O’Meara, Sarah; Pauporté, Iris; Pivot, Xavier; Purdie, Colin A.; Raine, Keiran; Ramakrishnan, Kamna; Rodríguez-González, F. Germán; Romieu, Gilles; Sieuwerts, Anieta M.; Simpson, Peter T; Shepherd, Rebecca; Stebbings, Lucy; Stefansson, Olafur A; Teague, Jon; Tommasi, Stefania; Treilleux, Isabelle; Van den Eynden, Gert G.; Vermeulen, Peter; Vincent-Salomon, Anne; Yates, Lucy; Caldas, Carlos; van’t Veer, Laura; Tutt, Andrew; Knappskog, Stian; Tan, Benita Kiat Tee; Jonkers, Jos; Borg, Åke; Ueno, Naoto T; Sotiriou, Christos; Viari, Alain; Futreal, P. Andrew; Campbell, Peter J; Span, Paul N.; Van Laere, Steven; Lakhani, Sunil R; Eyfjord, Jorunn E.; Thompson, Alastair M.; Birney, Ewan; Stunnenberg, Hendrik G; van de Vijver, Marc J; Martens, John W.M.; Børresen-Dale, Anne-Lise; Richardson, Andrea L.; Kong, Gu; Thomas, Gilles; Stratton, Michael R.

2016-01-01

We analysed whole genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. 93 protein-coding cancer genes carried likely driver mutations. Some non-coding regions exhibited high mutation frequencies but most have distinctive structural features probably causing elevated mutation rates and do not harbour driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed 12 base substitution and six rearrangement signatures. Three rearrangement signatures, characterised by tandem duplications or deletions, appear associated with defective homologous recombination based DNA repair: one with deficient BRCA1 function; another with deficient BRCA1 or BRCA2 function; the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operative, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer. PMID:27135926
Genomic comparison of virulent and non-virulent Streptococcus agalactiae in fish.

PubMed

Delannoy, C M J; Zadoks, R N; Crumlish, M; Rodgers, D; Lainson, F A; Ferguson, H W; Turnbull, J; Fontaine, M C

2016-01-01

Streptococcus agalactiae infections in fish are predominantly caused by beta-haemolytic strains of clonal complex (CC) 7, notably its namesake sequence type (ST) 7, or by non-haemolytic strains of CC552, including the globally distributed ST260. In contrast, CC23, including its namesake ST23, has been associated with a wide homeothermic and poikilothermic host range, but never with fish. The aim of this study was to determine whether ST23 is virulent in fish and to identify genomic markers of fish adaptation of S. agalactiae. Intraperitoneal challenge of Nile tilapia, Oreochromis niloticus (Linnaeus), showed that ST260 is lethal at doses down to 10(2) cfu per fish, whereas ST23 does not cause disease at 10(7) cfu per fish. Comparison of the genome sequence of ST260 and ST23 with those of strains derived from fish, cattle and humans revealed the presence of genomic elements that are unique to subpopulations of S. agalactiae that have the ability to infect fish (CC7 and CC552). These loci occurred in clusters exhibiting typical signatures of mobile genetic elements. PCR-based screening of a collection of isolates from multiple host species confirmed the association of selected genes with fish-derived strains. Several fish-associated genes encode proteins that potentially provide fitness in the aquatic environment. © 2014 John Wiley & Sons Ltd.
Delineation of metabolic gene clusters in plant genomes by chromatin signatures

PubMed Central

Yu, Nan; Nützmann, Hans-Wilhelm; MacDonald, James T.; Moore, Ben; Field, Ben; Berriri, Souha; Trick, Martin; Rosser, Susan J.; Kumar, S. Vinod; Freemont, Paul S.; Osbourn, Anne

2016-01-01

Plants are a tremendous source of diverse chemicals, including many natural product-derived drugs. It has recently become apparent that the genes for the biosynthesis of numerous different types of plant natural products are organized as metabolic gene clusters, thereby unveiling a highly unusual form of plant genome architecture and offering novel avenues for discovery and exploitation of plant specialized metabolism. Here we show that these clustered pathways are characterized by distinct chromatin signatures of histone 3 lysine trimethylation (H3K27me3) and histone 2 variant H2A.Z, associated with cluster repression and activation, respectively, and represent discrete windows of co-regulation in the genome. We further demonstrate that knowledge of these chromatin signatures along with chromatin mutants can be used to mine genomes for cluster discovery. The roles of H3K27me3 and H2A.Z in repression and activation of single genes in plants are well known. However, our discovery of highly localized operon-like co-regulated regions of chromatin modification is unprecedented in plants. Our findings raise intriguing parallels with groups of physically linked multi-gene complexes in animals and with clustered pathways for specialized metabolism in filamentous fungi. PMID:26895889
Detection of selection signatures in Piemontese and Marchigiana cattle, two breeds with similar production aptitudes but different selection histories.

PubMed

Sorbolini, Silvia; Marras, Gabriele; Gaspa, Giustino; Dimauro, Corrado; Cellesi, Massimo; Valentini, Alessio; Macciotta, Nicolò Pp

2015-06-23

Domestication and selection are processes that alter the pattern of within- and between-population genetic variability. They can be investigated at the genomic level by tracing the so-called selection signatures. Recently, sequence polymorphisms at the genome-wide level have been investigated in a wide range of animals. A common approach to detect selection signatures is to compare breeds that have been selected for different breeding goals (i.e. dairy and beef cattle). However, genetic variations in different breeds with similar production aptitudes and similar phenotypes can be related to differences in their selection history. In this study, we investigated selection signatures between two Italian beef cattle breeds, Piemontese and Marchigiana, using genotyping data that was obtained with the Illumina BovineSNP50 BeadChip. The comparison was based on the fixation index (Fst), combined with a locally weighted scatterplot smoothing (LOWESS) regression and a control chart approach. In addition, analyses of Fst were carried out to confirm candidate genes. In particular, data were processed using the varLD method, which compares the regional variation of linkage disequilibrium between populations. Genome scans confirmed the presence of selective sweeps in the genomic regions that harbour candidate genes that are known to affect productive traits in cattle such as DGAT1, ABCG2, CAPN3, MSTN and FTO. In addition, several new putative candidate genes (for example ALAS1, ABCB8, ACADS and SOD1) were detected. This study provided evidence on the different selection histories of two cattle breeds and the usefulness of genomic scans to detect selective sweeps even in cattle breeds that are bred for similar production aptitudes.
Systematic bias in genomic classification due to contaminating non-neoplastic tissue in breast tumor samples.

PubMed

Elloumi, Fathi; Hu, Zhiyuan; Li, Yan; Parker, Joel S; Gulley, Margaret L; Amos, Keith D; Troester, Melissa A

2011-06-30

Genomic tests are available to predict breast cancer recurrence and to guide clinical decision making. These predictors provide recurrence risk scores along with a measure of uncertainty, usually a confidence interval. The confidence interval conveys random error and not systematic bias. Standard tumor sampling methods make this problematic, as it is common to have a substantial proportion (typically 30-50%) of a tumor sample comprised of histologically benign tissue. This "normal" tissue could represent a source of non-random error or systematic bias in genomic classification. To assess the performance characteristics of genomic classification to systematic error from normal contamination, we collected 55 tumor samples and paired tumor-adjacent normal tissue. Using genomic signatures from the tumor and paired normal, we evaluated how increasing normal contamination altered recurrence risk scores for various genomic predictors. Simulations of normal tissue contamination caused misclassification of tumors in all predictors evaluated, but different breast cancer predictors showed different types of vulnerability to normal tissue bias. While two predictors had unpredictable direction of bias (either higher or lower risk of relapse resulted from normal contamination), one signature showed predictable direction of normal tissue effects. Due to this predictable direction of effect, this signature (the PAM50) was adjusted for normal tissue contamination and these corrections improved sensitivity and negative predictive value. For all three assays quality control standards and/or appropriate bias adjustment strategies can be used to improve assay reliability. Normal tissue sampled concurrently with tumor is an important source of bias in breast genomic predictors. All genomic predictors show some sensitivity to normal tissue contamination and ideal strategies for mitigating this bias vary depending upon the particular genes and computational methods used in the predictor.
Genomic Signature of Kin Selection in an Ant with Obligately Sterile Workers

PubMed Central

Warner, Michael R.; Mikheyev, Alexander S.

2017-01-01

Abstract Kin selection is thought to drive the evolution of cooperation and conflict, but the specific genes and genome-wide patterns shaped by kin selection are unknown. We identified thousands of genes associated with the sterile ant worker caste, the archetype of an altruistic phenotype shaped by kin selection, and then used population and comparative genomic approaches to study patterns of molecular evolution at these genes. Consistent with population genetic theoretical predictions, worker-upregulated genes experienced reduced selection compared with genes upregulated in reproductive castes. Worker-upregulated genes included more taxonomically restricted genes, indicating that the worker caste has recruited more novel genes, yet these genes also experienced reduced selection. Our study identifies a putative genomic signature of kin selection and helps to integrate emerging sociogenomic data with longstanding social evolution theory. PMID:28419349
Genome-wide signatures of complex introgression and adaptive evolution in the big cats

PubMed Central

Figueiró, Henrique V.; Li, Gang; Trindade, Fernanda J.; Assis, Juliana; Pais, Fabiano; Fernandes, Gabriel; Santos, Sarah H. D.; Hughes, Graham M.; Komissarov, Aleksey; Antunes, Agostinho; Trinca, Cristine S.; Rodrigues, Maíra R.; Linderoth, Tyler; Bi, Ke; Silveira, Leandro; Azevedo, Fernando C. C.; Kantek, Daniel; Ramalho, Emiliano; Brassaloti, Ricardo A.; Villela, Priscilla M. S.; Nunes, Adauto L. V.; Teixeira, Rodrigo H. F.; Morato, Ronaldo G.; Loska, Damian; Saragüeta, Patricia; Gabaldón, Toni; Teeling, Emma C.; O’Brien, Stephen J.; Nielsen, Rasmus; Coutinho, Luiz L.; Oliveira, Guilherme; Murphy, William J.; Eizirik, Eduardo

2017-01-01

The great cats of the genus Panthera comprise a recent radiation whose evolutionary history is poorly understood. Their rapid diversification poses challenges to resolving their phylogeny while offering opportunities to investigate the historical dynamics of adaptive divergence. We report the sequence, de novo assembly, and annotation of the jaguar (Panthera onca) genome, a novel genome sequence for the leopard (Panthera pardus), and comparative analyses encompassing all living Panthera species. Demographic reconstructions indicated that all of these species have experienced variable episodes of population decline during the Pleistocene, ultimately leading to small effective sizes in present-day genomes. We observed pervasive genealogical discordance across Panthera genomes, caused by both incomplete lineage sorting and complex patterns of historical interspecific hybridization. We identified multiple signatures of species-specific positive selection, affecting genes involved in craniofacial and limb development, protein metabolism, hypoxia, reproduction, pigmentation, and sensory perception. There was remarkable concordance in pathways enriched in genomic segments implicated in interspecies introgression and in positive selection, suggesting that these processes were connected. We tested this hypothesis by developing exome capture probes targeting ~19,000 Panthera genes and applying them to 30 wild-caught jaguars. We found at least two genes (DOCK3 and COL4A5, both related to optic nerve development) bearing significant signatures of interspecies introgression and within-species positive selection. These findings indicate that post-speciation admixture has contributed genetic material that facilitated the adaptive evolution of big cat lineages. PMID:28776029

The population genomic signature of environmental selection in the widespread insect-pollinated tree species Frangula alnus at different geographical scales

PubMed Central

De Kort, H; Vandepitte, K; Mergeay, J; Mijnsbrugge, K V; Honnay, O

2015-01-01

The evaluation of the molecular signatures of selection in species lacking an available closely related reference genome remains challenging, yet it may provide valuable fundamental insights into the capacity of populations to respond to environmental cues. We screened 25 native populations of the tree species Frangula alnus subsp. alnus (Rhamnaceae), covering three different geographical scales, for 183 annotated single-nucleotide polymorphisms (SNPs). Standard population genomic outlier screens were combined with individual-based and multivariate landscape genomic approaches to examine the strength of selection relative to neutral processes in shaping genomic variation, and to identify the main environmental agents driving selection. Our results demonstrate a more distinct signature of selection with increasing geographical distance, as indicated by the proportion of SNPs (i) showing exceptional patterns of genetic diversity and differentiation (outliers) and (ii) associated with climate. Both temperature and precipitation have an important role as selective agents in shaping adaptive genomic differentiation in F. alnus subsp. alnus, although their relative importance differed among spatial scales. At the ‘intermediate' and ‘regional' scales, where limited genetic clustering and high population diversity were observed, some indications of natural selection may suggest a major role for gene flow in safeguarding adaptability. High genetic diversity at loci under selection in particular, indicated considerable adaptive potential, which may nevertheless be compromised by the combined effects of climate change and habitat fragmentation. PMID:25944466
Comparative population genomics of Fusarium graminearum reveals adaptive divergence among cereal head blight pathogens

USDA-ARS?s Scientific Manuscript database

In this study we sequenced the genomes of 60 Fusarium graminearum, the major fungal pathogen responsible for Fusarium head blight (FHB) in cereal crops world-wide. To investigate adaptive evolution of FHB pathogens, we performed population-level analyses to characterize genomic structure, signatures...
Genomic and bioinformatics analyses of HAdV-4vac and HAdV-7vac, two human adenovirus (HAdV) strains that constituted original prophylaxis against HAdV-related acute respiratory disease, a reemerging epidemic disease.

PubMed

Purkayastha, Anjan; Su, Jing; McGraw, John; Ditty, Susan E; Hadfield, Ted L; Seto, Jason; Russell, Kevin L; Tibbetts, Clark; Seto, Donald

2005-07-01

Vaccine strains of human adenovirus serotypes 4 and 7 (HAdV-4vac and HAdV-7vac) have been used successfully to prevent adenovirus-related acute respiratory disease outbreaks. The genomes of these two vaccine strains have been sequenced, annotated, and compared with their prototype equivalents with the goals of understanding their genomes for molecular diagnostics applications, vaccine redevelopment, and HAdV pathoepidemiology. These reference genomes are archived in GenBank as HAdV-4vac (35,994 bp; AY594254) and HAdV-7vac (35,240 bp; AY594256). Bioinformatics and comparative whole-genome analyses with their recently reported and archived prototype genomes reveal six mismatches and four insertions-deletions (indels) between the HAdV-4 prototype and vaccine strains, in contrast to the 611 mismatches and 130 indels between the HAdV-7 prototype and vaccine strains. Annotation reveals that the HAdV-4vac and HAdV-7vac genomes contain 51 and 50 coding units, respectively. Neither vaccine strain appears to be attenuated for virulence based on bioinformatics analyses. There is evidence of genome recombination, as the inverted terminal repeat of HAdV-4vac is initially identical to that of species C whereas the prototype is identical to species B1. These vaccine reference sequences yield unique genome signatures for molecular diagnostics. As a molecular forensics application, these references identify the circulating and problematic 1950s era field strains as the original HAdV-4 prototype and the Greider prototype, from which the vaccines are derived. Thus, they are useful for genomic comparisons to current epidemic and reemerging field strains, as well as leading to an understanding of pathoepidemiology among the human adenoviruses.
Genomic and Bioinformatics Analyses of HAdV-4vac and HAdV-7vac, Two Human Adenovirus (HAdV) Strains That Constituted Original Prophylaxis against HAdV-Related Acute Respiratory Disease, a Reemerging Epidemic Disease

PubMed Central

Purkayastha, Anjan; Su, Jing; McGraw, John; Ditty, Susan E.; Hadfield, Ted L.; Seto, Jason; Russell, Kevin L.; Tibbetts, Clark; Seto, Donald

2005-01-01

Vaccine strains of human adenovirus serotypes 4 and 7 (HAdV-4vac and HAdV-7vac) have been used successfully to prevent adenovirus-related acute respiratory disease outbreaks. The genomes of these two vaccine strains have been sequenced, annotated, and compared with their prototype equivalents with the goals of understanding their genomes for molecular diagnostics applications, vaccine redevelopment, and HAdV pathoepidemiology. These reference genomes are archived in GenBank as HAdV-4vac (35,994 bp; AY594254) and HAdV-7vac (35,240 bp; AY594256). Bioinformatics and comparative whole-genome analyses with their recently reported and archived prototype genomes reveal six mismatches and four insertions-deletions (indels) between the HAdV-4 prototype and vaccine strains, in contrast to the 611 mismatches and 130 indels between the HAdV-7 prototype and vaccine strains. Annotation reveals that the HAdV-4vac and HAdV-7vac genomes contain 51 and 50 coding units, respectively. Neither vaccine strain appears to be attenuated for virulence based on bioinformatics analyses. There is evidence of genome recombination, as the inverted terminal repeat of HAdV-4vac is initially identical to that of species C whereas the prototype is identical to species B1. These vaccine reference sequences yield unique genome signatures for molecular diagnostics. As a molecular forensics application, these references identify the circulating and problematic 1950s era field strains as the original HAdV-4 prototype and the Greider prototype, from which the vaccines are derived. Thus, they are useful for genomic comparisons to current epidemic and reemerging field strains, as well as leading to an understanding of pathoepidemiology among the human adenoviruses. PMID:16000418
InFlo: a novel systems biology framework identifies cAMP-CREB1 axis as a key modulator of platinum resistance in ovarian cancer.

PubMed

Dimitrova, N; Nagaraj, A B; Razi, A; Singh, S; Kamalakaran, S; Banerjee, N; Joseph, P; Mankovich, A; Mittal, P; DiFeo, A; Varadan, V

2017-04-27

Characterizing the complex interplay of cellular processes in cancer would enable the discovery of key mechanisms underlying its development and progression. Published approaches to decipher driver mechanisms do not explicitly model tissue-specific changes in pathway networks and the regulatory disruptions related to genomic aberrations in cancers. We therefore developed InFlo, a novel systems biology approach for characterizing complex biological processes using a unique multidimensional framework integrating transcriptomic, genomic and/or epigenomic profiles for any given cancer sample. We show that InFlo robustly characterizes tissue-specific differences in activities of signalling networks on a genome scale using unique probabilistic models of molecular interactions on a per-sample basis. Using large-scale multi-omics cancer datasets, we show that InFlo exhibits higher sensitivity and specificity in detecting pathway networks associated with specific disease states when compared to published pathway network modelling approaches. Furthermore, InFlo's ability to infer the activity of unmeasured signalling network components was also validated using orthogonal gene expression signatures. We then evaluated multi-omics profiles of primary high-grade serous ovarian cancer tumours (N=357) to delineate mechanisms underlying resistance to frontline platinum-based chemotherapy. InFlo was the only algorithm to identify hyperactivation of the cAMP-CREB1 axis as a key mechanism associated with resistance to platinum-based therapy, a finding that we subsequently experimentally validated. We confirmed that inhibition of CREB1 phosphorylation potently sensitized resistant cells to platinum therapy and was effective in killing ovarian cancer stem cells that contribute to both platinum-resistance and tumour recurrence. Thus, we propose InFlo to be a scalable and widely applicable and robust integrative network modelling framework for the discovery of evidence-based biomarkers and therapeutic targets.
DiRE: identifying distant regulatory elements of co-expressed genes

PubMed Central

Gotea, Valer; Ovcharenko, Ivan

2008-01-01

Regulation of gene expression in eukaryotic genomes is established through a complex cooperative activity of proximal promoters and distant regulatory elements (REs) such as enhancers, repressors and silencers. We have developed a web server named DiRE, based on the Enhancer Identification (EI) method, for predicting distant regulatory elements in higher eukaryotic genomes, namely for determining their chromosomal location and functional characteristics. The server uses gene co-expression data, comparative genomics and profiles of transcription factor binding sites (TFBSs) to determine TFBS-association signatures that can be used for discriminating specific regulatory functions. DiRE's unique feature is its ability to detect REs outside of proximal promoter regions, as it takes advantage of the full gene locus to conduct the search. DiRE can predict common REs for any set of input genes for which the user has prior knowledge of co-expression, co-function or other biologically meaningful grouping. The server predicts function-specific REs consisting of clusters of specifically-associated TFBSs and it also scores the association of individual transcription factors (TFs) with the biological function shared by the group of input genes. Its integration with the Array2BIO server allows users to start their analysis with raw microarray expression data. The DiRE web server is freely available at http://dire.dcode.org. PMID:18487623
Genomic analyses in cotton identify signatures of selection and loci associated with fiber quality and yield traits.

PubMed

Fang, Lei; Wang, Qiong; Hu, Yan; Jia, Yinhua; Chen, Jiedan; Liu, Bingliang; Zhang, Zhiyuan; Guan, Xueying; Chen, Shuqi; Zhou, Baoliang; Mei, Gaofu; Sun, Junling; Pan, Zhaoe; He, Shoupu; Xiao, Songhua; Shi, Weijun; Gong, Wenfang; Liu, Jianguang; Ma, Jun; Cai, Caiping; Zhu, Xiefei; Guo, Wangzhen; Du, Xiongming; Zhang, Tianzhen

2017-07-01

Upland cotton (Gossypium hirsutum) is the most important natural fiber crop in the world. The overall genetic diversity among cultivated species of cotton and the genetic changes that occurred during their improvement are poorly understood. Here we report a comprehensive genomic assessment of modern improved upland cotton based on the genome-wide resequencing of 318 landraces and modern improved cultivars or lines. We detected more associated loci for lint yield than for fiber quality, which suggests that lint yield has stronger selection signatures than other traits. We found that two ethylene-pathway-related genes were associated with increased lint yield in improved cultivars. We evaluated the population frequency of each elite allele in historically released cultivar groups and found that 54.8% of the elite genome-wide association study (GWAS) alleles detected were transferred from three founder landraces: Deltapine 15, Stoneville 2B and Uganda Mian. Our results provide a genomic basis for improving cotton cultivars and for further evolutionary analysis of polyploid crops.
Diversity of human copy number variation and multicopy genes.

PubMed

Sudmant, Peter H; Kitzman, Jacob O; Antonacci, Francesca; Alkan, Can; Malig, Maika; Tsalenko, Anya; Sampas, Nick; Bruhn, Laurakay; Shendure, Jay; Eichler, Evan E

2010-10-29

Copy number variants affect both disease and normal phenotypic variation, but those lying within heavily duplicated, highly identical sequence have been difficult to assay. By analyzing short-read mapping depth for 159 human genomes, we demonstrated accurate estimation of absolute copy number for duplications as small as 1.9 kilobase pairs, ranging from 0 to 48 copies. We identified 4.1 million "singly unique nucleotide" positions informative in distinguishing specific copies and used them to genotype the copy and content of specific paralogs within highly duplicated gene families. These data identify human-specific expansions in genes associated with brain development, reveal extensive population genetic diversity, and detect signatures consistent with gene conversion in the human species. Our approach makes ~1000 genes accessible to genetic studies of disease association.
An optimized library for reference-based deconvolution of whole-blood biospecimens assayed using the Illumina HumanMethylationEPIC BeadArray.

PubMed

Salas, Lucas A; Koestler, Devin C; Butler, Rondi A; Hansen, Helen M; Wiencke, John K; Kelsey, Karl T; Christensen, Brock C

2018-05-29

Genome-wide methylation arrays are powerful tools for assessing cell composition of complex mixtures. We compare three approaches to select reference libraries for deconvoluting neutrophil, monocyte, B-lymphocyte, natural killer, and CD4+ and CD8+ T-cell fractions based on blood-derived DNA methylation signatures assayed using the Illumina HumanMethylationEPIC array. The IDOL algorithm identifies a library of 450 CpGs, resulting in an average R 2 = 99.2 across cell types when applied to EPIC methylation data collected on artificial mixtures constructed from the above cell types. Of the 450 CpGs, 69% are unique to EPIC. This library has the potential to reduce unintended technical differences across array platforms.
Isolation and characterization of major histocompatibility complex class II B genes in cranes.

PubMed

Kohyama, Tetsuo I; Akiyama, Takuya; Nishida, Chizuko; Takami, Kazutoshi; Onuma, Manabu; Momose, Kunikazu; Masuda, Ryuichi

2015-11-01

In this study, we isolated and characterized the major histocompatibility complex (MHC) class II B genes in cranes. Genomic sequences spanning exons 1 to 4 were amplified and determined in 13 crane species and three other species closely related to cranes. In all, 55 unique sequences were identified, and at least two polymorphic MHC class II B loci were found in most species. An analysis of sequence polymorphisms showed the signature of positive selection and recombination. A phylogenetic reconstruction based on exon 2 sequences indicated that trans-species polymorphism has persisted for at least 10 million years, whereas phylogenetic analyses of the sequences flanking exon 2 revealed a pattern of concerted evolution. These results suggest that both balancing selection and recombination play important roles in the crane MHC evolution.
Toward understanding dog evolutionary and domestication history.

PubMed

Galibert, Francis; Quignon, Pascale; Hitte, Christophe; André, Catherine

2011-03-01

Dog domestication was probably started very early during the Upper paleolithic period (~35,000 BP), thus well before any other animal or plant domestication. This early process, probably unconscious, is called proto-domestication to distinguish it from the real domestication process that has been dated around 14,000 BC. Genomic DNA analyses have shown recently that domestication started in the Middle East and rapidly expanded into all human populations. Nowadays, the dog population is fragmented in several hundreds of breeds well characterized by their phenotypes that offer a unique spectrum of polymorphism. More recent studies detect genetic signatures that will be useful to highlight breed history as well as the impact of domestication at the DNA level. Copyright © 2011 Académie des sciences. Published by Elsevier SAS. All rights reserved.
Signatures of positive selection in African Butana and Kenana dairy zebu cattle

PubMed Central

Salim, Bashir; Almathen, Faisal; Al Enezi, Fahad; Mwacharo, Joram M.; Hanotte, Olivier

2018-01-01

Butana and Kenana are two types of zebu cattle found in Sudan. They are unique amongst African indigenous zebu cattle because of their high milk production. Aiming to understand their genome structure, we genotyped 25 individuals from each breed using the Illumina BovineHD Genotyping BeadChip. Genetic structure analysis shows that both breeds have an admixed genome composed of an even proportion of indicine (0.75 ± 0.03 in Butana, 0.76 ± 0.006 in Kenana) and taurine (0.23 ± 0.009 in Butana, 0.24 ± 0.006 in Kenana) ancestries. We also observe a proportion of 0.02 to 0.12 of European taurine ancestry in ten individuals of Butana that were sampled from cattle herds in Tamboul area suggesting local crossbreeding with exotic breeds. Signatures of selection analyses (iHS and Rsb) reveal 87 and 61 candidate positive selection regions in Butana and Kenana, respectively. These regions span genes and quantitative trait loci (QTL) associated with biological pathways that are important for adaptation to marginal environments (e.g., immunity, reproduction and heat tolerance). Trypanotolerance QTL are intersecting candidate regions in Kenana cattle indicating selection pressure acting on them, which might be associated with an unexplored level of trypanotolerance in this cattle breed. Several dairy traits QTL are overlapping the identified candidate regions in these two zebu cattle breeds. Our findings underline the potential to improve dairy production in the semi-arid pastoral areas of Africa through breeding improvement strategy of indigenous local breeds. PMID:29300786
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence

PubMed Central

2017-01-01

During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana. We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays, although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. PMID:28223399
The search for loci under selection: trends, biases and progress.

PubMed

Ahrens, Collin W; Rymer, Paul D; Stow, Adam; Bragg, Jason; Dillon, Shannon; Umbers, Kate D L; Dudaniec, Rachael Y

2018-03-01

Detecting genetic variants under selection using F ST outlier analysis (OA) and environmental association analyses (EAAs) are popular approaches that provide insight into the genetic basis of local adaptation. Despite the frequent use of OA and EAA approaches and their increasing attractiveness for detecting signatures of selection, their application to field-based empirical data have not been synthesized. Here, we review 66 empirical studies that use Single Nucleotide Polymorphisms (SNPs) in OA and EAA. We report trends and biases across biological systems, sequencing methods, approaches, parameters, environmental variables and their influence on detecting signatures of selection. We found striking variability in both the use and reporting of environmental data and statistical parameters. For example, linkage disequilibrium among SNPs and numbers of unique SNP associations identified with EAA were rarely reported. The proportion of putatively adaptive SNPs detected varied widely among studies, and decreased with the number of SNPs analysed. We found that genomic sampling effort had a greater impact than biological sampling effort on the proportion of identified SNPs under selection. OA identified a higher proportion of outliers when more individuals were sampled, but this was not the case for EAA. To facilitate repeatability, interpretation and synthesis of studies detecting selection, we recommend that future studies consistently report geographical coordinates, environmental data, model parameters, linkage disequilibrium, and measures of genetic structure. Identifying standards for how OA and EAA studies are designed and reported will aid future transparency and comparability of SNP-based selection studies and help to progress landscape and evolutionary genomics. © 2018 John Wiley & Sons Ltd.
Mutational signatures of DNA mismatch repair deficiency in C. elegans and human cancers.

PubMed

Meier, Bettina; Volkova, Nadezda V; Hong, Ye; Schofield, Pieta; Campbell, Peter J; Gerstung, Moritz; Gartner, Anton

2018-05-01

Throughout their lifetime, cells are subject to extrinsic and intrinsic mutational processes leaving behind characteristic signatures in the genome. DNA mismatch repair (MMR) deficiency leads to hypermutation and is found in different cancer types. Although it is possible to associate mutational signatures extracted from human cancers with possible mutational processes, the exact causation is often unknown. Here, we use C. elegans genome sequencing of pms-2 and mlh-1 knockouts to reveal the mutational patterns linked to C. elegans MMR deficiency and their dependency on endogenous replication errors and errors caused by deletion of the polymerase ε subunit pole-4 Signature extraction from 215 human colorectal and 289 gastric adenocarcinomas revealed three MMR-associated signatures, one of which closely resembles the C. elegans MMR spectrum and strongly discriminates microsatellite stable and unstable tumors (AUC = 98%). A characteristic difference between human and C. elegans MMR deficiency is the lack of elevated levels of N C G > NTG mutations in C. elegans, likely caused by the absence of cytosine (CpG) methylation in worms . The other two human MMR signatures may reflect the interaction between MMR deficiency and other mutagenic processes, but their exact cause remains unknown. In summary, combining information from genetically defined models and cancer samples allows for better aligning mutational signatures to causal mutagenic processes. © 2018 Meier et al.; Published by Cold Spring Harbor Laboratory Press.
Genome-wide SNP genotyping resolves signatures of selection and tetrasomic recombination in peanut

USDA-ARS?s Scientific Manuscript database

Peanut (Arachis hypogaea; 2n=4x=40) is a nutritious food and a good source of vitamins, minerals, and healthy fats. Expansion of genetic and genomic resources for genetic enhancement of cultivated peanut has gained momentum from the sequenced genomes of the diploid ancestors of cultivated peanut. ...
Genome-wide association studies identified novel loci for non-high-density lipoprotein cholesterol and its postprandial lipemic response

USDA-ARS?s Scientific Manuscript database

Non-high-density lipoprotein cholesterol (NHDL) is an independent and superior predictor of CVD risk as compared to low-density lipoprotein alone. It represents a spectrum of atherogenic lipid fractions with possibly a distinct genomic signature. We performed genome-wide association studies (GWAS) t...
Continental-level population differentiation and environmental adaptation in the mushroom Suillus brevipes

PubMed Central

Branco, Sara; Bi, Ke; Liao, Hui-Ling; Gladieux, Pierre; Badouin, Hélène; Ellison, Christopher E.; Nguyen, Nhu H.; Vilgalys, Rytas; Peay, Kabir G.; Taylor, John W.; Bruns, Thomas D.

2016-01-01

Recent advancements in sequencing technology allowed researchers to better address the patterns and mechanisms involved in microbial environmental adaptation at large spatial scales. Here we investigated the genomic basis of adaptation to climate at the continental scale in Suillus brevipes, an ectomycorrhizal fungus symbiotically associated with the roots of pine trees. We used genomic data from 55 individuals in seven locations across North America to perform genome scans to detect signatures of positive selection and assess whether temperature and precipitation were associated with genetic differentiation. We found that S. brevipes exhibited overall strong population differentiation, with potential admixture in Canadian populations. This species also displayed genomic signatures of positive selection as well as genomic sites significantly associated with distinct climatic regimes and abiotic environmental parameters. These genomic regions included genes involved in transmembrane transport of substances and helicase activity potentially involved in cold stress response. Our study sheds light on large-scale environmental adaptation in fungi by identifying putative adaptive genes and providing a framework to further investigate the genetic basis of fungal adaptation. PMID:27761941
Detection of genomic signatures of recent selection in commercial broiler chickens.

PubMed

Fu, Weixuan; Lee, William R; Abasht, Behnam

2016-08-26

Identification of the genomic signatures of recent selection may help uncover causal polymorphisms controlling traits relevant to recent decades of selective breeding in livestock. In this study, we aimed at detecting signatures of recent selection in commercial broiler chickens using genotype information from single nucleotide polymorphisms (SNPs). A total of 565 chickens from five commercial purebred lines, including three broiler sire (male) lines and two broiler dam (female) lines, were genotyped using the 60K SNP Illumina iSelect chicken array. To detect genomic signatures of recent selection, we applied two methods based on population comparison, cross-population extended haplotype homozygosity (XP-EHH) and cross-population composite likelihood ratio (XP-CLR), and further analyzed the results to find genomic regions under recent selection in multiple purebred lines. A total of 321 candidate selection regions spanning approximately 1.45 % of the chicken genome in each line were detected by consensus of results of both XP-EHH and XP-CLR methods. To minimize false discovery due to genetic drift, only 42 of the candidate selection regions that were shared by 2 or more purebred lines were considered as high-confidence selection regions in the study. Of these 42 regions, 20 were 50 kb or less while 4 regions were larger than 0.5 Mb. In total, 91 genes could be found in the 42 regions, among which 19 regions contained only 1 or 2 genes, and 9 regions were located at gene deserts. Our results provide a genome-wide scan of recent selection signatures in five purebred lines of commercial broiler chickens. We found several candidate genes for recent selection in multiple lines, such as SOX6 (Sex Determining Region Y-Box 6) and cTR (Thyroid hormone receptor beta). These genes may have been under recent selection due to their essential roles in growth, development and reproduction in chickens. Furthermore, our results suggest that in some candidate regions, the same or opposite alleles have been under recent selection in multiple lines. Most of the candidate genes in the selection regions are novel, and as such they should be of great interest for future research into the genetic architecture of traits relevant to modern broiler breeding.
Early Detection of NSCLC Using Stromal Markers in Peripheral Blood

DTIC Science & Technology

2017-11-01

transcriptionally altered and the alteration is tumor dependent . The specific transcriptomic signature of circulating myeloid cells may provide us unique...signature, which may be useful for early lung cancer diagnosis. The specific aims are: Aim 1. To identify a NSCLC- dependent transcriptomic signature in...circulating myeloid cells are transcriptionally altered and the alteration is tumor dependent . The specific transcriptomic signature of circulating

Comprehensive Sieve Analysis of Breakthrough HIV-1 Sequences in the RV144 Vaccine Efficacy Trial

PubMed Central

Edlefsen, Paul T.; Rolland, Morgane; Hertz, Tomer; Tovanabutra, Sodsai; Gartland, Andrew J.; deCamp, Allan C.; Magaret, Craig A.; Ahmed, Hasan; Gottardo, Raphael; Juraska, Michal; McCoy, Connor; Larsen, Brendan B.; Sanders-Buell, Eric; Carrico, Chris; Menis, Sergey; Bose, Meera; Arroyo, Miguel A.; O’Connell, Robert J.; Nitayaphan, Sorachai; Pitisuttithum, Punnee; Kaewkungwal, Jaranit; Rerks-Ngarm, Supachai; Robb, Merlin L.; Kirys, Tatsiana; Georgiev, Ivelin S.; Kwong, Peter D.; Scheffler, Konrad; Pond, Sergei L. Kosakovsky; Carlson, Jonathan M.; Michael, Nelson L.; Schief, William R.; Mullins, James I.; Kim, Jerome H.; Gilbert, Peter B.

2015-01-01

The RV144 clinical trial showed the partial efficacy of a vaccine regimen with an estimated vaccine efficacy (VE) of 31% for protecting low-risk Thai volunteers against acquisition of HIV-1. The impact of vaccine-induced immune responses can be investigated through sieve analysis of HIV-1 breakthrough infections (infected vaccine and placebo recipients). A V1/V2-targeted comparison of the genomes of HIV-1 breakthrough viruses identified two V2 amino acid sites that differed between the vaccine and placebo groups. Here we extended the V1/V2 analysis to the entire HIV-1 genome using an array of methods based on individual sites, k-mers and genes/proteins. We identified 56 amino acid sites or “signatures” and 119 k-mers that differed between the vaccine and placebo groups. Of those, 19 sites and 38 k-mers were located in the regions comprising the RV144 vaccine (Env-gp120, Gag, and Pro). The nine signature sites in Env-gp120 were significantly enriched for known antibody-associated sites (p = 0.0021). In particular, site 317 in the third variable loop (V3) overlapped with a hotspot of antibody recognition, and sites 369 and 424 were linked to CD4 binding site neutralization. The identified signature sites significantly covaried with other sites across the genome (mean = 32.1) more than did non-signature sites (mean = 0.9) (p < 0.0001), suggesting functional and/or structural relevance of the signature sites. Since signature sites were not preferentially restricted to the vaccine immunogens and because most of the associations were insignificant following correction for multiple testing, we predict that few of the genetic differences are strongly linked to the RV144 vaccine-induced immune pressure. In addition to presenting results of the first complete-genome analysis of the breakthrough infections in the RV144 trial, this work describes a set of statistical methods and tools applicable to analysis of breakthrough infection genomes in general vaccine efficacy trials for diverse pathogens. PMID:25646817
Blood-Based Gene Expression Profiles Models for Classification of Subsyndromal Symptomatic Depression and Major Depressive Disorder

PubMed Central

Yu, Shunying; Yuan, Chengmei; Hong, Wu; Wang, Zuowei; Cui, Jian; Shi, Tieliu; Fang, Yiru

2012-01-01

Subsyndromal symptomatic depression (SSD) is a subtype of subthreshold depressive and also lead to significant psychosocial functional impairment as same as major depressive disorder (MDD). Several studies have suggested that SSD is a transitory phenomena in the depression spectrum and is thus considered a subtype of depression. However, the pathophysioloy of depression remain largely obscure and studies on SSD are limited. The present study compared the expression profile and made the classification with the leukocytes by using whole-genome cRNA microarrays among drug-free first-episode subjects with SSD, MDD, and matched controls (8 subjects in each group). Support vector machines (SVMs) were utilized for training and testing on candidate signature expression profiles from signature selection step. Firstly, we identified 63 differentially expressed SSD signatures in contrast to control (P< = 5.0E-4) and 30 differentially expressed MDD signatures in contrast to control, respectively. Then, 123 gene signatures were identified with significantly differential expression level between SSD and MDD. Secondly, in order to conduct priority selection for biomarkers for SSD and MDD together, we selected top gene signatures from each group of pair-wise comparison results, and merged the signatures together to generate better profiles used for clearly classify SSD and MDD sets in the same time. In details, we tried different combination of signatures from the three pair-wise compartmental results and finally determined 48 gene expression signatures with 100% accuracy. Our finding suggested that SSD and MDD did not exhibit the same expressed genome signature with peripheral blood leukocyte, and blood cell–derived RNA of these 48 gene models may have significant value for performing diagnostic functions and classifying SSD, MDD, and healthy controls. PMID:22348066
Unique transcriptome signatures and GM-CSF expression in lymphocytes from patients with spondyloarthritis.

PubMed

Al-Mossawi, M H; Chen, L; Fang, H; Ridley, A; de Wit, J; Yager, N; Hammitzsch, A; Pulyakhina, I; Fairfax, B P; Simone, D; Yi, Yao; Bandyopadhyay, S; Doig, K; Gundle, R; Kendrick, B; Powrie, F; Knight, J C; Bowness, P

2017-11-15

Spondyloarthritis encompasses a group of common inflammatory diseases thought to be driven by IL-17A-secreting type-17 lymphocytes. Here we show increased numbers of GM-CSF-producing CD4 and CD8 lymphocytes in the blood and joints of patients with spondyloarthritis, and increased numbers of IL-17A + GM-CSF + double-producing CD4, CD8, γδ and NK cells. GM-CSF production in CD4 T cells occurs both independently and in combination with classical Th1 and Th17 cytokines. Type 3 innate lymphoid cells producing predominantly GM-CSF are expanded in synovial tissues from patients with spondyloarthritis. GM-CSF + CD4 + cells, isolated using a triple cytokine capture approach, have a specific transcriptional signature. Both GM-CSF + and IL-17A + GM-CSF + double-producing CD4 T cells express increased levels of GPR65, a proton-sensing receptor associated with spondyloarthritis in genome-wide association studies and pathogenicity in murine inflammatory disease models. Silencing GPR65 in primary CD4 T cells reduces GM-CSF production. GM-CSF and GPR65 may thus serve as targets for therapeutic intervention of spondyloarthritis.
Identifying signatures of natural selection in Tibetan and Andean populations using dense genome scan data.

PubMed

Bigham, Abigail; Bauchet, Marc; Pinto, Dalila; Mao, Xianyun; Akey, Joshua M; Mei, Rui; Scherer, Stephen W; Julian, Colleen G; Wilson, Megan J; López Herráez, David; Brutsaert, Tom; Parra, Esteban J; Moore, Lorna G; Shriver, Mark D

2010-09-09

High-altitude hypoxia (reduced inspired oxygen tension due to decreased barometric pressure) exerts severe physiological stress on the human body. Two high-altitude regions where humans have lived for millennia are the Andean Altiplano and the Tibetan Plateau. Populations living in these regions exhibit unique circulatory, respiratory, and hematological adaptations to life at high altitude. Although these responses have been well characterized physiologically, their underlying genetic basis remains unknown. We performed a genome scan to identify genes showing evidence of adaptation to hypoxia. We looked across each chromosome to identify genomic regions with previously unknown function with respect to altitude phenotypes. In addition, groups of genes functioning in oxygen metabolism and sensing were examined to test the hypothesis that particular pathways have been involved in genetic adaptation to altitude. Applying four population genetic statistics commonly used for detecting signatures of natural selection, we identified selection-nominated candidate genes and gene regions in these two populations (Andeans and Tibetans) separately. The Tibetan and Andean patterns of genetic adaptation are largely distinct from one another, with both populations showing evidence of positive natural selection in different genes or gene regions. Interestingly, one gene previously known to be important in cellular oxygen sensing, EGLN1 (also known as PHD2), shows evidence of positive selection in both Tibetans and Andeans. However, the pattern of variation for this gene differs between the two populations. Our results indicate that several key HIF-regulatory and targeted genes are responsible for adaptation to high altitude in Andeans and Tibetans, and several different chromosomal regions are implicated in the putative response to selection. These data suggest a genetic role in high-altitude adaption and provide a basis for future genotype/phenotype association studies necessary to confirm the role of selection-nominated candidate genes and gene regions in adaptation to altitude.
Identifying Signatures of Natural Selection in Tibetan and Andean Populations Using Dense Genome Scan Data

PubMed Central

Bigham, Abigail; Bauchet, Marc; Pinto, Dalila; Mao, Xianyun; Akey, Joshua M.; Mei, Rui; Scherer, Stephen W.; Julian, Colleen G.; Wilson, Megan J.; López Herráez, David; Brutsaert, Tom; Parra, Esteban J.; Moore, Lorna G.; Shriver, Mark D.

2010-01-01

High-altitude hypoxia (reduced inspired oxygen tension due to decreased barometric pressure) exerts severe physiological stress on the human body. Two high-altitude regions where humans have lived for millennia are the Andean Altiplano and the Tibetan Plateau. Populations living in these regions exhibit unique circulatory, respiratory, and hematological adaptations to life at high altitude. Although these responses have been well characterized physiologically, their underlying genetic basis remains unknown. We performed a genome scan to identify genes showing evidence of adaptation to hypoxia. We looked across each chromosome to identify genomic regions with previously unknown function with respect to altitude phenotypes. In addition, groups of genes functioning in oxygen metabolism and sensing were examined to test the hypothesis that particular pathways have been involved in genetic adaptation to altitude. Applying four population genetic statistics commonly used for detecting signatures of natural selection, we identified selection-nominated candidate genes and gene regions in these two populations (Andeans and Tibetans) separately. The Tibetan and Andean patterns of genetic adaptation are largely distinct from one another, with both populations showing evidence of positive natural selection in different genes or gene regions. Interestingly, one gene previously known to be important in cellular oxygen sensing, EGLN1 (also known as PHD2), shows evidence of positive selection in both Tibetans and Andeans. However, the pattern of variation for this gene differs between the two populations. Our results indicate that several key HIF-regulatory and targeted genes are responsible for adaptation to high altitude in Andeans and Tibetans, and several different chromosomal regions are implicated in the putative response to selection. These data suggest a genetic role in high-altitude adaption and provide a basis for future genotype/phenotype association studies necessary to confirm the role of selection-nominated candidate genes and gene regions in adaptation to altitude. PMID:20838600
NCI Workshop Report: Clinical and Computational Requirements for Correlating Imaging Phenotypes with Genomics Signatures.

PubMed

Colen, Rivka; Foster, Ian; Gatenby, Robert; Giger, Mary Ellen; Gillies, Robert; Gutman, David; Heller, Matthew; Jain, Rajan; Madabhushi, Anant; Madhavan, Subha; Napel, Sandy; Rao, Arvind; Saltz, Joel; Tatum, James; Verhaak, Roeland; Whitman, Gary

2014-10-01

The National Cancer Institute (NCI) Cancer Imaging Program organized two related workshops on June 26-27, 2013, entitled "Correlating Imaging Phenotypes with Genomics Signatures Research" and "Scalable Computational Resources as Required for Imaging-Genomics Decision Support Systems." The first workshop focused on clinical and scientific requirements, exploring our knowledge of phenotypic characteristics of cancer biological properties to determine whether the field is sufficiently advanced to correlate with imaging phenotypes that underpin genomics and clinical outcomes, and exploring new scientific methods to extract phenotypic features from medical images and relate them to genomics analyses. The second workshop focused on computational methods that explore informatics and computational requirements to extract phenotypic features from medical images and relate them to genomics analyses and improve the accessibility and speed of dissemination of existing NIH resources. These workshops linked clinical and scientific requirements of currently known phenotypic and genotypic cancer biology characteristics with imaging phenotypes that underpin genomics and clinical outcomes. The group generated a set of recommendations to NCI leadership and the research community that encourage and support development of the emerging radiogenomics research field to address short-and longer-term goals in cancer research.
Genome-wide signatures of population bottlenecks and diversifying selection in European wolves

PubMed Central

Pilot, M; Greco, C; vonHoldt, B M; Jędrzejewska, B; Randi, E; Jędrzejewski, W; Sidorovich, V E; Ostrander, E A; Wayne, R K

2014-01-01

Genomic resources developed for domesticated species provide powerful tools for studying the evolutionary history of their wild relatives. Here we use 61K single-nucleotide polymorphisms (SNPs) evenly spaced throughout the canine nuclear genome to analyse evolutionary relationships among the three largest European populations of grey wolves in comparison with other populations worldwide, and investigate genome-wide effects of demographic bottlenecks and signatures of selection. European wolves have a discontinuous range, with large and connected populations in Eastern Europe and relatively smaller, isolated populations in Italy and the Iberian Peninsula. Our results suggest a continuous decline in wolf numbers in Europe since the Late Pleistocene, and long-term isolation and bottlenecks in the Italian and Iberian populations following their divergence from the Eastern European population. The Italian and Iberian populations have low genetic variability and high linkage disequilibrium, but relatively few autozygous segments across the genome. This last characteristic clearly distinguishes them from populations that underwent recent drastic demographic declines or founder events, and implies long-term bottlenecks in these two populations. Although genetic drift due to spatial isolation and bottlenecks seems to be a major evolutionary force diversifying the European populations, we detected 35 loci that are putatively under diversifying selection. Two of these loci flank the canine platelet-derived growth factor gene, which affects bone growth and may influence differences in body size between wolf populations. This study demonstrates the power of population genomics for identifying genetic signals of demographic bottlenecks and detecting signatures of directional selection in bottlenecked populations, despite their low background variability. PMID:24346500
Delineation of metabolic gene clusters in plant genomes by chromatin signatures.

PubMed

Yu, Nan; Nützmann, Hans-Wilhelm; MacDonald, James T; Moore, Ben; Field, Ben; Berriri, Souha; Trick, Martin; Rosser, Susan J; Kumar, S Vinod; Freemont, Paul S; Osbourn, Anne

2016-03-18

Plants are a tremendous source of diverse chemicals, including many natural product-derived drugs. It has recently become apparent that the genes for the biosynthesis of numerous different types of plant natural products are organized as metabolic gene clusters, thereby unveiling a highly unusual form of plant genome architecture and offering novel avenues for discovery and exploitation of plant specialized metabolism. Here we show that these clustered pathways are characterized by distinct chromatin signatures of histone 3 lysine trimethylation (H3K27me3) and histone 2 variant H2A.Z, associated with cluster repression and activation, respectively, and represent discrete windows of co-regulation in the genome. We further demonstrate that knowledge of these chromatin signatures along with chromatin mutants can be used to mine genomes for cluster discovery. The roles of H3K27me3 and H2A.Z in repression and activation of single genes in plants are well known. However, our discovery of highly localized operon-like co-regulated regions of chromatin modification is unprecedented in plants. Our findings raise intriguing parallels with groups of physically linked multi-gene complexes in animals and with clustered pathways for specialized metabolism in filamentous fungi. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Comprehensive Genomic Characterization of Upper Tract Urothelial Carcinoma.

PubMed

Moss, Tyler J; Qi, Yuan; Xi, Liu; Peng, Bo; Kim, Tae-Beom; Ezzedine, Nader E; Mosqueda, Maribel E; Guo, Charles C; Czerniak, Bogdan A; Ittmann, Michael; Wheeler, David A; Lerner, Seth P; Matin, Surena F

2017-10-01

Upper urinary tract urothelial cancer (UTUC) may have unique etiologic and genomic factors compared to bladder cancer. To characterize the genomic landscape of UTUC and provide insights into its biology using comprehensive integrated genomic analyses. We collected 31 untreated snap-frozen UTUC samples from two institutions and carried out whole-exome sequencing (WES) of DNA, RNA sequencing (RNAseq), and protein analysis. Adjusting for batch effects, consensus mutation calls from independent pipelines identified DNA mutations, gene expression clusters using unsupervised consensus hierarchical clustering (UCHC), and protein expression levels that were correlated with relevant clinical variables, The Cancer Genome Atlas, and other published data. WES identified mutations in FGFR3 (74.1%; 92% low-grade, 60% high-grade), KMT2D (44.4%), PIK3CA (25.9%), and TP53 (22.2%). APOBEC and CpG were the most common mutational signatures. UCHC of RNAseq data segregated samples into four molecular subtypes with the following characteristics. Cluster 1: no PIK3CA mutations, nonsmokers, high-grade
Clustered Mutation Signatures Reveal that Error-Prone DNA Repair Targets Mutations to Active Genes.

PubMed

Supek, Fran; Lehner, Ben

2017-07-27

Many processes can cause the same nucleotide change in a genome, making the identification of the mechanisms causing mutations a difficult challenge. Here, we show that clustered mutations provide a more precise fingerprint of mutagenic processes. Of nine clustered mutation signatures identified from >1,000 tumor genomes, three relate to variable APOBEC activity and three are associated with tobacco smoking. An additional signature matches the spectrum of translesion DNA polymerase eta (POLH). In lymphoid cells, these mutations target promoters, consistent with AID-initiated somatic hypermutation. In solid tumors, however, they are associated with UV exposure and alcohol consumption and target the H3K36me3 chromatin of active genes in a mismatch repair (MMR)-dependent manner. These regions normally have a low mutation rate because error-free MMR also targets H3K36me3 chromatin. Carcinogens and error-prone repair therefore redistribute mutations to the more important regions of the genome, contributing a substantial mutation load in many tumors, including driver mutations. Copyright © 2017 Elsevier Inc. All rights reserved.
Ancient, recurrent phage attacks and recombination shaped dynamic sequence-variable mosaics at the root of phytoplasma genome evolution.

PubMed

Wei, Wei; Davis, Robert E; Jomantiene, Rasa; Zhao, Yan

2008-08-19

Mobile genetic elements have impacted biological evolution across all studied organisms, but evidence for a role in evolutionary emergence of an entire phylogenetic clade has not been forthcoming. We suggest that mobile element predation played a formative role in emergence of the phytoplasma clade. Phytoplasmas are cell wall-less bacteria that cause numerous diseases in plants. Phylogenetic analyses indicate that these transkingdom parasites descended from Gram-positive walled bacteria, but events giving rise to the first phytoplasma have remained unknown. Previously we discovered a unique feature of phytoplasmal genome architecture, genes clustered in sequence-variable mosaics (SVMs), and suggested that such structures formed through recurrent, targeted attacks by mobile elements. In the present study, we discovered that cryptic prophage remnants, originating from phages in the order Caudovirales, formed SVMs and comprised exceptionally large percentages of the chromosomes of 'Candidatus Phytoplasma asteris'-related strains OYM and AYWB, occupying nearly all major nonsyntenic sections, and accounting for most of the size difference between the two genomes. The clustered phage remnants formed genomic islands exhibiting distinct DNA physical signatures, such as dinucleotide relative abundance and codon position GC values. Phytoplasma strain-specific genes identified as phage morons were located in hypervariable regions within individual SVMs, indicating that prophage remnants played important roles in generating phytoplasma genetic diversity. Because no SVM-like structures could be identified in genomes of ancestral relatives including Acholeplasma spp., we hypothesize that ancient phage attacks leading to SVM formation occurred after divergence of phytoplasmas from acholeplasmas, triggering evolution of the phytoplasma clade.
Deep sequencing reveals unique small RNA repertoire that is regulated during head regeneration in Hydra magnipapillata.

PubMed

Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda

2013-01-07

Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem-loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping-pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration.
Deep sequencing reveals unique small RNA repertoire that is regulated during head regeneration in Hydra magnipapillata

PubMed Central

Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda

2013-01-01

Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem–loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping–pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration. PMID:23166307
Population genomic scan for candidate signatures of balancing selection to guide antigen characterization in malaria parasites.

PubMed

Amambua-Ngwa, Alfred; Tetteh, Kevin K A; Manske, Magnus; Gomez-Escobar, Natalia; Stewart, Lindsay B; Deerhake, M Elizabeth; Cheeseman, Ian H; Newbold, Christopher I; Holder, Anthony A; Knuepfer, Ellen; Janha, Omar; Jallow, Muminatou; Campino, Susana; Macinnis, Bronwyn; Kwiatkowski, Dominic P; Conway, David J

2012-01-01

Acquired immunity in vertebrates maintains polymorphisms in endemic pathogens, leading to identifiable signatures of balancing selection. To comprehensively survey for genes under such selection in the human malaria parasite Plasmodium falciparum, we generated paired-end short-read sequences of parasites in clinical isolates from an endemic Gambian population, which were mapped to the 3D7 strain reference genome to yield high-quality genome-wide coding sequence data for 65 isolates. A minority of genes did not map reliably, including the hypervariable var, rifin, and stevor families, but 5,056 genes (90.9% of all in the genome) had >70% sequence coverage with minimum read depth of 5 for at least 50 isolates, of which 2,853 genes contained 3 or more single nucleotide polymorphisms (SNPs) for analysis of polymorphic site frequency spectra. Against an overall background of negatively skewed frequencies, as expected from historical population expansion combined with purifying selection, the outlying minority of genes with signatures indicating exceptionally intermediate frequencies were identified. Comparing genes with different stage-specificity, such signatures were most common in those with peak expression at the merozoite stage that invades erythrocytes. Members of clag, PfMC-2TM, surfin, and msp3-like gene families were highly represented, the strongest signature being in the msp3-like gene PF10_0355. Analysis of msp3-like transcripts in 45 clinical and 11 laboratory adapted isolates grown to merozoite-containing schizont stages revealed surprisingly low expression of PF10_0355. In diverse clonal parasite lines the protein product was expressed in a minority of mature schizonts (<1% in most lines and ∼10% in clone HB3), and eight sub-clones of HB3 cultured separately had an intermediate spectrum of positive frequencies (0.9 to 7.5%), indicating phase variable expression of this polymorphic antigen. This and other identified targets of balancing selection are now prioritized for functional study.
27 CFR 73.3 - What terms must I know to understand this part?

Code of Federal Regulations, 2010 CFR

2010-04-01

... and/or actions are both unique to that individual and measurable. Digital signature. An electronic... verified. A signer creates a digital signature by using public-key encryption to transform a message digest of an electronic message. If a recipient of the digital signature has an electronic message, message...
27 CFR 73.3 - What terms must I know to understand this part?

Code of Federal Regulations, 2011 CFR

2011-04-01

... and/or actions are both unique to that individual and measurable. Digital signature. An electronic... verified. A signer creates a digital signature by using public-key encryption to transform a message digest of an electronic message. If a recipient of the digital signature has an electronic message, message...
Decoding the Regulatory Landscape of Ageing in Musculoskeletal Engineered Tissues Using Genome-Wide DNA Methylation and RNASeq

PubMed Central

Peffers, Mandy Jayne; Goljanek-Whysall, Katarzyna; Collins, John; Fang, Yongxiang; Rushton, Michael; Loughlin, John; Proctor, Carole; Clegg, Peter David

2016-01-01

Mesenchymal stem cells (MSC) are capable of multipotent differentiation into connective tissues and as such are an attractive source for autologous cell-based regenerative medicine and tissue engineering. Epigenetic mechanisms, like DNA methylation, contribute to the changes in gene expression in ageing. However there was a lack of sufficient knowledge of the role that differential methylation plays during chondrogenic, osteogenic and tenogenic differentiation from ageing MSCs. This study undertook genome level determination of the effects of DNA methylation on expression in engineered tissues from chronologically aged MSCs. We compiled unique DNA methylation signatures from chondrogenic, osteogenic, and tenogenic engineered tissues derived from young; n = 4 (21.8 years ± 2.4 SD) and old; n = 4 (65.5 years±8.3SD) human MSCs donors using the Illumina HumanMethylation 450 Beadchip arrays and compared these to gene expression by RNA sequencing. Unique and common signatures of global DNA methylation were identified. There were 201, 67 and 32 chondrogenic, osteogenic and tenogenic age-related DE protein-coding genes respectively. Findings inferred the nature of the transcript networks was predominantly for ‘cell death and survival’, ‘cell morphology’, and ‘cell growth and proliferation’. Further studies are required to validate if this gene expression effect translates to cell events. Alternative splicing (AS) was dysregulated in ageing with 119, 21 and 9 differential splicing events identified in chondrogenic, osteogenic and tenogenic respectively, and enrichment in genes associated principally with metabolic processes. Gene ontology analysis of differentially methylated loci indicated age-related enrichment for all engineered tissue types in ‘skeletal system morphogenesis’, ‘regulation of cell proliferation’ and ‘regulation of transcription’ suggesting that dynamic epigenetic modifications may occur in genes associated with shared and distinct pathways dependent upon engineered tissue type. An altered phenotype in engineered tissues was observed with ageing at numerous levels. These changes represent novel insights into the ageing process, with implications for stem cell therapies in older patients. In addition we have identified a number of tissue-dependant pathways, which warrant further studies. PMID:27533049
Signatures of polygenic adaptation associated with climate across the range of a threatened fish species with high genetic connectivity.

PubMed

Harrisson, Katherine A; Amish, Stephen J; Pavlova, Alexandra; Narum, Shawn R; Telonis-Scott, Marina; Rourke, Meaghan L; Lyon, Jarod; Tonkin, Zeb; Gilligan, Dean M; Ingram, Brett A; Lintermans, Mark; Gan, Han Ming; Austin, Christopher M; Luikart, Gordon; Sunnucks, Paul

2017-11-01

Adaptive differences across species' ranges can have important implications for population persistence and conservation management decisions. Despite advances in genomic technologies, detecting adaptive variation in natural populations remains challenging. Key challenges in gene-environment association studies involve distinguishing the effects of drift from those of selection and identifying subtle signatures of polygenic adaptation. We used paired-end restriction site-associated DNA sequencing data (6,605 biallelic single nucleotide polymorphisms; SNPs) to examine population structure and test for signatures of adaptation across the geographic range of an iconic Australian endemic freshwater fish species, the Murray cod Maccullochella peelii. Two univariate gene-association methods identified 61 genomic regions associated with climate variation. We also tested for subtle signatures of polygenic adaptation using a multivariate method (redundancy analysis; RDA). The RDA analysis suggested that climate (temperature- and precipitation-related variables) and geography had similar magnitudes of effect in shaping the distribution of SNP genotypes across the sampled range of Murray cod. Although there was poor agreement among the candidate SNPs identified by the univariate methods, the top 5% of SNPs contributing to significant RDA axes included 67% of the SNPs identified by univariate methods. We discuss the potential implications of our findings for the management of Murray cod and other species generally, particularly in relation to informing conservation actions such as translocations to improve evolutionary resilience of natural populations. Our results highlight the value of using a combination of different approaches, including polygenic methods, when testing for signatures of adaptation in landscape genomic studies. © 2017 John Wiley & Sons Ltd.
Ontology based molecular signatures for immune cell types via gene expression analysis

PubMed Central

2013-01-01

Background New technologies are focusing on characterizing cell types to better understand their heterogeneity. With large volumes of cellular data being generated, innovative methods are needed to structure the resulting data analyses. Here, we describe an ‘Ontologically BAsed Molecular Signature’ (OBAMS) method that identifies novel cellular biomarkers and infers biological functions as characteristics of particular cell types. This method finds molecular signatures for immune cell types based on mapping biological samples to the Cell Ontology (CL) and navigating the space of all possible pairwise comparisons between cell types to find genes whose expression is core to a particular cell type’s identity. Results We illustrate this ontological approach by evaluating expression data available from the Immunological Genome project (IGP) to identify unique biomarkers of mature B cell subtypes. We find that using OBAMS, candidate biomarkers can be identified at every strata of cellular identity from broad classifications to very granular. Furthermore, we show that Gene Ontology can be used to cluster cell types by shared biological processes in order to find candidate genes responsible for somatic hypermutation in germinal center B cells. Moreover, through in silico experiments based on this approach, we have identified genes sets that represent genes overexpressed in germinal center B cells and identify genes uniquely expressed in these B cells compared to other B cell types. Conclusions This work demonstrates the utility of incorporating structured ontological knowledge into biological data analysis – providing a new method for defining novel biomarkers and providing an opportunity for new biological insights. PMID:24004649
Selection signature analysis in Holstein cattle identified genes known to affect reproduction

USDA-ARS?s Scientific Manuscript database

Using direct comparison of 45,878 SNPs between a group of Holstein cattle unselected since 1964 and contemporary Holsteins that on average take 30 days longer for successful conception than the 1964 Holsteins, we conducted selection signature analyses to identify genomic regions associated with dair...

Protein phylogenies and signature sequences: A reappraisal of evolutionary relationships among archaebacteria, eubacteria, and eukaryotes.

PubMed

Gupta, R S

1998-12-01

The presence of shared conserved insertion or deletions (indels) in protein sequences is a special type of signature sequence that shows considerable promise for phylogenetic inference. An alternative model of microbial evolution based on the use of indels of conserved proteins and the morphological features of prokaryotic organisms is proposed. In this model, extant archaebacteria and gram-positive bacteria, which have a simple, single-layered cell wall structure, are termed monoderm prokaryotes. They are believed to be descended from the most primitive organisms. Evidence from indels supports the view that the archaebacteria probably evolved from gram-positive bacteria, and I suggest that this evolution occurred in response to antibiotic selection pressures. Evidence is presented that diderm prokaryotes (i.e., gram-negative bacteria), which have a bilayered cell wall, are derived from monoderm prokaryotes. Signature sequences in different proteins provide a means to define a number of different taxa within prokaryotes (namely, low G+C and high G+C gram-positive, Deinococcus-Thermus, cyanobacteria, chlamydia-cytophaga related, and two different groups of Proteobacteria) and to indicate how they evolved from a common ancestor. Based on phylogenetic information from indels in different protein sequences, it is hypothesized that all eukaryotes, including amitochondriate and aplastidic organisms, received major gene contributions from both an archaebacterium and a gram-negative eubacterium. In this model, the ancestral eukaryotic cell is a chimera that resulted from a unique fusion event between the two separate groups of prokaryotes followed by integration of their genomes.
Protein Phylogenies and Signature Sequences: A Reappraisal of Evolutionary Relationships among Archaebacteria, Eubacteria, and Eukaryotes

PubMed Central

Gupta, Radhey S.

1998-01-01

The presence of shared conserved insertion or deletions (indels) in protein sequences is a special type of signature sequence that shows considerable promise for phylogenetic inference. An alternative model of microbial evolution based on the use of indels of conserved proteins and the morphological features of prokaryotic organisms is proposed. In this model, extant archaebacteria and gram-positive bacteria, which have a simple, single-layered cell wall structure, are termed monoderm prokaryotes. They are believed to be descended from the most primitive organisms. Evidence from indels supports the view that the archaebacteria probably evolved from gram-positive bacteria, and I suggest that this evolution occurred in response to antibiotic selection pressures. Evidence is presented that diderm prokaryotes (i.e., gram-negative bacteria), which have a bilayered cell wall, are derived from monoderm prokaryotes. Signature sequences in different proteins provide a means to define a number of different taxa within prokaryotes (namely, low G+C and high G+C gram-positive, Deinococcus-Thermus, cyanobacteria, chlamydia-cytophaga related, and two different groups of Proteobacteria) and to indicate how they evolved from a common ancestor. Based on phylogenetic information from indels in different protein sequences, it is hypothesized that all eukaryotes, including amitochondriate and aplastidic organisms, received major gene contributions from both an archaebacterium and a gram-negative eubacterium. In this model, the ancestral eukaryotic cell is a chimera that resulted from a unique fusion event between the two separate groups of prokaryotes followed by integration of their genomes. PMID:9841678
Conventional and Non-Conventional Nuclear Material Signatures

NASA Astrophysics Data System (ADS)

Gozani, Tsahi

2009-03-01

The detection and interdiction of concealed special nuclear material (SNM) in all modes of transport is one of the most critical security issues facing the United States and the rest of the world. In principle, detection of nuclear materials is relatively easy because of their unique properties: all of them are radioactive and all emit some characteristic gamma rays. A few emit neutrons as well. These signatures are the basis for passive non-intrusive detection of nuclear materials. The low energy of the radiations necessitates additional means of detection and validation. These are provided by high-energy x-ray radiography and by active inspection based on inducing nuclear reactions in the nuclear materials. Positive confirmation that a nuclear material is present or absent can be provided by interrogation of the inspected object with penetrating probing radiation, such as neutrons and photons. The radiation induces specific reactions in the nuclear material yielding, in turn, penetrating signatures which can be detected outside the inspected object. The "conventional" signatures are first and foremost fission signatures: prompt and delayed neutrons and gamma rays. Their intensity (number per fission) and the fact that they have broad energy (non-discrete, though unique) distributions and certain temporal behaviors are key to their use. The "non- conventional" signatures are not related to the fission process but to the unique nuclear structure of each element or isotope in nature. This can be accessed through the excitation of isotopic nuclear levels (discrete and continuum) by neutron inelastic scattering or gamma resonance fluorescence. Finally there is an atomic signature, namely the high atomic number (Z>74), which obviously includes all the nuclear materials and their possible shielding. The presence of such high-Z elements can be inferred by techniques using high-energy x rays. The conventional signatures have been addressed in another article. Non-conventional signatures and some of their current or potential uses will be discussed here.
Testing an aflatoxin B1 gene signature in rat archival tissues.

PubMed

Merrick, B Alex; Auerbach, Scott S; Stockton, Patricia S; Foley, Julie F; Malarkey, David E; Sills, Robert C; Irwin, Richard D; Tice, Raymond R

2012-05-21

Archival tissues from laboratory studies represent a unique opportunity to explore the relationship between genomic changes and agent-induced disease. In this study, we evaluated the applicability of qPCR for detecting genomic changes in formalin-fixed, paraffin-embedded (FFPE) tissues by determining if a subset of 14 genes from a 90-gene signature derived from microarray data and associated with eventual tumor development could be detected in archival liver, kidney, and lung of rats exposed to aflatoxin B1 (AFB1) for 90 days in feed at 1 ppm. These tissues originated from the same rats used in the microarray study. The 14 genes evaluated were Adam8, Cdh13, Ddit4l, Mybl2, Akr7a3, Akr7a2, Fhit, Wwox, Abcb1b, Abcc3, Cxcl1, Gsta5, Grin2c, and the C8orf46 homologue. The qPCR FFPE liver results were compared to the original liver microarray data and to qPCR results using RNA from fresh frozen liver. Archival liver paraffin blocks yielded 30 to 50 μg of degraded RNA that ranged in size from 0.1 to 4 kB. qPCR results from FFPE and fresh frozen liver samples were positively correlated (p ≤ 0.05) by regression analysis and showed good agreement in direction and proportion of change with microarray data for 11 of 14 genes. All 14 transcripts could be amplified from FFPE kidney RNA except the glutamate receptor gene Grin2c; however, only Abcb1b was significantly upregulated from control. Abundant constitutive transcripts, S18 and β-actin, could be amplified from lung FFPE samples, but the narrow RNA size range (25-500 bp length) prevented consistent detection of target transcripts. Overall, a discrete gene signature derived from prior transcript profiling and representing cell cycle progression, DNA damage response, and xenosensor and detoxication pathways was successfully applied to archival liver and kidney by qPCR and indicated that gene expression changes in response to subchronic AFB1 exposure occurred predominantly in the liver, the primary target for AFB1-induced tumors. We conclude that an evaluation of gene signatures in archival tissues can be an important toxicological tool for evaluating critical molecular events associated with chemical exposures.
Phenotypic and genomic survey on organic acid utilization profile of Pseudomonas mendocina strain S5.2, a vineyard soil isolate.

PubMed

Chong, Teik Min; Chen, Jian-Woon; See-Too, Wah-Seng; Yu, Choo-Yee; Ang, Geik-Yong; Lim, Yan Lue; Yin, Wai-Fong; Grandclément, Catherine; Faure, Denis; Dessaux, Yves; Chan, Kok-Gan

2017-12-01

Root exudates are chemical compounds that are released from living plant roots and provide significant energy, carbon, nitrogen and phosphorus sources for microbes inhabiting the rhizosphere. The exudates shape the microflora associated with the plant, as well as influences the plant health and productivity. Therefore, a better understanding of the trophic link that is established between the plant and the associated bacteria is necessary. In this study, a comprehensive survey on the utilization of grapevine and rootstock related organic acids were conducted on a vineyard soil isolate which is Pseudomonas mendocina strain S5.2. Phenotype microarray analysis has demonstrated that this strain can utilize several organic acids including lactic acid, succinic acid, malic acid, citric acid and fumaric acid as sole growth substrates. Complete genome analysis using single molecule real-time technology revealed that the genome consists of a 5,120,146 bp circular chromosome and a 252,328 bp megaplasmid. A series of genetic determinants associated with the carbon utilization signature of the strain were subsequently identified in the chromosome. Of note, the coexistence of genes encoding several iron-sulfur cluster independent isoenzymes in the genome indicated the importance of these enzymes in the events of iron deficiency. Synteny and comparative analysis have also unraveled the unique features of D-lactate dehydrogenase of strain S5.2 in the study. Collective information of this work has provided insights on the metabolic role of this strain in vineyard soil rhizosphere.
An eleven gene molecular signature for extra-capsular spread in oral squamous cell carcinoma serves as a prognosticator of outcome in patients without nodal metastases.

PubMed

Wang, Weining; Lim, Weng Khong; Leong, Hui Sun; Chong, Fui Teen; Lim, Tony K H; Tan, Daniel S W; Teh, Bin Tean; Iyer, N Gopalakrishna

2015-04-01

Extracapsular spread (ECS) is an important prognostic factor for oral squamous cell carcinoma (OSCC) and is used to guide management. In this study, we aimed to identify an expression profile signature for ECS in node-positive OSCC using data derived from two different sources: a cohort of OSCC patients from our institution (National Cancer Centre Singapore) and The Cancer Genome Atlas (TCGA) head and neck squamous cell carcinoma (HNSCC) cohort. We also sought to determine if this signature could serve as a prognostic factor in node negative cancers. Patients with a histological diagnosis of OSCC were identified from an institutional database and fresh tumor samples were retrieved. RNA was extracted and gene expression profiling was performed using the Affymetrix GeneChip Human Genome U133 Plus 2.0 microarray platform. RNA sequence data and corresponding clinical data for the TCGA HNSCC cohort were downloaded from the TCGA Data Portal. All data analyses were conducted using R package and SPSS. We identified an 11 gene signature (GGH, MTFR1, CDKN3, PSRC1, SMIM3, CA9, IRX4, CPA3, ZSCAN16, CBX7 and ZFP3) which was robust in segregating tumors by ECS status. In node negative patients, patients harboring this ECS signature had a significantly worse overall survival (p=0.04). An eleven gene signature for ECS was derived. Our results also suggest that this signature is prognostic in a separate subset of patients with no nodal metastasis Further validation of this signature on other datasets and immunohistochemical studies are required to establish utility of this signature in stratifying early stage OSCC patients. Copyright © 2014 Elsevier Ltd. All rights reserved.
Immunoprofiles of human Sertoli cells infected with Zika virus reveals unique insights into host-pathogen crosstalk.

PubMed

Strange, Daniel P; Green, Richard; Siemann, David N; Gale, Michael; Verma, Saguna

2018-06-07

Confirmed reports of Zika virus (ZIKV) in seminal fluid months after clearance of viremia suggests that ZIKV can establish persistent infection in the seminiferous tubules, an immune privileged site of the testis. The seminiferous tubule epithelium is mainly composed of Sertoli cells that function to nourish and protect developing germ cells. We recently demonstrated that primary human Sertoli cells (hSeC) were highly susceptible to ZIKV as compared to dengue virus without causing cell death and thus may act as a reservoir for ZIKV in the testes. However, the cellular and immune responses of hSeC to infection with ZIKV or any other virus are not yet characterized. Using genome-wide RNA-seq to compare immunoprofiles of hSeC, we show that the most prominent response to ZIKV at early stage of infection was suppression of cell growth and proliferation functional pathways. Peak virus replication was associated with induction of multiple antiviral defense pathways. Unique ZIKV-associated signatures included dysregulation of germ cell-Sertoli cell junction signaling. This study demonstrates that hSeC are capable of signaling through canonical pro-inflammatory pathways and provides insights into unique cell-type-specific response induced by ZIKV in association with viral persistence in the testes.
Genome-wide scan for selection signatures in six cattle breeds in South Africa.

PubMed

Makina, Sithembile O; Muchadeyi, Farai C; van Marle-Köster, Este; Taylor, Jerry F; Makgahlela, Mahlako L; Maiwashe, Azwihangwisi

2015-11-26

The detection of selection signatures in breeds of livestock species can contribute to the identification of regions of the genome that are, or have been, functionally important and, as a consequence, have been targeted by selection. This study used two approaches to detect signatures of selection within and between six cattle breeds in South Africa, including Afrikaner (n = 44), Nguni (n = 54), Drakensberger (n = 47), Bonsmara (n = 44), Angus (n = 31) and Holstein (n = 29). The first approach was based on the detection of genomic regions in which haplotypes have been driven towards complete fixation within breeds. The second approach identified regions of the genome that had very different allele frequencies between populations (F ST). Forty-seven candidate genomic regions were identified as harbouring putative signatures of selection using both methods. Twelve of these candidate selected regions were shared among the breeds and ten were validated by previous studies. Thirty-three of these regions were successfully annotated and candidate genes were identified. Among these genes the keratin genes (KRT222, KRT24, KRT25, KRT26, and KRT27) and one heat shock protein gene (HSPB9) on chromosome 19 between 42,896,570 and 42,897,840 bp were detected for the Nguni breed. These genes were previously associated with adaptation to tropical environments in Zebu cattle. In addition, a number of candidate genes associated with the nervous system (WNT5B, FMOD, PRELP, and ATP2B), immune response (CYM, CDC6, and CDK10), production (MTPN, IGFBP4, TGFB1, and AJAP1) and reproductive performance (ADIPOR2, OVOS2, and RBBP8) were also detected as being under selection. The results presented here provide a foundation for detecting mutations that underlie genetic variation of traits that have economic importance for cattle breeds in South Africa.
Genomic continuity of Argentinean Mennonites

PubMed Central

Pardo-Seco, Jacobo; Llull, Cintia; Berardi, Gabriela; Gómez, Andrea; Andreatta, Fernando; Martinón-Torres, Federico; Toscanini, Ulises; Salas, Antonio

2016-01-01

Mennonites are Anabaptist communities that originated in Central Europe about 500 years ago. They initially migrated to different European countries, and in the early 18th century they established their first communities in North America, from where they moved to other American regions. We aimed to analyze an Argentinean Mennonite congregation from a genome-wide perspective by way of investigating >580.000 autosomal SNPs. Several analyses show that Argentinean Mennonites have European ancestry without signatures of admixture with other non-European American populations. Among the worldwide datasets used for population comparison, the CEU, which is the best-subrogated Central European population existing in The 1000 Genome Project, is the dataset showing the closest genome affinity to the Mennonites. When compared to other European population samples, the Mennonites show higher inbreeding coefficient values. Argentinean Mennonites show signatures of genetic continuity with no evidence of admixture with Americans of Native American or sub-Saharan African ancestry. Their genome indicates the existence of an increased endogamy compared to other Europeans most likely mirroring their lifestyle that involve small communities and historical consanguineous marriages. PMID:27824108
Signatures of adaptation to plant parasitism in nematode genomes.

PubMed

Bird, David McK; Jones, John T; Opperman, Charles H; Kikuchi, Taisei; Danchin, Etienne G J

2015-02-01

Plant-parasitic nematodes cause considerable damage to global agriculture. The ability to parasitize plants is a derived character that appears to have independently emerged several times in the phylum Nematoda. Morphological convergence to feeding style has been observed, but whether this is emergent from molecular convergence is less obvious. To address this, we assess whether genomic signatures can be associated with plant parasitism by nematodes. In this review, we report genomic features and characteristics that appear to be common in plant-parasitic nematodes while absent or rare in animal parasites, predators or free-living species. Candidate horizontal acquisitions of parasitism genes have systematically been found in all plant-parasitic species investigated at the sequence level. Presence of peptides that mimic plant hormones also appears to be a trait of plant-parasitic species. Annotations of the few genomes of plant-parasitic nematodes available to date have revealed a set of apparently species-specific genes on every occasion. Effector genes, important for parasitism are frequently found among those species-specific genes, indicating poor overlap. Overall, nematodes appear to have developed convergent genomic solutions to adapt to plant parasitism.
Relative extended haplotype homozygosity signals across breeds reveal dairy and beef specific signatures of selection.

PubMed

Bomba, Lorenzo; Nicolazzi, Ezequiel L; Milanesi, Marco; Negrini, Riccardo; Mancini, Giordano; Biscarini, Filippo; Stella, Alessandra; Valentini, Alessio; Ajmone-Marsan, Paolo

2015-04-02

A number of methods are available to scan a genome for selection signatures by evaluating patterns of diversity within and between breeds. Among these, "extended haplotype homozygosity" (EHH) is a reliable approach to detect genome regions under recent selective pressure. The objective of this study was to use this approach to identify regions that are under recent positive selection and shared by the most representative Italian dairy and beef cattle breeds. A total of 3220 animals from Italian Holstein (2179), Italian Brown (775), Simmental (493), Marchigiana (485) and Piedmontese (379) breeds were genotyped with the Illumina BovineSNP50 BeadChip v.1. After standard quality control procedures, genotypes were phased and core haplotypes were identified. The decay of linkage disequilibrium (LD) for each core haplotype was assessed by measuring the EHH. Since accurate estimates of local recombination rates were not available, relative EHH (rEHH) was calculated for each core haplotype. Genomic regions that carry frequent core haplotypes and with significant rEHH values were considered as candidates for recent positive selection. Candidate regions were aligned across to identify signals shared by dairy or beef cattle breeds. Overall, 82 and 87 common regions were detected among dairy and beef cattle breeds, respectively. Bioinformatic analysis identified 244 and 232 genes in these common genomic regions. Gene annotation and pathway analysis showed that these genes are involved in molecular functions that are biologically related to milk or meat production. Our results suggest that a multi-breed approach can lead to the identification of genomic signatures in breeds of cattle that are selected for the same production goal and thus to the localisation of genomic regions of interest in dairy and beef production.
MutSpec: a Galaxy toolbox for streamlined analyses of somatic mutation spectra in human and mouse cancer genomes.

PubMed

Ardin, Maude; Cahais, Vincent; Castells, Xavier; Bouaoun, Liacine; Byrnes, Graham; Herceg, Zdenko; Zavadil, Jiri; Olivier, Magali

2016-04-18

The nature of somatic mutations observed in human tumors at single gene or genome-wide levels can reveal information on past carcinogenic exposures and mutational processes contributing to tumor development. While large amounts of sequencing data are being generated, the associated analysis and interpretation of mutation patterns that may reveal clues about the natural history of cancer present complex and challenging tasks that require advanced bioinformatics skills. To make such analyses accessible to a wider community of researchers with no programming expertise, we have developed within the web-based user-friendly platform Galaxy a first-of-its-kind package called MutSpec. MutSpec includes a set of tools that perform variant annotation and use advanced statistics for the identification of mutation signatures present in cancer genomes and for comparing the obtained signatures with those published in the COSMIC database and other sources. MutSpec offers an accessible framework for building reproducible analysis pipelines, integrating existing methods and scripts developed in-house with publicly available R packages. MutSpec may be used to analyse data from whole-exome, whole-genome or targeted sequencing experiments performed on human or mouse genomes. Results are provided in various formats including rich graphical outputs. An example is presented to illustrate the package functionalities, the straightforward workflow analysis and the richness of the statistics and publication-grade graphics produced by the tool. MutSpec offers an easy-to-use graphical interface embedded in the popular Galaxy platform that can be used by researchers with limited programming or bioinformatics expertise to analyse mutation signatures present in cancer genomes. MutSpec can thus effectively assist in the discovery of complex mutational processes resulting from exogenous and endogenous carcinogenic insults.
Horizontal gene transfer and gene dosage drives adaptation to wood colonization in a tree pathogen

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dhillon, Braham; Feau, Nicolas; Aerts, Andrea L.

Some of the most damaging tree diseases are caused by pathogens that induce cankers, a stem deformation often lethal. To investigate the cause of this adaptation, we sequenced the genomes of poplar pathogens that do and do not cause cankers. We found a unique cluster of genes that produce secondary metabolites and are co-activated when the canker pathogen is grown on poplar wood and leaves. The gene genealogy is discordant with the species phylogeny, showing a signature of horizontal transfer from fungi associated with wood decay. Furthermore, genes encoding hemicellulose-degrading enzymes are up-regulated on poplar wood chips, with some havingmore » been acquired horizontally. In conclusion, we propose that adaptation to colonize poplar woody stems is the result of acquisition of these genes.« less
Horizontal gene transfer and gene dosage drives adaptation to wood colonization in a tree pathogen

DOE PAGES

Dhillon, Braham; Feau, Nicolas; Aerts, Andrea L.; ...

2015-03-02

Some of the most damaging tree diseases are caused by pathogens that induce cankers, a stem deformation often lethal. To investigate the cause of this adaptation, we sequenced the genomes of poplar pathogens that do and do not cause cankers. We found a unique cluster of genes that produce secondary metabolites and are co-activated when the canker pathogen is grown on poplar wood and leaves. The gene genealogy is discordant with the species phylogeny, showing a signature of horizontal transfer from fungi associated with wood decay. Furthermore, genes encoding hemicellulose-degrading enzymes are up-regulated on poplar wood chips, with some havingmore » been acquired horizontally. In conclusion, we propose that adaptation to colonize poplar woody stems is the result of acquisition of these genes.« less
Comprehensive Molecular Characterization of Muscle-Invasive Bladder Cancer. | Office of Cancer Genomics

Cancer.gov

We report a comprehensive analysis of 412 muscle-invasive bladder cancers characterized by multiple TCGA analytical platforms. Fifty-eight genes were significantly mutated, and the overall mutational load was associated with APOBEC-signature mutagenesis. Clustering by mutation signature identified a high-mutation subset with 75% 5-year survival.
Signatures of Pleiotropy, Economy and Convergent Evolution in a Domain-Resolved Map of Human–Virus Protein–Protein Interaction Networks

PubMed Central

Garamszegi, Sara; Franzosa, Eric A.; Xia, Yu

2013-01-01

A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology. PMID:24339775
Molecular pathological epidemiology of epigenetics: emerging integrative science to analyze environment, host, and disease.

PubMed

Ogino, Shuji; Lochhead, Paul; Chan, Andrew T; Nishihara, Reiko; Cho, Eunyoung; Wolpin, Brian M; Meyerhardt, Jeffrey A; Meissner, Alexander; Schernhammer, Eva S; Fuchs, Charles S; Giovannucci, Edward

2013-04-01

Epigenetics acts as an interface between environmental/exogenous factors, cellular responses, and pathological processes. Aberrant epigenetic signatures are a hallmark of complex multifactorial diseases (including neoplasms and malignancies such as leukemias, lymphomas, sarcomas, and breast, lung, prostate, liver, and colorectal cancers). Epigenetic signatures (DNA methylation, mRNA and microRNA expression, etc) may serve as biomarkers for risk stratification, early detection, and disease classification, as well as targets for therapy and chemoprevention. In particular, DNA methylation assays are widely applied to formalin-fixed, paraffin-embedded archival tissue specimens as clinical pathology tests. To better understand the interplay between etiological factors, cellular molecular characteristics, and disease evolution, the field of 'molecular pathological epidemiology (MPE)' has emerged as an interdisciplinary integration of 'molecular pathology' and 'epidemiology'. In contrast to traditional epidemiological research including genome-wide association studies (GWAS), MPE is founded on the unique disease principle, that is, each disease process results from unique profiles of exposomes, epigenomes, transcriptomes, proteomes, metabolomes, microbiomes, and interactomes in relation to the macroenvironment and tissue microenvironment. MPE may represent a logical evolution of GWAS, termed 'GWAS-MPE approach'. Although epigenome-wide association study attracts increasing attention, currently, it has a fundamental problem in that each cell within one individual has a unique, time-varying epigenome. Having a similar conceptual framework to systems biology, the holistic MPE approach enables us to link potential etiological factors to specific molecular pathology, and gain novel pathogenic insights on causality. The widespread application of epigenome (eg, methylome) analyses will enhance our understanding of disease heterogeneity, epigenotypes (CpG island methylator phenotype, LINE-1 (long interspersed nucleotide element-1; also called long interspersed nuclear element-1; long interspersed element-1; L1) hypomethylation, etc), and host-disease interactions. In this article, we illustrate increasing contribution of modern pathology to broader public health sciences, which attests pivotal roles of pathologists in the new integrated MPE science towards our ultimate goal of personalized medicine and prevention.
Molecular Pathological Epidemiology of Epigenetics: Emerging Integrative Science to Analyze Environment, Host, and Disease

PubMed Central

Ogino, Shuji; Lochhead, Paul; Chan, Andrew T.; Nishihara, Reiko; Cho, Eunyoung; Wolpin, Brian M.; Meyerhardt, Jeffrey A.; Meissner, Alexander; Schernhammer, Eva S.; Fuchs, Charles S.; Giovannucci, Edward

2013-01-01

Epigenetics acts as an interface between environmental / exogenous factors, cellular responses and pathological processes. Aberrant epigenetic signatures are a hallmark of complex multifactorial diseases, including non-neoplastic disorders (e.g., cardiovascular diseases, hypertension, diabetes mellitus, autoimmune diseases, and some infectious diseases) and neoplasms (e.g., leukemias, lymphomas, sarcomas, and breast, lung, prostate, liver and colorectal cancers). Epigenetic signatures (DNA methylation, mRNA and microRNA expression, etc.) may serve as biomarkers for risk stratification, early detection, and disease classification, as well as targets for therapy and chemoprevention. DNA methylation assays are widely applied to formalin-fixed paraffin-embedded archival tissue specimens as clinical pathology tests. To better understand the interplay between etiologic factors, cellular molecular characteristics, and disease evolution, the field of “Molecular Pathological Epidemiology (MPE)” has emerged as an interdisciplinary integration of “molecular pathology” and “epidemiology”, with a similar conceptual framework to systems biology and network medicine. In contrast to traditional epidemiologic research including genome-wide association studies (GWAS), MPE is founded on the unique disease principle; that is, each disease process results from unique profiles of exposomes, epigenomes, transcriptomes, proteomes, metabolomes, microbiomes, and interactomes in relation to the macro-environment and tissue microenvironment. The widespread application of epigenomics (e.g., methylome) analyses will enhance our understanding of disease heterogeneity, epigenotypes (CpG island methylator phenotype, LINE-1 hypomethylation, etc.), and host-disease interactions. MPE may represent a logical evolution of GWAS, termed “GWAS-MPE approach”. Though epigenome-wide association study attracts increasing attention, currently, it has a fundamental problem in that each cell within one individual has a unique, time-varying epigenome. This article will illustrate increasing contribution of modern pathology to broader public health sciences, which attests pivotal roles of pathologists in the new integrated MPE science towards our ultimate goal of personalized medicine and prevention. PMID:23307060
Signatures of pleiotropy, economy and convergent evolution in a domain-resolved map of human-virus protein-protein interaction networks.

PubMed

Garamszegi, Sara; Franzosa, Eric A; Xia, Yu

2013-01-01

A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology.
Signatures of positive selection in East African Shorthorn Zebu: A genome-wide single nucleotide polymorphism analysis

PubMed Central

Bahbahani, Hussain; Clifford, Harry; Wragg, David; Mbole-Kariuki, Mary N; Van Tassell, Curtis; Sonstegard, Tad; Woolhouse, Mark; Hanotte, Olivier

2015-01-01

The small East African Shorthorn Zebu (EASZ) is the main indigenous cattle across East Africa. A recent genome wide SNP analysis revealed an ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signatures of positive selection in their genome, with the aim to provide qualitative insights about the corresponding selective pressures. Four hundred and twenty-five EASZ and four reference populations (Holstein-Friesian, Jersey, N’Dama and Nellore) were analysed using 46,171 SNPs covering all autosomes and the X chromosome. Following FST and two extended haplotype homozygosity-based (iHS and Rsb) analyses 24 candidate genome regions within 14 autosomes and the X chromosome were revealed, in which 18 and 4 were previously identified in tropical-adapted and commercial breeds, respectively. These regions overlap with 340 bovine QTL. They include 409 annotated genes, in which 37 were considered as candidates. These genes are involved in various biological pathways (e.g. immunity, reproduction, development and heat tolerance). Our results support that different selection pressures (e.g. environmental constraints, human selection, genome admixture constrains) have shaped the genome of EASZ. We argue that these candidate regions represent genome landmarks to be maintained in breeding programs aiming to improve sustainable livestock productivity in the tropics. PMID:26130263

Clock-like mutational processes in human somatic cells

DOE PAGES

Alexandrov, Ludmil B.; Jones, Philip H.; Wedge, David C.; ...

2015-11-09

During the course of a lifetime, somatic cells acquire mutations. Different mutational processes may contribute to the mutations accumulated in a cell, with each imprinting a mutational signature on the cell's genome. Some processes generate mutations throughout life at a constant rate in all individuals, and the number of mutations in a cell attributable to these processes will be proportional to the chronological age of the person. Using mutations from 10,250 cancer genomes across 36 cancer types, we investigated clock-like mutational processes that have been operating in normal human cells. Two mutational signatures show clock-like properties. Both exhibit different mutationmore » rates in different tissues. However, their mutation rates are not correlated, indicating that the underlying processes are subject to different biological influences. For one signature, the rate of cell division may influence its mutation rate. This paper provides the first survey of clock-like mutational processes operating in human somatic cells.« less
Clock-like mutational processes in human somatic cells

PubMed Central

Alexandrov, Ludmil B.; Jones, Philip H.; Wedge, David C.; Sale, Julian E.; Campbell, Peter J.; Nik-Zainal, Serena; Stratton, Michael R.

2016-01-01

During the course of a lifetime somatic cells acquire mutations. Different mutational processes may contribute to the mutations accumulated in a cell, with each imprinting a mutational signature on the cell’s genome. Some processes generate mutations throughout life at a constant rate in all individuals and the number of mutations in a cell attributable to these processes will be proportional to the chronological age of the person. Using mutations from 10,250 cancer genomes across 36 cancer types, we investigated clock-like mutational processes that have been operating in normal human cells. Two mutational signatures show clock-like properties. Both exhibit different mutation rates in different tissues. However, their mutation rates are not correlated indicating that the underlying processes are subject to different biological influences. For one signature, the rate of cell division may influence its mutation rate. This study provides the first survey of clock-like mutational processes operative in human somatic cells. PMID:26551669
A genome-wide scan for selection signatures in Nelore cattle

USDA-ARS?s Scientific Manuscript database

Brazilian Nelore cattle have been selected for growth traits over more than four decades. In recent years, reproductive and meat quality traits have become more important because of increasing consumption, exports and consumer demand. The identification of genomic regions altered by artificial selec...
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures

PubMed Central

Stark, Alexander; Lin, Michael F.; Kheradpour, Pouya; Pedersen, Jakob S.; Parts, Leopold; Carlson, Joseph W.; Crosby, Madeline A.; Rasmussen, Matthew D.; Roy, Sushmita; Deoras, Ameya N.; Ruby, J. Graham; Brennecke, Julius; Hodges, Emily; Hinrichs, Angie S.; Caspi, Anat; Paten, Benedict; Park, Seung-Won; Han, Mira V.; Maeder, Morgan L.; Polansky, Benjamin J.; Robson, Bryanne E.; Aerts, Stein; van Helden, Jacques; Hassan, Bassem; Gilbert, Donald G.; Eastman, Deborah A.; Rice, Michael; Weir, Michael; Hahn, Matthew W.; Park, Yongkyu; Dewey, Colin N.; Pachter, Lior; Kent, W. James; Haussler, David; Lai, Eric C.; Bartel, David P.; Hannon, Gregory J.; Kaufman, Thomas C.; Eisen, Michael B.; Clark, Andrew G.; Smith, Douglas; Celniker, Susan E.; Gelbart, William M.; Kellis, Manolis

2008-01-01

Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies. PMID:17994088
Use of qualitative environmental and phenotypic variables in the context of allele distribution models: detecting signatures of selection in the genome of Lake Victoria cichlids.

PubMed

Joost, Stéphane; Kalbermatten, Michael; Bezault, Etienne; Seehausen, Ole

2012-01-01

When searching for loci possibly under selection in the genome, an alternative to population genetics theoretical models is to establish allele distribution models (ADM) for each locus to directly correlate allelic frequencies and environmental variables such as precipitation, temperature, or sun radiation. Such an approach implementing multiple logistic regression models in parallel was implemented within a computing program named MATSAM: . Recently, this application was improved in order to support qualitative environmental predictors as well as to permit the identification of associations between genomic variation and individual phenotypes, allowing the detection of loci involved in the genetic architecture of polymorphic characters. Here, we present the corresponding methodological developments and compare the results produced by software implementing population genetics theoretical models (DFDIST: and BAYESCAN: ) and ADM (MATSAM: ) in an empirical context to detect signatures of genomic divergence associated with speciation in Lake Victoria cichlid fishes.
Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.

PubMed

Baniecki, Mary Lynn; Faust, Aubrey L; Schaffner, Stephen F; Park, Daniel J; Galinsky, Kevin; Daniels, Rachel F; Hamilton, Elizabeth; Ferreira, Marcelo U; Karunaweera, Nadira D; Serre, David; Zimmerman, Peter A; Sá, Juliana M; Wellems, Thomas E; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E; Volkman, Sarah K; Wirth, Dyann F; Sabeti, Pardis C

2015-03-01

Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.
Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections

PubMed Central

Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.

2015-01-01

Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence.

PubMed

Maheshwari, Shamoni; Ishii, Takayoshi; Brown, C Titus; Houben, Andreas; Comai, Luca

2017-03-01

During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays , although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. © 2017 Maheshwari et al.; Published by Cold Spring Harbor Laboratory Press.
Cell-type-specific enrichment of risk-associated regulatory elements at ovarian cancer susceptibility loci.

PubMed

Coetzee, Simon G; Shen, Howard C; Hazelett, Dennis J; Lawrenson, Kate; Kuchenbaecker, Karoline; Tyrer, Jonathan; Rhie, Suhn K; Levanon, Keren; Karst, Alison; Drapkin, Ronny; Ramus, Susan J; Couch, Fergus J; Offit, Kenneth; Chenevix-Trench, Georgia; Monteiro, Alvaro N A; Antoniou, Antonis; Freedman, Matthew; Coetzee, Gerhard A; Pharoah, Paul D P; Noushmehr, Houtan; Gayther, Simon A

2015-07-01

Understanding the regulatory landscape of the human genome is a central question in complex trait genetics. Most single-nucleotide polymorphisms (SNPs) associated with cancer risk lie in non-protein-coding regions, implicating regulatory DNA elements as functional targets of susceptibility variants. Here, we describe genome-wide annotation of regions of open chromatin and histone modification in fallopian tube and ovarian surface epithelial cells (FTSECs, OSECs), the debated cellular origins of high-grade serous ovarian cancers (HGSOCs) and in endometriosis epithelial cells (EECs), the likely precursor of clear cell ovarian carcinomas (CCOCs). The regulatory architecture of these cell types was compared with normal human mammary epithelial cells and LNCaP prostate cancer cells. We observed similar positional patterns of global enhancer signatures across the three different ovarian cancer precursor cell types, and evidence of tissue-specific regulatory signatures compared to non-gynecological cell types. We found significant enrichment for risk-associated SNPs intersecting regulatory biofeatures at 17 known HGSOC susceptibility loci in FTSECs (P = 3.8 × 10(-30)), OSECs (P = 2.4 × 10(-23)) and HMECs (P = 6.7 × 10(-15)) but not for EECs (P = 0.45) or LNCaP cells (P = 0.88). Hierarchical clustering of risk SNPs conditioned on the six different cell types indicates FTSECs and OSECs are highly related (96% of samples using multi-scale bootstrapping) suggesting both cell types may be precursors of HGSOC. These data represent the first description of regulatory catalogues of normal precursor cells for different ovarian cancer subtypes, and provide unique insights into the tissue specific regulatory variation with respect to the likely functional targets of germline genetic susceptibility variants for ovarian cancer. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Unraveling the Light-Specific Metabolic and Regulatory Signatures of Rice through Combined in Silico Modeling and Multiomics Analysis1[OPEN

PubMed Central

Lim, Sun-Hyung; Kim, Jae Kwang; Ha, Sun-Hwa

2015-01-01

Light quality is an important signaling component upon which plants orchestrate various morphological processes, including seed germination and seedling photomorphogenesis. However, it is still unclear how plants, especially food crops, sense various light qualities and modulate their cellular growth and other developmental processes. Therefore, in this work, we initially profiled the transcripts of a model crop, rice (Oryza sativa), under four different light treatments (blue, green, red, and white) as well as in the dark. Concurrently, we reconstructed a fully compartmentalized genome-scale metabolic model of rice cells, iOS2164, containing 2,164 unique genes, 2,283 reactions, and 1,999 metabolites. We then combined the model with transcriptome profiles to elucidate the light-specific transcriptional signatures of rice metabolism. Clearly, light signals mediated rice gene expressions, differentially regulating numerous metabolic pathways: photosynthesis and secondary metabolism were up-regulated in blue light, whereas reserve carbohydrates degradation was pronounced in the dark. The topological analysis of gene expression data with the rice genome-scale metabolic model further uncovered that phytohormones, such as abscisate, ethylene, gibberellin, and jasmonate, are the key biomarkers of light-mediated regulation, and subsequent analysis of the associated genes’ promoter regions identified several light-specific transcription factors. Finally, the transcriptional control of rice metabolism by red and blue light signals was assessed by integrating the transcriptome and metabolome data with constraint-based modeling. The biological insights gained from this integrative systems biology approach offer several potential applications, such as improving the agronomic traits of food crops and designing light-specific synthetic gene circuits in microbial and mammalian systems. PMID:26453433
Progesterone receptor isoforms, agonists and antagonists differentially reprogram estrogen signaling

PubMed Central

Singhal, Hari; Greene, Marianne E.; Zarnke, Allison L.; Laine, Muriel; Al Abosy, Rose; Chang, Ya-Fang; Dembo, Anna G.; Schoenfelt, Kelly; Vadhi, Raga; Qiu, Xintao; Rao, Prakash; Santhamma, Bindu; Nair, Hareesh B.; Nickisch, Klaus J.; Long, Henry W.; Becker, Lev; Brown, Myles; Greene, Geoffrey L.

2018-01-01

Major roadblocks to developing effective progesterone receptor (PR)-targeted therapies in breast cancer include the lack of highly-specific PR modulators, a poor understanding of the pro- or anti-tumorigenic networks for PR isoforms and ligands, and an incomplete understanding of the cross talk between PR and estrogen receptor (ER) signaling. Through genomic analyses of xenografts treated with various clinically-relevant ER and PR-targeting drugs, we describe how the activation or inhibition of PR differentially reprograms estrogen signaling, resulting in the segregation of transcriptomes into separate PR agonist and antagonist-mediated groups. These findings address an ongoing controversy regarding the clinical utility of PR agonists and antagonists, alone or in combination with tamoxifen, for breast cancer management. Additionally, the two PR isoforms PRA and PRB, bind distinct but overlapping genomic sites and interact with different sets of co-regulators to differentially modulate estrogen signaling to be either pro- or anti-tumorigenic. Of the two isoforms, PRA inhibited gene expression and ER chromatin binding significantly more than PRB. Differential gene expression was observed in PRA and PRB-rich patient tumors and PRA-rich gene signatures had poorer survival outcomes. In support of antiprogestin responsiveness of PRA-rich tumors, gene signatures associated with PR antagonists, but not PR agonists, predicted better survival outcomes. The better patient survival associated with PR antagonists versus PR agonists treatments was further reflected in the higher in vivo anti-tumor activity of therapies that combine tamoxifen with PR antagonists and modulators. This study suggests that distinguishing common effects observed due to concomitant interaction of another receptor with its ligand (agonist or antagonist), from unique isoform and ligand-specific effects will inform the development of biomarkers for patient selection and translation of PR-targeted therapies to the clinic. PMID:29435103
A neural signature of the unique hues

PubMed Central

Forder, Lewis; Bosten, Jenny; He, Xun; Franklin, Anna

2017-01-01

Since at least the 17th century there has been the idea that there are four simple and perceptually pure “unique” hues: red, yellow, green, and blue, and that all other hues are perceived as mixtures of these four hues. However, sustained scientific investigation has not yet provided solid evidence for a neural representation that separates the unique hues from other colors. We measured event-related potentials elicited from unique hues and the ‘intermediate’ hues in between them. We find a neural signature of the unique hues 230 ms after stimulus onset at a post-perceptual stage of visual processing. Specifically, the posterior P2 component over the parieto-occipital lobe peaked significantly earlier for the unique than for the intermediate hues (Z = −2.9, p = 0.004). Having identified a neural marker for unique hues, fundamental questions about the contribution of neural hardwiring, language and environment to the unique hues can now be addressed. PMID:28186142
The molecular basis of breast cancer pathological phenotypes.

PubMed

Heng, Yujing J; Lester, Susan C; Tse, Gary Mk; Factor, Rachel E; Allison, Kimberly H; Collins, Laura C; Chen, Yunn-Yi; Jensen, Kristin C; Johnson, Nicole B; Jeong, Jong Cheol; Punjabi, Rahi; Shin, Sandra J; Singh, Kamaljeet; Krings, Gregor; Eberhard, David A; Tan, Puay Hoon; Korski, Konstanty; Waldman, Frederic M; Gutman, David A; Sanders, Melinda; Reis-Filho, Jorge S; Flanagan, Sydney R; Gendoo, Deena Ma; Chen, Gregory M; Haibe-Kains, Benjamin; Ciriello, Giovanni; Hoadley, Katherine A; Perou, Charles M; Beck, Andrew H

2017-02-01

The histopathological evaluation of morphological features in breast tumours provides prognostic information to guide therapy. Adjunct molecular analyses provide further diagnostic, prognostic and predictive information. However, there is limited knowledge of the molecular basis of morphological phenotypes in invasive breast cancer. This study integrated genomic, transcriptomic and protein data to provide a comprehensive molecular profiling of morphological features in breast cancer. Fifteen pathologists assessed 850 invasive breast cancer cases from The Cancer Genome Atlas (TCGA). Morphological features were significantly associated with genomic alteration, DNA methylation subtype, PAM50 and microRNA subtypes, proliferation scores, gene expression and/or reverse-phase protein assay subtype. Marked nuclear pleomorphism, necrosis, inflammation and a high mitotic count were associated with the basal-like subtype, and had a similar molecular basis. Omics-based signatures were constructed to predict morphological features. The association of morphology transcriptome signatures with overall survival in oestrogen receptor (ER)-positive and ER-negative breast cancer was first assessed by use of the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) dataset; signatures that remained prognostic in the METABRIC multivariate analysis were further evaluated in five additional datasets. The transcriptomic signature of poorly differentiated epithelial tubules was prognostic in ER-positive breast cancer. No signature was prognostic in ER-negative breast cancer. This study provided new insights into the molecular basis of breast cancer morphological phenotypes. The integration of morphological with molecular data has the potential to refine breast cancer classification, predict response to therapy, enhance our understanding of breast cancer biology, and improve clinical management. This work is publicly accessible at www.dx.ai/tcga_breast. Copyright © 2016 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd. Copyright © 2016 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.
Revealing the selection history of adaptive loci using genome-wide scans for selection: an example from domestic sheep.

PubMed

Rochus, Christina Marie; Tortereau, Flavie; Plisson-Petit, Florence; Restoux, Gwendal; Moreno-Romieux, Carole; Tosser-Klopp, Gwenola; Servin, Bertrand

2018-01-23

One of the approaches to detect genetics variants affecting fitness traits is to identify their surrounding genomic signatures of past selection. With established methods for detecting selection signatures and the current and future availability of large datasets, such studies should have the power to not only detect these signatures but also to infer their selective histories. Domesticated animals offer a powerful model for these approaches as they adapted rapidly to environmental and human-mediated constraints in a relatively short time. We investigated this question by studying a large dataset of 542 individuals from 27 domestic sheep populations raised in France, genotyped for more than 500,000 SNPs. Population structure analysis revealed that this set of populations harbour a large part of European sheep diversity in a small geographical area, offering a powerful model for the study of adaptation. Identification of extreme SNP and haplotype frequency differences between populations listed 126 genomic regions likely affected by selection. These signatures revealed selection at loci commonly identified as selection targets in many species ("selection hotspots") including ABCG2, LCORL/NCAPG, MSTN, and coat colour genes such as ASIP, MC1R, MITF, and TYRP1. For one of these regions (ABCG2, LCORL/NCAPG), we could propose a historical scenario leading to the introgression of an adaptive allele into a new genetic background. Among selection signatures, we found clear evidence for parallel selection events in different genetic backgrounds, most likely for different mutations. We confirmed this allelic heterogeneity in one case by resequencing the MC1R gene in three black-faced breeds. Our study illustrates how dense genetic data in multiple populations allows the deciphering of evolutionary history of populations and of their adaptive mutations.
Signatures of microevolutionary processes in phylogenetic patterns.

PubMed

Costa, Carolina L N; Lemos-Costa, Paula; Marquitti, Flavia M D; Fernandes, Lucas D; Ramos, Marlon F; Schneider, David M; Martins, Ayana B; Aguiar, Marcus A M

2018-06-23

Phylogenetic trees are representations of evolutionary relationships among species and contain signatures of the processes responsible for the speciation events they display. Inferring processes from tree properties, however, is challenging. To address this problem we analysed a spatially-explicit model of speciation where genome size and mating range can be controlled. We simulated parapatric and sympatric (narrow and wide mating range, respectively) radiations and constructed their phylogenetic trees, computing structural properties such as tree balance and speed of diversification. We showed that parapatric and sympatric speciation are well separated by these structural tree properties. Balanced trees with constant rates of diversification only originate in sympatry and genome size affected both the balance and the speed of diversification of the simulated trees. Comparison with empirical data showed that most of the evolutionary radiations considered to have developed in parapatry or sympatry are in good agreement with model predictions. Even though additional forces other than spatial restriction of gene flow, genome size, and genetic incompatibilities, do play a role in the evolution of species formation, the microevolutionary processes modeled here capture signatures of the diversification pattern of evolutionary radiations, regarding the symmetry and speed of diversification of lineages.
Fission Signatures for Nuclear Material Detection

NASA Astrophysics Data System (ADS)

Gozani, Tsahi

2009-06-01

Detection and interdiction of nuclear materials in all forms of transport is one of the most critical security issues facing the United States and the rest of the civilized world. Naturally emitted gamma rays by these materials, while abundant and detectable when unshielded, are low in energy and readily shielded. X-ray radiography is useful in detecting the possible presence of shielding material. Positive detection of concealed nuclear materials requires methods which unequivocally detect specific attributes of the materials. These methods typically involve active interrogation by penetrating radiation of neutrons, photons or other particles. Fortunately, nuclear materials, probed by various types of radiation, yield very unique and often strong signatures. Paramount among them are the detectable fission signatures, namely prompt neutrons and gamma rays, and delayed neutrons gamma rays. Other useful signatures are the nuclear states excited by neutrons, via inelastic scattering, or photons, via nuclear resonance fluorescence and absorption. The signatures are very different in magnitude, level of specificity, ease of excitation and detection, signal to background ratios, etc. For example, delayed neutrons are very unique to the fission process, but are scarce, have low energy, and hence are easily absorbed. Delayed gamma rays are more abundant but "featureless", and have a higher background from natural sources and more importantly, from activation due to the interrogation sources. The prompt fission signatures need to be measured in the presence of the much higher levels of probing radiation. This requires taking special measures to look for the signatures, sometimes leading to a significant sensitivity loss or a complete inability to detect them. Characteristic gamma rays induced in nuclear materials reflecting their nuclear structure, while rather unique, require very high intensity of interrogation radiation and very high resolution in energy and/or time. The trade off of signatures, their means of stimulation, and methods of detection, will be reviewed.
Development of phoH as a Novel Signature Gene for Assessing Marine Phage Diversity▿

PubMed Central

Goldsmith, Dawn B.; Crosti, Giuseppe; Dwivedi, Bhakti; McDaniel, Lauren D.; Varsani, Arvind; Suttle, Curtis A.; Weinbauer, Markus G.; Sandaa, Ruth-Anne; Breitbart, Mya

2011-01-01

Phages play a key role in the marine environment by regulating the transfer of energy between trophic levels and influencing global carbon and nutrient cycles. The diversity of marine phage communities remains difficult to characterize because of the lack of a signature gene common to all phages. Recent studies have demonstrated the presence of host-derived auxiliary metabolic genes in phage genomes, such as those belonging to the Pho regulon, which regulates phosphate uptake and metabolism under low-phosphate conditions. Among the completely sequenced phage genomes in GenBank, this study identified Pho regulon genes in nearly 40% of the marine phage genomes, while only 4% of nonmarine phage genomes contained these genes. While several Pho regulon genes were identified, phoH was the most prevalent, appearing in 42 out of 602 completely sequenced phage genomes. Phylogenetic analysis demonstrated that phage phoH sequences formed a cluster distinct from those of their bacterial hosts. PCR primers designed to amplify a region of the phoH gene were used to determine the diversity of phage phoH sequences throughout a depth profile in the Sargasso Sea and at six locations worldwide. phoH was present at all sites examined, and a high diversity of phoH sequences was recovered. Most phoH sequences belonged to clusters without any cultured representatives. Each depth and geographic location had a distinct phoH composition, although most phoH clusters were recovered from multiple sites. Overall, phoH is an effective signature gene for examining phage diversity in the marine environment. PMID:21926220
Somatic mutation load of estrogen receptor-positive breast tumors predicts overall survival: an analysis of genome sequence data.

PubMed

Haricharan, Svasti; Bainbridge, Matthew N; Scheet, Paul; Brown, Powel H

2014-07-01

Breast cancer is one of the most commonly diagnosed cancers in women. While there are several effective therapies for breast cancer and important single gene prognostic/predictive markers, more than 40,000 women die from this disease every year. The increasing availability of large-scale genomic datasets provides opportunities for identifying factors that influence breast cancer survival in smaller, well-defined subsets. The purpose of this study was to investigate the genomic landscape of various breast cancer subtypes and its potential associations with clinical outcomes. We used statistical analysis of sequence data generated by the Cancer Genome Atlas initiative including somatic mutation load (SML) analysis, Kaplan-Meier survival curves, gene mutational frequency, and mutational enrichment evaluation to study the genomic landscape of breast cancer. We show that ER(+), but not ER(-), tumors with high SML associate with poor overall survival (HR = 2.02). Further, these high mutation load tumors are enriched for coincident mutations in both DNA damage repair and ER signature genes. While it is known that somatic mutations in specific genes affect breast cancer survival, this study is the first to identify that SML may constitute an important global signature for a subset of ER(+) tumors prone to high mortality. Moreover, although somatic mutations in individual DNA damage genes affect clinical outcome, our results indicate that coincident mutations in DNA damage response and signature ER genes may prove more informative for ER(+) breast cancer survival. Next generation sequencing may prove an essential tool for identifying pathways underlying poor outcomes and for tailoring therapeutic strategies.
Polycomb repressive complex 2 epigenomic signature defines age-associated hypermethylation and gene expression changes

PubMed Central

Dozmorov, Mikhail G

2015-01-01

Although age-associated gene expression and methylation changes have been reported throughout the literature, the unifying epigenomic principles of aging remain poorly understood. Recent explosion in availability and resolution of functional/regulatory genome annotation data (epigenomic data), such as that provided by the ENCODE and Roadmap Epigenomics projects, provides an opportunity for the identification of epigenomic mechanisms potentially altered by age-associated differentially methylated regions (aDMRs) and regulatory signatures in the promoters of age-associated genes (aGENs). In this study we found that aDMRs and aGENs identified in multiple independent studies share a common Polycomb Repressive Complex 2 signature marked by EZH2, SUZ12, CTCF binding sites, repressive H3K27me3, and activating H3K4me1 histone modification marks, and a “poised promoter” chromatin state. This signature is depleted in RNA Polymerase II-associated transcription factor binding sites, activating H3K79me2, H3K36me3, H3K27ac marks, and an “active promoter” chromatin state. The PRC2 signature was shown to be generally stable across cell types. When considering the directionality of methylation changes, we found the PRC2 signature to be associated with aDMRs hypermethylated with age, while hypomethylated aDMRs were associated with enhancers. In contrast, aGENs were associated with the PRC2 signature independently of the directionality of gene expression changes. In this study we demonstrate that the PRC2 signature is the common epigenomic context of genomic regions associated with hypermethylation and gene expression changes in aging. PMID:25880792
Peripheral Blood Gene Expression as a Novel Genomic Biomarker in Complicated Sarcoidosis

PubMed Central

Sweiss, Nadera J.; Chen, Edward S.; Moller, David R.; Knox, Kenneth S.; Ma, Shwu-Fan; Wade, Michael S.; Noth, Imre; Machado, Roberto F.; Garcia, Joe G. N.

2012-01-01

Sarcoidosis, a systemic granulomatous syndrome invariably affecting the lung, typically spontaneously remits but in ∼20% of cases progresses with severe lung dysfunction or cardiac and neurologic involvement (complicated sarcoidosis). Unfortunately, current biomarkers fail to distinguish patients with remitting (uncomplicated) sarcoidosis from other fibrotic lung disorders, and fail to identify individuals at risk for complicated sarcoidosis. We utilized genome-wide peripheral blood gene expression analysis to identify a 20-gene sarcoidosis biomarker signature distinguishing sarcoidosis (n = 39) from healthy controls (n = 35, 86% classification accuracy) and which served as a molecular signature for complicated sarcoidosis (n = 17). As aberrancies in T cell receptor (TCR) signaling, JAK-STAT (JS) signaling, and cytokine-cytokine receptor (CCR) signaling are implicated in sarcoidosis pathogenesis, a 31-gene signature comprised of T cell signaling pathway genes associated with sarcoidosis (TCR/JS/CCR) was compared to the unbiased 20-gene biomarker signature but proved inferior in prediction accuracy in distinguishing complicated from uncomplicated sarcoidosis. Additional validation strategies included significant association of single nucleotide polymorphisms (SNPs) in signature genes with sarcoidosis susceptibility and severity (unbiased signature genes - CX3CR1, FKBP1A, NOG, RBM12B, SENS3, TSHZ2; T cell/JAK-STAT pathway genes such as AKT3, CBLB, DLG1, IFNG, IL2RA, IL7R, ITK, JUN, MALT1, NFATC2, PLCG1, SPRED1). In summary, this validated peripheral blood molecular gene signature appears to be a valuable biomarker in identifying cases with sarcoidoisis and predicting risk for complicated sarcoidosis. PMID:22984568

Fruit and Juice Epigenetic Signatures Are Associated with Independent Immunoregulatory Pathways.

PubMed

Nicodemus-Johnson, Jessie; Sinnott, Robert A

2017-07-14

Epidemiological evidence strongly suggests that fruit consumption promotes many health benefits. Despite the general consensus that fruit and juice are nutritionally similar, epidemiological results for juice consumption are conflicting. Our objective was to use DNA methylation marks to characterize fruit and juice epigenetic signatures within PBMCs and identify shared and independent signatures associated with these groups. Genome-wide DNA methylation marks (Illumina Human Methylation 450k chip) for 2,148 individuals that participated in the Framingham Offspring exam 8 were analyzed for correlations between fruit or juice consumption using standard linear regression. CpG sites with low P -values ( P < 0.01) were characterized using Gene Set Enrichment Analysis (GSEA), Ingenuity Pathway Analysis (IPA), and epigenetic Functional element Overlap analysis of the Results of Genome Wide Association Study Experiments (eFORGE). Fruit and juice-specific low P -value epigenetic signatures were largely independent. Genes near the fruit-specific epigenetic signature were enriched among pathways associated with antigen presentation and chromosome or telomere maintenance, while the juice-specific epigenetic signature was enriched for proinflammatory pathways. IPA and eFORGE analyses implicate fruit and juice-specific epigenetic signatures in the modulation of macrophage (fruit) and B or T cell (juice) activities. These data suggest a role for epigenetic regulation in fruit and juice-specific health benefits and demonstrate independent associations with distinct immune functions and cell types, suggesting that these groups may not confer the same health benefits. Identification of such differences between foods is the first step toward personalized nutrition and ultimately the improvement of human health and longevity.
Fruit and Juice Epigenetic Signatures Are Associated with Independent Immunoregulatory Pathways

PubMed Central

Nicodemus-Johnson, Jessie; Sinnott, Robert A.

2017-01-01

Epidemiological evidence strongly suggests that fruit consumption promotes many health benefits. Despite the general consensus that fruit and juice are nutritionally similar, epidemiological results for juice consumption are conflicting. Our objective was to use DNA methylation marks to characterize fruit and juice epigenetic signatures within PBMCs and identify shared and independent signatures associated with these groups. Genome-wide DNA methylation marks (Illumina Human Methylation 450k chip) for 2,148 individuals that participated in the Framingham Offspring exam 8 were analyzed for correlations between fruit or juice consumption using standard linear regression. CpG sites with low P-values (P < 0.01) were characterized using Gene Set Enrichment Analysis (GSEA), Ingenuity Pathway Analysis (IPA), and epigenetic Functional element Overlap analysis of the Results of Genome Wide Association Study Experiments (eFORGE). Fruit and juice-specific low P-value epigenetic signatures were largely independent. Genes near the fruit-specific epigenetic signature were enriched among pathways associated with antigen presentation and chromosome or telomere maintenance, while the juice-specific epigenetic signature was enriched for proinflammatory pathways. IPA and eFORGE analyses implicate fruit and juice-specific epigenetic signatures in the modulation of macrophage (fruit) and B or T cell (juice) activities. These data suggest a role for epigenetic regulation in fruit and juice-specific health benefits and demonstrate independent associations with distinct immune functions and cell types, suggesting that these groups may not confer the same health benefits. Identification of such differences between foods is the first step toward personalized nutrition and ultimately the improvement of human health and longevity. PMID:28708104
Identification of Genomic Regions Associated with Phenotypic Variation between Dog Breeds using Selection Mapping

PubMed Central

Derrien, Thomas; Axelsson, Erik; Rosengren Pielberg, Gerli; Sigurdsson, Snaevar; Fall, Tove; Seppälä, Eija H.; Hansen, Mark S. T.; Lawley, Cindy T.; Karlsson, Elinor K.; Bannasch, Danika; Vilà, Carles; Lohi, Hannes; Galibert, Francis; Fredholm, Merete; Häggström, Jens; Hedhammar, Åke; André, Catherine; Lindblad-Toh, Kerstin; Hitte, Christophe; Webster, Matthew T.

2011-01-01

The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease. PMID:22022279
Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping.

PubMed

Vaysse, Amaury; Ratnakumar, Abhirami; Derrien, Thomas; Axelsson, Erik; Rosengren Pielberg, Gerli; Sigurdsson, Snaevar; Fall, Tove; Seppälä, Eija H; Hansen, Mark S T; Lawley, Cindy T; Karlsson, Elinor K; Bannasch, Danika; Vilà, Carles; Lohi, Hannes; Galibert, Francis; Fredholm, Merete; Häggström, Jens; Hedhammar, Ake; André, Catherine; Lindblad-Toh, Kerstin; Hitte, Christophe; Webster, Matthew T

2011-10-01

The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease.
Next-generation sequencing of translocation renal cell carcinoma reveals novel RNA splicing partners and frequent mutations of chromatin-remodeling genes.

PubMed

Malouf, Gabriel G; Su, Xiaoping; Yao, Hui; Gao, Jianjun; Xiong, Liangwen; He, Qiuming; Compérat, Eva; Couturier, Jérôme; Molinié, Vincent; Escudier, Bernard; Camparo, Philippe; Doss, Denaha J; Thompson, Erika J; Khayat, David; Wood, Christopher G; Yu, Willie; Teh, Bin T; Weinstein, John; Tannir, Nizar M

2014-08-01

MITF/TFE translocation renal cell carcinoma (TRCC) is a rare subtype of kidney cancer. Its incidence and the genome-wide characterization of its genetic origin have not been fully elucidated. We performed RNA and exome sequencing on an exploratory set of TRCC (n = 7), and validated our findings using The Cancer Genome Atlas (TCGA) clear-cell RCC (ccRCC) dataset (n = 460). Using the TCGA dataset, we identified seven TRCC (1.5%) cases and determined their genomic profile. We discovered three novel partners of MITF/TFE (LUC7L3, KHSRP, and KHDRBS2) that are involved in RNA splicing. TRCC displayed a unique gene expression signature as compared with other RCC types, and showed activation of MITF, the transforming growth factor β1 and the PI3K complex targets. Genes differentially spliced between TRCC and other RCC types were enriched for MITF and ID2 targets. Exome sequencing of TRCC revealed a distinct mutational spectrum as compared with ccRCC, with frequent mutations in chromatin-remodeling genes (six of eight cases, three of which were from the TCGA). In two cases, we identified mutations in INO80D, an ATP-dependent chromatin-remodeling gene, previously shown to control the amplitude of the S phase. Knockdown of INO80D decreased cell proliferation in a novel cell line bearing LUC7L3-TFE3 translocation. This genome-wide study defines the incidence of TRCC within a ccRCC-directed project and expands the genomic spectrum of TRCC by identifying novel MITF/TFE partners involved in RNA splicing and frequent mutations in chromatin-remodeling genes. ©2014 American Association for Cancer Research.
Functional Genomic Characterization of Virulence Factors from Necrotizing Fasciitis-Causing Strains of Aeromonas hydrophila

PubMed Central

Grim, Christopher J.; Kozlova, Elena V.; Ponnusamy, Duraisamy; Fitts, Eric C.; Sha, Jian; Kirtley, Michelle L.; van Lier, Christina J.; Tiner, Bethany L.; Erova, Tatiana E.; Joseph, Sandeep J.; Read, Timothy D.; Shak, Joshua R.; Joseph, Sam W.; Singletary, Ed; Felland, Tracy; Baze, Wallace B.; Horneman, Amy J.

2014-01-01

The genomes of 10 Aeromonas isolates identified and designated Aeromonas hydrophila WI, Riv3, and NF1 to NF4; A. dhakensis SSU; A. jandaei Riv2; and A. caviae NM22 and NM33 were sequenced and annotated. Isolates NF1 to NF4 were from a patient with necrotizing fasciitis (NF). Two environmental isolates (Riv2 and -3) were from the river water from which the NF patient acquired the infection. While isolates NF2 to NF4 were clonal, NF1 was genetically distinct. Outside the conserved core genomes of these 10 isolates, several unique genomic features were identified. The most virulent strains possessed one of the following four virulence factors or a combination of them: cytotoxic enterotoxin, exotoxin A, and type 3 and 6 secretion system effectors AexU and Hcp. In a septicemic-mouse model, SSU, NF1, and Riv2 were the most virulent, while NF2 was moderately virulent. These data correlated with high motility and biofilm formation by the former three isolates. Conversely, in a mouse model of intramuscular infection, NF2 was much more virulent than NF1. Isolates NF2, SSU, and Riv2 disseminated in high numbers from the muscular tissue to the visceral organs of mice, while NF1 reached the liver and spleen in relatively lower numbers on the basis of colony counting and tracking of bioluminescent strains in real time by in vivo imaging. Histopathologically, degeneration of myofibers with significant infiltration of polymorphonuclear cells due to the highly virulent strains was noted. Functional genomic analysis provided data that allowed us to correlate the highly infectious nature of Aeromonas pathotypes belonging to several different species with virulence signatures and their potential ability to cause NF. PMID:24795370
Genome-wide screen identifies a novel prognostic signature for breast cancer survival

DOE PAGES

Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey; ...

2017-01-21

Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Population genomics of the honey bee reveals strong signatures of positive selection on worker traits.

PubMed

Harpur, Brock A; Kent, Clement F; Molodtsova, Daria; Lebon, Jonathan M D; Alqarni, Abdulaziz S; Owayss, Ayman A; Zayed, Amro

2014-02-18

Most theories used to explain the evolution of eusociality rest upon two key assumptions: mutations affecting the phenotype of sterile workers evolve by positive selection if the resulting traits benefit fertile kin, and that worker traits provide the primary mechanism allowing social insects to adapt to their environment. Despite the common view that positive selection drives phenotypic evolution of workers, we know very little about the prevalence of positive selection acting on the genomes of eusocial insects. We mapped the footprints of positive selection in Apis mellifera through analysis of 40 individual genomes, allowing us to identify thousands of genes and regulatory sequences with signatures of adaptive evolution over multiple timescales. We found Apoidea- and Apis-specific genes to be enriched for signatures of positive selection, indicating that novel genes play a disproportionately large role in adaptive evolution of eusocial insects. Worker-biased proteins have higher signatures of adaptive evolution relative to queen-biased proteins, supporting the view that worker traits are key to adaptation. We also found genes regulating worker division of labor to be enriched for signs of positive selection. Finally, genes associated with worker behavior based on analysis of brain gene expression were highly enriched for adaptive protein and cis-regulatory evolution. Our study highlights the significant contribution of worker phenotypes to adaptive evolution in social insects, and provides a wealth of knowledge on the loci that influence fitness in honey bees.
Genome-wide scans between two honeybee populations reveal putative signatures of human-mediated selection.

PubMed

Parejo, M; Wragg, D; Henriques, D; Vignal, A; Neuditschko, M

2017-12-01

Human-mediated selection has left signatures in the genomes of many domesticated animals, including the European dark honeybee, Apis mellifera mellifera, which has been selected by apiculturists for centuries. Using whole-genome sequence information, we investigated selection signatures in spatially separated honeybee subpopulations (Switzerland, n = 39 and France, n = 17). Three different test statistics were calculated in windows of 2 kb (fixation index, cross-population extended haplotype homozygosity and cross-population composite likelihood ratio) and combined into a recently developed composite selection score. Applying a stringent false discovery rate of 0.01, we identified six significant selective sweeps distributed across five chromosomes covering eight genes. These genes are associated with multiple molecular and biological functions, including regulation of transcription, receptor binding and signal transduction. Of particular interest is a selection signature on chromosome 1, which corresponds to the WNT4 gene, the family of which is conserved across the animal kingdom with a variety of functions. In Drosophila melanogaster, WNT4 alleles have been associated with differential wing, cross vein and abdominal phenotypes. Defining phenotypic characteristics of different Apis mellifera ssp., which are typically used as selection criteria, include colour and wing venation pattern. This signal is therefore likely to be a good candidate for human mediated-selection arising from different applied breeding practices in the two managed populations. © 2017 The Authors. Animal Genetics published by John Wiley & Sons Ltd on behalf of Stichting International Foundation for Animal Genetics.
Genome-wide screen identifies a novel prognostic signature for breast cancer survival

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey

Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Population genomics of the honey bee reveals strong signatures of positive selection on worker traits

PubMed Central

Harpur, Brock A.; Kent, Clement F.; Molodtsova, Daria; Lebon, Jonathan M. D.; Alqarni, Abdulaziz S.; Owayss, Ayman A.; Zayed, Amro

2014-01-01

Most theories used to explain the evolution of eusociality rest upon two key assumptions: mutations affecting the phenotype of sterile workers evolve by positive selection if the resulting traits benefit fertile kin, and that worker traits provide the primary mechanism allowing social insects to adapt to their environment. Despite the common view that positive selection drives phenotypic evolution of workers, we know very little about the prevalence of positive selection acting on the genomes of eusocial insects. We mapped the footprints of positive selection in Apis mellifera through analysis of 40 individual genomes, allowing us to identify thousands of genes and regulatory sequences with signatures of adaptive evolution over multiple timescales. We found Apoidea- and Apis-specific genes to be enriched for signatures of positive selection, indicating that novel genes play a disproportionately large role in adaptive evolution of eusocial insects. Worker-biased proteins have higher signatures of adaptive evolution relative to queen-biased proteins, supporting the view that worker traits are key to adaptation. We also found genes regulating worker division of labor to be enriched for signs of positive selection. Finally, genes associated with worker behavior based on analysis of brain gene expression were highly enriched for adaptive protein and cis-regulatory evolution. Our study highlights the significant contribution of worker phenotypes to adaptive evolution in social insects, and provides a wealth of knowledge on the loci that influence fitness in honey bees. PMID:24488971
Comparison of carnivore, omnivore, and herbivore mammalian genomes with a new leopard assembly.

PubMed

Kim, Soonok; Cho, Yun Sung; Kim, Hak-Min; Chung, Oksung; Kim, Hyunho; Jho, Sungwoong; Seomun, Hong; Kim, Jeongho; Bang, Woo Young; Kim, Changmu; An, Junghwa; Bae, Chang Hwan; Bhak, Youngjune; Jeon, Sungwon; Yoon, Hyejun; Kim, Yumi; Jun, JeHoon; Lee, HyeJin; Cho, Suan; Uphyrkina, Olga; Kostyria, Aleksey; Goodrich, John; Miquelle, Dale; Roelke, Melody; Lewis, John; Yurchenko, Andrey; Bankevich, Anton; Cho, Juok; Lee, Semin; Edwards, Jeremy S; Weber, Jessica A; Cook, Jo; Kim, Sangsoo; Lee, Hang; Manica, Andrea; Lee, Ilbeum; O'Brien, Stephen J; Bhak, Jong; Yeo, Joo-Hong

2016-10-11

There are three main dietary groups in mammals: carnivores, omnivores, and herbivores. Currently, there is limited comparative genomics insight into the evolution of dietary specializations in mammals. Due to recent advances in sequencing technologies, we were able to perform in-depth whole genome analyses of representatives of these three dietary groups. We investigated the evolution of carnivory by comparing 18 representative genomes from across Mammalia with carnivorous, omnivorous, and herbivorous dietary specializations, focusing on Felidae (domestic cat, tiger, lion, cheetah, and leopard), Hominidae, and Bovidae genomes. We generated a new high-quality leopard genome assembly, as well as two wild Amur leopard whole genomes. In addition to a clear contraction in gene families for starch and sucrose metabolism, the carnivore genomes showed evidence of shared evolutionary adaptations in genes associated with diet, muscle strength, agility, and other traits responsible for successful hunting and meat consumption. Additionally, an analysis of highly conserved regions at the family level revealed molecular signatures of dietary adaptation in each of Felidae, Hominidae, and Bovidae. However, unlike carnivores, omnivores and herbivores showed fewer shared adaptive signatures, indicating that carnivores are under strong selective pressure related to diet. Finally, felids showed recent reductions in genetic diversity associated with decreased population sizes, which may be due to the inflexible nature of their strict diet, highlighting their vulnerability and critical conservation status. Our study provides a large-scale family level comparative genomic analysis to address genomic changes associated with dietary specialization. Our genomic analyses also provide useful resources for diet-related genetic and health research.
Comparison of phasing strategies for whole human genomes

PubMed Central

Kirkness, Ewen; Schork, Nicholas J.

2018-01-01

Humans are a diploid species that inherit one set of chromosomes paternally and one homologous set of chromosomes maternally. Unfortunately, most human sequencing initiatives ignore this fact in that they do not directly delineate the nucleotide content of the maternal and paternal copies of the 23 chromosomes individuals possess (i.e., they do not ‘phase’ the genome) often because of the costs and complexities of doing so. We compared 11 different widely-used approaches to phasing human genomes using the publicly available ‘Genome-In-A-Bottle’ (GIAB) phased version of the NA12878 genome as a gold standard. The phasing strategies we compared included laboratory-based assays that prepare DNA in unique ways to facilitate phasing as well as purely computational approaches that seek to reconstruct phase information from general sequencing reads and constructs or population-level haplotype frequency information obtained through a reference panel of haplotypes. To assess the performance of the 11 approaches, we used metrics that included, among others, switch error rates, haplotype block lengths, the proportion of fully phase-resolved genes, phasing accuracy and yield between pairs of SNVs. Our comparisons suggest that a hybrid or combined approach that leverages: 1. population-based phasing using the SHAPEIT software suite, 2. either genome-wide sequencing read data or parental genotypes, and 3. a large reference panel of variant and haplotype frequencies, provides a fast and efficient way to produce highly accurate phase-resolved individual human genomes. We found that for population-based approaches, phasing performance is enhanced with the addition of genome-wide read data; e.g., whole genome shotgun and/or RNA sequencing reads. Further, we found that the inclusion of parental genotype data within a population-based phasing strategy can provide as much as a ten-fold reduction in phasing errors. We also considered a majority voting scheme for the construction of a consensus haplotype combining multiple predictions for enhanced performance and site coverage. Finally, we also identified DNA sequence signatures associated with the genomic regions harboring phasing switch errors, which included regions of low polymorphism or SNV density. PMID:29621242
Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

PubMed Central

Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789
Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

PubMed

Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.
Genetic heterogeneity in cholangiocarcinoma: a major challenge for targeted therapies

PubMed Central

Brandi, Giovanni; Farioli, Andrea; Astolfi, Annalisa; Biasco, Guido; Tavolari, Simona

2015-01-01

Cholangiocarcinoma (CC) encompasses a group of related but distinct malignancies whose lack of a stereotyped genetic signature makes challenging the identification of genomic landscape and the development of effective targeted therapies. Accumulated evidences strongly suggest that the remarkable genetic heterogeneity of CC may be the result of a complex interplay among different causative factors, some shared by most human cancers while others typical of this malignancy. Currently, considerable efforts are ongoing worldwide for the genetic characterization of CC, also using advanced technologies such as next-generation sequencing (NGS). Undoubtedly this technology could offer an unique opportunity to broaden our understanding on CC molecular pathogenesis. Despite this great potential, however, the high complexity in terms of factors potentially contributing to genetic variability in CC calls for a more cautionary application of NGS to this malignancy, in order to avoid possible biases and criticisms in the identification of candidate actionable targets. This approach is further justified by the urgent need to develop effective targeted therapies in this disease. A multidisciplinary approach integrating genomic, functional and clinical studies is therefore mandatory to translate the results obtained by NGS into effective targeted therapies for this orphan disease. PMID:26142706
Genome-wide identification and characterisation of HOT regions in the human genome.

PubMed

Li, Hao; Liu, Feng; Ren, Chao; Bo, Xiaochen; Shu, Wenjie

2016-09-15

HOT (high-occupancy target) regions, which are bound by a surprisingly large number of transcription factors, are considered to be among the most intriguing findings of recent years. An improved understanding of the roles that HOT regions play in biology would be afforded by knowing the constellation of factors that constitute these domains and by identifying HOT regions across the spectrum of human cell types. We characterised and validated HOT regions in embryonic stem cells (ESCs) and produced a catalogue of HOT regions in a broad range of human cell types. We found that HOT regions are associated with genes that control and define the developmental processes of the respective cell and tissue types. We also showed evidence of the developmental persistence of HOT regions at primitive enhancers and demonstrate unique signatures of HOT regions that distinguish them from typical enhancers and super-enhancers. Finally, we performed a dynamic analysis to reveal the dynamical regulation of HOT regions upon H1 differentiation. Taken together, our results provide a resource for the functional exploration of HOT regions and extend our understanding of the key roles of HOT regions in development and differentiation.
Conservation priorities for endangered Indian tigers through a genomic lens.

PubMed

Natesh, Meghana; Atla, Goutham; Nigam, Parag; Jhala, Yadvendradev V; Zachariah, Arun; Borthakur, Udayan; Ramakrishnan, Uma

2017-08-29

Tigers have lost 93% of their historical range worldwide. India plays a vital role in the conservation of tigers since nearly 60% of all wild tigers are currently found here. However, as protected areas are small (<300 km 2 on average), with only a few individuals in each, many of them may not be independently viable. It is thus important to identify and conserve genetically connected populations, as well as to maintain connectivity within them. We collected samples from wild tigers (Panthera tigris tigris) across India and used genome-wide SNPs to infer genetic connectivity. We genotyped 10,184 SNPs from 38 individuals across 17 protected areas and identified three genetically distinct clusters (corresponding to northwest, southern and central India). The northwest cluster was isolated with low variation and high relatedness. The geographically large central cluster included tigers from central, northeastern and northern India, and had the highest variation. Most genetic diversity (62%) was shared among clusters, while unique variation was highest in the central cluster (8.5%) and lowest in the northwestern one (2%). We did not detect signatures of differential selection or local adaptation. We highlight that the northwest population requires conservation attention to ensure persistence of these tigers.
PwRn1, a novel Ty3/gypsy-like retrotransposon of Paragonimus westermani: molecular characters and its differentially preserved mobile potential according to host chromosomal polyploidy.

PubMed

Bae, Young-An; Ahn, Jong-Sook; Kim, Seon-Hee; Rhyu, Mun-Gan; Kong, Yoon; Cho, Seung-Yull

2008-10-14

Retrotransposons have been known to involve in the remodeling and evolution of host genome. These reverse transcribing elements, which show a complex evolutionary pathway with diverse intermediate forms, have been comprehensively analyzed from a wide range of host genomes, while the information remains limited to only a few species in the phylum Platyhelminthes. A LTR retrotransposon and its homologs with a strong phylogenetic affinity toward CsRn1 of Clonorchis sinensis were isolated from a trematode parasite Paragonimus westermani via a degenerate PCR method and from an insect species Anopheles gambiae by in silico analysis of the whole mosquito genome, respectively. These elements, designated PwRn1 and AgCR-1 - AgCR-14 conserved unique features including a t-RNATrp primer binding site and the unusual CHCC signature of Gag proteins. Their flanking LTRs displayed >97% nucleotide identities and thus, these elements were likely to have expanded recently in the trematode and insect genomes. They evolved heterogeneous expression strategies: a single fused ORF, two separate ORFs with an identical reading frame and two ORFs overlapped by -1 frameshifting. Phylogenetic analyses suggested that the elements with the separate ORFs had evolved from an ancestral form(s) with the overlapped ORFs. The mobile potential of PwRn1 was likely to be maintained differentially in association with the karyotype of host genomes, as was examined by the presence/absence of intergenomic polymorphism and mRNA transcripts. Our results on the structural diversity of CsRn1-like elements can provide a molecular tool to dissect a more detailed evolutionary episode of LTR retrotransposons. The PwRn1-associated genomic polymorphism, which is substantial in diploids, will also be informative in addressing genomic diversification following inter-/intra-specific hybridization in P. westermani populations.
Global population genomics and comparisons of selective signatures from two invasions of melon fly, Zeugodacus cucurbitae (Diptera: Tephritidae)

USDA-ARS?s Scientific Manuscript database

Population genetics is a powerful tool for invasion biology and pest management, from tracing invasion pathways to informing management decisions with inference of population demographics. Genomics greatly increases the resolution of population-scale analyses, yet outside of model species with exten...

Genetic Comparison of B. Anthracis and its Close Relatives Using AFLP and PCR Analysis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jackson, P.J.; Hill, K.K.; Laker, M.T.

1999-02-01

Amplified Fragment length Polymorphism (AFLP) analysis allows a rapid, relatively simple analysis of a large portion of a microbial genome, providing information about the species and its phylogenetic relationship to other microbes (Vos, et al., 1995). The method simply surveys the genome for length and sequence polymorphisms. The pattern identified can be used for comparison to the genomes of other species. Unlike other methods, it does not rely on analysis of a single genetic locus that may bias the interpretation of results and it does not require any prior knowledge of the targeted organism. Moreover, a standard set of reagentsmore » can be applied to any species without using species-specific information or molecular probes. The authors are using AFLP's to rapidly identify different bacterial species. A comparison of AFLP profiles generated from a large battery of B. anthracis strains shows very little variability among different isolates (Keim, et al., 1997). By contrast, there is a significant difference between AFLP profiles generated for any B. anthracis strain and even the most closely related Bacillus species. Sufficient variability is apparent among all known microbial species to allow phylogenetic analysis based on large numbers of genetically unlinked loci. These striking differences among AFLP profiles allow unambiguous identification of previously identified species and phylogenetic placement of newly characterized isolates relative to known species based on a large number of independent genetic loci. Data generated thus far show that the method provides phylogenetic analyses that are consistent with other widely accepted phylogenetic methods. However, AFLP analysis provides a more detailed analysis of the targets and samples a much larger portion of the genome. Consequently, it provides an inexpensive, rapid means of characterizing microbial isolates to further differentiate among strains and closely related microbial species. Such information cannot be rapidly generated by other means. AFLP sample analysis quickly generates a very large amount of molecular information about microbial genomes. However, this information cannot be analyzed rapidly using manual methods. The authors are developing a large archive of electronic AFLP signatures that is being used to identify isolates collected from medical, veterinary, forensic and environmental samples. They are also developing the computational packages necessary to rapidly and unambiguously analyze the AFLP profiles and conduct a phylogenetic comparison of these data relative to information already in the database. They will use this archive and the associated algorithms to determine the species identity of previously uncharacterized isolates and place them phylogenetically relative to other microbes based on their AFLP signatures. This study provides significant new information about microbes with environmental, veterinary and medical significance. This information can be used in further studies to understand the relationships among these species and the factors that distinguish them from one another. It should also allow identification of unique factors that contribute to important microbial traits including pathogenicity and virulence. They are also using AFLP data to identify, isolate and sequence DNA fragments that are unique to particular microbial species and strains. The fragment patterns and sequence information provide insights into the complexity and organization of bacterial genomes relative to one another. They also provide the information necessary for development of species-specific PCR primers that can be used to interrogate complex samples for the presence of B. anthracis, other microbial pathogens or their remnants.« less
Study of recreational land and open space using Skylab imagery

NASA Technical Reports Server (NTRS)

Sattinger, I. J. (Principal Investigator)

1975-01-01

The author has identified the following significant results. An analysis of the statistical uniqueness of each of the signatures of the Gratiot-Saginaw State Game Area was made by computing a matrix of probabilities of misclassification for all possible signature pairs. Within each data set, the 35 signatures were then aggregated into a smaller set of composite signatures by combining groups of signatures having high probabilities of misclassification. Computer separation of forest denisty classes was poor with multispectral scanner data collected on 5 August 1973. Signatures from the scanner data were further analyzed to determine the ranking of spectral channels for computer separation of the scene classes. Probabilities of misclassification were computed for composite signatures using four separate combinations of data source and channel selection.
A genomic copy number signature predicts radiation exposure in post-Chernobyl breast cancer.

PubMed

Wilke, Christina M; Braselmann, Herbert; Hess, Julia; Klymenko, Sergiy V; Chumak, Vadim V; Zakhartseva, Liubov M; Bakhanova, Elena V; Walch, Axel K; Selmansberger, Martin; Samaga, Daniel; Weber, Peter; Schneider, Ludmila; Fend, Falko; Bösmüller, Hans C; Zitzelsberger, Horst; Unger, Kristian

2018-04-16

Breast cancer is the second leading cause of cancer death among women worldwide and besides life style, age and genetic risk factors, exposure to ionizing radiation is known to increase the risk for breast cancer. Further, DNA copy number alterations (CNAs), which can result from radiation-induced double-strand breaks, are frequently occurring in breast cancer cells. We set out to identify a signature of CNAs discriminating breast cancers from radiation-exposed and non-exposed female patients. We analyzed resected breast cancer tissues from 68 exposed female Chernobyl clean-up workers and evacuees and 68 matched non-exposed control patients for CNAs by array comparative genomic hybridization analysis (aCGH). Using a stepwise forward-backward selection approach a non-complex CNA signature, that is, less than ten features, was identified in the training data set, which could be subsequently validated in the validation data set (p value < 0.05). The signature consisted of nine copy number regions located on chromosomal bands 7q11.22-11.23, 7q21.3, 16q24.3, 17q21.31, 20p11.23-11.21, 1p21.1, 2q35, 2q35, 6p22.2. The signature was independent of any clinical characteristics of the patients. In all, we identified a CNA signature that has the potential to allow identification of radiation-associated breast cancer at the individual level. © 2018 UICC.
Mutation signatures of carcinogen exposure: genome-wide detection and new opportunities for cancer prevention

PubMed Central

2014-01-01

Exposure to environmental mutagens is an important cause of human cancer, and measures to reduce mutagenic and carcinogenic exposures have been highly successful at controlling cancer. Until recently, it has been possible to connect the chemical characteristics of mutagens to actual mutations observed in human tumors only indirectly. Now, next-generation sequencing technology enables us to observe in detail the DNA-sequence-level effects of well-known mutagens, such as ultraviolet radiation and tobacco smoke, as well as endogenous mutagenic processes, such as those involving activated DNA cytidine deaminases (APOBECs). We can also observe the effects of less well-known but potent mutagens, including those recently found to be present in some herbal remedies. Crucially, we can now tease apart the superimposed effects of several mutational exposures and processes and determine which ones occurred during the development of individual tumors. Here, we review advances in detecting these mutation signatures and discuss the implications for surveillance and prevention of cancer. The number of sequenced tumors from diverse cancer types and multiple geographic regions is growing explosively, and the genomes of these tumors will bear the signatures of even more diverse mutagenic exposures. Thus, we envision development of wide-ranging compendia of mutation signatures from tumors and a concerted effort to experimentally elucidate the signatures of a large number of mutagens. This information will be used to link signatures observed in tumors to the exposures responsible for them, which will offer unprecedented opportunities for prevention. PMID:25031618
Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria

PubMed Central

Gao, Beile

2012-01-01

Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973
Comparing Patterns of Natural Selection across Species Using Selective Signatures

PubMed Central

Shapiro, B. Jesse; Alm, Eric J

2008-01-01

Comparing gene expression profiles over many different conditions has led to insights that were not obvious from single experiments. In the same way, comparing patterns of natural selection across a set of ecologically distinct species may extend what can be learned from individual genome-wide surveys. Toward this end, we show how variation in protein evolutionary rates, after correcting for genome-wide effects such as mutation rate and demographic factors, can be used to estimate the level and types of natural selection acting on genes across different species. We identify unusually rapidly and slowly evolving genes, relative to empirically derived genome-wide and gene family-specific background rates for 744 core protein families in 30 γ-proteobacterial species. We describe the pattern of fast or slow evolution across species as the “selective signature” of a gene. Selective signatures represent a profile of selection across species that is predictive of gene function: pairs of genes with correlated selective signatures are more likely to share the same cellular function, and genes in the same pathway can evolve in concert. For example, glycolysis and phenylalanine metabolism genes evolve rapidly in Idiomarina loihiensis, mirroring an ecological shift in carbon source from sugars to amino acids. In a broader context, our results suggest that the genomic landscape is organized into functional modules even at the level of natural selection, and thus it may be easier than expected to understand the complex evolutionary pressures on a cell. PMID:18266472
Systems Biology Methods for Alzheimer's Disease Research Toward Molecular Signatures, Subtypes, and Stages and Precision Medicine: Application in Cohort Studies and Trials.

PubMed

Castrillo, Juan I; Lista, Simone; Hampel, Harald; Ritchie, Craig W

2018-01-01

Alzheimer's disease (AD) is a complex multifactorial disease, involving a combination of genomic, interactome, and environmental factors, with essential participation of (a) intrinsic genomic susceptibility and (b) a constant dynamic interplay between impaired pathways and central homeostatic networks of nerve cells. The proper investigation of the complexity of AD requires new holistic systems-level approaches, at both the experimental and computational level. Systems biology methods offer the potential to unveil new fundamental insights, basic mechanisms, and networks and their interplay. These may lead to the characterization of mechanism-based molecular signatures, and AD hallmarks at the earliest molecular and cellular levels (and beyond), for characterization of AD subtypes and stages, toward targeted interventions according to the evolving precision medicine paradigm. In this work, an update on advanced systems biology methods and strategies for holistic studies of multifactorial diseases-particularly AD-is presented. This includes next-generation genomics, neuroimaging and multi-omics methods, experimental and computational approaches, relevant disease models, and latest genome editing and single-cell technologies. Their progressive incorporation into basic research, cohort studies, and trials is beginning to provide novel insights into AD essential mechanisms, molecular signatures, and markers toward mechanism-based classification and staging, and tailored interventions. Selected methods which can be applied in cohort studies and trials, with the European Prevention of Alzheimer's Dementia (EPAD) project as a reference example, are presented and discussed.
The role of protozoa-driven selection in shaping human genetic variability.

PubMed

Pozzoli, Uberto; Fumagalli, Matteo; Cagliani, Rachele; Comi, Giacomo P; Bresolin, Nereo; Clerici, Mario; Sironi, Manuela

2010-03-01

Protozoa exert a strong selective pressure in humans. The selection signatures left by these pathogens can be exploited to identify genetic modulators of infection susceptibility. We show that protozoa diversity in different geographic locations is a good measure of protozoa-driven selective pressure; protozoa diversity captured selection signatures at known malaria resistance loci and identified several selected single nucleotide polymorphisms in immune and hemolytic anemia genes. A genome-wide search enabled us to identify 5180 variants mapping to 1145 genes that are subjected to protozoa-driven selective pressure. We provide a genome-wide estimate of protozoa-driven selective pressure and identify candidate susceptibility genes for protozoa-borne diseases. Copyright 2010 Elsevier Ltd. All rights reserved.
A Polyglot Approach to Bioinformatics Data Integration: A Phylogenetic Analysis of HIV-1

PubMed Central

Reisman, Steven; Hatzopoulos, Thomas; Läufer, Konstantin; Thiruvathukal, George K.; Putonti, Catherine

2016-01-01

As sequencing technologies continue to drop in price and increase in throughput, new challenges emerge for the management and accessibility of genomic sequence data. We have developed a pipeline for facilitating the storage, retrieval, and subsequent analysis of molecular data, integrating both sequence and metadata. Taking a polyglot approach involving multiple languages, libraries, and persistence mechanisms, sequence data can be aggregated from publicly available and local repositories. Data are exposed in the form of a RESTful web service, formatted for easy querying, and retrieved for downstream analyses. As a proof of concept, we have developed a resource for annotated HIV-1 sequences. Phylogenetic analyses were conducted for >6,000 HIV-1 sequences revealing spatial and temporal factors influence the evolution of the individual genes uniquely. Nevertheless, signatures of origin can be extrapolated even despite increased globalization. The approach developed here can easily be customized for any species of interest. PMID:26819543
Making the Bend: DNA Tertiary Structure and Protein-DNA Interactions

PubMed Central

Harteis, Sabrina; Schneider, Sabine

2014-01-01

DNA structure functions as an overlapping code to the DNA sequence. Rapid progress in understanding the role of DNA structure in gene regulation, DNA damage recognition and genome stability has been made. The three dimensional structure of both proteins and DNA plays a crucial role for their specific interaction, and proteins can recognise the chemical signature of DNA sequence (“base readout”) as well as the intrinsic DNA structure (“shape recognition”). These recognition mechanisms do not exist in isolation but, depending on the individual interaction partners, are combined to various extents. Driving force for the interaction between protein and DNA remain the unique thermodynamics of each individual DNA-protein pair. In this review we focus on the structures and conformations adopted by DNA, both influenced by and influencing the specific interaction with the corresponding protein binding partner, as well as their underlying thermodynamics. PMID:25026169
Convergent evolution of marine mammals is associated with distinct substitutions in common genes

PubMed Central

Zhou, Xuming; Seim, Inge; Gladyshev, Vadim N.

2015-01-01

Phenotypic convergence is thought to be driven by parallel substitutions coupled with natural selection at the sequence level. Multiple independent evolutionary transitions of mammals to an aquatic environment offer an opportunity to test this thesis. Here, whole genome alignment of coding sequences identified widespread parallel amino acid substitutions in marine mammals; however, the majority of these changes were not unique to these animals. Conversely, we report that candidate aquatic adaptation genes, identified by signatures of likelihood convergence and/or elevated ratio of nonsynonymous to synonymous nucleotide substitution rate, are characterized by very few parallel substitutions and exhibit distinct sequence changes in each group. Moreover, no significant positive correlation was found between likelihood convergence and positive selection in all three marine lineages. These results suggest that convergence in protein coding genes associated with aquatic lifestyle is mainly characterized by independent substitutions and relaxed negative selection. PMID:26549748
Immune signatures of protective spleen memory CD8 T cells.

PubMed

Brinza, Lilia; Djebali, Sophia; Tomkowiak, Martine; Mafille, Julien; Loiseau, Céline; Jouve, Pierre-Emmanuel; de Bernard, Simon; Buffat, Laurent; Lina, Bruno; Ottmann, Michèle; Rosa-Calatrava, Manuel; Schicklin, Stéphane; Bonnefoy, Nathalie; Lauvau, Grégoire; Grau, Morgan; Wencker, Mélanie; Arpin, Christophe; Walzer, Thierry; Leverrier, Yann; Marvel, Jacqueline

2016-11-24

Memory CD8 T lymphocyte populations are remarkably heterogeneous and differ in their ability to protect the host. In order to identify the whole range of qualities uniquely associated with protective memory cells we compared the gene expression signatures of two qualities of memory CD8 T cells sharing the same antigenic-specificity: protective (Influenza-induced, Flu-TM) and non-protective (peptide-induced, TIM) spleen memory CD8 T cells. Although Flu-TM and TIM express classical phenotypic memory markers and are polyfunctional, only Flu-TM protects against a lethal viral challenge. Protective memory CD8 T cells express a unique set of genes involved in migration and survival that correlate with their unique capacity to rapidly migrate within the infected lung parenchyma in response to influenza infection. We also enlighten a new set of poised genes expressed by protective cells that is strongly enriched in cytokines and chemokines such as Ccl1, Ccl9 and Gm-csf. CCL1 and GM-CSF genes are also poised in human memory CD8 T cells. These immune signatures are also induced by two other pathogens (vaccinia virus and Listeria monocytogenes). The immune signatures associated with immune protection were identified on circulating cells, i.e. those that are easily accessible for immuno-monitoring and could help predict vaccines efficacy.
Establishment and Characterization of Novel Human Primary and Metastatic Anaplastic Thyroid Cancer Cell Lines and Their Genomic Evolution Over a Year as a Primagraft

PubMed Central

Okamoto, Ryoko; Nagata, Yasunobu; Kanojia, Deepika; Venkatesan, Subhashree; M. T., Anand; Braunstein, Glenn D.; Said, Jonathan W.; Doan, Ngan B.; Ho, Quoc; Akagi, Tadayuki; Gery, Sigal; Liu, Li-zhen; Tan, Kar Tong; Chng, Wee Joo; Yang, Henry; Ogawa, Seishi; Koeffler, H. Phillip

2015-01-01

Context: Anaplastic thyroid cancer (ATC) has no effective treatment, resulting in a high rate of mortality. We established cell lines from a primary ATC and its lymph node metastasis, and investigated the molecular factors and genomic changes associated with tumor growth. Objective: The aim of the study was to understand the molecular and genomic changes of highly aggressive ATC and its clonal evolution to develop rational therapies. Design: We established unique cell lines from primary (OGK-P) and metastatic (OGK-M) ATC specimen, as well as primagraft from the metastatic ATC, which was serially xeno-transplanted for more than 1 year in NOD scid gamma mice were established. These cell lines and primagraft were used as tools to examine gene expression, copy number changes, and somatic mutations using RNA array, SNP Chip, and whole exome sequencing. Results: Mice carrying sc (OGK-P and OGK-M) tumors developed splenomegaly and neutrophilia with high expression of cytokines including CSF1, CSF2, CSF3, IL-1β, and IL-6. Levels of HIF-1α and its targeted genes were also elevated in these tumors. The treatment of tumor carrying mice with Bevacizumab effectively decreased tumor growth, macrophage infiltration, and peripheral WBCs. SNP chip analysis showed homozygous deletion of exons 3–22 of the PARD3 gene in the cells. Forced expression of PARD3 decreased cell proliferation, motility, and invasiveness, restores cell-cell contacts and enhanced cell adhesion. Next generation exome sequencing identified the somatic changes present in the primary, metastatic, and primagraft tumors demonstrating evolution of the mutational signature over the year of passage in vivo. Conclusion: To our knowledge, we established the first paired human primary and metastatic ATC cell lines offering unique possibilities for comparative functional investigations in vitro and in vivo. Our exome sequencing also identified novel mutations, as well as clonal evolution in both the metastasis and primagraft. PMID:25365311
Identification of a lineage specific zinc responsive genomic island in Mycobacterium avium ssp. paratuberculosis.

PubMed

Eckelt, Elke; Jarek, Michael; Frömke, Cornelia; Meens, Jochen; Goethe, Ralph

2014-12-06

Maintenance of metal homeostasis is crucial in bacterial pathogenicity as metal starvation is the most important mechanism in the nutritional immunity strategy of host cells. Thus, pathogenic bacteria have evolved sensitive metal scavenging systems to overcome this particular host defence mechanism. The ruminant pathogen Mycobacterium avium ssp. paratuberculosis (MAP) displays a unique gut tropism and causes a chronic progressive intestinal inflammation. MAP possesses eight conserved lineage specific large sequence polymorphisms (LSP), which distinguish MAP from its ancestral M. avium ssp. hominissuis or other M. avium subspecies. LSP14 and LSP15 harbour many genes proposed to be involved in metal homeostasis and have been suggested to substitute for a MAP specific, impaired mycobactin synthesis. In the present study, we found that a LSP14 located putative IrtAB-like iron transporter encoded by mptABC was induced by zinc but not by iron starvation. Heterologous reporter gene assays with the lacZ gene under control of the mptABC promoter in M. smegmatis (MSMEG) and in a MSMEG∆furB deletion mutant revealed a zinc dependent, metalloregulator FurB mediated expression of mptABC via a conserved mycobacterial FurB recognition site. Deep sequencing of RNA from MAP cultures treated with the zinc chelator TPEN revealed that 70 genes responded to zinc limitation. Remarkably, 45 of these genes were located on a large genomic island of approximately 90 kb which harboured LSP14 and LSP15. Thirty-five of these genes were predicted to be controlled by FurB, due to the presence of putative binding sites. This clustering of zinc responsive genes was exclusively found in MAP and not in other mycobacteria. Our data revealed a particular genomic signature for MAP given by a unique zinc specific locus, thereby suggesting an exceptional relevance of zinc for the metabolism of MAP. MAP seems to be well adapted to maintain zinc homeostasis which might contribute to the peculiarity of MAP pathogenicity.
Effect of Artificial Selection on Runs of Homozygosity in U.S. Holstein Cattle

PubMed Central

Kim, Eui-Soo; Cole, John B.; Huson, Heather; Wiggans, George R.; Van Tassell, Curtis P.; Crooker, Brian A.; Liu, George; Da, Yang; Sonstegard, Tad S.

2013-01-01

The intensive selection programs for milk made possible by mass artificial insemination increased the similarity among the genomes of North American (NA) Holsteins tremendously since the 1960s. This migration of elite alleles has caused certain regions of the genome to have runs of homozygosity (ROH) occasionally spanning millions of continuous base pairs at a specific locus. In this study, genome signatures of artificial selection in NA Holsteins born between 1953 and 2008 were identified by comparing changes in ROH between three distinct groups under different selective pressure for milk production. The ROH regions were also used to estimate the inbreeding coefficients. The comparisons of genomic autozygosity between groups selected or unselected since 1964 for milk production revealed significant differences with respect to overall ROH frequency and distribution. These results indicate selection has increased overall autozygosity across the genome, whereas the autozygosity in an unselected line has not changed significantly across most of the chromosomes. In addition, ROH distribution was more variable across the genomes of selected animals in comparison to a more even ROH distribution for unselected animals. Further analysis of genome-wide autozygosity changes and the association between traits and haplotypes identified more than 40 genomic regions under selection on several chromosomes (Chr) including Chr 2, 7, 16 and 20. Many of these selection signatures corresponded to quantitative trait loci for milk, fat, and protein yield previously found in contemporary Holsteins. PMID:24348915
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd

PubMed Central

Wang, Zichen; Monteiro, Caroline D.; Jagodnik, Kathleen M.; Fernandez, Nicolas F.; Gundersen, Gregory W.; Rouillard, Andrew D.; Jenkins, Sherry L.; Feldmann, Axel S.; Hu, Kevin S.; McDermott, Michael G.; Duan, Qiaonan; Clark, Neil R.; Jones, Matthew R.; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R.; Szeto, Gregory L.; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M.; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M.; Kruth, Candice D.; Bongio, Nicholas J.; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E.; Malatras, Apostolos; Fulp, Carl T.; Galindo, John A.; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C.; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H.; Allison, Lindsey R.; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi

2016-01-01

Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization. PMID:27667448
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.

PubMed

Wang, Zichen; Monteiro, Caroline D; Jagodnik, Kathleen M; Fernandez, Nicolas F; Gundersen, Gregory W; Rouillard, Andrew D; Jenkins, Sherry L; Feldmann, Axel S; Hu, Kevin S; McDermott, Michael G; Duan, Qiaonan; Clark, Neil R; Jones, Matthew R; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R; Szeto, Gregory L; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M; Kruth, Candice D; Bongio, Nicholas J; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E; Malatras, Apostolos; Fulp, Carl T; Galindo, John A; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H; Allison, Lindsey R; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi

2016-09-26

Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.
Whole Genome Characterization, Phylogenetic and Genome Signature Analysis of Human Pandemic H1N1 Virus in Thailand, 2009–2012

PubMed Central

Makkoch, Jarika; Suwannakarn, Kamol; Payungporn, Sunchai; Prachayangprecha, Slinporn; Cheiocharnsin, Thaweesak; Linsuwanon, Piyada; Theamboonlers, Apiradee; Poovorawan, Yong

2012-01-01

Background Three waves of human pandemic influenza occurred in Thailand in 2009–2012. The genome signature features and evolution of pH1N1 need to be characterized to elucidate the aspects responsible for the multiple waves of pandemic. Methodology/Findings Forty whole genome sequences and 584 partial sequences of pH1N1 circulating in Thailand, divided into 1st, 2nd and 3rd wave and post-pandemic were characterized and 77 genome signatures were analyzed. Phylogenetic trees of concatenated whole genome and HA gene sequences were constructed calculating substitution rate and dN/dS of each gene. Phylogenetic analysis showed a distinct pattern of pH1N1 circulation in Thailand, with the first two isolates from May, 2009 belonging to clade 5 while clades 5, 6 and 7 co-circulated during the first wave of pH1N1 pandemic in Thailand. Clade 8 predominated during the second wave and different proportions of the pH1N1 viruses circulating during the third wave and post pandemic period belonged to clades 8, 11.1 and 11.2. The mutation analysis of pH1N1 revealed many adaptive mutations which have become the signature of each clade and may be responsible for the multiple pandemic waves in Thailand, especially with regard to clades 11.1 and 11.2 as evidenced with V731I, G154D of PB1 gene, PA I330V, HA A214T S160G and S202T. The substitution rate of pH1N1 in Thailand ranged from 2.53×10−3±0.02 (M2 genes) to 5.27×10−3±0.03 per site per year (NA gene). Conclusions All results suggested that this virus is still adaptive, maybe to evade the host's immune response and tends to remain in the human host although the dN/dS were under purifying selection in all 8 genes. Due to the gradual evolution of pH1N1 in Thailand, continuous monitoring is essential for evaluation and surveillance to be prepared for and able to control future influenza activities. PMID:23251479
Bat Biology, Genomes, and the Bat1K Project: To Generate Chromosome-Level Genomes for All Living Bat Species.

PubMed

Teeling, Emma C; Vernes, Sonja C; Dávalos, Liliana M; Ray, David A; Gilbert, M Thomas P; Myers, Eugene

2018-02-15

Bats are unique among mammals, possessing some of the rarest mammalian adaptations, including true self-powered flight, laryngeal echolocation, exceptional longevity, unique immunity, contracted genomes, and vocal learning. They provide key ecosystem services, pollinating tropical plants, dispersing seeds, and controlling insect pest populations, thus driving healthy ecosystems. They account for more than 20% of all living mammalian diversity, and their crown-group evolutionary history dates back to the Eocene. Despite their great numbers and diversity, many species are threatened and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n∼1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any interested individuals committed to a better understanding of the genetic and evolutionary mechanisms that underlie the unique adaptations of bats. Our aim is to catalog the unique genetic diversity present in all living bats to better understand the molecular basis of their unique adaptations; uncover their evolutionary history; link genotype with phenotype; and ultimately better understand, promote, and conserve bats. Here we review the unique adaptations of bats and highlight how chromosome-level genome assemblies can uncover the molecular basis of these traits. We present a novel sequencing and assembly strategy and review the striking societal and scientific benefits that will result from the Bat1K initiative.
Chemicals from the Practice of Healthcare: Challenges and Unknowns Posed by Residues in the Environment

EPA Science Inventory

Medications have unique signatures - real and metaphorical fingerprints, footprints, and shadows. Signatures imparted by manufacturers use distinctive combinations of shapes, colors, and imprints. These serve as rough first tests to aid in visually identifying the types and quant...

Identification of positive selection signatures in pigs by comparing linkage disequilibrium variances.

PubMed

Li, X; Yang, S; Dong, K; Tang, Z; Li, K; Fan, B; Wang, Z; Liu, B

2017-10-01

Selection affects the patterns of linkage disequilibrium (LD) around the site of a beneficial allele with an increase in LD among the hitchhiking alleles. Comparing the differences in regional LD between pig populations could help to identify putative genomic regions with potential adaptations for economic traits. In this study, using Illumina Porcine SNP60K BeadChip genotyping data from 207 Chinese indigenous, 117 South American village and 408 Large White pigs, we estimated the variation of genome-wide LD between populations using the varld program. The top 0.1% standardized VarLD scores were used as a criterion for all comparisons, and compared with LD blocks, a total of four selection signatures on Sus scrofa chromosome (SSC) 7, 9, 13 and 14 were identified in all populations. These signatures overlapped with quantitative trait loci for linoleic acid content, age at puberty, number of muscle fibers per unit area, hip structure and body weight traits in pigs. Among them, one of the signatures (56.5-56.6 Mb on SSC7) in Large White pigs harbored the ADAMTSL3 gene, which is known to affect body length. The findings of this study seem to point toward recent selection in different pig populations. Further investigations are encouraged to confirm the selection signatures detected by varld in the present study. © 2017 Stichting International Foundation for Animal Genetics.
Population genomics of Fusarium graminearum reveals signatures of divergent evolution within a major cereal pathogen

USDA-ARS?s Scientific Manuscript database

The cereal pathogen Fusarium graminearum is the primary cause of Fusarium head blight (FHB) and a significant threat to food safety and crop production. To elucidate population structure and identify genomic targets of selection within major FHB pathogen populations in North America we sequenced the...
Recent artificial selection in U.S. Jersey cattle impacts autozygosity levels of specific genomic regions

USDA-ARS?s Scientific Manuscript database

Genome signatures of artificial selection in U.S. Jersey cattle were identified by examining changes in haplotype homozygosity for a resource population of animals born between 1962 and 2005. Genetic merit of this population changed dramatically during this period for a number of traits, especially ...
Signatures of adaptation in the weedy rice genome

USDA-ARS?s Scientific Manuscript database

Weedy rice is a common problem of by product of domestication that has evolved multiple times from cultivated and wild rice relatives. Here we use whole genome sequences to examine the origin and adaptation of the two major US weedy red rice strains, with a comparison to Chinese weedy red rice. We f...
Comparative analysis of viral RNA signatures on different RIG-I-like receptors

PubMed Central

Sanchez David, Raul Y; Combredet, Chantal; Sismeiro, Odile; Dillies, Marie-Agnès; Jagla, Bernd; Coppée, Jean-Yves; Mura, Marie; Guerbois Galla, Mathilde; Despres, Philippe; Tangy, Frédéric; Komarova, Anastassia V

2016-01-01

The RIG-I-like receptors (RLRs) play a major role in sensing RNA virus infection to initiate and modulate antiviral immunity. They interact with particular viral RNAs, most of them being still unknown. To decipher the viral RNA signature on RLRs during viral infection, we tagged RLRs (RIG-I, MDA5, LGP2) and applied tagged protein affinity purification followed by next-generation sequencing (NGS) of associated RNA molecules. Two viruses with negative- and positive-sense RNA genome were used: measles (MV) and chikungunya (CHIKV). NGS analysis revealed that distinct regions of MV genome were specifically recognized by distinct RLRs: RIG-I recognized defective interfering genomes, whereas MDA5 and LGP2 specifically bound MV nucleoprotein-coding region. During CHIKV infection, RIG-I associated specifically to the 3’ untranslated region of viral genome. This study provides the first comparative view of the viral RNA ligands for RIG-I, MDA5 and LGP2 in the presence of infection. DOI: http://dx.doi.org/10.7554/eLife.11275.001 PMID:27011352
An FDA Perspective on the Regulatory Implications of Complex Signatures to Predict Response to Targeted Therapies

PubMed Central

Beaver, Julia A.; Tzou, Abraham; Blumenthal, Gideon M.; McKee, Amy E.; Kim, Geoffrey; Pazdur, Richard; Philip, Reena

2016-01-01

As technologies evolve, and diagnostics move from detection of single biomarkers toward complex signatures, an increase in the clinical use and regulatory submission of complex signatures is anticipated. However, to date, no complex signatures have been approved as companion diagnostics. In this article, we will describe the potential benefit of complex signatures and their unique regulatory challenges including analytical performance validation, complex signature simulation, and clinical performance evaluation. We also will review the potential regulatory pathways for clearance, approval, or acceptance of complex signatures by the U.S. Food and Drug Administration (FDA). These regulatory pathways include regulations applicable to in vitro diagnostic devices, including companion diagnostic devices, the potential for labeling as a complementary diagnostic, and the biomarker qualification program. PMID:27993967
A mutational signature reveals alterations underlying deficient homologous recombination repair in breast cancer.

PubMed

Polak, Paz; Kim, Jaegil; Braunstein, Lior Z; Karlic, Rosa; Haradhavala, Nicholas J; Tiao, Grace; Rosebrock, Daniel; Livitz, Dimitri; Kübler, Kirsten; Mouw, Kent W; Kamburov, Atanas; Maruvka, Yosef E; Leshchiner, Ignaty; Lander, Eric S; Golub, Todd R; Zick, Aviad; Orthwein, Alexandre; Lawrence, Michael S; Batra, Rajbir N; Caldas, Carlos; Haber, Daniel A; Laird, Peter W; Shen, Hui; Ellisen, Leif W; D'Andrea, Alan D; Chanock, Stephen J; Foulkes, William D; Getz, Gad

2017-10-01

Biallelic inactivation of BRCA1 or BRCA2 is associated with a pattern of genome-wide mutations known as signature 3. By analyzing ∼1,000 breast cancer samples, we confirmed this association and established that germline nonsense and frameshift variants in PALB2, but not in ATM or CHEK2, can also give rise to the same signature. We were able to accurately classify missense BRCA1 or BRCA2 variants known to impair homologous recombination (HR) on the basis of this signature. Finally, we show that epigenetic silencing of RAD51C and BRCA1 by promoter methylation is strongly associated with signature 3 and, in our data set, was highly enriched in basal-like breast cancers in young individuals of African descent.
Genomic signatures among Oncorhynchus nerka ecotypes to inform conservation and management of endangered Sockeye Salmon.

PubMed

Nichols, Krista M; Kozfkay, Christine C; Narum, Shawn R

2016-12-01

Conservation of life history variation is an important consideration for many species with trade-offs in migratory characteristics. Many salmonid species exhibit both resident and migratory strategies that capitalize on benefits in freshwater and marine environments. In this study, we investigated genomic signatures for migratory life history in collections of resident and anadromous Oncorhynchus nerka (Kokanee and Sockeye Salmon, respectively) from two lake systems, using ~2,600 SNPs from restriction-site-associated DNA sequencing (RAD-seq). Differing demographic histories were evident in the two systems where one pair was significantly differentiated (Redfish Lake, F ST = 0.091 [95% confidence interval: 0.087 to 0.095]) but the other pair was not (Alturas Lake, F ST = -0.007 [-0.008 to -0.006]). Outlier and association analyses identified several candidate markers in each population pair, but there was limited evidence for parallel signatures of genomic variation associated with migration. Despite lack of evidence for consistent markers associated with migratory life history in this species, candidate markers were mapped to functional genes and provide evidence for adaptive genetic variation within each lake system. Life history variation has been maintained in these nearly extirpated populations of O. nerka, and conservation efforts to preserve this diversity are important for long-term resiliency of this species.
Breast cancer genome and transcriptome integration implicates specific mutational signatures with immune cell infiltration

PubMed Central

Smid, Marcel; Rodríguez-González, F. Germán; Sieuwerts, Anieta M.; Salgado, Roberto; Prager-Van der Smissen, Wendy J. C.; Vlugt-Daane, Michelle van der; van Galen, Anne; Nik-Zainal, Serena; Staaf, Johan; Brinkman, Arie B.; van de Vijver, Marc J.; Richardson, Andrea L.; Fatima, Aquila; Berentsen, Kim; Butler, Adam; Martin, Sancha; Davies, Helen R.; Debets, Reno; Gelder, Marion E. Meijer-Van; van Deurzen, Carolien H. M.; MacGrogan, Gaëtan; Van den Eynden, Gert G. G. M.; Purdie, Colin; Thompson, Alastair M.; Caldas, Carlos; Span, Paul N.; Simpson, Peter T.; Lakhani, Sunil R.; Van Laere, Steven; Desmedt, Christine; Ringnér, Markus; Tommasi, Stefania; Eyford, Jorunn; Broeks, Annegien; Vincent-Salomon, Anne; Futreal, P. Andrew; Knappskog, Stian; King, Tari; Thomas, Gilles; Viari, Alain; Langerød, Anita; Børresen-Dale, Anne-Lise; Birney, Ewan; Stunnenberg, Hendrik G.; Stratton, Mike; Foekens, John A.; Martens, John W. M.

2016-01-01

A recent comprehensive whole genome analysis of a large breast cancer cohort was used to link known and novel drivers and substitution signatures to the transcriptome of 266 cases. Here, we validate that subtype-specific aberrations show concordant expression changes for, for example, TP53, PIK3CA, PTEN, CCND1 and CDH1. We find that CCND3 expression levels do not correlate with amplification, while increased GATA3 expression in mutant GATA3 cancers suggests GATA3 is an oncogene. In luminal cases the total number of substitutions, irrespective of type, associates with cell cycle gene expression and adverse outcome, whereas the number of mutations of signatures 3 and 13 associates with immune-response specific gene expression, increased numbers of tumour-infiltrating lymphocytes and better outcome. Thus, while earlier reports imply that the sheer number of somatic aberrations could trigger an immune-response, our data suggests that substitutions of a particular type are more effective in doing so than others. PMID:27666519
Immunologic and Virologic Mechanisms for Partial Protection from Intravenous Challenge by an Integration-Defective SIV Vaccine †

PubMed Central

Wang, Chu; Jiang, Chunlai; Gao, Nan; Zhang, Kaikai; Liu, Donglai; Wang, Wei; Cong, Zhe; Qin, Chuan; Ganusov, Vitaly V.; Ferrari, Guido; LaBranche, Celia; Montefiori, David C.; Kong, Wei; Yu, Xianghui; Gao, Feng

2017-01-01

The suppression of viral loads and identification of selection signatures in non-human primates after challenge are indicators for effective human immunodeficiency virus (HIV)/simian immunodeficiency virus (SIV) vaccines. To mimic the protective immunity elicited by attenuated SIV vaccines, we developed an integration-defective SIV (idSIV) vaccine by inactivating integrase, mutating sequence motifs critical for integration, and inserting the cytomegalovirus (CMV) promoter for more efficient expression in the SIVmac239 genome. Chinese rhesus macaques were immunized with idSIV DNA and idSIV particles, and the cellular and humoral immune responses were measured. After the intravenous SIVmac239 challenge, viral loads were monitored and selection signatures in viral genomes from vaccinated monkeys were identified by single genome sequencing. T cell responses, heterologous neutralization against tier-1 viruses, and antibody-dependent cellular cytotoxicity (ADCC) were detected in idSIV-vaccinated macaques post immunization. After challenge, the median peak viral load in the vaccine group was significantly lower than that in the control group. However, this initial viral control did not last as viral set-points were similar between vaccinated and control animals. Selection signatures were identified in Nef, Gag, and Env proteins in vaccinated and control macaques, but these signatures were different, suggesting selection pressure on viruses from vaccine-induced immunity in the vaccinated animals. Our results showed that the idSIV vaccine exerted some pressure on the virus population early during the infection but future modifications are needed in order to induce more potent immune responses. PMID:28574482
Identification of endometrial cancer methylation features using combined methylation analysis methods

PubMed Central

Trimarchi, Michael P.; Yan, Pearlly; Groden, Joanna; Bundschuh, Ralf; Goodfellow, Paul J.

2017-01-01

Background DNA methylation is a stable epigenetic mark that is frequently altered in tumors. DNA methylation features are attractive biomarkers for disease states given the stability of DNA methylation in living cells and in biologic specimens typically available for analysis. Widespread accumulation of methylation in regulatory elements in some cancers (specifically the CpG island methylator phenotype, CIMP) can play an important role in tumorigenesis. High resolution assessment of CIMP for the entire genome, however, remains cost prohibitive and requires quantities of DNA not available for many tissue samples of interest. Genome-wide scans of methylation have been undertaken for large numbers of tumors, and higher resolution analyses for a limited number of cancer specimens. Methods for analyzing such large datasets and integrating findings from different studies continue to evolve. An approach for comparison of findings from a genome-wide assessment of the methylated component of tumor DNA and more widely applied methylation scans was developed. Methods Methylomes for 76 primary endometrial cancer and 12 normal endometrial samples were generated using methylated fragment capture and second generation sequencing, MethylCap-seq. Publically available Infinium HumanMethylation 450 data from The Cancer Genome Atlas (TCGA) were compared to MethylCap-seq data. Results Analysis of methylation in promoter CpG islands (CGIs) identified a subset of tumors with a methylator phenotype. We used a two-stage approach to develop a 13-region methylation signature associated with a “hypermethylator state.” High level methylation for the 13-region methylation signatures was associated with mismatch repair deficiency, high mutation rate, and low somatic copy number alteration in the TCGA test set. In addition, the signature devised showed good agreement with previously described methylation clusters devised by TCGA. Conclusion We identified a methylation signature for a “hypermethylator phenotype” in endometrial cancer and developed methods that may prove useful for identifying extreme methylation phenotypes in other cancers. PMID:28278225
Phosphoproteomic analysis of the non-seed vascular plant model Selaginella moellendorffii

PubMed Central

2014-01-01

Background Selaginella (Selaginella moellendorffii) is a lycophyte which diverged from other vascular plants approximately 410 million years ago. As the first reported non-seed vascular plant genome, Selaginella genome data allow comparative analysis of genetic changes that may be associated with land plant evolution. Proteomics investigations on this lycophyte model have not been extensively reported. Phosphorylation represents the most common post-translational modifications and it is a ubiquitous regulatory mechanism controlling the functional expression of proteins inside living organisms. Results In this study, polyethylene glycol fractionation and immobilized metal ion affinity chromatography were employed to isolate phosphopeptides from wild-growing Selaginella. Using liquid chromatography-tandem mass spectrometry analysis, 1593 unique phosphopeptides spanning 1104 non-redundant phosphosites with confirmed localization on 716 phosphoproteins were identified. Analysis of the Selaginella dataset revealed features that are consistent with other plant phosphoproteomes, such as the relative proportions of phosphorylated Ser, Thr, and Tyr residues, the highest occurrence of phosphosites in the C-terminal regions of proteins, and the localization of phosphorylation events outside protein domains. In addition, a total of 97 highly conserved phosphosites in evolutionary conserved proteins were identified, indicating the conservation of phosphorylation-dependent regulatory mechanisms in phylogenetically distinct plant species. On the other hand, close examination of proteins involved in photosynthesis revealed phosphorylation events which may be unique to Selaginella evolution. Furthermore, phosphorylation motif analyses identified Pro-directed, acidic, and basic signatures which are recognized by typical protein kinases in plants. A group of Selaginella-specific phosphoproteins were found to be enriched in the Pro-directed motif class. Conclusions Our work provides the first large-scale atlas of phosphoproteins in Selaginella which occupies a unique position in the evolution of terrestrial plants. Future research into the functional roles of Selaginella-specific phosphorylation events in photosynthesis and other processes may offer insight into the molecular mechanisms leading to the distinct evolution of lycophytes. PMID:24628833
Genome analysis of canine astroviruses reveals genetic heterogeneity and suggests possible inter-species transmission.

PubMed

Mihalov-Kovács, Eszter; Martella, Vito; Lanave, Gianvito; Bodnar, Livia; Fehér, Enikő; Marton, Szilvia; Kemenesi, Gábor; Jakab, Ferenc; Bányai, Krisztián

2017-03-15

Canine astrovirus RNA was detected in the stools of 17/63 (26.9%) samples, using either a broadly reactive consensus RT-PCR for astroviruses or random RT-PCR coupled with massive deep sequencing. The complete or nearly complete genome sequence of five canine astroviruses was reconstructed that allowed mapping the genome organization and to investigate the genetic diversity of these viruses. The genome was about 6.6kb in length and contained three open reading frames (ORFs) flanked by a 5' UTR, and a 3' UTR plus a poly-A tail. ORF1a and ORF1b overlapped by 43 nucleotides while the ORF2 overlapped by 8 nucleotides with the 3' end of ORF1b. Upon genome comparison, four strains (HUN/2012/2, HUN/2012/6, HUN/2012/115, and HUN/2012/135) were more related genetically to each other and to UK canine astroviruses (88-96% nt identity), whilst strain HUN/2012/126 was more divergent (75-76% nt identity). In the ORF1b and ORF2, strains HUN/2012/2, HUN/2012/6, and HUN/2012/135 were related genetically to other canine astroviruses identified formerly in Europe and China, whereas strain HUN/2012/126 was related genetically to a divergent canine astrovirus strain, ITA/2010/Zoid. For one canine astrovirus, HUN/2012/8, only a 3.2kb portion of the genome, at the 3' end, could be determined. Interestingly, this strain possessed unique genetic signatures (including a longer ORF1b/ORF2 overlap and a longer 3'UTR) and it was divergent in both ORF1b and ORF2 from all other canine astroviruses, with the highest nucleotide sequence identity (68% and 63%, respectively) to a mink astrovirus, thus suggesting a possible event of interspecies transmission. The genetic heterogeneity of canine astroviruses may pose a challenge for the diagnostics and for future prophylaxis strategies. Copyright © 2016 Elsevier B.V. All rights reserved.
Thermal imaging as a biometrics approach to facial signature authentication.

PubMed

Guzman, A M; Goryawala, M; Wang, Jin; Barreto, A; Andrian, J; Rishe, N; Adjouadi, M

2013-01-01

A new thermal imaging framework with unique feature extraction and similarity measurements for face recognition is presented. The research premise is to design specialized algorithms that would extract vasculature information, create a thermal facial signature and identify the individual. The proposed algorithm is fully integrated and consolidates the critical steps of feature extraction through the use of morphological operators, registration using the Linear Image Registration Tool and matching through unique similarity measures designed for this task. The novel approach at developing a thermal signature template using four images taken at various instants of time ensured that unforeseen changes in the vasculature over time did not affect the biometric matching process as the authentication process relied only on consistent thermal features. Thirteen subjects were used for testing the developed technique on an in-house thermal imaging system. The matching using the similarity measures showed an average accuracy of 88.46% for skeletonized signatures and 90.39% for anisotropically diffused signatures. The highly accurate results obtained in the matching process clearly demonstrate the ability of the thermal infrared system to extend in application to other thermal imaging based systems. Empirical results applying this approach to an existing database of thermal images proves this assertion.
CoGAPS matrix factorization algorithm identifies transcriptional changes in AP-2alpha target genes in feedback from therapeutic inhibition of the EGFR network

PubMed Central

Thakar, Manjusha; Howard, Jason D.; Kagohara, Luciane T.; Krigsfeld, Gabriel; Ranaweera, Ruchira S.; Hughes, Robert M.; Perez, Jimena; Jones, Siân; Favorov, Alexander V.; Carey, Jacob; Stein-O'Brien, Genevieve; Gaykalova, Daria A.; Ochs, Michael F.; Chung, Christine H.

2016-01-01

Patients with oncogene driven tumors are treated with targeted therapeutics including EGFR inhibitors. Genomic data from The Cancer Genome Atlas (TCGA) demonstrates molecular alterations to EGFR, MAPK, and PI3K pathways in previously untreated tumors. Therefore, this study uses bioinformatics algorithms to delineate interactions resulting from EGFR inhibitor use in cancer cells with these genetic alterations. We modify the HaCaT keratinocyte cell line model to simulate cancer cells with constitutive activation of EGFR, HRAS, and PI3K in a controlled genetic background. We then measure gene expression after treating modified HaCaT cells with gefitinib, afatinib, and cetuximab. The CoGAPS algorithm distinguishes a gene expression signature associated with the anticipated silencing of the EGFR network. It also infers a feedback signature with EGFR gene expression itself increasing in cells that are responsive to EGFR inhibitors. This feedback signature has increased expression of several growth factor receptors regulated by the AP-2 family of transcription factors. The gene expression signatures for AP-2alpha are further correlated with sensitivity to cetuximab treatment in HNSCC cell lines and changes in EGFR expression in HNSCC tumors with low CDKN2A gene expression. In addition, the AP-2alpha gene expression signatures are also associated with inhibition of MEK, PI3K, and mTOR pathways in the Library of Integrated Network-Based Cellular Signatures (LINCS) data. These results suggest that AP-2 transcription factors are activated as feedback from EGFR network inhibition and may mediate EGFR inhibitor resistance. PMID:27650546
Classifying lower grade glioma cases according to whole genome gene expression.

PubMed

Chen, Baoshi; Liang, Tingyu; Yang, Pei; Wang, Haoyuan; Liu, Yanwei; Yang, Fan; You, Gan

2016-11-08

To identify a gene-based signature as a novel prognostic model in lower grade gliomas. A gene signature developed from HOXA7, SLC2A4RG and MN1 could segregate patients into low and high risk score groups with different overall survival (OS), and was validated in TCGA RNA-seq and GSE16011 mRNA array datasets. Receiver operating characteristic (ROC) was performed to show that the three-gene signature was more sensitive and specific than histology, grade, age, IDH1 mutation and 1p/19q co-deletion. Gene Set Enrichment Analysis (GSEA) and GO analysis showed high-risk samples were associated with tumor associated macrophages (TAMs) and highly invasive phenotypes. Moreover, HOXA7-siRNA inhibited migration and invasion in vitro, and downregulated MMP9 at the protein level in U251 glioma cells. A cohort of 164 glioma specimens from the Chinese Glioma Genome Atlas (CGGA) array database were assessed as the training group. TCGA RNA-seq and GSE16011 mRNA array datasets were used for validation. Regression analyses and linear risk score assessment were performed for the identification of the three-gene signature comprising HOXA7, SLC2A4RG and MN1. We established a three-gene signature for lower grade gliomas, which could independently predict overall survival (OS) of lower grade glioma patients with higher sensitivity and specificity compared with other clinical characteristics. These findings indicate that the three-gene signature is a new prognostic model that could provide improved OS prediction and accurate therapies for lower grade glioma patients.
Development and Validation of an Individualized Immune Prognostic Signature in Early-Stage Nonsquamous Non-Small Cell Lung Cancer.

PubMed

Li, Bailiang; Cui, Yi; Diehn, Maximilian; Li, Ruijiang

2017-11-01

The prevalence of early-stage non-small cell lung cancer (NSCLC) is expected to increase with recent implementation of annual screening programs. Reliable prognostic biomarkers are needed to identify patients at a high risk for recurrence to guide adjuvant therapy. To develop a robust, individualized immune signature that can estimate prognosis in patients with early-stage nonsquamous NSCLC. This retrospective study analyzed the gene expression profiles of frozen tumor tissue samples from 19 public NSCLC cohorts, including 18 microarray data sets and 1 RNA-Seq data set for The Cancer Genome Atlas (TCGA) lung adenocarcinoma cohort. Only patients with nonsquamous NSCLC with clinical annotation were included. Samples were from 2414 patients with nonsquamous NSCLC, divided into a meta-training cohort (729 patients), meta-testing cohort (716 patients), and 3 independent validation cohorts (439, 323, and 207 patients). All patients underwent surgery with a negative surgical margin, received no adjuvant or neoadjuvant therapy, and had publicly available gene expression data and survival information. Data were collected from July 22 through September 8, 2016. Overall survival. Of 2414 patients (1205 men [50%], 1111 women [46%], and 98 of unknown sex [4%]; median age [range], 64 [15-90] years), a prognostic immune signature of 25 gene pairs consisting of 40 unique genes was constructed using the meta-training data set. In the meta-testing and validation cohorts, the immune signature significantly stratified patients into high- vs low-risk groups in terms of overall survival across and within subpopulations with stage I, IA, IB, or II disease and remained as an independent prognostic factor in multivariate analyses (hazard ratio range, 1.72 [95% CI, 1.26-2.33; P < .001] to 2.36 [95% CI, 1.47-3.79; P < .001]) after adjusting for clinical and pathologic factors. Several biological processes, including chemotaxis, were enriched among genes in the immune signature. The percentage of neutrophil infiltration (5.6% vs 1.8%) and necrosis (4.6% vs 1.5%) was significantly higher in the high-risk immune group compared with the low-risk groups in TCGA data set (P < .003). The immune signature achieved a higher accuracy (mean concordance index [C-index], 0.64) than 2 commercialized multigene signatures (mean C-index, 0.53 and 0.61) for estimation of survival in comparable validation cohorts. When integrated with clinical characteristics such as age and stage, the composite clinical and immune signature showed improved prognostic accuracy in all validation data sets relative to molecular signatures alone (mean C-index, 0.70 vs 0.63) and another commercialized clinical-molecular signature (mean C-index, 0.68 vs 0.65). The proposed clinical-immune signature is a promising biomarker for estimating overall survival in nonsquamous NSCLC, including early-stage disease. Prospective studies are needed to test the clinical utility of the biomarker in individualized management of nonsquamous NSCLC.
Genomic profiling of a Hepatocyte growth factor-dependent signature for MET-targeted therapy in glioblastoma.

PubMed

Johnson, Jennifer; Ascierto, Maria Libera; Mittal, Sandeep; Newsome, David; Kang, Liang; Briggs, Michael; Tanner, Kirk; Marincola, Francesco M; Berens, Michael E; Vande Woude, George F; Xie, Qian

2015-09-17

Constitutive MET signaling promotes invasiveness in most primary and recurrent GBM. However, deployment of available MET-targeting agents is confounded by lack of effective biomarkers for selecting suitable patients for treatment. Because endogenous HGF overexpression often causes autocrine MET activation, and also indicates sensitivity to MET inhibitors, we investigated whether it drives the expression of distinct genes which could serve as a signature indicating vulnerability to MET-targeted therapy in GBM. Interrogation of genomic data from TCGA GBM (Student's t test, GBM patients with high and low HGF expression, p ≤ 0.00001) referenced against patient-derived xenograft (PDX) models (Student's t test, sensitive vs. insensitive models, p ≤ 0.005) was used to identify the HGF-dependent signature. Genomic analysis of GBM xenograft models using both human and mouse gene expression microarrays (Student's t test, treated vs. vehicle tumors, p ≤ 0.01) were performed to elucidate the tumor and microenvironment cross talk. A PDX model with EGFR(amp) was tested for MET activation as a mechanism of erlotinib resistance. We identified a group of 20 genes highly associated with HGF overexpression in GBM and were up- or down-regulated only in tumors sensitive to MET inhibitor. The MET inhibitors regulate tumor (human) and host (mouse) cells within the tumor via distinct molecular processes, but overall impede tumor growth by inhibiting cell cycle progression. EGFR (amp) tumors undergo erlotinib resistance responded to a combination of MET and EGFR inhibitors. Combining TCGA primary tumor datasets (human) and xenograft tumor model datasets (human tumor grown in mice) using therapeutic efficacy as an endpoint may serve as a useful approach to discover and develop molecular signatures as therapeutic biomarkers for targeted therapy. The HGF dependent signature may serve as a candidate predictive signature for patient enrollment in clinical trials using MET inhibitors. Human and mouse microarrays maybe used to dissect the tumor-host interactions. Targeting MET in EGFR (amp) GBM may delay the acquired resistance developed during treatment with erlotinib.
Loss of function JAK1 mutations occur at high frequency in cancers with microsatellite instability and are suggestive of immune evasion.

PubMed

Albacker, Lee A; Wu, Jeremy; Smith, Peter; Warmuth, Markus; Stephens, Philip J; Zhu, Ping; Yu, Lihua; Chmielecki, Juliann

2017-01-01

Immune evasion is a well-recognized hallmark of cancer and recent studies with immunotherapy agents have suggested that tumors with increased numbers of neoantigens elicit greater immune responses. We hypothesized that the immune system presents a common selective pressure on high mutation burden tumors and therefore immune evasion mutations would be enriched in high mutation burden tumors. The JAK family of kinases is required for the signaling of a host of immune modulators in tumor, stromal, and immune cells. Therefore, we analyzed alterations in this family for the hypothesized signature of an immune evasion mutation. Here, we searched a database of 61,704 unique solid tumors for alterations in the JAK family kinases (JAK1/2/3, TYK2). We used The Cancer Genome Atlas and Cancer Cell Line Encyclopedia data to confirm and extend our findings by analyzing gene expression patterns. Recurrent frameshift mutations in JAK1 were associated with high mutation burden and microsatellite instability. These mutations occurred in multiple tumor types including endometrial, colorectal, stomach, and prostate carcinomas. Analyzing gene expression signatures in endometrial and stomach adenocarcinomas revealed that tumors with a JAK1 frameshift exhibited reduced expression of interferon response signatures and multiple anti-tumor immune signatures. Importantly, endometrial cancer cell lines exhibited similar gene expression changes that were expected to be tumor cell intrinsic (e.g. interferon response) but not those expected to be tumor cell extrinsic (e.g. NK cells). From these data, we derive two primary conclusions: 1) JAK1 frameshifts are loss of function alterations that represent a potential pan-cancer adaptation to immune responses against tumors with microsatellite instability; 2) The mechanism by which JAK1 loss of function contributes to tumor immune evasion is likely associated with loss of the JAK1-mediated interferon response.
Predictive genomics: a cancer hallmark network framework for predicting tumor clinical phenotypes using genome sequencing data.

PubMed

Wang, Edwin; Zaman, Naif; Mcgee, Shauna; Milanese, Jean-Sébastien; Masoudi-Nejad, Ali; O'Connor-McCourt, Maureen

2015-02-01

Tumor genome sequencing leads to documenting thousands of DNA mutations and other genomic alterations. At present, these data cannot be analyzed adequately to aid in the understanding of tumorigenesis and its evolution. Moreover, we have little insight into how to use these data to predict clinical phenotypes and tumor progression to better design patient treatment. To meet these challenges, we discuss a cancer hallmark network framework for modeling genome sequencing data to predict cancer clonal evolution and associated clinical phenotypes. The framework includes: (1) cancer hallmarks that can be represented by a few molecular/signaling networks. 'Network operational signatures' which represent gene regulatory logics/strengths enable to quantify state transitions and measures of hallmark traits. Thus, sets of genomic alterations which are associated with network operational signatures could be linked to the state/measure of hallmark traits. The network operational signature transforms genotypic data (i.e., genomic alterations) to regulatory phenotypic profiles (i.e., regulatory logics/strengths), to cellular phenotypic profiles (i.e., hallmark traits) which lead to clinical phenotypic profiles (i.e., a collection of hallmark traits). Furthermore, the framework considers regulatory logics of the hallmark networks under tumor evolutionary dynamics and therefore also includes: (2) a self-promoting positive feedback loop that is dominated by a genomic instability network and a cell survival/proliferation network is the main driver of tumor clonal evolution. Surrounding tumor stroma and its host immune systems shape the evolutionary paths; (3) cell motility initiating metastasis is a byproduct of the above self-promoting loop activity during tumorigenesis; (4) an emerging hallmark network which triggers genome duplication dominates a feed-forward loop which in turn could act as a rate-limiting step for tumor formation; (5) mutations and other genomic alterations have specific patterns and tissue-specificity, which are driven by aging and other cancer-inducing agents. This framework represents the logics of complex cancer biology as a myriad of phenotypic complexities governed by a limited set of underlying organizing principles. It therefore adds to our understanding of tumor evolution and tumorigenesis, and moreover, potential usefulness of predicting tumors' evolutionary paths and clinical phenotypes. Strategies of using this framework in conjunction with genome sequencing data in an attempt to predict personalized drug targets, drug resistance, and metastasis for cancer patients, as well as cancer risks for healthy individuals are discussed. Accurate prediction of cancer clonal evolution and clinical phenotypes will have substantial impact on timely diagnosis, personalized treatment and personalized prevention of cancer. Crown Copyright © 2014. Published by Elsevier Ltd. All rights reserved.

Future paradigms for precision oncology

PubMed Central

Klement, Giannoula Lakka; Arkun, Knarik; Valik, Dalibor; Roffidal, Tina; Hashemi, Ali; Klement, Christos; Carmassi, Paolo; Rietman, Edward; Slaby, Ondrej; Mazanek, Pavel; Mudry, Peter; Kovacs, Gabor; Kiss, Csongor; Norga, Koen; Konstantinov, Dobrin; André, Nicolas; Slavc, Irene; van Den Berg, Henk; Kolenova, Alexandra; Kren, Leos; Tuma, Jiri; Skotakova, Jarmila; Sterba, Jaroslav

2016-01-01

Research has exposed cancer to be a heterogeneous disease with a high degree of inter-tumoral and intra-tumoral variability. Individual tumors have unique profiles, and these molecular signatures make the use of traditional histology-based treatments problematic. The conventional diagnostic categories, while necessary for care, thwart the use of molecular information for treatment as molecular characteristics cross tissue types. This is compounded by the struggle to keep abreast the scientific advances made in all fields of science, and by the enormous challenge to organize, cross-reference, and apply molecular data for patient benefit. In order to supplement the site-specific, histology-driven diagnosis with genomic, proteomic and metabolomics information, a paradigm shift in diagnosis and treatment of patients is required. While most physicians are open and keen to use the emerging data for therapy, even those versed in molecular therapeutics are overwhelmed with the amount of available data. It is not surprising that even though The Human Genome Project was completed thirteen years ago, our patients have not benefited from the information. Physicians cannot, and should not be asked to process the gigabytes of genomic and proteomic information on their own in order to provide patients with safe therapies. The following consensus summary identifies the needed for practice changes, proposes potential solutions to the present crisis of informational overload, suggests ways of providing physicians with the tools necessary for interpreting patient specific molecular profiles, and facilitates the implementation of quantitative precision medicine. It also provides two case studies where this approach has been used. PMID:27223079
Diversity, genetic mapping, and signatures of domestication in the carrot (Daucus carota L.) genome, as revealed by Diversity Arrays Technology (DArT) markers

USDA-ARS?s Scientific Manuscript database

Carrot is one of the most economically important vegetables worldwide, however, genetic and genomic resources supporting carrot breeding remain limited. We developed a Diversity Arrays Technology (DArT) platform for wild and cultivated carrot and used it to investigate genetic diversity and to devel...
Correlation of 16S Ribosomal DNA Signature Sequences with Temperature-Dependent Growth Rates of Mesophilic and Psychrotolerant Strains of the Bacillus cereus Group

PubMed Central

Prüß, Birgit M.; Francis, Kevin P.; von Stetten, Felix; Scherer, Siegfried

1999-01-01

Sequences of the 16S ribosomal DNA (rDNA) from psychrotolerant and mesophilic strains of the Bacillus cereus group revealed signatures which were specific for these two thermal groups of bacteria. Further analysis of the genomic DNA from a wide range of food and soil isolates showed that B. cereus group strains have between 6 and 10 copies of 16S rDNA. Moreover, a number of these environmental strains have both rDNA operons with psychrotolerant signatures and rDNA operons with mesophilic signatures. The ability of these isolates to grow at low temperatures correlates with the prevalence of rDNA operons with psychrotolerant signatures, indicating specific nucleotides within the 16S rRNA to play a role in psychrotolerance. PMID:10198030
Whole-Genome Sequencing in Microbial Forensic Analysis of Gamma-Irradiated Microbial Materials.

PubMed

Broomall, Stacey M; Ait Ichou, Mohamed; Krepps, Michael D; Johnsky, Lauren A; Karavis, Mark A; Hubbard, Kyle S; Insalaco, Joseph M; Betters, Janet L; Redmond, Brady W; Rivers, Bryan A; Liem, Alvin T; Hill, Jessica M; Fochler, Edward T; Roth, Pierce A; Rosenzweig, C Nicole; Skowronski, Evan W; Gibbons, Henry S

2016-01-15

Effective microbial forensic analysis of materials used in a potential biological attack requires robust methods of morphological and genetic characterization of the attack materials in order to enable the attribution of the materials to potential sources and to exclude other potential sources. The genetic homogeneity and potential intersample variability of many of the category A to C bioterrorism agents offer a particular challenge to the generation of attributive signatures, potentially requiring whole-genome or proteomic approaches to be utilized. Currently, irradiation of mail is standard practice at several government facilities judged to be at particularly high risk. Thus, initial forensic signatures would need to be recovered from inactivated (nonviable) material. In the study described in this report, we determined the effects of high-dose gamma irradiation on forensic markers of bacterial biothreat agent surrogate organisms with a particular emphasis on the suitability of genomic DNA (gDNA) recovered from such sources as a template for whole-genome analysis. While irradiation of spores and vegetative cells affected the retention of Gram and spore stains and sheared gDNA into small fragments, we found that irradiated material could be utilized to generate accurate whole-genome sequence data on the Illumina and Roche 454 sequencing platforms. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Natural Selection and Functional Potentials of Human Noncoding Elements Revealed by Analysis of Next Generation Sequencing Data

PubMed Central

Xu, Shuhua

2015-01-01

Noncoding DNA sequences (NCS) have attracted much attention recently due to their functional potentials. Here we attempted to reveal the functional roles of noncoding sequences from the point of view of natural selection that typically indicates the functional potentials of certain genomic elements. We analyzed nearly 37 million single nucleotide polymorphisms (SNPs) of Phase I data of the 1000 Genomes Project. We estimated a series of key parameters of population genetics and molecular evolution to characterize sequence variations of the noncoding genome within and between populations, and identified the natural selection footprints in NCS in worldwide human populations. Our results showed that purifying selection is prevalent and there is substantial constraint of variations in NCS, while positive selectionis more likely to be specific to some particular genomic regions and regional populations. Intriguingly, we observed larger fraction of non-conserved NCS variants with lower derived allele frequency in the genome, indicating possible functional gain of non-conserved NCS. Notably, NCS elements are enriched for potentially functional markers such as eQTLs, TF motif, and DNase I footprints in the genome. More interestingly, some NCS variants associated with diseases such as Alzheimer's disease, Type 1 diabetes, and immune-related bowel disorder (IBD) showed signatures of positive selection, although the majority of NCS variants, reported as risk alleles by genome-wide association studies, showed signatures of negative selection. Our analyses provided compelling evidence of natural selection forces on noncoding sequences in the human genome and advanced our understanding of their functional potentials that play important roles in disease etiology and human evolution. PMID:26053627
Molecular signatures database (MSigDB) 3.0.

PubMed

Liberzon, Arthur; Subramanian, Aravind; Pinchback, Reid; Thorvaldsdóttir, Helga; Tamayo, Pablo; Mesirov, Jill P

2011-06-15

Well-annotated gene sets representing the universe of the biological processes are critical for meaningful and insightful interpretation of large-scale genomic data. The Molecular Signatures Database (MSigDB) is one of the most widely used repositories of such sets. We report the availability of a new version of the database, MSigDB 3.0, with over 6700 gene sets, a complete revision of the collection of canonical pathways and experimental signatures from publications, enhanced annotations and upgrades to the web site. MSigDB is freely available for non-commercial use at http://www.broadinstitute.org/msigdb.
Unique core genomes of the bacterial family vibrionaceae: insights into niche adaptation and speciation.

PubMed

Kahlke, Tim; Goesmann, Alexander; Hjerde, Erik; Willassen, Nils Peder; Haugen, Peik

2012-05-10

The criteria for defining bacterial species and even the concept of bacterial species itself are under debate, and the discussion is apparently intensifying as more genome sequence data is becoming available. However, it is still unclear how the new advances in genomics should be used most efficiently to address this question. In this study we identify genes that are common to any group of genomes in our dataset, to determine whether genes specific to a particular taxon exist and to investigate their potential role in adaptation of bacteria to their specific niche. These genes were named unique core genes. Additionally, we investigate the existence and importance of unique core genes that are found in isolates of phylogenetically non-coherent groups. These groups of isolates, that share a genetic feature without sharing a closest common ancestor, are termed genophyletic groups. The bacterial family Vibrionaceae was used as the model, and we compiled and compared genome sequences of 64 different isolates. Using the software orthoMCL we determined clusters of homologous genes among the investigated genome sequences. We used multilocus sequence analysis to build a host phylogeny and mapped the numbers of unique core genes of all distinct groups of isolates onto the tree. The results show that unique core genes are more likely to be found in monophyletic groups of isolates. Genophyletic groups of isolates, in contrast, are less common especially for large groups of isolate. The subsequent annotation of unique core genes that are present in genophyletic groups indicate a high degree of horizontally transferred genes. Finally, the annotation of the unique core genes of Vibrio cholerae revealed genes involved in aerotaxis and biosynthesis of the iron-chelator vibriobactin. The presented work indicates that genes specific for any taxon inside the bacterial family Vibrionaceae exist. These unique core genes encode conserved metabolic functions that can shed light on the adaptation of a species to its ecological niche. Additionally, our study suggests that unique core genes can be used to aid classification of bacteria and contribute to a bacterial species definition on a genomic level. Furthermore, these genes may be of importance in clinical diagnostics and drug development.
Detecting Signatures of Positive Selection along Defined Branches of a Population Tree Using LSD.

PubMed

Librado, Pablo; Orlando, Ludovic

2018-06-01

Identifying the genomic basis underlying local adaptation is paramount to evolutionary biology, and bears many applications in the fields of conservation biology, crop, and animal breeding, as well as personalized medicine. Although many approaches have been developed to detect signatures of positive selection within single populations and population pairs, the increasing wealth of high-throughput sequencing data requires improved methods capable of handling multiple, and ideally large number of, populations in a single analysis. In this study, we introduce LSD (levels of exclusively shared differences), a fast and flexible framework to perform genome-wide selection scans, along the internal and external branches of a given population tree. We use forward simulations to demonstrate that LSD can identify branches targeted by positive selection with remarkable sensitivity and specificity. We illustrate a range of potential applications by analyzing data from the 1000 Genomes Project and uncover a list of adaptive candidates accompanying the expansion of anatomically modern humans out of Africa and their spread to Europe.
Gyneco-oncological genomics and emerging biomarkers for cancer treatment with immune-checkpoint inhibitors.

PubMed

Curigliano, Giuseppe

2018-05-15

In gynecological cancers tumor infiltrating lymphocytes and upregulation of immune-related gene signatures have been associated with a better prognosis. Knowledge of tumor immunogenicity and associated gene signatures suggests that the tumor immune landscape is a key determinant to define patient prognosis and potentially to predict response to immune-checkpoint inhibitors. The aim of this review is to give an overview of immune gene signatures across gynecology histological cancer types, defining their prognostic and potential predictive role. In the current review we will present data on these gene signatures, on immunohistochemical features and their potential importance to select patients potentially eligible to trials with immune-checkpoint inhibitors. Copyright © 2018 Elsevier Ltd. All rights reserved.
Comparative genomics provides evidence for the 3-hydroxypropionate autotrophic pathway in filamentous anoxygenic phototrophic bacteria and in hot spring microbial mats.

PubMed

Klatt, Christian G; Bryant, Donald A; Ward, David M

2007-08-01

Stable carbon isotope signatures of diagnostic lipid biomarkers have suggested that Roseiflexus spp., the dominant filamentous anoxygenic phototrophic bacteria inhabiting microbial mats of alkaline siliceous hot springs, may be capable of fixing bicarbonate via the 3-hydroxypropionate pathway, which has been characterized in their distant relative, Chloroflexus aurantiacus. The genomes of three filamentous anoxygenic phototrophic Chloroflexi isolates (Roseiflexus sp. RS-1, Roseiflexus castenholzii and Chloroflexus aggregans), but not that of a non-photosynthetic Chloroflexi isolate (Herpetosiphon aurantiacus), were found to contain open reading frames that show a high degree of sequence similarity to genes encoding enzymes in the C. aurantiacus pathway. Metagenomic DNA sequences from the microbial mats of alkaline siliceous hot springs also contain homologues of these genes that are highly similar to genes in both Roseiflexus spp. and Chloroflexus spp. Thus, Roseiflexus spp. appear to have the genetic capacity for carbon dioxide reduction via the 3-hydroxypropionate pathway. This may contribute to heavier carbon isotopic signatures of the cell components of native Roseiflexus populations in mats compared with the signatures of cyanobacterial cell components, as a similar isotopic signature would be expected if Roseiflexus spp. were participating in photoheterotrophic uptake of cyanobacterial photosynthate produced by the reductive pentose phosphate cycle.
Unraveling Molecular Signatures of Immunostimulatory Adjuvants in the Female Genital Tract through Systems Biology

PubMed Central

Brinkenberg, Ingrid; Samuelson, Emma; Thörn, Karolina; Nielsen, Jens; Harandi, Ali M.

2011-01-01

Sexually transmitted infections (STIs) unequivocally represent a major public health concern in both industrialized and developing countries. Previous efforts to develop vaccines for systemic immunization against a large number of STIs in humans have been unsuccessful. There is currently a drive to develop mucosal vaccines and adjuvants for delivery through the genital tract to confer protective immunity against STIs. Identification of molecular signatures that can be used as biomarkers for adjuvant potency can inform rational development of potent mucosal adjuvants. Here, we used systems biology to study global gene expression and signature molecules and pathways in the mouse vagina after treatment with two classes of experimental adjuvants. The Toll-like receptor 9 agonist CpG ODN and the invariant natural killer T cell agonist alpha-galactosylceramide, which we previously identified as equally potent vaginal adjuvants, were selected for this study. Our integrated analysis of genome-wide transcriptome data determined which signature pathways, processes and networks are shared by or otherwise exclusive to these 2 classes of experimental vaginal adjuvants in the mouse vagina. To our knowledge, this is the first integrated genome-wide transcriptome analysis of the effects of immunomodulatory adjuvants on the female genital tract of a mammal. These results could inform rational development of effective mucosal adjuvants for vaccination against STIs. PMID:21666746
Comparative analysis reveals genomic features of stress-induced transcriptional readthrough

PubMed Central

Vilborg, Anna; Sabath, Niv; Wiesel, Yuval; Nathans, Jenny; Levy-Adam, Flonia; Yario, Therese A.; Steitz, Joan A.; Shalgi, Reut

2017-01-01

Transcription is a highly regulated process, and stress-induced changes in gene transcription have been shown to play a major role in stress responses and adaptation. Genome-wide studies reveal prevalent transcription beyond known protein-coding gene loci, generating a variety of RNA classes, most of unknown function. One such class, termed downstream of gene-containing transcripts (DoGs), was reported to result from transcriptional readthrough upon osmotic stress in human cells. However, how widespread the readthrough phenomenon is, and what its causes and consequences are, remain elusive. Here we present a genome-wide mapping of transcriptional readthrough, using nuclear RNA-Seq, comparing heat shock, osmotic stress, and oxidative stress in NIH 3T3 mouse fibroblast cells. We observe massive induction of transcriptional readthrough, both in levels and length, under all stress conditions, with significant, yet not complete, overlap of readthrough-induced loci between different conditions. Importantly, our analyses suggest that stress-induced transcriptional readthrough is not a random failure process, but is rather differentially induced across different conditions. We explore potential regulators and find a role for HSF1 in the induction of a subset of heat shock-induced readthrough transcripts. Analysis of public datasets detected increases in polymerase II occupancy in DoG regions after heat shock, supporting our findings. Interestingly, DoGs tend to be produced in the vicinity of neighboring genes, leading to a marked increase in their antisense-generating potential. Finally, we examine genomic features of readthrough transcription and observe a unique chromatin signature typical of DoG-producing regions, suggesting that readthrough transcription is associated with the maintenance of an open chromatin state. PMID:28928151
Rapid genome-wide evolution in Brassica rapa populations following drought revealed by sequencing of ancestral and descendant gene pools.

PubMed

Franks, Steven J; Kane, Nolan C; O'Hara, Niamh B; Tittes, Silas; Rest, Joshua S

2016-08-01

There is increasing evidence that evolution can occur rapidly in response to selection. Recent advances in sequencing suggest the possibility of documenting genetic changes as they occur in populations, thus uncovering the genetic basis of evolution, particularly if samples are available from both before and after selection. Here, we had a unique opportunity to directly assess genetic changes in natural populations following an evolutionary response to a fluctuation in climate. We analysed genome-wide differences between ancestors and descendants of natural populations of Brassica rapa plants from two locations that rapidly evolved changes in multiple phenotypic traits, including flowering time, following a multiyear late-season drought in California. These ancestor-descendant comparisons revealed evolutionary shifts in allele frequencies in many genes. Some genes showing evolutionary shifts have functions related to drought stress and flowering time, consistent with an adaptive response to selection. Loci differentiated between ancestors and descendants (FST outliers) were generally different from those showing signatures of selection based on site frequency spectrum analysis (Tajima's D), indicating that the loci that evolved in response to the recent drought and those under historical selection were generally distinct. Very few genes showed similar evolutionary responses between two geographically distinct populations, suggesting independent genetic trajectories of evolution yielding parallel phenotypic changes. The results show that selection can result in rapid genome-wide evolutionary shifts in allele frequencies in natural populations, and highlight the usefulness of combining resurrection experiments in natural populations with genomics for studying the genetic basis of adaptive evolution. © 2016 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle

PubMed Central

Nelson, William C.; Stegen, James C.

2015-01-01

Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in a broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. “Housekeeping” genes and genes for biosynthesis of peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides, and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle, or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest that the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum. PMID:26257709
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle

DOE PAGES

Nelson, William C.; Stegen, James C.

2015-07-21

Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in a broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. “Housekeeping” genes and genes for biosynthesismore » of peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides, and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle, or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest that the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum.« less
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nelson, William C.; Stegen, James C.

2015-07-21

Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. ‘Housekeeping’ genes and genes for biosynthesis ofmore » peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum.« less
Characterization of canine osteosarcoma by array comparative genomic hybridization and RT-qPCR: signatures of genomic imbalance in canine osteosarcoma parallel the human counterpart.

PubMed

Angstadt, Andrea Y; Motsinger-Reif, Alison; Thomas, Rachael; Kisseberth, William C; Guillermo Couto, C; Duval, Dawn L; Nielsen, Dahlia M; Modiano, Jaime F; Breen, Matthew

2011-11-01

Osteosarcoma (OS) is the most commonly diagnosed malignant bone tumor in humans and dogs, characterized in both species by extremely complex karyotypes exhibiting high frequencies of genomic imbalance. Evaluation of genomic signatures in human OS using array comparative genomic hybridization (aCGH) has assisted in uncovering genetic mechanisms that result in disease phenotype. Previous low-resolution (10-20 Mb) aCGH analysis of canine OS identified a wide range of recurrent DNA copy number aberrations, indicating extensive genomic instability. In this study, we profiled 123 canine OS tumors by 1 Mb-resolution aCGH to generate a dataset for direct comparison with current data for human OS, concluding that several high frequency aberrations in canine and human OS are orthologous. To ensure complete coverage of gene annotation, we identified the human refseq genes that map to these orthologous aberrant dog regions and found several candidate genes warranting evaluation for OS involvement. Specifically, subsequenct FISH and qRT-PCR analysis of RUNX2, TUSC3, and PTEN indicated that expression levels correlated with genomic copy number status, showcasing RUNX2 as an OS associated gene and TUSC3 as a possible tumor suppressor candidate. Together these data demonstrate the ability of genomic comparative oncology to identify genetic abberations which may be important for OS progression. Large scale screening of genomic imbalance in canine OS further validates the use of the dog as a suitable model for human cancers, supporting the idea that dysregulation discovered in canine cancers will provide an avenue for complementary study in human counterparts. Copyright © 2011 Wiley-Liss, Inc.
Functional genomics reveals the induction of inflammatory response and metalloproteinase gene expression during lethal Ebola virus infection.

PubMed

Cilloniz, Cristian; Ebihara, Hideki; Ni, Chester; Neumann, Gabriele; Korth, Marcus J; Kelly, Sara M; Kawaoka, Yoshihiro; Feldmann, Heinz; Katze, Michael G

2011-09-01

Ebola virus is the etiologic agent of a lethal hemorrhagic fever in humans and nonhuman primates with mortality rates of up to 90%. Previous studies with Zaire Ebola virus (ZEBOV), mouse-adapted virus (MA-ZEBOV), and mutant viruses (ZEBOV-NP(ma), ZEBOV-VP24(ma), and ZEBOV-NP/VP24(ma)) allowed us to identify the mutations in viral protein 24 (VP24) and nucleoprotein (NP) responsible for acquisition of high virulence in mice. To elucidate specific molecular signatures associated with lethality, we compared global gene expression profiles in spleen samples from mice infected with these viruses and performed an extensive functional analysis. Our analysis showed that the lethal viruses (MA-ZEBOV and ZEBOV-NP/VP24(ma)) elicited a strong expression of genes 72 h after infection. In addition, we found that although the host transcriptional response to ZEBOV-VP24(ma) was nearly the same as that to ZEBOV-NP/VP24(ma), the contribution of a mutation in the NP gene was required for a lethal phenotype. Further analysis indicated that one of the most relevant biological functions differentially regulated by the lethal viruses was the inflammatory response, as was the induction of specific metalloproteinases, which were present in our newly identify functional network that was associated with Ebola virus lethality. Our results suggest that this dysregulated proinflammatory response increased the severity of disease. Consequently, the newly discovered molecular signature could be used as the starting point for the development of new drugs and therapeutics. To our knowledge, this is the first study that clearly defines unique molecular signatures associated with Ebola virus lethality.
Somatic cell nuclear transfer: infinite reproduction of a unique diploid genome.

PubMed

Kishigami, Satoshi; Wakayama, Sayaka; Hosoi, Yoshihiko; Iritani, Akira; Wakayama, Teruhiko

2008-06-10

In mammals, a diploid genome of an individual following fertilization of an egg and a spermatozoon is unique and irreproducible. This implies that the generated unique diploid genome is doomed with the individual ending. Even as cultured cells from the individual, they cannot normally proliferate in perpetuity because of the "Hayflick limit". However, Dolly, the sheep cloned from an adult mammary gland cell, changes this scenario. Somatic cell nuclear transfer (SCNT) enables us to produce offspring without germ cells, that is, to "passage" a unique diploid genome. Animal cloning has also proven to be a powerful research tool for reprogramming in many mammals, notably mouse and cow. The mechanism underlying reprogramming, however, remains largely unknown and, animal cloning has been inefficient as a result. More momentously, in addition to abortion and fetal mortality, some cloned animals display possible premature aging phenotypes including early death and short telomere lengths. Under these inauspicious conditions, is it really possible for SCNT to preserve a diploid genome? Delightfully, in mouse and recently in primate, using SCNT we can produce nuclear transfer ES cells (ntES) more efficiently, which can preserve the eternal lifespan for the "passage" of a unique diploid genome. Further, new somatic cloning technique using histone-deacetylase inhibitors has been developed which can significantly increase the previous cloning rates two to six times. Here, we introduce SCNT and its value as a preservation tool for a diploid genome while reviewing aging of cloned animals on cellular and individual levels.
Fast, Accurate and Automatic Ancient Nucleosome and Methylation Maps with epiPALEOMIX.

PubMed

Hanghøj, Kristian; Seguin-Orlando, Andaine; Schubert, Mikkel; Madsen, Tobias; Pedersen, Jakob Skou; Willerslev, Eske; Orlando, Ludovic

2016-12-01

The first epigenomes from archaic hominins (AH) and ancient anatomically modern humans (AMH) have recently been characterized, based, however, on a limited number of samples. The extent to which ancient genome-wide epigenetic landscapes can be reconstructed thus remains contentious. Here, we present epiPALEOMIX, an open-source and user-friendly pipeline that exploits post-mortem DNA degradation patterns to reconstruct ancient methylomes and nucleosome maps from shotgun and/or capture-enrichment data. Applying epiPALEOMIX to the sequence data underlying 35 ancient genomes including AMH, AH, equids and aurochs, we investigate the temporal, geographical and preservation range of ancient epigenetic signatures. We first assess the quality of inferred ancient epigenetic signatures within well-characterized genomic regions. We find that tissue-specific methylation signatures can be obtained across a wider range of DNA preparation types than previously thought, including when no particular experimental procedures have been used to remove deaminated cytosines prior to sequencing. We identify a large subset of samples for which DNA associated with nucleosomes is protected from post-mortem degradation, and nucleosome positioning patterns can be reconstructed. Finally, we describe parameters and conditions such as DNA damage levels and sequencing depth that limit the preservation of epigenetic signatures in ancient samples. When such conditions are met, we propose that epigenetic profiles of CTCF binding regions can be used to help data authentication. Our work, including epiPALEOMIX, opens for further investigations of ancient epigenomes through time especially aimed at tracking possible epigenetic changes during major evolutionary, environmental, socioeconomic, and cultural shifts. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Novel recurrently mutated genes and a prognostic mutation signature in colorectal cancer.

PubMed

Yu, Jun; Wu, William K K; Li, Xiangchun; He, Jun; Li, Xiao-Xing; Ng, Simon S M; Yu, Chang; Gao, Zhibo; Yang, Jie; Li, Miao; Wang, Qiaoxiu; Liang, Qiaoyi; Pan, Yi; Tong, Joanna H; To, Ka F; Wong, Nathalie; Zhang, Ning; Chen, Jie; Lu, Youyong; Lai, Paul B S; Chan, Francis K L; Li, Yingrui; Kung, Hsiang-Fu; Yang, Huanming; Wang, Jun; Sung, Joseph J Y

2015-04-01

Characterisation of colorectal cancer (CRC) genomes by next-generation sequencing has led to the discovery of novel recurrently mutated genes. Nevertheless, genomic data has not yet been used for CRC prognostication. To identify recurrent somatic mutations with prognostic significance in patients with CRC. Exome sequencing was performed to identify somatic mutations in tumour tissues of 22 patients with CRC, followed by validation of 187 recurrent and pathway-related genes using targeted capture sequencing in additional 160 cases. Seven significantly mutated genes, including four reported (APC, TP53, KRAS and SMAD4) and three novel recurrently mutated genes (CDH10, FAT4 and DOCK2), exhibited high mutation prevalence (6-14% for novel cancer genes) and higher-than-expected number of non-silent mutations in our CRC cohort. For prognostication, a five-gene-signature (CDH10, COL6A3, SMAD4, TMEM132D, VCAN) was devised, in which mutation(s) in one or more of these genes was significantly associated with better overall survival independent of tumor-node-metastasis (TNM) staging. The median survival time was 80.4 months in the mutant group versus 42.4 months in the wild type group (p=0.0051). The prognostic significance of this signature was successfully verified using the data set from the Cancer Genome Atlas study. The application of next-generation sequencing has led to the identification of three novel significantly mutated genes in CRC and a mutation signature that predicts survival outcomes for stratifying patients with CRC independent of TNM staging. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Human CD30+ B cells represent a unique subset related to Hodgkin lymphoma cells.

PubMed

Weniger, Marc A; Tiacci, Enrico; Schneider, Stefanie; Arnolds, Judith; Rüschenbaum, Sabrina; Duppach, Janine; Seifert, Marc; Döring, Claudia; Hansmann, Martin-Leo; Küppers, Ralf

2018-06-11

Very few B cells in germinal centers (GCs) and extrafollicular (EF) regions of lymph nodes express CD30. Their specific features and relationship to CD30-expressing Hodgkin and Reed/Sternberg (HRS) cells of Hodgkin lymphoma are unclear but highly relevant, because numerous patients with lymphoma are currently treated with an anti-CD30 immunotoxin. We performed a comprehensive analysis of human CD30+ B cells. Phenotypic and IgV gene analyses indicated that CD30+ GC B lymphocytes represent typical GC B cells, and that CD30+ EF B cells are mostly post-GC B cells. The transcriptomes of CD30+ GC and EF B cells largely overlapped, sharing a strong MYC signature, but were strikingly different from conventional GC B cells and memory B and plasma cells, respectively. CD30+ GC B cells represent MYC+ centrocytes redifferentiating into centroblasts; CD30+ EF B cells represent active, proliferating memory B cells. HRS cells shared typical transcriptome patterns with CD30+ B cells, suggesting that they originate from these lymphocytes or acquire their characteristic features during lymphomagenesis. By comparing HRS to normal CD30+ B cells we redefined aberrant and disease-specific features of HRS cells. A remarkable downregulation of genes regulating genomic stability and cytokinesis in HRS cells may explain their genomic instability and multinuclearity.
The impact of genetics on future drug discovery in schizophrenia.

PubMed

Matsumoto, Mitsuyuki; Walton, Noah M; Yamada, Hiroshi; Kondo, Yuji; Marek, Gerard J; Tajinda, Katsunori

2017-07-01

Failures of investigational new drugs (INDs) for schizophrenia have left huge unmet medical needs for patients. Given the recent lackluster results, it is imperative that new drug discovery approaches (and resultant drug candidates) target pathophysiological alterations that are shared in specific, stratified patient populations that are selected based on pre-identified biological signatures. One path to implementing this paradigm is achievable by leveraging recent advances in genetic information and technologies. Genome-wide exome sequencing and meta-analysis of single nucleotide polymorphism (SNP)-based association studies have already revealed rare deleterious variants and SNPs in patient populations. Areas covered: Herein, the authors review the impact that genetics have on the future of schizophrenia drug discovery. The high polygenicity of schizophrenia strongly indicates that this disease is biologically heterogeneous so the identification of unique subgroups (by patient stratification) is becoming increasingly necessary for future investigational new drugs. Expert opinion: The authors propose a pathophysiology-based stratification of genetically-defined subgroups that share deficits in particular biological pathways. Existing tools, including lower-cost genomic sequencing and advanced gene-editing technology render this strategy ever more feasible. Genetically complex psychiatric disorders such as schizophrenia may also benefit from synergistic research with simpler monogenic disorders that share perturbations in similar biological pathways.
Origin and evolution of group XI secretory phospholipase A2 from flax (Linum usitatissimum) based on phylogenetic analysis of conserved domains.

PubMed

Gupta, Payal; Saini, Raman; Dash, Prasanta K

2017-07-01

Phospholipase A 2 (PLA 2 ) belongs to class of lipolytic enzymes (EC 3.1.1.4). Lysophosphatidic acid (LPA) and free fatty acids (FFAs) are the products of PLA 2 catalyzed hydrolysis of phosphoglycerides at sn-2 position. LPA and FFA that act as second mediators involved in the development and maturation of plants and animals. Mining of flax genome identified two phospholipase A 2 encoding genes, viz., LusPLA 2 I and LusPLA 2 II (Linum usitatissimum secretory phospholipase A 2 ). Molecular simulation of LusPLA 2 s with already characterized plant sPLA 2 s revealed the presence of conserved motifs and signature domains necessary to classify them as secretory phospholipase A 2 . Phylogenetic analysis of flax sPLA 2 with representative sPLA 2 s from other organisms revealed that they evolved rapidly via gene duplication/deletion events and shares a common ancestor. Our study is the first report of detailed phylogenetic analysis for secretory phospholipase A 2 in flax. Comparative genomic analysis of two LusPLA 2 s with earlier reported plant sPLA 2 s, based on their gene architectures, sequence similarities, and domain structures are presented elucidating the uniqueness of flax sPLA 2 .
The Miami Barrel: An Innovation in Forensic Firearms Identification

ERIC Educational Resources Information Center

Fadul, Thomas G., Jr.

2009-01-01

The scientific foundation in firearm and tool mark identification is that each firearm/tool produces a signature of identification (striation/impression) that is unique to that firearm/tool, and through examining the individual striations/impressions; the signature can be positively identified to the firearm/tool that produced it. There is no set…
Noble Gas Isotopic Signatures and X-Ray and Electron Diffraction Characteristics of Tagish Lake Carbonaceous Chondrite

NASA Technical Reports Server (NTRS)

Nakamura, T.; Noguchi, T.; Zolensky, M. E.; Takaoka, N.

2001-01-01

Noble gas isotopic signatures and X-ray and electron diffraction characteristics of Tagish Lake indicate that it is a unique carbonaceous chondrite rich in saponite, Fe-Mg-Ca carbonate, primordial noble gases, and presolar grains. Additional information is contained in the original extended abstract.
Genome-wide DNA methylation patterns of bovine blastocysts derived from in vivo embryos subjected to in vitro culture before, during or after embryonic genome activation.

PubMed

Salilew-Wondim, Dessie; Saeed-Zidane, Mohammed; Hoelker, Michael; Gebremedhn, Samuel; Poirier, Mikhaël; Pandey, Hari Om; Tholen, Ernst; Neuhoff, Christiane; Held, Eva; Besenfelder, Urban; Havlicek, Vita; Rings, Franca; Fournier, Eric; Gagné, Dominic; Sirard, Marc-André; Robert, Claude; Gad, Ahmed; Schellander, Karl; Tesfaye, Dawit

2018-06-01

Aberrant DNA methylation patterns of genes required for development are common in in vitro produced embryos. In this regard, we previously identified altered DNA methylation patterns of in vivo developed blastocysts from embryos which spent different stages of development in vitro, indicating carryover effects of suboptimal culture conditions on epigenetic signatures of preimplantation embryos. However, epigenetic responses of in vivo originated embryos to suboptimal culture conditions are not fully understood. Therefore, here we investigated DNA methylation patterns of in vivo derived bovine embryos subjected to in vitro culture condition before, during or after major embryonic genome activation (EGA). For this, in vivo produced 2-, 8- and 16-cell stage embryos were cultured in vitro until the blastocyst stage and blastocysts were used for genome-wide DNA methylation analysis. The 2- and 8-cell flushed embryo groups showed lower blastocyst rates compared to the 16-cell flush group. This was further accompanied by increased numbers of differentially methylated genomic regions (DMRs) in blastocysts of the 2- and 8-cell flush groups compared to the complete in vivo control ones. Moreover, 1623 genomic loci including imprinted genes were hypermethylated in blastocyst of 2-, 8- and 16-cell flushed groups, indicating the presence of genomic regions which are sensitive to the in vitro culture at any stage of embryonic development. Furthermore, hypermethylated genomic loci outnumbered hypomethylated ones in blastocysts of 2- and 16-cell flushed embryo groups, but the opposite occurred in the 8-cell group. Moreover, DMRs which were unique to blastocysts of the 2-cell flushed group and inversely correlated with corresponding mRNA expression levels were involved in plasma membrane lactate transport, amino acid transport and phosphorus metabolic processes, whereas DMRs which were specific to the 8-cell group and inversely correlated with corresponding mRNA expression levels were involved in several biological processes including regulation of fatty acids and steroid biosynthesis processes. In vivo embryos subjected to in vitro culture before and during major embryonic genome activation (EGA) are prone to changes in DNA methylation marks and exposure of in vivo embryos to in vitro culture during the time of EGA increased hypomethylated genomic loci in blastocysts.
Microbes: Agents of Isotopic Change

NASA Astrophysics Data System (ADS)

Fogel, M. L.

2012-12-01

Microbes drive many of the important oxidation and reduction reactions on Earth; digest almost all forms of organic matter; and can serve as both primary and secondary producers. Because of their versatile biochemistry and physiology, they impart unique isotopic signatures to organic and inorganic materials, which have proven to be key measurements for understanding elemental cycling now and throughout Earth's history. Understanding microbial isotope fractionations in laboratory experiments has been important for interpreting isotopic patterns measured in natural settings. In fact, the pairing of simple experiment with natural observation has been the pathway for interpreting the fingerprint of microbial processes in ancient sediments and rocks. Examples of how key experiments have explained stable isotope fractionations by microbes and advanced the field of microbial ecology will be presented. Learning the isotopic signatures of Earth's microbes is a valuable exercise for predicting what isotopic signatures could be displayed by possible extant or extinct extraterrestrial life. Given the potential for discovery on Mars, Enceladus, and other solar system bodies, new methods and techniques for pinpointing what is unique about microbial isotope signatures is particularly relevant.
Isolation, propagation, genome analysis and epidemiology of HKU1 betacoronaviruses

PubMed Central

Shrivastava, Susmita; Berglund, Andrew; Qian, Zhaohui; Góes, Luiz Gustavo Bentim; Halpin, Rebecca A.; Fedorova, Nadia; Ransier, Amy; Weston, Philip A.; Durigon, Edison Luiz; Jerez, José Antonio; Robinson, Christine C.; Town, Christopher D.; Holmes, Kathryn V.

2014-01-01

From 1 January 2009 to 31 May 2013, 15 287 respiratory specimens submitted to the Clinical Virology Laboratory at the Children’s Hospital Colorado were tested for human coronavirus RNA by reverse transcription-PCR. Human coronaviruses HKU1, OC43, 229E and NL63 co-circulated during each of the respiratory seasons but with significant year-to-year variability, and cumulatively accounted for 7.4–15.6 % of all samples tested during the months of peak activity. A total of 79 (0.5 % prevalence) specimens were positive for human betacoronavirus HKU1 RNA. Genotypes HKU1 A and B were both isolated from clinical specimens and propagated on primary human tracheal–bronchial epithelial cells cultured at the air–liquid interface and were neutralized in vitro by human intravenous immunoglobulin and by polyclonal rabbit antibodies to the spike glycoprotein of HKU1. Phylogenetic analysis of the deduced amino acid sequences of seven full-length genomes of Colorado HKU1 viruses and the spike glycoproteins from four additional HKU1 viruses from Colorado and three from Brazil demonstrated remarkable conservation of these sequences with genotypes circulating in Hong Kong and France. Within genotype A, all but one of the Colorado HKU1 sequences formed a unique subclade defined by three amino acid substitutions (W197F, F613Y and S752F) in the spike glycoprotein and exhibited a unique signature in the acidic tandem repeat in the N-terminal region of the nsp3 subdomain. Elucidating the function of and mechanisms responsible for the formation of these varying tandem repeats will increase our understanding of the replication process and pathogenicity of HKU1 and potentially of other coronaviruses. PMID:24394697
The Italian genome reflects the history of Europe and the Mediterranean basin

PubMed Central

Fiorito, Giovanni; Di Gaetano, Cornelia; Guarrera, Simonetta; Rosa, Fabio; Feldman, Marcus W; Piazza, Alberto; Matullo, Giuseppe

2016-01-01

Recent scientific literature has highlighted the relevance of population genetic studies both for disease association mapping in admixed populations and for understanding the history of human migrations. Deeper insight into the history of the Italian population is critical for understanding the peopling of Europe. Because of its crucial position at the centre of the Mediterranean basin, the Italian peninsula has experienced a complex history of colonization and migration whose genetic signatures are still present in contemporary Italians. In this study, we investigated genomic variation in the Italian population using 2.5 million single-nucleotide polymorphisms in a sample of more than 300 unrelated Italian subjects with well-defined geographical origins. We combined several analytical approaches to interpret genome-wide data on 1272 individuals from European, Middle Eastern, and North African populations. We detected three major ancestral components contributing different proportions across the Italian peninsula, and signatures of continuous gene flow within Italy, which have produced remarkable genetic variability among contemporary Italians. In addition, we have extracted novel details about the Italian population's ancestry, identifying the genetic signatures of major historical events in Europe and the Mediterranean basin from the Neolithic (e.g., peopling of Sardinia) to recent times (e.g., ‘barbarian invasion' of Northern and Central Italy). These results are valuable for further genetic, epidemiological and forensic studies in Italy and in Europe. PMID:26554880
Transcriptomes Reveal Genetic Signatures Underlying Physiological Variations Imposed by Different Fermentation Conditions in Lactobacillus plantarum

PubMed Central

Bongers, Roger S.; van Bokhorst-van de Veen, Hermien; Wiersma, Anne; Overmars, Lex; Marco, Maria L.; Kleerebezem, Michiel

2012-01-01

Lactic acid bacteria (LAB) are utilized widely for the fermentation of foods. In the current post-genomic era, tools have been developed that explore genetic diversity among LAB strains aiming to link these variations to differential phenotypes observed in the strains investigated. However, these genotype-phenotype matching approaches fail to assess the role of conserved genes in the determination of physiological characteristics of cultures by environmental conditions. This manuscript describes a complementary approach in which Lactobacillus plantarum WCFS1 was fermented under a variety of conditions that differ in temperature, pH, as well as NaCl, amino acid, and O2 levels. Samples derived from these fermentations were analyzed by full-genome transcriptomics, paralleled by the assessment of physiological characteristics, e.g., maximum growth rate, yield, and organic acid profiles. A data-storage and -mining suite designated FermDB was constructed and exploited to identify correlations between fermentation conditions and industrially relevant physiological characteristics of L. plantarum, as well as the associated transcriptome signatures. Finally, integration of the specific fermentation variables with the transcriptomes enabled the reconstruction of the gene-regulatory networks involved. The fermentation-genomics platform presented here is a valuable complementary approach to earlier described genotype-phenotype matching strategies which allows the identification of transcriptome signatures underlying physiological variations imposed by different fermentation conditions. PMID:22802930
Genes involved in convergent evolution of eusociality in bees

PubMed Central

Woodard, S. Hollis; Fischman, Brielle J.; Venkat, Aarti; Hudson, Matt E.; Varala, Kranthi; Cameron, Sydney A.; Clark, Andrew G.; Robinson, Gene E.

2011-01-01

Eusociality has arisen independently at least 11 times in insects. Despite this convergence, there are striking differences among eusocial lifestyles, ranging from species living in small colonies with overt conflict over reproduction to species in which colonies contain hundreds of thousands of highly specialized sterile workers produced by one or a few queens. Although the evolution of eusociality has been intensively studied, the genetic changes involved in the evolution of eusociality are relatively unknown. We examined patterns of molecular evolution across three independent origins of eusociality by sequencing transcriptomes of nine socially diverse bee species and combining these data with genome sequence from the honey bee Apis mellifera to generate orthologous sequence alignments for 3,647 genes. We found a shared set of 212 genes with a molecular signature of accelerated evolution across all eusocial lineages studied, as well as unique sets of 173 and 218 genes with a signature of accelerated evolution specific to either highly or primitively eusocial lineages, respectively. These results demonstrate that convergent evolution can involve a mosaic pattern of molecular changes in both shared and lineage-specific sets of genes. Genes involved in signal transduction, gland development, and carbohydrate metabolism are among the most prominent rapidly evolving genes in eusocial lineages. These findings provide a starting point for linking specific genetic changes to the evolution of eusociality. PMID:21482769
Rapid Molecular Identification of Pathogenic Yeasts by Pyrosequencing Analysis of 35 Nucleotides of Internal Transcribed Spacer 2 ▿

PubMed Central

Borman, Andrew M.; Linton, Christopher J.; Oliver, Debra; Palmer, Michael D.; Szekely, Adrien; Johnson, Elizabeth M.

2010-01-01

Rapid identification of yeast species isolates from clinical samples is particularly important given their innately variable antifungal susceptibility profiles. Here, we have evaluated the utility of pyrosequencing analysis of a portion of the internal transcribed spacer 2 region (ITS2) for identification of pathogenic yeasts. A total of 477 clinical isolates encompassing 43 different fungal species were subjected to pyrosequencing analysis in a strictly blinded study. The molecular identifications produced by pyrosequencing were compared with those obtained using conventional biochemical tests (AUXACOLOR2) and following PCR amplification and sequencing of the D1-D2 portion of the nuclear 28S large rRNA gene. More than 98% (469/477) of isolates encompassing 40 of the 43 fungal species tested were correctly identified by pyrosequencing of only 35 bp of ITS2. Moreover, BLAST searches of the public synchronized databases with the ITS2 pyrosequencing signature sequences revealed that there was only minimal sequence redundancy in the ITS2 under analysis. In all cases, the pyrosequencing signature sequences were unique to the yeast species (or species complex) under investigation. Finally, when pyrosequencing was combined with the Whatman FTA paper technology for the rapid extraction of fungal genomic DNA, molecular identification could be accomplished within 6 h from the time of starting from pure cultures. PMID:20702674
Utilization of Genomic Signatures to Direct Use of Primary Chemotherapy in Early Stage Breast Cancer

DTIC Science & Technology

2012-07-01

those seen with standard dose AC and TC  By chemotherapy:  AC: 11 of 19 with grade 3 or 4 adverse events (all neutropenia except one treatment...unrelated PE)  TC: 8 of 20 with grade 3 or 4 adverse events (all neutropenia except one docetaxel reaction)  By arm:  Genomically-guided: 14
Global biogeography of SAR11 marine bacteria

PubMed Central

Brown, Mark V; Lauro, Federico M; DeMaere, Matthew Z; Muir, Les; Wilkins, David; Thomas, Torsten; Riddle, Martin J; Fuhrman, Jed A; Andrews-Pfannkoch, Cynthia; Hoffman, Jeffrey M; McQuaid, Jeffrey B; Allen, Andrew; Rintoul, Stephen R; Cavicchioli, Ricardo

2012-01-01

The ubiquitous SAR11 bacterial clade is the most abundant type of organism in the world's oceans, but the reasons for its success are not fully elucidated. We analysed 128 surface marine metagenomes, including 37 new Antarctic metagenomes. The large size of the data set enabled internal transcribed spacer (ITS) regions to be obtained from the Southern polar region, enabling the first global characterization of the distribution of SAR11, from waters spanning temperatures −2 to 30°C. Our data show a stable co-occurrence of phylotypes within both ‘tropical' (>20°C) and ‘polar' (<10°C) biomes, highlighting ecological niche differentiation between major SAR11 subgroups. All phylotypes display transitions in abundance that are strongly correlated with temperature and latitude. By assembling SAR11 genomes from Antarctic metagenome data, we identified specific genes, biases in gene functions and signatures of positive selection in the genomes of the polar SAR11—genomic signatures of adaptive radiation. Our data demonstrate the importance of adaptive radiation in the organism's ability to proliferate throughout the world's oceans, and describe genomic traits characteristic of different phylotypes in specific marine biomes. PMID:22806143
Mutational signature analysis identifies MUTYH deficiency in colorectal cancers and adrenocortical carcinomas: Mutational signature associated with MUTYH deficiency in cancers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pilati, Camilla; Shinde, Jayendra; Alexandrov, Ludmil B.

Germline alterations in DNA repair genes are implicated in cancer predisposition and can result in characteristic mutational signatures. However, specific mutational signatures associated with base excision repair (BER) defects remain to be characterized. Here, by analysing a series of colorectal cancers (CRCs) using exome sequencing, we identified a particular spectrum of somatic mutations characterized by an enrichment of C > A transversions in NpCpA or NpCpT contexts in three tumours from a MUTYH-associated polyposis (MAP) patient and in two cases harbouring pathogenic germline MUTYH mutations. In two series of adrenocortical carcinomas (ACCs), we identified four tumours with a similar signaturemore » also presenting germline MUTYH mutations. Altogether, these findings demonstrate that MUTYH inactivation results in a particular mutational signature, which may serve as a useful marker of BER-related genomic instability in new cancer types.« less
Mutational signature analysis identifies MUTYH deficiency in colorectal cancers and adrenocortical carcinomas: Mutational signature associated with MUTYH deficiency in cancers

DOE PAGES

Pilati, Camilla; Shinde, Jayendra; Alexandrov, Ludmil B.; ...

2017-03-29

Germline alterations in DNA repair genes are implicated in cancer predisposition and can result in characteristic mutational signatures. However, specific mutational signatures associated with base excision repair (BER) defects remain to be characterized. Here, by analysing a series of colorectal cancers (CRCs) using exome sequencing, we identified a particular spectrum of somatic mutations characterized by an enrichment of C > A transversions in NpCpA or NpCpT contexts in three tumours from a MUTYH-associated polyposis (MAP) patient and in two cases harbouring pathogenic germline MUTYH mutations. In two series of adrenocortical carcinomas (ACCs), we identified four tumours with a similar signaturemore » also presenting germline MUTYH mutations. Altogether, these findings demonstrate that MUTYH inactivation results in a particular mutational signature, which may serve as a useful marker of BER-related genomic instability in new cancer types.« less
Distinct microbiological signatures associated with triple negative breast cancer.

PubMed

Banerjee, Sagarika; Wei, Zhi; Tan, Fei; Peck, Kristen N; Shih, Natalie; Feldman, Michael; Rebbeck, Timothy R; Alwine, James C; Robertson, Erle S

2015-10-15

Infectious agents are the third highest human cancer risk factor and may have a greater role in the origin and/or progression of cancers, and related pathogenesis. Thus, knowing the specific viruses and microbial agents associated with a cancer type may provide insights into cause, diagnosis and treatment. We utilized a pan-pathogen array technology to identify the microbial signatures associated with triple negative breast cancer (TNBC). This technology detects low copy number and fragmented genomes extracted from formalin-fixed paraffin embedded archival tissues. The results, validated by PCR and sequencing, define a microbial signature present in TNBC tissue which was underrepresented in normal tissue. Hierarchical clustering analysis displayed two broad microbial signatures, one prevalent in bacteria and parasites and one prevalent in viruses. These signatures demonstrate a new paradigm in our understanding of the link between microorganisms and cancer, as causative or commensal in the tumor microenvironment and provide new diagnostic potential.
ANME-2D Archaea Catalyze Methane Oxidation in Deep Subsurface Sediments Independent of Nitrate Reduction

NASA Astrophysics Data System (ADS)

Hernsdorf, A. W.; Amano, Y.; Suzuki, Y.; Ise, K.; Thomas, B. C.; Banfield, J. F.

2015-12-01

Terrestrial sediments are an important global reservoir for methane. Microorganisms in the deep subsurface play a critical role in the methane cycle, yet much remains to be learned about their diversity and metabolisms. To provide more comprehensive insight into the microbiology of the methane cycle in the deep subsurface, we conducted a genome-resolved study of samples collected from the Horonobe Underground Research Laboratory (HURL), Japan. Groundwater samples were obtained from three boreholes from a depth range of between 140 m and 250 m in two consecutive years. Groundwater was filtered and metagenomic DNA extracted and sequenced, and the sequence data assembled. Based on the sequences of phylogenetically informative genes on the assembled fragments, we detected a high degree of overlap in community composition across a vertical transect within one borehole at the two sampling times. However, there was comparatively little similarity observed among communities across boreholes. Spatial and temporal abundance patterns were used in combination with tetranucleotide signatures of assembled genome fragments to bin the data and reconstruct over 200 unique draft genomes, of which 137 are considered to be of high quality (>90% complete). The deepest samples from one borehole were highly dominated by an archaeon identified as ANME-2D; this organism was also present at lower abundance in all other samples from that borehole. Also abundant in these microbial communities were novel members of the Gammaproteobacteria, Saccharibacteria (TM7) and Tenericute phyla. Notably, a ~2 Mbp draft genome for the ANME-2D archaeon was reconstructed. As expected, the genome encodes all of the genes predicted to be involved in the reverse methanogenesis pathway. In contrast with the previously reported ANME2-D genome, the HURL ANME-2D genome lacks the capacity to reduce nitrate. However, we identified many multiheme cytochromes with closest similarity to those of the known Fe-reducing/oxidizing archaeon Ferroglobus placidus. Thus, we suggest that ANME2-D may couple methane oxidation to reduction of ferric iron minerals in the sediment and may be generally important as a link between the iron and methane cycles in deep subsurface environments. Such information has important implications for modeling the global carbon cycle.
A Proteomic Signature of Dormancy in the Actinobacterium Micrococcus luteus.

PubMed

Mali, Sujina; Mitchell, Morgan; Havis, Spencer; Bodunrin, Abiodun; Rangel, Jonathan; Olson, Gabriella; Widger, William R; Bark, Steven J

2017-07-15

Dormancy is a protective state in which diverse bacteria, including Mycobacterium tuberculosis , Staphylococcus aureus , Treponema pallidum (syphilis), and Borrelia burgdorferi (Lyme disease), curtail metabolic activity to survive external stresses, including antibiotics. Evidence suggests dormancy consists of a continuum of interrelated states, including viable but nonculturable (VBNC) and persistence states. VBNC and persistence contribute to antibiotic tolerance, reemergence from latent infections, and even quorum sensing and biofilm formation. Previous studies indicate that the protein mechanisms regulating persistence and VBNC states are not well understood. We have queried the VBNC state of Micrococcus luteus NCTC 2665 (MI-2665) by quantitative proteomics combining gel electrophoresis, high-performance liquid chromatography, and tandem mass spectrometry to elucidate some of these mechanisms. MI-2665 is a nonpathogenic actinobacterium containing a small (2.5-Mb), high-GC-content genome which exhibits a well-defined VBNC state induced by nutrient deprivation. The MI-2665 VBNC state demonstrated a loss of protein diversity accompanied by increased levels of 18 proteins that are conserved across actinobacteria, 14 of which have not been previously identified in VNBC. These proteins implicate an anaplerotic strategy in the transition to VBNC, including changes in the glyoxylate shunt, redox and amino acid metabolism, and ribosomal regulatory processes. Our data suggest that MI-2665 is a viable model for dissecting the protein mechanisms underlying the VBNC stress response and provide the first protein-level signature of this state. We expect that this protein signature will enable future studies deciphering the protein mechanisms of dormancy and identify novel therapeutic strategies effective against antibiotic-tolerant bacterial infections. IMPORTANCE Dormancy is a protective state enabling bacteria to survive antibiotics, starvation, and the immune system. Dormancy is comprised of different states, including persistent and viable but nonculturable (VBNC) states that contribute to the spread of bacterial infections. Therefore, it is imperative to identify how bacteria utilize these different dormancy states to survive antibiotic treatment. The objective of our research is to eliminate dormancy as a route to antibiotic tolerance by understanding the proteins that control dormancy in Micrococcus luteus NCTC 2665. This bacterium has unique advantages for studying dormancy, including a small genome and a well-defined and reproducible VBNC state. Our experiments implicate four previously identified and 14 novel proteins upregulated in VBNC that may regulate this critical survival mechanism. Copyright © 2017 American Society for Microbiology.

By their genes ye shall know them: genomic signatures of predatory bacteria

PubMed Central

Pasternak, Zohar; Pietrokovski, Shmuel; Rotem, Or; Gophna, Uri; Lurie-Weinberger, Mor N; Jurkevitch, Edouard

2013-01-01

Predatory bacteria are taxonomically disparate, exhibit diverse predatory strategies and are widely distributed in varied environments. To date, their predatory phenotypes cannot be discerned in genome sequence data thereby limiting our understanding of bacterial predation, and of its impact in nature. Here, we define the ‘predatome,' that is, sets of protein families that reflect the phenotypes of predatory bacteria. The proteomes of all sequenced 11 predatory bacteria, including two de novo sequenced genomes, and 19 non-predatory bacteria from across the phylogenetic and ecological landscapes were compared. Protein families discriminating between the two groups were identified and quantified, demonstrating that differences in the proteomes of predatory and non-predatory bacteria are large and significant. This analysis allows predictions to be made, as we show by confirming from genome data an over-looked bacterial predator. The predatome exhibits deficiencies in riboflavin and amino acids biosynthesis, suggesting that predators obtain them from their prey. In contrast, these genomes are highly enriched in adhesins, proteases and particular metabolic proteins, used for binding to, processing and consuming prey, respectively. Strikingly, predators and non-predators differ in isoprenoid biosynthesis: predators use the mevalonate pathway, whereas non-predators, like almost all bacteria, use the DOXP pathway. By defining predatory signatures in bacterial genomes, the predatory potential they encode can be uncovered, filling an essential gap for measuring bacterial predation in nature. Moreover, we suggest that full-genome proteomic comparisons are applicable to other ecological interactions between microbes, and provide a convenient and rational tool for the functional classification of bacteria. PMID:23190728
Genomic Analysis Reveals Hypoxia Adaptation in the Tibetan Mastiff by Introgression of the Gray Wolf from the Tibetan Plateau.

PubMed

Miao, Benpeng; Wang, Zhen; Li, Yixue

2017-03-01

The Tibetan Mastiff (TM), a native of the Tibetan Plateau, has quickly adapted to the extreme highland environment. Recently, the impact of positive selection on the TM genome was studied and potential hypoxia-adaptive genes were identified. However, the origin of the adaptive variants remains unknown. In this study, we investigated the signature of genetic introgression in the adaptation of TMs with dog and wolf genomic data from different altitudes in close geographic proximity. On a genome-wide scale, the TM was much more closely related to other dogs than wolves. However, using the 'ABBA/BABA' test, we identified genomic regions from the TM that possibly introgressed from Tibetan gray wolf. Several of the regions, including the EPAS1 and HBB loci, also showed the dominant signature of selective sweeps in the TM genome. We validated the introgression of the two loci by excluding the possibility of convergent evolution and ancestral polymorphisms and examined the haplotypes of all available canid genomes. The estimated time of introgression based on a non-coding region of the EPAS1 locus mostly overlapped with the Paleolithic era. Our results demonstrated that the introgression of hypoxia adaptive genes in wolves from the highland played an important role for dogs living in hypoxic environments, which indicated that domestic animals could acquire local adaptation quickly by secondary contact with their wild relatives. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
A comparative analysis of whole genome sequencing of esophageal adenocarcinoma pre- and post-chemotherapy

PubMed Central

Noorani, Ayesha; Lynch, Andy G.; Achilleos, Achilleas; Eldridge, Matthew; Bower, Lawrence; Weaver, Jamie M.J.; Crawte, Jason; Ong, Chin-Ann; Shannon, Nicholas; MacRae, Shona; Grehan, Nicola; Nutzinger, Barbara; O'Donovan, Maria; Hardwick, Richard; Tavaré, Simon; Fitzgerald, Rebecca C.

2017-01-01

The scientific community has avoided using tissue samples from patients that have been exposed to systemic chemotherapy to infer the genomic landscape of a given cancer. Esophageal adenocarcinoma is a heterogeneous, chemoresistant tumor for which the availability and size of pretreatment endoscopic samples are limiting. This study compares whole-genome sequencing data obtained from chemo-naive and chemo-treated samples. The quality of whole-genomic sequencing data is comparable across all samples regardless of chemotherapy status. Inclusion of samples collected post-chemotherapy increased the proportion of late-stage tumors. When comparing matched pre- and post-chemotherapy samples from 10 cases, the mutational signatures, copy number, and SNV mutational profiles reflect the expected heterogeneity in this disease. Analysis of SNVs in relation to allele-specific copy-number changes pinpoints the common ancestor to a point prior to chemotherapy. For cases in which pre- and post-chemotherapy samples do show substantial differences, the timing of the divergence is near-synchronous with endoreduplication. Comparison across a large prospective cohort (62 treatment-naive, 58 chemotherapy-treated samples) reveals no significant differences in the overall mutation rate, mutation signatures, specific recurrent point mutations, or copy-number events in respect to chemotherapy status. In conclusion, whole-genome sequencing of samples obtained following neoadjuvant chemotherapy is representative of the genomic landscape of esophageal adenocarcinoma. Excluding these samples reduces the material available for cataloging and introduces a bias toward the earlier stages of cancer. PMID:28465312
Genome-wide introgression among distantly related Heliconius butterfly species.

PubMed

Zhang, Wei; Dasmahapatra, Kanchon K; Mallet, James; Moreira, Gilson R P; Kronforst, Marcus R

2016-02-27

Although hybridization is thought to be relatively rare in animals, the raw genetic material introduced via introgression may play an important role in fueling adaptation and adaptive radiation. The butterfly genus Heliconius is an excellent system to study hybridization and introgression but most studies have focused on closely related species such as H. cydno and H. melpomene. Here we characterize genome-wide patterns of introgression between H. besckei, the only species with a red and yellow banded 'postman' wing pattern in the tiger-striped silvaniform clade, and co-mimetic H. melpomene nanna. We find a pronounced signature of putative introgression from H. melpomene into H. besckei in the genomic region upstream of the gene optix, known to control red wing patterning, suggesting adaptive introgression of wing pattern mimicry between these two distantly related species. At least 39 additional genomic regions show signals of introgression as strong or stronger than this mimicry locus. Gene flow has been on-going, with evidence of gene exchange at multiple time points, and bidirectional, moving from the melpomene to the silvaniform clade and vice versa. The history of gene exchange has also been complex, with contributions from multiple silvaniform species in addition to H. besckei. We also detect a signature of ancient introgression of the entire Z chromosome between the silvaniform and melpomene/cydno clades. Our study provides a genome-wide portrait of introgression between distantly related butterfly species. We further propose a comprehensive and efficient workflow for gene flow identification in genomic data sets.
Characterization of a Genomic Signature of Pregnancy in the Breast

PubMed Central

Belitskaya-Lévy, Ilana; Zeleniuch-Jacquotte, Anne; Russo, Jose; Russo, Irma H.; Bordás, Pal; Åhman, Janet; Afanasyeva, Yelena; Johansson, Robert; Lenner, Per; Li, Xiaochun; de Cicco, Ricardo López; Peri, Suraj; Ross, Eric; Russo, Patricia A.; Santucci-Pereira, Julia; Sheriff, Fathima S.; Slifker, Michael; Hallmans, Göran; Toniolo, Paolo; Arslan, Alan A.

2012-01-01

The objective of the current study was to comprehensively compare the genomic profiles in the breast of parous and nulliparous postmenopausal women to identify genes that permanently change their expression following pregnancy. The study was designed as a two-phase approach. In the discovery phase, we compared breast genomic profiles of 37 parous with 18 nulliparous postmenopausal women. In the validation phase, confirmation of the genomic patterns observed in the discovery phase was sought in an independent set of 30 parous and 22 nulliparous postmenopausal women. RNA was hybridized to Affymetrix HG_U133 Plus 2.0 oligonucleotide arrays containing probes to 54,675 transcripts; scanned and the images analyzed using Affymetrix GCOS software. Surrogate variable analysis, logistic regression and significance analysis for microarrays were used to identify statistically significant differences in expression of genes. The False Discovery Rate (FDR) approach was used to control for multiple comparisons. We found that 208 genes (305 probe sets) were differentially expressed between parous and nulliparous women in both discovery and validation phases of the study at a FDR of 10% and with at least a 1.25-fold change. These genes are involved in regulation of transcription, centrosome organization, RNA splicing, cell cycle control, adhesion and differentiation. The results provide persuasive evidence that full-term pregnancy induces long-term genomic changes in the breast. The genomic signature of pregnancy could be used as an intermediate marker to assess potential chemopreventive interventions with hormones mimicking the effects of pregnancy for prevention of breast cancer. PMID:21622728
Discovery and Validation of Novel Expression Signature for Postcystectomy Recurrence in High-Risk Bladder Cancer

PubMed Central

Lam, Lucia L.; Ghadessi, Mercedeh; Erho, Nicholas; Vergara, Ismael A.; Alshalalfa, Mohammed; Buerki, Christine; Haddad, Zaid; Sierocinski, Thomas; Triche, Timothy J.; Skinner, Eila C.; Davicioni, Elai; Daneshmand, Siamak; Black, Peter C.

2014-01-01

Background Nearly half of muscle-invasive bladder cancer patients succumb to their disease following cystectomy. Selecting candidates for adjuvant therapy is currently based on clinical parameters with limited predictive power. This study aimed to develop and validate genomic-based signatures that can better identify patients at risk for recurrence than clinical models alone. Methods Transcriptome-wide expression profiles were generated using 1.4 million feature-arrays on archival tumors from 225 patients who underwent radical cystectomy and had muscle-invasive and/or node-positive bladder cancer. Genomic (GC) and clinical (CC) classifiers for predicting recurrence were developed on a discovery set (n = 133). Performances of GC, CC, an independent clinical nomogram (IBCNC), and genomic-clinicopathologic classifiers (G-CC, G-IBCNC) were assessed in the discovery and independent validation (n = 66) sets. GC was further validated on four external datasets (n = 341). Discrimination and prognostic abilities of classifiers were compared using area under receiver-operating characteristic curves (AUCs). All statistical tests were two-sided. Results A 15-feature GC was developed on the discovery set with area under curve (AUC) of 0.77 in the validation set. This was higher than individual clinical variables, IBCNC (AUC = 0.73), and comparable to CC (AUC = 0.78). Performance was improved upon combining GC with clinical nomograms (G-IBCNC, AUC = 0.82; G-CC, AUC = 0.86). G-CC high-risk patients had elevated recurrence probabilities (P < .001), with GC being the best predictor by multivariable analysis (P = .005). Genomic-clinicopathologic classifiers outperformed clinical nomograms by decision curve and reclassification analyses. GC performed the best in validation compared with seven prior signatures. GC markers remained prognostic across four independent datasets. Conclusions The validated genomic-based classifiers outperform clinical models for predicting postcystectomy bladder cancer recurrence. This may be used to better identify patients who need more aggressive management. PMID:25344601
Studying a Complex Tumor—Potential and Pitfalls

PubMed Central

Zheng, Siyuan; Chheda, Milan G.; Verhaak, Roel G.W.

2012-01-01

Glioblastoma multiforme (GBM) is a histopathologically heterogeneous disease with few treatment options. Therapy based on genomic alterations is rapidly gaining popularity because of the high response rate and high specificity. DNA copy number and exon sequencing studies of GBM samples have revealed recurrent genomic alterations in genes such as TP53, EGFR and IDH1 but to date this has not resulted in novel GBM therapies. Identification of expression subtypes have resulted in new insights such as the association between genomic abnormalities and expression signatures. This review describes the types of genomic studies that have been performed and that are underway, the most prominent results and the implications of genomic research for development of clinical treatment modalities. PMID:22290264
Literacy Course Priorities and Signature Aspects of Nine Elementary Initial Licensure Programs

ERIC Educational Resources Information Center

Lenski, Susan; Ganske, Kathy; Chambers, Sandy; Wold, Linda; Dobler, Elizabeth; Grisham, Dana L.; Scales, Roya; Smetana, Linda; Wolsey, Thomas Devere; Yoder, Karen K.; Young, Janet

2013-01-01

The purpose of this article is to describe the first part of a three-phase study to learn what makes an effective elementary literacy initial licensure program. The first step was to identify how nine programs prioritized research-based literacy practices and to identify each program's unique features, which we called "signature aspects." Findings…
A comprehensive study of the genomic differentiation between temperate Dent and Flint maize.

PubMed

Unterseer, Sandra; Pophaly, Saurabh D; Peis, Regina; Westermeier, Peter; Mayer, Manfred; Seidel, Michael A; Haberer, Georg; Mayer, Klaus F X; Ordas, Bernardo; Pausch, Hubert; Tellier, Aurélien; Bauer, Eva; Schön, Chris-Carolin

2016-07-08

Dent and Flint represent two major germplasm pools exploited in maize breeding. Several traits differentiate the two pools, like cold tolerance, early vigor, and flowering time. A comparative investigation of their genomic architecture relevant for quantitative trait expression has not been reported so far. Understanding the genomic differences between germplasm pools may contribute to a better understanding of the complementarity in heterotic patterns exploited in hybrid breeding and of mechanisms involved in adaptation to different environments. We perform whole-genome screens for signatures of selection specific to temperate Dent and Flint maize by comparing high-density genotyping data of 70 American and European Dent and 66 European Flint inbred lines. We find 2.2 % and 1.4 % of the genes are under selective pressure, respectively, and identify candidate genes associated with agronomic traits known to differ between the two pools. Taking flowering time as an example for the differentiation between Dent and Flint, we investigate candidate genes involved in the flowering network by phenotypic analyses in a Dent-Flint introgression library and find that the Flint haplotypes of the candidates promote earlier flowering. Within the flowering network, the majority of Flint candidates are associated with endogenous pathways in contrast to Dent candidate genes, which are mainly involved in response to environmental factors like light and photoperiod. The diversity patterns of the candidates in a unique panel of more than 900 individuals from 38 European landraces indicate a major contribution of landraces from France, Germany, and Spain to the candidate gene diversity of the Flint elite lines. In this study, we report the investigation of pool-specific differences between temperate Dent and Flint on a genome-wide scale. The identified candidate genes represent a promising source for the functional investigation of pool-specific haplotypes in different genetic backgrounds and for the evaluation of their potential for future crop improvement like the adaptation to specific environments.
Functional genomic characterization of virulence factors from necrotizing fasciitis-causing strains of Aeromonas hydrophila.

PubMed

Grim, Christopher J; Kozlova, Elena V; Ponnusamy, Duraisamy; Fitts, Eric C; Sha, Jian; Kirtley, Michelle L; van Lier, Christina J; Tiner, Bethany L; Erova, Tatiana E; Joseph, Sandeep J; Read, Timothy D; Shak, Joshua R; Joseph, Sam W; Singletary, Ed; Felland, Tracy; Baze, Wallace B; Horneman, Amy J; Chopra, Ashok K

2014-07-01

The genomes of 10 Aeromonas isolates identified and designated Aeromonas hydrophila WI, Riv3, and NF1 to NF4; A. dhakensis SSU; A. jandaei Riv2; and A. caviae NM22 and NM33 were sequenced and annotated. Isolates NF1 to NF4 were from a patient with necrotizing fasciitis (NF). Two environmental isolates (Riv2 and -3) were from the river water from which the NF patient acquired the infection. While isolates NF2 to NF4 were clonal, NF1 was genetically distinct. Outside the conserved core genomes of these 10 isolates, several unique genomic features were identified. The most virulent strains possessed one of the following four virulence factors or a combination of them: cytotoxic enterotoxin, exotoxin A, and type 3 and 6 secretion system effectors AexU and Hcp. In a septicemic-mouse model, SSU, NF1, and Riv2 were the most virulent, while NF2 was moderately virulent. These data correlated with high motility and biofilm formation by the former three isolates. Conversely, in a mouse model of intramuscular infection, NF2 was much more virulent than NF1. Isolates NF2, SSU, and Riv2 disseminated in high numbers from the muscular tissue to the visceral organs of mice, while NF1 reached the liver and spleen in relatively lower numbers on the basis of colony counting and tracking of bioluminescent strains in real time by in vivo imaging. Histopathologically, degeneration of myofibers with significant infiltration of polymorphonuclear cells due to the highly virulent strains was noted. Functional genomic analysis provided data that allowed us to correlate the highly infectious nature of Aeromonas pathotypes belonging to several different species with virulence signatures and their potential ability to cause NF. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication.

PubMed

Montague, Michael J; Li, Gang; Gandolfi, Barbara; Khan, Razib; Aken, Bronwen L; Searle, Steven M J; Minx, Patrick; Hillier, LaDeana W; Koboldt, Daniel C; Davis, Brian W; Driscoll, Carlos A; Barr, Christina S; Blackistone, Kevin; Quilez, Javier; Lorente-Galdos, Belen; Marques-Bonet, Tomas; Alkan, Can; Thomas, Gregg W C; Hahn, Matthew W; Menotti-Raymond, Marilyn; O'Brien, Stephen J; Wilson, Richard K; Lyons, Leslie A; Murphy, William J; Warren, Wesley C

2014-12-02

Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae.
Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication

PubMed Central

Li, Gang; Gandolfi, Barbara; Khan, Razib; Aken, Bronwen L.; Searle, Steven M. J.; Minx, Patrick; Hillier, LaDeana W.; Koboldt, Daniel C.; Davis, Brian W.; Driscoll, Carlos A.; Barr, Christina S.; Blackistone, Kevin; Quilez, Javier; Lorente-Galdos, Belen; Marques-Bonet, Tomas; Alkan, Can; Thomas, Gregg W. C.; Hahn, Matthew W.; Menotti-Raymond, Marilyn; O’Brien, Stephen J.; Wilson, Richard K.; Lyons, Leslie A.; Murphy, William J.; Warren, Wesley C.

2014-01-01

Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae. PMID:25385592
Evaluation of high throughput gene expression platforms using a genomic biomarker signature for prediction of skin sensitization.

PubMed

Forreryd, Andy; Johansson, Henrik; Albrekt, Ann-Sofie; Lindstedt, Malin

2014-05-16

Allergic contact dermatitis (ACD) develops upon exposure to certain chemical compounds termed skin sensitizers. To reduce the occurrence of skin sensitizers, chemicals are regularly screened for their capacity to induce sensitization. The recently developed Genomic Allergen Rapid Detection (GARD) assay is an in vitro alternative to animal testing for identification of skin sensitizers, classifying chemicals by evaluating transcriptional levels of a genomic biomarker signature. During assay development and biomarker identification, genome-wide expression analysis was applied using microarrays covering approximately 30,000 transcripts. However, the microarray platform suffers from drawbacks in terms of low sample throughput, high cost per sample and time consuming protocols and is a limiting factor for adaption of GARD into a routine assay for screening of potential sensitizers. With the purpose to simplify assay procedures, improve technical parameters and increase sample throughput, we assessed the performance of three high throughput gene expression platforms--nCounter®, BioMark HD™ and OpenArray®--and correlated their performance metrics against our previously generated microarray data. We measured the levels of 30 transcripts from the GARD biomarker signature across 48 samples. Detection sensitivity, reproducibility, correlations and overall structure of gene expression measurements were compared across platforms. Gene expression data from all of the evaluated platforms could be used to classify most of the sensitizers from non-sensitizers in the GARD assay. Results also showed high data quality and acceptable reproducibility for all platforms but only medium to poor correlations of expression measurements across platforms. In addition, evaluated platforms were superior to the microarray platform in terms of cost efficiency, simplicity of protocols and sample throughput. We evaluated the performance of three non-array based platforms using a limited set of transcripts from the GARD biomarker signature. We demonstrated that it was possible to achieve acceptable discriminatory power in terms of separation between sensitizers and non-sensitizers in the GARD assay while reducing assay costs, simplify assay procedures and increase sample throughput by using an alternative platform, providing a first step towards the goal to prepare GARD for formal validation and adaption of the assay for industrial screening of potential sensitizers.
Association of Distinct Mutational Signatures With Correlates of Increased Immune Activity in Pancreatic Ductal Adenocarcinoma.

PubMed

Connor, Ashton A; Denroche, Robert E; Jang, Gun Ho; Timms, Lee; Kalimuthu, Sangeetha N; Selander, Iris; McPherson, Treasa; Wilson, Gavin W; Chan-Seng-Yue, Michelle A; Borozan, Ivan; Ferretti, Vincent; Grant, Robert C; Lungu, Ilinca M; Costello, Eithne; Greenhalf, William; Palmer, Daniel; Ghaneh, Paula; Neoptolemos, John P; Buchler, Markus; Petersen, Gloria; Thayer, Sarah; Hollingsworth, Michael A; Sherker, Alana; Durocher, Daniel; Dhani, Neesha; Hedley, David; Serra, Stefano; Pollett, Aaron; Roehrl, Michael H A; Bavi, Prashant; Bartlett, John M S; Cleary, Sean; Wilson, Julie M; Alexandrov, Ludmil B; Moore, Malcolm; Wouters, Bradly G; McPherson, John D; Notta, Faiyaz; Stein, Lincoln D; Gallinger, Steven

2017-06-01

Outcomes for patients with pancreatic ductal adenocarcinoma (PDAC) remain poor. Advances in next-generation sequencing provide a route to therapeutic approaches, and integrating DNA and RNA analysis with clinicopathologic data may be a crucial step toward personalized treatment strategies for this disease. To classify PDAC according to distinct mutational processes, and explore their clinical significance. We performed a retrospective cohort study of resected PDAC, using cases collected between 2008 and 2015 as part of the International Cancer Genome Consortium. The discovery cohort comprised 160 PDAC cases from 154 patients (148 primary; 12 metastases) that underwent tumor enrichment prior to whole-genome and RNA sequencing. The replication cohort comprised 95 primary PDAC cases that underwent whole-genome sequencing and expression microarray on bulk biospecimens. Somatic mutations accumulate from sequence-specific processes creating signatures detectable by DNA sequencing. Using nonnegative matrix factorization, we measured the contribution of each signature to carcinogenesis, and used hierarchical clustering to subtype each cohort. We examined expression of antitumor immunity genes across subtypes to uncover biomarkers predictive of response to systemic therapies. The discovery cohort was 53% male (n = 79) and had a median age of 67 (interquartile range, 58-74) years. The replication cohort was 50% male (n = 48) and had a median age of 68 (interquartile range, 60-75) years. Five predominant mutational subtypes were identified that clustered PDAC into 4 major subtypes: age related, double-strand break repair, mismatch repair, and 1 with unknown etiology (signature 8). These were replicated and validated. Signatures were faithfully propagated from primaries to matched metastases, implying their stability during carcinogenesis. Twelve of 27 (45%) double-strand break repair cases lacked germline or somatic events in canonical homologous recombination genes-BRCA1, BRCA2, or PALB2. Double-strand break repair and mismatch repair subtypes were associated with increased expression of antitumor immunity, including activation of CD8-positive T lymphocytes (GZMA and PRF1) and overexpression of regulatory molecules (cytotoxic T-lymphocyte antigen 4, programmed cell death 1, and indolamine 2,3-dioxygenase 1), corresponding to higher frequency of somatic mutations and tumor-specific neoantigens. Signature-based subtyping may guide personalized therapy of PDAC in the context of biomarker-driven prospective trials.
Microbial genomic taxonomy

PubMed Central

2013-01-01

A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, <10 in Karlin genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups. PMID:24365132
Mutational Signatures in Cancer (MuSiCa): a web application to implement mutational signatures analysis in cancer samples.

PubMed

Díaz-Gay, Marcos; Vila-Casadesús, Maria; Franch-Expósito, Sebastià; Hernández-Illán, Eva; Lozano, Juan José; Castellví-Bel, Sergi

2018-06-14

Mutational signatures have been proved as a valuable pattern in somatic genomics, mainly regarding cancer, with a potential application as a biomarker in clinical practice. Up to now, several bioinformatic packages to address this topic have been developed in different languages/platforms. MutationalPatterns has arisen as the most efficient tool for the comparison with the signatures currently reported in the Catalogue of Somatic Mutations in Cancer (COSMIC) database. However, the analysis of mutational signatures is nowadays restricted to a small community of bioinformatic experts. In this work we present Mutational Signatures in Cancer (MuSiCa), a new web tool based on MutationalPatterns and built using the Shiny framework in R language. By means of a simple interface suited to non-specialized researchers, it provides a comprehensive analysis of the somatic mutational status of the supplied cancer samples. It permits characterizing the profile and burden of mutations, as well as quantifying COSMIC-reported mutational signatures. It also allows classifying samples according to the above signature contributions. MuSiCa is a helpful web application to characterize mutational signatures in cancer samples. It is accessible online at http://bioinfo.ciberehd.org/GPtoCRC/en/tools.html and source code is freely available at https://github.com/marcos-diazg/musica .
Development of Advanced Technologies for Complete Genomic and Proteomic Characterization of Quantized Human Tumor Cells

DTIC Science & Technology

2014-07-01

establishment of Glioblastoma ( GBM ) cell lines from GBM patient’s tumor samples and quantized cell populations of each of the parental GBM cell lines, we... GBM patients are now well established and from the basis of the molecular characterization of the tumor development and signatures presented by these...analysis of these quantized cell sub populations and have begun to assemble the protein signatures of GBM tumors underpinned by the comprehensive
Molecular characterization of circulating colorectal tumor cells defines genetic signatures for individualized cancer care.

PubMed

Kong, Say Li; Liu, Xingliang; Suhaimi, Nur-Afidah Mohamed; Koh, Kenneth Jia Hao; Hu, Min; Lee, Daniel Yoke San; Cima, Igor; Phyo, Wai Min; Lee, Esther Xing Wei; Tai, Joyce A; Foong, Yu Miin; Vo, Jess Honganh; Koh, Poh Koon; Zhang, Tong; Ying, Jackie Y; Lim, Bing; Tan, Min-Han; Hillmer, Axel M

2017-09-15

Studies on circulating tumor cells (CTCs) have largely focused on platform development and CTC enumeration rather than on the genomic characterization of CTCs. To address this, we performed targeted sequencing of CTCs of colorectal cancer patients and compared the mutations with the matched primary tumors. We collected preoperative blood and matched primary tumor samples from 48 colorectal cancer patients. CTCs were isolated using a label-free microfiltration device on a silicon microsieve. Upon whole genome amplification, we performed amplicon-based targeted sequencing on a panel of 39 druggable and frequently mutated genes on both CTCs and fresh-frozen tumor samples. We developed an analysis pipeline to minimize false-positive detection of somatic mutations in amplified DNA. In 60% of the CTC-enriched blood samples, we detected primary tumor matching mutations. We found a significant positive correlation between the allele frequencies of somatic mutations detected in CTCs and abnormal CEA serum level. Strikingly, we found driver mutations and amplifications in cancer and druggable genes such as APC, KRAS, TP53, ERBB3 , FBXW7 and ERBB2 . In addition, we found that CTCs carried mutation signatures that resembled the signatures of their primary tumors. Cumulatively, our study defined genetic signatures and somatic mutation frequency of colorectal CTCs. The identification of druggable mutations in CTCs of preoperative colorectal cancer patients could lead to more timely and focused therapeutic interventions.
Molecular characterization of circulating colorectal tumor cells defines genetic signatures for individualized cancer care

PubMed Central

Kong, Say Li; Liu, Xingliang; Suhaimi, Nur-Afidah Mohamed; Koh, Kenneth Jia Hao; Hu, Min; Lee, Daniel Yoke San; Cima, Igor; Phyo, Wai Min; Lee, Esther Xing Wei; Tai, Joyce A.; Foong, Yu Miin; Vo, Jess Honganh; Koh, Poh Koon; Zhang, Tong; Ying, Jackie Y.; Lim, Bing; Tan, Min-Han; Hillmer, Axel M.

2017-01-01

Studies on circulating tumor cells (CTCs) have largely focused on platform development and CTC enumeration rather than on the genomic characterization of CTCs. To address this, we performed targeted sequencing of CTCs of colorectal cancer patients and compared the mutations with the matched primary tumors. We collected preoperative blood and matched primary tumor samples from 48 colorectal cancer patients. CTCs were isolated using a label-free microfiltration device on a silicon microsieve. Upon whole genome amplification, we performed amplicon-based targeted sequencing on a panel of 39 druggable and frequently mutated genes on both CTCs and fresh-frozen tumor samples. We developed an analysis pipeline to minimize false-positive detection of somatic mutations in amplified DNA. In 60% of the CTC-enriched blood samples, we detected primary tumor matching mutations. We found a significant positive correlation between the allele frequencies of somatic mutations detected in CTCs and abnormal CEA serum level. Strikingly, we found driver mutations and amplifications in cancer and druggable genes such as APC, KRAS, TP53, ERBB3, FBXW7 and ERBB2. In addition, we found that CTCs carried mutation signatures that resembled the signatures of their primary tumors. Cumulatively, our study defined genetic signatures and somatic mutation frequency of colorectal CTCs. The identification of druggable mutations in CTCs of preoperative colorectal cancer patients could lead to more timely and focused therapeutic interventions. PMID:28978093
Genomic signatures of selection at linked sites: unifying the disparity among species

PubMed Central

Cutter, Asher D.; Payseur, Bret A.

2014-01-01

Population genetics theory supplies powerful predictions about how natural selection interacts with genetic linkage to sculpt the genomic landscape of nucleotide polymorphism. Both the spread of beneficial mutations and removal of deleterious mutations act to depress polymorphism levels, especially in low-recombination regions. However, empiricists have documented extreme disparities among species. Here we characterize the dominant features that could drive variation in linked selection among species, including roles for selective sweeps being ‘hard’ or ‘soft’, and concealing by demography and genomic confounds. We advocate targeted studies of close relatives to unify our understanding of how selection and linkage interact to shape genome evolution. PMID:23478346

Improving the annotation of the Heterorhabditis bacteriophora genome.

PubMed

McLean, Florence; Berger, Duncan; Laetsch, Dominik R; Schwartz, Hillel T; Blaxter, Mark

2018-04-01

Genome assembly and annotation remain exacting tasks. As the tools available for these tasks improve, it is useful to return to data produced with earlier techniques to assess their credibility and correctness. The entomopathogenic nematode Heterorhabditis bacteriophora is widely used to control insect pests in horticulture. The genome sequence for this species was reported to encode an unusually high proportion of unique proteins and a paucity of secreted proteins compared to other related nematodes. We revisited the H. bacteriophora genome assembly and gene predictions to determine whether these unusual characteristics were biological or methodological in origin. We mapped an independent resequencing dataset to the genome and used the blobtools pipeline to identify potential contaminants. While present (0.2% of the genome span, 0.4% of predicted proteins), assembly contamination was not significant. Re-prediction of the gene set using BRAKER1 and published transcriptome data generated a predicted proteome that was very different from the published one. The new gene set had a much reduced complement of unique proteins, better completeness values that were in line with other related species' genomes, and an increased number of proteins predicted to be secreted. It is thus likely that methodological issues drove the apparent uniqueness of the initial H. bacteriophora genome annotation and that similar contamination and misannotation issues affect other published genome assemblies.
Genome analysis and signature discovery for diving and sensory properties of the endangered Chinese alligator

PubMed Central

Wan, Qiu-Hong; Pan, Sheng-Kai; Hu, Li; Zhu, Ying; Xu, Peng-Wei; Xia, Jin-Quan; Chen, Hui; He, Gen-Yun; He, Jing; Ni, Xiao-Wei; Hou, Hao-Long; Liao, Sheng-Guang; Yang, Hai-Qiong; Chen, Ying; Gao, Shu-Kun; Ge, Yun-Fa; Cao, Chang-Chang; Li, Peng-Fei; Fang, Li-Ming; Liao, Li; Zhang, Shu; Wang, Meng-Zhen; Dong, Wei; Fang, Sheng-Guo

2013-01-01

Crocodilians are diving reptiles that can hold their breath under water for long periods of time and are crepuscular animals with excellent sensory abilities. They comprise a sister lineage of birds and have no sex chromosome. Here we report the genome sequence of the endangered Chinese alligator (Alligator sinensis) and describe its unique features. The next-generation sequencing generated 314 Gb of raw sequence, yielding a genome size of 2.3 Gb. A total of 22 200 genes were predicted in Alligator sinensis using a de novo, homology- and RNA-based combined model. The genetic basis of long-diving behavior includes duplication of the bicarbonate-binding hemoglobin gene, co-functioning of routine phosphate-binding and special bicarbonate-binding oxygen transport, and positively selected energy metabolism, ammonium bicarbonate excretion and cardiac muscle contraction. Further, we elucidated the robust Alligator sinensis sensory system, including a significantly expanded olfactory receptor repertoire, rapidly evolving nerve-related cellular components and visual perception, and positive selection of the night vision-related opsin and sound detection-associated otopetrin. We also discovered a well-developed immune system with a considerable number of lineage-specific antigen-presentation genes for adaptive immunity as well as expansion of the tripartite motif-containing C-type lectin and butyrophilin genes for innate immunity and expression of antibacterial peptides. Multifluorescence in situ hybridization showed that alligator chromosome 3, which encodes DMRT1, exhibits significant synteny with chicken chromosome Z. Finally, population history analysis indicated population admixture 0.60-1.05 million years ago, when the Qinghai-Tibetan Plateau was uplifted. PMID:23917531
Early Evolution of Conserved Regulatory Sequences Associated with Development in Vertebrates

PubMed Central

McEwen, Gayle K.; Goode, Debbie K.; Parker, Hugo J.; Woolfe, Adam; Callaway, Heather; Elgar, Greg

2009-01-01

Comparisons between diverse vertebrate genomes have uncovered thousands of highly conserved non-coding sequences, an increasing number of which have been shown to function as enhancers during early development. Despite their extreme conservation over 500 million years from humans to cartilaginous fish, these elements appear to be largely absent in invertebrates, and, to date, there has been little understanding of their mode of action or the evolutionary processes that have modelled them. We have now exploited emerging genomic sequence data for the sea lamprey, Petromyzon marinus, to explore the depth of conservation of this type of element in the earliest diverging extant vertebrate lineage, the jawless fish (agnathans). We searched for conserved non-coding elements (CNEs) at 13 human gene loci and identified lamprey elements associated with all but two of these gene regions. Although markedly shorter and less well conserved than within jawed vertebrates, identified lamprey CNEs are able to drive specific patterns of expression in zebrafish embryos, which are almost identical to those driven by the equivalent human elements. These CNEs are therefore a unique and defining characteristic of all vertebrates. Furthermore, alignment of lamprey and other vertebrate CNEs should permit the identification of persistent sequence signatures that are responsible for common patterns of expression and contribute to the elucidation of the regulatory language in CNEs. Identifying the core regulatory code for development, common to all vertebrates, provides a foundation upon which regulatory networks can be constructed and might also illuminate how large conserved regulatory sequence blocks evolve and become fixed in genomic DNA. PMID:20011110
Conserved DNA methylation patterns in healthy blood cells and extensive changes in leukemia measured by a new quantitative technique

PubMed Central

Jelinek, Jaroslav; Liang, Shoudan; Lu, Yue; He, Rong; Ramagli, Louis S.; Shpall, Elizabeth J.; Estecio, Marcos R.H.; Issa, Jean-Pierre J.

2012-01-01

Genome wide analysis of DNA methylation provides important information in a variety of diseases, including cancer. Here, we describe a simple method, Digital Restriction Enzyme Analysis of Methylation (DREAM), based on next generation sequencing analysis of methylation-specific signatures created by sequential digestion of genomic DNA with SmaI and XmaI enzymes. DREAM provides information on 150,000 unique CpG sites, of which 39,000 are in CpG islands and 30,000 are at transcription start sites of 13,000 RefSeq genes. We analyzed DNA methylation in healthy white blood cells and found methylation patterns to be remarkably uniform. Inter individual differences > 30% were observed only at 227 of 28,331 (0.8%) of autosomal CpG sites. Similarly, > 30% differences were observed at only 59 sites when we comparing the cord and adult blood. These conserved methylation patterns contrasted with extensive changes affecting 18–40% of CpG sites in a patient with acute myeloid leukemia and in two leukemia cell lines. The method is cost effective, quantitative (r2 = 0.93 when compared with bisulfite pyrosequencing) and reproducible (r2 = 0.997). Using 100-fold coverage, DREAM can detect differences in methylation greater than 10% or 30% with a false positive rate below 0.05 or 0.001, respectively. DREAM can be useful in quantifying epigenetic effects of environment and nutrition, correlating developmental epigenetic variation with phenotypes, understanding epigenetics of cancer and chronic diseases, measuring the effects of drugs on DNA methylation or deriving new biological insights into mammalian genomes. PMID:23075513
Systematic discovery and characterization of fly microRNAs using 12 Drosophila genomes

PubMed Central

Stark, Alexander; Kheradpour, Pouya; Parts, Leopold; Brennecke, Julius; Hodges, Emily; Hannon, Gregory J.; Kellis, Manolis

2007-01-01

MicroRNAs (miRNAs) are short regulatory RNAs that inhibit target genes by complementary binding in 3′ untranslated regions (3′ UTRs). They are one of the most abundant classes of regulators, targeting a large fraction of all genes, making their comprehensive study a requirement for understanding regulation and development. Here we use 12 Drosophila genomes to define structural and evolutionary signatures of miRNA hairpins, which we use for their de novo discovery. We predict >41 novel miRNA genes, which encompass many unique families, and 28 of which are validated experimentally. We also define signals for the precise start position of mature miRNAs, which suggest corrections of previously known miRNAs, often leading to drastic changes in their predicted target spectrum. We show that miRNA discovery power scales with the number and divergence of species compared, suggesting that such approaches can be successful in human as dozens of mammalian genomes become available. Interestingly, for some miRNAs sense and anti-sense hairpins score highly and mature miRNAs from both strands can indeed be found in vivo. Similarly, miRNAs with weak 5′ end predictions show increased in vivo processing of multiple alternate 5′ ends and have fewer predicted targets. Lastly, we show that several miRNA star sequences score highly and are likely functional. For mir-10 in particular, both arms show abundant processing, and both show highly conserved target sites in Hox genes, suggesting a possible cooperation of the two arms, and their role as a master Hox regulator. PMID:17989255
Genome analysis and signature discovery for diving and sensory properties of the endangered Chinese alligator.

PubMed

Wan, Qiu-Hong; Pan, Sheng-Kai; Hu, Li; Zhu, Ying; Xu, Peng-Wei; Xia, Jin-Quan; Chen, Hui; He, Gen-Yun; He, Jing; Ni, Xiao-Wei; Hou, Hao-Long; Liao, Sheng-Guang; Yang, Hai-Qiong; Chen, Ying; Gao, Shu-Kun; Ge, Yun-Fa; Cao, Chang-Chang; Li, Peng-Fei; Fang, Li-Ming; Liao, Li; Zhang, Shu; Wang, Meng-Zhen; Dong, Wei; Fang, Sheng-Guo

2013-09-01

Crocodilians are diving reptiles that can hold their breath under water for long periods of time and are crepuscular animals with excellent sensory abilities. They comprise a sister lineage of birds and have no sex chromosome. Here we report the genome sequence of the endangered Chinese alligator (Alligator sinensis) and describe its unique features. The next-generation sequencing generated 314 Gb of raw sequence, yielding a genome size of 2.3 Gb. A total of 22 200 genes were predicted in Alligator sinensis using a de novo, homology- and RNA-based combined model. The genetic basis of long-diving behavior includes duplication of the bicarbonate-binding hemoglobin gene, co-functioning of routine phosphate-binding and special bicarbonate-binding oxygen transport, and positively selected energy metabolism, ammonium bicarbonate excretion and cardiac muscle contraction. Further, we elucidated the robust Alligator sinensis sensory system, including a significantly expanded olfactory receptor repertoire, rapidly evolving nerve-related cellular components and visual perception, and positive selection of the night vision-related opsin and sound detection-associated otopetrin. We also discovered a well-developed immune system with a considerable number of lineage-specific antigen-presentation genes for adaptive immunity as well as expansion of the tripartite motif-containing C-type lectin and butyrophilin genes for innate immunity and expression of antibacterial peptides. Multifluorescence in situ hybridization showed that alligator chromosome 3, which encodes DMRT1, exhibits significant synteny with chicken chromosome Z. Finally, population history analysis indicated population admixture 0.60-1.05 million years ago, when the Qinghai-Tibetan Plateau was uplifted.
RNA-directed DNA methylation involves co-transcriptional small-RNA-guided slicing of polymerase V transcripts in Arabidopsis.

PubMed

Liu, Wanlu; Duttke, Sascha H; Hetzel, Jonathan; Groth, Martin; Feng, Suhua; Gallego-Bartolome, Javier; Zhong, Zhenhui; Kuo, Hsuan Yu; Wang, Zonghua; Zhai, Jixian; Chory, Joanne; Jacobsen, Steven E

2018-03-01

Small RNAs regulate chromatin modifications such as DNA methylation and gene silencing across eukaryotic genomes. In plants, RNA-directed DNA methylation (RdDM) requires 24-nucleotide small interfering RNAs (siRNAs) that bind to ARGONAUTE 4 (AGO4) and target genomic regions for silencing. RdDM also requires non-coding RNAs transcribed by RNA polymerase V (Pol V) that probably serve as scaffolds for binding of AGO4-siRNA complexes. Here, we used a modified global nuclear run-on protocol followed by deep sequencing to capture Pol V nascent transcripts genome-wide. We uncovered unique characteristics of Pol V RNAs, including a uracil (U) common at position 10. This uracil was complementary to the 5' adenine found in many AGO4-bound 24-nucleotide siRNAs and was eliminated in a siRNA-deficient mutant as well as in the ago4/6/9 triple mutant, suggesting that the +10 U signature is due to siRNA-mediated co-transcriptional slicing of Pol V transcripts. Expression of wild-type AGO4 in ago4/6/9 mutants was able to restore slicing of Pol V transcripts, but a catalytically inactive AGO4 mutant did not correct the slicing defect. We also found that Pol V transcript slicing required SUPPRESSOR OF TY INSERTION 5-LIKE (SPT5L), an elongation factor whose function is not well understood. These results highlight the importance of Pol V transcript slicing in RNA-mediated transcriptional gene silencing, which is a conserved process in many eukaryotes.
High Resolution Genomic Scans Reveal Genetic Architecture Controlling Alcohol Preference in Bidirectionally Selected Rat Model.

PubMed

Lo, Chiao-Ling; Lossie, Amy C; Liang, Tiebing; Liu, Yunlong; Xuei, Xiaoling; Lumeng, Lawrence; Zhou, Feng C; Muir, William M

2016-08-01

Investigations on the influence of nature vs. nurture on Alcoholism (Alcohol Use Disorder) in human have yet to provide a clear view on potential genomic etiologies. To address this issue, we sequenced a replicated animal model system bidirectionally-selected for alcohol preference (AP). This model is uniquely suited to map genetic effects with high reproducibility, and resolution. The origin of the rat lines (an 8-way cross) resulted in small haplotype blocks (HB) with a corresponding high level of resolution. We sequenced DNAs from 40 samples (10 per line of each replicate) to determine allele frequencies and HB. We achieved ~46X coverage per line and replicate. Excessive differentiation in the genomic architecture between lines, across replicates, termed signatures of selection (SS), were classified according to gene and region. We identified SS in 930 genes associated with AP. The majority (50%) of the SS were confined to single gene regions, the greatest numbers of which were in promoters (284) and intronic regions (169) with the least in exon's (4), suggesting that differences in AP were primarily due to alterations in regulatory regions. We confirmed previously identified genes and found many new genes associated with AP. Of those newly identified genes, several demonstrated neuronal function involved in synaptic memory and reward behavior, e.g. ion channels (Kcnf1, Kcnn3, Scn5a), excitatory receptors (Grin2a, Gria3, Grip1), neurotransmitters (Pomc), and synapses (Snap29). This study not only reveals the polygenic architecture of AP, but also emphasizes the importance of regulatory elements, consistent with other complex traits.
Raman spectral signatures as conformational probes of gas phase flexible molecules

NASA Astrophysics Data System (ADS)

Golan, Amir; Mayorkas, Nitzan; Rosenwaks, Salman; Bar, Ilana

2009-07-01

A novel application of ionization-loss stimulated Raman spectroscopy (ILSRS) for monitoring the spectral features of four conformers of a gas phase flexible molecule is reported. The Raman spectral signatures of four conformers of 2-phenylethylamine are well matched by the results of density functional theory calculations, showing bands uniquely identifying the structures. The measurement of spectral signatures by ILSRS in an extended spectral range, with a conventional laser source, is instrumental in facilitating the unraveling of intra- and intermolecular interactions that are significant in biological structure and activity.
Detection of Ionospheric Alfven Resonator Signatures in the Equatorial Ionosphere

NASA Technical Reports Server (NTRS)

Simoes, Fernando; Klenzing, Jeffrey; Ivanov, Stoyan; Pfaff, Robert; Freudenreich, Henry; Bilitza, Dieter; Rowland, Douglas; Bromund, Kenneth; Liebrecht, Maria Carmen; Martin, Steven;

2012-01-01

The ionosphere response resulting from minimum solar activity during cycle 23/24 was unusual and offered unique opportunities for investigating space weather in the near-Earth environment. We report ultra low frequency electric field signatures related to the ionospheric Alfven resonator detected by the Communications/Navigation Outage Forecasting System (C/NOFS) satellite in the equatorial region. These signatures are used to constrain ionospheric empirical models and offer a new approach for monitoring ionosphere dynamics and space weather phenomena, namely aeronomy processes, Alfven wave propagation, and troposphere24 ionosphere-magnetosphere coupling mechanisms.

The microbiota and microbiome in aging: potential implications in health and age-related diseases.

PubMed

Zapata, Heidi J; Quagliarello, Vincent J

2015-04-01

Advances in bacterial deoxyribonucleic acid sequencing allow for characterization of the human commensal bacterial community (microbiota) and its corresponding genome (microbiome). Surveys of healthy adults reveal that a signature composite of bacteria characterizes each unique body habitat (e.g., gut, skin, oral cavity, vagina). A myriad of clinical changes, including a basal proinflammatory state (inflamm-aging), that directly interface with the microbiota of older adults and enhance susceptibility to disease accompany aging. Studies in older adults demonstrate that the gut microbiota correlates with diet, location of residence (e.g., community dwelling, long-term care settings), and basal level of inflammation. Links exist between the microbiota and a variety of clinical problems plaguing older adults, including physical frailty, Clostridium difficile colitis, vulvovaginal atrophy, colorectal carcinoma, and atherosclerotic disease. Manipulation of the microbiota and microbiome of older adults holds promise as an innovative strategy to influence the development of comorbidities associated with aging. © 2015, Copyright the Authors Journal compilation © 2015, The American Geriatrics Society.
Transcriptome analysis of intraspecific competition in Arabidopsis thaliana reveals organ-specific signatures related to nutrient acquisition and general stress response pathways

PubMed Central

2012-01-01

Background Plants are sessile and therefore have to perceive and adjust to changes in their environment. The presence of neighbours leads to a competitive situation where resources and space will be limited. Complex adaptive responses to such situation are poorly understood at the molecular level. Results Using microarrays, we analysed whole-genome expression changes in Arabidopsis thaliana plants subjected to intraspecific competition. The leaf and root transcriptome was strongly altered by competition. Differentially expressed genes were enriched in genes involved in nutrient deficiency (mainly N, P, K), perception of light quality, and responses to abiotic and biotic stresses. Interestingly, performance of the generalist insect Spodoptera littoralis on densely grown plants was significantly reduced, suggesting that plants under competition display enhanced resistance to herbivory. Conclusions This study provides a comprehensive list of genes whose expression is affected by intraspecific competition in Arabidopsis. The outcome is a unique response that involves genes related to light, nutrient deficiency, abiotic stress, and defence responses. PMID:23194435
Discovery of metabolic signatures for predicting whole organism toxicology.

PubMed

Hines, Adam; Staff, Fred J; Widdows, John; Compton, Russell M; Falciani, Francesco; Viant, Mark R

2010-06-01

Toxicological studies in sentinel organisms frequently use biomarkers to assess biological effect. Development of "omic" technologies has enhanced biomarker discovery at the molecular level, providing signatures unique to toxicant mode-of-action (MOA). However, these signatures often lack relevance to organismal responses, such as growth or reproduction, limiting their value for environmental monitoring. Our primary objective was to discover metabolic signatures in chemically exposed organisms that can predict physiological toxicity. Marine mussels (Mytilus edulis) were exposed for 7 days to 12 and 50 microg/l copper and 50 and 350 microg/l pentachlorophenol (PCP), toxicants with unique MOAs. Physiological responses comprised an established measure of organism energetic fitness, scope for growth (SFG). Metabolic fingerprints were measured in the same individuals using nuclear magnetic resonance-based metabolomics. Metabolic signatures predictive of SFG were sought using optimal variable selection strategies and multivariate regression and then tested upon independently field-sampled mussels from rural and industrialized sites. Copper and PCP induced rational metabolic and physiological changes. Measured and predicted SFG were highly correlated for copper (r(2) = 0.55, P = 2.82 x 10(-7)) and PCP (r(2) = 0.66, P = 3.20 x 10(-6)). Predictive metabolites included methionine and arginine/phosphoarginine for copper and allantoin, valine, and methionine for PCP. When tested on field-sampled animals, metabolic signatures predicted considerably reduced fitness of mussels from the contaminated (SFG = 6.0 J/h/g) versus rural (SFG = 15.2 J/h/g) site. We report the first successful discovery of metabolic signatures in chemically exposed environmental organisms that inform on molecular MOA and that can predict physiological toxicity. This could have far-reaching implications for monitoring impacts on environmental health.
The Human Airway Epithelial Basal Cell Transcriptome

PubMed Central

Wang, Rui; Zwick, Rachel K.; Ferris, Barbara; Witover, Bradley; Salit, Jacqueline; Crystal, Ronald G.

2011-01-01

Background The human airway epithelium consists of 4 major cell types: ciliated, secretory, columnar and basal cells. During natural turnover and in response to injury, the airway basal cells function as stem/progenitor cells for the other airway cell types. The objective of this study is to better understand human airway epithelial basal cell biology by defining the gene expression signature of this cell population. Methodology/Principal Findings Bronchial brushing was used to obtain airway epithelium from healthy nonsmokers. Microarrays were used to assess the transcriptome of basal cells purified from the airway epithelium in comparison to the transcriptome of the differentiated airway epithelium. This analysis identified the “human airway basal cell signature” as 1,161 unique genes with >5-fold higher expression level in basal cells compared to differentiated epithelium. The basal cell signature was suppressed when the basal cells differentiated into a ciliated airway epithelium in vitro. The basal cell signature displayed overlap with genes expressed in basal-like cells from other human tissues and with that of murine airway basal cells. Consistent with self-modulation as well as signaling to other airway cell types, the human airway basal cell signature was characterized by genes encoding extracellular matrix components, growth factors and growth factor receptors, including genes related to the EGF and VEGF pathways. Interestingly, while the basal cell signature overlaps that of basal-like cells of other organs, the human airway basal cell signature has features not previously associated with this cell type, including a unique pattern of genes encoding extracellular matrix components, G protein-coupled receptors, neuroactive ligands and receptors, and ion channels. Conclusion/Significance The human airway epithelial basal cell signature identified in the present study provides novel insights into the molecular phenotype and biology of the stem/progenitor cells of the human airway epithelium. PMID:21572528
Genomic taxonomy of vibrios

PubMed Central

Thompson, Cristiane C; Vicente, Ana Carolina P; Souza, Rangel C; Vasconcelos, Ana Tereza R; Vesth, Tammi; Alves, Nelson; Ussery, David W; Iida, Tetsuya; Thompson, Fabiano L

2009-01-01

Background Vibrio taxonomy has been based on a polyphasic approach. In this study, we retrieve useful taxonomic information (i.e. data that can be used to distinguish different taxonomic levels, such as species and genera) from 32 genome sequences of different vibrio species. We use a variety of tools to explore the taxonomic relationship between the sequenced genomes, including Multilocus Sequence Analysis (MLSA), supertrees, Average Amino Acid Identity (AAI), genomic signatures, and Genome BLAST atlases. Our aim is to analyse the usefulness of these tools for species identification in vibrios. Results We have generated four new genome sequences of three Vibrio species, i.e., V. alginolyticus 40B, V. harveyi-like 1DA3, and V. mimicus strains VM573 and VM603, and present a broad analyses of these genomes along with other sequenced Vibrio species. The genome atlas and pangenome plots provide a tantalizing image of the genomic differences that occur between closely related sister species, e.g. V. cholerae and V. mimicus. The vibrio pangenome contains around 26504 genes. The V. cholerae core genome and pangenome consist of 1520 and 6923 genes, respectively. Pangenomes might allow different strains of V. cholerae to occupy different niches. MLSA and supertree analyses resulted in a similar phylogenetic picture, with a clear distinction of four groups (Vibrio core group, V. cholerae-V. mimicus, Aliivibrio spp., and Photobacterium spp.). A Vibrio species is defined as a group of strains that share > 95% DNA identity in MLSA and supertree analysis, > 96% AAI, ≤ 10 genome signature dissimilarity, and > 61% proteome identity. Strains of the same species and species of the same genus will form monophyletic groups on the basis of MLSA and supertree. Conclusion The combination of different analytical and bioinformatics tools will enable the most accurate species identification through genomic computational analysis. This endeavour will culminate in the birth of the online genomic taxonomy whereby researchers and end-users of taxonomy will be able to identify their isolates through a web-based server. This novel approach to microbial systematics will result in a tremendous advance concerning biodiversity discovery, description, and understanding. PMID:19860885
Exploring signatures of positive selection in pigmentation candidate genes in populations of East Asian ancestry

PubMed Central

2013-01-01

Background Currently, there is very limited knowledge about the genes involved in normal pigmentation variation in East Asian populations. We carried out a genome-wide scan of signatures of positive selection using the 1000 Genomes Phase I dataset, in order to identify pigmentation genes showing putative signatures of selective sweeps in East Asia. We applied a broad range of methods to detect signatures of selection including: 1) Tests designed to identify deviations of the Site Frequency Spectrum (SFS) from neutral expectations (Tajima’s D, Fay and Wu’s H and Fu and Li’s D* and F*), 2) Tests focused on the identification of high-frequency haplotypes with extended linkage disequilibrium (iHS and Rsb) and 3) Tests based on genetic differentiation between populations (LSBL). Based on the results obtained from a genome wide analysis of 25 kb windows, we constructed an empirical distribution for each statistic across all windows, and identified pigmentation genes that are outliers in the distribution. Results Our tests identified twenty genes that are relevant for pigmentation biology. Of these, eight genes (ATRN, EDAR, KLHL7, MITF, OCA2, TH, TMEM33 and TRPM1,) were extreme outliers (top 0.1% of the empirical distribution) for at least one statistic, and twelve genes (ADAM17, BNC2, CTSD, DCT, EGFR, LYST, MC1R, MLPH, OPRM1, PDIA6, PMEL (SILV) and TYRP1) were in the top 1% of the empirical distribution for at least one statistic. Additionally, eight of these genes (BNC2, EGFR, LYST, MC1R, OCA2, OPRM1, PMEL (SILV) and TYRP1) have been associated with pigmentary traits in association studies. Conclusions We identified a number of putative pigmentation genes showing extremely unusual patterns of genetic variation in East Asia. Most of these genes are outliers for different tests and/or different populations, and have already been described in previous scans for positive selection, providing strong support to the hypothesis that recent selective sweeps left a signature in these regions. However, it will be necessary to carry out association and functional studies to demonstrate the implication of these genes in normal pigmentation variation. PMID:23848512
Exploring signatures of positive selection in pigmentation candidate genes in populations of East Asian ancestry.

PubMed

Hider, Jessica L; Gittelman, Rachel M; Shah, Tapan; Edwards, Melissa; Rosenbloom, Arnold; Akey, Joshua M; Parra, Esteban J

2013-07-12

Currently, there is very limited knowledge about the genes involved in normal pigmentation variation in East Asian populations. We carried out a genome-wide scan of signatures of positive selection using the 1000 Genomes Phase I dataset, in order to identify pigmentation genes showing putative signatures of selective sweeps in East Asia. We applied a broad range of methods to detect signatures of selection including: 1) Tests designed to identify deviations of the Site Frequency Spectrum (SFS) from neutral expectations (Tajima's D, Fay and Wu's H and Fu and Li's D* and F*), 2) Tests focused on the identification of high-frequency haplotypes with extended linkage disequilibrium (iHS and Rsb) and 3) Tests based on genetic differentiation between populations (LSBL). Based on the results obtained from a genome wide analysis of 25 kb windows, we constructed an empirical distribution for each statistic across all windows, and identified pigmentation genes that are outliers in the distribution. Our tests identified twenty genes that are relevant for pigmentation biology. Of these, eight genes (ATRN, EDAR, KLHL7, MITF, OCA2, TH, TMEM33 and TRPM1,) were extreme outliers (top 0.1% of the empirical distribution) for at least one statistic, and twelve genes (ADAM17, BNC2, CTSD, DCT, EGFR, LYST, MC1R, MLPH, OPRM1, PDIA6, PMEL (SILV) and TYRP1) were in the top 1% of the empirical distribution for at least one statistic. Additionally, eight of these genes (BNC2, EGFR, LYST, MC1R, OCA2, OPRM1, PMEL (SILV) and TYRP1) have been associated with pigmentary traits in association studies. We identified a number of putative pigmentation genes showing extremely unusual patterns of genetic variation in East Asia. Most of these genes are outliers for different tests and/or different populations, and have already been described in previous scans for positive selection, providing strong support to the hypothesis that recent selective sweeps left a signature in these regions. However, it will be necessary to carry out association and functional studies to demonstrate the implication of these genes in normal pigmentation variation.
Detecting signatures of positive selection associated with musical aptitude in the human genome

PubMed Central

Liu, Xuanyao; Kanduri, Chakravarthi; Oikkonen, Jaana; Karma, Kai; Raijas, Pirre; Ukkola-Vuoti, Liisa; Teo, Yik-Ying; Järvelä, Irma

2016-01-01

Abilities related to musical aptitude appear to have a long history in human evolution. To elucidate the molecular and evolutionary background of musical aptitude, we compared genome-wide genotyping data (641 K SNPs) of 148 Finnish individuals characterized for musical aptitude. We assigned signatures of positive selection in a case-control setting using three selection methods: haploPS, XP-EHH and FST. Gene ontology classification revealed that the positive selection regions contained genes affecting inner-ear development. Additionally, literature survey has shown that several of the identified genes were known to be involved in auditory perception (e.g. GPR98, USH2A), cognition and memory (e.g. GRIN2B, IL1A, IL1B, RAPGEF5), reward mechanisms (RGS9), and song perception and production of songbirds (e.g. FOXP1, RGS9, GPR98, GRIN2B). Interestingly, genes related to inner-ear development and cognition were also detected in a previous genome-wide association study of musical aptitude. However, the candidate genes detected in this study were not reported earlier in studies of musical abilities. Identification of genes related to language development (FOXP1 and VLDLR) support the popular hypothesis that music and language share a common genetic and evolutionary background. The findings are consistent with the evolutionary conservation of genes related to auditory processes in other species and provide first empirical evidence for signatures of positive selection for abilities that contribute to musical aptitude. PMID:26879527
Genome-wide methylation and gene expression changes in newborn rats following maternal protein restriction and reversal by folic acid.

PubMed

Altobelli, Gioia; Bogdarina, Irina G; Stupka, Elia; Clark, Adrian J L; Langley-Evans, Simon

2013-01-01

A large body of evidence from human and animal studies demonstrates that the maternal diet during pregnancy can programme physiological and metabolic functions in the developing fetus, effectively determining susceptibility to later disease. The mechanistic basis of such programming is unclear but may involve resetting of epigenetic marks and fetal gene expression. The aim of this study was to evaluate genome-wide DNA methylation and gene expression in the livers of newborn rats exposed to maternal protein restriction. On day one postnatally, there were 618 differentially expressed genes and 1183 differentially methylated regions (FDR 5%). The functional analysis of differentially expressed genes indicated a significant effect on DNA repair/cycle/maintenance functions and of lipid, amino acid metabolism and circadian functions. Enrichment for known biological functions was found to be associated with differentially methylated regions. Moreover, these epigenetically altered regions overlapped genetic loci associated with metabolic and cardiovascular diseases. Both expression changes and DNA methylation changes were largely reversed by supplementing the protein restricted diet with folic acid. Although the epigenetic and gene expression signatures appeared to underpin largely different biological processes, the gene expression profile of DNA methyl transferases was altered, providing a potential link between the two molecular signatures. The data showed that maternal protein restriction is associated with widespread differential gene expression and DNA methylation across the genome, and that folic acid is able to reset both molecular signatures.
Inferring Properties of Ancient Cyanobacteria from Biogeochemical Activity and Genomes of Siderophilic Cyanobacteria

NASA Technical Reports Server (NTRS)

McKay, David S.; Brown, I. I.; Tringe, S. G.; Thomas-Keprta, K. E.; Bryant, D. A.; Sarkisova, S. S.; Malley, K.; Sosa, O.; Klatt, C. G.; McKay, D. S.

2010-01-01

Interrelationships between life and the planetary system could have simultaneously left landmarks in genomes of microbes and physicochemical signatures in the lithosphere. Verifying the links between genomic features in living organisms and the mineralized signatures generated by these organisms will help to reveal traces of life on Earth and beyond. Among contemporary environments, iron-depositing hot springs (IDHS) may represent one of the most appropriate natural models [1] for insights into ancient life since organisms may have originated on Earth and probably Mars in association with hydrothermal activity [2,3]. IDHS also seem to be appropriate models for studying certain biogeochemical processes that could have taken place in the late Archean and,-or early Paleoproterozoic eras [4, 5]. It has been suggested that inorganic polyphosphate (PPi), in chains of tens to hundreds of phosphate residues linked by high-energy bonds, is environmentally ubiquitous and abundant [6]. Cyanobacteria (CB) react to increased heavy metal concentrations and UV by enhanced generation of PPi bodies (PPB) [7], which are believed to be signatures of life [8]. However, the role of PPi in oxygenic prokaryotes for the suppression of oxidative stress induced by high Fe is poorly studied. Here we present preliminary results of a new mechanism of Fe mineralization in oxygenic prokaryotes, the effect of Fe on the generation of PPi bodies in CB, as well as preliminary analysis of the diversity and phylogeny of proteins involved in the prevention of oxidative stress in phototrophs inhabiting IDHS.

Detecting signatures of positive selection associated with musical aptitude in the human genome.

PubMed

Liu, Xuanyao; Kanduri, Chakravarthi; Oikkonen, Jaana; Karma, Kai; Raijas, Pirre; Ukkola-Vuoti, Liisa; Teo, Yik-Ying; Järvelä, Irma

2016-02-16

Abilities related to musical aptitude appear to have a long history in human evolution. To elucidate the molecular and evolutionary background of musical aptitude, we compared genome-wide genotyping data (641 K SNPs) of 148 Finnish individuals characterized for musical aptitude. We assigned signatures of positive selection in a case-control setting using three selection methods: haploPS, XP-EHH and FST. Gene ontology classification revealed that the positive selection regions contained genes affecting inner-ear development. Additionally, literature survey has shown that several of the identified genes were known to be involved in auditory perception (e.g. GPR98, USH2A), cognition and memory (e.g. GRIN2B, IL1A, IL1B, RAPGEF5), reward mechanisms (RGS9), and song perception and production of songbirds (e.g. FOXP1, RGS9, GPR98, GRIN2B). Interestingly, genes related to inner-ear development and cognition were also detected in a previous genome-wide association study of musical aptitude. However, the candidate genes detected in this study were not reported earlier in studies of musical abilities. Identification of genes related to language development (FOXP1 and VLDLR) support the popular hypothesis that music and language share a common genetic and evolutionary background. The findings are consistent with the evolutionary conservation of genes related to auditory processes in other species and provide first empirical evidence for signatures of positive selection for abilities that contribute to musical aptitude.
The impact of age, biogenesis, and genomic clustering on Drosophila microRNA evolution

PubMed Central

Mohammed, Jaaved; Flynt, Alex S.; Siepel, Adam; Lai, Eric C.

2013-01-01

The molecular evolutionary signatures of miRNAs inform our understanding of their emergence, biogenesis, and function. The known signatures of miRNA evolution have derived mostly from the analysis of deeply conserved, canonical loci. In this study, we examine the impact of age, biogenesis pathway, and genomic arrangement on the evolutionary properties of Drosophila miRNAs. Crucial to the accuracy of our results was our curation of high-quality miRNA alignments, which included nearly 150 corrections to ortholog calls and nucleotide sequences of the global 12-way Drosophilid alignments currently available. Using these data, we studied primary sequence conservation, normalized free-energy values, and types of structure-preserving substitutions. We expand upon common miRNA evolutionary patterns that reflect fundamental features of miRNAs that are under functional selection. We observe that melanogaster-subgroup-specific miRNAs, although recently emerged and rapidly evolving, nonetheless exhibit evolutionary signatures that are similar to well-conserved miRNAs and distinct from other structured noncoding RNAs and bulk conserved non-miRNA hairpins. This provides evidence that even young miRNAs may be selected for regulatory activities. More strikingly, we observe that mirtrons and clustered miRNAs both exhibit distinct evolutionary properties relative to solo, well-conserved miRNAs, even after controlling for sequence depth. These studies highlight the previously unappreciated impact of biogenesis strategy and genomic location on the evolutionary dynamics of miRNAs, and affirm that miRNAs do not evolve as a unitary class. PMID:23882112
Range Expansion and the Origin of USA300 North American Epidemic Methicillin-Resistant Staphylococcus aureus

PubMed Central

Challagundla, Lavanya; Luo, Xiao; Tickler, Isabella A.; Coombs, Geoffrey W.; Sordelli, Daniel O.; Brown, Eric L.; Skov, Robert; Larsen, Anders Rhod; Reyes, Jinnethe; Robledo, Iraida E.; Vazquez, Guillermo J.; Rivera, Raul; Fey, Paul D.; Stevenson, Kurt; Wang, Shu-Hua; Kreiswirth, Barry N.; Mediavilla, Jose R.; Arias, Cesar A.; Planet, Paul J.; Nolan, Rathel L.; Tenover, Fred C.; Goering, Richard V.

2018-01-01

ABSTRACT The USA300 North American epidemic (USA300-NAE) clone of methicillin-resistant Staphylococcus aureus has caused a wave of severe skin and soft tissue infections in the United States since it emerged in the early 2000s, but its geographic origin is obscure. Here we use the population genomic signatures expected from the serial founder effects of a geographic range expansion to infer the origin of USA300-NAE and identify polymorphisms associated with its spread. Genome sequences from 357 isolates from 22 U.S. states and territories and seven other countries are compared. We observe two significant signatures of range expansion, including decreases in genetic diversity and increases in derived allele frequency with geographic distance from the Pennsylvania region. These signatures account for approximately half of the core nucleotide variation of this clone, occur genome wide, and are robust to heterogeneity in temporal sampling of isolates, human population density, and recombination detection methods. The potential for positive selection of a gyrA fluoroquinolone resistance allele and several intergenic regions, along with a 2.4 times higher recombination rate in a resistant subclade, is noted. These results are the first to show a pattern of genetic variation that is consistent with a range expansion of an epidemic bacterial clone, and they highlight a rarely considered but potentially common mechanism by which genetic drift may profoundly influence bacterial genetic variation. PMID:29295910
Secure Genomic Computation through Site-Wise Encryption

PubMed Central

Zhao, Yongan; Wang, XiaoFeng; Tang, Haixu

2015-01-01

Commercial clouds provide on-demand IT services for big-data analysis, which have become an attractive option for users who have no access to comparable infrastructure. However, utilizing these services for human genome analysis is highly risky, as human genomic data contains identifiable information of human individuals and their disease susceptibility. Therefore, currently, no computation on personal human genomic data is conducted on public clouds. To address this issue, here we present a site-wise encryption approach to encrypt whole human genome sequences, which can be subject to secure searching of genomic signatures on public clouds. We implemented this method within the Hadoop framework, and tested it on the case of searching disease markers retrieved from the ClinVar database against patients’ genomic sequences. The secure search runs only one order of magnitude slower than the simple search without encryption, indicating our method is ready to be used for secure genomic computation on public clouds. PMID:26306278
Secure Genomic Computation through Site-Wise Encryption.

PubMed

Zhao, Yongan; Wang, XiaoFeng; Tang, Haixu

2015-01-01

Commercial clouds provide on-demand IT services for big-data analysis, which have become an attractive option for users who have no access to comparable infrastructure. However, utilizing these services for human genome analysis is highly risky, as human genomic data contains identifiable information of human individuals and their disease susceptibility. Therefore, currently, no computation on personal human genomic data is conducted on public clouds. To address this issue, here we present a site-wise encryption approach to encrypt whole human genome sequences, which can be subject to secure searching of genomic signatures on public clouds. We implemented this method within the Hadoop framework, and tested it on the case of searching disease markers retrieved from the ClinVar database against patients' genomic sequences. The secure search runs only one order of magnitude slower than the simple search without encryption, indicating our method is ready to be used for secure genomic computation on public clouds.
No genome-wide protein sequence convergence for echolocation.

PubMed

Zou, Zhengting; Zhang, Jianzhi

2015-05-01

Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Linkage disequilibrium and signatures of positive selection around LINE-1 retrotransposons in the human genome.

PubMed

Kuhn, Alexandre; Ong, Yao Min; Cheng, Ching-Yu; Wong, Tien Yin; Quake, Stephen R; Burkholder, William F

2014-06-03

Insertions of the human-specific subfamily of LINE-1 (L1) retrotransposon are highly polymorphic across individuals and can critically influence the human transcriptome. We hypothesized that L1 insertions could represent genetic variants determining important human phenotypic traits, and performed an integrated analysis of L1 elements and single nucleotide polymorphisms (SNPs) in several human populations. We found that a large fraction of L1s were in high linkage disequilibrium with their surrounding genomic regions and that they were well tagged by SNPs. However, L1 variants were only partially captured by SNPs on standard SNP arrays, so that their potential phenotypic impact would be frequently missed by SNP array-based genome-wide association studies. We next identified potential phenotypic effects of L1s by looking for signatures of natural selection linked to L1 insertions; significant extended haplotype homozygosity was detected around several L1 insertions. This finding suggests that some of these L1 insertions may have been the target of recent positive selection.
Genes Required for Free Phage Production are Essential for Pseudomonas aeruginosa Chronic Lung Infections.

PubMed

Lemieux, Andrée-Ann; Jeukens, Julie; Kukavica-Ibrulj, Irena; Fothergill, Joanne L; Boyle, Brian; Laroche, Jérôme; Tucker, Nicholas P; Winstanley, Craig; Levesque, Roger C

2016-02-01

The opportunistic pathogen Pseudomonas aeruginosa causes chronic lung infection in patients with cystic fibrosis. The Liverpool Epidemic Strain LESB58 is highly resistant to antibiotics, transmissible, and associated with increased morbidity and mortality. Its genome contains 6 prophages and 5 genomic islands. We constructed a polymerase chain reaction (PCR)-based signature-tagged mutagenesis library of 9216 LESB58 mutants and screened the mutants in a rat model of chronic lung infection. A total of 162 mutants were identified as defective for in vivo maintenance, with 11 signature-tagged mutagenesis mutants having insertions in prophage and genomic island genes. Many of these mutants showed both diminished virulence and reduced phage production. Transcription profiling by quantitative PCR and RNA-Seq suggested that disruption of these prophages had a widespread trans-acting effect on the transcriptome. This study demonstrates that temperate phages play a pivotal role in the establishment of infection through modulation of bacterial host gene expression. © The Author 2015. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
Genomic Signatures of Speciation in Sympatric and Allopatric Hawaiian Picture-Winged Drosophila

PubMed Central

Kang, Lin; Settlage, Robert; McMahon, Wyatt; Michalak, Katarzyna; Tae, Hongseok; Garner, Harold R.; Stacy, Elizabeth A.; Price, Donald K.; Michalak, Pawel

2016-01-01

The Hawaiian archipelago provides a natural arena for understanding adaptive radiation and speciation. The Hawaiian Drosophila are one of the most diverse endemic groups in Hawaiì with up to 1,000 species. We sequenced and analyzed entire genomes of recently diverged species of Hawaiian picture-winged Drosophila, Drosophila silvestris and Drosophila heteroneura from Hawaiì Island, in comparison with Drosophila planitibia, their sister species from Maui, a neighboring island where a common ancestor of all three had likely occurred. Genome-wide single nucleotide polymorphism patterns suggest the more recent origin of D. silvestris and D. heteroneura, as well as a pervasive influence of positive selection on divergence of the three species, with the signatures of positive selection more prominent in sympatry than allopatry. Positively selected genes were significantly enriched for functional terms related to sensory detection and mating, suggesting that sexual selection played an important role in speciation of these species. In particular, sequence variation in Olfactory receptor and Gustatory receptor genes seems to play a major role in adaptive radiation in Hawaiian pictured-winged Drosophila. PMID:27189993
Genomic Expression Patterns in Menstrually-Related Migraine in Adolescents

PubMed Central

Hershey, Andrew; Horn, Paul; Kabbouche, Marielle; O'Brien, Hope; Powers, Scott

2011-01-01

Background Exacerbation of migraine with menses is common in adolescent girls and women with migraine, occurring in up to 60% of females with migraine. These migraines are oftentimes longer and more disabling and may be related to estrogen levels and hormonal fluctuations. Objective This study identifies the unique genomic expression pattern of menstrually-related migraine (MRM) in comparison to migraine occurring outside the menstrual period and headache free controls. Methods Whole blood samples were obtained from female subjects having an acute migraine during their menstrual period (MRM) or outside of their menstrual period (nonMRM) and controls (C) – females having a menstrual period without any history of headache. The mRNA was isolated from these samples and genomic profile was assessed. Affymetrix Human Exon ST 1.0 arrays were used to examine the genomic expression pattern differences between these three groups. Results Blood genomic expression patterns were obtained on 56 subjects (MRM = 18, nonMRM = 18 and C = 20). Unique genomic expression patterns were observed for both MRM and nonMRM. For MRM, 77 genes were identified that were unique to MRM, while 61 genes were commonly expressed for MRM and nonMRM and 127 genes appeared to have a unique expression pattern for nonMRM. In addition, there were 279 genes that differentially expressed for MRM compared to nonMRM that were not differentially expressed for nonMRM. Gene ontology of these samples indicated many of these groups of genes were functionally related and included categories of immunomodulation/inflammation, mitochondrial function and DNA homeostasis. Conclusions Blood genomic patterns can accurately differentiate MRM from nonMRM. These results indicate that MRM involves a unique molecular biology pathway that can be identified with a specific biomarker and suggest that individuals with MRM have a different underlying genetic etiology. PMID:22220971
Sorghum Expressed Sequence Tags Identify Signature Genes for Drought, Pathogenesis, and Skotomorphogenesis from a Milestone Set of 16,801 Unique Transcripts1[w

PubMed Central

Pratt, Lee H.; Liang, Chun; Shah, Manish; Sun, Feng; Wang, Haiming; Reid, St. Patrick; Gingle, Alan R.; Paterson, Andrew H.; Wing, Rod; Dean, Ralph; Klein, Robert; Nguyen, Henry T.; Ma, Hong-mei; Zhao, Xin; Morishige, Daryl T.; Mullet, John E.; Cordonnier-Pratt, Marie-Michèle

2005-01-01

Improved knowledge of the sorghum transcriptome will enhance basic understanding of how plants respond to stresses and serve as a source of genes of value to agriculture. Toward this goal, Sorghum bicolor L. Moench cDNA libraries were prepared from light- and dark-grown seedlings, drought-stressed plants, Colletotrichum-infected seedlings and plants, ovaries, embryos, and immature panicles. Other libraries were prepared with meristems from Sorghum propinquum (Kunth) Hitchc. that had been photoperiodically induced to flower, and with rhizomes from S. propinquum and johnsongrass (Sorghum halepense L. Pers.). A total of 117,682 expressed sequence tags (ESTs) were obtained representing both 3′ and 5′ sequences from about half that number of cDNA clones. A total of 16,801 unique transcripts, representing tentative UniScripts (TUs), were identified from 55,783 3′ ESTs. Of these TUs, 9,032 are represented by two or more ESTs. Collectively, these libraries were predicted to contain a total of approximately 31,000 TUs. Individual libraries, however, were predicted to contain no more than about 6,000 to 9,000, with the exception of light-grown seedlings, which yielded an estimate of close to 13,000. In addition, each library exhibits about the same level of complexity with respect to both the number of TUs preferentially expressed in that library and the frequency with which two or more ESTs is found in only that library. These results indicate that the sorghum genome is expressed in highly selective fashion in the individual organs and in response to the environmental conditions surveyed here. Close to 2,000 differentially expressed TUs were identified among the cDNA libraries examined, of which 775 were differentially expressed at a confidence level of 98%. From these 775 TUs, signature genes were identified defining drought, Colletotrichum infection, skotomorphogenesis (etiolation), ovary, immature panicle, and embryo. PMID:16169961
Generation and characterization of the sea bass Dicentrarchus labrax brain and liver transcriptomes.

PubMed

Magnanou, Elodie; Klopp, Christophe; Noirot, Celine; Besseau, Laurence; Falcón, Jack

2014-07-01

The sea bass Dicentrarchus labrax is the center of interest of an increasing number of basic or applied research investigations, even though few genomic or transcriptomic data is available. Current public data only represent a very partial view of its transcriptome. To fill this need, we characterized brain and liver transcriptomes in a generalist manner that would benefit the entire scientific community. We also tackled some bioinformatics questions, related to the effect of RNA fragment size on the assembly quality. Using Illumina RNA-seq, we sequenced organ pools from both wild and farmed Atlantic and Mediterranean fishes. We built two distinct cDNA libraries per organ that only differed by the length of the selected mRNA fragments. Efficiency of assemblies performed on either or both fragments size differed depending on the organ, but remained very close reflecting the quality of the technical replication. We generated more than 19,538Mbp of data. Over 193million reads were assembled into 35,073 contigs (average length=2374bp; N50=3257). 59% contigs were annotated with SwissProt, which corresponded to 12,517 unique genes. We compared the Gene Ontology (GO) contig distribution between the sea bass and the tilapia. We also looked for brain and liver GO specific signatures as well as KEGG pathway coverage. 23,050 putative micro-satellites and 134,890 putative SNPs were identified. Our sampling strategy and assembly pipeline provided a reliable and broad reference transcriptome for the sea bass. It constitutes an indisputable quantitative and qualitative improvement of the public data, as it provides 5 times more base pairs with fewer and longer contigs. Both organs present unique signatures consistent with their specific physiological functions. The discrepancy in fragment size effect on assembly quality between organs lies in their difference in complexity and thus does not allow prescribing any general strategy. This information on two key organs will facilitate further functional approaches. Copyright © 2014 Elsevier B.V. All rights reserved.
Whole genome detection of signature of positive selection in African cattle reveals selection for thermotolerance.

PubMed

Taye, Mengistie; Lee, Wonseok; Caetano-Anolles, Kelsey; Dessie, Tadelle; Hanotte, Olivier; Mwai, Okeyo Ally; Kemp, Stephen; Cho, Seoae; Oh, Sung Jong; Lee, Hak-Kyo; Kim, Heebal

2017-12-01

As African indigenous cattle evolved in a hot tropical climate, they have developed an inherent thermotolerance; survival mechanisms include a light-colored and shiny coat, increased sweating, and cellular and molecular mechanisms to cope with high environmental temperature. Here, we report the positive selection signature of genes in African cattle breeds which contribute for their heat tolerance mechanisms. We compared the genomes of five indigenous African cattle breeds with the genomes of four commercial cattle breeds using cross-population composite likelihood ratio (XP-CLR) and cross-population extended haplotype homozygosity (XP-EHH) statistical methods. We identified 296 (XP-EHH) and 327 (XP-CLR) positively selected genes. Gene ontology analysis resulted in 41 biological process terms and six Kyoto Encyclopedia of Genes and Genomes pathways. Several genes and pathways were found to be involved in oxidative stress response, osmotic stress response, heat shock response, hair and skin properties, sweat gland development and sweating, feed intake and metabolism, and reproduction functions. The genes and pathways identified directly or indirectly contribute to the superior heat tolerance mechanisms in African cattle populations. The result will improve our understanding of the biological mechanisms of heat tolerance in African cattle breeds and opens an avenue for further study. © 2017 Japanese Society of Animal Science.
Virus-like attachment sites as structural landmarks of plants retrotransposons.

PubMed

Ochoa Cruz, Edgar Andres; Cruz, Guilherme Marcello Queiroga; Vieira, Andréia Prata; Van Sluys, Marie-Anne

2016-01-01

The genomic data available nowadays has enabled the study of repetitive sequences and their relationship to viruses. Among them, long terminal repeat retrotransposons (LTR-RTs) are the largest component of most plant genomes, the Gypsy and Copia superfamilies being the most common. Recently it has been found that Del lineage, an LTR-RT of Gypsy superfamily, has putative virus-like attachment (vl-att) sites. This signature, originally described for retroviruses, is recognized by retroviral integrase conferring specificity to the integration process. Here we retrieved 26,092 putative complete LTR-RTs from 10 lineages found in 10 fully sequenced angiosperm genomes and found putative vl-att sites that are a conserved structural landmark across these genomes. Furthermore, we reveal that each plant genome has a distinguishable LTR-RT lineage amplification pattern that could be related to the vl-att sites diversity. We used these patterns to generate a specific quick-response (QR) code for each genome that could be used as a barcode of identification of plants in the future. The universal distribution of vl-att sites represents a new structural feature common to plant LTR-RTs and retroviruses. This is an important finding that expands the information about the structural similarity between LTR-RT and retroviruses. We speculate that the sequence diversity of vl-att sites could be important for the life cycle of retrotransposons, as it was shown for retroviruses. All the structural vl-att site signatures are strong candidates for further functional studies. Moreover, this is the first identification of specific LTR-RT content and their amplification patterns in a large dataset of LTR-RT lineages and angiosperm genomes. These distribution patterns could be used in the future with biotechnological identification purposes.
Gene-expression signature regulated by the KEAP1-NRF2-CUL3 axis is associated with a poor prognosis in head and neck squamous cell cancer.

PubMed

Namani, Akhileshwar; Matiur Rahaman, Md; Chen, Ming; Tang, Xiuwen

2018-01-06

NRF2 is the key regulator of oxidative stress in normal cells and aberrant expression of the NRF2 pathway due to genetic alterations in the KEAP1 (Kelch-like ECH-associated protein 1)-NRF2 (nuclear factor erythroid 2 like 2)-CUL3 (cullin 3) axis leads to tumorigenesis and drug resistance in many cancers including head and neck squamous cell cancer (HNSCC). The main goal of this study was to identify specific genes regulated by the KEAP1-NRF2-CUL3 axis in HNSCC patients, to assess the prognostic value of this gene signature in different cohorts, and to reveal potential biomarkers. RNA-Seq V2 level 3 data from 279 tumor samples along with 37 adjacent normal samples from patients enrolled in the The Cancer Genome Atlas (TCGA)-HNSCC study were used to identify upregulated genes using two methods (altered KEAP1-NRF2-CUL3 versus normal, and altered KEAP1-NRF2-CUL3 versus wild-type). We then used a new approach to identify the combined gene signature by integrating both datasets and subsequently tested this signature in 4 independent HNSCC datasets to assess its prognostic value. In addition, functional annotation using the DAVID v6.8 database and protein-protein interaction (PPI) analysis using the STRING v10 database were performed on the signature. A signature composed of a subset of 17 genes regulated by the KEAP1-NRF2-CUL3 axis was identified by overlapping both the upregulated genes of altered versus normal (251 genes) and altered versus wild-type (25 genes) datasets. We showed that increased expression was significantly associated with poor survival in 4 independent HNSCC datasets, including the TCGA-HNSCC dataset. Furthermore, Gene Ontology, Kyoto Encyclopedia of Genes and Genomes, and PPI analysis revealed that most of the genes in this signature are associated with drug metabolism and glutathione metabolic pathways. Altogether, our study emphasizes the discovery of a gene signature regulated by the KEAP1-NRF2-CUL3 axis which is strongly associated with tumorigenesis and drug resistance in HNSCC. This 17-gene signature provides potential biomarkers and therapeutic targets for HNSCC cases in which the NRF2 pathway is activated.
Genome Sequence of a Canadian Vibrio parahaemolyticus Isolate with Unique Mobilizing Capacity.

PubMed

Bioteau, Audrey; Huguet, Kévin; Burrus, Vincent; Banerjee, Swapan

2018-06-14

Vibrio parahaemolyticus is a clinically significant marine bacterium implicated in gastroenteritis among consumers of raw or undercooked seafood. This report presents the whole-genome sequence of a unique strain of V. parahaemolyticus isolated from oysters harvested in Canada. © Crown copyright 2018.
Ion channel gene expression predicts survival in glioma patients

PubMed Central

Wang, Rong; Gurguis, Christopher I.; Gu, Wanjun; Ko, Eun A; Lim, Inja; Bang, Hyoweon; Zhou, Tong; Ko, Jae-Hong

2015-01-01

Ion channels are important regulators in cell proliferation, migration, and apoptosis. The malfunction and/or aberrant expression of ion channels may disrupt these important biological processes and influence cancer progression. In this study, we investigate the expression pattern of ion channel genes in glioma. We designate 18 ion channel genes that are differentially expressed in high-grade glioma as a prognostic molecular signature. This ion channel gene expression based signature predicts glioma outcome in three independent validation cohorts. Interestingly, 16 of these 18 genes were down-regulated in high-grade glioma. This signature is independent of traditional clinical, molecular, and histological factors. Resampling tests indicate that the prognostic power of the signature outperforms random gene sets selected from human genome in all the validation cohorts. More importantly, this signature performs better than the random gene signatures selected from glioma-associated genes in two out of three validation datasets. This study implicates ion channels in brain cancer, thus expanding on knowledge of their roles in other cancers. Individualized profiling of ion channel gene expression serves as a superior and independent prognostic tool for glioma patients. PMID:26235283
In vitro downregulated hypoxia transcriptome is associated with poor prognosis in breast cancer.

PubMed

Abu-Jamous, Basel; Buffa, Francesca M; Harris, Adrian L; Nandi, Asoke K

2017-06-15

Hypoxia is a characteristic of breast tumours indicating poor prognosis. Based on the assumption that those genes which are up-regulated under hypoxia in cell-lines are expected to be predictors of poor prognosis in clinical data, many signatures of poor prognosis were identified. However, it was observed that cell line data do not always concur with clinical data, and therefore conclusions from cell line analysis should be considered with caution. As many transcriptomic cell-line datasets from hypoxia related contexts are available, integrative approaches which investigate these datasets collectively, while not ignoring clinical data, are required. We analyse sixteen heterogeneous breast cancer cell-line transcriptomic datasets in hypoxia-related conditions collectively by employing the unique capabilities of the method, UNCLES, which integrates clustering results from multiple datasets and can address questions that cannot be answered by existing methods. This has been demonstrated by comparison with the state-of-the-art iCluster method. From this collection of genome-wide datasets include 15,588 genes, UNCLES identified a relatively high number of genes (>1000 overall) which are consistently co-regulated over all of the datasets, and some of which are still poorly understood and represent new potential HIF targets, such as RSBN1 and KIAA0195. Two main, anti-correlated, clusters were identified; the first is enriched with MYC targets participating in growth and proliferation, while the other is enriched with HIF targets directly participating in the hypoxia response. Surprisingly, in six clinical datasets, some sub-clusters of growth genes are found consistently positively correlated with hypoxia response genes, unlike the observation in cell lines. Moreover, the ability to predict bad prognosis by a combined signature of one sub-cluster of growth genes and one sub-cluster of hypoxia-induced genes appears to be comparable and perhaps greater than that of known hypoxia signatures. We present a clustering approach suitable to integrate data from diverse experimental set-ups. Its application to breast cancer cell line datasets reveals new hypoxia-regulated signatures of genes which behave differently when in vitro (cell-line) data is compared with in vivo (clinical) data, and are of a prognostic value comparable or exceeding the state-of-the-art hypoxia signatures.
Chloroplast DNA sequence of the green alga Oedogonium cardiacum (Chlorophyceae): Unique genome architecture, derived characters shared with the Chaetophorales and novel genes acquired through horizontal transfer

PubMed Central

Brouard, Jean-Simon; Otis, Christian; Lemieux, Claude; Turmel, Monique

2008-01-01

Background To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA) from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales), Scenedesmus (Sphaeropleales), and Stigeoclonium (Chaetophorales) revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade) and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade). Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales). Results Like its three chlorophycean homologues, the 196,547-bp Oedogonium chloroplast genome displays a distinctive architecture. This genome is one of the most compact among photosynthetic chlorophytes. It has an atypical quadripartite structure, is intron-rich (17 group I and 4 group II introns), and displays 99 different conserved genes and four long open reading frames (ORFs), three of which are clustered in the spacious inverted repeat of 35,493 bp. Intriguingly, two of these ORFs (int and dpoB) revealed high similarities to genes not usually found in cpDNA. At the gene content and gene order levels, the Oedogonium genome most closely resembles its Stigeoclonium counterpart. Characters shared by these chlorophyceans but missing in members of the CS clade include the retention of psaM, rpl32 and trnL(caa), the loss of petA, the disruption of three ancestral clusters and the presence of five derived gene clusters. Conclusion The Oedogonium chloroplast genome disclosed additional characters that bolster the evidence for a close alliance between the Oedogoniales and Chaetophorales. Our unprecedented finding of int and dpoB in this cpDNA provides a clear example that novel genes were acquired by the chloroplast genome through horizontal transfers, possibly from a mitochondrial genome donor. PMID:18558012
Syntenic block overlap multiplicities with a panel of reference genomes provide a signature of ancient polyploidization events.

PubMed

Zheng, Chunfang; Santos Muñoz, Daniella; Albert, Victor A; Sankoff, David

2015-01-01

Following whole genome duplication (WGD), there is a compact distribution of gene similarities within the genome reflecting duplicate pairs of all the genes in the genome. With time, the distribution broadens and loses volume due to variable decay of duplicate gene similarity and to the process of duplicate gene loss. If there are two WGD, the older one becomes so reduced and broad that it merges with the tail of the distributions resulting from more recent events, and it becomes difficult to distinguish them. The goal of this paper is to advance statistical methods of identifying, or at least counting, the WGD events in the lineage of a given genome. For a set of 15 angiosperm genomes, we analyze all 15 × 14 = 210 ordered pairs of target genome versus reference genome, using SynMap to find syntenic blocks. We consider all sets of B ≥ 2 syntenic blocks in the target genome that overlap in the reference genome as evidence of WGD activity in the target, whether it be one event or several. We hypothesize that in fitting an exponential function to the tail of the empirical distribution f (B) of block multiplicities, the size of the exponent will reflect the amount of WGD in the history of the target genome. By amalgamating the results from all reference genomes, a range of values of SynMap parameters, and alternative cutoff points for the tail, we find a clear pattern whereby multiple-WGD core eudicots have the smallest (negative) exponents, followed by core eudicots with only the single "γ" triplication in their history, followed by a non-core eudicot with a single WGD, followed by the monocots, with a basal angiosperm, the WGD-free Amborella having the largest exponent. The hypothesis that the exponent of the fit to the tail of the multiplicity distribution is a signature of the amount of WGD is verified, but there is also a clear complicating factor in the monocot clade, where a history of multiple WGD is not reflected in a small exponent.

Comparative Genome Structure, Secondary Metabolite, and Effector Coding Capacity across Cochliobolus Pathogens

PubMed Central

Bushley, Kathryn E.; Ohm, Robin A.; Otillar, Robert; Martin, Joel; Schackwitz, Wendy; Grimwood, Jane; MohdZainudin, NurAinIzzati; Xue, Chunsheng; Wang, Rui; Manning, Viola A.; Dhillon, Braham; Tu, Zheng Jin; Steffenson, Brian J.; Salamov, Asaf; Sun, Hui; Lowry, Steve; LaButti, Kurt; Han, James; Copeland, Alex; Lindquist, Erika; Barry, Kerrie; Schmutz, Jeremy; Baker, Scott E.; Ciuffetti, Lynda M.; Grigoriev, Igor V.; Zhong, Shaobin; Turgeon, B. Gillian

2013-01-01

The genomes of five Cochliobolus heterostrophus strains, two Cochliobolus sativus strains, three additional Cochliobolus species (Cochliobolus victoriae, Cochliobolus carbonum, Cochliobolus miyabeanus), and closely related Setosphaeria turcica were sequenced at the Joint Genome Institute (JGI). The datasets were used to identify SNPs between strains and species, unique genomic regions, core secondary metabolism genes, and small secreted protein (SSP) candidate effector encoding genes with a view towards pinpointing structural elements and gene content associated with specificity of these closely related fungi to different cereal hosts. Whole-genome alignment shows that three to five percent of each genome differs between strains of the same species, while a quarter of each genome differs between species. On average, SNP counts among field isolates of the same C. heterostrophus species are more than 25× higher than those between inbred lines and 50× lower than SNPs between Cochliobolus species. The suites of nonribosomal peptide synthetase (NRPS), polyketide synthase (PKS), and SSP–encoding genes are astoundingly diverse among species but remarkably conserved among isolates of the same species, whether inbred or field strains, except for defining examples that map to unique genomic regions. Functional analysis of several strain-unique PKSs and NRPSs reveal a strong correlation with a role in virulence. PMID:23357949
Comparative Genome Structure, Secondary Metabolite, and Effector Coding Capacity across Cochliobolus Pathogens

DOE Office of Scientific and Technical Information (OSTI.GOV)

Condon, Bradford J.; Leng, Yueqiang; Wu, Dongliang

The genomes of five Cochliobolus heterostrophus strains, two Cochliobolus sativus strains, three additional Cochliobolus species (Cochliobolus victoriae, Cochliobolus carbonum, Cochliobolus miyabeanus), and closely related Setosphaeria turcica were sequenced at the Joint Genome Institute (JGI). The datasets were used to identify SNPs between strains and species, unique genomic regions, core secondary metabolism genes, and small secreted protein (SSP) candidate effector encoding genes with a view towards pinpointing structural elements and gene content associated with specificity of these closely related fungi to different cereal hosts. Whole-genome alignment shows that three to five of each genome differs between strains of the same species,more » while a quarter of each genome differs between species. On average, SNP counts among field isolates of the same C. heterostrophus species are more than 25 higher than those between inbred lines and 50 lower than SNPs between Cochliobolus species. The suites of nonribosomal peptide synthetase (NRPS), polyketide synthase (PKS), and SSP encoding genes are astoundingly diverse among species but remarkably conserved among isolates of the same species, whether inbred or field strains, except for defining examples that map to unique genomic regions. Functional analysis of several strain-unique PKSs and NRPSs reveal a strong correlation with a role in virulence.« less
Characterising private and shared signatures of positive selection in 37 Asian populations.

PubMed

Liu, Xuanyao; Lu, Dongsheng; Saw, Woei-Yuh; Shaw, Philip J; Wangkumhang, Pongsakorn; Ngamphiw, Chumpol; Fucharoen, Suthat; Lert-Itthiporn, Worachart; Chin-Inmanu, Kwanrutai; Chau, Tran Nguyen Bich; Anders, Katie; Kasturiratne, Anuradhani; de Silva, H Janaka; Katsuya, Tomohiro; Kimura, Ryosuke; Nabika, Toru; Ohkubo, Takayoshi; Tabara, Yasuharu; Takeuchi, Fumihiko; Yamamoto, Ken; Yokota, Mitsuhiro; Mamatyusupu, Dolikun; Yang, Wenjun; Chung, Yeun-Jun; Jin, Li; Hoh, Boon-Peng; Wickremasinghe, Ananda R; Ong, RickTwee-Hee; Khor, Chiea-Chuen; Dunstan, Sarah J; Simmons, Cameron; Tongsima, Sissades; Suriyaphol, Prapat; Kato, Norihiro; Xu, Shuhua; Teo, Yik-Ying

2017-04-01

The Asian Diversity Project (ADP) assembled 37 cosmopolitan and ethnic minority populations in Asia that have been densely genotyped across over half a million markers to study patterns of genetic diversity and positive natural selection. We performed population structure analyses of the ADP populations and divided these populations into four major groups based on their genographic information. By applying a highly sensitive algorithm haploPS to locate genomic signatures of positive selection, 140 distinct genomic regions exhibiting evidence of positive selection in at least one population were identified. We examined the extent of signal sharing for regions that were selected in multiple populations and observed that populations clustered in a similar fashion to that of how the ancestry clades were phylogenetically defined. In particular, populations predominantly located in South Asia underwent considerably different adaptation as compared with populations from the other geographical regions. Signatures of positive selection present in multiple geographical regions were predicted to be older and have emerged prior to the separation of the populations in the different regions. In contrast, selection signals present in a single population group tended to be of lower frequencies and thus can be attributed to recent evolutionary events.
Cross-study projections of genomic biomarkers: an evaluation in cancer genomics.

PubMed

Lucas, Joseph E; Carvalho, Carlos M; Chen, Julia Ling-Yu; Chi, Jen-Tsan; West, Mike

2009-01-01

Human disease studies using DNA microarrays in both clinical/observational and experimental/controlled studies are having increasing impact on our understanding of the complexity of human diseases. A fundamental concept is the use of gene expression as a "common currency" that links the results of in vitro controlled experiments to in vivo observational human studies. Many studies--in cancer and other diseases--have shown promise in using in vitro cell manipulations to improve understanding of in vivo biology, but experiments often simply fail to reflect the enormous phenotypic variation seen in human diseases. We address this with a framework and methods to dissect, enhance and extend the in vivo utility of in vitro derived gene expression signatures. From an experimentally defined gene expression signature we use statistical factor analysis to generate multiple quantitative factors in human cancer gene expression data. These factors retain their relationship to the original, one-dimensional in vitro signature but better describe the diversity of in vivo biology. In a breast cancer analysis, we show that factors can reflect fundamentally different biological processes linked to molecular and clinical features of human cancers, and that in combination they can improve prediction of clinical outcomes.
Characterising private and shared signatures of positive selection in 37 Asian populations

PubMed Central

Liu, Xuanyao; Lu, Dongsheng; Saw, Woei-Yuh; Shaw, Philip J; Wangkumhang, Pongsakorn; Ngamphiw, Chumpol; Fucharoen, Suthat; Lert-itthiporn, Worachart; Chin-inmanu, Kwanrutai; Chau, Tran Nguyen Bich; Anders, Katie; Kasturiratne, Anuradhani; de Silva, H Janaka; Katsuya, Tomohiro; Kimura, Ryosuke; Nabika, Toru; Ohkubo, Takayoshi; Tabara, Yasuharu; Takeuchi, Fumihiko; Yamamoto, Ken; Yokota, Mitsuhiro; Mamatyusupu, Dolikun; Yang, Wenjun; Chung, Yeun-Jun; Jin, Li; Hoh, Boon-Peng; Wickremasinghe, Ananda R; Ong, RickTwee-Hee; Khor, Chiea-Chuen; Dunstan, Sarah J; Simmons, Cameron; Tongsima, Sissades; Suriyaphol, Prapat; Kato, Norihiro; Xu, Shuhua; Teo, Yik-Ying

2017-01-01

The Asian Diversity Project (ADP) assembled 37 cosmopolitan and ethnic minority populations in Asia that have been densely genotyped across over half a million markers to study patterns of genetic diversity and positive natural selection. We performed population structure analyses of the ADP populations and divided these populations into four major groups based on their genographic information. By applying a highly sensitive algorithm haploPS to locate genomic signatures of positive selection, 140 distinct genomic regions exhibiting evidence of positive selection in at least one population were identified. We examined the extent of signal sharing for regions that were selected in multiple populations and observed that populations clustered in a similar fashion to that of how the ancestry clades were phylogenetically defined. In particular, populations predominantly located in South Asia underwent considerably different adaptation as compared with populations from the other geographical regions. Signatures of positive selection present in multiple geographical regions were predicted to be older and have emerged prior to the separation of the populations in the different regions. In contrast, selection signals present in a single population group tended to be of lower frequencies and thus can be attributed to recent evolutionary events. PMID:28098149
Blind Quantum Signature with Blind Quantum Computation

NASA Astrophysics Data System (ADS)

Li, Wei; Shi, Ronghua; Guo, Ying

2017-04-01

Blind quantum computation allows a client without quantum abilities to interact with a quantum server to perform a unconditional secure computing protocol, while protecting client's privacy. Motivated by confidentiality of blind quantum computation, a blind quantum signature scheme is designed with laconic structure. Different from the traditional signature schemes, the signing and verifying operations are performed through measurement-based quantum computation. Inputs of blind quantum computation are securely controlled with multi-qubit entangled states. The unique signature of the transmitted message is generated by the signer without leaking information in imperfect channels. Whereas, the receiver can verify the validity of the signature using the quantum matching algorithm. The security is guaranteed by entanglement of quantum system for blind quantum computation. It provides a potential practical application for e-commerce in the cloud computing and first-generation quantum computation.
Wild emmer genome architecture and diversity elucidate wheat evolution and domestication.

PubMed

Avni, Raz; Nave, Moran; Barad, Omer; Baruch, Kobi; Twardziok, Sven O; Gundlach, Heidrun; Hale, Iago; Mascher, Martin; Spannagl, Manuel; Wiebe, Krystalee; Jordan, Katherine W; Golan, Guy; Deek, Jasline; Ben-Zvi, Batsheva; Ben-Zvi, Gil; Himmelbach, Axel; MacLachlan, Ron P; Sharpe, Andrew G; Fritz, Allan; Ben-David, Roi; Budak, Hikmet; Fahima, Tzion; Korol, Abraham; Faris, Justin D; Hernandez, Alvaro; Mikel, Mark A; Levy, Avraham A; Steffenson, Brian; Maccaferri, Marco; Tuberosa, Roberto; Cattivelli, Luigi; Faccioli, Primetta; Ceriotti, Aldo; Kashkush, Khalil; Pourkheirandish, Mohammad; Komatsuda, Takao; Eilam, Tamar; Sela, Hanan; Sharon, Amir; Ohad, Nir; Chamovitz, Daniel A; Mayer, Klaus F X; Stein, Nils; Ronen, Gil; Peleg, Zvi; Pozniak, Curtis J; Akhunov, Eduard D; Distelfeld, Assaf

2017-07-07

Wheat ( Triticum spp.) is one of the founder crops that likely drove the Neolithic transition to sedentary agrarian societies in the Fertile Crescent more than 10,000 years ago. Identifying genetic modifications underlying wheat's domestication requires knowledge about the genome of its allo-tetraploid progenitor, wild emmer ( T. turgidum ssp. dicoccoides ). We report a 10.1-gigabase assembly of the 14 chromosomes of wild tetraploid wheat, as well as analyses of gene content, genome architecture, and genetic diversity. With this fully assembled polyploid wheat genome, we identified the causal mutations in Brittle Rachis 1 ( TtBtr1 ) genes controlling shattering, a key domestication trait. A study of genomic diversity among wild and domesticated accessions revealed genomic regions bearing the signature of selection under domestication. This reference assembly will serve as a resource for accelerating the genome-assisted improvement of modern wheat varieties. Copyright © 2017, American Association for the Advancement of Science.
The tiger genome and comparative analysis with lion and snow leopard genomes.

PubMed

Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-Uk; Luo, Shu-Jin; Johnson, Warren E; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A; Marker, Laurie; Harper, Cindy; Miller, Susan M; Jacobs, Wilhelm; Bertola, Laura D; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O'Brien, Stephen J; Wang, Jun; Bhak, Jong

2013-01-01

Tigers and their close relatives (Panthera) are some of the world's most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats' hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species.
The tiger genome and comparative analysis with lion and snow leopard genomes

PubMed Central

Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-uk; Luo, Shu-Jin; Johnson, Warren E.; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A.; Marker, Laurie; Harper, Cindy; Miller, Susan M.; Jacobs, Wilhelm; Bertola, Laura D.; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O’Brien, Stephen J.; Wang, Jun; Bhak, Jong

2013-01-01

Tigers and their close relatives (Panthera) are some of the world’s most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats’ hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species. PMID:24045858
Chemodiversity of dissolved organic matter in the Amazon Basin

NASA Astrophysics Data System (ADS)

Gonsior, Michael; Valle, Juliana; Schmitt-Kopplin, Philippe; Hertkorn, Norbert; Bastviken, David; Luek, Jenna; Harir, Mourad; Bastos, Wanderley; Enrich-Prast, Alex

2016-07-01

Regions in the Amazon Basin have been associated with specific biogeochemical processes, but a detailed chemical classification of the abundant and ubiquitous dissolved organic matter (DOM), beyond specific indicator compounds and bulk measurements, has not yet been established. We sampled water from different locations in the Negro, Madeira/Jamari and Tapajós River areas to characterize the molecular DOM composition and distribution. Ultrahigh-resolution Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS) combined with excitation emission matrix (EEM) fluorescence spectroscopy and parallel factor analysis (PARAFAC) revealed a large proportion of ubiquitous DOM but also unique area-specific molecular signatures. Unique to the DOM of the Rio Negro area was the large abundance of high molecular weight, diverse hydrogen-deficient and highly oxidized molecular ions deviating from known lignin or tannin compositions, indicating substantial oxidative processing of these ultimately plant-derived polyphenols indicative of these black waters. In contrast, unique signatures in the Madeira/Jamari area were defined by presumably labile sulfur- and nitrogen-containing molecules in this white water river system. Waters from the Tapajós main stem did not show any substantial unique molecular signatures relative to those present in the Rio Madeira and Rio Negro, which implied a lower organic molecular complexity in this clear water tributary, even after mixing with the main stem of the Amazon River. Beside ubiquitous DOM at average H / C and O / C elemental ratios, a distinct and significant unique DOM pool prevailed in the black, white and clear water areas that were also highly correlated with EEM-PARAFAC components and define the frameworks for primary production and other aspects of aquatic life.
Genome-Wide Analysis Reveals the Unique Stem Cell Identity of Human Amniocytes

PubMed Central

Maguire, Colin T.; Demarest, Bradley L.; Hill, Jonathon T.; Palmer, James D.; Brothman, Arthur R.; Yost, H. Joseph; Condic, Maureen L.

2013-01-01

Human amniotic fluid contains cells that potentially have important stem cell characteristics, yet the programs controlling their developmental potency are unclear. Here, we provide evidence that amniocytes derived from multiple patients are marked by heterogeneity and variability in expression levels of pluripotency markers. Clonal analysis from multiple patients indicates that amniocytes have large pools of self-renewing cells that have an inherent property to give rise to a distinct amniocyte phenotype with a heterogeneity of pluripotent markers. Significant to their therapeutic potential, genome-wide profiles are distinct at different gestational ages and times in culture, but do not differ between genders. Based on hierarchical clustering and differential expression analyses of the entire transcriptome, amniocytes express canonical regulators associated with pluripotency and stem cell repression. Their profiles are distinct from human embryonic stem cells (ESCs), induced-pluripotent stem cells (iPSCs), and newborn foreskin fibroblasts. Amniocytes have a complex molecular signature, coexpressing trophoblastic, ectodermal, mesodermal, and endodermal cell-type-specific regulators. In contrast to the current view of the ground state of stem cells, ESCs and iPSCs also express high levels of a wide range of cell-type-specific regulators. The coexpression of multilineage differentiation markers combined with the strong expression of a subset of ES cell repressors in amniocytes suggests that these cells have a distinct phenotype that is unlike any other known cell-type or lineage. PMID:23326421
Signatures of cytoplasmic proteins in the exoproteome distinguish community- and hospital-associated methicillin-resistant Staphylococcus aureus USA300 lineages.

PubMed

Mekonnen, Solomon A; Palma Medina, Laura M; Glasner, Corinna; Tsompanidou, Eleni; de Jong, Anne; Grasso, Stefano; Schaffer, Marc; Mäder, Ulrike; Larsen, Anders R; Gumpert, Heidi; Westh, Henrik; Völker, Uwe; Otto, Andreas; Becher, Dörte; van Dijl, Jan Maarten

2017-08-18

Methicillin-resistant Staphylococcus aureus (MRSA) is the common name for a heterogeneous group of highly drug-resistant staphylococci. Two major MRSA classes are distinguished based on epidemiology, namely community-associated (CA) and hospital-associated (HA) MRSA. Notably, the distinction of CA- and HA-MRSA based on molecular traits remains difficult due to the high genomic plasticity of S. aureus. Here we sought to pinpoint global distinguishing features of CA- and HA-MRSA through a comparative genome and proteome analysis of the notorious MRSA lineage USA300. We show for the first time that CA- and HA-MRSA isolates can be distinguished by 2 distinct extracellular protein abundance clusters that are predictive not only for epidemiologic behavior, but also for their growth and survival within epithelial cells. This 'exoproteome profiling' also groups more distantly related HA-MRSA isolates into the HA exoproteome cluster. Comparative genome analysis suggests that these distinctive features of CA- and HA-MRSA isolates relate predominantly to the accessory genome. Intriguingly, the identified exoproteome clusters differ in the relative abundance of typical cytoplasmic proteins, suggesting that signatures of cytoplasmic proteins in the exoproteome represent a new distinguishing feature of CA- and HA-MRSA. Our comparative genome and proteome analysis focuses attention on potentially distinctive roles of 'liberated' cytoplasmic proteins in the epidemiology and intracellular survival of CA- and HA-MRSA isolates. Such extracellular cytoplasmic proteins were recently invoked in staphylococcal virulence, but their implication in the epidemiology of MRSA is unprecedented.
Detecting and Characterizing Genomic Signatures of Positive Selection in Global Populations

PubMed Central

Liu, Xuanyao; Ong, Rick Twee-Hee; Pillai, Esakimuthu Nisha; Elzein, Abier M.; Small, Kerrin S.; Clark, Taane G.; Kwiatkowski, Dominic P.; Teo, Yik-Ying

2013-01-01

Natural selection is a significant force that shapes the architecture of the human genome and introduces diversity across global populations. The question of whether advantageous mutations have arisen in the human genome as a result of single or multiple mutation events remains unanswered except for the fact that there exist a handful of genes such as those that confer lactase persistence, affect skin pigmentation, or cause sickle cell anemia. We have developed a long-range-haplotype method for identifying genomic signatures of positive selection to complement existing methods, such as the integrated haplotype score (iHS) or cross-population extended haplotype homozygosity (XP-EHH), for locating signals across the entire allele frequency spectrum. Our method also locates the founder haplotypes that carry the advantageous variants and infers their corresponding population frequencies. This presents an opportunity to systematically interrogate the whole human genome whether a selection signal shared across different populations is the consequence of a single mutation process followed subsequently by gene flow between populations or of convergent evolution due to the occurrence of multiple independent mutation events either at the same variant or within the same gene. The application of our method to data from 14 populations across the world revealed that positive-selection events tend to cluster in populations of the same ancestry. Comparing the founder haplotypes for events that are present across different populations revealed that convergent evolution is a rare occurrence and that the majority of shared signals stem from the same evolutionary event. PMID:23731540
Comprehensive benchmarking reveals H2BK20 acetylation as a distinctive signature of cell-state-specific enhancers and promoters.

PubMed

Kumar, Vibhor; Rayan, Nirmala Arul; Muratani, Masafumi; Lim, Stefan; Elanggovan, Bavani; Xin, Lixia; Lu, Tess; Makhija, Harshyaa; Poschmann, Jeremie; Lufkin, Thomas; Ng, Huck Hui; Prabhakar, Shyam

2016-05-01

Although over 35 different histone acetylation marks have been described, the overwhelming majority of regulatory genomics studies focus exclusively on H3K27ac and H3K9ac. In order to identify novel epigenomic traits of regulatory elements, we constructed a benchmark set of validated enhancers by performing 140 enhancer assays in human T cells. We tested 40 chromatin signatures on this unbiased enhancer set and identified H2BK20ac, a little-studied histone modification, as the most predictive mark of active enhancers. Notably, we detected a novel class of functionally distinct enhancers enriched in H2BK20ac but lacking H3K27ac, which was present in all examined cell lines and also in embryonic forebrain tissue. H2BK20ac was also unique in highlighting cell-type-specific promoters. In contrast, other acetylation marks were present in all active promoters, regardless of cell-type specificity. In stimulated microglial cells, H2BK20ac was more correlated with cell-state-specific expression changes than H3K27ac, with TGF-beta signaling decoupling the two acetylation marks at a subset of regulatory elements. In summary, our study reveals a previously unknown connection between histone acetylation and cell-type-specific gene regulation and indicates that H2BK20ac profiling can be used to uncover new dimensions of gene regulation. © 2016 Kumar et al.; Published by Cold Spring Harbor Laboratory Press.
Comprehensive benchmarking reveals H2BK20 acetylation as a distinctive signature of cell-state-specific enhancers and promoters

PubMed Central

Kumar, Vibhor; Rayan, Nirmala Arul; Muratani, Masafumi; Lim, Stefan; Elanggovan, Bavani; Xin, Lixia; Lu, Tess; Makhija, Harshyaa; Poschmann, Jeremie; Lufkin, Thomas; Ng, Huck Hui; Prabhakar, Shyam

2016-01-01

Although over 35 different histone acetylation marks have been described, the overwhelming majority of regulatory genomics studies focus exclusively on H3K27ac and H3K9ac. In order to identify novel epigenomic traits of regulatory elements, we constructed a benchmark set of validated enhancers by performing 140 enhancer assays in human T cells. We tested 40 chromatin signatures on this unbiased enhancer set and identified H2BK20ac, a little-studied histone modification, as the most predictive mark of active enhancers. Notably, we detected a novel class of functionally distinct enhancers enriched in H2BK20ac but lacking H3K27ac, which was present in all examined cell lines and also in embryonic forebrain tissue. H2BK20ac was also unique in highlighting cell-type-specific promoters. In contrast, other acetylation marks were present in all active promoters, regardless of cell-type specificity. In stimulated microglial cells, H2BK20ac was more correlated with cell-state-specific expression changes than H3K27ac, with TGF-beta signaling decoupling the two acetylation marks at a subset of regulatory elements. In summary, our study reveals a previously unknown connection between histone acetylation and cell-type-specific gene regulation and indicates that H2BK20ac profiling can be used to uncover new dimensions of gene regulation. PMID:26957309
Four-miRNA signature as a prognostic tool for lung adenocarcinoma.

PubMed

Lin, Yan; Lv, Yufeng; Liang, Rong; Yuan, Chunling; Zhang, Jinyan; He, Dan; Zheng, Xiaowen; Zhang, Jianfeng

2018-01-01

The aim of this study was to generate a novel miRNA expression signature to accurately predict prognosis for patients with lung adenocarcinoma (LUAD). Using expression profiles downloaded from The Cancer Genome Atlas database, we identified multiple miRNAs with differential expression between LUAD and paired healthy tissues. We then evaluated the prognostic values of the differentially expressed miRNAs using univariate/multivariate Cox regression analysis. This analysis was ultimately used to construct a four-miRNA signature that effectively predicted patient survival. Finally, we analyzed potential functional roles of the target genes for these four miRNAs using Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses. Based on our cutoff criteria ( P <0.05 and |log2FC| >1.0), we identified a total of 187 differentially expressed miRNAs, including 148 that were upregulated in LUAD tissues and 39 that were downregulated. Four miRNAs (miR-148a-5p, miR-31-5p, miR-548v, and miR-550a-5p) were independently associated with survival based on Kaplan-Meier analysis. We generated a signature index based on the expression of these four miRNAs and stratified patients into low- and high-risk groups. Patients in the high-risk group had significantly shorter survival times than those in the low-risk group ( P =0.002). A functional enrichment analysis suggested that the target genes of these four miRNAs were involved in protein phosphorylation and the Hippo and sphingolipid signaling pathways. Taken together, our results suggest that our four-miRNA signature can be used as a prognostic tool for patients with LUAD.
Genomic signature analysis of the recently emerged highly pathogenic A(H5N8) avian influenza virus: implying an evolutionary trend for bird-to-human transmission.

PubMed

Xu, Wei; Dai, Yanyan; Hua, Chen; Wang, Qian; Zou, Peng; Deng, Qiwen; Jiang, Shibo; Lu, Lu

2017-12-01

In early 2014, a novel subclade (2.3.4.4) of the highly pathogenic avian influenza (HPAI) A(H5N8) virus caused the first outbreak in domestic ducks and migratory birds in South Korea. Since then, it has spread to 44 countries and regions. To date, no human infections with A(H5N8) virus have been reported, but the possibility cannot be excluded. By analyzing the genomic signatures of A(H5N8) strains, we found that among the 47 species-associated signature positions, three positions exhibited human-like signatures (HLS), including PA-404S, PB2-613I and PB2-702R and that mutation trend of host signatures of avian A(H5N8) is different before and after 2014. About 82% of A(H5N8) isolates collected after January of 2014 carried the 3 HLS (PA-404S/PB2-613I/PB2-702R) in combination, while none of isolates collected before 2014 had this combination. Furthermore, the HA protein had S137A and S227R substitutions in the receptor-binding site and A160T in the glycosylation site, potentially increasing viral ability to bind human-type receptors. Based on these findings, the newly emerged HPAI A(H5N8) isolates show an evolutionary trend toward gaining more HLS and, along with it, the potential for bird-to-human transmissibility. Therefore, more extensive surveillance of this rapidly spreading HPAI A(H5N8) and preparedness against its potential pandemic are urgently needed. Copyright © 2017. Published by Elsevier Masson SAS.
Genome-wide analyses of HTLV-1aD strains from Cape Verde, Africa.

PubMed

Zanella, Louise; Pina-Araujo I, Isabel de; Morgado, Mariza G; Vicente, Ana Carolina

2016-09-01

We characterised and reported the first full-length genomes of Human T-cell Lymphotropic Virus Type 1 subgroup HTLV-1aD (CV21 and CV79). This subgroup is one of the major determinants of HTLV-1 infections in North and West Africa, and recombinant strains involving this subgroup have been recently demonstrated. The CV21 and CV79 strains from Cape Verde/Africa were characterised as pure HTLV-1aD genomes, comparative analyses including HTLV-1 subtypes and subgroups revealed HTLV-1aD signatures in the envelope, pol, and pX regions. These genomes provide original information that will contribute to further studies on HTLV-1a epidemiology and evolution.
Selection Signature Analysis Implicates the PC1/PCSK1 Region for Chicken Abdominal Fat Content

PubMed Central

Wang, Zhipeng; Zhang, Yuandan; Wang, Shouzhi; Wang, Ning; Ma, Li; Leng, Li; Wang, Shengwen; Wang, Qigui; Wang, Yuxiang; Tang, Zhiquan; Li, Ning; Da, Yang; Li, Hui

2012-01-01

We conducted a selection signature analysis using the chicken 60k SNP chip in two chicken lines that had been divergently selected for abdominal fat content (AFC) for 11 generations. The selection signature analysis used multiple signals of selection, including long-range allele frequency differences between the lean and fat lines, long-range heterozygosity changes, linkage disequilibrium, haplotype frequencies, and extended haplotype homozygosity. Multiple signals of selection identified ten signatures on chromosomes 1, 2, 4, 5, 11, 15, 20, 26 and Z. The 0.73 Mb PC1/PCSK1 region of the Z chromosome at 55.43-56.16 Mb was the most heavily selected region. This region had 26 SNP markers and seven genes, Mar-03, SLC12A2, FBN2, ERAP1, CAST, PC1/PCSK1 and ELL2, where PC1/PCSK1 are the chicken/human names for the same gene. The lean and fat lines had two main haplotypes with completely opposite SNP alleles for the 26 SNP markers and were virtually line-specific, and had a recombinant haplotype with nearly equal frequency (0.193 and 0.196) in both lines. Other haplotypes in this region had negligible frequencies. Nine other regions with selection signatures were PAH-IGF1, TRPC4, GJD4-CCNY, NDST4, NOVA1, GALNT9, the ESRP2-GALR1 region with five genes, the SYCP2-CADH4 with six genes, and the TULP1-KIF21B with 14 genes. Genome-wide association analysis showed that nearly all regions with evidence of selection signature had SNP effects with genome-wide significance (P<10–6) on abdominal fat weight and percentage. The results of this study provide specific gene targets for the control of chicken AFC and a potential model of AFC in human obesity. PMID:22792402
Selection signature analysis implicates the PC1/PCSK1 region for chicken abdominal fat content.

PubMed

Zhang, Hui; Hu, Xiaoxiang; Wang, Zhipeng; Zhang, Yuandan; Wang, Shouzhi; Wang, Ning; Ma, Li; Leng, Li; Wang, Shengwen; Wang, Qigui; Wang, Yuxiang; Tang, Zhiquan; Li, Ning; Da, Yang; Li, Hui

2012-01-01

We conducted a selection signature analysis using the chicken 60k SNP chip in two chicken lines that had been divergently selected for abdominal fat content (AFC) for 11 generations. The selection signature analysis used multiple signals of selection, including long-range allele frequency differences between the lean and fat lines, long-range heterozygosity changes, linkage disequilibrium, haplotype frequencies, and extended haplotype homozygosity. Multiple signals of selection identified ten signatures on chromosomes 1, 2, 4, 5, 11, 15, 20, 26 and Z. The 0.73 Mb PC1/PCSK1 region of the Z chromosome at 55.43-56.16 Mb was the most heavily selected region. This region had 26 SNP markers and seven genes, Mar-03, SLC12A2, FBN2, ERAP1, CAST, PC1/PCSK1 and ELL2, where PC1/PCSK1 are the chicken/human names for the same gene. The lean and fat lines had two main haplotypes with completely opposite SNP alleles for the 26 SNP markers and were virtually line-specific, and had a recombinant haplotype with nearly equal frequency (0.193 and 0.196) in both lines. Other haplotypes in this region had negligible frequencies. Nine other regions with selection signatures were PAH-IGF1, TRPC4, GJD4-CCNY, NDST4, NOVA1, GALNT9, the ESRP2-GALR1 region with five genes, the SYCP2-CADH4 with six genes, and the TULP1-KIF21B with 14 genes. Genome-wide association analysis showed that nearly all regions with evidence of selection signature had SNP effects with genome-wide significance (P<10(-6)) on abdominal fat weight and percentage. The results of this study provide specific gene targets for the control of chicken AFC and a potential model of AFC in human obesity.

DNA Methylation Signature of Childhood Chronic Physical Aggression in T Cells of Both Men and Women

PubMed Central

Guillemin, Claire; Provençal, Nadine; Suderman, Matthew; Côté, Sylvana M.; Vitaro, Frank; Hallett, Michael; Tremblay, Richard E.; Szyf, Moshe

2014-01-01

Background High frequency of physical aggression is the central feature of severe conduct disorder and is associated with a wide range of social, mental and physical health problems. We have previously tested the hypothesis that differential DNA methylation signatures in peripheral T cells are associated with a chronic aggression trajectory in males. Despite the fact that sex differences appear to play a pivotal role in determining the development, magnitude and frequency of aggression, most of previous studies focused on males, so little is known about female chronic physical aggression. We therefore tested here whether or not there is a signature of physical aggression in female DNA methylation and, if there is, how it relates to the signature observed in males. Methodology/Principal Findings Methylation profiles were created using the method of methylated DNA immunoprecipitation (MeDIP) followed by microarray hybridization and statistical and bioinformatic analyses on T cell DNA obtained from adult women who were found to be on a chronic physical aggression trajectory (CPA) between 6 and 12 years of age compared to women who followed a normal physical aggression trajectory. We confirmed the existence of a well-defined, genome-wide signature of DNA methylation associated with chronic physical aggression in the peripheral T cells of adult females that includes many of the genes similarly associated with physical aggression in the same cell types of adult males. Conclusions This study in a small number of women presents preliminary evidence for a genome-wide variation in promoter DNA methylation that associates with CPA in women that warrant larger studies for further verification. A significant proportion of these associations were previously observed in men with CPA supporting the hypothesis that the epigenetic signature of early life aggression in females is composed of a component specific to females and another common to both males and females. PMID:24475181
Detection of Low-Copy-Number Genomic DNA Sequences in Individual Bacterial Cells by Using Peptide Nucleic Acid-Assisted Rolling-Circle Amplification and Fluorescence In Situ Hybridization▿ †

PubMed Central

Smolina, Irina; Lee, Charles; Frank-Kamenetskii, Maxim

2007-01-01

An approach is proposed for in situ detection of short signature DNA sequences present in single copies per bacterial genome. The site is locally opened by peptide nucleic acids, and a circular oligonucleotide is assembled. The amplicon generated by rolling circle amplification is detected by hybridization with fluorescently labeled decorator probes. PMID:17293504
Genomic and Expression Profiling of Benign and Malignant Nerve Sheath Tumors in Neurofibromatosis Patients

DTIC Science & Technology

2008-05-01

DAMD17-03-1-0297 Title: Genomic and Expression Pr ofiling of Benign and Malignant Nerve Sheath Tumors in Neurofibromatosis Patients...have determined the gene expression signature for benign and malignant peripheral nerve sheath tumors and found that the major trend in transformation...However, EGFR data in soft tissue neoplasms is limited. Using a variety of benign and malignant spindle cell neoplasms, we assessed EGFR status by
Textural signatures for wetland vegetation

NASA Technical Reports Server (NTRS)

Whitman, R. I.; Marcellus, K. L.

1973-01-01

This investigation indicates that unique textural signatures do exist for specific wetland communities at certain times in the growing season. When photographs with the proper resolution are obtained, the textural features can identify the spectral features of the vegetation community seen with lower resolution mapping data. The development of a matrix of optimum textural signatures is the goal of this research. Seasonal variations of spectral and textural features are particularly important when performing a vegetations analysis of fresh water marshes. This matrix will aid in flight planning, since expected seasonal variations and resolution requirements can be established prior to a given flight mission.
Nitrogen isotopic signatures in the Acapulco meteorite

NASA Technical Reports Server (NTRS)

Sturgeon, G.; Marti, K.

1991-01-01

N isotopic abundances are reported for a bulk sample of the unique meteorite Acapulco. Although the mineral chemistry indicates a high degree of recrystallization under redox conditions between those of H and E chondrites (Palme et al., 1981), the presence of two distinct N isotopic signatures shows that the carriers of these N components were not equilibrated. In stepwise pyrolysis, the larger (65 percent) N component is released mostly below 1000 C and reveals a signature of delta(N-15) = 8.9 + or - 1.2 per mil, which is within the range observed in chondrites. A second 'light' component appears above 1000 C and has a signature of delta(N-15) less than or equal to -110.5 + or - 4.0 per mil (uncorrected for spallation N-15).
Use of signature-tagged mutagenesis to identify virulence determinants in Haemophilus ducreyi responsible for ulcer formation.

PubMed

Yeung, Angela; Cameron, D William; Desjardins, Marc; Lee, B Craig

2011-02-01

Elucidating the molecular mechanisms responsible for chancroid, a genital ulcer disease caused by Haemophilus ducreyi, has been hampered in part by the relative genetic intractability of the organism. A whole genome screen using signature-tagged mutagenesis in the temperature-dependent rabbit model (TDRM) of H. ducreyi infection uncovered 26 mutants with a presumptive attenuated phenotype. Insertions in two previously recognized virulence determinants, hgbA and lspA1, validated this genome scanning technique. Database interrogation allowed assignment of 24 mutants to several functional classes, including transport, metabolism, DNA repair, stress response and gene regulation. The attenuated virulence for a 3 strain with a mutation in hicB was confirmed by individual infection in the TDRM. The results from this preliminary study indicate that this high throughput strategy will further the understanding of the pathogenesis of H. ducreyi infection. Copyright © 2010 Elsevier B.V. All rights reserved.
A species-specific nucleosomal signature defines a periodic distribution of amino acids in proteins.

PubMed

Quintales, Luis; Soriano, Ignacio; Vázquez, Enrique; Segurado, Mónica; Antequera, Francisco

2015-04-01

Nucleosomes are the basic structural units of chromatin. Most of the yeast genome is organized in a pattern of positioned nucleosomes that is stably maintained under a wide range of physiological conditions. In this work, we have searched for sequence determinants associated with positioned nucleosomes in four species of fission and budding yeasts. We show that mononucleosomal DNA follows a highly structured base composition pattern, which differs among species despite the high degree of histone conservation. These nucleosomal signatures are present in transcribed and non-transcribed regions across the genome. In the case of open reading frames, they correctly predict the relative distribution of codons on mononucleosomal DNA, and they also determine a periodicity in the average distribution of amino acids along the proteins. These results establish a direct and species-specific connection between the position of each codon around the histone octamer and protein composition.
Novel Insights into Tree Biology and Genome Evolution as Revealed Through Genomics.

PubMed

Neale, David B; Martínez-García, Pedro J; De La Torre, Amanda R; Montanari, Sara; Wei, Xiao-Xin

2017-04-28

Reference genome sequences are the key to the discovery of genes and gene families that determine traits of interest. Recent progress in sequencing technologies has enabled a rapid increase in genome sequencing of tree species, allowing the dissection of complex characters of economic importance, such as fruit and wood quality and resistance to biotic and abiotic stresses. Although the number of reference genome sequences for trees lags behind those for other plant species, it is not too early to gain insight into the unique features that distinguish trees from nontree plants. Our review of the published data suggests that, although many gene families are conserved among herbaceous and tree species, some gene families, such as those involved in resistance to biotic and abiotic stresses and in the synthesis and transport of sugars, are often expanded in tree genomes. As the genomes of more tree species are sequenced, comparative genomics will further elucidate the complexity of tree genomes and how this relates to traits unique to trees.
Mosaic Graphs and Comparative Genomics in Phage Communities

PubMed Central

Belcaid, Mahdi; Bergeron, Anne

2010-01-01

Abstract Comparing the genomes of two closely related viruses often produces mosaics where nearly identical sequences alternate with sequences that are unique to each genome. When several closely related genomes are compared, the unique sequences are likely to be shared with third genomes, leading to virus mosaic communities. Here we present comparative analysis of sets of Staphylococcus aureus phages that share large identical sequences with up to three other genomes, and with different partners along their genomes. We introduce mosaic graphs to represent these complex recombination events, and use them to illustrate the breath and depth of sequence sharing: some genomes are almost completely made up of shared sequences, while genomes that share very large identical sequences can adopt alternate functional modules. Mosaic graphs also allow us to identify breakpoints that could eventually be used for the construction of recombination networks. These findings have several implications on phage metagenomics assembly, on the horizontal gene transfer paradigm, and more generally on the understanding of the composition and evolutionary dynamics of virus communities. PMID:20874413
Use of locally weighted scatterplot smoothing (LOWESS) regression to study selection signatures in Piedmontese and Italian Brown cattle breeds.

PubMed

Pintus, Elia; Sorbolini, Silvia; Albera, Andrea; Gaspa, Giustino; Dimauro, Corrado; Steri, Roberto; Marras, Gabriele; Macciotta, Nicolò P P

2014-02-01

Selection is the major force affecting local levels of genetic variation in species. The availability of dense marker maps offers new opportunities for a detailed understanding of genetic diversity distribution across the animal genome. Over the last 50 years, cattle breeds have been subjected to intense artificial selection. Consequently, regions controlling traits of economic importance are expected to exhibit selection signatures. The fixation index (Fst ) is an estimate of population differentiation, based on genetic polymorphism data, and it is calculated using the relationship between inbreeding and heterozygosity. In the present study, locally weighted scatterplot smoothing (LOWESS) regression and a control chart approach were used to investigate selection signatures in two cattle breeds with different production aptitudes (dairy and beef). Fst was calculated for 42 514 SNP marker loci distributed across the genome in 749 Italian Brown and 364 Piedmontese bulls. The statistical significance of Fst values was assessed using a control chart. The LOWESS technique was efficient in removing noise from the raw data and was able to highlight selection signatures in chromosomes known to harbour genes affecting dairy and beef traits. Examples include the peaks detected for BTA2 in the region where the myostatin gene is located and for BTA6 in the region harbouring the ABCG2 locus. Moreover, several loci not previously reported in cattle studies were detected. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
A novel multi-network approach reveals tissue-specific cellular modulators of fibrosis in systemic sclerosis.

PubMed

Taroni, Jaclyn N; Greene, Casey S; Martyanov, Viktor; Wood, Tammara A; Christmann, Romy B; Farber, Harrison W; Lafyatis, Robert A; Denton, Christopher P; Hinchcliff, Monique E; Pioli, Patricia A; Mahoney, J Matthew; Whitfield, Michael L

2017-03-23

Systemic sclerosis (SSc) is a multi-organ autoimmune disease characterized by skin fibrosis. Internal organ involvement is heterogeneous. It is unknown whether disease mechanisms are common across all involved affected tissues or if each manifestation has a distinct underlying pathology. We used consensus clustering to compare gene expression profiles of biopsies from four SSc-affected tissues (skin, lung, esophagus, and peripheral blood) from patients with SSc, and the related conditions pulmonary fibrosis (PF) and pulmonary arterial hypertension, and derived a consensus disease-associate signature across all tissues. We used this signature to query tissue-specific functional genomic networks. We performed novel network analyses to contrast the skin and lung microenvironments and to assess the functional role of the inflammatory and fibrotic genes in each organ. Lastly, we tested the expression of macrophage activation state-associated gene sets for enrichment in skin and lung using a Wilcoxon rank sum test. We identified a common pathogenic gene expression signature-an immune-fibrotic axis-indicative of pro-fibrotic macrophages (MØs) in multiple tissues (skin, lung, esophagus, and peripheral blood mononuclear cells) affected by SSc. While the co-expression of these genes is common to all tissues, the functional consequences of this upregulation differ by organ. We used this disease-associated signature to query tissue-specific functional genomic networks to identify common and tissue-specific pathologies of SSc and related conditions. In contrast to skin, in the lung-specific functional network we identify a distinct lung-resident MØ signature associated with lipid stimulation and alternative activation. In keeping with our network results, we find distinct MØ alternative activation transcriptional programs in SSc-associated PF lung and in the skin of patients with an "inflammatory" SSc gene expression signature. Our results suggest that the innate immune system is central to SSc disease processes but that subtle distinctions exist between tissues. Our approach provides a framework for examining molecular signatures of disease in fibrosis and autoimmune diseases and for leveraging publicly available data to understand common and tissue-specific disease processes in complex human diseases.
Genomic analyses provide insights into the history of tomato breeding.

PubMed

Lin, Tao; Zhu, Guangtao; Zhang, Junhong; Xu, Xiangyang; Yu, Qinghui; Zheng, Zheng; Zhang, Zhonghua; Lun, Yaoyao; Li, Shuai; Wang, Xiaoxuan; Huang, Zejun; Li, Junming; Zhang, Chunzhi; Wang, Taotao; Zhang, Yuyang; Wang, Aoxue; Zhang, Yancong; Lin, Kui; Li, Chuanyou; Xiong, Guosheng; Xue, Yongbiao; Mazzucato, Andrea; Causse, Mathilde; Fei, Zhangjun; Giovannoni, James J; Chetelat, Roger T; Zamir, Dani; Städler, Thomas; Li, Jingfu; Ye, Zhibiao; Du, Yongchen; Huang, Sanwen

2014-11-01

The histories of crop domestication and breeding are recorded in genomes. Although tomato is a model species for plant biology and breeding, the nature of human selection that altered its genome remains largely unknown. Here we report a comprehensive analysis of tomato evolution based on the genome sequences of 360 accessions. We provide evidence that domestication and improvement focused on two independent sets of quantitative trait loci (QTLs), resulting in modern tomato fruit ∼100 times larger than its ancestor. Furthermore, we discovered a major genomic signature for modern processing tomatoes, identified the causative variants that confer pink fruit color and precisely visualized the linkage drag associated with wild introgressions. This study outlines the accomplishments as well as the costs of historical selection and provides molecular insights toward further improvement.
GENOMIC ORGANIZATION OF THE SP22 GENE AND A UNIQUE PATTERN OF EXPRESSION IN SPERMATOGENIC CELLS

EPA Science Inventory

GENOMIC ORGANIZATION OF THE SP22 GENE AND A UNIQUE PATTERN OF EXPRESSION IN SPERMATOGENIC CELLS.
JE Welch*, RR Barbee*, JD Suarez*, NL Roberts*, and GR Klinefelter. Reproductive Toxicology Division, NHEERL, U.S. EPA, Research Triangle Park, NC, USA.
Our laboratory has rep...
Genome fluctuations in cyanobacteria reflect evolutionary, developmental and adaptive traits.

PubMed

Larsson, John; Nylander, Johan Aa; Bergman, Birgitta

2011-06-30

Cyanobacteria belong to an ancient group of photosynthetic prokaryotes with pronounced variations in their cellular differentiation strategies, physiological capacities and choice of habitat. Sequencing efforts have shown that genomes within this phylum are equally diverse in terms of size and protein-coding capacity. To increase our understanding of genomic changes in the lineage, the genomes of 58 contemporary cyanobacteria were analysed for shared and unique orthologs. A total of 404 protein families, present in all cyanobacterial genomes, were identified. Two of these are unique to the phylum, corresponding to an AbrB family transcriptional regulator and a gene that escapes functional annotation although its genomic neighbourhood is conserved among the organisms examined. The evolution of cyanobacterial genome sizes involves a mix of gains and losses in the clade encompassing complex cyanobacteria, while a single event of reduction is evident in a clade dominated by unicellular cyanobacteria. Genome sizes and gene family copy numbers evolve at a higher rate in the former clade, and multi-copy genes were predominant in large genomes. Orthologs unique to cyanobacteria exhibiting specific characteristics, such as filament formation, heterocyst differentiation, diazotrophy and symbiotic competence, were also identified. An ancestral character reconstruction suggests that the most recent common ancestor of cyanobacteria had a genome size of approx. 4.5 Mbp and 1678 to 3291 protein-coding genes, 4%-6% of which are unique to cyanobacteria today. The different rates of genome-size evolution and multi-copy gene abundance suggest two routes of genome development in the history of cyanobacteria. The expansion strategy is driven by gene-family enlargment and generates a broad adaptive potential; while the genome streamlining strategy imposes adaptations to highly specific niches, also reflected in their different functional capacities. A few genomes display extreme proliferation of non-coding nucleotides which is likely to be the result of initial expansion of genomes/gene copy number to gain adaptive potential, followed by a shift to a life-style in a highly specific niche (e.g. symbiosis). This transition results in redundancy of genes and gene families, leading to an increase in junk DNA and eventually to gene loss. A few orthologs can be correlated with specific phenotypes in cyanobacteria, such as filament formation and symbiotic competence; these constitute exciting exploratory targets.
Genome fluctuations in cyanobacteria reflect evolutionary, developmental and adaptive traits

PubMed Central

2011-01-01

Background Cyanobacteria belong to an ancient group of photosynthetic prokaryotes with pronounced variations in their cellular differentiation strategies, physiological capacities and choice of habitat. Sequencing efforts have shown that genomes within this phylum are equally diverse in terms of size and protein-coding capacity. To increase our understanding of genomic changes in the lineage, the genomes of 58 contemporary cyanobacteria were analysed for shared and unique orthologs. Results A total of 404 protein families, present in all cyanobacterial genomes, were identified. Two of these are unique to the phylum, corresponding to an AbrB family transcriptional regulator and a gene that escapes functional annotation although its genomic neighbourhood is conserved among the organisms examined. The evolution of cyanobacterial genome sizes involves a mix of gains and losses in the clade encompassing complex cyanobacteria, while a single event of reduction is evident in a clade dominated by unicellular cyanobacteria. Genome sizes and gene family copy numbers evolve at a higher rate in the former clade, and multi-copy genes were predominant in large genomes. Orthologs unique to cyanobacteria exhibiting specific characteristics, such as filament formation, heterocyst differentiation, diazotrophy and symbiotic competence, were also identified. An ancestral character reconstruction suggests that the most recent common ancestor of cyanobacteria had a genome size of approx. 4.5 Mbp and 1678 to 3291 protein-coding genes, 4%-6% of which are unique to cyanobacteria today. Conclusions The different rates of genome-size evolution and multi-copy gene abundance suggest two routes of genome development in the history of cyanobacteria. The expansion strategy is driven by gene-family enlargment and generates a broad adaptive potential; while the genome streamlining strategy imposes adaptations to highly specific niches, also reflected in their different functional capacities. A few genomes display extreme proliferation of non-coding nucleotides which is likely to be the result of initial expansion of genomes/gene copy number to gain adaptive potential, followed by a shift to a life-style in a highly specific niche (e.g. symbiosis). This transition results in redundancy of genes and gene families, leading to an increase in junk DNA and eventually to gene loss. A few orthologs can be correlated with specific phenotypes in cyanobacteria, such as filament formation and symbiotic competence; these constitute exciting exploratory targets. PMID:21718514
Mitochondrial DNA as a non-invasive biomarker: Accurate quantification using real time quantitative PCR without co-amplification of pseudogenes and dilution bias

DOE Office of Scientific and Technical Information (OSTI.GOV)

Malik, Afshan N., E-mail: afshan.malik@kcl.ac.uk; Shahni, Rojeen; Rodriguez-de-Ledesma, Ana

2011-08-19

Highlights: {yields} Mitochondrial dysfunction is central to many diseases of oxidative stress. {yields} 95% of the mitochondrial genome is duplicated in the nuclear genome. {yields} Dilution of untreated genomic DNA leads to dilution bias. {yields} Unique primers and template pretreatment are needed to accurately measure mitochondrial DNA content. -- Abstract: Circulating mitochondrial DNA (MtDNA) is a potential non-invasive biomarker of cellular mitochondrial dysfunction, the latter known to be central to a wide range of human diseases. Changes in MtDNA are usually determined by quantification of MtDNA relative to nuclear DNA (Mt/N) using real time quantitative PCR. We propose that themore » methodology for measuring Mt/N needs to be improved and we have identified that current methods have at least one of the following three problems: (1) As much of the mitochondrial genome is duplicated in the nuclear genome, many commonly used MtDNA primers co-amplify homologous pseudogenes found in the nuclear genome; (2) use of regions from genes such as {beta}-actin and 18S rRNA which are repetitive and/or highly variable for qPCR of the nuclear genome leads to errors; and (3) the size difference of mitochondrial and nuclear genomes cause a 'dilution bias' when template DNA is diluted. We describe a PCR-based method using unique regions in the human mitochondrial genome not duplicated in the nuclear genome; unique single copy region in the nuclear genome and template treatment to remove dilution bias, to accurately quantify MtDNA from human samples.« less
The genome of Eucalyptus grandis.

PubMed

Myburg, Alexander A; Grattapaglia, Dario; Tuskan, Gerald A; Hellsten, Uffe; Hayes, Richard D; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R K; Hussey, Steven G; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B; Togawa, Roberto C; Pappas, Marilia R; Faria, Danielle A; Sansaloni, Carolina P; Petroli, Cesar D; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A; Bornberg-Bauer, Erich; Kersting, Anna R; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E; Liston, Aaron; Spatafora, Joseph W; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C; Steane, Dorothy A; Vaillancourt, René E; Potts, Brad M; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J; Strauss, Steven H; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S; Schmutz, Jeremy

2014-06-19

Eucalypts are the world's most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled >94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.
Application of site and haplotype-frequency based approaches for detecting selection signatures in cattle

PubMed Central

2011-01-01

Background 'Selection signatures' delimit regions of the genome that are, or have been, functionally important and have therefore been under either natural or artificial selection. In this study, two different and complementary methods--integrated Haplotype Homozygosity Score (|iHS|) and population differentiation index (FST)--were applied to identify traces of decades of intensive artificial selection for traits of economic importance in modern cattle. Results We scanned the genome of a diverse set of dairy and beef breeds from Germany, Canada and Australia genotyped with a 50 K SNP panel. Across breeds, a total of 109 extreme |iHS| values exceeded the empirical threshold level of 5% with 19, 27, 9, 10 and 17 outliers in Holstein, Brown Swiss, Australian Angus, Hereford and Simmental, respectively. Annotating the regions harboring clustered |iHS| signals revealed a panel of interesting candidate genes like SPATA17, MGAT1, PGRMC2 and ACTC1, COL23A1, MATN2, respectively, in the context of reproduction and muscle formation. In a further step, a new Bayesian FST-based approach was applied with a set of geographically separated populations including Holstein, Brown Swiss, Simmental, North American Angus and Piedmontese for detecting differentiated loci. In total, 127 regions exceeding the 2.5 per cent threshold of the empirical posterior distribution were identified as extremely differentiated. In a substantial number (56 out of 127 cases) the extreme FST values were found to be positioned in poor gene content regions which deviated significantly (p < 0.05) from the expectation assuming a random distribution. However, significant FST values were found in regions of some relevant genes such as SMCP and FGF1. Conclusions Overall, 236 regions putatively subject to recent positive selection in the cattle genome were detected. Both |iHS| and FST suggested selection in the vicinity of the Sialic acid binding Ig-like lectin 5 gene on BTA18. This region was recently reported to be a major QTL with strong effects on productive life and fertility traits in Holstein cattle. We conclude that high-resolution genome scans of selection signatures can be used to identify genomic regions contributing to within- and inter-breed phenotypic variation. PMID:21679429
A high resolution atlas of gene expression in the domestic sheep (Ovis aries)

PubMed Central

Farquhar, Iseabail L.; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G.; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C. Bruce; Freeman, Tom C.; Archibald, Alan L.; Hume, David A.

2017-01-01

Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of ‘guilt by association’ was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages. PMID:28915238
A high resolution atlas of gene expression in the domestic sheep (Ovis aries).

PubMed

Clark, Emily L; Bush, Stephen J; McCulloch, Mary E B; Farquhar, Iseabail L; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G; Wu, Chunlei; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C Bruce; Freeman, Tom C; Summers, Kim M; Archibald, Alan L; Hume, David A

2017-09-01

Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of 'guilt by association' was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages.

Massive gene acquisitions in Mycobacterium indicus pranii provide a perspective on mycobacterial evolution

PubMed Central

Saini, Vikram; Raghuvanshi, Saurabh; Khurana, Jitendra P.; Ahmed, Niyaz; Hasnain, Seyed E.; Tyagi, Akhilesh K.; Tyagi, Anil K.

2012-01-01

Understanding the evolutionary and genomic mechanisms responsible for turning the soil-derived saprophytic mycobacteria into lethal intracellular pathogens is a critical step towards the development of strategies for the control of mycobacterial diseases. In this context, Mycobacterium indicus pranii (MIP) is of specific interest because of its unique immunological and evolutionary significance. Evolutionarily, it is the progenitor of opportunistic pathogens belonging to M. avium complex and is endowed with features that place it between saprophytic and pathogenic species. Herein, we have sequenced the complete MIP genome to understand its unique life style, basis of immunomodulation and habitat diversification in mycobacteria. As a case of massive gene acquisitions, 50.5% of MIP open reading frames (ORFs) are laterally acquired. We show, for the first time for Mycobacterium, that MIP genome has mosaic architecture. These gene acquisitions have led to the enrichment of selected gene families critical to MIP physiology. Comparative genomic analysis indicates a higher antigenic potential of MIP imparting it a unique ability for immunomodulation. Besides, it also suggests an important role of genomic fluidity in habitat diversification within mycobacteria and provides a unique view of evolutionary divergence and putative bottlenecks that might have eventually led to intracellular survival and pathogenic attributes in mycobacteria. PMID:22965120
Signatures of co-evolutionary host-pathogen interactions in the genome of the entomopathogenic nematode Steinernema carpocapsae.

PubMed

Flores-Ponce, Mitzi; Vallebueno-Estrada, Miguel; González-Orozco, Eduardo; Ramos-Aboites, Hilda E; García-Chávez, J Noé; Simões, Nelson; Montiel, Rafael

2017-04-26

The entomopathogenic nematode Steinernema carpocapsae has been used worldwide as a biocontrol agent for insect pests, making it an interesting model for understanding parasite-host interactions. Two models propose that these interactions are co-evolutionary processes in such a way that equilibrium is never reached. In one model, known as "arms race", new alleles in relevant genes are fixed in both host and pathogens by directional positive selection, producing recurrent and alternating selective sweeps. In the other model, known as"trench warfare", persistent dynamic fluctuations in allele frequencies are sustained by balancing selection. There are some examples of genes evolving according to both models, however, it is not clear to what extent these interactions might alter genome-level evolutionary patterns and intraspecific diversity. Here we investigate some of these aspects by studying genomic variation in S. carpocapsae and other pathogenic and free-living nematodes from phylogenetic clades IV and V. To look for signatures of an arms-race dynamic, we conducted massive scans to detect directional positive selection in interspecific data. In free-living nematodes, we detected a significantly higher proportion of genes with sites under positive selection than in parasitic nematodes. However, in these genes, we found more enriched Gene Ontology terms in parasites. To detect possible effects of dynamic polymorphisms interactions we looked for signatures of balancing selection in intraspecific genomic data. The observed distribution of Tajima's D values in S. carpocapsae was more skewed to positive values and significantly different from the observed distribution in the free-living Caenorhabditis briggsae. Also, the proportion of significant positive values of Tajima's D was elevated in genes that were differentially expressed after induction with insect tissues as compared to both non-differentially expressed genes and the global scan. Our study provides a first portrait of the effects that lifestyle might have in shaping the patterns of selection at the genomic level. An arms-race between hosts and pathogens seems to be affecting specific genetic functions but not necessarily increasing the number of positively selected genes. Trench warfare dynamics seem to be acting more generally in the genome, likely focusing on genes responding to the interaction, rather than targeting specific genetic functions.
Population structure and genomic inbreeding in nine Swiss dairy cattle populations.

PubMed

Signer-Hasler, Heidi; Burren, Alexander; Neuditschko, Markus; Frischknecht, Mirjam; Garrick, Dorian; Stricker, Christian; Gredler, Birgit; Bapst, Beat; Flury, Christine

2017-11-07

Domestication, breed formation and intensive selection have resulted in divergent cattle breeds that likely exhibit their own genomic signatures. In this study, we used genotypes from 27,612 autosomal single nucleotide polymorphisms to characterize population structure based on 9214 sires representing nine Swiss dairy cattle populations: Brown Swiss (BS), Braunvieh (BV), Original Braunvieh (OB), Holstein (HO), Red Holstein (RH), Swiss Fleckvieh (SF), Simmental (SI), Eringer (ER) and Evolèner (EV). Genomic inbreeding (F ROH ) and signatures of selection were determined by calculating runs of homozygosity (ROH). The results build the basis for a better understanding of the genetic development of Swiss dairy cattle populations and highlight differences between the original populations (i.e. OB, SI, ER and EV) and those that have become more popular in Switzerland as currently reflected by their larger populations (i.e. BS, BV, HO, RH and SF). The levels of genetic diversity were highest and lowest in the SF and BS breeds, respectively. Based on F ST values, we conclude that, among all pairwise comparisons, BS and HO (0.156) differ more than the other pairs of populations. The original Swiss cattle populations OB, SI, ER, and EV are clearly genetically separated from the Swiss cattle populations that are now more common and represented by larger numbers of cows. Mean levels of F ROH ranged from 0.027 (ER) to 0.091 (BS). Three of the original Swiss cattle populations, ER (F ROH : 0.027), OB (F ROH : 0.029), and SI (F ROH : 0.039), showed low levels of genomic inbreeding, whereas it was much higher in EV (F ROH : 0.074). Private signatures of selection for the original Swiss cattle populations are reported for BTA4, 5, 11 and 26. The low levels of genomic inbreeding observed in the original Swiss cattle populations ER, OB and SI compared to the other breeds are explained by a lesser use of artificial insemination and greater use of natural service. Natural service results in more sires having progeny at each generation and thus this breeding practice is likely the major reason for the remarkable levels of genetic diversity retained within these populations. The fact that the EV population is regionally restricted and its small census size of herd-book cows explain its high level of genomic inbreeding.
Texture analysis of radiometric signatures of new sea ice forming in Arctic leads

NASA Technical Reports Server (NTRS)

Eppler, Duane T.; Farmer, L. Dennis

1991-01-01

Analysis of 33.6-GHz, high-resolution, passive microwave images suggests that new sea ice accumulating in open leads is characterized by a unique textural signature which can be used to discriminate new ice forming in this environment from adjacent surfaces of similar radiometric temperature. Ten training areas were selected from the data set, three of which consisted entirely of first-year ice, four entirely of multilayer ice, and three of new ice in open leads in the process of freezing. A simple gradient operator was used to characterize the radiometric texture in each training region in terms of the degree to which radiometric gradients are oriented. New ice in leads has a sufficiently high proportion of well-oriented features to distinguish it uniquely from first-year ice and multiyear ice. The predominance of well-oriented features probably reflects physical processes by which new ice accumulates in open leads. Banded structures, which are evident in aerial photographs of new ice, apparently give rise to the radiometric signature observed, in which the trend of brightness temperature gradients is aligned parallel to lead trends. First-year ice and multiyear ice, which have been subjected to a more random growth and process history, lack this banded structure and therefore are characterized by signatures in which well-aligned elements are less dominant.
Used Fuel Cask Identification through Neutron Profile

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rauch, Eric Benton

2015-11-20

Currently, most spent fuel is stored near reactors. An interim consolidated fuel storage facility would receive fuel from multiple sites and store it in casks on site for decades. For successful operation of such a facility there is need for a way to restore continuity of knowledge if lost as well as a method that will indicate state of fuel inside the cask. Used nuclear fuel is identifiable by its radiation emission, both gamma and neutron. Neutron emission from fission products, multiplication from remaining fissile material, and the unique distribution of both in each cask produce a unique neutron signature.more » If two signatures taken at different times do not match, either changes within the fuel content or misidentification of a cask occurred. It was found that identification of cask loadings works well through the profile of emitted neutrons in simulated real casks. Even casks with similar overall neutron emission or average counts around the circumference can be distinguished from each other by analyzing the profile. In conclusion, (1) identification of unaltered casks through neutron signature profile is viable; (2) collecting the profile provides insight to the condition and intactness of the fuel stored inside the cask; and (3) the signature profile is stable over time.« less
Resonant ultrasound spectroscopy

DOEpatents

Migliori, Albert

1991-01-01

A resonant ultrasound spectroscopy method provides a unique characterization of an object for use in distinguishing similar objects having physical differences greater than a predetermined tolerance. A resonant response spectrum is obtained for a reference object by placing excitation and detection transducers at any accessible location on the object. The spectrum is analyzed to determine the number of resonant response peaks in a predetermined frequency interval. The distribution of the resonance frequencies is then characterized in a manner effective to form a unique signature of the object. In one characterization, a small frequency interval is defined and stepped though the spectrum frequency range. Subsequent objects are similarly characterized where the characterizations serve as signatures effective to distinguish objects that differ from the reference object by more than the predetermined tolerance.
Proposal for chiral-boson search at LHC via their unique new signature

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chizhov, M. V.; Bednyakov, V. A.; Budagov, J. A.

The resonance production of new chiral spin-1 bosons and their detection through the Drell-Yan process at the CERN LHC is considered. Quantitative evaluations of various differential cross sections of the chiral-boson production are made within the CalcHEP package. The new neutral chiral bosons can be observed as a Breit-Wigner resonance peak in the invariant-dilepton-mass distribution, as usual. However, unique new signatures of the chiral bosons exist. First, there is no Jacobian peak in the lepton transverse-momentum distribution. Second, the lepton angular distribution in the Collins-Soper frame for the high on-peak invariant masses of the lepton pairs has a peculiar 'swallowtail'more » shape.« less
Gene-expression signatures of Atlantic salmon's plastic life cycle.

PubMed

Aubin-Horth, Nadia; Letcher, Benjamin H; Hofmann, Hans A

2009-09-15

How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated-at similar magnitudes, yet in opposite direction-in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators.
Genomic signatures of evolutionary transitions from solitary to group living

USDA-ARS?s Scientific Manuscript database

Eusociality has evolved rarely, but repeatedly, in vertebrates and invertebrates, and resulted inconvergent morphological, physiological, and behavioural innovations. It is unknown whether similar evolutionary processes are responsible for the repeated origins and further elaborations of eusociality...
Explaining human uniqueness: genome interactions with environment, behaviour and culture.

PubMed

Varki, Ajit; Geschwind, Daniel H; Eichler, Evan E

2008-10-01

What makes us human? Specialists in each discipline respond through the lens of their own expertise. In fact, 'anthropogeny' (explaining the origin of humans) requires a transdisciplinary approach that eschews such barriers. Here we take a genomic and genetic perspective towards molecular variation, explore systems analysis of gene expression and discuss an organ-systems approach. Rejecting any 'genes versus environment' dichotomy, we then consider genome interactions with environment, behaviour and culture, finally speculating that aspects of human uniqueness arose because of a primate evolutionary trend towards increasing and irreversible dependence on learned behaviours and culture - perhaps relaxing allowable thresholds for large-scale genomic diversity.
Optimizing Restriction Site Placement for Synthetic Genomes

NASA Astrophysics Data System (ADS)

Montes, Pablo; Memelli, Heraldo; Ward, Charles; Kim, Joondong; Mitchell, Joseph S. B.; Skiena, Steven

Restriction enzymes are the workhorses of molecular biology. We introduce a new problem that arises in the course of our project to design virus variants to serve as potential vaccines: we wish to modify virus-length genomes to introduce large numbers of unique restriction enzyme recognition sites while preserving wild-type function by substitution of synonymous codons. We show that the resulting problem is NP-Complete, give an exponential-time algorithm, and propose effective heuristics, which we show give excellent results for five sample viral genomes. Our resulting modified genomes have several times more unique restriction sites and reduce the maximum gap between adjacent sites by three to nine-fold.
Explaining human uniqueness: genome interactions with environment, behaviour and culture

PubMed Central

Varki, Ajit; Geschwind, Daniel H.; Eichler, Evan E.

2009-01-01

What makes us human? Specialists in each discipline respond through the lens of their own expertise. In fact, ‘anthropogeny’ (explaining the origin of humans) requires a transdisciplinary approach that eschews such barriers. Here we take a genomic and genetic perspective towards molecular variation, explore systems analysis of gene expression and discuss an organ-systems approach. Rejecting any ‘genes versus environment’ dichotomy, we then consider genome interactions with environment, behaviour and culture, finally speculating that aspects of human uniqueness arose because of a primate evolutionary trend towards increasing and irreversible dependence on learned behaviours and culture — perhaps relaxing allowable thresholds for large-scale genomic diversity. PMID:18802414
A signature correlation study of ground target VHF/UHF ISAR imagery

NASA Astrophysics Data System (ADS)

Gatesman, Andrew J.; Beaudoin, Christopher J.; Giles, Robert H.; Kersey, William T.; Waldman, Jerry; Carter, Steve; Nixon, William E.

2003-09-01

VV and HH-polarized radar signatures of several ground targets were acquired in the VHF/UHF band (171-342 MHz) by using 1/35th scale models and an indoor radar range operating from 6 to 12 GHz. Data were processed into medianized radar cross sections as well as focused, ISAR imagery. Measurement validation was confirmed by comparing the radar cross section of a test object with a method of moments radar cross section prediction code. The signatures of several vehicles from three vehicle classes (tanks, trunks, and TELs) were measured and a signature cross-correlation study was performed. The VHF/UHF band is currently being exploited for its foliage penetration ability, however, the coarse image resolution which results from the relatively long radar wavelengths suggests a more challenging target recognition problem. One of the study's goals was to determine the amount of unique signature content in VHF/UHF ISAR imagery of military ground vehicles. Open-field signatures are compared with each other as well as with simplified shapes of similar size. Signatures were also acquired on one vehicle in a variety of configurations to determine the impact of monitor target variations on the signature content at these frequencies.
Molecular Innovation in Ciliates with Complex Genome Rearrangements

NASA Astrophysics Data System (ADS)

Neme, R.; Landweber, L. F.

2017-07-01

We study molecular innovation in several ciliate species with unique massive genome rearrangements to understand how a radically distinct genome architecture can shape the process of acquiring new functions, genes and structures.
Genome-wide analysis of signatures of selection in populations of African honey bees (Apis mellifera) using new web-based tools.

PubMed

Fuller, Zachary L; Niño, Elina L; Patch, Harland M; Bedoya-Reina, Oscar C; Baumgarten, Tracey; Muli, Elliud; Mumoki, Fiona; Ratan, Aakrosh; McGraw, John; Frazier, Maryann; Masiga, Daniel; Schuster, Stephen; Grozinger, Christina M; Miller, Webb

2015-07-10

With the development of inexpensive, high-throughput sequencing technologies, it has become feasible to examine questions related to population genetics and molecular evolution of non-model species in their ecological contexts on a genome-wide scale. Here, we employed a newly developed suite of integrated, web-based programs to examine population dynamics and signatures of selection across the genome using several well-established tests, including F ST, pN/pS, and McDonald-Kreitman. We applied these techniques to study populations of honey bees (Apis mellifera) in East Africa. In Kenya, there are several described A. mellifera subspecies, which are thought to be localized to distinct ecological regions. We performed whole genome sequencing of 11 worker honey bees from apiaries distributed throughout Kenya and identified 3.6 million putative single-nucleotide polymorphisms. The dense coverage allowed us to apply several computational procedures to study population structure and the evolutionary relationships among the populations, and to detect signs of adaptive evolution across the genome. While there is considerable gene flow among the sampled populations, there are clear distinctions between populations from the northern desert region and those from the temperate, savannah region. We identified several genes showing population genetic patterns consistent with positive selection within African bee populations, and between these populations and European A. mellifera or Asian Apis florea. These results lay the groundwork for future studies of adaptive ecological evolution in honey bees, and demonstrate the use of new, freely available web-based tools and workflows ( http://usegalaxy.org/r/kenyanbee ) that can be applied to any model system with genomic information.
European Chlamydia abortus livestock isolate genomes reveal unusual stability and limited diversity, reflected in geographical signatures.

PubMed

Seth-Smith, H M B; Busó, Leonor Sánchez; Livingstone, M; Sait, M; Harris, S R; Aitchison, K D; Vretou, Evangelia; Siarkou, V I; Laroucau, K; Sachse, K; Longbottom, D; Thomson, N R

2017-05-04

Chlamydia abortus (formerly Chlamydophila abortus) is an economically important livestock pathogen, causing ovine enzootic abortion (OEA), and can also cause zoonotic infections in humans affecting pregnancy outcome. Large-scale genomic studies on other chlamydial species are giving insights into the biology of these organisms but have not yet been performed on C. abortus. Our aim was to investigate a broad collection of European isolates of C. abortus, using next generation sequencing methods, looking at diversity, geographic distribution and genome dynamics. Whole genome sequencing was performed on our collection of 57 C. abortus isolates originating primarily from the UK, Germany, France and Greece, but also from Tunisia, Namibia and the USA. Phylogenetic analysis of a total of 64 genomes shows a deep structural division within the C. abortus species with a major clade displaying limited diversity, in addition to a branch carrying two more distantly related Greek isolates, LLG and POS. Within the major clade, seven further phylogenetic groups can be identified, demonstrating geographical associations. The number of variable nucleotide positions across the sampled isolates is significantly lower than those published for C. trachomatis and C. psittaci. No recombination was identified within C. abortus, and no plasmid was found. Analysis of pseudogenes showed lineage specific loss of some functions, notably with several Pmp and TMH/Inc proteins predicted to be inactivated in many of the isolates studied. The diversity within C. abortus appears to be much lower compared to other species within the genus. There are strong geographical signatures within the phylogeny, indicating clonal expansion within areas of limited livestock transport. No recombination has been identified within this species, showing that different species of Chlamydia may demonstrate different evolutionary dynamics, and that the genome of C. abortus is highly stable.
Signatures of Long-Term Balancing Selection in Human Genomes

PubMed Central

de Filippo, Cesare; Teixeira, João C; Schmidt, Joshua M; Kleinert, Philip; Meyer, Diogo; Andrés, Aida M

2018-01-01

Abstract Balancing selection maintains advantageous diversity in populations through various mechanisms. Although extensively explored from a theoretical perspective, an empirical understanding of its prevalence and targets lags behind our knowledge of positive selection. Here, we describe the Non-central Deviation (NCD), a simple yet powerful statistic to detect long-term balancing selection (LTBS) that quantifies how close frequencies are to expectations under LTBS, and provides the basis for a neutrality test. NCD can be applied to a single locus or genomic data, and can be implemented considering only polymorphisms (NCD1) or also considering fixed differences with respect to an outgroup (NCD2) species. Incorporating fixed differences improves power, and NCD2 has higher power to detect LTBS in humans under different frequencies of the balanced allele(s) than other available methods. Applied to genome-wide data from African and European human populations, in both cases using chimpanzee as an outgroup, NCD2 shows that, albeit not prevalent, LTBS affects a sizable portion of the genome: ∼0.6% of analyzed genomic windows and 0.8% of analyzed positions. Significant windows (P < 0.0001) contain 1.6% of SNPs in the genome, which disproportionally fall within exons and change protein sequence, but are not enriched in putatively regulatory sites. These windows overlap ∼8% of the protein-coding genes, and these have larger number of transcripts than expected by chance even after controlling for gene length. Our catalog includes known targets of LTBS but a majority of them (90%) are novel. As expected, immune-related genes are among those with the strongest signatures, although most candidates are involved in other biological functions, suggesting that LTBS potentially influences diverse human phenotypes. PMID:29608730
Decomposing Oncogenic Transcriptional Signatures to Generate Maps of Divergent Cellular States* | Office of Cancer Genomics

Cancer.gov

The systematic sequencing of the cancer genome has led to the identification of numerous genetic alterations in cancer. However, a deeper understanding of the functional consequences of these alterations is necessary to guide appropriate therapeutic strategies. Here, we describe Onco-GPS (OncoGenic Positioning System), a data-driven analysis framework to organize individual tumor samples with shared oncogenic alterations onto a reference map defined by their underlying cellular states.
Whole-genome landscapes of major melanoma subtypes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hayward, Nicholas K.; Wilmott, James S.; Waddell, Nicola

Melanoma of the skin is a common cancer only in Europeans, whereas it arises in internal body surfaces (mucosal sites) and on the hands and feet (acral sites) in people throughout the world. We report analysis of whole-genome sequences from cutaneous, acral and mucosal subtypes of melanoma. The heavily mutated landscape of coding and non-coding mutations in cutaneous melanoma resolved novel signatures of mutagenesis attributable to ultraviolet radiation. But, acral and mucosal melanomas were dominated by structural changes and mutation signatures of unknown aetiology, not previously identified in melanoma. The number of genes affected by recurrent mutations disrupting non-coding sequencesmore » was similar to that affected by recurrent mutations to coding sequences. Significantly mutated genes included BRAF, CDKN2A, NRAS and TP53 in cutaneous melanoma, BRAF, NRAS and NF1 in acral melanoma and SF3B1 in mucosal melanoma. Mutations affecting the TERT promoter were the most frequent of all; however, neither they nor ATRX mutations, which correlate with alternative telomere lengthening, were associated with greater telomere length. In most cases, melanomas had potentially actionable mutations, most in components of the mitogen-activated protein kinase and phosphoinositol kinase pathways. The whole-genome mutation landscape of melanoma reveals diverse carcinogenic processes across its subtypes, some unrelated to sun exposure, and extends potential involvement of the non-coding genome in its pathogenesis.« less
Whole-genome landscapes of major melanoma subtypes

DOE PAGES

Hayward, Nicholas K.; Wilmott, James S.; Waddell, Nicola; ...

2017-05-03

Melanoma of the skin is a common cancer only in Europeans, whereas it arises in internal body surfaces (mucosal sites) and on the hands and feet (acral sites) in people throughout the world. We report analysis of whole-genome sequences from cutaneous, acral and mucosal subtypes of melanoma. The heavily mutated landscape of coding and non-coding mutations in cutaneous melanoma resolved novel signatures of mutagenesis attributable to ultraviolet radiation. But, acral and mucosal melanomas were dominated by structural changes and mutation signatures of unknown aetiology, not previously identified in melanoma. The number of genes affected by recurrent mutations disrupting non-coding sequencesmore » was similar to that affected by recurrent mutations to coding sequences. Significantly mutated genes included BRAF, CDKN2A, NRAS and TP53 in cutaneous melanoma, BRAF, NRAS and NF1 in acral melanoma and SF3B1 in mucosal melanoma. Mutations affecting the TERT promoter were the most frequent of all; however, neither they nor ATRX mutations, which correlate with alternative telomere lengthening, were associated with greater telomere length. In most cases, melanomas had potentially actionable mutations, most in components of the mitogen-activated protein kinase and phosphoinositol kinase pathways. The whole-genome mutation landscape of melanoma reveals diverse carcinogenic processes across its subtypes, some unrelated to sun exposure, and extends potential involvement of the non-coding genome in its pathogenesis.« less

Impact of genomics on the understanding of microbial evolution and classification: the importance of Darwin's views on classification.

PubMed

Gupta, Radhey S

2016-07-01

Analyses of genome sequences, by some approaches, suggest that the widespread occurrence of horizontal gene transfers (HGTs) in prokaryotes disguises their evolutionary relationships and have led to questioning of the Darwinian model of evolution for prokaryotes. These inferences are critically examined in the light of comparative genome analysis, characteristic synapomorphies, phylogenetic trees and Darwin's views on examining evolutionary relationships. Genome sequences are enabling discovery of numerous molecular markers (synapomorphies) such as conserved signature indels (CSIs) and conserved signature proteins (CSPs), which are distinctive characteristics of different prokaryotic taxa. Based on these molecular markers, exhibiting high degree of specificity and predictive ability, numerous prokaryotic taxa of different ranks, currently identified based on the 16S rRNA gene trees, can now be reliably demarcated in molecular terms. Within all studied groups, multiple CSIs and CSPs have been identified for successive nested clades providing reliable information regarding their hierarchical relationships and these inferences are not affected by HGTs. These results strongly support Darwin's views on evolution and classification and supplement the current phylogenetic framework based on 16S rRNA in important respects. The identified molecular markers provide important means for developing novel diagnostics, therapeutics and for functional studies providing important insights regarding prokaryotic taxa. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Uncovering the genetic signature of quantitative trait evolution with replicated time series data.

PubMed

Franssen, S U; Kofler, R; Schlötterer, C

2017-01-01

The genetic architecture of adaptation in natural populations has not yet been resolved: it is not clear to what extent the spread of beneficial mutations (selective sweeps) or the response of many quantitative trait loci drive adaptation to environmental changes. Although much attention has been given to the genomic footprint of selective sweeps, the importance of selection on quantitative traits is still not well studied, as the associated genomic signature is extremely difficult to detect. We propose 'Evolve and Resequence' as a promising tool, to study polygenic adaptation of quantitative traits in evolving populations. Simulating replicated time series data we show that adaptation to a new intermediate trait optimum has three characteristic phases that are reflected on the genomic level: (1) directional frequency changes towards the new trait optimum, (2) plateauing of allele frequencies when the new trait optimum has been reached and (3) subsequent divergence between replicated trajectories ultimately leading to the loss or fixation of alleles while the trait value does not change. We explore these 3 phase characteristics for relevant population genetic parameters to provide expectations for various experimental evolution designs. Remarkably, over a broad range of parameters the trajectories of selected alleles display a pattern across replicates, which differs both from neutrality and directional selection. We conclude that replicated time series data from experimental evolution studies provide a promising framework to study polygenic adaptation from whole-genome population genetics data.
Detecting selection signatures between Duroc and Duroc synthetic pig populations using high-density SNP chip.

PubMed

Edea, Z; Hong, J-K; Jung, J-H; Kim, D-W; Kim, Y-M; Kim, E-S; Shin, S S; Jung, Y C; Kim, K-S

2017-08-01

The development of high throughput genotyping techniques has facilitated the identification of selection signatures of pigs. The detection of genomic selection signals in a population subjected to differential selection pressures may provide insights into the genes associated with economically and biologically important traits. To identify genomic regions under selection, we genotyped 488 Duroc (D) pigs and 155 D × Korean native pigs (DKNPs) using the Porcine SNP70K BeadChip. By applying the F ST and extended haplotype homozygosity (EHH-Rsb) methods, we detected genes under directional selection associated with growth/stature (DOCK7, PLCB4, HS2ST1, FBP2 and TG), carcass and meat quality (TG, COL14A1, FBXO5, NR3C1, SNX7, ARHGAP26 and DPYD), number of teats (LOC100153159 and LRRC1), pigmentation (MME) and ear morphology (SOX5), which are all mostly near or at fixation. These results could be a basis for investigating the underlying mutations associated with observed phenotypic variation. Validation using genome-wide association analysis would also facilitate the inclusion of some of these markers in genetic evaluation programs. © 2017 Stichting International Foundation for Animal Genetics.
A negative genetic interaction map in isogenic cancer cell lines reveals cancer cell vulnerabilities

PubMed Central

Vizeacoumar, Franco J; Arnold, Roland; Vizeacoumar, Frederick S; Chandrashekhar, Megha; Buzina, Alla; Young, Jordan T F; Kwan, Julian H M; Sayad, Azin; Mero, Patricia; Lawo, Steffen; Tanaka, Hiromasa; Brown, Kevin R; Baryshnikova, Anastasia; Mak, Anthony B; Fedyshyn, Yaroslav; Wang, Yadong; Brito, Glauber C; Kasimer, Dahlia; Makhnevych, Taras; Ketela, Troy; Datti, Alessandro; Babu, Mohan; Emili, Andrew; Pelletier, Laurence; Wrana, Jeff; Wainberg, Zev; Kim, Philip M; Rottapel, Robert; O'Brien, Catherine A; Andrews, Brenda; Boone, Charles; Moffat, Jason

2013-01-01

Improved efforts are necessary to define the functional product of cancer mutations currently being revealed through large-scale sequencing efforts. Using genome-scale pooled shRNA screening technology, we mapped negative genetic interactions across a set of isogenic cancer cell lines and confirmed hundreds of these interactions in orthogonal co-culture competition assays to generate a high-confidence genetic interaction network of differentially essential or differential essentiality (DiE) genes. The network uncovered examples of conserved genetic interactions, densely connected functional modules derived from comparative genomics with model systems data, functions for uncharacterized genes in the human genome and targetable vulnerabilities. Finally, we demonstrate a general applicability of DiE gene signatures in determining genetic dependencies of other non-isogenic cancer cell lines. For example, the PTEN−/− DiE genes reveal a signature that can preferentially classify PTEN-dependent genotypes across a series of non-isogenic cell lines derived from the breast, pancreas and ovarian cancers. Our reference network suggests that many cancer vulnerabilities remain to be discovered through systematic derivation of a network of differentially essential genes in an isogenic cancer cell model. PMID:24104479
Cattle genome-wide analysis reveals genetic signatures in trypanotolerant N'Dama.

PubMed

Kim, Soo-Jin; Ka, Sojeong; Ha, Jung-Woo; Kim, Jaemin; Yoo, DongAhn; Kim, Kwondo; Lee, Hak-Kyo; Lim, Dajeong; Cho, Seoae; Hanotte, Olivier; Mwai, Okeyo Ally; Dessie, Tadelle; Kemp, Stephen; Oh, Sung Jong; Kim, Heebal

2017-05-12

Indigenous cattle in Africa have adapted to various local environments to acquire superior phenotypes that enhance their survival under harsh conditions. While many studies investigated the adaptation of overall African cattle, genetic characteristics of each breed have been poorly studied. We performed the comparative genome-wide analysis to assess evidence for subspeciation within species at the genetic level in trypanotolerant N'Dama cattle. We analysed genetic variation patterns in N'Dama from the genomes of 101 cattle breeds including 48 samples of five indigenous African cattle breeds and 53 samples of various commercial breeds. Analysis of SNP variances between cattle breeds using wMI, XP-CLR, and XP-EHH detected genes containing N'Dama-specific genetic variants and their potential associations. Functional annotation analysis revealed that these genes are associated with ossification, neurological and immune system. Particularly, the genes involved in bone formation indicate that local adaptation of N'Dama may engage in skeletal growth as well as immune systems. Our results imply that N'Dama might have acquired distinct genotypes associated with growth and regulation of regional diseases including trypanosomiasis. Moreover, this study offers significant insights into identifying genetic signatures for natural and artificial selection of diverse African cattle breeds.
Genomic Signatures of Speciation in Sympatric and Allopatric Hawaiian Picture-Winged Drosophila.

PubMed

Kang, Lin; Settlage, Robert; McMahon, Wyatt; Michalak, Katarzyna; Tae, Hongseok; Garner, Harold R; Stacy, Elizabeth A; Price, Donald K; Michalak, Pawel

2016-05-30

The Hawaiian archipelago provides a natural arena for understanding adaptive radiation and speciation. The Hawaiian Drosophila are one of the most diverse endemic groups in Hawaiì with up to 1,000 species. We sequenced and analyzed entire genomes of recently diverged species of Hawaiian picture-winged Drosophila, Drosophila silvestris and Drosophila heteroneura from Hawaiì Island, in comparison with Drosophila planitibia, their sister species from Maui, a neighboring island where a common ancestor of all three had likely occurred. Genome-wide single nucleotide polymorphism patterns suggest the more recent origin of D. silvestris and D. heteroneura, as well as a pervasive influence of positive selection on divergence of the three species, with the signatures of positive selection more prominent in sympatry than allopatry. Positively selected genes were significantly enriched for functional terms related to sensory detection and mating, suggesting that sexual selection played an important role in speciation of these species. In particular, sequence variation in Olfactory receptor and Gustatory receptor genes seems to play a major role in adaptive radiation in Hawaiian pictured-winged Drosophila. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Gene discovery in an invasive tephritid model pest species, the Mediterranean fruit fly, Ceratitis capitata

PubMed Central

Gomulski, Ludvik M; Dimopoulos, George; Xi, Zhiyong; Soares, Marcelo B; Bonaldo, Maria F; Malacrida, Anna R; Gasperi, Giuliano

2008-01-01

Background The medfly, Ceratitis capitata, is a highly invasive agricultural pest that has become a model insect for the development of biological control programs. Despite research into the behavior and classical and population genetics of this organism, the quantity of sequence data available is limited. We have utilized an expressed sequence tag (EST) approach to obtain detailed information on transcriptome signatures that relate to a variety of physiological systems in the medfly; this information emphasizes on reproduction, sex determination, and chemosensory perception, since the study was based on normalized cDNA libraries from embryos and adult heads. Results A total of 21,253 high-quality ESTs were obtained from the embryo and head libraries. Clustering analyses performed separately for each library resulted in 5201 embryo and 6684 head transcripts. Considering an estimated 19% overlap in the transcriptomes of the two libraries, they represent about 9614 unique transcripts involved in a wide range of biological processes and molecular functions. Of particular interest are the sequences that share homology with Drosophila genes involved in sex determination, olfaction, and reproductive behavior. The medfly transformer2 (tra2) homolog was identified among the embryonic sequences, and its genomic organization and expression were characterized. Conclusion The sequences obtained in this study represent the first major dataset of expressed genes in a tephritid species of agricultural importance. This resource provides essential information to support the investigation of numerous questions regarding the biology of the medfly and other related species and also constitutes an invaluable tool for the annotation of complete genome sequences. Our study has revealed intriguing findings regarding the transcript regulation of tra2 and other sex determination genes, as well as insights into the comparative genomics of genes implicated in chemosensory reception and reproduction. PMID:18500975
Inflammatory macrophage-associated 3-gene signature predicts subclinical allograft injury and graft survival.

PubMed

Azad, Tej D; Donato, Michele; Heylen, Line; Liu, Andrew B; Shen-Orr, Shai S; Sweeney, Timothy E; Maltzman, Jonathan Scott; Naesens, Maarten; Khatri, Purvesh

2018-01-25

Late allograft failure is characterized by cumulative subclinical insults manifesting over many years. Although immunomodulatory therapies targeting host T cells have improved short-term survival rates, rates of chronic allograft loss remain high. We hypothesized that other immune cell types may drive subclinical injury, ultimately leading to graft failure. We collected whole-genome transcriptome profiles from 15 independent cohorts composed of 1,697 biopsy samples to assess the association of an inflammatory macrophage polarization-specific gene signature with subclinical injury. We applied penalized regression to a subset of the data sets and identified a 3-gene inflammatory macrophage-derived signature. We validated discriminatory power of the 3-gene signature in 3 independent renal transplant data sets with mean AUC of 0.91. In a longitudinal cohort, the 3-gene signature strongly correlated with extent of injury and accurately predicted progression of subclinical injury 18 months before clinical manifestation. The 3-gene signature also stratified patients at high risk of graft failure as soon as 15 days after biopsy. We found that the 3-gene signature also distinguished acute rejection (AR) accurately in 3 heart transplant data sets but not in lung transplant. Overall, we identified a parsimonious signature capable of diagnosing AR, recognizing subclinical injury, and risk-stratifying renal transplant patients. Our results strongly suggest that inflammatory macrophages may be a viable therapeutic target to improve long-term outcomes for organ transplantation patients.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Kishigami, Satoshi; Kinki University, 930 Nishimitani, Kinokawa 599-5993; Wakayama, Sayaka

In mammals, a diploid genome of an individual following fertilization of an egg and a spermatozoon is unique and irreproducible. This implies that the generated unique diploid genome is doomed with the individual ending. Even as cultured cells from the individual, they cannot normally proliferate in perpetuity because of the 'Hayflick limit'. However, Dolly, the sheep cloned from an adult mammary gland cell, changes this scenario. Somatic cell nuclear transfer (SCNT) enables us to produce offspring without germ cells, that is, to 'passage' a unique diploid genome. Animal cloning has also proven to be a powerful research tool for reprogrammingmore » in many mammals, notably mouse and cow. The mechanism underlying reprogramming, however, remains largely unknown and, animal cloning has been inefficient as a result. More momentously, in addition to abortion and fetal mortality, some cloned animals display possible premature aging phenotypes including early death and short telomere lengths. Under these inauspicious conditions, is it really possible for SCNT to preserve a diploid genome? Delightfully, in mouse and recently in primate, using SCNT we can produce nuclear transfer ES cells (ntES) more efficiently, which can preserve the eternal lifespan for the 'passage' of a unique diploid genome. Further, new somatic cloning technique using histone-deacetylase inhibitors has been developed which can significantly increase the previous cloning rates two to six times. Here, we introduce SCNT and its value as a preservation tool for a diploid genome while reviewing aging of cloned animals on cellular and individual levels.« less
Complete genome sequence of Brachyspira intermedia reveals unique genomic features in Brachyspira species and phage-mediated horizontal gene transfer

PubMed Central

2011-01-01

Background Brachyspira spp. colonize the intestines of some mammalian and avian species and show different degrees of enteropathogenicity. Brachyspira intermedia can cause production losses in chickens and strain PWS/AT now becomes the fourth genome to be completed in the genus Brachyspira. Results 15 classes of unique and shared genes were analyzed in B. intermedia, B. murdochii, B. hyodysenteriae and B. pilosicoli. The largest number of unique genes was found in B. intermedia and B. murdochii. This indicates the presence of larger pan-genomes. In general, hypothetical protein annotations are overrepresented among the unique genes. A 3.2 kb plasmid was found in B. intermedia strain PWS/AT. The plasmid was also present in the B. murdochii strain but not in nine other Brachyspira isolates. Within the Brachyspira genomes, genes had been translocated and also frequently switched between leading and lagging strands, a process that can be followed by different AT-skews in the third positions of synonymous codons. We also found evidence that bacteriophages were being remodeled and genes incorporated into them. Conclusions The accessory gene pool shapes species-specific traits. It is also influenced by reductive genome evolution and horizontal gene transfer. Gene-transfer events can cross both species and genus boundaries and bacteriophages appear to play an important role in this process. A mechanism for horizontal gene transfer appears to be gene translocations leading to remodeling of bacteriophages in combination with broad tropism. PMID:21816042
Genome flux and stasis in a five millennium transect of European prehistory

PubMed Central

Gamba, Cristina; Jones, Eppie R.; Teasdale, Matthew D.; McLaughlin, Russell L.; Gonzalez-Fortes, Gloria; Mattiangeli, Valeria; Domboróczki, László; Kővári, Ivett; Pap, Ildikó; Anders, Alexandra; Whittle, Alasdair; Dani, János; Raczky, Pál; Higham, Thomas F. G.; Hofreiter, Michael; Bradley, Daniel G; Pinhasi, Ron

2014-01-01

The Great Hungarian Plain was a crossroads of cultural transformations that have shaped European prehistory. Here we analyse a 5,000-year transect of human genomes, sampled from petrous bones giving consistently excellent endogenous DNA yields, from 13 Hungarian Neolithic, Copper, Bronze and Iron Age burials including two to high (~22 × ) and seven to ~1 × coverage, to investigate the impact of these on Europe’s genetic landscape. These data suggest genomic shifts with the advent of the Neolithic, Bronze and Iron Ages, with interleaved periods of genome stability. The earliest Neolithic context genome shows a European hunter-gatherer genetic signature and a restricted ancestral population size, suggesting direct contact between cultures after the arrival of the first farmers into Europe. The latest, Iron Age, sample reveals an eastern genomic influence concordant with introduced Steppe burial rites. We observe transition towards lighter pigmentation and surprisingly, no Neolithic presence of lactase persistence. PMID:25334030
Genome flux and stasis in a five millennium transect of European prehistory.

PubMed

Gamba, Cristina; Jones, Eppie R; Teasdale, Matthew D; McLaughlin, Russell L; Gonzalez-Fortes, Gloria; Mattiangeli, Valeria; Domboróczki, László; Kővári, Ivett; Pap, Ildikó; Anders, Alexandra; Whittle, Alasdair; Dani, János; Raczky, Pál; Higham, Thomas F G; Hofreiter, Michael; Bradley, Daniel G; Pinhasi, Ron

2014-10-21

The Great Hungarian Plain was a crossroads of cultural transformations that have shaped European prehistory. Here we analyse a 5,000-year transect of human genomes, sampled from petrous bones giving consistently excellent endogenous DNA yields, from 13 Hungarian Neolithic, Copper, Bronze and Iron Age burials including two to high (~22 × ) and seven to ~1 × coverage, to investigate the impact of these on Europe's genetic landscape. These data suggest genomic shifts with the advent of the Neolithic, Bronze and Iron Ages, with interleaved periods of genome stability. The earliest Neolithic context genome shows a European hunter-gatherer genetic signature and a restricted ancestral population size, suggesting direct contact between cultures after the arrival of the first farmers into Europe. The latest, Iron Age, sample reveals an eastern genomic influence concordant with introduced Steppe burial rites. We observe transition towards lighter pigmentation and surprisingly, no Neolithic presence of lactase persistence.
A role for Tn6029 in the evolution of the complex antibiotic resistance gene loci in genomic island 3 in enteroaggregative hemorrhagic Escherichia coli O104:H4.

PubMed

Roy Chowdhury, Piklu; Charles, Ian G; Djordjevic, Steven P

2015-01-01

In enteroaggregative hemorrhagic Escherichia coli (EAHEC) O104 the complex antibiotic resistance gene loci (CRL) found in the region of divergence 1 (RD1) within E. coli genomic island 3 (GI3) contains blaTEM-1, strAB, sul2, tet(A)A, and dfrA7 genes encoding resistance to ampicillin, streptomycin, sulfamethoxazole, tetracycline and trimethoprim respectively. The precise arrangement of antibiotic resistance genes and the role of mobile elements that drove the evolutionary events and created the CRL have not been investigated. We used a combination of bioinformatics and iterative BLASTn searches to determine the micro-evolutionary events that likely led to the formation of the CRL in GI3 using the closed genome sequences of EAHEC O104:H4 strains 2011C-3493 and 2009EL-2050 and high quality draft genomes of EAHEC E. coli O104:H4 isolates from sporadic cases not associated with the initial outbreak. Our analyses indicate that the CRL in GI3 evolved from a progenitor structure that contained an In2-derived class 1 integron in a Tn21/Tn1721 hybrid backbone. Within the hybrid backbone, a Tn6029-family transposon, identified here as Tn6029C abuts the sul1 gene in the 3'-Conserved Segment (-CS) of a class 1 integron generating a unique molecular signature that has only previously been observed in pASL01a, a small plasmid found in commensal E. coli in West Africa. From this common progenitor, independent IS26-mediated events created two novel transposons identified here as Tn6029D and Tn6222 in 2011C-3493 and 2009EL-2050 respectively. Analysis of RD1 within GI3 reveals IS26 has played a crucial role in the assembly of regions within the CRL.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Merkley, Eric D.; Sego, Landon H.; Lin, Andy

Adaptive processes in bacterial species can occur rapidly in laboratory culture, leading to genetic divergence between naturally occurring and laboratory-adapted strains. Differentiating wild and closely-related laboratory strains is clearly important for biodefense and bioforensics; however, DNA sequence data alone has thus far not provided a clear signature, perhaps due to lack of understanding of how diverse genome changes lead to adapted phenotypes. Protein abundance profiles from mass spectrometry-based proteomics analyses are a molecular measure of phenotype. Proteomics data contains sufficient information that powerful statistical methods can uncover signatures that distinguish wild strains of Yersinia pestis from laboratory-adapted strains.
Draft Genome Sequence of the Spore-Forming Probiotic Strain Bacillus coagulans Unique IS-2

PubMed Central

Upadrasta, Aditya; Pitta, Swetha

2016-01-01

Bacillus coagulans Unique IS-2 is a potential spore-forming probiotic that is commercially available on the market. The draft genome sequence presented here provides deep insight into the beneficial features of this strain for its safe use as a probiotic for various human and animal health applications. PMID:27103709
Identification of a unique library of complex, but ordered, arrays of repetitive elements in the human genome and implication of their potential involvement in pathobiology.

PubMed

Lee, Kang-Hoon; Lee, Young-Kwan; Kwon, Deug-Nam; Chiu, Sophia; Chew, Victoria; Rah, Hyungchul; Kujawski, Gregory; Melhem, Ramzi; Hsu, Karen; Chung, Cecilia; Greenhalgh, David G; Cho, Kiho

2011-06-01

Approximately 2% of the human genome is reported to be occupied by genes. Various forms of repetitive elements (REs), both characterized and uncharacterized, are presumed to make up the vast majority of the rest of the genomes of human and other species. In conjunction with a comprehensive annotation of genes, information regarding components of genome biology, such as gene polymorphisms, non-coding RNAs, and certain REs, is found in human genome databases. However, the genome-wide profile of unique RE arrangements formed by different groups of REs has not been fully characterized yet. In this study, the entire human genome was subjected to an unbiased RE survey to establish a whole-genome profile of REs and their arrangements. Due to the limitation in query size within the bl2seq alignment program (National Center for Biotechnology Information [NCBI]) utilized for the RE survey, the entire NCBI reference human genome was fragmented into 6206 units of 0.5M nucleotides. A number of RE arrangements with varying complexities and patterns were identified throughout the genome. Each chromosome had unique profiles of RE arrangements and density, and high levels of RE density were measured near the centromere regions. Subsequently, 175 complex RE arrangements, which were selected throughout the genome, were subjected to a comparison analysis using five different human genome sequences. Interestingly, three of the five human genome databases shared the exactly same arrangement patterns and sequences for all 175 RE arrangement regions (a total of 12,765,625 nucleotides). The findings from this study demonstrate that a substantial fraction of REs in the human genome are clustered into various forms of ordered structures. Further investigations are needed to examine whether some of these ordered RE arrangements contribute to the human pathobiology as a functional genome unit. Copyright © 2011 Elsevier Inc. All rights reserved.
Incorporating genomic, transcriptomic and clinical data: a prognostic and stem cell-like MYC and PRC imbalance in high-risk neuroblastoma.

PubMed

Yang, Xinan Holly; Tang, Fangming; Shin, Jisu; Cunningham, John M

2017-10-03

Previous studies suggested that cancer cells possess traits reminiscent of the biological mechanisms ascribed to normal embryonic stem cells (ESCs) regulated by MYC and Polycomb repressive complex 2 (PRC2). Several poorly differentiated adult tumors showed preferentially high expression levels in targets of MYC, coincident with low expression levels in targets of PRC2. This paper will reveal this ESC-like cancer signature in high-risk neuroblastoma (HR-NB), the most common extracranial solid tumor in children. We systematically assembled genomic variants, gene expression changes, priori knowledge of gene functions, and clinical outcomes to identify prognostic multigene signatures. First, we assigned a new, individualized prognostic index using the relative expressions between the poor- and good-outcome signature genes. We then characterized HR-NB aggressiveness beyond these prognostic multigene signatures through the imbalanced effects of MYC and PRC2 signaling. We further analyzed Retinoic acid (RA)-induced HR-NB cells to model tumor cell differentiation. Finally, we performed in vitro validation on ZFHX3, a cell differentiation marker silenced by PRC2, and compared cell morphology changes before and after blocking PRC2 in HR-NB cells. A significant concurrence existed between exons with verified variants and genes showing MYCN-dependent expression in HR-NB. From these biomarker candidates, we identified two novel prognostic gene-set pairs with multi-scale oncogenic defects. Intriguingly, MYC targets over-represented an unfavorable component of the identified prognostic signatures while PRC2 targets over-represented a favorable component. The cell cycle arrest and neuronal differentiation marker ZFHX3 was identified as one of PRC2-silenced tumor suppressor candidates. Blocking PRC2 reduced tumor cell growth and increased the mRNA expression levels of ZFHX3 in an early treatment stage. This hypothesis-driven systems bioinformatics work offered novel insights into the PRC2-mediated tumor cell growth and differentiation in neuroblastoma, which may exert oncogenic effects together with MYC regulation. Our results propose a prognostic effect of imbalanced MYC and PRC2 moderations in pediatric HR-NB for the first time. This study demonstrates an incorporation of genomic landscapes and transcriptomic profiles into the hypothesis-driven precision prognosis and biomarker discovery. The application of this approach to neuroblastoma, as well as other cancer more broadly, could contribute to reduced relapse and mortality rates in the long term.
Weekly Hydrometeorological Signatures - Characterization of Urban-Induced Streamflow and Rainfall Variability

NASA Astrophysics Data System (ADS)

Schnier, S.; Cai, X.; Sivapalan, M.

2014-12-01

About half of all humans alive today live in cities, with that number projected to grow to 70% by 2050. Because most people live in cities, urban streamflow patterns and precipitation events have a large impact on the global population. Urban environments can alter natural streamflow and precipitation patterns in a localized area. This study introduces a novel way to characterize this interference: the weekly hydrometeorological signature. Daily streamflow and precipitation data is collected from USGS gages around three climatically-different major American cities: Chicago, Los Angeles, and Charlotte. The following hypothesis is tested: a persistent weekly pattern (Monday through Sunday) exists in the hydrometeorological data which is unique to each city. All three cities appear to exhibit a persistent weekly pattern which is unique to that city for various climatological, industrial, and topographic reasons. Further study is needed; however these findings have important implications for understanding urban weather and can serve as a unique identifier, or fingerprint, for human interference to local streamflow and precipitation patterns.
Partial bisulfite conversion for unique template sequencing

PubMed Central

Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael

2018-01-01

Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423
PYTi-NiCr Signatures in the Columbia Hills are Present in Certain Martian Meteorites

NASA Technical Reports Server (NTRS)

Clark, B. C.; Gellert, R.; Ming, D. W.; Morris, R. V.; Mittlefehldt, D. W.; Squyres, S. W.

2006-01-01

Uniquely high levels of phosphorus and titanium were observed in several samples [1-3] by the APXS x-ray fluorescence measurements as the MER Spirit rover climbed Husband Hill (Columbia Hills, Gusev crater, Mars). A careful study of many such samples and their geochemical variability has revealed additional elements in this pattern, and that the derived multi-element signature is also unambiguously manifested in several martian meteorites.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.