Recent advances in understanding the role of nutrition in human genome evolution.
Ye, Kaixiong; Gu, Zhenglong
2011-11-01
Dietary transitions in human history have been suggested to play important roles in the evolution of mankind. Genetic variations caused by adaptation to diet during human evolution could have important health consequences in current society. The advance of sequencing technologies and the rapid accumulation of genome information provide an unprecedented opportunity to comprehensively characterize genetic variations in human populations and unravel the genetic basis of human evolution. Series of selection detection methods, based on various theoretical models and exploiting different aspects of selection signatures, have been developed. Their applications at the species and population levels have respectively led to the identification of human specific selection events that distinguish human from nonhuman primates and local adaptation events that contribute to human diversity. Scrutiny of candidate genes has revealed paradigms of adaptations to specific nutritional components and genome-wide selection scans have verified the prevalence of diet-related selection events and provided many more candidates awaiting further investigation. Understanding the role of diet in human evolution is fundamental for the development of evidence-based, genome-informed nutritional practices in the era of personal genomics.
Dong, Xinran; Wang, Xiao; Zhang, Feng; Tian, Weidong
2016-01-01
Accelerated evolution of regulatory sequence can alter the expression pattern of target genes, and cause phenotypic changes. In this study, we used DNase I hypersensitive sites (DHSs) to annotate putative regulatory sequences in the human genome, and conducted a genome-wide analysis of the effects of accelerated evolution on regulatory sequences. Working under the assumption that local ancient repeat elements of DHSs are under neutral evolution, we discovered that ∼0.44% of DHSs are under accelerated evolution (ace-DHSs). We found that ace-DHSs tend to be more active than background DHSs, and are strongly associated with epigenetic marks of active transcription. The target genes of ace-DHSs are significantly enriched in neuron-related functions, and their expression levels are positively selected in the human brain. Thus, these lines of evidences strongly suggest that accelerated evolution on regulatory sequences plays important role in the evolution of human-specific phenotypes. PMID:27401230
Dong, Xinran; Wang, Xiao; Zhang, Feng; Tian, Weidong
2016-10-01
Accelerated evolution of regulatory sequence can alter the expression pattern of target genes, and cause phenotypic changes. In this study, we used DNase I hypersensitive sites (DHSs) to annotate putative regulatory sequences in the human genome, and conducted a genome-wide analysis of the effects of accelerated evolution on regulatory sequences. Working under the assumption that local ancient repeat elements of DHSs are under neutral evolution, we discovered that ∼0.44% of DHSs are under accelerated evolution (ace-DHSs). We found that ace-DHSs tend to be more active than background DHSs, and are strongly associated with epigenetic marks of active transcription. The target genes of ace-DHSs are significantly enriched in neuron-related functions, and their expression levels are positively selected in the human brain. Thus, these lines of evidences strongly suggest that accelerated evolution on regulatory sequences plays important role in the evolution of human-specific phenotypes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons.
Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J; Mulley, John F; Ravi, Vydianathan; Lee, Alison P; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E G; Sun, Yi; Hertel, Jana; Beam, Michael J; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H; Litman, Gary W; Litman, Ronda T; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F; Wang, Han; Taylor, John S; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M J; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T; Venkatesh, Byrappa; Holland, Peter W H; Guiguen, Yann; Bobe, Julien; Shubin, Neil H; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H
2016-04-01
To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences.
The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons
Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.
2016-01-01
To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095
The Evolution of the Human Genome
Simonti, Corinne N.; Capra, John A.
2015-01-01
Human genomes hold a record of the evolutionary forces that have shaped our species. Advances in DNA sequencing, functional genomics, and population genetic modeling have deepened our understanding of human demographic history, natural selection, and many other long-studied topics. These advances have also revealed many previously underappreciated factors that influence the evolution of the human genome, including functional modifications to DNA and histones, conserved 3D topological chromatin domains, structural variation, and heterogeneous mutation patterns along the genome. Using evolutionary theory as a lens to study these phenomena will lead to significant breakthroughs in understanding what makes us human and why we get sick. PMID:26338498
Evolution of Local Mutation Rate and Its Determinants.
Terekhanova, Nadezhda V; Seplyarskiy, Vladimir B; Soldatov, Ruslan A; Bazykin, Georgii A
2017-05-01
Mutation rate varies along the human genome, and part of this variation is explainable by measurable local properties of the DNA molecule. Moreover, mutation rates differ between orthologous genomic regions of different species, but the drivers of this change are unclear. Here, we use data on human divergence from chimpanzee, human rare polymorphism, and human de novo mutations to predict the substitution rate at orthologous regions of non-human mammals. We show that the local mutation rates are very similar between human and apes, implying that their variation has a strong underlying cryptic component not explainable by the known genomic features. Mutation rates become progressively less similar in more distant species, and these changes are partially explainable by changes in the local genomic features of orthologous regions, most importantly, in the recombination rate. However, they are much more rapid, implying that the cryptic component underlying the mutation rate is more ephemeral than the known genomic features. These findings shed light on the determinants of mutation rate evolution. local mutation rate, molecular evolution, recombination rate. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Comparative Methylome Analyses Identify Epigenetic Regulatory Loci of Human Brain Evolution.
Mendizabal, Isabel; Shi, Lei; Keller, Thomas E; Konopka, Genevieve; Preuss, Todd M; Hsieh, Tzung-Fu; Hu, Enzhi; Zhang, Zhe; Su, Bing; Yi, Soojin V
2016-11-01
How do epigenetic modifications change across species and how do these modifications affect evolution? These are fundamental questions at the forefront of our evolutionary epigenomic understanding. Our previous work investigated human and chimpanzee brain methylomes, but it was limited by the lack of outgroup data which is critical for comparative (epi)genomic studies. Here, we compared whole genome DNA methylation maps from brains of humans, chimpanzees and also rhesus macaques (outgroup) to elucidate DNA methylation changes during human brain evolution. Moreover, we validated that our approach is highly robust by further examining 38 human-specific DMRs using targeted deep genomic and bisulfite sequencing in an independent panel of 37 individuals from five primate species. Our unbiased genome-scan identified human brain differentially methylated regions (DMRs), irrespective of their associations with annotated genes. Remarkably, over half of the newly identified DMRs locate in intergenic regions or gene bodies. Nevertheless, their regulatory potential is on par with those of promoter DMRs. An intriguing observation is that DMRs are enriched in active chromatin loops, suggesting human-specific evolutionary remodeling at a higher-order chromatin structure. These findings indicate that there is substantial reprogramming of epigenomic landscapes during human brain evolution involving noncoding regions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Insights into hominid evolution from the gorilla genome sequence
Scally, Aylwyn; Dutheil, Julien Y.; Hillier, LaDeana W.; Jordan, Greg E.; Goodhead, Ian; Herrero, Javier; Hobolth, Asger; Lappalainen, Tuuli; Mailund, Thomas; Marques-Bonet, Tomas; McCarthy, Shane; Montgomery, Stephen H.; Schwalie, Petra C.; Tang, Y. Amy; Ward, Michelle C.; Xue, Yali; Yngvadottir, Bryndis; Alkan, Can; Andersen, Lars N.; Ayub, Qasim; Ball, Edward V.; Beal, Kathryn; Bradley, Brenda J.; Chen, Yuan; Clee, Chris M.; Fitzgerald, Stephen; Graves, Tina A.; Gu, Yong; Heath, Paul; Heger, Andreas; Karakoc, Emre; Kolb-Kokocinski, Anja; Laird, Gavin K.; Lunter, Gerton; Meader, Stephen; Mort, Matthew; Mullikin, James C.; Munch, Kasper; O’Connor, Timothy D.; Phillips, Andrew D.; Prado-Martinez, Javier; Rogers, Anthony S.; Sajjadian, Saba; Schmidt, Dominic; Shaw, Katy; Simpson, Jared T.; Stenson, Peter D.; Turner, Daniel J.; Vigilant, Linda; Vilella, Albert J.; Whitener, Weldon; Zhu, Baoli; Cooper, David N.; de Jong, Pieter; Dermitzakis, Emmanouil T.; Eichler, Evan E.; Flicek, Paul; Goldman, Nick; Mundy, Nicholas I.; Ning, Zemin; Odom, Duncan T.; Ponting, Chris P.; Quail, Michael A.; Ryder, Oliver A.; Searle, Stephen M.; Warren, Wesley C.; Wilson, Richard K.; Schierup, Mikkel H.; Rogers, Jane; Tyler-Smith, Chris; Durbin, Richard
2012-01-01
Summary Gorillas are humans’ closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human-chimpanzee and human-chimpanzee-gorilla speciation events at approximately 6 and 10 million years ago (Mya). In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution. PMID:22398555
Comparative Methylome Analyses Identify Epigenetic Regulatory Loci of Human Brain Evolution
Mendizabal, Isabel; Shi, Lei; Keller, Thomas E.; Konopka, Genevieve; Preuss, Todd M.; Hsieh, Tzung-Fu; Hu, Enzhi; Zhang, Zhe; Su, Bing; Yi, Soojin V.
2016-01-01
How do epigenetic modifications change across species and how do these modifications affect evolution? These are fundamental questions at the forefront of our evolutionary epigenomic understanding. Our previous work investigated human and chimpanzee brain methylomes, but it was limited by the lack of outgroup data which is critical for comparative (epi)genomic studies. Here, we compared whole genome DNA methylation maps from brains of humans, chimpanzees and also rhesus macaques (outgroup) to elucidate DNA methylation changes during human brain evolution. Moreover, we validated that our approach is highly robust by further examining 38 human-specific DMRs using targeted deep genomic and bisulfite sequencing in an independent panel of 37 individuals from five primate species. Our unbiased genome-scan identified human brain differentially methylated regions (DMRs), irrespective of their associations with annotated genes. Remarkably, over half of the newly identified DMRs locate in intergenic regions or gene bodies. Nevertheless, their regulatory potential is on par with those of promoter DMRs. An intriguing observation is that DMRs are enriched in active chromatin loops, suggesting human-specific evolutionary remodeling at a higher-order chromatin structure. These findings indicate that there is substantial reprogramming of epigenomic landscapes during human brain evolution involving noncoding regions. PMID:27563052
A Molecular Phylogeny of Living Primates
Perelman, Polina; Johnson, Warren E.; Roos, Christian; Seuánez, Hector N.; Horvath, Julie E.; Moreira, Miguel A. M.; Kessing, Bailey; Pontius, Joan; Roelke, Melody; Rumpler, Yves; Schneider, Maria Paula C.; Silva, Artur; O'Brien, Stephen J.; Pecon-Slattery, Jill
2011-01-01
Comparative genomic analyses of primates offer considerable potential to define and understand the processes that mold, shape, and transform the human genome. However, primate taxonomy is both complex and controversial, with marginal unifying consensus of the evolutionary hierarchy of extant primate species. Here we provide new genomic sequence (∼8 Mb) from 186 primates representing 61 (∼90%) of the described genera, and we include outgroup species from Dermoptera, Scandentia, and Lagomorpha. The resultant phylogeny is exceptionally robust and illuminates events in primate evolution from ancient to recent, clarifying numerous taxonomic controversies and providing new data on human evolution. Ongoing speciation, reticulate evolution, ancient relic lineages, unequal rates of evolution, and disparate distributions of insertions/deletions among the reconstructed primate lineages are uncovered. Our resolution of the primate phylogeny provides an essential evolutionary framework with far-reaching applications including: human selection and adaptation, global emergence of zoonotic diseases, mammalian comparative genomics, primate taxonomy, and conservation of endangered species. PMID:21436896
Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.
2015-01-01
Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089
Reconstruction and evolutionary history of eutherian chromosomes
Kim, Jaebum; Auvil, Loretta; Capitanu, Boris; Larkin, Denis M.; Ma, Jian; Lewin, Harris A.
2017-01-01
Whole-genome assemblies of 19 placental mammals and two outgroup species were used to reconstruct the order and orientation of syntenic fragments in chromosomes of the eutherian ancestor and six other descendant ancestors leading to human. For ancestral chromosome reconstructions, we developed an algorithm (DESCHRAMBLER) that probabilistically determines the adjacencies of syntenic fragments using chromosome-scale and fragmented genome assemblies. The reconstructed chromosomes of the eutherian, boreoeutherian, and euarchontoglires ancestor each included >80% of the entire length of the human genome, whereas reconstructed chromosomes of the most recent common ancestor of simians, catarrhini, great apes, and humans and chimpanzees included >90% of human genome sequence. These high-coverage reconstructions permitted reliable identification of chromosomal rearrangements over ∼105 My of eutherian evolution. Orangutan was found to have eight chromosomes that were completely conserved in homologous sequence order and orientation with the eutherian ancestor, the largest number for any species. Ruminant artiodactyls had the highest frequency of intrachromosomal rearrangements, and interchromosomal rearrangements dominated in murid rodents. A total of 162 chromosomal breakpoints in evolution of the eutherian ancestral genome to the human genome were identified; however, the rate of rearrangements was significantly lower (0.80/My) during the first ∼60 My of eutherian evolution, then increased to greater than 2.0/My along the five primate lineages studied. Our results significantly expand knowledge of eutherian genome evolution and will facilitate greater understanding of the role of chromosome rearrangements in adaptation, speciation, and the etiology of inherited and spontaneously occurring diseases. PMID:28630326
Pace, John K; Sen, Shurjo K; Batzer, Mark A; Feschotte, Cédric
2009-05-01
DNA double-strand breaks (DSBs) are a common form of cellular damage that can lead to cell death if not repaired promptly. Experimental systems have shown that DSB repair in eukaryotic cells is often imperfect and may result in the insertion of extra chromosomal DNA or the duplication of existing DNA at the breakpoint. These events are thought to be a source of genomic instability and human diseases, but it is unclear whether they have contributed significantly to genome evolution. Here we developed an innovative computational pipeline that takes advantage of the repetitive structure of genomes to detect repair-mediated duplication events (RDs) that occurred in the germline and created insertions of at least 50 bp of genomic DNA. Using this pipeline we identified over 1,000 probable RDs in the human genome. Of these, 824 were intra-chromosomal, closely linked duplications of up to 619 bp bearing the hallmarks of the synthesis-dependent strand-annealing repair pathway. This mechanism has duplicated hundreds of sequences predicted to be functional in the human genome, including exons, UTRs, intron splice sites and transcription factor binding sites. Dating of the duplication events using comparative genomics and experimental validation revealed that the mechanism has operated continuously but with decreasing intensity throughout primate evolution. The mechanism has produced species-specific duplications in all primate species surveyed and is contributing to genomic variation among humans. Finally, we show that RDs have also occurred, albeit at a lower frequency, in non-primate mammals and other vertebrates, indicating that this mechanism has been an important force shaping vertebrate genome evolution.
Cis-regulatory Elements and Human Evolution
Siepel, Adam
2014-01-01
Modification of gene regulation has long been considered an important force in human evolution, particularly through changes to cis-regulatory elements (CREs) that function in transcriptional regulation. For decades, however, the study of cis-regulatory evolution was severely limited by the available data. New data sets describing the locations of CREs and genetic variation within and between species have now made it possible to study CRE evolution much more directly on a genome-wide scale. Here, we review recent research on the evolution of CREs in humans based on large-scale genomic data sets. We consider inferences based on primate divergence, human polymorphism, and combinations of divergence and polymorphism. We then consider “new frontiers” in this field stemming from recent research on transcriptional regulation. PMID:25218861
Li, Jian; Harris, R. Alan; Cheung, Sau Wai; Coarfa, Cristian; Jeong, Mira; Goodell, Margaret A.; White, Lisa D.; Patel, Ankita; Kang, Sung-Hae; Shaw, Chad; Chinault, A. Craig; Gambin, Tomasz; Gambin, Anna; Lupski, James R.; Milosavljevic, Aleksandar
2012-01-01
The hotspots of structural polymorphisms and structural mutability in the human genome remain to be explained mechanistically. We examine associations of structural mutability with germline DNA methylation and with non-allelic homologous recombination (NAHR) mediated by low-copy repeats (LCRs). Combined evidence from four human sperm methylome maps, human genome evolution, structural polymorphisms in the human population, and previous genomic and disease studies consistently points to a strong association of germline hypomethylation and genomic instability. Specifically, methylation deserts, the ∼1% fraction of the human genome with the lowest methylation in the germline, show a tenfold enrichment for structural rearrangements that occurred in the human genome since the branching of chimpanzee and are highly enriched for fast-evolving loci that regulate tissue-specific gene expression. Analysis of copy number variants (CNVs) from 400 human samples identified using a custom-designed array comparative genomic hybridization (aCGH) chip, combined with publicly available structural variation data, indicates that association of structural mutability with germline hypomethylation is comparable in magnitude to the association of structural mutability with LCR–mediated NAHR. Moreover, rare CNVs occurring in the genomes of individuals diagnosed with schizophrenia, bipolar disorder, and developmental delay and de novo CNVs occurring in those diagnosed with autism are significantly more concentrated within hypomethylated regions. These findings suggest a new connection between the epigenome, selective mutability, evolution, and human disease. PMID:22615578
The human genome and the human control of natural evolution.
Sakamoto, H
2001-10-01
Recent advances in research on the Human Genome are provoking many critical problems in the global policy regarding the future status of human beings as well as in that of the whole life system on the earth, and consequently, these advances provoke the serious bioethical and philosophical questions. Firstly, how can we comprehend that we are going to have the complete technology to manipulate the system of the human genome and other non-human genomes? Though no science and technology can be complete, we will, I believe, take possession of an almost complete gene technology in the early stage of the next Century. Gene technology will soon fall into the hands of human beings instead of rendering in the province of God. Secondly, which gene technologies will we actually realize and utilize in the early stages of the 21st Century? Most probably, we will adopt these technologies to health care to treat some apparent bodily diseases, for instance, cancer, hemophilia, ADA deficiency, and so forth, and sooner or later we will adopt gene therapy to germ lines, which, in the long run, suggests the possibility of a future "artificial evolution" instead of the "natural evolution" of the past. Thirdly, how is the new concept of "artificial evolution" justified ethically? I believe this kind of manmade evolution is the only way for human beings to survive into the future global environment. There cannot be any serious ethical objection against the idea of artificial evolution. Fourthly, what is the background philosophy for the concept of "artificial evolution"? I will discuss the nature of modern European humanism with individual dignity and fundamental human rights which has led the philosophy of modern culture and modern society, and I will conclude by suggesting that we should abolish an essential part of modern humanism and newly devise some alternative philosophy to fit the new Millennium.
Viral symbiosis and the holobiontic nature of the human genome.
Ryan, Francis Patrick
2016-01-01
The human genome is a holobiontic union of the mammalian nuclear genome, the mitochondrial genome and large numbers of endogenized retroviral genomes. This article defines and explores this symbiogenetic pattern of evolution, looking at the implications for human genetics, epigenetics, embryogenesis, physiology and the pathogenesis of inborn errors of metabolism and many other diseases. © 2016 APMIS. Published by John Wiley & Sons Ltd.
Kim, Dong Seon; Hahn, Yoonsoo
2012-11-13
Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
Can a few non‐coding mutations make a human brain?
Franchini, Lucía F.
2015-01-01
The recent finding that the human version of a neurodevelopmental enhancer of the Wnt receptor Frizzled 8 (FZD8) gene alters neural progenitor cell cycle timing and brain size is a step forward to understanding human brain evolution. The human brain is distinctive in terms of its cognitive abilities as well as its susceptibility to neurological disease. Identifying which of the millions of genomic changes that occurred during human evolution led to these and other uniquely human traits is extremely challenging. Recent studies have demonstrated that many of the fastest evolving regions of the human genome function as gene regulatory enhancers during embryonic development and that the human‐specific mutations in them might alter expression patterns. However, elucidating molecular and cellular effects of sequence or expression pattern changes is a major obstacle to discovering the genetic bases of the evolution of our species. There is much work to do before human‐specific genetic and genomic changes are linked to complex human traits. Also watch the Video Abstract. PMID:26350501
Random genetic drift, natural selection, and noise in human cranial evolution.
Roseman, Charles C
2016-08-01
This study assesses the extent to which relationships among groups complicate comparative studies of adaptation in recent human cranial variation and the extent to which departures from neutral additive models of evolution hinder the reconstruction of population relationships among groups using cranial morphology. Using a maximum likelihood evolutionary model fitting approach and a mixed population genomic and cranial data set, I evaluate the relative fits of several widely used models of human cranial evolution. Moreover, I compare the goodness of fit of models of cranial evolution constrained by genomic variation to test hypotheses about population specific departures from neutrality. Models from population genomics are much better fits to cranial variation than are traditional models from comparative human biology. There is not enough evolutionary information in the cranium to reconstruct much of recent human evolution but the influence of population history on cranial variation is strong enough to cause comparative studies of adaptation serious difficulties. Deviations from a model of random genetic drift along a tree-like population history show the importance of environmental effects, gene flow, and/or natural selection on human cranial variation. Moreover, there is a strong signal of the effect of natural selection or an environmental factor on a group of humans from Siberia. The evolution of the human cranium is complex and no one evolutionary process has prevailed at the expense of all others. A holistic unification of phenome, genome, and environmental context, gives us a strong point of purchase on these problems, which is unavailable to any one traditional approach alone. Am J Phys Anthropol 160:582-592, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.
Hazkani-Covo, Einat; Martin, William F
2017-05-01
Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Interpreting Mammalian Evolution using Fugu Genome Comparisons
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stubbs, L; Ovcharenko, I; Loots, G G
2004-04-02
Comparative sequence analysis of the human and the pufferfish Fugu rubripes (fugu) genomes has revealed several novel functional coding and noncoding regions in the human genome. In particular, the fugu genome has been extremely valuable for identifying transcriptional regulatory elements in human loci harboring unusually high levels of evolutionary conservation to rodent genomes. In such regions, the large evolutionary distance between human and fishes provides an additional filter through which functional noncoding elements can be detected with high efficiency.
The genomics of selection in dogs and the parallel evolution between dogs and humans.
Wang, Guo-dong; Zhai, Weiwei; Yang, He-chuan; Fan, Ruo-xi; Cao, Xue; Zhong, Li; Wang, Lu; Liu, Fei; Wu, Hong; Cheng, Lu-guang; Poyarkov, Andrei D; Poyarkov, Nikolai A; Tang, Shu-sheng; Zhao, Wen-ming; Gao, Yun; Lv, Xue-mei; Irwin, David M; Savolainen, Peter; Wu, Chung-I; Zhang, Ya-ping
2013-01-01
The genetic bases of demographic changes and artificial selection underlying domestication are of great interest in evolutionary biology. Here we perform whole-genome sequencing of multiple grey wolves, Chinese indigenous dogs and dogs of diverse breeds. Demographic analysis show that the split between wolves and Chinese indigenous dogs occurred 32,000 years ago and that the subsequent bottlenecks were mild. Therefore, dogs may have been under human selection over a much longer time than previously concluded, based on molecular data, perhaps by initially scavenging with humans. Population genetic analysis identifies a list of genes under positive selection during domestication, which overlaps extensively with the corresponding list of positively selected genes in humans. Parallel evolution is most apparent in genes for digestion and metabolism, neurological process and cancer. Our study, for the first time, draws together humans and dogs in their recent genomic evolution.
New genes contribute to genetic and phenotypic novelties in human evolution
Zhang, Yong E.; Long, Manyuan
2014-01-01
New genes in human genomes have been found relevant in evolution and biology of humans. It was conservatively estimated that the human genome encodes more than 300 human-specific genes and 1,000 primate-specific genes. These new arrivals appear to be implicated in brain function and male reproduction. Surprisingly, increasing evidence indicates that they may also bring negative pleiotropic effects, while assuming various possible biological functions as sources of phenotypic novelties, suggesting a non-progressive route for functional evolution. Similar to these fixed new genes, polymorphic new genes were found to contribute to functional evolution within species, e.g. with respect to digestion or disease resistance, revealing that new genes can acquire new or diverged functions in its initial stage as prototypic genes. These progresses have provided new opportunity to explore the genetic basis of human biology and human evolutionary history in a new dimension. PMID:25218862
2012-01-01
Background Francisella is a genus of gram-negative bacterium highly virulent in fishes and human where F. tularensis is causing the serious disease tularaemia in human. Recently Francisella species have been reported to cause mortality in aquaculture species like Atlantic cod and tilapia. We have completed the sequencing and draft assembly of the Francisella noatunensis subsp. orientalisToba04 strain isolated from farmed Tilapia. Compared to other available Francisella genomes, it is most similar to the genome of Francisella philomiragia subsp. philomiragia, a free-living bacterium not virulent to human. Results The genome is rearranged compared to the available Francisella genomes even though we found no IS-elements in the genome. Nearly 16% percent of the predicted ORFs are pseudogenes. Computational pathway analysis indicates that a number of the metabolic pathways are disrupted due to pseudogenes. Comparing the novel genome with other available Francisella genomes, we found around 2.5% of unique genes present in Francisella noatunensis subsp. orientalis Toba04 and a list of genes uniquely present in the human-pathogenic Francisella subspecies. Most of these genes might have transferred from bacterial species through horizontal gene transfer. Comparative analysis between human and fish pathogen also provide insights into genes responsible for pathogenecity. Our analysis of pseudogenes indicates that the evolution of Francisella subspecies’s pseudogenes from Tilapia is old with large number of pseudogenes having more than one inactivating mutation. Conclusions The fish pathogen has lost non-essential genes some time ago. Evolutionary analysis of the Francisella genomes, strongly suggests that human and fish pathogenic Francisella species have evolved independently from free-living metabolically competent Francisella species. These findings will contribute to understanding the evolution of Francisella species and pathogenesis. PMID:23131096
2012-01-01
Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
Single genome retrieval of context-dependent variability in mutation rates for human germline.
Sahakyan, Aleksandr B; Balasubramanian, Shankar
2017-01-13
Accurate knowledge of the core components of substitution rates is of vital importance to understand genome evolution and dynamics. By performing a single-genome and direct analysis of 39,894 retrotransposon remnants, we reveal sequence context-dependent germline nucleotide substitution rates for the human genome. The rates are characterised through rate constants in a time-domain, and are made available through a dedicated program (Trek) and a stand-alone database. Due to the nature of the method design and the imposed stringency criteria, we expect our rate constants to be good estimates for the rates of spontaneous mutations. Benefiting from such data, we study the short-range nucleotide (up to 7-mer) organisation and the germline basal substitution propensity (BSP) profile of the human genome; characterise novel, CpG-independent, substitution prone and resistant motifs; confirm a decreased tendency of moieties with low BSP to undergo somatic mutations in a number of cancer types; and, produce a Trek-based estimate of the overall mutation rate in human. The extended set of rate constants we report may enrich our resources and help advance our understanding of genome dynamics and evolution, with possible implications for the role of spontaneous mutations in the emergence of pathological genotypes and neutral evolution of proteomes.
The Nature and Evolution of Genomic Diversity in the Mycobacterium tuberculosis Complex.
Brites, Daniela; Gagneux, Sebastien
2017-01-01
The Mycobacterium tuberculosis Complex (MTBC) consists of a clonal group of several mycobacterial lineages pathogenic to a range of different mammalian hosts. In this chapter, we discuss the origins and the evolutionary forces shaping the genomic diversity of the human-adapted MTBC. Advances in whole-genome sequencing have brought invaluable insights into the macro-evolution of the MTBC, and the biogeographical distribution of the different MTBC lineages, the phylogenetic relationships between these lineages. Moreover, micro-evolutionary processes start to be better understood, including those influencing bacterial mutation rates and those governing the fate of new mutations emerging within patients during treatment. Current genomic and epidemiological evidence reflect the fact that, through ecological specialization, the MTBC affecting humans became an obligate and extremely well-adapted human pathogen. Identifying the adaptive traits of human-adapted MTBC and unraveling the bacterial loci that interact with human genomic variation might help identify new targets for developing better vaccines and designing more effective treatments.
Neolithic and medieval virus genomes reveal complex evolution of hepatitis B
Key, Felix M; Kühnert, Denise; Bosse, Esther; Immel, Alexander; Rinne, Christoph; Kornell, Sabin-Christin; Yepes, Diego; Franzenburg, Sören; Heyne, Henrike O; Meier, Thomas; Lösch, Sandra; Meller, Harald; Friederich, Susanne; Nicklisch, Nicole; Alt, Kurt W; Schreiber, Stefan; Tholey, Andreas; Herbig, Alexander; Nebel, Almut
2018-01-01
The hepatitis B virus (HBV) is one of the most widespread human pathogens known today, yet its origin and evolutionary history are still unclear and controversial. Here, we report the analysis of three ancient HBV genomes recovered from human skeletons found at three different archaeological sites in Germany. We reconstructed two Neolithic and one medieval HBV genome by de novo assembly from shotgun DNA sequencing data. Additionally, we observed HBV-specific peptides using paleo-proteomics. Our results demonstrated that HBV has circulated in the European population for at least 7000 years. The Neolithic HBV genomes show a high genomic similarity to each other. In a phylogenetic network, they do not group with any human-associated HBV genome and are most closely related to those infecting African non-human primates. The ancient viruses appear to represent distinct lineages that have no close relatives today and possibly went extinct. Our results reveal the great potential of ancient DNA from human skeletons in order to study the long-time evolution of blood borne viruses. PMID:29745896
Neolithic and Medieval virus genomes reveal complex evolution of Hepatitis B.
Krause-Kyora, Ben; Susat, Julian; Key, Felix M; Kühnert, Denise; Bosse, Esther; Immel, Alexander; Rinne, Christoph; Kornell, Sabin-Christin; Yepes, Diego; Franzenburg, Sören; Heyne, Henrike O; Meier, Thomas; Lösch, Sandra; Meller, Harald; Friederich, Susanne; Nicklisch, Nicole; Alt, Kurt W; Schreiber, Stefan; Tholey, Andreas; Herbig, Alexander; Nebel, Almut; Krause, Johannes
2018-05-10
The hepatitis B virus (HBV) is one of the most widespread human pathogens known today, yet its origin and evolutionary history are still unclear and controversial. Here, we report the analysis of three ancient HBV genomes recovered from human skeletons found at three different archaeological sites in Germany. We reconstructed two Neolithic and one medieval HBV genomes by de novo assembly from shotgun DNA sequencing data. Additionally, we observed HBV-specific peptides using paleo-proteomics. Our results show that HBV circulates in the European population for at least 7000 years. The Neolithic HBV genomes show a high genomic similarity to each other. In a phylogenetic network, they do not group with any human-associated HBV genome and are most closely related to those infecting African non-human primates. These ancient virus forms appear to represent distinct lineages that have no close relatives today and possibly went extinct. Our results reveal the great potential of ancient DNA from human skeletons in order to study the long-time evolution of blood borne viruses. © 2018, Krause-Kyora et al.
Evolutionary interrogation of human biology in well-annotated genomic framework of rhesus macaque.
Zhang, Shi-Jian; Liu, Chu-Jun; Yu, Peng; Zhong, Xiaoming; Chen, Jia-Yu; Yang, Xinzhuang; Peng, Jiguang; Yan, Shouyu; Wang, Chenqu; Zhu, Xiaotong; Xiong, Jingwei; Zhang, Yong E; Tan, Bertrand Chin-Ming; Li, Chuan-Yun
2014-05-01
With genome sequence and composition highly analogous to human, rhesus macaque represents a unique reference for evolutionary studies of human biology. Here, we developed a comprehensive genomic framework of rhesus macaque, the RhesusBase2, for evolutionary interrogation of human genes and the associated regulations. A total of 1,667 next-generation sequencing (NGS) data sets were processed, integrated, and evaluated, generating 51.2 million new functional annotation records. With extensive NGS annotations, RhesusBase2 refined the fine-scale structures in 30% of the macaque Ensembl transcripts, reporting an accurate, up-to-date set of macaque gene models. On the basis of these annotations and accurate macaque gene models, we further developed an NGS-oriented Molecular Evolution Gateway to access and visualize macaque annotations in reference to human orthologous genes and associated regulations (www.rhesusbase.org/molEvo). We highlighted the application of this well-annotated genomic framework in generating hypothetical link of human-biased regulations to human-specific traits, by using mechanistic characterization of the DIEXF gene as an example that provides novel clues to the understanding of digestive system reduction in human evolution. On a global scale, we also identified a catalog of 9,295 human-biased regulatory events, which may represent novel elements that have a substantial impact on shaping human transcriptome and possibly underpin recent human phenotypic evolution. Taken together, we provide an NGS data-driven, information-rich framework that will broadly benefit genomics research in general and serves as an important resource for in-depth evolutionary studies of human biology.
Liu, Yu; Koyutürk, Mehmet; Maxwell, Sean; Xiang, Min; Veigl, Martina; Cooper, Richard S; Tayo, Bamidele O; Li, Li; LaFramboise, Thomas; Wang, Zhenghe; Zhu, Xiaofeng; Chance, Mark R
2014-08-16
Sequences up to several megabases in length have been found to be present in individual genomes but absent in the human reference genome. These sequences may be common in populations, and their absence in the reference genome may indicate rare variants in the genomes of individuals who served as donors for the human genome project. As the reference genome is used in probe design for microarray technology and mapping short reads in next generation sequencing (NGS), this missing sequence could be a source of bias in functional genomic studies and variant analysis. One End Anchor (OEA) and/or orphan reads from paired-end sequencing have been used to identify novel sequences that are absent in reference genome. However, there is no study to investigate the distribution, evolution and functionality of those sequences in human populations. To systematically identify and study the missing common sequences (micSeqs), we extended the previous method by pooling OEA reads from large number of individuals and applying strict filtering methods to remove false sequences. The pipeline was applied to data from phase 1 of the 1000 Genomes Project. We identified 309 micSeqs that are present in at least 1% of the human population, but absent in the reference genome. We confirmed 76% of these 309 micSeqs by comparison to other primate genomes, individual human genomes, and gene expression data. Furthermore, we randomly selected fifteen micSeqs and confirmed their presence using PCR validation in 38 additional individuals. Functional analysis using published RNA-seq and ChIP-seq data showed that eleven micSeqs are highly expressed in human brain and three micSeqs contain transcription factor (TF) binding regions, suggesting they are functional elements. In addition, the identified micSeqs are absent in non-primates and show dynamic acquisition during primate evolution culminating with most micSeqs being present in Africans, suggesting some micSeqs may be important sources of human diversity. 76% of micSeqs were confirmed by a comparative genomics approach. Fourteen micSeqs are expressed in human brain or contain TF binding regions. Some micSeqs are primate-specific, conserved and may play a role in the evolution of primates.
The Evolution of Host Specialization in the Vertebrate Gut Symbiont Lactobacillus reuteri
DOE Office of Scientific and Technical Information (OSTI.GOV)
Frese, Steven A.; Benson, Andrew K.; Tannock, Gerald W.
Recent research has provided mechanistic insight into the important contributions of the gut microbiota to vertebrate biology, but questions remain about the evolutionary processes that have shaped this symbiosis. In the present study, we showed in experiments with gnotobiotic mice that the evolution of Lactobacillus reuteri with rodents resulted in the emergence of host specialization. To identify genomic events marking adaptations to the murine host, we compared the genome of the rodent isolate L. reuteri 100-23 with that of the human isolate L. reuteri F275, and we identified hundreds of genes that were specific to each strain. In order tomore » differentiate true host-specific genome content from strain-level differences, comparative genome hybridizations were performed to query 57 L. reuteri strains originating from six different vertebrate hosts in combination with genome sequence comparisons of nine strains encompassing five phylogenetic lineages of the species. This approach revealed that rodent strains, although showing a high degree of genomic plasticity, possessed a specific genome inventory that was rare or absent in strains from other vertebrate hosts. The distinct genome content of L. reuteri lineages reflected the niche characteristics in the gastrointestinal tracts of their respective hosts, and inactivation of seven out of eight representative rodent-specific genes in L. reuteri 100-23 resulted in impaired ecological performance in the gut of mice. The comparative genomic analyses suggested fundamentally different trends of genome evolution in rodent and human L. reuteri populations, with the former possessing a large and adaptable pan-genome while the latter being subjected to a process of reductive evolution. In conclusion, this study provided experimental evidence and a molecular basis for the evolution of host specificity in a vertebrate gut symbiont, and it identified genomic events that have shaped this process.« less
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health
Martin, William F.
2017-01-01
Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372
The first genome sequences of human bocaviruses from Vietnam
Thanh, Tran Tan; Van, Hoang Minh Tu; Hong, Nguyen Thi Thu; Nhu, Le Nguyen Truc; Anh, Nguyen To; Tuan, Ha Manh; Hien, Ho Van; Tuong, Nguyen Manh; Kien, Trinh Trung; Khanh, Truong Huu; Nhan, Le Nguyen Thanh; Hung, Nguyen Thanh; Chau, Nguyen Van Vinh; Thwaites, Guy; van Doorn, H. Rogier; Tan, Le Van
2017-01-01
As part of an ongoing effort to generate complete genome sequences of hand, foot and mouth disease-causing enteroviruses directly from clinical specimens, two complete coding sequences and two partial genomic sequences of human bocavirus 1 (n=3) and 2 (n=1) were co-amplified and sequenced, representing the first genome sequences of human bocaviruses from Vietnam. The sequences may aid future study aiming at understanding the evolution of the virus. PMID:28090592
Farré, Marta; Robinson, Terence J; Ruiz-Herrera, Aurora
2015-05-01
Our understanding of genomic reorganization, the mechanics of genomic transmission to offspring during germ line formation, and how these structural changes contribute to the speciation process, and genetic disease is far from complete. Earlier attempts to understand the mechanism(s) and constraints that govern genome remodeling suffered from being too narrowly focused, and failed to provide a unified and encompassing view of how genomes are organized and regulated inside cells. Here, we propose a new multidisciplinary Integrative Breakage Model for the study of genome evolution. The analysis of the high-level structural organization of genomes (nucleome), together with the functional constrains that accompany genome reshuffling, provide insights into the origin and plasticity of genome organization that may assist with the detection and isolation of therapeutic targets for the treatment of complex human disorders. © 2015 WILEY Periodicals, Inc.
Is mammalian chromosomal evolution driven by regions of genome fragility?
Ruiz-Herrera, Aurora; Castresana, Jose; Robinson, Terence J
2006-01-01
Background A fundamental question in comparative genomics concerns the identification of mechanisms that underpin chromosomal change. In an attempt to shed light on the dynamics of mammalian genome evolution, we analyzed the distribution of syntenic blocks, evolutionary breakpoint regions, and evolutionary breakpoints taken from public databases available for seven eutherian species (mouse, rat, cattle, dog, pig, cat, and horse) and the chicken, and examined these for correspondence with human fragile sites and tandem repeats. Results Our results confirm previous investigations that showed the presence of chromosomal regions in the human genome that have been repeatedly used as illustrated by a high breakpoint accumulation in certain chromosomes and chromosomal bands. We show, however, that there is a striking correspondence between fragile site location, the positions of evolutionary breakpoints, and the distribution of tandem repeats throughout the human genome, which similarly reflect a non-uniform pattern of occurrence. Conclusion These observations provide further evidence that certain chromosomal regions in the human genome have been repeatedly used in the evolutionary process. As a consequence, the genome is a composite of fragile regions prone to reorganization that have been conserved in different lineages, and genomic tracts that do not exhibit the same levels of evolutionary plasticity. PMID:17156441
Within-host evolution of bacterial pathogens
Didelot, Xavier; Walker, A. Sarah; Peto, Tim E.; Crook, Derrick W.; Wilson, Daniel J.
2016-01-01
Whole genome sequencing has opened the way to investigating the dynamics and genomic evolution of bacterial pathogens during colonization and infection of humans. The application of this technology to the longitudinal study of adaptation in the infected host — in particular, the evolution of drug resistance and host adaptation in patients chronically infected with opportunistic pathogens — has revealed remarkable patterns of convergent evolution, pointing to an inherent repeatability of evolution. In this Review, we describe how these studies have advanced our understanding of the mechanisms and principles of within-host genome evolution, and we consider the consequences of findings such as a potent adaptive potential for pathogenicity. Finally, we discuss the possibility that genomics may be used in the future to predict the clinical progression of bacterial infections, and to suggest the best treatment option. PMID:26806595
Within-host evolution of bacterial pathogens.
Didelot, Xavier; Walker, A Sarah; Peto, Tim E; Crook, Derrick W; Wilson, Daniel J
2016-03-01
Whole-genome sequencing has opened the way for investigating the dynamics and genomic evolution of bacterial pathogens during the colonization and infection of humans. The application of this technology to the longitudinal study of adaptation in an infected host--in particular, the evolution of drug resistance and host adaptation in patients who are chronically infected with opportunistic pathogens--has revealed remarkable patterns of convergent evolution, suggestive of an inherent repeatability of evolution. In this Review, we describe how these studies have advanced our understanding of the mechanisms and principles of within-host genome evolution, and we consider the consequences of findings such as a potent adaptive potential for pathogenicity. Finally, we discuss the possibility that genomics may be used in the future to predict the clinical progression of bacterial infections and to suggest the best option for treatment.
KANEKO-ISHINO, Tomoko; ISHINO, Fumitoshi
2015-01-01
Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is “mammalian-specific genomic functions”, a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of “mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons”, based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes. PMID:26666304
Kaneko-Ishino, Tomoko; Ishino, Fumitoshi
2015-01-01
Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is "mammalian-specific genomic functions", a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of "mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons", based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes.
Acharya, Debarun; Ghosh, Tapash C
2016-01-22
Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution.
Gu, Xun; Wang, Yufeng; Gu, Jianying
2002-06-01
The classical (two-round) hypothesis of vertebrate genome duplication proposes two successive whole-genome duplication(s) (polyploidizations) predating the origin of fishes, a view now being seriously challenged. As the debate largely concerns the relative merits of the 'big-bang mode' theory (large-scale duplication) and the 'continuous mode' theory (constant creation by small-scale duplications), we tested whether a significant proportion of paralogous genes in the contemporary human genome was indeed generated in the early stage of vertebrate evolution. After an extensive search of major databases, we dated 1,739 gene duplication events from the phylogenetic analysis of 749 vertebrate gene families. We found a pattern characterized by two waves (I, II) and an ancient component. Wave I represents a recent gene family expansion by tandem or segmental duplications, whereas wave II, a rapid paralogous gene increase in the early stage of vertebrate evolution, supports the idea of genome duplication(s) (the big-bang mode). Further analysis indicated that large- and small-scale gene duplications both make a significant contribution during the early stage of vertebrate evolution to build the current hierarchy of the human proteome.
Analysis of pig genomes provide insight into porcine demography and evolution
USDA-ARS?s Scientific Manuscript database
For nearly 8,000 years pigs and humans have shared a close and complex relationship, and through domestication and breeding, humans have shaped the genomes of current diverse pig breeds. Here we present the assembly and analysis of the genome sequence of a female domestic pig from the European Duroc...
Divergence of Mammalian Higher Order Chromatin Structure Is Associated with Developmental Loci
Chambers, Emily V.; Bickmore, Wendy A.; Semple, Colin A.
2013-01-01
Several recent studies have examined different aspects of mammalian higher order chromatin structure – replication timing, lamina association and Hi-C inter-locus interactions — and have suggested that most of these features of genome organisation are conserved over evolution. However, the extent of evolutionary divergence in higher order structure has not been rigorously measured across the mammalian genome, and until now little has been known about the characteristics of any divergent loci present. Here, we generate a dataset combining multiple measurements of chromatin structure and organisation over many embryonic cell types for both human and mouse that, for the first time, allows a comprehensive assessment of the extent of structural divergence between mammalian genomes. Comparison of orthologous regions confirms that all measurable facets of higher order structure are conserved between human and mouse, across the vast majority of the detectably orthologous genome. This broad similarity is observed in spite of many loci possessing cell type specific structures. However, we also identify hundreds of regions (from 100 Kb to 2.7 Mb in size) showing consistent evidence of divergence between these species, constituting at least 10% of the orthologous mammalian genome and encompassing many hundreds of human and mouse genes. These regions show unusual shifts in human GC content, are unevenly distributed across both genomes, and are enriched in human subtelomeric regions. Divergent regions are also relatively enriched for genes showing divergent expression patterns between human and mouse ES cells, implying these regions cause divergent regulation. Particular divergent loci are strikingly enriched in genes implicated in vertebrate development, suggesting important roles for structural divergence in the evolution of mammalian developmental programmes. These data suggest that, though relatively rare in the mammalian genome, divergence in higher order chromatin structure has played important roles during evolution. PMID:23592965
Bai, Haihua; Guo, Xiaosen; Zhang, Dong; Narisu, Narisu; Bu, Junjie; Jirimutu, Jirimutu; Liang, Fan; Zhao, Xiang; Xing, Yanping; Wang, Dingzhu; Li, Tongda; Zhang, Yanru; Guan, Baozhu; Yang, Xukui; Yang, Zili; Shuangshan, Shuangshan; Su, Zhe; Wu, Huiguang; Li, Wenjing; Chen, Ming; Zhu, Shilin; Bayinnamula, Bayinnamula; Chang, Yuqi; Gao, Ying; Lan, Tianming; Suyalatu, Suyalatu; Huang, Hui; Su, Yan; Chen, Yujie; Li, Wenqi; Yang, Xu; Feng, Qiang; Wang, Jian; Yang, Huanming; Wang, Jun; Wu, Qizhu; Yin, Ye; Zhou, Huanmin
2014-11-05
Mongolians have played a significant role in modern human evolution, especially after the rise of Genghis Khan (1162[?]-1227). Although the social cultural impacts of Genghis Khan and the Mongolian population have been well documented, explorations of their genome structure and genetic imprints on other human populations have been lacking. We here present the genome of a Mongolian male individual. The genome was de novo assembled using a total of 130.8-fold genomic data produced from massively parallel whole-genome sequencing. We identified high-confidence variation sets, including 3.7 million single nucleotide polymorphisms (SNPs) and 756,234 short insertions and deletions. Functional SNP analysis predicted that the individual has a pathogenic risk for carnitine deficiency. We located the patrilineal inheritance of the Mongolian genome to the lineage D3a through Y haplogroup analysis and inferred that the individual has a common patrilineal ancestor with Tibeto-Burman populations and is likely to be the progeny of the earliest settlers in East Asia. We finally investigated the genetic imprints of Mongolians on other human populations using different approaches. We found varying degrees of gene flows between Mongolians and populations living in Europe, South/Central Asia, and the Indian subcontinent. The analyses demonstrate that the genetic impacts of Mongolians likely resulted from the expansion of the Mongolian Empire in the 13th century. The genome will be of great help in further explorations of modern human evolution and genetic causes of diseases/traits specific to Mongolians. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Glinsky, Gennadi V
2016-09-19
Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8-10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Neighboring Genes Show Correlated Evolution in Gene Expression.
Ghanbarian, Avazeh T; Hurst, Laurence D
2015-07-01
When considering the evolution of a gene's expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Neighboring Genes Show Correlated Evolution in Gene Expression
Ghanbarian, Avazeh T.; Hurst, Laurence D.
2015-01-01
When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543
Paillot, Romain; Steward, Karen F.; Webb, Katy; Ainslie, Fern; Jourdan, Thibaud; Bason, Nathalie C.; Holroyd, Nancy E.; Mungall, Karen; Quail, Michael A.; Sanders, Mandy; Simmonds, Mark; Willey, David; Brooks, Karen; Aanensen, David M.; Spratt, Brian G.; Jolley, Keith A.; Maiden, Martin C. J.; Kehoe, Michael; Chanter, Neil; Bentley, Stephen D.; Robinson, Carl; Maskell, Duncan J.; Parkhill, Julian; Waller, Andrew S.
2009-01-01
The continued evolution of bacterial pathogens has major implications for both human and animal disease, but the exchange of genetic material between host-restricted pathogens is rarely considered. Streptococcus equi subspecies equi (S. equi) is a host-restricted pathogen of horses that has evolved from the zoonotic pathogen Streptococcus equi subspecies zooepidemicus (S. zooepidemicus). These pathogens share approximately 80% genome sequence identity with the important human pathogen Streptococcus pyogenes. We sequenced and compared the genomes of S. equi 4047 and S. zooepidemicus H70 and screened S. equi and S. zooepidemicus strains from around the world to uncover evidence of the genetic events that have shaped the evolution of the S. equi genome and led to its emergence as a host-restricted pathogen. Our analysis provides evidence of functional loss due to mutation and deletion, coupled with pathogenic specialization through the acquisition of bacteriophage encoding a phospholipase A2 toxin, and four superantigens, and an integrative conjugative element carrying a novel iron acquisition system with similarity to the high pathogenicity island of Yersinia pestis. We also highlight that S. equi, S. zooepidemicus, and S. pyogenes share a common phage pool that enhances cross-species pathogen evolution. We conclude that the complex interplay of functional loss, pathogenic specialization, and genetic exchange between S. equi, S. zooepidemicus, and S. pyogenes continues to influence the evolution of these important streptococci. PMID:19325880
Evidence of a Conserved Molecular Response to Selection for Increased Brain Size in Primates
Harrison, Peter W.; Caravas, Jason A.; Raghanti, Mary Ann; Phillips, Kimberley A.; Mundy, Nicholas I.
2017-01-01
The adaptive significance of human brain evolution has been frequently studied through comparisons with other primates. However, the evolution of increased brain size is not restricted to the human lineage but is a general characteristic of primate evolution. Whether or not these independent episodes of increased brain size share a common genetic basis is unclear. We sequenced and de novo assembled the transcriptome from the neocortical tissue of the most highly encephalized nonhuman primate, the tufted capuchin monkey (Cebus apella). Using this novel data set, we conducted a genome-wide analysis of orthologous brain-expressed protein coding genes to identify evidence of conserved gene–phenotype associations and species-specific adaptations during three independent episodes of brain size increase. We identify a greater number of genes associated with either total brain mass or relative brain size across these six species than show species-specific accelerated rates of evolution in individual large-brained lineages. We test the robustness of these associations in an expanded data set of 13 species, through permutation tests and by analyzing how genome-wide patterns of substitution co-vary with brain size. Many of the genes targeted by selection during brain expansion have glutamatergic functions or roles in cell cycle dynamics. We also identify accelerated evolution in a number of individual capuchin genes whose human orthologs are associated with human neuropsychiatric disorders. These findings demonstrate the value of phenotypically informed genome analyses, and suggest at least some aspects of human brain evolution have occurred through conserved gene–phenotype associations. Understanding these commonalities is essential for distinguishing human-specific selection events from general trends in brain evolution. PMID:28391320
Human genomic disease variants: a neutral evolutionary explanation.
Dudley, Joel T; Kim, Yuseob; Liu, Li; Markov, Glenn J; Gerold, Kristyn; Chen, Rong; Butte, Atul J; Kumar, Sudhir
2012-08-01
Many perspectives on the role of evolution in human health include nonempirical assumptions concerning the adaptive evolutionary origins of human diseases. Evolutionary analyses of the increasing wealth of clinical and population genomic data have begun to challenge these presumptions. In order to systematically evaluate such claims, the time has come to build a common framework for an empirical and intellectual unification of evolution and modern medicine. We review the emerging evidence and provide a supporting conceptual framework that establishes the classical neutral theory of molecular evolution (NTME) as the basis for evaluating disease- associated genomic variations in health and medicine. For over a decade, the NTME has already explained the origins and distribution of variants implicated in diseases and has illuminated the power of evolutionary thinking in genomic medicine. We suggest that a majority of disease variants in modern populations will have neutral evolutionary origins (previously neutral), with a relatively smaller fraction exhibiting adaptive evolutionary origins (previously adaptive). This pattern is expected to hold true for common as well as rare disease variants. Ultimately, a neutral evolutionary perspective will provide medicine with an informative and actionable framework that enables objective clinical assessment beyond convenient tendencies to invoke past adaptive events in human history as a root cause of human disease.
Human genomic disease variants: A neutral evolutionary explanation
Dudley, Joel T.; Kim, Yuseob; Liu, Li; Markov, Glenn J.; Gerold, Kristyn; Chen, Rong; Butte, Atul J.; Kumar, Sudhir
2012-01-01
Many perspectives on the role of evolution in human health include nonempirical assumptions concerning the adaptive evolutionary origins of human diseases. Evolutionary analyses of the increasing wealth of clinical and population genomic data have begun to challenge these presumptions. In order to systematically evaluate such claims, the time has come to build a common framework for an empirical and intellectual unification of evolution and modern medicine. We review the emerging evidence and provide a supporting conceptual framework that establishes the classical neutral theory of molecular evolution (NTME) as the basis for evaluating disease- associated genomic variations in health and medicine. For over a decade, the NTME has already explained the origins and distribution of variants implicated in diseases and has illuminated the power of evolutionary thinking in genomic medicine. We suggest that a majority of disease variants in modern populations will have neutral evolutionary origins (previously neutral), with a relatively smaller fraction exhibiting adaptive evolutionary origins (previously adaptive). This pattern is expected to hold true for common as well as rare disease variants. Ultimately, a neutral evolutionary perspective will provide medicine with an informative and actionable framework that enables objective clinical assessment beyond convenient tendencies to invoke past adaptive events in human history as a root cause of human disease. PMID:22665443
Kim, Soonok; Cho, Yun Sung; Bhak, Jong; O’Brian, Stephen J.; Yeo, Joo-Hong
2017-01-01
Recent advances in genome sequencing technologies have enabled humans to generate and investigate the genomes of wild species. This includes the big cat family, such as tigers, lions, and leopards. Adding the first high quality leopard genome, we have performed an in-depth comparative analysis to identify the genomic signatures in the evolution of felid to become the top predators on land. Our study focused on how the carnivore genomes, as compared to the omnivore or herbivore genomes, shared evolutionary adaptations in genes associated with nutrient metabolism, muscle strength, agility, and other traits responsible for hunting and meat digestion. We found genetic evidence that genomes represent what animals eat through modifying genes. Highly conserved genetically relevant regions were discovered in genomes at the family level. Also, the Felidae family genomes exhibited low levels of genetic diversity associated with decreased population sizes, presumably because of their strict diet, suggesting their vulnerability and critical conservation status. Our findings can be used for human health enhancement, since we share the same genes as cats with some variation. This is an example how wildlife genomes can be a critical resource for human evolution, providing key genetic marker information for disease treatment. PMID:28042784
Hofreiter, Michael
2011-02-01
Ten years after the first draft versions of the human genome were announced, technical progress in both DNA sequencing and ancient DNA analyses has allowed a research team around Ed Green and Svante Pääbo to complete this task from infinitely more difficult hominid samples: a few pieces of bone originating from our closest, albeit extinct, relatives, the Neanderthals. Pulling the Neanderthal sequences out of a sea of contaminating environmental DNA impregnating the bones and at the same time avoiding the problems of contamination with modern human DNA is in itself a remarkable accomplishment. However, the crucial question in the long run is, what can we learn from such genomic data about hominid evolution?
The Evolution and Functional Impact of Human Deletion Variants Shared with Archaic Hominin Genomes
Lin, Yen-Lung; Pavlidis, Pavlos; Karakoc, Emre; Ajay, Jerry; Gokcumen, Omer
2015-01-01
Allele sharing between modern and archaic hominin genomes has been variously interpreted to have originated from ancestral genetic structure or through non-African introgression from archaic hominins. However, evolution of polymorphic human deletions that are shared with archaic hominin genomes has yet to be studied. We identified 427 polymorphic human deletions that are shared with archaic hominin genomes, approximately 87% of which originated before the Human–Neandertal divergence (ancient) and only approximately 9% of which have been introgressed from Neandertals (introgressed). Recurrence, incomplete lineage sorting between human and chimp lineages, and hominid-specific insertions constitute the remaining approximately 4% of allele sharing between humans and archaic hominins. We observed that ancient deletions correspond to more than 13% of all common (>5% allele frequency) deletion variation among modern humans. Our analyses indicate that the genomic landscapes of both ancient and introgressed deletion variants were primarily shaped by purifying selection, eliminating large and exonic variants. We found 17 exonic deletions that are shared with archaic hominin genomes, including those leading to three fusion transcripts. The affected genes are involved in metabolism of external and internal compounds, growth and sperm formation, as well as susceptibility to psoriasis and Crohn’s disease. Our analyses suggest that these “exonic” deletion variants have evolved through different adaptive forces, including balancing and population-specific positive selection. Our findings reveal that genomic structural variants that are shared between humans and archaic hominin genomes are common among modern humans and can influence biomedically and evolutionarily important phenotypes. PMID:25556237
Konkel, Miriam K; Walker, Jerilyn A; Hotard, Ashley B; Ranck, Megan C; Fontenot, Catherine C; Storer, Jessica; Stewart, Chip; Marth, Gabor T; Batzer, Mark A
2015-08-29
The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Human centromere genomics: now it's personal.
Hayden, Karen E
2012-07-01
Advances in human genomics have accelerated studies in evolution, disease, and cellular regulation. However, centromere sequences, defining the chromosomal interface with spindle microtubules, remain largely absent from ongoing genomic studies and disconnected from functional, genome-wide analyses. This disparity results from the challenge of predicting the linear order of multi-megabase-sized regions that are composed almost entirely of near-identical satellite DNA. Acknowledging these challenges, the field of human centromere genomics possesses the potential to rapidly advance given the availability of individual, or personalized, genome projects matched with the promise of long-read sequencing technologies. Here I review the current genomic model of human centromeres in consideration of those studies involving functional datasets that examine the role of sequence in centromere identity.
Yohn, Chris T; Jiang, Zhaoshi; McGrath, Sean D; Hayden, Karen E; Khaitovich, Philipp; Johnson, Matthew E; Eichler, Marla Y; McPherson, John D; Zhao, Shaying; Pääbo, Svante; Eichler, Evan E
2005-04-01
Retroviral infections of the germline have the potential to episodically alter gene function and genome structure during the course of evolution. Horizontal transmissions between species have been proposed, but little evidence exists for such events in the human/great ape lineage of evolution. Based on analysis of finished BAC chimpanzee genome sequence, we characterize a retroviral element (Pan troglodytes endogenous retrovirus 1 [PTERV1]) that has become integrated in the germline of African great ape and Old World monkey species but is absent from humans and Asian ape genomes. We unambiguously map 287 retroviral integration sites and determine that approximately 95.8% of the insertions occur at non-orthologous regions between closely related species. Phylogenetic analysis of the endogenous retrovirus reveals that the gorilla and chimpanzee elements share a monophyletic origin with a subset of the Old World monkey retroviral elements, but that the average sequence divergence exceeds neutral expectation for a strictly nuclear inherited DNA molecule. Within the chimpanzee, there is a significant integration bias against genes, with only 14 of these insertions mapping within intronic regions. Six out of ten of these genes, for which there are expression data, show significant differences in transcript expression between human and chimpanzee. Our data are consistent with a retroviral infection that bombarded the genomes of chimpanzees and gorillas independently and concurrently, 3-4 million years ago. We speculate on the potential impact of such recent events on the evolution of humans and great apes.
Glinsky, Gennadi V.
2016-01-01
Abstract Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8–10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. PMID:27503290
The function and evolution of the Aspergillus genome
Gibbons, John G.; Rokas, Antonis
2012-01-01
Species in the filamentous fungal genus Aspergillus display a wide diversity of lifestyles and are of great importance to humans. The decoding of genome sequences from a dozen species that vary widely in their degree of evolutionary affinity has galvanized studies of the function and evolution of the Aspergillus genome in clinical, industrial, and agricultural environments. Here, we synthesize recent key findings that shed light on the architecture of the Aspergillus genome, on the molecular foundations of the genus’ astounding dexterity and diversity in secondary metabolism, and on the genetic underpinnings of virulence in Aspergillus fumigatus, one of the most lethal fungal pathogens. Many of these insights dramatically expand our knowledge of fungal and microbial eukaryote genome evolution and function and argue that Aspergillus constitutes a superb model clade for the study of functional and comparative genomics. PMID:23084572
Identification of cis-suppression of human disease mutations by comparative genomics.
Jordan, Daniel M; Frangakis, Stephan G; Golzio, Christelle; Cassa, Christopher A; Kurtzberg, Joanne; Davis, Erica E; Sunyaev, Shamil R; Katsanis, Nicholas
2015-08-13
Patterns of amino acid conservation have served as a tool for understanding protein evolution. The same principles have also found broad application in human genomics, driven by the need to interpret the pathogenic potential of variants in patients. Here we performed a systematic comparative genomics analysis of human disease-causing missense variants. We found that an appreciable fraction of disease-causing alleles are fixed in the genomes of other species, suggesting a role for genomic context. We developed a model of genetic interactions that predicts most of these to be simple pairwise compensations. Functional testing of this model on two known human disease genes revealed discrete cis amino acid residues that, although benign on their own, could rescue the human mutations in vivo. This approach was also applied to ab initio gene discovery to support the identification of a de novo disease driver in BTG2 that is subject to protective cis-modification in more than 50 species. Finally, on the basis of our data and models, we developed a computational tool to predict candidate residues subject to compensation. Taken together, our data highlight the importance of cis-genomic context as a contributor to protein evolution; they provide an insight into the complexity of allele effect on phenotype; and they are likely to assist methods for predicting allele pathogenicity.
Evolution and Diversity of the Human Hepatitis D Virus Genome
Huang, Chi-Ruei; Lo, Szecheng J.
2010-01-01
Human hepatitis delta virus (HDV) is the smallest RNA virus in genome. HDV genome is divided into a viroid-like sequence and a protein-coding sequence which could have originated from different resources and the HDV genome was eventually constituted through RNA recombination. The genome subsequently diversified through accumulation of mutations selected by interactions between the mutated RNA and proteins with host factors to successfully form the infectious virions. Therefore, we propose that the conservation of HDV nucleotide sequence is highly related with its functionality. Genome analysis of known HDV isolates shows that the C-terminal coding sequences of large delta antigen (LDAg) are the highest diversity than other regions of protein-coding sequences but they still retain biological functionality to interact with the heavy chain of clathrin can be selected and maintained. Since viruses interact with many host factors, including escaping the host immune response, how to design a program to predict RNA genome evolution is a great challenging work. PMID:20204073
Evidence for Widespread Reticulate Evolution within Human Duplicons
Jackson, Michael S. ; Oliver, Karen ; Loveland, Jane ; Humphray, Sean ; Dunham, Ian ; Rocchi, Mariano ; Viggiano, Luigi ; Park, Jonathan P. ; Hurles, Matthew E. ; Santibanez-Koref, Mauro
2005-01-01
Approximately 5% of the human genome consists of segmental duplications that can cause genomic mutations and may play a role in gene innovation. Reticulate evolutionary processes, such as unequal crossing-over and gene conversion, are known to occur within specific duplicon families, but the broader contribution of these processes to the evolution of human duplications remains poorly characterized. Here, we use phylogenetic profiling to analyze multiple alignments of 24 human duplicon families that span >8 Mb of DNA. Our results indicate that none of them are evolving independently, with all alignments showing sharp discontinuities in phylogenetic signal consistent with reticulation. To analyze these results in more detail, we have developed a quartet method that estimates the relative contribution of nucleotide substitution and reticulate processes to sequence evolution. Our data indicate that most of the duplications show a highly significant excess of sites consistent with reticulate evolution, compared with the number expected by nucleotide substitution alone, with 15 of 30 alignments showing a >20-fold excess over that expected. Using permutation tests, we also show that at least 5% of the total sequence shares 100% sequence identity because of reticulation, a figure that includes 74 independent tracts of perfect identity >2 kb in length. Furthermore, analysis of a subset of alignments indicates that the density of reticulation events is as high as 1 every 4 kb. These results indicate that phylogenetic relationships within recently duplicated human DNA can be rapidly disrupted by reticulate evolution. This finding has important implications for efforts to finish the human genome sequence, complicates comparative sequence analysis of duplicon families, and could profoundly influence the tempo of gene-family evolution. PMID:16252241
The genome sequence of taurine cattle: a window to ruminant biology and evolution.
Elsik, Christine G; Tellam, Ross L; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Weinstock, George M; Adelson, David L; Eichler, Evan E; Elnitski, Laura; Guigó, Roderic; Hamernik, Debora L; Kappes, Steve M; Lewin, Harris A; Lynn, David J; Nicholas, Frank W; Reymond, Alexandre; Rijnkels, Monique; Skow, Loren C; Zdobnov, Evgeny M; Schook, Lawrence; Womack, James; Alioto, Tyler; Antonarakis, Stylianos E; Astashyn, Alex; Chapple, Charles E; Chen, Hsiu-Chuan; Chrast, Jacqueline; Câmara, Francisco; Ermolaeva, Olga; Henrichsen, Charlotte N; Hlavina, Wratko; Kapustin, Yuri; Kiryutin, Boris; Kitts, Paul; Kokocinski, Felix; Landrum, Melissa; Maglott, Donna; Pruitt, Kim; Sapojnikov, Victor; Searle, Stephen M; Solovyev, Victor; Souvorov, Alexandre; Ucla, Catherine; Wyss, Carine; Anzola, Juan M; Gerlach, Daniel; Elhaik, Eran; Graur, Dan; Reese, Justin T; Edgar, Robert C; McEwan, John C; Payne, Gemma M; Raison, Joy M; Junier, Thomas; Kriventseva, Evgenia V; Eyras, Eduardo; Plass, Mireya; Donthu, Ravikiran; Larkin, Denis M; Reecy, James; Yang, Mary Q; Chen, Lin; Cheng, Ze; Chitko-McKown, Carol G; Liu, George E; Matukumalli, Lakshmi K; Song, Jiuzhou; Zhu, Bin; Bradley, Daniel G; Brinkman, Fiona S L; Lau, Lilian P L; Whiteside, Matthew D; Walker, Angela; Wheeler, Thomas T; Casey, Theresa; German, J Bruce; Lemay, Danielle G; Maqbool, Nauman J; Molenaar, Adrian J; Seo, Seongwon; Stothard, Paul; Baldwin, Cynthia L; Baxter, Rebecca; Brinkmeyer-Langford, Candice L; Brown, Wendy C; Childers, Christopher P; Connelley, Timothy; Ellis, Shirley A; Fritz, Krista; Glass, Elizabeth J; Herzig, Carolyn T A; Iivanainen, Antti; Lahmers, Kevin K; Bennett, Anna K; Dickens, C Michael; Gilbert, James G R; Hagen, Darren E; Salih, Hanni; Aerts, Jan; Caetano, Alexandre R; Dalrymple, Brian; Garcia, Jose Fernando; Gill, Clare A; Hiendleder, Stefan G; Memili, Erdogan; Spurlock, Diane; Williams, John L; Alexander, Lee; Brownstein, Michael J; Guan, Leluo; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Moore, Richard; Moore, Stephen S; Roberts, Andy; Taniguchi, Masaaki; Waterman, Richard C; Chacko, Joseph; Chandrabose, Mimi M; Cree, Andy; Dao, Marvin Diep; Dinh, Huyen H; Gabisi, Ramatu Ayiesha; Hines, Sandra; Hume, Jennifer; Jhangiani, Shalini N; Joshi, Vandita; Kovar, Christie L; Lewis, Lora R; Liu, Yih-Shin; Lopez, John; Morgan, Margaret B; Nguyen, Ngoc Bich; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Wright, Rita A; Buhay, Christian; Ding, Yan; Dugan-Rocha, Shannon; Herdandez, Judith; Holder, Michael; Sabo, Aniko; Egan, Amy; Goodell, Jason; Wilczek-Boney, Katarzyna; Fowler, Gerald R; Hitchens, Matthew Edward; Lozado, Ryan J; Moen, Charles; Steffen, David; Warren, James T; Zhang, Jingkun; Chiu, Readman; Schein, Jacqueline E; Durbin, K James; Havlak, Paul; Jiang, Huaiyang; Liu, Yue; Qin, Xiang; Ren, Yanru; Shen, Yufeng; Song, Henry; Bell, Stephanie Nicole; Davis, Clay; Johnson, Angela Jolivet; Lee, Sandra; Nazareth, Lynne V; Patel, Bella Mayurkumar; Pu, Ling-Ling; Vattathil, Selina; Williams, Rex Lee; Curry, Stacey; Hamilton, Cerissa; Sodergren, Erica; Wheeler, David A; Barris, Wes; Bennett, Gary L; Eggen, André; Green, Ronnie D; Harhay, Gregory P; Hobbs, Matthew; Jann, Oliver; Keele, John W; Kent, Matthew P; Lien, Sigbjørn; McKay, Stephanie D; McWilliam, Sean; Ratnakumar, Abhirami; Schnabel, Robert D; Smith, Timothy; Snelling, Warren M; Sonstegard, Tad S; Stone, Roger T; Sugimoto, Yoshikazu; Takasuga, Akiko; Taylor, Jeremy F; Van Tassell, Curtis P; Macneil, Michael D; Abatepaulo, Antonio R R; Abbey, Colette A; Ahola, Virpi; Almeida, Iassudara G; Amadio, Ariel F; Anatriello, Elen; Bahadue, Suria M; Biase, Fernando H; Boldt, Clayton R; Carroll, Jeffery A; Carvalho, Wanessa A; Cervelatti, Eliane P; Chacko, Elsa; Chapin, Jennifer E; Cheng, Ye; Choi, Jungwoo; Colley, Adam J; de Campos, Tatiana A; De Donato, Marcos; Santos, Isabel K F de Miranda; de Oliveira, Carlo J F; Deobald, Heather; Devinoy, Eve; Donohue, Kaitlin E; Dovc, Peter; Eberlein, Annett; Fitzsimmons, Carolyn J; Franzin, Alessandra M; Garcia, Gustavo R; Genini, Sem; Gladney, Cody J; Grant, Jason R; Greaser, Marion L; Green, Jonathan A; Hadsell, Darryl L; Hakimov, Hatam A; Halgren, Rob; Harrow, Jennifer L; Hart, Elizabeth A; Hastings, Nicola; Hernandez, Marta; Hu, Zhi-Liang; Ingham, Aaron; Iso-Touru, Terhi; Jamis, Catherine; Jensen, Kirsty; Kapetis, Dimos; Kerr, Tovah; Khalil, Sari S; Khatib, Hasan; Kolbehdari, Davood; Kumar, Charu G; Kumar, Dinesh; Leach, Richard; Lee, Justin C-M; Li, Changxi; Logan, Krystin M; Malinverni, Roberto; Marques, Elisa; Martin, William F; Martins, Natalia F; Maruyama, Sandra R; Mazza, Raffaele; McLean, Kim L; Medrano, Juan F; Moreno, Barbara T; Moré, Daniela D; Muntean, Carl T; Nandakumar, Hari P; Nogueira, Marcelo F G; Olsaker, Ingrid; Pant, Sameer D; Panzitta, Francesca; Pastor, Rosemeire C P; Poli, Mario A; Poslusny, Nathan; Rachagani, Satyanarayana; Ranganathan, Shoba; Razpet, Andrej; Riggs, Penny K; Rincon, Gonzalo; Rodriguez-Osorio, Nelida; Rodriguez-Zas, Sandra L; Romero, Natasha E; Rosenwald, Anne; Sando, Lillian; Schmutz, Sheila M; Shen, Libing; Sherman, Laura; Southey, Bruce R; Lutzow, Ylva Strandberg; Sweedler, Jonathan V; Tammen, Imke; Telugu, Bhanu Prakash V L; Urbanski, Jennifer M; Utsunomiya, Yuri T; Verschoor, Chris P; Waardenberg, Ashley J; Wang, Zhiquan; Ward, Robert; Weikard, Rosemarie; Welsh, Thomas H; White, Stephen N; Wilming, Laurens G; Wunderlich, Kris R; Yang, Jianqi; Zhao, Feng-Qi
2009-04-24
To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.
Sequence-Level Mechanisms of Human Epigenome Evolution
Prendergast, James G.D.; Chambers, Emily V.; Semple, Colin A.M.
2014-01-01
DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage. PMID:24966180
The Red Queen in mitochondria: cyto-nuclear co-evolution, hybrid breakdown and human disease
Chou, Jui-Yu; Leu, Jun-Yi
2015-01-01
Cyto-nuclear incompatibility, a specific form of Dobzhansky-Muller incompatibility caused by incompatible alleles between mitochondrial and nuclear genomes, has been suggested to play a critical role during speciation. Several features of the mitochondrial genome (mtDNA), including high mutation rate, dynamic genomic structure, and uniparental inheritance, make mtDNA more likely to accumulate mutations in the population. Once mtDNA has changed, the nuclear genome needs to play catch-up due to the intimate interactions between these two genomes. In two populations, if cyto-nuclear co-evolution is driven in different directions, it may eventually lead to hybrid incompatibility. Although cyto-nuclear incompatibility has been observed in a wide range of organisms, it remains unclear what type of mutations drives the co-evolution. Currently, evidence supporting adaptive mutations in mtDNA remains limited. On the other hand, it has been known that some mutations allow mtDNA to propagate more efficiently but compromise the host fitness (described as selfish mtDNA). Arms races between such selfish mtDNA and host nuclear genomes can accelerate cyto-nuclear co-evolution and lead to a phenomenon called the Red Queen Effect. Here, we discuss how the Red Queen Effect may contribute to the frequent observation of cyto-nuclear incompatibility and be the underlying driving force of some human mitochondrial diseases. PMID:26042149
Acute myeloid leukaemia: a paradigm for the clonal evolution of cancer?
Grove, Carolyn S.; Vassiliou, George S.
2014-01-01
Acute myeloid leukaemia (AML) is an uncontrolled clonal proliferation of abnormal myeloid progenitor cells in the bone marrow and blood. Advances in cancer genomics have revealed the spectrum of somatic mutations that give rise to human AML and drawn our attention to its molecular evolution and clonal architecture. It is now evident that most AML genomes harbour small numbers of mutations, which are acquired in a stepwise manner. This characteristic, combined with our ability to identify mutations in individual leukaemic cells and our detailed understanding of normal human and murine haematopoiesis, makes AML an excellent model for understanding the principles of cancer evolution. Furthermore, a better understanding of how AML evolves can help us devise strategies to improve the therapy and prognosis of AML patients. Here, we draw from recent advances in genomics, clinical studies and experimental models to describe the current knowledge of the clonal evolution of AML and its implications for the biology and treatment of leukaemias and other cancers. PMID:25056697
The Human Genome Project: applications in the diagnosis and treatment of neurologic disease.
Evans, G A
1998-10-01
The Human Genome Project (HGP), an international program to decode the entire DNA sequence of the human genome in 15 years, represents the largest biological experiment ever conducted. This set of information will contain the blueprint for the construction and operation of a human being. While the primary driving force behind the genome project is the potential to vastly expand the amount of genetic information available for biomedical research, the ramifications for other fields of study in biological research, the biotechnology and pharmaceutical industry, our understanding of evolution, effects on agriculture, and implications for bioethics are likely to be profound.
Rübben, Albert; Nordhoff, Ole
2013-01-01
Summary Most clinically distinguishable malignant tumors are characterized by specific mutations, specific patterns of chromosomal rearrangements and a predominant mechanism of genetic instability but it remains unsolved whether modifications of cancer genomes can be explained solely by mutations and selection through the cancer microenvironment. It has been suggested that internal dynamics of genomic modifications as opposed to the external evolutionary forces have a significant and complex impact on Darwinian species evolution. A similar situation can be expected for somatic cancer evolution as molecular key mechanisms encountered in species evolution also constitute prevalent mutation mechanisms in human cancers. This assumption is developed into a systems approach of carcinogenesis which focuses on possible inner constraints of the genome architecture on lineage selection during somatic cancer evolution. The proposed systems approach can be considered an analogy to the concept of evolvability in species evolution. The principal hypothesis is that permissive or restrictive effects of the genome architecture on lineage selection during somatic cancer evolution exist and have a measurable impact. The systems approach postulates three classes of lineage selection effects of the genome architecture on somatic cancer evolution: i) effects mediated by changes of fitness of cells of cancer lineage, ii) effects mediated by changes of mutation probabilities and iii) effects mediated by changes of gene designation and physical and functional genome redundancy. Physical genome redundancy is the copy number of identical genetic sequences. Functional genome redundancy of a gene or a regulatory element is defined as the number of different genetic elements, regardless of copy number, coding for the same specific biological function within a cancer cell. Complex interactions of the genome architecture on lineage selection may be expected when modifications of the genome architecture have multiple and possibly opposed effects which manifest themselves at disparate times and progression stages. Dissection of putative mechanisms mediating constraints exerted by the genome architecture on somatic cancer evolution may provide an algorithm for understanding and predicting as well as modifying somatic cancer evolution in individual patients. PMID:23336076
The Genome Sequence of Taurine Cattle: A window to ruminant biology and evolution
Elsik, Christine G.; Tellam, Ross L.; Worley, Kim C.
2010-01-01
To understand the biology and evolution of ruminants, the cattle genome was sequenced to ∼7× coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1,217 are absent or undetected in non-eutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides an enabling resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production. PMID:19390049
Identification of cis-suppression of human disease mutations by comparative genomics
Jordan, Daniel M.; Frangakis, Stephan G.; Golzio, Christelle; Cassa, Christopher A.; Kurtzberg, Joanne; Davis, Erica E.; Sunyaev, Shamil R.; Katsanis, Nicholas
2015-01-01
Patterns of amino acid conservation have served as a tool for understanding protein evolution1. The same principles have also found broad application in human genomics, driven by the need to interpret the pathogenic potential of variants in patients2. Here we performed a systematic comparative genomics analysis of human disease-causing missense variants. We found that an appreciable fraction of disease-causing alleles are fixed in the genomes of other species, suggesting a role for genomic context. We developed a model of genetic interactions that predicts most of these to be simple pairwise compensations. Functional testing of this model on two known human disease genes3,4 revealed discrete cis amino acid residues that, although benign on their own, could rescue the human mutations in vivo. This approach was also applied to ab initio gene discovery to support the identification of a de novo disease driver in BTG2 that is subject to protective cis-modification in more than 50 species. Finally, on the basis of our data and models, we developed a computational tool to predict candidate residues subject to compensation. Taken together, our data highlight the importance of cis-genomic context as a contributor to protein evolution; they provide an insight into the complexity of allele effect on phenotype; and they are likely to assist methods for predicting allele pathogenicity5,6. PMID:26123021
Chintalapati, Manjusha; Dannemann, Michael; Prüfer, Kay
2017-08-04
Small insertions and deletions occur in humans at a lower rate compared to nucleotide changes, but evolve under more constraint than nucleotide changes. While the evolution of insertions and deletions have been investigated using ape outgroups, the now available genome of a Neandertal can shed light on the evolution of indels in more recent times. We used the Neandertal genome together with several primate outgroup genomes to differentiate between human insertion/deletion changes that likely occurred before the split from Neandertals and those that likely arose later. Changes that pre-date the split from Neandertals show a smaller proportion of deletions than those that occurred later. The presence of a Neandertal-shared allele in Europeans or Asians but the absence in Africans was used to detect putatively introgressed indels in Europeans and Asians. A larger proportion of these variants reside in intergenic regions compared to other modern human variants, and some variants are linked to SNPs that have been associated with traits in modern humans. Our results are in agreement with earlier results that suggested that deletions evolve under more constraint than insertions. When considering Neandertal introgressed variants, we find some evidence that negative selection affected these variants more than other variants segregating in modern humans. Among introgressed variants we also identify indels that may influence the phenotype of their carriers. In particular an introgressed deletion associated with a decrease in the time to menarche may constitute an example of a former Neandertal-specific trait contributing to modern human phenotypic diversity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schulman, Al
2009-08-09
Three subfamilies of grasses, the Erhardtoideae (rice), the Panicoideae (maize, sorghum, sugar cane and millet), and the Pooideae (wheat, barley and cool season forage grasses) provide the basis of human nutrition and are poised to become major sources of renewable energy. Here we describe the complete genome sequence of the wild grass Brachypodium distachyon (Brachypodium), the first member of the Pooideae subfamily to be completely sequenced. Comparison of the Brachypodium, rice and sorghum genomes reveals a precise sequence- based history of genome evolution across a broad diversity of the grass family and identifies nested insertions of whole chromosomes into centromericmore » regions as a predominant mechanism driving chromosome evolution in the grasses. The relatively compact genome of Brachypodium is maintained by a balance of retroelement replication and loss. The complete genome sequence of Brachypodium, coupled to its exceptional promise as a model system for grass research, will support the development of new energy and food crops« less
Short template switch events explain mutation clusters in the human genome.
Löytynoja, Ari; Goldman, Nick
2017-06-01
Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press.
Wu, Qizhu; Yin, Ye; Zhou, Huanmin
2014-01-01
Mongolians have played a significant role in modern human evolution, especially after the rise of Genghis Khan (1162[?]–1227). Although the social cultural impacts of Genghis Khan and the Mongolian population have been well documented, explorations of their genome structure and genetic imprints on other human populations have been lacking. We here present the genome of a Mongolian male individual. The genome was de novo assembled using a total of 130.8-fold genomic data produced from massively parallel whole-genome sequencing. We identified high-confidence variation sets, including 3.7 million single nucleotide polymorphisms (SNPs) and 756,234 short insertions and deletions. Functional SNP analysis predicted that the individual has a pathogenic risk for carnitine deficiency. We located the patrilineal inheritance of the Mongolian genome to the lineage D3a through Y haplogroup analysis and inferred that the individual has a common patrilineal ancestor with Tibeto-Burman populations and is likely to be the progeny of the earliest settlers in East Asia. We finally investigated the genetic imprints of Mongolians on other human populations using different approaches. We found varying degrees of gene flows between Mongolians and populations living in Europe, South/Central Asia, and the Indian subcontinent. The analyses demonstrate that the genetic impacts of Mongolians likely resulted from the expansion of the Mongolian Empire in the 13th century. The genome will be of great help in further explorations of modern human evolution and genetic causes of diseases/traits specific to Mongolians. PMID:25377941
USDA-ARS?s Scientific Manuscript database
We report a chromosome-scale assembly and analysis of the Daucus carota genome, an important source of provitamin A in the human diet and the first sequenced genome among members of the Euasterid II clade. We characterized two new polyploidization events, both occurring after the divergence of carro...
Full-Genome Characterization and Genetic Evolution of West African Isolates of Bagaza Virus
Faye, Oumar; Diagne, Moussa Moise; Fall, Gamou; Sembene, Mbacke; Sall, Amadou Alpha; Faye, Ousmane
2018-01-01
Bagaza virus is a mosquito-borne flavivirus, first isolated in 1966 in Central African Republic. It has currently been identified in mosquito pools collected in the field in West and Central Africa. Emergence in wild birds in Europe and serological evidence in encephalitis patients in India raise questions on its genetic evolution and the diversity of isolates circulating in Africa. To better understand genetic diversity and evolution of Bagaza virus, we describe the full-genome characterization of 11 West African isolates, sampled from 1988 to 2014. Parameters such as genetic distances, N-glycosylation patterns, recombination events, selective pressures, and its codon adaptation to human genes are assessed. Our study is noteworthy for the observation of N-glycosylation and recombination in Bagaza virus and provides insight into its Indian origin from the 13th century. Interestingly, evidence of Bagaza virus codon adaptation to human house-keeping genes is also observed to be higher than those of other flaviviruses well known in human infections. Genetic variations on genome of West African Bagaza virus could play an important role in generating diversity and may promote Bagaza virus adaptation to other vertebrates and become an important threat in human health. PMID:29652824
Full-Genome Characterization and Genetic Evolution of West African Isolates of Bagaza Virus.
Faye, Martin; Faye, Oumar; Diagne, Moussa Moise; Fall, Gamou; Weidmann, Manfred; Sembene, Mbacke; Sall, Amadou Alpha; Faye, Ousmane
2018-04-13
Bagaza virus is a mosquito-borne flavivirus, first isolated in 1966 in Central African Republic. It has currently been identified in mosquito pools collected in the field in West and Central Africa. Emergence in wild birds in Europe and serological evidence in encephalitis patients in India raise questions on its genetic evolution and the diversity of isolates circulating in Africa. To better understand genetic diversity and evolution of Bagaza virus, we describe the full-genome characterization of 11 West African isolates, sampled from 1988 to 2014. Parameters such as genetic distances, N-glycosylation patterns, recombination events, selective pressures, and its codon adaptation to human genes are assessed. Our study is noteworthy for the observation of N-glycosylation and recombination in Bagaza virus and provides insight into its Indian origin from the 13th century. Interestingly, evidence of Bagaza virus codon adaptation to human house-keeping genes is also observed to be higher than those of other flaviviruses well known in human infections. Genetic variations on genome of West African Bagaza virus could play an important role in generating diversity and may promote Bagaza virus adaptation to other vertebrates and become an important threat in human health.
Amores, Angel; Catchen, Julian; Ferrara, Allyse; Fontenot, Quenton; Postlethwait, John H.
2011-01-01
Genomic resources for hundreds of species of evolutionary, agricultural, economic, and medical importance are unavailable due to the expense of well-assembled genome sequences and difficulties with multigenerational studies. Teleost fish provide many models for human disease but possess anciently duplicated genomes that sometimes obfuscate connectivity. Genomic information representing a fish lineage that diverged before the teleost genome duplication (TGD) would provide an outgroup for exploring the mechanisms of evolution after whole-genome duplication. We exploited massively parallel DNA sequencing to develop meiotic maps with thrift and speed by genotyping F1 offspring of a single female and a single male spotted gar (Lepisosteus oculatus) collected directly from nature utilizing only polymorphisms existing in these two wild individuals. Using Stacks, software that automates the calling of genotypes from polymorphisms assayed by Illumina sequencing, we constructed a map containing 8406 markers. RNA-seq on two map-cross larvae provided a reference transcriptome that identified nearly 1000 mapped protein-coding markers and allowed genome-wide analysis of conserved synteny. Results showed that the gar lineage diverged from teleosts before the TGD and its genome is organized more similarly to that of humans than teleosts. Thus, spotted gar provides a critical link between medical models in teleost fish, to which gar is biologically similar, and humans, to which gar is genomically similar. Application of our F1 dense mapping strategy to species with no prior genome information promises to facilitate comparative genomics and provide a scaffold for ordering the numerous contigs arising from next generation genome sequencing. PMID:21828280
Lesecque, Yann; Glémin, Sylvain; Lartillot, Nicolas; Mouchiroud, Dominique; Duret, Laurent
2014-01-01
Recombination is an essential process in eukaryotes, which increases diversity by disrupting genetic linkage between loci and ensures the proper segregation of chromosomes during meiosis. In the human genome, recombination events are clustered in hotspots, whose location is determined by the PRDM9 protein. There is evidence that the location of hotspots evolves rapidly, as a consequence of changes in PRDM9 DNA-binding domain. However, the reasons for these changes and the rate at which they occur are not known. In this study, we investigated the evolution of human hotspot loci and of PRDM9 target motifs, both in modern and archaic human lineages (Denisovan) to quantify the dynamic of hotspot turnover during the recent period of human evolution. We show that present-day human hotspots are young: they have been active only during the last 10% of the time since the divergence from chimpanzee, starting to be operating shortly before the split between Denisovans and modern humans. Surprisingly, however, our analyses indicate that Denisovan recombination hotspots did not overlap with modern human ones, despite sharing similar PRDM9 target motifs. We further show that high-affinity PRDM9 target motifs are subject to a strong self-destructive drive, known as biased gene conversion (BGC), which should lead to the loss of the majority of them in the next 3 MYR. This depletion of PRDM9 genomic targets is expected to decrease fitness, and thereby to favor new PRDM9 alleles binding different motifs. Our refined estimates of the age and life expectancy of human hotspots provide empirical evidence in support of the Red Queen hypothesis of recombination hotspots evolution. PMID:25393762
Lesecque, Yann; Glémin, Sylvain; Lartillot, Nicolas; Mouchiroud, Dominique; Duret, Laurent
2014-11-01
Recombination is an essential process in eukaryotes, which increases diversity by disrupting genetic linkage between loci and ensures the proper segregation of chromosomes during meiosis. In the human genome, recombination events are clustered in hotspots, whose location is determined by the PRDM9 protein. There is evidence that the location of hotspots evolves rapidly, as a consequence of changes in PRDM9 DNA-binding domain. However, the reasons for these changes and the rate at which they occur are not known. In this study, we investigated the evolution of human hotspot loci and of PRDM9 target motifs, both in modern and archaic human lineages (Denisovan) to quantify the dynamic of hotspot turnover during the recent period of human evolution. We show that present-day human hotspots are young: they have been active only during the last 10% of the time since the divergence from chimpanzee, starting to be operating shortly before the split between Denisovans and modern humans. Surprisingly, however, our analyses indicate that Denisovan recombination hotspots did not overlap with modern human ones, despite sharing similar PRDM9 target motifs. We further show that high-affinity PRDM9 target motifs are subject to a strong self-destructive drive, known as biased gene conversion (BGC), which should lead to the loss of the majority of them in the next 3 MYR. This depletion of PRDM9 genomic targets is expected to decrease fitness, and thereby to favor new PRDM9 alleles binding different motifs. Our refined estimates of the age and life expectancy of human hotspots provide empirical evidence in support of the Red Queen hypothesis of recombination hotspots evolution.
Genomics and the Ark: an ecocentric perspective on human history.
Zwart, Hub; Penders, Bart
2011-01-01
Views of ourselves in relationship to the rest of the biosphere are changing. Theocentric and anthropocentric perspectives are giving way to more ecocentric views on the history, present, and future of humankind. Novel sciences, such as genomics, have deepened and broadened our understanding of the process of anthropogenesis, the coming into being of humans. Genomics suggests that early human history must be regarded as a complex narrative of evolving ecosystems, in which human evolution both influenced and was influenced by the evolution of companion species. During the agricultural revolution, human beings designed small-scale artificial ecosystems or evolutionary "Arks," in which networks of plants, animals, and microorganisms coevolved. Currently, our attitude towards this process seems subject to a paradoxical reversal. The boundaries of the Ark have dramatically broadened, and genomics is not only being used to increase our understanding of our ecological past, but may also help us to conserve, reconstruct, or even revivify species and ecosystems to whose degradation or (near) extinction we have contributed. This article explores the role of genomics in the elaboration of a more ecocentric view of ourselves with the help of two examples, namely the renaissance of Paleolithic diets and of Pleistocene parks. It argues that an understanding of the world in ecocentric terms requires new partnerships and mutually beneficial forms of collaboration and convergence between life sciences, social sciences, and the humanities.
Human evolution: a tale from ancient genomes
2017-01-01
The field of human ancient DNA (aDNA) has moved from mitochondrial sequencing that suffered from contamination and provided limited biological insights, to become a fully genomic discipline that is changing our conception of human history. Recent successes include the sequencing of extinct hominins, and true population genomic studies of Bronze Age populations. Among the emerging areas of aDNA research, the analysis of past epigenomes is set to provide more new insights into human adaptation and disease susceptibility through time. Starting as a mere curiosity, ancient human genetics has become a major player in the understanding of our evolutionary history. This article is part of the themed issue ‘Evo-devo in the genomics era, and the origins of morphological diversity’. PMID:27994125
Meta genome-wide network from functional linkages of genes in human gut microbial ecosystems.
Ji, Yan; Shi, Yixiang; Wang, Chuan; Dai, Jianliang; Li, Yixue
2013-03-01
The human gut microbial ecosystem (HGME) exerts an important influence on the human health. In recent researches, meta-genomics provided deep insights into the HGME in terms of gene contents, metabolic processes and genome constitutions of meta-genome. Here we present a novel methodology to investigate the HGME on the basis of a set of functionally coupled genes regardless of their genome origins when considering the co-evolution properties of genes. By analyzing these coupled genes, we showed some basic properties of HGME significantly associated with each other, and further constructed a protein interaction map of human gut meta-genome to discover some functional modules that may relate with essential metabolic processes. Compared with other studies, our method provides a new idea to extract basic function elements from meta-genome systems and investigate complex microbial environment by associating its biological traits with co-evolutionary fingerprints encoded in it.
Holden, Matthew T. G.; Hauser, Heidi; Sanders, Mandy; Ngo, Thi Hoa; Cherevach, Inna; Cronin, Ann; Goodhead, Ian; Mungall, Karen; Quail, Michael A.; Price, Claire; Rabbinowitsch, Ester; Sharp, Sarah; Croucher, Nicholas J.; Chieu, Tran Bich; Thi Hoang Mai, Nguyen; Diep, To Song; Chinh, Nguyen Tran; Kehoe, Michael; Leigh, James A.; Ward, Philip N.; Dowson, Christopher G.; Whatmore, Adrian M.; Chanter, Neil; Iversen, Pernille; Gottschalk, Marcelo; Slater, Josh D.; Smith, Hilde E.; Spratt, Brian G.; Xu, Jianguo; Ye, Changyun; Bentley, Stephen; Barrell, Barclay G.; Schultsz, Constance; Maskell, Duncan J.; Parkhill, Julian
2009-01-01
Background Streptococcus suis is a zoonotic pathogen that infects pigs and can occasionally cause serious infections in humans. S. suis infections occur sporadically in human Europe and North America, but a recent major outbreak has been described in China with high levels of mortality. The mechanisms of S. suis pathogenesis in humans and pigs are poorly understood. Methodology/Principal Findings The sequencing of whole genomes of S. suis isolates provides opportunities to investigate the genetic basis of infection. Here we describe whole genome sequences of three S. suis strains from the same lineage: one from European pigs, and two from human cases from China and Vietnam. Comparative genomic analysis was used to investigate the variability of these strains. S. suis is phylogenetically distinct from other Streptococcus species for which genome sequences are currently available. Accordingly, ∼40% of the ∼2 Mb genome is unique in comparison to other Streptococcus species. Finer genomic comparisons within the species showed a high level of sequence conservation; virtually all of the genome is common to the S. suis strains. The only exceptions are three ∼90 kb regions, present in the two isolates from humans, composed of integrative conjugative elements and transposons. Carried in these regions are coding sequences associated with drug resistance. In addition, small-scale sequence variation has generated pseudogenes in putative virulence and colonization factors. Conclusions/Significance The genomic inventories of genetically related S. suis strains, isolated from distinct hosts and diseases, exhibit high levels of conservation. However, the genomes provide evidence that horizontal gene transfer has contributed to the evolution of drug resistance. PMID:19603075
Pang, Erli; Wu, Xiaomei; Lin, Kui
2016-06-01
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
Recently evolved human-specific methylated regions are enriched in schizophrenia signals.
Banerjee, Niladri; Polushina, Tatiana; Bettella, Francesco; Giddaluru, Sudheer; Steen, Vidar M; Andreassen, Ole A; Le Hellard, Stephanie
2018-05-11
One explanation for the persistence of schizophrenia despite the reduced fertility of patients is that it is a by-product of recent human evolution. This hypothesis is supported by evidence suggesting that recently-evolved genomic regions in humans are involved in the genetic risk for schizophrenia. Using summary statistics from genome-wide association studies (GWAS) of schizophrenia and 11 other phenotypes, we tested for enrichment of association with GWAS traits in regions that have undergone methylation changes in the human lineage compared to Neanderthals and Denisovans, i.e. human-specific differentially methylated regions (DMRs). We used analytical tools that evaluate polygenic enrichment of a subset of genomic variants against all variants. Schizophrenia was the only trait in which DMR SNPs showed clear enrichment of association that passed the genome-wide significance threshold. The enrichment was not observed for Neanderthal or Denisovan DMRs. The enrichment seen in human DMRs is comparable to that for genomic regions tagged by Neanderthal Selective Sweep markers, and stronger than that for Human Accelerated Regions. The enrichment survives multiple testing performed through permutation (n = 10,000) and bootstrapping (n = 5000) in INRICH (p < 0.01). Some enrichment of association with height was observed at the gene level. Regions where DNA methylation modifications have changed during recent human evolution show enrichment of association with schizophrenia and possibly with height. Our study further supports the hypothesis that genetic variants conferring risk of schizophrenia co-occur in genomic regions that have changed as the human species evolved. Since methylation is an epigenetic mark, potentially mediated by environmental changes, our results also suggest that interaction with the environment might have contributed to that association.
Genomic signatures of diet-related shifts during human origins
Babbitt, Courtney C.; Warner, Lisa R.; Fedrigo, Olivier; Wall, Christine E.; Wray, Gregory A.
2011-01-01
There are numerous anthropological analyses concerning the importance of diet during human evolution. Diet is thought to have had a profound influence on the human phenotype, and dietary differences have been hypothesized to contribute to the dramatic morphological changes seen in modern humans as compared with non-human primates. Here, we attempt to integrate the results of new genomic studies within this well-developed anthropological context. We then review the current evidence for adaptation related to diet, both at the level of sequence changes and gene expression. Finally, we propose some ways in which new technologies can help identify specific genomic adaptations that have resulted in metabolic and morphological differences between humans and non-human primates. PMID:21177690
Molecular Epidemiology and Genomics of Group A Streptococcus
Bessen, Debra E.; McShan, W. Michael; Nguyen, Scott V.; Shetty, Amol; Agrawal, Sonia; Tettelin, Hervé
2014-01-01
Streptococcus pyogenes (group A streptococcus; GAS) is a strict human pathogen with a very high prevalence worldwide. This review highlights the genetic organization of the species and the important ecological considerations that impact its evolution. Recent advances are presented on the topics of molecular epidemiology, population biology, molecular basis for genetic change, genome structure and genetic flux, phylogenomics and closely related streptococcal species, and the long- and short-term evolution of GAS. The application of whole genome sequence data to addressing key biological questions is discussed. PMID:25460818
Chalopin, Domitille; Naville, Magali; Plard, Floriane; Galiana, Delphine; Volff, Jean-Nicolas
2015-01-09
Transposable elements (TEs) are major components of vertebrate genomes, with major roles in genome architecture and evolution. In order to characterize both common patterns and lineage-specific differences in TE content and TE evolution, we have compared the mobilomes of 23 vertebrate genomes, including 10 actinopterygian fish, 11 sarcopterygians, and 2 nonbony vertebrates. We found important variations in TE content (from 6% in the pufferfish tetraodon to 55% in zebrafish), with a more important relative contribution of TEs to genome size in fish than in mammals. Some TE superfamilies were found to be widespread in vertebrates, but most elements showed a more patchy distribution, indicative of multiple events of loss or gain. Interestingly, loss of major TE families was observed during the evolution of the sarcopterygian lineage, with a particularly strong reduction in TE diversity in birds and mammals. Phylogenetic trends in TE composition and activity were detected: Teleost fish genomes are dominated by DNA transposons and contain few ancient TE copies, while mammalian genomes have been predominantly shaped by nonlong terminal repeat retrotransposons, along with the persistence of older sequences. Differences were also found within lineages: The medaka fish genome underwent more recent TE amplification than the related platyfish, as observed for LINE retrotransposons in the mouse compared with the human genome. This study allows the identification of putative cases of horizontal transfer of TEs, and to tentatively infer the composition of the ancestral vertebrate mobilome. Taken together, the results obtained highlight the importance of TEs in the structure and evolution of vertebrate genomes, and demonstrate their major impact on genome diversity both between and within lineages. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Evolution of Campylobacter jejuni and Campylobacter coli
Sheppard, Samuel K.; Maiden, Martin C.J.
2015-01-01
The global significance of Campylobacter jejuni and Campylobacter coli as gastrointestinal human pathogens has motivated numerous studies to characterize their population biology and evolution. These bacteria are a common component of the intestinal microbiota of numerous bird and mammal species and cause disease in humans, typically via consumption of contaminated meat products, especially poultry meat. Sequence-based molecular typing methods, such as multilocus sequence typing (MLST) and whole genome sequencing (WGS), have been instructive for understanding the epidemiology and evolution of these bacteria and how phenotypic variation relates to the high degree of genetic structuring in C. coli and C. jejuni populations. Here, we describe aspects of the relatively short history of coevolution between humans and pathogenic Campylobacter, by reviewing research investigating how mutation and lateral or horizontal gene transfer (LGT or HGT, respectively) interact to create the observed population structure. These genetic changes occur in a complex fitness landscape with divergent ecologies, including multiple host species, which can lead to rapid adaptation, for example, through frame-shift mutations that alter gene expression or the acquisition of novel genetic elements by HGT. Recombination is a particularly strong evolutionary force in Campylobacter, leading to the emergence of new lineages and even large-scale genome-wide interspecies introgression between C. jejuni and C. coli. The increasing availability of large genome datasets is enhancing understanding of Campylobacter evolution through the application of methods, such as genome-wide association studies, but MLST-derived clonal complex designations remain a useful method for describing population structure. PMID:26101080
Schrider, Daniel R; Kern, Andrew D
2014-06-09
Identifying the complete set of functional elements within the human genome would be a windfall for multiple areas of biological research including medicine, molecular biology, and evolution. Complete knowledge of function would aid in the prioritization of loci when searching for the genetic bases of disease or adaptive phenotypes. Because mutations that disrupt function are disfavored by natural selection, purifying selection leaves a detectable signature within functional elements; accordingly, this signal has been exploited for over a decade through the use of genomic comparisons of distantly related species. While this is so, the functional complement of the genome changes extensively across time and between lineages; therefore, evidence of the current action of purifying selection in humans is essential. Because the removal of deleterious mutations by natural selection also reduces within-species genetic diversity within functional loci, dense population genetic data have the potential to reveal genomic elements that are currently functional. Here, we assess the potential of this approach by examining an ultradeep sample of human mitochondrial genomes (n = 16,411). We show that the high density of polymorphism in this data set precisely delineates regions experiencing purifying selection. Furthermore, we show that the number of segregating alleles at a site is strongly correlated with its divergence across species after accounting for known mutational biases in human mitochondrial DNA (ρ = 0.51; P < 2.2 × 10(-16)). These two measures track one another at a remarkably fine scale across many loci-a correlation that is purely the result of natural selection. Our results demonstrate that genetic variation has the potential to reveal with surprising precision which regions in the genome are currently performing important functions and likely to have deleterious fitness effects when mutated. As more complete human genomes are sequenced, similar power to reveal purifying selection may be achievable in the human nuclear genome. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Bioinformatics in high school biology curricula: a study of state science standards.
Wefer, Stephen H; Sheppard, Keith
2008-01-01
The proliferation of bioinformatics in modern biology marks a modern revolution in science that promises to influence science education at all levels. This study analyzed secondary school science standards of 49 U.S. states (Iowa has no science framework) and the District of Columbia for content related to bioinformatics. The bioinformatics content of each state's biology standards was analyzed and categorized into nine areas: Human Genome Project/genomics, forensics, evolution, classification, nucleotide variations, medicine, computer use, agriculture/food technology, and science technology and society/socioscientific issues. Findings indicated a generally low representation of bioinformatics-related content, which varied substantially across the different areas, with Human Genome Project/genomics and computer use being the lowest (8%), and evolution being the highest (64%) among states' science frameworks. This essay concludes with recommendations for reworking/rewording existing standards to facilitate the goal of promoting science literacy among secondary school students.
Bioinformatics in High School Biology Curricula: A Study of State Science Standards
Sheppard, Keith
2008-01-01
The proliferation of bioinformatics in modern biology marks a modern revolution in science that promises to influence science education at all levels. This study analyzed secondary school science standards of 49 U.S. states (Iowa has no science framework) and the District of Columbia for content related to bioinformatics. The bioinformatics content of each state's biology standards was analyzed and categorized into nine areas: Human Genome Project/genomics, forensics, evolution, classification, nucleotide variations, medicine, computer use, agriculture/food technology, and science technology and society/socioscientific issues. Findings indicated a generally low representation of bioinformatics-related content, which varied substantially across the different areas, with Human Genome Project/genomics and computer use being the lowest (8%), and evolution being the highest (64%) among states' science frameworks. This essay concludes with recommendations for reworking/rewording existing standards to facilitate the goal of promoting science literacy among secondary school students. PMID:18316818
Analyses of pig genomes provide insight into porcine demography and evolution
Groenen, Martien A. M.; Archibald, Alan L.; Uenishi, Hirohide; Tuggle, Christopher K.; Takeuchi, Yasuhiro; Rothschild, Max F.; Rogel-Gaillard, Claire; Park, Chankyu; Milan, Denis; Megens, Hendrik-Jan; Li, Shengting; Larkin, Denis M.; Kim, Heebal; Frantz, Laurent A. F.; Caccamo, Mario; Ahn, Hyeonju; Aken, Bronwen L.; Anselmo, Anna; Anthon, Christian; Auvil, Loretta; Badaoui, Bouabid; Beattie, Craig W.; Bendixen, Christian; Berman, Daniel; Blecha, Frank; Blomberg, Jonas; Bolund, Lars; Bosse, Mirte; Botti, Sara; Bujie, Zhan; Bystrom, Megan; Capitanu, Boris; Silva, Denise Carvalho; Chardon, Patrick; Chen, Celine; Cheng, Ryan; Choi, Sang-Haeng; Chow, William; Clark, Richard C.; Clee, Christopher; Crooijmans, Richard P. M. A.; Dawson, Harry D.; Dehais, Patrice; De Sapio, Fioravante; Dibbits, Bert; Drou, Nizar; Du, Zhi-Qiang; Eversole, Kellye; Fadista, João; Fairley, Susan; Faraut, Thomas; Faulkner, Geoffrey J.; Fowler, Katie E.; Fredholm, Merete; Fritz, Eric; Gilbert, James G. R.; Giuffra, Elisabetta; Gorodkin, Jan; Griffin, Darren K.; Harrow, Jennifer L.; Hayward, Alexander; Howe, Kerstin; Hu, Zhi-Liang; Humphray, Sean J.; Hunt, Toby; Hornshøj, Henrik; Jeon, Jin-Tae; Jern, Patric; Jones, Matthew; Jurka, Jerzy; Kanamori, Hiroyuki; Kapetanovic, Ronan; Kim, Jaebum; Kim, Jae-Hwan; Kim, Kyu-Won; Kim, Tae-Hun; Larson, Greger; Lee, Kyooyeol; Lee, Kyung-Tai; Leggett, Richard; Lewin, Harris A.; Li, Yingrui; Liu, Wansheng; Loveland, Jane E.; Lu, Yao; Lunney, Joan K.; Ma, Jian; Madsen, Ole; Mann, Katherine; Matthews, Lucy; McLaren, Stuart; Morozumi, Takeya; Murtaugh, Michael P.; Narayan, Jitendra; Nguyen, Dinh Truong; Ni, Peixiang; Oh, Song-Jung; Onteru, Suneel; Panitz, Frank; Park, Eung-Woo; Park, Hong-Seog; Pascal, Geraldine; Paudel, Yogesh; Perez-Enciso, Miguel; Ramirez-Gonzalez, Ricardo; Reecy, James M.; Zas, Sandra Rodriguez; Rohrer, Gary A.; Rund, Lauretta; Sang, Yongming; Schachtschneider, Kyle; Schraiber, Joshua G.; Schwartz, John; Scobie, Linda; Scott, Carol; Searle, Stephen; Servin, Bertrand; Southey, Bruce R.; Sperber, Goran; Stadler, Peter; Sweedler, Jonathan V.; Tafer, Hakim; Thomsen, Bo; Wali, Rashmi; Wang, Jian; Wang, Jun; White, Simon; Xu, Xun; Yerle, Martine; Zhang, Guojie; Zhang, Jianguo; Zhang, Jie; Zhao, Shuhong; Rogers, Jane; Churcher, Carol; Schook, Lawrence B.
2013-01-01
For 10,000 years pigs and humans have shared a close and complex relationship. From domestication to modern breeding practices, humans have shaped the genomes of domestic pigs. Here we present the assembly and analysis of the genome sequence of a female domestic Duroc pig (Sus scrofa) and a comparison with the genomes of wild and domestic pigs from Europe and Asia. Wild pigs emerged in South East Asia and subsequently spread across Eurasia. Our results reveal a deep phylogenetic split between European and Asian wild boars ~1 million years ago, and a selective sweep analysis indicates selection on genes involved in RNA processing and regulation. Genes associated with immune response and olfaction exhibit fast evolution. Pigs have the largest repertoire of functional olfactory receptor genes, reflecting the importance of smell in this scavenging animal. The pig genome sequence provides an important resource for further improvements of this important livestock species, and our identification of many putative disease-causing variants extends the potential of the pig as a biomedical model. PMID:23151582
Austerlitz, Frédéric; Heyer, Evelyne
2018-06-01
Here, we present a synthetic view on how Kimura's Neutral theory has helped us gaining insight on the different evolutionary forces that shape human evolution. We put this perspective in the frame of recent emerging challenges: the use of whole genome data for reconstructing population histories, natural selection on complex polygenic traits, and integrating cultural processes in human evolution.
Westgeest, Kim B; Russell, Colin A; Lin, Xudong; Spronken, Monique I J; Bestebroer, Theo M; Bahl, Justin; van Beek, Ruud; Skepner, Eugene; Halpin, Rebecca A; de Jong, Jan C; Rimmelzwaan, Guus F; Osterhaus, Albert D M E; Smith, Derek J; Wentworth, David E; Fouchier, Ron A M; de Graaf, Miranda
2014-03-01
Influenza A(H3N2) viruses became widespread in humans during the 1968 H3N2 virus pandemic and have been a major cause of influenza epidemics ever since. These viruses evolve continuously by reassortment and genomic evolution. Antigenic drift is the cause for the need to update influenza vaccines frequently. Using two data sets that span the entire period of circulation of human influenza A(H3N2) viruses, it was shown that influenza A(H3N2) virus evolution can be mapped to 13 antigenic clusters. Here we analyzed the full genomes of 286 influenza A(H3N2) viruses from these two data sets to investigate the genomic evolution and reassortment patterns. Numerous reassortment events were found, scattered over the entire period of virus circulation, but most prominently in viruses circulating between 1991 and 1998. Some of these reassortment events persisted over time, and one of these coincided with an antigenic cluster transition. Furthermore, selection pressures and nucleotide and amino acid substitution rates of all proteins were studied, including those of the recently discovered PB1-N40, PA-X, PA-N155, and PA-N182 proteins. Rates of nucleotide and amino acid substitutions were most pronounced for the hemagglutinin, neuraminidase, and PB1-F2 proteins. Selection pressures were highest in hemagglutinin, neuraminidase, matrix 1, and nonstructural protein 1. This study of genotype in relation to antigenic phenotype throughout the period of circulation of human influenza A(H3N2) viruses leads to a better understanding of the evolution of these viruses. Each winter, influenza virus infects approximately 5 to 15% of the world's population, resulting in significant morbidity and mortality. Influenza A(H3N2) viruses evolve continuously by reassortment and genomic evolution. This leads to changes in antigenic recognition (antigenic drift) which make it necessary to update vaccines against influenza A(H3N2) viruses frequently. In this study, the relationship of genetic evolution to antigenic change spanning the entire period of A(H3N2) virus circulation was studied for the first time. The results presented in this study contribute to a better understanding of genetic evolution in correlation with antigenic evolution of influenza A(H3N2) viruses.
Evolution of epigenetic regulation in vertebrate genomes
Lowdon, Rebecca F.; Jang, Hyo Sik; Wang, Ting
2016-01-01
Empirical models of sequence evolution have spurred progress in the field of evolutionary genetics for decades. We are now realizing the importance and complexity of the eukaryotic epigenome. While epigenome analysis has been applied to genomes from single cell eukaryotes to human, comparative analyses are still relatively few, and computational algorithms to quantify epigenome evolution remain scarce. Accordingly, a quantitative model of epigenome evolution remains to be established. Here we review the comparative epigenomics literature and synthesize its overarching themes. We also suggest one mechanism, transcription factor binding site turnover, which relates sequence evolution to epigenetic conservation or divergence. Lastly, we propose a framework for how the field can move forward to build a coherent quantitative model of epigenome evolution. PMID:27080453
González-Peñas, Javier; Arrojo, Manuel; Paz, Eduardo; Brenlla, Julio; Páramo, Mario; Costas, Javier
2015-10-01
Schizophrenia may be considered a human-specific disorder arisen as a maladaptive by-product of human-specific brain evolution. Therefore, genetic variants involved in susceptibility to schizophrenia may be identified among those genes related to acquisition of human-specific traits. NPAS3, a transcription factor involved in central nervous system development and neurogenesis, seems to be implicated in the evolution of human brain, as it is the human gene with most human-specific accelerated elements (HAEs), i.e., .mammalian conserved regulatory sequences with accelerated evolution in the lineage leading to humans after human-chimpanzee split. We hypothesize that any nucleotide variant at the NPAS3 HAEs may lead to altered susceptibility to schizophrenia. Twenty-one variants at these HAEs detected by the 1000 genomes Project, as well as five additional variants taken from psychiatric genome-wide association studies, were genotyped in 538 schizophrenic patients and 539 controls from Galicia. Analyses at the haplotype level or based on the cumulative role of the variants assuming different susceptibility models did not find any significant association in spite of enough power under several plausible scenarios regarding direction of effect and the specific role of rare and common variants. These results suggest that, contrary to our hypothesis, the special evolution of the NPAS3 HAEs in Homo relaxed the strong constraint on sequence that characterized these regions during mammalian evolution, allowing some sequence changes without any effect on schizophrenia risk. © 2015 Wiley Periodicals, Inc.
Recent coselection in human populations revealed by protein-protein interaction network.
Qian, Wei; Zhou, Hang; Tang, Kun
2014-12-21
Genome-wide scans for signals of natural selection in human populations have identified a large number of candidate loci that underlie local adaptations. This is surprising given the relatively short evolutionary time since the divergence of the human population. One hypothesis that has not been formally examined is whether and how the recent human evolution may have been shaped by coselection in the context of complex molecular interactome. In this study, genome-wide signals of selection were scanned in East Asians, Europeans, and Africans using 1000 Genome data, and subsequently mapped onto the protein-protein interaction (PPI) network. We found that the candidate genes of recent positive selection localized significantly closer to each other on the PPI network than expected, revealing substantial clustering of selected genes. Furthermore, gene pairs of shorter PPI network distances showed higher similarities of their recent evolutionary paths than those further apart. Last, subnetworks enriched with recent coselection signals were identified, which are substantially overrepresented in biological pathways related to signal transduction, neurogenesis, and immune function. These results provide the first genome-wide evidence for association of recent selection signals with the PPI network, shedding light on the potential mechanisms of recent coselection in the human genome. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Chromosomal Rearrangements as Barriers to Genetic Homogenization between Archaic and Modern Humans
Rogers, Rebekah L.
2015-01-01
Chromosomal rearrangements, which shuffle DNA throughout the genome, are an important source of divergence across taxa. Using a paired-end read approach with Illumina sequence data for archaic humans, I identify changes in genome structure that occurred recently in human evolution. Hundreds of rearrangements indicate genomic trafficking between the sex chromosomes and autosomes, raising the possibility of sex-specific changes. Additionally, genes adjacent to genome structure changes in Neanderthals are associated with testis-specific expression, consistent with evolutionary theory that new genes commonly form with expression in the testes. I identify one case of new-gene creation through transposition from the Y chromosome to chromosome 10 that combines the 5′-end of the testis-specific gene Fank1 with previously untranscribed sequence. This new transcript experienced copy number expansion in archaic genomes, indicating rapid genomic change. Among rearrangements identified in Neanderthals, 13% are transposition of selfish genetic elements, whereas 32% appear to be ectopic exchange between repeats. In Denisovan, the pattern is similar but numbers are significantly higher with 18% of rearrangements reflecting transposition and 40% ectopic exchange between distantly related repeats. There is an excess of divergent rearrangements relative to polymorphism in Denisovan, which might result from nonuniform rates of mutation, possibly reflecting a burst of transposable element activity in the lineage that led to Denisovan. Finally, loci containing genome structure changes show diminished rates of introgression from Neanderthals into modern humans, consistent with the hypothesis that rearrangements serve as barriers to gene flow during hybridization. Together, these results suggest that this previously unidentified source of genomic variation has important biological consequences in human evolution. PMID:26399483
Ebola Virus Epidemiology, Transmission, and Evolution during Seven Months in Sierra Leone
Park, Daniel J.; Dudas, Gytis; Wohl, Shirlee; Goba, Augustine; Whitmer, Shannon L.M.; Andersen, Kristian G.; Sealfon, Rachel S.; Ladner, Jason T.; Kugelman, Jeffrey R.; Matranga, Christian B.; Winnicki, Sarah M.; Qu, James; Gire, Stephen K.; Gladden-Young, Adrianne; Jalloh, Simbirie; Nosamiefan, Dolo; Yozwiak, Nathan L.; Moses, Lina M.; Jiang, Pan-Pan; Lin, Aaron E.; Schaffner, Stephen F.; Bird, Brian; Towner, Jonathan; Mamoh, Mambu; Gbakie, Michael; Kanneh, Lansana; Kargbo, David; Massally, James L.B.; Kamara, Fatima K.; Konuwa, Edwin; Sellu, Josephine; Jalloh, Abdul A.; Mustapha, Ibrahim; Foday, Momoh; Yillah, Mohamed; Erickson, Bobbie R.; Sealy, Tara; Blau, Dianna; Paddock, Christopher; Brault, Aaron; Amman, Brian; Basile, Jane; Bearden, Scott; Belser, Jessica; Bergeron, Eric; Campbell, Shelley; Chakrabarti, Ayan; Dodd, Kimberly; Flint, Mike; Gibbons, Aridth; Goodman, Christin; Klena, John; McMullan, Laura; Morgan, Laura; Russell, Brandy; Salzer, Johanna; Sanchez, Angela; Wang, David; Jungreis, Irwin; Tomkins-Tinch, Christopher; Kislyuk, Andrey; Lin, Michael F.; Chapman, Sinead; MacInnis, Bronwyn; Matthews, Ashley; Bochicchio, James; Hensley, Lisa E.; Kuhn, Jens H.; Nusbaum, Chad; Schieffelin, John S.; Birren, Bruce W.; Forget, Marc; Nichol, Stuart T.; Palacios, Gustavo F.; Ndiaye, Daouda; Happi, Christian; Gevao, Sahr M.; Vandi, Mohamed A.; Kargbo, Brima; Holmes, Edward C.; Bedford, Trevor; Gnirke, Andreas; Ströher, Ute; Rambaut, Andrew; Garry, Robert F.; Sabeti, Pardis C.
2015-01-01
Summary The 2013–2015 Ebola virus disease (EVD) epidemic is caused by the Makona variant of Ebola virus (EBOV). Early in the epidemic, genome sequencing provided insights into virus evolution and transmission and offered important information for outbreak response. Here, we analyze sequences from 232 patients sampled over 7 months in Sierra Leone, along with 86 previously released genomes from earlier in the epidemic. We confirm sustained human-to-human transmission within Sierra Leone and find no evidence for import or export of EBOV across national borders after its initial introduction. Using high-depth replicate sequencing, we observe both host-to-host transmission and recurrent emergence of intrahost genetic variants. We trace the increasing impact of purifying selection in suppressing the accumulation of nonsynonymous mutations over time. Finally, we note changes in the mucin-like domain of EBOV glycoprotein that merit further investigation. These findings clarify the movement of EBOV within the region and describe viral evolution during prolonged human-to-human transmission. PMID:26091036
Transposable Elements in Human Cancer: Causes and Consequences of Deregulation.
Anwar, Sumadi Lukman; Wulaningsih, Wahyu; Lehmann, Ulrich
2017-05-04
Transposable elements (TEs) comprise nearly half of the human genome and play an essential role in the maintenance of genomic stability, chromosomal architecture, and transcriptional regulation. TEs are repetitive sequences consisting of RNA transposons, DNA transposons, and endogenous retroviruses that can invade the human genome with a substantial contribution in human evolution and genomic diversity. TEs are therefore firmly regulated from early embryonic development and during the entire course of human life by epigenetic mechanisms, in particular DNA methylation and histone modifications. The deregulation of TEs has been reported in some developmental diseases, as well as for different types of human cancers. To date, the role of TEs, the mechanisms underlying TE reactivation, and the interplay with DNA methylation in human cancers remain largely unexplained. We reviewed the loss of epigenetic regulation and subsequent genomic instability, chromosomal aberrations, transcriptional deregulation, oncogenic activation, and aberrations of non-coding RNAs as the potential mechanisms underlying TE deregulation in human cancers.
Transposable Elements in Human Cancer: Causes and Consequences of Deregulation
Anwar, Sumadi Lukman; Wulaningsih, Wahyu; Lehmann, Ulrich
2017-01-01
Transposable elements (TEs) comprise nearly half of the human genome and play an essential role in the maintenance of genomic stability, chromosomal architecture, and transcriptional regulation. TEs are repetitive sequences consisting of RNA transposons, DNA transposons, and endogenous retroviruses that can invade the human genome with a substantial contribution in human evolution and genomic diversity. TEs are therefore firmly regulated from early embryonic development and during the entire course of human life by epigenetic mechanisms, in particular DNA methylation and histone modifications. The deregulation of TEs has been reported in some developmental diseases, as well as for different types of human cancers. To date, the role of TEs, the mechanisms underlying TE reactivation, and the interplay with DNA methylation in human cancers remain largely unexplained. We reviewed the loss of epigenetic regulation and subsequent genomic instability, chromosomal aberrations, transcriptional deregulation, oncogenic activation, and aberrations of non-coding RNAs as the potential mechanisms underlying TE deregulation in human cancers. PMID:28471386
Applications of the 1000 Genomes Project resources
Zheng-Bradley, Xiangqun
2017-01-01
Abstract The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. PMID:27436001
Xu, Shuhua
2015-01-01
Noncoding DNA sequences (NCS) have attracted much attention recently due to their functional potentials. Here we attempted to reveal the functional roles of noncoding sequences from the point of view of natural selection that typically indicates the functional potentials of certain genomic elements. We analyzed nearly 37 million single nucleotide polymorphisms (SNPs) of Phase I data of the 1000 Genomes Project. We estimated a series of key parameters of population genetics and molecular evolution to characterize sequence variations of the noncoding genome within and between populations, and identified the natural selection footprints in NCS in worldwide human populations. Our results showed that purifying selection is prevalent and there is substantial constraint of variations in NCS, while positive selectionis more likely to be specific to some particular genomic regions and regional populations. Intriguingly, we observed larger fraction of non-conserved NCS variants with lower derived allele frequency in the genome, indicating possible functional gain of non-conserved NCS. Notably, NCS elements are enriched for potentially functional markers such as eQTLs, TF motif, and DNase I footprints in the genome. More interestingly, some NCS variants associated with diseases such as Alzheimer's disease, Type 1 diabetes, and immune-related bowel disorder (IBD) showed signatures of positive selection, although the majority of NCS variants, reported as risk alleles by genome-wide association studies, showed signatures of negative selection. Our analyses provided compelling evidence of natural selection forces on noncoding sequences in the human genome and advanced our understanding of their functional potentials that play important roles in disease etiology and human evolution. PMID:26053627
Genome-wide signals of positive selection in human evolution
Enard, David; Messer, Philipp W.; Petrov, Dmitri A.
2014-01-01
The role of positive selection in human evolution remains controversial. On the one hand, scans for positive selection have identified hundreds of candidate loci, and the genome-wide patterns of polymorphism show signatures consistent with frequent positive selection. On the other hand, recent studies have argued that many of the candidate loci are false positives and that most genome-wide signatures of adaptation are in fact due to reduction of neutral diversity by linked deleterious mutations, known as background selection. Here we analyze human polymorphism data from the 1000 Genomes Project and detect signatures of positive selection once we correct for the effects of background selection. We show that levels of neutral polymorphism are lower near amino acid substitutions, with the strongest reduction observed specifically near functionally consequential amino acid substitutions. Furthermore, amino acid substitutions are associated with signatures of recent adaptation that should not be generated by background selection, such as unusually long and frequent haplotypes and specific distortions in the site frequency spectrum. We use forward simulations to argue that the observed signatures require a high rate of strongly adaptive substitutions near amino acid changes. We further demonstrate that the observed signatures of positive selection correlate better with the presence of regulatory sequences, as predicted by the ENCODE Project Consortium, than with the positions of amino acid substitutions. Our results suggest that adaptation was frequent in human evolution and provide support for the hypothesis of King and Wilson that adaptive divergence is primarily driven by regulatory changes. PMID:24619126
Genome Editing of Structural Variations: Modeling and Gene Correction.
Park, Chul-Yong; Sung, Jin Jea; Kim, Dong-Wook
2016-07-01
The analysis of chromosomal structural variations (SVs), such as inversions and translocations, was made possible by the completion of the human genome project and the development of genome-wide sequencing technologies. SVs contribute to genetic diversity and evolution, although some SVs can cause diseases such as hemophilia A in humans. Genome engineering technology using programmable nucleases (e.g., ZFNs, TALENs, and CRISPR/Cas9) has been rapidly developed, enabling precise and efficient genome editing for SV research. Here, we review advances in modeling and gene correction of SVs, focusing on inversion, translocation, and nucleotide repeat expansion. Copyright © 2016 Elsevier Ltd. All rights reserved.
Evolution of the CYP2D gene cluster in humans and four non-human primates.
Yasukochi, Yoshiki; Satta, Yoko
2011-01-01
The human cytochrome P450 2D6 (CYP2D6) is a primary enzyme involved in the metabolism of about 25% of commonly used therapeutic drugs. CYP2D6 belongs to the CYP2D subfamily, a gene cluster located on chromosome 22, which comprises the CYP2D6 gene and pseudogenes CYP2D7P and CYP2D8P. Although the chemical and physiological properties of CYP2D6 have been extensively studied, there has been no study to date on molecular evolution of the CYP2D subfamily in the human genome. Such knowledge could greatly contribute to the understanding of drug metabolism in humans because it makes us to know when and how the current metabolic system has been constructed. The knowledge moreover can be useful to find differences in exogenous substrates in a particular metabolism between human and other animals such as experimental animals. Here, we conducted a preliminary study to investigate the evolution and gene organization of the CYP2D subfamily, focused on humans and four non-human primates (chimpanzees, orangutans, rhesus monkeys, and common marmosets). Our results indicate that CYP2D7P has been duplicated from CYP2D6 before the divergence between humans and great apes, whereas CYP2D6 and CYP2D8P have been already present in the stem lineages of New World monkeys and Catarrhini. Furthermore, the origin of the CYP2D subfamily in the human genome can be traced back to before the divergence between amniotes and amphibians. Our analyses also show that reported chimeric sequences of the CYP2D6 and CYP2D7 genes in the chimpanzee genome appear to be exchanged in its genome database.
Molecular anthropology in the genomic era.
Destro-Bisol, Giovanni; Jobling, Mark A; Rocha, Jorge; Novembre, John; Richards, Martin B; Mulligan, Connie; Batini, Chiara; Manni, Franz
2010-01-01
Molecular Anthropology is a relatively young field of research. In fact, less than 50 years have passed since the symposium "Classification and Human Evolution" (1962, Burg Wartenstein, Austria), where the term was formally introduced by Emil Zuckerkandl. In this time, Molecular Anthropology has developed both methodologically and theoretically and extended its applications, so covering key aspects of human evolution such as the reconstruction of the history of human populations and peopling processes, the characterization of DNA in extinct humans and the role of adaptive processes in shaping the genetic diversity of our species. In the current scientific panorama, molecular anthropologists have to face a double challenge. As members of the anthropological community, we are strongly committed to the integration of biological findings and other lines of evidence (e.g. linguistic and archaeological), while keeping in line with methodological innovations which are moving the approach from the genetic to the genomic level. In this framework, the meeting "DNA Polymorphisms in Human Populations: Molecular Anthropology in the Genomic Era" (Rome, December 3-5, 2009) offered an opportunity for discussion among scholars from different disciplines, while paying attention to the impact of recent methodological innovations. Here we present an overview of the meeting and discuss perspectives and prospects of Molecular Anthropology in the genomic era.
Huang, L; Nesterenko, A; Nie, W; Wang, J; Su, W; Graphodatsky, A S; Yang, F
2008-01-01
Considering the giraffe (Giraffa camelopardalis, GCA, 2n = 30) as a primitive species, its comparative genomic data are critical for our understanding of the karyotype evolution of pecorans. Here, we have established genome-wide chromosomal homologies between giraffe, Chinese muntjac (Muntiacus reevesi, MRE, 2n = 46) and human (Homo sapiens, HSA, 2n = 46) with whole sets of chromosome-specific paints from Chinese muntjac and human, in addition to providing a high-resolution G-banding karyotype of giraffe. Chinese muntjac and human chromosome paints detected 32 and 45 autosomal homologs in the genome of giraffe, respectively. Our results suggest that it would require at least thirteen fissions, six fusions and three intrachromosomal rearrangements to 'transform' the 2n = 44 eutherian ancestral karyotype to the 2n = 58 pecoran ancestral karyotype. During giraffe evolution, some ancestral eutherian syntenies (i.e. association of HSA3/21, 4/8, 7/16, 14/15, 16/19 and two forms of 12/22) have been retained, while several derived syntenies (i.e. associations of human homologous segments 2/1, 2/9, 5/19, 4/12/22, 8/9, and 10/20) have been produced. The reduction of chromosome number in giraffe from the 2n = 58 pecoran ancestral karyotype could be primarily attributed to extensive Robertsonian translocations of ancestral chromosomal segments. More complex chromosomal rearrangements (including tandem fusion, centromere repositioning and pericentric inversion) have happened during the evolution of GCA2 and GCA8. Copyright 2008 S. Karger AG, Basel.
Mycobacterium leprae: genes, pseudogenes and genetic diversity
Singh, Pushpendra; Cole, Stewart T
2011-01-01
Leprosy, which has afflicted human populations for millenia, results from infection with Mycobacterium leprae, an unculturable pathogen with an exceptionally long generation time. Considerable insight into the biology and drug resistance of the leprosy bacillus has been obtained from genomics. M. leprae has undergone reductive evolution and pseudogenes now occupy half of its genome. Comparative genomics of four different strains revealed remarkable conservation of the genome (99.995% identity) yet uncovered 215 polymorphic sites, mainly single nucleotide polymorphisms, and a handful of new pseudogenes. Mapping these polymorphisms in a large panel of strains defined 16 single nucleotide polymorphism-subtypes that showed strong geographical associations and helped retrace the evolution of M. leprae. PMID:21162636
Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.
Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene
2017-02-01
Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Gilbert, Maarten J; Miller, William G; Yee, Emma; Zomer, Aldert L; van der Graaf-van Bloois, Linda; Fitzgerald, Collette; Forbes, Ken J; Méric, Guillaume; Sheppard, Samuel K; Wagenaar, Jaap A; Duim, Birgitta
2016-07-02
Campylobacter fetus currently comprises three recognized subspecies, which display distinct host association. Campylobacter fetus subsp. fetus and C fetus subsp. venerealis are both associated with endothermic mammals, primarily ruminants, whereas C fetus subsp. testudinum is primarily associated with ectothermic reptiles. Both C. fetus subsp. testudinum and C. fetus subsp. fetus have been associated with severe infections, often with a systemic component, in immunocompromised humans. To study the genetic factors associated with the distinct host dichotomy in C. fetus, whole-genome sequencing and comparison of mammal- and reptile-associated C fetus was performed. The genomes of C fetus subsp. testudinum isolated from either reptiles or humans were compared with elucidate the genetic factors associated with pathogenicity in humans. Genomic comparisons showed conservation of gene content and organization among C fetus subspecies, but a clear distinction between mammal- and reptile-associated C fetus was observed. Several genomic regions appeared to be subspecies specific, including a putative tricarballylate catabolism pathway, exclusively present in C fetus subsp. testudinum strains. Within C fetus subsp. testudinum, sapA, sapB, and sapAB type strains were observed. The recombinant locus iamABC (mlaFED) was exclusively associated with invasive C fetus subsp. testudinum strains isolated from humans. A phylogenetic reconstruction was consistent with divergent evolution in host-associated strains and the existence of a barrier to lateral gene transfer between mammal- and reptile-associated C fetus Overall, this study shows that reptile-associated C fetus subsp. testudinum is genetically divergent from mammal-associated C fetus subspecies. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Sun, Zhihong; Zhang, Wenyi; Guo, Chenyi; Yang, Xianwei; Liu, Wenjun; Wu, Yarong; Song, Yuqin; Kwok, Lai Yu; Cui, Yujun; Menghe, Bilige; Yang, Ruifu; Hu, Liangping; Zhang, Heping
2015-01-01
Bifidobacteria are well known for their human health-promoting effects and are therefore widely applied in the food industry. Members of the Bifidobacterium genus were first identified from the human gastrointestinal tract and were then found to be widely distributed across various ecological niches. Although the genetic diversity of Bifidobacterium has been determined based on several marker genes or a few genomes, the global diversity and evolution scenario for the entire genus remain unresolved. The present study comparatively analyzed the genomes of 45 type strains. We built a robust genealogy for Bifidobacterium based on 402 core genes and defined its root according to the phylogeny of the tree of bacteria. Our results support that all human isolates are of younger lineages, and although species isolated from bees dominate the more ancient lineages, the bee was not necessarily the original host for bifidobacteria. Moreover, the species isolated from different hosts are enriched with specific gene sets, suggesting host-specific adaptation. Notably, bee-specific genes are strongly associated with respiratory metabolism and are potential in helping those bacteria adapt to the oxygen-rich gut environment in bees. This study provides a snapshot of the genetic diversity and evolution of Bifidobacterium, paving the way for future studies on the taxonomy and functional genomics of the genus.
Future potential of the Human Epigenome Project.
Eckhardt, Florian; Beck, Stephan; Gut, Ivo G; Berlin, Kurt
2004-09-01
Deciphering the information encoded in the human genome is key for the further understanding of human biology, physiology and evolution. With the draft sequence of the human genome completed, elucidation of the epigenetic information layer of the human genome becomes accessible. Epigenetic mechanisms are mediated by either chemical modifications of the DNA itself or by modifications of proteins that are closely associated with DNA. Defects of the epigenetic regulation involved in processes such as imprinting, X chromosome inactivation, transcriptional control of genes, as well as mutations affecting DNA methylation enzymes, contribute fundamentally to the etiology of many human diseases. Headed by the Human Epigenome Consortium, the Human Epigenome Project is a joint effort by an international collaboration that aims to identify, catalog and interpret genome-wide DNA methylation patterns of all human genes in all major tissues. Methylation variable positions are thought to reflect gene activity, tissue type and disease state, and are useful epigenetic markers revealing the dynamic state of the genome. Like single nucleotide polymorphisms, methylation variable positions will greatly advance our ability to elucidate and diagnose the molecular basis of human diseases.
Chalopin, Domitille; Naville, Magali; Plard, Floriane; Galiana, Delphine; Volff, Jean-Nicolas
2015-01-01
Transposable elements (TEs) are major components of vertebrate genomes, with major roles in genome architecture and evolution. In order to characterize both common patterns and lineage-specific differences in TE content and TE evolution, we have compared the mobilomes of 23 vertebrate genomes, including 10 actinopterygian fish, 11 sarcopterygians, and 2 nonbony vertebrates. We found important variations in TE content (from 6% in the pufferfish tetraodon to 55% in zebrafish), with a more important relative contribution of TEs to genome size in fish than in mammals. Some TE superfamilies were found to be widespread in vertebrates, but most elements showed a more patchy distribution, indicative of multiple events of loss or gain. Interestingly, loss of major TE families was observed during the evolution of the sarcopterygian lineage, with a particularly strong reduction in TE diversity in birds and mammals. Phylogenetic trends in TE composition and activity were detected: Teleost fish genomes are dominated by DNA transposons and contain few ancient TE copies, while mammalian genomes have been predominantly shaped by nonlong terminal repeat retrotransposons, along with the persistence of older sequences. Differences were also found within lineages: The medaka fish genome underwent more recent TE amplification than the related platyfish, as observed for LINE retrotransposons in the mouse compared with the human genome. This study allows the identification of putative cases of horizontal transfer of TEs, and to tentatively infer the composition of the ancestral vertebrate mobilome. Taken together, the results obtained highlight the importance of TEs in the structure and evolution of vertebrate genomes, and demonstrate their major impact on genome diversity both between and within lineages. PMID:25577199
The Divided Bacterial Genome: Structure, Function, and Evolution.
diCenzo, George C; Finan, Turlough M
2017-09-01
Approximately 10% of bacterial genomes are split between two or more large DNA fragments, a genome architecture referred to as a multipartite genome. This multipartite organization is found in many important organisms, including plant symbionts, such as the nitrogen-fixing rhizobia, and plant, animal, and human pathogens, including the genera Brucella , Vibrio , and Burkholderia . The availability of many complete bacterial genome sequences means that we can now examine on a broad scale the characteristics of the different types of DNA molecules in a genome. Recent work has begun to shed light on the unique properties of each class of replicon, the unique functional role of chromosomal and nonchromosomal DNA molecules, and how the exploitation of novel niches may have driven the evolution of the multipartite genome. The aims of this review are to (i) outline the literature regarding bacterial genomes that are divided into multiple fragments, (ii) provide a meta-analysis of completed bacterial genomes from 1,708 species as a way of reviewing the abundant information present in these genome sequences, and (iii) provide an encompassing model to explain the evolution and function of the multipartite genome structure. This review covers, among other topics, salient genome terminology; mechanisms of multipartite genome formation; the phylogenetic distribution of multipartite genomes; how each part of a genome differs with respect to genomic signatures, genetic variability, and gene functional annotation; how each DNA molecule may interact; as well as the costs and benefits of this genome structure. Copyright © 2017 American Society for Microbiology.
Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kass, D.H.; Batzer, M.A.; Deininger, P.L.
The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome.more » However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.« less
Impacts of Genome-Wide Analyses on Our Understanding of Human Herpesvirus Diversity and Evolution.
Renner, Daniel W; Szpara, Moriah L
2018-01-01
Until fairly recently, genome-wide evolutionary dynamics and within-host diversity were more commonly examined in the context of small viruses than in the context of large double-stranded DNA viruses such as herpesviruses. The high mutation rates and more compact genomes of RNA viruses have inspired the investigation of population dynamics for these species, and recent data now suggest that herpesviruses might also be considered candidates for population modeling. High-throughput sequencing (HTS) and bioinformatics have expanded our understanding of herpesviruses through genome-wide comparisons of sequence diversity, recombination, allele frequency, and selective pressures. Here we discuss recent data on the mechanisms that generate herpesvirus genomic diversity and underlie the evolution of these virus families. We focus on human herpesviruses, with key insights drawn from veterinary herpesviruses and other large DNA virus families. We consider the impacts of cell culture on herpesvirus genomes and how to accurately describe the viral populations under study. The need for a strong foundation of high-quality genomes is also discussed, since it underlies all secondary genomic analyses such as RNA sequencing (RNA-Seq), chromatin immunoprecipitation, and ribosome profiling. Areas where we foresee future progress, such as the linking of viral genetic differences to phenotypic or clinical outcomes, are highlighted as well. Copyright © 2017 Renner and Szpara.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Korenberg, J.R.
The ultimate goal of this research is to generate and apply novel technologies to speed completion and integration of the human genome map and sequence with biomedical problems. To do this, techniques were developed and genome-wide resources generated. This includes a genome-wide Mapped and Integrated BAC/PAC Resource that has been used for gene finding, map completion and anchoring, breakpoint definition and sequencing. In the last period of the grant, the Human Mapped BAC/PAC Resource was also applied to determine regions of human variation and to develop a novel paradigm of primate evolution through to humans. Further, in order to moremore » rapidly evaluate animal models of human disease, a BAC Map of the mouse was generated in collaboration with the MTI Genome Center, Dr. Bruce Birren.« less
Advances in Cryptococcus genomics: insights into the evolution of pathogenesis.
Cuomo, Christina A; Rhodes, Johanna; Desjardins, Christopher A
2018-01-01
Cryptococcus species are the causative agents of cryptococcal meningitis, a significant source of mortality in immunocompromised individuals. Initial work on the molecular epidemiology of this fungal pathogen utilized genotyping approaches to describe the genetic diversity and biogeography of two species, Cryptococcus neoformans and Cryptococcus gattii. Whole genome sequencing of representatives of both species resulted in reference assemblies enabling a wide array of downstream studies and genomic resources. With the increasing availability of whole genome sequencing, both species have now had hundreds of individual isolates sequenced, providing fine-scale insight into the evolution and diversification of Cryptococcus and allowing for the first genome-wide association studies to identify genetic variants associated with human virulence. Sequencing has also begun to examine the microevolution of isolates during prolonged infection and to identify variants specific to outbreak lineages, highlighting the potential role of hyper-mutation in evolving within short time scales. We can anticipate that further advances in sequencing technology and sequencing microbial genomes at scale, including metagenomics approaches, will continue to refine our view of how the evolution of Cryptococcus drives its success as a pathogen.
The genomic and epidemiological dynamics of human influenza A virus.
Rambaut, Andrew; Pybus, Oliver G; Nelson, Martha I; Viboud, Cecile; Taubenberger, Jeffery K; Holmes, Edward C
2008-05-29
The evolutionary interaction between influenza A virus and the human immune system, manifest as 'antigenic drift' of the viral haemagglutinin, is one of the best described patterns in molecular evolution. However, little is known about the genome-scale evolutionary dynamics of this pathogen. Similarly, how genomic processes relate to global influenza epidemiology, in which the A/H3N2 and A/H1N1 subtypes co-circulate, is poorly understood. Here through an analysis of 1,302 complete viral genomes sampled from temperate populations in both hemispheres, we show that the genomic evolution of influenza A virus is characterized by a complex interplay between frequent reassortment and periodic selective sweeps. The A/H3N2 and A/H1N1 subtypes exhibit different evolutionary dynamics, with diverse lineages circulating in A/H1N1, indicative of weaker antigenic drift. These results suggest a sink-source model of viral ecology in which new lineages are seeded from a persistent influenza reservoir, which we hypothesize to be located in the tropics, to sink populations in temperate regions.
Plasmodium malariae and P. ovale genomes provide insights into malaria parasite evolution
Rutledge, Gavin G.; Böhme, Ulrike; Sanders, Mandy; Reid, Adam J.; Cotton, James A.; Maiga-Ascofare, Oumou; Djimdé, Abdoulaye A.; Apinjoh, Tobias O.; Amenga-Etego, Lucas; Manske, Magnus; Barnwell, John W.; Renaud, François; Ollomo, Benjamin; Prugnolle, Franck; Anstey, Nicholas M.; Auburn, Sarah; Price, Ric N.; McCarthy, James S.; Kwiatkowski, Dominic P.; Newbold, Chris I.; Berriman, Matthew; Otto, Thomas D.
2017-01-01
Elucidation of the evolutionary history and interrelatedness of Plasmodium species that infect humans has been hampered by a lack of genetic information for three human-infective species: P. malariae and two P. ovale species (P. o. curtisi and P. o. wallikeri)1. These species are prevalent across most regions in which malaria is endemic2,3 and are often undetectable by light microscopy4, rendering their study in human populations difficult5. The exact evolutionary relationship of these species to the other human-infective species has been contested6,7. Using a new reference genome for P. malariae and a manually curated draft P. o. curtisi genome, we are now able to accurately place these species within the Plasmodium phylogeny. Sequencing of a P. malariae relative that infects chimpanzees reveals similar signatures of selection in the P. malariae lineage to another Plasmodium lineage shown to be capable of colonization of both human and chimpanzee hosts. Molecular dating suggests that these host adaptations occurred over similar evolutionary timescales. In addition to the core genome that is conserved between species, differences in gene content can be linked to their specific biology. The genome suggests that P. malariae expresses a family of heterodimeric proteins on its surface that have structural similarities to a protein crucial for invasion of red blood cells. The data presented here provide insight into the evolution of the Plasmodium genus as a whole. PMID:28117441
Chen, Chunxia; Cui, Xiaoying; Yu, Jun; Xiao, Jingfa; Kan, Biao
2012-01-01
Salmonella Paratyphi A (S. Paratyphi A) is a highly adapted, human-specific pathogen that causes paratyphoid fever. Cases of paratyphoid fever have recently been increasing, and the disease is becoming a major public health concern, especially in Eastern and Southern Asia. To investigate the genomic variation and evolution of S. Paratyphi A, a pan-genomic analysis was performed on five newly sequenced S. Paratyphi A strains and two other reference strains. A whole genome comparison revealed that the seven genomes are collinear and that their organization is highly conserved. The high rate of substitutions in part of the core genome indicates that there are frequent homologous recombination events. Based on the changes in the pan-genome size and cluster number (both in the core functional genes and core pseudogenes), it can be inferred that the sharply increasing number of pseudogene clusters may have strong correlation with the inactivation of functional genes, and indicates that the S. Paratyphi A genome is being degraded. PMID:23028950
Applications of the 1000 Genomes Project resources.
Zheng-Bradley, Xiangqun; Flicek, Paul
2017-05-01
The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. © The Author 2016. Published by Oxford University Press.
Everts-van der Wind, Annelie; Kata, Srinivas R.; Band, Mark R.; Rebeiz, Mark; Larkin, Denis M.; Everts, Robin E.; Green, Cheryl A.; Liu, Lei; Natarajan, Shreedhar; Goldammer, Tom; Lee, Jun Heon; McKay, Stephanie; Womack, James E.; Lewin, Harris A.
2004-01-01
A second-generation 5000 rad radiation hybrid (RH) map of the cattle genome was constructed primarily using cattle ESTs that were targeted to gaps in the existing cattle–human comparative map, as well as to sparsely populated map intervals. A total of 870 targeted markers were added, bringing the number of markers mapped on the RH5000 panel to 1913. Of these, 1463 have significant BLASTN hits (E < e–5) against the human genome sequence. A cattle–human comparative map was created using human genome sequence coordinates of the paired orthologs. One-hundred and ninety-five conserved segments (defined by two or more genes) were identified between the cattle and human genomes, of which 31 are newly discovered and 34 were extended singletons on the first-generation map. The new map represents an improvement of 20% genome-wide comparative coverage compared with the first-generation map. Analysis of gene content within human genome regions where there are gaps in the comparative map revealed gaps with both significantly greater and significantly lower gene content. The new, more detailed cattle–human comparative map provides an improved resource for the analysis of mammalian chromosome evolution, the identification of candidate genes for economically important traits, and for proper alignment of sequence contigs on cattle chromosomes. PMID:15231756
Benard, Emmanuel; Michel, Christian J
2009-08-01
We present here the SEGM web server (Stochastic Evolution of Genetic Motifs) in order to study the evolution of genetic motifs both in the direct evolutionary sense (past-present) and in the inverse evolutionary sense (present-past). The genetic motifs studied can be nucleotides, dinucleotides and trinucleotides. As an example of an application of SEGM and to understand its functionalities, we give an analysis of inverse mutations of splice sites of human genome introns. SEGM is freely accessible at http://lsiit-bioinfo.u-strasbg.fr:8080/webMathematica/SEGM/SEGM.html directly or by the web site http://dpt-info.u-strasbg.fr/~michel/. To our knowledge, this SEGM web server is to date the only computational biology software in this evolutionary approach.
Mapping the Structure and Dynamics of Genomics-Related MeSH Terms Complex Networks
Siqueiros-García, Jesús M.; Hernández-Lemus, Enrique; García-Herrera, Rodrigo; Robina-Galatas, Andrea
2014-01-01
It has been proposed that the history and evolution of scientific ideas may reflect certain aspects of the underlying socio-cognitive frameworks in which science itself is developing. Systematic analyses of the development of scientific knowledge may help us to construct models of the collective dynamics of science. Aiming at scientific rigor, these models should be built upon solid empirical evidence, analyzed with formal tools leading to ever-improving results that support the related conclusions. Along these lines we studied the dynamics and structure of the development of research in genomics as represented by the entire collection of genomics-related scientific papers contained in the PubMed database. The analyzed corpus consisted in more than 49,000 articles published in the years 1987 (first appeareance of the term Genomics) to 2011, categorized by means of the Medical Subheadings (MeSH) content-descriptors. Complex networks were built where two MeSH terms were connected if they are descriptors of the same article(s). The analysis of such networks revealed a complex structure and dynamics that to certain extent resembled small-world networks. The evolution of such networks in time reflected interesting phenomena in the historical development of genomic research, including what seems to be a phase-transition in a period marked by the completion of the first draft of the Human Genome Project. We also found that different disciplinary areas have different dynamic evolution patterns in their MeSH connectivity networks. In the case of areas related to science, changes in topology were somewhat fast while retaining a certain core-stucture, whereas in the humanities, the evolution was pretty slow and the structure resulted highly redundant and in the case of technology related issues, the evolution was very fast and the structure remained tree-like with almost no overlapping terms. PMID:24699262
Differential DNA Methylation Analysis without a Reference Genome.
Klughammer, Johanna; Datlinger, Paul; Printz, Dieter; Sheffield, Nathan C; Farlik, Matthias; Hadler, Johanna; Fritsch, Gerhard; Bock, Christoph
2015-12-22
Genome-wide DNA methylation mapping uncovers epigenetic changes associated with animal development, environmental adaptation, and species evolution. To address the lack of high-throughput methods for DNA methylation analysis in non-model organisms, we developed an integrated approach for studying DNA methylation differences independent of a reference genome. Experimentally, our method relies on an optimized 96-well protocol for reduced representation bisulfite sequencing (RRBS), which we have validated in nine species (human, mouse, rat, cow, dog, chicken, carp, sea bass, and zebrafish). Bioinformatically, we developed the RefFreeDMA software to deduce ad hoc genomes directly from RRBS reads and to pinpoint differentially methylated regions between samples or groups of individuals (http://RefFreeDMA.computational-epigenetics.org). The identified regions are interpreted using motif enrichment analysis and/or cross-mapping to annotated genomes. We validated our method by reference-free analysis of cell-type-specific DNA methylation in the blood of human, cow, and carp. In summary, we present a cost-effective method for epigenome analysis in ecology and evolution, which enables epigenome-wide association studies in natural populations and species without a reference genome. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Adaptive Evolution as a Predictor of Species-Specific Innate Immune Response.
Webb, Andrew E; Gerek, Z Nevin; Morgan, Claire C; Walsh, Thomas A; Loscher, Christine E; Edwards, Scott V; O'Connell, Mary J
2015-07-01
It has been proposed that positive selection may be associated with protein functional change. For example, human and macaque have different outcomes to HIV infection and it has been shown that residues under positive selection in the macaque TRIM5α receptor locate to the region known to influence species-specific response to HIV. In general, however, the relationship between sequence and function has proven difficult to fully elucidate, and it is the role of large-scale studies to help bridge this gap in our understanding by revealing major patterns in the data that correlate genotype with function or phenotype. In this study, we investigate the level of species-specific positive selection in innate immune genes from human and mouse. In total, we analyzed 456 innate immune genes using codon-based models of evolution, comparing human, mouse, and 19 other vertebrate species to identify putative species-specific positive selection. Then we used population genomic data from the recently completed Neanderthal genome project, the 1000 human genomes project, and the 17 laboratory mouse genomes project to determine whether the residues that were putatively positively selected are fixed or variable in these populations. We find evidence of species-specific positive selection on both the human and the mouse branches and we show that the classes of genes under positive selection cluster by function and by interaction. Data from this study provide us with targets to test the relationship between positive selection and protein function and ultimately to test the relationship between positive selection and discordant phenotypes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evolution, revolution and heresy in the genetics of infectious disease susceptibility
Hill, Adrian V. S.
2012-01-01
Infectious pathogens have long been recognized as potentially powerful agents impacting on the evolution of human genetic diversity. Analysis of large-scale case–control studies provides one of the most direct means of identifying human genetic variants that currently impact on susceptibility to particular infectious diseases. For over 50 years candidate gene studies have been used to identify loci for many major causes of human infectious mortality, including malaria, tuberculosis, human immunodeficiency virus/acquired immunodeficiency syndrome, bacterial pneumonia and hepatitis. But with the advent of genome-wide approaches, many new loci have been identified in diverse populations. Genome-wide linkage studies identified a few loci, but genome-wide association studies are proving more successful, and both exome and whole-genome sequencing now offer a revolutionary increase in power. Opinions differ on the extent to which the genetic component to common disease susceptibility is encoded by multiple high frequency or rare variants, and the heretical view that most infectious diseases might even be monogenic has been advocated recently. Review of findings to date suggests that the genetic architecture of infectious disease susceptibility may be importantly different from that of non-infectious diseases, and it is suggested that natural selection may be the driving force underlying this difference. PMID:22312051
Chromosomal Inversions between Human and Chimpanzee Lineages Caused by Retrotransposons
Lee, Jungnam; Han, Kyudong; Meyer, Thomas J.; Kim, Heui-Soo; Batzer, Mark A.
2008-01-01
The long interspersed element-1 (LINE-1 or L1) and Alu elements are the most abundant mobile elements comprising 21% and 11% of the human genome, respectively. Since the divergence of human and chimpanzee lineages, these elements have vigorously created chromosomal rearrangements causing genomic difference between humans and chimpanzees by either increasing or decreasing the size of genome. Here, we report an exotic mechanism, retrotransposon recombination-mediated inversion (RRMI), that usually does not alter the amount of genomic material present. Through the comparison of the human and chimpanzee draft genome sequences, we identified 252 inversions whose respective inversion junctions can clearly be characterized. Our results suggest that L1 and Alu elements cause chromosomal inversions by either forming a secondary structure or providing a fragile site for double-strand breaks. The detailed analysis of the inversion breakpoints showed that L1 and Alu elements are responsible for at least 44% of the 252 inversion loci between human and chimpanzee lineages, including 49 RRMI loci. Among them, three RRMI loci inverted exonic regions in known genes, which implicates this mechanism in generating the genomic and phenotypic differences between human and chimpanzee lineages. This study is the first comprehensive analysis of mobile element bases inversion breakpoints between human and chimpanzee lineages, and highlights their role in primate genome evolution. PMID:19112500
Madrid, Andy; Chopra, Pankaj; Alisch, Reid S.
2018-01-01
Human evolution from non-human primates has seen substantial change in the central nervous system, with the molecular mechanisms underlying human brain evolution remaining largely unknown. Methylation of cytosine at the fifth carbon (5-methylcytosine; 5 mC) is an essential epigenetic mark linked to neurodevelopment, as well as neurological disease. The emergence of another modified form of cytosine (5-hydroxymethylcytosine; 5 hmC) that is enriched in the brain further substantiates a role for these epigenetic marks in neurodevelopment, yet little is known about the evolutionary importance of these marks in brain development. Here, human and monkey brain tissue were profiled, identifying 5,516 and 4,070 loci that were differentially methylated and hydroxymethylated, respectively, between the species. Annotation of these loci to the human genome revealed genes critical for the development of the nervous system and that are associated with intelligence and higher cognitive functioning, such as RELN and GNAS. Moreover, ontological analyses of these differentially methylated and hydroxymethylated genes revealed a significant enrichment of neuronal/immunological–related processes, including neurogenesis and axon development. Finally, the sequences flanking the differentially methylated/hydroxymethylated loci contained a significant enrichment of binding sites for neurodevelopmentally important transcription factors (e.g., OTX1 and PITX1), suggesting that DNA methylation may regulate gene expression by mediating transcription factor binding on these transcripts. Together, these data support dynamic species-specific epigenetic contributions in the evolution and development of the human brain from non-human primates. PMID:29491831
Singh, Param Priya; Arora, Jatin; Isambert, Hervé
2015-07-01
Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.
Singh, Param Priya; Arora, Jatin; Isambert, Hervé
2015-01-01
Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined ‘ohnologs’ after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases. PMID:26181593
An overview on genome organization of marine organisms.
Costantini, Maria
2015-12-01
In this review we will concentrate on some general genome features of marine organisms and their evolution, ranging from vertebrate to invertebrates until unicellular organisms. Before genome sequencing, the ultracentrifugation in CsCl led to high resolution of mammalian DNA (without seeing at the sequence). The analytical profile of human DNA showed that the vertebrate genome is a mosaic of isochores, typically megabase-size DNA segments that belong in a small number of families characterized by different GC levels. The recent availability of a number of fully sequenced genomes allowed mapping very precisely the isochores, based on DNA sequences. Since isochores are tightly linked to biological properties such as gene density, replication timing and recombination, the new level of detail provided by the isochore map helped the understanding of genome structure, function and evolution. This led the current level of knowledge and to further insights. Copyright © 2015. Published by Elsevier B.V.
PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.
Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay
2015-12-01
A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.
Takeuchi, Fumihiko; Watanabe, Shinya; Baba, Tadashi; Yuzawa, Harumi; Ito, Teruyo; Morimoto, Yuh; Kuroda, Makoto; Cui, Longzhu; Takahashi, Mikio; Ankai, Akiho; Baba, Shin-ichi; Fukui, Shigehiro; Lee, Jean C; Hiramatsu, Keiichi
2005-11-01
Staphylococcus haemolyticus is an opportunistic bacterial pathogen that colonizes human skin and is remarkable for its highly antibiotic-resistant phenotype. We determined the complete genome sequence of S.haemolyticus to better understand its pathogenicity and evolutionary relatedness to the other staphylococcal species. A large proportion of the open reading frames in the genomes of S.haemolyticus, Staphylococcus aureus, and Staphylococcus epidermidis were conserved in their sequence and order on the chromosome. We identified a region of the bacterial chromosome just downstream of the origin of replication that showed little homology among the species but was conserved among strains within a species. This novel region, designated the "oriC environ," likely contributes to the evolution and differentiation of the staphylococcal species, since it was enriched for species-specific nonessential genes that contribute to the biological features of each staphylococcal species. A comparative analysis of the genomes of S.haemolyticus, S.aureus, and S.epidermidis elucidated differences in their biological and genetic characteristics and pathogenic potentials. We identified as many as 82 insertion sequences in the S.haemolyticus chromosome that probably mediated frequent genomic rearrangements, resulting in phenotypic diversification of the strain. Such rearrangements could have brought genomic plasticity to this species and contributed to its acquisition of antibiotic resistance.
Tong, Pin; Monahan, Jack; Prendergast, James G D
2017-03-01
Large-scale gene expression datasets are providing an increasing understanding of the location of cis-eQTLs in the human genome and their role in disease. However, little is currently known regarding the extent of regulatory site-sharing between genes. This is despite it having potentially wide-ranging implications, from the determination of the way in which genetic variants may shape multiple phenotypes to the understanding of the evolution of human gene order. By first identifying the location of non-redundant cis-eQTLs, we show that regulatory site-sharing is a relatively common phenomenon in the human genome, with over 10% of non-redundant regulatory variants linked to the expression of multiple nearby genes. We show that these shared, local regulatory sites are linked to high levels of chromatin looping between the regulatory sites and their associated genes. In addition, these co-regulated gene modules are found to be strongly conserved across mammalian species, suggesting that shared regulatory sites have played an important role in shaping human gene order. The association of these shared cis-eQTLs with multiple genes means they also appear to be unusually important in understanding the genetics of human phenotypes and pleiotropy, with shared regulatory sites more often linked to multiple human phenotypes than other regulatory variants. This study shows that regulatory site-sharing is likely an underappreciated aspect of gene regulation and has important implications for the understanding of various biological phenomena, including how the two and three dimensional structures of the genome have been shaped and the potential causes of disease pleiotropy outside coding regions.
Impact of retrotransposons in pluripotent stem cells.
Tanaka, Yoshiaki; Chung, Leeyup; Park, In-Hyun
2012-12-01
Retrotransposons, which constitute approximately 40% of the human genome, have the capacity to 'jump' across the genome. Their mobility contributes to oncogenesis, evolution, and genomic plasticity of the host genome. Induced pluripotent stem cells as well as embryonic stem cells are more susceptible than differentiated cells to genomic aberrations including insertion, deletion and duplication. Recent studies have revealed specific behaviors of retrotransposons in pluripotent cells. Here, we review recent progress in understanding retrotransposons and provide a perspective on the relationship between retrotransposons and genomic variation in pluripotent stem cells.
The Evolution of Human Genetic Studies of Cleft Lip and Cleft Palate
Marazita, Mary L.
2013-01-01
Orofacial clefts (OFCs)—primarily cleft lip and cleft palate—are among the most common birth defects in all populations worldwide, and have notable population, ethnicity, and gender differences in birth prevalence. Interest in these birth defects goes back centuries, as does formal scientific interest; scientists often used OFCs as examples or evidence during paradigm shifts in human genetics, and have also used virtually every new method of human genetic analysis to deepen our understanding of OFC. This review traces the evolution of human genetic investigations of OFC, highlights the specific insights gained about OFC through the years, and culminates in a review of recent key OFC genetic findings resulting from the powerful tools of the genomics era. Notably, OFC represents a major success for genome-wide approaches, and the field is poised for further breakthroughs in the near future. PMID:22703175
Initial sequencing and comparative analysis of the mouse genome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Waterston, Robert H.; Lindblad-Toh, Kerstin; Birney, Ewan
2002-12-15
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of themore » genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.« less
Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility.
Nuttle, Xander; Giannuzzi, Giuliana; Duyzend, Michael H; Schraiber, Joshua G; Narvaiza, Iñigo; Sudmant, Peter H; Penn, Osnat; Chiatante, Giorgia; Malig, Maika; Huddleston, John; Benner, Chris; Camponeschi, Francesca; Ciofi-Baffoni, Simone; Stessman, Holly A F; Marchetto, Maria C N; Denman, Laura; Harshman, Lana; Baker, Carl; Raja, Archana; Penewit, Kelsi; Janke, Nicolette; Tang, W Joyce; Ventura, Mario; Banci, Lucia; Antonacci, Francesca; Akey, Joshua M; Amemiya, Chris T; Gage, Fred H; Reymond, Alexandre; Eichler, Evan E
2016-08-11
Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11. rearrangement show that more than 96% of breakpoints occur within the H. sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.
SINEs as driving forces in genome evolution.
Schmitz, J
2012-01-01
SINEs are short interspersed elements derived from cellular RNAs that repetitively retropose via RNA intermediates and integrate more or less randomly back into the genome. SINEs propagate almost entirely vertically within their host cells and, once established in the germline, are passed on from generation to generation. As non-autonomous elements, their reverse transcription (from RNA to cDNA) and genomic integration depends on the activity of the enzymatic machinery of autonomous retrotransposons, such as long interspersed elements (LINEs). SINEs are widely distributed in eukaryotes, but are especially effectively propagated in mammalian species. For example, more than a million Alu-SINE copies populate the human genome (approximately 13% of genomic space), and few master copies of them are still active. In the organisms where they occur, SINEs are a challenge to genomic integrity, but in the long term also can serve as beneficial building blocks for evolution, contributing to phenotypic heterogeneity and modifying gene regulatory networks. They substantially expand the genomic space and introduce structural variation to the genome. SINEs have the potential to mutate genes, to alter gene expression, and to generate new parts of genes. A balanced distribution and controlled activity of such properties is crucial to maintaining the organism's dynamic and thriving evolution. Copyright © 2012 S. Karger AG, Basel.
Erren, T C; Erren, M
2004-04-01
When David Horrobin suggested that phospholipid and fatty acid metabolism played a major role in human evolution, his 'fat utilization hypothesis' unified intriguing work from paleoanthropology, evolutionary biology, genetic and nervous system research in a novel and coherent lipid-related context. Interestingly, unlike most other evolutionary concepts, the hypothesis allows specific predictions which can be empirically tested in the near future. This paper summarizes some of Horrobin's intriguing propositions and suggests as to how approaches of comparative genomics published in Cell, Nature, Science and elsewhere since 1997 may be used to examine his evolutionary hypothesis. Indeed, systematic investigations of the genomic clock in the species' mitochondrial DNA, the Y and autosomal chromosomes as evidence of evolutionary relationships and distinctions can help to scrutinize associated predictions for their validity, namely that key mutations which differentiate us from Neanderthals and from great apes are in the genes coding for proteins which regulate fat metabolism, and particularly the phospholipid metabolism of the synapses of the brain. It is concluded that beyond clues to humans' relationships with living primates and to the Neanderthals' cognitive performance and their disappearance, the suggested molecular clock analyses may provide crucial insights into the biochemical evolution-and means of possible manipulation-of our brain.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Grzybowska, Ewa A., E-mail: ewag@coi.waw.pl
2012-07-20
Highlights: Black-Right-Pointing-Pointer Functional characteristics of intronless genes (IGs). Black-Right-Pointing-Pointer Diseases associated with IGs. Black-Right-Pointing-Pointer Origin and evolution of IGs. Black-Right-Pointing-Pointer mRNA processing without splicing. -- Abstract: Intronless genes (IGs) constitute approximately 3% of the human genome. Human IGs are essentially different in evolution and functionality from the IGs of unicellular eukaryotes, which represent the majority in their genomes. Functional analysis of IGs has revealed a massive over-representation of signal transduction genes and genes encoding regulatory proteins important for growth, proliferation, and development. IGs also often display tissue-specific expression, usually in the nervous system and testis. These characteristics translate into IG-associatedmore » diseases, mainly neuropathies, developmental disorders, and cancer. IGs represent recent additions to the genome, created mostly by retroposition of processed mRNAs with retained functionality. Processing, nuclear export, and translation of these mRNAs should be hampered dramatically by the lack of splice factors, which normally tightly cover mature transcripts and govern their fate. However, natural IGs manage to maintain satisfactory expression levels. Different mechanisms by which IGs solve the problem of mRNA processing and nuclear export are discussed here, along with their possible impact on reporter studies.« less
USDA-ARS?s Scientific Manuscript database
Early maize adaptation to different agricultural environments was an important process associated with the creation of a stable food supply that allowed the evolution of human civilization in the Americas. To explore the mechanisms of maize adaptation, genomic, transcriptomic and phenomic data were ...
New frontiers in the study of human cultural and genetic evolution.
Ross, Cody T; Richerson, Peter J
2014-12-01
In this review, we discuss the dynamic linkages between culture and the genetic evolution of the human species. We begin by briefly describing the framework of gene-culture coevolutionary (or dual-inheritance) models for human evolutionary change. Until recently, the literature on gene-culture coevolution was composed primarily of mathematical models and formalized theory describing the complex dynamics underlying human behavior, adaptation, and technological evolution, but had little empirical support concerning genetics. The rapid progress in the fields of molecular genetics and genomics, however, is now providing the kinds of data needed to produce rich empirical support for gene-culture coevolutionary models. We briefly outline how theoretical and methodological progress in genome sciences has provided ways for the strength of selection on genes to be evaluated, and then outline how evidence of selection on several key genes can be directly linked to human cultural practices. We then describe some exciting new directions in the empirical study of gene-culture coevolution, and conclude with a discussion of the role of gene-culture evolutionary models in the future integration of medical, biological, and social sciences. Copyright © 2014 Elsevier Ltd. All rights reserved.
Lee, Meonghun; Yoe, Hyun
2015-01-01
The environment promotes evolution. Evolutionary processes represent environmental adaptations over long time scales; evolution of crop genomes is not inducible within the relatively short time span of a human generation. Extreme environmental conditions can accelerate evolution, but such conditions are often stress inducing and disruptive. Artificial growth systems can be used to induce and select genomic variation by changing external environmental conditions, thus, accelerating evolution. By using cloud computing and big-data analysis, we analyzed environmental stress factors for Pleurotus ostreatus by assessing, evaluating, and predicting information of the growth environment. Through the indexing of environmental stress, the growth environment can be precisely controlled and developed into a technology for improving crop quality and production. PMID:25874206
Comprehensive identification and analysis of human accelerated regulatory DNA
Gittelman, Rachel M.; Hun, Enna; Ay, Ferhat; Madeoy, Jennifer; Pennacchio, Len; Noble, William S.; Hawkins, R. David; Akey, Joshua M.
2015-01-01
It has long been hypothesized that changes in gene regulation have played an important role in human evolution, but regulatory DNA has been much more difficult to study compared with protein-coding regions. Recent large-scale studies have created genome-scale catalogs of DNase I hypersensitive sites (DHSs), which demark potentially functional regulatory DNA. To better define regulatory DNA that has been subject to human-specific adaptive evolution, we performed comprehensive evolutionary and population genetics analyses on over 18 million DHSs discovered in 130 cell types. We identified 524 DHSs that are conserved in nonhuman primates but accelerated in the human lineage (haDHS), and estimate that 70% of substitutions in haDHSs are attributable to positive selection. Through extensive computational and experimental analyses, we demonstrate that haDHSs are often active in brain or neuronal cell types; play an important role in regulating the expression of developmentally important genes, including many transcription factors such as SOX6, POU3F2, and HOX genes; and identify striking examples of adaptive regulatory evolution that may have contributed to human-specific phenotypes. More generally, our results reveal new insights into conserved and adaptive regulatory DNA in humans and refine the set of genomic substrates that distinguish humans from their closest living primate relatives. PMID:26104583
Karro, J E; Peifer, M; Hardison, R C; Kollmann, M; von Grünberg, H H
2008-02-01
The distribution of guanine and cytosine nucleotides throughout a genome, or the GC content, is associated with numerous features in mammals; understanding the pattern and evolutionary history of GC content is crucial to our efforts to annotate the genome. The local GC content is decaying toward an equilibrium point, but the causes and rates of this decay, as well as the value of the equilibrium point, remain topics of debate. By comparing the results of 2 methods for estimating local substitution rates, we identify 620 Mb of the human genome in which the rates of the various types of nucleotide substitutions are the same on both strands. These strand-symmetric regions show an exponential decay of local GC content at a pace determined by local substitution rates. DNA segments subjected to higher rates experience disproportionately accelerated decay and are AT rich, whereas segments subjected to lower rates decay more slowly and are GC rich. Although we are unable to draw any conclusions about causal factors, the results support the hypothesis proposed by Khelifi A, Meunier J, Duret L, and Mouchiroud D (2006. GC content evolution of the human and mouse genomes: insights from the study of processed pseudogenes in regions of different recombination rates. J Mol Evol. 62:745-752.) that the isochore structure has been reshaped over time. If rate variation were a determining factor, then the current isochore structure of mammalian genomes could result from the local differences in substitution rates. We predict that under current conditions strand-symmetric portions of the human genome will stabilize at an average GC content of 30% (considerably less than the current 42%), thus confirming that the human genome has not yet reached equilibrium.
Migrating microbes: what pathogens can tell us about population movements and human evolution.
Houldcroft, Charlotte J; Ramond, Jean-Baptiste; Rifkin, Riaan F; Underdown, Simon J
2017-08-01
The biology of human migration can be observed from the co-evolutionary relationship with infectious diseases. While many pathogens are brief, unpleasant visitors to human bodies, others have the ability to become life-long human passengers. The story of a pathogen's genetic code may, therefore, provide insight into the history of its human host. The evolution and distribution of disease in Africa is of particular interest, because of the deep history of human evolution in Africa, the presence of a variety of non-human primates, and tropical reservoirs of emerging infectious diseases. This study explores which pathogens leave traces in the archaeological record, and whether there are realistic prospects that these pathogens can be recovered from sub-Saharan African archaeological contexts. Three stories are then presented of germs on a journey. The first is the story of HIV's spread on the back of colonialism and the railway networks over the last 150 years. The second involves the spread of Schistosoma mansoni, a parasite which shares its history with the trans-Atlantic slave trade and the origins of fresh-water fishing. Finally, we discuss the tantalising hints of hominin migration and interaction found in the genome of human herpes simplex virus 2. Evidence from modern African pathogen genomes can provide data on human behaviour and migration in deep time and contribute to the improvement of human quality-of-life and longevity.
Genetics and the making of Homo sapiens.
Carroll, Sean B
2003-04-24
Understanding the genetic basis of the physical and behavioural traits that distinguish humans from other primates presents one of the great new challenges in biology. Of the millions of base-pair differences between humans and chimpanzees, which particular changes contributed to the evolution of human features after the separation of the Pan and Homo lineages 5-7 million years ago? How can we identify the 'smoking guns' of human genetic evolution from neutral ticks of the molecular evolutionary clock? The magnitude and rate of morphological evolution in hominids suggests that many independent and incremental developmental changes have occurred that, on the basis of recent findings in model animals, are expected to be polygenic and regulatory in nature. Comparative genomics, population genetics, gene-expression analyses and medical genetics have begun to make complementary inroads into the complex genetic architecture of human evolution.
The Evolution of Human Cells in Terms of Protein Innovation
Sardar, Adam J.; Oates, Matt E.; Fang, Hai; Forrest, Alistair R.R.; Kawaji, Hideya; Gough, Julian; Rackham, Owen J.L.
2014-01-01
Humans are composed of hundreds of cell types. As the genomic DNA of each somatic cell is identical, cell type is determined by what is expressed and when. Until recently, little has been reported about the determinants of human cell identity, particularly from the joint perspective of gene evolution and expression. Here, we chart the evolutionary past of all documented human cell types via the collective histories of proteins, the principal product of gene expression. FANTOM5 data provide cell-type–specific digital expression of human protein-coding genes and the SUPERFAMILY resource is used to provide protein domain annotation. The evolutionary epoch in which each protein was created is inferred by comparison with domain annotation of all other completely sequenced genomes. Studying the distribution across epochs of genes expressed in each cell type reveals insights into human cellular evolution in terms of protein innovation. For each cell type, its history of protein innovation is charted based on the genes it expresses. Combining the histories of all cell types enables us to create a timeline of cell evolution. This timeline identifies the possibility that our common ancestor Coelomata (cavity-forming animals) provided the innovation required for the innate immune system, whereas cells which now form the brain of human have followed a trajectory of continually accumulating novel proteins since Opisthokonta (boundary of animals and fungi). We conclude that exaptation of existing domain architectures into new contexts is the dominant source of cell-type–specific domain architectures. PMID:24692656
Schrider, Daniel R.; Kern, Andrew D.
2015-01-01
The comparative genomics revolution of the past decade has enabled the discovery of functional elements in the human genome via sequence comparison. While that is so, an important class of elements, those specific to humans, is entirely missed by searching for sequence conservation across species. Here we present an analysis based on variation data among human genomes that utilizes a supervised machine learning approach for the identification of human-specific purifying selection in the genome. Using only allele frequency information from the complete low-coverage 1000 Genomes Project data set in conjunction with a support vector machine trained from known functional and nonfunctional portions of the genome, we are able to accurately identify portions of the genome constrained by purifying selection. Our method identifies previously known human-specific gains or losses of function and uncovers many novel candidates. Candidate targets for gain and loss of function along the human lineage include numerous putative regulatory regions of genes essential for normal development of the central nervous system, including a significant enrichment of gain of function events near neurotransmitter receptor genes. These results are consistent with regulatory turnover being a key mechanism in the evolution of human-specific characteristics of brain development. Finally, we show that the majority of the genome is unconstrained by natural selection currently, in agreement with what has been estimated from phylogenetic methods but in sharp contrast to estimates based on transcriptomics or other high-throughput functional methods. PMID:26590212
The Evolution of Lineage-Specific Regulatory Activities in the Human Embryonic Limb
Cotney, Justin; Leng, Jing; Yin, Jun; Reilly, Steven K.; DeMare, Laura E.; Emera, Deena; Ayoub, Albert E.; Rakic, Pasko; Noonan, James P.
2013-01-01
SUMMARY The evolution of human anatomical features likely involved changes in gene regulation during development. However, the nature and extent of human-specific developmental regulatory functions remain unknown. We obtained a genome-wide view of cis-regulatory evolution in human embryonic tissues by comparing the histone modification H3K27ac, which provides a quantitative readout of promoter and enhancer activity, during human, rhesus, and mouse limb development. Based on increased H3K27ac, we find that 13% of promoters and 11% of enhancers have gained activity on the human lineage since the human-rhesus divergence. These gains largely arose by modification of ancestral regulatory activities in the limb or potential co-option from other tissues and are likely to have heterogeneous genetic causes. Most enhancers that exhibit gain of activity in humans originated in mammals. Gains at promoters and enhancers in the human limb are associated with increased gene expression, suggesting they include molecular drivers of human morphological evolution. PMID:23827682
Evolutionary Genomics of Wheat
USDA-ARS?s Scientific Manuscript database
Wheat is the world’s largest and most important food crop for direct human consumption, therefore, continued wheat improvement is paramount for feeding an ever-increasing human population. Wheat improvement is tightly associated with the characterization and understanding of wheat evolution and gene...
Lefébure, Tristan; Stanhope, Michael J
2007-01-01
Background The genus Streptococcus is one of the most diverse and important human and agricultural pathogens. This study employs comparative evolutionary analyses of 26 Streptococcus genomes to yield an improved understanding of the relative roles of recombination and positive selection in pathogen adaptation to their hosts. Results Streptococcus genomes exhibit extreme levels of evolutionary plasticity, with high levels of gene gain and loss during species and strain evolution. S. agalactiae has a large pan-genome, with little recombination in its core-genome, while S. pyogenes has a smaller pan-genome and much more recombination of its core-genome, perhaps reflecting the greater habitat, and gene pool, diversity for S. agalactiae compared to S. pyogenes. Core-genome recombination was evident in all lineages (18% to 37% of the core-genome judged to be recombinant), while positive selection was mainly observed during species differentiation (from 11% to 34% of the core-genome). Positive selection pressure was unevenly distributed across lineages and biochemical main role categories. S. suis was the lineage with the greatest level of positive selection pressure, the largest number of unique loci selected, and the largest amount of gene gain and loss. Conclusion Recombination is an important evolutionary force in shaping Streptococcus genomes, not only in the acquisition of significant portions of the genome as lineage specific loci, but also in facilitating rapid evolution of the core-genome. Positive selection, although undoubtedly a slower process, has nonetheless played an important role in adaptation of the core-genome of different Streptococcus species to different hosts. PMID:17475002
Cancer heterogeneity: converting a limitation into a source of biologic information.
Rübben, Albert; Araujo, Arturo
2017-09-08
Analysis of spatial and temporal genetic heterogeneity in human cancers has revealed that somatic cancer evolution in most cancers is not a simple linear process composed of a few sequential steps of mutation acquisitions and clonal expansions. Parallel evolution has been observed in many early human cancers resulting in genetic heterogeneity as well as multilineage progression. Moreover, aneuploidy as well as structural chromosomal aberrations seems to be acquired in a non-linear, punctuated mode where most aberrations occur at early stages of somatic cancer evolution. At later stages, the cancer genomes seem to get stabilized and acquire only few additional rearrangements. While parallel evolution suggests positive selection of driver mutations at early stages of somatic cancer evolution, stabilization of structural aberrations at later stages suggests that negative selection takes effect when cancer cells progressively lose their tolerance towards additional mutation acquisition. Mixing of genetically heterogeneous subclones in cancer samples reduces sensitivity of mutation detection. Moreover, driver mutations present only in a fraction of cancer cells are more likely to be mistaken for passenger mutations. Therefore, genetic heterogeneity may be considered a limitation negatively affecting detection sensitivity of driver mutations. On the other hand, identification of subclones and subclone lineages in human cancers may lead to a more profound understanding of the selective forces which shape somatic cancer evolution in human cancers. Identification of parallel evolution by analyzing spatial heterogeneity may hint to driver mutations which might represent additional therapeutic targets besides driver mutations present in a monoclonal state. Likewise, stabilization of cancer genomes which can be identified by analyzing temporal genetic heterogeneity might hint to genes and pathways which have become essential for survival of cancer cell lineages at later stages of cancer evolution. These genes and pathways might also constitute patient specific therapeutic targets.
Human development, heredity and evolution.
Nishinakamura, Ryuichi; Takasato, Minoru
2017-06-15
From March 27-29 2017, the RIKEN Center for Developmental Biology held a symposium entitled 'Towards Understanding Human Development, Heredity, and Evolution' in Kobe, Japan. Recent advances in technologies including stem cell culture, live imaging, single-cell approaches, next-generation sequencing and genome editing have led to an expansion in our knowledge of human development. Organized by Yoshiya Kawaguchi, Mitinori Saitou, Mototsugu Eiraku, Tomoya Kitajima, Fumio Matsuzaki, Takashi Tsuji and Edith Heard, the symposium covered a broad range of topics including human germline development, epigenetics, organogenesis and evolution. This Meeting Review provides a summary of this timely and exciting symposium, which has convinced us that we are moving into the era of science targeted on humans. © 2017. Published by The Company of Biologists Ltd.
Liu, Gangbiao; Zou, Yangyun; Cheng, Qiqun; Zeng, Yanwu; Gu, Xun; Su, Zhixi
2014-04-01
The age distribution of gene duplication events within the human genome exhibits two waves of duplications along with an ancient component. However, because of functional constraint differences, genes in different functional categories might show dissimilar retention patterns after duplication. It is known that genes in some functional categories are highly duplicated in the early stage of vertebrate evolution. However, the correlations of the age distribution pattern of gene duplication between the different functional categories are still unknown. To investigate this issue, we developed a robust pipeline to date the gene duplication events in the human genome. We successfully estimated about three-quarters of the duplication events within the human genome, along with the age distribution pattern in each Gene Ontology (GO) slim category. We found that some GO slim categories show different distribution patterns when compared to the whole genome. Further hierarchical clustering of the GO slim functional categories enabled grouping into two main clusters. We found that human genes located in the duplicated copy number variant regions, whose duplicate genes have not been fixed in the human population, were mainly enriched in the groups with a high proportion of recently duplicated genes. Moreover, we used a phylogenetic tree-based method to date the age of duplications in three signaling-related gene superfamilies: transcription factors, protein kinases and G-protein coupled receptors. These superfamilies were expressed in different subcellular localizations. They showed a similar age distribution as the signaling-related GO slim categories. We also compared the differences between the age distributions of gene duplications in multiple subcellular localizations. We found that the distribution patterns of the major subcellular localizations were similar to that of the whole genome. This study revealed the whole picture of the evolution patterns of gene functional categories in the human genome.
Duan, Naibin; Bai, Yang; Sun, Honghe; Wang, Nan; Ma, Yumin; Li, Mingjun; Wang, Xin; Jiao, Chen; Legall, Noah; Mao, Linyong; Wan, Sibao; Wang, Kun; He, Tianming; Feng, Shouqian; Zhang, Zongying; Mao, Zhiquan; Shen, Xiang; Chen, Xiaoliu; Jiang, Yuanmao; Wu, Shujing; Yin, Chengmiao; Ge, Shunfeng; Yang, Long; Jiang, Shenghui; Xu, Haifeng; Liu, Jingxuan; Wang, Deyun; Qu, Changzhi; Wang, Yicheng; Zuo, Weifang; Xiang, Li; Liu, Chang; Zhang, Daoyuan; Gao, Yuan; Xu, Yimin; Xu, Kenong; Chao, Thomas; Fazio, Gennaro; Shu, Huairui; Zhong, Gan-Yuan; Cheng, Lailiang; Fei, Zhangjun; Chen, Xuesen
2017-08-15
Human selection has reshaped crop genomes. Here we report an apple genome variation map generated through genome sequencing of 117 diverse accessions. A comprehensive model of apple speciation and domestication along the Silk Road is proposed based on evidence from diverse genomic analyses. Cultivated apples likely originate from Malus sieversii in Kazakhstan, followed by intensive introgressions from M. sylvestris. M. sieversii in Xinjiang of China turns out to be an "ancient" isolated ecotype not directly contributing to apple domestication. We have identified selective sweeps underlying quantitative trait loci/genes of important fruit quality traits including fruit texture and flavor, and provide evidences supporting a model of apple fruit size evolution comprising two major events with one occurring prior to domestication and the other during domestication. This study outlines the genetic basis of apple domestication and evolution, and provides valuable information for facilitating marker-assisted breeding and apple improvement.Apple is one of the most important fruit crops. Here, the authors perform deep genome resequencing of 117 diverse accessions and reveal comprehensive models of apple origin, speciation, domestication, and fruit size evolution as well as candidate genes associated with important agronomic traits.
Tempo and mode of genomic mutations unveil human evolutionary history.
Hara, Yuichiro
2015-01-01
Mutations that have occurred in human genomes provide insight into various aspects of evolutionary history such as speciation events and degrees of natural selection. Comparing genome sequences between human and great apes or among humans is a feasible approach for inferring human evolutionary history. Recent advances in high-throughput or so-called 'next-generation' DNA sequencing technologies have enabled the sequencing of thousands of individual human genomes, as well as a variety of reference genomes of hominids, many of which are publicly available. These sequence data can help to unveil the detailed demographic history of the lineage leading to humans as well as the explosion of modern human population size in the last several thousand years. In addition, high-throughput sequencing illustrates the tempo and mode of de novo mutations, which are producing human genetic variation at this moment. Pedigree-based human genome sequencing has shown that mutation rates vary significantly across the human genome. These studies have also provided an improved timescale of human evolution, because the mutation rate estimated from pedigree analysis is half that estimated from traditional analyses based on molecular phylogeny. Because of the dramatic reduction in sequencing cost, sequencing on-demand samples designed for specific studies is now also becoming popular. To produce data of sufficient quality to meet the requirements of the study, it is necessary to set an explicit sequencing plan that includes the choice of sample collection methods, sequencing platforms, and number of sequence reads.
On the Structural Plasticity of the Human Genome: Chromosomal Inversions Revisited
Alves, Joao M; Lopes, Alexandra M; Chikhi, Lounès; Amorim, António
2012-01-01
With the aid of novel and powerful molecular biology techniques, recent years have witnessed a dramatic increase in the number of studies reporting the involvement of complex structural variants in several genomic disorders. In fact, with the discovery of Copy Number Variants (CNVs) and other forms of unbalanced structural variation, much attention has been directed to the detection and characterization of such rearrangements, as well as the identification of the mechanisms involved in their formation. However, it has long been appreciated that chromosomes can undergo other forms of structural changes - balanced rearrangements - that do not involve quantitative variation of genetic material. Indeed, a particular subtype of balanced rearrangement – inversions – was recently found to be far more common than had been predicted from traditional cytogenetics. Chromosomal inversions alter the orientation of a specific genomic sequence and, unless involving breaks in coding or regulatory regions (and, disregarding complex trans effects, in their close vicinity), appear to be phenotypically silent. Such a surprising finding, which is difficult to reconcile with the classical interpretation of inversions as a mechanism causing subfertility (and ultimately reproductive isolation), motivated a new series of theoretical and empirical studies dedicated to understand their role in human genome evolution and to explore their possible association to complex genetic disorders. With this review, we attempt to describe the latest methodological improvements to inversions detection at a genome wide level, while exploring some of the possible implications of inversion rearrangements on the evolution of the human genome. PMID:23730202
Nakatani, Yoichiro; Takeda, Hiroyuki; Kohara, Yuji; Morishita, Shinichi
2007-09-01
Although several vertebrate genomes have been sequenced, little is known about the genome evolution of early vertebrates and how large-scale genomic changes such as the two rounds of whole-genome duplications (2R WGD) affected evolutionary complexity and novelty in vertebrates. Reconstructing the ancestral vertebrate genome is highly nontrivial because of the difficulty in identifying traces originating from the 2R WGD. To resolve this problem, we developed a novel method capable of pinning down remains of the 2R WGD in the human and medaka fish genomes using invertebrate tunicate and sea urchin genes to define ohnologs, i.e., paralogs produced by the 2R WGD. We validated the reconstruction using the chicken genome, which was not considered in the reconstruction step, and observed that many ancestral proto-chromosomes were retained in the chicken genome and had one-to-one correspondence to chicken microchromosomes, thereby confirming the reconstructed ancestral genomes. Our reconstruction revealed a contrast between the slow karyotype evolution after the second WGD and the rapid, lineage-specific genome reorganizations that occurred in the ancestral lineages of major taxonomic groups such as teleost fishes, amphibians, reptiles, and marsupials.
An Upper Limit on the Functional Fraction of the Human Genome.
Graur, Dan
2017-07-01
For the human population to maintain a constant size from generation to generation, an increase in fertility must compensate for the reduction in the mean fitness of the population caused, among others, by deleterious mutations. The required increase in fertility due to this mutational load depends on the number of sites in the genome that are functional, the mutation rate, and the fraction of deleterious mutations among all mutations in functional regions. These dependencies and the fact that there exists a maximum tolerable replacement level fertility can be used to put an upper limit on the fraction of the human genome that can be functional. Mutational load considerations lead to the conclusion that the functional fraction within the human genome cannot exceed 25%, and is probably considerably lower. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genome-wide scans for loci under selection in humans
2005-01-01
Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection. PMID:16004726
Helminth Genomics: The Implications for Human Health
Brindley, Paul J.; Mitreva, Makedonka; Ghedin, Elodie; Lustigman, Sara
2009-01-01
More than two billion people (one-third of humanity) are infected with parasitic roundworms or flatworms, collectively known as helminth parasites. These infections cause diseases that are responsible for enormous levels of morbidity and mortality, delays in the physical development of children, loss of productivity among the workforce, and maintenance of poverty. Genomes of the major helminth species that affect humans, and many others of agricultural and veterinary significance, are now the subject of intensive genome sequencing and annotation. Draft genome sequences of the filarial worm Brugia malayi and two of the human schistosomes, Schistosoma japonicum and S. mansoni, are now available, among others. These genome data will provide the basis for a comprehensive understanding of the molecular mechanisms involved in helminth nutrition and metabolism, host-dependent development and maturation, immune evasion, and evolution. They are likely also to predict new potential vaccine candidates and drug targets. In this review, we present an overview of these efforts and emphasize the potential impact and importance of these new findings. PMID:19855829
Demographic history, selection and functional diversity of the canine genome.
Ostrander, Elaine A; Wayne, Robert K; Freedman, Adam H; Davis, Brian W
2017-12-01
The domestic dog represents one of the most dramatic long-term evolutionary experiments undertaken by humans. From a large wolf-like progenitor, unparalleled diversity in phenotype and behaviour has developed in dogs, providing a model for understanding the developmental and genomic mechanisms of diversification. We discuss pattern and process in domestication, beginning with general findings about early domestication and problems in documenting selection at the genomic level. Furthermore, we summarize genotype-phenotype studies based first on single nucleotide polymorphism (SNP) genotyping and then with whole-genome data and show how an understanding of evolution informs topics as different as human history, adaptive and deleterious variation, morphological development, ageing, cancer and behaviour.
Evolutionary genomics of Entamoeba
Weedall, Gareth D.; Hall, Neil
2011-01-01
Entamoeba histolytica is a human pathogen that causes amoebic dysentery and leads to significant morbidity and mortality worldwide. Understanding the genome and evolution of the parasite will help explain how, when and why it causes disease. Here we review current knowledge about the evolutionary genomics of Entamoeba: how differences between the genomes of different species may help explain different phenotypes, and how variation among E. histolytica parasites reveals patterns of population structure. The imminent expansion of the amount genome data will greatly improve our knowledge of the genus and of pathogenic species within it. PMID:21288488
Chromosomal duplications in bacteria, fruit flies, and humans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lupski, J.R.; Weinstock, G.M.; Roth, J.R.
1996-01-01
Tandem duplication of chromosomal segments has been recognized as a frequent mutational mechanism in several genetic model systems. In bacteria, fruit flies, and humans, duplications form by similar molecular mechanisms and appear to be important in genome evolution. 80 refs.
Genomes of all known members of a Plasmodium subgenus reveal paths to virulent human malaria.
Otto, Thomas D; Gilabert, Aude; Crellen, Thomas; Böhme, Ulrike; Arnathau, Céline; Sanders, Mandy; Oyola, Samuel O; Okouga, Alain Prince; Boundenga, Larson; Willaume, Eric; Ngoubangoye, Barthélémy; Moukodoum, Nancy Diamella; Paupy, Christophe; Durand, Patrick; Rougeron, Virginie; Ollomo, Benjamin; Renaud, François; Newbold, Chris; Berriman, Matthew; Prugnolle, Franck
2018-06-01
Plasmodium falciparum, the most virulent agent of human malaria, shares a recent common ancestor with the gorilla parasite Plasmodium praefalciparum. Little is known about the other gorilla- and chimpanzee-infecting species in the same (Laverania) subgenus as P. falciparum, but none of them are capable of establishing repeated infection and transmission in humans. To elucidate underlying mechanisms and the evolutionary history of this subgenus, we have generated multiple genomes from all known Laverania species. The completeness of our dataset allows us to conclude that interspecific gene transfers, as well as convergent evolution, were important in the evolution of these species. Striking copy number and structural variations were observed within gene families and one, stevor, shows a host-specific sequence pattern. The complete genome sequence of the closest ancestor of P. falciparum enables us to estimate the timing of the beginning of speciation to be 40,000-60,000 years ago followed by a population bottleneck around 4,000-6,000 years ago. Our data allow us also to search in detail for the features of P. falciparum that made it the only member of the Laverania able to infect and spread in humans.
A revised timescale for human evolution based on ancient mitochondrial genomes
Johnson, Philip L.F.; Bos, Kirsten; Lari, Martina; Bollongino, Ruth; Sun, Chengkai; Giemsch, Liane; Schmitz, Ralf; Burger, Joachim; Ronchitelli, Anna Maria; Martini, Fabio; Cremonesi, Renata G.; Svoboda, Jiří; Bauer, Peter; Caramelli, David; Castellano, Sergi; Reich, David; Pääbo, Svante; Krause, Johannes
2016-01-01
Summary Background Recent analyses of de novo DNA mutations in modern humans have suggested a nuclear substitution rate that is approximately half that of previous estimates based on fossil calibration. This result has led to suggestions that major events in human evolution occurred far earlier than previously thought. Result Here we use mitochondrial genome sequences from 10 securely dated ancient modern humans spanning 40,000 years as calibration points for the mitochondrial clock, thus yielding a direct estimate of the mitochondrial substitution rate. Our clock yields mitochondrial divergence times that are in agreement with earlier estimates based on calibration points derived from either fossils or archaeological material. In particular, our results imply a separation of non-Africans from the most closely related sub-Saharan African mitochondrial DNAs (haplogroup L3) of less than 62,000-95,000 years ago. Conclusion Though single loci like mitochondrial DNA (mtDNA) can only provide biased estimates of population split times, they can provide valid upper bounds; our results exclude most of the older dates for African and non-African split times recently suggested by de novo mutation rate estimates in the nuclear genome. PMID:23523248
A revised timescale for human evolution based on ancient mitochondrial genomes.
Fu, Qiaomei; Mittnik, Alissa; Johnson, Philip L F; Bos, Kirsten; Lari, Martina; Bollongino, Ruth; Sun, Chengkai; Giemsch, Liane; Schmitz, Ralf; Burger, Joachim; Ronchitelli, Anna Maria; Martini, Fabio; Cremonesi, Renata G; Svoboda, Jiří; Bauer, Peter; Caramelli, David; Castellano, Sergi; Reich, David; Pääbo, Svante; Krause, Johannes
2013-04-08
Recent analyses of de novo DNA mutations in modern humans have suggested a nuclear substitution rate that is approximately half that of previous estimates based on fossil calibration. This result has led to suggestions that major events in human evolution occurred far earlier than previously thought. Here, we use mitochondrial genome sequences from ten securely dated ancient modern humans spanning 40,000 years as calibration points for the mitochondrial clock, thus yielding a direct estimate of the mitochondrial substitution rate. Our clock yields mitochondrial divergence times that are in agreement with earlier estimates based on calibration points derived from either fossils or archaeological material. In particular, our results imply a separation of non-Africans from the most closely related sub-Saharan African mitochondrial DNAs (haplogroup L3) that occurred less than 62-95 kya. Though single loci like mitochondrial DNA (mtDNA) can only provide biased estimates of population divergence times, they can provide valid upper bounds. Our results exclude most of the older dates for African and non-African population divergences recently suggested by de novo mutation rate estimates in the nuclear genome. Copyright © 2013 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Qiu, Shuhao
2015-01-01
In order to investigate the complexity of mutations, a computational approach named Genome Evolution by Matrix Algorithms ("GEMA") has been implemented. GEMA models genomic changes, taking into account hundreds of mutations within each individual in a population. By modeling of entire human chromosomes, GEMA precisely mimics real…
Significant expansion of exon-bordering protein domains during animal proteome evolution
Liu, Mingyi; Walch, Heiko; Wu, Shaoping; Grigoriev, Andrei
2005-01-01
We present evidence of remarkable genome-wide mobility and evolutionary expansion for a class of protein domains whose borders locate close to the borders of their encoding exons. These exon-bordering domains are more numerous and widely distributed in the human genome than other domains. They also co-occur with more diverse domains to form a larger variety of domain architectures in human proteins. A systematic comparison of nine animal genomes from nematodes to mammals revealed that exon-bordering domains expanded faster than other protein domains in both abundance and distribution, as well as the diversity of co-occurring domains and the domain architectures of harboring proteins. Furthermore, exon-bordering domains exhibited a particularly strong preference for class 1-1 intron phase. Our findings suggest that exon-bordering domains were amplified and interchanged within a genome more often and/or more successfully than other domains during evolution, probably the result of extensive exon shuffling and gene duplication events. The diverse biological functions of these domains underscore the important role they play in the expansion and diversification of animal proteomes. PMID:15640447
Ancient Recombination Events between Human Herpes Simplex Viruses.
Burrel, Sonia; Boutolleau, David; Ryu, Diane; Agut, Henri; Merkel, Kevin; Leendertz, Fabian H; Calvignac-Spencer, Sébastien
2017-07-01
Herpes simplex viruses 1 and 2 (HSV-1 and HSV-2) are seen as close relatives but also unambiguously considered as evolutionary independent units. Here, we sequenced the genomes of 18 HSV-2 isolates characterized by divergent UL30 gene sequences to further elucidate the evolutionary history of this virus. Surprisingly, genome-wide recombination analyses showed that all HSV-2 genomes sequenced to date contain HSV-1 fragments. Using phylogenomic analyses, we could also show that two main HSV-2 lineages exist. One lineage is mostly restricted to subSaharan Africa whereas the other has reached a global distribution. Interestingly, only the worldwide lineage is characterized by ancient recombination events with HSV-1. Our findings highlight the complexity of HSV-2 evolution, a virus of putative zoonotic origin which later recombined with its human-adapted relative. They also suggest that coinfections with HSV-1 and 2 may have genomic and potentially functional consequences and should therefore be monitored more closely. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Yasukochi, Yoshiki; Satta, Yoko
2015-03-25
The human cytochrome P450 (CYP) 2D6 gene is a member of the CYP2D gene subfamily, along with the CYP2D7P and CYP2D8P pseudogenes. Although the CYP2D6 enzyme has been studied extensively because of its clinical importance, the evolution of the CYP2D subfamily has not yet been fully understood. Therefore, the goal of this study was to reveal the evolutionary process of the human drug metabolic system. Here, we investigate molecular evolution of the CYP2D subfamily in primates by comparing 14 CYP2D sequences from humans to New World monkey genomes. Window analysis and statistical tests revealed that entire genomic sequences of paralogous genes were extensively homogenized by gene conversion during molecular evolution of CYP2D genes in primates. A neighbor-joining tree based on genomic sequences at the nonsubstrate recognition sites showed that CYP2D6 and CYP2D8 genes were clustered together due to gene conversion. In contrast, a phylogenetic tree using amino acid sequences at substrate recognition sites did not cluster the CYP2D6 and CYP2D8 genes, suggesting that the functional constraint on substrate specificity is one of the causes for purifying selection at the substrate recognition sites. Our results suggest that the CYP2D gene subfamily in primates has evolved to maintain the regioselectivity for a substrate hydroxylation activity between individual enzymes, even though extensive gene conversion has occurred across CYP2D coding sequences. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Population and clinical genetics of human transposable elements in the (post) genomic era
Rishishwar, Lavanya; Wang, Lu; Clayton, Evan A.; Mariño-Ramírez, Leonardo; McDonald, John F.; Jordan, I. King
2017-01-01
ABSTRACT Recent technological developments—in genomics, bioinformatics and high-throughput experimental techniques—are providing opportunities to study ongoing human transposable element (TE) activity at an unprecedented level of detail. It is now possible to characterize genome-wide collections of TE insertion sites for multiple human individuals, within and between populations, and for a variety of tissue types. Comparison of TE insertion site profiles between individuals captures the germline activity of TEs and reveals insertion site variants that segregate as polymorphisms among human populations, whereas comparison among tissue types ascertains somatic TE activity that generates cellular heterogeneity. In this review, we provide an overview of these new technologies and explore their implications for population and clinical genetic studies of human TEs. We cover both recent published results on human TE insertion activity as well as the prospects for future TE studies related to human evolution and health. PMID:28228978
Evolution of H3N2v viruses in North American swine and humans, 2009-2011
USDA-ARS?s Scientific Manuscript database
Novel H3N2 influenza viruses (H3N2v) containing seven genome segments from swine-lineage triple reassortant H3N2 viruses and a 2009 pandemic H1N1 (H1N1pdm09) matrix protein segment (pM) have been isolated from 12 humans in the United States between August – December 2011. To understand the evolution...
Zeng, Jia; Yi, Soojin V
2014-10-16
Recombination clusters nonuniformly across mammalian genomes at discrete genomic loci referred to as recombination hotspots. Despite their ubiquitous presence, individual hotspots rapidly lose their activities, and the molecular and evolutionary mechanisms underlying such frequent hotspot turnovers (the so-called "recombination hotspot paradox") remain unresolved. Even though some sequence motifs are significantly associated with hotspots, multiple lines of evidence indicate that factors other than underlying sequences, such as epigenetic modifications, may affect the evolution of recombination hotspots. Thus, identifying epigenetic factors that covary with recombination at fine-scale is a promising step for this important research area. It was previously reported that recombination rates correlate with indirect measures of DNA methylation in the human genome. Here, we analyze experimentally determined DNA methylation and histone modification of human sperms, and show that the correlation between DNA methylation and recombination in long-range windows does not hold with respect to the spatial and temporal variation of recombination at hotspots. On the other hand, two histone modifications (H3K4me3 and H3K27me3) overlap extensively with recombination hotspots. Similar trends were observed in mice. These results indicate that specific histone modifications rather than DNA methylation are associated with the rapid evolution of recombination hotspots. Furthermore, many human recombination hotspots occupy "bivalent" chromatin regions that harbor both active (H3K4me3) and repressive (H3K27me3) marks. This may explain why human recombination hotspots tend to occur in nongenic regions, in contrast to yeast and Arabidopsis hotspots that are characterized by generally active chromatins. Our results highlight the dynamic epigenetic underpinnings of recombination hotspot evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genetic Structures of Copy Number Variants Revealed by Genotyping Single Sperm
Luo, Minjie; Cui, Xiangfeng; Fredman, David; Brookes, Anthony J.; Azaro, Marco A.; Greenawalt, Danielle M.; Hu, Guohong; Wang, Hui-Yun; Tereshchenko, Irina V.; Lin, Yong; Shentu, Yue; Gao, Richeng; Shen, Li; Li, Honghua
2009-01-01
Background Copy number variants (CNVs) occupy a significant portion of the human genome and may have important roles in meiotic recombination, human genome evolution and gene expression. Many genetic diseases may be underlain by CNVs. However, because of the presence of their multiple copies, variability in copy numbers and the diploidy of the human genome, detailed genetic structure of CNVs cannot be readily studied by available techniques. Methodology/Principal Findings Single sperm samples were used as the primary subjects for the study so that CNV haplotypes in the sperm donors could be studied individually. Forty-eight CNVs characterized in a previous study were analyzed using a microarray-based high-throughput genotyping method after multiplex amplification. Seventeen single nucleotide polymorphisms (SNPs) were also included as controls. Two single-base variants, either allelic or paralogous, could be discriminated for all markers. Microarray data were used to resolve SNP alleles and CNV haplotypes, to quantitatively assess the numbers and compositions of the paralogous segments in each CNV haplotype. Conclusions/Significance This is the first study of the genetic structure of CNVs on a large scale. Resulting information may help understand evolution of the human genome, gain insight into many genetic processes, and discriminate between CNVs and SNPs. The highly sensitive high-throughput experimental system with haploid sperm samples as subjects may be used to facilitate detailed large-scale CNV analysis. PMID:19384415
Takeuchi, Fumihiko; Watanabe, Shinya; Baba, Tadashi; Yuzawa, Harumi; Ito, Teruyo; Morimoto, Yuh; Kuroda, Makoto; Cui, Longzhu; Takahashi, Mikio; Ankai, Akiho; Baba, Shin-ichi; Fukui, Shigehiro; Lee, Jean C.; Hiramatsu, Keiichi
2005-01-01
Staphylococcus haemolyticus is an opportunistic bacterial pathogen that colonizes human skin and is remarkable for its highly antibiotic-resistant phenotype. We determined the complete genome sequence of S.haemolyticus to better understand its pathogenicity and evolutionary relatedness to the other staphylococcal species. A large proportion of the open reading frames in the genomes of S.haemolyticus, Staphylococcus aureus, and Staphylococcus epidermidis were conserved in their sequence and order on the chromosome. We identified a region of the bacterial chromosome just downstream of the origin of replication that showed little homology among the species but was conserved among strains within a species. This novel region, designated the “oriC environ,” likely contributes to the evolution and differentiation of the staphylococcal species, since it was enriched for species-specific nonessential genes that contribute to the biological features of each staphylococcal species. A comparative analysis of the genomes of S.haemolyticus, S.aureus, and S.epidermidis elucidated differences in their biological and genetic characteristics and pathogenic potentials. We identified as many as 82 insertion sequences in the S.haemolyticus chromosome that probably mediated frequent genomic rearrangements, resulting in phenotypic diversification of the strain. Such rearrangements could have brought genomic plasticity to this species and contributed to its acquisition of antibiotic resistance. PMID:16237012
Copin, Richard; Wang, Xueying; Louie, Eddie; Escuyer, Vincent; Coscolla, Mireia; Gagneux, Sebastien; Palmer, Guy H; Ernst, Joel D
2016-12-01
Molecular epidemiological assessments, drug treatment optimization, and development of immunological interventions all depend on understanding pathogen adaptation and genetic variation, which differ for specific pathogens. Mycobacterium tuberculosis is an exceptionally successful human pathogen, yet beyond knowledge that this bacterium has low overall genomic variation but acquires drug resistance mutations, little is known of the factors that drive its population genomic characteristics. Here, we compared the genetic diversity of the bacteria that established infection to the bacterial populations obtained from infected tissues during murine M. tuberculosis pulmonary infection and human disseminated M. bovis BCG infection. We found that new mutations accumulate during in vitro culture, but that in vivo, purifying selection against new mutations dominates, indicating that M. tuberculosis follows a dominant lineage model of evolution. Comparing bacterial populations passaged in T cell-deficient and immunocompetent mice, we found that the presence of T cells is associated with an increase in the diversity of the M. tuberculosis genome. Together, our findings put M. tuberculosis genetic evolution in a new perspective and clarify the impact of T cells on sequence diversity of M. tuberculosis.
Wallberg, Andreas; Glémin, Sylvain; Webster, Matthew T.
2015-01-01
Meiotic recombination is a fundamental cellular process, with important consequences for evolution and genome integrity. However, we know little about how recombination rates vary across the genomes of most species and the molecular and evolutionary determinants of this variation. The honeybee, Apis mellifera, has extremely high rates of meiotic recombination, although the evolutionary causes and consequences of this are unclear. Here we use patterns of linkage disequilibrium in whole genome resequencing data from 30 diploid honeybees to construct a fine-scale map of rates of crossing over in the genome. We find that, in contrast to vertebrate genomes, the recombination landscape is not strongly punctate. Crossover rates strongly correlate with levels of genetic variation, but not divergence, which indicates a pervasive impact of selection on the genome. Germ-line methylated genes have reduced crossover rate, which could indicate a role of methylation in suppressing recombination. Controlling for the effects of methylation, we do not infer a strong association between gene expression patterns and recombination. The site frequency spectrum is strongly skewed from neutral expectations in honeybees: rare variants are dominated by AT-biased mutations, whereas GC-biased mutations are found at higher frequencies, indicative of a major influence of GC-biased gene conversion (gBGC), which we infer to generate an allele fixation bias 5 – 50 times the genomic average estimated in humans. We uncover further evidence that this repair bias specifically affects transitions and favours fixation of CpG sites. Recombination, via gBGC, therefore appears to have profound consequences on genome evolution in honeybees and interferes with the process of natural selection. These findings have important implications for our understanding of the forces driving molecular evolution. PMID:25902173
Samuels, David C.; Boys, Richard J.; Henderson, Daniel A.; Chinnery, Patrick F.
2003-01-01
We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a ‘molecular clock’ to determine the rate of population and species divergence. PMID:14530452
2004-12-09
We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.
Swaggart, Kayleigh A.; Pavlicev, Mihaela; Muglia, Louis J.
2015-01-01
The molecular mechanisms controlling human birth timing at term, or resulting in preterm birth, have been the focus of considerable investigation, but limited insights have been gained over the past 50 years. In part, these processes have remained elusive because of divergence in reproductive strategies and physiology shown by model organisms, making extrapolation to humans uncertain. Here, we summarize the evolution of progesterone signaling and variation in pregnancy maintenance and termination. We use this comparative physiology to support the hypothesis that selective pressure on genomic loci involved in the timing of parturition have shaped human birth timing, and that these loci can be identified with comparative genomic strategies. Previous limitations imposed by divergence of mechanisms provide an important new opportunity to elucidate fundamental pathways of parturition control through increasing availability of sequenced genomes and associated reproductive physiology characteristics across diverse organisms. PMID:25646385
Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus.
Andersen, Kristian G; Shapiro, B Jesse; Matranga, Christian B; Sealfon, Rachel; Lin, Aaron E; Moses, Lina M; Folarin, Onikepe A; Goba, Augustine; Odia, Ikponmwonsa; Ehiane, Philomena E; Momoh, Mambu; England, Eleina M; Winnicki, Sarah; Branco, Luis M; Gire, Stephen K; Phelan, Eric; Tariyal, Ridhi; Tewhey, Ryan; Omoniwa, Omowunmi; Fullah, Mohammed; Fonnie, Richard; Fonnie, Mbalu; Kanneh, Lansana; Jalloh, Simbirie; Gbakie, Michael; Saffa, Sidiki; Karbo, Kandeh; Gladden, Adrianne D; Qu, James; Stremlau, Matthew; Nekoui, Mahan; Finucane, Hilary K; Tabrizi, Shervin; Vitti, Joseph J; Birren, Bruce; Fitzgerald, Michael; McCowan, Caryn; Ireland, Andrea; Berlin, Aaron M; Bochicchio, James; Tazon-Vega, Barbara; Lennon, Niall J; Ryan, Elizabeth M; Bjornson, Zach; Milner, Danny A; Lukens, Amanda K; Broodie, Nisha; Rowland, Megan; Heinrich, Megan; Akdag, Marjan; Schieffelin, John S; Levy, Danielle; Akpan, Henry; Bausch, Daniel G; Rubins, Kathleen; McCormick, Joseph B; Lander, Eric S; Günther, Stephan; Hensley, Lisa; Okogbenin, Sylvanus; Schaffner, Stephen F; Okokhere, Peter O; Khan, S Humarr; Grant, Donald S; Akpede, George O; Asogun, Danny A; Gnirke, Andreas; Levin, Joshua Z; Happi, Christian T; Garry, Robert F; Sabeti, Pardis C
2015-08-13
The 2013-2015 West African epidemic of Ebola virus disease (EVD) reminds us of how little is known about biosafety level 4 viruses. Like Ebola virus, Lassa virus (LASV) can cause hemorrhagic fever with high case fatality rates. We generated a genomic catalog of almost 200 LASV sequences from clinical and rodent reservoir samples. We show that whereas the 2013-2015 EVD epidemic is fueled by human-to-human transmissions, LASV infections mainly result from reservoir-to-human infections. We elucidated the spread of LASV across West Africa and show that this migration was accompanied by changes in LASV genome abundance, fatality rates, codon adaptation, and translational efficiency. By investigating intrahost evolution, we found that mutations accumulate in epitopes of viral surface proteins, suggesting selection for immune escape. This catalog will serve as a foundation for the development of vaccines and diagnostics. VIDEO ABSTRACT. Copyright © 2015 Elsevier Inc. All rights reserved.
Clinical sequencing uncovers origins and evolution of Lassa virus
Andersen, Kristian G.; Shapiro, B. Jesse; Matranga, Christian B.; Sealfon, Rachel; Lin, Aaron E.; Moses, Lina M.; Folarin, Onikepe A.; Goba, Augustine; Odia, Ikponmwonsa; Ehiane, Philomena E.; Momoh, Mambu; England, Eleina M.; Winnicki, Sarah; Branco, Luis M.; Gire, Stephen K.; Phelan, Eric; Tariyal, Ridhi; Tewhey, Ryan; Omoniwa, Omowunmi; Fullah, Mohammed; Fonnie, Richard; Fonnie, Mbalu; Kanneh, Lansana; Jalloh, Simbirie; Gbakie, Michael; Saffa, Sidiki; Karbo, Kandeh; Gladden, Adrianne D.; Qu, James; Stremlau, Matthew; Nekoui, Mahan; Finucane, Hilary K.; Tabrizi, Shervin; Vitti, Joseph J.; Birren, Bruce; Fitzgerald, Michael; McCowan, Caryn; Ireland, Andrea; Berlin, Aaron M.; Bochicchio, James; Tazon-Vega, Barbara; Lennon, Niall J.; Ryan, Elizabeth M.; Bjornson, Zach; Milner, Danny A.; Lukens, Amanda K.; Broodie, Nisha; Rowland, Megan; Heinrich, Megan; Akdag, Marjan; Schieffelin, John S.; Levy, Danielle; Akpan, Henry; Bausch, Daniel G.; Rubins, Kathleen; McCormick, Joseph B.; Lander, Eric S.; Günther, Stephan; Hensley, Lisa; Okogbenin, Sylvanus; Schaffner, Stephen F.; Okokhere, Peter O.; Khan, S. Humarr; Grant, Donald S.; Akpede, George O.; Asogun, Danny A.; Gnirke, Andreas; Levin, Joshua Z.; Happi, Christian T.; Garry, Robert F.; Sabeti, Pardis C.
2015-01-01
Summary The 2013-2015 West African epidemic of Ebola virus disease (EVD) reminds us how little is known about biosafety level-4 viruses. Like Ebola virus, Lassa virus (LASV) can cause hemorrhagic fever with high case fatality rates. We generated a genomic catalog of almost 200 LASV sequences from clinical and rodent reservoir samples. We show that whereas the 2013-2015 EVD epidemic is fueled by human-to-human transmissions, LASV infections mainly result from reservoir-to-human infections. We elucidated the spread of LASV across West Africa and show that this migration was accompanied by changes in LASV genome abundance, fatality rates, codon adaptation, and translational efficiency. By investigating intrahost evolution, we found that mutations accumulate in epitopes of viral surface proteins, suggesting selection for immune escape. This catalog will serve as a foundation for the development of vaccines and diagnostics. PMID:26276630
Neanderthal ancestry drives evolution of lipid catabolism in contemporary Europeans
Khrameeva, Ekaterina E.; Bozek, Katarzyna; He, Liu; Yan, Zheng; Jiang, Xi; Wei, Yuning; Tang, Kun; Gelfand, Mikhail S.; Prufer, Kay; Kelso, Janet; Paabo, Svante; Giavalisco, Patrick; Lachmann, Michael; Khaitovich, Philipp
2014-01-01
Although Neanderthals are extinct, fragments of their genomes persist in contemporary humans. Here we show that while the genome-wide frequency of Neanderthal-like sites is approximately constant across all contemporary out-of-Africa populations, genes involved in lipid catabolism contain more than threefold excess of such sites in contemporary humans of European descent. Evolutionally, these genes show significant association with signatures of recent positive selection in the contemporary European, but not Asian or African populations. Functionally, the excess of Neanderthal-like sites in lipid catabolism genes can be linked with a greater divergence of lipid concentrations and enzyme expression levels within this pathway, seen in contemporary Europeans, but not in the other populations. We conclude that sequence variants that evolved in Neanderthals may have given a selective advantage to anatomically modern humans that settled in the same geographical areas. PMID:24690587
The Molecular Basis of Human Brain Evolution.
Enard, Wolfgang
2016-10-24
Humans are a remarkable species, especially because of the remarkable properties of their brain. Since the split from the chimpanzee lineage, the human brain has increased three-fold in size and has acquired abilities for vocal learning, language and intense cooperation. To better understand the molecular basis of these changes is of great biological and biomedical interest. However, all the about 16 million fixed genetic changes that occurred during human evolution are fully correlated with all molecular, cellular, anatomical and behavioral changes that occurred during this time. Hence, as humans and chimpanzees cannot be crossed or genetically manipulated, no direct evidence for linking particular genetic and molecular changes to human brain evolution can be obtained. Here, I sketch a framework how indirect evidence can be obtained and review findings related to the molecular basis of human cognition, vocal learning and brain size. In particular, I discuss how a comprehensive comparative approach, leveraging cellular systems and genomic technologies, could inform the evolution of our brain in the future. Copyright © 2016 Elsevier Ltd. All rights reserved.
The evolution of human warfare.
Pitman, George R
2011-01-01
Here we propose a new theory for the origins and evolution of human warfare as a complex social phenomenon involving several behavioral traits, including aggression, risk taking, male bonding, ingroup altruism, outgroup xenophobia, dominance and subordination, and territoriality, all of which are encoded in the human genome. Among the family of great apes only chimpanzees and humans engage in war; consequently, warfare emerged in their immediate common ancestor that lived in patrilocal groups who fought one another for females. The reasons for warfare changed when the common ancestor females began to immigrate into the groups of their choice, and again, during the agricultural revolution.
Comparative genome map of human and cattle
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solinas-Toldo, S.; Fries, R.; Lengauer, C.
Chromosomal homologies between individual human chromosomes and the bovine karyotype have been established by using a new approach termed Zoo-FISH. Labeled DNA libraries from flow-sorted human chromosomes were used as probes for fluorescence in situ hybridization on cattle chromosomes. All human DNA libraries, except the Y chromosome library, hybridized to one or more cattle chromosomes, identifying and delineating 50 segments of homology, most of them corresponding to the regions of homology as identified by the previous mapping of individual conserved loci. However, Zoo-FISH refines the comparative maps constructed by molecular gene mapping of individual loci by providing information on themore » boundaries of conserved regions in the absence of obvious cytogenetic homologies of human and bovine chromosomes. It allows study of karyotypic evolution and opens new avenues for genomic analysis by facilitating the extrapolation of results from the human genome initiative. 50 refs., 3 figs., 1 tab.« less
Extensive horizontal gene transfer during Staphylococcus aureus co-colonization in vivo.
McCarthy, Alex J; Loeffler, Anette; Witney, Adam A; Gould, Katherine A; Lloyd, David H; Lindsay, Jodi A
2014-09-25
Staphylococcus aureus is a commensal and major pathogen of humans and animals. Comparative genomics of S. aureus populations suggests that colonization of different host species is associated with carriage of mobile genetic elements (MGE), particularly bacteriophages and plasmids capable of encoding virulence, resistance, and immune evasion pathways. Antimicrobial-resistant S. aureus of livestock are a potential zoonotic threat to human health if they adapt to colonize humans efficiently. We utilized the technique of experimental evolution and co-colonized gnotobiotic piglets with both human- and pig-associated variants of the lineage clonal complex 398, and investigated growth and genetic changes over 16 days using whole genome sequencing. The human isolate survived co-colonization on piglets more efficiently than in vitro. During co-colonization, transfer of MGE from the pig to the human isolate was detected within 4 h. Extensive and repeated transfer of two bacteriophages and three plasmids resulted in colonization with isolates carrying a wide variety of mobilomes. Whole genome sequencing of progeny bacteria revealed no acquisition of core genome polymorphisms, highlighting the importance of MGE. Staphylococcus aureus bacteriophage recombination and integration into novel sites was detected experimentally for the first time. During colonization, clones coexisted and diversified rather than a single variant dominating. Unexpectedly, each piglet carried unique populations of bacterial variants, suggesting limited transmission of bacteria between piglets once colonized. Our data show that horizontal gene transfer occurs at very high frequency in vivo and significantly higher than that detectable in vitro. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Silencing Effect of Hominoid Highly Conserved Noncoding Sequences on Embryonic Brain Development
Mahmoudi Saber, Morteza
2017-01-01
Abstract Superfamily Hominoidea, which consists of Hominidae (humans and great apes) and Hylobatidae (gibbons), is well-known for sharing human-like characteristics, however, the genomic origins of these shared unique phenotypes have mainly remained elusive. To decipher the underlying genomic basis of Hominoidea-restricted phenotypes, we identified and characterized Hominoidea-restricted highly conserved noncoding sequences (HCNSs) that are a class of potential regulatory elements which may be involved in evolution of lineage-specific phenotypes. We discovered 679 such HCNSs from human, chimpanzee, gorilla, orangutan and gibbon genomes. These HCNSs were demonstrated to be under purifying selection but with lineage-restricted characteristics different from old CNSs. A significant proportion of their ancestral sequences had accelerated rates of nucleotide substitutions, insertions and deletions during the evolution of common ancestor of Hominoidea, suggesting the intervention of positive Darwinian selection for creating those HCNSs. In contrary to enhancer elements and similar to silencer sequences, these Hominoidea-restricted HCNSs are located in close proximity of transcription start sites. Their target genes are enriched in the nervous system, development and transcription, and they tend to be remotely located from the nearest coding gene. Chip-seq signals and gene expression patterns suggest that Hominoidea-restricted HCNSs are likely to be functional regulatory elements by imposing silencing effects on their target genes in a tissue-restricted manner during fetal brain development. These HCNSs, emerged through adaptive evolution and conserved through purifying selection, represent a set of promising targets for future functional studies of the evolution of Hominoidea-restricted phenotypes. PMID:28633494
Detecting and characterizing genomic signatures of positive selection in global populations.
Liu, Xuanyao; Ong, Rick Twee-Hee; Pillai, Esakimuthu Nisha; Elzein, Abier M; Small, Kerrin S; Clark, Taane G; Kwiatkowski, Dominic P; Teo, Yik-Ying
2013-06-06
Natural selection is a significant force that shapes the architecture of the human genome and introduces diversity across global populations. The question of whether advantageous mutations have arisen in the human genome as a result of single or multiple mutation events remains unanswered except for the fact that there exist a handful of genes such as those that confer lactase persistence, affect skin pigmentation, or cause sickle cell anemia. We have developed a long-range-haplotype method for identifying genomic signatures of positive selection to complement existing methods, such as the integrated haplotype score (iHS) or cross-population extended haplotype homozygosity (XP-EHH), for locating signals across the entire allele frequency spectrum. Our method also locates the founder haplotypes that carry the advantageous variants and infers their corresponding population frequencies. This presents an opportunity to systematically interrogate the whole human genome whether a selection signal shared across different populations is the consequence of a single mutation process followed subsequently by gene flow between populations or of convergent evolution due to the occurrence of multiple independent mutation events either at the same variant or within the same gene. The application of our method to data from 14 populations across the world revealed that positive-selection events tend to cluster in populations of the same ancestry. Comparing the founder haplotypes for events that are present across different populations revealed that convergent evolution is a rare occurrence and that the majority of shared signals stem from the same evolutionary event. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Gilbert, Maarten J.; Miller, William G.; Yee, Emma; Zomer, Aldert L.; van der Graaf-van Bloois, Linda; Fitzgerald, Collette; Forbes, Ken J.; Méric, Guillaume; Sheppard, Samuel K.; Wagenaar, Jaap A.; Duim, Birgitta
2016-01-01
Campylobacter fetus currently comprises three recognized subspecies, which display distinct host association. Campylobacter fetus subsp. fetus and C. fetus subsp. venerealis are both associated with endothermic mammals, primarily ruminants, whereas C. fetus subsp. testudinum is primarily associated with ectothermic reptiles. Both C. fetus subsp. testudinum and C. fetus subsp. fetus have been associated with severe infections, often with a systemic component, in immunocompromised humans. To study the genetic factors associated with the distinct host dichotomy in C. fetus, whole-genome sequencing and comparison of mammal- and reptile-associated C. fetus was performed. The genomes of C. fetus subsp. testudinum isolated from either reptiles or humans were compared with elucidate the genetic factors associated with pathogenicity in humans. Genomic comparisons showed conservation of gene content and organization among C. fetus subspecies, but a clear distinction between mammal- and reptile-associated C. fetus was observed. Several genomic regions appeared to be subspecies specific, including a putative tricarballylate catabolism pathway, exclusively present in C. fetus subsp. testudinum strains. Within C. fetus subsp. testudinum, sapA, sapB, and sapAB type strains were observed. The recombinant locus iamABC (mlaFED) was exclusively associated with invasive C. fetus subsp. testudinum strains isolated from humans. A phylogenetic reconstruction was consistent with divergent evolution in host-associated strains and the existence of a barrier to lateral gene transfer between mammal- and reptile-associated C. fetus. Overall, this study shows that reptile-associated C. fetus subsp. testudinum is genetically divergent from mammal-associated C. fetus subspecies. PMID:27333878
The evolution of human influenza A viruses from 1999 to 2006: a complete genome study.
Bragstad, Karoline; Nielsen, Lars P; Fomsgaard, Anders
2008-03-07
Knowledge about the complete genome constellation of seasonal influenza A viruses from different countries is valuable for monitoring and understanding of the evolution and migration of strains. Few complete genome sequences of influenza A viruses from Europe are publicly available at the present time and there have been few longitudinal genome studies of human influenza A viruses. We have studied the evolution of circulating human H3N2, H1N1 and H1N2 influenza A viruses from 1999 to 2006, we analysed 234 Danish human influenza A viruses and characterised 24 complete genomes. H3N2 was the prevalent strain in Denmark during the study period, but H1N1 dominated the 2000-2001 season. H1N2 viruses were first observed in Denmark in 2002-2003. After years of little genetic change in the H1N1 viruses the 2005-2006 season presented H1N1 of greater variability than before. This indicates that H1N1 viruses are evolving and that H1N1 soon is likely to be the prevalent strain again. Generally, the influenza A haemagglutinin (HA) of H3N2 viruses formed seasonal phylogenetic clusters. Different lineages co-circulating within the same season were also observed. The evolution has been stochastic, influenced by small "jumps" in genetic distance rather than constant drift, especially with the introduction of the Fujian-like viruses in 2002-2003. Also evolutionary stasis-periods were observed which might indicate well fit viruses. The evolution of H3N2 viruses have also been influenced by gene reassortments between lineages from different seasons. None of the influenza genes were influenced by strong positive selection pressure. The antigenic site B in H3N2 HA was the preferred site for genetic change during the study period probably because the site A has been masked by glycosylations. Substitutions at CTL-epitopes in the genes coding for the neuraminidase (NA), polymerase acidic protein (PA), matrix protein 1 (M1), non-structural protein 1 (NS1) and especially the nucleoprotein (NP) were observed. The N-linked glycosylation pattern varied during the study period and the H3N2 isolates from 2004 to 2006 were highly glycosylated with ten predicted sequons in HA, the highest amount of glycosylations observed in this study period. The present study is the first to our knowledge to characterise the evolution of complete genomes of influenza A H3N2, H1N1 and H1N2 isolates from Europe over a time period of seven years from 1999 to 2006. More precise knowledge about the circulating strains may have implications for predicting the following season strains and thereby better matching the vaccine composition.
The evolution of human influenza A viruses from 1999 to 2006: A complete genome study
Bragstad, Karoline; Nielsen, Lars P; Fomsgaard, Anders
2008-01-01
Background Knowledge about the complete genome constellation of seasonal influenza A viruses from different countries is valuable for monitoring and understanding of the evolution and migration of strains. Few complete genome sequences of influenza A viruses from Europe are publicly available at the present time and there have been few longitudinal genome studies of human influenza A viruses. We have studied the evolution of circulating human H3N2, H1N1 and H1N2 influenza A viruses from 1999 to 2006, we analysed 234 Danish human influenza A viruses and characterised 24 complete genomes. Results H3N2 was the prevalent strain in Denmark during the study period, but H1N1 dominated the 2000–2001 season. H1N2 viruses were first observed in Denmark in 2002–2003. After years of little genetic change in the H1N1 viruses the 2005–2006 season presented H1N1 of greater variability than before. This indicates that H1N1 viruses are evolving and that H1N1 soon is likely to be the prevalent strain again. Generally, the influenza A haemagglutinin (HA) of H3N2 viruses formed seasonal phylogenetic clusters. Different lineages co-circulating within the same season were also observed. The evolution has been stochastic, influenced by small "jumps" in genetic distance rather than constant drift, especially with the introduction of the Fujian-like viruses in 2002–2003. Also evolutionary stasis-periods were observed which might indicate well fit viruses. The evolution of H3N2 viruses have also been influenced by gene reassortments between lineages from different seasons. None of the influenza genes were influenced by strong positive selection pressure. The antigenic site B in H3N2 HA was the preferred site for genetic change during the study period probably because the site A has been masked by glycosylations. Substitutions at CTL-epitopes in the genes coding for the neuraminidase (NA), polymerase acidic protein (PA), matrix protein 1 (M1), non-structural protein 1 (NS1) and especially the nucleoprotein (NP) were observed. The N-linked glycosylation pattern varied during the study period and the H3N2 isolates from 2004 to 2006 were highly glycosylated with ten predicted sequons in HA, the highest amount of glycosylations observed in this study period. Conclusion The present study is the first to our knowledge to characterise the evolution of complete genomes of influenza A H3N2, H1N1 and H1N2 isolates from Europe over a time period of seven years from 1999 to 2006. More precise knowledge about the circulating strains may have implications for predicting the following season strains and thereby better matching the vaccine composition. PMID:18325125
A Feast of Malaria Parasite Genomes.
Carlton, Jane M; Sullivan, Steven A
2017-03-08
The Plasmodium genus has evolved over time and across hosts, complexifying our understanding of malaria. In a recent Nature paper, Rutledge et al. (2017) describe the genome sequences of three major human malaria parasite species, providing insight into Plasmodium evolution and raising the question of how many species there are. Copyright © 2017 Elsevier Inc. All rights reserved.
Ferris, Elliott; Abegglen, Lisa M; Schiffman, Joshua D; Gregg, Christopher
2018-03-06
The identity of most functional elements in the mammalian genome and the phenotypes they impact are unclear. Here, we perform a genome-wide comparative analysis of patterns of accelerated evolution in species with highly distinctive traits to discover candidate functional elements for clinically important phenotypes. We identify accelerated regions (ARs) in the elephant, hibernating bat, orca, dolphin, naked mole rat, and thirteen-lined ground squirrel lineages in mammalian conserved regions, uncovering ∼33,000 elements that bind hundreds of different regulatory proteins in humans and mice. ARs in the elephant, the largest land mammal, are uniquely enriched near elephant DNA damage response genes. The genomic hotspot for elephant ARs is the E3 ligase subunit of the Fanconi anemia complex, a master regulator of DNA repair. Additionally, ARs in the six species are associated with specific human clinical phenotypes that have apparent concordance with overt traits in each species. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
A decade of human genome project conclusion: Scientific diffusion about our genome knowledge.
Moraes, Fernanda; Góes, Andréa
2016-05-06
The Human Genome Project (HGP) was initiated in 1990 and completed in 2003. It aimed to sequence the whole human genome. Although it represented an advance in understanding the human genome and its complexity, many questions remained unanswered. Other projects were launched in order to unravel the mysteries of our genome, including the ENCyclopedia of DNA Elements (ENCODE). This review aims to analyze the evolution of scientific knowledge related to both the HGP and ENCODE projects. Data were retrieved from scientific articles published in 1990-2014, a period comprising the development and the 10 years following the HGP completion. The fact that only 20,000 genes are protein and RNA-coding is one of the most striking HGP results. A new concept about the organization of genome arose. The ENCODE project was initiated in 2003 and targeted to map the functional elements of the human genome. This project revealed that the human genome is pervasively transcribed. Therefore, it was determined that a large part of the non-protein coding regions are functional. Finally, a more sophisticated view of chromatin structure emerged. The mechanistic functioning of the genome has been redrafted, revealing a much more complex picture. Besides, a gene-centric conception of the organism has to be reviewed. A number of criticisms have emerged against the ENCODE project approaches, raising the question of whether non-conserved but biochemically active regions are truly functional. Thus, HGP and ENCODE projects accomplished a great map of the human genome, but the data generated still requires further in depth analysis. © 2016 by The International Union of Biochemistry and Molecular Biology, 44:215-223, 2016. © 2016 The International Union of Biochemistry and Molecular Biology.
The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.
Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A
2016-10-11
Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.
Boyd, Bret M; Allen, Julie M; de Crécy-Lagard, Valérie; Reed, David L
2014-09-11
The obligate-heritable endosymbionts of insects possess some of the smallest known bacterial genomes. This is likely due to loss of genomic material during symbiosis. The mode and rate of this erosion may change over evolutionary time: faster in newly formed associations and slower in long-established ones. The endosymbionts of human and anthropoid primate lice present a unique opportunity to study genome erosion in newly established (or young) symbionts. This is because we have a detailed phylogenetic history of these endosymbionts with divergence dates for closely related species. This allows for genome evolution to be studied in detail and rates of change to be estimated in a phylogenetic framework. Here, we sequenced the genome of the chimpanzee louse endosymbiont (Candidatus Riesia pediculischaeffi) and compared it with the closely related genome of the human body louse endosymbiont. From this comparison, we found evidence for recent genome erosion leading to gene loss in these endosymbionts. Although gene loss was detected, it was not significantly greater than in older endosymbionts from aphids and ants. Additionally, we searched for genes associated with B-vitamin synthesis in the two louse endosymbiont genomes because these endosymbionts are believed to synthesize essential B vitamins absent in the louse's diet. All of the expected genes were present, except those involved in thiamin synthesis. We failed to find genes encoding for proteins involved in the biosynthesis of thiamin or any complete exogenous means of salvaging thiamin, suggesting there is an undescribed mechanism for the salvage of thiamin. Finally, genes encoding for the pantothenate de novo biosynthesis pathway were located on a plasmid in both taxa along with a heat shock protein. Movement of these genes onto a plasmid may be functionally and evolutionarily significant, potentially increasing production and guarding against the deleterious effects of mutation. These data add to a growing resource of obligate endosymbiont genomes and to our understanding of the rate and mode of genome erosion in obligate animal-associated bacteria. Ultimately sequencing additional louse p-endosymbiont genomes will provide a model system for studying genome evolution in obligate host associated bacteria. Copyright © 2014 Boyd et al.
Genomic Perspectives of Transcriptional Regulation in Forebrain Development
Nord, Alex S.; Pattabiraman, Kartik; Visel, Axel; ...
2015-01-07
The forebrain is the seat of higher-order brain functions, and many human neuropsychiatric disorders are due to genetic defects affecting forebrain development, making it imperative to understand the underlying genetic circuitry. We report that recent progress now makes it possible to begin fully elucidating the genomic regulatory mechanisms that control forebrain gene expression. Here, we discuss the current knowledge of how transcription factors drive gene expression programs through their interactions with cis-acting genomic elements, such as enhancers; how analyses of chromatin and DNA modifications provide insights into gene expression states; and how these approaches yield insights into the evolution ofmore » the human brain.« less
McLaughlin, Paul J; Keegan, Liam P
2014-08-01
Nearly 150 different enzymatically modified forms of the four canonical residues in RNA have been identified. For instance, enzymes of the ADAR (adenosine deaminase acting on RNA) family convert adenosine residues into inosine in cellular dsRNAs. Recent findings show that DNA endonuclease V enzymes have undergone an evolutionary transition from cleaving 3' to deoxyinosine in DNA and ssDNA to cleaving 3' to inosine in dsRNA and ssRNA in humans. Recent work on dsRNA-binding domains of ADARs and other proteins also shows that a degree of sequence specificity is achieved by direct readout in the minor groove. However, the level of sequence specificity observed is much less than that of DNA major groove-binding helix-turn-helix proteins. We suggest that the evolution of DNA-binding proteins following the RNA to DNA genome transition represents the major advantage that DNA genomes have over RNA genomes. We propose that a hypothetical RNA modification, a RRAR (ribose reductase acting on genomic dsRNA) produced the first stretches of DNA in RNA genomes. We discuss why this is the most satisfactory explanation for the origin of DNA. The evolution of this RNA modification and later steps to DNA genomes are likely to have been driven by cellular genome co-evolution with viruses and intragenomic parasites. RNA modifications continue to be involved in host-virus conflicts; in vertebrates, edited cellular dsRNAs with inosine-uracil base pairs appear to be recognized as self RNA and to suppress activation of innate immune sensors that detect viral dsRNA.
Mobile element biology – new possibilities with high-throughput sequencing
Xing, Jinchuan; Witherspoon, David J.; Jorde, Lynn B.
2014-01-01
Mobile elements compose more than half of the human genome, but until recently their large-scale detection was time-consuming and challenging. With the development of new high-throughput sequencing technologies, the complete spectrum of mobile element variation in humans can now be identified and analyzed. Thousands of new mobile element insertions have been discovered, yielding new insights into mobile element biology, evolution, and genomic variation. We review several high-throughput methods, with an emphasis on techniques that specifically target mobile element insertions in humans, and we highlight recent applications of these methods in evolutionary studies and in the analysis of somatic alterations in human cancers. PMID:23312846
Grandi, Nicole; Cadeddu, Marta; Blomberg, Jonas; Mayer, Jens; Tramontano, Enzo
2018-01-19
The genomes of all vertebrates harbor remnants of ancient retroviral infections, having affected the germ line cells during the last 100 million years. These sequences, named Endogenous Retroviruses (ERVs), have been transmitted to the offspring in a Mendelian way, being relatively stable components of the host genome even long after their exogenous counterparts went extinct. Among human ERVs (HERVs), the HERV-W group is of particular interest for our physiology and pathology. A HERV-W provirus in locus 7q21.2 has been coopted during evolution to exert an essential role in placenta, and the group expression has been tentatively linked to Multiple Sclerosis and other diseases. Following up on a detailed analysis of 213 HERV-W insertions in the human genome, we now investigated the ERV-W group genomic spread within primate lineages. We analyzed HERV-W orthologous loci in the genome sequences of 12 non-human primate species belonging to Simiiformes (parvorders Catarrhini and Platyrrhini), Tarsiiformes and to the most primitive Prosimians. Analysis of HERV-W orthologous loci in non-human Catarrhini primates revealed species-specific insertions in the genomes of Chimpanzee (3), Gorilla (4), Orangutan (6), Gibbon (2) and especially Rhesus Macaque (66). Such sequences were acquired in a retroviral fashion and, in the majority of cases, by L1-mediated formation of processed pseudogenes. There were also a number of LTR-LTR homologous recombination events that occurred subsequent to separation of Catarrhini sub-lineages. Moreover, we retrieved 130 sequences in Marmoset and Squirrel Monkeys (family Cebidae, Platyrrhini parvorder), identified as ERV1-1_CJa based on RepBase annotations, which appear closely related to the ERV-W group. Such sequences were also identified in Atelidae and Pitheciidae, representative of the other Platyrrhini families. In contrast, no ERV-W-related sequences were found in genome sequence assemblies of Tarsiiformes and Prosimians. Overall, our analysis now provides a detailed picture of the ERV-W sequences colonization of the primate lineages genomes, revealing the exact dynamics of ERV-W locus formations as well as novel insights into the evolution and origin of the group.
Richards, Vincent P.; Lang, Ping; Pavinski Bitar, Paulina D.; Lefébure, Tristan; Schukken, Ynte H.; Zadoks, Ruth N.; Stanhope, Michael J.
2011-01-01
In addition to causing severe invasive infections in humans, Streptococcus agalactiae, or group B Streptococcus (GBS), is also a major cause of bovine mastitis. Here we provide the first genome sequence for S. agalactiae isolated from a cow diagnosed with clinical mastitis (strain FSL S3-026). Comparison to eight S. agalactiae genomes obtained from human disease isolates revealed 183 genes specific to the bovine strain. Subsequent polymerase chain reaction (PCR) screening for the presence/absence of a subset of these loci in additional bovine and human strains revealed strong differentiation between the two groups (Fisher exact test: p < 0.0001). The majority of the bovine strain-specific genes (~85%) clustered tightly into eight genomic islands, suggesting these genes were acquired through lateral gene transfer (LGT). This bovine GBS also contained an unusually high proportion of insertion sequences (4.3% of the total genome), suggesting frequent genomic rearrangement. Comparison to other mastitis-causing species of bacteria provided strong evidence for two cases of interspecies LGT within the shared bovine environment: bovine S. agalactiae with Streptococcus uberis (nisin U operon) and Streptococcus dysgalactiae subsp. dysgalactiae (lactose operon). We also found evidence for LGT, involving the salivaricin operon, between the bovine S. agalactiae strain and either Streptococcus pyogenes or Streptococcus salivarius. Our findings provide insight intomechanismsfacilitatingenvironmentaladaptationandacquisitionofpotential virulence factors, while highlighting both the key role LGT has played in the recent evolution of the bovine S. agalactiae strain, and the importance of LGT among pathogens within a shared environment. PMID:21536150
Evolution of Influenza A Virus by Mutation and Re-Assortment
Shao, Wenhan; Li, Xinxin; Goraya, Mohsan Ullah; Wang, Song; Chen, Ji-Long
2017-01-01
Influenza A virus (IAV), a highly infectious respiratory pathogen, has continued to be a significant threat to global public health. To complete their life cycle, influenza viruses have evolved multiple strategies to interact with a host. A large number of studies have revealed that the evolution of influenza A virus is mainly mediated through the mutation of the virus itself and the re-assortment of viral genomes derived from various strains. The evolution of influenza A virus through these mechanisms causes worldwide annual epidemics and occasional pandemics. Importantly, influenza A virus can evolve from an animal infected pathogen to a human infected pathogen. The highly pathogenic influenza virus has resulted in stupendous economic losses due to its morbidity and mortality both in human and animals. Influenza viruses fall into a category of viruses that can cause zoonotic infection with stable adaptation to human, leading to sustained horizontal transmission. The rapid mutations of influenza A virus result in the loss of vaccine optimal efficacy, and challenge the complete eradication of the virus. In this review, we highlight the current understanding of influenza A virus evolution caused by the mutation and re-assortment of viral genomes. In addition, we discuss the specific mechanisms by which the virus evolves. PMID:28783091
Sheehan, Michael J; Nachman, Michael W
2014-09-16
Facial recognition plays a key role in human interactions, and there has been great interest in understanding the evolution of human abilities for individual recognition and tracking social relationships. Individual recognition requires sufficient cognitive abilities and phenotypic diversity within a population for discrimination to be possible. Despite the importance of facial recognition in humans, the evolution of facial identity has received little attention. Here we demonstrate that faces evolved to signal individual identity under negative frequency-dependent selection. Faces show elevated phenotypic variation and lower between-trait correlations compared with other traits. Regions surrounding face-associated single nucleotide polymorphisms show elevated diversity consistent with frequency-dependent selection. Genetic variation maintained by identity signalling tends to be shared across populations and, for some loci, predates the origin of Homo sapiens. Studies of human social evolution tend to emphasize cognitive adaptations, but we show that social evolution has shaped patterns of human phenotypic and genetic diversity as well.
A decade of pig genome sequencing: a window on pig domestication and evolution.
Groenen, Martien A M
2016-03-29
Insight into how genomes change and adapt due to selection addresses key questions in evolutionary biology and in domestication of animals and plants by humans. In that regard, the pig and its close relatives found in Africa and Eurasia represent an excellent group of species that enables studies of the effect of both natural and human-mediated selection on the genome. The recent completion of the draft genome sequence of a domestic pig and the development of next-generation sequencing technology during the past decade have created unprecedented possibilities to address these questions in great detail. In this paper, I review recent whole-genome sequencing studies in the pig and closely-related species that provide insight into the demography, admixture and selection of these species and, in particular, how domestication and subsequent selection of Sus scrofa have shaped the genomes of these animals.
Keinath, Melissa C.; Timoshevskiy, Vladimir A.; Timoshevskaya, Nataliya Y.; Tsonis, Panagiotis A.; Voss, S. Randal; Smith, Jeramiah J.
2015-01-01
Vertebrates exhibit substantial diversity in genome size, and some of the largest genomes exist in species that uniquely inform diverse areas of basic and biomedical research. For example, the salamander Ambystoma mexicanum (the Mexican axolotl) is a model organism for studies of regeneration, development and genome evolution, yet its genome is ~10× larger than the human genome. As part of a hierarchical approach toward improving genome resources for the species, we generated 600 Gb of shotgun sequence data and developed methods for sequencing individual laser-captured chromosomes. Based on these data, we estimate that the A. mexicanum genome is ~32 Gb. Notably, as much as 19 Gb of the A. mexicanum genome can potentially be considered single copy, which presumably reflects the evolutionary diversification of mobile elements that accumulated during an ancient episode of genome expansion. Chromosome-targeted sequencing permitted the development of assemblies within the constraints of modern computational platforms, allowed us to place 2062 genes on the two smallest A. mexicanum chromosomes and resolves key events in the history of vertebrate genome evolution. Our analyses show that the capture and sequencing of individual chromosomes is likely to provide valuable information for the systematic sequencing, assembly and scaffolding of large genomes. PMID:26553646
Keinath, Melissa C; Timoshevskiy, Vladimir A; Timoshevskaya, Nataliya Y; Tsonis, Panagiotis A; Voss, S Randal; Smith, Jeramiah J
2015-11-10
Vertebrates exhibit substantial diversity in genome size, and some of the largest genomes exist in species that uniquely inform diverse areas of basic and biomedical research. For example, the salamander Ambystoma mexicanum (the Mexican axolotl) is a model organism for studies of regeneration, development and genome evolution, yet its genome is ~10× larger than the human genome. As part of a hierarchical approach toward improving genome resources for the species, we generated 600 Gb of shotgun sequence data and developed methods for sequencing individual laser-captured chromosomes. Based on these data, we estimate that the A. mexicanum genome is ~32 Gb. Notably, as much as 19 Gb of the A. mexicanum genome can potentially be considered single copy, which presumably reflects the evolutionary diversification of mobile elements that accumulated during an ancient episode of genome expansion. Chromosome-targeted sequencing permitted the development of assemblies within the constraints of modern computational platforms, allowed us to place 2062 genes on the two smallest A. mexicanum chromosomes and resolves key events in the history of vertebrate genome evolution. Our analyses show that the capture and sequencing of individual chromosomes is likely to provide valuable information for the systematic sequencing, assembly and scaffolding of large genomes.
The dog genome map and its use in mammalian comparative genomics.
Switonski, Marek; Szczerbal, Izabela; Nowacka, Joanna
2004-01-01
The dog genome organization was extensively studied in the last ten years. The most important achievements are the well-developed marker genome maps, including over 3200 marker loci, and a survey of the DNA genome sequence. This knowledge, along with the most advanced map of the human genome, turned out to be very useful in comparative genomic studies. On the one hand, it has promoted the development of marker genome maps of other species of the family Canidae (red fox, arctic fox, Chinese raccoon dog) as well as studies on the evolution of their karyotype. But the most important approach is the comparative analysis of human and canine hereditary diseases. At present, causative gene mutations are known for 30 canine hereditary diseases. A majority of them have human counterparts with similar clinical and molecular features. Studies on identification of genes having a major impact on some multifactorial diseases (hip dysplasia, epilepsy) and cancers (multifocal renal cystadenocarcinoma and nodular dermatofibrosis) are advanced. Very promising are the results of gene therapy for certain canine monogenic diseases (haemophilia, hereditary retinal dystrophy, mucopolysaccharidosis), which have human equivalents. The above-mentioned examples prove a very important model role of the dog in studies of human genetic diseases. On the other hand, the identification of gene mutations responsible for hereditary diseases has a substantial impact on breeding strategy in the dog.
Audit, Benjamin; Zaghloul, Lamia; Baker, Antoine; Arneodo, Alain; Chen, Chun-Long; d'Aubenton-Carafa, Yves; Thermes, Claude
2013-01-01
In higher eukaryotes, the absence of specific sequence motifs, marking the origins of replication has been a serious hindrance to the understanding of (i) the mechanisms that regulate the spatio-temporal replication program, and (ii) the links between origins activation, chromatin structure and transcription. In this chapter, we review the partitioning of the human genome into megabased-size replication domains delineated as N-shaped motifs in the strand compositional asymmetry profiles. They collectively span 28.3% of the genome and are bordered by more than 1,000 putative replication origins. We recapitulate the comparison of this partition of the human genome with high-resolution experimental data that confirms that replication domain borders are likely to be preferential replication initiation zones in the germline. In addition, we highlight the specific distribution of experimental and numerical chromatin marks along replication domains. Domain borders correspond to particular open chromatin regions, possibly encoded in the DNA sequence, and around which replication and transcription are highly coordinated. These regions also present a high evolutionary breakpoint density, suggesting that susceptibility to breakage might be linked to local open chromatin fiber state. Altogether, this chapter presents a compartmentalization of the human genome into replication domains that are landmarks of the human genome organization and are likely to play a key role in genome dynamics during evolution and in pathological situations.
Badenhorst, Daleen; Hillier, LaDeana W; Literman, Robert; Montiel, Eugenia Elisabet; Radhakrishnan, Srihari; Shen, Yingjia; Minx, Patrick; Janes, Daniel E; Warren, Wesley C; Edwards, Scott V; Valenzuela, Nicole
2015-06-24
Comparative genomics continues illuminating amniote genome evolution, but for many lineages our understanding remains incomplete. Here, we refine the assembly (CPI 3.0.3 NCBI AHGY00000000.2) and develop a cytogenetic map of the painted turtle (Chrysemys picta-CPI) genome, the first in turtles and in vertebrates with temperature-dependent sex determination. A comparison of turtle genomes with those of chicken, selected nonavian reptiles, and human revealed shared and novel genomic features, such as numerous chromosomal rearrangements. The largest conserved syntenic blocks between birds and turtles exist in four macrochromosomes, whereas rearrangements were evident in these and other chromosomes, disproving that turtles and birds retain fully conserved macrochromosomes for greater than 300 Myr. C-banding revealed large heterochromatic blocks in the centromeric region of only few chromosomes. The nucleolar-organizing region (NOR) mapped to a single CPI microchromosome, whereas in some turtles and lizards the NOR maps to nonhomologous sex-chromosomes, thus revealing independent translocations of the NOR in various reptilian lineages. There was no evidence for recent chromosomal fusions as interstitial telomeric-DNA was absent. Some repeat elements (CR1-like, Gypsy) were enriched in the centromeres of five chromosomes, whereas others were widespread in the CPI genome. Bacterial artificial chromosome (BAC) clones were hybridized to 18 of the 25 CPI chromosomes and anchored to a G-banded ideogram. Several CPI sex-determining genes mapped to five chromosomes, and homology was detected between yet other CPI autosomes and the globally nonhomologous sex chromosomes of chicken, other turtles, and squamates, underscoring the independent evolution of vertebrate sex-determining mechanisms. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Detecting and Characterizing Genomic Signatures of Positive Selection in Global Populations
Liu, Xuanyao; Ong, Rick Twee-Hee; Pillai, Esakimuthu Nisha; Elzein, Abier M.; Small, Kerrin S.; Clark, Taane G.; Kwiatkowski, Dominic P.; Teo, Yik-Ying
2013-01-01
Natural selection is a significant force that shapes the architecture of the human genome and introduces diversity across global populations. The question of whether advantageous mutations have arisen in the human genome as a result of single or multiple mutation events remains unanswered except for the fact that there exist a handful of genes such as those that confer lactase persistence, affect skin pigmentation, or cause sickle cell anemia. We have developed a long-range-haplotype method for identifying genomic signatures of positive selection to complement existing methods, such as the integrated haplotype score (iHS) or cross-population extended haplotype homozygosity (XP-EHH), for locating signals across the entire allele frequency spectrum. Our method also locates the founder haplotypes that carry the advantageous variants and infers their corresponding population frequencies. This presents an opportunity to systematically interrogate the whole human genome whether a selection signal shared across different populations is the consequence of a single mutation process followed subsequently by gene flow between populations or of convergent evolution due to the occurrence of multiple independent mutation events either at the same variant or within the same gene. The application of our method to data from 14 populations across the world revealed that positive-selection events tend to cluster in populations of the same ancestry. Comparing the founder haplotypes for events that are present across different populations revealed that convergent evolution is a rare occurrence and that the majority of shared signals stem from the same evolutionary event. PMID:23731540
Mosquito genomics. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes.
Neafsey, Daniel E; Waterhouse, Robert M; Abai, Mohammad R; Aganezov, Sergey S; Alekseyev, Max A; Allen, James E; Amon, James; Arcà, Bruno; Arensburger, Peter; Artemov, Gleb; Assour, Lauren A; Basseri, Hamidreza; Berlin, Aaron; Birren, Bruce W; Blandin, Stephanie A; Brockman, Andrew I; Burkot, Thomas R; Burt, Austin; Chan, Clara S; Chauve, Cedric; Chiu, Joanna C; Christensen, Mikkel; Costantini, Carlo; Davidson, Victoria L M; Deligianni, Elena; Dottorini, Tania; Dritsou, Vicky; Gabriel, Stacey B; Guelbeogo, Wamdaogo M; Hall, Andrew B; Han, Mira V; Hlaing, Thaung; Hughes, Daniel S T; Jenkins, Adam M; Jiang, Xiaofang; Jungreis, Irwin; Kakani, Evdoxia G; Kamali, Maryam; Kemppainen, Petri; Kennedy, Ryan C; Kirmitzoglou, Ioannis K; Koekemoer, Lizette L; Laban, Njoroge; Langridge, Nicholas; Lawniczak, Mara K N; Lirakis, Manolis; Lobo, Neil F; Lowy, Ernesto; MacCallum, Robert M; Mao, Chunhong; Maslen, Gareth; Mbogo, Charles; McCarthy, Jenny; Michel, Kristin; Mitchell, Sara N; Moore, Wendy; Murphy, Katherine A; Naumenko, Anastasia N; Nolan, Tony; Novoa, Eva M; O'Loughlin, Samantha; Oringanje, Chioma; Oshaghi, Mohammad A; Pakpour, Nazzy; Papathanos, Philippos A; Peery, Ashley N; Povelones, Michael; Prakash, Anil; Price, David P; Rajaraman, Ashok; Reimer, Lisa J; Rinker, David C; Rokas, Antonis; Russell, Tanya L; Sagnon, N'Fale; Sharakhova, Maria V; Shea, Terrance; Simão, Felipe A; Simard, Frederic; Slotman, Michel A; Somboon, Pradya; Stegniy, Vladimir; Struchiner, Claudio J; Thomas, Gregg W C; Tojo, Marta; Topalis, Pantelis; Tubio, José M C; Unger, Maria F; Vontas, John; Walton, Catherine; Wilding, Craig S; Willis, Judith H; Wu, Yi-Chieh; Yan, Guiyun; Zdobnov, Evgeny M; Zhou, Xiaofan; Catteruccia, Flaminia; Christophides, George K; Collins, Frank H; Cornman, Robert S; Crisanti, Andrea; Donnelly, Martin J; Emrich, Scott J; Fontaine, Michael C; Gelbart, William; Hahn, Matthew W; Hansen, Immo A; Howell, Paul I; Kafatos, Fotis C; Kellis, Manolis; Lawson, Daniel; Louis, Christos; Luckhart, Shirley; Muskavitch, Marc A T; Ribeiro, José M; Riehle, Michael A; Sharakhov, Igor V; Tu, Zhijian; Zwiebel, Laurence J; Besansky, Nora J
2015-01-02
Variation in vectorial capacity for human malaria among Anopheles mosquito species is determined by many factors, including behavior, immunity, and life history. To investigate the genomic basis of vectorial capacity and explore new avenues for vector control, we sequenced the genomes of 16 anopheline mosquito species from diverse locations spanning ~100 million years of evolution. Comparative analyses show faster rates of gene gain and loss, elevated gene shuffling on the X chromosome, and more intron losses, relative to Drosophila. Some determinants of vectorial capacity, such as chemosensory genes, do not show elevated turnover but instead diversify through protein-sequence changes. This dynamism of anopheline genes and genomes may contribute to their flexible capacity to take advantage of new ecological niches, including adapting to humans as primary hosts. Copyright © 2015, American Association for the Advancement of Science.
Vu, Michael M. K.; Jameson, Nora E.; Masuda, Stuart J.; Lin, Dana; Larralde-Ridaura, Rosa; Lupták, Andrej
2012-01-01
SUMMARY Aptamers are structured macromolecules in vitro evolved to bind molecular targets, whereas in nature they form the ligand-binding domains of riboswitches. Adenosine aptamers of a single structural family were isolated several times from random pools but they have not been identified in genomic sequences. We used two unbiased methods, structure-based bioinformatics and human genome-based in vitro selection, to identify aptamers that form the same adenosine-binding structure in a bacterium, and several vertebrates, including humans. Two of the human aptamers map to introns of RAB3C and FGD3 genes. The RAB3C aptamer binds ATP with dissociation constants about ten times lower than physiological ATP concentration, while the minimal FGD3 aptamer binds ATP only co-transcriptionally. PMID:23102219
Schneider, E; Jensen, L R; Farcas, R; Kondova, I; Bontrop, R E; Navarro, B; Fuchs, E; Kuss, A W; Haaf, T
2012-01-01
The human brain is distinguished by its remarkable size, high energy consumption, and cognitive abilities compared to all other mammals and non-human primates. However, little is known about what has accelerated brain evolution in the human lineage. One possible explanation is that the appearance of advanced communication skills and language has been a driving force of human brain development. The phenotypic adaptations in brain structure and function which occurred on the way to modern humans may be associated with specific molecular signatures in today's human genome and/or transcriptome. Genes that have been linked to language, reading, and/or autism spectrum disorders are prime candidates when searching for genes for human-specific communication abilities. The database and genome-wide expression analyses we present here revealed a clustering of such communication-associated genes (COAG) on human chromosomes X and 7, in particular chromosome 7q31-q36. Compared to the rest of the genome, we found a high number of COAG to be differentially expressed in the cortices of humans and non-human primates (chimpanzee, baboon, and/or marmoset). The role of X-linked genes for the development of human-specific cognitive abilities is well known. We now propose that chromosome 7q31-q36 also represents a hot spot for the evolution of human-specific communication abilities. Selective pressure on the T cell receptor beta locus on chromosome 7q34, which plays a pivotal role in the immune system, could have led to rapid dissemination of positive gene variants in hitchhiking COAG. Copyright © 2012 S. Karger AG, Basel.
The Emerging Field of Human Social Genomics
Slavich, George M.; Cole, Steven W.
2013-01-01
Although we generally experience our bodies as being biologically stable across time and situations, an emerging field of research is demonstrating that external social conditions, especially our subjective perceptions of those conditions, can influence our most basic internal biological processes—namely, the expression of our genes. This research on human social genomics has begun to identify the types of genes that are subject to social-environmental regulation, the neural and molecular mechanisms that mediate the effects of social processes on gene expression, and the genetic polymorphisms that moderate individual differences in genomic sensitivity to social context. The molecular models resulting from this research provide new opportunities for understanding how social and genetic factors interact to shape complex behavioral phenotypes and susceptibility to disease. This research also sheds new light on the evolution of the human genome and challenges the fundamental belief that our molecular makeup is relatively stable and impermeable to social-environmental influence. PMID:23853742
UCbase 2.0: ultraconserved sequences database (2014 update)
Lomonaco, Vincenzo; Martoglia, Riccardo; Mandreoli, Federica; Anderlucci, Laura; Emmett, Warren; Bicciato, Silvio; Taccioli, Cristian
2014-01-01
UCbase 2.0 (http://ucbase.unimore.it) is an update, extension and evolution of UCbase, a Web tool dedicated to the analysis of ultraconserved sequences (UCRs). UCRs are 481 sequences >200 bases sharing 100% identity among human, mouse and rat genomes. They are frequently located in genomic regions known to be involved in cancer or differentially expressed in human leukemias and carcinomas. UCbase 2.0 is a platform-independent Web resource that includes the updated version of the human genome annotation (hg19), information linking disorders to chromosomal coordinates based on the Systematized Nomenclature of Medicine classification, a query tool to search for Single Nucleotide Polymorphisms (SNPs) and a new text box to directly interrogate the database using a MySQL interface. To facilitate the interactive visual interpretation of UCR chromosomal positioning, UCbase 2.0 now includes a graph visualization interface directly linked to UCSC genome browser. Database URL: http://ucbase.unimore.it PMID:24951797
Opening plenary speaker: Human genomics, precision medicine, and advancing human health.
Green, Eric D
2016-08-01
Starting with the launch of the Human Genome Project in 1990, the past quarter-century has brought spectacular achievements in genomics that dramatically empower the study of human biology and disease. The human genomics enterprise is now in the midst of an important transition, as the growing foundation of genomic knowledge is being used by researchers and clinicians to tackle increasingly complex problems in biomedicine. Of particular prominence is the use of revolutionary new DNA sequencing technologies for generating prodigious amounts of DNA sequence data to elucidate the complexities of genome structure, function, and evolution, as well as to unravel the genomic bases of rare and common diseases. Together, these developments are ushering in the era of genomic medicine. Augmenting the advances in human genomics have been innovations in technologies for measuring environmental and lifestyle information, electronic health records, and data science; together, these provide opportunities of unprecedented scale and scope for investigating the underpinnings of health and disease. To capitalize on these opportunities, U.S. President Barack Obama recently announced a major new research endeavor - the U.S. Precision Medicine Initiative. This bold effort will be framed around several key aims, which include accelerating the use of genomically informed approaches to cancer care, making important policy and regulatory changes, and establishing a large research cohort of >1 million volunteers to facilitate precision medicine research. The latter will include making the partnership with all participants a centerpiece feature in the cohort's design and development. The Precision Medicine Initiative represents a broad-based research program that will allow new approaches for individualized medical care to be rigorously tested, so as to establish a new evidence base for advancing clinical practice and, eventually, human health.
2010-01-01
Background Unitary pseudogenes are a class of unprocessed pseudogenes without functioning counterparts in the genome. They constitute only a small fraction of annotated pseudogenes in the human genome. However, as they represent distinct functional losses over time, they shed light on the unique features of humans in primate evolution. Results We have developed a pipeline to detect human unitary pseudogenes through analyzing the global inventory of orthologs between the human genome and its mammalian relatives. We focus on gene losses along the human lineage after the divergence from rodents about 75 million years ago. In total, we identify 76 unitary pseudogenes, including previously annotated ones, and many novel ones. By comparing each of these to its functioning ortholog in other mammals, we can approximately date the creation of each unitary pseudogene (that is, the gene 'death date') and show that for our group of 76, the functional genes appear to be disabled at a fairly uniform rate throughout primate evolution - not all at once, correlated, for instance, with the 'Alu burst'. Furthermore, we identify 11 unitary pseudogenes that are polymorphic - that is, they have both nonfunctional and functional alleles currently segregating in the human population. Comparing them with their orthologs in other primates, we find that two of them are in fact pseudogenes in non-human primates, suggesting that they represent cases of a gene being resurrected in the human lineage. Conclusions This analysis of unitary pseudogenes provides insights into the evolutionary constraints faced by different organisms and the timescales of functional gene loss in humans. PMID:20210993
USDA-ARS?s Scientific Manuscript database
The mitochondrial genome’s non-recombinant mode of inheritance and relatively rapid rate of evolution has promoted its use as a marker for studying the biogeographic history and evolutionary interrelationships among many metazoan species. A modest portion of the mitochondrial genome has been define...
Ajmal, Wajya; Khan, Hiba; Abbasi, Amir Ali
2014-12-01
Understanding the genetic mechanisms underlying the organismal complexity and origin of novelties during vertebrate history is one of the central goals of evolutionary biology. Ohno (1970) was the first to postulate that whole genome duplications (WGD) have played a vital role in the evolution of new gene functions: permitting an increase in morphological, physiological and anatomical complexity during early vertebrate history. Here, we analyze the evolutionary history of human FGFR-bearing paralogon (human autosome 4/5/8/10) by the phylogenetic analysis of multigene families with triplicate and quadruplicate distribution on these chromosomes. Our results categorized the histories of 21 families into discrete co-duplicated groups. Genes of a particular co-duplicated group exhibit identical evolutionary history and have duplicated in concert with each other, whereas genes belonging to different groups have dissimilar histories and have not duplicated concurrently. Taken together with our previously published data, we submit that there is sufficient empirical evidence to disprove the 1R/2R hypothesis and to support the general prediction that vertebrate genome evolved by relatively small-scale, regional duplication events that spread across the history of life. Copyright © 2014 Elsevier Inc. All rights reserved.
Feuk, Lars; MacDonald, Jeffrey R; Tang, Terence; Carson, Andrew R; Li, Martin; Rao, Girish; Khaja, Razi; Scherer, Stephen W
2005-10-01
With a draft genome-sequence assembly for the chimpanzee available, it is now possible to perform genome-wide analyses to identify, at a submicroscopic level, structural rearrangements that have occurred between chimpanzees and humans. The goal of this study was to investigate chromosomal regions that are inverted between the chimpanzee and human genomes. Using the net alignments for the builds of the human and chimpanzee genome assemblies, we identified a total of 1,576 putative regions of inverted orientation, covering more than 154 mega-bases of DNA. The DNA segments are distributed throughout the genome and range from 23 base pairs to 62 mega-bases in length. For the 66 inversions more than 25 kilobases (kb) in length, 75% were flanked on one or both sides by (often unrelated) segmental duplications. Using PCR and fluorescence in situ hybridization we experimentally validated 23 of 27 (85%) semi-randomly chosen regions; the largest novel inversion confirmed was 4.3 mega-bases at human Chromosome 7p14. Gorilla was used as an out-group to assign ancestral status to the variants. All experimentally validated inversion regions were then assayed against a panel of human samples and three of the 23 (13%) regions were found to be polymorphic in the human genome. These polymorphic inversions include 730 kb (at 7p22), 13 kb (at 7q11), and 1 kb (at 16q24) fragments with a 5%, 30%, and 48% minor allele frequency, respectively. Our results suggest that inversions are an important source of variation in primate genome evolution. The finding of at least three novel inversion polymorphisms in humans indicates this type of structural variation may be a more common feature of our genome than previously realized.
Catalog of genetic progression of human cancers: breast cancer.
Desmedt, Christine; Yates, Lucy; Kulka, Janina
2016-03-01
With the rapid development of next-generation sequencing, deeper insights are being gained into the molecular evolution that underlies the development and clinical progression of breast cancer. It is apparent that during evolution, breast cancers acquire thousands of mutations including single base pair substitutions, insertions, deletions, copy number aberrations, and structural rearrangements. As a consequence, at the whole genome level, no two cancers are identical and few cancers even share the same complement of "driver" mutations. Indeed, two samples from the same cancer may also exhibit extensive differences due to constant remodeling of the genome over time. In this review, we summarize recent studies that extend our understanding of the genomic basis of cancer progression. Key biological insights include the following: subclonal diversification begins early in cancer evolution, being detectable even in in situ lesions; geographical stratification of subclonal structure is frequent in primary tumors and can include therapeutically targetable alterations; multiple distant metastases typically arise from a common metastatic ancestor following a "metastatic cascade" model; systemic therapy can unmask preexisting resistant subclones or influence further treatment sensitivity and disease progression. We conclude the review by describing novel approaches such as the analysis of circulating DNA and patient-derived xenografts that promise to further our understanding of the genomic changes occurring during cancer evolution and guide treatment decision making.
Oilseed rape: learning about ancient and recent polyploid evolution from a recent crop species.
Mason, A S; Snowdon, R J
2016-11-01
Oilseed rape (Brassica napus) is one of our youngest crop species, arising several times under cultivation in the last few thousand years and completely unknown in the wild. Oilseed rape originated from hybridisation events between progenitor diploid species B. rapa and B. oleracea, both important vegetable species. The diploid progenitors are also ancient polyploids, with remnants of two previous polyploidisation events evident in the triplicated genome structure. This history of polyploid evolution and human agricultural selection makes B. napus an excellent model with which to investigate processes of genomic evolution and selection in polyploid crops. The ease of de novo interspecific hybridisation, responsiveness to tissue culture, and the close relationship of oilseed rape to the model plant Arabidopsis thaliana, coupled with the recent availability of reference genome sequences and suites of molecular cytogenetic and high-throughput genotyping tools, allow detailed dissection of genetic, genomic and phenotypic interactions in this crop. In this review we discuss the past and present uses of B. napus as a model for polyploid speciation and evolution in crop species, along with current and developing analysis tools and resources. We further outline unanswered questions that may now be tractable to investigation. © 2016 German Botanical Society and The Royal Botanical Society of the Netherlands.
Recent Coselection in Human Populations Revealed by Protein–Protein Interaction Network
Qian, Wei; Zhou, Hang; Tang, Kun
2015-01-01
Genome-wide scans for signals of natural selection in human populations have identified a large number of candidate loci that underlie local adaptations. This is surprising given the relatively short evolutionary time since the divergence of the human population. One hypothesis that has not been formally examined is whether and how the recent human evolution may have been shaped by coselection in the context of complex molecular interactome. In this study, genome-wide signals of selection were scanned in East Asians, Europeans, and Africans using 1000 Genome data, and subsequently mapped onto the protein–protein interaction (PPI) network. We found that the candidate genes of recent positive selection localized significantly closer to each other on the PPI network than expected, revealing substantial clustering of selected genes. Furthermore, gene pairs of shorter PPI network distances showed higher similarities of their recent evolutionary paths than those further apart. Last, subnetworks enriched with recent coselection signals were identified, which are substantially overrepresented in biological pathways related to signal transduction, neurogenesis, and immune function. These results provide the first genome-wide evidence for association of recent selection signals with the PPI network, shedding light on the potential mechanisms of recent coselection in the human genome. PMID:25532814
Birney, Ewan; Stamatoyannopoulos, John A; Dutta, Anindya; Guigó, Roderic; Gingeras, Thomas R; Margulies, Elliott H; Weng, Zhiping; Snyder, Michael; Dermitzakis, Emmanouil T; Thurman, Robert E; Kuehn, Michael S; Taylor, Christopher M; Neph, Shane; Koch, Christoph M; Asthana, Saurabh; Malhotra, Ankit; Adzhubei, Ivan; Greenbaum, Jason A; Andrews, Robert M; Flicek, Paul; Boyle, Patrick J; Cao, Hua; Carter, Nigel P; Clelland, Gayle K; Davis, Sean; Day, Nathan; Dhami, Pawandeep; Dillon, Shane C; Dorschner, Michael O; Fiegler, Heike; Giresi, Paul G; Goldy, Jeff; Hawrylycz, Michael; Haydock, Andrew; Humbert, Richard; James, Keith D; Johnson, Brett E; Johnson, Ericka M; Frum, Tristan T; Rosenzweig, Elizabeth R; Karnani, Neerja; Lee, Kirsten; Lefebvre, Gregory C; Navas, Patrick A; Neri, Fidencio; Parker, Stephen C J; Sabo, Peter J; Sandstrom, Richard; Shafer, Anthony; Vetrie, David; Weaver, Molly; Wilcox, Sarah; Yu, Man; Collins, Francis S; Dekker, Job; Lieb, Jason D; Tullius, Thomas D; Crawford, Gregory E; Sunyaev, Shamil; Noble, William S; Dunham, Ian; Denoeud, France; Reymond, Alexandre; Kapranov, Philipp; Rozowsky, Joel; Zheng, Deyou; Castelo, Robert; Frankish, Adam; Harrow, Jennifer; Ghosh, Srinka; Sandelin, Albin; Hofacker, Ivo L; Baertsch, Robert; Keefe, Damian; Dike, Sujit; Cheng, Jill; Hirsch, Heather A; Sekinger, Edward A; Lagarde, Julien; Abril, Josep F; Shahab, Atif; Flamm, Christoph; Fried, Claudia; Hackermüller, Jörg; Hertel, Jana; Lindemeyer, Manja; Missal, Kristin; Tanzer, Andrea; Washietl, Stefan; Korbel, Jan; Emanuelsson, Olof; Pedersen, Jakob S; Holroyd, Nancy; Taylor, Ruth; Swarbreck, David; Matthews, Nicholas; Dickson, Mark C; Thomas, Daryl J; Weirauch, Matthew T; Gilbert, James; Drenkow, Jorg; Bell, Ian; Zhao, XiaoDong; Srinivasan, K G; Sung, Wing-Kin; Ooi, Hong Sain; Chiu, Kuo Ping; Foissac, Sylvain; Alioto, Tyler; Brent, Michael; Pachter, Lior; Tress, Michael L; Valencia, Alfonso; Choo, Siew Woh; Choo, Chiou Yu; Ucla, Catherine; Manzano, Caroline; Wyss, Carine; Cheung, Evelyn; Clark, Taane G; Brown, James B; Ganesh, Madhavan; Patel, Sandeep; Tammana, Hari; Chrast, Jacqueline; Henrichsen, Charlotte N; Kai, Chikatoshi; Kawai, Jun; Nagalakshmi, Ugrappa; Wu, Jiaqian; Lian, Zheng; Lian, Jin; Newburger, Peter; Zhang, Xueqing; Bickel, Peter; Mattick, John S; Carninci, Piero; Hayashizaki, Yoshihide; Weissman, Sherman; Hubbard, Tim; Myers, Richard M; Rogers, Jane; Stadler, Peter F; Lowe, Todd M; Wei, Chia-Lin; Ruan, Yijun; Struhl, Kevin; Gerstein, Mark; Antonarakis, Stylianos E; Fu, Yutao; Green, Eric D; Karaöz, Ulaş; Siepel, Adam; Taylor, James; Liefer, Laura A; Wetterstrand, Kris A; Good, Peter J; Feingold, Elise A; Guyer, Mark S; Cooper, Gregory M; Asimenos, George; Dewey, Colin N; Hou, Minmei; Nikolaev, Sergey; Montoya-Burgos, Juan I; Löytynoja, Ari; Whelan, Simon; Pardi, Fabio; Massingham, Tim; Huang, Haiyan; Zhang, Nancy R; Holmes, Ian; Mullikin, James C; Ureta-Vidal, Abel; Paten, Benedict; Seringhaus, Michael; Church, Deanna; Rosenbloom, Kate; Kent, W James; Stone, Eric A; Batzoglou, Serafim; Goldman, Nick; Hardison, Ross C; Haussler, David; Miller, Webb; Sidow, Arend; Trinklein, Nathan D; Zhang, Zhengdong D; Barrera, Leah; Stuart, Rhona; King, David C; Ameur, Adam; Enroth, Stefan; Bieda, Mark C; Kim, Jonghwan; Bhinge, Akshay A; Jiang, Nan; Liu, Jun; Yao, Fei; Vega, Vinsensius B; Lee, Charlie W H; Ng, Patrick; Shahab, Atif; Yang, Annie; Moqtaderi, Zarmik; Zhu, Zhou; Xu, Xiaoqin; Squazzo, Sharon; Oberley, Matthew J; Inman, David; Singer, Michael A; Richmond, Todd A; Munn, Kyle J; Rada-Iglesias, Alvaro; Wallerman, Ola; Komorowski, Jan; Fowler, Joanna C; Couttet, Phillippe; Bruce, Alexander W; Dovey, Oliver M; Ellis, Peter D; Langford, Cordelia F; Nix, David A; Euskirchen, Ghia; Hartman, Stephen; Urban, Alexander E; Kraus, Peter; Van Calcar, Sara; Heintzman, Nate; Kim, Tae Hoon; Wang, Kun; Qu, Chunxu; Hon, Gary; Luna, Rosa; Glass, Christopher K; Rosenfeld, M Geoff; Aldred, Shelley Force; Cooper, Sara J; Halees, Anason; Lin, Jane M; Shulha, Hennady P; Zhang, Xiaoling; Xu, Mousheng; Haidar, Jaafar N S; Yu, Yong; Ruan, Yijun; Iyer, Vishwanath R; Green, Roland D; Wadelius, Claes; Farnham, Peggy J; Ren, Bing; Harte, Rachel A; Hinrichs, Angie S; Trumbower, Heather; Clawson, Hiram; Hillman-Jackson, Jennifer; Zweig, Ann S; Smith, Kayla; Thakkapallayil, Archana; Barber, Galt; Kuhn, Robert M; Karolchik, Donna; Armengol, Lluis; Bird, Christine P; de Bakker, Paul I W; Kern, Andrew D; Lopez-Bigas, Nuria; Martin, Joel D; Stranger, Barbara E; Woodroffe, Abigail; Davydov, Eugene; Dimas, Antigone; Eyras, Eduardo; Hallgrímsdóttir, Ingileif B; Huppert, Julian; Zody, Michael C; Abecasis, Gonçalo R; Estivill, Xavier; Bouffard, Gerard G; Guan, Xiaobin; Hansen, Nancy F; Idol, Jacquelyn R; Maduro, Valerie V B; Maskeri, Baishali; McDowell, Jennifer C; Park, Morgan; Thomas, Pamela J; Young, Alice C; Blakesley, Robert W; Muzny, Donna M; Sodergren, Erica; Wheeler, David A; Worley, Kim C; Jiang, Huaiyang; Weinstock, George M; Gibbs, Richard A; Graves, Tina; Fulton, Robert; Mardis, Elaine R; Wilson, Richard K; Clamp, Michele; Cuff, James; Gnerre, Sante; Jaffe, David B; Chang, Jean L; Lindblad-Toh, Kerstin; Lander, Eric S; Koriabine, Maxim; Nefedov, Mikhail; Osoegawa, Kazutoyo; Yoshinaga, Yuko; Zhu, Baoli; de Jong, Pieter J
2007-06-14
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
Origin of amphibian and avian chromosomes by fission, fusion, and retention of ancestral chromosomes
Voss, Stephen R.; Kump, D. Kevin; Putta, Srikrishna; Pauly, Nathan; Reynolds, Anna; Henry, Rema J.; Basa, Saritha; Walker, John A.; Smith, Jeramiah J.
2011-01-01
Amphibian genomes differ greatly in DNA content and chromosome size, morphology, and number. Investigations of this diversity are needed to identify mechanisms that have shaped the evolution of vertebrate genomes. We used comparative mapping to investigate the organization of genes in the Mexican axolotl (Ambystoma mexicanum), a species that presents relatively few chromosomes (n = 14) and a gigantic genome (>20 pg/N). We show extensive conservation of synteny between Ambystoma, chicken, and human, and a positive correlation between the length of conserved segments and genome size. Ambystoma segments are estimated to be four to 51 times longer than homologous human and chicken segments. Strikingly, genes demarking the structures of 28 chicken chromosomes are ordered among linkage groups defining the Ambystoma genome, and we show that these same chromosomal segments are also conserved in a distantly related anuran amphibian (Xenopus tropicalis). Using linkage relationships from the amphibian maps, we predict that three chicken chromosomes originated by fusion, nine to 14 originated by fission, and 12–17 evolved directly from ancestral tetrapod chromosomes. We further show that some ancestral segments were fused prior to the divergence of salamanders and anurans, while others fused independently and randomly as chromosome numbers were reduced in lineages leading to Ambystoma and Xenopus. The maintenance of gene order relationships between chromosomal segments that have greatly expanded and contracted in salamander and chicken genomes, respectively, suggests selection to maintain synteny relationships and/or extremely low rates of chromosomal rearrangement. Overall, the results demonstrate the value of data from diverse, amphibian genomes in studies of vertebrate genome evolution. PMID:21482624
Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers
2012-01-01
Background Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Results Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. Conclusions This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion. PMID:22541534
Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers.
Walker, Jerilyn A; Konkel, Miriam K; Ullmer, Brygg; Monceaux, Christopher P; Ryder, Oliver A; Hubley, Robert; Smit, Arian Fa; Batzer, Mark A
2012-04-30
Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion.
Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate
Dehal, Paramvir; Boore, Jeffrey L
2005-01-01
The hypothesis that the relatively large and complex vertebrate genome was created by two ancient, whole genome duplications has been hotly debated, but remains unresolved. We reconstructed the evolutionary relationships of all gene families from the complete gene sets of a tunicate, fish, mouse, and human, and then determined when each gene duplicated relative to the evolutionary tree of the organisms. We confirmed the results of earlier studies that there remains little signal of these events in numbers of duplicated genes, gene tree topology, or the number of genes per multigene family. However, when we plotted the genomic map positions of only the subset of paralogous genes that were duplicated prior to the fish–tetrapod split, their global physical organization provides unmistakable evidence of two distinct genome duplication events early in vertebrate evolution indicated by clear patterns of four-way paralogous regions covering a large part of the human genome. Our results highlight the potential for these large-scale genomic events to have driven the evolutionary success of the vertebrate lineage. PMID:16128622
Evolution of language: Lessons from the genome.
Fisher, Simon E
2017-02-01
The post-genomic era is an exciting time for researchers interested in the biology of speech and language. Substantive advances in molecular methodologies have opened up entire vistas of investigation that were not previously possible, or in some cases even imagined. Speculations concerning the origins of human cognitive traits are being transformed into empirically addressable questions, generating specific hypotheses that can be explicitly tested using data collected from both the natural world and experimental settings. In this article, I discuss a number of promising lines of research in this area. For example, the field has begun to identify genes implicated in speech and language skills, including not just disorders but also the normal range of abilities. Such genes provide powerful entry points for gaining insights into neural bases and evolutionary origins, using sophisticated experimental tools from molecular neuroscience and developmental neurobiology. At the same time, sequencing of ancient hominin genomes is giving us an unprecedented view of the molecular genetic changes that have occurred during the evolution of our species. Synthesis of data from these complementary sources offers an opportunity to robustly evaluate alternative accounts of language evolution. Of course, this endeavour remains challenging on many fronts, as I also highlight in the article. Nonetheless, such an integrated approach holds great potential for untangling the complexities of the capacities that make us human.
Yasukochi, Yoshiki; Satta, Yoko
2015-01-01
The human cytochrome P450 (CYP) 2D6 gene is a member of the CYP2D gene subfamily, along with the CYP2D7P and CYP2D8P pseudogenes. Although the CYP2D6 enzyme has been studied extensively because of its clinical importance, the evolution of the CYP2D subfamily has not yet been fully understood. Therefore, the goal of this study was to reveal the evolutionary process of the human drug metabolic system. Here, we investigate molecular evolution of the CYP2D subfamily in primates by comparing 14 CYP2D sequences from humans to New World monkey genomes. Window analysis and statistical tests revealed that entire genomic sequences of paralogous genes were extensively homogenized by gene conversion during molecular evolution of CYP2D genes in primates. A neighbor-joining tree based on genomic sequences at the nonsubstrate recognition sites showed that CYP2D6 and CYP2D8 genes were clustered together due to gene conversion. In contrast, a phylogenetic tree using amino acid sequences at substrate recognition sites did not cluster the CYP2D6 and CYP2D8 genes, suggesting that the functional constraint on substrate specificity is one of the causes for purifying selection at the substrate recognition sites. Our results suggest that the CYP2D gene subfamily in primates has evolved to maintain the regioselectivity for a substrate hydroxylation activity between individual enzymes, even though extensive gene conversion has occurred across CYP2D coding sequences. PMID:25808902
Patumcharoenpol, Preecha; Rujirawat, Thidarat; Lohnoo, Tassanee; Yingyong, Wanta; Vanittanakom, Nongnuch; Kittichotirat, Weerayuth; Krajaejun, Theerapong
2018-02-01
Pythium insidiosum is an aquatic oomycete microorganism that causes the fatal infectious disease, pythiosis, in humans and animals. The organism has been successfully isolated from the environment worldwide. Diagnosis and treatment of pythiosis is difficult and challenging. Genome sequences of P. insidiosum , isolated from humans, are available and accessible in public databases. To further facilitate biology-, pathogenicity-, and evolution-related genomic and genetic studies of P. insidiosum , we report two additional draft genome sequences of the P. insidiosum strain CBS 573.85 (35.6 Mb in size; accession number, BCFO00000000.1) isolated from a horse with pythiosis, and strain CR02 (37.7 Mb in size; accession number, BCFR00000000.1) isolated from the environment.
“Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes”
Neafsey, Daniel E.; Waterhouse, Robert M.; Abai, Mohammad R.; Aganezov, Sergey S.; Alekseyev, Max A.; Allen, James E.; Amon, James; Arcà, Bruno; Arensburger, Peter; Artemov, Gleb; Assour, Lauren A.; Basseri, Hamidreza; Berlin, Aaron; Birren, Bruce W.; Blandin, Stephanie A.; Brockman, Andrew I.; Burkot, Thomas R.; Burt, Austin; Chan, Clara S.; Chauve, Cedric; Chiu, Joanna C.; Christensen, Mikkel; Costantini, Carlo; Davidson, Victoria L.M.; Deligianni, Elena; Dottorini, Tania; Dritsou, Vicky; Gabriel, Stacey B.; Guelbeogo, Wamdaogo M.; Hall, Andrew B.; Han, Mira V.; Hlaing, Thaung; Hughes, Daniel S.T.; Jenkins, Adam M.; Jiang, Xiaofang; Jungreis, Irwin; Kakani, Evdoxia G.; Kamali, Maryam; Kemppainen, Petri; Kennedy, Ryan C.; Kirmitzoglou, Ioannis K.; Koekemoer, Lizette L.; Laban, Njoroge; Langridge, Nicholas; Lawniczak, Mara K.N.; Lirakis, Manolis; Lobo, Neil F.; Lowy, Ernesto; MacCallum, Robert M.; Mao, Chunhong; Maslen, Gareth; Mbogo, Charles; McCarthy, Jenny; Michel, Kristin; Mitchell, Sara N.; Moore, Wendy; Murphy, Katherine A.; Naumenko, Anastasia N.; Nolan, Tony; Novoa, Eva M.; O'Loughlin, Samantha; Oringanje, Chioma; Oshaghi, Mohammad A.; Pakpour, Nazzy; Papathanos, Philippos A.; Peery, Ashley N.; Povelones, Michael; Prakash, Anil; Price, David P.; Rajaraman, Ashok; Reimer, Lisa J.; Rinker, David C.; Rokas, Antonis; Russell, Tanya L.; Sagnon, N'Fale; Sharakhova, Maria V.; Shea, Terrance; Simão, Felipe A.; Simard, Frederic; Slotman, Michel A.; Somboon, Pradya; Stegniy, Vladimir; Struchiner, Claudio J.; Thomas, Gregg W.C.; Tojo, Marta; Topalis, Pantelis; Tubio, José M.C.; Unger, Maria F.; Vontas, John; Walton, Catherine; Wilding, Craig S.; Willis, Judith H.; Wu, Yi-Chieh; Yan, Guiyun; Zdobnov, Evgeny M.; Zhou, Xiaofan; Catteruccia, Flaminia; Christophides, George K.; Collins, Frank H.; Cornman, Robert S.; Crisanti, Andrea; Donnelly, Martin J.; Emrich, Scott J.; Fontaine, Michael C.; Gelbart, William; Hahn, Matthew W.; Hansen, Immo A.; Howell, Paul I.; Kafatos, Fotis C.; Kellis, Manolis; Lawson, Daniel; Louis, Christos; Luckhart, Shirley; Muskavitch, Marc A.T.; Ribeiro, José M.; Riehle, Michael A.; Sharakhov, Igor V.; Tu, Zhijian; Zwiebel, Laurence J.; Besansky, Nora J.
2015-01-01
Variation in vectorial capacity for human malaria among Anopheles mosquito species is determined by many factors, including behavior, immunity, and life history. To investigate the genomic basis of vectorial capacity and explore new avenues for vector control, we sequenced the genomes of 16 anopheline mosquito species from diverse locations spanning ~100 million years of evolution. Comparative analyses show faster rates of gene gain and loss, elevated gene shuffling on the X chromosome, and more intron losses, relative to Drosophila. Some determinants of vectorial capacity, such as chemosensory genes, do not show elevated turnover, but instead diversify through protein-sequence changes. This dynamism of anopheline genes and genomes may contribute to their flexible capacity to take advantage of new ecological niches, including adapting to humans as primary hosts. PMID:25554792
Genomic Insights into the Saccharomyces sensu stricto Complex
Borneman, Anthony R.; Pretorius, Isak S.
2015-01-01
The Saccharomyces sensu stricto group encompasses species ranging from the industrially ubiquitous yeast Saccharomyces cerevisiae to those that are confined to geographically limited environmental niches. The wealth of genomic data that are now available for the Saccharomyces genus is providing unprecedented insights into the genomic processes that can drive speciation and evolution, both in the natural environment and in response to human-driven selective forces during the historical “domestication” of these yeasts for baking, brewing, and winemaking. PMID:25657346
Genomic insights into the Saccharomyces sensu stricto complex.
Borneman, Anthony R; Pretorius, Isak S
2015-02-01
The Saccharomyces sensu stricto group encompasses species ranging from the industrially ubiquitous yeast Saccharomyces cerevisiae to those that are confined to geographically limited environmental niches. The wealth of genomic data that are now available for the Saccharomyces genus is providing unprecedented insights into the genomic processes that can drive speciation and evolution, both in the natural environment and in response to human-driven selective forces during the historical "domestication" of these yeasts for baking, brewing, and winemaking. Copyright © 2015 by the Genetics Society of America.
Vincent, Antony T; Trudel, Mélanie V; Freschi, Luca; Nagar, Vandan; Gagné-Thivierge, Cynthia; Levesque, Roger C; Charette, Steve J
2016-01-12
Aeromonads make up a group of Gram-negative bacteria that includes human and fish pathogens. The Aeromonas salmonicida species has the peculiarity of including five known subspecies. However, few studies of the genomes of A. salmonicida subspecies have been reported to date. We sequenced the genomes of additional A. salmonicida isolates, including three from India, using next-generation sequencing in order to gain a better understanding of the genomic and phylogenetic links between A. salmonicida subspecies. Their relative phylogenetic positions were confirmed by a core genome phylogeny based on 1645 gene sequences. The Indian isolates, which formed a sub-group together with A. salmonicida subsp. pectinolytica, were able to grow at either at 18 °C and 37 °C, unlike the A. salmonicida psychrophilic isolates that did not grow at 37 °C. Amino acid frequencies, GC content, tRNA composition, loss and gain of genes during evolution, pseudogenes as well as genes under positive selection and the mobilome were studied to explain this intraspecies dichotomy. Insertion sequences appeared to be an important driving force that locked the psychrophilic strains into their particular lifestyle in order to conserve their genomic integrity. This observation, based on comparative genomics, is in agreement with previous results showing that insertion sequence mobility induced by heat in A. salmonicida subspecies causes genomic plasticity, resulting in a deleterious effect on the virulence of the bacterium. We provide a proof-of-concept that selfish DNAs play a major role in the evolution of bacterial species by modeling genomes.
Synaptic scaffold evolution generated components of vertebrate cognitive complexity
Nithianantharajah, J.; Komiyama, N.H.; McKechanie, A.; Johnstone, M.; Blackwood, D. H.; St Clair, D.; Emes, R.D.; van de Lagemaat, L. N.; Saksida, L.M.; Bussey, T.J.; Grant, S.G.N.
2014-01-01
The origins and evolution of higher cognitive functions including complex forms of learning, attention and executive functions are unknown. A potential mechanism driving the evolution of vertebrate cognition early in the vertebrate lineage (550 My ago) was genome duplication and subsequent diversification of postsynaptic genes. Here we report the first genetic analysis of a vertebrate gene family in cognitive functions measured using computerized touchscreens. Comparison of mice carrying mutations in all four Dlg paralogs show simple associative learning required Dlg4, while Dlg2 and Dlg3 diversified to play opposing roles in complex cognitive processes. Exploiting the translational utility of touchscreens in humans and mice, testing Dlg2 mutations in both species showed Dlg2’s role in complex learning, cognitive flexibility and attention has been highly conserved over 100 My. Dlg family mutations underlie psychiatric disorders suggesting genome evolution expanded the complexity of vertebrate cognition at the cost of susceptibility to mental illness. PMID:23201973
Parallel independent evolution of pathogenicity within the genus Yersinia
Reuter, Sandra; Connor, Thomas R.; Barquist, Lars; Walker, Danielle; Feltwell, Theresa; Harris, Simon R.; Fookes, Maria; Hall, Miquette E.; Petty, Nicola K.; Fuchs, Thilo M.; Corander, Jukka; Dufour, Muriel; Ringwood, Tamara; Savin, Cyril; Bouchier, Christiane; Martin, Liliane; Miettinen, Minna; Shubin, Mikhail; Riehm, Julia M.; Laukkanen-Ninios, Riikka; Sihvonen, Leila M.; Siitonen, Anja; Skurnik, Mikael; Falcão, Juliana Pfrimer; Fukushima, Hiroshi; Scholz, Holger C.; Prentice, Michael B.; Wren, Brendan W.; Parkhill, Julian; Carniel, Elisabeth; Achtman, Mark; McNally, Alan; Thomson, Nicholas R.
2014-01-01
The genus Yersinia has been used as a model system to study pathogen evolution. Using whole-genome sequencing of all Yersinia species, we delineate the gene complement of the whole genus and define patterns of virulence evolution. Multiple distinct ecological specializations appear to have split pathogenic strains from environmental, nonpathogenic lineages. This split demonstrates that contrary to hypotheses that all pathogenic Yersinia species share a recent common pathogenic ancestor, they have evolved independently but followed parallel evolutionary paths in acquiring the same virulence determinants as well as becoming progressively more limited metabolically. Shared virulence determinants are limited to the virulence plasmid pYV and the attachment invasion locus ail. These acquisitions, together with genomic variations in metabolic pathways, have resulted in the parallel emergence of related pathogens displaying an increasingly specialized lifestyle with a spectrum of virulence potential, an emerging theme in the evolution of other important human pathogens. PMID:24753568
Mechanisms and impact of genetic recombination in the evolution of Streptococcus pneumoniae
Chaguza, Chrispin; Cornick, Jennifer E.; Everett, Dean B.
2015-01-01
Streptococcus pneumoniae (the pneumococcus) is a highly recombinogenic bacterium responsible for a high burden of human disease globally. Genetic recombination, a process in which exogenous DNA is acquired and incorporated into its genome, is a key evolutionary mechanism employed by the pneumococcus to rapidly adapt to selective pressures. The rate at which the pneumococcus acquires genetic variation through recombination is much higher than the rate at which the organism acquires variation through spontaneous mutations. This higher rate of variation allows the pneumococcus to circumvent the host innate and adaptive immune responses, escape clinical interventions, including antibiotic therapy and vaccine introduction. The rapid influx of whole genome sequence (WGS) data and the advent of novel analysis methods and powerful computational tools for population genetics and evolution studies has transformed our understanding of how genetic recombination drives pneumococcal adaptation and evolution. Here we discuss how genetic recombination has impacted upon the evolution of the pneumococcus. PMID:25904996
Mechanisms and impact of genetic recombination in the evolution of Streptococcus pneumoniae.
Chaguza, Chrispin; Cornick, Jennifer E; Everett, Dean B
2015-01-01
Streptococcus pneumoniae (the pneumococcus) is a highly recombinogenic bacterium responsible for a high burden of human disease globally. Genetic recombination, a process in which exogenous DNA is acquired and incorporated into its genome, is a key evolutionary mechanism employed by the pneumococcus to rapidly adapt to selective pressures. The rate at which the pneumococcus acquires genetic variation through recombination is much higher than the rate at which the organism acquires variation through spontaneous mutations. This higher rate of variation allows the pneumococcus to circumvent the host innate and adaptive immune responses, escape clinical interventions, including antibiotic therapy and vaccine introduction. The rapid influx of whole genome sequence (WGS) data and the advent of novel analysis methods and powerful computational tools for population genetics and evolution studies has transformed our understanding of how genetic recombination drives pneumococcal adaptation and evolution. Here we discuss how genetic recombination has impacted upon the evolution of the pneumococcus.
History in the gene: negotiations between molecular and organismal anthropology.
Sommer, Marianne
2008-01-01
In the advertising discourse of human genetic database projects, of genetic ancestry tracing companies, and in popular books on anthropological genetics, what I refer to as the anthropological gene and genome appear as documents of human history, by far surpassing the written record and oral history in scope and accuracy as archives of our past. How did macromolecules become "documents of human evolutionary history"? Historically, molecular anthropology, a term introduced by Emile Zuckerkandl in 1962 to characterize the study of primate phylogeny and human evolution on the molecular level, asserted its claim to the privilege of interpretation regarding hominoid, hominid, and human phylogeny and evolution vis-à-vis other historical sciences such as evolutionary biology, physical anthropology, and paleoanthropology. This process will be discussed on the basis of three key conferences on primate classification and evolution that brought together exponents of the respective fields and that were held in approximately ten-years intervals between the early 1960s and the 1980s. I show how the anthropological gene and genome gained their status as the most fundamental, clean, and direct records of historical information, and how the prioritizing of these epistemic objects was part of a complex involving the objectivity of numbers, logic, and mathematics, the objectivity of machines and instruments, and the objectivity seen to reside in the epistemic objects themselves.
Lefébure, Tristan; Richards, Vince P.; Lang, Ping; Pavinski-Bitar, Paulina; Stanhope, Michael J.
2012-01-01
Streptococcus pyogenes, is an important human pathogen classified within the pyogenic group of streptococci, exclusively adapted to the human host. Our goal was to employ a comparative evolutionary approach to better understand the genomic events concomitant with S. pyogenes human adaptation. As part of ascertaining these events, we sequenced the genome of one of the potential sister species, the agricultural pathogen S. canis, and combined it in a comparative genomics reconciliation analysis with two other closely related species, Streptococcus dysgalactiae and Streptococcus equi, to determine the genes that were gained and lost during S. pyogenes evolution. Genome wide phylogenetic analyses involving 15 Streptococcus species provided convincing support for a clade of S. equi, S. pyogenes, S. dysgalactiae, and S. canis and suggested that the most likely S. pyogenes sister species was S. dysgalactiae. The reconciliation analysis identified 113 genes that were gained on the lineage leading to S. pyogenes. Almost half (46%) of these gained genes were phage associated and 14 showed significant matches to experimentally verified bacteria virulence factors. Subsequent to the origin of S. pyogenes, over half of the phage associated genes were involved in 90 different LGT events, mostly involving different strains of S. pyogenes, but with a high proportion involving the horse specific pathogen S. equi subsp. equi, with the directionality almost exclusively (86%) in the S. pyogenes to S. equi direction. Streptococcus agalactiae appears to have played an important role in the evolution of S. pyogenes with a high proportion of LGTs originating from this species. Overall the analysis suggests that S. pyogenes adaptation to the human host was achieved in part by (i) the integration of new virulence factors (e.g. speB, and the sal locus) and (ii) the construction of new regulation networks (e.g. rgg, and to some extent speB). PMID:22666370
Draft genome sequence of Enterococcus faecium strain LMG 8148.
Michiels, Joran E; Van den Bergh, Bram; Fauvart, Maarten; Michiels, Jan
2016-01-01
Enterococcus faecium, traditionally considered a harmless gut commensal, is emerging as an important nosocomial pathogen showing increasing rates of multidrug resistance. We report the draft genome sequence of E. faecium strain LMG 8148, isolated in 1968 from a human in Gothenburg, Sweden. The draft genome has a total length of 2,697,490 bp, a GC-content of 38.3 %, and 2,402 predicted protein-coding sequences. The isolation of this strain predates the emergence of E. faecium as a nosocomial pathogen. Consequently, its genome can be useful in comparative genomic studies investigating the evolution of E. faecium as a pathogen.
Active Transposition in Genomes
Huang, Cheng Ran Lisa; Burns, Kathleen H.; Boeke, Jef D.
2013-01-01
Transposons are DNA sequences capable of moving in genomes. Early evidence showed their accumulation in many species and suggested their continued activity in at least isolated organisms. In the past decade, with the development of various genomic technologies, it has become abundantly clear that ongoing activity is the rule rather than the exception. Active transposons of various classes are observed throughout plants and animals, including humans. They continue to create new insertions, have an enormous variety of structural and functional impact on genes and genomes, and play important roles in genome evolution. Transposon activities have been identified and measured by employing various strategies. Here, we summarize evidence of current transposon activity in various plant and animal genomes. PMID:23145912
The role of recombination in the origin and evolution of Alu subfamilies.
Teixeira-Silva, Ana; Silva, Raquel M; Carneiro, João; Amorim, António; Azevedo, Luísa
2013-01-01
Alus are the most abundant and successful short interspersed nuclear elements found in primate genomes. In humans, they represent about 10% of the genome, although few are retrotransposition-competent and are clustered into subfamilies according to the source gene from which they evolved. Recombination between them can lead to genomic rearrangements of clinical and evolutionary significance. In this study, we have addressed the role of recombination in the origin of chimeric Alu source genes by the analysis of all known consensus sequences of human Alus. From the allelic diversity of Alu consensus sequences, validated in extant elements resulting from whole genome searches, distinct events of recombination were detected in the origin of particular subfamilies of AluS and AluY source genes. These results demonstrate that at least two subfamilies are likely to have emerged from ectopic Alu-Alu recombination, which stimulates further research regarding the potential of chimeric active Alus to punctuate the genome.
Predicting human genetic interactions from cancer genome evolution.
Lu, Xiaowen; Megchelenbrink, Wout; Notebaart, Richard A; Huynen, Martijn A
2015-01-01
Synthetic Lethal (SL) genetic interactions play a key role in various types of biological research, ranging from understanding genotype-phenotype relationships to identifying drug-targets against cancer. Despite recent advances in empirical measuring SL interactions in human cells, the human genetic interaction map is far from complete. Here, we present a novel approach to predict this map by exploiting patterns in cancer genome evolution. First, we show that empirically determined SL interactions are reflected in various gene presence, absence, and duplication patterns in hundreds of cancer genomes. The most evident pattern that we discovered is that when one member of an SL interaction gene pair is lost, the other gene tends not to be lost, i.e. the absence of co-loss. This observation is in line with expectation, because the loss of an SL interacting pair will be lethal to the cancer cell. SL interactions are also reflected in gene expression profiles, such as an under representation of cases where the genes in an SL pair are both under expressed, and an over representation of cases where one gene of an SL pair is under expressed, while the other one is over expressed. We integrated the various previously unknown cancer genome patterns and the gene expression patterns into a computational model to identify SL pairs. This simple, genome-wide model achieves a high prediction power (AUC = 0.75) for known genetic interactions. It allows us to present for the first time a comprehensive genome-wide list of SL interactions with a high estimated prediction precision, covering up to 591,000 gene pairs. This unique list can potentially be used in various application areas ranging from biotechnology to medical genetics.
Survey of (Meta)genomic Approaches for Understanding Microbial Community Dynamics.
Sharma, Anukriti; Lal, Rup
2017-03-01
Advancement in the next generation sequencing technologies has led to evolution of the field of genomics and metagenomics in a slim duration with nominal cost at precipitous higher rate. While metagenomics and genomics can be separately used to reveal the culture-independent and culture-based microbial evolution, respectively, (meta)genomics together can be used to demonstrate results at population level revealing in-depth complex community interactions for specific ecotypes. The field of metagenomics which started with answering "who is out there?" based on 16S rRNA gene has evolved immensely with the precise organismal reconstruction at species/strain level from the deeply covered metagenome data outweighing the need to isolate bacteria of which 99% are de facto non-cultivable. In this review we have underlined the appeal of metagenomic-derived genomes in providing insights into the evolutionary patterns, growth dynamics, genome/gene-specific sweeps, and durability of environmental pressures. We have demonstrated the use of culture-based genomics and environmental shotgun metagenome data together to elucidate environment specific genome modulations via metagenomic recruitments in terms of gene loss/gain, accessory and core-genome extent. We further illustrated the benefit of (meta)genomics in the understanding of infectious diseases by deducing the relationship between human microbiota and clinical microbiology. This review summarizes the technological advances in the (meta)genomic strategies using the genome and metagenome datasets together to increase the resolution of microbial population studies.
Kilian, Mogens; Riley, David R; Jensen, Anders; Brüggemann, Holger; Tettelin, Hervé
2014-07-22
The bacterium Streptococcus pneumoniae is one of the leading causes of fatal infections affecting humans. Intriguingly, phylogenetic analysis shows that the species constitutes one evolutionary lineage in a cluster of the otherwise commensal Streptococcus mitis strains, with which humans live in harmony. In a comparative analysis of 35 genomes, including phylogenetic analyses of all predicted genes, we have shown that the pathogenic pneumococcus has evolved into a master of genomic flexibility while lineages that evolved into the nonpathogenic S. mitis secured harmonious coexistence with their host by stabilizing an approximately 15%-reduced genome devoid of many virulence genes. Our data further provide evidence that interspecies gene transfer between S. pneumoniae and S. mitis occurs in a unidirectional manner, i.e., from S. mitis to S. pneumoniae. Import of genes from S. mitis and other mitis, anginosus, and salivarius group streptococci ensured allelic replacements and antigenic diversification and has been driving the evolution of the remarkable structural diversity of capsular polysaccharides of S. pneumoniae. Our study explains how the unique structural diversity of the pneumococcal capsule emerged and conceivably will continue to increase and reveals a striking example of the fragile border between the commensal and pathogenic lifestyles. While genomic plasticity enabling quick adaptation to environmental stress is a necessity for the pathogenic streptococci, the commensal lifestyle benefits from stability. Importance: One of the leading causes of fatal infections affecting humans, Streptococcus pneumoniae, and the commensal Streptococcus mitis are closely related obligate symbionts associated with hominids. Faced with a shortage of accessible hosts, the two opposing lifestyles evolved in parallel. We have shown that the nonpathogenic S. mitis secured harmonious coexistence with its host by stabilizing a reduced genome devoid of many virulence genes. Meanwhile, the pathogenic pneumococcus evolved into a master of genomic flexibility and imports genes from S. mitis and other related streptococci. This process ensured antigenic diversification and has been driving the evolution of the remarkable structural diversity of capsular polysaccharides of S. pneumoniae, which conceivably will continue to increase and present a challenge to disease prevention. Copyright © 2014 Kilian et al.
Hargreaves, Katherine R; Flores, Cesar O; Lawley, Trevor D; Clokie, Martha R J
2014-08-26
Clostridium difficile is an important human-pathogenic bacterium causing antibiotic-associated nosocomial infections worldwide. Mobile genetic elements and bacteriophages have helped shape C. difficile genome evolution. In many bacteria, phage infection may be controlled by a form of bacterial immunity called the clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR/Cas) system. This uses acquired short nucleotide sequences (spacers) to target homologous sequences (protospacers) in phage genomes. C. difficile carries multiple CRISPR arrays, and in this paper we examine the relationships between the host- and phage-carried elements of the system. We detected multiple matches between spacers and regions in 31 C. difficile phage and prophage genomes. A subset of the spacers was located in prophage-carried CRISPR arrays. The CRISPR spacer profiles generated suggest that related phages would have similar host ranges. Furthermore, we show that C. difficile strains of the same ribotype could either have similar or divergent CRISPR contents. Both synonymous and nonsynonymous mutations in the protospacer sequences were identified, as well as differences in the protospacer adjacent motif (PAM), which could explain how phages escape this system. This paper illustrates how the distribution and diversity of CRISPR spacers in C. difficile, and its prophages, could modulate phage predation for this pathogen and impact upon its evolution and pathogenicity. Clostridium difficile is a significant bacterial human pathogen which undergoes continual genome evolution, resulting in the emergence of new virulent strains. Phages are major facilitators of genome evolution in other bacterial species, and we use sequence analysis-based approaches in order to examine whether the CRISPR/Cas system could control these interactions across divergent C. difficile strains. The presence of spacer sequences in prophages that are homologous to phage genomes raises an extra level of complexity in this predator-prey microbial system. Our results demonstrate that the impact of phage infection in this system is widespread and that the CRISPR/Cas system is likely to be an important aspect of the evolutionary dynamics in C. difficile. Copyright © 2014 Hargreaves et al.
Andersson, Jan O; Sjögren, Åsa M; Horner, David S; Murphy, Colleen A; Dyal, Patricia L; Svärd, Staffan G; Logsdon, John M; Ragan, Mark A; Hirt, Robert P; Roger, Andrew J
2007-01-01
Background Comparative genomic studies of the mitochondrion-lacking protist group Diplomonadida (diplomonads) has been lacking, although Giardia lamblia has been intensively studied. We have performed a sequence survey project resulting in 2341 expressed sequence tags (EST) corresponding to 853 unique clones, 5275 genome survey sequences (GSS), and eleven finished contigs from the diplomonad fish parasite Spironucleus salmonicida (previously described as S. barkhanus). Results The analyses revealed a compact genome with few, if any, introns and very short 3' untranslated regions. Strikingly different patterns of codon usage were observed in genes corresponding to frequently sampled ESTs versus genes poorly sampled, indicating that translational selection is influencing the codon usage of highly expressed genes. Rigorous phylogenomic analyses identified 84 genes – mostly encoding metabolic proteins – that have been acquired by diplomonads or their relatively close ancestors via lateral gene transfer (LGT). Although most acquisitions were from prokaryotes, more than a dozen represent likely transfers of genes between eukaryotic lineages. Many genes that provide novel insights into the genetic basis of the biology and pathogenicity of this parasitic protist were identified including 149 that putatively encode variant-surface cysteine-rich proteins which are candidate virulence factors. A number of genomic properties that distinguish S. salmonicida from its human parasitic relative G. lamblia were identified such as nineteen putative lineage-specific gene acquisitions, distinct mutational biases and codon usage and distinct polyadenylation signals. Conclusion Our results highlight the power of comparative genomic studies to yield insights into the biology of parasitic protists and the evolution of their genomes, and suggest that genetic exchange between distantly-related protist lineages may be occurring at an appreciable rate in eukaryote genome evolution. PMID:17298675
The Role of Retrotransposons in Gene Family Expansions in the Human and Mouse Genomes
Janoušek, Václav; Laukaitis, Christina M.; Yanchukov, Alexey
2016-01-01
Abstract Retrotransposons comprise a large portion of mammalian genomes. They contribute to structural changes and more importantly to gene regulation. The expansion and diversification of gene families have been implicated as sources of evolutionary novelties. Given the roles retrotransposons play in genomes, their contribution to the evolution of gene families warrants further exploration. In this study, we found a significant association between two major retrotransposon classes, LINEs and LTRs, and lineage-specific gene family expansions in both the human and mouse genomes. The distribution and diversity differ between LINEs and LTRs, suggesting that each has a distinct involvement in gene family expansion. LTRs are associated with open chromatin sites surrounding the gene families, supporting their involvement in gene regulation, whereas LINEs may play a structural role promoting gene duplication. Our findings also suggest that gene family expansions, especially in the mouse genome, undergo two phases. The first phase is characterized by elevated deposition of LTRs and their utilization in reshaping gene regulatory networks. The second phase is characterized by rapid gene family expansion due to continuous accumulation of LINEs and it appears that, in some instances at least, this could become a runaway process. We provide an example in which this has happened and we present a simulation supporting the possibility of the runaway process. Altogether we provide evidence of the contribution of retrotransposons to the expansion and evolution of gene families. Our findings emphasize the putative importance of these elements in diversification and adaptation in the human and mouse lineages. PMID:27503295
Efficient identification of Y chromosome sequences in the human and Drosophila genomes.
Carvalho, Antonio Bernardo; Clark, Andrew G
2013-11-01
Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k-mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis, including JYalpha, a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes.
Prediction of Steps in the Evolution of Variola Virus Host Range
Smithson, Chad; Purdy, Alex; Verster, Adrian J.; Upton, Chris
2014-01-01
Variola virus, the agent of smallpox, has a severely restricted host range (humans) but a devastatingly high mortality rate. Although smallpox has been eradicated by a World Health Organization vaccination program, knowledge of the evolutionary processes by which human super-pathogens such as variola virus arise is important. By analyzing the evolution of variola and other closely related poxviruses at the level of single nucleotide polymorphisms we detected a hotspot of genome variation within the smallpox ortholog of the vaccinia virus O1L gene, which is known to be necessary for efficient replication of vaccinia virus in human cells. These mutations in the variola virus ortholog and the subsequent loss of the functional gene from camelpox virus and taterapox virus, the two closest relatives of variola virus, strongly suggest that changes within this region of the genome may have played a key role in the switch to humans as a host for the ancestral virus and the subsequent host-range restriction that must have occurred to create the phenotype exhibited by smallpox. PMID:24626337
Prediction of steps in the evolution of variola virus host range.
Smithson, Chad; Purdy, Alex; Verster, Adrian J; Upton, Chris
2014-01-01
Variola virus, the agent of smallpox, has a severely restricted host range (humans) but a devastatingly high mortality rate. Although smallpox has been eradicated by a World Health Organization vaccination program, knowledge of the evolutionary processes by which human super-pathogens such as variola virus arise is important. By analyzing the evolution of variola and other closely related poxviruses at the level of single nucleotide polymorphisms we detected a hotspot of genome variation within the smallpox ortholog of the vaccinia virus O1L gene, which is known to be necessary for efficient replication of vaccinia virus in human cells. These mutations in the variola virus ortholog and the subsequent loss of the functional gene from camelpox virus and taterapox virus, the two closest relatives of variola virus, strongly suggest that changes within this region of the genome may have played a key role in the switch to humans as a host for the ancestral virus and the subsequent host-range restriction that must have occurred to create the phenotype exhibited by smallpox.
Greally, John M
2002-01-08
To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival.
Greally, John M.
2002-01-01
To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival. PMID:11756672
Genetic Markers of Human Evolution Are Enriched in Schizophrenia.
Srinivasan, Saurabh; Bettella, Francesco; Mattingsdal, Morten; Wang, Yunpeng; Witoelar, Aree; Schork, Andrew J; Thompson, Wesley K; Zuber, Verena; Winsvold, Bendik S; Zwart, John-Anker; Collier, David A; Desikan, Rahul S; Melle, Ingrid; Werge, Thomas; Dale, Anders M; Djurovic, Srdjan; Andreassen, Ole A
2016-08-15
Why schizophrenia has accompanied humans throughout our history despite its negative effect on fitness remains an evolutionary enigma. It is proposed that schizophrenia is a by-product of the complex evolution of the human brain and a compromise for humans' language, creative thinking, and cognitive abilities. We analyzed recent large genome-wide association studies of schizophrenia and a range of other human phenotypes (anthropometric measures, cardiovascular disease risk factors, immune-mediated diseases) using a statistical framework that draws on polygenic architecture and ancillary information on genetic variants. We used information from the evolutionary proxy measure called the Neanderthal selective sweep (NSS) score. Gene loci associated with schizophrenia are significantly (p = 7.30 × 10(-9)) more prevalent in genomic regions that are likely to have undergone recent positive selection in humans (i.e., with a low NSS score). Variants in brain-related genes with a low NSS score confer significantly higher susceptibility than variants in other brain-related genes. The enrichment is strongest for schizophrenia, but we cannot rule out enrichment for other phenotypes. The false discovery rate conditional on the evolutionary proxy points to 27 candidate schizophrenia susceptibility loci, 12 of which are associated with schizophrenia and other psychiatric disorders or linked to brain development. Our results suggest that there is a polygenic overlap between schizophrenia and NSS score, a marker of human evolution, which is in line with the hypothesis that the persistence of schizophrenia is related to the evolutionary process of becoming human. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Ananda, Guruprasad; Hile, Suzanne E.; Breski, Amanda; Wang, Yanli; Kelkar, Yogeshwar; Makova, Kateryna D.; Eckert, Kristin A.
2014-01-01
Interruptions of microsatellite sequences impact genome evolution and can alter disease manifestation. However, human polymorphism levels at interrupted microsatellites (iMSs) are not known at a genome-wide scale, and the pathways for gaining interruptions are poorly understood. Using the 1000 Genomes Phase-1 variant call set, we interrogated mono-, di-, tri-, and tetranucleotide repeats up to 10 units in length. We detected ∼26,000–40,000 iMSs within each of four human population groups (African, European, East Asian, and American). We identified population-specific iMSs within exonic regions, and discovered that known disease-associated iMSs contain alleles present at differing frequencies among the populations. By analyzing longer microsatellites in primate genomes, we demonstrate that single interruptions result in a genome-wide average two- to six-fold reduction in microsatellite mutability, as compared with perfect microsatellites. Centrally located interruptions lowered mutability dramatically, by two to three orders of magnitude. Using a biochemical approach, we tested directly whether the mutability of a specific iMS is lower because of decreased DNA polymerase strand slippage errors. Modeling the adenomatous polyposis coli tumor suppressor gene sequence, we observed that a single base substitution interruption reduced strand slippage error rates five- to 50-fold, relative to a perfect repeat, during synthesis by DNA polymerases α, β, or η. Computationally, we demonstrate that iMSs arise primarily by base substitution mutations within individual human genomes. Our biochemical survey of human DNA polymerase α, β, δ, κ, and η error rates within certain microsatellites suggests that interruptions are created most frequently by low fidelity polymerases. Our combined computational and biochemical results demonstrate that iMSs are abundant in human genomes and are sources of population-specific genetic variation that may affect genome stability. The genome-wide identification of iMSs in human populations presented here has important implications for current models describing the impact of microsatellite polymorphisms on gene expression. PMID:25033203
Functional genomics of lactic acid bacteria: from food to health
2014-01-01
Genome analysis using next generation sequencing technologies has revolutionized the characterization of lactic acid bacteria and complete genomes of all major groups are now available. Comparative genomics has provided new insights into the natural and laboratory evolution of lactic acid bacteria and their environmental interactions. Moreover, functional genomics approaches have been used to understand the response of lactic acid bacteria to their environment. The results have been instrumental in understanding the adaptation of lactic acid bacteria in artisanal and industrial food fermentations as well as their interactions with the human host. Collectively, this has led to a detailed analysis of genes involved in colonization, persistence, interaction and signaling towards to the human host and its health. Finally, massive parallel genome re-sequencing has provided new opportunities in applied genomics, specifically in the characterization of novel non-GMO strains that have potential to be used in the food industry. Here, we provide an overview of the state of the art of these functional genomics approaches and their impact in understanding, applying and designing lactic acid bacteria for food and health. PMID:25186768
Functional genomics of lactic acid bacteria: from food to health.
Douillard, François P; de Vos, Willem M
2014-08-29
Genome analysis using next generation sequencing technologies has revolutionized the characterization of lactic acid bacteria and complete genomes of all major groups are now available. Comparative genomics has provided new insights into the natural and laboratory evolution of lactic acid bacteria and their environmental interactions. Moreover, functional genomics approaches have been used to understand the response of lactic acid bacteria to their environment. The results have been instrumental in understanding the adaptation of lactic acid bacteria in artisanal and industrial food fermentations as well as their interactions with the human host. Collectively, this has led to a detailed analysis of genes involved in colonization, persistence, interaction and signaling towards to the human host and its health. Finally, massive parallel genome re-sequencing has provided new opportunities in applied genomics, specifically in the characterization of novel non-GMO strains that have potential to be used in the food industry. Here, we provide an overview of the state of the art of these functional genomics approaches and their impact in understanding, applying and designing lactic acid bacteria for food and health.
Has Rift Valley fever virus evolved with increasing severity in human populations in East Africa?
Baba, Marycelin; Masiga, Daniel K; Sang, Rosemary; Villinger, Jandouwe
2016-06-22
Rift Valley fever (RVF) outbreaks have occurred across eastern Africa from 1912 to 2010 approximately every 4-15 years, most of which have not been accompanied by significant epidemics in human populations. However, human epidemics during RVF outbreaks in eastern Africa have involved 478 deaths in 1998, 1107 reported cases with 350 deaths from 2006 to 2007 and 1174 cases with 241 deaths in 2008. We review the history of RVF outbreaks in eastern Africa to identify the epidemiological factors that could have influenced its increasing severity in humans. Diverse ecological factors influence outbreak frequency, whereas virus evolution has a greater impact on its virulence in hosts. Several factors could have influenced the lack of information on RVF in humans during earlier outbreaks, but the explosive nature of human RVF epidemics in recent years mirrors the evolutionary trend of the virus. Comparisons between isolates from different outbreaks have revealed an accumulation of genetic mutations and genomic reassortments that have diversified RVF virus genomes over several decades. The threat to humans posed by the diversified RVF virus strains increases the potential public health and socioeconomic impacts of future outbreaks. Understanding the shifting RVF epidemiology as determined by its evolution is key to developing new strategies for outbreak mitigation and prevention of future human RVF casualties.
Medaka: a promising model animal for comparative population genomics
Matsumoto, Yoshifumi; Oota, Hiroki; Asaoka, Yoichi; Nishina, Hiroshi; Watanabe, Koji; Bujnicki, Janusz M; Oda, Shoji; Kawamura, Shoji; Mitani, Hiroshi
2009-01-01
Background Within-species genome diversity has been best studied in humans. The international HapMap project has revealed a tremendous amount of single-nucleotide polymorphisms (SNPs) among humans, many of which show signals of positive selection during human evolution. In most of the cases, however, functional differences between the alleles remain experimentally unverified due to the inherent difficulty of human genetic studies. It would therefore be highly useful to have a vertebrate model with the following characteristics: (1) high within-species genetic diversity, (2) a variety of gene-manipulation protocols already developed, and (3) a completely sequenced genome. Medaka (Oryzias latipes) and its congeneric species, tiny fresh-water teleosts distributed broadly in East and Southeast Asia, meet these criteria. Findings Using Oryzias species from 27 local populations, we conducted a simple screening of nonsynonymous SNPs for 11 genes with apparent orthology between medaka and humans. We found medaka SNPs for which the same sites in human orthologs are known to be highly differentiated among the HapMap populations. Importantly, some of these SNPs show signals of positive selection. Conclusion These results indicate that medaka is a promising model system for comparative population genomics exploring the functional and adaptive significance of allelic differentiations. PMID:19426554
Belkorchia, Abdel; Biderre, Corinne; Militon, Cécile; Polonais, Valérie; Wincker, Patrick; Jubin, Claire; Delbac, Frédéric; Peyretaillade, Eric; Peyret, Pierre
2008-03-01
Brachiola algerae has a broad host spectrum from human to mosquitoes. The successful infection of two mosquito cell lines (Mos55: embryonic cells and Sua 4.0: hemocyte-like cells) and a human cell line (HFF) highlights the efficient adaptive capacity of this microsporidian pathogen. The molecular karyotype of this microsporidian species was determined in the context of the B. algerae genome sequencing project, showing that its haploid genome consists of 30 chromosomal-sized DNAs ranging from 160 to 2240 kbp giving an estimated genome size of 23 Mbp. A contig of 12,269 bp including the DNA sequence of the B. algerae ribosomal transcription unit has been built from initial genomic sequences and the secondary structure of the large subunit rRNA constructed. The data obtained indicate that B. algerae should be an excellent parasitic model to understand genome evolution in relation to infectious capacity.
Human-Specific Duplication and Mosaic Transcripts: The Recent Paralogous Structure of Chromosome 22
Bailey, Jeffrey A. ; Yavor, Amy M. ; Viggiano, Luigi ; Misceo, Doriana ; Horvath, Juliann E. ; Archidiacono, Nicoletta ; Schwartz, Stuart ; Rocchi, Mariano ; Eichler, Evan E.
2002-01-01
In recent decades, comparative chromosomal banding, chromosome painting, and gene-order studies have shown strong conservation of gross chromosome structure and gene order in mammals. However, findings from the human genome sequence suggest an unprecedented degree of recent (<35 million years ago) segmental duplication. This dynamism of segmental duplications has important implications in disease and evolution. Here we present a chromosome-wide view of the structure and evolution of the most highly homologous duplications (⩾1 kb and ⩾90%) on chromosome 22. Overall, 10.8% (3.7/33.8 Mb) of chromosome 22 is duplicated, with an average sequence identity of 95.4%. To organize the duplications into tractable units, intron-exon structure and well-defined duplication boundaries were used to define 78 duplicated modules (minimally shared evolutionary segments) with 157 copies on chromosome 22. Analysis of these modules provides evidence for the creation or modification of 11 novel transcripts. Comparative FISH analyses of human, chimpanzee, gorilla, orangutan, and macaque reveal qualitative and quantitative differences in the distribution of these duplications—consistent with their recent origin. Several duplications appear to be human specific, including a ∼400-kb duplication (99.4%–99.8% sequence identity) that transposed from chromosome 14 to the most proximal pericentromeric region of chromosome 22. Experimental and in silico data further support a pericentromeric gradient of duplications where the most recent duplications transpose adjacent to the centromere. Taken together, these data suggest that segmental duplications have been an ongoing process of primate genome evolution, contributing to recent gene innovation and the dynamic transformation of genome architecture within and among closely related species. PMID:11731936
Zeng, Jia; Yi, Soojin V.
2014-01-01
Recombination clusters nonuniformly across mammalian genomes at discrete genomic loci referred to as recombination hotspots. Despite their ubiquitous presence, individual hotspots rapidly lose their activities, and the molecular and evolutionary mechanisms underlying such frequent hotspot turnovers (the so-called “recombination hotspot paradox”) remain unresolved. Even though some sequence motifs are significantly associated with hotspots, multiple lines of evidence indicate that factors other than underlying sequences, such as epigenetic modifications, may affect the evolution of recombination hotspots. Thus, identifying epigenetic factors that covary with recombination at fine-scale is a promising step for this important research area. It was previously reported that recombination rates correlate with indirect measures of DNA methylation in the human genome. Here, we analyze experimentally determined DNA methylation and histone modification of human sperms, and show that the correlation between DNA methylation and recombination in long-range windows does not hold with respect to the spatial and temporal variation of recombination at hotspots. On the other hand, two histone modifications (H3K4me3 and H3K27me3) overlap extensively with recombination hotspots. Similar trends were observed in mice. These results indicate that specific histone modifications rather than DNA methylation are associated with the rapid evolution of recombination hotspots. Furthermore, many human recombination hotspots occupy “bivalent” chromatin regions that harbor both active (H3K4me3) and repressive (H3K27me3) marks. This may explain why human recombination hotspots tend to occur in nongenic regions, in contrast to yeast and Arabidopsis hotspots that are characterized by generally active chromatins. Our results highlight the dynamic epigenetic underpinnings of recombination hotspot evolution. PMID:25326136
Maeso, Ignacio; Dunwell, Thomas L; Wyatt, Chris D R; Marlétaz, Ferdinand; Vető, Borbála; Bernal, Juan A; Quah, Shan; Irimia, Manuel; Holland, Peter W H
2016-06-13
A central goal of evolutionary biology is to link genomic change to phenotypic evolution. The origin of new transcription factors is a special case of genomic evolution since it brings opportunities for novel regulatory interactions and potentially the emergence of new biological properties. We demonstrate that a group of four homeobox gene families (Argfx, Leutx, Dprx, Tprx), plus a gene newly described here (Pargfx), arose by tandem gene duplication from the retinal-expressed Crx gene, followed by asymmetric sequence evolution. We show these genes arose as part of repeated gene gain and loss events on a dynamic chromosomal region in the stem lineage of placental mammals, on the forerunner of human chromosome 19. The human orthologues of these genes are expressed specifically in early embryo totipotent cells, peaking from 8-cell to morula, prior to cell fate restrictions; cow orthologues have similar expression. To examine biological roles, we used ectopic gene expression in cultured human cells followed by high-throughput RNA-seq and uncovered extensive transcriptional remodelling driven by three of the genes. Comparison to transcriptional profiles of early human embryos suggest roles in activating and repressing a set of developmentally-important genes that spike at 8-cell to morula, rather than a general role in genome activation. We conclude that a dynamic chromosome region spawned a set of evolutionarily new homeobox genes, the ETCHbox genes, specifically in eutherian mammals. After these genes diverged from the parental Crx gene, we argue they were recruited for roles in the preimplantation embryo including activation of genes at the 8-cell stage and repression after morula. We propose these new homeobox gene roles permitted fine-tuning of cell fate decisions necessary for specification and function of embryonic and extra-embryonic tissues utilised in mammalian development and pregnancy.
Genome-wide diversity and selective pressure in the human rhinovirus
Kistler, Amy L; Webster, Dale R; Rouskin, Silvi; Magrini, Vince; Credle, Joel J; Schnurr, David P; Boushey, Homer A; Mardis, Elaine R; Li, Hao; DeRisi, Joseph L
2007-01-01
Background The human rhinoviruses (HRV) are one of the most common and diverse respiratory pathogens of humans. Over 100 distinct HRV serotypes are known, yet only 6 genomes are available. Due to the paucity of HRV genome sequence, little is known about the genetic diversity within HRV or the forces driving this diversity. Previous comparative genome sequence analyses indicate that recombination drives diversification in multiple genera of the picornavirus family, yet it remains unclear if this holds for HRV. Results To resolve this and gain insight into the forces driving diversification in HRV, we generated a representative set of 34 fully sequenced HRVs. Analysis of these genomes shows consistent phylogenies across the genome, conserved non-coding elements, and only limited recombination. However, spikes of genetic diversity at both the nucleotide and amino acid level are detectable within every locus of the genome. Despite this, the HRV genome as a whole is under purifying selective pressure, with islands of diversifying pressure in the VP1, VP2, and VP3 structural genes and two non-structural genes, the 3C protease and 3D polymerase. Mapping diversifying residues in these factors onto available 3-dimensional structures revealed the diversifying capsid residues partition to the external surface of the viral particle in statistically significant proximity to antigenic sites. Diversifying pressure in the pleconaril binding site is confined to a single residue known to confer drug resistance (VP1 191). In contrast, diversifying pressure in the non-structural genes is less clear, mapping both nearby and beyond characterized functional domains of these factors. Conclusion This work provides a foundation for understanding HRV genetic diversity and insight into the underlying biology driving evolution in HRV. It expands our knowledge of the genome sequence space that HRV reference serotypes occupy and how the pattern of genetic diversity across HRV genomes differs from other picornaviruses. It also reveals evidence of diversifying selective pressure in both structural genes known to interact with the host immune system and in domains of unassigned function in the non-structural 3C and 3D genes, raising the possibility that diversification of undiscovered functions in these essential factors may influence HRV fitness and evolution. PMID:17477878
Bolzán, Alejandro D
2017-07-01
By definition, telomeric sequences are located at the very ends or terminal regions of chromosomes. However, several vertebrate species show blocks of (TTAGGG)n repeats present in non-terminal regions of chromosomes, the so-called interstitial telomeric sequences (ITSs), interstitial telomeric repeats or interstitial telomeric bands, which include those intrachromosomal telomeric-like repeats located near (pericentromeric ITSs) or within the centromere (centromeric ITSs) and those telomeric repeats located between the centromere and the telomere (i.e., truly interstitial telomeric sequences) of eukaryotic chromosomes. According with their sequence organization, localization and flanking sequences, ITSs can be classified into four types: 1) short ITSs, 2) subtelomeric ITSs, 3) fusion ITSs, and 4) heterochromatic ITSs. The first three types have been described mainly in the human genome, whereas heterochromatic ITSs have been found in several vertebrate species but not in humans. Several lines of evidence suggest that ITSs play a significant role in genome instability and evolution. This review aims to summarize our current knowledge about the origin, function, instability and evolution of these telomeric-like repeats in vertebrate chromosomes. Copyright © 2017 Elsevier B.V. All rights reserved.
Bull, James J; Barrick, Jeffrey E
2017-12-01
Evolution in the form of selective breeding has long been harnessed as a useful tool by humans. However, rapid evolution can also be a danger to our health and a stumbling block for biotechnology. Unwanted evolution can underlie the emergence of drug and pesticide resistance, cancer, and weeds. It makes live vaccines and engineered cells inherently unreliable and unpredictable, and therefore potentially unsafe. Yet, there are strategies that have been and can possibly be used to stop or slow many types of evolution. We review and classify existing population genetics-inspired methods for arresting evolution. Then, we discuss how genome editing techniques enable a radically new set of approaches to limit evolution. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V
2003-09-02
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
Uniparental Inheritance Promotes Adaptive Evolution in Cytoplasmic Genomes.
Christie, Joshua R; Beekman, Madeleine
2017-03-01
Eukaryotes carry numerous asexual cytoplasmic genomes (mitochondria and plastids). Lacking recombination, asexual genomes should theoretically suffer from impaired adaptive evolution. Yet, empirical evidence indicates that cytoplasmic genomes experience higher levels of adaptive evolution than predicted by theory. In this study, we use a computational model to show that the unique biology of cytoplasmic genomes-specifically their organization into host cells and their uniparental (maternal) inheritance-enable them to undergo effective adaptive evolution. Uniparental inheritance of cytoplasmic genomes decreases competition between different beneficial substitutions (clonal interference), promoting the accumulation of beneficial substitutions. Uniparental inheritance also facilitates selection against deleterious cytoplasmic substitutions, slowing Muller's ratchet. In addition, uniparental inheritance generally reduces genetic hitchhiking of deleterious substitutions during selective sweeps. Overall, uniparental inheritance promotes adaptive evolution by increasing the level of beneficial substitutions relative to deleterious substitutions. When we assume that cytoplasmic genome inheritance is biparental, decreasing the number of genomes transmitted during gametogenesis (bottleneck) aids adaptive evolution. Nevertheless, adaptive evolution is always more efficient when inheritance is uniparental. Our findings explain empirical observations that cytoplasmic genomes-despite their asexual mode of reproduction-can readily undergo adaptive evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Molecular networks and the evolution of human cognitive specializations.
Fontenot, Miles; Konopka, Genevieve
2014-12-01
Inroads into elucidating the origins of human cognitive specializations have taken many forms, including genetic, genomic, anatomical, and behavioral assays that typically compare humans to non-human primates. While the integration of all of these approaches is essential for ultimately understanding human cognition, here, we review the usefulness of coexpression network analysis for specifically addressing this question. An increasing number of studies have incorporated coexpression networks into brain expression studies comparing species, disease versus control tissue, brain regions, or developmental time periods. A clearer picture has emerged of the key genes driving brain evolution, as well as the developmental and regional contributions of gene expression patterns important for normal brain development and those misregulated in cognitive diseases. Copyright © 2014 Elsevier Ltd. All rights reserved.
Genome-nutrition divergence: evolving understanding of the malnutrition spectrum.
Eaton, Jacob C; Iannotti, Lora L
2017-11-01
Humans adapted over a period of 2.3 million years to a diet high in quality and diversity. Genome-nutrition divergence describes the misalignment between modern global diets and the genome formed through evolution. A survey of hominin diets over time shows that humans have thrived on a broad range of foods. Earlier diets were highly diverse and nutrient dense, in contrast to modern food systems in which monotonous diets of staple cereals and ultraprocessed foods play a more prominent role. Applying the lens of genome-nutrition divergence to malnutrition reveals shared risk factors for undernutrition and overnutrition at nutrient, food, and environmental levels. Mechanisms for food system shifts, such as crop-neutral agricultural policy, agroecology, and social policy, are explored as a means to realign modern diets with the nutritional patterns to which humans may be better adapted to thrive. © The Author(s) 2017. Published by Oxford University Press on behalf of the International Life Sciences Institute. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
UCbase 2.0: ultraconserved sequences database (2014 update).
Lomonaco, Vincenzo; Martoglia, Riccardo; Mandreoli, Federica; Anderlucci, Laura; Emmett, Warren; Bicciato, Silvio; Taccioli, Cristian
2014-01-01
UCbase 2.0 (http://ucbase.unimore.it) is an update, extension and evolution of UCbase, a Web tool dedicated to the analysis of ultraconserved sequences (UCRs). UCRs are 481 sequences >200 bases sharing 100% identity among human, mouse and rat genomes. They are frequently located in genomic regions known to be involved in cancer or differentially expressed in human leukemias and carcinomas. UCbase 2.0 is a platform-independent Web resource that includes the updated version of the human genome annotation (hg19), information linking disorders to chromosomal coordinates based on the Systematized Nomenclature of Medicine classification, a query tool to search for Single Nucleotide Polymorphisms (SNPs) and a new text box to directly interrogate the database using a MySQL interface. To facilitate the interactive visual interpretation of UCR chromosomal positioning, UCbase 2.0 now includes a graph visualization interface directly linked to UCSC genome browser. Database URL: http://ucbase.unimore.it. © The Author(s) 2014. Published by Oxford University Press.
Population genomics reveals the origin and asexual evolution of human infective trypanosomes
Weir, William; Capewell, Paul; Foth, Bernardo; Clucas, Caroline; Pountain, Andrew; Steketee, Pieter; Veitch, Nicola; Koffi, Mathurin; De Meeûs, Thierry; Kaboré, Jacques; Camara, Mamadou; Cooper, Anneli; Tait, Andy; Jamonneau, Vincent; Bucheton, Bruno; Berriman, Matt; MacLeod, Annette
2016-01-01
Evolutionary theory predicts that the lack of recombination and chromosomal re-assortment in strictly asexual organisms results in homologous chromosomes irreversibly accumulating mutations and thus evolving independently of each other, a phenomenon termed the Meselson effect. We apply a population genomics approach to examine this effect in an important human pathogen, Trypanosoma brucei gambiense. We determine that T.b. gambiense is evolving strictly asexually and is derived from a single progenitor, which emerged within the last 10,000 years. We demonstrate the Meselson effect for the first time at the genome-wide level in any organism and show large regions of loss of heterozygosity, which we hypothesise to be a short-term compensatory mechanism for counteracting deleterious mutations. Our study sheds new light on the genomic and evolutionary consequences of strict asexuality, which this pathogen uses as it exploits a new biological niche, the human population. DOI: http://dx.doi.org/10.7554/eLife.11473.001 PMID:26809473
Adaptive potential of genomic structural variation in human and mammalian evolution.
Radke, David W; Lee, Charles
2015-09-01
Because phenotypic innovations must be genetically heritable for biological evolution to proceed, it is natural to consider new mutation events as well as standing genetic variation as sources for their birth. Previous research has identified a number of single-nucleotide polymorphisms that underlie a subset of adaptive traits in organisms. However, another well-known class of variation, genomic structural variation, could have even greater potential to produce adaptive phenotypes, due to the variety of possible types of alterations (deletions, insertions, duplications, among others) at different genomic positions and with variable lengths. It is from these dramatic genomic alterations, and selection on their phenotypic consequences, that adaptations leading to biological diversification could be derived. In this review, using studies in humans and other mammals, we highlight examples of how phenotypic variation from structural variants might become adaptive in populations and potentially enable biological diversification. Phenotypic change arising from structural variants will be described according to their immediate effect on organismal metabolic processes, immunological response and physical features. Study of population dynamics of segregating structural variation can therefore provide a window into understanding current and historical biological diversification. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Guerrini, Valentina; Subbian, Selvakumar; Santucci, Pierre; Canaan, Stéphane; Pozzi, Gianni
2016-01-01
Isolates of the human pathogen Mycobacterium tuberculosis recovered from clinical samples exhibit genetic heterogeneity. Such variation may result from the stressful environment encountered by the pathogen inside the macrophage, which is the host cell tubercle bacilli parasitize. To study the evolution of the M. tuberculosis genome during growth inside macrophages, we developed a model of intracellular culture in which bacteria were serially passaged in macrophage-like THP-1 cells for about 80 bacterial generations. Genome sequencing of single bacterial colonies isolated before and after the infection cycles revealed that M. tuberculosis developed mutations at a rate of about 5.7 × 10−9 / bp/ generation, consistent with mutation rates calculated during in vivo infection. Analysis of mutant growth in macrophages and in mice showed that the mutations identified after the cyclic infection conferred no advantage to the mutants relative to wild-type. Furthermore, activity testing of the recombinant protein harboring one of these mutations showed that the presence of the mutation did not affect the enzymatic activity. The serial infection protocol developed in this work to study M. tuberculosis genome microevolution can be applied to exposure to stressors to determine their effect on genome remodeling during intra-macrophage growth. PMID:27959952
Guerrini, Valentina; Subbian, Selvakumar; Santucci, Pierre; Canaan, Stéphane; Gennaro, Maria Laura; Pozzi, Gianni
2016-01-01
Isolates of the human pathogen Mycobacterium tuberculosis recovered from clinical samples exhibit genetic heterogeneity. Such variation may result from the stressful environment encountered by the pathogen inside the macrophage, which is the host cell tubercle bacilli parasitize. To study the evolution of the M. tuberculosis genome during growth inside macrophages, we developed a model of intracellular culture in which bacteria were serially passaged in macrophage-like THP-1 cells for about 80 bacterial generations. Genome sequencing of single bacterial colonies isolated before and after the infection cycles revealed that M. tuberculosis developed mutations at a rate of about 5.7 × 10-9 / bp/ generation, consistent with mutation rates calculated during in vivo infection. Analysis of mutant growth in macrophages and in mice showed that the mutations identified after the cyclic infection conferred no advantage to the mutants relative to wild-type. Furthermore, activity testing of the recombinant protein harboring one of these mutations showed that the presence of the mutation did not affect the enzymatic activity. The serial infection protocol developed in this work to study M. tuberculosis genome microevolution can be applied to exposure to stressors to determine their effect on genome remodeling during intra-macrophage growth.
Parallel or convergent evolution in human population genomic data revealed by genotype networks.
R Vahdati, Ali; Wagner, Andreas
2016-08-02
Genotype networks are representations of genetic variation data that are complementary to phylogenetic trees. A genotype network is a graph whose nodes are genotypes (DNA sequences) with the same broadly defined phenotype. Two nodes are connected if they differ in some minimal way, e.g., in a single nucleotide. We analyze human genome variation data from the 1,000 genomes project, and construct haploid genotype (haplotype) networks for 12,235 protein coding genes. The structure of these networks varies widely among genes, indicating different patterns of variation despite a shared evolutionary history. We focus on those genes whose genotype networks show many cycles, which can indicate homoplasy, i.e., parallel or convergent evolution, on the sequence level. For 42 genes, the observed number of cycles is so large that it cannot be explained by either chance homoplasy or recombination. When analyzing possible explanations, we discovered evidence for positive selection in 21 of these genes and, in addition, a potential role for constrained variation and purifying selection. Balancing selection plays at most a small role. The 42 genes with excess cycles are enriched in functions related to immunity and response to pathogens. Genotype networks are representations of genetic variation data that can help understand unusual patterns of genomic variation.
Transcriptional activation of short interspersed elements by DNA-damaging agents.
Rudin, C M; Thompson, C B
2001-01-01
Short interspersed elements (SINEs), typified by the human Alu repeat, are RNA polymerase III (pol III)-transcribed sequences that replicate within the genome through an RNA intermediate. Replication of SINEs has been extensive in mammalian evolution: an estimated 5% of the human genome consists of Alu repeats. The mechanisms regulating transcription, reverse transcription, and reinsertion of SINE elements in genomic DNA are poorly understood. Here we report that expression of murine SINE transcripts of both the B1 and B2 classes is strongly upregulated after prolonged exposure to cisplatin, etoposide, or gamma radiation. A similar induction of Alu transcripts in human cells occurs under these conditions. This induction is not due to a general upregulation of pol III activity in either species. Genotoxic treatment of murine cells containing an exogenous human Alu element induced Alu transcription. Concomitant with the increased expression of SINEs, an increase in cellular reverse transcriptase was observed after exposure to these same DNA-damaging agents. These findings suggest that genomic damage may be an important activator of SINEs, and that SINE mobility may contribute to secondary malignancy after exposure to DNA-damaging chemotherapy.
Ecological genomics of adaptation and speciation in fungi.
Leducq, Jean-Baptiste
2014-01-01
Fungi play a central role in both ecosystems and human societies. This is in part because they have adopted a large diversity of life history traits to conquer a wide variety of ecological niches. Here, I review recent fungal genomics studies that explored the molecular origins and the adaptive significance of this diversity. First, macro-ecological genomics studies revealed that fungal genomes were highly remodelled during their evolution. This remodelling, in terms of genome organization and size, occurred through the proliferation of non-coding elements, gene compaction, gene loss and the expansion of large families of adaptive genes. These features vary greatly among fungal clades, and are correlated with different life history traits such as multicellularity, pathogenicity, symbiosis, and sexual reproduction. Second, micro-ecological genomics studies, based on population genomics, experimental evolution and quantitative trait loci approaches, have allowed a deeper exploration of early evolutionary steps of the above adaptations. Fungi, and especially budding yeasts, were used intensively to characterize early mutations and chromosomal rearrangements that underlie the acquisition of new adaptive traits allowing them to conquer new ecological niches and potentially leading to speciation. By uncovering the ecological factors and genomic modifications that underline adaptation, these studies showed that Fungi are powerful models for ecological genomics (eco-genomics), and that this approach, so far mainly developed in a few model species, should be expanded to the whole kingdom.
Comparative analysis of protein evolution in the genome of pre-epidemic and epidemic Zika virus.
Ramaiah, Arunachalam; Dai, Lei; Contreras, Deisy; Sinha, Sanjeev; Sun, Ren; Arumugaswami, Vaithilingaraja
2017-07-01
Zika virus (ZIKV) causes microcephaly in congenital infection, neurological disorders, and poor pregnancy outcome and no vaccine is available for use in humans or approved. Although ZIKV was first discovered in 1947, the exact mechanism of virus replication and pathogenesis remains unknown. Recent outbreaks of Zika virus in the Americas clearly suggest a human-mosquito cycle or urban cycle of transmission. Understanding the conserved and adaptive features in the evolution of ZIKV genome will provide a hint on the mechanism of ZIKV adaptation to a new cycle of transmission. Here, we show comprehensive analysis of protein evolution of ZIKV strains including the current 2015-16 outbreak. To identify the constraints on ZIKV evolution, selection pressure at individual codons, immune epitopes and co-evolving sites were analyzed. Phylogenetic trees show that the ZIKV strains of the Asian genotype form distinct cluster and share a common ancestor with African genotype. The TMRCA (Time to the Most Recent Common Ancestor) for the Asian lineage and the subsequently evolved Asian human strains was calculated at 88 and 34years ago, respectively. The proteome of current 2015/16 epidemic ZIKV strains of Asian genotype was found to be genetically conserved due to genome-wide negative selection, with limited positive selection. We identified a total of 16 amino acid substitutions in the epidemic and pre-epidemic strains from human, mosquito, and monkey hosts. Negatively selected amino acid sites of Envelope protein (E-protein) (positions 69, 166, and 174) and NS5 (292, 345, and 587) were located in central dimerization domains and C-terminal RNA-directed RNA polymerase regions, respectively. The predicted 137 (92 CD4 TCEs; 45 CD8 TCEs) immunogenic peptide chains comprising negatively selected amino acid sites can be considered as suitable target for sub-unit vaccine development, as these sites are less likely to generate immune-escape variants due to strong functional constrains operating on them. The targeted changes at the amino acid level may contribute to better adaptation of ZIKV strains to human-mosquito cycle or urban cycle of transmission. Copyright © 2017. Published by Elsevier B.V.
CRISPR Genome Engineering for Human Pluripotent Stem Cell Research
Chaterji, Somali; Ahn, Eun Hyun; Kim, Deok-Ho
2017-01-01
The emergence of targeted and efficient genome editing technologies, such as repurposed bacterial programmable nucleases (e.g., CRISPR-Cas systems), has abetted the development of cell engineering approaches. Lessons learned from the development of RNA-interference (RNA-i) therapies can spur the translation of genome editing, such as those enabling the translation of human pluripotent stem cell engineering. In this review, we discuss the opportunities and the challenges of repurposing bacterial nucleases for genome editing, while appreciating their roles, primarily at the epigenomic granularity. First, we discuss the evolution of high-precision, genome editing technologies, highlighting CRISPR-Cas9. They exist in the form of programmable nucleases, engineered with sequence-specific localizing domains, and with the ability to revolutionize human stem cell technologies through precision targeting with greater on-target activities. Next, we highlight the major challenges that need to be met prior to bench-to-bedside translation, often learning from the path-to-clinic of complementary technologies, such as RNA-i. Finally, we suggest potential bioinformatics developments and CRISPR delivery vehicles that can be deployed to circumvent some of the challenges confronting genome editing technologies en route to the clinic. PMID:29158838
Thèves, C; Biagini, P; Crubézy, E
2014-03-01
Smallpox is an infectious disease that is unique to humans, caused by a poxvirus. It is one of the most lethal of diseases; the virus variant Variola major has a mortality rate of 30%. People surviving this disease have life-long consequences, but also assured immunity. Historically, smallpox was recognized early in human populations. This led to prevention attempts--variolation, quarantine, and the isolation of infected subjects--until Jenner's discovery of the first steps of vaccination in the 18th century. After vaccination campaigns throughout the 19th and 20th centuries, the WHO declared the eradication of smallpox in 1980. With the development of microscopy techniques, the structural characterization of the virus began in the early 20th century. In 1990, the genomes of different smallpox viruses were determined; viruses could be classified in order to investigate their origin, diffusion, and evolution. To study the evolution and possible re-emergence of this viral pathogen, however, researchers can only use viral genomes collected during the 20th century. Cases of smallpox in ancient periods are sometimes well documented, so palaeomicrobiology and, more precisely, the study of ancient smallpox viral strains could be an exceptional opportunity. The analysis of poxvirus fragmented genomes could give new insights into the genetic evolution of the poxvirus. Recently, small fragments of the poxvirus genome were detected. With the genetic information obtained, a new phylogeny of smallpox virus was described. The interest in conducting studies on ancient strains is discussed, in order to explore the natural history of this disease. © 2014 The Authors Clinical Microbiology and Infection © 2014 European Society of Clinical Microbiology and Infectious Diseases.
The origins and impact of primate segmental duplications.
Marques-Bonet, Tomas; Girirajan, Santhosh; Eichler, Evan E
2009-10-01
Duplicated sequences are substrates for the emergence of new genes and are an important source of genetic instability associated with rare and common diseases. Analyses of primate genomes have shown an increase in the proportion of interspersed segmental duplications (SDs) within the genomes of humans and great apes. This contrasts with other mammalian genomes that seem to have their recently duplicated sequences organized in a tandem configuration. In this review, we focus on the mechanistic origin and impact of this difference with respect to evolution, genetic diversity and primate phenotype. Although many genomes will be sequenced in the future, resolution of this aspect of genomic architecture still requires high quality sequences and detailed analyses.
Miklós, István
2003-10-01
As more and more genomes have been sequenced, genomic data is rapidly accumulating. Genome-wide mutations are believed more neutral than local mutations such as substitutions, insertions and deletions, therefore phylogenetic investigations based on inversions, transpositions and inverted transpositions are less biased by the hypothesis on neutral evolution. Although efficient algorithms exist for obtaining the inversion distance of two signed permutations, there is no reliable algorithm when both inversions and transpositions are considered. Moreover, different type of mutations happen with different rates, and it is not clear how to weight them in a distance based approach. We introduce a Markov Chain Monte Carlo method to genome rearrangement based on a stochastic model of evolution, which can estimate the number of different evolutionary events needed to sort a signed permutation. The performance of the method was tested on simulated data, and the estimated numbers of different types of mutations were reliable. Human and Drosophila mitochondrial data were also analysed with the new method. The mixing time of the Markov Chain is short both in terms of CPU times and number of proposals. The source code in C is available on request from the author.
Ilkilic, Ilhan; Paul, Norbert W
2009-03-01
The goal of the Human Genome Diversity Project (HGDP) was to reconstruct the history of human evolution and the historical and geographical distribution of populations with the help of scientific research. Through this kind of research, the entire spectrum of genetic diversity to be found in the human species was to be explored with the hope of generating a better understanding of the history of humankind. An important part of this genome diversity research consists in taking blood and tissue samples from indigenous populations. For various reasons, it has not been possible to execute this project in the planned scope and form to date. Nevertheless, genomic diversity research addresses complex issues which prove to be highly relevant from the perspective of research ethics, transcultural medical ethics, and cultural philosophy. In the article at hand, we discuss these ethical issues as illustrated by the HGDP. This investigation focuses on the confrontation of culturally diverse images of humans and their cosmologies within the framework of genome diversity research and the ethical questions it raises. We argue that in addition to complex questions pertaining to research ethics such as informed consent and autonomy of probands, genome diversity research also has a cultural-philosophical, meta-ethical, and phenomenological dimension which must be taken into account in ethical discourses. Acknowledging this fact, we attempt to show the limits of current guidelines used in international genome diversity studies, following this up by a formulation of theses designed to facilitate an appropriate inquiry and ethical evaluation of intercultural dimensions of genome research.
Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
Sheinman, Michael; Ramisch, Anna; Massip, Florian; Arndt, Peter F.
2016-01-01
Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes. PMID:27488939
Single-cell sequencing and tumorigenesis: improved understanding of tumor evolution and metastasis.
Ellsworth, Darrell L; Blackburn, Heather L; Shriver, Craig D; Rabizadeh, Shahrooz; Soon-Shiong, Patrick; Ellsworth, Rachel E
2017-12-01
Extensive genomic and transcriptomic heterogeneity in human cancer often negatively impacts treatment efficacy and survival, thus posing a significant ongoing challenge for modern treatment regimens. State-of-the-art DNA- and RNA-sequencing methods now provide high-resolution genomic and gene expression portraits of individual cells, facilitating the study of complex molecular heterogeneity in cancer. Important developments in single-cell sequencing (SCS) technologies over the past 5 years provide numerous advantages over traditional sequencing methods for understanding the complexity of carcinogenesis, but significant hurdles must be overcome before SCS can be clinically useful. In this review, we: (1) highlight current methodologies and recent technological advances for isolating single cells, single-cell whole-genome and whole-transcriptome amplification using minute amounts of nucleic acids, and SCS, (2) summarize research investigating molecular heterogeneity at the genomic and transcriptomic levels and how this heterogeneity affects clonal evolution and metastasis, and (3) discuss the promise for integrating SCS in the clinical care arena for improved patient care.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nord, Alex S.; Pattabiraman, Kartik; Visel, Axel
The forebrain is the seat of higher-order brain functions, and many human neuropsychiatric disorders are due to genetic defects affecting forebrain development, making it imperative to understand the underlying genetic circuitry. We report that recent progress now makes it possible to begin fully elucidating the genomic regulatory mechanisms that control forebrain gene expression. Here, we discuss the current knowledge of how transcription factors drive gene expression programs through their interactions with cis-acting genomic elements, such as enhancers; how analyses of chromatin and DNA modifications provide insights into gene expression states; and how these approaches yield insights into the evolution ofmore » the human brain.« less
Evolution of genome size and genomic GC content in carnivorous holokinetics (Droseraceae).
Veleba, Adam; Šmarda, Petr; Zedek, František; Horová, Lucie; Šmerda, Jakub; Bureš, Petr
2017-02-01
Studies in the carnivorous family Lentibulariaceae in the last years resulted in the discovery of the smallest plant genomes and an unusual pattern of genomic GC content evolution. However, scarcity of genomic data in other carnivorous clades still prevents a generalization of the observed patterns. Here the aim was to fill this gap by mapping genome evolution in the second largest carnivorous family, Droseraceae, where this evolution may be affected by chromosomal holokinetism in Drosera METHODS: The genome size and genomic GC content of 71 Droseraceae species were measured by flow cytometry. A dated phylogeny was constructed, and the evolution of both genomic parameters and their relationship to species climatic niches were tested using phylogeny-based statistics. The 2C genome size of Droseraceae varied between 488 and 10 927 Mbp, and the GC content ranged between 37·1 and 44·7 %. The genome sizes and genomic GC content of carnivorous and holocentric species did not differ from those of their non-carnivorous and monocentric relatives. The genomic GC content positively correlated with genome size and annual temperature fluctuations. The genome size and chromosome numbers were inversely correlated in the Australian clade of Drosera CONCLUSIONS: Our results indicate that neither carnivory (nutrient scarcity) nor the holokinetism have a prominent effect on size and DNA base composition of Droseraceae genomes. However, the holokinetic drive seems to affect karyotype evolution in one of the major clades of Drosera Our survey confirmed that the evolution of GC content is tightly connected with the evolution of genome size and also with environmental conditions. © The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
USDA-ARS?s Scientific Manuscript database
Transmission of influenza A virus (IAV) from humans to swine occurs with relative frequency and is a critical contributor to swine IAV diversity. Subsequent to the introduction of these human seasonal lineages, there is often reassortment with endemic viruses and antigenic drift. To address whether ...
Vallée, Geneviève C; Muñoz, Daniella Santos; Sankoff, David
2016-11-11
Of the approximately two hundred sequenced plant genomes, how many and which ones were sequenced motivated by strictly or largely scientific considerations, and how many by chiefly economic, in a wide sense, incentives? And how large a role does publication opportunity play? In an integration of multiple disparate databases and other sources of information, we collect and analyze data on the size (number of species) in the plant orders and families containing sequenced genomes, on the trade value of these species, and of all the same-family or same-order species, and on the publication priority within the family and order. These data are subjected to multiple regression and other statistical analyses. We find that despite the initial importance of model organisms, it is clearly economic considerations that outweigh others in the choice of genome to be sequenced. This has important implications for generalizations about plant genomes, since human choices of plants to harvest (and cultivate) will have incurred many biases with respect to phenotypic characteristics and hence of genomic properties, and recent genomic evolution will also have been affected by human agricultural practices.
Alekseyenko, Alexander V.; Kim, Namshin; Lee, Christopher J.
2007-01-01
Association of alternative splicing (AS) with accelerated rates of exon evolution in some organisms has recently aroused widespread interest in its role in evolution of eukaryotic gene structure. Previous studies were limited to analysis of exon creation or lost events in mouse and/or human only. Our multigenome approach provides a way for (1) distinguishing creation and loss events on the large scale; (2) uncovering details of the evolutionary mechanisms involved; (3) estimating the corresponding rates over a wide range of evolutionary times and organisms; and (4) assessing the impact of AS on those evolutionary rates. We use previously unpublished independent analyses of alternative splicing in five species (human, mouse, dog, cow, and zebrafish) from the ASAP database combined with genomewide multiple alignment of 17 genomes to analyze exon creation and loss of both constitutively and alternatively spliced exons in mammals, fish, and birds. Our analysis provides a comprehensive database of exon creation and loss events over 360 million years of vertebrate evolution, including tens of thousands of alternative and constitutive exons. We find that exon inclusion level is inversely related to the rate of exon creation. In addition, we provide a detailed in-depth analysis of mechanisms of exon creation and loss, which suggests that a large fraction of nonrepetitive created exons are results of ab initio creation from purely intronic sequences. Our data indicate an important role for alternative splicing in creation of new exons and provide a useful novel database resource for future genome evolution research. PMID:17369312
Wuhan large pig roundworm virus identified in human feces in Brazil.
Luchs, Adriana; Leal, Elcio; Komninakis, Shirley Vasconcelos; de Pádua Milagres, Flavio Augusto; Brustulin, Rafael; da Aparecida Rodrigues Teles, Maria; Gill, Danielle Elise; Deng, Xutao; Delwart, Eric; Sabino, Ester Cerdeira; da Costa, Antonio Charlys
2018-03-28
We report here the complete genome sequence of a bipartite virus, herein denoted WLPRV/human/BRA/TO-34/201, from a sample collected in 2015 from a two-year-old child in Brazil presenting acute gastroenteritis. The virus has 98-99% identity (segments 2 and 1, respectively) with the Wuhan large pig roundworm virus (unclassified RNA virus) that was recently discovered in the stomachs of pigs from China. This is the first report of a Wuhan large pig roundworm virus detected in human specimens, and the second genome described worldwide. However, the generation of more sequence data and further functional studies are required to fully understand the ecology, epidemiology, and evolution of this new unclassified virus.
The Sequenced Angiosperm Genomes and Genome Databases.
Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng
2018-01-01
Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.
The Sequenced Angiosperm Genomes and Genome Databases
Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng
2018-01-01
Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology. PMID:29706973
Linkage of A-to-I RNA Editing in Metazoans and the Impact on Genome Evolution.
Duan, Yuange; Dou, Shengqian; Zhang, Hong; Wu, Changcheng; Wu, Mingming; Lu, Jian
2018-01-01
The adenosine-to-inosine (A-to-I) RNA editomes have been systematically characterized in various metazoan species, and many editing sites were found in clusters. However, it remains unclear whether the clustered editing sites tend to be linked in the same RNA molecules or not. By adopting a method originally designed to detect linkage disequilibrium of DNA mutations, we examined the editomes of ten metazoan species and detected extensive linkage of editing in Drosophila and cephalopods. The prevalent linkages of editing in these two clades, many of which are conserved between closely related species and might be associated with the adaptive proteomic recoding, are maintained by natural selection at the cost of genome evolution. Nevertheless, in worms and humans, we only detected modest proportions of linked editing events, the majority of which were not conserved. Furthermore, the linkage of editing in coding regions of worms and humans might be overall deleterious, which drives the evolution of DNA sites to escape promiscuous editing. Altogether, our results suggest that the linkage landscape of A-to-I editing has evolved during metazoan evolution. This present study also suggests that linkage of editing should be considered in elucidating the functional consequences of RNA editing. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
"Orphan" retrogenes in the human genome.
Ciomborowska, Joanna; Rosikiewicz, Wojciech; Szklarczyk, Damian; Makałowski, Wojciech; Makałowska, Izabela
2013-02-01
Gene duplicates generated via retroposition were long thought to be pseudogenized and consequently decayed. However, a significant number of these genes escaped their evolutionary destiny and evolved into functional genes. Despite multiple studies, the number of functional retrogenes in human and other genomes remains unclear. We performed a comparative analysis of human, chicken, and worm genomes to identify "orphan" retrogenes, that is, retrogenes that have replaced their progenitors. We located 25 such candidates in the human genome. All of these genes were previously known, and the majority has been intensively studied. Despite this, they have never been recognized as retrogenes. Analysis revealed that the phenomenon of replacing parental genes with their retrocopies has been taking place over the entire span of animal evolution. This process was often species specific and contributed to interspecies differences. Surprisingly, these retrogenes, which should evolve in a more relaxed mode, are subject to a very strong purifying selection, which is, on average, two and a half times stronger than other human genes. Also, for retrogenes, they do not show a typical overall tendency for a testis-specific expression. Notably, seven of them are associated with human diseases. Recognizing them as "orphan" retrocopies, which have different regulatory machinery than their parents, is important for any disease studies in model organisms, especially when discoveries made in one species are transferred to humans.
Endangered Species Hold Clues to Human Evolution
Bejerano, Gill; Salama, Sofie R.; Haussler, David
2010-01-01
We report that 18 conserved, and by extension functional, elements in the human genome are the result of retroposon insertions that are evolving under purifying selection in mammals. We show evidence that 1 of the 18 elements regulates the expression of ASXL3 during development by encoding an alternatively spliced exon that causes nonsense-mediated decay of the transcript. The retroposon that gave rise to these functional elements was quickly inactivated in the mammalian ancestor, and all traces of it have been lost due to neutral decay. However, the tuatara has maintained a near-ancestral version of this retroposon in its extant genome, which allows us to connect the 18 human elements to the evolutionary events that created them. We propose that conservation efforts over more than 100 years may not have only prevented the tuatara from going extinct but could have preserved our ability to understand the evolutionary history of functional elements in the human genome. Through simulations, we argue that species with historically low population sizes are more likely to harbor ancient mobile elements for long periods of time and in near-ancestral states, making these species indispensable in understanding the evolutionary origin of functional elements in the human genome. PMID:20332163
Feliziani, Sofía; Moyano, Alejandro J.; Di Rienzo, Julio A.; Krogh Johansen, Helle; Molin, Søren; Smania, Andrea M.
2014-01-01
The advent of high-throughput sequencing techniques has made it possible to follow the genomic evolution of pathogenic bacteria by comparing longitudinally collected bacteria sampled from human hosts. Such studies in the context of chronic airway infections by Pseudomonas aeruginosa in cystic fibrosis (CF) patients have indicated high bacterial population diversity. Such diversity may be driven by hypermutability resulting from DNA mismatch repair system (MRS) deficiency, a common trait evolved by P. aeruginosa strains in CF infections. No studies to date have utilized whole-genome sequencing to investigate within-host population diversity or long-term evolution of mutators in CF airways. We sequenced the genomes of 13 and 14 isolates of P. aeruginosa mutator populations from an Argentinian and a Danish CF patient, respectively. Our collection of isolates spanned 6 and 20 years of patient infection history, respectively. We sequenced 11 isolates from a single sample from each patient to allow in-depth analysis of population diversity. Each patient was infected by clonal populations of bacteria that were dominated by mutators. The in vivo mutation rate of the populations was ∼100 SNPs/year–∼40-fold higher than rates in normo-mutable populations. Comparison of the genomes of 11 isolates from the same sample showed extensive within-patient genomic diversification; the populations were composed of different sub-lineages that had coexisted for many years since the initial colonization of the patient. Analysis of the mutations identified genes that underwent convergent evolution across lineages and sub-lineages, suggesting that the genes were targeted by mutation to optimize pathogenic fitness. Parallel evolution was observed in reduction of overall catabolic capacity of the populations. These findings are useful for understanding the evolution of pathogen populations and identifying new targets for control of chronic infections. PMID:25330091
Darwinian evolution in the light of genomics
Koonin, Eugene V.
2009-01-01
Comparative genomics and systems biology offer unprecedented opportunities for testing central tenets of evolutionary biology formulated by Darwin in the Origin of Species in 1859 and expanded in the Modern Synthesis 100 years later. Evolutionary-genomic studies show that natural selection is only one of the forces that shape genome evolution and is not quantitatively dominant, whereas non-adaptive processes are much more prominent than previously suspected. Major contributions of horizontal gene transfer and diverse selfish genetic elements to genome evolution undermine the Tree of Life concept. An adequate depiction of evolution requires the more complex concept of a network or ‘forest’ of life. There is no consistent tendency of evolution towards increased genomic complexity, and when complexity increases, this appears to be a non-adaptive consequence of evolution under weak purifying selection rather than an adaptation. Several universals of genome evolution were discovered including the invariant distributions of evolutionary rates among orthologous genes from diverse genomes and of paralogous gene family sizes, and the negative correlation between gene expression level and sequence evolution rate. Simple, non-adaptive models of evolution explain some of these universals, suggesting that a new synthesis of evolutionary biology might become feasible in a not so remote future. PMID:19213802
Mutation as a Stress Response and the Regulation of Evolvability
Galhardo, Rodrigo S.; Hastings, P. J.; Rosenberg, Susan M.
2010-01-01
Our concept of a stable genome is evolving to one in which genomes are plastic and responsive to environmental changes. Growing evidence shows that a variety of environmental stresses induce genomic instability in bacteria, yeast, and human cancer cells, generating occasional fitter mutants and potentially accelerating adaptive evolution. The emerging molecular mechanisms of stress-induced mutagenesis vary but share telling common components that underscore two common themes. The first is the regulation of mutagenesis in time by cellular stress responses, which promote random mutations specifically when cells are poorly adapted to their environments, i.e., when they are stressed. A second theme is the possible restriction of random mutagenesis in genomic space, achieved via coupling of mutation-generating machinery to local events such as DNA-break repair or transcription. Such localization may minimize accumulation of deleterious mutations in the genomes of rare fitter mutants, and promote local concerted evolution. Although mutagenesis induced by stresses other than direct damage to DNA was previously controversial, evidence for the existence of various stress-induced mutagenesis programs is now overwhelming and widespread. Such mechanisms probably fuel evolution of microbial pathogenesis and antibiotic-resistance, and tumor progression and chemotherapy resistance, all of which occur under stress, driven by mutations. The emerging commonalities in stress-induced-mutation mechanisms provide hope for new therapeutic interventions for all of these processes. PMID:17917874
Principles of regulatory information conservation between mouse and human.
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; Wu, Weisheng; Cayting, Philip; Boyle, Alan P; Sundaram, Vasavi; Xing, Xiaoyun; Dogan, Nergiz; Li, Jingjing; Euskirchen, Ghia; Lin, Shin; Lin, Yiing; Visel, Axel; Kawli, Trupti; Yang, Xinqiong; Patacsil, Dorrelyn; Keller, Cheryl A; Giardine, Belinda; Kundaje, Anshul; Wang, Ting; Pennacchio, Len A; Weng, Zhiping; Hardison, Ross C; Snyder, Michael P
2014-11-20
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.
Human adaptation and population differentiation in the light of ancient genomes
Key, Felix M.; Fu, Qiaomei; Romagné, Frédéric; Lachmann, Michael; Andrés, Aida M.
2016-01-01
The influence of positive selection sweeps in human evolution is increasingly debated, although our ability to detect them is hampered by inherent uncertainties in the timing of past events. Ancient genomes provide snapshots of allele frequencies in the past and can help address this question. We combine modern and ancient genomic data in a simple statistic (DAnc) to time allele frequency changes, and investigate the role of drift and adaptation in population differentiation. Only 30% of the most strongly differentiated alleles between Africans and Eurasians changed in frequency during the colonization of Eurasia, but in Europe these alleles are enriched in genic and putatively functional alleles to an extent only compatible with local adaptation. Adaptive alleles—especially those associated with pigmentation—are mostly of hunter-gatherer origin, although lactose persistence arose in a haplotype present in farmers. These results provide evidence for a role of local adaptation in human population differentiation. PMID:26988143
LINE-1 protein localization and functional dynamics during the cell cycle
Wudzinska, Aleksandra; Sun, Xiaoji; Andrade, Joshua; Nayak, Shruti; Kahler, David J; Badri, Sana; LaCava, John; Ueberheide, Beatrix; Yun, Chi Y; Fenyö, David
2018-01-01
LINE-1/L1 retrotransposon sequences comprise 17% of the human genome. Among the many classes of mobile genetic elements, L1 is the only autonomous retrotransposon that still drives human genomic plasticity today. Through its co-evolution with the human genome, L1 has intertwined itself with host cell biology. However, a clear understanding of L1’s lifecycle and the processes involved in restricting its insertion and intragenomic spread remains elusive. Here we identify modes of L1 proteins’ entrance into the nucleus, a necessary step for L1 proliferation. Using functional, biochemical, and imaging approaches, we also show a clear cell cycle bias for L1 retrotransposition that peaks during the S phase. Our observations provide a basis for novel interpretations about the nature of nuclear and cytoplasmic L1 ribonucleoproteins (RNPs) and the potential role of DNA replication in L1 retrotransposition. PMID:29309036
de la Fuente, José; Díez-Delgado, Iratxe; Contreras, Marinela; Vicente, Joaquín; Cabezas-Cruz, Alejandro; Tobes, Raquel; Manrique, Marina; López, Vladimir; Romero, Beatriz; Bezos, Javier; Dominguez, Lucas; Sevilla, Iker A; Garrido, Joseba M; Juste, Ramón; Madico, Guillermo; Jones-López, Edward; Gortazar, Christian
2015-11-01
Mycobacteria of the Mycobacterium tuberculosis complex (MTBC) greatly affect humans and animals worldwide. The life cycle of mycobacteria is complex and the mechanisms resulting in pathogen infection and survival in host cells are not fully understood. Recently, comparative genomics analyses have provided new insights into the evolution and adaptation of the MTBC to survive inside the host. However, most of this information has been obtained using M. tuberculosis but not other members of the MTBC such as M. bovis and M. caprae. In this study, the genome of three M. bovis (MB1, MB3, MB4) and one M. caprae (MB2) field isolates with different lesion score, prevalence and host distribution phenotypes were sequenced. Genome sequence information was used for whole-genome and protein-targeted comparative genomics analysis with the aim of finding correlates with phenotypic variation with potential implications for tuberculosis (TB) disease risk assessment and control. At the whole-genome level the results of the first comparative genomics study of field isolates of M. bovis including M. caprae showed that as previously reported for M. tuberculosis, sequential chromosomal nucleotide substitutions were the main driver of the M. bovis genome evolution. The phylogenetic analysis provided a strong support for the M. bovis/M. caprae clade, but supported M. caprae as a separate species. The comparison of the MB1 and MB4 isolates revealed differences in genome sequence, including gene families that are important for bacterial infection and transmission, thus highlighting differences with functional implications between isolates otherwise classified with the same spoligotype. Strategic protein-targeted analysis using the ESX or type VII secretion system, proteins linking stress response with lipid metabolism, host T cell epitopes of mycobacteria, antigens and peptidoglycan assembly protein identified new genetic markers and candidate vaccine antigens that warrant further study to develop tools to evaluate risks for TB disease caused by M. bovis/M.caprae and for TB control in humans and animals.
Linkage of A-to-I RNA Editing in Metazoans and the Impact on Genome Evolution
Duan, Yuange; Dou, Shengqian; Zhang, Hong; Wu, Changcheng; Wu, Mingming
2018-01-01
Abstract The adenosine-to-inosine (A-to-I) RNA editomes have been systematically characterized in various metazoan species, and many editing sites were found in clusters. However, it remains unclear whether the clustered editing sites tend to be linked in the same RNA molecules or not. By adopting a method originally designed to detect linkage disequilibrium of DNA mutations, we examined the editomes of ten metazoan species and detected extensive linkage of editing in Drosophila and cephalopods. The prevalent linkages of editing in these two clades, many of which are conserved between closely related species and might be associated with the adaptive proteomic recoding, are maintained by natural selection at the cost of genome evolution. Nevertheless, in worms and humans, we only detected modest proportions of linked editing events, the majority of which were not conserved. Furthermore, the linkage of editing in coding regions of worms and humans might be overall deleterious, which drives the evolution of DNA sites to escape promiscuous editing. Altogether, our results suggest that the linkage landscape of A-to-I editing has evolved during metazoan evolution. This present study also suggests that linkage of editing should be considered in elucidating the functional consequences of RNA editing. PMID:29048557
Iraola, Gregorio; Pérez, Ruben; Naya, Hugo; Paolicchi, Fernando; Pastor, Eugenia; Valenzuela, Sebastián; Calleros, Lucía; Velilla, Alejandra; Hernández, Martín; Morsella, Claudia
2014-09-04
The genus Campylobacter includes some of the most relevant pathogens for human and animal health; the continuous effort in their characterization has also revealed new species putatively involved in different kind of infections. Nowadays, the available genomic data for the genus comprise a wide variety of species with different pathogenic potential and niche preferences. In this work, we contribute to enlarge this available information presenting the first genome for the species Campylobacter sputorum bv. sputorum and use this and the already sequenced organisms to analyze the emergence and evolution of pathogenicity and niche preferences among Campylobacter species. We found that campylobacters can be unequivocally distinguished in established and putative pathogens depending on their repertory of virulence genes, which have been horizontally acquired from other bacteria because the nonpathogenic Campylobacter ancestor emerged, and posteriorly interchanged between some members of the genus. Additionally, we demonstrated the role of both horizontal gene transfers and diversifying evolution in niche preferences, being able to distinguish genetic features associated to the tropism for oral, genital, and gastrointestinal tissues. In particular, we highlight the role of nonsynonymous evolution of disulphide bond proteins, the invasion antigen B (CiaB), and other secreted proteins in the determination of niche preferences. Our results arise from assessing the previously unmet goal of considering the whole available Campylobacter diversity for genome comparisons, unveiling notorious genetic features that could explain particular phenotypes and set the basis for future research in Campylobacter biology. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Rühli, Frank Jakobus; Henneberg, Maciej
2013-04-29
Evolutionary medicine (EM) is a growing field focusing on the evolutionary basis of human diseases and their changes through time. To date, the majority of EM studies have used pure theories of hominin macroevolution to explain the present-day state of human health. Here, we propose a different approach by addressing more empirical and health-oriented research concerning past, current and future microevolutionary changes of human structure, functions and pathologies. Studying generation-to-generation changes of human morphology that occurred in historical times, and still occur in present-day populations under the forces of evolution, helps to explain medical conditions and warns clinicians that their current practices may influence future humans. Also, analyzing historic tissue specimens such as mummies is crucial in order to address the molecular evolution of pathogens, of the human genome, and their coadaptations.
A universal genomic coordinate translator for comparative genomics
2014-01-01
Background Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Results Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Conclusions Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken. PMID:24976580
A universal genomic coordinate translator for comparative genomics.
Zamani, Neda; Sundström, Görel; Meadows, Jennifer R S; Höppner, Marc P; Dainat, Jacques; Lantz, Henrik; Haas, Brian J; Grabherr, Manfred G
2014-06-30
Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken.
Structure of Ljungan virus provides insight into genome packaging of this picornavirus
NASA Astrophysics Data System (ADS)
Zhu, Ling; Wang, Xiangxi; Ren, Jingshan; Porta, Claudine; Wenham, Hannah; Ekström, Jens-Ola; Panjwani, Anusha; Knowles, Nick J.; Kotecha, Abhay; Siebert, C. Alistair; Lindberg, A. Michael; Fry, Elizabeth E.; Rao, Zihe; Tuthill, Tobias J.; Stuart, David I.
2015-10-01
Picornaviruses are responsible for a range of human and animal diseases, but how their RNA genome is packaged remains poorly understood. A particularly poorly studied group within this family are those that lack the internal coat protein, VP4. Here we report the atomic structure of one such virus, Ljungan virus, the type member of the genus Parechovirus B, which has been linked to diabetes and myocarditis in humans. The 3.78-Å resolution cryo-electron microscopy structure shows remarkable features, including an extended VP1 C terminus, forming a major protuberance on the outer surface of the virus, and a basic motif at the N terminus of VP3, binding to which orders some 12% of the viral genome. This apparently charge-driven RNA attachment suggests that this branch of the picornaviruses uses a different mechanism of genome encapsidation, perhaps explored early in the evolution of picornaviruses.
Structure of Ljungan virus provides insight into genome packaging of this picornavirus.
Zhu, Ling; Wang, Xiangxi; Ren, Jingshan; Porta, Claudine; Wenham, Hannah; Ekström, Jens-Ola; Panjwani, Anusha; Knowles, Nick J; Kotecha, Abhay; Siebert, C Alistair; Lindberg, A Michael; Fry, Elizabeth E; Rao, Zihe; Tuthill, Tobias J; Stuart, David I
2015-10-08
Picornaviruses are responsible for a range of human and animal diseases, but how their RNA genome is packaged remains poorly understood. A particularly poorly studied group within this family are those that lack the internal coat protein, VP4. Here we report the atomic structure of one such virus, Ljungan virus, the type member of the genus Parechovirus B, which has been linked to diabetes and myocarditis in humans. The 3.78-Å resolution cryo-electron microscopy structure shows remarkable features, including an extended VP1 C terminus, forming a major protuberance on the outer surface of the virus, and a basic motif at the N terminus of VP3, binding to which orders some 12% of the viral genome. This apparently charge-driven RNA attachment suggests that this branch of the picornaviruses uses a different mechanism of genome encapsidation, perhaps explored early in the evolution of picornaviruses.
Badenhorst, Daleen; Hillier, LaDeana W.; Literman, Robert; Montiel, Eugenia Elisabet; Radhakrishnan, Srihari; Shen, Yingjia; Minx, Patrick; Janes, Daniel E.; Warren, Wesley C.; Edwards, Scott V.; Valenzuela, Nicole
2015-01-01
Comparative genomics continues illuminating amniote genome evolution, but for many lineages our understanding remains incomplete. Here, we refine the assembly (CPI 3.0.3 NCBI AHGY00000000.2) and develop a cytogenetic map of the painted turtle (Chrysemys picta—CPI) genome, the first in turtles and in vertebrates with temperature-dependent sex determination. A comparison of turtle genomes with those of chicken, selected nonavian reptiles, and human revealed shared and novel genomic features, such as numerous chromosomal rearrangements. The largest conserved syntenic blocks between birds and turtles exist in four macrochromosomes, whereas rearrangements were evident in these and other chromosomes, disproving that turtles and birds retain fully conserved macrochromosomes for greater than 300 Myr. C-banding revealed large heterochromatic blocks in the centromeric region of only few chromosomes. The nucleolar-organizing region (NOR) mapped to a single CPI microchromosome, whereas in some turtles and lizards the NOR maps to nonhomologous sex-chromosomes, thus revealing independent translocations of the NOR in various reptilian lineages. There was no evidence for recent chromosomal fusions as interstitial telomeric-DNA was absent. Some repeat elements (CR1-like, Gypsy) were enriched in the centromeres of five chromosomes, whereas others were widespread in the CPI genome. Bacterial artificial chromosome (BAC) clones were hybridized to 18 of the 25 CPI chromosomes and anchored to a G-banded ideogram. Several CPI sex-determining genes mapped to five chromosomes, and homology was detected between yet other CPI autosomes and the globally nonhomologous sex chromosomes of chicken, other turtles, and squamates, underscoring the independent evolution of vertebrate sex-determining mechanisms. PMID:26108489
Stress-Induced Mutagenesis: Implications in Cancer and Drug Resistance.
Fitzgerald, Devon M; Hastings, P J; Rosenberg, Susan M
2017-03-01
Genomic instability underlies many cancers and generates genetic variation that drives cancer initiation, progression, and therapy resistance. In contrast with classical assumptions that mutations occur purely stochastically at constant, gradual rates, microbes, plants, flies, and human cancer cells possess mechanisms of mutagenesis that are upregulated by stress responses. These generate transient, genetic-diversity bursts that can propel evolution, specifically when cells are poorly adapted to their environments-that is, when stressed. We review molecular mechanisms of stress-response-dependent (stress-induced) mutagenesis that occur from bacteria to cancer, and are activated by starvation, drugs, hypoxia, and other stressors. We discuss mutagenic DNA break repair in Escherichia coli as a model for mechanisms in cancers. The temporal regulation of mutagenesis by stress responses and spatial restriction in genomes are common themes across the tree of life. Both can accelerate evolution, including the evolution of cancers. We discuss possible anti-evolvability drugs, aimed at targeting mutagenesis and other variation generators, that could be used to delay the evolution of cancer progression and therapy resistance.
Stress-Induced Mutagenesis: Implications in Cancer and Drug Resistance
Fitzgerald, Devon M.; Hastings, P.J.; Rosenberg, Susan M.
2017-01-01
Genomic instability underlies many cancers and generates genetic variation that drives cancer initiation, progression, and therapy resistance. In contrast with classical assumptions that mutations occur purely stochastically at constant, gradual rates, microbes, plants, flies, and human cancer cells possess mechanisms of mutagenesis that are upregulated by stress responses. These generate transient, genetic-diversity bursts that can propel evolution, specifically when cells are poorly adapted to their environments—that is, when stressed. We review molecular mechanisms of stress-response-dependent (stress-induced) mutagenesis that occur from bacteria to cancer, and are activated by starvation, drugs, hypoxia, and other stressors. We discuss mutagenic DNA break repair in Escherichia coli as a model for mechanisms in cancers. The temporal regulation of mutagenesis by stress responses and spatial restriction in genomes are common themes across the tree of life. Both can accelerate evolution, including the evolution of cancers. We discuss possible anti-evolvability drugs, aimed at targeting mutagenesis and other variation generators, that could be used to delay the evolution of cancer progression and therapy resistance. PMID:29399660
Plant comparative genetics after 10 years.
Gale, M D; Devos, K M
1998-10-23
The past 10 years have seen the discovery of unexpected levels of conservation of gene content and gene orders over millions of years of evolution within grasses, crucifers, legumes, some trees, and Solanaceae crops. Within the grasses, which include the three 500-million-ton-plus-per-year crops (wheat, maize, and rice), and the crucifers, which include all the Brassica crops, colinearity looks good enough to do most map-based cloning only in the small genome model species, rice and Arabidopsis. Elsewhere, knowledge gained in a few major crops is being pooled and applied across the board. The extrapolation of information from the well-studied species to orphan crops, which include many tropical species, is providing a solid base for their improvement. Genome rearrangements are giving new insights into evolution. In fact, comparative genetics is the key that will unlock the secrets of crop plants with genomes larger than that of humans.
The draft genome of the parasitic nematode Trichinella spiralis
Mitreva, Makedonka; Jasmer, Douglas P.; Zarlenga, Dante S.; Wang, Zhengyuan; Abubucker, Sahar; Martin, John; Taylor, Christina M.; Yin, Yong; Fulton, Lucinda; Minx, Pat; Yang, Shiaw-Pyng; Warren, Wesley C.; Fulton, Robert S.; Bhonagiri, Veena; Zhang, Xu; Hallsworth-Pepin, Kym; Clifton, Sandra W.; McCarter, James P.; Appleton, Judith; Mardis, Elaine R.; Wilson, Richard K.
2011-01-01
Genome-based studies of metazoan evolution are most informative when phylogenetically diverse species are incorporated in the analysis. As such, evolutionary trends within and outside the phylum Nematoda have been less revealing by focusing only on comparisons involving Caenorhabditis elegans. Herein, we present a draft of the 64 megabase nuclear genome of Trichinella spiralis, containing 15,808 protein coding genes. This parasitic nematode is an extant member of a clade that diverged early in the evolution of the phylum enabling identification of archetypical genes and molecular signatures exclusive to nematodes. Comparative analyses support intrachromosomal rearrangements across the phylum, disproportionate numbers of protein family deaths over births in parasitic vs. a non-parasitic nematode, and a preponderance of gene loss and gain events in nematodes relative to Drosophila melanogaster. This sequence and the panphylum characteristics identified herein will advance evolutionary studies and strategies to combat global parasites of humans, food animals and crops. PMID:21336279
Divergent genome evolution caused by regional variation in DNA gain and loss between human and mouse
Kortschak, R. Daniel
2018-01-01
The forces driving the accumulation and removal of non-coding DNA and ultimately the evolution of genome size in complex organisms are intimately linked to genome structure and organisation. Our analysis provides a novel method for capturing the regional variation of lineage-specific DNA gain and loss events in their respective genomic contexts. To further understand this connection we used comparative genomics to identify genome-wide individual DNA gain and loss events in the human and mouse genomes. Focusing on the distribution of DNA gains and losses, relationships to important structural features and potential impact on biological processes, we found that in autosomes, DNA gains and losses both followed separate lineage-specific accumulation patterns. However, in both species chromosome X was particularly enriched for DNA gain, consistent with its high L1 retrotransposon content required for X inactivation. We found that DNA loss was associated with gene-rich open chromatin regions and DNA gain events with gene-poor closed chromatin regions. Additionally, we found that DNA loss events tended to be smaller than DNA gain events suggesting that they were able to accumulate in gene-rich open chromatin regions due to their reduced capacity to interrupt gene regulatory architecture. GO term enrichment showed that mouse loss hotspots were strongly enriched for terms related to developmental processes. However, these genes were also located in regions with a high density of conserved elements, suggesting that despite high levels of DNA loss, gene regulatory architecture remained conserved. This is consistent with a model in which DNA gain and loss results in turnover or “churning” in regulatory element dense regions of open chromatin, where interruption of regulatory elements is selected against. PMID:29677183
The Role of Recombination in the Origin and Evolution of Alu Subfamilies
Teixeira-Silva, Ana; Silva, Raquel M.; Carneiro, João; Amorim, António; Azevedo, Luísa
2013-01-01
Alus are the most abundant and successful short interspersed nuclear elements found in primate genomes. In humans, they represent about 10% of the genome, although few are retrotransposition-competent and are clustered into subfamilies according to the source gene from which they evolved. Recombination between them can lead to genomic rearrangements of clinical and evolutionary significance. In this study, we have addressed the role of recombination in the origin of chimeric Alu source genes by the analysis of all known consensus sequences of human Alus. From the allelic diversity of Alu consensus sequences, validated in extant elements resulting from whole genome searches, distinct events of recombination were detected in the origin of particular subfamilies of AluS and AluY source genes. These results demonstrate that at least two subfamilies are likely to have emerged from ectopic Alu-Alu recombination, which stimulates further research regarding the potential of chimeric active Alus to punctuate the genome. PMID:23750218
Comparative genomic analysis of the false killer whale (Pseudorca crassidens) LMBR1 locus.
Kim, Dae-Won; Choi, Sang-Haeng; Kim, Ryong Nam; Kim, Sun-Hong; Paik, Sang-Gi; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Aeri; Kang, Aram; Park, Hong-Seog
2010-09-01
The sequencing and comparative genomic analysis of LMBR1 loci in mammals or other species, including human, would be very important in understanding evolutionary genetic changes underlying the evolution of limb development. In this regard, comparative genomic annotation of the false killer whale LMBR1 locus could shed new light on the evolution of limb development. We sequenced two false killer whale BAC clones, corresponding to 156 kb and 144 kb, respectively, harboring the tightly linked RNF32, LMBR1, and NOM1 genes. Our annotation of the false killer whale LMBR1 gene showed that it consists of 17 exons (1473 bp), in contrast to 18 exons (1596 bp) in human, and it displays 93.1% and 95.6% nucleotide and amino acid sequence similarity, respectively, compared with the human gene. In particular, we discovered that exon 10, deleted in the false killer whale LMBR1 gene, is present only in primates, and this fact strongly implies that exon 10 might be crucial in determining primate-specific limb development. ZRS and TFBS sequences have been well conserved across 11 species, suggesting that these regions could be involved in an important function of limb development and limb patterning. The neighboring gene RNF32 showed several lineage-conserved exons, such as exons 2 through 9 conserved in eutherian mammals, exons 3 through 9 conserved in mammals, and exons 5 through 9 conserved in vertebrates. The other neighboring gene, NOM1, had undergone a substitution (ATG→GTA) at the start codon, giving rise to a 36 bp shorter N-terminal sequence compared with the human sequence. Our comparative analysis of the false killer whale LMBR1 genomic locus provides important clues regarding the genetic regions that may play crucial roles in limb development and patterning.
Courseaux, Anouk; Richard, Florence; Grosgeorge, Josiane; Ortola, Christine; Viale, Agnes; Turc-Carel, Claude; Dutrillaux, Bernard; Gaudray, Patrick; Nahon, Jean-Louis
2003-01-01
Recent analyses of the structure of pericentromeric and subtelomeric regions have revealed that these particular regions of human chromosomes are often composed of blocks of duplicated genomic segments that have been associated with rapid evolutionary turnover among the genomes of closely related primates. In the present study, we show that euchromatic regions of human chromosome 5—5p14, 5p13, 5q13, 5q15–5q21—also display such an accumulation of segmental duplications. The structure, organization and evolution of those primate-specific sequences were studied in detail by combining in silico and comparative FISH analyses on human, chimpanzee, gorilla, orangutang, macaca, and capuchin chromosomes. Our results lend support to a two-step model of transposition duplication in the euchromatic regions, with a founder insertional event at the time of divergence between Platyrrhini and Catarrhini (25–35 million years ago) and an apparent burst of inter- and intrachromosomal duplications in the Hominidae lineage. Furthermore, phylogenetic analysis suggests that the chronology and, likely, molecular mechanisms, differ regarding the region of primary insertion—euchromatic versus pericentromeric regions. Lastly, we show that as their counterparts located near the heterochromatic region, the euchromatic segmental duplications have consistently reshaped their region of insertion during primate evolution, creating putative mosaic genes, and they are obvious candidates for causing ectopic rearrangements that have contributed to evolutionary/genomic instability. [Supplemental material is available online at www.genome.org. The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: D. Le Paslier, A. McKenzie, J. Melki, C. Sargent, J. Scharf and S. Selig.] PMID:12618367
Has Rift Valley fever virus evolved with increasing severity in human populations in East Africa?
Baba, Marycelin; Masiga, Daniel K; Sang, Rosemary; Villinger, Jandouwe
2016-01-01
Rift Valley fever (RVF) outbreaks have occurred across eastern Africa from 1912 to 2010 approximately every 4–15 years, most of which have not been accompanied by significant epidemics in human populations. However, human epidemics during RVF outbreaks in eastern Africa have involved 478 deaths in 1998, 1107 reported cases with 350 deaths from 2006 to 2007 and 1174 cases with 241 deaths in 2008. We review the history of RVF outbreaks in eastern Africa to identify the epidemiological factors that could have influenced its increasing severity in humans. Diverse ecological factors influence outbreak frequency, whereas virus evolution has a greater impact on its virulence in hosts. Several factors could have influenced the lack of information on RVF in humans during earlier outbreaks, but the explosive nature of human RVF epidemics in recent years mirrors the evolutionary trend of the virus. Comparisons between isolates from different outbreaks have revealed an accumulation of genetic mutations and genomic reassortments that have diversified RVF virus genomes over several decades. The threat to humans posed by the diversified RVF virus strains increases the potential public health and socioeconomic impacts of future outbreaks. Understanding the shifting RVF epidemiology as determined by its evolution is key to developing new strategies for outbreak mitigation and prevention of future human RVF casualties. PMID:27329846
Lei, Huimeng; Yan, Zhangming; Sun, Xiaohong; Zhang, Yue; Wang, Jianhong; Ma, Caihong; Xu, Qunyuan; Wang, Rui; Jarvis, Erich D; Sun, Zhirong
2017-11-01
Human and several nonhuman species share the rare ability of modifying acoustic and/or syntactic features of sounds produced, i.e. vocal learning, which is the important neurobiological and behavioral substrate of human speech/language. This convergent trait was suggested to be associated with significant genomic convergence and best manifested at the ROBO-SLIT axon guidance pathway. Here we verified the significance of such genomic convergence and assessed its functional relevance to human speech/language using human genetic variation data. In normal human populations, we found the affected amino acid sites were well fixed and accompanied with significantly more associated protein-coding SNPs in the same genes than the rest genes. Diseased individuals with speech/language disorders have significant more low frequency protein coding SNPs but they preferentially occurred outside the affected genes. Such patients' SNPs were enriched in several functional categories including two axon guidance pathways (mediated by netrin and semaphorin) that interact with ROBO-SLITs. Four of the six patients have homozygous missense SNPs on PRAME gene family, one youngest gene family in human lineage, which possibly acts upon retinoic acid receptor signaling, similarly as FOXP2, to modulate axon guidance. Taken together, we suggest the axon guidance pathways (e.g. ROBO-SLIT, PRAME gene family) served as common targets for human speech/language evolution and related disorders. Copyright © 2017 Elsevier Inc. All rights reserved.
Adaptation of Enterococcus faecalis to daptomycin reveals an ordered progression to resistance.
Miller, Corwin; Kong, Jiayi; Tran, Truc T; Arias, Cesar A; Saxer, Gerda; Shamoo, Yousif
2013-11-01
With increasing numbers of hospital-acquired antibiotic resistant infections each year and staggering health care costs, there is a clear need for new antimicrobial agents, as well as novel strategies to extend their clinical efficacy. While genomic studies have provided a wealth of information about the alleles associated with adaptation to antibiotics, they do not provide essential information about the relative importance of genomic changes, their order of appearance, or potential epistatic relationships between adaptive changes. Here we used quantitative experimental evolution of a single polymorphic population in continuous culture with whole-genome sequencing and allelic frequency measurements to study daptomycin (DAP) resistance in the vancomycin-resistant clinical pathogen Enterococcus faecalis S613. Importantly, we sustained both planktonic and nonplanktonic (i.e., biofilm) populations in coculture as the concentration of antibiotic was raised, facilitating the development of more ecological complexity than is typically observed in laboratory evolution. Quantitative experimental evolution revealed a clear order and hierarchy of genetic changes leading to resistance, the signaling and metabolic pathways responsible, and the relative importance of these mutations to the evolution of DAP resistance. Despite the relative simplicity of this ex vivo approach compared to the ecological complexity of the human body, we showed that experimental evolution allows for rapid identification of clinically relevant adaptive molecular pathways and new targets for drug design in pathogens.
Five rules for the evolution of cooperation.
Nowak, Martin A
2006-12-08
Cooperation is needed for evolution to construct new levels of organization. Genomes, cells, multicellular organisms, social insects, and human society are all based on cooperation. Cooperation means that selfish replicators forgo some of their reproductive potential to help one another. But natural selection implies competition and therefore opposes cooperation unless a specific mechanism is at work. Here I discuss five mechanisms for the evolution of cooperation: kin selection, direct reciprocity, indirect reciprocity, network reciprocity, and group selection. For each mechanism, a simple rule is derived that specifies whether natural selection can lead to cooperation.
Five Rules for the Evolution of Cooperation
NASA Astrophysics Data System (ADS)
Nowak, Martin A.
2006-12-01
Cooperation is needed for evolution to construct new levels of organization. Genomes, cells, multicellular organisms, social insects, and human society are all based on cooperation. Cooperation means that selfish replicators forgo some of their reproductive potential to help one another. But natural selection implies competition and therefore opposes cooperation unless a specific mechanism is at work. Here I discuss five mechanisms for the evolution of cooperation: kin selection, direct reciprocity, indirect reciprocity, network reciprocity, and group selection. For each mechanism, a simple rule is derived that specifies whether natural selection can lead to cooperation.
The unusual reproductive system of head and body lice (Pediculus humanus)
ANDREWES, S.; CLARK, J. M.; ROSS, L.
2017-01-01
Abstract Insect reproduction is extremely variable, but the implications of alternative genetic systems are often overlooked in studies on the evolution of insecticide resistance. Both ecotypes of Pediculus humanus (Phthiraptera: Pediculidae), the human head and body lice, are human ectoparasites, the control of which is challenged by the recent spread of resistance alleles. The present study conclusively establishes for the first time that both head and body lice reproduce through paternal genome elimination (PGE), an unusual genetic system in which males transmit only their maternally derived chromosomes. Here, we investigate inheritance patterns of parental genomes using a genotyping approach across families of both ecotypes and show that heterozygous males exclusively or preferentially pass on one allele only, whereas females transmit both in a Mendelian fashion. We do however observe occasional transmission of paternal chromosomes through males, representing the first known case of PGE in which whole‐genome meiotic drive is incomplete. Finally, we discuss the potential implications of this finding for the evolution of resistance and invite the development of new theoretical models of how this knowledge might contribute to increasing the success of pediculicide‐based management schemes. PMID:29266297
Genomic Encyclopedia of Type Strains of the Genus Bifidobacterium
Milani, Christian; Lugli, Gabriele Andrea; Duranti, Sabrina; Turroni, Francesca; Bottacini, Francesca; Mangifesta, Marta; Sanchez, Borja; Viappiani, Alice; Mancabelli, Leonardo; Taminiau, Bernard; Delcenserie, Véronique; Barrangou, Rodolphe; Margolles, Abelardo; van Sinderen, Douwe
2014-01-01
Bifidobacteria represent one of the dominant microbial groups that are present in the gut of various animals, being particularly prevalent during the suckling stage of life of humans and other mammals. However, the overall genome structure of this group of microorganisms remains largely unexplored. Here, we sequenced the genomes of 42 representative (sub)species across the Bifidobacterium genus and used this information to explore the overall genetic picture of this bacterial group. Furthermore, the genomic data described here were used to reconstruct the evolutionary development of the Bifidobacterium genus. This reconstruction suggests that its evolution was substantially influenced by genetic adaptations to obtain access to glycans, thereby representing a common and potent evolutionary force in shaping bifidobacterial genomes. PMID:25085493
Uniparental Inheritance Promotes Adaptive Evolution in Cytoplasmic Genomes
Christie, Joshua R.; Beekman, Madeleine
2017-01-01
Eukaryotes carry numerous asexual cytoplasmic genomes (mitochondria and plastids). Lacking recombination, asexual genomes should theoretically suffer from impaired adaptive evolution. Yet, empirical evidence indicates that cytoplasmic genomes experience higher levels of adaptive evolution than predicted by theory. In this study, we use a computational model to show that the unique biology of cytoplasmic genomes—specifically their organization into host cells and their uniparental (maternal) inheritance—enable them to undergo effective adaptive evolution. Uniparental inheritance of cytoplasmic genomes decreases competition between different beneficial substitutions (clonal interference), promoting the accumulation of beneficial substitutions. Uniparental inheritance also facilitates selection against deleterious cytoplasmic substitutions, slowing Muller’s ratchet. In addition, uniparental inheritance generally reduces genetic hitchhiking of deleterious substitutions during selective sweeps. Overall, uniparental inheritance promotes adaptive evolution by increasing the level of beneficial substitutions relative to deleterious substitutions. When we assume that cytoplasmic genome inheritance is biparental, decreasing the number of genomes transmitted during gametogenesis (bottleneck) aids adaptive evolution. Nevertheless, adaptive evolution is always more efficient when inheritance is uniparental. Our findings explain empirical observations that cytoplasmic genomes—despite their asexual mode of reproduction—can readily undergo adaptive evolution. PMID:28025277
Yue, Y; Grossmann, B; Tsend-Ayush, E; Grützner, F; Ferguson-Smith, M A; Yang, F; Haaf, T
2005-01-01
Intrachromosomal duplications play a significant role in human genome pathology and evolution. To better understand the molecular basis of evolutionary chromosome rearrangements, we performed molecular cytogenetic and sequence analyses of the breakpoint region that distinguishes human chromosome 3p12.3 and orangutan chromosome 2. FISH with region-specific BAC clones demonstrated that the breakpoint-flanking sequences are duplicated intrachromosomally on orangutan 2 and human 3q21 as well as at many pericentromeric and subtelomeric sites throughout the genomes. Breakage and rearrangement of the human 3p12.3-homologous region in the orangutan lineage were associated with a partial loss of duplicated sequences in the breakpoint region. Consistent with our FISH mapping results, computational analysis of the human chromosome 3 genomic sequence revealed three 3p12.3-paralogous sequence blocks on human chromosome 3q21 and smaller blocks on the short arm end 3p26-->p25. This is consistent with the view that sequences from an ancestral site at 3q21 were duplicated at 3p12.3 in a common ancestor of orangutan and humans. Our results show that evolutionary chromosome rearrangements are associated with microduplications and microdeletions, contributing to the DNA differences between closely related species. Copyright (c) 2005 S. Karger AG, Basel.
Positioning Genomics in Biology Education: Content Mapping of Undergraduate Biology Textbooks†
Wernick, Naomi L. B.; Ndung’u, Eric; Haughton, Dominique; Ledley, Fred D.
2014-01-01
Biological thought increasingly recognizes the centrality of the genome in constituting and regulating processes ranging from cellular systems to ecology and evolution. In this paper, we ask whether genomics is similarly positioned as a core concept in the instructional sequence for undergraduate biology. Using quantitative methods, we analyzed the order in which core biological concepts were introduced in textbooks for first-year general and human biology. Statistical analysis was performed using self-organizing map algorithms and conventional methods to identify clusters of terms and their relative position in the books. General biology textbooks for both majors and nonmajors introduced genome-related content after text related to cell biology and biological chemistry, but before content describing higher-order biological processes. However, human biology textbooks most often introduced genomic content near the end of the books. These results suggest that genomics is not yet positioned as a core concept in commonly used textbooks for first-year biology and raises questions about whether such textbooks, or courses based on the outline of these textbooks, provide an appropriate foundation for understanding contemporary biological science. PMID:25574293
Positioning genomics in biology education: content mapping of undergraduate biology textbooks.
Wernick, Naomi L B; Ndung'u, Eric; Haughton, Dominique; Ledley, Fred D
2014-12-01
Biological thought increasingly recognizes the centrality of the genome in constituting and regulating processes ranging from cellular systems to ecology and evolution. In this paper, we ask whether genomics is similarly positioned as a core concept in the instructional sequence for undergraduate biology. Using quantitative methods, we analyzed the order in which core biological concepts were introduced in textbooks for first-year general and human biology. Statistical analysis was performed using self-organizing map algorithms and conventional methods to identify clusters of terms and their relative position in the books. General biology textbooks for both majors and nonmajors introduced genome-related content after text related to cell biology and biological chemistry, but before content describing higher-order biological processes. However, human biology textbooks most often introduced genomic content near the end of the books. These results suggest that genomics is not yet positioned as a core concept in commonly used textbooks for first-year biology and raises questions about whether such textbooks, or courses based on the outline of these textbooks, provide an appropriate foundation for understanding contemporary biological science.
Iraola, Gregorio; Pérez, Ruben; Naya, Hugo; Paolicchi, Fernando; Pastor, Eugenia; Valenzuela, Sebastián; Calleros, Lucía; Velilla, Alejandra; Hernández, Martín; Morsella, Claudia
2014-01-01
The genus Campylobacter includes some of the most relevant pathogens for human and animal health; the continuous effort in their characterization has also revealed new species putatively involved in different kind of infections. Nowadays, the available genomic data for the genus comprise a wide variety of species with different pathogenic potential and niche preferences. In this work, we contribute to enlarge this available information presenting the first genome for the species Campylobacter sputorum bv. sputorum and use this and the already sequenced organisms to analyze the emergence and evolution of pathogenicity and niche preferences among Campylobacter species. We found that campylobacters can be unequivocally distinguished in established and putative pathogens depending on their repertory of virulence genes, which have been horizontally acquired from other bacteria because the nonpathogenic Campylobacter ancestor emerged, and posteriorly interchanged between some members of the genus. Additionally, we demonstrated the role of both horizontal gene transfers and diversifying evolution in niche preferences, being able to distinguish genetic features associated to the tropism for oral, genital, and gastrointestinal tissues. In particular, we highlight the role of nonsynonymous evolution of disulphide bond proteins, the invasion antigen B (CiaB), and other secreted proteins in the determination of niche preferences. Our results arise from assessing the previously unmet goal of considering the whole available Campylobacter diversity for genome comparisons, unveiling notorious genetic features that could explain particular phenotypes and set the basis for future research in Campylobacter biology. PMID:25193310
Mekalanos, John J.
2014-01-01
Modern genomic and bioinformatic approaches have been applied to interrogate the V. cholerae genome, the role of genomic elements in cholera disease, and the origin, relatedness, and dissemination of epidemic strains. A universal attribute of choleragenic strains includes a repertoire of pathogenicity islands and virulence genes, namely the CTX–ϕ prophage and Toxin Co-regulated Pilus (TCP) in addition to other virulent genetic elements including those referred to as Seventh Pandemic Islands. During the last decade, the advent of Next Generation Sequencing (NGS) has provided highly resolved and often complete genomic sequences of epidemic isolates in addition to both clinical and environmental strains isolated from geographically unconnected regions. Genomic comparisons of these strains, as was completed during and following the Haitian outbreak in 2010, reveals that most epidemic strains appear closely related, regardless of region of origin. Non-O1 clinical or environmental strains may also possess some virulence islands, but phylogenic analysis of the core genome suggests they are more diverse and distantly related than those isolated during epidemics. Like Haiti, genomic studies that examine both the Vibrio core- and pan-genome in addition to Single Nucleotide Polymorphisms (SNPs) conclude that a number of epidemics are caused by strains that closely resemble those in Asia, and often appear to originate there and then spread globally. The accumulation of SNPs in the epidemic strains over time can then be applied to better understand the evolution of the V. cholerae genome as an etiological agent. PMID:24590676
Stilling, Roman M.; Bordenstein, Seth R.; Dinan, Timothy G.; Cryan, John F.
2014-01-01
The tight association of the human body with trillions of colonizing microbes that we observe today is the result of a long evolutionary history. Only very recently have we started to understand how this symbiosis also affects brain function and behavior. In this hypothesis and theory article, we propose how host-microbe associations potentially influenced mammalian brain evolution and development. In particular, we explore the integration of human brain development with evolution, symbiosis, and RNA biology, which together represent a “social triangle” that drives human social behavior and cognition. We argue that, in order to understand how inter-kingdom communication can affect brain adaptation and plasticity, it is inevitable to consider epigenetic mechanisms as important mediators of genome-microbiome interactions on an individual as well as a transgenerational time scale. Finally, we unite these interpretations with the hologenome theory of evolution. Taken together, we propose a tighter integration of neuroscience fields with host-associated microbiology by taking an evolutionary perspective. PMID:25401092
Genomic analysis of expressed sequence tags in American black bear Ursus americanus
2010-01-01
Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065
Genomic analysis of expressed sequence tags in American black bear Ursus americanus.
Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun
2010-03-26
Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
Gerstein, Aleeza C; Nielsen, Kirsten
2017-04-01
Cryptococcus is predominantly an AIDS-related pathogen that causes significant morbidity and mortality in immunocompromised patients. Research studies have historically focused on understanding how the organism causes human disease through the use of in vivo and in vitro model systems to identify virulence factors. Cryptococcus is not an obligate pathogen, however, as human-human transmission is either absent or rare. Selection in the environment must thus be invoked to shape the evolution of this taxa, and directly influences genotypic and trait diversity. Importantly, the evolution and maintenance of pathogenicity must also stem directly from environmental selection. To that end, here we examine abiotic and biotic stresses in the environment, and discuss how they could shape the factors that are commonly identified as important virulence traits. We identify a number of important unanswered questions about Cryptococcus diversity and evolution that are critical for understanding this deadly pathogen, and discuss how implementation of modern sampling and genomic tools could be utilized to answer these questions. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Hunt, Tam
2015-01-01
There is a singular unifying reality underlying every biologic interaction on our planet. In immunology, that which does not kill you makes you different. -William B. Miller, Jr. We are experiencing a revolution in our understanding of inner space on a par with our exponentially increasing understanding of outer space. In biology, we are learning that the genetic and epigenetic complexity within organisms is far deeper than suspected. This is a key theme in William B. Miller Jr.'s book, The Microcosm Within: Evolution and Extinction in the Hologenome. We are learning also that a focus on the human genome alone is misleading when it comes to who we really are as biological entities, and in terms of how we and other creatures have evolved. Rather than being defined by the human genome alone, we are instead defined by the "hologenome," the sum of the human genome and the far larger genetic endowment of the microbiome and symbiotic communities that reside within and around us. Miller is a medical doctor previously in private practice in Pennsylvania and Phoenix, Arizona. This book is his first foray into evolutionary theory. His book could have been titled "The Origin of Variation" because this is his primary focus. He accepts that natural selection plays a role in evolution, but he demotes this mechanism to a less important role than the Modern Synthesis suggests. His main gripe, however, concerns random variation. He argues that random variation is unable to explain the origin and evolution of biological forms that we see in the world around us and in the historical record. Miller suggests that, rather than random variation as the engine of novelty, there is a creative impulse at the heart of cellular life, and even at the level of the genetic aggregate, that generates novelty on a regular basis. I probe this assertion in the interview below. He also highlights the strong role of "exogenous genetic assault" in variation and in his immunological model of evolution.
Hormonal Involvement in Breast Cancer Gene Amplification
2010-10-01
been shown to induce DN A amplification in yeast (Gopalakrishnan et al., 2001; Nguy en et al., 2001; Green et al., 2006) an d increased Cdt1 results in...re-replication in human cells (Dorn et al., 2008). The N- terminus of Cdt1 is important for re-replication, perhaps through interactions with PCNA...evolution of a cancer genome. Genome Res. (Epub. Dec. 3, 2008). Harris TD, Buzby PR, Babcock H, Beer E, Bowers J, Bras lavsky I, Causey M
Tracking the establishment of local endemic populations of an emergent enteric pathogen
Holt, Kathryn E.; Thieu Nga, Tran Vu; Thanh, Duy Pham; Vinh, Ha; Kim, Dong Wook; Vu Tra, My Phan; Campbell, James I.; Hoang, Nguyen Van Minh; Vinh, Nguyen Thanh; Minh, Pham Van; Thuy, Cao Thu; Nga, Tran Thi Thu; Thompson, Corinne; Dung, Tran Thi Ngoc; Nhu, Nguyen Thi Khanh; Vinh, Phat Voong; Tuyet, Pham Thi Ngoc; Phuc, Hoang Le; Lien, Nguyen Thi Nam; Phu, Bui Duc; Ai, Nguyen Thi Thuy; Tien, Nguyen Manh; Dong, Nguyen; Parry, Christopher M.; Hien, Tran Tinh; Farrar, Jeremy J.; Parkhill, Julian; Dougan, Gordon; Thomson, Nicholas R.; Baker, Stephen
2013-01-01
Shigella sonnei is a human-adapted pathogen that is emerging globally as the dominant agent of bacterial dysentery. To investigate local establishment, we sequenced the genomes of 263 Vietnamese S. sonnei isolated over 15 y. Our data show that S. sonnei was introduced into Vietnam in the 1980s and has undergone localized clonal expansion, punctuated by genomic fixation events through periodic selective sweeps. We uncover geographical spread, spatially restricted frontier populations, and convergent evolution through local gene pool sampling. This work provides a unique, high-resolution insight into the microevolution of a pioneering human pathogen during its establishment in a new host population. PMID:24082120
Xiang, Yezi; Huang, Chien-Hsun; Hu, Yi; Wen, Jun; Li, Shisheng; Yi, Tingshuang; Chen, Hongyi; Xiang, Jun; Ma, Hong
2017-02-01
Fruits are the defining feature of angiosperms, likely have contributed to angiosperm successes by protecting and dispersing seeds, and provide foods to humans and other animals, with many morphological types and important ecological and agricultural implications. Rosaceae is a family with ∼3000 species and an extraordinary spectrum of distinct fruits, including fleshy peach, apple, and strawberry prized by their consumers, as well as dry achenetum and follicetum with features facilitating seed dispersal, excellent for studying fruit evolution. To address Rosaceae fruit evolution and other questions, we generated 125 new transcriptomic and genomic datasets and identified hundreds of nuclear genes to reconstruct a well-resolved Rosaceae phylogeny with highly supported monophyly of all subfamilies and tribes. Molecular clock analysis revealed an estimated age of ∼101.6 Ma for crown Rosaceae and divergence times of tribes and genera, providing a geological and climate context for fruit evolution. Phylogenomic analysis yielded strong evidence for numerous whole genome duplications (WGDs), supporting the hypothesis that the apple tribe had a WGD and revealing another one shared by fleshy fruit-bearing members of this tribe, with moderate support for WGDs in the peach tribe and other groups. Ancestral character reconstruction for fruit types supports independent origins of fleshy fruits from dry-fruit ancestors, including the evolution of drupes (e.g., peach) and pomes (e.g., apple) from follicetum, and drupetum (raspberry and blackberry) from achenetum. We propose that WGDs and environmental factors, including animals, contributed to the evolution of the many fruits in Rosaceae, which provide a foundation for understanding fruit evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Kaján, Győző L; Kajon, Adriana E; Pinto, Alexis Castillo; Bartha, Dániel; Arnberg, Niklas
2017-10-15
A novel human adenovirus was isolated from a pediatric case of acute respiratory disease in Panama City, Panama in 2011. The clinical isolate was initially identified as an intertypic recombinant based on hexon and fiber gene sequencing. Based on the analysis of its complete genome sequence, the novel complex recombinant Human mastadenovirus D (HAdV-D) strain was classified into a new HAdV type: HAdV-84, and it was designated Adenovirus D human/PAN/P309886/2011/84[P43H17F84]. HAdV-D types possess usually an ocular or gastrointestinal tropism, and respiratory association is scarcely reported. The virus has a novel fiber type, most closely related to, but still clearly distant from that of HAdV-36. The predicted fiber is hypothesised to bind sialic acid with lower affinity compared to HAdV-37. Bioinformatic analysis of the complete genomic sequence of HAdV-84 revealed multiple homologous recombination events and provided deeper insight into HAdV evolution. Copyright © 2017 Elsevier B.V. All rights reserved.
Genome-wide analyses of HTLV-1aD strains from Cape Verde, Africa.
Zanella, Louise; Pina-Araujo I, Isabel de; Morgado, Mariza G; Vicente, Ana Carolina
2016-09-01
We characterised and reported the first full-length genomes of Human T-cell Lymphotropic Virus Type 1 subgroup HTLV-1aD (CV21 and CV79). This subgroup is one of the major determinants of HTLV-1 infections in North and West Africa, and recombinant strains involving this subgroup have been recently demonstrated. The CV21 and CV79 strains from Cape Verde/Africa were characterised as pure HTLV-1aD genomes, comparative analyses including HTLV-1 subtypes and subgroups revealed HTLV-1aD signatures in the envelope, pol, and pX regions. These genomes provide original information that will contribute to further studies on HTLV-1a epidemiology and evolution.
2013-01-01
Evolutionary medicine (EM) is a growing field focusing on the evolutionary basis of human diseases and their changes through time. To date, the majority of EM studies have used pure theories of hominin macroevolution to explain the present-day state of human health. Here, we propose a different approach by addressing more empirical and health-oriented research concerning past, current and future microevolutionary changes of human structure, functions and pathologies. Studying generation-to-generation changes of human morphology that occurred in historical times, and still occur in present-day populations under the forces of evolution, helps to explain medical conditions and warns clinicians that their current practices may influence future humans. Also, analyzing historic tissue specimens such as mummies is crucial in order to address the molecular evolution of pathogens, of the human genome, and their coadaptations. PMID:23627943
Evidence for Hitchhiking of Deleterious Mutations within the Human Genome
Chun, Sung; Fay, Justin C.
2011-01-01
Deleterious mutations present a significant obstacle to adaptive evolution. Deleterious mutations can inhibit the spread of linked adaptive mutations through a population; conversely, adaptive substitutions can increase the frequency of linked deleterious mutations and even result in their fixation. To assess the impact of adaptive mutations on linked deleterious mutations, we examined the distribution of deleterious and neutral amino acid polymorphism in the human genome. Within genomic regions that show evidence of recent hitchhiking, we find fewer neutral but a similar number of deleterious SNPs compared to other genomic regions. The higher ratio of deleterious to neutral SNPs is consistent with simulated hitchhiking events and implies that positive selection eliminates some deleterious alleles and increases the frequency of others. The distribution of disease-associated alleles is also altered in hitchhiking regions. Disease alleles within hitchhiking regions have been associated with auto-immune disorders, metabolic diseases, cancers, and mental disorders. Our results suggest that positive selection has had a significant impact on deleterious polymorphism and may be partly responsible for the high frequency of certain human disease alleles. PMID:21901107
Gehring, Walter J
2014-01-01
In this review, the evolution of vision is retraced from its putative origins in cyanobacteria to humans. Circadian oscillatory clocks, phototropism, and phototaxis require the capability to detect light. Photosensory proteins allow us to reconstruct molecular phylogenetic trees. The evolution of animal eyes leading from an ancestral prototype to highly complex image forming eyes can be deciphered on the basis of evolutionary developmental genetic experiments and comparative genomics. As all bilaterian animals share the same master control gene, Pax6, and the same retinal and pigment cell determination genes, we conclude that the different eye-types originated monophyletically and subsequently diversified by divergent, parallel, or convergent evolution. © 2012 Wiley Periodicals, Inc.
SINEs, evolution and genome structure in the opossum.
Gu, Wanjun; Ray, David A; Walker, Jerilyn A; Barnes, Erin W; Gentles, Andrew J; Samollow, Paul B; Jurka, Jerzy; Batzer, Mark A; Pollock, David D
2007-07-01
Short INterspersed Elements (SINEs) are non-autonomous retrotransposons, usually between 100 and 500 base pairs (bp) in length, which are ubiquitous components of eukaryotic genomes. Their activity, distribution, and evolution can be highly informative on genomic structure and evolutionary processes. To determine recent activity, we amplified more than one hundred SINE1 loci in a panel of 43 M. domestica individuals derived from five diverse geographic locations. The SINE1 family has expanded recently enough that many loci were polymorphic, and the SINE1 insertion-based genetic distances among populations reflected geographic distance. Genome-wide comparisons of SINE1 densities and GC content revealed that high SINE1 density is associated with high GC content in a few long and many short spans. Young SINE1s, whether fixed or polymorphic, showed an unbiased GC content preference for insertion, indicating that the GC preference accumulates over long time periods, possibly in periodic bursts. SINE1 evolution is thus broadly similar to human Alu evolution, although it has an independent origin. High GC content adjacent to SINE1s is strongly correlated with bias towards higher AT to GC substitutions and lower GC to AT substitutions. This is consistent with biased gene conversion, and also indicates that like chickens, but unlike eutherian mammals, GC content heterogeneity (isochore structure) is reinforced by substitution processes in the M. domestica genome. Nevertheless, both high and low GC content regions are apparently headed towards lower GC content equilibria, possibly due to a relative shift to lower recombination rates in the recent Monodelphis ancestral lineage. Like eutherians, metatherian (marsupial) mammals have evolved high CpG substitution rates, but this is apparently a convergence in process rather than a shared ancestral state.
Yang, Lei; Neme, Rafik; Wichman, Holly A.; Malik, Harmit S.
2014-01-01
Mammalian genomes comprise many active and fossilized retroelements. The obligate requirement for retroelement integration affords host genomes an opportunity to ‘domesticate’ retroelement genes for their own purpose, leading to important innovations in genome defense and placentation. While many such exaptations involve retroviruses, the L1TD1 gene is the only known domesticated gene whose protein-coding sequence is almost entirely derived from a LINE-1 (L1) retroelement. Human L1TD1 has been shown to play an important role in pluripotency maintenance. To investigate how this role was acquired, we traced the origin and evolution of L1TD1. We find that L1TD1 originated in the common ancestor of eutherian mammals, but was lost or pseudogenized multiple times during mammalian evolution. We also find that L1TD1 has evolved under positive selection during primate and mouse evolution, and that one prosimian L1TD1 has ‘replenished’ itself with a more recent L1 ORF1 from the prosimian genome. These data suggest that L1TD1 has been recurrently selected for functional novelty, perhaps for a role in genome defense. L1TD1 loss is associated with L1 extinction in several megabat lineages, but not in sigmodontine rodents. We hypothesize that L1TD1 could have originally evolved for genome defense against L1 elements. Later, L1TD1 may have become incorporated into pluripotency maintenance in some lineages. Our study highlights the role of retroelement gene domestication in fundamental aspects of mammalian biology, and that such domesticated genes can adopt different functions in different lineages. PMID:25211013
Chiara, Matteo; Caruso, Marta; D'Erchia, Anna Maria; Manzari, Caterina; Fraccalvieri, Rosa; Goffredo, Elisa; Latorre, Laura; Miccolupo, Angela; Padalino, Iolanda; Santagada, Gianfranco; Chiocco, Doriano; Pesole, Graziano; Horner, David S; Parisi, Antonio
2015-07-15
Historically, genome-wide and molecular characterization of the genus Listeria has concentrated on the important human pathogen Listeria monocytogenes and a small number of closely related species, together termed Listeria sensu strictu. More recently, a number of genome sequences for more basal, and nonpathogenic, members of the Listeria genus have become available, facilitating a wider perspective on the evolution of pathogenicity and genome level evolutionary dynamics within the entire genus (termed Listeria sensu lato). Here, we have sequenced the genomes of additional Listeria fleischmannii and Listeria newyorkensis isolates and explored the dynamics of genome evolution in Listeria sensu lato. Our analyses suggest that acquisition of genetic material through gene duplication and divergence as well as through lateral gene transfer (mostly from outside Listeria) is widespread throughout the genus. Novel genetic material is apparently subject to rapid turnover. Multiple lines of evidence point to significant differences in evolutionary dynamics between the most basal Listeria subclade and all other congeners, including both sensu strictu and other sensu lato isolates. Strikingly, these differences are likely attributable to stochastic, population-level processes and contribute to observed variation in genome size across the genus. Notably, our analyses indicate that the common ancestor of Listeria sensu lato lacked flagella, which were acquired by lateral gene transfer by a common ancestor of Listeria grayi and Listeria sensu strictu, whereas a recently functionally characterized pathogenicity island, responsible for the capacity to produce cobalamin and utilize ethanolamine/propane-2-diol, was acquired in an ancestor of Listeria sensu strictu. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Ogura, Atsushi; Ikeo, Kazuho; Gojobori, Takashi
2004-01-01
Although the camera eye of the octopus is very similar to that of humans, phylogenetic and embryological analyses have suggested that their camera eyes have been acquired independently. It has been known as a typical example of convergent evolution. To study the molecular basis of convergent evolution of camera eyes, we conducted a comparative analysis of gene expression in octopus and human camera eyes. We sequenced 16,432 ESTs of the octopus eye, leading to 1052 nonredundant genes that have matches in the protein database. Comparing these 1052 genes with 13,303 already-known ESTs of the human eye, 729 (69.3%) genes were commonly expressed between the human and octopus eyes. On the contrary, when we compared octopus eye ESTs with human connective tissue ESTs, the expression similarity was quite low. To trace the evolutionary changes that are potentially responsible for camera eye formation, we also compared octopus-eye ESTs with the completed genome sequences of other organisms. We found that 1019 out of the 1052 genes had already existed at the common ancestor of bilateria, and 875 genes were conserved between humans and octopuses. It suggests that a larger number of conserved genes and their similar gene expression may be responsible for the convergent evolution of the camera eye. PMID:15289475
Cloud-based interactive analytics for terabytes of genomic variants data.
Pan, Cuiping; McInnes, Gregory; Deflaux, Nicole; Snyder, Michael; Bingham, Jonathan; Datta, Somalee; Tsao, Philip S
2017-12-01
Large scale genomic sequencing is now widely used to decipher questions in diverse realms such as biological function, human diseases, evolution, ecosystems, and agriculture. With the quantity and diversity these data harbor, a robust and scalable data handling and analysis solution is desired. We present interactive analytics using a cloud-based columnar database built on Dremel to perform information compression, comprehensive quality controls, and biological information retrieval in large volumes of genomic data. We demonstrate such Big Data computing paradigms can provide orders of magnitude faster turnaround for common genomic analyses, transforming long-running batch jobs submitted via a Linux shell into questions that can be asked from a web browser in seconds. Using this method, we assessed a study population of 475 deeply sequenced human genomes for genomic call rate, genotype and allele frequency distribution, variant density across the genome, and pharmacogenomic information. Our analysis framework is implemented in Google Cloud Platform and BigQuery. Codes are available at https://github.com/StanfordBioinformatics/mvp_aaa_codelabs. cuiping@stanford.edu or ptsao@stanford.edu. Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2017. This work is written by US Government employees and are in the public domain in the US.
Cloud-based interactive analytics for terabytes of genomic variants data
Pan, Cuiping; McInnes, Gregory; Deflaux, Nicole; Snyder, Michael; Bingham, Jonathan; Datta, Somalee; Tsao, Philip S
2017-01-01
Abstract Motivation Large scale genomic sequencing is now widely used to decipher questions in diverse realms such as biological function, human diseases, evolution, ecosystems, and agriculture. With the quantity and diversity these data harbor, a robust and scalable data handling and analysis solution is desired. Results We present interactive analytics using a cloud-based columnar database built on Dremel to perform information compression, comprehensive quality controls, and biological information retrieval in large volumes of genomic data. We demonstrate such Big Data computing paradigms can provide orders of magnitude faster turnaround for common genomic analyses, transforming long-running batch jobs submitted via a Linux shell into questions that can be asked from a web browser in seconds. Using this method, we assessed a study population of 475 deeply sequenced human genomes for genomic call rate, genotype and allele frequency distribution, variant density across the genome, and pharmacogenomic information. Availability and implementation Our analysis framework is implemented in Google Cloud Platform and BigQuery. Codes are available at https://github.com/StanfordBioinformatics/mvp_aaa_codelabs. Contact cuiping@stanford.edu or ptsao@stanford.edu Supplementary information Supplementary data are available at Bioinformatics online. PMID:28961771
Wang, Qingguo; Jia, Peilin; Zhao, Zhongming
2015-01-01
Fueled by widespread applications of high-throughput next generation sequencing (NGS) technologies and urgent need to counter threats of pathogenic viruses, large-scale studies were conducted recently to investigate virus integration in host genomes (for example, human tumor genomes) that may cause carcinogenesis or other diseases. A limiting factor in these studies, however, is rapid virus evolution and resulting polymorphisms, which prevent reads from aligning readily to commonly used virus reference genomes, and, accordingly, make virus integration sites difficult to detect. Another confounding factor is host genomic instability as a result of virus insertions. To tackle these challenges and improve our capability to identify cryptic virus-host fusions, we present a new approach that detects Virus intEgration sites through iterative Reference SEquence customization (VERSE). To the best of our knowledge, VERSE is the first approach to improve detection through customizing reference genomes. Using 19 human tumors and cancer cell lines as test data, we demonstrated that VERSE substantially enhanced the sensitivity of virus integration site detection. VERSE is implemented in the open source package VirusFinder 2 that is available at http://bioinfo.mc.vanderbilt.edu/VirusFinder/.
Šmajs, David; Norris, Steven J.; Weinstock, George M.
2013-01-01
Pathogenic uncultivable treponemes, similar to syphilis-causing Treponema pallidum subspecies pallidum, include T. pallidum ssp. pertenue, T. pallidum ssp. endemicum and Treponema carateum, which cause yaws, bejel and pinta, respectively. Genetic analyses of these pathogens revealed striking similarity among these bacteria and also a high degree of similarity to the rabbit pathogen, T. paraluiscuniculi, a treponeme not infectious to humans. Genome comparisons between pallidum and non-pallidum treponemes revealed genes with potential involvement in human infectivity, whereas comparisons between pallidum and pertenue treponemes identified genes possibly involved in the high invasivity of syphilis treponemes. Genetic variability within syphilis strains is considered as the basis of syphilis molecular epidemiology with potential to detect more virulent strains, whereas genetic variability within a single strain is related to its ability to elude the immune system of the host. Genome analyses also shed light on treponemal evolution and on chromosomal targets for molecular diagnostics of treponemal infections. PMID:22198325
Detecting microsatellites within genomes: significant variation among algorithms.
Leclercq, Sébastien; Rivals, Eric; Jarne, Philippe
2007-04-18
Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker). Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif. Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.
Detecting microsatellites within genomes: significant variation among algorithms
Leclercq, Sébastien; Rivals, Eric; Jarne, Philippe
2007-01-01
Background Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker). Results Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif. Conclusion Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions. PMID:17442102
Baptista, Rodrigo P; Reis-Cunha, Joao Luis; DeBarry, Jeremy D; Chiari, Egler; Kissinger, Jessica C; Bartholomeu, Daniella C; Macedo, Andrea M
2018-02-14
Next-generation sequencing (NGS) methods are low-cost high-throughput technologies that produce thousands to millions of sequence reads. Despite the high number of raw sequence reads, their short length, relative to Sanger, PacBio or Nanopore reads, complicates the assembly of genomic repeats. Many genome tools are available, but the assembly of highly repetitive genome sequences using only NGS short reads remains challenging. Genome assembly of organisms responsible for important neglected diseases such as Trypanosoma cruzi, the aetiological agent of Chagas disease, is known to be challenging because of their repetitive nature. Only three of six recognized discrete typing units (DTUs) of the parasite have their draft genomes published and therefore genome evolution analyses in the taxon are limited. In this study, we developed a computational workflow to assemble highly repetitive genomes via a combination of de novo and reference-based assembly strategies to better overcome the intrinsic limitations of each, based on Illumina reads. The highly repetitive genome of the human-infecting parasite T. cruzi 231 strain was used as a test subject. The combined-assembly approach shown in this study benefits from the reference-based assembly ability to resolve highly repetitive sequences and from the de novo capacity to assemble genome-specific regions, improving the quality of the assembly. The acceptable confidence obtained by analyzing our results showed that our combined approach is an attractive option to assemble highly repetitive genomes with NGS short reads. Phylogenomic analysis including the 231 strain, the first representative of DTU III whose genome was sequenced, was also performed and provides new insights into T. cruzi genome evolution.
The d4 gene family in the human genome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chestkov, A.V.; Baka, I.D.; Kost, M.V.
1996-08-15
The d4 domain, a novel zinc finger-like structural motif, was first revealed in the rat neuro-d4 protein. Here we demonstrate that the d4 domain is conserved in evolution and that three related genes form a d4 family in the human genome. The human neuro-d4 is very similar to rat neuro-d4 at both the amino acid and the nucleotide levels. Moreover, the same splice variants have been detected among rat and human neuro-d4 transcripts. This gene has been localized on chromosome 19, and two other genes, members of the d4 family isolated by screening of the human genomic library at lowmore » stringency, have been mapped to chromosomes 11 and 14. The gene on chromosome 11 is the homolog of the ubiquitously expressed mouse gene ubi-d4/requiem, which is required for cell death after deprivation of trophic factors. A gene with a conserved d4 domain has been found in the genome of the nematode Caenorhabditis elegans. The conservation of d4 proteins from nematodes to vertebrates suggests that they have a general importance, but a diversity of d4 proteins expressed in vertebrate nervous systems suggests that some family members have special functions. 11 refs., 2 figs.« less
Mapping cis-Regulatory Domains in the Human Genome UsingMulti-Species Conservation of Synteny
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Prabhakar, Shyam; Poulin, Francis
2005-06-13
Our inability to associate distant regulatory elements with the genes that they regulate has largely precluded their examination for sequence alterations contributing to human disease. One major obstacle is the large genomic space surrounding targeted genes in which such elements could potentially reside. In order to delineate gene regulatory boundaries we used whole-genome human-mouse-chicken (HMC) and human-mouse-frog (HMF) multiple alignments to compile conserved blocks of synteny (CBS), under the hypothesis that these blocks have been kept intact throughout evolution at least in part by the requirement of regulatory elements to stay linked to the genes that they regulate. A totalmore » of 2,116 and 1,942 CBS>200 kb were assembled for HMC and HMF respectively, encompassing 1.53 and 0.86 Gb of human sequence. To support the existence of complex long-range regulatory domains within these CBS we analyzed the prevalence and distribution of chromosomal aberrations leading to position effects (disruption of a genes regulatory environment), observing a clear bias not only for mapping onto CBS but also for longer CBS size. Our results provide a genome wide data set characterizing the regulatory domains of genes and the conserved regulatory elements within them.« less
Clappier, Emmanuelle; Gerby, Bastien; Sigaux, François; Delord, Marc; Touzri, Farah; Hernandez, Lucie; Ballerini, Paola; Baruchel, André; Pflumio, Françoise; Soulier, Jean
2011-04-11
Genomic studies in human acute lymphoblastic leukemia (ALL) have revealed clonal heterogeneity at diagnosis and clonal evolution at relapse. In this study, we used genome-wide profiling to compare human T cell ALL samples at the time of diagnosis and after engraftment (xenograft) into immunodeficient recipient mice. Compared with paired diagnosis samples, the xenograft leukemia often contained additional genomic lesions in established human oncogenes and/or tumor suppressor genes. Mimicking such genomic lesions by short hairpin RNA-mediated knockdown in diagnosis samples conferred a selective advantage in competitive engraftment experiments, demonstrating that additional lesions can be drivers of increased leukemia-initiating activity. In addition, the xenograft leukemias appeared to arise from minor subclones existing in the patient at diagnosis. Comparison of paired diagnosis and relapse samples showed that, with regard to genetic lesions, xenograft leukemias more frequently more closely resembled relapse samples than bulk diagnosis samples. Moreover, a cell cycle- and mitosis-associated gene expression signature was present in xenograft and relapse samples, and xenograft leukemia exhibited diminished sensitivity to drugs. Thus, the establishment of human leukemia in immunodeficient mice selects and expands a more aggressive malignancy, recapitulating the process of relapse in patients. These findings may contribute to the design of novel strategies to prevent or treat relapse.
Cultural evolution and emergent group-level traits through social heterosis.
Nonacs, Peter; Kapheim, Karen M
2014-06-01
Smaldino proposes emergent properties of human groups, arising when individuals display both differentiation and organization, constitute a novel unit of cultural selection not addressed by current evolutionary theory. We propose existing theoretical frameworks for maintenance of genetic diversity - social heterosis and social genomes - can similarly explain the appearance and maintenance of human cultural diversity (i.e., group-level traits) and collaborative interdependence.
Valenzuela, Nicole
2009-07-01
Painted turtles (Chrysemys picta) are representatives of a vertebrate clade whose biology and phylogenetic position hold a key to our understanding of fundamental aspects of vertebrate evolution. These features make them an ideal emerging model system. Extensive ecological and physiological research provide the context in which to place new research advances in evolutionary genetics, genomics, evolutionary developmental biology, and ecological developmental biology which are enabled by current resources, such as a bacterial artificial chromosome (BAC) library of C. picta, and the imminent development of additional ones such as genome sequences and cDNA and expressed sequence tag (EST) libraries. This integrative approach will allow the research community to continue making advances to provide functional and evolutionary explanations for the lability of biological traits found not only among reptiles but vertebrates in general. Moreover, because humans and reptiles share a common ancestor, and given the ease of using nonplacental vertebrates in experimental biology compared with mammalian embryos, painted turtles are also an emerging model system for biomedical research. For example, painted turtles have been studied to understand many biological responses to overwintering and anoxia, as potential sentinels for environmental xenobiotics, and as a model to decipher the ecology and evolution of sexual development and reproduction. Thus, painted turtles are an excellent reptilian model system for studies with human health, environmental, ecological, and evolutionary significance.
The Evolution and Expression Pattern of Human Overlapping lncRNA and Protein-coding Gene Pairs.
Ning, Qianqian; Li, Yixue; Wang, Zhen; Zhou, Songwen; Sun, Hong; Yu, Guangjun
2017-03-27
Long non-coding RNA overlapping with protein-coding gene (lncRNA-coding pair) is a special type of overlapping genes. Protein-coding overlapping genes have been well studied and increasing attention has been paid to lncRNAs. By studying lncRNA-coding pairs in human genome, we showed that lncRNA-coding pairs were more likely to be generated by overprinting and retaining genes in lncRNA-coding pairs were given higher priority than non-overlapping genes. Besides, the preference of overlapping configurations preserved during evolution was based on the origin of lncRNA-coding pairs. Further investigations showed that lncRNAs promoting the splicing of their embedded protein-coding partners was a unilateral interaction, but the existence of overlapping partners improving the gene expression was bidirectional and the effect was decreased with the increased evolutionary age of genes. Additionally, the expression of lncRNA-coding pairs showed an overall positive correlation and the expression correlation was associated with their overlapping configurations, local genomic environment and evolutionary age of genes. Comparison of the expression correlation of lncRNA-coding pairs between normal and cancer samples found that the lineage-specific pairs including old protein-coding genes may play an important role in tumorigenesis. This work presents a systematically comprehensive understanding of the evolution and the expression pattern of human lncRNA-coding pairs.
Convergent Evolution of Human-Isolated H7N9 Avian Influenza A Viruses.
Xiang, Dan; Shen, Xuejuan; Pu, Zhiqing; Irwin, David M; Liao, Ming; Shen, Yongyi
2018-05-05
Avian influenza A virus H7N9 has caused 5 epidemic waves of human infections in China since 2013. Avian influenza A viruses may face strong selection to adapt to novel conditions when establishing themselves in humans. In this study, we sought to determine whether adaptive evolution had occurred in human-isolated H7N9 viruses. We evaluated all available genomes of H7N9 avian influenza A virus. Maximum likelihood trees were separately reconstructed for all 8 genes. Signals of positive selection and convergent evolution were then detected on branches that lead to changes in host tropism (from avian to human). We found that 3 genes had significant signals of positive selection (all of them P < .05). In addition, we detected 34 sites having significant signals for parallel evolution in 8 genes (all of them P < .05), including 7 well-known sites (Q591K, E627K, and D701N in PB2 gene; R156K, V202A, and L244Q in HA; and R289K in NA) that play roles in crossing species barriers for avian influenza A viruses. Our study suggests that, during infection in humans, H7N9 viruses have undergone adaptive evolution to adapt to their new host environment and that the sites where parallel evolution occurred might play roles in crossing species barriers and respond to the new selection pressures arising from their new host environments.
Genetic Differences Between Great Apes and Humans: Implications for Human Evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Varki, Ajit
2004-03-17
When considering protein sequences, humans are 99-100% identical to chimpanzees and bonobos, our closest evolutionary relatives. The evolution of humans (and the unique features of our species) from a common ancestor with these great apes involved many steps, influenced by interactions amongst factors of genetic, developmental, ecological, microbial, climatic, behavioral, cultural and social origin. The genetic factors can be approached by direct comparisons of human and great ape genomes, genes and gene products, and by elucidating biochemical and biological consequences of the differences. We have discovered multiple genetic and biochemical differences between humans and great apes, particularly in relationship tomore » a family of cell surface molecules called sialic acids. These differences have implications for the human condition, ranging from susceptibility or resistance to microbial pathogens; effects on endogenous receptors in the immune system; potential effects on placental signaling; the expression of oncofetal antigens in cancers; consequences of dietary intake of animal foods; and the development of the mammalian brain. This talk will provide an overview of these and other genetic differences between humans and great apes, with attention to differences potentially relevant to the evolution of humans.« less
Genome sequencing and analysis of the model grass Brachypodium distachyon
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Xiaohan; Kalluri, Udaya C; Tuskan, Gerald A
Three subfamilies of grasses, the Ehrhartoideae, Panicoideae and Pooideae, provide the bulk of human nutrition and are poised to become major sources of renewable energy. Here we describe the genome sequence of the wild grass Brachypodium distachyon (Brachypodium), which is, to our knowledge, the first member of the Pooideae subfamily to be sequenced. Comparison of the Brachypodium, rice and sorghum genomes shows a precise history of genome evolution across a broad diversity of the grasses, and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat. The high-quality genome sequence, coupled with easemore » of cultivation and transformation, small size and rapid life cycle, will help Brachypodium reach its potential as an important model system for developing new energy and food crops.« less
The Trichoplax Genome and the Nature of Placozoans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Srivastava, Mansi; Begovic, Emina; Chapman, Jarrod
2008-08-01
Placozoans are arguably the simplest free-living animals, possibly evoking an early stage in metazoan evolution, yet their biology is poorly understood. Here we report the sequencing and analysis of the {approx}98 million base pair nuclear genome of the placozoan Trichoplax adhaerens. Whole genome phylogenetic analysis suggests that placozoans belong to a 'eumetazoan' clade that includes cnidarians and bilaterians, with sponges as the earliest diverging animals. The compact genome exhibits conserved gene content, gene structure, and synteny relative to the human and other complex eumetazoan genomes. Despite the apparent cellular and organismal simplicity of Trichoplax, its genome encodes a rich arraymore » of transcription factor and signaling pathway genes that are typically associated with diverse cell types and developmental processes in eumetazoans, motivating further searches for cryptic cellular complexity and/or as yet unobserved life history stages.« less
Comparative and demographic analysis of orangutan genomes
Locke, Devin P.; Hillier, LaDeana W.; Warren, Wesley C.; Worley, Kim C.; Nazareth, Lynne V.; Muzny, Donna M.; Yang, Shiaw-Pyng; Wang, Zhengyuan; Chinwalla, Asif T.; Minx, Pat; Mitreva, Makedonka; Cook, Lisa; Delehaunty, Kim D.; Fronick, Catrina; Schmidt, Heather; Fulton, Lucinda A.; Fulton, Robert S.; Nelson, Joanne O.; Magrini, Vincent; Pohl, Craig; Graves, Tina A.; Markovic, Chris; Cree, Andy; Dinh, Huyen H.; Hume, Jennifer; Kovar, Christie L.; Fowler, Gerald R.; Lunter, Gerton; Meader, Stephen; Heger, Andreas; Ponting, Chris P.; Marques-Bonet, Tomas; Alkan, Can; Chen, Lin; Cheng, Ze; Kidd, Jeffrey M.; Eichler, Evan E.; White, Simon; Searle, Stephen; Vilella, Albert J.; Chen, Yuan; Flicek, Paul; Ma, Jian; Raney, Brian; Suh, Bernard; Burhans, Richard; Herrero, Javier; Haussler, David; Faria, Rui; Fernando, Olga; Darré, Fleur; Farré, Domènec; Gazave, Elodie; Oliva, Meritxell; Navarro, Arcadi; Roberto, Roberta; Capozzi, Oronzo; Archidiacono, Nicoletta; Valle, Giuliano Della; Purgato, Stefania; Rocchi, Mariano; Konkel, Miriam K.; Walker, Jerilyn A.; Ullmer, Brygg; Batzer, Mark A.; Smit, Arian F. A.; Hubley, Robert; Casola, Claudio; Schrider, Daniel R.; Hahn, Matthew W.; Quesada, Victor; Puente, Xose S.; Ordoñez, Gonzalo R.; López-Otín, Carlos; Vinar, Tomas; Brejova, Brona; Ratan, Aakrosh; Harris, Robert S.; Miller, Webb; Kosiol, Carolin; Lawson, Heather A.; Taliwal, Vikas; Martins, André L.; Siepel, Adam; RoyChoudhury, Arindam; Ma, Xin; Degenhardt, Jeremiah; Bustamante, Carlos D.; Gutenkunst, Ryan N.; Mailund, Thomas; Dutheil, Julien Y.; Hobolth, Asger; Schierup, Mikkel H.; Chemnick, Leona; Ryder, Oliver A.; Yoshinaga, Yuko; de Jong, Pieter J.; Weinstock, George M.; Rogers, Jeffrey; Mardis, Elaine R.; Gibbs, Richard A.; Wilson, Richard K.
2011-01-01
“Orangutan” is derived from the Malay term “man of the forest” and aptly describes the Southeast Asian great apes native to Sumatra and Borneo. The orangutan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orangutan draft genome assembly and short read sequence data from five Sumatran and five Bornean orangutan genomes. Our analyses reveal that, compared to other primates, the orangutan genome has many unique features. Structural evolution of the orangutan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe the first primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orangutan genome structure. Orangutans have extremely low energy usage for a eutherian mammal1, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400k years ago (ya), is more recent than most previous studies and underscores the complexity of the orangutan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (Ne) expanded exponentially relative to the ancestral Ne after the split, while Bornean Ne declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts. PMID:21270892
Human tumors show a high level of genetic heterogeneity, but the processes that influence the timing and route of metastatic dissemination of the subclones are unknown. Here we have used whole-exome sequencing of 103 matched benign, malignant and metastatic skin tumors from genetically heterogeneous mice to demonstrate that most metastases disseminate synchronously from the primary tumor, supporting parallel rather than linear evolution as the predominant model of metastasis.
Genomic minimalism in the early diverging intestinal parasite Giardia lamblia.
Morrison, Hilary G; McArthur, Andrew G; Gillin, Frances D; Aley, Stephen B; Adam, Rodney D; Olsen, Gary J; Best, Aaron A; Cande, W Zacheus; Chen, Feng; Cipriano, Michael J; Davids, Barbara J; Dawson, Scott C; Elmendorf, Heidi G; Hehl, Adrian B; Holder, Michael E; Huse, Susan M; Kim, Ulandt U; Lasek-Nesselquist, Erica; Manning, Gerard; Nigam, Anuranjini; Nixon, Julie E J; Palm, Daniel; Passamaneck, Nora E; Prabhu, Anjali; Reich, Claudia I; Reiner, David S; Samuelson, John; Svard, Staffan G; Sogin, Mitchell L
2007-09-28
The genome of the eukaryotic protist Giardia lamblia, an important human intestinal parasite, is compact in structure and content, contains few introns or mitochondrial relics, and has simplified machinery for DNA replication, transcription, RNA processing, and most metabolic pathways. Protein kinases comprise the single largest protein class and reflect Giardia's requirement for a complex signal transduction network for coordinating differentiation. Lateral gene transfer from bacterial and archaeal donors has shaped Giardia's genome, and previously unknown gene families, for example, cysteine-rich structural proteins, have been discovered. Unexpectedly, the genome shows little evidence of heterozygosity, supporting recent speculations that this organism is sexual. This genome sequence will not only be valuable for investigating the evolution of eukaryotes, but will also be applied to the search for new therapeutics for this parasite.
Genomic analyses provide insights into the history of tomato breeding.
Lin, Tao; Zhu, Guangtao; Zhang, Junhong; Xu, Xiangyang; Yu, Qinghui; Zheng, Zheng; Zhang, Zhonghua; Lun, Yaoyao; Li, Shuai; Wang, Xiaoxuan; Huang, Zejun; Li, Junming; Zhang, Chunzhi; Wang, Taotao; Zhang, Yuyang; Wang, Aoxue; Zhang, Yancong; Lin, Kui; Li, Chuanyou; Xiong, Guosheng; Xue, Yongbiao; Mazzucato, Andrea; Causse, Mathilde; Fei, Zhangjun; Giovannoni, James J; Chetelat, Roger T; Zamir, Dani; Städler, Thomas; Li, Jingfu; Ye, Zhibiao; Du, Yongchen; Huang, Sanwen
2014-11-01
The histories of crop domestication and breeding are recorded in genomes. Although tomato is a model species for plant biology and breeding, the nature of human selection that altered its genome remains largely unknown. Here we report a comprehensive analysis of tomato evolution based on the genome sequences of 360 accessions. We provide evidence that domestication and improvement focused on two independent sets of quantitative trait loci (QTLs), resulting in modern tomato fruit ∼100 times larger than its ancestor. Furthermore, we discovered a major genomic signature for modern processing tomatoes, identified the causative variants that confer pink fruit color and precisely visualized the linkage drag associated with wild introgressions. This study outlines the accomplishments as well as the costs of historical selection and provides molecular insights toward further improvement.
Genetic Variation and Adaptation in Africa: Implications for Human Evolution and Disease
Gomez, Felicia; Hirbo, Jibril; Tishkoff, Sarah A.
2014-01-01
Because modern humans originated in Africa and have adapted to diverse environments, African populations have high levels of genetic and phenotypic diversity. Thus, genomic studies of diverse African ethnic groups are essential for understanding human evolutionary history and how this leads to differential disease risk in all humans. Comparative studies of genetic diversity within and between African ethnic groups creates an opportunity to reconstruct some of the earliest events in human population history and are useful for identifying patterns of genetic variation that have been influenced by recent natural selection. Here we describe what is currently known about genetic variation and evolutionary history of diverse African ethnic groups. We also describe examples of recent natural selection in African genomes and how these data are informative for understanding the frequency of many genetic traits, including those that cause disease susceptibility in African populations and populations of recent African descent. PMID:24984772
Y chromosome evolution: emerging insights into processes of Y chromosome degeneration
Bachtrog, Doris
2014-01-01
The human Y chromosome is intriguing not only because it harbours the master-switch gene determining gender but also because of its unusual evolutionary trajectory. Previously an autosome, Y chromosome evolution has been characterized by massive gene decay. Recent whole-genome and transcriptome analyses of Y chromosomes in humans and other primates, in Drosophila species as well as in plants have shed light on the current gene content of the Y, its origins and its long-term fate. Comparative analysis of young and old Y chromosomes have given further insights into the evolutionary and molecular forces triggering Y degeneration and its evolutionary destiny. PMID:23329112
The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes
Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A. P.; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J. Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H
2014-01-01
Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus. PMID:24852848
Genomic approaches for understanding dengue: insights from the virus, vector, and host.
Sim, Shuzhen; Hibberd, Martin L
2016-03-02
The incidence and geographic range of dengue have increased dramatically in recent decades. Climate change, rapid urbanization and increased global travel have facilitated the spread of both efficient mosquito vectors and the four dengue virus serotypes between population centers. At the same time, significant advances in genomics approaches have provided insights into host-pathogen interactions, immunogenetics, and viral evolution in both humans and mosquitoes. Here, we review these advances and the innovative treatment and control strategies that they are inspiring.
Comparative Genomics in Drosophila.
Oti, Martin; Pane, Attilio; Sammeth, Michael
2018-01-01
Since the pioneering studies of Thomas Hunt Morgan and coworkers at the dawn of the twentieth century, Drosophila melanogaster and its sister species have tremendously contributed to unveil the rules underlying animal genetics, development, behavior, evolution, and human disease. Recent advances in DNA sequencing technologies launched Drosophila into the post-genomic era and paved the way for unprecedented comparative genomics investigations. The complete sequencing and systematic comparison of the genomes from 12 Drosophila species represents a milestone achievement in modern biology, which allowed a plethora of different studies ranging from the annotation of known and novel genomic features to the evolution of chromosomes and, ultimately, of entire genomes. Despite the efforts of countless laboratories worldwide, the vast amount of data that were produced over the past 15 years is far from being fully explored.In this chapter, we will review some of the bioinformatic approaches that were developed to interrogate the genomes of the 12 Drosophila species. Setting off from alignments of the entire genomic sequences, the degree of conservation can be separately evaluated for every region of the genome, providing already first hints about elements that are under purifying selection and therefore likely functional. Furthermore, the careful analysis of repeated sequences sheds light on the evolutionary dynamics of transposons, an enigmatic and fascinating class of mobile elements housed in the genomes of animals and plants. Comparative genomics also aids in the computational identification of the transcriptionally active part of the genome, first and foremost of protein-coding loci, but also of transcribed nevertheless apparently noncoding regions, which were once considered "junk" DNA. Eventually, the synergy between functional and comparative genomics also facilitates in silico and in vivo studies on cis-acting regulatory elements, like transcription factor binding sites, that due to the high degree of sequence variability usually impose increased challenges for bioinformatics approaches.
Cheng, Feixiong; Liu, Chuang; Lin, Chen-Ching; Zhao, Junfei; Jia, Peilin; Li, Wen-Hsiung; Zhao, Zhongming
2015-09-01
Cancer development and progression result from somatic evolution by an accumulation of genomic alterations. The effects of those alterations on the fitness of somatic cells lead to evolutionary adaptations such as increased cell proliferation, angiogenesis, and altered anticancer drug responses. However, there are few general mathematical models to quantitatively examine how perturbations of a single gene shape subsequent evolution of the cancer genome. In this study, we proposed the gene gravity model to study the evolution of cancer genomes by incorporating the genome-wide transcription and somatic mutation profiles of ~3,000 tumors across 9 cancer types from The Cancer Genome Atlas into a broad gene network. We found that somatic mutations of a cancer driver gene may drive cancer genome evolution by inducing mutations in other genes. This functional consequence is often generated by the combined effect of genetic and epigenetic (e.g., chromatin regulation) alterations. By quantifying cancer genome evolution using the gene gravity model, we identified six putative cancer genes (AHNAK, COL11A1, DDX3X, FAT4, STAG2, and SYNE1). The tumor genomes harboring the nonsynonymous somatic mutations in these genes had a higher mutation density at the genome level compared to the wild-type groups. Furthermore, we provided statistical evidence that hypermutation of cancer driver genes on inactive X chromosomes is a general feature in female cancer genomes. In summary, this study sheds light on the functional consequences and evolutionary characteristics of somatic mutations during tumorigenesis by propelling adaptive cancer genome evolution, which would provide new perspectives for cancer research and therapeutics.
Lin, Chen-Ching; Zhao, Junfei; Jia, Peilin; Li, Wen-Hsiung; Zhao, Zhongming
2015-01-01
Cancer development and progression result from somatic evolution by an accumulation of genomic alterations. The effects of those alterations on the fitness of somatic cells lead to evolutionary adaptations such as increased cell proliferation, angiogenesis, and altered anticancer drug responses. However, there are few general mathematical models to quantitatively examine how perturbations of a single gene shape subsequent evolution of the cancer genome. In this study, we proposed the gene gravity model to study the evolution of cancer genomes by incorporating the genome-wide transcription and somatic mutation profiles of ~3,000 tumors across 9 cancer types from The Cancer Genome Atlas into a broad gene network. We found that somatic mutations of a cancer driver gene may drive cancer genome evolution by inducing mutations in other genes. This functional consequence is often generated by the combined effect of genetic and epigenetic (e.g., chromatin regulation) alterations. By quantifying cancer genome evolution using the gene gravity model, we identified six putative cancer genes (AHNAK, COL11A1, DDX3X, FAT4, STAG2, and SYNE1). The tumor genomes harboring the nonsynonymous somatic mutations in these genes had a higher mutation density at the genome level compared to the wild-type groups. Furthermore, we provided statistical evidence that hypermutation of cancer driver genes on inactive X chromosomes is a general feature in female cancer genomes. In summary, this study sheds light on the functional consequences and evolutionary characteristics of somatic mutations during tumorigenesis by propelling adaptive cancer genome evolution, which would provide new perspectives for cancer research and therapeutics. PMID:26352260
2013-01-01
Background We describe the genome of the western painted turtle, Chrysemys picta bellii, one of the most widespread, abundant, and well-studied turtles. We place the genome into a comparative evolutionary context, and focus on genomic features associated with tooth loss, immune function, longevity, sex differentiation and determination, and the species' physiological capacities to withstand extreme anoxia and tissue freezing. Results Our phylogenetic analyses confirm that turtles are the sister group to living archosaurs, and demonstrate an extraordinarily slow rate of sequence evolution in the painted turtle. The ability of the painted turtle to withstand complete anoxia and partial freezing appears to be associated with common vertebrate gene networks, and we identify candidate genes for future functional analyses. Tooth loss shares a common pattern of pseudogenization and degradation of tooth-specific genes with birds, although the rate of accumulation of mutations is much slower in the painted turtle. Genes associated with sex differentiation generally reflect phylogeny rather than convergence in sex determination functionality. Among gene families that demonstrate exceptional expansions or show signatures of strong natural selection, immune function and musculoskeletal patterning genes are consistently over-represented. Conclusions Our comparative genomic analyses indicate that common vertebrate regulatory networks, some of which have analogs in human diseases, are often involved in the western painted turtle's extraordinary physiological capacities. As these regulatory pathways are analyzed at the functional level, the painted turtle may offer important insights into the management of a number of human health disorders. PMID:23537068
Lung evolution as a cipher for physiology
Torday, J. S.; Rehan, V. K.
2009-01-01
In the postgenomic era, we need an algorithm to readily translate genes into physiologic principles. The failure to advance biomedicine is due to the false hope raised in the wake of the Human Genome Project (HGP) by the promise of systems biology as a ready means of reconstructing physiology from genes. like the atom in physics, the cell, not the gene, is the smallest completely functional unit of biology. Trying to reassemble gene regulatory networks without accounting for this fundamental feature of evolution will result in a genomic atlas, but not an algorithm for functional genomics. For example, the evolution of the lung can be “deconvoluted” by applying cell-cell communication mechanisms to all aspects of lung biology development, homeostasis, and regeneration/repair. Gene regulatory networks common to these processes predict ontogeny, phylogeny, and the disease-related consequences of failed signaling. This algorithm elucidates characteristics of vertebrate physiology as a cascade of emergent and contingent cellular adaptational responses. By reducing complex physiological traits to gene regulatory networks and arranging them hierarchically in a self-organizing map, like the periodic table of elements in physics, the first principles of physiology will emerge. PMID:19366785
USDA-ARS?s Scientific Manuscript database
Adaptation to human-induced environmental change has the potential to profoundly influence the genomic architecture of affected species. This is particularly true in agricultural ecosystems, where anthropogenic selection pressure is strong. Heliothis virescens feeds on cotton in its larval stages a...
USDA-ARS?s Scientific Manuscript database
Cycles of whole genome duplication (WGD) and diploidization are hallmarks of eukaryotic genome evolution and speciation. Polyploid wheat (Triticum aestivum) has had a massive increase in genome size largely due to recent WGDs. How these processes may impact the dynamics of gene evolution was studied...
Five rules for the evolution of cooperation
Nowak, Martin A.
2011-01-01
Cooperation is needed for evolution to construct new levels of organization. The emergence of genomes, cells, multi-cellular organisms, social insects and human society are all based on cooperation. Cooperation means that selfish replicators forgo some of their reproductive potential to help one another. But natural selection implies competition and therefore opposes cooperation unless a specific mechanism is at work. Here I discuss five mechanisms for the evolution of cooperation: kin selection, direct reciprocity, indirect reciprocity, network reciprocity and group selection. For each mechanism, a simple rule is derived which specifies whether natural selection can lead to cooperation. PMID:17158317
Nguyen, Quan H; Tellam, Ross L; Naval-Sanchez, Marina; Porto-Neto, Laercio R; Barendse, William; Reverter, Antonio; Hayes, Benjamin; Kijas, James; Dalrymple, Brian P
2018-01-01
Abstract Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets. PMID:29618048
Nguyen, Quan H; Tellam, Ross L; Naval-Sanchez, Marina; Porto-Neto, Laercio R; Barendse, William; Reverter, Antonio; Hayes, Benjamin; Kijas, James; Dalrymple, Brian P
2018-03-01
Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets.
Biological data sciences in genome research
Schatz, Michael C.
2015-01-01
The last 20 years have been a remarkable era for biology and medicine. One of the most significant achievements has been the sequencing of the first human genomes, which has laid the foundation for profound insights into human genetics, the intricacies of regulation and development, and the forces of evolution. Incredibly, as we look into the future over the next 20 years, we see the very real potential for sequencing more than 1 billion genomes, bringing even deeper insight into human genetics as well as the genetics of millions of other species on the planet. Realizing this great potential for medicine and biology, though, will only be achieved through the integration and development of highly scalable computational and quantitative approaches that can keep pace with the rapid improvements to biotechnology. In this perspective, I aim to chart out these future technologies, anticipate the major themes of research, and call out the challenges ahead. One of the largest shifts will be in the training used to prepare the class of 2035 for their highly interdisciplinary world. PMID:26430150
Wild tobacco genomes reveal the evolution of nicotine biosynthesis.
Xu, Shuqing; Brockmöller, Thomas; Navarro-Quezada, Aura; Kuhl, Heiner; Gase, Klaus; Ling, Zhihao; Zhou, Wenwu; Kreitzer, Christoph; Stanke, Mario; Tang, Haibao; Lyons, Eric; Pandey, Priyanka; Pandey, Shree P; Timmermann, Bernd; Gaquerel, Emmanuel; Baldwin, Ian T
2017-06-06
Nicotine, the signature alkaloid of Nicotiana species responsible for the addictive properties of human tobacco smoking, functions as a defensive neurotoxin against attacking herbivores. However, the evolution of the genetic features that contributed to the assembly of the nicotine biosynthetic pathway remains unknown. We sequenced and assembled genomes of two wild tobaccos, Nicotiana attenuata (2.5 Gb) and Nicotiana obtusifolia (1.5 Gb), two ecological models for investigating adaptive traits in nature. We show that after the Solanaceae whole-genome triplication event, a repertoire of rapidly expanding transposable elements (TEs) bloated these Nicotiana genomes, promoted expression divergences among duplicated genes, and contributed to the evolution of herbivory-induced signaling and defenses, including nicotine biosynthesis. The biosynthetic machinery that allows for nicotine synthesis in the roots evolved from the stepwise duplications of two ancient primary metabolic pathways: the polyamine and nicotinamide adenine dinucleotide (NAD) pathways. In contrast to the duplication of the polyamine pathway that is shared among several solanaceous genera producing polyamine-derived tropane alkaloids, we found that lineage-specific duplications within the NAD pathway and the evolution of root-specific expression of the duplicated Solanaceae-specific ethylene response factor that activates the expression of all nicotine biosynthetic genes resulted in the innovative and efficient production of nicotine in the genus Nicotiana Transcription factor binding motifs derived from TEs may have contributed to the coexpression of nicotine biosynthetic pathway genes and coordinated the metabolic flux. Together, these results provide evidence that TEs and gene duplications facilitated the emergence of a key metabolic innovation relevant to plant fitness.
An Organismal CNV Mutator Phenotype Restricted to Early Human Development.
Liu, Pengfei; Yuan, Bo; Carvalho, Claudia M B; Wuster, Arthur; Walter, Klaudia; Zhang, Ling; Gambin, Tomasz; Chong, Zechen; Campbell, Ian M; Coban Akdemir, Zeynep; Gelowani, Violet; Writzl, Karin; Bacino, Carlos A; Lindsay, Sarah J; Withers, Marjorie; Gonzaga-Jauregui, Claudia; Wiszniewska, Joanna; Scull, Jennifer; Stankiewicz, Paweł; Jhangiani, Shalini N; Muzny, Donna M; Zhang, Feng; Chen, Ken; Gibbs, Richard A; Rautenstrauss, Bernd; Cheung, Sau Wai; Smith, Janice; Breman, Amy; Shaw, Chad A; Patel, Ankita; Hurles, Matthew E; Lupski, James R
2017-02-23
De novo copy number variants (dnCNVs) arising at multiple loci in a personal genome have usually been considered to reflect cancer somatic genomic instabilities. We describe a multiple dnCNV (MdnCNV) phenomenon in which individuals with genomic disorders carry five to ten constitutional dnCNVs. These CNVs originate from independent formation incidences, are predominantly tandem duplications or complex gains, exhibit breakpoint junction features reminiscent of replicative repair, and show increased de novo point mutations flanking the rearrangement junctions. The active CNV mutation shower appears to be restricted to a transient perizygotic period. We propose that a defect in the CNV formation process is responsible for the "CNV-mutator state," and this state is dampened after early embryogenesis. The constitutional MdnCNV phenomenon resembles chromosomal instability in various cancers. Investigations of this phenomenon may provide unique access to understanding genomic disorders, structural variant mutagenesis, human evolution, and cancer biology. Copyright © 2017 Elsevier Inc. All rights reserved.
Ancient Recombination Events between Human Herpes Simplex Viruses
Burrel, Sonia; Boutolleau, David; Ryu, Diane; Agut, Henri; Merkel, Kevin; Leendertz, Fabian H.
2017-01-01
Abstract Herpes simplex viruses 1 and 2 (HSV-1 and HSV-2) are seen as close relatives but also unambiguously considered as evolutionary independent units. Here, we sequenced the genomes of 18 HSV-2 isolates characterized by divergent UL30 gene sequences to further elucidate the evolutionary history of this virus. Surprisingly, genome-wide recombination analyses showed that all HSV-2 genomes sequenced to date contain HSV-1 fragments. Using phylogenomic analyses, we could also show that two main HSV-2 lineages exist. One lineage is mostly restricted to subSaharan Africa whereas the other has reached a global distribution. Interestingly, only the worldwide lineage is characterized by ancient recombination events with HSV-1. Our findings highlight the complexity of HSV-2 evolution, a virus of putative zoonotic origin which later recombined with its human-adapted relative. They also suggest that coinfections with HSV-1 and 2 may have genomic and potentially functional consequences and should therefore be monitored more closely. PMID:28369565
A DNA methylation map of human cancer at single base-pair resolution.
Vidal, E; Sayols, S; Moran, S; Guillaumet-Adkins, A; Schroeder, M P; Royo, R; Orozco, M; Gut, M; Gut, I; Lopez-Bigas, N; Heyn, H; Esteller, M
2017-10-05
Although single base-pair resolution DNA methylation landscapes for embryonic and different somatic cell types provided important insights into epigenetic dynamics and cell-type specificity, such comprehensive profiling is incomplete across human cancer types. This prompted us to perform genome-wide DNA methylation profiling of 22 samples derived from normal tissues and associated neoplasms, including primary tumors and cancer cell lines. Unlike their invariant normal counterparts, cancer samples exhibited highly variable CpG methylation levels in a large proportion of the genome, involving progressive changes during tumor evolution. The whole-genome sequencing results from selected samples were replicated in a large cohort of 1112 primary tumors of various cancer types using genome-scale DNA methylation analysis. Specifically, we determined DNA hypermethylation of promoters and enhancers regulating tumor-suppressor genes, with potential cancer-driving effects. DNA hypermethylation events showed evidence of positive selection, mutual exclusivity and tissue specificity, suggesting their active participation in neoplastic transformation. Our data highlight the extensive changes in DNA methylation that occur in cancer onset, progression and dissemination.
Three tiers of genome evolution in reptiles
Organ, Chris L.; Moreno, Ricardo Godínez; Edwards, Scott V.
2008-01-01
Characterization of reptilian genomes is essential for understanding the overall diversity and evolution of amniote genomes, because reptiles, which include birds, constitute a major fraction of the amniote evolutionary tree. To better understand the evolution and diversity of genomic characteristics in Reptilia, we conducted comparative analyses of online sequence data from Alligator mississippiensis (alligator) and Sphenodon punctatus (tuatara) as well as genome size and karyological data from a wide range of reptilian species. At the whole-genome and chromosomal tiers of organization, we find that reptilian genome size distribution is consistent with a model of continuous gradual evolution while genomic compartmentalization, as manifested in the number of microchromosomes and macrochromosomes, appears to have undergone early rapid change. At the sequence level, the third genomic tier, we find that exon size in Alligator is distributed in a pattern matching that of exons in Gallus (chicken), especially in the 101—200 bp size class. A small spike in the fraction of exons in the 301 bp—1 kb size class is also observed for Alligator, but more so for Sphenodon. For introns, we find that members of Reptilia have a larger fraction of introns within the 101 bp–2 kb size class and a lower fraction of introns within the 5–30 kb size class than do mammals. These findings suggest that the mode of reptilian genome evolution varies across three hierarchical levels of the genome, a pattern consistent with a mosaic model of genomic evolution. PMID:21669810
Phylogenomics and the Dynamic Genome Evolution of the Genus Streptococcus
Richards, Vincent P.; Palmer, Sara R.; Pavinski Bitar, Paulina D.; Qin, Xiang; Weinstock, George M.; Highlander, Sarah K.; Town, Christopher D.; Burne, Robert A.; Stanhope, Michael J.
2014-01-01
The genus Streptococcus comprises important pathogens that have a severe impact on human health and are responsible for substantial economic losses to agriculture. Here, we utilize 46 Streptococcus genome sequences (44 species), including eight species sequenced here, to provide the first genomic level insight into the evolutionary history and genetic basis underlying the functional diversity of all major groups of this genus. Gene gain/loss analysis revealed a dynamic pattern of genome evolution characterized by an initial period of gene gain followed by a period of loss, as the major groups within the genus diversified. This was followed by a period of genome expansion associated with the origins of the present extant species. The pattern is concordant with an emerging view that genomes evolve through a dynamic process of expansion and streamlining. A large proportion of the pan-genome has experienced lateral gene transfer (LGT) with causative factors, such as relatedness and shared environment, operating over different evolutionary scales. Multiple gene ontology terms were significantly enriched for each group, and mapping terms onto the phylogeny showed that those corresponding to genes born on branches leading to the major groups represented approximately one-fifth of those enriched. Furthermore, despite the extensive LGT, several biochemical characteristics have been retained since group formation, suggesting genomic cohesiveness through time, and that these characteristics may be fundamental to each group. For example, proteolysis: mitis group; urea metabolism: salivarius group; carbohydrate metabolism: pyogenic group; and transcription regulation: bovis group. PMID:24625962
Rates of genomic divergence in humans, chimpanzees and their lice.
Johnson, Kevin P; Allen, Julie M; Olds, Brett P; Mugisha, Lawrence; Reed, David L; Paige, Ken N; Pittendrigh, Barry R
2014-02-22
The rate of DNA mutation and divergence is highly variable across the tree of life. However, the reasons underlying this variation are not well understood. Comparing the rates of genetic changes between hosts and parasite lineages that diverged at the same time is one way to begin to understand differences in genetic mutation and substitution rates. Such studies have indicated that the rate of genetic divergence in parasites is often faster than that of their hosts when comparing single genes. However, the variation in this relative rate of molecular evolution across different genes in the genome is unknown. We compared the rate of DNA sequence divergence between humans, chimpanzees and their ectoparasitic lice for 1534 protein-coding genes across their genomes. The rate of DNA substitution in these orthologous genes was on average 14 times faster for lice than for humans and chimpanzees. In addition, these rates were positively correlated across genes. Because this correlation only occurred for substitutions that changed the amino acid, this pattern is probably produced by similar functional constraints across the same genes in humans, chimpanzees and their ectoparasites.
Rates of genomic divergence in humans, chimpanzees and their lice
Johnson, Kevin P.; Allen, Julie M.; Olds, Brett P.; Mugisha, Lawrence; Reed, David L.; Paige, Ken N.; Pittendrigh, Barry R.
2014-01-01
The rate of DNA mutation and divergence is highly variable across the tree of life. However, the reasons underlying this variation are not well understood. Comparing the rates of genetic changes between hosts and parasite lineages that diverged at the same time is one way to begin to understand differences in genetic mutation and substitution rates. Such studies have indicated that the rate of genetic divergence in parasites is often faster than that of their hosts when comparing single genes. However, the variation in this relative rate of molecular evolution across different genes in the genome is unknown. We compared the rate of DNA sequence divergence between humans, chimpanzees and their ectoparasitic lice for 1534 protein-coding genes across their genomes. The rate of DNA substitution in these orthologous genes was on average 14 times faster for lice than for humans and chimpanzees. In addition, these rates were positively correlated across genes. Because this correlation only occurred for substitutions that changed the amino acid, this pattern is probably produced by similar functional constraints across the same genes in humans, chimpanzees and their ectoparasites. PMID:24403325
Weill, Jean-Claude; Radman, Miroslav
2004-01-01
Our genome has evolved to perpetuate itself through the maintenance of the species via an uninterrupted chain of reproductive somas. Accordingly, evolution is not concerned with diseases occurring after the soma's reproductive stage. Following Richard Dawkins, we would like to reassert that we indeed live as disposable somas, slaves of our germline genome, but could soon start rebelling against such slavery. Cancer and its relation to the TP53 gene may offer a paradigmatic example. The observation that the latency period in cancer can be prolonged in mice by increasing the number of TP53 genes in their genome, suggests that sooner or later we will have to address the question of heritable disease avoidance via the manipulation of the human germline. PMID:15065661
Genome Studies on Nematophagous and Entomogenous Fungi in China
Zhang, Weiwei; Cheng, Xiaoli; Liu, Xingzhong; Xiang, Meichun
2016-01-01
The nematophagous and entomogenous fungi are natural enemies of nematodes and insects and have been utilized by humans to control agricultural and forestry pests. Some of these fungi have been or are being developed as biological control agents in China and worldwide. Several important nematophagous and entomogenous fungi, including nematode-trapping fungi (Arthrobotrys oligospora and Drechslerella stenobrocha), nematode endoparasite (Hirsutella minnesotensis), insect pathogens (Beauveria bassiana and Metarhizium spp.) and Chinese medicinal fungi (Ophiocordyceps sinensis and Cordyceps militaris), have been genome sequenced and extensively analyzed in China. The biology, evolution, and pharmaceutical application of these fungi and their interacting with host nematodes and insects revealed by genomes, comparing genomes coupled with transcriptomes are summarized and reviewed in this paper. PMID:29376926
de la Fuente, José; Díez-Delgado, Iratxe; Contreras, Marinela; Vicente, Joaquín; Cabezas-Cruz, Alejandro; Tobes, Raquel; Manrique, Marina; López, Vladimir; Romero, Beatriz; Bezos, Javier; Dominguez, Lucas; Sevilla, Iker A.; Garrido, Joseba M.; Juste, Ramón; Madico, Guillermo; Jones-López, Edward; Gortazar, Christian
2015-01-01
Mycobacteria of the Mycobacterium tuberculosis complex (MTBC) greatly affect humans and animals worldwide. The life cycle of mycobacteria is complex and the mechanisms resulting in pathogen infection and survival in host cells are not fully understood. Recently, comparative genomics analyses have provided new insights into the evolution and adaptation of the MTBC to survive inside the host. However, most of this information has been obtained using M. tuberculosis but not other members of the MTBC such as M. bovis and M. caprae. In this study, the genome of three M. bovis (MB1, MB3, MB4) and one M. caprae (MB2) field isolates with different lesion score, prevalence and host distribution phenotypes were sequenced. Genome sequence information was used for whole-genome and protein-targeted comparative genomics analysis with the aim of finding correlates with phenotypic variation with potential implications for tuberculosis (TB) disease risk assessment and control. At the whole-genome level the results of the first comparative genomics study of field isolates of M. bovis including M. caprae showed that as previously reported for M. tuberculosis, sequential chromosomal nucleotide substitutions were the main driver of the M. bovis genome evolution. The phylogenetic analysis provided a strong support for the M. bovis/M. caprae clade, but supported M. caprae as a separate species. The comparison of the MB1 and MB4 isolates revealed differences in genome sequence, including gene families that are important for bacterial infection and transmission, thus highlighting differences with functional implications between isolates otherwise classified with the same spoligotype. Strategic protein-targeted analysis using the ESX or type VII secretion system, proteins linking stress response with lipid metabolism, host T cell epitopes of mycobacteria, antigens and peptidoglycan assembly protein identified new genetic markers and candidate vaccine antigens that warrant further study to develop tools to evaluate risks for TB disease caused by M. bovis/M.caprae and for TB control in humans and animals. PMID:26583774
Genomics and Evolution in Traditional Medicinal Plants: Road to a Healthier Life
Hao, Da-Cheng; Xiao, Pei-Gen
2015-01-01
Medicinal plants have long been utilized in traditional medicine and ethnomedicine worldwide. This review presents a glimpse of the current status of and future trends in medicinal plant genomics, evolution, and phylogeny. These dynamic fields are at the intersection of phytochemistry and plant biology and are concerned with the evolution mechanisms and systematics of medicinal plant genomes, origin and evolution of the plant genotype and metabolic phenotype, interaction between medicinal plant genomes and their environment, the correlation between genomic diversity and metabolite diversity, and so on. Use of the emerging high-end genomic technologies can be expanded from crop plants to traditional medicinal plants, in order to expedite medicinal plant breeding and transform them into living factories of medicinal compounds. The utility of molecular phylogeny and phylogenomics in predicting chemodiversity and bioprospecting is also highlighted within the context of natural-product-based drug discovery and development. Representative case studies of medicinal plant genome, phylogeny, and evolution are summarized to exemplify the expansion of knowledge pedigree and the paradigm shift to the omics-based approaches, which update our awareness about plant genome evolution and enable the molecular breeding of medicinal plants and the sustainable utilization of plant pharmaceutical resources. PMID:26461812
Genomics and Evolution in Traditional Medicinal Plants: Road to a Healthier Life.
Hao, Da-Cheng; Xiao, Pei-Gen
2015-01-01
Medicinal plants have long been utilized in traditional medicine and ethnomedicine worldwide. This review presents a glimpse of the current status of and future trends in medicinal plant genomics, evolution, and phylogeny. These dynamic fields are at the intersection of phytochemistry and plant biology and are concerned with the evolution mechanisms and systematics of medicinal plant genomes, origin and evolution of the plant genotype and metabolic phenotype, interaction between medicinal plant genomes and their environment, the correlation between genomic diversity and metabolite diversity, and so on. Use of the emerging high-end genomic technologies can be expanded from crop plants to traditional medicinal plants, in order to expedite medicinal plant breeding and transform them into living factories of medicinal compounds. The utility of molecular phylogeny and phylogenomics in predicting chemodiversity and bioprospecting is also highlighted within the context of natural-product-based drug discovery and development. Representative case studies of medicinal plant genome, phylogeny, and evolution are summarized to exemplify the expansion of knowledge pedigree and the paradigm shift to the omics-based approaches, which update our awareness about plant genome evolution and enable the molecular breeding of medicinal plants and the sustainable utilization of plant pharmaceutical resources.
The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution.
Baniaga, Anthony E; Arrigo, Nils; Barker, Michael S
2016-06-03
The haploid nuclear genome size (1C DNA) of vascular land plants varies over several orders of magnitude. Much of this observed diversity in genome size is due to the proliferation and deletion of transposable elements. To date, all vascular land plant lineages with extremely small nuclear genomes represent recently derived states, having ancestors with much larger genome sizes. The Selaginellaceae represent an ancient lineage with extremely small genomes. It is unclear how small nuclear genomes evolved in Selaginella We compared the rates of nuclear genome size evolution in Selaginella and major vascular plant clades in a comparative phylogenetic framework. For the analyses, we collected 29 new flow cytometry estimates of haploid genome size in Selaginella to augment publicly available data. Selaginella possess some of the smallest known haploid nuclear genome sizes, as well as the lowest rate of genome size evolution observed across all vascular land plants included in our analyses. Additionally, our analyses provide strong support for a history of haploid nuclear genome size stasis in Selaginella Our results indicate that Selaginella, similar to other early diverging lineages of vascular land plants, has relatively low rates of genome size evolution. Further, our analyses highlight that a rapid transition to a small genome size is only one route to an extremely small genome. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
2009 Epigenetics Gordon Research Conference (August 9 - 14, 2009)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jeanie Lee
Epigenetics refers to the study of heritable changes in genome function that occur without a change in primary DNA sequence. The 2009 Gordon Conference in Epigenetics will feature discussion of various epigenetic phenomena, emerging understanding of their underlying mechanisms, and the growing appreciation that human, animal, and plant health all depend on proper epigenetic control. Special emphasis will be placed on genome-environment interactions particularly as they relate to human disease. Towards improving knowledge of molecular mechanisms, the conference will feature international leaders studying the roles of higher order chromatin structure, noncoding RNA, repeat elements, nuclear organization, and morphogenic evolution. Traditionalmore » and new model organisms are selected from plants, fungi, and metazoans.« less
Iacobuzio-Donahue, Christine A
2012-01-01
Pancreatic cancer is a disease caused by the accumulation of genetic alterations in specific genes. Elucidation of the human genome sequence, in conjunction with technical advances in the ability to perform whole exome sequencing, have provided new insight into the mutational spectra characteristic of this lethal tumour type. Most recently, exomic sequencing has been used to clarify the clonal evolution of pancreatic cancer as well as provide time estimates of pancreatic carcinogenesis, indicating that a long window of opportunity may exist for early detection of this disease while in the curative stage. Moving forward, these mutational analyses indicate potential targets for personalised diagnostic and therapeutic intervention as well as the optimal timing for intervention based on the natural history of pancreatic carcinogenesis and progression. PMID:21749982
Evolution of substrate specificity in a retained enzyme driven by gene loss
Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N; Verduzco-Castro, Ernesto A; Michalska, Karolina; Wu, Chenggang; Noda-García, Lianet; Babnigg, Gyorgy; Endres, Michael; Medina-Ruíz, Sofía; Santoyo-Flores, Julián; Carrillo-Tripp, Mauricio; Ton-That, Hung; Joachimiak, Andrzej; Henry, Christopher S; Barona-Gómez, Francisco
2017-01-01
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Our results show how gene loss can drive the evolution of substrate specificity from retained enzymes. DOI: http://dx.doi.org/10.7554/eLife.22679.001 PMID:28362260
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss
Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.; ...
2017-03-31
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Evolution of substrate specificity in a retained enzyme driven by gene loss
Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.; ...
2017-03-31
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence oftrpandhisgenes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficientmore » form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Finally, our results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Evolution of substrate specificity in a retained enzyme driven by gene loss
DOE Office of Scientific and Technical Information (OSTI.GOV)
Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence oftrpandhisgenes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficientmore » form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Finally, our results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss
DOE Office of Scientific and Technical Information (OSTI.GOV)
Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Hofman, Courtney A; Rick, Torben C; Hawkins, Melissa T R; Funk, W Chris; Ralls, Katherine; Boser, Christina L; Collins, Paul W; Coonan, Tim; King, Julie L; Morrison, Scott A; Newsome, Seth D; Sillett, T Scott; Fleischer, Robert C; Maldonado, Jesus E
2015-01-01
Island endemics are typically differentiated from their mainland progenitors in behavior, morphology, and genetics, often resulting from long-term evolutionary change. To examine mechanisms for the origins of island endemism, we present a phylogeographic analysis of whole mitochondrial genomes from the endangered island fox (Urocyon littoralis), endemic to California's Channel Islands, and mainland gray foxes (U. cinereoargenteus). Previous genetic studies suggested that foxes first appeared on the islands >16,000 years ago, before human arrival (~13,000 cal BP), while archaeological and paleontological data supported a colonization >7000 cal BP. Our results are consistent with initial fox colonization of the northern islands probably by rafting or human introduction ~9200-7100 years ago, followed quickly by human translocation of foxes from the northern to southern Channel Islands. Mitogenomes indicate that island foxes are monophyletic and most closely related to gray foxes from northern California that likely experienced a Holocene climate-induced range shift. Our data document rapid morphological evolution of island foxes (in ~2000 years or less). Despite evidence for bottlenecks, island foxes have generated and maintained multiple mitochondrial haplotypes. This study highlights the intertwined evolutionary history of island foxes and humans, and illustrates a new approach for investigating the evolutionary histories of other island endemics.
Hofman, Courtney A.; Rick, Torben C.; Hawkins, Melissa T. R.; Funk, W. Chris; Ralls, Katherine; Boser, Christina L.; Collins, Paul W.; Coonan, Tim; King, Julie L.; Morrison, Scott A.; Newsome, Seth D.; Sillett, T. Scott; Fleischer, Robert C.; Maldonado, Jesus E.
2015-01-01
Island endemics are typically differentiated from their mainland progenitors in behavior, morphology, and genetics, often resulting from long-term evolutionary change. To examine mechanisms for the origins of island endemism, we present a phylogeographic analysis of whole mitochondrial genomes from the endangered island fox (Urocyon littoralis), endemic to California’s Channel Islands, and mainland gray foxes (U. cinereoargenteus). Previous genetic studies suggested that foxes first appeared on the islands >16,000 years ago, before human arrival (~13,000 cal BP), while archaeological and paleontological data supported a colonization >7000 cal BP. Our results are consistent with initial fox colonization of the northern islands probably by rafting or human introduction ~9200–7100 years ago, followed quickly by human translocation of foxes from the northern to southern Channel Islands. Mitogenomes indicate that island foxes are monophyletic and most closely related to gray foxes from northern California that likely experienced a Holocene climate-induced range shift. Our data document rapid morphological evolution of island foxes (in ~2000 years or less). Despite evidence for bottlenecks, island foxes have generated and maintained multiple mitochondrial haplotypes. This study highlights the intertwined evolutionary history of island foxes and humans, and illustrates a new approach for investigating the evolutionary histories of other island endemics. PMID:25714775
Patterns and processes of Mycobacterium bovis evolution revealed by phylogenomic analyses
USDA-ARS?s Scientific Manuscript database
Mycobacterium bovis is an important animal pathogen worldwide that parasitizes wild and domesticated vertebrate livestock as well as humans. A comparison of the five M. bovis complete genomes from UK, South Korea, Brazil and USA revealed four novel large-scale structural variations of at least 2,000...
77 FR 12605 - Center for Scientific Review; Notice of Closed Meetings
Federal Register 2010, 2011, 2012, 2013, 2014
2012-03-01
... Special Emphasis Panel; Studies of variation, evolution, genomics with statistical and molecular Methods... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health Center for Scientific Review; Notice of Closed Meetings Pursuant to section 10(d) of the Federal Advisory Committee Act, as amended (5...
The unusual reproductive system of head and body lice (Pediculus humanus).
DE LA Filia, A G; Andrewes, S; Clark, J M; Ross, L
2018-06-01
Insect reproduction is extremely variable, but the implications of alternative genetic systems are often overlooked in studies on the evolution of insecticide resistance. Both ecotypes of Pediculus humanus (Phthiraptera: Pediculidae), the human head and body lice, are human ectoparasites, the control of which is challenged by the recent spread of resistance alleles. The present study conclusively establishes for the first time that both head and body lice reproduce through paternal genome elimination (PGE), an unusual genetic system in which males transmit only their maternally derived chromosomes. Here, we investigate inheritance patterns of parental genomes using a genotyping approach across families of both ecotypes and show that heterozygous males exclusively or preferentially pass on one allele only, whereas females transmit both in a Mendelian fashion. We do however observe occasional transmission of paternal chromosomes through males, representing the first known case of PGE in which whole-genome meiotic drive is incomplete. Finally, we discuss the potential implications of this finding for the evolution of resistance and invite the development of new theoretical models of how this knowledge might contribute to increasing the success of pediculicide-based management schemes. © 2017 The Authors. Medical and Veterinary Entomology published by John Wiley & Sons Ltd on behalf of Royal Entomological Society.
Human Xq28 inversion polymorphism: From sex linkage to Genomics--A genetic mother lode.
Kirby, Cait S; Kolber, Natalie; Salih Almohaidi, Asmaa M; Bierwert, Lou Ann; Saunders, Lori; Williams, Steven; Merritt, Robert
2016-01-01
An inversion polymorphism of the filamin and emerin genes at the tip of the long arm of the human X-chromosome serves as the basis of an investigative laboratory in which students learn something new about their own genomes. Long, nearly identical inverted repeats flanking the filamin and emerin genes illustrate how repetitive elements can lead to alterations in genome structure (inversions) through nonallelic homologous recombination. The near identity of the inverted repeats is an example of concerted evolution through gene conversion. While the laboratory in its entirety is designed for college level genetics courses, portions of the laboratory are appropriate for courses at other levels. Because the polymorphism is on the X-chromosome, the laboratory can be used in introductory biology courses to enhance understanding of sex-linkage and to test for Hardy-Weinberg equilibrium in females. More advanced topics, such as chromosome interference, the molecular model for recombination, and inversion heterozygosity suppression of recombination can be explored in upper-level genetics and evolution courses. DNA isolation, restriction digests, ligation, long PCR, and iPCR provide experience with techniques in molecular biology. This investigative laboratory weaves together topics stretching from molecular genetics to cytogenetics and sex-linkage, population genetics and evolutionary genetics. © 2016 The International Union of Biochemistry and Molecular Biology.
Epigenetic functions enriched in transcription factors binding to mouse recombination hotspots.
Wu, Min; Kwoh, Chee-Keong; Przytycka, Teresa M; Li, Jing; Zheng, Jie
2012-06-21
The regulatory mechanism of recombination is a fundamental problem in genomics, with wide applications in genome-wide association studies, birth-defect diseases, molecular evolution, cancer research, etc. In mammalian genomes, recombination events cluster into short genomic regions called "recombination hotspots". Recently, a 13-mer motif enriched in hotspots is identified as a candidate cis-regulatory element of human recombination hotspots; moreover, a zinc finger protein, PRDM9, binds to this motif and is associated with variation of recombination phenotype in human and mouse genomes, thus is a trans-acting regulator of recombination hotspots. However, this pair of cis and trans-regulators covers only a fraction of hotspots, thus other regulators of recombination hotspots remain to be discovered. In this paper, we propose an approach to predicting additional trans-regulators from DNA-binding proteins by comparing their enrichment of binding sites in hotspots. Applying this approach on newly mapped mouse hotspots genome-wide, we confirmed that PRDM9 is a major trans-regulator of hotspots. In addition, a list of top candidate trans-regulators of mouse hotspots is reported. Using GO analysis we observed that the top genes are enriched with function of histone modification, highlighting the epigenetic regulatory mechanisms of recombination hotspots.
Epigenetic functions enriched in transcription factors binding to mouse recombination hotspots
2012-01-01
The regulatory mechanism of recombination is a fundamental problem in genomics, with wide applications in genome-wide association studies, birth-defect diseases, molecular evolution, cancer research, etc. In mammalian genomes, recombination events cluster into short genomic regions called "recombination hotspots". Recently, a 13-mer motif enriched in hotspots is identified as a candidate cis-regulatory element of human recombination hotspots; moreover, a zinc finger protein, PRDM9, binds to this motif and is associated with variation of recombination phenotype in human and mouse genomes, thus is a trans-acting regulator of recombination hotspots. However, this pair of cis and trans-regulators covers only a fraction of hotspots, thus other regulators of recombination hotspots remain to be discovered. In this paper, we propose an approach to predicting additional trans-regulators from DNA-binding proteins by comparing their enrichment of binding sites in hotspots. Applying this approach on newly mapped mouse hotspots genome-wide, we confirmed that PRDM9 is a major trans-regulator of hotspots. In addition, a list of top candidate trans-regulators of mouse hotspots is reported. Using GO analysis we observed that the top genes are enriched with function of histone modification, highlighting the epigenetic regulatory mechanisms of recombination hotspots. PMID:22759569
Pettigrew, Melinda M; Ahearn, Christian P; Gent, Janneane F; Kong, Yong; Gallo, Mary C; Munro, James B; D'Mello, Adonis; Sethi, Sanjay; Tettelin, Hervé; Murphy, Timothy F
2018-04-03
Nontypeable Haemophilus influenzae (NTHi) exclusively colonize and infect humans and are critical to the pathogenesis of chronic obstructive pulmonary disease (COPD). In vitro and animal models do not accurately capture the complex environments encountered by NTHi during human infection. We conducted whole-genome sequencing of 269 longitudinally collected cleared and persistent NTHi from a 15-y prospective study of adults with COPD. Genome sequences were used to elucidate the phylogeny of NTHi isolates, identify genomic changes that occur with persistence in the human airways, and evaluate the effect of selective pressure on 12 candidate vaccine antigens. Strains persisted in individuals with COPD for as long as 1,422 d. Slipped-strand mispairing, mediated by changes in simple sequence repeats in multiple genes during persistence, regulates expression of critical virulence functions, including adherence, nutrient uptake, and modification of surface molecules, and is a major mechanism for survival in the hostile environment of the human airways. A subset of strains underwent a large 400-kb inversion during persistence. NTHi does not undergo significant gene gain or loss during persistence, in contrast to other persistent respiratory tract pathogens. Amino acid sequence changes occurred in 8 of 12 candidate vaccine antigens during persistence, an observation with important implications for vaccine development. These results indicate that NTHi alters its genome during persistence by regulation of critical virulence functions primarily by slipped-strand mispairing, advancing our understanding of how a bacterial pathogen that plays a critical role in COPD adapts to survival in the human respiratory tract.
The Mitonuclear Dimension of Neanderthal and Denisovan Ancestry in Modern Human Genomes
Sharbrough, Joel; Havird, Justin C.; Noe, Gregory R.; Warren, Jessica M.
2017-01-01
Abstract Some human populations interbred with Neanderthals and Denisovans, resulting in substantial contributions to modern-human genomes. Therefore, it is now possible to use genomic data to investigate mechanisms that shaped historical gene flow between humans and our closest hominin relatives. More generally, in eukaryotes, mitonuclear interactions have been argued to play a disproportionate role in generating reproductive isolation. There is no evidence of mtDNA introgression into modern human populations, which means that all introgressed nuclear alleles from archaic hominins must function on a modern-human mitochondrial background. Therefore, mitonuclear interactions are also potentially relevant to hominin evolution. We performed a detailed accounting of mtDNA divergence among hominin lineages and used population-genomic data to test the hypothesis that mitonuclear incompatibilities have preferentially restricted the introgression of nuclear genes with mitochondrial functions. We found a small but significant underrepresentation of introgressed Neanderthal alleles at such nuclear loci. Structural analyses of mitochondrial enzyme complexes revealed that these effects are unlikely to be mediated by physically interacting sites in mitochondrial and nuclear gene products. We did not detect any underrepresentation of introgressed Denisovan alleles at mitochondrial-targeted loci, but this may reflect reduced power because locus-specific estimates of Denisovan introgression are more conservative. Overall, we conclude that genes involved in mitochondrial function may have been subject to distinct selection pressures during the history of introgression from archaic hominins but that mitonuclear incompatibilities have had, at most, a small role in shaping genome-wide introgression patterns, perhaps because of limited functional divergence in mtDNA and interacting nuclear genes. PMID:28854627
The noncoding human genome and the future of personalised medicine.
Cowie, Philip; Hay, Elizabeth A; MacKenzie, Alasdair
2015-01-30
Non-coding cis-regulatory sequences act as the 'eyes' of the genome and their role is to perceive, organise and relay cellular communication information to RNA polymerase II at gene promoters. The evolution of these sequences, that include enhancers, silencers, insulators and promoters, has progressed in multicellular organisms to the extent that cis-regulatory sequences make up as much as 10% of the human genome. Parallel evidence suggests that 75% of polymorphisms associated with heritable disease occur within predicted cis-regulatory sequences that effectively alter the 'perception' of cis-regulatory sequences or render them blind to cell communication cues. Cis-regulatory sequences also act as major functional targets of epigenetic modification thus representing an important conduit through which changes in DNA-methylation affects disease susceptibility. The objectives of the current review are (1) to describe what has been learned about identifying and characterising cis-regulatory sequences since the sequencing of the human genome; (2) to discuss their role in interpreting cell signalling pathways pathways; and (3) outline how this role may be altered by polymorphisms and epigenetic changes. We argue that the importance of the cis-regulatory genome for the interpretation of cellular communication pathways cannot be overstated and understanding its role in health and disease will be critical for the future development of personalised medicine.
Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud
2017-01-01
Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
Genome sequence, comparative analysis and haplotype structure of the domestic dog.
Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S
2005-12-08
Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.
Primate-specific evolution of noncoding element insertion into PLA2G4C and human preterm birth
2010-01-01
Background The onset of birth in humans, like other apes, differs from non-primate mammals in its endocrine physiology. We hypothesize that higher primate-specific gene evolution may lead to these differences and target genes involved in human preterm birth, an area of global health significance. Methods We performed a comparative genomics screen of highly conserved noncoding elements and identified PLA2G4C, a phospholipase A isoform involved in prostaglandin biosynthesis as human accelerated. To examine whether this gene demonstrating primate-specific evolution was associated with birth timing, we genotyped and analyzed 8 common single nucleotide polymorphisms (SNPs) in PLA2G4C in US Hispanic (n = 73 preterm, 292 control), US White (n = 147 preterm, 157 control) and US Black (n = 79 preterm, 166 control) mothers. Results Detailed structural and phylogenic analysis of PLA2G4C suggested a short genomic element within the gene duplicated from a paralogous highly conserved element on chromosome 1 specifically in primates. SNPs rs8110925 and rs2307276 in US Hispanics and rs11564620 in US Whites were significant after correcting for multiple tests (p < 0.006). Additionally, rs11564620 (Thr360Pro) was associated with increased metabolite levels of the prostaglandin thromboxane in healthy individuals (p = 0.02), suggesting this variant may affect PLA2G4C activity. Conclusions Our findings suggest that variation in PLA2G4C may influence preterm birth risk by increasing levels of prostaglandins, which are known to regulate labor. PMID:21184677
Gene-culture coevolution in the age of genomics
Richerson, Peter J.; Boyd, Robert; Henrich, Joseph
2010-01-01
The use of socially learned information (culture) is central to human adaptations. We investigate the hypothesis that the process of cultural evolution has played an active, leading role in the evolution of genes. Culture normally evolves more rapidly than genes, creating novel environments that expose genes to new selective pressures. Many human genes that have been shown to be under recent or current selection are changing as a result of new environments created by cultural innovations. Some changed in response to the development of agricultural subsistence systems in the Early and Middle Holocene. Alleles coding for adaptations to diets rich in plant starch (e.g., amylase copy number) and to epidemic diseases evolved as human populations expanded (e.g., sickle cell and G6PD deficiency alleles that provide protection against malaria). Large-scale scans using patterns of linkage disequilibrium to detect recent selection suggest that many more genes evolved in response to agriculture. Genetic change in response to the novel social environment of contemporary modern societies is also likely to be occurring. The functional effects of most of the alleles under selection during the last 10,000 years are currently unknown. Also unknown is the role of paleoenvironmental change in regulating the tempo of hominin evolution. Although the full extent of culture-driven gene-culture coevolution is thus far unknown for the deeper history of the human lineage, theory and some evidence suggest that such effects were profound. Genomic methods promise to have a major impact on our understanding of gene-culture coevolution over the span of hominin evolutionary history. PMID:20445092
An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.
Shao, Mingfu; Lin, Yu; Moret, Bernard M E
2015-05-01
Computing the edit distance between two genomes is a basic problem in the study of genome evolution. The double-cut-and-join (DCJ) model has formed the basis for most algorithmic research on rearrangements over the last few years. The edit distance under the DCJ model can be computed in linear time for genomes without duplicate genes, while the problem becomes NP-hard in the presence of duplicate genes. In this article, we propose an integer linear programming (ILP) formulation to compute the DCJ distance between two genomes with duplicate genes. We also provide an efficient preprocessing approach to simplify the ILP formulation while preserving optimality. Comparison on simulated genomes demonstrates that our method outperforms MSOAR in computing the edit distance, especially when the genomes contain long duplicated segments. We also apply our method to assign orthologous gene pairs among human, mouse, and rat genomes, where once again our method outperforms MSOAR.
2013-01-01
Background Lyme disease is caused by spirochete bacteria from the Borrelia burgdorferi sensu lato (B. burgdorferi s.l.) species complex. To reconstruct the evolution of B. burgdorferi s.l. and identify the genomic basis of its human virulence, we compared the genomes of 23 B. burgdorferi s.l. isolates from Europe and the United States, including B. burgdorferi sensu stricto (B. burgdorferi s.s., 14 isolates), B. afzelii (2), B. garinii (2), B. “bavariensis” (1), B. spielmanii (1), B. valaisiana (1), B. bissettii (1), and B. “finlandensis” (1). Results Robust B. burgdorferi s.s. and B. burgdorferi s.l. phylogenies were obtained using genome-wide single-nucleotide polymorphisms, despite recombination. Phylogeny-based pan-genome analysis showed that the rate of gene acquisition was higher between species than within species, suggesting adaptive speciation. Strong positive natural selection drives the sequence evolution of lipoproteins, including chromosomally-encoded genes 0102 and 0404, cp26-encoded ospC and b08, and lp54-encoded dbpA, a07, a22, a33, a53, a65. Computer simulations predicted rapid adaptive radiation of genomic groups as population size increases. Conclusions Intra- and inter-specific pan-genome sizes of B. burgdorferi s.l. expand linearly with phylogenetic diversity. Yet gene-acquisition rates in B. burgdorferi s.l. are among the lowest in bacterial pathogens, resulting in high genome stability and few lineage-specific genes. Genome adaptation of B. burgdorferi s.l. is driven predominantly by copy-number and sequence variations of lipoprotein genes. New genomic groups are likely to emerge if the current trend of B. burgdorferi s.l. population expansion continues. PMID:24112474
Genomic anatomy of Escherichia coli O157:H7 outbreaks
Eppinger, Mark; Mammel, Mark K.; Leclerc, Joseph E.; Ravel, Jacques; Cebula, Thomas A.
2011-01-01
The rapid emergence of Escherichia coli O157:H7 from an unknown strain in 1982 to the dominant hemorrhagic E. coli serotype in the United States and the cause of widespread outbreaks of human food-borne illness highlights a need to evaluate critically the extent to which genomic plasticity of this important enteric pathogen contributes to its pathogenic potential and its evolution as well as its adaptation in different ecological niches. Aimed at a better understanding of the evolution of the E. coli O157:H7 pathogenome, the present study presents the high-quality sequencing and comparative phylogenomic analysis of a comprehensive panel of 25 E. coli O157:H7 strains associated with three nearly simultaneous food-borne outbreaks of human disease in the United States. Here we present a population genetic analysis of more than 200 related strains recovered from patients, contaminated produce, and zoonotic sources. High-resolution phylogenomic approaches allow the dynamics of pathogenome evolution to be followed at a high level of phylogenetic accuracy and resolution. SNP discovery and study of genome architecture and prophage content identified numerous biomarkers to assess the extent of genetic diversity within a set of clinical and environmental strains. A total of 1,225 SNPs were identified in the present study and are now available for typing of the E. coli O157:H7 lineage. These data should prove useful for the development of a refined phylogenomic framework for forensic, diagnostic, and epidemiological studies to define better risk in response to novel and emerging E. coli O157:H7 resistance and virulence phenotypes. PMID:22135463
Audit, Benjamin; Zaghloul, Lamia; Vaillant, Cédric; Chevereau, Guillaume; d'Aubenton-Carafa, Yves; Thermes, Claude; Arneodo, Alain
2009-01-01
For years, progress in elucidating the mechanisms underlying replication initiation and its coupling to transcriptional activities and to local chromatin structure has been hampered by the small number (approximately 30) of well-established origins in the human genome and more generally in mammalian genomes. Recent in silico studies of compositional strand asymmetries revealed a high level of organization of human genes around 1000 putative replication origins. Here, by comparing with recently experimentally identified replication origins, we provide further support that these putative origins are active in vivo. We show that regions ∼300-kb wide surrounding most of these putative replication origins that replicate early in the S phase are hypersensitive to DNase I cleavage, hypomethylated and present a significant enrichment in genomic energy barriers that impair nucleosome formation (nucleosome-free regions). This suggests that these putative replication origins are specified by an open chromatin structure favored by the DNA sequence. We discuss how this distinctive attribute makes these origins, further qualified as ‘master’ replication origins, priviledged loci for future research to decipher the human spatio-temporal replication program. Finally, we argue that these ‘master’ origins are likely to play a key role in genome dynamics during evolution and in pathological situations. PMID:19671527
A Genome-Wide Landscape of Retrocopies in Primate Genomes.
Navarro, Fábio C P; Galante, Pedro A F
2015-07-29
Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The evolution of vertebrate Toll-like receptors
Roach, J.C.; Glusman, G.; Rowen, L.; Kaur, A.; Purcell, M.K.; Smith, K.D.; Hood, L.E.; Aderem, A.
2005-01-01
The complete sequences of Takifugu Toll-like receptor (TLR) loci and gene predictions from many draft genomes enable comprehensive molecular phylogenetic analysis. Strong selective pressure for recognition of and response to pathogen-associated molecular patterns has maintained a largely unchanging TLR recognition in all vertebrates. There are six major families of vertebrate TLRs. This repertoire is distinct from that of invertebrates. TLRs within a family recognize a general class of pathogen-associated molecular patterns. Most vertebrates have exactly one gene ortholog for each TLR family. The family including TLR1 has more species-specific adaptations than other families. A major family including TLR11 is represented in humans only by a pseudogene. Coincidental evolution plays a minor role in TLR evolution. The sequencing phase of this study produced finished genomic sequences for the 12 Takifugu rubripes TLRs. In addition, we have produced > 70 gene models, including sequences from the opossum, chicken, frog, dog, sea urchin, and sea squirt. ?? 2005 by The National Academy of Sciences of the USA.
Gao, Shibo; Anderson, Tavis K; Walia, Rasna R; Dorman, Karin S; Janas-Martindale, Alicia; Vincent, Amy L
2017-08-01
Transmission of influenza A virus (IAV) from humans to swine occurs with relative frequency and is a critical contributor to swine IAV diversity. Subsequent to the introduction of these human seasonal lineages, there is often reassortment with endemic viruses and antigenic drift. To address whether particular genome constellations contributed to viral persistence following the introduction of the 2009 H1N1 human pandemic virus to swine in the USA, we collated and analysed 616 whole genomes of swine H1 isolates. For each gene, sequences were aligned, the best-known maximum likelihood phylogeny was inferred, and each virus was assigned a clade based upon its evolutionary history. A time-scaled Bayesian approach was implemented for the haemagglutinin (HA) gene to determine the patterns of genetic diversity over time. From these analyses, we observed an increase in genome diversity across all H1 lineages and clades, with the H1-γ and H1-δ1 genetic clades containing the greatest number of unique genome patterns. We documented 74 genome patterns from 2009 to 2016, of which 3 genome patterns were consistently detected at a significantly higher level than others across the entire time period. Eight genome patterns increased significantly, while five genome patterns were shown to decline in detection over time. Viruses with genome patterns identified as persisting in the US swine population may possess a greater capacity to infect and transmit in swine. This study highlights the emerging genetic diversity of US swine IAV from 2009 to 2016, with implications for swine and public health and vaccine control efforts.
Gao, Shibo; Anderson, Tavis K.; Walia, Rasna R.; Dorman, Karin S.; Janas-Martindale, Alicia
2017-01-01
Transmission of influenza A virus (IAV) from humans to swine occurs with relative frequency and is a critical contributor to swine IAV diversity. Subsequent to the introduction of these human seasonal lineages, there is often reassortment with endemic viruses and antigenic drift. To address whether particular genome constellations contributed to viral persistence following the introduction of the 2009 H1N1 human pandemic virus to swine in the USA, we collated and analysed 616 whole genomes of swine H1 isolates. For each gene, sequences were aligned, the best-known maximum likelihood phylogeny was inferred, and each virus was assigned a clade based upon its evolutionary history. A time-scaled Bayesian approach was implemented for the haemagglutinin (HA) gene to determine the patterns of genetic diversity over time. From these analyses, we observed an increase in genome diversity across all H1 lineages and clades, with the H1-γ and H1-δ1 genetic clades containing the greatest number of unique genome patterns. We documented 74 genome patterns from 2009 to 2016, of which 3 genome patterns were consistently detected at a significantly higher level than others across the entire time period. Eight genome patterns increased significantly, while five genome patterns were shown to decline in detection over time. Viruses with genome patterns identified as persisting in the US swine population may possess a greater capacity to infect and transmit in swine. This study highlights the emerging genetic diversity of US swine IAV from 2009 to 2016, with implications for swine and public health and vaccine control efforts. PMID:28758634
Genome Dynamics in Legionella: The Basis of Versatility and Adaptation to Intracellular Replication
Gomez-Valero, Laura; Buchrieser, Carmen
2013-01-01
Legionella pneumophila is a bacterial pathogen present in aquatic environments that can cause a severe pneumonia called Legionnaires’ disease. Soon after its recognition, it was shown that Legionella replicates inside amoeba, suggesting that bacteria replicating in environmental protozoa are able to exploit conserved signaling pathways in human phagocytic cells. Comparative, evolutionary, and functional genomics suggests that the Legionella–amoeba interaction has shaped this pathogen more than previously thought. A complex evolutionary scenario involving mobile genetic elements, type IV secretion systems, and horizontal gene transfer among Legionella, amoeba, and other organisms seems to take place. This long-lasting coevolution led to the development of very sophisticated virulence strategies and a high level of temporal and spatial fine-tuning of bacteria host–cell interactions. We will discuss current knowledge of the evolution of virulence of Legionella from a genomics perspective and propose our vision of the emergence of this human pathogen from the environment. PMID:23732852
Systematic characterization of deubiquitylating enzymes for roles in maintaining genome integrity.
Nishi, Ryotaro; Wijnhoven, Paul; le Sage, Carlos; Tjeertes, Jorrit; Galanty, Yaron; Forment, Josep V; Clague, Michael J; Urbé, Sylvie; Jackson, Stephen P
2014-10-01
DNA double-strand breaks (DSBs) are perhaps the most toxic of all DNA lesions, with defects in the DNA-damage response to DSBs being associated with various human diseases. Although it is known that DSB repair pathways are tightly regulated by ubiquitylation, we do not yet have a comprehensive understanding of how deubiquitylating enzymes (DUBs) function in DSB responses. Here, by carrying out a multidimensional screening strategy for human DUBs, we identify several with hitherto unknown links to DSB repair, the G2/M DNA-damage checkpoint and genome-integrity maintenance. Phylogenetic analyses reveal functional clustering within certain DUB subgroups, suggesting evolutionally conserved functions and/or related modes of action. Furthermore, we establish that the DUB UCHL5 regulates DSB resection and repair by homologous recombination through protecting its interactor, NFRKB, from degradation. Collectively, our findings extend the list of DUBs promoting the maintenance of genome integrity, and highlight their potential as therapeutic targets for cancer.
Huang, Yi-Fei; Gulko, Brad; Siepel, Adam
2017-04-01
Many genetic variants that influence phenotypes of interest are located outside of protein-coding genes, yet existing methods for identifying such variants have poor predictive power. Here we introduce a new computational method, called LINSIGHT, that substantially improves the prediction of noncoding nucleotide sites at which mutations are likely to have deleterious fitness consequences, and which, therefore, are likely to be phenotypically important. LINSIGHT combines a generalized linear model for functional genomic data with a probabilistic model of molecular evolution. The method is fast and highly scalable, enabling it to exploit the 'big data' available in modern genomics. We show that LINSIGHT outperforms the best available methods in identifying human noncoding variants associated with inherited diseases. In addition, we apply LINSIGHT to an atlas of human enhancers and show that the fitness consequences at enhancers depend on cell type, tissue specificity, and constraints at associated promoters.
Mutants of Cre recombinase with improved accuracy
Eroshenko, Nikolai; Church, George M.
2013-01-01
Despite rapid advances in genome engineering technologies, inserting genes into precise locations in the human genome remains an outstanding problem. It has been suggested that site-specific recombinases can be adapted towards use as transgene delivery vectors. The specificity of recombinases can be altered either with directed evolution or via fusions to modular DNA-binding domains. Unfortunately, both wildtype and altered variants often have detectable activities at off-target sites. Here we use bacterial selections to identify mutations in the dimerization surface of Cre recombinase (R32V, R32M, and 303GVSdup) that improve the accuracy of recombination. The mutants are functional in bacteria, in human cells, and in vitro (except for 303GVSdup, which we did not purify), and have improved selectivity against both model off-target sites and the entire E. coli genome. We propose that destabilizing binding cooperativity may be a general strategy for improving the accuracy of dimeric DNA-binding proteins. PMID:24056590
Genome dynamics in Legionella: the basis of versatility and adaptation to intracellular replication.
Gomez-Valero, Laura; Buchrieser, Carmen
2013-06-01
Legionella pneumophila is a bacterial pathogen present in aquatic environments that can cause a severe pneumonia called Legionnaires' disease. Soon after its recognition, it was shown that Legionella replicates inside amoeba, suggesting that bacteria replicating in environmental protozoa are able to exploit conserved signaling pathways in human phagocytic cells. Comparative, evolutionary, and functional genomics suggests that the Legionella-amoeba interaction has shaped this pathogen more than previously thought. A complex evolutionary scenario involving mobile genetic elements, type IV secretion systems, and horizontal gene transfer among Legionella, amoeba, and other organisms seems to take place. This long-lasting coevolution led to the development of very sophisticated virulence strategies and a high level of temporal and spatial fine-tuning of bacteria host-cell interactions. We will discuss current knowledge of the evolution of virulence of Legionella from a genomics perspective and propose our vision of the emergence of this human pathogen from the environment.
Porse, Andreas; Gumpert, Heidi; Kubicek-Sutherland, Jessica Z; Karami, Nahid; Adlerberth, Ingegerd; Wold, Agnes E; Andersson, Dan I; Sommer, Morten O A
2017-01-01
Elucidating the adaptive strategies and plasticity of bacterial genomes in situ is crucial for understanding the epidemiology and evolution of pathogens threatening human health. While much is known about the evolution of Escherichia coli in controlled laboratory environments, less effort has been made to elucidate the genome dynamics of E. coli in its native settings. Here, we follow the genome dynamics of co-existing E. coli lineages in situ of the infant gut during the first year of life. One E. coli lineage causes a urinary tract infection (UTI) and experiences several alterations of its genomic content during subsequent antibiotic treatment. Interestingly, all isolates of this uropathogenic E. coli strain carried a highly stable plasmid implicated in virulence of diverse pathogenic strains from all over the world. While virulence elements are certainly beneficial during infection scenarios, their role in gut colonization and pathogen persistence is poorly understood. We performed in vivo competitive fitness experiments to assess the role of this highly disseminated virulence plasmid in gut colonization, but found no evidence for a direct benefit of plasmid carriage. Through plasmid stability assays, we demonstrate that this plasmid is maintained in a parasitic manner, by strong first-line inheritance mechanisms, acting on the single-cell level, rather than providing a direct survival advantage in the gut. Investigating the ecology of endemic accessory genetic elements, in their pathogenic hosts and native environment, is of vital importance if we want to understand the evolution and persistence of highly virulent and drug resistant bacterial isolates.
Emergence and Evolution of Hominidae-Specific Coding and Noncoding Genomic Sequences.
Saber, Morteza Mahmoudi; Adeyemi Babarinde, Isaac; Hettiarachchi, Nilmini; Saitou, Naruya
2016-07-12
Family Hominidae, which includes humans and great apes, is recognized for unique complex social behavior and intellectual abilities. Despite the increasing genome data, however, the genomic origin of its phenotypic uniqueness has remained elusive. Clade-specific genes and highly conserved noncoding sequences (HCNSs) are among the high-potential evolutionary candidates involved in driving clade-specific characters and phenotypes. On this premise, we analyzed whole genome sequences along with gene orthology data retrieved from major DNA databases to find Hominidae-specific (HS) genes and HCNSs. We discovered that Down syndrome critical region 4 (DSCR4) is the only experimentally verified gene uniquely present in Hominidae. DSCR4 has no structural homology to any known protein and was inferred to have emerged in several steps through LTR/ERV1, LTR/ERVL retrotransposition, and transversion. Using the genomic distance as neutral evolution threshold, we identified 1,658 HS HCNSs. Polymorphism coverage and derived allele frequency analysis of HS HCNSs showed that these HCNSs are under purifying selection, indicating that they may harbor important functions. They are overrepresented in promoters/untranslated regions, in close proximity of genes involved in sensory perception of sound and developmental process, and also showed a significantly lower nucleosome occupancy probability. Interestingly, many ancestral sequences of the HS HCNSs showed very high evolutionary rates. This suggests that new functions emerged through some kind of positive selection, and then purifying selection started to operate to keep these functions. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evolution and genome architecture in fungal plant pathogens.
Möller, Mareike; Stukenbrock, Eva H
2017-12-01
The fungal kingdom comprises some of the most devastating plant pathogens. Sequencing the genomes of fungal pathogens has shown a remarkable variability in genome size and architecture. Population genomic data enable us to understand the mechanisms and the history of changes in genome size and adaptive evolution in plant pathogens. Although transposable elements predominantly have negative effects on their host, fungal pathogens provide prominent examples of advantageous associations between rapidly evolving transposable elements and virulence genes that cause variation in virulence phenotypes. By providing homogeneous environments at large regional scales, managed ecosystems, such as modern agriculture, can be conducive for the rapid evolution and dispersal of pathogens. In this Review, we summarize key examples from fungal plant pathogen genomics and discuss evolutionary processes in pathogenic fungi in the context of molecular evolution, population genomics and agriculture.
de Been, Mark; van Schaik, Willem; Cheng, Lu; Corander, Jukka; Willems, Rob J.
2013-01-01
Reasons for the rising clinical impact of the bacterium Enterococcus faecium include the species’ rapid acquisition of adaptive genetic elements. Here, we focused on the impact of recombination on the evolution of E. faecium. We used the recently developed BratNextGen algorithm to detect recombinant regions in the core genome of 34 E. faecium strains, including three newly sequenced clinical strains. Recombination was found to have a significant impact on the E. faecium genome: of the original 1.2 million positions in the core genome, 0.5 million were predicted to have been affected by recombination in at least one strain. Importantly, strains in one of the two major E. faecium clades (clade B), which contains most of the E. faecium human gut commensals, formed the most important reservoir for donating foreign DNA to the second major E. faecium clade (clade A), which contains most of the clinical isolates. Also, several genomic regions were found to mainly recombine in specific hospital-associated E. faecium strains. One of these regions (the epa-like locus) likely encodes the biosynthesis of cell wall polysaccharides. These findings suggest a crucial role for recombination in the emergence of E. faecium as a successful hospital-associated pathogen. PMID:23882129
The origin of the variola virus.
Babkin, Igor V; Babkina, Irina N
2015-03-10
The question of the origin of smallpox, one of the major menaces to humankind, is a constant concern for the scientific community. Smallpox is caused by the agent referred to as the variola virus (VARV), which belongs to the genus Orthopoxvirus. In the last century, smallpox was declared eradicated from the human community; however, the mechanisms responsible for the emergence of new dangerous pathogens have yet to be unraveled. Evolutionary analyses of the molecular biological genomic data of various orthopoxviruses, involving a wide range of epidemiological and historical information about smallpox, have made it possible to date the emergence of VARV. Comparisons of the VARV genome to the genomes of the most closely related orthopoxviruses and the examination of the distribution their natural hosts' ranges suggest that VARV emerged 3000 to 4000 years ago in the east of the African continent. The VARV evolution rate has been estimated to be approximately 2 × 10-6 substitutions/site/year for the central conserved genomic region and 4 × 10-6 substitutions/site/year for the synonymous substitutions in the genome. Presumably, the introduction of camels to Africa and the concurrent changes to the climate were the particular factors that triggered the divergent evolution of a cowpox-like ancestral virus and thereby led to the emergence of VARV.
Paixão-Côrtes, Vanessa Rodrigues; Viscardi, Lucas Henrique; Salzano, Francisco Mauro; Hünemeier, Tábita; Bortolini, Maria Cátira
2012-12-01
After a brief review of the most recent findings in the study of human evolution, an extensive comparison of the complete genomes of our nearest relative, the chimpanzee (Pan troglodytes), of extant Homo sapiens, archaic Homo neanderthalensis and the Denisova specimen were made. The focus was on non-synonymous mutations, which consequently had an impact on protein levels and these changes were classified according to degree of effect. A total of 10,447 non-synonymous substitutions were found in which the derived allele is fixed or nearly fixed in humans as compared to chimpanzee. Their most frequent location was on chromosome 21. Their presence was then searched in the two archaic genomes. Mutations in 381 genes would imply radical amino acid changes, with a fraction of these related to olfaction and other important physiological processes. Eight new alleles were identified in the Neanderthal and/or Denisova genetic pools. Four others, possibly affecting cognition, occured both in the sapiens and two other archaic genomes. The selective sweep that gave rise to Homo sapiens could, therefore, have initiated before the modern/archaic human divergence.
Chromosomal transfers in mycoplasmas: when minimal genomes go mobile.
Dordet-Frisoni, Emilie; Sagné, Eveline; Baranowski, Eric; Breton, Marc; Nouvel, Laurent Xavier; Blanchard, Alain; Marenda, Marc Serge; Tardy, Florence; Sirand-Pugnet, Pascal; Citti, Christine
2014-11-25
Horizontal gene transfer (HGT) is a main driving force of bacterial evolution and innovation. This phenomenon was long thought to be marginal in mycoplasmas, a large group of self-replicating bacteria characterized by minute genomes as a result of successive gene losses during evolution. Recent comparative genomic analyses challenged this paradigm, but the occurrence of chromosomal exchanges had never been formally addressed in mycoplasmas. Here, we demonstrated the conjugal transfer of large chromosomal regions within and among ruminant mycoplasma species, with the incorporation of the incoming DNA occurring by homologous recombination into the recipient chromosome. By combining classical mating experiments with high-throughput next-generation sequencing, we documented the transfer of almost every position of the mycoplasma chromosome. Mycoplasma conjugation relies on the occurrence of an integrative conjugative element (ICE) in at least one parent cell. While ICE propagates horizontally from ICE-positive to ICE-negative cells, chromosomal transfers (CTs) occurred in the opposite direction, from ICE-negative to ICE-positive cells, independently of ICE movement. These findings challenged the classical mechanisms proposed for other bacteria in which conjugative CTs are driven by conjugative elements, bringing into the spotlight a new means for rapid mycoplasma innovation. Overall, they radically change our current views concerning the evolution of mycoplasmas, with particularly far-reaching implications given that over 50 species are human or animal pathogens. Horizontal gene transfers (HGT) shape bacterial genomes and are key contributors to microbial diversity and innovation. One main mechanism involves conjugation, a process that allows the simultaneous transfer of significant amounts of DNA upon cell-to-cell contact. Recognizing and deciphering conjugal mechanisms are thus essential in understanding the impact of gene flux on bacterial evolution. We addressed this issue in mycoplasmas, the smallest and simplest self-replicating bacteria. In these organisms, HGT was long thought to be marginal. We showed here that nearly every position of the Mycoplasma agalactiae chromosome could be transferred via conjugation, using an unconventional mechanism. The transfer involved DNA blocks containing up to 80 genes that were incorporated into the host chromosome by homologous recombination. These findings radically change our views concerning mycoplasma evolution and adaptation with particularly far-reaching implications given that over 50 species are human or animal pathogens. Copyright © 2014 Dordet-Frisoni et al.
(Meta)genomic insights into the pathogenome of Cellulosimicrobium cellulans
Sharma, Anukriti; Gilbert, Jack A.; Lal, Rup
2016-05-06
Despite having serious clinical manifestations, Cellulosimicrobium cellulans remain under-reported with only three genome sequences available at the time of writing. Genome sequences of C. cellulans LMG16121, C. cellulans J36 and Cellulosimicrobium sp. strain MM were used to determine distribution of pathogenicity islands (PAIs) across C. cellulans, which revealed 49 potential marker genes with known association to human infections, e.g. Fic and VbhA toxin-antitoxin system. Oligonucleotide composition-based analysis of orthologous proteins (n = 791) across three genomes revealed significant negative correlation (P < 0.05) between frequency of optimal codons ( Fopt) and gene G+C content, highlighting the G+C-biased gene conversion (gBGC)more » effect across Cellulosimicrobium strains. Bayesian molecular-clock analysis performed on three virulent PAI proteins (Fic; D-alanyl-D-alanine-carboxypeptidase; transposase) dated the divergence event at 300 million years ago from the most common recent ancestor. Synteny-based annotation of hypothetical proteins highlighted gene transfers from non-pathogenic bacteria as a key factor in the evolution of PAIs. Additonally, deciphering the metagenomic islands using strain MM's genome with environmental data from the site of isolation (hot-spring biofilm) revealed (an)aerobic respiration as population segregation factor across the in situ cohorts. Furthermore, using reference genomes and metagenomic data, our results highlight the emergence and evolution of PAIs in the genus Cellulosimicrobium.« less
Evolution of biological complexity
Adami, Christoph; Ofria, Charles; Collier, Travis C.
2000-01-01
To make a case for or against a trend in the evolution of complexity in biological evolution, complexity needs to be both rigorously defined and measurable. A recent information-theoretic (but intuitively evident) definition identifies genomic complexity with the amount of information a sequence stores about its environment. We investigate the evolution of genomic complexity in populations of digital organisms and monitor in detail the evolutionary transitions that increase complexity. We show that, because natural selection forces genomes to behave as a natural “Maxwell Demon,” within a fixed environment, genomic complexity is forced to increase. PMID:10781045
Allaby, Robin G; Kistler, Logan; Gutaker, Rafal M; Ware, Roselyn; Kitchen, James L; Smith, Oliver; Clarke, Andrew C
2015-02-01
The colonization of the human environment by plants, and the consequent evolution of domesticated forms is increasingly being viewed as a co-evolutionary plant-human process that occurred over a long time period, with evidence for the co-evolutionary relationship between plants and humans reaching ever deeper into the hominin past. This developing view is characterized by a change in emphasis on the drivers of evolution in the case of plants. Rather than individual species being passive recipients of artificial selection pressures and ultimately becoming domesticates, entire plant communities adapted to the human environment. This evolutionary scenario leads to systems level genetic expectations from models that can be explored through ancient DNA and Next Generation Sequencing approaches. Emerging evidence suggests that domesticated genomes fit well with these expectations, with periods of stable complex evolution characterized by large amounts of change associated with relatively small selective value, punctuated by periods in which changes in one-half of the plant-hominin relationship cause rapid, low-complexity adaptation in the other. A corollary of a single plant-hominin co-evolutionary process is that clues about the initiation of the domestication process may well lie deep within the hominin lineage. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution.
Filée, Jonathan
2015-01-01
Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses) and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements), whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages.
Evolution and Diversity of Transposable Elements in Vertebrate Genomes.
Sotero-Caio, Cibele G; Platt, Roy N; Suh, Alexander; Ray, David A
2017-01-01
Transposable elements (TEs) are selfish genetic elements that mobilize in genomes via transposition or retrotransposition and often make up large fractions of vertebrate genomes. Here, we review the current understanding of vertebrate TE diversity and evolution in the context of recent advances in genome sequencing and assembly techniques. TEs make up 4-60% of assembled vertebrate genomes, and deeply branching lineages such as ray-finned fishes and amphibians generally exhibit a higher TE diversity than the more recent radiations of birds and mammals. Furthermore, the list of taxa with exceptional TE landscapes is growing. We emphasize that the current bottleneck in genome analyses lies in the proper annotation of TEs and provide examples where superficial analyses led to misleading conclusions about genome evolution. Finally, recent advances in long-read sequencing will soon permit access to TE-rich genomic regions that previously resisted assembly including the gigantic, TE-rich genomes of salamanders and lungfishes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Y-chromosome evolution: emerging insights into processes of Y-chromosome degeneration.
Bachtrog, Doris
2013-02-01
The human Y chromosome is intriguing not only because it harbours the master-switch gene that determines gender but also because of its unusual evolutionary history. The Y chromosome evolved from an autosome, and its evolution has been characterized by massive gene decay. Recent whole-genome and transcriptome analyses of Y chromosomes in humans and other primates, in Drosophila species and in plants have shed light on the current gene content of the Y chromosome, its origins and its long-term fate. Furthermore, comparative analysis of young and old Y chromosomes has given further insights into the evolutionary and molecular forces triggering Y-chromosome degeneration and into the evolutionary destiny of the Y chromosome.
Neocentromeres: role in human disease, evolution, and centromere study.
Amor, David J; Choo, K H Andy
2002-10-01
The centromere is essential for the proper segregation and inheritance of genetic information. Neocentromeres are ectopic centromeres that originate occasionally from noncentromeric regions of chromosomes. Despite the complete absence of normal centromeric alpha-satellite DNA, human neocentromeres are able to form a primary constriction and assemble a functional kinetochore. Since the discovery and characterization of the first case of a human neocentromere in our laboratory a decade ago, 60 examples of constitutional human neocentromeres distributed widely across the genome have been described. Typically, these are located on marker chromosomes that have been detected in children with developmental delay or congenital abnormalities. Neocentromeres have also been detected in at least two types of human cancer and have been experimentally induced in Drosophila. Current evidence from human and fly studies indicates that neocentromere activity is acquired epigenetically rather than by any alteration to the DNA sequence. Since human neocentromere formation is generally detrimental to the individual, its biological value must lie beyond the individual level, such as in karyotype evolution and speciation.
Evolution Analysis of Simple Sequence Repeats in Plant Genome.
Qin, Zhen; Wang, Yanping; Wang, Qingmei; Li, Aixian; Hou, Fuyun; Zhang, Liming
2015-01-01
Simple sequence repeats (SSRs) are widespread units on genome sequences, and play many important roles in plants. In order to reveal the evolution of plant genomes, we investigated the evolutionary regularities of SSRs during the evolution of plant species and the plant kingdom by analysis of twelve sequenced plant genome sequences. First, in the twelve studied plant genomes, the main SSRs were those which contain repeats of 1-3 nucleotides combination. Second, in mononucleotide SSRs, the A/T percentage gradually increased along with the evolution of plants (except for P. patens). With the increase of SSRs repeat number the percentage of A/T in C. reinhardtii had no significant change, while the percentage of A/T in terrestrial plants species gradually declined. Third, in dinucleotide SSRs, the percentage of AT/TA increased along with the evolution of plant kingdom and the repeat number increased in terrestrial plants species. This trend was more obvious in dicotyledon than monocotyledon. The percentage of CG/GC showed the opposite pattern to the AT/TA. Forth, in trinucleotide SSRs, the percentages of combinations including two or three A/T were in a rising trend along with the evolution of plant kingdom; meanwhile with the increase of SSRs repeat number in plants species, different species chose different combinations as dominant SSRs. SSRs in C. reinhardtii, P. patens, Z. mays and A. thaliana showed their specific patterns related to evolutionary position or specific changes of genome sequences. The results showed that, SSRs not only had the general pattern in the evolution of plant kingdom, but also were associated with the evolution of the specific genome sequence. The study of the evolutionary regularities of SSRs provided new insights for the analysis of the plant genome evolution.
Glinsky, Gennadi V
2018-03-01
Transposable elements have made major evolutionary impacts on creation of primate-specific and human-specific genomic regulatory loci and species-specific genomic regulatory networks (GRNs). Molecular and genetic definitions of human-specific changes to GRNs contributing to development of unique to human phenotypes remain a highly significant challenge. Genome-wide proximity placement analysis of diverse families of human-specific genomic regulatory loci (HSGRL) identified topologically associating domains (TADs) that are significantly enriched for HSGRL and designated rapidly evolving in human TADs. Here, the analysis of HSGRL, hESC-enriched enhancers, super-enhancers (SEs), and specific sub-TAD structures termed super-enhancer domains (SEDs) has been performed. In the hESC genome, 331 of 504 (66%) of SED-harboring TADs contain HSGRL and 68% of SEDs co-localize with HSGRL, suggesting that emergence of HSGRL may have rewired SED-associated GRNs within specific TADs by inserting novel and/or erasing existing non-coding regulatory sequences. Consequently, markedly distinct features of the principal regulatory structures of interphase chromatin evolved in the hESC genome compared to mouse: the SED quantity is 3-fold higher and the median SED size is significantly larger. Concomitantly, the overall TAD quantity is increased by 42% while the median TAD size is significantly decreased (p = 9.11E-37) in the hESC genome. Present analyses illustrate a putative global role for transposable elements and HSGRL in shaping the human-specific features of the interphase chromatin organization and functions, which are facilitated by accelerated creation of novel transcription factor binding sites and new enhancers driven by targeted placement of HSGRL at defined genomic coordinates. A trend toward the convergence of TAD and SED architectures of interphase chromatin in the hESC genome may reflect changes of 3D-folding patterns of linear chromatin fibers designed to enhance both regulatory complexity and functional precision of GRNs by creating predominantly a single gene (or a set of functionally linked genes) per regulatory domain structures. Collectively, present analyses reveal critical evolutionary contributions of transposable elements and distal enhancers to creation of thousands primate- and human-specific elements of a chromatin folding code, which defines the 3D context of interphase chromatin both restricting and facilitating biological functions of GRNs.
[Understanding mitochondrial genome fragmentation in parasitic lice (Insecta: Phthiraptera)].
Dong, Wen-Ge; Guo, Xian-Guo; Jin, Dao-Chao; Xue, Shi-Peng; Qin, Feng; Simon, Song; Stephen, C Barker; Renfu, Shao
2013-07-01
Lice are obligate ectoparasites of mammals and birds. Extensive fragmentation of mitochondrial genomes has been found in some louse species in the families Pediculidae, Pthiridae, Philopteridae and Trichodectidae. For example, the mt genomes of human body louse (Pediculus humanus), head louse (Pediculus capitis), and public louse (Pthirus pubis) have 20, 20 and 14 mini-chromosomes, respectively. These mini-chromosomes might be the results of deletion and recombination of mt genes. The factors and mechanisms of mitochondrial genome fragmentation are currently unknown. The fragmentation might be the results of evolutionary selection or random genetic drift or it is probably related to the lack of mtSSB (mitochondrial single-strand DNA binding protein). Understanding the fragmentation of mitochondrial genomes is of significance for understanding the origin and evolution of mitochondria. This paper reviews the recent advances in the studies of mito-chondrial genome fragmentation in lice, including the phenomena of mitochondrial genome fragmentation, characteristics of fragmented mitochondrial genomes, and some factors and mechanisms possibly leading to the mitochondrial genome fragmentation of lice. Perspectives for future studies on fragmented mt genomes are also discussed.
Pan, Deng; Zhang, Liqing
2007-01-01
Background The rate of gene duplication is an important parameter in the study of evolution, but the influence of gene conversion and technical problems have confounded previous attempts to provide a satisfying estimate. We propose a new strategy to estimate the rate that involves separate quantification of the rates of two different mechanisms of gene duplication and subsequent combination of the two rates, based on their respective contributions to the overall gene duplication rate. Results Previous estimates of gene duplication rates are based on small gene families. Therefore, to assess the applicability of this to families of all sizes, we looked at both two-copy gene families and the entire genome. We studied unequal crossover and retrotransposition, and found that these mechanisms of gene duplication are largely independent and account for a substantial amount of duplicated genes. Unequal crossover contributed more to duplications in the entire genome than retrotransposition did, but this contribution was significantly less in two-copy gene families, and duplicated genes arising from this mechanism are more likely to be retained. Combining rates of duplication using the two mechanisms, we estimated the overall rates to be from approximately 0.515 to 1.49 × 10-3 per gene per million years in human, and from approximately 1.23 to 4.23 × 10-3 in mouse. The rates estimated from two-copy gene families are always lower than those from the entire genome, and so it is not appropriate to use small families to estimate the rate for the entire genome. Conclusion We present a novel strategy for estimating gene duplication rates. Our results show that different mechanisms contribute differently to the evolution of small and large gene families. PMID:17683522
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse
Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.
2009-01-01
The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303
Quantification of GC-biased gene conversion in the human genome
Glémin, Sylvain; Arndt, Peter F.; Messer, Philipp W.; Petrov, Dmitri; Galtier, Nicolas; Duret, Laurent
2015-01-01
Much evidence indicates that GC-biased gene conversion (gBGC) has a major impact on the evolution of mammalian genomes. However, a detailed quantification of the process is still lacking. The strength of gBGC can be measured from the analysis of derived allele frequency spectra (DAF), but this approach is sensitive to a number of confounding factors. In particular, we show by simulations that the inference is pervasively affected by polymorphism polarization errors and by spatial heterogeneity in gBGC strength. We propose a new general method to quantify gBGC from DAF spectra, incorporating polarization errors, taking spatial heterogeneity into account, and jointly estimating mutation bias. Applying it to human polymorphism data from the 1000 Genomes Project, we show that the strength of gBGC does not differ between hypermutable CpG sites and non-CpG sites, suggesting that in humans gBGC is not caused by the base-excision repair machinery. Genome-wide, the intensity of gBGC is in the nearly neutral area. However, given that recombination occurs primarily within recombination hotspots, 1%–2% of the human genome is subject to strong gBGC. On average, gBGC is stronger in African than in non-African populations, reflecting differences in effective population sizes. However, due to more heterogeneous recombination landscapes, the fraction of the genome affected by strong gBGC is larger in non-African than in African populations. Given that the location of recombination hotspots evolves very rapidly, our analysis predicts that, in the long term, a large fraction of the genome is affected by short episodes of strong gBGC. PMID:25995268
The dynamic evolutionary history of genome size in North American woodland salamanders.
Newman, Catherine E; Gregory, T Ryan; Austin, Christopher C
2017-04-01
The genus Plethodon is the most species-rich salamander genus in North America, and nearly half of its species face an uncertain future. It is also one of the most diverse families in terms of genome sizes, which range from 1C = 18.2 to 69.3 pg, or 5-20 times larger than the human genome. Large genome size in salamanders results in part from accumulation of transposable elements and is associated with various developmental and physiological traits. However, genome sizes have been reported for only 25% of the species of Plethodon (14 of 55). We collected genome size data for Plethodon serratus to supplement an ongoing phylogeographic study, reconstructed the evolutionary history of genome size in Plethodontidae, and inferred probable genome sizes for the 41 species missing empirical data. Results revealed multiple genome size changes in Plethodon: genomes of western Plethodon increased, whereas genomes of eastern Plethodon decreased, followed by additional decreases or subsequent increases. The estimated genome size of P. serratus was 21 pg. New understanding of variation in genome size evolution, along with genome size inferences for previously unstudied taxa, provide a foundation for future studies on the biology of plethodontid salamanders.
The vacuolar protein sorting genes in insects: A comparative genome view.
Li, Zhaofei; Blissard, Gary
2015-07-01
In eukaryotic cells, regulated vesicular trafficking is critical for directing protein transport and for recycling and degradation of membrane lipids and proteins. Through carefully regulated transport vesicles, the endomembrane system performs a large and important array of dynamic cellular functions while maintaining the integrity of the cellular membrane system. Genetic studies in yeast Saccharomyces cerevisiae have identified approximately 50 vacuolar protein sorting (VPS) genes involved in vesicle trafficking, and most of these genes are also characterized in mammals. The VPS proteins form distinct functional complexes, which include complexes known as ESCRT, retromer, CORVET, HOPS, GARP, and PI3K-III. Little is known about the orthologs of VPS proteins in insects. Here, with the newly annotated Manduca sexta genome, we carried out genomic comparative analysis of VPS proteins in yeast, humans, and 13 sequenced insect genomes representing the Orders Hymenoptera, Diptera, Hemiptera, Phthiraptera, Lepidoptera, and Coleoptera. Amino acid sequence alignments and domain/motif structure analyses reveal that most of the components of ESCRT, retromer, CORVET, HOPS, GARP, and PI3K-III are evolutionarily conserved across yeast, insects, and humans. However, in contrast to the VPS gene expansions observed in the human genome, only four VPS genes (VPS13, VPS16, VPS33, and VPS37) were expanded in the six insect Orders. Additionally, VPS2 was expanded only in species from Phthiraptera, Lepidoptera, and Coleoptera. These studies provide a baseline for understanding the evolution of vesicular trafficking across yeast, insect, and human genomes, and also provide a basis for further addressing specific functional roles of VPS proteins in insects. Copyright © 2014 Elsevier Ltd. All rights reserved.
Primates, Lice and Bacteria: Speciation and Genome Evolution in the Symbionts of Hominid Lice
Allen, Julie M.; Nguyen, Nam-Phuong; Vachaspati, Pranjal; Quicksall, Zachary S.; Warnow, Tandy; Mugisha, Lawrence; Johnson, Kevin P.; Reed, David L.
2017-01-01
Abstract Insects with restricted diets rely on symbiotic bacteria to provide essential metabolites missing in their diet. The blood-sucking lice are obligate, host-specific parasites of mammals and are themselves host to symbiotic bacteria. In human lice, these bacterial symbionts supply the lice with B-vitamins. Here, we sequenced the genomes of symbiotic and heritable bacterial of human, chimpanzee, gorilla, and monkey lice and used phylogenomics to investigate their evolutionary relationships. We find that these symbionts have a phylogenetic history reflecting the louse phylogeny, a finding contrary to previous reports of symbiont replacement. Examination of the highly reduced symbiont genomes (0.53–0.57 Mb) reveals much of the genomes are dedicated to vitamin synthesis. This is unchanged in the smallest symbiont genome and one that appears to have been reorganized. Specifically, symbionts from human lice, chimpanzee lice, and gorilla lice carry a small plasmid that encodes synthesis of vitamin B5, a vitamin critical to the bacteria-louse symbiosis. This plasmid is absent in an old world monkey louse symbiont, where this pathway is on its primary chromosome. This suggests the unique genomic configuration brought about by the plasmid is not essential for symbiosis, but once obtained, it has persisted for up to 25 My. We also find evidence that human, chimpanzee, and gorilla louse endosymbionts have lost a pathway for synthesis of vitamin B1, whereas the monkey louse symbiont has retained this pathway. It is unclear whether these changes are adaptive, but they may point to evolutionary responses of louse symbionts to shifts in primate biology. PMID:28419279
Evolution and variation of the yeast (Saccharomyces) genome.
Mortimer, R K
2000-04-01
In this review we describe the role of the yeast Saccharomyces in the development of human societies including the use of this organism in the making of wine, bread, beer, and distilled beverages. We also discuss the tremendous diversity of yeast found in natural (i.e., noninoculated) wine fermentations and the scientific uses of yeast over the past 60 years. In conclusion, we present ideas on the model of "genome renewal" and the use of this model to explain the mode by which yeast has evolved and how diversity can be generated.
On the improbability of intelligent extraterrestrials
NASA Astrophysics Data System (ADS)
Bond, A.
1982-05-01
Discussions relating to the prevalence of extraterrestrial life generally remain ambiguous due to the lack of a suitable model for the development of biology. In this paper a simple model is proposed based on neutral evolution theory which leads to quantitative values for the genome growth rate within a biosphere. It is hypothesised that the genome size is a measure of organism complexity and hence an indicator of the likelihood of intelligence. The calculations suggest that organisms with the complexity of human beings may be rare and only occur with a probability below once per galaxy.
Ancient hepatitis B viruses from the Bronze Age to the Medieval period.
Mühlemann, Barbara; Jones, Terry C; Damgaard, Peter de Barros; Allentoft, Morten E; Shevnina, Irina; Logvin, Andrey; Usmanova, Emma; Panyushkina, Irina P; Boldgiv, Bazartseren; Bazartseren, Tsevel; Tashbaeva, Kadicha; Merz, Victor; Lau, Nina; Smrčka, Václav; Voyakin, Dmitry; Kitov, Egor; Epimakhov, Andrey; Pokutta, Dalia; Vicze, Magdolna; Price, T Douglas; Moiseyev, Vyacheslav; Hansen, Anders J; Orlando, Ludovic; Rasmussen, Simon; Sikora, Martin; Vinner, Lasse; Osterhaus, Albert D M E; Smith, Derek J; Glebe, Dieter; Fouchier, Ron A M; Drosten, Christian; Sjögren, Karl-Göran; Kristiansen, Kristian; Willerslev, Eske
2018-05-09
Hepatitis B virus (HBV) is a major cause of human hepatitis. There is considerable uncertainty about the timescale of its evolution and its association with humans. Here we present 12 full or partial ancient HBV genomes that are between approximately 0.8 and 4.5 thousand years old. The ancient sequences group either within or in a sister relationship with extant human or other ape HBV clades. Generally, the genome properties follow those of modern HBV. The root of the HBV tree is projected to between 8.6 and 20.9 thousand years ago, and we estimate a substitution rate of 8.04 × 10 -6 -1.51 × 10 -5 nucleotide substitutions per site per year. In several cases, the geographical locations of the ancient genotypes do not match present-day distributions. Genotypes that today are typical of Africa and Asia, and a subgenotype from India, are shown to have an early Eurasian presence. The geographical and temporal patterns that we observe in ancient and modern HBV genotypes are compatible with well-documented human migrations during the Bronze and Iron Ages 1,2 . We provide evidence for the creation of HBV genotype A via recombination, and for a long-term association of modern HBV genotypes with humans, including the discovery of a human genotype that is now extinct. These data expose a complexity of HBV evolution that is not evident when considering modern sequences alone.
Rieseberg, Loren
2018-02-06
Loren Rieseberg from the University of British Columbia on "The Sunflower Genome and its Evolution" at the 7th Annual Genomics of Energy & Environment Meeting on March 21, 2012 in Walnut Creek, California.
Evolution of Salmonella Cerro on a dairy farm over an eight-year period
USDA-ARS?s Scientific Manuscript database
Salmonella enterica subsp. enterica serovar Cerro, frequently isolated from dairy cattle and an occasional pathogen of humans, was recurrently isolated over an eight-year period on a dairy farm in south-central Pennsylvania. The genomes of 18 S. Cerro isolates recovered directly from the feces of in...
Systematic genome assessment of B-vitamin biosynthesis suggests co-operation among gut microbes
Magnúsdóttir, Stefanía; Ravcheev, Dmitry; de Crécy-Lagard, Valérie; Thiele, Ines
2015-01-01
The human gut microbiota supplies its host with essential nutrients, including B-vitamins. Using the PubSEED platform, we systematically assessed the genomes of 256 common human gut bacteria for the presence of biosynthesis pathways for eight B-vitamins: biotin, cobalamin, folate, niacin, pantothenate, pyridoxine, riboflavin, and thiamin. On the basis of the presence and absence of genome annotations, we predicted that each of the eight vitamins was produced by 40–65% of the 256 human gut microbes. The distribution of synthesis pathways was diverse; some genomes had all eight biosynthesis pathways, whereas others contained no de novo synthesis pathways. We compared our predictions to experimental data from 16 organisms and found 88% of our predictions to be in agreement with published data. In addition, we identified several pairs of organisms whose vitamin synthesis pathway pattern complemented those of other organisms. This analysis suggests that human gut bacteria actively exchange B-vitamins among each other, thereby enabling the survival of organisms that do not synthesize any of these essential cofactors. This result indicates the co-evolution of the gut microbes in the human gut environment. Our work presents the first comprehensive assessment of the B-vitamin synthesis capabilities of the human gut microbiota. We propose that in addition to diet, the gut microbiota is an important source of B-vitamins, and that changes in the gut microbiota composition can severely affect our dietary B-vitamin requirements. PMID:25941533
Extensive Horizontal Gene Transfer during Staphylococcus aureus Co-colonization In Vivo
McCarthy, Alex J.; Loeffler, Anette; Witney, Adam A.; Gould, Katherine A.; Lloyd, David H.; Lindsay, Jodi A.
2014-01-01
Staphylococcus aureus is a commensal and major pathogen of humans and animals. Comparative genomics of S. aureus populations suggests that colonization of different host species is associated with carriage of mobile genetic elements (MGE), particularly bacteriophages and plasmids capable of encoding virulence, resistance, and immune evasion pathways. Antimicrobial-resistant S. aureus of livestock are a potential zoonotic threat to human health if they adapt to colonize humans efficiently. We utilized the technique of experimental evolution and co-colonized gnotobiotic piglets with both human- and pig-associated variants of the lineage clonal complex 398, and investigated growth and genetic changes over 16 days using whole genome sequencing. The human isolate survived co-colonization on piglets more efficiently than in vitro. During co-colonization, transfer of MGE from the pig to the human isolate was detected within 4 h. Extensive and repeated transfer of two bacteriophages and three plasmids resulted in colonization with isolates carrying a wide variety of mobilomes. Whole genome sequencing of progeny bacteria revealed no acquisition of core genome polymorphisms, highlighting the importance of MGE. Staphylococcus aureus bacteriophage recombination and integration into novel sites was detected experimentally for the first time. During colonization, clones coexisted and diversified rather than a single variant dominating. Unexpectedly, each piglet carried unique populations of bacterial variants, suggesting limited transmission of bacteria between piglets once colonized. Our data show that horizontal gene transfer occurs at very high frequency in vivo and significantly higher than that detectable in vitro. PMID:25260585
Cao, Ke; Zheng, Zhijun; Wang, Lirong; Liu, Xin; Zhu, Gengrui; Fang, Weichao; Cheng, Shifeng; Zeng, Peng; Chen, Changwen; Wang, Xinwei; Xie, Min; Zhong, Xiao; Wang, Xiaoli; Zhao, Pei; Bian, Chao; Zhu, Yinling; Zhang, Jiahui; Ma, Guosheng; Chen, Chengxuan; Li, Yanjun; Hao, Fengge; Li, Yong; Huang, Guodong; Li, Yuxiang; Li, Haiyan; Guo, Jian; Xu, Xun; Wang, Jun
2014-07-31
Recently, many studies utilizing next generation sequencing have investigated plant evolution and domestication in annual crops. Peach, Prunus persica, is a typical perennial fruit crop that has ornamental and edible varieties. Unlike other fruit crops, cultivated peach includes a large number of phenotypes but few polymorphisms. In this study, we explore the genetic basis of domestication in peach and the influence of humans on its evolution. We perform large-scale resequencing of 10 wild and 74 cultivated peach varieties, including 9 ornamental, 23 breeding, and 42 landrace lines. We identify 4.6 million SNPs, a large number of which could explain the phenotypic variation in cultivated peach. Population analysis shows a single domestication event, the speciation of P. persica from wild peach. Ornamental and edible peach both belong to P. persica, along with another geographically separated subgroup, Prunus ferganensis. Our analyses enhance our knowledge of the domestication history of perennial fruit crops, and the dataset we generated could be useful for future research on comparative population genomics.
Pan, Zhangyuan; Li, Shengdi; Liu, Qiuyue; Wang, Zhen; Zhou, Zhengkui; Di, Ran; Miao, Benpeng; Hu, Wenping; Wang, Xiangyu; Hu, Xiaoxiang; Xu, Ze; Wei, Dongkai; He, Xiaoyun; Yuan, Liyun; Guo, Xiaofei; Liang, Benmeng; Wang, Ruichao; Li, Xiaoyu; Cao, Xiaohan; Dong, Xinlong; Xia, Qing; Shi, Hongcai; Hao, Geng; Yang, Jean; Luosang, Cuicheng; Zhao, Yiqiang; Jin, Mei; Zhang, Yingjie; Lv, Shenjin; Li, Fukuan; Ding, Guohui; Chu, Mingxing; Li, Yixue
2018-04-01
Animal domestication has been extensively studied, but the process of feralization remains poorly understood. Here, we performed whole-genome sequencing of 99 sheep and identified a primary genetic divergence between 2 heterogeneous populations in the Tibetan Plateau, including 1 semi-feral lineage. Selective sweep and candidate gene analysis revealed local adaptations of these sheep associated with sensory perception, muscle strength, eating habit, mating process, and aggressive behavior. In particular, a horn-related gene, RXFP2, showed signs of rapid evolution specifically in the semi-feral breeds. A unique haplotype and repressed horn-related tissue expression of RXFP2 were correlated with higher horn length, as well as spiral and horizontally extended horn shape. Semi-feralization has an extensive impact on diverse phenotypic traits of sheep. By acquiring features like those of their wild ancestors, semi-feral sheep were able to regain fitness while in frequent contact with wild surroundings and rare human interventions. This study provides a new insight into the evolution of domestic animals when human interventions are no longer dominant.
Wang, Xumin; Deng, Xin; Zhang, Xiaowei; Hu, Songnian; Yu, Jun
2012-01-01
The complete nucleotide sequences of the chloroplast (cp) and mitochondrial (mt) genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae) have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147) with a 72% coding sequence, and the larger mitochondrial genome have less genes (65) with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC) and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage. PMID:22291979
Evolution and population genomics of the Lyme borreliosis pathogen, Borrelia burgdorferi.
Seifert, Stephanie N; Khatchikian, Camilo E; Zhou, Wei; Brisson, Dustin
2015-04-01
Population genomic studies have the potential to address many unresolved questions about microbial pathogens by facilitating the identification of genes underlying ecologically important traits, such as novel virulence factors and adaptations to humans or other host species. Additionally, this framework improves estimations of population demography and evolutionary history to accurately reconstruct recent epidemics and identify the molecular and environmental factors that resulted in the outbreak. The Lyme disease bacterium, Borrelia burgdorferi, exemplifies the power and promise of the application of population genomics to microbial pathogens. We discuss here the future of evolutionary studies in B. burgdorferi, focusing on the primary evolutionary forces of horizontal gene transfer, natural selection, and migration, as investigations transition from analyses of single genes to genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
A Syst-OMICS Approach to Ensuring Food Safety and Reducing the Economic Burden of Salmonellosis.
Emond-Rheault, Jean-Guillaume; Jeukens, Julie; Freschi, Luca; Kukavica-Ibrulj, Irena; Boyle, Brian; Dupont, Marie-Josée; Colavecchio, Anna; Barrere, Virginie; Cadieux, Brigitte; Arya, Gitanjali; Bekal, Sadjia; Berry, Chrystal; Burnett, Elton; Cavestri, Camille; Chapin, Travis K; Crouse, Alanna; Daigle, France; Danyluk, Michelle D; Delaquis, Pascal; Dewar, Ken; Doualla-Bell, Florence; Fliss, Ismail; Fong, Karen; Fournier, Eric; Franz, Eelco; Garduno, Rafael; Gill, Alexander; Gruenheid, Samantha; Harris, Linda; Huang, Carol B; Huang, Hongsheng; Johnson, Roger; Joly, Yann; Kerhoas, Maud; Kong, Nguyet; Lapointe, Gisèle; Larivière, Line; Loignon, Stéphanie; Malo, Danielle; Moineau, Sylvain; Mottawea, Walid; Mukhopadhyay, Kakali; Nadon, Céline; Nash, John; Ngueng Feze, Ida; Ogunremi, Dele; Perets, Ann; Pilar, Ana V; Reimer, Aleisha R; Robertson, James; Rohde, John; Sanderson, Kenneth E; Song, Lingqiao; Stephan, Roger; Tamber, Sandeep; Thomassin, Paul; Tremblay, Denise; Usongo, Valentine; Vincent, Caroline; Wang, Siyun; Weadge, Joel T; Wiedmann, Martin; Wijnands, Lucas; Wilson, Emily D; Wittum, Thomas; Yoshida, Catherine; Youfsi, Khadija; Zhu, Lei; Weimer, Bart C; Goodridge, Lawrence; Levesque, Roger C
2017-01-01
The Salmonella Syst-OMICS consortium is sequencing 4,500 Salmonella genomes and building an analysis pipeline for the study of Salmonella genome evolution, antibiotic resistance and virulence genes. Metadata, including phenotypic as well as genomic data, for isolates of the collection are provided through the Salmonella Foodborne Syst-OMICS database (SalFoS), at https://salfos.ibis.ulaval.ca/. Here, we present our strategy and the analysis of the first 3,377 genomes. Our data will be used to draw potential links between strains found in fresh produce, humans, animals and the environment. The ultimate goals are to understand how Salmonella evolves over time, improve the accuracy of diagnostic methods, develop control methods in the field, and identify prognostic markers for evidence-based decisions in epidemiology and surveillance.
Analysis of base and codon usage by rubella virus.
Zhou, Yumei; Chen, Xianfeng; Ushijima, Hiroshi; Frey, Teryl K
2012-05-01
Rubella virus (RUBV), a small, plus-strand RNA virus that is an important human pathogen, has the unique feature that the GC content of its genome (70%) is the highest (by 20%) among RNA viruses. To determine the effect of this GC content on genomic evolution, base and codon usage were analyzed across viruses from eight diverse genotypes of RUBV. Despite differences in frequency of codon use, the favored codons in the RUBV genome matched those in the human genome for 18 of the 20 amino acids, indicating adaptation to the host. Although usage patterns were conserved in corresponding genes in the diverse genotypes, within-genome comparison revealed that both base and codon usages varied regionally, particularly in the hypervariable region (HVR) of the P150 replicase gene. While directional mutation pressure was predominant in determining base and codon usage within most of the genome (with the strongest tendency being towards C's at third codon positions), natural selection was predominant in the HVR region. The GC content of this region was the highest in the genome (>80%), and it was not clear if selection at the nucleotide level accompanied selection at the amino acid level. Dinucleotide frequency analysis of the RUBV genome revealed that TpA usage was lower than expected, similar to mammalian genes; however, CpG usage was not suppressed, and TpG usage was not enhanced, as is the case in mammalian genes.
Evolution of community- and healthcare-associated methicillin-resistant Staphylococcus aureus☆
Uhlemann, Anne-Catrin; Otto, Michael; Lowy, Franklin D.; DeLeo, Frank R.
2013-01-01
Staphylococcus aureus is a prominent cause of human infections globally. The high prevalence of infections is compounded by antibiotic resistance—a significant problem for treatment. Methicillin-resistant S. aureus (MRSA) is endemic in hospitals and healthcare facilities worldwide, and is an increasingly common cause of community-associated bacterial infections in industrialized countries. Although much focus is placed on the role of S. aureus as a human pathogen, it is in fact a human commensal organism that has had a relatively long coexistence with the human host. Many S. aureus infections can be explained by host susceptibility or other predisposing risk factors. On the other hand, the emergence/re-emergence of successful S. aureus clones (referred to as epidemic waves) suggests a rapid bacterial adaption and evolution, which includes the emergence of antibiotic resistance and increased virulence and/or transmissibility. It is within this context that we review our understanding of selected S. aureus epidemic waves, and highlight the use of genome sequencing as a means to better understand the evolution of each lineage. PMID:23648426
Longo, Mark S; Carone, Dawn M; Green, Eric D; O'Neill, Michael J; O'Neill, Rachel J
2009-01-01
Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB), are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN), euchromatic regions (EU), and an evolutionary breakpoint (EB) that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s) and endogenous retroviruses (ERVs) and a depletion of short interspersed nucleotide elements (SINEs) shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33), known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the divergence of marsupials and eutherians that may have predisposed these genomic regions to large-scale chromosomal instability. PMID:19630942
The Language of the Protein Universe
Scaiewicz, Andrea; Levitt, Michael
2015-01-01
Proteins, the main cell machinery which play a major roll in nearly every cellular process, have always been a central focus in biology. We live in the post-genomic era, and inferring information from massive data sets is a steadily growing universal challenge. The increasing availability of fully sequenced genomes can be regarded as the “Rosetta Stone” of the protein universe, allowing the understanding of genomes and their evolution, just as the original Rosetta Stone allowed Champollion to decipher the ancient Egyptian hieroglyphics. In this review, we consider aspects of the protein domain architectures repertoire that are closely related to those of human languages and aim to provide some insights about the language of proteins. PMID:26451980
Rapid evolution in insect pests: the importance of space and time in population genomics studies.
Pélissié, Benjamin; Crossley, Michael S; Cohen, Zachary Paul; Schoville, Sean D
2018-04-01
Pest species in agroecosystems often exhibit patterns of rapid evolution to environmental and human-imposed selection pressures. Although the role of adaptive processes is well accepted, few insect pests have been studied in detail and most research has focused on selection at insecticide resistance candidate genes. Emerging genomic datasets provide opportunities to detect and quantify selection in insect pest populations, and address long-standing questions about mechanisms underlying rapid evolutionary change. We examine the strengths of recent studies that stratify population samples both in space (along environmental gradients and comparing ancestral vs. derived populations) and in time (using chronological sampling, museum specimens and comparative phylogenomics), resulting in critical insights on evolutionary processes, and providing new directions for studying pests in agroecosystems. Copyright © 2018 Elsevier Inc. All rights reserved.
Retroelements and their impact on genome evolution and functioning.
Gogvadze, Elena; Buzdin, Anton
2009-12-01
Retroelements comprise a considerable fraction of eukaryotic genomes. Since their initial discovery by Barbara McClintock in maize DNA, retroelements have been found in genomes of almost all organisms. First considered as a "junk DNA" or genomic parasites, they were shown to influence genome functioning and to promote genetic innovations. For this reason, they were suggested as an important creative force in the genome evolution and adaptation of an organism to altered environmental conditions. In this review, we summarize the up-to-date knowledge of different ways of retroelement involvement in structural and functional evolution of genes and genomes, as well as the mechanisms generated by cells to control their retrotransposition.
Thybert, David; Roller, Maša; Navarro, Fábio C.P.; Fiddes, Ian; Streeter, Ian; Feig, Christine; Martin-Galvez, David; Kolmogorov, Mikhail; Janoušek, Václav; Akanni, Wasiu; Aken, Bronwen; Aldridge, Sarah; Chakrapani, Varshith; Chow, William; Clarke, Laura; Cummins, Carla; Doran, Anthony; Dunn, Matthew; Goodstadt, Leo; Howe, Kerstin; Howell, Matthew; Josselin, Ambre-Aurore; Karn, Robert C.; Laukaitis, Christina M.; Jingtao, Lilue; Martin, Fergal; Muffato, Matthieu; Nachtweide, Stefanie; Quail, Michael A.; Sisu, Cristina; Stanke, Mario; Stefflova, Klara; Van Oosterhout, Cock; Veyrunes, Frederic; Ward, Ben; Yang, Fengtang; Yazdanifar, Golbahar; Zadissa, Amonida; Adams, David J.; Brazma, Alvis; Gerstein, Mark; Paten, Benedict; Pham, Son; Keane, Thomas M.; Odom, Duncan T.; Flicek, Paul
2018-01-01
Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology. PMID:29563166
Functional requirements driving the gene duplication in 12 Drosophila species.
Zhong, Yan; Jia, Yanxiao; Gao, Yang; Tian, Dacheng; Yang, Sihai; Zhang, Xiaohui
2013-08-15
Gene duplication supplies the raw materials for novel gene functions and many gene families arisen from duplication experience adaptive evolution. Most studies of young duplicates have focused on mammals, especially humans, whereas reports describing their genome-wide evolutionary patterns across the closely related Drosophila species are rare. The sequenced 12 Drosophila genomes provide the opportunity to address this issue. In our study, 3,647 young duplicate gene families were identified across the 12 Drosophila species and three types of expansions, species-specific, lineage-specific and complex expansions, were detected in these gene families. Our data showed that the species-specific young duplicate genes predominated (86.6%) over the other two types. Interestingly, many independent species-specific expansions in the same gene family have been observed in many species, even including 11 or 12 Drosophila species. Our data also showed that the functional bias observed in these young duplicate genes was mainly related to responses to environmental stimuli and biotic stresses. This study reveals the evolutionary patterns of young duplicates across 12 Drosophila species on a genomic scale. Our results suggest that convergent evolution acts on young duplicate genes after the species differentiation and adaptive evolution may play an important role in duplicate genes for adaption to ecological factors and environmental changes in Drosophila.
Genomic Diversification of Enterococci in Hosts: The Role of the Mobilome
Santagati, Maria; Campanile, Floriana; Stefani, Stefania
2012-01-01
Enterococci are ubiquitous lactic acid bacteria, possessing a flexible nature that allows them to colonize various environments and hosts but also to be opportunistic pathogens. Many papers have contributed to a better understanding of: (i) the taxonomy of this complex group of microorganisms; (ii) intra-species variability; (iii) the role of different pathogenicity traits; and (iv) some markers related to the character of host-specificity, but the reasons of such incredible success of adaptability is still far from being fully explained. Recently, genomic-based studies have improved our understanding of the genome diversity of the most studied species, i.e., E. faecalis and E. faecium. From these studies, what is becoming evident is the role of the mobilome in adding new abilities to colonize new hosts and environments, and eventually in driving their evolution: specific clones associated with human infections or specific hosts can exist, but probably the consideration of these populations as strictly clonal groups is only partially correct. The variable presence of mobile genetic elements may, indeed, be one of the factors involved in the evolution of one specific group in a specific host and/or environment. Certainly more extensive studies using new high throughput technologies are mandatory to fully understand the evolution of predominant clones and species in different hosts and environments. PMID:22435066
Genomic diversification of enterococci in hosts: the role of the mobilome.
Santagati, Maria; Campanile, Floriana; Stefani, Stefania
2012-01-01
Enterococci are ubiquitous lactic acid bacteria, possessing a flexible nature that allows them to colonize various environments and hosts but also to be opportunistic pathogens. Many papers have contributed to a better understanding of: (i) the taxonomy of this complex group of microorganisms; (ii) intra-species variability; (iii) the role of different pathogenicity traits; and (iv) some markers related to the character of host-specificity, but the reasons of such incredible success of adaptability is still far from being fully explained. Recently, genomic-based studies have improved our understanding of the genome diversity of the most studied species, i.e., E. faecalis and E. faecium. From these studies, what is becoming evident is the role of the mobilome in adding new abilities to colonize new hosts and environments, and eventually in driving their evolution: specific clones associated with human infections or specific hosts can exist, but probably the consideration of these populations as strictly clonal groups is only partially correct. The variable presence of mobile genetic elements may, indeed, be one of the factors involved in the evolution of one specific group in a specific host and/or environment. Certainly more extensive studies using new high throughput technologies are mandatory to fully understand the evolution of predominant clones and species in different hosts and environments.
Three crocodilian genomes reveal ancestral patterns of evolution among archosaurs
Green, Richard E; Braun, Edward L; Armstrong, Joel; Earl, Dent; Nguyen, Ngan; Hickey, Glenn; Vandewege, Michael W; St John, John A; Capella-Gutiérrez, Salvador; Castoe, Todd A; Kern, Colin; Fujita, Matthew K; Opazo, Juan C; Jurka, Jerzy; Kojima, Kenji K; Caballero, Juan; Hubley, Robert M; Smit, Arian F; Platt, Roy N; Lavoie, Christine A; Ramakodi, Meganathan P; Finger, John W; Suh, Alexander; Isberg, Sally R; Miles, Lee; Chong, Amanda Y; Jaratlerdsiri, Weerachai; Gongora, Jaime; Moran, Christopher; Iriarte, Andrés; McCormack, John; Burgess, Shane C; Edwards, Scott V; Lyons, Eric; Williams, Christina; Breen, Matthew; Howard, Jason T; Gresham, Cathy R; Peterson, Daniel G; Schmitz, Jürgen; Pollock, David D; Haussler, David; Triplett, Eric W; Zhang, Guojie; Irie, Naoki; Jarvis, Erich D; Brochu, Christopher A; Schmidt, Carl J; McCarthy, Fiona M; Faircloth, Brant C; Hoffmann, Federico G; Glenn, Travis C; Gabaldón, Toni; Paten, Benedict; Ray, David A
2015-01-01
To provide context for the diversifications of archosaurs, the group that includes crocodilians, dinosaurs and birds, we generated draft genomes of three crocodilians, Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the relatively rapid evolution of bird genomes represents an autapomorphy within that clade. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these new data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs. PMID:25504731
Mitochondrial genome evolution in the Saccharomyces sensu stricto complex.
Ruan, Jiangxing; Cheng, Jian; Zhang, Tongcun; Jiang, Huifeng
2017-01-01
Exploring the evolutionary patterns of mitochondrial genomes is important for our understanding of the Saccharomyces sensu stricto (SSS) group, which is a model system for genomic evolution and ecological analysis. In this study, we first obtained the complete mitochondrial sequences of two important species, Saccharomyces mikatae and Saccharomyces kudriavzevii. We then compared the mitochondrial genomes in the SSS group with those of close relatives, and found that the non-coding regions evolved rapidly, including dramatic expansion of intergenic regions, fast evolution of introns and almost 20-fold higher rearrangement rates than those of the nuclear genomes. However, the coding regions, and especially the protein-coding genes, are more conserved than those in the nuclear genomes of the SSS group. The different evolutionary patterns of coding and non-coding regions in the mitochondrial and nuclear genomes may be related to the origin of the aerobic fermentation lifestyle in this group. Our analysis thus provides novel insights into the evolution of mitochondrial genomes.
The Evolutionary Origin and Genetic Makeup of Domestic Horses.
Librado, Pablo; Fages, Antoine; Gaunitz, Charleen; Leonardi, Michela; Wagner, Stefanie; Khan, Naveed; Hanghøj, Kristian; Alquraishi, Saleh A; Alfarhan, Ahmed H; Al-Rasheid, Khaled A; Der Sarkissian, Clio; Schubert, Mikkel; Orlando, Ludovic
2016-10-01
The horse was domesticated only 5.5 KYA, thousands of years after dogs, cattle, pigs, sheep, and goats. The horse nonetheless represents the domestic animal that most impacted human history; providing us with rapid transportation, which has considerably changed the speed and magnitude of the circulation of goods and people, as well as their cultures and diseases. By revolutionizing warfare and agriculture, horses also deeply influenced the politico-economic trajectory of human societies. Reciprocally, human activities have circled back on the recent evolution of the horse, by creating hundreds of domestic breeds through selective programs, while leading all wild populations to near extinction. Despite being tightly associated with humans, several aspects in the evolution of the domestic horse remain controversial. Here, we review recent advances in comparative genomics and paleogenomics that helped advance our understanding of the genetic foundation of domestic horses. Copyright © 2016 by the Genetics Society of America.
The Evolutionary Origin and Genetic Makeup of Domestic Horses
Librado, Pablo; Fages, Antoine; Gaunitz, Charleen; Leonardi, Michela; Wagner, Stefanie; Khan, Naveed; Hanghøj, Kristian; Alquraishi, Saleh A.; Alfarhan, Ahmed H.; Al-Rasheid, Khaled A.; Der Sarkissian, Clio; Schubert, Mikkel; Orlando, Ludovic
2016-01-01
The horse was domesticated only 5.5 KYA, thousands of years after dogs, cattle, pigs, sheep, and goats. The horse nonetheless represents the domestic animal that most impacted human history; providing us with rapid transportation, which has considerably changed the speed and magnitude of the circulation of goods and people, as well as their cultures and diseases. By revolutionizing warfare and agriculture, horses also deeply influenced the politico-economic trajectory of human societies. Reciprocally, human activities have circled back on the recent evolution of the horse, by creating hundreds of domestic breeds through selective programs, while leading all wild populations to near extinction. Despite being tightly associated with humans, several aspects in the evolution of the domestic horse remain controversial. Here, we review recent advances in comparative genomics and paleogenomics that helped advance our understanding of the genetic foundation of domestic horses. PMID:27729493
Shapiro, James A
2016-06-08
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Parasitism drives host genome evolution: Insights from the Pasteuria ramosa-Daphnia magna system.
Bourgeois, Yann; Roulin, Anne C; Müller, Kristina; Ebert, Dieter
2017-04-01
Because parasitism is thought to play a major role in shaping host genomes, it has been predicted that genomic regions associated with resistance to parasites should stand out in genome scans, revealing signals of selection above the genomic background. To test whether parasitism is indeed such a major factor in host evolution and to better understand host-parasite interaction at the molecular level, we studied genome-wide polymorphisms in 97 genotypes of the planktonic crustacean Daphnia magna originating from three localities across Europe. Daphnia magna is known to coevolve with the bacterial pathogen Pasteuria ramosa for which host genotypes (clonal lines) are either resistant or susceptible. Using association mapping, we identified two genomic regions involved in resistance to P. ramosa, one of which was already known from a previous QTL analysis. We then performed a naïve genome scan to test for signatures of positive selection and found that the two regions identified with the association mapping further stood out as outliers. Several other regions with evidence for selection were also found, but no link between these regions and phenotypic variation could be established. Our results are consistent with the hypothesis that parasitism is driving host genome evolution. © 2017 The Author(s). Evolution © 2017 The Society for the Study of Evolution.