Zhu, Yuan O; Aw, Pauline P K; de Sessions, Paola Florez; Hong, Shuzhen; See, Lee Xian; Hong, Lewis Z; Wilm, Andreas; Li, Chen Hao; Hue, Stephane; Lim, Seng Gee; Nagarajan, Niranjan; Burkholder, William F; Hibberd, Martin
2017-10-27
Viral populations are complex, dynamic, and fast evolving. The evolution of groups of closely related viruses in a competitive environment is termed quasispecies. To fully understand the role that quasispecies play in viral evolution, characterizing the trajectories of viral genotypes in an evolving population is the key. In particular, long-range haplotype information for thousands of individual viruses is critical; yet generating this information is non-trivial. Popular deep sequencing methods generate relatively short reads that do not preserve linkage information, while third generation sequencing methods have higher error rates that make detection of low frequency mutations a bioinformatics challenge. Here we applied BAsE-Seq, an Illumina-based single-virion sequencing technology, to eight samples from four chronic hepatitis B (CHB) patients - once before antiviral treatment and once after viral rebound due to resistance. With single-virion sequencing, we obtained 248-8796 single-virion sequences per sample, which allowed us to find evidence for both hard and soft selective sweeps. We were able to reconstruct population demographic history that was independently verified by clinically collected data. We further verified four of the samples independently through PacBio SMRT and Illumina Pooled deep sequencing. Overall, we showed that single-virion sequencing yields insight into viral evolution and population dynamics in an efficient and high throughput manner. We believe that single-virion sequencing is widely applicable to the study of viral evolution in the context of drug resistance and host adaptation, allows differentiation between soft or hard selective sweeps, and may be useful in the reconstruction of intra-host viral population demographic history.
Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs.
Chen-Harris, Haiyin; Borucki, Monica K; Torres, Clinton; Slezak, Tom R; Allen, Jonathan E
2013-02-12
High throughput sequencing is beginning to make a transformative impact in the area of viral evolution. Deep sequencing has the potential to reveal the mutant spectrum within a viral sample at high resolution, thus enabling the close examination of viral mutational dynamics both within- and between-hosts. The challenge however, is to accurately model the errors in the sequencing data and differentiate real viral mutations, particularly those that exist at low frequencies, from sequencing errors. We demonstrate that overlapping read pairs (ORP) -- generated by combining short fragment sequencing libraries and longer sequencing reads -- significantly reduce sequencing error rates and improve rare variant detection accuracy. Using this sequencing protocol and an error model optimized for variant detection, we are able to capture a large number of genetic mutations present within a viral population at ultra-low frequency levels (<0.05%). Our rare variant detection strategies have important implications beyond viral evolution and can be applied to any basic and clinical research area that requires the identification of rare mutations.
Leung, Preston; Eltahla, Auda A; Lloyd, Andrew R; Bull, Rowena A; Luciani, Fabio
2017-07-15
With the advent of affordable deep sequencing technologies, detection of low frequency variants within genetically diverse viral populations can now be achieved with unprecedented depth and efficiency. The high-resolution data provided by next generation sequencing technologies is currently recognised as the gold standard in estimation of viral diversity. In the analysis of rapidly mutating viruses, longitudinal deep sequencing datasets from viral genomes during individual infection episodes, as well as at the epidemiological level during outbreaks, now allow for more sophisticated analyses such as statistical estimates of the impact of complex mutation patterns on the evolution of the viral populations both within and between hosts. These analyses are revealing more accurate descriptions of the evolutionary dynamics that underpin the rapid adaptation of these viruses to the host response, and to drug therapies. This review assesses recent developments in methods and provide informative research examples using deep sequencing data generated from rapidly mutating viruses infecting humans, particularly hepatitis C virus (HCV), human immunodeficiency virus (HIV), Ebola virus and influenza virus, to understand the evolution of viral genomes and to explore the relationship between viral mutations and the host adaptive immune response. Finally, we discuss limitations in current technologies, and future directions that take advantage of publically available large deep sequencing datasets. Copyright © 2016 Elsevier B.V. All rights reserved.
Genome sequence diversity and clues to the evolution of variola (smallpox) virus.
Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M
2006-08-11
Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.
Sobel Leonard, Ashley; McClain, Micah T; Smith, Gavin J D; Wentworth, David E; Halpin, Rebecca A; Lin, Xudong; Ransier, Amy; Stockwell, Timothy B; Das, Suman R; Gilbert, Anthony S; Lambkin-Williams, Robert; Ginsburg, Geoffrey S; Woods, Christopher W; Koelle, Katia
2016-12-15
Knowledge of influenza virus evolution at the point of transmission and at the intrahost level remains limited, particularly for human hosts. Here, we analyze a unique viral data set of next-generation sequencing (NGS) samples generated from a human influenza challenge study wherein 17 healthy subjects were inoculated with cell- and egg-passaged virus. Nasal wash samples collected from 7 of these subjects were successfully deep sequenced. From these, we characterized changes in the subjects' viral populations during infection and identified differences between the virus in these samples and the viral stock used to inoculate the subjects. We first calculated pairwise genetic distances between the subjects' nasal wash samples, the viral stock, and the influenza virus A/Wisconsin/67/2005 (H3N2) reference strain used to generate the stock virus. These distances revealed that considerable viral evolution occurred at various points in the human challenge study. Further quantitative analyses indicated that (i) the viral stock contained genetic variants that originated and likely were selected for during the passaging process, (ii) direct intranasal inoculation with the viral stock resulted in a selective bottleneck that reduced nonsynonymous genetic diversity in the viral hemagglutinin and nucleoprotein, and (iii) intrahost viral evolution continued over the course of infection. These intrahost evolutionary dynamics were dominated by purifying selection. Our findings indicate that rapid viral evolution can occur during acute influenza infection in otherwise healthy human hosts when the founding population size of the virus is large, as is the case with direct intranasal inoculation. Influenza viruses circulating among humans are known to rapidly evolve over time. However, little is known about how influenza virus evolves across single transmission events and over the course of a single infection. To address these issues, we analyze influenza virus sequences from a human challenge experiment that initiated infection with a cell- and egg-passaged viral stock, which appeared to have adapted during its preparation. We find that the subjects' viral populations differ genetically from the viral stock, with subjects' viral populations having lower representation of the amino-acid-changing variants that arose during viral preparation. We also find that most of the viral evolution occurring over single infections is characterized by further decreases in the frequencies of these amino-acid-changing variants and that only limited intrahost genetic diversification through new mutations is apparent. Our findings indicate that influenza virus populations can undergo rapid genetic changes during acute human infections. Copyright © 2016 Sobel Leonard et al.
Influenza A virus hemagglutinin glycosylation compensates for antibody escape fitness costs.
Kosik, Ivan; Ince, William L; Gentles, Lauren E; Oler, Andrew J; Kosikova, Martina; Angel, Matthew; Magadán, Javier G; Xie, Hang; Brooke, Christopher B; Yewdell, Jonathan W
2018-01-01
Rapid antigenic evolution enables the persistence of seasonal influenza A and B viruses in human populations despite widespread herd immunity. Understanding viral mechanisms that enable antigenic evolution is critical for designing durable vaccines and therapeutics. Here, we utilize the primerID method of error-correcting viral population sequencing to reveal an unexpected role for hemagglutinin (HA) glycosylation in compensating for fitness defects resulting from escape from anti-HA neutralizing antibodies. Antibody-free propagation following antigenic escape rapidly selected viruses with mutations that modulated receptor binding avidity through the addition of N-linked glycans to the HA globular domain. These findings expand our understanding of the viral mechanisms that maintain fitness during antigenic evolution to include glycan addition, and highlight the immense power of high-definition virus population sequencing to reveal novel viral adaptive mechanisms.
Palma, Paolo; Zangari, Paola; Alteri, Claudia; Tchidjou, Hyppolite K; Manno, Emma Concetta; Liuzzi, Giuseppina; Perno, Carlo Federico; Rossi, Paolo; Bertoli, Ada; Bernardi, Stefania
2016-12-09
HIV genetic diversity implicates major challenges for the control of viral infection by the immune system and for the identification of an effective immunotherapeutic strategy. With the present case report we underline as HIV evolution could be effectively halted by early antiretroviral treatment (eART). Few cases supported this evidence due to the difficulty of performing amplification and sequencing analysis in long-term viral suppressed patients. Here, we reported the case of limited HIV-1 viral evolution over time in a successful early treated child. A perinatally HIV-1 infected infant was treated within 7 weeks of age with zidovudine, lamivudine, nevirapine and lopinavir/ritonavir. At antiretroviral treatment (ART) initiation HIV-1 viral load (VL) and CD4 percentage were >500,000 copies/ml and 35%, respectively. Plasma genotypic resistance test showed a wild-type virus. The child reached VL undetectability after 33 weeks of combination antiretroviral therapy (cART) since he maintained a stable VL <40copies/ml. After 116 weeks on ART we were able to perform amplification and sequencing assay on the plasma virus. At this time VL was <40 copies/ml and CD4 percentage was 40%. Again the genotypic resistance test revealed a wild-type virus. The phylogenetic analysis performed on the HIV-1 pol sequences of the mother and the child revealed that sequences clustered with C subtype reference strains and formed a monophyletic cluster distinct from the other C sequences included in the analysis (bootstrap value >90%). Any major evolutionary divergence was detected. eART limits the viral evolution avoiding the emergence of new viral variants. This result may have important implications in host immune control and may sustain the challenge search of new personalized immunotherapeutic approaches to achieve a prolonged viral remission.
Viral genetic variation accounts for a third of variability in HIV-1 set-point viral load in Europe.
Blanquart, François; Wymant, Chris; Cornelissen, Marion; Gall, Astrid; Bakker, Margreet; Bezemer, Daniela; Hall, Matthew; Hillebregt, Mariska; Ong, Swee Hoe; Albert, Jan; Bannert, Norbert; Fellay, Jacques; Fransen, Katrien; Gourlay, Annabelle J; Grabowski, M Kate; Gunsenheimer-Bartmeyer, Barbara; Günthard, Huldrych F; Kivelä, Pia; Kouyos, Roger; Laeyendecker, Oliver; Liitsola, Kirsi; Meyer, Laurence; Porter, Kholoud; Ristola, Matti; van Sighem, Ard; Vanham, Guido; Berkhout, Ben; Kellam, Paul; Reiss, Peter; Fraser, Christophe
2017-06-01
HIV-1 set-point viral load-the approximately stable value of viraemia in the first years of chronic infection-is a strong predictor of clinical outcome and is highly variable across infected individuals. To better understand HIV-1 pathogenesis and the evolution of the viral population, we must quantify the heritability of set-point viral load, which is the fraction of variation in this phenotype attributable to viral genetic variation. However, current estimates of heritability vary widely, from 6% to 59%. Here we used a dataset of 2,028 seroconverters infected between 1985 and 2013 from 5 European countries (Belgium, Switzerland, France, the Netherlands and the United Kingdom) and estimated the heritability of set-point viral load at 31% (CI 15%-43%). Specifically, heritability was measured using models of character evolution describing how viral load evolves on the phylogeny of whole-genome viral sequences. In contrast to previous studies, (i) we measured viral loads using standardized assays on a sample collected in a strict time window of 6 to 24 months after infection, from which the viral genome was also sequenced; (ii) we compared 2 models of character evolution, the classical "Brownian motion" model and another model ("Ornstein-Uhlenbeck") that includes stabilising selection on viral load; (iii) we controlled for covariates, including age and sex, which may inflate estimates of heritability; and (iv) we developed a goodness of fit test based on the correlation of viral loads in cherries of the phylogenetic tree, showing that both models of character evolution fit the data well. An overall heritability of 31% (CI 15%-43%) is consistent with other studies based on regression of viral load in donor-recipient pairs. Thus, about a third of variation in HIV-1 virulence is attributable to viral genetic variation.
Sequential Bottlenecks Drive Viral Evolution in Early Acute Hepatitis C Virus Infection
McElroy, Kerensa; Gaudieri, Silvana; Pham, Son T.; Chopra, Abha; Cameron, Barbara; Maher, Lisa; Dore, Gregory J.; White, Peter A.; Lloyd, Andrew R.
2011-01-01
Hepatitis C is a pandemic human RNA virus, which commonly causes chronic infection and liver disease. The characterization of viral populations that successfully initiate infection, and also those that drive progression to chronicity is instrumental for understanding pathogenesis and vaccine design. A comprehensive and longitudinal analysis of the viral population was conducted in four subjects followed from very early acute infection to resolution of disease outcome. By means of next generation sequencing (NGS) and standard cloning/Sanger sequencing, genetic diversity and viral variants were quantified over the course of the infection at frequencies as low as 0.1%. Phylogenetic analysis of reassembled viral variants revealed acute infection was dominated by two sequential bottleneck events, irrespective of subsequent chronicity or clearance. The first bottleneck was associated with transmission, with one to two viral variants successfully establishing infection. The second occurred approximately 100 days post-infection, and was characterized by a decline in viral diversity. In the two subjects who developed chronic infection, this second bottleneck was followed by the emergence of a new viral population, which evolved from the founder variants via a selective sweep with fixation in a small number of mutated sites. The diversity at sites with non-synonymous mutation was higher in predicted cytotoxic T cell epitopes, suggesting immune-driven evolution. These results provide the first detailed analysis of early within-host evolution of HCV, indicating strong selective forces limit viral evolution in the acute phase of infection. PMID:21912520
Gayral, Philippe; Iskra-Caruana, Marie-Line
2009-07-01
Banana streak virus (BSV) is a plant dsDNA pararetrovirus (family Caulimoviridae, genus badnavirus). Although integration is not an essential step in the BSV replication cycle, the nuclear genome of banana (Musa sp.) contains BSV endogenous pararetrovirus sequences (BSV EPRVs). Some BSV EPRVs are infectious by reconstituting a functional viral genome. Recent studies revealed a large molecular diversity of episomal BSV viruses (i.e., nonintegrated) while others focused on BSV EPRV sequences only. In this study, the evolutionary history of badnavirus integration in banana was inferred from phylogenetic relationships between BSV and BSV EPRVs. The relative evolution rates and selective pressures (d(N)/d(S) ratio) were also compared between endogenous and episomal viral sequences. At least 27 recent independent integration events occurred after the divergence of three banana species, indicating that viral integration is a recent and frequent phenomenon. Relaxation of selective pressure on badnaviral sequences that experienced neutral evolution after integration in the plant genome was recorded. Additionally, a significant decrease (35%) in the EPRV evolution rate was observed compared to BSV, reflecting the difference in the evolution rate between episomal dsDNA viruses and plant genome. The comparison of our results with the evolution rate of the Musa genome and other reverse-transcribing viruses suggests that EPRVs play an active role in episomal BSV diversity and evolution.
DOE Office of Scientific and Technical Information (OSTI.GOV)
López, José L.; Golemba, Marcelo; Hernández, Edgardo
Rhodopsins are broadly distributed. In this work, we analyzed 23 metagenomes corresponding to marine sediment samples from four regions that share cold climate conditions (Norway; Sweden; Argentina and Antarctica). In order to investigate the genes evolution of viral rhodopsins, an initial set of 6224 bacterial rhodopsin sequences according to COG5524 were retrieved from the 23 metagenomes. After selection by the presence of transmembrane domains and alignment, 123 viral (51) and non-viral (72) sequences (>50 amino acids) were finally included in further analysis. Viral rhodopsin genes were homologs of Phaeocystis globosa virus and Organic lake Phycodnavirus. Non-viral microbial rhodopsin genes weremore » ascribed to Bacteroidetes, Planctomycetes, Firmicutes, Actinobacteria, Cyanobacteria, Proteobacteria, Deinococcus-Thermus and Cryptophyta and Fungi. A rescreening using Blastp, using as queries the viral sequences previously described, retrieved 30 sequences (>100 amino acids). Phylogeographic analysis revealed a geographical clustering of the sequences affiliated to the viral group. This clustering was not observed for the microbial non-viral sequences. The phylogenetic reconstruction allowed us to propose the existence of a putative ancestor of viral rhodopsin genes related to Actinobacteria and Chloroflexi. This is the first report about the existence of a phylogeographic association of the viral rhodopsin sequences from marine sediments.« less
Smith, Richard H; Hallwirth, Claus V; Westerman, Michael; Hetherington, Nicola A; Tseng, Yu-Shan; Cecchini, Sylvain; Virag, Tamas; Ziegler, Mona-Larissa; Rogozin, Igor B; Koonin, Eugene V; Agbandje-McKenna, Mavis; Kotin, Robert M; Alexander, Ian E
2016-07-05
Germline endogenous viral elements (EVEs) genetically preserve viral nucleotide sequences useful to the study of viral evolution, gene mutation, and the phylogenetic relationships among host organisms. Here, we describe a lineage-specific, adeno-associated virus (AAV)-derived endogenous viral element (mAAV-EVE1) found within the germline of numerous closely related marsupial species. Molecular screening of a marsupial DNA panel indicated that mAAV-EVE1 occurs specifically within the marsupial suborder Macropodiformes (present-day kangaroos, wallabies, and related macropodoids), to the exclusion of other Diprotodontian lineages. Orthologous mAAV-EVE1 locus sequences from sixteen macropodoid species, representing a speciation history spanning an estimated 30 million years, facilitated compilation of an inferred ancestral sequence that recapitulates the genome of an ancient marsupial AAV that circulated among Australian metatherian fauna sometime during the late Eocene to early Oligocene. In silico gene reconstruction and molecular modelling indicate remarkable conservation of viral structure over a geologic timescale. Characterisation of AAV-EVE loci among disparate species affords insight into AAV evolution and, in the case of macropodoid species, may offer an additional genetic basis for assignment of phylogenetic relationships among the Macropodoidea. From an applied perspective, the identified AAV "fossils" provide novel capsid sequences for use in translational research and clinical applications.
Viral genetic variation accounts for a third of variability in HIV-1 set-point viral load in Europe
Wymant, Chris; Cornelissen, Marion; Gall, Astrid; Bakker, Margreet; Bezemer, Daniela; Hall, Matthew; Hillebregt, Mariska; Ong, Swee Hoe; Albert, Jan; Bannert, Norbert; Fellay, Jacques; Fransen, Katrien; Gourlay, Annabelle J.; Grabowski, M. Kate; Gunsenheimer-Bartmeyer, Barbara; Günthard, Huldrych F.; Kivelä, Pia; Kouyos, Roger; Laeyendecker, Oliver; Liitsola, Kirsi; Meyer, Laurence; Porter, Kholoud; Ristola, Matti; van Sighem, Ard; Vanham, Guido; Berkhout, Ben; Kellam, Paul; Reiss, Peter; Fraser, Christophe
2017-01-01
HIV-1 set-point viral load—the approximately stable value of viraemia in the first years of chronic infection—is a strong predictor of clinical outcome and is highly variable across infected individuals. To better understand HIV-1 pathogenesis and the evolution of the viral population, we must quantify the heritability of set-point viral load, which is the fraction of variation in this phenotype attributable to viral genetic variation. However, current estimates of heritability vary widely, from 6% to 59%. Here we used a dataset of 2,028 seroconverters infected between 1985 and 2013 from 5 European countries (Belgium, Switzerland, France, the Netherlands and the United Kingdom) and estimated the heritability of set-point viral load at 31% (CI 15%–43%). Specifically, heritability was measured using models of character evolution describing how viral load evolves on the phylogeny of whole-genome viral sequences. In contrast to previous studies, (i) we measured viral loads using standardized assays on a sample collected in a strict time window of 6 to 24 months after infection, from which the viral genome was also sequenced; (ii) we compared 2 models of character evolution, the classical “Brownian motion” model and another model (“Ornstein–Uhlenbeck”) that includes stabilising selection on viral load; (iii) we controlled for covariates, including age and sex, which may inflate estimates of heritability; and (iv) we developed a goodness of fit test based on the correlation of viral loads in cherries of the phylogenetic tree, showing that both models of character evolution fit the data well. An overall heritability of 31% (CI 15%–43%) is consistent with other studies based on regression of viral load in donor–recipient pairs. Thus, about a third of variation in HIV-1 virulence is attributable to viral genetic variation. PMID:28604782
Ferns, R Bridget; Tarr, Alexander W; Hue, Stephane; Urbanowicz, Richard A; McClure, C Patrick; Gilson, Richard; Ball, Jonathan K; Nastouli, Eleni; Garson, Jeremy A; Pillay, Deenan
2016-05-01
HIV-1 infected patients who acquire HCV infection have higher rates of chronicity and liver disease progression than patients with HCV mono-infection. Understanding early events in this pathogenic process is important. We applied single genome sequencing of the E1 to NS3 regions and viral pseudotype neutralization assays to explore the consequences of viral quasispecies evolution from pre-seroconversion to chronicity in four co-infected individuals (mean follow up 566 days). We observed that one to three founder viruses were transmitted. Relatively low viral sequence diversity, possibly related to an impaired immune response, due to HIV infection was observed in three patients. However, the fourth patient, after an early purifying selection displayed increasing E2 sequence evolution, possibly related to being on suppressive antiretroviral therapy. Viral pseudotypes generated from HCV variants showed relative resistance to neutralization by autologous plasma but not to plasma collected from later time points, confirming ongoing virus escape from antibody neutralization. Copyright © 2016 Elsevier Inc. All rights reserved.
Setiawan, Laurentia C; Gijsbers, Esther F; van Nuenen, Adrianus C; Kootstra, Neeltje A
2015-08-01
The HLA-B27 allele is over-represented among human immunodeficiency virus type 1-infected long-term non-progressors. In these patients, strong CTL responses targeting HLA-B27-restricted viral epitopes have been associated with long-term asymptomatic survival. Indeed, loss of control of viraemia in HLA-B27 patients has been associated with CTL escape at position 264 in the immunodominant KK10 epitope. This CTL escape mutation in the viral Gag protein has been associated with severe viral attenuation and may require the presence of compensatory mutations before emerging. Here, we studied sequence evolution within HLA-B27-restricted CTL epitopes in the viral Gag protein during the course of infection of seven HLA-B27-positive patients. Longitudinal gag sequences obtained at different time points around the time of AIDS diagnosis were obtained and analysed for the presence of mutations in epitopes restricted by HLA-B27, and for potential compensatory mutations. Sequence variations were observed in the HLA-B27-restricted CTL epitopes IK9 and DR11, and the immunodominant KK10 epitope. However, the presence of sequence variations in the HLA-B27-restricted CTL epitopes could not be associated with an increase in viraemia in the majority of the patients studied. Furthermore, we observed low genetic diversity in the gag region of the viral variants throughout the course of infection, which is indicative of low viral replication and corresponds to the low viral load observed in the HLA-B27-positive patients. These data indicated that control of viral replication can be maintained in HLA-B27-positive patients despite the emergence of viral mutations in HLA-B27-restricted epitopes.
NASA Astrophysics Data System (ADS)
Shekhar, Karthik; Ruberman, Claire F.; Ferguson, Andrew L.; Barton, John P.; Kardar, Mehran; Chakraborty, Arup K.
2013-12-01
Mutational escape from vaccine-induced immune responses has thwarted the development of a successful vaccine against AIDS, whose causative agent is HIV, a highly mutable virus. Knowing the virus' fitness as a function of its proteomic sequence can enable rational design of potent vaccines, as this information can focus vaccine-induced immune responses to target mutational vulnerabilities of the virus. Spin models have been proposed as a means to infer intrinsic fitness landscapes of HIV proteins from patient-derived viral protein sequences. These sequences are the product of nonequilibrium viral evolution driven by patient-specific immune responses and are subject to phylogenetic constraints. How can such sequence data allow inference of intrinsic fitness landscapes? We combined computer simulations and variational theory á la Feynman to show that, in most circumstances, spin models inferred from patient-derived viral sequences reflect the correct rank order of the fitness of mutant viral strains. Our findings are relevant for diverse viruses.
Fluid spatial dynamics of West Nile virus in the USA: Rapid spread in a permissive host environment
Di Giallonardo , Francesca; Geoghegan, Jemma L.; Docherty, Douglas E.; McLean, Robert G.; Zody, Michael C.; Qu, James; Yang, Xiao; Birren, Bruce W.; Malboeuf, Christine M.; Newman, R.; Ip, Hon S.; Holmes, Edward C.
2016-01-01
The introduction of West Nile virus (WNV) into North America in 1999 is a classical example of viral emergence in a new environment, with its subsequent dispersion across the continent having a major impact on local bird populations. Despite the importance of this epizootic, the pattern, dynamics and determinants of WNV spread in its natural hosts remain uncertain. In particular, it is unclear whether the virus encountered major barriers to transmission, or spread in an unconstrained manner, and if specific viral lineages were favored over others indicative of intrinsic differences in fitness. To address these key questions in WNV evolution and ecology we sequenced the complete genomes of approximately 300 avian isolates sampled across the USA between 2001-2012. Phylogenetic analysis revealed a relatively ‘star-like' tree structure, indicative of explosive viral spread in US, although with some replacement of viral genotypes through time. These data are striking in that viral sequences exhibit relatively limited clustering according to geographic region, particularly for those viruses sampled from birds, and no strong phylogenetic association with well sampled avian species. The genome sequence data analysed here also contain relatively little evidence for adaptive evolution, particularly on structural proteins, suggesting that most viral lineages are of similar fitness, and that WNV is well adapted to the ecology of mosquito vectors and diverse avian hosts in the USA. In sum, the molecular evolution of WNV in North America depicts a largely unfettered expansion within a permissive host and geographic population with little evidence of major adaptive barriers.
2017-01-01
ABSTRACT RNA viruses are one of the fastest-evolving biological entities. Within their hosts, they exist as genetically diverse populations (i.e., viral mutant swarms), which are sculpted by different evolutionary mechanisms, such as mutation, natural selection, and genetic drift, and also the interactions between genetic variants within the mutant swarms. To elucidate the mechanisms that modulate the population diversity of an important plant-pathogenic virus, we performed evolution experiments with Potato virus Y (PVY) in potato genotypes that differ in their defense response against the virus. Using deep sequencing of small RNAs, we followed the temporal dynamics of standing and newly generated variations in the evolving viral lineages. A time-sampled approach allowed us to (i) reconstruct theoretical haplotypes in the starting population by using clustering of single nucleotide polymorphisms' trajectories and (ii) use quantitative population genetics approaches to estimate the contribution of selection and genetic drift, and their interplay, to the evolution of the virus. We detected imprints of strong selective sweeps and narrow genetic bottlenecks, followed by the shift in frequency of selected haplotypes. Comparison of patterns of viral evolution in differently susceptible host genotypes indicated possible diversifying evolution of PVY in the less-susceptible host (efficient in the accumulation of salicylic acid). IMPORTANCE High diversity of within-host populations of RNA viruses is an important aspect of their biology, since they represent a reservoir of genetic variants, which can enable quick adaptation of viruses to a changing environment. This study focuses on an important plant virus, Potato virus Y, and describes, at high resolution, temporal changes in the structure of viral populations within different potato genotypes. A novel and easy-to-implement computational approach was established to cluster single nucleotide polymorphisms into viral haplotypes from very short sequencing reads. During the experiment, a shift in the frequency of selected viral haplotypes was observed after a narrow genetic bottleneck, indicating an important role of the genetic drift in the evolution of the virus. On the other hand, a possible case of diversifying selection of the virus was observed in less susceptible host genotypes. PMID:28592544
Di Giallonardo, Francesca; Geoghegan, Jemma L; Docherty, Douglas E; McLean, Robert G; Zody, Michael C; Qu, James; Yang, Xiao; Birren, Bruce W; Malboeuf, Christine M; Newman, Ruchi M; Ip, Hon S; Holmes, Edward C
2016-01-15
The introduction of West Nile virus (WNV) into North America in 1999 is a classic example of viral emergence in a new environment, with its subsequent dispersion across the continent having a major impact on local bird populations. Despite the importance of this epizootic, the pattern, dynamics, and determinants of WNV spread in its natural hosts remain uncertain. In particular, it is unclear whether the virus encountered major barriers to transmission, or spread in an unconstrained manner, and if specific viral lineages were favored over others indicative of intrinsic differences in fitness. To address these key questions in WNV evolution and ecology, we sequenced the complete genomes of approximately 300 avian isolates sampled across the United States between 2001 and 2012. Phylogenetic analysis revealed a relatively star-like tree structure, indicative of explosive viral spread in the United States, although with some replacement of viral genotypes through time. These data are striking in that viral sequences exhibit relatively limited clustering according to geographic region, particularly for those viruses sampled from birds, and no strong phylogenetic association with well-sampled avian species. The genome sequence data analyzed here also contain relatively little evidence for adaptive evolution, particularly of structural proteins, suggesting that most viral lineages are of similar fitness and that WNV is well adapted to the ecology of mosquito vectors and diverse avian hosts in the United States. In sum, the molecular evolution of WNV in North America depicts a largely unfettered expansion within a permissive host and geographic population with little evidence of major adaptive barriers. How viruses spread in new host and geographic environments is central to understanding the emergence and evolution of novel infectious diseases and for predicting their likely impact. The emergence of the vector-borne West Nile virus (WNV) in North America in 1999 represents a classic example of this process. Using approximately 300 new viral genomes sampled from wild birds, we show that WNV experienced an explosive spread with little geographical or host constraints within birds and relatively low levels of adaptive evolution. From its introduction into the state of New York, WNV spread across the United States, reaching California and Florida within 4 years, a migration that is clearly reflected in our genomic sequence data, and with a general absence of distinct geographical clusters of bird viruses. However, some geographically distinct viral lineages were found to circulate in mosquitoes, likely reflecting their limited long-distance movement compared to avian species. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Cao-Lormeau, Van-Mai; Lambrechts, Louis
2017-01-01
Abstract Like other pathogens with high mutation and replication rates, within-host dengue virus (DENV) populations evolve during infection of their main mosquito vector, Aedes aegypti. Within-host DENV evolution during transmission provides opportunities for adaptation and emergence of novel virus variants. Recent studies of DENV genetic diversity failed to detect convergent evolution of adaptive mutations in mosquito tissues such as midgut and salivary glands, suggesting that convergent positive selection is not a major driver of within-host DENV evolution in the vector. However, it is unknown whether this conclusion extends to the transmitted viral subpopulation because it is technically difficult to sequence DENV genomes in mosquito saliva. Here, we achieved DENV full-genome sequencing by pooling saliva samples collected non-sacrificially from 49 to 163 individual Ae. aegypti mosquitoes previously infected with one of two DENV-1 genotypes. We compared the transmitted viral subpopulations found in the pooled saliva samples collected in time series with the input viral population present in the infectious blood meal. In all pooled saliva samples examined, the full-genome consensus sequence of the input viral population was unchanged. Although the pooling strategy prevents analysis of individual saliva samples, our results demonstrate the lack of strong convergent positive selection during a single round of DENV transmission by Ae. aegypti. This finding reinforces the idea that genetic drift and purifying selection are the dominant evolutionary forces shaping within-host DENV genetic diversity during transmission by mosquitoes. PMID:29497564
Kawano, Yasuhiro; Neeley, Shane; Adachi, Kei; Nakai, Hiroyuki
2013-01-01
Overlapping open reading frames (ORFs) in viral genomes undergo co-evolution; however, how individual amino acids coded by overlapping ORFs are structurally, functionally, and co-evolutionarily constrained remains difficult to address by conventional homologous sequence alignment approaches. We report here a new experimental and computational evolution-based methodology to address this question and report its preliminary application to elucidating a mode of co-evolution of the frame-shifted overlapping ORFs in the adeno-associated virus (AAV) serotype 2 viral genome. These ORFs encode both capsid VP protein and non-structural assembly-activating protein (AAP). To show proof of principle of the new method, we focused on the evolutionarily conserved QVKEVTQ and KSKRSRR motifs, a pair of overlapping heptapeptides in VP and AAP, respectively. In the new method, we first identified a large number of capsid-forming VP3 mutants and functionally competent AAP mutants of these motifs from mutant libraries by experimental directed evolution under no co-evolutionary constraints. We used Illumina sequencing to obtain a large dataset and then statistically assessed the viability of VP and AAP heptapeptide mutants. The obtained heptapeptide information was then integrated into an evolutionary algorithm, with which VP and AAP were co-evolved from random or native nucleotide sequences in silico. As a result, we demonstrate that these two heptapeptide motifs could exhibit high degeneracy if coded by separate nucleotide sequences, and elucidate how overlap-evoked co-evolutionary constraints play a role in making the VP and AAP heptapeptide sequences into the present shape. Specifically, we demonstrate that two valine (V) residues and β-strand propensity in QVKEVTQ are structurally important, the strongly negative and hydrophilic nature of KSKRSRR is functionally important, and overlap-evoked co-evolution imposes strong constraints on serine (S) residues in KSKRSRR, despite high degeneracy of the motifs in the absence of co-evolutionary constraints.
Lee, Justin S; Bevins, Sarah N; Serieys, Laurel E K; Vickers, Winston; Logan, Ken A; Aldredge, Mat; Boydston, Erin E; Lyren, Lisa M; McBride, Roy; Roelke-Parker, Melody; Pecon-Slattery, Jill; Troyer, Jennifer L; Riley, Seth P; Boyce, Walter M; Crooks, Kevin R; VandeWoude, Sue
2014-07-01
Mountain lions (Puma concolor) throughout North and South America are infected with puma lentivirus clade B (PLVB). A second, highly divergent lentiviral clade, PLVA, infects mountain lions in southern California and Florida. Bobcats (Lynx rufus) in these two geographic regions are also infected with PLVA, and to date, this is the only strain of lentivirus identified in bobcats. We sequenced full-length PLV genomes in order to characterize the molecular evolution of PLV in bobcats and mountain lions. Low sequence homology (88% average pairwise identity) and frequent recombination (1 recombination breakpoint per 3 isolates analyzed) were observed in both clades. Viral proteins have markedly different patterns of evolution; sequence homology and negative selection were highest in Gag and Pol and lowest in Vif and Env. A total of 1.7% of sites across the PLV genome evolve under positive selection, indicating that host-imposed selection pressure is an important force shaping PLV evolution. PLVA strains are highly spatially structured, reflecting the population dynamics of their primary host, the bobcat. In contrast, the phylogeography of PLVB reflects the highly mobile mountain lion, with diverse PLVB isolates cocirculating in some areas and genetically related viruses being present in populations separated by thousands of kilometers. We conclude that PLVA and PLVB are two different viral species with distinct feline hosts and evolutionary histories. Importance: An understanding of viral evolution in natural host populations is a fundamental goal of virology, molecular biology, and disease ecology. Here we provide a detailed analysis of puma lentivirus (PLV) evolution in two natural carnivore hosts, the bobcat and mountain lion. Our results illustrate that PLV evolution is a dynamic process that results from high rates of viral mutation/recombination and host-imposed selection pressure. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Chen, Sunlu; Zheng, Huizhen; Kishima, Yuji
2017-06-01
The interplay of different virus species in a host cell after infection can affect the adaptation of each virus. Endogenous viral elements, such as endogenous pararetroviruses (PRVs), have arisen from vertical inheritance of viral sequences integrated into host germline genomes. As viral genomic fossils, these sequences can thus serve as valuable paleogenomic data to study the long-term evolutionary dynamics of virus-virus interactions, but they have rarely been applied for this purpose. All extant PRVs have been considered autonomous species in their parasitic life cycle in host cells. Here, we provide evidence for multiple non-autonomous PRV species with structural defects in viral activity that have frequently infected ancient grass hosts and adapted through interplay between viruses. Our paleogenomic analyses using endogenous PRVs in grass genomes revealed that these non-autonomous PRV species have participated in interplay with autonomous PRVs in a possible commensal partnership, or, alternatively, with one another in a possible mutualistic partnership. These partnerships, which have been established by the sharing of noncoding regulatory sequences (NRSs) in intergenic regions between two partner viruses, have been further maintained and altered by the sequence homogenization of NRSs between partners. Strikingly, we found that frequent region-specific recombination, rather than mutation selection, is the main causative mechanism of NRS homogenization. Our results, obtained from ancient DNA records of viruses, suggest that adaptation of PRVs has occurred by concerted evolution of NRSs between different virus species in the same host. Our findings further imply that evaluation of within-host NRS interactions within and between populations of viral pathogens may be important.
Hauck, Nastasja C.; Kirpach, Josiane; Kiefer, Christina; Farinelle, Sophie; Morris, Stephen A.; Muller, Claude P.; Lu, I-Na
2018-01-01
To overcome yearly efforts and costs for the production of seasonal influenza vaccines, new approaches for the induction of broadly protective and long-lasting immune responses have been developed in the past decade. To warrant safety and efficacy of the emerging crossreactive vaccine candidates, it is critical to understand the evolution of influenza viruses in response to these new immune pressures. Here we applied unique molecular identifiers in next generation sequencing to analyze the evolution of influenza quasispecies under in vivo antibody pressure targeting the hemagglutinin (HA) long alpha helix (LAH). Our vaccine targeting LAH of hemagglutinin elicited significant seroconversion and protection against homologous and heterologous influenza virus strains in mice. The vaccine not only significantly reduced lung viral titers, but also induced a well-known bottleneck effect by decreasing virus diversity. In contrast to the classical bottleneck effect, here we showed a significant increase in the frequency of viruses with amino acid sequences identical to that of vaccine targeting LAH domain. No escape mutant emerged after vaccination. These results not only support the potential of a universal influenza vaccine targeting the conserved LAH domains, but also clearly demonstrate that the well-established bottleneck effect on viral quasispecies evolution does not necessarily generate escape mutants. PMID:29587397
Liu, Lin; Nardo, David; Li, Eric; Wang, Gary P
2016-03-13
CD4 T-cell depletion from HIV infection leads to a global decline in anti-hepatitis C virus (HCV) envelope neutralizing antibody (nAb) response, which may play a role in accelerating liver fibrosis. An increase in anti-HCV nAb titers has been reported during antiretroviral therapy (ART) but its impact on HCV remains poorly understood. The objective of this study is to determine the effects of ART on long-term HCV evolution. We examined HCV quasispecies structure and long-term evolution in HIV/HCV coinfected patients with ART-induced CD4 T-cell recovery, and compared with patients with CD4 T-cell depletion from delayed ART. We applied a single-variant sequencing (SVS) method to construct authentic viral quasispecies and compared sequence evolution in HCV envelope, the primary target for humoral immune responses, and NS3, a target for cellular immunity, between the two cohorts. The SVS method corrected biases known to skew the proportions of viral variants, revealing authentic HCV quasispeices structures. We observed higher rates of HCV envelope sequence evolution in patients with ART-induced CD4 T-cell recovery, compared with patients with CD4 T-cell depletion from delayed ART (P = 0.03). Evolutionary rates for NS3 were considerably lower than the rates for envelope (P < 0.01), with no significant difference observed between the two groups. ART-induced CD4 T-cell recovery results in rapid sequence evolution in HCV envelope, but not in NS3. These results suggest that suppressive ART disproportionally enhances HCV-specific humoral responses more than cellular responses, resulting in rapid sequence evolution in HCV envelope but not NS3.
Probing the Structures of Viral RNA Regulatory Elements with SHAPE and Related Methodologies
Rausch, Jason W.; Sztuba-Solinska, Joanna; Le Grice, Stuart F. J.
2018-01-01
Viral RNAs were selected by evolution to possess maximum functionality in a minimal sequence. Depending on the classification of the virus and the type of RNA in question, viral RNAs must alternately be replicated, spliced, transcribed, transported from the nucleus into the cytoplasm, translated and/or packaged into nascent virions, and in most cases, provide the sequence and structural determinants to facilitate these processes. One consequence of this compact multifunctionality is that viral RNA structures can be exquisitely complex, often involving intermolecular interactions with RNA or protein, intramolecular interactions between sequence segments separated by several thousands of nucleotides, or specialized motifs such as pseudoknots or kissing loops. The fluidity of viral RNA structure can also present a challenge when attempting to characterize it, as genomic RNAs especially are likely to sample numerous conformations at various stages of the virus life cycle. Here we review advances in chemoenzymatic structure probing that have made it possible to address such challenges with respect to cis-acting elements, full-length viral genomes and long non-coding RNAs that play a major role in regulating viral gene expression. PMID:29375504
The Viral Evolution Core within the AIDS and Cancer Virus Program will extract viral RNA/DNA from cell-free or cell-associated samples. Complementary (cDNA) will be generated as needed, and cDNA or DNA will be diluted to a single copy prior to nested
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.
Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H
2017-04-15
Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification
Sinclair, Robert M.; Ravantti, Janne J.
2017-01-01
ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
Li, Hui; Stoddard, Mark B; Wang, Shuyi; Giorgi, Elena E; Blair, Lily M; Learn, Gerald H; Hahn, Beatrice H; Alter, Harvey J; Busch, Michael P; Fierer, Daniel S; Ribeiro, Ruy M; Perelson, Alan S; Bhattacharya, Tanmoy; Shaw, George M
2016-01-01
Despite the recent development of highly effective anti-hepatitis C virus (HCV) drugs, the global burden of this pathogen remains immense. Control or eradication of HCV will likely require the broad application of antiviral drugs and development of an effective vaccine. A precise molecular identification of transmitted/founder (T/F) HCV genomes that lead to productive clinical infection could play a critical role in vaccine research, as it has for HIV-1. However, the replication schema of these two RNA viruses differ substantially, as do viral responses to innate and adaptive host defenses. These differences raise questions as to the certainty of T/F HCV genome inferences, particularly in cases where multiple closely related sequence lineages have been observed. To clarify these issues and distinguish between competing models of early HCV diversification, we examined seven cases of acute HCV infection in humans and chimpanzees, including three examples of virus transmission between linked donors and recipients. Using single-genome sequencing (SGS) of plasma vRNA, we found that inferred T/F sequences in recipients were identical to viral sequences in their respective donors. Early in infection, HCV genomes generally evolved according to a simple model of random evolution where the coalescent corresponded to the T/F sequence. Closely related sequence lineages could be explained by high multiplicity infection from a donor whose viral sequences had undergone a pretransmission bottleneck due to treatment, immune selection, or recent infection. These findings validate SGS, together with mathematical modeling and phylogenetic analysis, as a novel strategy to infer T/F HCV genome sequences. Despite the recent development of highly effective, interferon-sparing anti-hepatitis C virus (HCV) drugs, the global burden of this pathogen remains immense. Control or eradication of HCV will likely require the broad application of antiviral drugs and the development of an effective vaccine, which could be facilitated by a precise molecular identification of transmitted/founder (T/F) viral genomes and their progeny. We used single-genome sequencing to show that inferred HCV T/F sequences in recipients were identical to viral sequences in their respective donors and that viral genomes generally evolved early in infection according to a simple model of random sequence evolution. Altogether, the findings validate T/F genome inferences and illustrate how T/F sequence identification can illuminate studies of HCV transmission, immunopathogenesis, drug resistance development, and vaccine protection, including sieving effects on breakthrough virus strains. Copyright © 2015 Li et al.
BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.
Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F
2014-01-01
We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.
Li, Ci-Xiu; Shi, Mang; Tian, Jun-Hua; Lin, Xian-Dan; Kang, Yan-Jun; Chen, Liang-Jun; Qin, Xin-Cheng; Xu, Jianguo; Holmes, Edward C; Zhang, Yong-Zhen
2015-01-01
Although arthropods are important viral vectors, the biodiversity of arthropod viruses, as well as the role that arthropods have played in viral origins and evolution, is unclear. Through RNA sequencing of 70 arthropod species we discovered 112 novel viruses that appear to be ancestral to much of the documented genetic diversity of negative-sense RNA viruses, a number of which are also present as endogenous genomic copies. With this greatly enriched diversity we revealed that arthropods contain viruses that fall basal to major virus groups, including the vertebrate-specific arenaviruses, filoviruses, hantaviruses, influenza viruses, lyssaviruses, and paramyxoviruses. We similarly documented a remarkable diversity of genome structures in arthropod viruses, including a putative circular form, that sheds new light on the evolution of genome organization. Hence, arthropods are a major reservoir of viral genetic diversity and have likely been central to viral evolution. DOI: http://dx.doi.org/10.7554/eLife.05378.001 PMID:25633976
What can we learn about lyssavirus genomes using 454 sequencing?
Höper, Dirk; Finke, Stefan; Freuling, Conrad M; Hoffmann, Bernd; Beer, Martin
2012-01-01
The main task of the individual project number four"Whole genome sequencing, virus-host adaptation, and molecular epidemiological analyses of lyssaviruses "within the network" Lyssaviruses--a potential re-emerging public health threat" is to provide high quality complete genome sequences from lyssaviruses. These sequences are analysed in-depth with regard to the diversity of the viral populations as to both quasi-species and so-called defective interfering RNAs. Moreover, the sequence data will facilitate further epidemiological analyses, will provide insight into the evolution of lyssaviruses and will be the basis for the design of novel nucleic acid based diagnostics. The first results presented here indicate that not only high quality full-length lyssavirus genome sequences can be generated, but indeed efficient analysis of the viral population gets feasible.
Dahl, Viktor; Gisslen, Magnus; Hagberg, Lars; Peterson, Julia; Shao, Wei; Spudich, Serena; Price, Richard W.; Palmer, Sarah
2014-01-01
We sequenced the genome of human immunodeficiency virus type 1 (HIV-1) recovered from 70 cerebrospinal fluid (CSF) specimens and 29 plasma samples and corresponding samples obtained before treatment initiation from 17 subjects receiving suppressive therapy. More CSF sequences than plasma sequences were hypermutants. We determined CSF sequences and plasma sequences in specimens obtained from 2 subjects after treatment initiation. In one subject, we found genetically distinct CSF and plasma sequences, indicating that they came from HIV-1 from 2 different compartments, one potentially the central nervous system, during suppressive therapy. In addition, there was little evidence of viral evolution in the CSF during therapy, suggesting that continuous virus replication is not the major cause of viral persistence in the central nervous system. PMID:24338353
Dahl, Viktor; Gisslen, Magnus; Hagberg, Lars; Peterson, Julia; Shao, Wei; Spudich, Serena; Price, Richard W; Palmer, Sarah
2014-05-15
We sequenced the genome of human immunodeficiency virus type 1 (HIV-1) recovered from 70 cerebrospinal fluid (CSF) specimens and 29 plasma samples and corresponding samples obtained before treatment initiation from 17 subjects receiving suppressive therapy. More CSF sequences than plasma sequences were hypermutants. We determined CSF sequences and plasma sequences in specimens obtained from 2 subjects after treatment initiation. In one subject, we found genetically distinct CSF and plasma sequences, indicating that they came from HIV-1 from 2 different compartments, one potentially the central nervous system, during suppressive therapy. In addition, there was little evidence of viral evolution in the CSF during therapy, suggesting that continuous virus replication is not the major cause of viral persistence in the central nervous system.
Canard, Bruno
2018-01-01
Viral RNA-dependent RNA polymerases (RdRps) play a central role not only in viral replication, but also in the genetic evolution of viral RNAs. After binding to an RNA template and selecting 5′-triphosphate ribonucleosides, viral RdRps synthesize an RNA copy according to Watson-Crick base-pairing rules. The copy process sometimes deviates from both the base-pairing rules specified by the template and the natural ribose selectivity and, thus, the process is error-prone due to the intrinsic (in)fidelity of viral RdRps. These enzymes share a number of conserved amino-acid sequence strings, called motifs A–G, which can be defined from a structural and functional point-of-view. A co-relation is gradually emerging between mutations in these motifs and viral genome evolution or observed mutation rates. Here, we review our current knowledge on these motifs and their role on the structural and mechanistic basis of the fidelity of nucleotide selection and RNA synthesis by Flavivirus RdRps. PMID:29385764
Robson, Nicole D.; Telesnitsky, Alice
2000-01-01
Retrovirus plus-strand synthesis is primed by a cleavage remnant of the polypurine tract (PPT) region of viral RNA. In this study, we tested replication properties for Moloney murine leukemia viruses with targeted mutations in the PPT and in conserved sequences upstream, as well as for pools of mutants with randomized sequences in these regions. The importance of maintaining some purine residues within the PPT was indicated both by examining the evolution of random PPT pools and from the replication properties of targeted mutants. Although many different PPT sequences could support efficient replication and one mutant that contained two differences in the core PPT was found to replicate as well as the wild type, some sequences in the core PPT clearly conferred advantages over others. Contributions of sequences upstream of the core PPT were examined with deletion mutants. A conserved T-stretch within the upstream sequence was examined in detail and found to be unimportant to helper functions. Evolution of virus pools containing randomized T-stretch sequences demonstrated marked preference for the wild-type sequence in six of its eight positions. These findings demonstrate that maintenance of the T-rich element is more important to viral replication than is maintenance of the core PPT. PMID:11044073
A metagenomic survey of viral abundance and diversity in mosquitoes from Hubei province.
Shi, Chenyan; Liu, Yi; Hu, Xiaomin; Xiong, Jinfeng; Zhang, Bo; Yuan, Zhiming
2015-01-01
Mosquitoes as one of the most common but important vectors have the potential to transmit or acquire a lot of viruses through biting, however viral flora in mosquitoes and its impact on mosquito-borne disease transmission has not been well investigated and evaluated. In this study, the metagenomic techniquehas been successfully employed in analyzing the abundance and diversity of viral community in three mosquito samples from Hubei, China. Among 92,304 reads produced through a run with 454 GS FLX system, 39% have high similarities with viral sequences belonging to identified bacterial, fungal, animal, plant and insect viruses, and 0.02% were classed into unidentified viral sequences, demonstrating high abundance and diversity of viruses in mosquitoes. Furthermore, two novel viruses in subfamily Densovirinae and family Dicistroviridae were identified, and six torque tenosus virus1 in family Anelloviridae, three porcine parvoviruses in subfamily Parvovirinae and a Culex tritaeniorhynchus rhabdovirus in Family Rhabdoviridae were preliminarily characterized. The viral metagenomic analysis offered us a deep insight into the viral population of mosquito which played an important role in viral initiative or passive transmission and evolution during the process.
Cornelissen, Marion; Gall, Astrid; Vink, Monique; Zorgdrager, Fokla; Binter, Špela; Edwards, Stephanie; Jurriaans, Suzanne; Bakker, Margreet; Ong, Swee Hoe; Gras, Luuk; van Sighem, Ard; Bezemer, Daniela; de Wolf, Frank; Reiss, Peter; Kellam, Paul; Berkhout, Ben; Fraser, Christophe; van der Kuyl, Antoinette C
2017-07-15
The BEEHIVE (Bridging the Evolution and Epidemiology of HIV in Europe) project aims to analyse nearly-complete viral genomes from >3000 HIV-1 infected Europeans using high-throughput deep sequencing techniques to investigate the virus genetic contribution to virulence. Following the development of a computational pipeline, including a new de novo assembler for RNA virus genomes, to generate larger contiguous sequences (contigs) from the abundance of short sequence reads that characterise the data, another area that determines genome sequencing success is the quality and quantity of the input RNA. A pilot experiment with 125 patient plasma samples was performed to investigate the optimal method for isolation of HIV-1 viral RNA for long amplicon genome sequencing. Manual isolation with the QIAamp Viral RNA Mini Kit (Qiagen) was superior over robotically extracted RNA using either the QIAcube robotic system, the mSample Preparation Systems RNA kit with automated extraction by the m2000sp system (Abbott Molecular), or the MagNA Pure 96 System in combination with the MagNA Pure 96 Instrument (Roche Diagnostics). We scored amplification of a set of four HIV-1 amplicons of ∼1.9, 3.6, 3.0 and 3.5kb, and subsequent recovery of near-complete viral genomes. Subsequently, 616 BEEHIVE patient samples were analysed to determine factors that influence successful amplification of the genome in four overlapping amplicons using the QIAamp Viral RNA Kit for viral RNA isolation. Both low plasma viral load and high sample age (stored before 1999) negatively influenced the amplification of viral amplicons >3kb. A plasma viral load of >100,000 copies/ml resulted in successful amplification of all four amplicons for 86% of the samples, this value dropped to only 46% for samples with viral loads of <20,000 copies/ml. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Molecular epidemiology and evolution of foot-and-mouth disease virus (FMDV) are widely studied using genomic sequences encoding VP1, the capsid protein containing the most relevant antigenic domains. Although sequencing of the full viral genome is not used as a routine diagnostic or surveillance too...
Quick, Josh; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2018-01-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples without isolation remains challenging for viruses such as Zika, where metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence complete genomes comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimised library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved starting with clinical samples in 1-2 days following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. PMID:28538739
Bacterial RecA Protein Promotes Adenoviral Recombination during In Vitro Infection
Lee, Jeong Yoon; Lee, Ji Sun; Materne, Emma C.; Rajala, Rahul; Ismail, Ashrafali M.; Seto, Donald; Dyer, David W.
2018-01-01
ABSTRACT Adenovirus infections in humans are common and sometimes lethal. Adenovirus-derived vectors are also commonly chosen for gene therapy in human clinical trials. We have shown in previous work that homologous recombination between adenoviral genomes of human adenovirus species D (HAdV-D), the largest and fastest growing HAdV species, is responsible for the rapid evolution of this species. Because adenovirus infection initiates in mucosal epithelia, particularly at the gastrointestinal, respiratory, genitourinary, and ocular surfaces, we sought to determine a possible role for mucosal microbiota in adenovirus genome diversity. By analysis of known recombination hot spots across 38 human adenovirus genomes in species D (HAdV-D), we identified nucleotide sequence motifs similar to bacterial Chi sequences, which facilitate homologous recombination in the presence of bacterial Rec enzymes. These motifs, referred to here as ChiAD, were identified immediately 5′ to the sequence encoding penton base hypervariable loop 2, which expresses the arginine-glycine-aspartate moiety critical to adenoviral cellular entry. Coinfection with two HAdV-Ds in the presence of an Escherichia coli lysate increased recombination; this was blocked in a RecA mutant strain, E. coli DH5α, or upon RecA depletion. Recombination increased in the presence of E. coli lysate despite a general reduction in viral replication. RecA colocalized with viral DNA in HAdV-D-infected cell nuclei and was shown to bind specifically to ChiAD sequences. These results indicate that adenoviruses may repurpose bacterial recombination machinery, a sharing of evolutionary mechanisms across a diverse microbiota, and unique example of viral commensalism. IMPORTANCE Adenoviruses are common human mucosal pathogens of the gastrointestinal, respiratory, and genitourinary tracts and ocular surface. Here, we report finding Chi-like sequences in adenovirus recombination hot spots. Adenovirus coinfection in the presence of bacterial RecA protein facilitated homologous recombination between viruses. Genetic recombination led to evolution of an important external feature on the adenoviral capsid, namely, the penton base protein hypervariable loop 2, which contains the arginine-glycine-aspartic acid motif critical to viral internalization. We speculate that free Rec proteins present in gastrointestinal secretions upon bacterial cell death facilitate the evolution of human adenoviruses through homologous recombination, an example of viral commensalism and the complexity of virus-host interactions, including regional microbiota. PMID:29925671
Quick, Joshua; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah C; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno R; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2017-06-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
2D-dynamic representation of DNA sequences as a graphical tool in bioinformatics
NASA Astrophysics Data System (ADS)
Bielińska-Wa̧Ż, D.; Wa̧Ż, P.
2016-10-01
2D-dynamic representation of DNA sequences is briefly reviewed. Some new examples of 2D-dynamic graphs which are the graphical tool of the method are shown. Using the examples of the complete genome sequences of the Zika virus it is shown that the present method can be applied for the study of the evolution of viral genomes.
Evolution of HIV-1 coreceptor usage and coreceptor switching during pregnancy.
Ransy, Doris G; Motorina, Alena; Merindol, Natacha; Akouamba, Bertine S; Samson, Johanne; Lie, Yolanda; Napolitano, Laura A; Lapointe, Normand; Boucher, Marc; Soudeyns, Hugo
2014-03-01
Coreceptor switch from CCR5 to CXCR4 is associated with HIV disease progression. To document the evolution of coreceptor tropism during pregnancy, a longitudinal study of envelope gene sequences was performed in a group of pregnant women infected with HIV-1 of clade B (n=10) or non-B (n=9). Polymerase chain reaction (PCR) amplification of the V1-V3 region was performed on plasma viral RNA, followed by cloning and sequencing. Using geno2pheno and PSSMX4R5, the presence of X4 variants was predicted in nine of 19 subjects (X4 subjects) independent of HIV-1 clade. Six of nine X4 subjects exhibited CD4(+) T cell counts <200 cells/mm(3), and the presence of X4-capable virus was confirmed using a recombinant phenotypic assay in four of seven cases where testing was successful. In five of nine X4 subjects, a statistically significant decline in the geno2pheno false-positive rate was observed during the course of pregnancy, invariably accompanied by progressive increases in the PSSMX4R5 score, the net charge of V3, and the relative representation of X4 sequences. Evolution toward X4 tropism was also echoed in the primary structure of V2, as an accumulation of substitutions associated with CXCR4 tropism was seen in X4 subjects. Results from these experiments provide the first evidence of the ongoing evolution of coreceptor utilization from CCR5 to CXCR4 during pregnancy in a significant fraction of HIV-infected women. These results inform changes in host-pathogen interactions that lead to a directional shaping of viral populations and viral tropism during pregnancy, and provide insights into the biology of HIV transmission from mother to child.
Weiss, Eric R.; Alter, Galit; Ogembo, Javier Gordon; Henderson, Jennifer L.; Tabak, Barbara; Bakiş, Yasin; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa
2016-01-01
ABSTRACT The Epstein-Barr virus (EBV) gp350 glycoprotein interacts with the cellular receptor to mediate viral entry and is thought to be the major target for neutralizing antibodies. To better understand the role of EBV-specific antibodies in the control of viral replication and the evolution of sequence diversity, we measured EBV gp350-specific antibody responses and sequenced the gp350 gene in samples obtained from individuals experiencing primary EBV infection (acute infectious mononucleosis [AIM]) and again 6 months later (during convalescence [CONV]). EBV gp350-specific IgG was detected in the sera of 17 (71%) of 24 individuals at the time of AIM and all 24 (100%) individuals during CONV; binding antibody titers increased from AIM through CONV, reaching levels equivalent to those in age-matched, chronically infected individuals. Antibody-dependent cell-mediated phagocytosis (ADCP) was rarely detected during AIM (4 of 24 individuals; 17%) but was commonly detected during CONV (19 of 24 individuals; 79%). The majority (83%) of samples taken during AIM neutralized infection of primary B cells; all samples obtained at 6 months postdiagnosis neutralized EBV infection of cultured and primary target cells. Deep sequencing revealed interpatient gp350 sequence variation but conservation of the CR2-binding site. The levels of gp350-specific neutralizing activity directly correlated with higher peripheral blood EBV DNA levels during AIM and a greater evolution of diversity in gp350 nucleotide sequences from AIM to CONV. In summary, we conclude that the viral load and EBV gp350 diversity during early infection are associated with the development of neutralizing antibody responses following AIM. IMPORTANCE Antibodies against viral surface proteins can blunt the spread of viral infection by coating viral particles, mediating uptake by immune cells, or blocking interaction with host cell receptors, making them a desirable component of a sterilizing vaccine. The EBV surface protein gp350 is a major target for antibodies. We report the detection of EBV gp350-specific antibodies capable of neutralizing EBV infection in vitro. The majority of gp350-directed vaccines focus on glycoproteins from lab-adapted strains, which may poorly reflect primary viral envelope diversity. We report some of the first primary gp350 sequences, noting that the gp350 host receptor binding site is remarkably stable across patients and time. However, changes in overall gene diversity were detectable during infection. Patients with higher peripheral blood viral loads in primary infection and greater changes in viral diversity generated more efficient antibodies. Our findings provide insight into the generation of functional antibodies, necessary for vaccine development. PMID:27733645
Weiss, Eric R; Alter, Galit; Ogembo, Javier Gordon; Henderson, Jennifer L; Tabak, Barbara; Bakiş, Yasin; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa; Luzuriaga, Katherine
2017-01-01
The Epstein-Barr virus (EBV) gp350 glycoprotein interacts with the cellular receptor to mediate viral entry and is thought to be the major target for neutralizing antibodies. To better understand the role of EBV-specific antibodies in the control of viral replication and the evolution of sequence diversity, we measured EBV gp350-specific antibody responses and sequenced the gp350 gene in samples obtained from individuals experiencing primary EBV infection (acute infectious mononucleosis [AIM]) and again 6 months later (during convalescence [CONV]). EBV gp350-specific IgG was detected in the sera of 17 (71%) of 24 individuals at the time of AIM and all 24 (100%) individuals during CONV; binding antibody titers increased from AIM through CONV, reaching levels equivalent to those in age-matched, chronically infected individuals. Antibody-dependent cell-mediated phagocytosis (ADCP) was rarely detected during AIM (4 of 24 individuals; 17%) but was commonly detected during CONV (19 of 24 individuals; 79%). The majority (83%) of samples taken during AIM neutralized infection of primary B cells; all samples obtained at 6 months postdiagnosis neutralized EBV infection of cultured and primary target cells. Deep sequencing revealed interpatient gp350 sequence variation but conservation of the CR2-binding site. The levels of gp350-specific neutralizing activity directly correlated with higher peripheral blood EBV DNA levels during AIM and a greater evolution of diversity in gp350 nucleotide sequences from AIM to CONV. In summary, we conclude that the viral load and EBV gp350 diversity during early infection are associated with the development of neutralizing antibody responses following AIM. Antibodies against viral surface proteins can blunt the spread of viral infection by coating viral particles, mediating uptake by immune cells, or blocking interaction with host cell receptors, making them a desirable component of a sterilizing vaccine. The EBV surface protein gp350 is a major target for antibodies. We report the detection of EBV gp350-specific antibodies capable of neutralizing EBV infection in vitro The majority of gp350-directed vaccines focus on glycoproteins from lab-adapted strains, which may poorly reflect primary viral envelope diversity. We report some of the first primary gp350 sequences, noting that the gp350 host receptor binding site is remarkably stable across patients and time. However, changes in overall gene diversity were detectable during infection. Patients with higher peripheral blood viral loads in primary infection and greater changes in viral diversity generated more efficient antibodies. Our findings provide insight into the generation of functional antibodies, necessary for vaccine development. Copyright © 2016 American Society for Microbiology.
The kinetics and location of intra-host HIV evolution to evade cellular immunity are predictable
NASA Astrophysics Data System (ADS)
Barton, John; Goonetilleke, Nilu; Butler, Thomas; Walker, Bruce; McMichael, Andrew; Chakraborty, Arup
Human immunodeficiency virus (HIV) evolves within infected persons to escape targeting and clearance by the host immune system, thereby preventing effective immune control of infection. Knowledge of the timing and pathways of escape that result in loss of control of the virus could aid in the design of effective strategies to overcome the challenge of viral diversification and immune escape. We combined methods from statistical physics and evolutionary dynamics to predict the course of in vivo viral sequence evolution in response to T cell-mediated immune pressure in a cohort of 17 persons with acute HIV infection. Our predictions agree well with both the location of documented escape mutations and the clinically observed time to escape. We also find that that the mutational pathways to escape depend on the viral sequence background due to epistatic interactions. The ability to predict escape pathways, and the duration over which control is maintained by specific immune responses prior to escape, could be exploited for the rational design of immunotherapeutic strategies that may enable long-term control of HIV infection.
Zanotto, Paolo Marinho de Andrade; Krakauer, David C.
2008-01-01
We consider the concerted evolution of viral genomes in four families of DNA viruses. Given the high rate of horizontal gene transfer among viruses and their hosts, it is an open question as to how representative particular genes are of the evolutionary history of the complete genome. To address the concerted evolution of viral genes, we compared genomic evolution across four distinct, extant viral families. For all four viral families we constructed DNA-dependent DNA polymerase-based (DdDp) phylogenies and in addition, whole genome sequence, as quantitative descriptions of inter-genome relationships. We found that the history of the polymerase gene was highly predictive of the history of the genome as a whole, which we explain in terms of repeated, co-divergence events of the core DdDp gene accompanied by a number of satellite, accessory genetic loci. We also found that the rate of gene gain in baculovirus and poxviruses proceeds significantly more quickly than the rate of gene loss and that there is convergent acquisition of satellite functions promoting contextual adaptation when distinct viral families infect related hosts. The congruence of the genome and polymerase trees suggests that a large set of viral genes, including polymerase, derive from a phylogenetically conserved core of genes of host origin, secondarily reinforced by gene acquisition from common hosts or co-infecting viruses within the host. A single viral genome can be thought of as a mutualistic network, with the core genes acting as an effective host and the satellite genes as effective symbionts. Larger virus genomes show a greater departure from linkage equilibrium between core and satellites functions. PMID:18941535
A Metagenomic Survey of Viral Abundance and Diversity in Mosquitoes from Hubei Province
Shi, Chenyan; Liu, Yi; Hu, Xiaomin; Xiong, Jinfeng; Zhang, Bo; Yuan, Zhiming
2015-01-01
Mosquitoes as one of the most common but important vectors have the potential to transmit or acquire a lot of viruses through biting, however viral flora in mosquitoes and its impact on mosquito-borne disease transmission has not been well investigated and evaluated. In this study, the metagenomic techniquehas been successfully employed in analyzing the abundance and diversity of viral community in three mosquito samples from Hubei, China. Among 92,304 reads produced through a run with 454 GS FLX system, 39% have high similarities with viral sequences belonging to identified bacterial, fungal, animal, plant and insect viruses, and 0.02% were classed into unidentified viral sequences, demonstrating high abundance and diversity of viruses in mosquitoes. Furthermore, two novel viruses in subfamily Densovirinae and family Dicistroviridae were identified, and six torque tenosus virus1 in family Anelloviridae, three porcine parvoviruses in subfamily Parvovirinae and a Culex tritaeniorhynchus rhabdovirus in Family Rhabdoviridae were preliminarily characterized. The viral metagenomic analysis offered us a deep insight into the viral population of mosquito which played an important role in viral initiative or passive transmission and evolution during the process. PMID:26030271
Salvatori, F; Masiero, S; Giaquinto, C; Wade, C M; Brown, A J; Chieco-Bianchi, L; De Rossi, A
1997-01-01
We addressed the relationship between the origin and evolution of human immunodeficiency virus type 1 (HIV-1) variants and disease outcome in perinatally infected infants by studying the V3 regions of viral variants in samples obtained from five transmitting mothers at delivery and obtained sequentially over the first year of life from their infected infants, two of whom (rapid progressors) rapidly progressed to having AIDS. Phylogenetic analyses disclosed that the V3 sequences from each mother-infant pair clustered together and were clearly distinct from those of the other pairs. Within each pair, the child's sequences formed a monophyletic group, indicating that a single variant initiated the infection in both rapid and slow progressors. Plasma HIV-1 RNA levels increased in all five infants during their first months of life and then declined within the first semester of life only in the three slow progressors. V3 variability increased over time in all infants, but no differences in the pattern of V3 evolution in terms of potential viral phenotype were observed. The numbers of synonymous and nonsynonymous substitutions varied during the first semester of life regardless of viral load, CD4+-cell count, and disease progression. Conversely, during the second semester of life the rate of nonsynonymous substitutions was higher than that of synonymous substitutions in the slow progressors but not in the rapid progressors, thus suggesting a stronger host selective pressure in the former. In view of the proposal that V3 genetic evolution is driven mainly by host immune constraints, these findings suggest that while the immune response to V3 might contribute to regulating viral levels after the first semester of life, it is unlikely to play a determinant role in the initial viral decline soon after birth. PMID:9151863
USDA-ARS?s Scientific Manuscript database
Several FMDV carrier cattle were identified in Vietnam by recovery of infectious virus from oropharyngeal fluid. This report contains the first near-complete genome sequences of seven viruses isolated from a single carrier animal over the course of one year. Understanding within-host viral evolution...
2010-01-01
The human immunodeficiency virus type 1 (HIV-1) coreceptor use and viral evolution were analyzed in blood samples from an HIV-1 infected patient undergoing allogeneic stem cell transplantation (SCT). Coreceptor use was predicted in silico from sequence data obtained from the third variable loop region of the viral envelope gene with two software tools. Viral diversity and evolution was evaluated on the same samples by Bayesian inference and maximum likelihood methods. In addition, phenotypic analysis was done by comparison of viral growth in peripheral blood mononuclear cells and in a CCR5 (R5)-deficient T-cell line which was controlled by a reporter assay confirming viral tropism. In silico coreceptor predictions did not match experimental determinations that showed a consistent R5 tropism. Anti-HIV directed antibodies could be detected before and after the SCT. These preexisting antibodies did not prevent viral rebound after the interruption of antiretroviral therapy during the SCT. Eventually, transplantation and readministration of anti-retroviral drugs lead to sustained increase in CD4 counts and decreased viral load to undetectable levels. Unexpectedly, viral diversity decreased after successful SCT. Our data evidence that only R5-tropic virus was found in the patient before and after transplantation. Therefore, blocking CCR5 receptor during stem cell transplantation might have had beneficial effects and this might apply to more patients undergoing allogeneic stem cell transplantation. Furthermore, we revealed a scenario of HIV-1 dynamic different from the commonly described ones. Analysis of viral evolution shows the decrease of viral diversity even during episodes with bursts in viral load. PMID:20210988
Nakagawa, So; Takahashi, Mahoko Ueda
2016-01-01
In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species.Database URL: http://geve.med.u-tokai.ac.jp. © The Author(s) 2016. Published by Oxford University Press.
Nakagawa, So; Takahashi, Mahoko Ueda
2016-01-01
In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species. Database URL: http://geve.med.u-tokai.ac.jp PMID:27242033
Voorhees, Ian E H; Dalziel, Benjamin D; Glaser, Amy; Dubovi, Edward J; Murcia, Pablo R; Newbury, Sandra; Toohey-Kurth, Kathy; Su, Shuo; Kriti, Divya; Van Bakel, Harm; Goodman, Laura B; Leutenegger, Christian; Holmes, Edward C; Parrish, Colin R
2018-06-06
Avian-origin H3N2 canine influenza virus (CIV) transferred to dogs in Asia around 2005, becoming enzootic throughout China and Korea before reaching the USA in early 2015. To understand the post-transfer evolution and epidemiology of this virus, particularly the cause of recent and ongoing increases in incidence in the USA, we performed an integrated analysis of whole-genome sequence data from 64 newly sequenced viruses and comprehensive surveillance data. This reveals that the circulation of H3N2 CIV within the USA is typified by recurrent epidemic burst-fadeout dynamics driven by multiple introductions of virus from Asia. Although all major viral lineages displayed similar rates of genomic sequence evolution, H3N2 CIV consistently exhibited proportionally more non-synonymous substitutions per site compared to avian reservoir viruses, indicative of a large-scale change in selection pressures. Despite these genotypic differences, we found no evidence of adaptive evolution or increased viral transmission, with epidemiological models indicating a basic reproductive number, R 0 , of between 1 and 1.5 across nearly all USA outbreaks, consistent with maintained, but heterogeneous circulation. We propose that CIV's mode of viral circulation may have resulted in evolutionary cul-de-sacs, in which there is little opportunity for the selection of the more transmissible H3N2 CIV phenotypes necessary to enable circulation through a general dog population characterized by widespread contact heterogeneity. CIV must therefore rely on metapopulations of high host density (notably animal shelters) within the greater dog population and reintroduction from other populations or face complete epidemic extinction. IMPORTANCE The relatively recent appearance of influenza A virus (IAV) epidemics in dogs expands our understanding of IAV host-range and ecology, providing useful and relevant models for understanding critical factors involved in viral emergence. Here, we integrate viral whole-genome sequence analysis and comprehensive surveillance data to examine the evolution of the emerging avian-origin H3N2 canine influenza virus (CIV), particularly the factors driving ongoing circulation and recent increase in incidence of the virus within the USA. Our results provide a detailed understanding of how H3N2 CIV achieves sustained circulation within the USA, despite widespread host contact heterogeneity and recurrent epidemic fade-out. Moreover, our findings suggest that the types and intensity of selection pressures an emerging virus experiences are highly dependent on host population structure and ecology, and may inhibit an emerging virus from acquiring sustained epidemic or pandemic circulation. Copyright © 2018 American Society for Microbiology.
Lee, Justin S.; Bevins, Sarah N.; Serieys, Laurel E.K.; Vickers, Winston; Logan, Ken A.; Aldredge, Mat; Boydston, Erin E.; Lyren, Lisa M.; McBride, Roy; Roelke-Parker, Melody; Pecon-Slattery, Jill; Troyer, Jennifer L.; Riley, Seth P.; Boyce, Walter M.; Crooks, Kevin R.; VandeWoude, Sue
2014-01-01
Mountain lions (Puma concolor) throughout North and South America are infected with puma lentivirus clade B (PLVB). A second, highly divergent lentiviral clade, PLVA, infects mountain lions in southern California and Florida. Bobcats (Lynx rufus) in these two geographic regions are also infected with PLVA, and to date, this is the only strain of lentivirus identified in bobcats. We sequenced full-length PLV genomes in order to characterize the molecular evolution of PLV in bobcats and mountain lions. Low sequence homology (88% average pairwise identity) and frequent recombination (1 recombination breakpoint per 3 isolates analyzed) were observed in both clades. Viral proteins have markedly different patterns of evolution; sequence homology and negative selection were highest in Gag and Pol and lowest in Vif and Env. A total of 1.7% of sites across the PLV genome evolve under positive selection, indicating that host-imposed selection pressure is an important force shaping PLV evolution. PLVA strains are highly spatially structured, reflecting the population dynamics of their primary host, the bobcat. In contrast, the phylogeography of PLVB reflects the highly mobile mountain lion, with diverse PLVB isolates cocirculating in some areas and genetically related viruses being present in populations separated by thousands of kilometers. We conclude that PLVA and PLVB are two different viral species with distinct feline hosts and evolutionary histories.
Yoshida, Naoto; Shimura, Hanako; Masuta, Chikara
2018-06-01
Allexiviruses are economically important garlic viruses that are involved in garlic mosaic diseases. In this study, we characterized the allexivirus cysteine-rich protein (CRP) gene located just downstream of the coat protein (CP) gene in the viral genome. We determined the nucleotide sequences of the CP and CRP genes from numerous allexivirus isolates and performed a phylogenetic analysis. According to the resulting phylogenetic tree, we found that allexiviruses were clearly divided into two major groups (group I and group II) based on the sequences of the CP and CRP genes. In addition, the allexiviruses in group II had distinct sequences just before the CRP gene, while group I isolates did not. The inserted sequence between the CP and CRP genes was partially complementary to garlic 18S rRNA. Using a potato virus X vector, we showed that the CRPs affected viral accumulation and symptom induction in Nicotiana benthamiana, suggesting that the allexivirus CRP is a pathogenicity determinant. We assume that the inserted sequences before the CRP gene may have been generated during viral evolution to alter the termination-reinitiation mechanism for coupled translation of CP and CRP.
Genetic variability and evolutionary dynamics of viruses of the family Closteroviridae
Rubio, Luis; Guerri, José; Moreno, Pedro
2013-01-01
RNA viruses have a great potential for genetic variation, rapid evolution and adaptation. Characterization of the genetic variation of viral populations provides relevant information on the processes involved in virus evolution and epidemiology and it is crucial for designing reliable diagnostic tools and developing efficient and durable disease control strategies. Here we performed an updated analysis of sequences available in Genbank and reviewed present knowledge on the genetic variability and evolutionary processes of viruses of the family Closteroviridae. Several factors have shaped the genetic structure and diversity of closteroviruses. (I) A strong negative selection seems to be responsible for the high genetic stability in space and time for some viruses. (2) Long distance migration, probably by human transport of infected propagative plant material, have caused that genetically similar virus isolates are found in distant geographical regions. (3) Recombination between divergent sequence variants have generated new genotypes and plays an important role for the evolution of some viruses of the family Closteroviridae. (4) Interaction between virus strains or between different viruses in mixed infections may alter accumulation of certain strains. (5) Host change or virus transmission by insect vectors induced changes in the viral population structure due to positive selection of sequence variants with higher fitness for host-virus or vector-virus interaction (adaptation) or by genetic drift due to random selection of sequence variants during the population bottleneck associated to the transmission process. PMID:23805130
Ripamonti, Chiara; Leitner, Thomas; Laurén, Anna; Karlsson, Ingrid; Pastore, Angela; Cavarelli, Mariangela; Antonsson, Liselotte; Plebani, Anna; Fenyö, Eva Maria; Scarlatti, Gabriella
2007-12-01
To investigate the immunological and virological factors that may lead to different patterns of disease progression characteristic of HIV-1-infected children, two HIV-1-infected siblings, a slow and a fast progressor, were followed prospectively before the onset of highly active antiretroviral therapy. Viral coreceptor usage, including the use of CCR5/CXCR4 chimeric receptors, macrophage tropism, and sensitivity to the CC-chemokine RANTES, has been studied. An autologous and heterologous neutralizing antibody response has been documented using peripheral blood mononuclear cells- and GHOST(3) cell line-based assays. Viral evolution was investigated by env C2-V3 region sequence analysis. Although both siblings were infected with HIV-1 of the R5 phenotype, their viruses showed important biological differences. In the fast progressor there was a higher RANTES sensitivity of the early virus, an increased trend to change the mode of CCR5 receptor use, and a larger genetic evolution. Both children developed an autologous neutralizing antibody response starting from the second year with evidence of the continuous emergence of resistant variants. A marked viral genetic and phenotypic evolution was documented in the fast progressor sibling, which is accompanied by a high viral RANTES sensitivity and persistent neutralizing antibodies.
Sequence variation of the feline immunodeficiency virus genome and its clinical relevance.
Stickney, A L; Dunowska, M; Cave, N J
2013-06-08
The ongoing evolution of feline immunodeficiency virus (FIV) has resulted in the existence of a diverse continuum of viruses. FIV isolates differ with regards to their mutation and replication rates, plasma viral loads, cell tropism and the ability to induce apoptosis. Clinical disease in FIV-infected cats is also inconsistent. Genomic sequence variation of FIV is likely to be responsible for some of the variation in viral behaviour. The specific genetic sequences that influence these key viral properties remain to be determined. With knowledge of the specific key determinants of pathogenicity, there is the potential for veterinarians in the future to apply this information for prognostic purposes. Genomic sequence variation of FIV also presents an obstacle to effective vaccine development. Most challenge studies demonstrate acceptable efficacy of a dual-subtype FIV vaccine (Fel-O-Vax FIV) against FIV infection under experimental settings; however, vaccine efficacy in the field still remains to be proven. It is important that we discover the key determinants of immunity induced by this vaccine; such data would compliment vaccine field efficacy studies and provide the basis to make informed recommendations on its use.
The molecular biology and evolution of feline immunodeficiency viruses of cougars
Poss, Mary; Ross, Howard; Rodrigo, Allen; Terwee, Julie; VandeWoude, Sue; Biek, Roman
2008-01-01
Feline immunodeficiency virus (FIV) is a lentivirus that has been identified in many members of the family Felidae but domestic cats are the only FIV host in which infection results in disease. We studied FIVpco infection of cougars (Puma concolor) as a model for asymptomatic lentivirus infections to understand the mechanisms of host-virus coexistence. Several natural cougar populations were evaluated to determine if there are any consequences of FIVpco infection on cougar fecundity, survival, or susceptibility to other infections. We have sequenced full length viral genomes and conducted a detailed analysis of viral molecular evolution on these sequences and on genome fragments of serially sampled animals to determine the evolutionary forces experienced by this virus in cougars. In addition, we have evaluated the molecular genetics of FIVpco in a new host, domestic cats, to determine the evolutionary consequences to a host-adapted virus associated with cross-species infection. Our results indicate that there are no significant differences in survival, fecundity or susceptibility to other infections between FIVpco-infected and uninfected cougars. The molecular evolution of FIVpco is characterized by a slower evolutionary rate and an absence of positive selection, but also by proviral and plasma viral loads comparable to those of epidemic lentiviruses such as HIV-1 or FIVfca. Evolutionary and recombination rates and selection profiles change significantly when FIVpco replicates in a new host. PMID:18295904
Evolutionary Dynamics of Influenza A Viruses in US Exhibition Swine
Nelson, Martha I.; Wentworth, David E.; Das, Suman R.; Sreevatsan, Srinand; Killian, Mary L.; Nolting, Jacqueline M.; Slemons, Richard D.; Bowman, Andrew S.
2016-01-01
The role of exhibition swine in influenza A virus transmission was recently demonstrated by >300 infections with influenza A(H3N2) variant viruses among individuals who attended agricultural fairs. Through active influenza A virus surveillance in US exhibition swine and whole-genome sequencing of 380 isolates, we demonstrate that exhibition swine are actively involved in the evolution of influenza A viruses, including zoonotic strains. First, frequent introduction of influenza A viruses from commercial swine populations provides new genetic diversity in exhibition pigs each year locally. Second, genomic reassortment between viruses cocirculating in exhibition swine increases viral diversity. Third, viral migration between exhibition swine in neighboring states demonstrates that movements of exhibition pigs contributes to the spread of genetic diversity. The unexpected frequency of viral exchange between commercial and exhibition swine raises questions about the understudied interface between these populations. Overall, the complexity of viral evolution in exhibition swine indicates that novel viruses are likely to continually reemerge, presenting threats to humans. PMID:26243317
Mathematical modeling of escape of HIV from cytotoxic T lymphocyte responses
NASA Astrophysics Data System (ADS)
Ganusov, Vitaly V.; Neher, Richard A.; Perelson, Alan S.
2013-01-01
Human immunodeficiency virus (HIV-1 or simply HIV) induces a persistent infection, which in the absence of treatment leads to AIDS and death in almost all infected individuals. HIV infection elicits a vigorous immune response starting about 2-3 weeks postinfection that can lower the amount of virus in the body, but which cannot eradicate the virus. How HIV establishes a chronic infection in the face of a strong immune response remains poorly understood. It has been shown that HIV is able to rapidly change its proteins via mutation to evade recognition by virus-specific cytotoxic T lymphocytes (CTLs). Typically, an HIV-infected patient will generate 4-12 CTL responses specific for parts of viral proteins called epitopes. Such CTL responses lead to strong selective pressure to change the viral sequences encoding these epitopes so as to avoid CTL recognition. Indeed, the viral population ‘escapes’ from about half of the CTL responses by mutation in the first year. Here we review experimental data on HIV evolution in response to CTL pressure, mathematical models developed to explain this evolution, and highlight problems associated with the data and previous modeling efforts. We show that estimates of the strength of the epitope-specific CTL response depend on the method used to fit models to experimental data and on the assumptions made regarding how mutants are generated during infection. We illustrate that allowing CTL responses to decay over time may improve the model fit to experimental data and provides higher estimates of the killing efficacy of HIV-specific CTLs. We also propose a novel method for simultaneously estimating the killing efficacy of multiple CTL populations specific for different epitopes of HIV using stochastic simulations. Lastly, we show that current estimates of the efficacy at which HIV-specific CTLs clear virus-infected cells can be improved by more frequent sampling of viral sequences and by combining data on sequence evolution with experimentally measured CTL dynamics.
Grzela, Renata; Nusbaum, Julien; Fieulaine, Sonia; Lavecchia, Francesco; Bienvenut, Willy V; Dian, Cyril; Meinnel, Thierry; Giglione, Carmela
2017-09-08
Prokaryotic proteins must be deformylated before the removal of their first methionine. Peptide deformylase (PDF) is indispensable and guarantees this mechanism. Recent metagenomics studies revealed new idiosyncratic PDF forms as the most abundant family of viral sequences. Little is known regarding these viral PDFs, including the capacity of the corresponding encoded proteins to ensure deformylase activity. We provide here the first evidence that viral PDFs, including the shortest PDF identified to date, Vp16 PDF, display deformylase activity in vivo, despite the absence of the key ribosome-interacting C-terminal region. Moreover, characterization of phage Vp16 PDF underscores unexpected structural and molecular features with the C-terminal Isoleucine residue significantly contributing to deformylase activity both in vitro and in vivo. This residue fully compensates for the absence of the usual long C-domain. Taken together, these data elucidate an unexpected mechanism of enzyme natural evolution and adaptation within viral sequences.
Whitmer, Shannon L M; Albariño, César; Shepard, Samuel S; Dudas, Gytis; Sheth, Mili; Brown, Shelley C; Cannon, Deborah; Erickson, Bobbie R; Gibbons, Aridth; Schuh, Amy; Sealy, Tara; Ervin, Elizabeth; Frace, Mike; Uyeki, Timothy M; Nichol, Stuart T; Ströher, Ute
2016-10-15
Several patients with Ebola virus disease (EVD) managed in the United States have received ZMapp monoclonal antibodies, TKM-Ebola small interfering RNA, brincidofovir, and/or convalescent plasma as investigational therapeutics. To investigate whether treatment selected for Ebola virus (EBOV) mutations conferring resistance, viral sequencing was performed on RNA extracted from clinical blood specimens from patients with EVD following treatment, and putative viral targets were analyzed. We observed no major or minor EBOV mutations within regions targeted by therapeutics. This small subset of patients and clinical specimens suggests that evolution of resistance is not a direct consequence of antiviral treatment. As EVD antiviral treatments are introduced into wider use, it is essential that continuous viral full-genome surveillance is performed, to monitor for the emergence of escape mutations. Published by Oxford University Press for the Infectious Diseases Society of America 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Co-evolution of a broadly neutralizing HIV-1 antibody and founder virus
Liao, Hua-Xin; Lynch, Rebecca; Zhou, Tongqing; Gao, Feng; Alam, S. Munir; Boyd, Scott D.; Fire, Andrew Z.; Roskin, Krishna M.; Schramm, Chaim A.; Zhang, Zhenhai; Zhu, Jiang; Shapiro, Lawrence; Mullikin, James C.; Gnanakaran, S.; Hraber, Peter; Wiehe, Kevin; Kelsoe, Garnett; Yang, Guang; Xia, Shi-Mao; Montefiori, David C.; Parks, Robert; Lloyd, Krissey E.; Scearce, Richard M.; Soderberg, Kelly A.; Cohen, Myron; Kaminga, Gift; Louder, Mark K.; Tran, Lillan M.; Chen, Yue; Cai, Fangping; Chen, Sheri; Moquin, Stephanie; Du, Xiulian; Joyce, Gordon M.; Srivatsan, Sanjay; Zhang, Baoshan; Zheng, Anqi; Shaw, George M.; Hahn, Beatrice H.; Kepler, Thomas B.; Korber, Bette T.M.; Kwong, Peter D.; Mascola, John R.; Haynes, Barton F.
2013-01-01
Current HIV-1 vaccines elicit strain-specific neutralizing antibodies. However, cross-reactive neutralizing antibodies arise in ~20% of HIV-1-infected individuals, and details of their generation could provide a roadmap for effective vaccination. Here we report the isolation, evolution and structure of a broadly neutralizing antibody from an African donor followed from time of infection. The mature antibody, CH103, neutralized ~55% of HIV-1 isolates, and its co-crystal structure with gp120 revealed a novel loop-based mechanism of CD4-binding site recognition. Virus and antibody gene sequencing revealed concomitant virus evolution and antibody maturation. Notably, the CH103-lineage unmutated common ancestor avidly bound the transmitted/founder HIV-1 envelope glycoprotein, and evolution of antibody neutralization breadth was preceded by extensive viral diversification in and near the CH103 epitope. These data elucidate the viral and antibody evolution leading to induction of a lineage of HIV-1 broadly neutralizing antibodies and provide insights into strategies to elicit similar antibodies via vaccination. PMID:23552890
Pou, Christian; Codoñer, Francisco M; Thielen, Alexander; Bellido, Rocío; Pérez-Álvarez, Susana; Cabrera, Cecilia; Dalmau, Judith; Curriu, Marta; Lie, Yolanda; Noguera-Julian, Marc; Puig, Jordi; Martínez-Picado, Javier; Blanco, Julià; Coakley, Eoin; Däumer, Martin; Clotet, Bonaventura; Paredes, Roger
2013-01-01
Technically, HIV-1 tropism can be evaluated in plasma or peripheral blood mononuclear cells (PBMCs). However, only tropism testing of plasma HIV-1 has been validated as a tool to predict virological response to CCR5 antagonists in clinical trials. The preferable tropism testing strategy in subjects with undetectable HIV-1 viremia, in whom plasma tropism testing is not feasible, remains uncertain. We designed a proof-of-concept study including 30 chronically HIV-1-infected individuals who achieved HIV-1 RNA <50 copies/mL during at least 2 years after first-line ART initiation. First, we determined the diagnostic accuracy of 454 and population sequencing of gp120 V3-loops in plasma and PBMCs, as well as of MT-2 assays before ART initiation. The Enhanced Sensitivity Trofile Assay (ESTA) was used as the technical reference standard. 454 sequencing of plasma viruses provided the highest agreement with ESTA. The accuracy of 454 sequencing decreased in PBMCs due to reduced specificity. Population sequencing in plasma and PBMCs was slightly less accurate than plasma 454 sequencing, being less sensitive but more specific. MT-2 assays had low sensitivity but 100% specificity. Then, we used optimized 454 sequence data to investigate viral evolution in PBMCs during viremia suppression and only found evolution of R5 viruses in one subject. No de novo CXCR4-using HIV-1 production was observed over time. Finally, Slatkin-Maddison tests suggested that plasma and cell-associated V3 forms were sometimes compartmentalized. The absence of tropism shifts during viremia suppression suggests that, when available, testing of stored plasma samples is generally safe and informative, provided that HIV-1 suppression is maintained. Tropism testing in PBMCs may not necessarily produce equivalent biological results to plasma, because the structure of viral populations and the diagnostic performance of tropism assays may sometimes vary between compartments. Thereby, proviral DNA tropism testing should be specifically validated in clinical trials before it can be applied to routine clinical decision-making.
Stepanauskas, Ramunas; Fergusson, Elizabeth A; Brown, Joseph; Poulton, Nicole J; Tupper, Ben; Labonté, Jessica M; Becraft, Eric D; Brown, Julia M; Pachiadaki, Maria G; Povilaitis, Tadas; Thompson, Brian P; Mascena, Corianna J; Bellows, Wendy K; Lubys, Arvydas
2017-07-20
Microbial single-cell genomics can be used to provide insights into the metabolic potential, interactions, and evolution of uncultured microorganisms. Here we present WGA-X, a method based on multiple displacement amplification of DNA that utilizes a thermostable mutant of the phi29 polymerase. WGA-X enhances genome recovery from individual microbial cells and viral particles while maintaining ease of use and scalability. The greatest improvements are observed when amplifying high G+C content templates, such as those belonging to the predominant bacteria in agricultural soils. By integrating WGA-X with calibrated index-cell sorting and high-throughput genomic sequencing, we are able to analyze genomic sequences and cell sizes of hundreds of individual, uncultured bacteria, archaea, protists, and viral particles, obtained directly from marine and soil samples, in a single experiment. This approach may find diverse applications in microbiology and in biomedical and forensic studies of humans and other multicellular organisms.Single-cell genomics can be used to study uncultured microorganisms. Here, Stepanauskas et al. present a method combining improved multiple displacement amplification and FACS, to obtain genomic sequences and cell size information from uncultivated microbial cells and viral particles in environmental samples.
Short reads from honey bee (Apis sp.) sequencing projects reflect microbial associate diversity
Hurst, Gregory D.D.
2017-01-01
High throughput (or ‘next generation’) sequencing has transformed most areas of biological research and is now a standard method that underpins empirical study of organismal biology, and (through comparison of genomes), reveals patterns of evolution. For projects focused on animals, these sequencing methods do not discriminate between the primary target of sequencing (the animal genome) and ‘contaminating’ material, such as associated microbes. A common first step is to filter out these contaminants to allow better assembly of the animal genome or transcriptome. Here, we aimed to assess if these ‘contaminations’ provide information with regard to biologically important microorganisms associated with the individual. To achieve this, we examined whether the short read data from Apis retrieved elements of its well established microbiome. To this end, we screened almost 1,000 short read libraries of honey bee (Apis sp.) DNA sequencing project for the presence of microbial sequences, and find sequences from known honey bee microbial associates in at least 11% of them. Further to this, we screened ∼500 Apis RNA sequencing libraries for evidence of viral infections, which were found to be present in about half of them. We then used the data to reconstruct draft genomes of three Apis associated bacteria, as well as several viral strains de novo. We conclude that ‘contamination’ in short read sequencing libraries can provide useful genomic information on microbial taxa known to be associated with the target organisms, and may even lead to the discovery of novel associations. Finally, we demonstrate that RNAseq samples from experiments commonly carry uneven viral loads across libraries. We note variation in viral presence and load may be a confounding feature of differential gene expression analyses, and as such it should be incorporated as a random factor in analyses. PMID:28717593
Short reads from honey bee (Apis sp.) sequencing projects reflect microbial associate diversity.
Gerth, Michael; Hurst, Gregory D D
2017-01-01
High throughput (or 'next generation') sequencing has transformed most areas of biological research and is now a standard method that underpins empirical study of organismal biology, and (through comparison of genomes), reveals patterns of evolution. For projects focused on animals, these sequencing methods do not discriminate between the primary target of sequencing (the animal genome) and 'contaminating' material, such as associated microbes. A common first step is to filter out these contaminants to allow better assembly of the animal genome or transcriptome. Here, we aimed to assess if these 'contaminations' provide information with regard to biologically important microorganisms associated with the individual. To achieve this, we examined whether the short read data from Apis retrieved elements of its well established microbiome. To this end, we screened almost 1,000 short read libraries of honey bee ( Apis sp.) DNA sequencing project for the presence of microbial sequences, and find sequences from known honey bee microbial associates in at least 11% of them. Further to this, we screened ∼500 Apis RNA sequencing libraries for evidence of viral infections, which were found to be present in about half of them. We then used the data to reconstruct draft genomes of three Apis associated bacteria, as well as several viral strains de novo . We conclude that 'contamination' in short read sequencing libraries can provide useful genomic information on microbial taxa known to be associated with the target organisms, and may even lead to the discovery of novel associations. Finally, we demonstrate that RNAseq samples from experiments commonly carry uneven viral loads across libraries. We note variation in viral presence and load may be a confounding feature of differential gene expression analyses, and as such it should be incorporated as a random factor in analyses.
Vrancken, Bram; Lemey, Philippe; Rambaut, Andrew; Bedford, Trevor; Longdon, Ben; Günthard, Huldrych F.; Suchard, Marc A.
2014-01-01
Phylogenetic signal quantifies the degree to which resemblance in continuously-valued traits reflects phylogenetic relatedness. Measures of phylogenetic signal are widely used in ecological and evolutionary research, and are recently gaining traction in viral evolutionary studies. Standard estimators of phylogenetic signal frequently condition on data summary statistics of the repeated trait observations and fixed phylogenetics trees, resulting in information loss and potential bias. To incorporate the observation process and phylogenetic uncertainty in a model-based approach, we develop a novel Bayesian inference method to simultaneously estimate the evolutionary history and phylogenetic signal from molecular sequence data and repeated multivariate traits. Our approach builds upon a phylogenetic diffusion framework that model continuous trait evolution as a Brownian motion process and incorporates Pagel’s λ transformation parameter to estimate dependence among traits. We provide a computationally efficient inference implementation in the BEAST software package. We evaluate the synthetic performance of the Bayesian estimator of phylogenetic signal against standard estimators, and demonstrate the use of our coherent framework to address several virus-host evolutionary questions, including virulence heritability for HIV, antigenic evolution in influenza and HIV, and Drosophila sensitivity to sigma virus infection. Finally, we discuss model extensions that will make useful contributions to our flexible framework for simultaneously studying sequence and trait evolution. PMID:25780554
Kovaliski, John; Sinclair, Ron; Mutze, Greg; Peacock, David; Strive, Tanja; Abrantes, Joana; Esteves, Pedro J.; Holmes, Edward C.
2015-01-01
Rabbit Haemorrhagic Disease Virus (RHDV) was introduced into Australia in 1995 as a biological control agent against the wild European rabbit (Oryctolagus cuniculus). We evaluated its evolution over a 16 year period (1995–2011) by examining 50 isolates collected throughout Australia, as well as the original inoculum strains. Phylogenetic analysis of capsid protein VP60 sequences of the Australian isolates, compared to those sampled globally, revealed that they form a monophyletic group with the inoculum strains (CAPM V-351 and RHDV351INOC). Strikingly, despite more than 3000 re-releases of RHDV351INOC since 1995, only a single viral lineage has sustained its transmission in the long-term, indicative of a major competitive advantage. In addition, we find evidence for widespread viral gene flow, in which multiple lineages entered individual geographic locations, resulting in a marked turnover of viral lineages with time, as well as a continual increase in viral genetic diversity. The rate of RHDV evolution recorded in Australia – 4.0 (3.3 – 4.7) × 10−3 nucleotide substitutions per site per year – was higher than previously observed in RHDV, and evidence for adaptive evolution was obtained at two VP60 residues. Finally, more intensive study of a single rabbit population (Turretfield) in South Australia provided no evidence for viral persistence between outbreaks, with genetic diversity instead generated by continual strain importation. PMID:24251353
Genome Sequencing and Analysis of Geographically Diverse Clinical Isolates of Herpes Simplex Virus 2
Lamers, Susanna L.; Weiner, Brian; Ray, Stuart C.; Colgrove, Robert C.; Diaz, Fernando; Jing, Lichen; Wang, Kening; Saif, Sakina; Young, Sarah; Henn, Matthew; Laeyendecker, Oliver; Tobian, Aaron A. R.; Cohen, Jeffrey I.; Koelle, David M.; Quinn, Thomas C.; Knipe, David M.
2015-01-01
ABSTRACT Herpes simplex virus 2 (HSV-2), the principal causative agent of recurrent genital herpes, is a highly prevalent viral infection worldwide. Limited information is available on the amount of genomic DNA variation between HSV-2 strains because only two genomes have been determined, the HG52 laboratory strain and the newly sequenced SD90e low-passage-number clinical isolate strain, each from a different geographical area. In this study, we report the nearly complete genome sequences of 34 HSV-2 low-passage-number and laboratory strains, 14 of which were collected in Uganda, 1 in South Africa, 11 in the United States, and 8 in Japan. Our analyses of these genomes demonstrated remarkable sequence conservation, regardless of geographic origin, with the maximum nucleotide divergence between strains being 0.4% across the genome. In contrast, prior studies indicated that HSV-1 genomes exhibit more sequence diversity, as well as geographical clustering. Additionally, unlike HSV-1, little viral recombination between HSV-2 strains could be substantiated. These results are interpreted in light of HSV-2 evolution, epidemiology, and pathogenesis. Finally, the newly generated sequences more closely resemble the low-passage-number SD90e than HG52, supporting the use of the former as the new reference genome of HSV-2. IMPORTANCE Herpes simplex virus 2 (HSV-2) is a causative agent of genital and neonatal herpes. Therefore, knowledge of its DNA genome and genetic variability is central to preventing and treating genital herpes. However, only two full-length HSV-2 genomes have been reported. In this study, we sequenced 34 additional HSV-2 low-passage-number and laboratory viral genomes and initiated analysis of the genetic diversity of HSV-2 strains from around the world. The analysis of these genomes will facilitate research aimed at vaccine development, diagnosis, and the evaluation of clinical manifestations and transmission of HSV-2. This information will also contribute to our understanding of HSV evolution. PMID:26018166
Gavrilin, Gene V.; Cherkasova, Elena A.; Lipskaya, Galina Y.; Kew, Olen M.; Agol, Vadim I.
2000-01-01
We determined nucleotide sequences of the VP1 and 2AB genes and portions of the 2C and 3D genes of two evolving poliovirus lineages: circulating wild viruses of T geotype and Sabin vaccine-derived isolates from an immunodeficient patient. Different regions of the viral RNA were found to evolve nonsynchronously, and the rate of evolution of the 2AB region in the vaccine-derived population was not constant throughout its history. Synonymous replacements occurred not completely randomly, suggesting the need for conservation of certain rare codons (possibly to control translation elongation) and the existence of unidentified constraints in the viral RNA structure. Nevertheless the major contribution to the evolution of the two lineages came from linear accumulation of synonymous substitutions. Therefore, in agreement with current theories of viral evolution, we suggest that the majority of the mutations in both lineages were fixed as a result of successive sampling, from the heterogeneous populations, of random portions containing predominantly neutral and possibly adverse mutations. As a result of such a mode of evolution, the virus fitness may be maintained at a more or less constant level or may decrease unless more-fit variants are stochastically generated. The proposed unifying model of natural poliovirus evolution has important implications for the epidemiology of poliomyelitis. PMID:10906191
Herbeck, Joshua T; Rolland, Morgane; Liu, Yi; McLaughlin, Sherry; McNevin, John; Zhao, Hong; Wong, Kim; Stoddard, Julia N; Raugi, Dana; Sorensen, Stephanie; Genowati, Indira; Birditt, Brian; McKay, Angela; Diem, Kurt; Maust, Brandon S; Deng, Wenjie; Collier, Ann C; Stekler, Joanne D; McElrath, M Juliana; Mullins, James I
2011-08-01
HIV-1 transmission and viral evolution in the first year of infection were studied in 11 individuals representing four transmitter-recipient pairs and three independent seroconverters. Nine of these individuals were enrolled during acute infection; all were men who have sex with men (MSM) infected with HIV-1 subtype B. A total of 475 nearly full-length HIV-1 genome sequences were generated, representing on average 10 genomes per specimen at 2 to 12 visits over the first year of infection. Single founding variants with nearly homogeneous viral populations were detected in eight of the nine individuals who were enrolled during acute HIV-1 infection. Restriction to a single founder variant was not due to a lack of diversity in the transmitter as homogeneous populations were found in recipients from transmitters with chronic infection. Mutational patterns indicative of rapid viral population growth dominated during the first 5 weeks of infection and included a slight contraction of viral genetic diversity over the first 20 to 40 days. Subsequently, selection dominated, most markedly in env and nef. Mutants were detected in the first week and became consensus as early as day 21 after the onset of symptoms of primary HIV infection. We found multiple indications of cytotoxic T lymphocyte (CTL) escape mutations while reversions appeared limited. Putative escape mutations were often rapidly replaced with mutually exclusive mutations nearby, indicating the existence of a maturational escape process, possibly in adaptation to viral fitness constraints or to immune responses against new variants. We showed that establishment of HIV-1 infection is likely due to a biological mechanism that restricts transmission rather than to early adaptive evolution during acute infection. Furthermore, the diversity of HIV strains coupled with complex and individual-specific patterns of CTL escape did not reveal shared sequence characteristics of acute infection that could be harnessed for vaccine design.
Co-Circulation and Evolution of Polioviruses and Species C Enteroviruses in a District of Madagascar
Rakoto-Andrianarivelo, Mala; Guillot, Sophie; Iber, Jane; Balanant, Jean; Blondel, Bruno; Riquet, Franck; Martin, Javier; Kew, Olen; Randriamanalina, Bakolalao; Razafinimpiasa, Lalatiana; Rousset, Dominique; Delpeyroux, Francis
2007-01-01
Between October 2001 and April 2002, five cases of acute flaccid paralysis (AFP) associated with type 2 vaccine-derived polioviruses (VDPVs) were reported in the southern province of the Republic of Madagascar. To determine viral factors that favor the emergence of these pathogenic VDPVs, we analyzed in detail their genomic and phenotypic characteristics and compared them with co-circulating enteroviruses. These VDPVs appeared to belong to two independent recombinant lineages with sequences from the type 2 strain of the oral poliovaccine (OPV) in the 5′-half of the genome and sequences derived from unidentified species C enteroviruses (HEV-C) in the 3′-half. VDPV strains showed characteristics similar to those of wild neurovirulent viruses including neurovirulence in poliovirus-receptor transgenic mice. We looked for other VDPVs and for circulating enteroviruses in 316 stools collected from healthy children living in the small area where most of the AFP cases occurred. We found vaccine PVs, two VDPVs similar to those found in AFP cases, some echoviruses, and above all, many serotypes of coxsackie A viruses belonging to HEV-C, with substantial genetic diversity. Several coxsackie viruses A17 and A13 carried nucleotide sequences closely related to the 2C and the 3Dpol coding regions of the VDPVs, respectively. There was also evidence of multiple genetic recombination events among the HEV-C resulting in numerous recombinant genotypes. This indicates that co-circulation of HEV-C and OPV strains is associated with evolution by recombination, resulting in unexpectedly extensive viral diversity in small human populations in some tropical regions. This probably contributed to the emergence of recombinant VDPVs. These findings give further insight into viral ecosystems and the evolutionary processes that shape viral biodiversity. PMID:18085822
Lowry, Kym; Woodman, Andrew; Cook, Jonathan; Evans, David J.
2014-01-01
Recombination in enteroviruses provides an evolutionary mechanism for acquiring extensive regions of novel sequence, is suggested to have a role in genotype diversity and is known to have been key to the emergence of novel neuropathogenic variants of poliovirus. Despite the importance of this evolutionary mechanism, the recombination process remains relatively poorly understood. We investigated heterologous recombination using a novel reverse genetic approach that resulted in the isolation of intermediate chimeric intertypic polioviruses bearing genomes with extensive duplicated sequences at the recombination junction. Serial passage of viruses exhibiting such imprecise junctions yielded progeny with increased fitness which had lost the duplicated sequences. Mutations or inhibitors that changed polymerase fidelity or the coalescence of replication complexes markedly altered the yield of recombinants (but did not influence non-replicative recombination) indicating both that the process is replicative and that it may be possible to enhance or reduce recombination-mediated viral evolution if required. We propose that extant recombinants result from a biphasic process in which an initial recombination event is followed by a process of resolution, deleting extraneous sequences and optimizing viral fitness. This process has implications for our wider understanding of ‘evolution by duplication’ in the positive-strand RNA viruses. PMID:24945141
Viljakainen, Lumi; Holmberg, Ida; Abril, Sílvia; Jurvansuu, Jaana
2018-06-25
The Argentine ant (Linepithema humile) is a highly invasive pest, yet very little is known about its viruses. We analysed individual RNA-sequencing data from 48 Argentine ant queens to identify and characterisze their viruses. We discovered eight complete RNA virus genomes - all from different virus families - and one putative partial entomopoxvirus genome. Seven of the nine virus sequences were found from ant samples spanning 7 years, suggesting that these viruses may cause long-term infections within the super-colony. Although all nine viruses successfully infect Argentine ants, they have very different characteristics, such as genome organization, prevalence, loads, activation frequencies and rates of evolution. The eight RNA viruses constituted in total 23 different virus combinations which, based on statistical analysis, were non-random, suggesting that virus compatibility is a factor in infections. We also searched for virus sequences from New Zealand and Californian Argentine ant RNA-sequencing data and discovered that many of the viruses are found on different continents, yet some viruses are prevalent only in certain colonies. The viral loads described here most probably present a normal asymptomatic level of infection; nevertheless, detailed knowledge of Argentine ant viruses may enable the design of viral biocontrol methods against this pest.
Multiple introductions and onward transmission of HIV-1 subtype B strains in Shanghai, China.
Li, Xiaoshan; Zhu, Kexin; Xue, Yile; Wei, Feiran; Gao, Rong; Duerr, Ralf; Fang, Kun; Li, Wei; Song, Yue; Du, Guoping; Yan, Wenjuan; Musa, Taha Hussein; Ge, You; Ji, Yu; Zhong, Ping; Wei, Pingmin
2017-08-01
To investigate the viral genetic evolution, spatial origins and patterns of transmission of HIV-1 subtype B in Shanghai, China. A total of 242 Shanghai subtype B and 1519 reference pol sequences were subjected to phylogenetic inference and genetic transmission network analyses. Phylogenetic analysis revealed that subtype B strains circulating in Shanghai were genetically diverse and closely associated with viral sequence lineages in Beijing (76 of 242 [31.4%]), Central China (Henan/Hebei/Hunan/Hubei) (43 of 242 [17.8%]), Chinese Taiwan (20 of 242 [8.3%]), Japan (6 of 242 [2.5%]), and Korea (7 of 242 [2.9%]), suggesting multiple introductions into Shanghai from mainland China and Taiwan, Japan, and Korea. Interestingly, a monophyletic Shanghai lineage (SH-L) (36 of 242 [14.9%]) of HIV-1 subtype B most likely originated from an Argentine strain, transferred through Liaoning infected individuals. In-depth analyses of 195 Shanghai subtype B sequences revealed that a total of 37.9% (n = 74) sequences contributed to 35 transmission networks, whereof 33.8% (n = 25) of the sequences associated with infected individuals from other provinces. Our new findings reflect the evolution complexity and transmission dynamics of HIV-1 subtype B in Shanghai, which would provide critical information for the design of effective prevention measures against HIV transmission. Copyright © 2017 The British Infection Association. Published by Elsevier Ltd. All rights reserved.
Sunshine, Justine E.; Larsen, Brendan B.; Maust, Brandon; Casey, Ellie; Deng, Wenje; Chen, Lennie; Westfall, Dylan H.; Kim, Moon; Zhao, Hong; Ghorai, Suvankar; Lanxon-Cookson, Erinn; Rolland, Morgane; Collier, Ann C.; Maenza, Janine; Mullins, James I.
2015-01-01
ABSTRACT To understand the interplay between host cytotoxic T-lymphocyte (CTL) responses and the mechanisms by which HIV-1 evades them, we studied viral evolutionary patterns associated with host CTL responses in six linked transmission pairs. HIV-1 sequences corresponding to full-length p17 and p24 gag were generated by 454 pyrosequencing for all pairs near the time of transmission, and seroconverting partners were followed for a median of 847 days postinfection. T-cell responses were screened by gamma interferon/interleukin-2 (IFN-γ/IL-2) FluoroSpot using autologous peptide sets reflecting any Gag variant present in at least 5% of sequence reads in the individual's viral population. While we found little evidence for the occurrence of CTL reversions, CTL escape processes were found to be highly dynamic, with multiple epitope variants emerging simultaneously. We found a correlation between epitope entropy and the number of epitope variants per response (r = 0.43; P = 0.05). In cases in which multiple escape mutations developed within a targeted epitope, a variant with no fitness cost became fixed in the viral population. When multiple mutations within an epitope achieved fitness-balanced escape, these escape mutants were each maintained in the viral population. Additional mutations found to confer escape but undetected in viral populations incurred high fitness costs, suggesting that functional constraints limit the available sites tolerable to escape mutations. These results further our understanding of the impact of CTL escape and reversion from the founder virus in HIV infection and contribute to the identification of immunogenic Gag regions most vulnerable to a targeted T-cell attack. IMPORTANCE Rapid diversification of the viral population is a hallmark of HIV-1 infection, and understanding the selective forces driving the emergence of viral variants can provide critical insight into the interplay between host immune responses and viral evolution. We used deep sequencing to comprehensively follow viral evolution over time in six linked HIV transmission pairs. We then mapped T-cell responses to explore if mutations arose due to adaption to the host and found that escape processes were often highly dynamic, with multiple mutations arising within targeted epitopes. When we explored the impact of these mutations on replicative capacity, we found that dynamic escape processes only resolve with the selection of mutations that conferred escape with no fitness cost to the virus. These results provide further understanding of the complicated viral-host interactions that occur during early HIV-1 infection and may help inform the design of future vaccine immunogens. PMID:26223634
Goller, Katja V; Gabriel, Claudia; Dimna, Mireille Le; Le Potier, Marie-Frédérique; Rossi, Sophie; Staubach, Christoph; Merboth, Matthias; Beer, Martin; Blome, Sandra
2016-03-01
Classical swine fever is a viral disease of pigs that carries tremendous socio-economic impact. In outbreak situations, genetic typing is carried out for the purpose of molecular epidemiology in both domestic pigs and wild boar. These analyses are usually based on harmonized partial sequences. However, for high-resolution analyses towards the understanding of genetic variability and virus evolution, full-genome sequences are more appropriate. In this study, a unique set of representative virus strains was investigated that was collected during an outbreak in French free-ranging wild boar in the Vosges-du-Nord mountains between 2003 and 2007. Comparative sequence and evolutionary analyses of the nearly full-length sequences showed only slow evolution of classical swine fever virus strains over the years and no impact of vaccination on mutation rates. However, substitution rates varied amongst protein genes; furthermore, a spatial and temporal pattern could be observed whereby two separate clusters were formed that coincided with physical barriers.
Efficient error correction for next-generation sequencing of viral amplicons
2012-01-01
Background Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. Results In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Conclusions Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses. The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm PMID:22759430
Efficient error correction for next-generation sequencing of viral amplicons.
Skums, Pavel; Dimitrova, Zoya; Campo, David S; Vaughan, Gilberto; Rossi, Livia; Forbi, Joseph C; Yokosawa, Jonny; Zelikovsky, Alex; Khudyakov, Yury
2012-06-25
Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.
Viral Diversity Threshold for Adaptive Immunity in Prokaryotes
Weinberger, Ariel D.; Wolf, Yuri I.; Lobkovsky, Alexander E.; Gilmore, Michael S.; Koonin, Eugene V.
2012-01-01
ABSTRACT Bacteria and archaea face continual onslaughts of rapidly diversifying viruses and plasmids. Many prokaryotes maintain adaptive immune systems known as clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (Cas). CRISPR-Cas systems are genomic sensors that serially acquire viral and plasmid DNA fragments (spacers) that are utilized to target and cleave matching viral and plasmid DNA in subsequent genomic invasions, offering critical immunological memory. Only 50% of sequenced bacteria possess CRISPR-Cas immunity, in contrast to over 90% of sequenced archaea. To probe why half of bacteria lack CRISPR-Cas immunity, we combined comparative genomics and mathematical modeling. Analysis of hundreds of diverse prokaryotic genomes shows that CRISPR-Cas systems are substantially more prevalent in thermophiles than in mesophiles. With sequenced bacteria disproportionately mesophilic and sequenced archaea mostly thermophilic, the presence of CRISPR-Cas appears to depend more on environmental temperature than on bacterial-archaeal taxonomy. Mutation rates are typically severalfold higher in mesophilic prokaryotes than in thermophilic prokaryotes. To quantitatively test whether accelerated viral mutation leads microbes to lose CRISPR-Cas systems, we developed a stochastic model of virus-CRISPR coevolution. The model competes CRISPR-Cas-positive (CRISPR-Cas+) prokaryotes against CRISPR-Cas-negative (CRISPR-Cas−) prokaryotes, continually weighing the antiviral benefits conferred by CRISPR-Cas immunity against its fitness costs. Tracking this cost-benefit analysis across parameter space reveals viral mutation rate thresholds beyond which CRISPR-Cas cannot provide sufficient immunity and is purged from host populations. These results offer a simple, testable viral diversity hypothesis to explain why mesophilic bacteria disproportionately lack CRISPR-Cas immunity. More generally, fundamental limits on the adaptability of biological sensors (Lamarckian evolution) are predicted. PMID:23221803
Multiplex Reverse Transcription-PCR for Simultaneous Surveillance of Influenza A and B Viruses
Zhou, Bin; Barnes, John R.; Sessions, October M.; Chou, Tsui-Wen; Wilson, Malania; Stark, Thomas J.; Volk, Michelle; Spirason, Natalie; Halpin, Rebecca A.; Kamaraj, Uma Sangumathi; Ding, Tao; Stockwell, Timothy B.; Ghedin, Elodie; Barr, Ian G.
2017-01-01
ABSTRACT Influenza A and B viruses are the causative agents of annual influenza epidemics that can be severe, and influenza A viruses intermittently cause pandemics. Sequence information from influenza virus genomes is instrumental in determining mechanisms underpinning antigenic evolution and antiviral resistance. However, due to sequence diversity and the dynamics of influenza virus evolution, rapid and high-throughput sequencing of influenza viruses remains a challenge. We developed a single-reaction influenza A/B virus (FluA/B) multiplex reverse transcription-PCR (RT-PCR) method that amplifies the most critical genomic segments (hemagglutinin [HA], neuraminidase [NA], and matrix [M]) of seasonal influenza A and B viruses for next-generation sequencing, regardless of viral type, subtype, or lineage. Herein, we demonstrate that the strategy is highly sensitive and robust. The strategy was validated on thousands of seasonal influenza A and B virus-positive specimens using multiple next-generation sequencing platforms. PMID:28978683
Isakov, Ofer; Bordería, Antonio V; Golan, David; Hamenahem, Amir; Celniker, Gershon; Yoffe, Liron; Blanc, Hervé; Vignuzzi, Marco; Shomron, Noam
2015-07-01
The study of RNA virus populations is a challenging task. Each population of RNA virus is composed of a collection of different, yet related genomes often referred to as mutant spectra or quasispecies. Virologists using deep sequencing technologies face major obstacles when studying virus population dynamics, both experimentally and in natural settings due to the relatively high error rates of these technologies and the lack of high performance pipelines. In order to overcome these hurdles we developed a computational pipeline, termed ViVan (Viral Variance Analysis). ViVan is a complete pipeline facilitating the identification, characterization and comparison of sequence variance in deep sequenced virus populations. Applying ViVan on deep sequenced data obtained from samples that were previously characterized by more classical approaches, we uncovered novel and potentially crucial aspects of virus populations. With our experimental work, we illustrate how ViVan can be used for studies ranging from the more practical, detection of resistant mutations and effects of antiviral treatments, to the more theoretical temporal characterization of the population in evolutionary studies. Freely available on the web at http://www.vivanbioinfo.org : nshomron@post.tau.ac.il Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Wang, Jin; Dong, Hongping; Chionh, Yok Hian; McBee, Megan E.; Sirirungruang, Sasilada; Cunningham, Richard P.; Shi, Pei-Yong; Dedon, Peter C.
2016-01-01
The misincorporation of 2′-deoxyribonucleotides (dNs) into RNA has important implications for the function of non-coding RNAs, the translational fidelity of coding RNAs and the mutagenic evolution of viral RNA genomes. However, quantitative appreciation for the degree to which dN misincorporation occurs is limited by the lack of analytical tools. Here, we report a method to hydrolyze RNA to release 2′-deoxyribonucleotide-ribonucleotide pairs (dNrN) that are then quantified by chromatography-coupled mass spectrometry (LC-MS). Using this platform, we found misincorporated dNs occurring at 1 per 103 to 105 ribonucleotide (nt) in mRNA, rRNAs and tRNA in human cells, Escherichia coli, Saccharomyces cerevisiae and, most abundantly, in the RNA genome of dengue virus. The frequency of dNs varied widely among organisms and sequence contexts, and partly reflected the in vitro discrimination efficiencies of different RNA polymerases against 2′-deoxyribonucleoside 5′-triphosphates (dNTPs). Further, we demonstrate a strong link between dN frequencies in RNA and the balance of dNTPs and ribonucleoside 5′-triphosphates (rNTPs) in the cellular pool, with significant stress-induced variation of dN incorporation. Potential implications of dNs in RNA are discussed, including the possibilities of dN incorporation in RNA as a contributing factor in viral evolution and human disease, and as a host immune defense mechanism against viral infections. PMID:27365049
Tan, Yi; Hassan, Ferdaus; Schuster, Jennifer E.; Simenauer, Ari; Selvarangan, Rangaraj; Halpin, Rebecca A.; Lin, Xudong; Fedorova, Nadia; Stockwell, Timothy B.; Lam, Tommy Tsan-Yuk; Chappell, James D.; Hartert, Tina V.; Holmes, Edward C.
2015-01-01
ABSTRACT In August 2014, an outbreak of enterovirus D68 (EV-D68) occurred in North America, causing severe respiratory disease in children. Due to a lack of complete genome sequence data, there is only a limited understanding of the molecular evolution and epidemiology of EV-D68 during this outbreak, and it is uncertain whether the differing clinical manifestations of EV-D68 infection are associated with specific viral lineages. We developed a high-throughput complete genome sequencing pipeline for EV-D68 that produced a total of 59 complete genomes from respiratory samples with a 95% success rate, including 57 genomes from Kansas City, MO, collected during the 2014 outbreak. With these data in hand, we performed phylogenetic analyses of complete genome and VP1 capsid protein sequences. Notably, we observed considerable genetic diversity among EV-D68 isolates in Kansas City, manifest as phylogenetically distinct lineages, indicative of multiple introductions of this virus into the city. In addition, we identified an intersubclade recombination event within EV-D68, the first recombinant in this virus reported to date. Finally, we found no significant association between EV-D68 genetic variation, either lineages or individual mutations, and a variety of demographic and clinical variables, suggesting that host factors likely play a major role in determining disease severity. Overall, our study revealed the complex pattern of viral evolution within a single geographic locality during a single outbreak, which has implications for the design of effective intervention and prevention strategies. IMPORTANCE Until recently, EV-D68 was considered to be an uncommon human pathogen, associated with mild respiratory illness. However, in 2014 EV-D68 was responsible for more than 1,000 disease cases in North America, including severe respiratory illness in children and acute flaccid myelitis, raising concerns about its potential impact on public health. Despite the emergence of EV-D68, a lack of full-length genome sequences means that little is known about the molecular evolution of this virus within a single geographic locality during a single outbreak. Here, we doubled the number of publicly available complete genome sequences of EV-D68 by performing high-throughput next-generation sequencing, characterized the evolutionary history of this outbreak in detail, identified a recombination event, and investigated whether there was any correlation between the demographic and clinical characteristics of the patients and the viral variant that infected them. Overall, these results will help inform the design of intervention strategies for EV-D68. PMID:26656685
Chang, Suhua; Zhang, Jiajie; Liao, Xiaoyun; Zhu, Xinxing; Wang, Dahai; Zhu, Jiang; Feng, Tao; Zhu, Baoli; Gao, George F; Wang, Jian; Yang, Huanming; Yu, Jun; Wang, Jing
2007-01-01
Frequent outbreaks of highly pathogenic avian influenza and the increasing data available for comparative analysis require a central database specialized in influenza viruses (IVs). We have established the Influenza Virus Database (IVDB) to integrate information and create an analysis platform for genetic, genomic, and phylogenetic studies of the virus. IVDB hosts complete genome sequences of influenza A virus generated by Beijing Institute of Genomics (BIG) and curates all other published IV sequences after expert annotation. Our Q-Filter system classifies and ranks all nucleotide sequences into seven categories according to sequence content and integrity. IVDB provides a series of tools and viewers for comparative analysis of the viral genomes, genes, genetic polymorphisms and phylogenetic relationships. A search system has been developed for users to retrieve a combination of different data types by setting search options. To facilitate analysis of global viral transmission and evolution, the IV Sequence Distribution Tool (IVDT) has been developed to display the worldwide geographic distribution of chosen viral genotypes and to couple genomic data with epidemiological data. The BLAST, multiple sequence alignment and phylogenetic analysis tools were integrated for online data analysis. Furthermore, IVDB offers instant access to pre-computed alignments and polymorphisms of IV genes and proteins, and presents the results as SNP distribution plots and minor allele distributions. IVDB is publicly available at http://influenza.genomics.org.cn.
Using hidden Markov models and observed evolution to annotate viral genomes.
McCauley, Stephen; Hein, Jotun
2006-06-01
ssRNA (single stranded) viral genomes are generally constrained in length and utilize overlapping reading frames to maximally exploit the coding potential within the genome length restrictions. This overlapping coding phenomenon leads to complex evolutionary constraints operating on the genome. In regions which code for more than one protein, silent mutations in one reading frame generally have a protein coding effect in another. To maximize coding flexibility in all reading frames, overlapping regions are often compositionally biased towards amino acids which are 6-fold degenerate with respect to the 64 codon alphabet. Previous methodologies have used this fact in an ad hoc manner to look for overlapping genes by motif matching. In this paper differentiated nucleotide compositional patterns in overlapping regions are incorporated into a probabilistic hidden Markov model (HMM) framework which is used to annotate ssRNA viral genomes. This work focuses on single sequence annotation and applies an HMM framework to ssRNA viral annotation. A description of how the HMM is parameterized, whilst annotating within a missing data framework is given. A Phylogenetic HMM (Phylo-HMM) extension, as applied to 14 aligned HIV2 sequences is also presented. This evolutionary extension serves as an illustration of the potential of the Phylo-HMM framework for ssRNA viral genomic annotation. The single sequence annotation procedure (SSA) is applied to 14 different strains of the HIV2 virus. Further results on alternative ssRNA viral genomes are presented to illustrate more generally the performance of the method. The results of the SSA method are encouraging however there is still room for improvement, and since there is overwhelming evidence to indicate that comparative methods can improve coding sequence (CDS) annotation, the SSA method is extended to a Phylo-HMM to incorporate evolutionary information. The Phylo-HMM extension is applied to the same set of 14 HIV2 sequences which are pre-aligned. The performance improvement that results from including the evolutionary information in the analysis is illustrated.
PAQ: Partition Analysis of Quasispecies.
Baccam, P; Thompson, R J; Fedrigo, O; Carpenter, S; Cornette, J L
2001-01-01
The complexities of genetic data may not be accurately described by any single analytical tool. Phylogenetic analysis is often used to study the genetic relationship among different sequences. Evolutionary models and assumptions are invoked to reconstruct trees that describe the phylogenetic relationship among sequences. Genetic databases are rapidly accumulating large amounts of sequences. Newly acquired sequences, which have not yet been characterized, may require preliminary genetic exploration in order to build models describing the evolutionary relationship among sequences. There are clustering techniques that rely less on models of evolution, and thus may provide nice exploratory tools for identifying genetic similarities. Some of the more commonly used clustering methods perform better when data can be grouped into mutually exclusive groups. Genetic data from viral quasispecies, which consist of closely related variants that differ by small changes, however, may best be partitioned by overlapping groups. We have developed an intuitive exploratory program, Partition Analysis of Quasispecies (PAQ), which utilizes a non-hierarchical technique to partition sequences that are genetically similar. PAQ was used to analyze a data set of human immunodeficiency virus type 1 (HIV-1) envelope sequences isolated from different regions of the brain and another data set consisting of the equine infectious anemia virus (EIAV) regulatory gene rev. Analysis of the HIV-1 data set by PAQ was consistent with phylogenetic analysis of the same data, and the EIAV rev variants were partitioned into two overlapping groups. PAQ provides an additional tool which can be used to glean information from genetic data and can be used in conjunction with other tools to study genetic similarities and genetic evolution of viral quasispecies.
Weiss, Eric R; Lamers, Susanna L; Henderson, Jennifer L; Melnikov, Alexandre; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa; Nusbaum, Chad; Luzuriaga, Katherine
2018-01-15
Over 90% of the world's population is persistently infected with Epstein-Barr virus. While EBV does not cause disease in most individuals, it is the common cause of acute infectious mononucleosis (AIM) and has been associated with several cancers and autoimmune diseases, highlighting a need for a preventive vaccine. At present, very few primary, circulating EBV genomes have been sequenced directly from infected individuals. While low levels of diversity and low viral evolution rates have been predicted for double-stranded DNA (dsDNA) viruses, recent studies have demonstrated appreciable diversity in common dsDNA pathogens (e.g., cytomegalovirus). Here, we report 40 full-length EBV genome sequences obtained from matched oral wash and B cell fractions from a cohort of 10 AIM patients. Both intra- and interpatient diversity were observed across the length of the entire viral genome. Diversity was most pronounced in viral genes required for establishing latent infection and persistence, with appreciable levels of diversity also detected in structural genes, including envelope glycoproteins. Interestingly, intrapatient diversity declined significantly over time ( P < 0.01), and this was particularly evident on comparison of viral genomes sequenced from B cell fractions in early primary infection and convalescence ( P < 0.001). B cell-associated viral genomes were observed to converge, becoming nearly identical to the B95.8 reference genome over time (Spearman rank-order correlation test; r = -0.5589, P = 0.0264). The reduction in diversity was most marked in the EBV latency genes. In summary, our data suggest independent convergence of diverse viral genome sequences toward a reference-like strain within a relatively short period following primary EBV infection. IMPORTANCE Identification of viral proteins with low variability and high immunogenicity is important for the development of a protective vaccine. Knowledge of genome diversity within circulating viral populations is a key step in this process, as is the expansion of intrahost genomic variation during infection. We report full-length EBV genomes sequenced from the blood and oral wash of 10 individuals early in primary infection and during convalescence. Our data demonstrate considerable diversity within the pool of circulating EBV strains, as well as within individual patients. Overall viral diversity decreased from early to persistent infection, particularly in latently infected B cells, which serve as the viral reservoir. Reduction in B cell-associated viral genome diversity coincided with a convergence toward a reference-like EBV genotype. Greater convergence positively correlated with time after infection, suggesting that the reference-like genome is the result of selection. Copyright © 2018 American Society for Microbiology.
Horizontal acquisition of transposable elements and viral sequences: patterns and consequences.
Gilbert, Clément; Feschotte, Cédric
2018-04-01
It is becoming clear that most eukaryotic transposable elements (TEs) owe their evolutionary success in part to horizontal transfer events, which enable them to invade new species. Recent large-scale studies are beginning to unravel the mechanisms and ecological factors underlying this mode of transmission. Viruses are increasingly recognized as vectors in the process but also as a direct source of genetic material horizontally acquired by eukaryotic organisms. Because TEs and endogenous viruses are major catalysts of variation and innovation in genomes, we argue that horizontal inheritance has had a more profound impact in eukaryotic evolution than is commonly appreciated. To support this proposal, we compile a list of examples, including some previously unrecognized, whereby new host functions and phenotypes can be directly attributed to horizontally acquired TE or viral sequences. We predict that the number of examples will rapidly grow in the future as the prevalence of horizontal transfer in the life cycle of TEs becomes even more apparent, firmly establishing this form of non-Mendelian inheritance as a consequential facet of eukaryotic evolution. Copyright © 2018 Elsevier Ltd. All rights reserved.
Inferring HIV Escape Rates from Multi-Locus Genotype Data
Kessinger, Taylor A.; Perelson, Alan S.; Neher, Richard A.
2013-09-03
Cytotoxic T-lymphocytes (CTLs) recognize viral protein fragments displayed by major histocompatibility complex molecules on the surface of virally infected cells and generate an anti-viral response that can kill the infected cells. Virus variants whose protein fragments are not efficiently presented on infected cells or whose fragments are presented but not recognized by CTLs therefore have a competitive advantage and spread rapidly through the population. We present a method that allows a more robust estimation of these escape rates from serially sampled sequence data. The proposed method accounts for competition between multiple escapes by explicitly modeling the accumulation of escape mutationsmore » and the stochastic effects of rare multiple mutants. Applying our method to serially sampled HIV sequence data, we estimate rates of HIV escape that are substantially larger than those previously reported. The method can be extended to complex escapes that require compensatory mutations. We expect our method to be applicable in other contexts such as cancer evolution where time series data is also available.« less
Rissanen, Ilona; Grimes, Jonathan M.; Pawlowski, Alice; Mäntynen, Sari; Harlos, Karl; Bamford, Jaana K.H.; Stuart, David I.
2013-01-01
Summary It has proved difficult to classify viruses unless they are closely related since their rapid evolution hinders detection of remote evolutionary relationships in their genetic sequences. However, structure varies more slowly than sequence, allowing deeper evolutionary relationships to be detected. Bacteriophage P23-77 is an example of a newly identified viral lineage, with members inhabiting extreme environments. We have solved multiple crystal structures of the major capsid proteins VP16 and VP17 of bacteriophage P23-77. They fit the 14 Å resolution cryo-electron microscopy reconstruction of the entire virus exquisitely well, allowing us to propose a model for both the capsid architecture and viral assembly, quite different from previously published models. The structures of the capsid proteins and their mode of association to form the viral capsid suggest that the P23-77-like and adeno-PRD1 lineages of viruses share an extremely ancient common ancestor. PMID:23623731
Next-generation sequencing in clinical virology: Discovery of new viruses.
Datta, Sibnarayan; Budhauliya, Raghvendra; Das, Bidisha; Chatterjee, Soumya; Vanlalhmuaka; Veer, Vijay
2015-08-12
Viruses are a cause of significant health problem worldwide, especially in the developing nations. Due to different anthropological activities, human populations are exposed to different viral pathogens, many of which emerge as outbreaks. In such situations, discovery of novel viruses is utmost important for deciding prevention and treatment strategies. Since last century, a number of different virus discovery methods, based on cell culture inoculation, sequence-independent PCR have been used for identification of a variety of viruses. However, the recent emergence and commercial availability of next-generation sequencers (NGS) has entirely changed the field of virus discovery. These massively parallel sequencing platforms can sequence a mixture of genetic materials from a very heterogeneous mix, with high sensitivity. Moreover, these platforms work in a sequence-independent manner, making them ideal tools for virus discovery. However, for their application in clinics, sample preparation or enrichment is necessary to detect low abundance virus populations. A number of techniques have also been developed for enrichment or viral nucleic acids. In this manuscript, we review the evolution of sequencing; NGS technologies available today as well as widely used virus enrichment technologies. We also discuss the challenges associated with their applications in the clinical virus discovery.
Impacts of Genome-Wide Analyses on Our Understanding of Human Herpesvirus Diversity and Evolution.
Renner, Daniel W; Szpara, Moriah L
2018-01-01
Until fairly recently, genome-wide evolutionary dynamics and within-host diversity were more commonly examined in the context of small viruses than in the context of large double-stranded DNA viruses such as herpesviruses. The high mutation rates and more compact genomes of RNA viruses have inspired the investigation of population dynamics for these species, and recent data now suggest that herpesviruses might also be considered candidates for population modeling. High-throughput sequencing (HTS) and bioinformatics have expanded our understanding of herpesviruses through genome-wide comparisons of sequence diversity, recombination, allele frequency, and selective pressures. Here we discuss recent data on the mechanisms that generate herpesvirus genomic diversity and underlie the evolution of these virus families. We focus on human herpesviruses, with key insights drawn from veterinary herpesviruses and other large DNA virus families. We consider the impacts of cell culture on herpesvirus genomes and how to accurately describe the viral populations under study. The need for a strong foundation of high-quality genomes is also discussed, since it underlies all secondary genomic analyses such as RNA sequencing (RNA-Seq), chromatin immunoprecipitation, and ribosome profiling. Areas where we foresee future progress, such as the linking of viral genetic differences to phenotypic or clinical outcomes, are highlighted as well. Copyright © 2017 Renner and Szpara.
Ellenbecker, Mary; St Goddard, Jeremy; Sundet, Alec; Lanchy, Jean-Marc; Raiford, Douglas; Lodmell, J Stephen
2015-10-01
Rift Valley fever virus (RVFV) is a potent human and livestock pathogen endemic to sub-Saharan Africa and the Arabian Peninsula that has potential to spread to other parts of the world. Although there is no proven effective and safe treatment for RVFV infections, a potential therapeutic target is the virally encoded nucleocapsid protein (N). During the course of infection, N binds to viral RNA, and perturbation of this interaction can inhibit viral replication. To gain insight into how N recognizes viral RNA specifically, we designed an algorithm that uses a distance matrix and multidimensional scaling to compare the predicted secondary structures of known N-binding RNAs, or aptamers, that were isolated and characterized in previous in vitro evolution experiment. These aptamers did not exhibit overt sequence or predicted structure similarity, so we employed bioinformatic methods to propose novel aptamers based on analysis and clustering of secondary structures. We screened and scored the predicted secondary structures of novel randomly generated RNA sequences in silico and selected several of these putative N-binding RNAs whose secondary structures were similar to those of known N-binding RNAs. We found that overall the in silico generated RNA sequences bound well to N in vitro. Furthermore, introduction of these RNAs into cells prior to infection with RVFV inhibited viral replication in cell culture. This proof of concept study demonstrates how the predictive power of bioinformatics and the empirical power of biochemistry can be jointly harnessed to discover, synthesize, and test new RNA sequences that bind tightly to RVFV N protein. The approach would be easily generalizable to other applications. Copyright © 2015 Elsevier Ltd. All rights reserved.
Fulminant hepatitis failure in adults and children from a Public Hospital in Rio de Janeiro, Brazil.
Santos, Damião Carlos Moraes dos; Martinho, José Manoel da Silva Gomes; Pacheco-Moreira, Lucio Filgueiras; Araújo, Cristina Carvalho Viana de; Oliveira, Barbara Cristina Euzebio Pereira Dias de; Lago, Barbara Vieira; Pinto, Marcelo Alves; Paula, Vanessa Salete de
2009-10-01
Fulminant hepatic failure (FHF) is characterized by massive hepatocellular injury, whose physiopathology is still unclear. Hepatitis B (HBV) is probably the most common viral cause of FHF, while hepatitis A (HAV) virus seem occurs less frequently. However, the host and viral factors that determine the outcome of these infections are poorly understood. In the present study, viral load and genotyping determining regions of HAV and HBV genomes were sequenced. Eight FHF patients and one patient with severe acute hepatitis (SAH) were included. Liver and blood samples were collected during liver transplantation or necropsy procedures. HAV-RNA and HBV-DNA were extracted from serum, biopsy and paraffin liver. Nucleotide sequencing of HAV-RNA was performed from VP1/2A and HBV-DNA from PreS/S region. The amplified samples were quantified by Real-Time PCR. The cases of HAV infection were due to subgenotype IA. The cases of HBV infection were due to genotype A2 and D4. The case of HAV/HBV coinfection was infected by genotype IA and D3. Hepatitis A and B infection were associated with genotypes most prevalent in Brazil. In hepatitis A infection the mean of period evolution was 13 days. In hepatitis B, FHF patients infected by genotype D have a shorter period of evolution than FHF patients infected by genotype A (mean 15 v. 53 days). There was no association with genotype-determining region with the severity of hepatitis, however nucleotide differences and high viral load could be observed among FHF.
A bioinformatic analysis of ribonucleotide reductase genes in phage genomes and metagenomes
2013-01-01
Background Ribonucleotide reductase (RNR), the enzyme responsible for the formation of deoxyribonucleotides from ribonucleotides, is found in all domains of life and many viral genomes. RNRs are also amongst the most abundant genes identified in environmental metagenomes. This study focused on understanding the distribution, diversity, and evolution of RNRs in phages (viruses that infect bacteria). Hidden Markov Model profiles were used to analyze the proteins encoded by 685 completely sequenced double-stranded DNA phages and 22 environmental viral metagenomes to identify RNR homologs in cultured phages and uncultured viral communities, respectively. Results RNRs were identified in 128 phage genomes, nearly tripling the number of phages known to encode RNRs. Class I RNR was the most common RNR class observed in phages (70%), followed by class II (29%) and class III (28%). Twenty-eight percent of the phages contained genes belonging to multiple RNR classes. RNR class distribution varied according to phage type, isolation environment, and the host’s ability to utilize oxygen. The majority of the phages containing RNRs are Myoviridae (65%), followed by Siphoviridae (30%) and Podoviridae (3%). The phylogeny and genomic organization of phage and host RNRs reveal several distinct evolutionary scenarios involving horizontal gene transfer, co-evolution, and differential selection pressure. Several putative split RNR genes interrupted by self-splicing introns or inteins were identified, providing further evidence for the role of frequent genetic exchange. Finally, viral metagenomic data indicate that RNRs are prevalent and highly dynamic in uncultured viral communities, necessitating future research to determine the environmental conditions under which RNRs provide a selective advantage. Conclusions This comprehensive study describes the distribution, diversity, and evolution of RNRs in phage genomes and environmental viral metagenomes. The distinct distributions of specific RNR classes amongst phages, combined with the various evolutionary scenarios predicted from RNR phylogenies suggest multiple inheritance sources and different selective forces for RNRs in phages. This study significantly improves our understanding of phage RNRs, providing insight into the diversity and evolution of this important auxiliary metabolic gene as well as the evolution of phages in response to their bacterial hosts and environments. PMID:23391036
The paradox of HBV evolution as revealed from a 16th century mummy
Duggan, Ana T.; Poinar, Debi; Poinar, Hendrik N.
2018-01-01
Hepatitis B virus (HBV) is a ubiquitous viral pathogen associated with large-scale morbidity and mortality in humans. However, there is considerable uncertainty over the time-scale of its origin and evolution. Initial shotgun data from a mid-16th century Italian child mummy, that was previously paleopathologically identified as having been infected with Variola virus (VARV, the agent of smallpox), showed no DNA reads for VARV yet did for hepatitis B virus (HBV). Previously, electron microscopy provided evidence for the presence of VARV in this sample, although similar analyses conducted here did not reveal any VARV particles. We attempted to enrich and sequence for both VARV and HBV DNA. Although we did not recover any reads identified as VARV, we were successful in reconstructing an HBV genome at 163.8X coverage. Strikingly, both the HBV sequence and that of the associated host mitochondrial DNA displayed a nearly identical cytosine deamination pattern near the termini of DNA fragments, characteristic of an ancient origin. In contrast, phylogenetic analyses revealed a close relationship between the putative ancient virus and contemporary HBV strains (of genotype D), at first suggesting contamination. In addressing this paradox we demonstrate that HBV evolution is characterized by a marked lack of temporal structure. This confounds attempts to use molecular clock-based methods to date the origin of this virus over the time-frame sampled so far, and means that phylogenetic measures alone cannot yet be used to determine HBV sequence authenticity. If genuine, this phylogenetic pattern indicates that the genotypes of HBV diversified long before the 16th century, and enables comparison of potential pathogenic similarities between modern and ancient HBV. These results have important implications for our understanding of the emergence and evolution of this common viral pathogen. PMID:29300782
Ebola Virus Epidemiology, Transmission, and Evolution during Seven Months in Sierra Leone
Park, Daniel J.; Dudas, Gytis; Wohl, Shirlee; Goba, Augustine; Whitmer, Shannon L.M.; Andersen, Kristian G.; Sealfon, Rachel S.; Ladner, Jason T.; Kugelman, Jeffrey R.; Matranga, Christian B.; Winnicki, Sarah M.; Qu, James; Gire, Stephen K.; Gladden-Young, Adrianne; Jalloh, Simbirie; Nosamiefan, Dolo; Yozwiak, Nathan L.; Moses, Lina M.; Jiang, Pan-Pan; Lin, Aaron E.; Schaffner, Stephen F.; Bird, Brian; Towner, Jonathan; Mamoh, Mambu; Gbakie, Michael; Kanneh, Lansana; Kargbo, David; Massally, James L.B.; Kamara, Fatima K.; Konuwa, Edwin; Sellu, Josephine; Jalloh, Abdul A.; Mustapha, Ibrahim; Foday, Momoh; Yillah, Mohamed; Erickson, Bobbie R.; Sealy, Tara; Blau, Dianna; Paddock, Christopher; Brault, Aaron; Amman, Brian; Basile, Jane; Bearden, Scott; Belser, Jessica; Bergeron, Eric; Campbell, Shelley; Chakrabarti, Ayan; Dodd, Kimberly; Flint, Mike; Gibbons, Aridth; Goodman, Christin; Klena, John; McMullan, Laura; Morgan, Laura; Russell, Brandy; Salzer, Johanna; Sanchez, Angela; Wang, David; Jungreis, Irwin; Tomkins-Tinch, Christopher; Kislyuk, Andrey; Lin, Michael F.; Chapman, Sinead; MacInnis, Bronwyn; Matthews, Ashley; Bochicchio, James; Hensley, Lisa E.; Kuhn, Jens H.; Nusbaum, Chad; Schieffelin, John S.; Birren, Bruce W.; Forget, Marc; Nichol, Stuart T.; Palacios, Gustavo F.; Ndiaye, Daouda; Happi, Christian; Gevao, Sahr M.; Vandi, Mohamed A.; Kargbo, Brima; Holmes, Edward C.; Bedford, Trevor; Gnirke, Andreas; Ströher, Ute; Rambaut, Andrew; Garry, Robert F.; Sabeti, Pardis C.
2015-01-01
Summary The 2013–2015 Ebola virus disease (EVD) epidemic is caused by the Makona variant of Ebola virus (EBOV). Early in the epidemic, genome sequencing provided insights into virus evolution and transmission and offered important information for outbreak response. Here, we analyze sequences from 232 patients sampled over 7 months in Sierra Leone, along with 86 previously released genomes from earlier in the epidemic. We confirm sustained human-to-human transmission within Sierra Leone and find no evidence for import or export of EBOV across national borders after its initial introduction. Using high-depth replicate sequencing, we observe both host-to-host transmission and recurrent emergence of intrahost genetic variants. We trace the increasing impact of purifying selection in suppressing the accumulation of nonsynonymous mutations over time. Finally, we note changes in the mucin-like domain of EBOV glycoprotein that merit further investigation. These findings clarify the movement of EBOV within the region and describe viral evolution during prolonged human-to-human transmission. PMID:26091036
Keele, Brandon F; Giorgi, Elena E; Salazar-Gonzalez, Jesus F; Decker, Julie M; Pham, Kimmy T; Salazar, Maria G; Sun, Chuanxi; Grayson, Truman; Wang, Shuyi; Li, Hui; Wei, Xiping; Jiang, Chunlai; Kirchherr, Jennifer L; Gao, Feng; Anderson, Jeffery A; Ping, Li-Hua; Swanstrom, Ronald; Tomaras, Georgia D; Blattner, William A; Goepfert, Paul A; Kilby, J Michael; Saag, Michael S; Delwart, Eric L; Busch, Michael P; Cohen, Myron S; Montefiori, David C; Haynes, Barton F; Gaschen, Brian; Athreya, Gayathri S; Lee, Ha Y; Wood, Natasha; Seoighe, Cathal; Perelson, Alan S; Bhattacharya, Tanmoy; Korber, Bette T; Hahn, Beatrice H; Shaw, George M
2008-05-27
The precise identification of the HIV-1 envelope glycoprotein (Env) responsible for productive clinical infection could be instrumental in elucidating the molecular basis of HIV-1 transmission and in designing effective vaccines. Here, we developed a mathematical model of random viral evolution and, together with phylogenetic tree construction, used it to analyze 3,449 complete env sequences derived by single genome amplification from 102 subjects with acute HIV-1 (clade B) infection. Viral env genes evolving from individual transmitted or founder viruses generally exhibited a Poisson distribution of mutations and star-like phylogeny, which coalesced to an inferred consensus sequence at or near the estimated time of virus transmission. Overall, 78 of 102 subjects had evidence of productive clinical infection by a single virus, and 24 others had evidence of productive clinical infection by a minimum of two to five viruses. Phenotypic analysis of transmitted or early founder Envs revealed a consistent pattern of CCR5 dependence, masking of coreceptor binding regions, and equivalent or modestly enhanced resistance to the fusion inhibitor T1249 and broadly neutralizing antibodies compared with Envs from chronically infected subjects. Low multiplicity infection and limited viral evolution preceding peak viremia suggest a finite window of potential vulnerability of HIV-1 to vaccine-elicited immune responses, although phenotypic properties of transmitted Envs pose a formidable defense.
Keele, Brandon F.; Giorgi, Elena E.; Salazar-Gonzalez, Jesus F.; Decker, Julie M.; Pham, Kimmy T.; Salazar, Maria G.; Sun, Chuanxi; Grayson, Truman; Wang, Shuyi; Li, Hui; Wei, Xiping; Jiang, Chunlai; Kirchherr, Jennifer L.; Gao, Feng; Anderson, Jeffery A.; Ping, Li-Hua; Swanstrom, Ronald; Tomaras, Georgia D.; Blattner, William A.; Goepfert, Paul A.; Kilby, J. Michael; Saag, Michael S.; Delwart, Eric L.; Busch, Michael P.; Cohen, Myron S.; Montefiori, David C.; Haynes, Barton F.; Gaschen, Brian; Athreya, Gayathri S.; Lee, Ha Y.; Wood, Natasha; Seoighe, Cathal; Perelson, Alan S.; Bhattacharya, Tanmoy; Korber, Bette T.; Hahn, Beatrice H.; Shaw, George M.
2008-01-01
The precise identification of the HIV-1 envelope glycoprotein (Env) responsible for productive clinical infection could be instrumental in elucidating the molecular basis of HIV-1 transmission and in designing effective vaccines. Here, we developed a mathematical model of random viral evolution and, together with phylogenetic tree construction, used it to analyze 3,449 complete env sequences derived by single genome amplification from 102 subjects with acute HIV-1 (clade B) infection. Viral env genes evolving from individual transmitted or founder viruses generally exhibited a Poisson distribution of mutations and star-like phylogeny, which coalesced to an inferred consensus sequence at or near the estimated time of virus transmission. Overall, 78 of 102 subjects had evidence of productive clinical infection by a single virus, and 24 others had evidence of productive clinical infection by a minimum of two to five viruses. Phenotypic analysis of transmitted or early founder Envs revealed a consistent pattern of CCR5 dependence, masking of coreceptor binding regions, and equivalent or modestly enhanced resistance to the fusion inhibitor T1249 and broadly neutralizing antibodies compared with Envs from chronically infected subjects. Low multiplicity infection and limited viral evolution preceding peak viremia suggest a finite window of potential vulnerability of HIV-1 to vaccine-elicited immune responses, although phenotypic properties of transmitted Envs pose a formidable defense. PMID:18490657
Luque, Daniel; Gómez-Blanco, Josué; Garriga, Damiá; Brilot, Axel F.; González, José M.; Havens, Wendy M.; Carrascosa, José L.; Trus, Benes L.; Verdaguer, Nuria; Ghabrial, Said A.; Castón, José R.
2014-01-01
Viruses evolve so rapidly that sequence-based comparison is not suitable for detecting relatedness among distant viruses. Structure-based comparisons suggest that evolution led to a small number of viral classes or lineages that can be grouped by capsid protein (CP) folds. Here, we report that the CP structure of the fungal dsRNA Penicillium chrysogenum virus (PcV) shows the progenitor fold of the dsRNA virus lineage and suggests a relationship between lineages. Cryo-EM structure at near-atomic resolution showed that the 982-aa PcV CP is formed by a repeated α-helical core, indicative of gene duplication despite lack of sequence similarity between the two halves. Superimposition of secondary structure elements identified a single “hotspot” at which variation is introduced by insertion of peptide segments. Structural comparison of PcV and other distantly related dsRNA viruses detected preferential insertion sites at which the complexity of the conserved α-helical core, made up of ancestral structural motifs that have acted as a skeleton, might have increased, leading to evolution of the highly varied current structures. Analyses of structural motifs only apparent after systematic structural comparisons indicated that the hallmark fold preserved in the dsRNA virus lineage shares a long (spinal) α-helix tangential to the capsid surface with the head-tailed phage and herpesvirus viral lineage. PMID:24821769
Pedersen, Casper-Emil T; Frandsen, Peter; Wekesa, Sabenzia N; Heller, Rasmus; Sangula, Abraham K; Wadsworth, Jemma; Knowles, Nick J; Muwanika, Vincent B; Siegismund, Hans R
2015-01-01
With the emergence of analytical software for the inference of viral evolution, a number of studies have focused on estimating important parameters such as the substitution rate and the time to the most recent common ancestor (tMRCA) for rapidly evolving viruses. Coupled with an increasing abundance of sequence data sampled under widely different schemes, an effort to keep results consistent and comparable is needed. This study emphasizes commonly disregarded problems in the inference of evolutionary rates in viral sequence data when sampling is unevenly distributed on a temporal scale through a study of the foot-and-mouth (FMD) disease virus serotypes SAT 1 and SAT 2. Our study shows that clustered temporal sampling in phylogenetic analyses of FMD viruses will strongly bias the inferences of substitution rates and tMRCA because the inferred rates in such data sets reflect a rate closer to the mutation rate rather than the substitution rate. Estimating evolutionary parameters from viral sequences should be performed with due consideration of the differences in short-term and longer-term evolutionary processes occurring within sets of temporally sampled viruses, and studies should carefully consider how samples are combined.
Canuti, Marta; O'Leary, Kimberly E; Hunter, Bruce D; Spearman, Grant; Ojkic, Davor; Whitney, Hugh G; Lang, Andrew S
2016-01-01
Aleutian mink disease virus (AMDV) causes plasmacytosis, an immune complex-associated syndrome that affects wild and farmed mink. The virus can also infect other small mammals (e.g., ferrets, skunks, ermines, and raccoons), but the disease in these hosts has been studied less. In 2007, a mink plasmacytosis outbreak began on the Island of Newfoundland, and the virus has been endemic in farms since then. In this study, we evaluated the molecular epidemiology of AMDV in farmed and wild animals of Newfoundland since before the beginning of the outbreak and investigated the epidemic in a global context by studying AMDV worldwide, thereby examining its diffusion and phylogeography. Furthermore, AMDV evolution was examined in the context of intensive farming, where host population dynamics strongly influence viral evolution. Partial NS1 sequences and several complete genomes were obtained from Newfoundland viruses and analyzed along with numerous sequences from other locations worldwide that were either obtained as part of this study or from public databases. We observed very high viral diversity within Newfoundland and within single farms, where high rates of co-infection, recombinant viruses and polymorphisms were observed within single infected individuals. Worldwide, we documented a partial geographic distribution of strains, where viruses from different countries co-exist within clades but form country-specific subclades. Finally, we observed the occurrence of recombination and the predominance of negative selection pressure on AMDV proteins. A surprisingly low number of immunoepitopic sites were under diversifying pressure, possibly because AMDV gains no benefit by escaping the immune response as viral entry into target cells is mediated through interactions with antibodies, which therefore contribute to cell infection. In conclusion, the high prevalence of AMDV in farms facilitates the establishment of co-infections that can favor the occurrence of recombination and enhance viral diversity. Viruses are then exchanged between different farms and countries and can be introduced into the wild, with the rapidly evolving viruses producing many parallel lineages.
Ramey, Andy M.; Reeves, Andrew; Poulson, Rebecca L.; Carter, Deborah L.; Davis-Fields, Nicholas; Stallknecht, David E.
2016-01-01
We report here the complete genome sequence of a novel H14N7 subtype influenza A virus (IAV) isolated from a blue-winged teal (Anas discors) harvested in Texas, USA. The genomic characteristics of this IAV strain with a previously undetected subtype combination suggest recent viral evolution within the New World wild-bird IAV reservoir.
The evolution of subtype B HIV-1 tat in the Netherlands during 1985-2012.
van der Kuyl, Antoinette C; Vink, Monique; Zorgdrager, Fokla; Bakker, Margreet; Wymant, Chris; Hall, Matthew; Gall, Astrid; Blanquart, François; Berkhout, Ben; Fraser, Christophe; Cornelissen, Marion
2018-05-02
For the production of viral genomic RNA, HIV-1 is dependent on an early viral protein, Tat, which is required for high-level transcription. The quantity of viral RNA detectable in blood of HIV-1 infected individuals varies dramatically, and a factor involved could be the efficiency of Tat protein variants to stimulate RNA transcription. HIV-1 virulence, measured by set-point viral load, has been observed to increase over time in the Netherlands and elsewhere. Investigation of tat gene evolution in clinical isolates could discover a role of Tat in this changing virulence. A dataset of 291 Dutch HIV-1 subtype B tat genes, derived from full-length HIV-1 genome sequences from samples obtained between 1985-2012, was used to analyse the evolution of Tat. Twenty-two patient-derived tat genes, and the control Tat HXB2 were analysed for their capacity to stimulate expression of an LTR-luciferase reporter gene construct in diverse cell lines, as well as for their ability to complement a tat-defective HIV-1 LAI clone. Analysis of 291 historical tat sequences from the Netherlands showed ample amino acid (aa) variation between isolates, although no specific mutations were selected for over time. Of note, however, the encoded protein varied its length over the years through the loss or gain of stop codons in the second exon. In transmission clusters, a selection against the shorter Tat86 ORF was apparent in favour of the more common Tat101 version, likely due to negative selection against Tat86 itself, although random drift, transmission bottlenecks, or linkage to other variants could also explain the observation. There was no correlation between Tat length and set-point viral load; however, the number of non-intermediate variants in our study was small. In addition, variation in the length of Tat did not significantly change its capacity to stimulate transcription. From 1985 till 2012, variation in the length of the HIV-1 subtype B tat gene is increasingly found in the Dutch epidemic. However, as Tat proteins did not differ significantly in their capacity to stimulate transcription elongation in vitro, the increased HIV-1 virulence seen in recent years could not be linked to an evolving viral Tat protein. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.
Allison, Andrew B; Kohler, Dennis J; Ortega, Alicia; Hoover, Elizabeth A; Grove, Daniel M; Holmes, Edward C; Parrish, Colin R
2014-11-01
Canine parvovirus (CPV) emerged as a new pandemic pathogen of dogs in the 1970s and is closely related to feline panleukopenia virus (FPV), a parvovirus of cats and related carnivores. Although both viruses have wide host ranges, analysis of viral sequences recovered from different wild carnivore species, as shown here, demonstrated that>95% were derived from CPV-like viruses, suggesting that CPV is dominant in sylvatic cycles. Many viral sequences showed host-specific mutations in their capsid proteins, which were often close to sites known to control binding to the transferrin receptor (TfR), the host receptor for these carnivore parvoviruses, and which exhibited frequent parallel evolution. To further examine the process of host adaptation, we passaged parvoviruses with alternative backgrounds in cells from different carnivore hosts. Specific mutations were selected in several viruses and these differed depending on both the background of the virus and the host cells in which they were passaged. Strikingly, these in vitro mutations recapitulated many specific changes seen in viruses from natural populations, strongly suggesting they are host adaptive, and which were shown to result in fitness advantages over their parental virus. Comparison of the sequences of the transferrin receptors of the different carnivore species demonstrated that many mutations occurred in and around the apical domain where the virus binds, indicating that viral variants were likely selected through their fit to receptor structures. Some of the viruses accumulated high levels of variation upon passage in alternative hosts, while others could infect multiple different hosts with no or only a few additional mutations. Overall, these studies demonstrate that the evolutionary history of a virus, including how long it has been circulating and in which hosts, as well as its phylogenetic background, has a profound effect on determining viral host range.
Allison, Andrew B.; Kohler, Dennis J.; Ortega, Alicia; Hoover, Elizabeth A.; Grove, Daniel M.; Holmes, Edward C.; Parrish, Colin R.
2014-01-01
Canine parvovirus (CPV) emerged as a new pandemic pathogen of dogs in the 1970s and is closely related to feline panleukopenia virus (FPV), a parvovirus of cats and related carnivores. Although both viruses have wide host ranges, analysis of viral sequences recovered from different wild carnivore species, as shown here, demonstrated that >95% were derived from CPV-like viruses, suggesting that CPV is dominant in sylvatic cycles. Many viral sequences showed host-specific mutations in their capsid proteins, which were often close to sites known to control binding to the transferrin receptor (TfR), the host receptor for these carnivore parvoviruses, and which exhibited frequent parallel evolution. To further examine the process of host adaptation, we passaged parvoviruses with alternative backgrounds in cells from different carnivore hosts. Specific mutations were selected in several viruses and these differed depending on both the background of the virus and the host cells in which they were passaged. Strikingly, these in vitro mutations recapitulated many specific changes seen in viruses from natural populations, strongly suggesting they are host adaptive, and which were shown to result in fitness advantages over their parental virus. Comparison of the sequences of the transferrin receptors of the different carnivore species demonstrated that many mutations occurred in and around the apical domain where the virus binds, indicating that viral variants were likely selected through their fit to receptor structures. Some of the viruses accumulated high levels of variation upon passage in alternative hosts, while others could infect multiple different hosts with no or only a few additional mutations. Overall, these studies demonstrate that the evolutionary history of a virus, including how long it has been circulating and in which hosts, as well as its phylogenetic background, has a profound effect on determining viral host range. PMID:25375184
Croville, Guillaume; Soubies, Sébastien Mathieu; Barbieri, Johanna; Klopp, Christophe; Mariette, Jérôme; Bouchez, Olivier; Camus-Bouclainville, Christelle
2012-01-01
Adaptation of avian influenza viruses (AIVs) from waterfowl to domestic poultry with a deletion in the neuraminidase (NA) stalk has already been reported. The way the virus undergoes this evolution, however, is thus far unclear. We address this question using pyrosequencing of duck and turkey low-pathogenicity AIVs. Ducks and turkeys were sampled at the very beginning of an H6N1 outbreak, and turkeys were swabbed again 8 days later. NA stalk deletions were evidenced in turkeys by Sanger sequencing. To further investigate viral evolution, 454 pyrosequencing was performed: for each set of samples, up to 41,500 reads of ca. 400 bp were generated and aligned. Genetic polymorphisms between duck and turkey viruses were tracked on the whole genome. NA deletion was detected in less than 2% of reads in duck feces but in 100% of reads in turkey tracheal specimens collected at the same time. Further variations in length were observed in NA from turkeys 8 days later. Similarly, minority mutants emerged on the hemagglutinin (HA) gene, with substitutions mostly in the receptor binding site on the globular head. These critical changes suggest a strong evolutionary pressure in turkeys. The increasing performances of next-generation sequencing technologies should enable us to monitor the genomic diversity of avian influenza viruses and early emergence of potentially pathogenic variants within bird flocks. The present study, based on 454 pyrosequencing, suggests that NA deletion, an example of AIV adaptation from waterfowl to domestic poultry, occurs by selection rather than de novo emergence of viral mutants. PMID:22718944
Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike
2018-01-01
ABSTRACT Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection. PMID:29564396
Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike; Khan, Arifa S
2018-01-01
Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection.
Lewis, Nicola S.; Verhagen, Josanne H.; Javakhishvili, Zurab; Russell, Colin A.; Lexmond, Pascal; Westgeest, Kim B.; Bestebroer, Theo M.; Halpin, Rebecca A.; Lin, Xudong; Ransier, Amy; Fedorova, Nadia B.; Stockwell, Timothy B.; Latorre-Margalef, Neus; Olsen, Björn; Smith, Gavin; Bahl, Justin; Wentworth, David E.; Waldenström, Jonas; Fouchier, Ron A. M.
2015-01-01
Low pathogenic avian influenza A viruses (IAVs) have a natural host reservoir in wild waterbirds and the potential to spread to other host species. Here, we investigated the evolutionary, spatial and temporal dynamics of avian IAVs in Eurasian wild birds. We used whole-genome sequences collected as part of an intensive long-term Eurasian wild bird surveillance study, and combined this genetic data with temporal and spatial information to explore the virus evolutionary dynamics. Frequent reassortment and co-circulating lineages were observed for all eight genomic RNA segments over time. There was no apparent species-specific effect on the diversity of the avian IAVs. There was a spatial and temporal relationship between the Eurasian sequences and significant viral migration of avian IAVs from West Eurasia towards Central Eurasia. The observed viral migration patterns differed between segments. Furthermore, we discuss the challenges faced when analysing these surveillance and sequence data, and the caveats to be borne in mind when drawing conclusions from the apparent results of such analyses. PMID:25904147
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.
Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique
2015-06-01
Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.
Emmenegger, E.J.; Kurath, G.
2002-01-01
Infectious hematopoietic necrosis virus (IHNV) is a pathogen that infects many Pacific salmonid stocks from the watersheds of North America. Previous studies have thoroughly characterized the genetic diversity of IHNV isolates from Alaska and the Hagerman Valley in Idaho. To enhance understanding of the evolution and viral transmission patterns of IHNV within the Pacific Northwest geographic range, we analyzed the G gene of IHNV isolates from the coastal watersheds of Washington State by ribonuclease protection assay (RPA) and nucleotide sequencing. The RPA analysis of 23 isolates indicated that the Skagit basin IHNV isolates were relatively homogeneous as a result of the dominance of one G gene haplotype (S). Sequence analysis of 303 bases in the middle of the G gene (midG region) of 61 isolates confirmed the high frequency of a Skagit River basin sequence and identified another sequence commonly found in isolates from the Lake Washington basin. Overall, both the RPA and sequence analysis showed that the Washington coastal IHNV isolates are genetically homogeneous and have little genetic diversity. This is similar to the genetic diversity pattern of IHNV from Alaska and contrasts sharply with the high genetic diversity demonstrated for IHNV isolates from fish farms along the Snake River in Idaho. The high degree of sequence and haplotype similarity between the Washington coastal IHNV isolates and those from Alaska and British Columbia suggests that they have a common viral ancestor. Phylogenetic analyses of the isolates we studied and those from different regions throughout the virus's geographic range confirms a conserved pattern of evolution of the virus in salmonid stocks north of the Columbia River, which forms Washington's southern border.
NASA Astrophysics Data System (ADS)
Takeda, Haruhiko; Ueda, Yoshihide; Inuzuka, Tadashi; Yamashita, Yukitaka; Osaki, Yukio; Nasu, Akihiro; Umeda, Makoto; Takemura, Ryo; Seno, Hiroshi; Sekine, Akihiro; Marusawa, Hiroyuki
2017-03-01
Resistance-associated variant (RAV) is one of the most significant clinical challenges in treating HCV-infected patients with direct-acting antivirals (DAAs). We investigated the viral dynamics in patients receiving DAAs using third-generation sequencing technology. Among 283 patients with genotype-1b HCV receiving daclatasvir + asunaprevir (DCV/ASV), 32 (11.3%) failed to achieve sustained virological response (SVR). Conventional ultra-deep sequencing of HCV genome was performed in 104 patients (32 non-SVR, 72 SVR), and detected representative RAVs in all non-SVR patients at baseline, including Y93H in 28 (87.5%). Long contiguous sequences spanning NS3 to NS5A regions of each viral clone in 12 sera from 6 representative non-SVR patients were determined by third-generation sequencing, and showed the concurrent presence of several synonymous mutations linked to resistance-associated substitutions in a subpopulation of pre-existing RAVs and dominant isolates at treatment failure. Phylogenetic analyses revealed close genetic distances between pre-existing RAVs and dominant RAVs at treatment failure. In addition, multiple drug-resistant mutations developed on pre-existing RAVs after DCV/ASV in all non-SVR cases. In conclusion, multi-drug resistant viral clones at treatment failure certainly originated from a subpopulation of pre-existing RAVs in HCV-infected patients. Those RAVs were selected for and became dominant with the acquisition of multiple resistance-associated substitutions under DAA treatment pressure.
Fun, Axel; Leitner, Thomas; Vandekerckhove, Linos; Däumer, Martin; Thielen, Alexander; Buchholz, Bernd; Hoepelman, Andy I M; Gisolf, Elizabeth H; Schipper, Pauline J; Wensing, Annemarie M J; Nijhuis, Monique
2018-01-05
Emergence of resistance against integrase inhibitor raltegravir in human immunodeficiency virus type 1 (HIV-1) patients is generally associated with selection of one of three signature mutations: Y143C/R, Q148K/H/R or N155H, representing three distinct resistance pathways. The mechanisms that drive selection of a specific pathway are still poorly understood. We investigated the impact of the HIV-1 genetic background and population dynamics on the emergence of raltegravir resistance. Using deep sequencing we analyzed the integrase coding sequence (CDS) in longitudinal samples from five patients who initiated raltegravir plus optimized background therapy at viral loads > 5000 copies/ml. To investigate the role of the HIV-1 genetic background we created recombinant viruses containing the viral integrase coding region from pre-raltegravir samples from two patients in whom raltegravir resistance developed through different pathways. The in vitro selections performed with these recombinant viruses were designed to mimic natural population bottlenecks. Deep sequencing analysis of the viral integrase CDS revealed that the virological response to raltegravir containing therapy inversely correlated with the relative amount of unique sequence variants that emerged suggesting diversifying selection during drug pressure. In 4/5 patients multiple signature mutations representing different resistance pathways were observed. Interestingly, the resistant population can consist of a single resistant variant that completely dominates the population but also of multiple variants from different resistance pathways that coexist in the viral population. We also found evidence for increased diversification after stronger bottlenecks. In vitro selections with low viral titers, mimicking population bottlenecks, revealed that both recombinant viruses and HXB2 reference virus were able to select mutations from different resistance pathways, although typically only one resistance pathway emerged in each individual culture. The generation of a specific raltegravir resistant variant is not predisposed in the genetic background of the viral integrase CDS. Typically, in the early phases of therapy failure the sequence space is explored and multiple resistance pathways emerge and then compete for dominance which frequently results in a switch of the dominant population over time towards the fittest variant or even multiple variants of similar fitness that can coexist in the viral population.
Nelson, Martha I.; Stucker, Karla M.; Schobel, Seth A.; Dugan, Vivien G.; Nelson, Sarah W.; Sreevatsan, Srinand; Killian, Mary L.; Nolting, Jacqueline M.; Wentworth, David E.
2016-01-01
ABSTRACT The swine-human interface created at agricultural fairs, along with the generation of and maintenance of influenza A virus diversity in exhibition swine, presents an ongoing threat to public health. Nucleotide sequences of influenza A virus isolates collected from exhibition swine in Ohio (n = 262) and Indiana (n = 103) during 2009 to 2013 were used to investigate viral evolution and movement within this niche sector of the swine industry. Phylogenetic and Bayesian analyses were employed to identify introductions of influenza A virus to exhibition swine and study viral population dynamics. In 2013 alone, we identified 10 independent introductions of influenza A virus into Ohio and/or Indiana exhibition swine. Frequently, viruses from the same introduction were identified at multiple fairs within the region, providing evidence of rapid and widespread viral movement within the exhibition swine populations of the two states. While pigs moving from fair to fair to fair is possible in some locations, the concurrent detection of nearly identical strains at several fairs indicates that a common viral source was more likely. Importantly, we detected an association between the high number of human variant H3N2 (H3N2v) virus infections in 2012 and the widespread circulation of influenza A viruses of the same genotype in exhibition swine in Ohio fairs sampled that year. The extent of viral diversity observed in exhibition swine and the rapidity with which it disseminated across long distances indicate that novel strains of influenza A virus will continue to emerge and spread within exhibition swine populations, presenting an ongoing threat to humans. IMPORTANCE Understanding the underlying population dynamics of influenza A viruses in commercial and exhibition swine is central to assessing the risk for human infections with variant viruses, including H3N2v. We used viral genomic sequences from isolates collected from exhibition swine during 2009 to 2013 to understand how the peak of H3N2v cases in 2012 relates to long-term trends in the population dynamics of pandemic viruses recently introduced into commercial and exhibition swine in the United States. The results of our spatial analysis underscore the key role of rapid viral dispersal in spreading multiple genetic lineages throughout a multistate network of agricultural fairs, providing opportunities for divergent lineages to coinfect, reassort, and generate new viral genotypes. The higher genetic diversity of genotypes cocirculating in exhibition swine since 2013 could facilitate the evolution of new reassortants, potentially with even greater ability to cause severe infections in humans or cause human-to-human transmission, highlighting the need for continued vigilance. PMID:27681134
PHYLOSCANNER: Inferring Transmission from Within- and Between-Host Pathogen Genetic Diversity
Hall, Matthew; Ratmann, Oliver; Bonsall, David; Golubchik, Tanya; de Cesare, Mariateresa; Gall, Astrid; Cornelissen, Marion; Fraser, Christophe
2018-01-01
Abstract A central feature of pathogen genomics is that different infectious particles (virions and bacterial cells) within an infected individual may be genetically distinct, with patterns of relatedness among infectious particles being the result of both within-host evolution and transmission from one host to the next. Here, we present a new software tool, phyloscanner, which analyses pathogen diversity from multiple infected hosts. phyloscanner provides unprecedented resolution into the transmission process, allowing inference of the direction of transmission from sequence data alone. Multiply infected individuals are also identified, as they harbor subpopulations of infectious particles that are not connected by within-host evolution, except where recombinant types emerge. Low-level contamination is flagged and removed. We illustrate phyloscanner on both viral and bacterial pathogens, namely HIV-1 sequenced on Illumina and Roche 454 platforms, HCV sequenced with the Oxford Nanopore MinION platform, and Streptococcus pneumoniae with sequences from multiple colonies per individual. phyloscanner is available from https://github.com/BDI-pathogens/phyloscanner. PMID:29186559
Genome-Wide Networks of Amino Acid Covariances Are Common among Viruses
Donlin, Maureen J.; Szeto, Brandon; Gohara, David W.; Aurora, Rajeev
2012-01-01
Coordinated variation among positions in amino acid sequence alignments can reveal genetic dependencies at noncontiguous positions, but methods to assess these interactions are incompletely developed. Previously, we found genome-wide networks of covarying residue positions in the hepatitis C virus genome (R. Aurora, M. J. Donlin, N. A. Cannon, and J. E. Tavis, J. Clin. Invest. 119:225–236, 2009). Here, we asked whether such networks are present in a diverse set of viruses and, if so, what they may imply about viral biology. Viral sequences were obtained for 16 viruses in 13 species from 9 families. The entire viral coding potential for each virus was aligned, all possible amino acid covariances were identified using the observed-minus-expected-squared algorithm at a false-discovery rate of ≤1%, and networks of covariances were assessed using standard methods. Covariances that spanned the viral coding potential were common in all viruses. In all cases, the covariances formed a single network that contained essentially all of the covariances. The hepatitis C virus networks had hub-and-spoke topologies, but all other networks had random topologies with an unusually large number of highly connected nodes. These results indicate that genome-wide networks of genetic associations and the coordinated evolution they imply are very common in viral genomes, that the networks rarely have the hub-and-spoke topology that dominates other biological networks, and that network topologies can vary substantially even within a given viral group. Five examples with hepatitis B virus and poliovirus are presented to illustrate how covariance network analysis can lead to inferences about viral biology. PMID:22238298
Bats, Primates, and the Evolutionary Origins and Diversification of Mammalian Gammaherpesviruses
Rojas-Anaya, Edith; Kolokotronis, Sergios-Orestis; Taboada, Blanca; Loza-Rubio, Elizabeth; Méndez-Ojeda, Maria L.; Osterrieder, Nikolaus
2016-01-01
ABSTRACT Gammaherpesviruses (γHVs) are generally considered host specific and to have codiverged with their hosts over millions of years. This tenet is challenged here by broad-scale phylogenetic analysis of two viral genes using the largest sample of mammalian γHVs to date, integrating for the first time bat γHV sequences available from public repositories and newly generated viral sequences from two vampire bat species (Desmodus rotundus and Diphylla ecaudata). Bat and primate viruses frequently represented deep branches within the supported phylogenies and clustered among viruses from distantly related mammalian taxa. Following evolutionary scenario testing, we determined the number of host-switching and cospeciation events. Cross-species transmissions have occurred much more frequently than previously estimated, and most of the transmissions were attributable to bats and primates. We conclude that the evolution of the Gammaherpesvirinae subfamily has been driven by both cross-species transmissions and subsequent cospeciation within specific viral lineages and that the bat and primate orders may have potentially acted as superspreaders to other mammalian taxa throughout evolutionary history. PMID:27834200
Molecular characterization of occult hepatitis B virus in genotype E-infected subjects.
Zahn, Astrid; Li, Chengyao; Danso, Kwabena; Candotti, Daniel; Owusu-Ofori, Shirley; Temple, Jillian; Allain, Jean-Pierre
2008-02-01
Occult hepatitis B virus (HBV) infection (OBI), defined as the presence of HBV DNA without detectable HBV surface antigen (HBsAg), is frequent in west Africa, where genotype E is prevalent. The prevalence of OBI in 804 blood donors and 1368 pregnant women was 1.7 and 1.5%, respectively. Nine of 32 OBI carriers were evaluated with HBV serology, viral load and complete HBV genome sequence of two to five clones. All samples except one were anti-HBV core antigen-positive and three contained antibodies against HBsAg (anti-HBs). All strains were of genotype E and formed quasispecies with 0.20-1.28% intra-sample sequence variation. Few uncommon mutations (absent in 23 genotype E reference sequences) were found across the entire genome. Two mutations in the core region encoded truncated or abnormal capsid protein, potentially affecting viral production, but were probably rescued by non-mutated variants, as found in one clone. No evidence of escape mutants was found in anti-HBs-carrying samples, as the 'a' region was consistently wild type. OBI carriers constitute approximately 10% of all HBV DNA-viraemic adult Ghanaians. OBI carriers appear as a disparate group, with a very low viral load in common, but multiple origins reflecting decades of natural evolution in an area essentially devoid of human intervention.
Genetic diversity and epidemiology of infectious hematopoietic necrosis virus in Alaska
Emmenegger, E.G; Meyers, T.R.; Burton, T.O.; Kurath, G.
2000-01-01
Forty-two infectious hematopoietic necrosis virus (IHNV) isolates from Alaska were analyzed using the ribonuclease protection assay (RPA) and nucleotide sequencing. RPA analyses, utilizing 4 probes, N5, N3 (N gene), GF (G gene), and NV (NV gene), determined that the haplotypes of all 3 genes demonstrated a consistent spatial pattern. Virus isolates belonging to the most common haplotype groups were distributed throughout Alaska, whereas isolates in small haplotype groups were obtained from only 1 site (hatchery, lake, etc.). The temporal pattern of the GF haplotypes suggested a 'genetic acclimation' of the G gene, possibly due to positive selection on the glycoprotein. A pairwise comparison of the sequence data determined that the maximum nucleotide diversity of the isolates was 2.75% (10 mismatches) for the NV gene, and 1.99% (6 mismatches) for a 301 base pair region of the G gene, indicating that the genetic diversity of IHNV within Alaska is notably lower than in the more southern portions of the IHNV North American range. Phylogenetic analysis of representative Alaskan sequences and sequences of 12 previously characterized IHNV strains from Washington, Oregon, Idaho, California (USA) and British Columbia (Canada) distinguished the isolates into clusters that correlated with geographic origin and indicated that the Alaskan and British Columbia isolates may have a common viral ancestral lineage. Comparisons of multiple isolates from the same site provided epidemiological insights into viral transmission patterns and indicated that viral evolution, viral introduction, and genetic stasis were the mechanisms involved with IHN virus population dynamics in Alaska. The examples of genetic stasis and the overall low sequence heterogeneity of the Alaskan isolates suggested that they are evolutionarily constrained. This study establishes a baseline of genetic fingerprint patterns and sequence groups representing the genetic diversity of Alaskan IHNV isolates. This information could be used to determine the source of an IHN outbreak and to facilitate decisions in fisheries management of Alaskan salmonid stocks.
Wagner, Andreas
2014-07-07
Networks of evolving genotypes can be constructed from the worldwide time-resolved genotyping of pathogens like influenza viruses. Such genotype networks are graphs where neighbouring vertices (viral strains) differ in a single nucleotide or amino acid. A rich trove of network analysis methods can help understand the evolutionary dynamics reflected in the structure of these networks. Here, I analyse a genotype network comprising hundreds of influenza A (H3N2) haemagglutinin genes. The network is rife with cycles that reflect non-random parallel or convergent (homoplastic) evolution. These cycles also show patterns of sequence change characteristic for strong and local evolutionary constraints, positive selection and mutation-limited evolution. Such cycles would not be visible on a phylogenetic tree, illustrating that genotype network analysis can complement phylogenetic analyses. The network also shows a distinct modular or community structure that reflects temporal more than spatial proximity of viral strains, where lowly connected bridge strains connect different modules. These and other organizational patterns illustrate that genotype networks can help us study evolution in action at an unprecedented level of resolution. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Physical mode of bacteria and virus coevolution
NASA Astrophysics Data System (ADS)
Han, Pu; Niestemski, Liang; Deem, Michael
2013-03-01
Single-cell hosts such as bacteria or archaea possess an adaptive, heritable immune system that protects them from viral invasion. This system, known as the CRISPR-Cas system, allows the host to recognize and incorporate short foreign DNA or RNA sequences from viruses or plasmids. The sequences form what are called ``spacers'' in the CRISPR. Spacers in the CRISPR loci provide a record of the host and predator coevolution history. We develop a physical model to study the dynamics of this coevolution due to immune pressure. Hosts and viruses reproduce, die, and evolve due to viral infection pressure, host immune pressure, and mutation. We will discuss the differing effects of point mutation and recombination on CRISPR evolution. We will also discuss the effect of different spacer deletion mechanisms. We will describe population structure of hosts and viruses, how spacer diversity depends on position within CRISPR, and match of the CRISPR spacers to the virus population.
Recombination and Population Mosaic of a Multifunctional Viral Gene, Adeno-Associated Virus cap
Takeuchi, Yasuhiro; Myers, Richard; Danos, Olivier
2008-01-01
Homologous recombination is a dominant force in evolution and results in genetic mosaics. To detect evidence of recombination events and assess the biological significance of genetic mosaics, genome sequences for various viral populations of reasonably large size are now available in the GenBank. We studied a multi-functional viral gene, the adeno-associated virus (AAV) cap gene, which codes for three capsid proteins, VP1, VP2 and VP3. VP1-3 share a common C-terminal domain corresponding to VP3, which forms the viral core structure, while the VP1 unique N-terminal part contains an enzymatic domain with phospholipase A2 activity. Our recombinant detection program (RecI) revealed five novel recombination events, four of which have their cross-over points in the N-terminal, VP1 and VP2 unique region. Comparison of phylogenetic trees for different cap gene regions confirmed discordant phylogenies for the recombinant sequences. Furthermore, differences in the phylogenetic tree structures for the VP1 unique (VP1u) region and the rest of cap highlighted the mosaic nature of cap gene in the AAV population: two dominant forms of VP1u sequences were identified and these forms are linked to diverse sequences in the rest of cap gene. This observation together with the finding of frequent recombination in the VP1 and 2 unique regions suggests that this region is a recombination hot spot. Recombination events in this region preserve protein blocks of distinctive functions and contribute to convergence in VP1u and divergence of the rest of cap. Additionally the possible biological significance of two dominant VP1u forms is inferred. PMID:18286191
DNA transposons have colonized the genome of the giant virus Pandoravirus salinus.
Sun, Cheng; Feschotte, Cédric; Wu, Zhiqiang; Mueller, Rachel Lockridge
2015-06-12
Transposable elements are mobile DNA sequences that are widely distributed in prokaryotic and eukaryotic genomes, where they represent a major force in genome evolution. However, transposable elements have rarely been documented in viruses, and their contribution to viral genome evolution remains largely unexplored. Pandoraviruses are recently described DNA viruses with genome sizes that exceed those of some prokaryotes, rivaling parasitic eukaryotes. These large genomes appear to include substantial noncoding intergenic spaces, which provide potential locations for transposable element insertions. However, no mobile genetic elements have yet been reported in pandoravirus genomes. Here, we report a family of miniature inverted-repeat transposable elements (MITEs) in the Pandoravirus salinus genome, representing the first description of a virus populated with a canonical transposable element family that proliferated by transposition within the viral genome. The MITE family, which we name Submariner, includes 30 copies with all the hallmarks of MITEs: short length, terminal inverted repeats, TA target site duplication, and no coding capacity. Submariner elements show signs of transposition and are undetectable in the genome of Pandoravirus dulcis, the closest known relative Pandoravirus salinus. We identified a DNA transposon related to Submariner in the genome of Acanthamoeba castellanii, a species thought to host pandoraviruses, which contains remnants of coding sequence for a Tc1/mariner transposase. These observations suggest that the Submariner MITEs of P. salinus belong to the widespread Tc1/mariner superfamily and may have been mobilized by an amoebozoan host. Ten of the 30 MITEs in the P. salinus genome are located within coding regions of predicted genes, while others are close to genes, suggesting that these transposons may have contributed to viral genetic novelty. Our discovery highlights the remarkable ability of DNA transposons to colonize and shape genomes from all domains of life, as well as giant viruses. Our findings continue to blur the division between viral and cellular genomes, adhering to the emerging view that the content, dynamics, and evolution of the genomes of giant viruses do not substantially differ from those of cellular organisms.
[Molecular evolution of the tick-borne encephalitis and Powassan viruses].
Subbotina, E L; Loktev, V B
2012-01-01
The problem of emerging viruses, their genetic diversity and viral evolution in nature are attracting more attention. The phylogenetic analysis and evaluationary rate estimation were made for pathogenic flaviviruses such as tick-borne encephalitis virus (TBEV) and Powassan (PV) circulated in natural foci in Russia. 47 nucleotide sequences of encoded protein E of the TBEV and 17 sequences of NS5 genome region of the PV have been used. It was found that the rate of accumulation of nucleotide substitutions for E genome region of TBEV was approximately 1.4 x 10(-4) and 5.4 x 10(-5) substitutions per site per year for NS5 genome region of PV. The ratio of non-synonymous nucleotide substitutions to synonymous substitution (dN/dS) for viral sequences were estimated of 0.049 for TBEV and 0.098 for PV. Maximum value dN/dS was 0.201-0.220 for sub-cluster of Russian and Canadian strains of PV and the minimum - 0.024 for cluster of Russian and Chinese strains of Far Eastern genotype TBEV. Evaluation of time intervals of evolutionary events associated with these viruses showed that European subtype TBEV are diverged from all-TBEV ancestor within approximately 2750 years and the Siberian and Far Eastern subtypes are emerged about 2250 years ago. The PV was introduced into natural foci of the Primorsky Krai of Russia only about 70 years ago and PV is a very close to Canadian strains of PV. Evolutionary picture for PV in North America is similar to evolution of Siberian and Far Eastern subtypes TBEV in Asia. The divergence time for main genetic groups of TBEV and PV are correlated with historical periods of warming and cooling. These allow to propose a hypothesis that climate changes were essential to the evolution of the flaviviruses in the past millenniums.
Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M.; LaBaer, Joshua
2014-01-01
Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies. PMID:24955142
Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M; LaBaer, Joshua
2014-01-01
Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.
Detection of porcine circovirus type 2 in pigs imported from Indonesia.
Manokaran, Gayathri; Lin, Yueh-Nuo; Soh, Moi-Lien; Lim, Elizabeth Ai-Sim; Lim, Chee-Wee; Tan, Boon-Huan
2008-11-25
We have detected the presence of porcine circovirus (PCV) type 2 in Indonesian pigs imported to Singapore for food consumption. A total of three viral isolates were identified, and to genetically characterise them further, their full genomes were sequenced. Each genome showed a typical organization of PCV type 2, with the three isolates sharing similar genome lengths of 1767 nucleotide (nt) at high nt identities of 99.8-100%, further indicating that the viral isolates were quite homogeneous. Sequence analysis further revealed that the ORF2 genes contain the nt sequence CCCCGC (from nt position 262 to 267) that was previously reported to be associated with PCV type 2, group 1C. The phylogenetic tree was constructed for the ORF2 genes, and the PCV type 2 isolates distributed into two distinctive groups. The Indonesian PCV type 2 clustered tightly with one China isolate, accession number AY035820, as a sub-cluster in group 1C. The sequence and phylogenetic analyses both confirmed that the three Indonesian PCV type 2 isolates belong to group 1C, and that the genetic changes for the three Indonesian isolates were very stable, possibly due to the low-scale evolution.
Molecular hyperdiversity and evolution in very large populations.
Cutter, Asher D; Jovelin, Richard; Dey, Alivia
2013-04-01
The genomic density of sequence polymorphisms critically affects the sensitivity of inferences about ongoing sequence evolution, function and demographic history. Most animal and plant genomes have relatively low densities of polymorphisms, but some species are hyperdiverse with neutral nucleotide heterozygosity exceeding 5%. Eukaryotes with extremely large populations, mimicking bacterial and viral populations, present novel opportunities for studying molecular evolution in sexually reproducing taxa with complex development. In particular, hyperdiverse species can help answer controversial questions about the evolution of genome complexity, the limits of natural selection, modes of adaptation and subtleties of the mutation process. However, such systems have some inherent complications and here we identify topics in need of theoretical developments. Close relatives of the model organisms Caenorhabditis elegans and Drosophila melanogaster provide known examples of hyperdiverse eukaryotes, encouraging functional dissection of resulting molecular evolutionary patterns. We recommend how best to exploit hyperdiverse populations for analysis, for example, in quantifying the impact of noncrossover recombination in genomes and for determining the identity and micro-evolutionary selective pressures on noncoding regulatory elements. © 2013 Blackwell Publishing Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Korber, Bette; Hraber, Peter; Giorgi, Elena
2009-01-01
Identification of transmitted/founder virus genomes and their progeny by is a novel strategy for probing the molecular basis of HIV-1 transmission and for evaluating the genetic imprint of viral and host factors that act to constrain or facilitate virus replication. Here, we show in a cohort of twelve acutely infected subjects (9 clade B; 3 clade C), that complete genomic sequences of transmitted/founder viruses could be inferred using single genome amplification of plasma viral RNA, direct amplicon sequencing, and a model of random virus evolution. This allowed for the precise identification, chemical synthesis, molecular cloning, and biological analysis of thosemore » viruses actually responsible for productive clinical infection and for a comprehensive mapping of sequential viral genomes and proteomes for mutations that are necessary or incidental to the establishment of HIV-1 persistence. Transmitted/founder viruses were CD4 and CCR5 tropic, replicated preferentially in activated primary T-Iymphocytes but not monocyte-derived macrophages, and were effectively shielded from most heterologous or broadly neutralizing antibodies. By 3 months of infection, the evolving viral quasispecies in three subjects showed mutational fixation at only 2-5 discreet genomic loci. By 6-12 months, mutational fixation was evident at 18-27 genomic loci. Some, but not all, of these mutations were attributable to virus escape from cytotoxic Tlymphocytes or neutralizing antibodies, suggesting that other viral or host factors may influence early HIV -1 fitness.« less
Lamers, S L; Fogel, G B; Liu, E S; Nolan, D J; Salemi, M; Barbier, A E; Rose, R; Singer, E J; McGrath, M S
2017-07-01
HIV cure research is increasingly focused on anatomical tissues as sites for residual HIV replication during combined antiretroviral therapy (cART). Tissue-based HIV could contribute to low-level immune activation and viral rebound over the course of infection and could also influence the development of diseases, such as atherosclerosis, neurological disorders and cancers. cART-treated subjects have a decreased and irregular presence of HIV among tissues, which has resulted in a paucity of actual evidence concerning how or if HIV persists, replicates and evolves in various anatomical sites during therapy. In this study, we pooled 1806 HIV envelope V3 loop sequences from twenty-six tissue types (seventy-one total tissues) of six pre-cART subjects, four subjects with an unknown cART history who died with profound AIDS, and five subjects who died while on cART with an undetectable plasma viral load. A computational approach was used to assess sequences for their ability to utilize specific cellular coreceptors (R5, R5 and X4, or X4). We found that autopsied tissues obtained from virally suppressed cART+ subjects harbored both integrated and expressed viruses with similar coreceptor usage profiles to subjects with no or ineffective cART therapy (i.e., significant plasma viral load at death). The study suggests that tissue microenvironments provide a sanctuary for the continued evolution of HIV despite cART. Copyright © 2017 Elsevier B.V. All rights reserved.
Mina, Thomas; Amini-Bavil-Olyaee, Samad; Shirvani-Dastgerdi, Elham; Trovão, Nídia Sequeira; Van Ranst, Marc; Pourkarim, Mahmoud Reza
2017-04-01
Fulminant hepatitis among different clinical outcomes of hepatitis B virus infection is very rare and manifests high mortality rate, however it has not been investigated in Belgian inhabitants yet. In the frame of a retrospective study between 1995 and 2010, 80 serum samples (in some cases serial samples) archived in Biobank, were collected from 24 patients who had clinically developed fulminant infection of hepatitis B virus. In total, 33 hepatitis B virus (HBV) strains (31 full-length genome and 2 partial viral genes) of different HBV genotypes and subgenotypes including A2, B2, D1, D2, D3 and E, were amplified, sequenced and phylogenetically analyzed. HBV isolated strains from native and exotic patients were characterized by genome variations associated with viral invasiveness. Although several mutations at nucleotide and protein levels were detected, evolutionary analyses revealed a negative selective pressure over the viral genomes. This study revealed influence of immigration through a steady change in the viral epidemiological profile of the Belgian population. Copyright © 2017 Elsevier B.V. All rights reserved.
Evolution and Diversity in Human Herpes Simplex Virus Genomes
Gatherer, Derek; Ochoa, Alejandro; Greenbaum, Benjamin; Dolan, Aidan; Bowden, Rory J.; Enquist, Lynn W.; Legendre, Matthieu; Davison, Andrew J.
2014-01-01
Herpes simplex virus 1 (HSV-1) causes a chronic, lifelong infection in >60% of adults. Multiple recent vaccine trials have failed, with viral diversity likely contributing to these failures. To understand HSV-1 diversity better, we comprehensively compared 20 newly sequenced viral genomes from China, Japan, Kenya, and South Korea with six previously sequenced genomes from the United States, Europe, and Japan. In this diverse collection of passaged strains, we found that one-fifth of the newly sequenced members share a gene deletion and one-third exhibit homopolymeric frameshift mutations (HFMs). Individual strains exhibit genotypic and potential phenotypic variation via HFMs, deletions, short sequence repeats, and single-nucleotide polymorphisms, although the protein sequence identity between strains exceeds 90% on average. In the first genome-scale analysis of positive selection in HSV-1, we found signs of selection in specific proteins and residues, including the fusion protein glycoprotein H. We also confirmed previous results suggesting that recombination has occurred with high frequency throughout the HSV-1 genome. Despite this, the HSV-1 strains analyzed clustered by geographic origin during whole-genome distance analysis. These data shed light on likely routes of HSV-1 adaptation to changing environments and will aid in the selection of vaccine antigens that are invariant worldwide. PMID:24227835
McFaul, Katie; Liptrott, Neill; Cox, Alison; Martin, Phillip; Egan, Deirdre; Owen, Andrew; Kelly, Sarah; Karolia, Zeenat; Shaw, Kate; Bower, Mark; Boffito, Marta
2016-09-01
The use of combination antiretroviral therapy (cART) and cytotoxic chemotherapy for HIV-associated lymphoma runs the risks of inducing HIV drug resistance. This study examined two possible mechanisms: altered expression of membrane drug transporter protein (MTP) and acquisition of mutations in pro-viral DNA. Expression levels of MTP and pro-viral DNA resistance mutation analysis were performed on peripheral blood mononuclear cells (PBMC) before, during, and after chemotherapy. Twenty nine patients completed the three time point estimations. There were no significant variations before, during, and after chemotherapy in the expression of four MTPs: ABCB1, ABCC1, ABCC2, and SLCO3A1 (OATP3A1). Pro-viral DNA sequencing revealed that only one patient developed a new nucleos/tide reverse transcriptase inhibitor-associated mutation (184V) during the course of the study, giving a mutation rate of 0.0027 per person per year. In conclusion, concomitant administration of cytotoxic chemotherapy and cART does not induce expression of MTP. Furthermore, no significant changes in viral resistance were observed pre- and post-chemotherapy, suggesting mutagenic cytotoxic chemotherapy seems not to induce mutations in HIV pro-viral DNA.
Some basic properties of immune selection.
Iwasa, Yoh; Michor, Franziska; Nowak, Martin
2004-07-21
We analyze models for the evolutionary dynamics of viral or other infectious agents within a host. We study how the invasion of a new strain affects the composition and diversity of the viral population. We show that--under strain-specific immunity--the equilibrium abundance of uninfected cells declines during viral evolution. In addition, for cytotoxic immunity the absolute force of infection, and for non-cytotoxic immunity the absolute cellular virulence increases during viral evolution. We prove global stability by means of Lyapunov functions. These unidirectional trends of virus evolution under immune selection do not hold for general cross-reactive immune responses, which introduce frequency-dependent selection among viral strains. Therefore, appropriate cross-reactive immunity can lead to a viral evolution within a host which limits the extent of the disease.
Benmansour, A; Brahimi, M; Tuffereau, C; Coulon, P; Lafay, F; Flamand, A
1992-03-01
The sequence of the glycoprotein gene of a street rabies virus was determined directly using fragments of a rabid dog brain after PCR amplification. Compared with that of the prototype strain CVS, this sequence displayed 10% divergence in overall amino acid composition. However only 6% divergence was noted in the ectodomain suggesting that structural constraints are exerted on this portion of the glycoprotein. A human strain isolated on cell culture from the saliva of a patient with clinical rabies had only five amino acid differences with the canine isolate, an indication of their close relatedness. These differences could have originated during transmission from dog to dog, or from dog to man, or during isolation on cell culture; they are nonetheless indicative of a genetic evolution of street rabies virus. This evolution was further evidenced by the selection of cell-adapted variants which displayed new amino acid substitutions in the glycoprotein. One of them concerned antigenic site III where arginine at position 333 was replaced by glutamine. As expected this substitution conferred resistance to a site IIIa monoclonal antibody (MAb), but surprisingly did not abolish neurovirulence for adult mice. However, a decrease in the neurovirulence of the cell-adapted variant in the presence of a site IIIa specific MAb was noted, suggesting that neurovirulence was due to a subpopulation neutralizable by the MAb. Simultaneous presence of both the parental and variant sequences was indeed evidenced in the brain of a mouse inoculated with the cell-adapted variant; during multiplication in the mouse brain, the frequency of the parental sequence rose from less than 10% to nearly 50%, indicating the selective advantage conferred by arginine 333 in nervous tissue. Altogether these results were suggestive of an intrinsic heterogeneity of street rabies virus. This heterogeneity was further demonstrated by the sequencing of molecular clones of the glycoprotein gene, which revealed that only one-third of the viral genomes present in the brain of a rabid dog had the consensus sequence. Two-thirds of the clones analyzed displayed from one to three amino acid substitutions. Such heterogeneous populations have been referred to as quasispecies, a concept which implies heterogeneous populations kept together in a dynamic equilibrium. This equilibrium could be rapidly displaced, giving the virus the capacity to adapt easily to new environmental conditions.
Piontkivska, Helen; Matos, Luis F; Paul, Sinu; Scharfenberg, Brian; Farmerie, William G; Miyamoto, Michael M; Wayne, Marta L
2016-10-05
Sigma virus (DMelSV) is ubiquitous in natural populations of Drosophila melanogaster. Host-mediated, selective RNA editing of adenosines to inosines (ADAR) may contribute to control of viral infection by preventing transcripts from being transported into the cytoplasm or being translated accurately; or by increasing the viral genomic mutation rate. Previous PCR-based studies showed that ADAR mutations occur in DMelSV at low frequency. Here we use SOLiD TM deep sequencing of flies from a single host population from Athens, GA, USA to comprehensively evaluate patterns of sequence variation in DMelSV with respect to ADAR. GA dinucleotides, which are weak targets of ADAR, are strongly overrepresented in the positive strand of the virus, consistent with selection to generate ADAR resistance on this complement of the transient, double-stranded RNA intermediate in replication and transcription. Potential ADAR sites in a worldwide sample of viruses are more likely to be "resistant" if the sites do not vary among samples. Either variable sites are less constrained and hence are subject to weaker selection than conserved sites, or the variation is driven by ADAR. We also find evidence of mutations segregating within hosts, hereafter referred to as hypervariable sites. Some of these sites were variable only in one or two flies (i.e., rare); others were shared by four or even all five of the flies (i.e., common). Rare and common hypervariable sites were indistinguishable with respect to susceptibility to ADAR; however, polymorphism in rare sites were more likely to be consistent with the action of ADAR than in common ones, again suggesting that ADAR is deleterious to the virus. Thus, in DMelSV, host mutagenesis is constraining viral evolution both within and between hosts. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Bertels, Frederic; Marzel, Alex; Leventhal, Gabriel; Mitov, Venelin; Fellay, Jacques; Günthard, Huldrych F; Böni, Jürg; Yerly, Sabine; Klimkait, Thomas; Aubert, Vincent; Battegay, Manuel; Rauch, Andri; Cavassini, Matthias; Calmy, Alexandra; Bernasconi, Enos; Schmid, Patrick; Scherrer, Alexandra U; Müller, Viktor; Bonhoeffer, Sebastian; Kouyos, Roger; Regoes, Roland R
2018-01-01
Pathogen strains may differ in virulence because they attain different loads in their hosts, or because they induce different disease-causing mechanisms independent of their load. In evolutionary ecology, the latter is referred to as "per-parasite pathogenicity". Using viral load and CD4+ T-cell measures from 2014 HIV-1 subtype B-infected individuals enrolled in the Swiss HIV Cohort Study, we investigated if virulence-measured as the rate of decline of CD4+ T cells-and per-parasite pathogenicity are heritable from donor to recipient. We estimated heritability by donor-recipient regressions applied to 196 previously identified transmission pairs, and by phylogenetic mixed models applied to a phylogenetic tree inferred from HIV pol sequences. Regressing the CD4+ T-cell declines and per-parasite pathogenicities of the transmission pairs did not yield heritability estimates significantly different from zero. With the phylogenetic mixed model, however, our best estimate for the heritability of the CD4+ T-cell decline is 17% (5-30%), and that of the per-parasite pathogenicity is 17% (4-29%). Further, we confirm that the set-point viral load is heritable, and estimate a heritability of 29% (12-46%). Interestingly, the pattern of evolution of all these traits differs significantly from neutrality, and is most consistent with stabilizing selection for the set-point viral load, and with directional selection for the CD4+ T-cell decline and the per-parasite pathogenicity. Our analysis shows that the viral genotype affects virulence mainly by modulating the per-parasite pathogenicity, while the indirect effect via the set-point viral load is minor. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Comparing viral metagenomics methods using a highly multiplexed human viral pathogens reagent
Li, Linlin; Deng, Xutao; Mee, Edward T.; Collot-Teixeira, Sophie; Anderson, Rob; Schepelmann, Silke; Minor, Philip D.; Delwart, Eric
2014-01-01
Unbiased metagenomic sequencing holds significant potential as a diagnostic tool for the simultaneous detection of any previously genetically described viral nucleic acids in clinical samples. Viral genome sequences can also inform on likely phenotypes including drug susceptibility or neutralization serotypes. In this study, different variables of the laboratory methods often used to generate viral metagenomics libraries on the efficiency of viral detection and virus genome coverage were compared. A biological reagent consisting of 25 different human RNA and DNA viral pathogens was used to estimate the effect of filtration and nuclease digestion, DNA/RNA extraction methods, pre-amplification and the use of different library preparation kits on the detection of viral nucleic acids. Filtration and nuclease treatment led to slight decreases in the percentage of viral sequence reads and number of viruses detected. For nucleic acid extractions silica spin columns improved viral sequence recovery relative to magnetic beads and Trizol extraction. Pre-amplification using random RT-PCR while generating more viral sequence reads resulted in detection of fewer viruses, more overlapping sequences, and lower genome coverage. The ScriptSeq library preparation method retrieved more viruses and a greater fraction of their genomes than the TruSeq and Nextera methods. Viral metagenomics sequencing was able to simultaneously detect up to 22 different viruses in the biological reagent analyzed including all those detected by qPCR. Further optimization will be required for the detection of viruses in biologically more complex samples such as tissues, blood, or feces. PMID:25497414
Origins and challenges of viral dark matter.
Krishnamurthy, Siddharth R; Wang, David
2017-07-15
The accurate classification of viral dark matter - metagenomic sequences that originate from viruses but do not align to any reference virus sequences - is one of the major obstacles in comprehensively defining the virome. Depending on the sample, viral dark matter can make up from anywhere between 40 and 90% of sequences. This review focuses on the specific nature of dark matter as it relates to viral sequences. We identify three factors that contribute to the existence of viral dark matter: the divergence and length of virus sequences, the limitations of alignment based classification, and limited representation of viruses in reference sequence databases. We then discuss current methods that have been developed to at least partially circumvent these limitations and thereby reduce the extent of viral dark matter. Copyright © 2017 Elsevier B.V. All rights reserved.
Fenton-May, Angharad E.; Dilernia, Dario A.; Kilembe, William; Allen, Susan A.; Borrow, Persephone; Hunter, Eric
2015-01-01
Heterosexual transmission of HIV-1 is characterized by a genetic bottleneck that selects a single viral variant, the transmitted/founder (TF), during most transmission events. To assess viral characteristics influencing HIV-1 transmission, we sequenced 167 near full-length viral genomes and generated 40 infectious molecular clones (IMC) including TF variants and multiple non-transmitted (NT) HIV-1 subtype C variants from six linked heterosexual transmission pairs near the time of transmission. Consensus-like genomes sensitive to donor antibodies were selected for during transmission in these six transmission pairs. However, TF variants did not demonstrate increased viral fitness in terms of particle infectivity or viral replicative capacity in activated peripheral blood mononuclear cells (PBMC) and monocyte-derived dendritic cells (MDDC). In addition, resistance of the TF variant to the antiviral effects of interferon-α (IFN-α) was not significantly different from that of non-transmitted variants from the same transmission pair. Thus neither in vitro viral replicative capacity nor IFN-α resistance discriminated the transmission potential of viruses in the quasispecies of these chronically infected individuals. However, our findings support the hypothesis that within-host evolution of HIV-1 in response to adaptive immune responses reduces viral transmission potential. PMID:26378795
Bovine Enteroviruses as Indicators of Fecal Contamination
Ley, Victoria; Higgins, James; Fayer, Ronald
2002-01-01
Surface waters frequently have been contaminated with human enteric viruses, and it is likely that animal enteric viruses have contaminated surface waters also. Bovine enteroviruses (BEV), found in cattle worldwide, usually cause asymptomatic infections and are excreted in the feces of infected animals in large numbers. In this study, the prevalence and genotype of BEV in a closed herd of cattle were evaluated and compared with BEV found in animals in the immediate environment and in environmental specimens. BEV was found in feces from 76% of cattle, 38% of white-tailed deer, and one of three Canada geese sharing the same pastures, as well as the water obtained from animal watering tanks, from the pasture, from streams running from the pasture to an adjacent river, and from the river, which emptied into the Chesapeake Bay. Furthermore, BEV was found in oysters collected from that river downstream from the farm. These findings suggest that BEV could be used as an indicator of fecal pollution originating from animals (cattle and/or deer). Partial sequence analysis of the viral genomes indicates that different viral variants coexist in the same area. The possibility of identifying the viral strains found in the animals and in the contaminated areas by sequencing the RNA genome, could provide a tool to find the origin of the contamination and should be useful for epidemiological and viral molecular evolution studies. PMID:12089028
Do Viruses Exchange Genes across Superkingdoms of Life?
Malik, Shahana S; Azem-E-Zahra, Syeda; Kim, Kyung Mo; Caetano-Anollés, Gustavo; Nasir, Arshan
2017-01-01
Viruses can be classified into archaeoviruses, bacterioviruses, and eukaryoviruses according to the taxonomy of the infected host. The host-constrained perception of viruses implies preference of genetic exchange between viruses and cellular organisms of their host superkingdoms and viral origins from host cells either via escape or reduction. However, viruses frequently establish non-lytic interactions with organisms and endogenize into the genomes of bacterial endosymbionts that reside in eukaryotic cells. Such interactions create opportunities for genetic exchange between viruses and organisms of non-host superkingdoms. Here, we take an atypical approach to revisit virus-cell interactions by first identifying protein fold structures in the proteomes of archaeoviruses, bacterioviruses, and eukaryoviruses and second by tracing their spread in the proteomes of superkingdoms Archaea, Bacteria, and Eukarya. The exercise quantified protein structural homologies between viruses and organisms of their host and non-host superkingdoms and revealed likely candidates for virus-to-cell and cell-to-virus gene transfers. Unexpected lifestyle-driven genetic affiliations between bacterioviruses and Eukarya and eukaryoviruses and Bacteria were also predicted in addition to a large cohort of protein folds that were universally shared by viral and cellular proteomes and virus-specific protein folds not detected in cellular proteomes. These protein folds provide unique insights into viral origins and evolution that are generally difficult to recover with traditional sequence alignment-dependent evolutionary analyses owing to the fast mutation rates of viral gene sequences.
Consolidation of molecular testing in clinical virology.
Scagnolari, Carolina; Turriziani, Ombretta; Monteleone, Katia; Pierangeli, Alessandra; Antonelli, Guido
2017-04-01
The development of quantitative methods for the detection of viral nucleic acids have significantly improved our ability to manage disease progression and to assess the efficacy of antiviral treatment. Moreover, major advances in molecular technologies during the last decade have allowed the identification of new host genetic markers associated with antiviral drug response but have also strongly revolutionized the way we see and perform virus diagnostics in the coming years. Areas covered: In this review, we describe the history and development of virology diagnostic methods, dedicating particular emphasis on the gradual evolution and recent advances toward the introduction of multiparametric platforms for the syndromic diagnosis. In parallel, we outline the consolidation of viral genome quantification practice in different clinical settings. Expert commentary: More rapid, accurate and affordable molecular technology can be predictable with particular emphasis on emerging techniques (next generation sequencing, digital PCR, point of care testing and syndromic diagnosis) to simplify viral diagnosis in the next future.
Chen, Rubing; Holmes, Edward C
2009-01-05
Revealing the factors that shape the genetic structure of avian influenza viruses (AIVs) in wild bird populations is essential to understanding their evolution. However, the relationship between epidemiological dynamics and patterns of genetic diversity in AIV is not well understood, especially at the continental scale. To address this question, we undertook a phylogeographic analysis of complete genome sequences of AIV sampled from wild birds in North America. In particular, we asked whether host species, geographic location or sampling time played the major role in shaping patterns of viral genetic diversity. Strikingly, our analysis revealed no strong species effect, yet a significant viral clustering by time and place of sampling, as well as the circulation of multiple viral lineages in single locations. These results suggest that AIVs can readily infect many of the bird species that share breeding/feeding areas.
Zika virus evolution and spread in the Americas.
Metsky, Hayden C; Matranga, Christian B; Wohl, Shirlee; Schaffner, Stephen F; Freije, Catherine A; Winnicki, Sarah M; West, Kendra; Qu, James; Baniecki, Mary Lynn; Gladden-Young, Adrianne; Lin, Aaron E; Tomkins-Tinch, Christopher H; Ye, Simon H; Park, Daniel J; Luo, Cynthia Y; Barnes, Kayla G; Shah, Rickey R; Chak, Bridget; Barbosa-Lima, Giselle; Delatorre, Edson; Vieira, Yasmine R; Paul, Lauren M; Tan, Amanda L; Barcellona, Carolyn M; Porcelli, Mario C; Vasquez, Chalmers; Cannons, Andrew C; Cone, Marshall R; Hogan, Kelly N; Kopp, Edgar W; Anzinger, Joshua J; Garcia, Kimberly F; Parham, Leda A; Ramírez, Rosa M Gélvez; Montoya, Maria C Miranda; Rojas, Diana P; Brown, Catherine M; Hennigan, Scott; Sabina, Brandon; Scotland, Sarah; Gangavarapu, Karthik; Grubaugh, Nathan D; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rambaut, Andrew; Gehrke, Lee; Smole, Sandra; Halloran, M Elizabeth; Villar, Luis; Mattar, Salim; Lorenzana, Ivette; Cerbino-Neto, Jose; Valim, Clarissa; Degrave, Wim; Bozza, Patricia T; Gnirke, Andreas; Andersen, Kristian G; Isern, Sharon; Michael, Scott F; Bozza, Fernando A; Souza, Thiago M L; Bosch, Irene; Yozwiak, Nathan L; MacInnis, Bronwyn L; Sabeti, Pardis C
2017-06-15
Although the recent Zika virus (ZIKV) epidemic in the Americas and its link to birth defects have attracted a great deal of attention, much remains unknown about ZIKV disease epidemiology and ZIKV evolution, in part owing to a lack of genomic data. Here we address this gap in knowledge by using multiple sequencing approaches to generate 110 ZIKV genomes from clinical and mosquito samples from 10 countries and territories, greatly expanding the observed viral genetic diversity from this outbreak. We analysed the timing and patterns of introductions into distinct geographic regions; our phylogenetic evidence suggests rapid expansion of the outbreak in Brazil and multiple introductions of outbreak strains into Puerto Rico, Honduras, Colombia, other Caribbean islands, and the continental United States. We find that ZIKV circulated undetected in multiple regions for many months before the first locally transmitted cases were confirmed, highlighting the importance of surveillance of viral infections. We identify mutations with possible functional implications for ZIKV biology and pathogenesis, as well as those that might be relevant to the effectiveness of diagnostic tests.
Garamszegi, Sara; Franzosa, Eric A.; Xia, Yu
2013-01-01
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology. PMID:24339775
Ramey, Andrew M; Goraichuk, Iryna V; Hicks, Joseph T; Dimitrov, Kiril M; Poulson, Rebecca L; Stallknecht, David E; Bahl, Justin; Afonso, Claudio L
2017-03-03
Avian paramyxovirus serotype 1 (APMV-1) viruses are globally distributed, infect wild, peridomestic, and domestic birds, and sometimes lead to outbreaks of disease. Thus, the maintenance, evolution, and spread of APMV-1 viruses are relevant to avian health. In this study we sequenced the fusion gene from 58 APMV-1 isolates recovered from thirteen species of wild birds sampled throughout the USA during 2007-2014. We analyzed sequence information with previously reported data in order to assess contemporary genetic diversity and inter-taxa/inter-region exchange of APMV-1 in wild birds sampled in North America. Our results suggest that wild birds maintain previously undescribed genetic diversity of APMV-1; however, such diversity is unlikely to be pathogenic to domestic poultry. Phylogenetic analyses revealed that APMV-1 diversity detected in wild birds of North America has been found in birds belonging to numerous taxonomic host orders and within hosts inhabiting multiple geographic regions suggesting some level of viral exchange. However, our results also provide statistical support for associations between phylogenetic tree topology and host taxonomic order/region of sample origin which supports restricted exchange among taxa and geographical regions of North America for some APMV-1 sub-genotypes. We identify previously unrecognized genetic diversity of APMV-1 in wild birds in North America which is likely a function of continued viral evolution in reservoir hosts. We did not, however, find support for the emergence or maintenance of APMV-1 strains predicted to be pathogenic to poultry in wild birds of North America outside of the order Suliformes (i.e., cormorants). Furthermore, genetic evidence suggests that ecological drivers or other mechanisms may restrict viral exchange among taxa and regions of North America. Additional and more systematic sampling for APMV-1 in North America would likely provide further inference on viral dynamics for this infectious agent in wild bird populations.
Ramey, Andy M.; Goraichuk, Iryna V.; Hicks, Joseph T.; Dimitrov, Kiril M.; Poulson, Rebecca L.; Stallknecht, David E.; Bahl, Justin; Afonso, Claudio L.
2017-01-01
BackgroundAvian paramyxovirus serotype 1 (APMV-1) viruses are globally distributed, infect wild, peridomestic, and domestic birds, and sometimes lead to outbreaks of disease. Thus, the maintenance, evolution, and spread of APMV-1 viruses are relevant to avian health.MethodsIn this study we sequenced the fusion gene from 58 APMV-1 isolates recovered from thirteen species of wild birds sampled throughout the USA during 2007–2014. We analyzed sequence information with previously reported data in order to assess contemporary genetic diversity and inter-taxa/inter-region exchange of APMV-1 in wild birds sampled in North America.ResultsOur results suggest that wild birds maintain previously undescribed genetic diversity of APMV-1; however, such diversity is unlikely to be pathogenic to domestic poultry. Phylogenetic analyses revealed that APMV-1 diversity detected in wild birds of North America has been found in birds belonging to numerous taxonomic host orders and within hosts inhabiting multiple geographic regions suggesting some level of viral exchange. However, our results also provide statistical support for associations between phylogenetic tree topology and host taxonomic order/region of sample origin which supports restricted exchange among taxa and geographical regions of North America for some APMV-1 sub-genotypes.ConclusionsWe identify previously unrecognized genetic diversity of APMV-1 in wild birds in North America which is likely a function of continued viral evolution in reservoir hosts. We did not, however, find support for the emergence or maintenance of APMV-1 strains predicted to be pathogenic to poultry in wild birds of North America outside of the order Suliformes (i.e., cormorants). Furthermore, genetic evidence suggests that ecological drivers or other mechanisms may restrict viral exchange among taxa and regions of North America. Additional and more systematic sampling for APMV-1 in North America would likely provide further inference on viral dynamics for this infectious agent in wild bird populations.
Garamszegi, Sara; Franzosa, Eric A; Xia, Yu
2013-01-01
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology.
Korber, B T; Kunstman, K J; Patterson, B K; Furtado, M; McEvilly, M M; Levy, R; Wolinsky, S M
1994-01-01
Human immunodeficiency virus type 1 (HIV-1) sequences were generated from blood and from brain tissue obtained by stereotactic biopsy from six patients undergoing a diagnostic neurosurgical procedure. Proviral DNA was directly amplified by nested PCR, and 8 to 36 clones from each sample were sequenced. Phylogenetic analysis of intrapatient envelope V3-V5 region HIV-1 DNA sequence sets revealed that brain viral sequences were clustered relative to the blood viral sequences, suggestive of tissue-specific compartmentalization of the virus in four of the six cases. In the other two cases, the blood and brain virus sequences were intermingled in the phylogenetic analyses, suggesting trafficking of virus between the two tissues. Slide-based PCR-driven in situ hybridization of two of the patients' brain biopsy samples confirmed our interpretation of the intrapatient phylogenetic analyses. Interpatient V3 region brain-derived sequence distances were significantly less than blood-derived sequence distances. Relative to the tip of the loop, the set of brain-derived viral sequences had a tendency towards negative or neutral charge compared with the set of blood-derived viral sequences. Entropy calculations were used as a measure of the variability at each position in alignments of blood and brain viral sequences. A relatively conserved set of positions were found, with a significantly lower entropy in the brain-than in the blood-derived viral sequences. These sites constitute a brain "signature pattern," or a noncontiguous set of amino acids in the V3 region conserved in viral sequences derived from brain tissue. This brain-derived signature pattern was also well preserved among isolates previously characterized in vitro as macrophage tropic. Macrophage-monocyte tropism may be the biological constraint that results in the conservation of the viral brain signature pattern. Images PMID:7933130
Pontremoli, Chiara; Forni, Diego; Cagliani, Rachele; Pozzoli, Uberto; Riva, Stefania; Bravo, Ignacio G; Clerici, Mario; Sironi, Manuela
2017-10-01
The Old World (OW) arenavirus complex includes several species of rodent-borne viruses, some of which (i.e., Lassa virus, LASV and Lymphocytic choriomeningitis virus, LCMV) cause human diseases. Most LCMV and LASV infections are caused by rodent-to-human transmissions. Thus, viral evolution is largely determined by events that occur in the wildlife reservoirs. We used a set of human- and rodent-derived viral sequences to investigate the evolutionary history underlying OW arenavirus speciation, as well as the more recent selective events that accompanied LASV spread in West Africa. We show that the viral RNA polymerase (L protein) was a major positive selection target in OW arenaviruses and during LASV out-of-Nigeria migration. No evidence of selection was observed for the glycoprotein, whereas positive selection acted on the nucleoprotein (NP) during LCMV speciation. Positively selected sites in L and NP are surrounded by highly conserved residues, and the bulk of the viral genome evolves under purifying selection. Several positively selected sites are likely to modulate viral replication/transcription. In both L and NP, structural features (solvent exposed surface area) are important determinants of site-wise evolutionary rate variation. By incorporating several rodent-derived sequences, we also performed an analysis of OW arenavirus codon adaptation to the human host. Results do not support a previously hypothesized role of codon adaptation in disease severity for non-Nigerian strains. In conclusion, L and NP represent the major selection targets and possible determinants of disease presentation; these results suggest that field surveys and experimental studies should primarily focus on these proteins. © 2017 John Wiley & Sons Ltd.
Benmansour, A.; Bascuro, B.; Monnier, A.F.; Vende, P.; Winton, J.R.; de Kinkelin, P.
1997-01-01
To evaluate the genetic diversity of viral haemorrhagic septicaemia virus (VHSV), the sequence of the glycoprotein genes (G) of 11 North American and European isolates were determined. Comparison with the G protein of representative members of the family Rhabdoviridae suggested that VHSV was a different virus species from infectious haemorrhagic necrosis virus (IHNV) and Hirame rhabdovirus (HIRRV). At a higher taxonomic level, VHSV, IHNV and HIRRV formed a group which was genetically closest to the genus Lyssavirus. Compared with each other, the G genes of VHSV displayed a dissimilar overall genetic diversity which correlated with differences in geographical origin. The multiple sequence alignment of the complete G protein, showed that the divergent positions were not uniformly distributed along the sequence. A central region (amino acid position 245-300) accumulated substitutions and appeared to be highly variable. The genetic heterogeneity within a single isolate was high, with an apparent internal mutation frequency of 1.2 x 10(-3) per nucleotide site, attesting the quasispecies nature of the viral population. The phylogeny separated VHSV strains according to the major geographical area of isolation: genotype I for continental Europe, genotype II for the British Isles, and genotype III for North America. Isolates from continental Europe exhibited the highest genetic variability, with sub-groups correlated partially with the serological classification. Neither neutralizing polyclonal sera, nor monoclonal antibodies, were able to discriminate between the genotypes. The overall structure of the phylogenetic tree suggests that VHSV genetic diversity and evolution fit within the model of random change and positive selection operating on quasispecies.
Janes, Holly; Frahm, Nicole; DeCamp, Allan; Rolland, Morgane; Gabriel, Erin; Wolfson, Julian; Hertz, Tomer; Kallas, Esper; Goepfert, Paul; Friedrich, David P.; Corey, Lawrence; Mullins, James I.; McElrath, M. Juliana; Gilbert, Peter
2012-01-01
Background The sieve analysis for the Step trial found evidence that breakthrough HIV-1 sequences for MRKAd5/HIV-1 Gag/Pol/Nef vaccine recipients were more divergent from the vaccine insert than placebo sequences in regions with predicted epitopes. We linked the viral sequence data with immune response and acute viral load data to explore mechanisms for and consequences of the observed sieve effect. Methods Ninety-one male participants (37 placebo and 54 vaccine recipients) were included; viral sequences were obtained at the time of HIV-1 diagnosis. T-cell responses were measured 4 weeks post-second vaccination and at the first or second week post-diagnosis. Acute viral load was obtained at RNA-positive and antibody-negative visits. Findings Vaccine recipients had a greater magnitude of post-infection CD8+ T cell response than placebo recipients (median 1.68% vs 1.18%; p = 0·04) and greater breadth of post-infection response (median 4.5 vs 2; p = 0·06). Viral sequences for vaccine recipients were marginally more divergent from the insert than placebo sequences in regions of Nef targeted by pre-infection immune responses (p = 0·04; Pol p = 0·13; Gag p = 0·89). Magnitude and breadth of pre-infection responses did not correlate with distance of the viral sequence to the insert (p>0·50). Acute log viral load trended lower in vaccine versus placebo recipients (estimated mean 4·7 vs 5·1) but the difference was not significant (p = 0·27). Neither was acute viral load associated with distance of the viral sequence to the insert (p>0·30). Interpretation Despite evidence of anamnestic responses, the sieve effect was not well explained by available measures of T-cell immunogenicity. Sequence divergence from the vaccine was not significantly associated with acute viral load. While point estimates suggested weak vaccine suppression of viral load, the result was not significant and more viral load data would be needed to detect suppression. PMID:22952672
Exploring viral infection using single-cell sequencing.
Rato, Sylvie; Golumbeanu, Monica; Telenti, Amalio; Ciuffi, Angela
2017-07-15
Single-cell sequencing (SCS) has emerged as a valuable tool to study cellular heterogeneity in diverse fields, including virology. By studying the viral and cellular genome and/or transcriptome, the dynamics of viral infection can be investigated at single cell level. Most studies have explored the impact of cell-to-cell variation on the viral life cycle from the point of view of the virus, by analyzing viral sequences, and from the point of view of the cell, mainly by analyzing the cellular host transcriptome. In this review, we will focus on recent studies that use single-cell sequencing to explore viral diversity and cell variability in response to viral replication. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Re-visiting the evolution, dispersal and epidemiology of Zika virus in Asia.
Pettersson, John H-O; Bohlin, Jon; Dupont-Rouzeyrol, Myrielle; Brynildsrud, Ola B; Alfsnes, Kristian; Cao-Lormeau, Van-Mai; Gaunt, Michael W; Falconar, Andrew K; de Lamballerie, Xavier; Eldholm, Vegard; Musso, Didier; Gould, Ernest A
2018-05-09
Based on serological evidence and viral isolation, Zika virus (ZIKV) has circulated for many years relatively benignly in a sylvatic cycle in Africa and an urban cycle in South East Asia (SEA). With the recent availability of limited but novel Indian ZIKV sequences to add to the plethora of SEA sequences, we traced the phylogenetic history and spatio-temporal dispersal pattern of ZIKV in Asia prior to its explosive emergence in the Pacific region and the Americas. These analyses demonstrated that the introduction and dispersal of ZIKV on the Pacific islands were preceded by an extended period of relatively silent transmission in SEA, enabling the virus to expand geographically and evolve adaptively before its unanticipated introduction to immunologically naive populations on the Pacific islands and in the Americas. Our findings reveal new features of the evolution and dispersal of this intriguing virus and may benefit future disease control strategies.
Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus.
Andersen, Kristian G; Shapiro, B Jesse; Matranga, Christian B; Sealfon, Rachel; Lin, Aaron E; Moses, Lina M; Folarin, Onikepe A; Goba, Augustine; Odia, Ikponmwonsa; Ehiane, Philomena E; Momoh, Mambu; England, Eleina M; Winnicki, Sarah; Branco, Luis M; Gire, Stephen K; Phelan, Eric; Tariyal, Ridhi; Tewhey, Ryan; Omoniwa, Omowunmi; Fullah, Mohammed; Fonnie, Richard; Fonnie, Mbalu; Kanneh, Lansana; Jalloh, Simbirie; Gbakie, Michael; Saffa, Sidiki; Karbo, Kandeh; Gladden, Adrianne D; Qu, James; Stremlau, Matthew; Nekoui, Mahan; Finucane, Hilary K; Tabrizi, Shervin; Vitti, Joseph J; Birren, Bruce; Fitzgerald, Michael; McCowan, Caryn; Ireland, Andrea; Berlin, Aaron M; Bochicchio, James; Tazon-Vega, Barbara; Lennon, Niall J; Ryan, Elizabeth M; Bjornson, Zach; Milner, Danny A; Lukens, Amanda K; Broodie, Nisha; Rowland, Megan; Heinrich, Megan; Akdag, Marjan; Schieffelin, John S; Levy, Danielle; Akpan, Henry; Bausch, Daniel G; Rubins, Kathleen; McCormick, Joseph B; Lander, Eric S; Günther, Stephan; Hensley, Lisa; Okogbenin, Sylvanus; Schaffner, Stephen F; Okokhere, Peter O; Khan, S Humarr; Grant, Donald S; Akpede, George O; Asogun, Danny A; Gnirke, Andreas; Levin, Joshua Z; Happi, Christian T; Garry, Robert F; Sabeti, Pardis C
2015-08-13
The 2013-2015 West African epidemic of Ebola virus disease (EVD) reminds us of how little is known about biosafety level 4 viruses. Like Ebola virus, Lassa virus (LASV) can cause hemorrhagic fever with high case fatality rates. We generated a genomic catalog of almost 200 LASV sequences from clinical and rodent reservoir samples. We show that whereas the 2013-2015 EVD epidemic is fueled by human-to-human transmissions, LASV infections mainly result from reservoir-to-human infections. We elucidated the spread of LASV across West Africa and show that this migration was accompanied by changes in LASV genome abundance, fatality rates, codon adaptation, and translational efficiency. By investigating intrahost evolution, we found that mutations accumulate in epitopes of viral surface proteins, suggesting selection for immune escape. This catalog will serve as a foundation for the development of vaccines and diagnostics. VIDEO ABSTRACT. Copyright © 2015 Elsevier Inc. All rights reserved.
Clinical sequencing uncovers origins and evolution of Lassa virus
Andersen, Kristian G.; Shapiro, B. Jesse; Matranga, Christian B.; Sealfon, Rachel; Lin, Aaron E.; Moses, Lina M.; Folarin, Onikepe A.; Goba, Augustine; Odia, Ikponmwonsa; Ehiane, Philomena E.; Momoh, Mambu; England, Eleina M.; Winnicki, Sarah; Branco, Luis M.; Gire, Stephen K.; Phelan, Eric; Tariyal, Ridhi; Tewhey, Ryan; Omoniwa, Omowunmi; Fullah, Mohammed; Fonnie, Richard; Fonnie, Mbalu; Kanneh, Lansana; Jalloh, Simbirie; Gbakie, Michael; Saffa, Sidiki; Karbo, Kandeh; Gladden, Adrianne D.; Qu, James; Stremlau, Matthew; Nekoui, Mahan; Finucane, Hilary K.; Tabrizi, Shervin; Vitti, Joseph J.; Birren, Bruce; Fitzgerald, Michael; McCowan, Caryn; Ireland, Andrea; Berlin, Aaron M.; Bochicchio, James; Tazon-Vega, Barbara; Lennon, Niall J.; Ryan, Elizabeth M.; Bjornson, Zach; Milner, Danny A.; Lukens, Amanda K.; Broodie, Nisha; Rowland, Megan; Heinrich, Megan; Akdag, Marjan; Schieffelin, John S.; Levy, Danielle; Akpan, Henry; Bausch, Daniel G.; Rubins, Kathleen; McCormick, Joseph B.; Lander, Eric S.; Günther, Stephan; Hensley, Lisa; Okogbenin, Sylvanus; Schaffner, Stephen F.; Okokhere, Peter O.; Khan, S. Humarr; Grant, Donald S.; Akpede, George O.; Asogun, Danny A.; Gnirke, Andreas; Levin, Joshua Z.; Happi, Christian T.; Garry, Robert F.; Sabeti, Pardis C.
2015-01-01
Summary The 2013-2015 West African epidemic of Ebola virus disease (EVD) reminds us how little is known about biosafety level-4 viruses. Like Ebola virus, Lassa virus (LASV) can cause hemorrhagic fever with high case fatality rates. We generated a genomic catalog of almost 200 LASV sequences from clinical and rodent reservoir samples. We show that whereas the 2013-2015 EVD epidemic is fueled by human-to-human transmissions, LASV infections mainly result from reservoir-to-human infections. We elucidated the spread of LASV across West Africa and show that this migration was accompanied by changes in LASV genome abundance, fatality rates, codon adaptation, and translational efficiency. By investigating intrahost evolution, we found that mutations accumulate in epitopes of viral surface proteins, suggesting selection for immune escape. This catalog will serve as a foundation for the development of vaccines and diagnostics. PMID:26276630
Focused Evolution of HIV-1 Neutralizing Antibodies Revealed by Structures and Deep Sequencing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Xueling; Zhou, Tongqing; Zhu, Jiang
2013-03-04
Antibody VRC01 is a human immunoglobulin that neutralizes about 90% of HIV-1 isolates. To understand how such broadly neutralizing antibodies develop, we used x-ray crystallography and 454 pyrosequencing to characterize additional VRC01-like antibodies from HIV-1-infected individuals. Crystal structures revealed a convergent mode of binding for diverse antibodies to the same CD4-binding-site epitope. A functional genomics analysis of expressed heavy and light chains revealed common pathways of antibody-heavy chain maturation, confined to the IGHV1-2*02 lineage, involving dozens of somatic changes, and capable of pairing with different light chains. Broadly neutralizing HIV-1 immunity associated with VRC01-like antibodies thus involves the evolution ofmore » antibodies to a highly affinity-matured state required to recognize an invariant viral structure, with lineages defined from thousands of sequences providing a genetic roadmap of their development.« less
Huang, Youhua; Huang, Xiaohong; Liu, Hong; Gong, Jie; Ouyang, Zhengliang; Cui, Huachun; Cao, Jianhao; Zhao, Yingtao; Wang, Xiujie; Jiang, Yulin; Qin, Qiwei
2009-01-01
Background Soft-shelled turtle iridovirus (STIV) is the causative agent of severe systemic diseases in cultured soft-shelled turtles (Trionyx sinensis). To our knowledge, the only molecular information available on STIV mainly concerns the highly conserved STIV major capsid protein. The complete sequence of the STIV genome is not yet available. Therefore, determining the genome sequence of STIV and providing a detailed bioinformatic analysis of its genome content and evolution status will facilitate further understanding of the taxonomic elements of STIV and the molecular mechanisms of reptile iridovirus pathogenesis. Results We determined the complete nucleotide sequence of the STIV genome using 454 Life Science sequencing technology. The STIV genome is 105 890 bp in length with a base composition of 55.1% G+C. Computer assisted analysis revealed that the STIV genome contains 105 potential open reading frames (ORFs), which encode polypeptides ranging from 40 to 1,294 amino acids and 20 microRNA candidates. Among the putative proteins, 20 share homology with the ancestral proteins of the nuclear and cytoplasmic large DNA viruses (NCLDVs). Comparative genomic analysis showed that STIV has the highest degree of sequence conservation and a colinear arrangement of genes with frog virus 3 (FV3), followed by Tiger frog virus (TFV), Ambystoma tigrinum virus (ATV), Singapore grouper iridovirus (SGIV), Grouper iridovirus (GIV) and other iridovirus isolates. Phylogenetic analysis based on conserved core genes and complete genome sequence of STIV with other virus genomes was performed. Moreover, analysis of the gene gain-and-loss events in the family Iridoviridae suggested that the genes encoded by iridoviruses have evolved for favoring adaptation to different natural host species. Conclusion This study has provided the complete genome sequence of STIV. Phylogenetic analysis suggested that STIV and FV3 are strains of the same viral species belonging to the Ranavirus genus in the Iridoviridae family. Given virus-host co-evolution and the phylogenetic relationship among vertebrates from fish to reptiles, we propose that iridovirus might transmit between reptiles and amphibians and that STIV and FV3 are strains of the same viral species in the Ranavirus genus. PMID:19439104
Limited SHIV env diversification in macaques failing oral antiretroviral pre-exposure prophylaxis.
Zheng, Qi; Ruone, Susan; Switzer, William M; Heneine, Walid; García-Lerma, J Gerardo
2012-05-09
Pre-exposure prophylaxis (PrEP) with daily Truvada [a combination of emtricitabine (FTC) and tenofovir disoproxil fumarate (TDF)] is a novel HIV prevention strategy recently found to prevent HIV transmission among men who have sex with men and heterosexual couples. Acute infection in adherent persons who fail PrEP will inevitably occur under concurrent antiretroviral therapy, thus raising questions regarding the potential impact of PrEP on early viral dynamics. We investigated viral evolution dynamics in a macaque model of PrEP consisting of repeated rectal exposures to SHIV162P3 in the presence of PrEP. Four macaques were infected during daily or intermittent PrEP with FTC or FTC/TDF, and five were untreated controls. SHIV env sequence evolution was monitored by single genome amplification with phylogenetic and sequence analysis. Mean nucleotide divergence from transmitted founder viruses calculated 17 weeks (range = 12-20) post peak viremia was significantly lower in PrEP failures than in control animals (7.2 × 10-3 compared to 1.6 × 10-2 nucleotide substitutions per site per year, respectively, p < 0.0001). Mean virus diversity was also lower in PrEP failures after 17 weeks (0.13% vs. 0.53% in controls, p < 0.0001). Our results in a macaque model of acute HIV infection suggest that infection during PrEP limits early virus evolution likely because of a direct antiviral effect of PrEP and/or reduced target cell availability. Reduced virus diversification during early infection might enhance immune control by slowing the selection of escape mutants.
Hraber, Peter; Korber, Bette; Wagh, Kshitij; ...
2015-10-21
Within-host genetic sequencing from samples collected over time provides a dynamic view of how viruses evade host immunity. Immune-driven mutations might stimulate neutralization breadth by selecting antibodies adapted to cycles of immune escape that generate within-subject epitope diversity. Comprehensive identification of immune-escape mutations is experimentally and computationally challenging. With current technology, many more viral sequences can readily be obtained than can be tested for binding and neutralization, making down-selection necessary. Typically, this is done manually, by picking variants that represent different time-points and branches on a phylogenetic tree. Such strategies are likely to miss many relevant mutations and combinations ofmore » mutations, and to be redundant for other mutations. Longitudinal Antigenic Sequences and Sites from Intrahost Evolution (LASSIE) uses transmitted founder loss to identify virus “hot-spots” under putative immune selection and chooses sequences that represent recurrent mutations in selected sites. LASSIE favors earliest sequences in which mutations arise. Here, with well-characterized longitudinal Env sequences, we confirmed selected sites were concentrated in antibody contacts and selected sequences represented diverse antigenic phenotypes. Finally, practical applications include rapidly identifying immune targets under selective pressure within a subject, selecting minimal sets of reagents for immunological assays that characterize evolving antibody responses, and for immunogens in polyvalent “cocktail” vaccines.« less
Gong, Yu-Nong; Chen, Guang-Wu; Yang, Shu-Li; Lee, Ching-Ju; Shih, Shin-Ru; Tsao, Kuo-Chien
2016-01-01
Forty-two cytopathic effect (CPE)-positive isolates were collected from 2008 to 2012. All isolates could not be identified for known viral pathogens by routine diagnostic assays. They were pooled into 8 groups of 5-6 isolates to reduce the sequencing cost. Next-generation sequencing (NGS) was conducted for each group of mixed samples, and the proposed data analysis pipeline was used to identify viral pathogens in these mixed samples. Polymerase chain reaction (PCR) or enzyme-linked immunosorbent assay (ELISA) was individually conducted for each of these 42 isolates depending on the predicted viral types in each group. Two isolates remained unknown after these tests. Moreover, iteration mapping was implemented for each of these 2 isolates, and predicted human parechovirus (HPeV) in both. In summary, our NGS pipeline detected the following viruses among the 42 isolates: 29 human rhinoviruses (HRVs), 10 HPeVs, 1 human adenovirus (HAdV), 1 echovirus and 1 rotavirus. We then focused on the 10 identified Taiwanese HPeVs because of their reported clinical significance over HRVs. Their genomes were assembled and their genetic diversity was explored. One novel 6-bp deletion was found in one HPeV-1 virus. In terms of nucleotide heterogeneity, 64 genetic variants were detected from these HPeVs using the mapped NGS reads. Most importantly, a recombination event was found between our HPeV-3 and a known HPeV-4 strain in the database. Similar event was detected in the other HPeV-3 strains in the same clade of the phylogenetic tree. These findings demonstrated that the proposed NGS data analysis pipeline identified unknown viruses from the mixed clinical samples, revealed their genetic identity and variants, and characterized their genetic features in terms of viral evolution.
Marandino, Ana; Tomás, Gonzalo; Panzera, Yanina; Greif, Gonzalo; Parodi-Talice, Adriana; Hernández, Martín; Techera, Claudia; Hernández, Diego; Pérez, Ruben
2017-10-01
Infectious bronchitis virus (Gammacoronavirus, Coronaviridae) is a genetically variable RNA virus that causes one of the most persistent respiratory diseases in poultry. The virus is classified in genotypes and lineages with different epidemiological relevance. Two lineages of the GI genotype (11 and 16) have been widely circulating for decades in South America. GI-11 is an exclusive South American lineage while the GI-16 lineage is distributed in Asia, Europe and South America. Here, we obtained the whole genome of two Uruguayan strains of the GI-11 and GI-16 lineages using Illumina high-throughput sequencing. The strains here sequenced are the first obtained in South America for the infectious bronchitis virus and provide new insights into the origin, spreading and evolution of viral variants. The complete genome of the GI-11 and GI-16 strains have 27,621 and 27,638 nucleotides, respectively, and possess the same genomic organization. Phylogenetic incongruence analysis reveals that both strains have a mosaic genome that arose by recombination between Euro Asiatic strains of the GI-16 lineage and ancestral South American GI-11 viruses. The recombination occurred in South America and produced two viral variants that have retained the full-length S1 sequences of the parental lineages but are extremely similar in the rest of their genomes. These recombinant virus have been extraordinary successful, persisting in the continent for several years with a notorious wide geographic distribution. Our findings reveal a singular viral dynamics and emphasize the importance of complete genomic characterization to understand the emergence and evolutionary history of viral variants. Copyright © 2017 Elsevier B.V. All rights reserved.
Gayral, Philippe; Blondin, Laurence; Guidolin, Olivier; Carreel, Françoise; Hippolyte, Isabelle; Perrier, Xavier; Iskra-Caruana, Marie-Line
2010-07-01
Endogenous plant pararetroviruses (EPRVs) are viral sequences of the family Caulimoviridae integrated into the nuclear genome of numerous plant species. The ability of some endogenous sequences of Banana streak viruses (eBSVs) in the genome of banana (Musa sp.) to induce infections just like the virus itself was recently demonstrated (P. Gayral et al., J. Virol. 83:6697-6710, 2008). Although eBSVs probably arose from accidental events, infectious eBSVs constitute an extreme case of parasitism, as well as a newly described strategy for vertical virus transmission in plants. We investigated the early evolutionary stages of infectious eBSV for two distinct BSV species-GF (BSGFV) and Imové (BSImV)-through the study of their distribution, insertion polymorphism, and structure evolution among selected banana genotypes representative of the diversity of 60 wild Musa species and genotypes. To do so, the historical frame of host evolution was analyzed by inferring banana phylogeny from two chloroplast regions-matK and trnL-trnF-as well as from the nuclear genome, using 19 microsatellite loci. We demonstrated that both BSV species integrated recently in banana evolution, circa 640,000 years ago. The two infectious eBSVs were subjected to different selective pressures and showed distinct levels of rearrangement within their final structure. In addition, the molecular phylogenies of integrated and nonintegrated BSVs enabled us to establish the phylogenetic origins of eBSGFV and eBSImV.
Viral Evolution Core | FNLCR Staging
Brandon F. Keele, Ph.D. PI/Senior Principal Investigator, Retroviral Evolution Section Head, Viral Evolution Core Leidos Biomedical Research, Inc. Frederick National Laboratory for Cancer Research Frederick, MD 21702-1201 Tel: 301-846-173
Novosel, D; Tuboly, T; Csagola, A; Lorincz, M; Cubric-Curik, V; Jungic, A; Curik, I; Segalés, J; Cortey, M; Lipej, Z
2014-04-26
Porcine circovirus type 2 (PCV2) causes some of the most significant economic losses in pig production. Several multisystemic syndromes have been attributed to PCV2 infection, which are known as PCV2-associated diseases (PCVDs). This study investigated the origin and evolution of PCV2 sequences in domestic pigs and wild boars affected by PCVDs in Croatia. Viral sequences were recovered from three wild boars diagnosed with PCV2-systemic disease (PCV2-SD), 63 fetuses positive for PCV2 DNA as determined by PCR, 14 domestic pigs affected with PCV2-SD (displaying severe interstitial nephritis) and five domestic pigs with proliferative and necrotising pneumonia. Seventeen complete PCV2 genomes were recovered. Phylogenetic and evolutionary analyses based on median-joining phylogenetic networks, amino acid alignments and principal coordinate analysis were performed using complete genomes, as well as complete and partial ORF sequences for ORF1 and ORF2. Two of the 17 PCV2 sequences belonged to PCV2a, 14 to PCV2b and one was unclustered. PCV2b was the predominant genotype in Croatia and has been linked to international trade as a route of introduction. Correlation between particular viral strains with PCVDs is lacking.
Moreno, Elena; Gallego, Isabel; Gregori, Josep; Lucía-Sanz, Adriana; Soria, María Eugenia; Castro, Victoria; Beach, Nathan M.; Manrubia, Susanna; Quer, Josep; Esteban, Juan Ignacio; Rice, Charles M.; Gómez, Jordi; Gastaminza, Pablo
2017-01-01
ABSTRACT Viral quasispecies evolution upon long-term virus replication in a noncoevolving cellular environment raises relevant general issues, such as the attainment of population equilibrium, compliance with the molecular-clock hypothesis, or stability of the phenotypic profile. Here, we evaluate the adaptation, mutant spectrum dynamics, and phenotypic diversification of hepatitis C virus (HCV) in the course of 200 passages in human hepatoma cells in an experimental design that precluded coevolution of the cells with the virus. Adaptation to the cells was evidenced by increase in progeny production. The rate of accumulation of mutations in the genomic consensus sequence deviated slightly from linearity, and mutant spectrum analyses revealed a complex dynamic of mutational waves, which was sustained beyond passage 100. The virus underwent several phenotypic changes, some of which impacted the virus-host relationship, such as enhanced cell killing, a shift toward higher virion density, and increased shutoff of host cell protein synthesis. Fluctuations in progeny production and failure to reach population equilibrium at the genomic level suggest internal instabilities that anticipate an unpredictable HCV evolution in the complex liver environment. IMPORTANCE Long-term virus evolution in an unperturbed cellular environment can reveal features of virus evolution that cannot be explained by comparing natural viral isolates. In the present study, we investigate genetic and phenotypic changes that occur upon prolonged passage of hepatitis C virus (HCV) in human hepatoma cells in an experimental design in which host cell evolutionary change is prevented. Despite replication in a noncoevolving cellular environment, the virus exhibited internal population disequilibria that did not decline with increased adaptation to the host cells. The diversification of phenotypic traits suggests that disequilibria inherent to viral populations may provide a selective advantage to viruses that can be fully exploited in changing environments. PMID:28275194
Ancient papillomavirus-host co-speciation in Felidae
Rector, Annabel; Lemey, Philippe; Tachezy, Ruth; Mostmans, Sara; Ghim, Shin-Je; Van Doorslaer, Koenraad; Roelke, Melody; Bush, Mitchell; Montali, Richard J; Joslin, Janis; Burk, Robert D; Jenson, Alfred B; Sundberg, John P; Shapiro, Beth; Van Ranst, Marc
2007-01-01
Background Estimating evolutionary rates for slowly evolving viruses such as papillomaviruses (PVs) is not possible using fossil calibrations directly or sequences sampled over a time-scale of decades. An ability to correlate their divergence with a host species, however, can provide a means to estimate evolutionary rates for these viruses accurately. To determine whether such an approach is feasible, we sequenced complete feline PV genomes, previously available only for the domestic cat (Felis domesticus, FdPV1), from four additional, globally distributed feline species: Lynx rufus PV type 1, Puma concolor PV type 1, Panthera leo persica PV type 1, and Uncia uncia PV type 1. Results The feline PVs all belong to the Lambdapapillomavirus genus, and contain an unusual second noncoding region between the early and late protein region, which is only present in members of this genus. Our maximum likelihood and Bayesian phylogenetic analyses demonstrate that the evolutionary relationships between feline PVs perfectly mirror those of their feline hosts, despite a complex and dynamic phylogeographic history. By applying host species divergence times, we provide the first precise estimates for the rate of evolution for each PV gene, with an overall evolutionary rate of 1.95 × 10-8 (95% confidence interval 1.32 × 10-8 to 2.47 × 10-8) nucleotide substitutions per site per year for the viral coding genome. Conclusion Our work provides evidence for long-term virus-host co-speciation of feline PVs, indicating that viral diversity in slowly evolving viruses can be used to investigate host species evolution. These findings, however, should not be extrapolated to other viral lineages without prior confirmation of virus-host co-divergence. PMID:17430578
Combelas, Nicolas; Holmblat, Barbara; Joffret, Marie-Line; Colbère-Garapin, Florence; Delpeyroux, Francis
2011-01-01
Genetic recombination in RNA viruses was discovered many years ago for poliovirus (PV), an enterovirus of the Picornaviridae family, and studied using PV or other picornaviruses as models. Recently, recombination was shown to be a general phenomenon between different types of enteroviruses of the same species. In particular, the interest for this mechanism of genetic plasticity was renewed with the emergence of pathogenic recombinant circulating vaccine-derived polioviruses (cVDPVs), which were implicated in poliomyelitis outbreaks in several regions of the world with insufficient vaccination coverage. Most of these cVDPVs had mosaic genomes constituted of mutated poliovaccine capsid sequences and part or all of the non-structural sequences from other human enteroviruses of species C (HEV-C), in particular coxsackie A viruses. A study in Madagascar showed that recombinant cVDPVs had been co-circulating in a small population of children with many different HEV-C types. This viral ecosystem showed a surprising and extensive biodiversity associated to several types and recombinant genotypes, indicating that intertypic genetic recombination was not only a mechanism of evolution for HEV-C, but an usual mode of genetic plasticity shaping viral diversity. Results suggested that recombination may be, in conjunction with mutations, implicated in the phenotypic diversity of enterovirus strains and in the emergence of new pathogenic strains. Nevertheless, little is known about the rules and mechanisms which govern genetic exchanges between HEV-C types, as well as about the importance of intertypic recombination in generating phenotypic variation. This review summarizes our current knowledge of the mechanisms of evolution of PV, in particular recombination events leading to the emergence of recombinant cVDPVs. PMID:21994791
Combelas, Nicolas; Holmblat, Barbara; Joffret, Marie-Line; Colbère-Garapin, Florence; Delpeyroux, Francis
2011-08-01
Genetic recombination in RNA viruses was discovered many years ago for poliovirus (PV), an enterovirus of the Picornaviridae family, and studied using PV or other picornaviruses as models. Recently, recombination was shown to be a general phenomenon between different types of enteroviruses of the same species. In particular, the interest for this mechanism of genetic plasticity was renewed with the emergence of pathogenic recombinant circulating vaccine-derived polioviruses (cVDPVs), which were implicated in poliomyelitis outbreaks in several regions of the world with insufficient vaccination coverage. Most of these cVDPVs had mosaic genomes constituted of mutated poliovaccine capsid sequences and part or all of the non-structural sequences from other human enteroviruses of species C (HEV-C), in particular coxsackie A viruses. A study in Madagascar showed that recombinant cVDPVs had been co-circulating in a small population of children with many different HEV-C types. This viral ecosystem showed a surprising and extensive biodiversity associated to several types and recombinant genotypes, indicating that intertypic genetic recombination was not only a mechanism of evolution for HEV-C, but an usual mode of genetic plasticity shaping viral diversity. Results suggested that recombination may be, in conjunction with mutations, implicated in the phenotypic diversity of enterovirus strains and in the emergence of new pathogenic strains. Nevertheless, little is known about the rules and mechanisms which govern genetic exchanges between HEV-C types, as well as about the importance of intertypic recombination in generating phenotypic variation. This review summarizes our current knowledge of the mechanisms of evolution of PV, in particular recombination events leading to the emergence of recombinant cVDPVs.
A prokaryotic viral sequence is expressed and conserved in mammalian brain.
Yeh, Yang-Hui; Gunasekharan, Vignesh; Manuelidis, Laura
2017-07-03
A natural and permanent transfer of prokaryotic viral sequences to mammals has not been reported by others. Circular "SPHINX" DNAs <5 kb were previously isolated from nuclease-protected cytoplasmic particles in rodent neuronal cell lines and brain. Two of these DNAs were sequenced after Φ29 polymerase amplification, and they revealed significant but imperfect homology to segments of commensal Acinetobacter phage viruses. These findings were surprising because the brain is isolated from environmental microorganisms. The 1.76-kb DNA sequence (SPHINX 1.8), with an iteron before its ORF, was evaluated here for its expression in neural cells and brain. A rabbit affinity purified antibody generated against a peptide without homology to mammalian sequences labeled a nonglycosylated ∼41-kDa protein (spx1) on Western blots, and the signal was efficiently blocked by the competing peptide. Spx1 was resistant to limited proteinase K digestion, but was unrelated to the expression of host prion protein or its pathologic amyloid form. Remarkably, spx1 concentrated in selected brain synapses, such as those on anterior motor horn neurons that integrate many complex neural inputs. SPHINX 1.8 appears to be involved in tissue-specific differentiation, including essential functions that preserve its propagation during mammalian evolution, possibly via maternal inheritance. The data here indicate that mammals can share and exchange a larger world of prokaryotic viruses than previously envisioned.
Diversity, Distribution, and Evolution of Tomato Viruses in China Uncovered by Small RNA Sequencing.
Xu, Chenxi; Sun, Xuepeng; Taylor, Angela; Jiao, Chen; Xu, Yimin; Cai, Xiaofeng; Wang, Xiaoli; Ge, Chenhui; Pan, Guanghui; Wang, Quanxi; Fei, Zhangjun; Wang, Quanhua
2017-06-01
Tomato is a major vegetable crop that has tremendous popularity. However, viral disease is still a major factor limiting tomato production. Here, we report the tomato virome identified through sequencing small RNAs of 170 field-grown samples collected in China. A total of 22 viruses were identified, including both well-documented and newly detected viruses. The tomato viral community is dominated by a few species, and they exhibit polymorphisms and recombination in the genomes with cold spots and hot spots. Most samples were coinfected by multiple viruses, and the majority of identified viruses are positive-sense single-stranded RNA viruses. Evolutionary analysis of one of the most dominant tomato viruses, Tomato yellow leaf curl virus (TYLCV), predicts its origin and the time back to its most recent common ancestor. The broadly sampled data have enabled us to identify several unreported viruses in tomato, including a completely new virus, which has a genome of ∼13.4 kb and groups with aphid-transmitted viruses in the genus Cytorhabdovirus Although both DNA and RNA viruses can trigger the biogenesis of virus-derived small interfering RNAs (vsiRNAs), we show that features such as length distribution, paired distance, and base selection bias of vsiRNA sequences reflect different plant Dicer-like proteins and Argonautes involved in vsiRNA biogenesis. Collectively, this study offers insights into host-virus interaction in tomato and provides valuable information to facilitate the management of viral diseases. IMPORTANCE Tomato is an important source of micronutrients in the human diet and is extensively consumed around the world. Virus is among the major constraints on tomato production. Categorizing virus species that are capable of infecting tomato and understanding their diversity and evolution are challenging due to difficulties in detecting such fast-evolving biological entities. Here, we report the landscape of the tomato virome in China, the leading country in tomato production. We identified dozens of viruses present in tomato, including both well-documented and completely new viruses. Some newly emerged viruses in tomato were found to spread fast, and therefore, prompt attention is needed to control them. Moreover, we show that the virus genomes exhibit considerable degree of polymorphisms and recombination, and the virus-derived small interfering RNA (vsiRNA) sequences indicate distinct vsiRNA biogenesis mechanisms for different viruses. The Chinese tomato virome that we developed provides valuable information to facilitate the management of tomato viral diseases. Copyright © 2017 American Society for Microbiology.
Diversity, Distribution, and Evolution of Tomato Viruses in China Uncovered by Small RNA Sequencing
Xu, Chenxi; Taylor, Angela; Jiao, Chen; Xu, Yimin; Cai, Xiaofeng; Wang, Xiaoli; Ge, Chenhui; Pan, Guanghui; Wang, Quanxi
2017-01-01
ABSTRACT Tomato is a major vegetable crop that has tremendous popularity. However, viral disease is still a major factor limiting tomato production. Here, we report the tomato virome identified through sequencing small RNAs of 170 field-grown samples collected in China. A total of 22 viruses were identified, including both well-documented and newly detected viruses. The tomato viral community is dominated by a few species, and they exhibit polymorphisms and recombination in the genomes with cold spots and hot spots. Most samples were coinfected by multiple viruses, and the majority of identified viruses are positive-sense single-stranded RNA viruses. Evolutionary analysis of one of the most dominant tomato viruses, Tomato yellow leaf curl virus (TYLCV), predicts its origin and the time back to its most recent common ancestor. The broadly sampled data have enabled us to identify several unreported viruses in tomato, including a completely new virus, which has a genome of ∼13.4 kb and groups with aphid-transmitted viruses in the genus Cytorhabdovirus. Although both DNA and RNA viruses can trigger the biogenesis of virus-derived small interfering RNAs (vsiRNAs), we show that features such as length distribution, paired distance, and base selection bias of vsiRNA sequences reflect different plant Dicer-like proteins and Argonautes involved in vsiRNA biogenesis. Collectively, this study offers insights into host-virus interaction in tomato and provides valuable information to facilitate the management of viral diseases. IMPORTANCE Tomato is an important source of micronutrients in the human diet and is extensively consumed around the world. Virus is among the major constraints on tomato production. Categorizing virus species that are capable of infecting tomato and understanding their diversity and evolution are challenging due to difficulties in detecting such fast-evolving biological entities. Here, we report the landscape of the tomato virome in China, the leading country in tomato production. We identified dozens of viruses present in tomato, including both well-documented and completely new viruses. Some newly emerged viruses in tomato were found to spread fast, and therefore, prompt attention is needed to control them. Moreover, we show that the virus genomes exhibit considerable degree of polymorphisms and recombination, and the virus-derived small interfering RNA (vsiRNA) sequences indicate distinct vsiRNA biogenesis mechanisms for different viruses. The Chinese tomato virome that we developed provides valuable information to facilitate the management of tomato viral diseases. PMID:28331089
Brüwer, Jan D.
2018-01-01
Current research posits that all multicellular organisms live in symbioses with associated microorganisms and form so-called metaorganisms or holobionts. Cnidarian metaorganisms are of specific interest given that stony corals provide the foundation of the globally threatened coral reef ecosystems. To gain first insight into viruses associated with the coral model system Aiptasia (sensu Exaiptasia pallida), we analyzed an existing RNA-Seq dataset of aposymbiotic, partially populated, and fully symbiotic Aiptasia CC7 anemones with Symbiodinium. Our approach included the selective removal of anemone host and algal endosymbiont sequences and subsequent microbial sequence annotation. Of a total of 297 million raw sequence reads, 8.6 million (∼3%) remained after host and endosymbiont sequence removal. Of these, 3,293 sequences could be assigned as of viral origin. Taxonomic annotation of these sequences suggests that Aiptasia is associated with a diverse viral community, comprising 116 viral taxa covering 40 families. The viral assemblage was dominated by viruses from the families Herpesviridae (12.00%), Partitiviridae (9.93%), and Picornaviridae (9.87%). Despite an overall stable viral assemblage, we found that some viral taxa exhibited significant changes in their relative abundance when Aiptasia engaged in a symbiotic relationship with Symbiodinium. Elucidation of viral taxa consistently present across all conditions revealed a core virome of 15 viral taxa from 11 viral families, encompassing many viruses previously reported as members of coral viromes. Despite the non-random selection of viral genetic material due to the nature of the sequencing data analyzed, our study provides a first insight into the viral community associated with Aiptasia. Similarities of the Aiptasia viral community with those of corals corroborate the application of Aiptasia as a model system to study coral holobionts. Further, the change in abundance of certain viral taxa across different symbiotic states suggests a role of viruses in the algal endosymbiosis, but the functional significance of this remains to be determined. PMID:29507840
Laboratory procedures to generate viral metagenomes.
Thurber, Rebecca V; Haynes, Matthew; Breitbart, Mya; Wegley, Linda; Rohwer, Forest
2009-01-01
This collection of laboratory protocols describes the steps to collect viruses from various samples with the specific aim of generating viral metagenome sequence libraries (viromes). Viral metagenomics, the study of uncultured viral nucleic acid sequences from different biomes, relies on several concentration, purification, extraction, sequencing and heuristic bioinformatic methods. No single technique can provide an all-inclusive approach, and therefore the protocols presented here will be discussed in terms of hypothetical projects. However, care must be taken to individualize each step depending on the source and type of viral-particles. This protocol is a description of the processes we have successfully used to: (i) concentrate viral particles from various types of samples, (ii) eliminate contaminating cells and free nucleic acids and (iii) extract, amplify and purify viral nucleic acids. Overall, a sample can be processed to isolate viral nucleic acids suitable for high-throughput sequencing in approximately 1 week.
IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses
Paez-Espino, David; Chen, I. -Min A.; Palaniappan, Krishna; ...
2016-10-30
Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from > 6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs aremore » grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparingwith external sequences, thus serving as an essential resource in the viral genomics community.« less
IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paez-Espino, David; Chen, I. -Min A.; Palaniappan, Krishna
Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from > 6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs aremore » grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparingwith external sequences, thus serving as an essential resource in the viral genomics community.« less
Ceballos, Ana; Andreani, Guadalupe; Ripamonti, Chiara; Dilernia, Dario; Mendez, Ramiro; Rabinovich, Roberto D; Cárdenas, Patricia Coll; Zala, Carlos; Cahn, Pedro; Scarlatti, Gabriella; Martínez Peralta, Liliana
2008-11-01
Mother-to-child transmission (MTCT) of human immunodeficiency virus type 1 (HIV-1) as described for women with an established infection is, in most cases, associated with the transmission of few maternal variants. This study analysed virus variability in four cases of maternal primary infection occurring during pregnancy and/or breastfeeding. Estimated time of seroconversion was at 4 months of pregnancy for one woman (early seroconversion) and during the last months of pregnancy and/or breastfeeding for the remaining three (late seroconversion). The C2V3 envelope region was analysed in samples of mother-child pairs by molecular cloning and sequencing. Comparisons of nucleotide and amino acid sequences as well as phylogenetic analysis were performed. The results showed low variability in the virus population of both mother and child. Maximum-likelihood analysis showed that, in the early pregnancy seroconversion case, a minor viral variant with further evolution in the child was transmitted, which could indicate a selection event in MTCT or a stochastic event, whereas in the late seroconversion cases, the mother's and child's sequences were intermingled, which is compatible with the transmission of multiple viral variants from the mother's major population. These results could be explained by the less pronounced selective pressure exerted by the immune system in the early stages of the mother's infection, which could play a role in MTCT of HIV-1.
Reid-Bayliss, Kate S; Loeb, Lawrence A
2017-08-29
Transcriptional mutagenesis (TM) due to misincorporation during RNA transcription can result in mutant RNAs, or epimutations, that generate proteins with altered properties. TM has long been hypothesized to play a role in aging, cancer, and viral and bacterial evolution. However, inadequate methodologies have limited progress in elucidating a causal association. We present a high-throughput, highly accurate RNA sequencing method to measure epimutations with single-molecule sensitivity. Accurate RNA consensus sequencing (ARC-seq) uniquely combines RNA barcoding and generation of multiple cDNA copies per RNA molecule to eliminate errors introduced during cDNA synthesis, PCR, and sequencing. The stringency of ARC-seq can be scaled to accommodate the quality of input RNAs. We apply ARC-seq to directly assess transcriptome-wide epimutations resulting from RNA polymerase mutants and oxidative stress.
Wood, Natasha; Bhattacharya, Tanmoy; Keele, Brandon F; Giorgi, Elena; Liu, Michael; Gaschen, Brian; Daniels, Marcus; Ferrari, Guido; Haynes, Barton F; McMichael, Andrew; Shaw, George M; Hahn, Beatrice H; Korber, Bette; Seoighe, Cathal
2009-05-01
The pattern of viral diversification in newly infected individuals provides information about the host environment and immune responses typically experienced by the newly transmitted virus. For example, sites that tend to evolve rapidly across multiple early-infection patients could be involved in enabling escape from common early immune responses, could represent adaptation for rapid growth in a newly infected host, or could represent reversion from less fit forms of the virus that were selected for immune escape in previous hosts. Here we investigated the diversification of HIV-1 env coding sequences in 81 very early B subtype infections previously shown to have resulted from transmission or expansion of single viruses (n = 78) or two closely related viruses (n = 3). In these cases, the sequence of the infecting virus can be estimated accurately, enabling inference of both the direction of substitutions as well as distinction between insertion and deletion events. By integrating information across multiple acutely infected hosts, we find evidence of adaptive evolution of HIV-1 env and identify a subset of codon sites that diversified more rapidly than can be explained by a model of neutral evolution. Of 24 such rapidly diversifying sites, 14 were either i) clustered and embedded in CTL epitopes that were verified experimentally or predicted based on the individual's HLA or ii) in a nucleotide context indicative of APOBEC-mediated G-to-A substitutions, despite having excluded heavily hypermutated sequences prior to the analysis. In several cases, a rapidly evolving site was embedded both in an APOBEC motif and in a CTL epitope, suggesting that APOBEC may facilitate early immune escape. Ten rapidly diversifying sites could not be explained by CTL escape or APOBEC hypermutation, including the most frequently mutated site, in the fusion peptide of gp41. We also examined the distribution, extent, and sequence context of insertions and deletions, and we provide evidence that the length variation seen in hypervariable loop regions of the envelope glycoprotein is a consequence of selection and not of mutational hotspots. Our results provide a detailed view of the process of diversification of HIV-1 following transmission, highlighting the role of CTL escape and hypermutation in shaping viral evolution during the establishment of new infections.
Wood, Natasha; Bhattacharya, Tanmoy; Keele, Brandon F.; Giorgi, Elena; Liu, Michael; Gaschen, Brian; Daniels, Marcus; Ferrari, Guido; Haynes, Barton F.; McMichael, Andrew; Shaw, George M.; Hahn, Beatrice H.; Korber, Bette; Seoighe, Cathal
2009-01-01
The pattern of viral diversification in newly infected individuals provides information about the host environment and immune responses typically experienced by the newly transmitted virus. For example, sites that tend to evolve rapidly across multiple early-infection patients could be involved in enabling escape from common early immune responses, could represent adaptation for rapid growth in a newly infected host, or could represent reversion from less fit forms of the virus that were selected for immune escape in previous hosts. Here we investigated the diversification of HIV-1 env coding sequences in 81 very early B subtype infections previously shown to have resulted from transmission or expansion of single viruses (n = 78) or two closely related viruses (n = 3). In these cases, the sequence of the infecting virus can be estimated accurately, enabling inference of both the direction of substitutions as well as distinction between insertion and deletion events. By integrating information across multiple acutely infected hosts, we find evidence of adaptive evolution of HIV-1 env and identify a subset of codon sites that diversified more rapidly than can be explained by a model of neutral evolution. Of 24 such rapidly diversifying sites, 14 were either i) clustered and embedded in CTL epitopes that were verified experimentally or predicted based on the individual's HLA or ii) in a nucleotide context indicative of APOBEC-mediated G-to-A substitutions, despite having excluded heavily hypermutated sequences prior to the analysis. In several cases, a rapidly evolving site was embedded both in an APOBEC motif and in a CTL epitope, suggesting that APOBEC may facilitate early immune escape. Ten rapidly diversifying sites could not be explained by CTL escape or APOBEC hypermutation, including the most frequently mutated site, in the fusion peptide of gp41. We also examined the distribution, extent, and sequence context of insertions and deletions, and we provide evidence that the length variation seen in hypervariable loop regions of the envelope glycoprotein is a consequence of selection and not of mutational hotspots. Our results provide a detailed view of the process of diversification of HIV-1 following transmission, highlighting the role of CTL escape and hypermutation in shaping viral evolution during the establishment of new infections. PMID:19424423
Prophage-mediated defense against viral attack and viral counter-defense
Dedrick, Rebekah M.; Jacobs-Sera, Deborah; Guerrero Bustamante, Carlos A.; Garlena, Rebecca A.; Mavrich, Travis N.; Pope, Welkin H.; Reyes, Juan C Cervantes; Russell, Daniel A.; Adair, Tamarah; Alvey, Richard; Bonilla, J. Alfred; Bricker, Jerald S.; Brown, Bryony R.; Byrnes, Deanna; Cresawn, Steven G.; Davis, William B.; Dickson, Leon A.; Edgington, Nicholas P.; Findley, Ann M.; Golebiewska, Urszula; Grose, Julianne H.; Hayes, Cory F.; Hughes, Lee E.; Hutchison, Keith W.; Isern, Sharon; Johnson, Allison A.; Kenna, Margaret A.; Klyczek, Karen K.; Mageeney, Catherine M.; Michael, Scott F.; Molloy, Sally D.; Montgomery, Matthew T.; Neitzel, James; Page, Shallee T.; Pizzorno, Marie C.; Poxleitner, Marianne K.; Rinehart, Claire A.; Robinson, Courtney J.; Rubin, Michael R.; Teyim, Joseph N.; Vazquez, Edwin; Ware, Vassie C.; Washington, Jacqueline; Hatfull, Graham F.
2017-01-01
Temperate phages are common and prophages are abundant residents of sequenced bacterial genomes. Mycobacteriophages are viruses infecting mycobacterial hosts including Mycobacterium tuberculosis and Mycobacterium smegmatis, encompass substantial genetic diversity, and are commonly temperate. Characterization of ten Cluster N temperate mycobacteriophages reveals at least five distinct prophage-expressed viral defense systems that interfere with infection of lytic and temperate phages that are either closely-related (homotypic defense) or unrelated (heterotypic defense). Target specificity is unpredictable, ranging from a single target phage to one-third of those tested. The defense systems include a single-subunit restriction system, a heterotypic exclusion system, and a predicted (p)ppGpp synthetase, which blocks lytic phage growth, promotes bacterial survival, and enables efficient lysogeny. The predicted (p)ppGpp synthetase coded by the Phrann prophage defends against phage Tweety infection, but Tweety codes for a tetrapeptide repeat protein, gp54, that acts as a highly effective counter-defense system. Prophage-mediated viral defense offers an efficient mechanism for bacterial success in host-virus dynamics, and counter-defense promotes phage co-evolution. PMID:28067906
Belyi, Vladimir A.; Levine, Arnold J.; Skalka, Anna Marie
2010-01-01
Vertebrate genomes contain numerous copies of retroviral sequences, acquired over the course of evolution. Until recently they were thought to be the only type of RNA viruses to be so represented, because integration of a DNA copy of their genome is required for their replication. In this study, an extensive sequence comparison was conducted in which 5,666 viral genes from all known non-retroviral families with single-stranded RNA genomes were matched against the germline genomes of 48 vertebrate species, to determine if such viruses could also contribute to the vertebrate genetic heritage. In 19 of the tested vertebrate species, we discovered as many as 80 high-confidence examples of genomic DNA sequences that appear to be derived, as long ago as 40 million years, from ancestral members of 4 currently circulating virus families with single strand RNA genomes. Surprisingly, almost all of the sequences are related to only two families in the Order Mononegavirales: the Bornaviruses and the Filoviruses, which cause lethal neurological disease and hemorrhagic fevers, respectively. Based on signature landmarks some, and perhaps all, of the endogenous virus-like DNA sequences appear to be LINE element-facilitated integrations derived from viral mRNAs. The integrations represent genes that encode viral nucleocapsid, RNA-dependent-RNA-polymerase, matrix and, possibly, glycoproteins. Integrations are generally limited to one or very few copies of a related viral gene per species, suggesting that once the initial germline integration was obtained (or selected), later integrations failed or provided little advantage to the host. The conservation of relatively long open reading frames for several of the endogenous sequences, the virus-like protein regions represented, and a potential correlation between their presence and a species' resistance to the diseases caused by these pathogens, are consistent with the notion that their products provide some important biological advantage to the species. In addition, the viruses could also benefit, as some resistant species (e.g. bats) may serve as natural reservoirs for their persistence and transmission. Given the stringent limitations imposed in this informatics search, the examples described here should be considered a low estimate of the number of such integration events that have persisted over evolutionary time scales. Clearly, the sources of genetic information in vertebrate genomes are much more diverse than previously suspected. PMID:20686665
Belyi, Vladimir A; Levine, Arnold J; Skalka, Anna Marie
2010-07-29
Vertebrate genomes contain numerous copies of retroviral sequences, acquired over the course of evolution. Until recently they were thought to be the only type of RNA viruses to be so represented, because integration of a DNA copy of their genome is required for their replication. In this study, an extensive sequence comparison was conducted in which 5,666 viral genes from all known non-retroviral families with single-stranded RNA genomes were matched against the germline genomes of 48 vertebrate species, to determine if such viruses could also contribute to the vertebrate genetic heritage. In 19 of the tested vertebrate species, we discovered as many as 80 high-confidence examples of genomic DNA sequences that appear to be derived, as long ago as 40 million years, from ancestral members of 4 currently circulating virus families with single strand RNA genomes. Surprisingly, almost all of the sequences are related to only two families in the Order Mononegavirales: the Bornaviruses and the Filoviruses, which cause lethal neurological disease and hemorrhagic fevers, respectively. Based on signature landmarks some, and perhaps all, of the endogenous virus-like DNA sequences appear to be LINE element-facilitated integrations derived from viral mRNAs. The integrations represent genes that encode viral nucleocapsid, RNA-dependent-RNA-polymerase, matrix and, possibly, glycoproteins. Integrations are generally limited to one or very few copies of a related viral gene per species, suggesting that once the initial germline integration was obtained (or selected), later integrations failed or provided little advantage to the host. The conservation of relatively long open reading frames for several of the endogenous sequences, the virus-like protein regions represented, and a potential correlation between their presence and a species' resistance to the diseases caused by these pathogens, are consistent with the notion that their products provide some important biological advantage to the species. In addition, the viruses could also benefit, as some resistant species (e.g. bats) may serve as natural reservoirs for their persistence and transmission. Given the stringent limitations imposed in this informatics search, the examples described here should be considered a low estimate of the number of such integration events that have persisted over evolutionary time scales. Clearly, the sources of genetic information in vertebrate genomes are much more diverse than previously suspected.
Díaz-Muñoz, Samuel L
2017-01-01
Infection of more than one virus in a host, coinfection, is common across taxa and environments. Viral coinfection can enable genetic exchange, alter the dynamics of infections, and change the course of viral evolution. Yet, a systematic test of the factors explaining variation in viral coinfection across different taxa and environments awaits completion. Here I employ three microbial data sets of virus-host interactions covering cross-infectivity, culture coinfection, and single-cell coinfection (total: 6,564 microbial hosts, 13,103 viruses) to provide a broad, comprehensive picture of the ecological and biological factors shaping viral coinfection. I found evidence that ecology and virus-virus interactions are recurrent factors shaping coinfection patterns. Host ecology was a consistent and strong predictor of coinfection across all three data sets: cross-infectivity, culture coinfection, and single-cell coinfection. Host phylogeny or taxonomy was a less consistent predictor, being weak or absent in the cross-infectivity and single-cell coinfection models, yet it was the strongest predictor in the culture coinfection model. Virus-virus interactions strongly affected coinfection. In the largest test of superinfection exclusion to date, prophage sequences reduced culture coinfection by other prophages, with a weaker effect on extrachromosomal virus coinfection. At the single-cell level, prophage sequences eliminated coinfection. Virus-virus interactions also increased culture coinfection with ssDNA-dsDNA coinfections >2× more likely than ssDNA-only coinfections. The presence of CRISPR spacers was associated with a ∼50% reduction in single-cell coinfection in a marine bacteria, despite the absence of exact spacer matches in any active infection. Collectively, these results suggest the environment bacteria inhabit and the interactions among surrounding viruses are two factors consistently shaping viral coinfection patterns. These findings highlight the role of virus-virus interactions in coinfection with implications for phage therapy, microbiome dynamics, and viral infection treatments.
Divergent evolution of multiple virus-resistance genes from a progenitor in Capsicum spp.
Kim, Saet-Byul; Kang, Won-Hee; Huy, Hoang Ngoc; Yeom, Seon-In; An, Jeong-Tak; Kim, Seungill; Kang, Min-Young; Kim, Hyun Jung; Jo, Yeong Deuk; Ha, Yeaseong; Choi, Doil; Kang, Byoung-Cheorl
2017-01-01
Plants have evolved hundreds of nucleotide-binding and leucine-rich domain proteins (NLRs) as potential intracellular immune receptors, but the evolutionary mechanism leading to the ability to recognize specific pathogen effectors is elusive. Here, we cloned Pvr4 (a Potyvirus resistance gene in Capsicum annuum) and Tsw (a Tomato spotted wilt virus resistance gene in Capsicum chinense) via a genome-based approach using independent segregating populations. The genes both encode typical NLRs and are located at the same locus on pepper chromosome 10. Despite the fact that these two genes recognize completely different viral effectors, the genomic structures and coding sequences of the two genes are strikingly similar. Phylogenetic studies revealed that these two immune receptors diverged from a progenitor gene of a common ancestor. Our results suggest that sequence variations caused by gene duplication and neofunctionalization may underlie the evolution of the ability to specifically recognize different effectors. These findings thereby provide insight into the divergent evolution of plant immune receptors. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
The Effect of Vaccination on the Evolution and Population Dynamics of Avian Paramyxovirus-1
Hudson, Peter J.; Poss, Mary
2010-01-01
Newcastle Disease Virus (NDV) is a pathogenic strain of avian paramyxovirus (aPMV-1) that is among the most serious of disease threats to the poultry industry worldwide. Viral diversity is high in aPMV-1; eight genotypes are recognized based on phylogenetic reconstruction of gene sequences. Modified live vaccines have been developed to decrease the economic losses caused by this virus. Vaccines derived from avirulent genotype II strains were developed in the 1950s and are in use globally, whereas Australian strains belonging to genotype I were developed as vaccines in the 1970s and are used mainly in Asia. In this study, we evaluated the consequences of attenuated live virus vaccination on the evolution of aPMV-1 genotypes. There was phylogenetic incongruence among trees based on individual genes and complete coding region of 54 full length aPMV-1 genomes, suggesting that recombinant sequences were present in the data set. Subsequently, five recombinant genomes were identified, four of which contained sequences from either genotype I or II. The population history of vaccine-related genotype II strains was distinct from other aPMV-1 genotypes; genotype II emerged in the late 19th century and is evolving more slowly than other genotypes, which emerged in the 1960s. Despite vaccination efforts, genotype II viruses have experienced constant population growth to the present. In contrast, other contemporary genotypes showed population declines in the late 1990s. Additionally, genotype I and II viruses, which are circulating in the presence of homotypic vaccine pressure, have unique selection profiles compared to nonvaccine-related strains. Collectively, these data show that vaccination with live attenuated viruses has changed the evolution of aPMV-1 by maintaining a large effective population size of a vaccine-related genotype, allowing for coinfection and recombination of vaccine and wild type strains, and by applying unique selective pressures on viral glycoproteins. PMID:20421950
Johnson, J A; Parra, G I; Levenson, E A; Green, K Y
2017-06-01
Historical outbreaks can be an important source of information in the understanding of norovirus evolution and epidemiology. Here, we revisit an outbreak of undiagnosed gastroenteritis that occurred in Shippensburg, Pennsylvania in 1972. Nearly 5000 people fell ill over the course of 10 days. Symptoms included diarrhea, vomiting, stomach cramps, and fever, lasting for a median of 24 h. Using current techniques, including next-generation sequencing of full-length viral genomic amplicons, we identified an unusual norovirus recombinant (GII.Pg/GII.3) in nine of 15 available stool samples from the outbreak. This particular recombinant virus has not been reported in recent decades, although GII.3 and GII.Pg genotypes have been detected individually in current epidemic strains. The consensus nucleotide sequences were nearly identical among the four viral genomes analysed, although each strain had three to seven positions in the genome with heterogenous non-synonymous nucleotide subpopulations. Two of these resulting amino acid polymorphisms were conserved in frequency among all four cases, consistent with common source exposure and successful transmission of a mixed viral population. Continued investigation of variant nucleotide populations and recombination events among ancestral norovirus strains such as the Shippensburg virus may provide unique insight into the origin of contemporary strains.
Recoding method that removes inhibitory sequences and improves HIV gene expression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rabadan, Raul; Krasnitz, Michael; Robins, Harlan
The invention relates to inhibitory nucleotide signal sequences or "INS" sequences in the genomes of lentiviruses. In particular the invention relates to the AGG motif present in all viral genomes. The AGG motif may have an inhibitory effect on a virus, for example by reducing the levels of, or maintaining low steady-state levels of, viral RNAs in host cells, and inducing and/or maintaining in viral latency. In one aspect, the invention provides vaccines that contain, or are produced from, viral nucleic acids in which the AGG sequences have been mutated. In another aspect, the invention provides methods and compositions formore » affecting the function of the AGG motif, and methods for identifying other INS sequences in viral genomes.« less
Dynamically correlated mutations drive human Influenza A evolution.
Tria, F; Pompei, S; Loreto, V
2013-01-01
Human Influenza A virus undergoes recurrent changes in the hemagglutinin (HA) surface protein, primarily involved in the human antibody recognition. Relevant antigenic changes, enabling the virus to evade host immune response, have been recognized to occur in parallel to multiple mutations at antigenic sites in HA. Yet, the role of correlated mutations (epistasis) in driving the molecular evolution of the virus still represents a challenging puzzle. Further, though circulation at a global geographic level is key for the survival of Influenza A, its role in shaping the viral phylodynamics remains largely unexplored. Here we show, through a sequence based epidemiological model, that epistatic effects between amino acids substitutions, coupled with a reservoir that mimics worldwide circulating viruses, are key determinants that drive human Influenza A evolution. Our approach explains all the up-to-date observations characterizing the evolution of H3N2 subtype, including phylogenetic properties, nucleotide fixation patterns, and composition of antigenic clusters.
APOBEC3D and APOBEC3F potently promote HIV-1 diversification and evolution in humanized mouse model.
Sato, Kei; Takeuchi, Junko S; Misawa, Naoko; Izumi, Taisuke; Kobayashi, Tomoko; Kimura, Yuichi; Iwami, Shingo; Takaori-Kondo, Akifumi; Hu, Wei-Shau; Aihara, Kazuyuki; Ito, Mamoru; An, Dong Sung; Pathak, Vinay K; Koyanagi, Yoshio
2014-10-01
Several APOBEC3 proteins, particularly APOBEC3D, APOBEC3F, and APOBEC3G, induce G-to-A hypermutations in HIV-1 genome, and abrogate viral replication in experimental systems, but their relative contributions to controlling viral replication and viral genetic variation in vivo have not been elucidated. On the other hand, an HIV-1-encoded protein, Vif, can degrade these APOBEC3 proteins via a ubiquitin/proteasome pathway. Although APOBEC3 proteins have been widely considered as potent restriction factors against HIV-1, it remains unclear which endogenous APOBEC3 protein(s) affect HIV-1 propagation in vivo. Here we use a humanized mouse model and HIV-1 with mutations in Vif motifs that are responsible for specific APOBEC3 interactions, DRMR/AAAA (4A) or YRHHY/AAAAA (5A), and demonstrate that endogenous APOBEC3D/F and APOBEC3G exert strong anti-HIV-1 activity in vivo. We also show that the growth kinetics of 4A HIV-1 negatively correlated with the expression level of APOBEC3F. Moreover, single genome sequencing analyses of viral RNA in plasma of infected mice reveal that 4A HIV-1 is specifically and significantly diversified. Furthermore, a mutated virus that is capable of using both CCR5 and CXCR4 as entry coreceptor is specifically detected in 4A HIV-1-infected mice. Taken together, our results demonstrate that APOBEC3D/F and APOBEC3G fundamentally work as restriction factors against HIV-1 in vivo, but at the same time, that APOBEC3D and APOBEC3F are capable of promoting viral diversification and evolution in vivo.
MOLECULAR EVOLUTION OF WEST NILE VIRUS IN A NORTHERN TEMPERATE REGION: CONNECTICUT, USA 1999–2008
Armstrong, Philip M.; Vossbrinck, Charles R.; Andreadis, Theodore G.; Anderson, John F.; Pesko, Kendra N.; Newman, Ruchi M.; Lennon, Niall J.; Birren, Bruce W.; Ebel, Gregory D.; Henn, Mathew R.
2011-01-01
West Nile virus (WNV) has become firmly established in northeastern U.S., reemerging every summer since its introduction into North America in 1999. To determine whether WNV overwinters locally or is reseeded annually, we examined the patterns of viral lineage persistence and replacement in Connecticut over 10 consecutive transmission seasons by phylogenetic analysis. In addition, we compared the full protein coding sequence among WNV isolates to search for evidence of convergent and adaptive evolution. Viruses sampled from Connecticut segregated into a number of well-supported subclades by year of isolation with few clades persisting ≥2 years. Similar viral strains were dispersed in different locations across the state and divergent strains appeared within a single location during a single transmission season, implying widespread movement and rapid colonization of virus. Numerous amino acid substitutions arose in the population but only one change, V→A at position 159 of the envelope protein, became permanently fixed. Several instances of parallel evolution were identified in independent lineages, including one amino acid change in the NS4A protein that appears to bepositively selected. Our results suggest that annual reemergence of WNV is driven by both reintroduction and local-overwintering of virus. Despite ongoing evolution of WNV, most amino acid variants occurred at low frequencies and were transient in the virus population. PMID:21723580
Continuous Influx of Genetic Material from Host to Virus Populations
Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane
2016-01-01
Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors. PMID:26829124
Continuous Influx of Genetic Material from Host to Virus Populations.
Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane; Cordaux, Richard; Herniou, Elisabeth A
2016-02-01
Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors.
IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses.
Paez-Espino, David; Chen, I-Min A; Palaniappan, Krishna; Ratner, Anna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Huang, Jinghua; Markowitz, Victor M; Nielsen, Torben; Huntemann, Marcel; K Reddy, T B; Pavlopoulos, Georgios A; Sullivan, Matthew B; Campbell, Barbara J; Chen, Feng; McMahon, Katherine; Hallam, Steve J; Denef, Vincent; Cavicchioli, Ricardo; Caffrey, Sean M; Streit, Wolfgang R; Webster, John; Handley, Kim M; Salekdeh, Ghasem H; Tsesmetzis, Nicolas; Setubal, Joao C; Pope, Phillip B; Liu, Wen-Tso; Rivers, Adam R; Ivanova, Natalia N; Kyrpides, Nikos C
2017-01-04
Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from >6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs are grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparing with external sequences, thus serving as an essential resource in the viral genomics community. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Zika virus evolution and spread in the Americas
Metsky, Hayden C.; Matranga, Christian B.; Wohl, Shirlee; Schaffner, Stephen F.; Freije, Catherine A.; Winnicki, Sarah M.; West, Kendra; Qu, James; Baniecki, Mary Lynn; Gladden-Young, Adrianne; Lin, Aaron E.; Tomkins-Tinch, Christopher H.; Ye, Simon H.; Park, Daniel J.; Luo, Cynthia Y.; Barnes, Kayla G.; Shah, Rickey R.; Chak, Bridget; Barbosa-Lima, Giselle; Delatorre, Edson; Vieira, Yasmine R.; Paul, Lauren M.; Tan, Amanda L.; Barcellona, Carolyn M.; Porcelli, Mario C.; Vasquez, Chalmers; Cannons, Andrew C.; Cone, Marshall R.; Hogan, Kelly N.; Kopp, Edgar W.; Anzinger, Joshua J.; Garcia, Kimberly F.; Parham, Leda A.; Gélvez Ramírez, Rosa M.; Miranda Montoya, Maria C.; Rojas, Diana P.; Brown, Catherine M.; Hennigan, Scott; Sabina, Brandon; Scotland, Sarah; Gangavarapu, Karthik; Grubaugh, Nathan D.; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rambaut, Andrew; Gehrke, Lee; Smole, Sandra; Halloran, M. Elizabeth; Villar, Luis; Mattar, Salim; Lorenzana, Ivette; Cerbino-Neto, Jose; Valim, Clarissa; Degrave, Wim; Bozza, Patricia T.; Gnirke, Andreas; Andersen, Kristian G.; Isern, Sharon; Michael, Scott F.; Bozza, Fernando A.; Souza, Thiago M. L.; Bosch, Irene; Yozwiak, Nathan L.; MacInnis, Bronwyn L.; Sabeti, Pardis C.
2017-01-01
Although the recent Zika virus (ZIKV) epidemic in the Americas and its link to birth defects have attracted a great deal of attention1,2, much remains unknown about ZIKV disease epidemiology and ZIKV evolution, in part owing to a lack of genomic data. Here we address this gap in knowledge by using multiple sequencing approaches to generate 110 ZIKV genomes from clinical and mosquito samples from 10 countries and territories, greatly expanding the observed viral genetic diversity from this outbreak. We analysed the timing and patterns of introductions into distinct geographic regions; our phylogenetic evidence suggests rapid expansion of the outbreak in Brazil and multiple introductions of outbreak strains into Puerto Rico, Honduras, Colombia, other Caribbean islands, and the continental United States. We find that ZIKV circulated undetected in multiple regions for many months before the first locally transmitted cases were confirmed, highlighting the importance of surveillance of viral infections. We identify mutations with possible functional implications for ZIKV biology and pathogenesis, as well as those that might be relevant to the effectiveness of diagnostic tests. PMID:28538734
A B cell follicle sanctuary permits persistent productive SIV infection in elite controllers
Fukazawa, Yoshinori; Lum, Richard; Okoye, Afam A.; Park, Haesun; Matsuda, Kenta; Bae, Jin Young; Hagen, Shoko I.; Shoemaker, Rebecca; Deleage, Claire; Lucero, Carissa; Morcock, David; Swanson, Tonya; Legasse, Alfred W.; Axthelm, Michael K.; Hesselgesser, Joseph; Geleziunas, Romas; Hirsch, Vanessa M.; Edlefsen, Paul T.; Piatak, Michael; Estes, Jacob D.; Lifson, Jeffrey D.; Picker, Louis J.
2014-01-01
Chronic phase HIV/SIV replication is reduced by as much as 10,000-fold in elite controllers (EC) compared to typical progressors, but sufficient viral replication persists in EC tissues to allow viral sequence evolution and induce excess immune activation. Here, we show that productive SIV infection in rhesus monkey EC is strikingly restricted to follicular helper CD4+ T cells (TFH), suggesting that while the potent SIV-specific CD8+ T cells of these monkeys can effectively clear productive infection from extra-follicular sites, their relative exclusion from B cell follicles limits elimination of infected TFH. Indeed, CD8+ lymphocyte depletion of EC monkeys resulted in a dramatic re-distribution of productive SIV infection to non-TFH, with TFH restriction resuming upon CD8+ T cell recovery. Thus, B cell follicles constitute sanctuaries for persistent SIV replication in the presence of potent anti-viral CD8+ T cell responses, potentially complicating efforts to cure HIV infection with therapeutic vaccination or T cell immunotherapy. PMID:25599132
Patterns of amino acid conservation in human and animal immunodeficiency viruses.
Voitenko, Olga S; Dhroso, Andi; Feldmann, Anna; Korkin, Dmitry; Kalinina, Olga V
2016-09-01
Due to their high genomic variability, RNA viruses and retroviruses present a unique opportunity for detailed study of molecular evolution. Lentiviruses, with HIV being a notable example, are one of the best studied viral groups: hundreds of thousands of sequences are available together with experimentally resolved three-dimensional structures for most viral proteins. In this work, we use these data to study specific patterns of evolution of the viral proteins, and their relationship to protein interactions and immunogenicity. We propose a method for identification of two types of surface residues clusters with abnormal conservation: extremely conserved and extremely variable clusters. We identify them on the surface of proteins from HIV and other animal immunodeficiency viruses. Both types of clusters are overrepresented on the interaction interfaces of viral proteins with other proteins, nucleic acids or low molecular-weight ligands, both in the viral particle and between the virus and its host. In the immunodeficiency viruses, the interaction interfaces are not more conserved than the corresponding proteins on an average, and we show that extremely conserved clusters coincide with protein-protein interaction hotspots, predicted as the residues with the largest energetic contribution to the interaction. Extremely variable clusters have been identified here for the first time. In the HIV-1 envelope protein gp120, they overlap with known antigenic sites. These antigenic sites also contain many residues from extremely conserved clusters, hence representing a unique interacting interface enriched both in extremely conserved and in extremely variable clusters of residues. This observation may have important implication for antiretroviral vaccine development. A Python package is available at https://bioinf.mpi-inf.mpg.de/publications/viral-ppi-pred/ voitenko@mpi-inf.mpg.de or kalinina@mpi-inf.mpg.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Franzo, Giovanni; Tucciarone, Claudia M; Dotto, Giorgia; Gigli, Alessandra; Ceglie, Letizia; Drigo, Michele
2015-06-01
Porcine circovirus type 2 is one of the most widespread and economically relevant infections of swine. Four genotypes have been recognized, but currently, only three (PCV2a, PCV2b and PCV2d) are effectively circulating. The widespread livestock trade and rapid viral evolution have contributed to determining the high heterogeneity of PCV2 and the dispersal of potentially more virulent strains. Italian swine farming and the related processing industry are relevant in the national economy. Despite the noteworthy losses associated with direct and control measure costs, no data are currently available on the molecular epidemiology of PCV2 in Italy. Our study, which was intended to fill this gap, considered 75 completed genome PCV2 sequences, which were obtained from samples collected from the highly densely populated area of Northern Italy between 2007 and 2014. Phylogenetic analysis and comparison with reference sequences demonstrated the co-circulation, with different prevalences, of PCV2a, PCV2b and PCV2d within the national borders, with PCV2b being the most prevalent. Recombination between different genotypes was also proven to be frequent. Phylogeographic analysis demonstrated that the marked variability of Italian PCV2 strains can be attributable to multiple introduction events. The comparison of the phylogenetic analysis results, the location of different haplotypes and the international commercial routs of live pigs allow the speculation of several links as well as the role of Italy as both an importer and exporter of PCV2 haplotypes, mainly from and to European and Asian countries. A similarly intricate contact network was demonstrated within national borders, with different haplotypes being detected in the same province and different provinces harbouring the same haplotype. Overall, this paper represents the first description of PCV2 in Italy and demonstrates that the high variability of circulating Italian strains is due to multiple introduction events, wide circulation within national boundaries and rapid viral evolution. Copyright © 2015 Elsevier B.V. All rights reserved.
Discovery of DNA viruses in wild-caught mosquitoes using small RNA high throughput sequencing.
Ma, Maijuan; Huang, Yong; Gong, Zhengda; Zhuang, Lu; Li, Cun; Yang, Hong; Tong, Yigang; Liu, Wei; Cao, Wuchun
2011-01-01
Mosquito-borne infectious diseases pose a severe threat to public health in many areas of the world. Current methods for pathogen detection and surveillance are usually dependent on prior knowledge of the etiologic agents involved. Hence, efficient approaches are required for screening wild mosquito populations to detect known and unknown pathogens. In this study, we explored the use of Next Generation Sequencing to identify viral agents in wild-caught mosquitoes. We extracted total RNA from different mosquito species from South China. Small 18-30 bp length RNA molecules were purified, reverse-transcribed into cDNA and sequenced using Illumina GAIIx instrumentation. Bioinformatic analyses to identify putative viral agents were conducted and the results confirmed by PCR. We identified a non-enveloped single-stranded DNA densovirus in the wild-caught Culex pipiens molestus mosquitoes. The majority of the viral transcripts (.>80% of the region) were covered by the small viral RNAs, with a few peaks of very high coverage obtained. The +/- strand sequence ratio of the small RNAs was approximately 7∶1, indicating that the molecules were mainly derived from the viral RNA transcripts. The small viral RNAs overlapped, enabling contig assembly of the viral genome sequence. We identified some small RNAs in the reverse repeat regions of the viral 5'- and 3' -untranslated regions where no transcripts were expected. Our results demonstrate for the first time that high throughput sequencing of small RNA is feasible for identifying viral agents in wild-caught mosquitoes. Our results show that it is possible to detect DNA viruses by sequencing the small RNAs obtained from insects, although the underlying mechanism of small viral RNA biogenesis is unclear. Our data and those of other researchers show that high throughput small RNA sequencing can be used for pathogen surveillance in wild mosquito vectors.
Goldsmith, Dawn B.; Parsons, Rachel J.; Beyene, Damitu; Salamon, Peter
2015-01-01
Deep sequencing of the viral phoH gene, a host-derived auxiliary metabolic gene, was used to track viral diversity throughout the water column at the Bermuda Atlantic Time-series Study (BATS) site in the summer (September) and winter (March) of three years. Viral phoH sequences reveal differences in the viral communities throughout a depth profile and between seasons in the same year. Variation was also detected between the same seasons in subsequent years, though these differences were not as great as the summer/winter distinctions. Over 3,600 phoH operational taxonomic units (OTUs; 97% sequence identity) were identified. Despite high richness, most phoH sequences belong to a few large, common OTUs whereas the majority of the OTUs are small and rare. While many OTUs make sporadic appearances at just a few times or depths, a small number of OTUs dominate the community throughout the seasons, depths, and years. PMID:26157645
Evolution of simeprevir-resistant variants over time by ultra-deep sequencing in HCV genotype 1b.
Akuta, Norio; Suzuki, Fumitaka; Sezaki, Hitomi; Suzuki, Yoshiyuki; Hosaka, Tetsuya; Kobayashi, Masahiro; Kobayashi, Mariko; Saitoh, Satoshi; Ikeda, Kenji; Kumada, Hiromitsu
2014-08-01
Using ultra-deep sequencing technology, the present study was designed to investigate the evolution of simeprevir-resistant variants (amino acid substitutions of aa80, aa155, aa156, and aa168 positions in HCV NS3 region) over time. In Toranomon Hospital, 18 Japanese patients infected with HCV genotype 1b, received triple therapy of simeprevir/PEG-IFN/ribavirin (DRAGON or CONCERT study). Sustained virological response rate was 67%, and that was significantly higher in patients with IL28B rs8099917 TT than in those with non-TT. Six patients, who did not achieve sustained virological response, were tested for resistant variants by ultra-deep sequencing, at the baseline, at the time of re-elevation of viral loads, and at 96 weeks after the completion of treatment. Twelve of 18 resistant variants, detected at re-elevation of viral load, were de novo resistant variants. Ten of 12 de novo resistant variants become undetectable over time, and that five of seven resistant variants, detected at baseline, persisted over time. In one patient, variants of Q80R at baseline (0.3%) increased at 96-week after the cessation of treatment (10.2%), and de novo resistant variants of D168E (0.3%) also increased at 96-week after the cessation of treatment (9.7%). In conclusion, the present study indicates that the emergence of simeprevir-resistant variants after the start of treatment could not be predicted at baseline, and the majority of de novo resistant variants become undetectable over time. Further large-scale prospective studies should be performed to investigate the clinical utility in detecting simeprevir-resistant variants. © 2014 Wiley Periodicals, Inc.
Duchêne, Sebastián; Duchêne, David; Holmes, Edward C; Ho, Simon Y W
2015-07-01
Rates and timescales of viral evolution can be estimated using phylogenetic analyses of time-structured molecular sequences. This involves the use of molecular-clock methods, calibrated by the sampling times of the viral sequences. However, the spread of these sampling times is not always sufficient to allow the substitution rate to be estimated accurately. We conducted Bayesian phylogenetic analyses of simulated virus data to evaluate the performance of the date-randomization test, which is sometimes used to investigate whether time-structured data sets have temporal signal. An estimate of the substitution rate passes this test if its mean does not fall within the 95% credible intervals of rate estimates obtained using replicate data sets in which the sampling times have been randomized. We find that the test sometimes fails to detect rate estimates from data with no temporal signal. This error can be minimized by using a more conservative criterion, whereby the 95% credible interval of the estimate with correct sampling times should not overlap with those obtained with randomized sampling times. We also investigated the behavior of the test when the sampling times are not uniformly distributed throughout the tree, which sometimes occurs in empirical data sets. The test performs poorly in these circumstances, such that a modification to the randomization scheme is needed. Finally, we illustrate the behavior of the test in analyses of nucleotide sequences of cereal yellow dwarf virus. Our results validate the use of the date-randomization test and allow us to propose guidelines for interpretation of its results. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Experimental evidence that RNA recombination occurs in the Japanese encephalitis virus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chuang, C.-K.; Chen, W.-J., E-mail: wjchen@mail.cgu.edu.t; Department of Public Health and Parasitology, Chang Gung University, Kwei-San, Tao-Yuan 33332, Taiwan
2009-11-25
Due to the lack of a proofreading function and error-repairing ability of genomic RNA, accumulated mutations are known to be a force driving viral evolution in the genus Flavivirus, including the Japanese encephalitis (JE) virus. Based on sequencing data, RNA recombination was recently postulated to be another factor associated with genomic variations in these viruses. We herein provide experimental evidence to demonstrate the occurrence of RNA recombination in the JE virus using two local pure clones (T1P1-S1 and CJN-S1) respectively derived from the local strains, T1P1 and CJN. Based on results from a restriction fragment length polymorphism (RFLP) assay onmore » the C/preM junction comprising a fragment of 868 nucleotides (nt 10-877), the recombinant progeny virus was primarily formed in BHK-21 cells that had been co-infected with the two clones used in this study. Nine of 20 recombinant forms of the JE virus had a crossover in the nt 123-323 region. Sequencing data derived from these recombinants revealed that no nucleotide deletion or insertion occurred in this region favoring crossovers, indicating that precisely, not aberrantly, homologous recombination was involved. With site-directed mutagenesis, three stem-loop secondary structures were destabilized and re-stabilized in sequence, leading to changes in the frequency of recombination. This suggests that the conformation, not the free energy, of the secondary structure is important in modulating RNA recombination of the virus. It was concluded that because RNA recombination generates genetic diversity in the JE virus, this must be considered particularly in studies of viral evolution, epidemiology, and possible vaccine safety.« less
The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families.
Yooseph, Shibu; Sutton, Granger; Rusch, Douglas B; Halpern, Aaron L; Williamson, Shannon J; Remington, Karin; Eisen, Jonathan A; Heidelberg, Karla B; Manning, Gerard; Li, Weizhong; Jaroszewski, Lukasz; Cieplak, Piotr; Miller, Christopher S; Li, Huiying; Mashiyama, Susan T; Joachimiak, Marcin P; van Belle, Christopher; Chandonia, John-Marc; Soergel, David A; Zhai, Yufeng; Natarajan, Kannan; Lee, Shaun; Raphael, Benjamin J; Bafna, Vineet; Friedman, Robert; Brenner, Steven E; Godzik, Adam; Eisenberg, David; Dixon, Jack E; Taylor, Susan S; Strausberg, Robert L; Frazier, Marvin; Venter, J Craig
2007-03-01
Metagenomics projects based on shotgun sequencing of populations of micro-organisms yield insight into protein families. We used sequence similarity clustering to explore proteins with a comprehensive dataset consisting of sequences from available databases together with 6.12 million proteins predicted from an assembly of 7.7 million Global Ocean Sampling (GOS) sequences. The GOS dataset covers nearly all known prokaryotic protein families. A total of 3,995 medium- and large-sized clusters consisting of only GOS sequences are identified, out of which 1,700 have no detectable homology to known families. The GOS-only clusters contain a higher than expected proportion of sequences of viral origin, thus reflecting a poor sampling of viral diversity until now. Protein domain distributions in the GOS dataset and current protein databases show distinct biases. Several protein domains that were previously categorized as kingdom specific are shown to have GOS examples in other kingdoms. About 6,000 sequences (ORFans) from the literature that heretofore lacked similarity to known proteins have matches in the GOS data. The GOS dataset is also used to improve remote homology detection. Overall, besides nearly doubling the number of current proteins, the predicted GOS proteins also add a great deal of diversity to known protein families and shed light on their evolution. These observations are illustrated using several protein families, including phosphatases, proteases, ultraviolet-irradiation DNA damage repair enzymes, glutamine synthetase, and RuBisCO. The diversity added by GOS data has implications for choosing targets for experimental structure characterization as part of structural genomics efforts. Our analysis indicates that new families are being discovered at a rate that is linear or almost linear with the addition of new sequences, implying that we are still far from discovering all protein families in nature.
Ruan, Yi Jun; Wei, Chia Lin; Ee, Ai Ling; Vega, Vinsensius B; Thoreau, Herve; Su, Se Thoe Yun; Chia, Jer-Ming; Ng, Patrick; Chiu, Kuo Ping; Lim, Landri; Zhang, Tao; Peng, Chan Kwai; Lin, Ean Oon Lynette; Lee, Ng Mah; Yee, Sin Leo; Ng, Lisa F P; Chee, Ren Ee; Stanton, Lawrence W; Long, Philip M; Liu, Edison T
2003-05-24
The cause of severe acute respiratory syndrome (SARS) has been identified as a new coronavirus. Whole genome sequence analysis of various isolates might provide an indication of potential strain differences of this new virus. Moreover, mutation analysis will help to develop effective vaccines. We sequenced the entire SARS viral genome of cultured isolates from the index case (SIN2500) presenting in Singapore, from three primary contacts (SIN2774, SIN2748, and SIN2677), and one secondary contact (SIN2679). These sequences were compared with the isolates from Canada (TOR2), Hong Kong (CUHK-W1 and HKU39849), Hanoi (URBANI), Guangzhou (GZ01), and Beijing (BJ01, BJ02, BJ03, BJ04). We identified 129 sequence variations among the 14 isolates, with 16 recurrent variant sequences. Common variant sequences at four loci define two distinct genotypes of the SARS virus. One genotype was linked with infections originating in Hotel M in Hong Kong, the second contained isolates from Hong Kong, Guangzhou, and Beijing with no association with Hotel M (p<0.0001). Moreover, other common sequence variants further distinguished the geographical origins of the isolates, especially between Singapore and Beijing. Despite the recent onset of the SARS epidemic, genetic signatures are emerging that partition the worldwide SARS viral isolates into groups on the basis of contact source history and geography. These signatures can be used to trace sources of infection. In addition, a common variant associated with a non-conservative aminoacid change in the S1 region of the spike protein, suggests that immunological pressures might be starting to influence the evolution of the SARS virus in human populations.
Inferring HIV-1 Transmission Dynamics in Germany From Recently Transmitted Viruses.
Pouran Yousef, Kaveh; Meixenberger, Karolin; Smith, Maureen R; Somogyi, Sybille; Gromöller, Silvana; Schmidt, Daniel; Gunsenheimer-Bartmeyer, Barbara; Hamouda, Osamah; Kücherer, Claudia; von Kleist, Max
2016-11-01
Although HIV continues to spread globally, novel intervention strategies such as treatment as prevention (TasP) may bring the epidemic to a halt. However, their effective implementation requires a profound understanding of the underlying transmission dynamics. We analyzed parameters of the German HIV epidemic based on phylogenetic clustering of viral sequences from recently infected seroconverters with known infection dates. Viral baseline and follow-up pol sequences (n = 1943) from 1159 drug-naïve individuals were selected from a nationwide long-term observational study initiated in 1997. Putative transmission clusters were computed based on a maximum likelihood phylogeny. Using individual follow-up sequences, we optimized our clustering threshold to maximize the likelihood of co-clustering individuals connected by direct transmission. The sizes of putative transmission clusters scaled inversely with their abundance and their distribution exhibited a heavy tail. Clusters based on the optimal clustering threshold were significantly more likely to contain members of the same or bordering German federal states. Interinfection times between co-clustered individuals were significantly shorter (26 weeks; interquartile range: 13-83) than in a null model. Viral intraindividual evolution may be used to select criteria that maximize co-clustering of transmission pairs in the absence of strong adaptive selection pressure. Interinfection times of co-clustered individuals may then be an indicator of the typical time to onward transmission. Our analysis suggests that onward transmission may have occurred early after infection, when individuals are typically unaware of their serological status. The latter argues that TasP should be combined with HIV testing campaigns to reduce the possibility of transmission before TasP initiation.
The Discovery, Distribution, and Evolution of Viruses Associated with Drosophila melanogaster
Webster, Claire L.; Waldron, Fergal M.; Robertson, Shaun; Crowson, Daisy; Ferrari, Giada; Quintana, Juan F.; Brouqui, Jean-Michel; Bayne, Elizabeth H.; Longdon, Ben; Buck, Amy H.; Lazzaro, Brian P.; Akorli, Jewelna; Haddrill, Penelope R.; Obbard, Darren J.
2015-01-01
Drosophila melanogaster is a valuable invertebrate model for viral infection and antiviral immunity, and is a focus for studies of insect-virus coevolution. Here we use a metagenomic approach to identify more than 20 previously undetected RNA viruses and a DNA virus associated with wild D. melanogaster. These viruses not only include distant relatives of known insect pathogens but also novel groups of insect-infecting viruses. By sequencing virus-derived small RNAs, we show that the viruses represent active infections of Drosophila. We find that the RNA viruses differ in the number and properties of their small RNAs, and we detect both siRNAs and a novel miRNA from the DNA virus. Analysis of small RNAs also allows us to identify putative viral sequences that lack detectable sequence similarity to known viruses. By surveying >2,000 individually collected wild adult Drosophila we show that more than 30% of D. melanogaster carry a detectable virus, and more than 6% carry multiple viruses. However, despite a high prevalence of the Wolbachia endosymbiont—which is known to be protective against virus infections in Drosophila—we were unable to detect any relationship between the presence of Wolbachia and the presence of any virus. Using publicly available RNA-seq datasets, we show that the community of viruses in Drosophila laboratories is very different from that seen in the wild, but that some of the newly discovered viruses are nevertheless widespread in laboratory lines and are ubiquitous in cell culture. By sequencing viruses from individual wild-collected flies we show that some viruses are shared between D. melanogaster and D. simulans. Our results provide an essential evolutionary and ecological context for host–virus interaction in Drosophila, and the newly reported viral sequences will help develop D. melanogaster further as a model for molecular and evolutionary virus research. PMID:26172158
Viral genome structures, charge, and sequences are optimal for capsid assembly
NASA Astrophysics Data System (ADS)
Hagan, Michael
2014-03-01
For many viruses, the spontaneous assembly of a capsid shell around the nu-cleic acid (NA) genome is an essential step in the viral life cycle. Capsid formation is a multicomponent, out-of-equilibrium assembly process for which kinetic effects and thermodynamic constraints compete to determine the outcome. Understand-ing how viral components drive highly efficient assembly under these constraints could promote biomedical efforts to block viral propagation, and would elucidate the factors controlling assembly in a wide range of systems containing proteins and polyelectrolytes. This talk will describe coarse-grained models of capsid proteins and NAs with which we investigate the dynamics and thermodynamics of virus assembly. In con-trast to recent theoretical models, we find that capsids spontaneously `overcharge' that is, the NA length which is kinetically and thermodynamically optimal possess-es a negative charge greater than the positive charge of the capsid. When applied to specific virus capsids, the calculated optimal NA lengths closely correspond to the natural viral genome lengths. These results suggest that the features included in this model (i.e. electrostatics, excluded volume, and NA tertiary structure) play key roles in determining assembly thermodynamics and consequently exert selec-tive pressure on viral evolution. I will then discuss mechanisms by which se-quence-specific interactions between NAs and capsid proteins promote selective encapsidation of the viral genome. This work was supported by NIH R01GM108021 and the Brandeis MRSEC NSF-MRSEC-0820492.
Do, Hai Quynh; Trinh, Dinh Thau; Nguyen, Thi Lan; Vu, Thi Thu Hang; Than, Duc Duong; Van Lo, Thi; Yeom, Minjoo; Song, Daesub; Choe, SeEun; An, Dong-Jun; Le, Van Phan
2016-11-17
Porcine respiratory and reproductive syndrome (PRRS) virus is one of the most economically significant pathogens in the Vietnamese swine industry. ORF5, which participates in many functional processes, including virion assembly, entry of the virus into the host cell, and viral adaptation to the host immune response, has been widely used in molecular evolution and phylogeny studies. Knowing of molecular evolution of PRRSV fields strains might contribute to PRRS control in Vietnam. The results showed that phylogenetic analysis indicated that all strains belonged to sub-lineages 8.7 and 5.1. The nucleotide and amino acid identities between strains were 84.5-100% and 82-100%, respectively. Furthermore, the results revealed differences in nucleotide and amino acid identities between the 2 sub-lineage groups. N-glycosylation prediction identified 7 potential N-glycosylation sites and 11 glycotypes. Analyses of the GP5 sequences, revealed 7 sites under positive selective pressure and 25 under negative selective pressure. Phylogenetic analysis based on ORF5 sequence indicated the diversity of PRRSV in Vietnam. Furthermore, the variance of N-glycosylation sites and position under selective pressure were demonstrated. This study expands existing knowledge on the genetic diversity and evolution of PRRSV in Vietnam and assists the effective strategies for PRRS vaccine development in Vietnam.
Du, Yushen; Wu, Nicholas C.; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting
2016-01-01
ABSTRACT Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. PMID:27803181
Promoter Motifs in NCLDVs: An Evolutionary Perspective
Oliveira, Graziele Pereira; Andrade, Ana Cláudia dos Santos Pereira; Rodrigues, Rodrigo Araújo Lima; Arantes, Thalita Souza; Boratto, Paulo Victor Miranda; Silva, Ludmila Karen dos Santos; Dornas, Fábio Pio; Trindade, Giliane de Souza; Drumond, Betânia Paiva; La Scola, Bernard; Kroon, Erna Geessien; Abrahão, Jônatas Santos
2017-01-01
For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV), raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’) that could be evolved gradually by nucleotides’ gain and loss and point mutations. PMID:28117683
Tal, Asaf; Arbel-Goren, Rinat; Costantino, Nina; Court, Donald L; Stavans, Joel
2014-05-20
The search for specific sequences on long genomes is a key process in many biological contexts. How can specific target sequences be located with high efficiency, within physiologically relevant times? We addressed this question for viral integration, a fundamental mechanism of horizontal gene transfer driving prokaryotic evolution, using the infection of Escherichia coli bacteria with bacteriophage λ and following the establishment of a lysogenic state. Following the targeting process in individual live E. coli cells in real time revealed that λ DNA remains confined near the entry point of a cell following infection. The encounter between the 15-bp-long target sequence on the chromosome and the recombination site on the viral genome is facilitated by the directed motion of bacterial DNA generated during chromosome replication, in conjunction with constrained diffusion of phage DNA. Moving the native bacterial integration site to different locations on the genome and measuring the integration frequency in these strains reveals that the frequencies of the native site and a site symmetric to it relative to the origin are similar, whereas both are significantly higher than when the integration site is moved near the terminus, consistent with the replication-driven mechanism we propose. This novel search mechanism is yet another example of the exquisite coevolution of λ with its host.
Long-term infection and vertical transmission of a gammaretrovirus in a foreign host species.
Sakuma, Toshie; Tonne, Jason M; Malcolm, Jessica A; Thatava, Tayaramma; Ohmine, Seiga; Peng, Kah-Whye; Ikeda, Yasuhiro
2012-01-01
Increasing evidence has indicated natural transspecies transmission of gammaretroviruses; however, viral-host interactions after initial xeno-exposure remain poorly understood. Potential association of xenotropic murine leukemia virus-related virus (XMRV) in patients with prostate cancer and chronic fatigue syndrome has attracted broad interests in this topic. Although recent studies have indicated that XMRV is unlikely a human pathogen, further understanding of XMRV xenoinfection would allow in vivo modeling of the initial steps of gammaretroviral interspecies transmission, evolution and dissemination in a new host population. In this study, we monitored the long-term consequences of XMRV infection and its possible vertical transmission in a permissive foreign host, wild-derived Mus pahari mice. One year post-infection, XMRV-infected mice showed no notable pathological changes, while proviral DNA was detected in three out of eight mice. XMRV-infected mice remained seropositive throughout the study although the levels of gp70 Env- and p30 capsid-specific antibodies gradually decreased. When vertical XMRV transmission was assessed, no viremia, humoral immune responses nor endogenization were observed in nine offspring from infected mothers, yet one offspring was found PCR-positive for XMRV-specific sequences. Amplified viral sequences from the offspring showed several mutations, including one amino acid deletion in the receptor binding domain of Env SU. Our results therefore demonstrate long-term asymptomatic infection, low incidence of vertical transmission and limited evolution of XMRV upon transspecies infection of a permissive new host, Mus pahari.
Intersubtype Differences in the Effect of a Rare p24 Gag Mutation on HIV-1 Replicative Fitness
Chopera, Denis R.; Cotton, Laura A.; Zawaira, Alexander; Mann, Jaclyn K.; Ngandu, Nobubelo K.; Ntale, Roman; Carlson, Jonathan M.; Mlisana, Koleka; Woodman, Zenda; de Assis Rosa, Debra; Martin, Eric; Miura, Toshiyuki; Pereyra, Florencia; Walker, Bruce D.; Gray, Clive M.; Martin, Darren P.; Ndung'u, Thumbi; Brockman, Mark A.; Karim, Salim Abdool
2012-01-01
Certain immune-driven mutations in HIV-1, such as those arising in p24Gag, decrease viral replicative capacity. However, the intersubtype differences in the replicative consequences of such mutations have not been explored. In HIV-1 subtype B, the p24Gag M250I mutation is a rare variant (0.6%) that is enriched among elite controllers (7.2%) (P = 0.0005) and appears to be a rare escape variant selected by HLA-B58 supertype alleles (P < 0.01). In contrast, in subtype C, it is a relatively common minor polymorphic variant (10 to 15%) whose appearance is not associated with a particular HLA allele. Using site-directed mutant viruses, we demonstrate that M250I reduces in vitro viral replicative capacity in both subtype B and subtype C sequences. However, whereas in subtype C downstream compensatory mutations at p24Gag codons 252 and 260 reduce the adverse effects of M250I, fitness costs in subtype B appear difficult to restore. Indeed, patient-derived subtype B sequences harboring M250I exhibited in vitro replicative defects, while those from subtype C did not. The structural implications of M250I were predicted by protein modeling to be greater in subtype B versus C, providing a potential explanation for its lower frequency and enhanced replicative defects in subtype B. In addition to accounting for genetic differences between HIV-1 subtypes, the design of cytotoxic-T-lymphocyte-based vaccines may need to account for differential effects of host-driven viral evolution on viral fitness. PMID:23015721
Zhang, Qian; Jun, Se -Ran; Leuze, Michael; ...
2017-01-19
The development of rapid, economical genome sequencing has shed new light on the classification of viruses. As of October 2016, the National Center for Biotechnology Information (NCBI) database contained >2 million viral genome sequences and a reference set of ~4000 viral genome sequences that cover a wide range of known viral families. Whole-genome sequences can be used to improve viral classification and provide insight into the viral tree of life . However, due to the lack of evolutionary conservation amongst diverse viruses, it is not feasible to build a viral tree of life using traditional phylogenetic methods based on conservedmore » proteins. In this study, we used an alignment-free method that uses k-mers as genomic features for a large-scale comparison of complete viral genomes available in RefSeq. To determine the optimal feature length, k (an essential step in constructing a meaningful dendrogram), we designed a comprehensive strategy that combines three approaches: (1) cumulative relative entropy, (2) average number of common features among genomes, and (3) the Shannon diversity index. This strategy was used to determine k for all 3,905 complete viral genomes in RefSeq. Lastly, the resulting dendrogram shows consistency with the viral taxonomy of the ICTV and the Baltimore classification of viruses.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Qian; Jun, Se -Ran; Leuze, Michael
The development of rapid, economical genome sequencing has shed new light on the classification of viruses. As of October 2016, the National Center for Biotechnology Information (NCBI) database contained >2 million viral genome sequences and a reference set of ~4000 viral genome sequences that cover a wide range of known viral families. Whole-genome sequences can be used to improve viral classification and provide insight into the viral tree of life . However, due to the lack of evolutionary conservation amongst diverse viruses, it is not feasible to build a viral tree of life using traditional phylogenetic methods based on conservedmore » proteins. In this study, we used an alignment-free method that uses k-mers as genomic features for a large-scale comparison of complete viral genomes available in RefSeq. To determine the optimal feature length, k (an essential step in constructing a meaningful dendrogram), we designed a comprehensive strategy that combines three approaches: (1) cumulative relative entropy, (2) average number of common features among genomes, and (3) the Shannon diversity index. This strategy was used to determine k for all 3,905 complete viral genomes in RefSeq. Lastly, the resulting dendrogram shows consistency with the viral taxonomy of the ICTV and the Baltimore classification of viruses.« less
Zhang, Qian; Jun, Se-Ran; Leuze, Michael; Ussery, David; Nookaew, Intawat
2017-01-01
The development of rapid, economical genome sequencing has shed new light on the classification of viruses. As of October 2016, the National Center for Biotechnology Information (NCBI) database contained >2 million viral genome sequences and a reference set of ~4000 viral genome sequences that cover a wide range of known viral families. Whole-genome sequences can be used to improve viral classification and provide insight into the viral “tree of life”. However, due to the lack of evolutionary conservation amongst diverse viruses, it is not feasible to build a viral tree of life using traditional phylogenetic methods based on conserved proteins. In this study, we used an alignment-free method that uses k-mers as genomic features for a large-scale comparison of complete viral genomes available in RefSeq. To determine the optimal feature length, k (an essential step in constructing a meaningful dendrogram), we designed a comprehensive strategy that combines three approaches: (1) cumulative relative entropy, (2) average number of common features among genomes, and (3) the Shannon diversity index. This strategy was used to determine k for all 3,905 complete viral genomes in RefSeq. The resulting dendrogram shows consistency with the viral taxonomy of the ICTV and the Baltimore classification of viruses. PMID:28102365
Residual Viremia in Treated HIV+ Individuals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Conway, Jessica M.; Perelson, Alan S.
Antiretroviral therapy (ART) effectively controls HIV infection, suppressing HIV viral loads. However, some residual virus remains, below the level of detection, in HIV-infected patients on ART. Furthermore, the source of this viremia is an area of debate: does it derive primarily from activation of infected cells in the latent reservoir, or from ongoing viral replication? Our observations seem to be contradictory: there is evidence of short term evolution, implying that there must be ongoing viral replication, and viral strains should thus evolve. The phylogenetic analyses, and rare emergent drug resistance, suggest no long-term viral evolution, implying that virus derived frommore » activated latent cells must dominate. We use simple deterministic and stochastic models to gain insight into residual viremia dynamics in HIV-infected patients. Our modeling relies on two underlying assumptions for patients on suppressive ART: that latent cell activation drives viral dynamics and that the reproductive ratio of treated infection is less than 1. Nonetheless, the contribution of viral replication to residual viremia in patients on ART may be non-negligible. However, even if the portion of viremia attributable to viral replication is significant, our model predicts (1) that latent reservoir re-seeding remains negligible, and (2) some short-term viral evolution is permitted, but long-term evolution can still be limited: stochastic analysis of our model shows that de novo emergence of drug resistance is rare. Thus, our simple models reconcile the seemingly contradictory observations on residual viremia and, with relatively few parameters, recapitulates HIV viral dynamics observed in patients on suppressive therapy.« less
Residual Viremia in Treated HIV+ Individuals
Conway, Jessica M.; Perelson, Alan S.
2016-01-06
Antiretroviral therapy (ART) effectively controls HIV infection, suppressing HIV viral loads. However, some residual virus remains, below the level of detection, in HIV-infected patients on ART. Furthermore, the source of this viremia is an area of debate: does it derive primarily from activation of infected cells in the latent reservoir, or from ongoing viral replication? Our observations seem to be contradictory: there is evidence of short term evolution, implying that there must be ongoing viral replication, and viral strains should thus evolve. The phylogenetic analyses, and rare emergent drug resistance, suggest no long-term viral evolution, implying that virus derived frommore » activated latent cells must dominate. We use simple deterministic and stochastic models to gain insight into residual viremia dynamics in HIV-infected patients. Our modeling relies on two underlying assumptions for patients on suppressive ART: that latent cell activation drives viral dynamics and that the reproductive ratio of treated infection is less than 1. Nonetheless, the contribution of viral replication to residual viremia in patients on ART may be non-negligible. However, even if the portion of viremia attributable to viral replication is significant, our model predicts (1) that latent reservoir re-seeding remains negligible, and (2) some short-term viral evolution is permitted, but long-term evolution can still be limited: stochastic analysis of our model shows that de novo emergence of drug resistance is rare. Thus, our simple models reconcile the seemingly contradictory observations on residual viremia and, with relatively few parameters, recapitulates HIV viral dynamics observed in patients on suppressive therapy.« less
Caragounis, E-C; Gisslén, M; Lindh, M; Nordborg, C; Westergren, S; Hagberg, L; Svennerholm, B
2008-02-01
HIV-1 infects the central nervous system (CNS) early in the course of infection. However, it is not known to what extent the virus evolves independently within the CNS and whether the HIV-RNA in cerebrospinal fluid (CSF) reflects the viral population replicating within the brain parenchyma or the systemic infection. The aim of this study was to investigate HIV-1 evolution in the CNS and the origin of HIV-1 in CSF. Longitudinally derived paired blood and CSF samples and post-mortem samples from CSF, brain and spleen were collected over a period of up to 63 months from three HIV-1 infected men receiving antiretroviral treatment and presenting with symptoms of AIDS dementia complex (ADC). Phylogenetic analyses of HIV-1 V3, reverse transcriptase (RT) and protease sequences from patient isolates suggest compartmentalization with distinct viral strains in blood, CSF and brain. We found a different pattern of RT and accessory protease mutations in the systemic infection compared to the CNS. We conclude that HIV-1 may to some extent evolve independently in the CNS and the viral population in CSF mainly reflects the infection in the brain parenchyma in patients with ADC. This is of importance in understanding HIV pathogenesis and can have implications on treatment of HIV-1 patients.
Lenzmeier, B A; Giebler, H A; Nyborg, J K
1998-02-01
Efficient human T-cell leukemia virus type 1 (HTLV-1) replication and viral gene expression are dependent upon the virally encoded oncoprotein Tax. To activate HTLV-1 transcription, Tax interacts with the cellular DNA binding protein cyclic AMP-responsive element binding protein (CREB) and recruits the coactivator CREB binding protein (CBP), forming a nucleoprotein complex on the three viral cyclic AMP-responsive elements (CREs) in the HTLV-1 promoter. Short stretches of dG-dC-rich (GC-rich) DNA, immediately flanking each of the viral CREs, are essential for Tax recruitment of CBP in vitro and Tax transactivation in vivo. Although the importance of the viral CRE-flanking sequences is well established, several studies have failed to identify an interaction between Tax and the DNA. The mechanistic role of the viral CRE-flanking sequences has therefore remained enigmatic. In this study, we used high resolution methidiumpropyl-EDTA iron(II) footprinting to show that Tax extended the CREB footprint into the GC-rich DNA flanking sequences of the viral CRE. The Tax-CREB footprint was enhanced but not extended by the KIX domain of CBP, suggesting that the coactivator increased the stability of the nucleoprotein complex. Conversely, the footprint pattern of CREB on a cellular CRE lacking GC-rich flanking sequences did not change in the presence of Tax or Tax plus KIX. The minor-groove DNA binding drug chromomycin A3 bound to the GC-rich flanking sequences and inhibited the association of Tax and the Tax-CBP complex without affecting CREB binding. Tax specifically cross-linked to the viral CRE in the 5'-flanking sequence, and this cross-link was blocked by chromomycin A3. Together, these data support a model where Tax interacts directly with both CREB and the minor-groove viral CRE-flanking sequences to form a high-affinity binding site for the recruitment of CBP to the HTLV-1 promoter.
Brody, Thomas; Yavatkar, Amarendra S; Park, Dong Sun; Kuzin, Alexander; Ross, Jermaine; Odenwald, Ward F
2017-06-01
Flavivirus and Filovirus infections are serious epidemic threats to human populations. Multi-genome comparative analysis of these evolving pathogens affords a view of their essential, conserved sequence elements as well as progressive evolutionary changes. While phylogenetic analysis has yielded important insights, the growing number of available genomic sequences makes comparisons between hundreds of viral strains challenging. We report here a new approach for the comparative analysis of these hemorrhagic fever viruses that can superimpose an unlimited number of one-on-one alignments to identify important features within genomes of interest. We have adapted EvoPrinter alignment algorithms for the rapid comparative analysis of Flavivirus or Filovirus sequences including Zika and Ebola strains. The user can input a full genome or partial viral sequence and then view either individual comparisons or generate color-coded readouts that superimpose hundreds of one-on-one alignments to identify unique or shared identity SNPs that reveal ancestral relationships between strains. The user can also opt to select a database genome in order to access a library of pre-aligned genomes of either 1,094 Flaviviruses or 460 Filoviruses for rapid comparative analysis with all database entries or a select subset. Using EvoPrinter search and alignment programs, we show the following: 1) superimposing alignment data from many related strains identifies lineage identity SNPs, which enable the assessment of sublineage complexity within viral outbreaks; 2) whole-genome SNP profile screens uncover novel Dengue2 and Zika recombinant strains and their parental lineages; 3) differential SNP profiling identifies host cell A-to-I hyper-editing within Ebola and Marburg viruses, and 4) hundreds of superimposed one-on-one Ebola genome alignments highlight ultra-conserved regulatory sequences, invariant amino acid codons and evolutionarily variable protein-encoding domains within a single genome. EvoPrinter allows for the assessment of lineage complexity within Flavivirus or Filovirus outbreaks, identification of recombinant strains, highlights sequences that have undergone host cell A-to-I editing, and identifies unique input and database SNPs within highly conserved sequences. EvoPrinter's ability to superimpose alignment data from hundreds of strains onto a single genome has allowed us to identify unique Zika virus sublineages that are currently spreading in South, Central and North America, the Caribbean, and in China. This new set of integrated alignment programs should serve as a useful addition to existing tools for the comparative analysis of these viruses.
Genomic reassortment of influenza A virus in North American swine, 1998–2011
Detmer, Susan E.; Wentworth, David E.; Tan, Yi; Schwartzbard, Aaron; Halpin, Rebecca A.; Stockwell, Timothy B.; Lin, Xudong; Vincent, Amy L.; Gramer, Marie R.; Holmes, Edward C.
2012-01-01
Revealing the frequency and determinants of reassortment among RNA genome segments is fundamental to understanding basic aspects of the biology and evolution of the influenza virus. To estimate the extent of genomic reassortment in influenza viruses circulating in North American swine, we performed a phylogenetic analysis of 139 whole-genome viral sequences sampled during 1998–2011 and representing seven antigenically distinct viral lineages. The highest amounts of reassortment were detected between the H3 and the internal gene segments (PB2, PB1, PA, NP, M and NS), while the lowest reassortment frequencies were observed among the H1γ, H1pdm and neuraminidase segments, particularly N1. Less reassortment was observed among specific haemagglutinin–neuraminidase combinations that were more prevalent in swine, suggesting that some genome constellations may be evolutionarily more stable. PMID:22993190
Webster, Brian; Ott, Melanie; Greene, Warner C
2013-12-01
Cells that are productively infected by hepatitis C virus (HCV) are refractory to a second infection by HCV via a block in viral replication known as superinfection exclusion. The block occurs at a postentry step and likely involves translation or replication of the secondary viral RNA, but the mechanism is largely unknown. To characterize HCV superinfection exclusion, we selected for an HCV variant that could overcome the block. We produced a high-titer HC-J6/JFH1 (Jc1) viral genome with a fluorescent reporter inserted between NS5A and NS5B and used it to infect Huh7.5 cells containing a Jc1 replicon. With multiple passages of these infected cells, we isolated an HCV variant that can superinfect cells at high levels. Notably, the superinfectious virus rapidly cleared the primary replicon from superinfected cells. Viral competition experiments, using a novel strategy of sequence-barcoding viral strains, as well as superinfection of replicon cells demonstrated that mutations in E1, p7, NS5A, and the poly(U/UC) tract of the 3' untranslated region were important for superinfection. Furthermore, these mutations dramatically increased the infectivity of the virus in naive cells. Interestingly, viruses with a shorter poly(U/UC) and an NS5A domain II mutation were most effective in overcoming the postentry block. Neither of these changes affected viral RNA translation, indicating that the major barrier to postentry exclusion occurs at viral RNA replication. The evolution of the ability to superinfect after less than a month in culture and the concomitant exclusion of the primary replicon suggest that superinfection exclusion dramatically affects viral fitness and dynamics in vivo.
Moreno, Elena; Gallego, Isabel; Gregori, Josep; Lucía-Sanz, Adriana; Soria, María Eugenia; Castro, Victoria; Beach, Nathan M; Manrubia, Susanna; Quer, Josep; Esteban, Juan Ignacio; Rice, Charles M; Gómez, Jordi; Gastaminza, Pablo; Domingo, Esteban; Perales, Celia
2017-05-15
Viral quasispecies evolution upon long-term virus replication in a noncoevolving cellular environment raises relevant general issues, such as the attainment of population equilibrium, compliance with the molecular-clock hypothesis, or stability of the phenotypic profile. Here, we evaluate the adaptation, mutant spectrum dynamics, and phenotypic diversification of hepatitis C virus (HCV) in the course of 200 passages in human hepatoma cells in an experimental design that precluded coevolution of the cells with the virus. Adaptation to the cells was evidenced by increase in progeny production. The rate of accumulation of mutations in the genomic consensus sequence deviated slightly from linearity, and mutant spectrum analyses revealed a complex dynamic of mutational waves, which was sustained beyond passage 100. The virus underwent several phenotypic changes, some of which impacted the virus-host relationship, such as enhanced cell killing, a shift toward higher virion density, and increased shutoff of host cell protein synthesis. Fluctuations in progeny production and failure to reach population equilibrium at the genomic level suggest internal instabilities that anticipate an unpredictable HCV evolution in the complex liver environment. IMPORTANCE Long-term virus evolution in an unperturbed cellular environment can reveal features of virus evolution that cannot be explained by comparing natural viral isolates. In the present study, we investigate genetic and phenotypic changes that occur upon prolonged passage of hepatitis C virus (HCV) in human hepatoma cells in an experimental design in which host cell evolutionary change is prevented. Despite replication in a noncoevolving cellular environment, the virus exhibited internal population disequilibria that did not decline with increased adaptation to the host cells. The diversification of phenotypic traits suggests that disequilibria inherent to viral populations may provide a selective advantage to viruses that can be fully exploited in changing environments. Copyright © 2017 American Society for Microbiology.
Hybridization capture reveals evolution and conservation across the entire Koala retrovirus genome.
Tsangaras, Kyriakos; Siracusa, Matthew C; Nikolaidis, Nikolas; Ishida, Yasuko; Cui, Pin; Vielgrader, Hanna; Helgen, Kristofer M; Roca, Alfred L; Greenwood, Alex D
2014-01-01
The koala retrovirus (KoRV) is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus) to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin.
Hybridization Capture Reveals Evolution and Conservation across the Entire Koala Retrovirus Genome
Ishida, Yasuko; Cui, Pin; Vielgrader, Hanna; Helgen, Kristofer M.; Roca, Alfred L.; Greenwood, Alex D.
2014-01-01
The koala retrovirus (KoRV) is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus) to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin. PMID:24752422
Lewis, Jo E.; Brameld, John M.; Hill, Phil; Barrett, Perry; Ebling, Francis J.P.; Jethwa, Preeti H.
2015-01-01
Introduction The viral 2A sequence has become an attractive alternative to the traditional internal ribosomal entry site (IRES) for simultaneous over-expression of two genes and in combination with recombinant adeno-associated viruses (rAAV) has been used to manipulate gene expression in vitro. New method To develop a rAAV construct in combination with the viral 2A sequence to allow long-term over-expression of the vgf gene and fluorescent marker gene for tracking of the transfected neurones in vivo. Results Transient transfection of the AAV plasmid containing the vgf gene, viral 2A sequence and eGFP into SH-SY5Y cells resulted in eGFP fluorescence comparable to a commercially available reporter construct. This increase in fluorescent cells was accompanied by an increase in VGF mRNA expression. Infusion of the rAAV vector containing the vgf gene, viral 2A sequence and eGFP resulted in eGFP fluorescence in the hypothalamus of both mice and Siberian hamsters, 32 weeks post infusion. In situ hybridisation confirmed that the location of VGF mRNA expression in the hypothalamus corresponded to the eGFP pattern of fluorescence. Comparison with old method The viral 2A sequence is much smaller than the traditional IRES and therefore allowed over-expression of the vgf gene with fluorescent tracking without compromising viral capacity. Conclusion The use of the viral 2A sequence in the AAV plasmid allowed the simultaneous expression of both genes in vitro. When used in combination with rAAV it resulted in long-term over-expression of both genes at equivalent locations in the hypothalamus of both Siberian hamsters and mice, without any adverse effects. PMID:26300182
Dong, Guoying; Luo, Jing; Zhang, Hong; Wang, Chengmin; Duan, Mingxing; Deliberto, Thomas Jude; Nolte, Dale Louis; Ji, Guangju; He, Hongxuan
2011-01-01
H9N2 influenza A viruses have become established worldwide in terrestrial poultry and wild birds, and are occasionally transmitted to mammals including humans and pigs. To comprehensively elucidate the genetic and evolutionary characteristics of H9N2 influenza viruses, we performed a large-scale sequence analysis of 571 viral genomes from the NCBI Influenza Virus Resource Database, representing the spectrum of H9N2 influenza viruses isolated from 1966 to 2009. Our study provides a panoramic framework for better understanding the genesis and evolution of H9N2 influenza viruses, and for describing the history of H9N2 viruses circulating in diverse hosts. Panorama phylogenetic analysis of the eight viral gene segments revealed the complexity and diversity of H9N2 influenza viruses. The 571 H9N2 viral genomes were classified into 74 separate lineages, which had marked host and geographical differences in phylogeny. Panorama genotypical analysis also revealed that H9N2 viruses include at least 98 genotypes, which were further divided according to their HA lineages into seven series (A–G). Phylogenetic analysis of the internal genes showed that H9N2 viruses are closely related to H3, H4, H5, H7, H10, and H14 subtype influenza viruses. Our results indicate that H9N2 viruses have undergone extensive reassortments to generate multiple reassortants and genotypes, suggesting that the continued circulation of multiple genotypical H9N2 viruses throughout the world in diverse hosts has the potential to cause future influenza outbreaks in poultry and epidemics in humans. We propose a nomenclature system for identifying and unifying all lineages and genotypes of H9N2 influenza viruses in order to facilitate international communication on the evolution, ecology and epidemiology of H9N2 influenza viruses. PMID:21386964
Manuel, Menchie; Shan, Chao; Manokaran, Gayathri; Bradrick, Shelton S.; Missé, Dorothée; Shi, Pei-Yong
2017-01-01
Globally re-emerging dengue viruses are transmitted from human-to-human by Aedes mosquitoes. While viral determinants of human pathogenicity have been defined, there is a lack of knowledge of how dengue viruses influence mosquito transmission. Identification of viral determinants of transmission can help identify isolates with high epidemiological potential. Additionally, mechanistic understanding of transmission will lead to better understanding of how dengue viruses harness evolution to cycle between the two hosts. Here, we identified viral determinants of transmission and characterized mechanisms that enhance production of infectious saliva by inhibiting immunity specifically in salivary glands. Combining oral infection of Aedes aegypti mosquitoes and reverse genetics, we identified two 3’ UTR substitutions in epidemic isolates that increased subgenomic flaviviral RNA (sfRNA) quantity, infectious particles in salivary glands and infection rate of saliva, which represents a measure of transmission. We also demonstrated that various 3’UTR modifications similarly affect sfRNA quantity in both whole mosquitoes and human cells, suggesting a shared determinism of sfRNA quantity. Furthermore, higher relative quantity of sfRNA in salivary glands compared to midgut and carcass pointed to sfRNA function in salivary glands. We showed that the Toll innate immune response was preferentially inhibited in salivary glands by viruses with the 3’UTR substitutions associated to high epidemiological fitness and high sfRNA quantity, pointing to a mechanism for higher saliva infection rate. By determining that sfRNA is an immune suppressor in a tissue relevant to mosquito transmission, we propose that 3’UTR/sfRNA sequence evolution shapes dengue epidemiology not only by influencing human pathogenicity but also by increasing mosquito transmission, thereby revealing a viral determinant of epidemiological fitness that is shared between the two hosts. PMID:28753642
Jachiet, Pierre-Alain; Colson, Philippe; Lopez, Philippe; Bapteste, Eric
2014-08-07
Complex nongradual evolutionary processes such as gene remodeling are difficult to model, to visualize, and to investigate systematically. Despite these challenges, the creation of composite (or mosaic) genes by combination of genetic segments from unrelated gene families was established as an important adaptive phenomena in eukaryotic genomes. In contrast, almost no general studies have been conducted to quantify composite genes in viruses. Although viral genome mosaicism has been well-described, the extent of gene mosaicism and its rules of emergence remain largely unexplored. Applying methods from graph theory to inclusive similarity networks, and using data from more than 3,000 complete viral genomes, we provide the first demonstration that composite genes in viruses are 1) functionally biased, 2) involved in key aspects of the arm race between cells and viruses, and 3) can be classified into two distinct types of composite genes in all viral classes. Beyond the quantification of the widespread recombination of genes among different viruses of the same class, we also report a striking sharing of genetic information between viruses of different classes and with different nucleic acid types. This latter discovery provides novel evidence for the existence of a large and complex mobilome network, which appears partly bound by the sharing of genetic information and by the formation of composite genes between mobile entities with different genetic material. Considering that there are around 10E31 viruses on the planet, gene remodeling appears as a hugely significant way of generating and moving novel sequences between different kinds of organisms on Earth. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Oshima, Koichi; Khiabanian, Hossein; da Silva-Almeida, Ana C.; Tzoneva, Gannie; Abate, Francesco; Ambesi-Impiombato, Alberto; Sanchez-Martin, Marta; Carpenter, Zachary; Penson, Alex; Perez-Garcia, Arianne; Eckert, Cornelia; Nicolas, Concepción; Balbin, Milagros; Sulis, Maria Luisa; Kato, Motohiro; Koh, Katsuyoshi; Paganin, Maddalena; Basso, Giuseppe; Gastier-Foster, Julie M.; Devidas, Meenakshi; Loh, Mignon L.; Kirschner-Schwabe, Renate; Palomero, Teresa; Rabadan, Raul; Ferrando, Adolfo A.
2016-01-01
Although multiagent combination chemotherapy is curative in a significant fraction of childhood acute lymphoblastic leukemia (ALL) patients, 20% of cases relapse and most die because of chemorefractory disease. Here we used whole-exome and whole-genome sequencing to analyze the mutational landscape at relapse in pediatric ALL cases. These analyses identified numerous relapse-associated mutated genes intertwined in chemotherapy resistance-related protein complexes. In this context, RAS-MAPK pathway-activating mutations in the neuroblastoma RAS viral oncogene homolog (NRAS), kirsten rat sarcoma viral oncogene homolog (KRAS), and protein tyrosine phosphatase, nonreceptor type 11 (PTPN11) genes were present in 24 of 55 (44%) cases in our series. Interestingly, some leukemias showed retention or emergence of RAS mutant clones at relapse, whereas in others RAS mutant clones present at diagnosis were replaced by RAS wild-type populations, supporting a role for both positive and negative selection evolutionary pressures in clonal evolution of RAS-mutant leukemia. Consistently, functional dissection of mouse and human wild-type and mutant RAS isogenic leukemia cells demonstrated induction of methotrexate resistance but also improved the response to vincristine in mutant RAS-expressing lymphoblasts. These results highlight the central role of chemotherapy-driven selection as a central mechanism of leukemia clonal evolution in relapsed ALL, and demonstrate a previously unrecognized dual role of RAS mutations as drivers of both sensitivity and resistance to chemotherapy. PMID:27655895
Rodriguez-Roche, Rosmari; Villegas, Elci; Cook, Shelley; Poh Kim, Pauline A.W.; Hinojosa, Yoandri; Rosario, Delfina; Villalobos, Iris; Bendezu, Herminia; Hibberd, Martin L.; Guzman, Maria G.
2012-01-01
During the past three decades there has been a notable increase in dengue disease severity in Venezuela. Nevertheless, the population structure of the viruses being transmitted in this country is not well understood. Here, we present a molecular epidemiological study on dengue viruses (DENV) circulating in Aragua State, Venezuela during 2006–2007. Twenty-one DENV full-length genomes representing all of the four serotypes were amplified and sequenced directly from the serum samples. Notably, only DENV-2 was associated with severe disease. Phylogenetic trees constructed using Bayesian methods indicated that only one genotype was circulating for each serotype. However, extensive viral genetic diversity was found in DENV isolated from the same area during the same period, indicating significant in situ evolution since the introduction of these genotypes. Collectively, the results suggest that the non-structural (NS) proteins may play an important role in DENV evolution, particularly NS1, NS2A and NS4B proteins. The phylogenetic data provide evidence to suggest that multiple introductions of DENV have occurred from the Latin American region into Venezuela and vice versa. The implications of the significant viral genetic diversity generated during hyperendemic transmission, particularly in NS protein are discussed and considered in the context of future development and use of human monoclonal antibodies as antivirals and tetravalent vaccines. PMID:22197765
Low dose rectal inoculation of rhesus macaques by SIV smE660 or SIVmac251 recapitulates
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hraber, Peter; Giorgi, Elena E; Keele, Brandon
2008-01-01
We recently developed a novel strategy to identify transmitted HIV-1 genomes in acutely infected humans using single-genome amplification and a model of random virus evolution. Here, we used this approach to determine the molecular features of simian immunodeficiency virus (SIV) transmission in 18 experimentally infected Indian rhesus macaques. Animals were inoculated intrarectally (i.r.) or intravenously (i.v.) with stocks of SIVmac251 or SIVsmE660 that exhibited sequence diversity typical of early-chronic HIV-1 infection. 987 full-length SIV env sequences (median of 48 per animal) were determined from plasma virion RNA 1--5 wk after infection. i.r. inoculation was followed by productive infection by onemore » or a few viruses (median 1; range 1--5) that diversified randomly with near starlike phylogeny and a Poisson distribution of mutations. Consensus viral sequences from ramp-up and peak viremia were identical to viruses found in the inocula or differed from them by only one or a few nucleotides, providing direct evidence that early plasma viral sequences coalesce to transmitted/founder viruses. i.v. infection was >2,000-fold more efficient than i.r. infection, and viruses transmitted by either route represented the full genetic spectra of the inocula. These findings identify key similarities in mucosal transmission and early diversification between SIV and HIV-1, and thus validate the SIV-macaque mucosal infection model for HIV-1 vaccine and microbicide research.« less
Li, Linlin; Joseph, G. Victoria; Wang, Chunlin; Jones, Morris; Fellers, Gary M.; Kunz, Thomas H.; Delwart, Eric
2010-01-01
Bats are hosts to a variety of viruses capable of zoonotic transmissions. Because of increased contact between bats, humans, and other animal species, the possibility exists for further cross-species transmissions and ensuing disease outbreaks. We describe here full and partial viral genomes identified using metagenomics in the guano of bats from California and Texas. A total of 34% and 58% of 390,000 sequence reads from bat guano in California and Texas, respectively, were related to eukaryotic viruses, and the largest proportion of those infect insects, reflecting the diet of these insectivorous bats, including members of the viral families Dicistroviridae, Iflaviridae, Tetraviridae, and Nodaviridae and the subfamily Densovirinae. The second largest proportion of virus-related sequences infects plants and fungi, likely reflecting the diet of ingested insects, including members of the viral families Luteoviridae, Secoviridae, Tymoviridae, and Partitiviridae and the genus Sobemovirus. Bat guano viruses related to those infecting mammals comprised the third largest group, including members of the viral families Parvoviridae, Circoviridae, Picornaviridae, Adenoviridae, Poxviridae, Astroviridae, and Coronaviridae. No close relative of known human viral pathogens was identified in these bat populations. Phylogenetic analysis was used to clarify the relationship to known viral taxa of novel sequences detected in bat guano samples, showing that some guano viral sequences fall outside existing taxonomic groups. This initial characterization of the bat guano virome, the first metagenomic analysis of viruses in wild mammals using second-generation sequencing, therefore showed the presence of previously unidentified viral species, genera, and possibly families. Viral metagenomics is a useful tool for genetically characterizing viruses present in animals with the known capability of direct or indirect viral zoonosis to humans.
Li, Linlin; Victoria, Joseph G.; Wang, Chunlin; Jones, Morris; Fellers, Gary M.; Kunz, Thomas H.; Delwart, Eric
2010-01-01
Bats are hosts to a variety of viruses capable of zoonotic transmissions. Because of increased contact between bats, humans, and other animal species, the possibility exists for further cross-species transmissions and ensuing disease outbreaks. We describe here full and partial viral genomes identified using metagenomics in the guano of bats from California and Texas. A total of 34% and 58% of 390,000 sequence reads from bat guano in California and Texas, respectively, were related to eukaryotic viruses, and the largest proportion of those infect insects, reflecting the diet of these insectivorous bats, including members of the viral families Dicistroviridae, Iflaviridae, Tetraviridae, and Nodaviridae and the subfamily Densovirinae. The second largest proportion of virus-related sequences infects plants and fungi, likely reflecting the diet of ingested insects, including members of the viral families Luteoviridae, Secoviridae, Tymoviridae, and Partitiviridae and the genus Sobemovirus. Bat guano viruses related to those infecting mammals comprised the third largest group, including members of the viral families Parvoviridae, Circoviridae, Picornaviridae, Adenoviridae, Poxviridae, Astroviridae, and Coronaviridae. No close relative of known human viral pathogens was identified in these bat populations. Phylogenetic analysis was used to clarify the relationship to known viral taxa of novel sequences detected in bat guano samples, showing that some guano viral sequences fall outside existing taxonomic groups. This initial characterization of the bat guano virome, the first metagenomic analysis of viruses in wild mammals using second-generation sequencing, therefore showed the presence of previously unidentified viral species, genera, and possibly families. Viral metagenomics is a useful tool for genetically characterizing viruses present in animals with the known capability of direct or indirect viral zoonosis to humans. PMID:20463061
Li, Linlin; Victoria, Joseph G; Wang, Chunlin; Jones, Morris; Fellers, Gary M; Kunz, Thomas H; Delwart, Eric
2010-07-01
Bats are hosts to a variety of viruses capable of zoonotic transmissions. Because of increased contact between bats, humans, and other animal species, the possibility exists for further cross-species transmissions and ensuing disease outbreaks. We describe here full and partial viral genomes identified using metagenomics in the guano of bats from California and Texas. A total of 34% and 58% of 390,000 sequence reads from bat guano in California and Texas, respectively, were related to eukaryotic viruses, and the largest proportion of those infect insects, reflecting the diet of these insectivorous bats, including members of the viral families Dicistroviridae, Iflaviridae, Tetraviridae, and Nodaviridae and the subfamily Densovirinae. The second largest proportion of virus-related sequences infects plants and fungi, likely reflecting the diet of ingested insects, including members of the viral families Luteoviridae, Secoviridae, Tymoviridae, and Partitiviridae and the genus Sobemovirus. Bat guano viruses related to those infecting mammals comprised the third largest group, including members of the viral families Parvoviridae, Circoviridae, Picornaviridae, Adenoviridae, Poxviridae, Astroviridae, and Coronaviridae. No close relative of known human viral pathogens was identified in these bat populations. Phylogenetic analysis was used to clarify the relationship to known viral taxa of novel sequences detected in bat guano samples, showing that some guano viral sequences fall outside existing taxonomic groups. This initial characterization of the bat guano virome, the first metagenomic analysis of viruses in wild mammals using second-generation sequencing, therefore showed the presence of previously unidentified viral species, genera, and possibly families. Viral metagenomics is a useful tool for genetically characterizing viruses present in animals with the known capability of direct or indirect viral zoonosis to humans.
Hosseinidoust, Zeinab
2017-01-01
Bacteriophages (bacterial viruses) have long been under investigation as vectors for gene therapy. Similar to other viral vectors, the phage coat proteins have evolved over millions of years to protect the viral genome from degradation post injection, offering protection for the valuable therapeutic sequence. However, what sets phage apart from other viral gene delivery vectors is their safety for human use and the relative ease by which foreign molecules can be expressed on the phage outer surface, enabling highly targeted gene delivery. The latter property also makes phage a popular choice for gene therapy target discovery through directed evolution. Although promising, phage-mediated gene therapy faces several outstanding challenges, the most notable being lower gene delivery efficiency compared to animal viruses, vector stability, and nondesirable immune stimulation. This review presents a critical review of promises and challenges of employing phage as gene delivery vehicles as well as an introduction to the concept of phage-based microbiome therapy as the new frontier and perhaps the most promising application of phage-based gene therapy. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Fukazawa, Yoshinori; Lum, Richard; Okoye, Afam A; Park, Haesun; Matsuda, Kenta; Bae, Jin Young; Hagen, Shoko I; Shoemaker, Rebecca; Deleage, Claire; Lucero, Carissa; Morcock, David; Swanson, Tonya; Legasse, Alfred W; Axthelm, Michael K; Hesselgesser, Joseph; Geleziunas, Romas; Hirsch, Vanessa M; Edlefsen, Paul T; Piatak, Michael; Estes, Jacob D; Lifson, Jeffrey D; Picker, Louis J
2015-02-01
Chronic-phase HIV and simian immunodeficiency virus (SIV) replication is reduced by as much as 10,000-fold in elite controllers (ECs) compared with typical progressors (TPs), but sufficient viral replication persists in EC tissues to allow viral sequence evolution and induce excess immune activation. Here we show that productive SIV infection in rhesus monkey ECs, but not TPs, is markedly restricted to CD4(+) follicular helper T (TFH) cells, suggesting that these EC monkeys' highly effective SIV-specific CD8(+) T cells can effectively clear productive SIV infection from extrafollicular sites, but their relative exclusion from B cell follicles prevents their elimination of productively infected TFH cells. CD8(+) lymphocyte depletion in EC monkeys resulted in a dramatic re-distribution of productive SIV infection to non-TFH cells, with restriction of productive infection to TFH cells resuming upon CD8(+) T cell recovery. Thus, B cell follicles constitute 'sanctuaries' for persistent SIV replication in the presence of potent anti-viral CD8(+) T cell responses, potentially complicating efforts to cure HIV infection with therapeutic vaccination or T cell immunotherapy.
Single cell genomics of uncultured marine alveolates shows paraphyly of basal dinoflagellates.
Strassert, Jürgen F H; Karnkowska, Anna; Hehenberger, Elisabeth; Del Campo, Javier; Kolisko, Martin; Okamoto, Noriko; Burki, Fabien; Janouškovec, Jan; Poirier, Camille; Leonard, Guy; Hallam, Steven J; Richards, Thomas A; Worden, Alexandra Z; Santoro, Alyson E; Keeling, Patrick J
2018-01-01
Marine alveolates (MALVs) are diverse and widespread early-branching dinoflagellates, but most knowledge of the group comes from a few cultured species that are generally not abundant in natural samples, or from diversity analyses of PCR-based environmental SSU rRNA gene sequences. To more broadly examine MALV genomes, we generated single cell genome sequences from seven individually isolated cells. Genes expected of heterotrophic eukaryotes were found, with interesting exceptions like presence of proteorhodopsin and vacuolar H + -pyrophosphatase. Phylogenetic analysis of concatenated SSU and LSU rRNA gene sequences provided strong support for the paraphyly of MALV lineages. Dinoflagellate viral nucleoproteins were found only in MALV groups that branched as sister to dinokaryotes. Our findings indicate that multiple independent origins of several characteristics early in dinoflagellate evolution, such as a parasitic life style, underlie the environmental diversity of MALVs, and suggest they have more varied trophic modes than previously thought.
Comparative Phylodynamics of Rabbit Hemorrhagic Disease Virus in Australia and New Zealand
Eden, John-Sebastian; Kovaliski, John; Duckworth, Janine A.; Swain, Grace; Mahar, Jackie E.; Strive, Tanja
2015-01-01
ABSTRACT The introduction of rabbit hemorrhagic disease virus (RHDV) into Australia and New Zealand during the 1990s as a means of controlling feral rabbits is an important case study in viral emergence. Both epidemics are exceptional in that the founder viruses share an origin and the timing of their release is known, providing a unique opportunity to compare the evolution of a single virus in distinct naive populations. We examined the evolution and spread of RHDV in Australia and New Zealand through a genome-wide evolutionary analysis, including data from 28 newly sequenced RHDV field isolates. Following the release of the Australian inoculum strain into New Zealand, no subsequent mixing of the populations occurred, with viruses from both countries forming distinct groups. Strikingly, the rate of evolution in the capsid gene was higher in the Australian viruses than in those from New Zealand, most likely due to the presence of transient deleterious mutations in the former. However, estimates of both substitution rates and population dynamics were strongly sample dependent, such that small changes in sample composition had an important impact on evolutionary parameters. Phylogeographic analysis revealed a clear spatial structure in the Australian RHDV strains, with a major division between those viruses from western and eastern states. Importantly, RHDV sequences from the state where the virus was first released, South Australia, had the greatest diversity and were diffuse throughout both geographic lineages, such that this region was likely a source population for the subsequent spread of the virus across the country. IMPORTANCE Most studies of viral emergence lack detailed knowledge about which strains were founders for the outbreak or when these events occurred. Hence, the human-mediated introduction of rabbit hemorrhagic disease virus (RHDV) into Australia and New Zealand from known starting stocks provides a unique opportunity to understand viral evolution and emergence. Within Australia, we revealed a major phylogenetic division between viruses sampled from the east and west of the country, with both regions likely seeded by viruses from South Australia. Despite their common origins, marked differences in evolutionary rates were observed between the Australian and New Zealand RHDV, which led to conflicting conclusions about population growth rates. An analysis of mutational patterns suggested that evolutionary rates have been elevated in the Australian viruses, at least in part due to the presence of low-fitness (deleterious) variants that have yet to be selectively purged. PMID:26157125
Hughes, Joseph; Biek, Roman; Litster, Annette; Willett, Brian J.; Hosie, Margaret J.
2015-01-01
Analysing the evolution of feline immunodeficiency virus (FIV) at the intra-host level is important in order to address whether the diversity and composition of viral quasispecies affect disease progression. We examined the intra-host diversity and the evolutionary rates of the entire env and structural fragments of the env sequences obtained from sequential blood samples in 43 naturally infected domestic cats that displayed different clinical outcomes. We observed in the majority of cats that FIV env showed very low levels of intra-host diversity. We estimated that env evolved at a rate of 1.16×10−3 substitutions per site per year and demonstrated that recombinant sequences evolved faster than non-recombinant sequences. It was evident that the V3–V5 fragment of FIV env displayed higher evolutionary rates in healthy cats than in those with terminal illness. Our study provided the first evidence that the leader sequence of env, rather than the V3–V5 sequence, had the highest intra-host diversity and the highest evolutionary rate of all env fragments, consistent with this region being under a strong selective pressure for genetic variation. Overall, FIV env displayed relatively low intra-host diversity and evolved slowly in naturally infected cats. The maximum evolutionary rate was observed in the leader sequence of env. Although genetic stability is not necessarily a prerequisite for clinical stability, the higher genetic stability of FIV compared with human immunodeficiency virus might explain why many naturally infected cats do not progress rapidly to AIDS. PMID:25535323
Chutiwitoonchai, Nopporn; Kakisaka, Michinori; Yamada, Kazunori; Aida, Yoko
2014-01-01
The assembly of influenza virus progeny virions requires machinery that exports viral genomic ribonucleoproteins from the cell nucleus. Currently, seven nuclear export signal (NES) consensus sequences have been identified in different viral proteins, including NS1, NS2, M1, and NP. The present study examined the roles of viral NES consensus sequences and their significance in terms of viral replication and nuclear export. Mutation of the NP-NES3 consensus sequence resulted in a failure to rescue viruses using a reverse genetics approach, whereas mutation of the NS2-NES1 and NS2-NES2 sequences led to a strong reduction in viral replication kinetics compared with the wild-type sequence. While the viral replication kinetics for other NES mutant viruses were also lower than those of the wild-type, the difference was not so marked. Immunofluorescence analysis after transient expression of NP-NES3, NS2-NES1, or NS2-NES2 proteins in host cells showed that they accumulated in the cell nucleus. These results suggest that the NP-NES3 consensus sequence is mostly required for viral replication. Therefore, each of the hydrophobic (Φ) residues within this NES consensus sequence (Φ1, Φ2, Φ3, or Φ4) was mutated, and its viral replication and nuclear export function were analyzed. No viruses harboring NP-NES3 Φ2 or Φ3 mutants could be rescued. Consistent with this, the NP-NES3 Φ2 and Φ3 mutants showed reduced binding affinity with CRM1 in a pull-down assay, and both accumulated in the cell nucleus. Indeed, a nuclear export assay revealed that these mutant proteins showed lower nuclear export activity than the wild-type protein. Moreover, the Φ2 and Φ3 residues (along with other Φ residues) within the NP-NES3 consensus were highly conserved among different influenza A viruses, including human, avian, and swine. Taken together, these results suggest that the Φ2 and Φ3 residues within the NP-NES3 protein are important for its nuclear export function during viral replication.
Association of coral algal symbionts with a diverse viral community responsive to heat shock.
Brüwer, Jan D; Agrawal, Shobhit; Liew, Yi Jin; Aranda, Manuel; Voolstra, Christian R
2017-08-17
Stony corals provide the structural foundation of coral reef ecosystems and are termed holobionts given they engage in symbioses, in particular with photosynthetic dinoflagellates of the genus Symbiodinium. Besides Symbiodinium, corals also engage with bacteria affecting metabolism, immunity, and resilience of the coral holobiont, but the role of associated viruses is largely unknown. In this regard, the increase of studies using RNA sequencing (RNA-Seq) to assess gene expression provides an opportunity to elucidate viral signatures encompassed within the data via careful delineation of sequence reads and their source of origin. Here, we re-analyzed an RNA-Seq dataset from a cultured coral symbiont (Symbiodinium microadriaticum, Clade A1) across four experimental treatments (control, cold shock, heat shock, dark shock) to characterize associated viral diversity, abundance, and gene expression. Our approach comprised the filtering and removal of host sequence reads, subsequent phylogenetic assignment of sequence reads of putative viral origin, and the assembly and analysis of differentially expressed viral genes. About 15.46% (123 million) of all sequence reads were non-host-related, of which <1% could be classified as archaea, bacteria, or virus. Of these, 18.78% were annotated as virus and comprised a diverse community consistent across experimental treatments. Further, non-host related sequence reads assembled into 56,064 contigs, including 4856 contigs of putative viral origin that featured 43 differentially expressed genes during heat shock. The differentially expressed genes included viral kinases, ubiquitin, and ankyrin repeat proteins (amongst others), which are suggested to help the virus proliferate and inhibit the algal host's antiviral response. Our results suggest that a diverse viral community is associated with coral algal endosymbionts of the genus Symbiodinium, which prompts further research on their ecological role in coral health and resilience.
The not so universal tree of life or the place of viruses in the living world
Brüssow, Harald
2009-01-01
Darwin provided a great unifying theory for biology; its visual expression is the universal tree of life. The tree concept is challenged by the occurrence of horizontal gene transfer and—as summarized in this review—by the omission of viruses. Microbial ecologists have demonstrated that viruses are the most numerous biological entities on earth, outnumbering cells by a factor of 10. Viral genomics have revealed an unexpected size and distinctness of the viral DNA sequence space. Comparative genomics has shown elements of vertical evolution in some groups of viruses. Furthermore, structural biology has demonstrated links between viruses infecting the three domains of life pointing to a very ancient origin of viruses. However, presently viruses do not find a place on the universal tree of life, which is thus only a tree of cellular life. In view of the polythetic nature of current life definitions, viruses cannot be dismissed as non-living material. On earth we have therefore at least two large DNA sequence spaces, one represented by capsid-encoding viruses and another by ribosome-encoding cells. Despite their probable distinct evolutionary origin, both spheres were and are connected by intensive two-way gene transfers. PMID:19571246
Geisler, Christoph
2018-02-07
Adventitious viral contamination in cell substrates used for biologicals production is a major safety concern. A powerful new approach that can be used to identify adventitious viruses is a combination of bioinformatics tools with massively parallel sequencing technology. Typically, this involves mapping or BLASTN searching individual reads against viral nucleotide databases. Although extremely sensitive for known viruses, this approach can easily miss viruses that are too dissimilar to viruses in the database. Moreover, it is computationally intensive and requires reference cell genome databases. To avoid these drawbacks, we set out to develop an alternative approach. We reasoned that searching genome and transcriptome assemblies for adventitious viral contaminants using TBLASTN with a compact viral protein database covering extant viral diversity as the query could be fast and sensitive without a requirement for high performance computing hardware. We tested our approach on Spodoptera frugiperda Sf-RVN, a recently isolated insect cell line, to determine if it was contaminated with one or more adventitious viruses. We used Illumina reads to assemble the Sf-RVN genome and transcriptome and searched them for adventitious viral contaminants using TBLASTN with our viral protein database. We found no evidence of viral contamination, which was substantiated by the fact that our searches otherwise identified diverse sequences encoding virus-like proteins. These sequences included Maverick, R1 LINE, and errantivirus transposons, all of which are common in insect genomes. We also identified previously described as well as novel endogenous viral elements similar to ORFs encoded by diverse insect viruses. Our results demonstrate TBLASTN searching massively parallel sequencing (MPS) assemblies with a compact, manually curated viral protein database is more sensitive for adventitious virus detection than BLASTN, as we identified various sequences that encoded virus-like proteins, but had no similarity to viral sequences at the nucleotide level. Moreover, searches were fast without requiring high performance computing hardware. Our study also documents the enhanced biosafety profile of Sf-RVN as compared to other Sf cell lines, and supports the notion that Sf-RVN is highly suitable for the production of safe biologicals.
VirSorter: mining viral signal from microbial genomic data.
Roux, Simon; Enault, Francois; Hurwitz, Bonnie L; Sullivan, Matthew B
2015-01-01
Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome), new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs) of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter's prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages). Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs) as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision) on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in "reverse" to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made available through the iPlant Cyberinfrastructure that provides a web-based user interface interconnected with the required computing resources. VirSorter thus complements existing prophage prediction softwares to better leverage fragmented, SAG and metagenomic datasets in a way that will scale to modern sequencing. Given these features, VirSorter should enable the discovery of new viruses in microbial datasets, and further our understanding of uncultivated viral communities across diverse ecosystems.
VirSorter: mining viral signal from microbial genomic data
Roux, Simon; Enault, Francois; Hurwitz, Bonnie L.
2015-01-01
Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome), new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs) of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter’s prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages). Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs) as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision) on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in “reverse” to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made available through the iPlant Cyberinfrastructure that provides a web-based user interface interconnected with the required computing resources. VirSorter thus complements existing prophage prediction softwares to better leverage fragmented, SAG and metagenomic datasets in a way that will scale to modern sequencing. Given these features, VirSorter should enable the discovery of new viruses in microbial datasets, and further our understanding of uncultivated viral communities across diverse ecosystems. PMID:26038737
Lee, Adam; Nolan, Alison; Watson, Jason; Tristem, Michael
2013-09-19
The evolutionary arms race between mammals and retroviruses has long been recognized as one of the oldest host-parasite interactions. Rapid evolution rates in exogenous retroviruses have often made accurate viral age estimations highly problematic. Endogenous retroviruses (ERVs), however, integrate into the germline of their hosts, and are subjected to their evolutionary rates. This study describes, for the first time, a retroviral orthologue predating the divergence of placental mammals, giving it a minimum age of 104-110 Myr. Simultaneously, other orthologous selfish genetic elements (SGEs), inserted into the ERV sequence, provide evidence for the oldest individual mammalian-wide interspersed repeat and medium-reiteration frequency interspersed repeat mammalian repeats, with the same minimum age. The combined use of shared SGEs and reconstruction of viral orthologies defines new limits and increases maximum 'lookback' times, with subsequent implications for the field of paleovirology.
Schulte, Michael B; Draghi, Jeremy A; Plotkin, Joshua B; Andino, Raul
2015-01-01
Life history theory posits that the sequence and timing of events in an organism's lifespan are fine-tuned by evolution to maximize the production of viable offspring. In a virus, a life history strategy is largely manifested in its replication mode. Here, we develop a stochastic mathematical model to infer the replication mode shaping the structure and mutation distribution of a poliovirus population in an intact single infected cell. We measure production of RNA and poliovirus particles through the infection cycle, and use these data to infer the parameters of our model. We find that on average the viral progeny produced from each cell are approximately five generations removed from the infecting virus. Multiple generations within a single cell infection provide opportunities for significant accumulation of mutations per viral genome and for intracellular selection. DOI: http://dx.doi.org/10.7554/eLife.03753.001 PMID:25635405
Bruder, Katherine; Malki, Kema; Cooper, Alexandria; Sible, Emily; Shapiro, Jason W.; Watkins, Siobhan C.; Putonti, Catherine
2016-01-01
Advances in bioinformatics and sequencing technologies have allowed for the analysis of complex microbial communities at an unprecedented rate. While much focus is often placed on the cellular members of these communities, viruses play a pivotal role, particularly bacteria-infecting viruses (bacteriophages); phages mediate global biogeochemical processes and drive microbial evolution through bacterial grazing and horizontal gene transfer. Despite their importance and ubiquity in nature, very little is known about the diversity and structure of viral communities. Though the need for culture-based methods for viral identification has been somewhat circumvented through metagenomic techniques, the analysis of metaviromic data is marred with many unique issues. In this review, we examine the current bioinformatic approaches for metavirome analyses and the inherent challenges facing the field as illustrated by the ongoing efforts in the exploration of freshwater phage populations. PMID:27375355
Ramey, Andy M.; Poulson, Rebecca L.; Gonzalez-Reiche, Ana S.; Perez, Daniel R.; Stalknecht, David E.; Brown, Justin D.
2014-01-01
Recent repeated isolation of H14 hemagglutinin subtype influenza A viruses (IAVs) in the New World waterfowl provides evidence to suggest that host and/or geographic ranges for viruses of this subtype may be expanding. In this study, we used genomic analyses to gain inference on the origin and evolution of H14 viruses in New World waterfowl and conducted an experimental challenge study in mallards (Anas platyrhynchos) to evaluate pathogenicity, viral replication, and transmissibility of a representative viral strain in a natural host species. Genomic characterization of H14 subtype IAVs isolated from New World waterfowl, including three isolates sequenced specifically for this study, revealed high nucleotide identity among individual gene segments (e.g. ≥95% shared identity among H14 HA gene segments). In contrast, lower shared identity was observed among internal gene segments. Furthermore, multiple neuraminidase subtypes were observed for H14 IAVs isolated in the New World. Gene segments of H14 viruses isolated after 2010 shared ancestral genetic lineages with IAVs isolated from wild birds throughout North America. Thus, genomic characterization provided evidence for viral evolution in New World waterfowl through genetic drift and genetic shift since purported introduction from Eurasia. In the challenge study, no clinical disease or lesions were observed among mallards experimentally inoculated with A/blue-winged teal/Texas/AI13-1028/2013(H14N5) or exposed via contact with infected birds. Titers of viral shedding for mallards challenged with the H14N5 IAV were highest at two days post-inoculation (DPI); however shedding was detected up to nine DPI using cloacal swabs. The distribution of viral antigen among mallards infected with H14N5 IAV was largely restricted to enterocytes lining the villi in the lower intestinal tract and in the epithelium of the bursa of Fabricius. Characterization of the infectivity of A/blue-winged teal/Texas/AI13-1028/2013(H14N5) in mallards provides support for similarities in viral replication and shedding as compared to previously described waterfowl-adapted, low pathogenic IAV strains in ducks.
Hartley, Carol A.; Vaz, Paola K.; Diaz-Méndez, Andrés; García, Maricarmen; Spatz, Stephen; Devlin, Joanne M.
2017-01-01
ABSTRACT Recombination is a feature of many alphaherpesviruses that infect people and animals. Infectious laryngotracheitis virus (ILTV; Gallid alphaherpesvirus 1) causes respiratory disease in chickens, resulting in significant production losses in poultry industries worldwide. Natural (field) ILTV recombination is widespread, particularly recombination between attenuated ILTV vaccine strains to create virulent viruses. These virulent recombinants have had a major impact on animal health. Recently, the development of a single nucleotide polymorphism (SNP) genotyping assay for ILTV has helped to understand ILTV recombination in laboratory settings. In this study, we applied this SNP genotyping assay to further examine ILTV recombination in the natural host. Following coinoculation of specific-pathogen-free chickens, we examined the resultant progeny for evidence of viral recombination and characterized the diversity of the recombinants over time. The results showed that ILTV replication and recombination are closely related and that the recombinant viral progeny are most diverse 4 days after coinoculation, which is the peak of viral replication. Further, the locations of recombination breakpoints in a selection of the recombinant progeny, and in field isolates of ILTV from different geographical regions, were examined following full-genome sequencing and used to identify recombination hot spots in the ILTV genome. IMPORTANCE Alphaherpesviruses are common causes of disease in people and animals. Recombination enables genome diversification in many different species of alphaherpesviruses, which can lead to the evolution of higher levels of viral virulence. Using the alphaherpesvirus infectious laryngotracheitis virus (ILTV), we performed coinfections in the natural host (chickens) to demonstrate high levels of virus recombination. Higher levels of diversity in the recombinant progeny coincided with the highest levels of virus replication. In the recombinant progeny, and in field isolates, recombination occurred at greater frequency in recombination hot spot regions of the virus genome. Our results suggest that control measures that aim to limit viral replication could offer the potential to limit virus recombination and thus the evolution of virulence. The development and use of vaccines that are focused on limiting virus replication, rather than vaccines that are focused more on limiting clinical disease, may be indicated in order to better control disease. PMID:28939604
Dilernia, Dario Alberto; Jones, Leandro; Rodriguez, Sabrina; Turk, Gabriela; Rubio, Andrea E.; Pampuro, Sandra; Gomez-Carrillo, Manuel; Bautista, Christian; Deluchi, Gabriel; Benetucci, Jorge; Lasala, María Beatriz; Lourtau, Leonardo; Losso, Marcelo Horacio; Perez, Héctor; Cahn, Pedro; Salomón, Horacio
2008-01-01
Background Cytotoxic T-Lymphocyte (CTL) response drives the evolution of HIV-1 at a host-level by selecting HLA-restricted escape mutations. Dissecting the dynamics of these escape mutations at a population-level would help to understand how HLA-mediated selection drives the evolution of HIV-1. Methodology/Principal Findings We undertook a study of the dynamics of HIV-1 CTL-escape mutations by analyzing through statistical approaches and phylogenetic methods the viral gene gag sequenced in plasma samples collected between the years 1987 and 2006 from 302 drug-naïve HIV-positive patients. By applying logistic regression models and after performing correction for multiple test, we identified 22 potential CTL-escape mutations (p-value<0.05; q-value<0.2); 10 of these associations were confirmed in samples biologically independent by a Bayesian Markov Chain Monte-Carlo method. Analyzing their prevalence back in time we found that escape mutations that are the consensus residue in samples collected after 2003 have actually significantly increased in time in one of either B or F subtype until becoming the most frequent residue, while dominating the other viral subtype. Their estimated prevalence in the viral subtype they did not dominate was lower than 30% for the majority of samples collected at the end of the 80's. In addition, when screening the entire viral region, we found that the 75% of positions significantly changing in time (p<0.05) were located within known CTL epitopes. Conclusions Across HIV Gag protein, the rise of polymorphisms from independent origin during the last twenty years of epidemic in our setting was related to an association with an HLA allele. The fact that these mutations accumulated in one of either B or F subtypes have also dominated the other subtype shows how this selection might be causing a convergence of viral subtypes to variants which are more likely to evade the immune response of the population where they circulate. PMID:18941505
Fonseca-Coronado, Salvador; Escobar-Gutiérrez, Alejandro; Ruiz-Tovar, Karina; Cruz-Rivera, Mayra Yolanda; Rivera-Osorio, Pilar; Vazquez-Pichardo, Mauricio; Carpio-Pedroza, Juan Carlos; Ruíz-Pacheco, Juan Alberto; Cazares, Fernando
2012-01-01
The use of telaprevir and boceprevir, both protease inhibitors (PI), as part of the specifically targeted antiviral therapy for hepatitis C (STAT-C) has significantly improved sustained virologic response (SVR) rates. However, different clinical studies have also identified several mutations associated with viral resistance to both PIs. In the absence of selective pressure, drug-resistant hepatitis C virus (HCV) mutants are generally present at low frequency, making mutation detection challenging. Here, we describe a mismatch amplification mutation assay (MAMA) PCR method for the specific detection of naturally occurring drug-resistant HCV mutants. MAMA PCR successfully identified the corresponding HCV variants, while conventional methods such as direct sequencing, endpoint limiting dilution (EPLD), and bacterial cloning were not sensitive enough to detect circulating drug-resistant mutants in clinical specimens. Ultradeep pyrosequencing was used to confirm the presence of the corresponding HCV mutants. In treatment-naïve patients, the frequency of all resistant variants was below 1%. Deep amplicon sequencing allowed a detailed analysis of the structure of the viral population among these patients, showing that the evolution of the NS3 is limited to a rather small sequence space. Monitoring of HCV drug resistance before and during treatment is likely to provide important information for management of patients undergoing anti-HCV therapy. PMID:22116161
Cantanhêde, Lilian Motta; Fernandes, Flavia Gonçalves; Ferreira, Gabriel Eduardo Melim; Porrozzi, Renato; Ferreira, Ricardo de Godoi Mattos; Cupolillo, Elisa
2018-01-01
Cutaneous leishmaniasis is a neglected parasitic disease that manifests in infected individuals under different phenotypes, with a range of factors contributing to its broad clinical spectrum. One factor, Leishmania RNA Virus 1 (LRV1), has been described as an endosymbiont present in different species of Leishmania. LRV1 significantly worsens the lesion, exacerbating the immune response in both experimentally infected animals and infected individuals. Little is known about the composition and genetic diversity of these viruses. Here, we investigated the relationship between the genetic composition of LRV1 detected in strains of Leishmania (Viannia) braziliensis and L. (V.) guyanensis and the interaction between the endosymbiont and the parasitic species, analyzing an approximately 850 base pair region of the viral genome. We also included one LRV1 sequence detected in L. (V.) shawi, representing the first report of LRV1 in a species other than L. braziliensis and L. guyanensis. The results illustrate the genetic diversity of the LRV1 strains analyzed here, with smaller divergences detected among viral sequences from the same parasite species. Phylogenetic analyses showed that the LRV1 sequences are grouped according to the parasite species and possibly according to the population of the parasite in which the virus was detected, corroborating the hypothesis of joint evolution of the viruses with the speciation of Leishmania parasites.
Potential Links between Hepadnavirus and Bornavirus Sequences in the Host Genome and Cancer.
Honda, Tomoyuki
2017-01-01
Various viruses leave their sequences in the host genomes during infection. Such events occur mainly in retrovirus infection but also sometimes in DNA and non-retroviral RNA virus infections. If viral sequences are integrated into the genomes of germ line cells, the sequences can become inherited as endogenous viral elements (EVEs). The integration events of viral sequences may have oncogenic potential. Because proviral integrations of some retroviruses and/or reactivation of endogenous retroviruses are closely linked to cancers, viral insertions related to non-retroviral viruses also possibly contribute to cancer development. This article focuses on genomic viral sequences derived from two non-retroviral viruses, whose endogenization is already reported, and discusses their possible contributions to cancer. Viral insertions of hepatitis B virus play roles in the development of hepatocellular carcinoma. Endogenous bornavirus-like elements, the only non-retroviral RNA virus-related EVEs found in the human genome, may also be involved in cancer formation. In addition, the possible contribution of the interactions between viruses and retrotransposons, which seem to be a major driving force for generating EVEs related to non-retroviral RNA viruses, to cancers will be discussed. Future studies regarding the possible links described here may open a new avenue for the development of novel therapeutics for tumor virus-related cancers and/or provide novel insights into EVE functions.
Metagenomic characterization of viral communities in Goseong Bay, Korea
NASA Astrophysics Data System (ADS)
Hwang, Jinik; Park, So Yun; Park, Mirye; Lee, Sukchan; Jo, Yeonhwa; Cho, Won Kyong; Lee, Taek-Kyun
2016-12-01
In this study, seawater samples were collected from Goseong Bay, Korea in March 2014 and viral populations were examined by metagenomics assembly. Enrichment of marine viral particles using FeCl3 followed by next-generation sequencing produced numerous sequences. De novo assembly and BLAST search showed that most of the obtained contigs were unknown sequences and only 0.74% of sequences were associated with known viruses. As a result, 138 viruses, including bacteriophages (87%), viruses infecting algae and others (13%) were identified. The identified 138 viruses were divided into 11 orders, 14 families, 34 genera, and 133 species. The dominant viruses were Pelagibacter phage HTVC010P and Roseobacter phage SIO1. The viruses infecting algae, including the Ostreococcus species, accounted for 9.4% of total identified viruses. In addition, we identified pathogenic herpes viruses infecting fishes and giant viruses infecting parasitic acanthamoeba species. This is a comprehensive study to reveal the viral populations in the Goseong Bay using metagenomics. The information associated with the marine viral community in Goseong Bay, Korea will be useful for comparative analysis in other marine viral communities.
Poole, Daniel S.; Yú, Shuǐqìng; Caì, Yíngyún; Dinis, Jorge M.; Müller, Marcel A.; Jordan, Ingo; Friedrich, Thomas C.; Kuhn, Jens H.
2014-01-01
ABSTRACT The recent identification of highly divergent influenza A viruses in bats revealed a new, geographically dispersed viral reservoir. To investigate the molecular mechanisms of host-restricted viral tropism and the potential for transmission of viruses between humans and bats, we exposed a panel of cell lines from bats of diverse species to a prototypical human-origin influenza A virus. All of the tested bat cell lines were susceptible to influenza A virus infection. Experimental evolution of human and avian-like viruses in bat cells resulted in efficient replication and created highly cytopathic variants. Deep sequencing of adapted human influenza A virus revealed a mutation in the PA polymerase subunit not previously described, M285K. Recombinant virus with the PA M285K mutation completely phenocopied the adapted virus. Adaptation of an avian virus-like virus resulted in the canonical PB2 E627K mutation that is required for efficient replication in other mammals. None of the adaptive mutations occurred in the gene for viral hemagglutinin, a gene that frequently acquires changes to recognize host-specific variations in sialic acid receptors. We showed that human influenza A virus uses canonical sialic acid receptors to infect bat cells, even though bat influenza A viruses do not appear to use these receptors for virus entry. Our results demonstrate that bats are unique hosts that select for both a novel mutation and a well-known adaptive mutation in the viral polymerase to support replication. IMPORTANCE Bats constitute well-known reservoirs for viruses that may be transferred into human populations, sometimes with fatal consequences. Influenza A viruses have recently been identified in bats, dramatically expanding the known host range of this virus. Here we investigated the replication of human influenza A virus in bat cell lines and the barriers that the virus faces in this new host. Human influenza A and B viruses infected cells from geographically and evolutionarily diverse New and Old World bats. Viruses mutated during infections in bat cells, resulting in increased replication and cytopathic effects. These mutations were mapped to the viral polymerase and shown to be solely responsible for adaptation to bat cells. Our data suggest that replication of human influenza A viruses in a nonnative host drives the evolution of new variants and may be an important source of genetic diversity. PMID:25142579
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roux, Simon; Hallam, Steven J.; Woyke, Tanja
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 newmore » viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.« less
Viral dark matter and virus-host interactions resolved from publicly available microbial genomes.
Roux, Simon; Hallam, Steven J; Woyke, Tanja; Sullivan, Matthew B
2015-07-22
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus-host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7-38% of 'unknown' sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus-host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
Roux, Simon; Hallam, Steven J.; Woyke, Tanja; ...
2015-07-22
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 newmore » viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.« less
Carré-Mlouka, A; Gaumer, S; Gay, P; Petitjean, A M; Coulondre, C; Dru, P; Bras, F; Dezélée, S; Contamine, D
2007-05-01
Ref(2)P has been described as one of the Drosophila proteins that interacts with the sigma virus cycle. We generated alleles to identify critical residues involved in the restrictive (inhibiting viral multiplication) or permissive (allowing viral multiplication) character of Ref(2)P. We demonstrate that permissive alleles increase the ability of the sigma virus to infect Drosophila when compared to null alleles and we confirm that restrictive alleles decrease this capacity. Moreover, we have created alleles unfunctional in viral cycling while functional for Ref(2)P fly functions. This type of allele had never been observed before and shows that fly- and virus-related activities of Ref(2)P are separable. The viral status of Ref(2)P variants is determined by the amino-terminal PB1 domain polymorphism. In addition, an isolated PB1 domain mimics virus-related functions even if it is similar to a loss of function toward fly-related activities. The evolutionary tree of the Ref(2)P PB1 domain that we could build on the basis of the natural allele sequences is in agreement with an evolution of PB1 domain due to successive transient selection waves.
Lewis, Jo E; Brameld, John M; Hill, Phil; Barrett, Perry; Ebling, Francis J P; Jethwa, Preeti H
2015-12-30
The viral 2A sequence has become an attractive alternative to the traditional internal ribosomal entry site (IRES) for simultaneous over-expression of two genes and in combination with recombinant adeno-associated viruses (rAAV) has been used to manipulate gene expression in vitro. To develop a rAAV construct in combination with the viral 2A sequence to allow long-term over-expression of the vgf gene and fluorescent marker gene for tracking of the transfected neurones in vivo. Transient transfection of the AAV plasmid containing the vgf gene, viral 2A sequence and eGFP into SH-SY5Y cells resulted in eGFP fluorescence comparable to a commercially available reporter construct. This increase in fluorescent cells was accompanied by an increase in VGF mRNA expression. Infusion of the rAAV vector containing the vgf gene, viral 2A sequence and eGFP resulted in eGFP fluorescence in the hypothalamus of both mice and Siberian hamsters, 32 weeks post infusion. In situ hybridisation confirmed that the location of VGF mRNA expression in the hypothalamus corresponded to the eGFP pattern of fluorescence. The viral 2A sequence is much smaller than the traditional IRES and therefore allowed over-expression of the vgf gene with fluorescent tracking without compromising viral capacity. The use of the viral 2A sequence in the AAV plasmid allowed the simultaneous expression of both genes in vitro. When used in combination with rAAV it resulted in long-term over-expression of both genes at equivalent locations in the hypothalamus of both Siberian hamsters and mice, without any adverse effects. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Carnegie, Nicole Bohme; Wang, Rui; Novitsky, Vladimir; De Gruttola, Victor
2014-01-01
Linkage analysis is useful in investigating disease transmission dynamics and the effect of interventions on them, but estimates of probabilities of linkage between infected people from observed data can be biased downward when missingness is informative. We investigate variation in the rates at which subjects' viral genotypes link across groups defined by viral load (low/high) and antiretroviral treatment (ART) status using blood samples from household surveys in the Northeast sector of Mochudi, Botswana. The probability of obtaining a sequence from a sample varies with viral load; samples with low viral load are harder to amplify. Pairwise genetic distances were estimated from aligned nucleotide sequences of HIV-1C env gp120. It is first shown that the probability that randomly selected sequences are linked can be estimated consistently from observed data. This is then used to develop estimates of the probability that a sequence from one group links to at least one sequence from another group under the assumption of independence across pairs. Furthermore, a resampling approach is developed that accounts for the presence of correlation across pairs, with diagnostics for assessing the reliability of the method. Sequences were obtained for 65% of subjects with high viral load (HVL, n = 117), 54% of subjects with low viral load but not on ART (LVL, n = 180), and 45% of subjects on ART (ART, n = 126). The probability of linkage between two individuals is highest if both have HVL, and lowest if one has LVL and the other has LVL or is on ART. Linkage across groups is high for HVL and lower for LVL and ART. Adjustment for missing data increases the group-wise linkage rates by 40–100%, and changes the relative rates between groups. Bias in inferences regarding HIV viral linkage that arise from differential ability to genotype samples can be reduced by appropriate methods for accommodating missing data. PMID:24415932
Carnegie, Nicole Bohme; Wang, Rui; Novitsky, Vladimir; De Gruttola, Victor
2014-01-01
Linkage analysis is useful in investigating disease transmission dynamics and the effect of interventions on them, but estimates of probabilities of linkage between infected people from observed data can be biased downward when missingness is informative. We investigate variation in the rates at which subjects' viral genotypes link across groups defined by viral load (low/high) and antiretroviral treatment (ART) status using blood samples from household surveys in the Northeast sector of Mochudi, Botswana. The probability of obtaining a sequence from a sample varies with viral load; samples with low viral load are harder to amplify. Pairwise genetic distances were estimated from aligned nucleotide sequences of HIV-1C env gp120. It is first shown that the probability that randomly selected sequences are linked can be estimated consistently from observed data. This is then used to develop estimates of the probability that a sequence from one group links to at least one sequence from another group under the assumption of independence across pairs. Furthermore, a resampling approach is developed that accounts for the presence of correlation across pairs, with diagnostics for assessing the reliability of the method. Sequences were obtained for 65% of subjects with high viral load (HVL, n = 117), 54% of subjects with low viral load but not on ART (LVL, n = 180), and 45% of subjects on ART (ART, n = 126). The probability of linkage between two individuals is highest if both have HVL, and lowest if one has LVL and the other has LVL or is on ART. Linkage across groups is high for HVL and lower for LVL and ART. Adjustment for missing data increases the group-wise linkage rates by 40-100%, and changes the relative rates between groups. Bias in inferences regarding HIV viral linkage that arise from differential ability to genotype samples can be reduced by appropriate methods for accommodating missing data.
Fuentes-Pananá, Ezequiel M; Larios-Serrato, Violeta; Méndez-Tenorio, Alfonso; Morales-Sánchez, Abigail; Arias, Carlos F; Torres, Javier
2016-01-01
Gastric (GC) and breast (BrC) cancer are two of the most common and deadly tumours. Different lines of evidence suggest a possible causative role of viral infections for both GC and BrC. Wide genome sequencing (WGS) technologies allow searching for viral agents in tissues of patients with cancer. These technologies have already contributed to establish virus-cancer associations as well as to discovery new tumour viruses. The objective of this study was to document possible associations of viral infection with GC and BrC in Mexican patients. In order to gain idea about cost effective conditions of experimental sequencing, we first carried out an in silico simulation of WGS. The next-generation-platform IlluminaGallx was then used to sequence GC and BrC tumour samples. While we did not find viral sequences in tissues from BrC patients, multiple reads matching Epstein-Barr virus (EBV) sequences were found in GC tissues. An end-point polymerase chain reaction confirmed an enrichment of EBV sequences in one of the GC samples sequenced, validating the next-generation sequencing-bioinformatics pipeline. PMID:26910355
Fuentes-Pananá, Ezequiel M; Larios-Serrato, Violeta; Méndez-Tenorio, Alfonso; Morales-Sánchez, Abigail; Arias, Carlos F; Torres, Javier
2016-03-01
Gastric (GC) and breast (BrC) cancer are two of the most common and deadly tumours. Different lines of evidence suggest a possible causative role of viral infections for both GC and BrC. Wide genome sequencing (WGS) technologies allow searching for viral agents in tissues of patients with cancer. These technologies have already contributed to establish virus-cancer associations as well as to discovery new tumour viruses. The objective of this study was to document possible associations of viral infection with GC and BrC in Mexican patients. In order to gain idea about cost effective conditions of experimental sequencing, we first carried out an in silico simulation of WGS. The next-generation-platform IlluminaGallx was then used to sequence GC and BrC tumour samples. While we did not find viral sequences in tissues from BrC patients, multiple reads matching Epstein-Barr virus (EBV) sequences were found in GC tissues. An end-point polymerase chain reaction confirmed an enrichment of EBV sequences in one of the GC samples sequenced, validating the next-generation sequencing-bioinformatics pipeline.
Coffey, Lark L; Page, Brady L; Greninger, Alexander L; Herring, Belinda L; Russell, Richard C; Doggett, Stephen L; Haniotis, John; Wang, Chunlin; Deng, Xutao; Delwart, Eric L
2014-01-05
Viral metagenomics characterizes known and identifies unknown viruses based on sequence similarities to any previously sequenced viral genomes. A metagenomics approach was used to identify virus sequences in Australian mosquitoes causing cytopathic effects in inoculated mammalian cell cultures. Sequence comparisons revealed strains of Liao Ning virus (Reovirus, Seadornavirus), previously detected only in China, livestock-infecting Stretch Lagoon virus (Reovirus, Orbivirus), two novel dimarhabdoviruses, named Beaumont and North Creek viruses, and two novel orthobunyaviruses, named Murrumbidgee and Salt Ash viruses. The novel virus proteomes diverged by ≥ 50% relative to their closest previously genetically characterized viral relatives. Deep sequencing also generated genomes of Warrego and Wallal viruses, orbiviruses linked to kangaroo blindness, whose genomes had not been fully characterized. This study highlights viral metagenomics in concert with traditional arbovirus surveillance to characterize known and new arboviruses in field-collected mosquitoes. Follow-up epidemiological studies are required to determine whether the novel viruses infect humans. © 2013 Elsevier Inc. All rights reserved.
Rocha, Cheila; Calado, Rita; Borrego, Pedro; ...
2013-10-24
Background: therapy and the majority of HIV-2 infected individuals survive as elite controllers with normal CD4 + T cell counts and low or undetectable plasma viral load. Neutralizing antibodies (Nabs) are thought to play a central role in HIV-2 evolution and pathogenesis. However, the dynamic of the Nab response and resulting HIV-2 escape during acute infection and their impact in HIV-2 evolution and disease progression remain largely unknown. Our objective was to characterize the Nab response and the molecular and phenotypic evolution of HIV-2 in association with Nab escape in the first years of infection in two children infected atmore » birth. As a result, CD4 + T cells decreased from about 50% to below 30% in both children in the first five years of infection and the infecting R5 viruses were replaced by X4 viruses within the same period. With antiretroviral therapy, viral load in child 1 decreased to undetectable levels and CD4 + T cells recovered to normal levels, which have been sustained at least until the age of 12. In contrast, viral load increased in child 2 and she progressed to AIDS and death at age 9. Beginning in the first year of life, child 1 raised high titers of antibodies that neutralized primary R5 isolates more effectively than X4 isolates, both autologous and heterologous. Child 2 raised a weak X4-specific Nab response that decreased sharply as disease progressed. Rate of evolution, nucleotide and amino acid diversity, and positive selection, were significantly higher in the envelope of child 1 compared to child 2. Rates of R5-to-X4 tropism switch, of V1 and V3 sequence diversification, and of convergence of V3 to a β-hairpin structure were related with rate of escape from the neutralizing antibodies. Finally, our data suggests that the molecular and phenotypic evolution of the human immunodeficiency virus type 2 envelope are related with the dynamics of the neutralizing antibody response providing further support for a model in which Nabs play an important role in HIV-2 pathogenesis.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rocha, Cheila; Calado, Rita; Borrego, Pedro
Background: therapy and the majority of HIV-2 infected individuals survive as elite controllers with normal CD4 + T cell counts and low or undetectable plasma viral load. Neutralizing antibodies (Nabs) are thought to play a central role in HIV-2 evolution and pathogenesis. However, the dynamic of the Nab response and resulting HIV-2 escape during acute infection and their impact in HIV-2 evolution and disease progression remain largely unknown. Our objective was to characterize the Nab response and the molecular and phenotypic evolution of HIV-2 in association with Nab escape in the first years of infection in two children infected atmore » birth. As a result, CD4 + T cells decreased from about 50% to below 30% in both children in the first five years of infection and the infecting R5 viruses were replaced by X4 viruses within the same period. With antiretroviral therapy, viral load in child 1 decreased to undetectable levels and CD4 + T cells recovered to normal levels, which have been sustained at least until the age of 12. In contrast, viral load increased in child 2 and she progressed to AIDS and death at age 9. Beginning in the first year of life, child 1 raised high titers of antibodies that neutralized primary R5 isolates more effectively than X4 isolates, both autologous and heterologous. Child 2 raised a weak X4-specific Nab response that decreased sharply as disease progressed. Rate of evolution, nucleotide and amino acid diversity, and positive selection, were significantly higher in the envelope of child 1 compared to child 2. Rates of R5-to-X4 tropism switch, of V1 and V3 sequence diversification, and of convergence of V3 to a β-hairpin structure were related with rate of escape from the neutralizing antibodies. Finally, our data suggests that the molecular and phenotypic evolution of the human immunodeficiency virus type 2 envelope are related with the dynamics of the neutralizing antibody response providing further support for a model in which Nabs play an important role in HIV-2 pathogenesis.« less
Expanding the Diversity of Mycobacteriophages: Insights into Genome Architecture and Evolution
Pope, Welkin H.; Jacobs-Sera, Deborah; Russell, Daniel A.; Peebles, Craig L.; Al-Atrache, Zein; Alcoser, Turi A.; Alexander, Lisa M.; Alfano, Matthew B.; Alford, Samantha T.; Amy, Nichols E.; Anderson, Marie D.; Anderson, Alexander G.; Ang, Andrew A. S.; Ares, Manuel; Barber, Amanda J.; Barker, Lucia P.; Barrett, Jonathan M.; Barshop, William D.; Bauerle, Cynthia M.; Bayles, Ian M.; Belfield, Katherine L.; Best, Aaron A.; Borjon, Agustin; Bowman, Charles A.; Boyer, Christine A.; Bradley, Kevin W.; Bradley, Victoria A.; Broadway, Lauren N.; Budwal, Keshav; Busby, Kayla N.; Campbell, Ian W.; Campbell, Anne M.; Carey, Alyssa; Caruso, Steven M.; Chew, Rebekah D.; Cockburn, Chelsea L.; Cohen, Lianne B.; Corajod, Jeffrey M.; Cresawn, Steven G.; Davis, Kimberly R.; Deng, Lisa; Denver, Dee R.; Dixon, Breyon R.; Ekram, Sahrish; Elgin, Sarah C. R.; Engelsen, Angela E.; English, Belle E. V.; Erb, Marcella L.; Estrada, Crystal; Filliger, Laura Z.; Findley, Ann M.; Forbes, Lauren; Forsyth, Mark H.; Fox, Tyler M.; Fritz, Melissa J.; Garcia, Roberto; George, Zindzi D.; Georges, Anne E.; Gissendanner, Christopher R.; Goff, Shannon; Goldstein, Rebecca; Gordon, Kobie C.; Green, Russell D.; Guerra, Stephanie L.; Guiney-Olsen, Krysta R.; Guiza, Bridget G.; Haghighat, Leila; Hagopian, Garrett V.; Harmon, Catherine J.; Harmson, Jeremy S.; Hartzog, Grant A.; Harvey, Samuel E.; He, Siping; He, Kevin J.; Healy, Kaitlin E.; Higinbotham, Ellen R.; Hildebrandt, Erin N.; Ho, Jason H.; Hogan, Gina M.; Hohenstein, Victoria G.; Holz, Nathan A.; Huang, Vincent J.; Hufford, Ericka L.; Hynes, Peter M.; Jackson, Arrykka S.; Jansen, Erica C.; Jarvik, Jonathan; Jasinto, Paul G.; Jordan, Tuajuanda C.; Kasza, Tomas; Katelyn, Murray A.; Kelsey, Jessica S.; Kerrigan, Larisa A.; Khaw, Daryl; Kim, Junghee; Knutter, Justin Z.; Ko, Ching-Chung; Larkin, Gail V.; Laroche, Jennifer R.; Latif, Asma; Leuba, Kohana D.; Leuba, Sequoia I.; Lewis, Lynn O.; Loesser-Casey, Kathryn E.; Long, Courtney A.; Lopez, A. Javier; Lowery, Nicholas; Lu, Tina Q.; Mac, Victor; Masters, Isaac R.; McCloud, Jazmyn J.; McDonough, Molly J.; Medenbach, Andrew J.; Menon, Anjali; Miller, Rachel; Morgan, Brandon K.; Ng, Patrick C.; Nguyen, Elvis; Nguyen, Katrina T.; Nguyen, Emilie T.; Nicholson, Kaylee M.; Parnell, Lindsay A.; Peirce, Caitlin E.; Perz, Allison M.; Peterson, Luke J.; Pferdehirt, Rachel E.; Philip, Seegren V.; Pogliano, Kit; Pogliano, Joe; Polley, Tamsen; Puopolo, Erica J.; Rabinowitz, Hannah S.; Resiss, Michael J.; Rhyan, Corwin N.; Robinson, Yetta M.; Rodriguez, Lauren L.; Rose, Andrew C.; Rubin, Jeffrey D.; Ruby, Jessica A.; Saha, Margaret S.; Sandoz, James W.; Savitskaya, Judith; Schipper, Dale J.; Schnitzler, Christine E.; Schott, Amanda R.; Segal, J. Bradley; Shaffer, Christopher D.; Sheldon, Kathryn E.; Shepard, Erica M.; Shepardson, Jonathan W.; Shroff, Madav K.; Simmons, Jessica M.; Simms, Erika F.; Simpson, Brandy M.; Sinclair, Kathryn M.; Sjoholm, Robert L.; Slette, Ingrid J.; Spaulding, Blaire C.; Straub, Clark L.; Stukey, Joseph; Sughrue, Trevor; Tang, Tin-Yun; Tatyana, Lyons M.; Taylor, Stephen B.; Taylor, Barbara J.; Temple, Louise M.; Thompson, Jasper V.; Tokarz, Michael P.; Trapani, Stephanie E.; Troum, Alexander P.; Tsay, Jonathan; Tubbs, Anthony T.; Walton, Jillian M.; Wang, Danielle H.; Wang, Hannah; Warner, John R.; Weisser, Emilie G.; Wendler, Samantha C.; Weston-Hafer, Kathleen A.; Whelan, Hilary M.; Williamson, Kurt E.; Willis, Angelica N.; Wirtshafter, Hannah S.; Wong, Theresa W.; Wu, Phillip; Yang, Yun jeong; Yee, Brandon C.; Zaidins, David A.; Zhang, Bo; Zúniga, Melina Y.; Hendrix, Roger W.; Hatfull, Graham F.
2011-01-01
Mycobacteriophages are viruses that infect mycobacterial hosts such as Mycobacterium smegmatis and Mycobacterium tuberculosis. All mycobacteriophages characterized to date are dsDNA tailed phages, and have either siphoviral or myoviral morphotypes. However, their genetic diversity is considerable, and although sixty-two genomes have been sequenced and comparatively analyzed, these likely represent only a small portion of the diversity of the mycobacteriophage population at large. Here we report the isolation, sequencing and comparative genomic analysis of 18 new mycobacteriophages isolated from geographically distinct locations within the United States. Although no clear correlation between location and genome type can be discerned, these genomes expand our knowledge of mycobacteriophage diversity and enhance our understanding of the roles of mobile elements in viral evolution. Expansion of the number of mycobacteriophages grouped within Cluster A provides insights into the basis of immune specificity in these temperate phages, and we also describe a novel example of apparent immunity theft. The isolation and genomic analysis of bacteriophages by freshman college students provides an example of an authentic research experience for novice scientists. PMID:21298013
Expanding the diversity of mycobacteriophages: insights into genome architecture and evolution.
Pope, Welkin H; Jacobs-Sera, Deborah; Russell, Daniel A; Peebles, Craig L; Al-Atrache, Zein; Alcoser, Turi A; Alexander, Lisa M; Alfano, Matthew B; Alford, Samantha T; Amy, Nichols E; Anderson, Marie D; Anderson, Alexander G; Ang, Andrew A S; Ares, Manuel; Barber, Amanda J; Barker, Lucia P; Barrett, Jonathan M; Barshop, William D; Bauerle, Cynthia M; Bayles, Ian M; Belfield, Katherine L; Best, Aaron A; Borjon, Agustin; Bowman, Charles A; Boyer, Christine A; Bradley, Kevin W; Bradley, Victoria A; Broadway, Lauren N; Budwal, Keshav; Busby, Kayla N; Campbell, Ian W; Campbell, Anne M; Carey, Alyssa; Caruso, Steven M; Chew, Rebekah D; Cockburn, Chelsea L; Cohen, Lianne B; Corajod, Jeffrey M; Cresawn, Steven G; Davis, Kimberly R; Deng, Lisa; Denver, Dee R; Dixon, Breyon R; Ekram, Sahrish; Elgin, Sarah C R; Engelsen, Angela E; English, Belle E V; Erb, Marcella L; Estrada, Crystal; Filliger, Laura Z; Findley, Ann M; Forbes, Lauren; Forsyth, Mark H; Fox, Tyler M; Fritz, Melissa J; Garcia, Roberto; George, Zindzi D; Georges, Anne E; Gissendanner, Christopher R; Goff, Shannon; Goldstein, Rebecca; Gordon, Kobie C; Green, Russell D; Guerra, Stephanie L; Guiney-Olsen, Krysta R; Guiza, Bridget G; Haghighat, Leila; Hagopian, Garrett V; Harmon, Catherine J; Harmson, Jeremy S; Hartzog, Grant A; Harvey, Samuel E; He, Siping; He, Kevin J; Healy, Kaitlin E; Higinbotham, Ellen R; Hildebrandt, Erin N; Ho, Jason H; Hogan, Gina M; Hohenstein, Victoria G; Holz, Nathan A; Huang, Vincent J; Hufford, Ericka L; Hynes, Peter M; Jackson, Arrykka S; Jansen, Erica C; Jarvik, Jonathan; Jasinto, Paul G; Jordan, Tuajuanda C; Kasza, Tomas; Katelyn, Murray A; Kelsey, Jessica S; Kerrigan, Larisa A; Khaw, Daryl; Kim, Junghee; Knutter, Justin Z; Ko, Ching-Chung; Larkin, Gail V; Laroche, Jennifer R; Latif, Asma; Leuba, Kohana D; Leuba, Sequoia I; Lewis, Lynn O; Loesser-Casey, Kathryn E; Long, Courtney A; Lopez, A Javier; Lowery, Nicholas; Lu, Tina Q; Mac, Victor; Masters, Isaac R; McCloud, Jazmyn J; McDonough, Molly J; Medenbach, Andrew J; Menon, Anjali; Miller, Rachel; Morgan, Brandon K; Ng, Patrick C; Nguyen, Elvis; Nguyen, Katrina T; Nguyen, Emilie T; Nicholson, Kaylee M; Parnell, Lindsay A; Peirce, Caitlin E; Perz, Allison M; Peterson, Luke J; Pferdehirt, Rachel E; Philip, Seegren V; Pogliano, Kit; Pogliano, Joe; Polley, Tamsen; Puopolo, Erica J; Rabinowitz, Hannah S; Resiss, Michael J; Rhyan, Corwin N; Robinson, Yetta M; Rodriguez, Lauren L; Rose, Andrew C; Rubin, Jeffrey D; Ruby, Jessica A; Saha, Margaret S; Sandoz, James W; Savitskaya, Judith; Schipper, Dale J; Schnitzler, Christine E; Schott, Amanda R; Segal, J Bradley; Shaffer, Christopher D; Sheldon, Kathryn E; Shepard, Erica M; Shepardson, Jonathan W; Shroff, Madav K; Simmons, Jessica M; Simms, Erika F; Simpson, Brandy M; Sinclair, Kathryn M; Sjoholm, Robert L; Slette, Ingrid J; Spaulding, Blaire C; Straub, Clark L; Stukey, Joseph; Sughrue, Trevor; Tang, Tin-Yun; Tatyana, Lyons M; Taylor, Stephen B; Taylor, Barbara J; Temple, Louise M; Thompson, Jasper V; Tokarz, Michael P; Trapani, Stephanie E; Troum, Alexander P; Tsay, Jonathan; Tubbs, Anthony T; Walton, Jillian M; Wang, Danielle H; Wang, Hannah; Warner, John R; Weisser, Emilie G; Wendler, Samantha C; Weston-Hafer, Kathleen A; Whelan, Hilary M; Williamson, Kurt E; Willis, Angelica N; Wirtshafter, Hannah S; Wong, Theresa W; Wu, Phillip; Yang, Yun jeong; Yee, Brandon C; Zaidins, David A; Zhang, Bo; Zúniga, Melina Y; Hendrix, Roger W; Hatfull, Graham F
2011-01-27
Mycobacteriophages are viruses that infect mycobacterial hosts such as Mycobacterium smegmatis and Mycobacterium tuberculosis. All mycobacteriophages characterized to date are dsDNA tailed phages, and have either siphoviral or myoviral morphotypes. However, their genetic diversity is considerable, and although sixty-two genomes have been sequenced and comparatively analyzed, these likely represent only a small portion of the diversity of the mycobacteriophage population at large. Here we report the isolation, sequencing and comparative genomic analysis of 18 new mycobacteriophages isolated from geographically distinct locations within the United States. Although no clear correlation between location and genome type can be discerned, these genomes expand our knowledge of mycobacteriophage diversity and enhance our understanding of the roles of mobile elements in viral evolution. Expansion of the number of mycobacteriophages grouped within Cluster A provides insights into the basis of immune specificity in these temperate phages, and we also describe a novel example of apparent immunity theft. The isolation and genomic analysis of bacteriophages by freshman college students provides an example of an authentic research experience for novice scientists.
Chen, Hui; Deng, Qiang; Ng, Sock Hoon; Lee, Raphael Tze Chuen; Maurer-Stroh, Sebastian; Zhai, Weiwei
2016-12-01
Influenza viruses are often propagated in a diverse set of culturing media and additional substitutions known as passage adaptation can cause extra evolution in the target strain, leading to ineffective vaccines. Using 25,482 H3N2 HA1 sequences curated from Global Initiative on Sharing All Influenza Data and National Center for Biotechnology Information databases, we found that passage adaptation is a very dynamic process that changes over time and evolves in a seesaw like pattern. After crossing the species boundary from bird to human in 1968, the influenza H3N2 virus evolves to be better adapted to the human environment and passaging them in embryonated eggs (i.e., an avian environment) leads to increasingly stronger positive selection. On the contrary, passage adaptation to the mammalian cell lines changes from positive selection to negative selection. Using two statistical tests, we identified 19 codon positions around the receptor binding domain strongly contributing to passage adaptation in the embryonated egg. These sites show strong convergent evolution and overlap extensively with positively selected sites identified in humans, suggesting that passage adaptation can confound many of the earlier studies on influenza evolution. Interestingly, passage adaptation in recent years seems to target a few codon positions in antigenic surface epitopes, which makes it difficult to produce antigenically unaltered vaccines using embryonic eggs. Our study outlines another interesting scenario whereby both convergent and adaptive evolution are working in synchrony driving viral adaptation. Future studies from sequence analysis to vaccine production need to take careful consideration of passage adaptation. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Miyashita, Shuhei; Ishibashi, Kazuhiro; Kishino, Hirohisa; Ishikawa, Masayuki
2015-01-01
Recent studies on evolutionarily distant viral groups have shown that the number of viral genomes that establish cell infection after cell-to-cell transmission is unexpectedly small (1–20 genomes). This aspect of viral infection appears to be important for the adaptation and survival of viruses. To clarify how the number of viral genomes that establish cell infection is determined, we developed a simulation model of cell infection for tomato mosaic virus (ToMV), a positive-strand RNA virus. The model showed that stochastic processes that govern the replication or degradation of individual genomes result in the infection by a small number of genomes, while a large number of infectious genomes are introduced in the cell. It also predicted two interesting characteristics regarding cell infection patterns: stochastic variation among cells in the number of viral genomes that establish infection and stochastic inequality in the accumulation of their progenies in each cell. Both characteristics were validated experimentally by inoculating tobacco cells with a library of nucleotide sequence–tagged ToMV and analyzing the viral genomes that accumulated in each cell using a high-throughput sequencer. An additional simulation model revealed that these two characteristics enhance selection during tissue infection. The cell infection model also predicted a mechanism that enhances selection at the cellular level: a small difference in the replication abilities of coinfected variants results in a large difference in individual accumulation via the multiple-round formation of the replication complex (i.e., the replication machinery). Importantly, this predicted effect was observed in vivo. The cell infection model was robust to changes in the parameter values, suggesting that other viruses could adopt similar adaptation mechanisms. Taken together, these data reveal a comprehensive picture of viral infection processes including replication, cell-to-cell transmission, and evolution, which are based on the stochastic behavior of the viral genome molecules in each cell. PMID:25781391
A phylogenomic data-driven exploration of viral origins and evolution
Nasir, Arshan; Caetano-Anollés, Gustavo
2015-01-01
The origin of viruses remains mysterious because of their diverse and patchy molecular and functional makeup. Although numerous hypotheses have attempted to explain viral origins, none is backed by substantive data. We take full advantage of the wealth of available protein structural and functional data to explore the evolution of the proteomic makeup of thousands of cells and viruses. Despite the extremely reduced nature of viral proteomes, we established an ancient origin of the “viral supergroup” and the existence of widespread episodes of horizontal transfer of genetic information. Viruses harboring different replicon types and infecting distantly related hosts shared many metabolic and informational protein structural domains of ancient origin that were also widespread in cellular proteomes. Phylogenomic analysis uncovered a universal tree of life and revealed that modern viruses reduced from multiple ancient cells that harbored segmented RNA genomes and coexisted with the ancestors of modern cells. The model for the origin and evolution of viruses and cells is backed by strong genomic and structural evidence and can be reconciled with existing models of viral evolution if one considers viruses to have originated from ancient cells and not from modern counterparts. PMID:26601271
2013-01-01
Background In vertebrates, it has been repeatedly demonstrated that genes encoding proteins involved in pathogen-recognition by adaptive immunity (e.g. MHC) are subject to intensive diversifying selection. On the other hand, the role and the type of selection processes shaping the evolution of innate-immunity genes are currently far less clear. In this study we analysed the natural variation and the evolutionary processes acting on two genes involved in the innate-immunity recognition of Microbe-Associated Molecular Patterns (MAMPs). Results We sequenced genes encoding Toll-like receptor 4 (Tlr4) and 7 (Tlr7), two of the key bacterial- and viral-sensing receptors of innate immunity, across 23 species within the subfamily Murinae. Although we have shown that the phylogeny of both Tlr genes is largely congruent with the phylogeny of rodents based on a comparably sized non-immune sequence dataset, we also identified several potentially important discrepancies. The sequence analyses revealed that major parts of both Tlrs are evolving under strong purifying selection, likely due to functional constraints. Yet, also several signatures of positive selection have been found in both genes, with more intense signal in the bacterial-sensing Tlr4 than in the viral-sensing Tlr7. 92% and 100% of sites evolving under positive selection in Tlr4 and Tlr7, respectively, were located in the extracellular domain. Directly in the Ligand-Binding Region (LBR) of TLR4 we identified two rapidly evolving amino acid residues and one site under positive selection, all three likely involved in species-specific recognition of lipopolysaccharide of gram-negative bacteria. In contrast, all putative sites of LBRTLR7 involved in the detection of viral nucleic acids were highly conserved across rodents. Interspecific differences in the predicted 3D-structure of the LBR of both Tlrs were not related to phylogenetic history, while analyses of protein charges clearly discriminated Rattini and Murini clades. Conclusions In consequence of the constraints given by the receptor protein function purifying selection has been a dominant force in evolution of Tlrs. Nevertheless, our results show that episodic diversifying parasite-mediated selection has shaped the present species-specific variability in rodent Tlrs. The intensity of diversifying selection was higher in Tlr4 than in Tlr7, presumably due to structural properties of their ligands. PMID:24028551
Fenstermacher, Katherine J; Achuthan, Vasudevan; Schneider, Thomas D; DeStefano, Jeffrey J
2018-01-16
DNA polymerases (DNAPs) recognize 3' recessed termini on duplex DNA and carry out nucleotide catalysis. Unlike promoter-specific RNA polymerases (RNAPs), no sequence specificity is required for binding or initiation of catalysis. Despite this, previous results indicate that viral reverse transcriptases bind much more tightly to DNA primers that mimic the polypurine tract. In the current report, primer sequences that bind with high affinity to Taq and Klenow polymerases were identified using a modified Selective Evolution of Ligands by Exponential Enrichment (SELEX) approach. Two Taq -specific primers that bound ∼10 (Taq1) and over 100 (Taq2) times more stably than controls to Taq were identified. Taq1 contained 8 nucleotides (5' -CACTAAAG-3') that matched the phage T3 RNAP "core" promoter. Both primers dramatically outcompeted primers with similar binding thermodynamics in PCR reactions. Similarly, exonuclease minus Klenow polymerase also selected a high affinity primer that contained a related core promoter sequence from phage T7 RNAP (5' -ACTATAG-3'). For both Taq and Klenow, even small modifications to the sequence resulted in large losses in binding affinity suggesting that binding was highly sequence-specific. The results are discussed in the context of possible effects on multi-primer (multiplex) PCR assays, molecular information theory, and the evolution of RNAPs and DNAPs. Importance This work further demonstrates that primer-dependent DNA polymerases can have strong sequence biases leading to dramatically tighter binding to specific sequences. These may be related to biological function, or be a consequences of the structural architecture of the enzyme. New sequence specificity for Taq and Klenow polymerases were uncovered and among them were sequences that contained the core promoter elements from T3 and T7 phage RNA polymerase promoters. This suggests the intriguing possibility that phage RNA polymerases exploited intrinsic binding affinities of ancestral DNA polymerases to develop their promotors. Conversely, DNA polymerases could have evolved from related RNA polymerases and retained the intrinsic binding preference despite there being no clear function for such a preference in DNA biology. Copyright © 2018 American Society for Microbiology.
Dombrovsky, Aviv; Glanz, Eyal; Lachman, Oded; Sela, Noa; Doron-Faigenboim, Adi; Antignus, Yehezkel
2013-01-01
We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV). PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV) and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV) (both poleroviruses). The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs), which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93%) and to ORF5 of CABYV (87%). Both PYLCV and Pepper vein yellowing virus (PeVYV) contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP), may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2) were identified between nucleotides 4,962 and 5,061 (ORF 5) and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3) with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s) between poleroviruses co-infecting a common host(s), resulting in the emergence of PYLCV, a novel pathogen with a wider host range. PMID:23936244
Dombrovsky, Aviv; Glanz, Eyal; Lachman, Oded; Sela, Noa; Doron-Faigenboim, Adi; Antignus, Yehezkel
2013-01-01
We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV). PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV) and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV) (both poleroviruses). The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs), which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93%) and to ORF5 of CABYV (87%). Both PYLCV and Pepper vein yellowing virus (PeVYV) contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP), may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2) were identified between nucleotides 4,962 and 5,061 (ORF 5) and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3) with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s) between poleroviruses co-infecting a common host(s), resulting in the emergence of PYLCV, a novel pathogen with a wider host range.
Fulton, Benjamin O; Sachs, David; Schwarz, Megan C; Palese, Peter; Evans, Matthew J
2017-08-01
The molecular constraints affecting Zika virus (ZIKV) evolution are not well understood. To investigate ZIKV genetic flexibility, we used transposon mutagenesis to add 15-nucleotide insertions throughout the ZIKV MR766 genome and subsequently deep sequenced the viable mutants. Few ZIKV insertion mutants replicated, which likely reflects a high degree of functional constraints on the genome. The NS1 gene exhibited distinct mutational tolerances at different stages of the screen. This result may define regions of the NS1 protein that are required for the different stages of the viral life cycle. The ZIKV structural genes showed the highest degree of insertional tolerance. Although the envelope (E) protein exhibited particular flexibility, the highly conserved envelope domain II (EDII) fusion loop of the E protein was intolerant of transposon insertions. The fusion loop is also a target of pan-flavivirus antibodies that are generated against other flaviviruses and neutralize a broad range of dengue virus and ZIKV isolates. The genetic restrictions identified within the epitopes in the EDII fusion loop likely explain the sequence and antigenic conservation of these regions in ZIKV and among multiple flaviviruses. Thus, our results provide insights into the genetic restrictions on ZIKV that may affect the evolution of this virus. IMPORTANCE Zika virus recently emerged as a significant human pathogen. Determining the genetic constraints on Zika virus is important for understanding the factors affecting viral evolution. We used a genome-wide transposon mutagenesis screen to identify where mutations were tolerated in replicating viruses. We found that the genetic regions involved in RNA replication were mostly intolerant of mutations. The genes coding for structural proteins were more permissive to mutations. Despite the flexibility observed in these regions, we found that epitopes bound by broadly reactive antibodies were genetically constrained. This finding may explain the genetic conservation of these epitopes among flaviviruses. Copyright © 2017 American Society for Microbiology.
Evolution of bird genomes-a transposon's-eye view.
Kapusta, Aurélie; Suh, Alexander
2017-02-01
Birds, the most species-rich monophyletic group of land vertebrates, have been subject to some of the most intense sequencing efforts to date, making them an ideal case study for recent developments in genomics research. Here, we review how our understanding of bird genomes has changed with the recent sequencing of more than 75 species from all major avian taxa. We illuminate avian genome evolution from a previously neglected perspective: their repetitive genomic parasites, transposable elements (TEs) and endogenous viral elements (EVEs). We show that (1) birds are unique among vertebrates in terms of their genome organization; (2) information about the diversity of avian TEs and EVEs is changing rapidly; (3) flying birds have smaller genomes yet more TEs than flightless birds; (4) current second-generation genome assemblies fail to capture the variation in avian chromosome number and genome size determined with cytogenetics; (5) the genomic microcosm of bird-TE "arms races" has yet to be explored; and (6) upcoming third-generation genome assemblies suggest that birds exhibit stability in gene-rich regions and instability in TE-rich regions. We emphasize that integration of cytogenetics and single-molecule technologies with repeat-resolved genome assemblies is essential for understanding the evolution of (bird) genomes. © 2016 New York Academy of Sciences.
Temporal evolution and potential recombination events in PRRSV strains of Sonora Mexico.
Burgara-Estrella, Alexel; Reséndiz-Sandoval, Mónica; Cortey, Martí; Mateu, Enric; Hernández, Jesús
2014-12-05
The aim of this work was to examine the evolution and potential existence of intragenic recombinations of PRRSV strains in Sonora, Mexico. In this study, 142 serum samples from farms located in Hermosillo (HMO), Cd. Obregón (OBR) and Navojoa (NAV) were sequenced from 2002 to 2012. Ninety non-redundant sequences of ORF5 gene were analyzed for temporal and spatial relationships among strains and the probability of a recombination event. The phylogenetic analysis showed 30 strains grouped into eight groups; 16 strains were closely related among the farms, while 14 were un-related. The first strain in this study was observed in 2002. A number of farms were infected with one or more strains, and in the majority of the strains, the virus was replaced by a new strain. The recombination analysis suggested the presence of four viruses as products of a recombination event; in one case, a virus close related with MLV vaccine was involved as the parent virus. This work shows the evolution of PRRSV in the field, the viral dissemination between farms and the potential recombination events. Our data suggest that PRRSV in Sonora has a specific genetic nature compared with other PRRSV. Copyright © 2014 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adekunle, S.S.A.; Wyandt, H.; Mark, H.F.L.
1994-09-01
Recently we mapped the telomeric repeat sequences to 111 interstitial sites in the human genome and to sites of gaps and breaks induced by aphidicolin and sister chromatid exchange sites detected by BrdU. Many of these sites correspond to conserved fragile sites in man, gorilla and chimpazee, to sites of conserved sister chromatid exchange in the mammalian X chromosome, to mutagenic sensitive sites, mapped locations of proto-oncogenes, breakpoints implicated in primate evolution and to breakpoints indicated as the sole anomaly in neoplasia. This observation prompted us to investigate if the interstitial telomeric sites cluster with these sites. An extensive literaturemore » search was carried out to find all the available published sites mentioned above. For comparison, we also carried out a statistical analysis of the clustering of the sites of the telomeric repeats with the gene locations where only nucleotide mutations have been observed as the only chromosomal abnormality. Our results indicate that the telomeric repeats cluster most with fragile sites, mutagenic sensitive sites and breakpoints implicated in primate evolution and least with cancer breakpoints, mapped locations of proto-oncogenes and other genes with nucleotide mutations.« less
Structure, sequence and expression of the hepatitis delta (δ) viral genome
NASA Astrophysics Data System (ADS)
Wang, Kang-Sheng; Choo, Qui-Lim; Weiner, Amy J.; Ou, Jing-Hsiung; Najarian, Richard C.; Thayer, Richard M.; Mullenbach, Guy T.; Denniston, Katherine J.; Gerin, John L.; Houghton, Michael
1986-10-01
Biochemical and electron microscopic data indicate that the human hepatitis δ viral agent contains a covalently closed circular and single-stranded RNA genome that has certain similarities with viroid-like agents from plants. The sequence of the viral genome (1,678 nucleotides) has been determined and an open reading frame within the complementary strand has been shown to encode an antigen that binds specifically to antisera from patients with chronic hepatitis δ viral infections.
O'Neill, F J; Gao, Y; Xu, X
1993-11-01
The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant monomolecular genomes. These and other findings indicate that the bipartite genome state can sustain many mutations which wtSV40 cannot directly sustain. However, the mutations can later be introduced into the wild type genomes when the E- and L-SV40 DNAs recombine to generate a new monomolecular genome structure.
Nouri, Shahideh; Salem, Nidá; Nigg, Jared C.
2015-01-01
ABSTRACT The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. IMPORTANCE Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. PMID:26676774
Nouri, Shahideh; Salem, Nidá; Nigg, Jared C; Falk, Bryce W
2015-12-16
The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Barrero, Roberto A; Napier, Kathryn R; Cunnington, James; Liefting, Lia; Keenan, Sandi; Frampton, Rebekah A; Szabo, Tamas; Bulman, Simon; Hunter, Adam; Ward, Lisa; Whattam, Mark; Bellgard, Matthew I
2017-01-11
Detection and preventing entry of exotic viruses and viroids at the border is critical for protecting plant industries trade worldwide. Existing post entry quarantine screening protocols rely on time-consuming biological indicators and/or molecular assays that require knowledge of infecting viral pathogens. Plants have developed the ability to recognise and respond to viral infections through Dicer-like enzymes that cleave viral sequences into specific small RNA products. Many studies reported the use of a broad range of small RNAs encompassing the product sizes of several Dicer enzymes involved in distinct biological pathways. Here we optimise the assembly of viral sequences by using specific small RNA subsets. We sequenced the small RNA fractions of 21 plants held at quarantine glasshouse facilities in Australia and New Zealand. Benchmarking of several de novo assembler tools yielded SPAdes using a kmer of 19 to produce the best assembly outcomes. We also found that de novo assembly using 21-25 nt small RNAs can result in chimeric assemblies of viral sequences and plant host sequences. Such non-specific assemblies can be resolved by using 21-22 nt or 24 nt small RNAs subsets. Among the 21 selected samples, we identified contigs with sequence similarity to 18 viruses and 3 viroids in 13 samples. Most of the viruses were assembled using only 21-22 nt long virus-derived siRNAs (viRNAs), except for one Citrus endogenous pararetrovirus that was more efficiently assembled using 24 nt long viRNAs. All three viroids found in this study were fully assembled using either 21-22 nt or 24 nt viRNAs. Optimised analysis workflows were customised within the Yabi web-based analytical environment. We present a fully automated viral surveillance and diagnosis web-based bioinformatics toolkit that provides a flexible, user-friendly, robust and scalable interface for the discovery and diagnosis of viral pathogens. We have implemented an automated viral surveillance and diagnosis (VSD) bioinformatics toolkit that produces improved viruses and viroid sequence assemblies. The VSD toolkit provides several optimised and reusable workflows applicable to distinct viral pathogens. We envisage that this resource will facilitate the surveillance and diagnosis viral pathogens in plants, insects and invertebrates.
Gourraud, P A; Karaouni, A; Woo, J M; Schmidt, T; Oksenberg, J R; Hecht, F M; Liegler, T J; Barbour, J D
2011-03-01
We examined single nucleotide polymorphisms (SNP) in the APOBEC3 locus on chromosome 22, paired with population sequences of pro-viral human immunodeficiency virus-1 (HIV-1) vif from peripheral blood mononuclear cells, from 96 recently HIV-1-infected treatment-naive adults. We found evidence for the existence of an APOBEC3H linkage disequilibrium (LD) block associated with variation in GA → AA, or APOBEC3F/H signature, sequence changes in pro-viral HIV-1 vif sequence (top 10 significant SNPs with a significant p = 4.8 × 10(-3)). We identified a common five position risk haplotype distal to APOBEC3H (A3Hrh). These markers were in high LD (D' = 1; r(2) = 0.98) to a previously described A3H "RED" haplotype containing a variant (E121) with enhanced susceptibility to HIV-1 Vif. This association was confirmed by a haplotype analysis. Homozygote carriers of the A3Hrh had lower GA->AA (A3F/H) sequence editing upon pro-viral HIV-1 vif sequence (p = 0.01), and lower HIV-1 RNA levels over time during early, untreated HIV-1 infection, (p = 0.015 mixed effects model). This effect may be due to enhanced susceptibility of A3H forms to HIV-1 Vif mediated viral suppression of sequence editing activity, slowing viral diversification and escape from immune responses. Copyright © 2011 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Cheng, Ji-Hong; Liu, Wen-Chun; Chang, Ting-Tsung; Hsieh, Sun-Yuan; Tseng, Vincent S
2017-10-01
Many studies have suggested that deletions of Hepatitis B Viral (HBV) are associated with the development of progressive liver diseases, even ultimately resulting in hepatocellular carcinoma (HCC). Among the methods for detecting deletions from next-generation sequencing (NGS) data, few methods considered the characteristics of virus, such as high evolution rates and high divergence among the different HBV genomes. Sequencing high divergence HBV genome sequences using the NGS technology outputs millions of reads. Thus, detecting exact breakpoints of deletions from these big and complex data incurs very high computational cost. We proposed a novel analytical method named VirDelect (Virus Deletion Detect), which uses split read alignment base to detect exact breakpoint and diversity variable to consider high divergence in single-end reads data, such that the computational cost can be reduced without losing accuracy. We use four simulated reads datasets and two real pair-end reads datasets of HBV genome sequence to verify VirDelect accuracy by score functions. The experimental results show that VirDelect outperforms the state-of-the-art method Pindel in terms of accuracy score for all simulated datasets and VirDelect had only two base errors even in real datasets. VirDelect is also shown to deliver high accuracy in analyzing the single-end read data as well as pair-end data. VirDelect can serve as an effective and efficient bioinformatics tool for physiologists with high accuracy and efficient performance and applicable to further analysis with characteristics similar to HBV on genome length and high divergence. The software program of VirDelect can be downloaded at https://sourceforge.net/projects/virdelect/. Copyright © 2017. Published by Elsevier Inc.
Vidal, R; González, R; Gil, F
2015-06-10
Innate pathway activation is fundamental for early anti-viral defense in fish, but currently there is insufficient understanding of how salmonid fish identify viral molecules and activate these pathways. The Toll-like receptor (TLR) is believed to play a crucial role in host defense of pathogenic microbes in the innate immune system. In the present study, the full-length cDNA of Salmo salar TLR3 (ssTLR3) was cloned. The ssTLR3 cDNA sequence was 6071 bp long, containing an open reading frame of 2754 bp and encoding 971 amino acids. The TLR group motifs, such as leucine-rich repeat (LRR) domains and Toll-interleukin-1 receptor (TIR) domains, were maintained in ssTLR3, with sixteen LRR domains and one TIR domain. In contrast to descriptions of the TLR3 in rainbow trout and the murine (TATA-less), we found a putative TATA box in the proximal promoter region 29 bp upstream of the transcription start point of ssTLR3. Multiple-sequence alignment analysis of the ssTLR3 protein-coding sequence with other known TLR3 sequences showed the sequence to be conserved among all species analyzed, implying that the function of the TLR3 had been sustained throughout evolution. The ssTLR3 mRNA expression patterns were measured using real-time PCR. The results revealed that TLR3 is widely expressed in various healthy tissues. Individuals challenged with infectious pancreatic necrosis virus and immunostimulated with polyinosinic:polycytidylic acid exhibited increased expression of TLR3 at the mRNA level, indicating that ssTLR3 may be involved in pathogen recognition in the early innate immune system.
Li, Yan; Khalafalla, Abdelmalik Ibrahim; Paden, Clinton R; Yusof, Mohammed F; Eltahir, Yassir M; Al Hammadi, Zulaikha M; Tao, Ying; Queen, Krista; Hosani, Farida Al; Gerber, Susan I; Hall, Aron J; Al Muhairi, Salama; Tong, Suxiang
2017-01-01
Camels are known carriers for many viral pathogens, including Middle East respiratory syndrome coronavirus (MERS-CoV). It is likely that there are additional, as yet unidentified viruses in camels with the potential to cause disease in humans. In this study, we performed metagenomic sequencing analysis on nasopharyngeal swab samples from 108 MERS-CoV-positive dromedary camels from a live animal market in Abu Dhabi, United Arab Emirates. We obtained a total of 846.72 million high-quality reads from these nasopharyngeal swab samples, of which 2.88 million (0.34%) were related to viral sequences while 512.63 million (60.5%) and 50.87 million (6%) matched bacterial and eukaryotic sequences, respectively. Among the viral reads, sequences related to mammalian viruses from 13 genera in 10 viral families were identified, including Coronaviridae, Nairoviridae, Paramyxoviridae, Parvoviridae, Polyomaviridae, Papillomaviridae, Astroviridae, Picornaviridae, Poxviridae, and Genomoviridae. Some viral sequences belong to known camel or human viruses and others are from potentially novel camel viruses with only limited sequence similarity to virus sequences in GenBank. A total of five potentially novel virus species or strains were identified. Co-infection of at least two recently identified camel coronaviruses was detected in 92.6% of the camels in the study. This study provides a comprehensive survey of viruses in the virome of upper respiratory samples in camels that have extensive contact with the human population.
Investigating the viral ecology of global bee communities with high-throughput metagenomics.
Galbraith, David A; Fuller, Zachary L; Ray, Allyson M; Brockmann, Axel; Frazier, Maryann; Gikungu, Mary W; Martinez, J Francisco Iturralde; Kapheim, Karen M; Kerby, Jeffrey T; Kocher, Sarah D; Losyev, Oleksiy; Muli, Elliud; Patch, Harland M; Rosa, Cristina; Sakamoto, Joyce M; Stanley, Scott; Vaudo, Anthony D; Grozinger, Christina M
2018-06-11
Bee viral ecology is a fascinating emerging area of research: viruses exert a range of effects on their hosts, exacerbate impacts of other environmental stressors, and, importantly, are readily shared across multiple bee species in a community. However, our understanding of bee viral communities is limited, as it is primarily derived from studies of North American and European Apis mellifera populations. Here, we examined viruses in populations of A. mellifera and 11 other bee species from 9 countries, across 4 continents and Oceania. We developed a novel pipeline to rapidly and inexpensively screen for bee viruses. This pipeline includes purification of encapsulated RNA/DNA viruses, sequence-independent amplification, high throughput sequencing, integrated assembly of contigs, and filtering to identify contigs specifically corresponding to viral sequences. We identified sequences for (+)ssRNA, (-)ssRNA, dsRNA, and ssDNA viruses. Overall, we found 127 contigs corresponding to novel viruses (i.e. previously not observed in bees), with 27 represented by >0.1% of the reads in a given sample, and 7 contained an RdRp or replicase sequence which could be used for robust phylogenetic analysis. This study provides a sequence-independent pipeline for viral metagenomics analysis, and greatly expands our understanding of the diversity of viruses found in bee communities.
Bayesian nonparametric clustering in phylogenetics: modeling antigenic evolution in influenza.
Cybis, Gabriela B; Sinsheimer, Janet S; Bedford, Trevor; Rambaut, Andrew; Lemey, Philippe; Suchard, Marc A
2018-01-30
Influenza is responsible for up to 500,000 deaths every year, and antigenic variability represents much of its epidemiological burden. To visualize antigenic differences across many viral strains, antigenic cartography methods use multidimensional scaling on binding assay data to map influenza antigenicity onto a low-dimensional space. Analysis of such assay data ideally leads to natural clustering of influenza strains of similar antigenicity that correlate with sequence evolution. To understand the dynamics of these antigenic groups, we present a framework that jointly models genetic and antigenic evolution by combining multidimensional scaling of binding assay data, Bayesian phylogenetic machinery and nonparametric clustering methods. We propose a phylogenetic Chinese restaurant process that extends the current process to incorporate the phylogenetic dependency structure between strains in the modeling of antigenic clusters. With this method, we are able to use the genetic information to better understand the evolution of antigenicity throughout epidemics, as shown in applications of this model to H1N1 influenza. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Immunity: Insect Immune Memory Goes Viral.
Ligoxygakis, Petros
2017-11-20
Adaptive memory in insect immunity has been controversial. In this issue, Andino and co-workers propose that acquisition of viral sequences in the host genome gives rise to anti-sense, anti-viral piRNAs. Such sequences can be regarded as both a genomic archive of past infections and as an armour of potential heritable memory. Copyright © 2017 Elsevier Ltd. All rights reserved.
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
Roux, Simon; Hallam, Steven J; Woyke, Tanja; Sullivan, Matthew B
2015-01-01
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes. DOI: http://dx.doi.org/10.7554/eLife.08490.001 PMID:26200428
Rizk, Francine; Laverdure, Sylvain; d'Alençon, Emmanuelle; Bossin, Hervé; Dupressoir, Thierry
2018-01-01
The Lepidopteran ambidensovirus 1 isolated from Junonia coenia (hereafter JcDV) is an invertebrate parvovirus considered as a viral transduction vector as well as a potential tool for the biological control of insect pests. Previous works showed that JcDV-based circular plasmids experimentally integrate into insect cells genomic DNA. In order to approach the natural conditions of infection and possible integration, we generated linear JcDV- gfp based molecules which were transfected into non permissive Spodoptera frugiperda ( Sf9 ) cultured cells. Cells were monitored for the expression of green fluorescent protein (GFP) and DNA was analyzed for integration of transduced viral sequences. Non-structural protein modulation of the VP-gene cassette promoter activity was additionally assayed. We show that linear JcDV-derived molecules are capable of long term genomic integration and sustained transgene expression in Sf9 cells. As expected, only the deletion of both inverted terminal repeats (ITR) or the polyadenylation signals of NS and VP genes dramatically impairs the global transduction/expression efficiency. However, all the integrated viral sequences we characterized appear "scrambled" whatever the viral content of the transfected vector. Despite a strong GFP expression, we were unable to recover any full sequence of the original constructs and found rearranged viral and non-viral sequences as well. Cellular flanking sequences were identified as non-coding ones. On the other hand, the kinetics of GFP expression over time led us to investigate the apparent down-regulation by non-structural proteins of the VP-gene cassette promoter. Altogether, our results show that JcDV-derived sequences included in linear DNA molecules are able to drive efficiently the integration and expression of a foreign gene into the genome of insect cells, whatever their composition, provided that at least one ITR is present. However, the transfected sequences were extensively rearranged with cellular DNA during or after random integration in the host cell genome. Lastly, the non-structural proteins seem to participate in the regulation of p9 promoter activity rather than to the integration of viral sequences.
Vector platforms for gene therapy of inherited retinopathies
Trapani, Ivana; Puppo, Agostina; Auricchio, Alberto
2014-01-01
Inherited retinopathies (IR) are common untreatable blinding conditions. Most of them are inherited as monogenic disorders, due to mutations in genes expressed in retinal photoreceptors (PR) and in retinal pigment epithelium (RPE). The retina’s compatibility with gene transfer has made transduction of different retinal cell layers in small and large animal models via viral and non-viral vectors possible. The ongoing identification of novel viruses as well as modifications of existing ones based either on rational design or directed evolution have generated vector variants with improved transduction properties. Dozens of promising proofs of concept have been obtained in IR animal models with both viral and non-viral vectors, and some of them have been relayed to clinical trials. To date, recombinant vectors based on the adeno-associated virus (AAV) represent the most promising tool for retinal gene therapy, given their ability to efficiently deliver therapeutic genes to both PR and RPE and their excellent safety and efficacy profiles in humans. However, AAVs’ limited cargo capacity has prevented application of the viral vector to treatments requiring transfer of genes with a coding sequence larger than 5 kb. Vectors with larger capacity, i.e. nanoparticles, adenoviral and lentiviral vectors are being exploited for gene transfer to the retina in animal models and, more recently, in humans. This review focuses on the available platforms for retinal gene therapy to fight inherited blindness, highlights their main strengths and examines the efforts to overcome some of their limitations. PMID:25124745
Ho, Daniel W H; Sze, Karen M F; Ng, Irene O L
2015-08-28
Viral integration into the human genome upon infection is an important risk factor for various human malignancies. We developed viral integration site detection tool called Virus-Clip, which makes use of information extracted from soft-clipped sequencing reads to identify exact positions of human and virus breakpoints of integration events. With initial read alignment to virus reference genome and streamlined procedures, Virus-Clip delivers a simple, fast and memory-efficient solution to viral integration site detection. Moreover, it can also automatically annotate the integration events with the corresponding affected human genes. Virus-Clip has been verified using whole-transcriptome sequencing data and its detection was validated to have satisfactory sensitivity and specificity. Marked advancement in performance was detected, compared to existing tools. It is applicable to versatile types of data including whole-genome sequencing, whole-transcriptome sequencing, and targeted sequencing. Virus-Clip is available at http://web.hku.hk/~dwhho/Virus-Clip.zip.
Borisenko, A S; Kotus, E V; Kaloshin, A A
2008-01-01
Significant number of scientific publications devoted to inhibition of viral replication by antisense RNA (asRNA) genes shows that this approach is useful for gene therapy of viral infections. To investigate the possibility of suppression of HTLV-1 virus reproduction by asRNA we constructed recombinant plasmids containing asRNA genes against U3 long terminal repeats region and X gene under the control of promoter of myeloproliferative sarcoma virus (MPSV) or without such promoter. Using stable calcium-phosphate transfection method with subsequent selection in the presence of G-418, RaHOS line-based cell clones carrying both asRNA genes and sequences able to bind HTLV-1 transactivator proteins (i.e. "traps" of viral transactivators, TVT) were obtained. Data from dot-hybridization analysis of viral RNA extracted from RaHOS cell clones showed that TVT sequences are able to suppress the viral RNA synthesis on 90% and asRNA against X gene synthesis--on 50%.
Metagenomic Analysis of Viral Communities in (Hado)Pelagic Sediments
Yoshida, Mitsuhiro; Takaki, Yoshihiro; Eitoku, Masamitsu; Nunoura, Takuro; Takai, Ken
2013-01-01
In this study, we analyzed viral metagenomes (viromes) in the sedimentary habitats of three geographically and geologically distinct (hado)pelagic environments in the northwest Pacific; the Izu-Ogasawara Trench (water depth = 9,760 m) (OG), the Challenger Deep in the Mariana Trench (10,325 m) (MA), and the forearc basin off the Shimokita Peninsula (1,181 m) (SH). Virus abundance ranged from 106 to 1011 viruses/cm3 of sediments (down to 30 cm below the seafloor [cmbsf]). We recovered viral DNA assemblages (viromes) from the (hado)pelagic sediment samples and obtained a total of 37,458, 39,882, and 70,882 sequence reads by 454 GS FLX Titanium pyrosequencing from the virome libraries of the OG, MA, and SH (hado)pelagic sediments, respectively. Only 24−30% of the sequence reads from each virome library exhibited significant similarities to the sequences deposited in the public nr protein database (E-value <10−3 in BLAST). Among the sequences identified as potential viral genes based on the BLAST search, 95−99% of the sequence reads in each library were related to genes from single-stranded DNA (ssDNA) viral families, including Microviridae, Circoviridae, and Geminiviridae. A relatively high abundance of sequences related to the genetic markers (major capsid protein [VP1] and replication protein [Rep]) of two ssDNA viral groups were also detected in these libraries, thereby revealing a high genotypic diversity of their viruses (833 genotypes for VP1 and 2,551 genotypes for Rep). A majority of the viral genes predicted from each library were classified into three ssDNA viral protein categories: Rep, VP1, and minor capsid protein. The deep-sea sedimentary viromes were distinct from the viromes obtained from the oceanic and fresh waters and marine eukaryotes, and thus, deep-sea sediments harbor novel viromes, including previously unidentified ssDNA viruses. PMID:23468952
Metagenomic analysis of viral communities in (hado)pelagic sediments.
Yoshida, Mitsuhiro; Takaki, Yoshihiro; Eitoku, Masamitsu; Nunoura, Takuro; Takai, Ken
2013-01-01
In this study, we analyzed viral metagenomes (viromes) in the sedimentary habitats of three geographically and geologically distinct (hado)pelagic environments in the northwest Pacific; the Izu-Ogasawara Trench (water depth = 9,760 m) (OG), the Challenger Deep in the Mariana Trench (10,325 m) (MA), and the forearc basin off the Shimokita Peninsula (1,181 m) (SH). Virus abundance ranged from 10(6) to 10(11) viruses/cm(3) of sediments (down to 30 cm below the seafloor [cmbsf]). We recovered viral DNA assemblages (viromes) from the (hado)pelagic sediment samples and obtained a total of 37,458, 39,882, and 70,882 sequence reads by 454 GS FLX Titanium pyrosequencing from the virome libraries of the OG, MA, and SH (hado)pelagic sediments, respectively. Only 24-30% of the sequence reads from each virome library exhibited significant similarities to the sequences deposited in the public nr protein database (E-value <10(-3) in BLAST). Among the sequences identified as potential viral genes based on the BLAST search, 95-99% of the sequence reads in each library were related to genes from single-stranded DNA (ssDNA) viral families, including Microviridae, Circoviridae, and Geminiviridae. A relatively high abundance of sequences related to the genetic markers (major capsid protein [VP1] and replication protein [Rep]) of two ssDNA viral groups were also detected in these libraries, thereby revealing a high genotypic diversity of their viruses (833 genotypes for VP1 and 2,551 genotypes for Rep). A majority of the viral genes predicted from each library were classified into three ssDNA viral protein categories: Rep, VP1, and minor capsid protein. The deep-sea sedimentary viromes were distinct from the viromes obtained from the oceanic and fresh waters and marine eukaryotes, and thus, deep-sea sediments harbor novel viromes, including previously unidentified ssDNA viruses.
Viral quasispecies inference from 454 pyrosequencing
2013-01-01
Background Many potentially life-threatening infectious viruses are highly mutable in nature. Characterizing the fittest variants within a quasispecies from infected patients is expected to allow unprecedented opportunities to investigate the relationship between quasispecies diversity and disease epidemiology. The advent of next-generation sequencing technologies has allowed the study of virus diversity with high-throughput sequencing, although these methods come with higher rates of errors which can artificially increase diversity. Results Here we introduce a novel computational approach that incorporates base quality scores from next-generation sequencers for reconstructing viral genome sequences that simultaneously infers the number of variants within a quasispecies that are present. Comparisons on simulated and clinical data on dengue virus suggest that the novel approach provides a more accurate inference of the underlying number of variants within the quasispecies, which is vital for clinical efforts in mapping the within-host viral diversity. Sequence alignments generated by our approach are also found to exhibit lower rates of error. Conclusions The ability to infer the viral quasispecies colony that is present within a human host provides the potential for a more accurate classification of the viral phenotype. Understanding the genomics of viruses will be relevant not just to studying how to control or even eradicate these viral infectious diseases, but also in learning about the innate protection in the human host against the viruses. PMID:24308284
The Spleen Is an HIV-1 Sanctuary During Combined Antiretroviral Therapy.
Nolan, David J; Rose, Rebecca; Rodriguez, Patricia H; Salemi, Marco; Singer, Elyse J; Lamers, Susanna L; McGrath, Michael S
2018-01-01
Combined antiretroviral therapy (cART) does not eradicate HIV, which persists for years and can re-establish replication if treatment is stopped. The current challenge is identifying those tissues harboring virus through cART. Here, we used HIV env-nef single genome sequencing and HIV gag droplet digital PCR (ddPCR) to survey 50 tissues from five subjects on cART with no detectable plasma viral load at death. The spleen most consistently contained multiple proviral and expressed sequences (4/5 participants). Spleen-derived HIV demonstrated two distinct phylogenetic patterns: multiple identical sequences, often from different tissues, as well as diverse viral sequences on long terminal branches. Our results suggested that ddPCR may overestimate the size of the tissue-based viral reservoir. The spleen, a lymphatic organ at the intersection of the immune and circulatory systems, may play a key role in viral persistence.
A mathematical model of marine bacteriophage evolution.
Pagliarini, Silvia; Korobeinikov, Andrei
2018-03-01
To explore how particularities of a host cell-virus system, and in particular host cell replication, affect viral evolution, in this paper we formulate a mathematical model of marine bacteriophage evolution. The intrinsic simplicity of real-life phage-bacteria systems, and in particular aquatic systems, for which the assumption of homogeneous mixing is well justified, allows for a reasonably simple model. The model constructed in this paper is based upon the Beretta-Kuang model of bacteria-phage interaction in an aquatic environment (Beretta & Kuang 1998 Math. Biosci. 149 , 57-76. (doi:10.1016/S0025-5564(97)10015-3)). Compared to the original Beretta-Kuang model, the model assumes the existence of a multitude of viral variants which correspond to continuously distributed phenotypes. It is noteworthy that the model is mechanistic (at least as far as the Beretta-Kuang model is mechanistic). Moreover, this model does not include any explicit law or mechanism of evolution; instead it is assumed, in agreement with the principles of Darwinian evolution, that evolution in this system can occur as a result of random mutations and natural selection. Simulations with a simplistic linear fitness landscape (which is chosen for the convenience of demonstration only and is not related to any real-life system) show that a pulse-type travelling wave moving towards increasing Darwinian fitness appears in the phenotype space. This implies that the overall fitness of a viral quasi-species steadily increases with time. That is, the simulations demonstrate that for an uneven fitness landscape random mutations combined with a mechanism of natural selection (for this particular system this is given by the conspecific competition for the resource) lead to the Darwinian evolution. It is noteworthy that in this system the speed of propagation of this wave (and hence the rate of evolution) is not constant but varies, depending on the current viral fitness and the abundance of susceptible bacteria. A specific feature of the original Beretta-Kuang model is that this model exhibits a supercritical Hopf bifurcation, leading to the loss of stability and the rise of self-sustained oscillations in the system. This phenomenon corresponds to the paradox of enrichment in the system. It is remarkable that under the conditions that ensure the bifurcation in the Beretta-Kuang model, the viral evolution model formulated in this paper also exhibits a rise in self-sustained oscillations of the abundance of all interacting populations. The propagation of the travelling wave, however, remains stable under these conditions. The only visible impact of the oscillations on viral evolution is a lower speed of the evolution.
FIV cross-species transmission: an evolutionary prospective
Troyer, Jennifer L.; VandeWoude, Sue; Pecon-Slattery, Jill; McIntosh, Carl; Franklin, Sam; Antunes, Agostinho; Johnson, Warren; O'Brien, Stephen J.
2008-01-01
Feline and primate immunodeficiency viruses (FIVs, SIVs, and HIV) are transmitted via direct contact (e.g. fighting, sexual contact, and mother-offspring transmission). This dynamic likely poses a behavioral barrier to cross-species transmission in the wild. Recently, several host intracellular anti-viral proteins that contribute to species-specificity of primate lentiviruses have been identified revealing adaptive mechanisms that further limit spread of lentiviruses between species. Consistent with these inter-species transmission barriers, phylogenetic evidence supports the prediction that FIV transmission is an exceedingly rare event between free-ranging cat species, though it has occurred occasionally in captive settings. Recently we documented that puma and bobcats in Southern California share an FIV strain, providing an opportunity to evaluate evolution of both viral strains and host intracellular restriction proteins. These studies are facilitated by the availability of the 2X cat genome sequence annotation. In addition, concurrent viral and host genetic analyses have been used to track patterns of migration of the host species and barriers to transmission of the virus within the African lion. These studies illustrate the utility of FIV as a model to discover the variables necessary for establishment and control of lentiviral infections in new species. PMID:18299153
NASA Astrophysics Data System (ADS)
Wodarz, Dominik
2005-12-01
This article reviews mathematical models which have investigated the importance of lytic and non-lytic immune responses for the control of viral infections. Lytic immune responses fight the virus by killing infected cells, while non-lytic immune responses fight the virus by inhibiting viral replication while leaving the infected cell alive. The models suggest which types or combinations of immune responses are required to resolve infections which vary in their characteristics, such as the rate of viral replication and the rate of virus-induced target cell death. This framework is then applied to persistent infections and viral evolution. It is investigated how viral evolution and antigenic escape can influence the relative balance of lytic and non-lytic responses over time, and how this might correlate with the transition from an asymptomatic infection to pathology. This is discussed in the specific context of hepatitis C virus infection.
Murphy, B; Hillman, C; McDonnel, S
2014-01-22
Feline immunodeficiency virus (FIV)-infected cats enter a clinically asymptomatic phase during chronic infection. Despite the lack of overt clinical disease, the asymptomatic phase is characterized by persistent immunologic impairment. In the peripheral blood obtained from cats experimentally infected with FIV-C for approximately 5 years, we identified a persistent inversion of the CD4/CD8 ratio. We cloned and sequenced the FIV-C long terminal repeat containing the viral promoter from cells infected with the inoculating virus and from in vivo-derived peripheral blood mononuclear cells and CD4 T cells isolated at multiple time points throughout the asymptomatic phase. Relative to the inoculating virus, viral sequences amplified from cells isolated from all of the infected animals demonstrated multiple single nucleotide mutations and a short deletion within the viral U3, R and U5 regions. A transcriptionally inactivating proviral mutation in the U3 promoter AP-1 site was identified at multiple time points from all of the infected animals but not within cell-associated viral RNA. In contrast, no mutations were identified within the sequence of the viral dUTPase gene amplified from PBMC isolated at approximately 5 years post-infection relative to the inoculating sequence. The possible implications of these mutations to viral pathogenesis are discussed. Copyright © 2013 Elsevier B.V. All rights reserved.
Knapp, David J H F; Brumme, Zabrina L; Huang, Sheng Yuan; Wynhoven, Brian; Dong, Winnie W Y; Mo, Theresa; Harrigan, P Richard; Brumme, Chanson J
2012-06-01
HLA class I-restricted cytotoxic T lymphocytes and highly active antiretroviral therapy (HAART) exert strong selective pressures on human immunodeficiency virus type 1 (HIV-1), leading to escape mutations compromising virologic control. Immune responses continue to shape HIV-1 evolution after HAART initiation, but the extent and rate at which this occurs remain incompletely quantified. Here, we characterize the incidence and clinical correlates of HLA-associated evolution in HIV-1 Pol after HAART initiation in a large, population-based observational cohort. British Columbia HAART Observational, Medical Evaluation and Research cohort participants with available HLA class I types and longitudinal posttherapy protease/reverse transcriptase sequences were studied (n = 619; median, 5 samples per patient and 5.2 years of follow-up). HLA-associated polymorphisms were defined according to published reference lists. Rates and correlates of immune-mediated HIV-1 evolution were investigated using multivariate Cox proportional hazard models incorporating baseline and time-dependent plasma viral load and CD4 response data. New HLA-associated escape events were observed in 269 (43%) patients during HAART and occurred at 49 of 63 (78%) investigated immune-associated sites in Pol. In time-dependent analyses adjusting for baseline factors, poorer virologic, but not immunologic, response to HAART was associated with increased risk of immune escape of 1.9-fold per log(10) viral load increment (P < .0001). Reversion of escape mutations following HAART initiation was extremely rare. HLA-associated HIV-1 evolution continues during HAART to an extent that is inversely related to the virologic success of therapy. Minimizing the degree of immune escape could represent a secondary benefit of effective HAART.
Amexis, Georgios; Oeth, Paul; Abel, Kenneth; Ivshina, Anna; Pelloquin, Francois; Cantor, Charles R.; Braun, Andreas; Chumakov, Konstantin
2001-01-01
RNA viruses exist as quasispecies, heterogeneous and dynamic mixtures of mutants having one or more consensus sequences. An adequate description of the genomic structure of such viral populations must include the consensus sequence(s) plus a quantitative assessment of sequence heterogeneities. For example, in quality control of live attenuated viral vaccines, the presence of even small quantities of mutants or revertants may indicate incomplete or unstable attenuation that may influence vaccine safety. Previously, we demonstrated the monitoring of oral poliovirus vaccine with the use of mutant analysis by PCR and restriction enzyme cleavage (MAPREC). In this report, we investigate genetic variation in live attenuated mumps virus vaccine by using both MAPREC and a platform (DNA MassArray) based on matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry. Mumps vaccines prepared from the Jeryl Lynn strain typically contain at least two distinct viral substrains, JL1 and JL2, which have been characterized by full length sequencing. We report the development of assays for characterizing sequence variants in these substrains and demonstrate their use in quantitative analysis of substrains and sequence variations in mixed virus cultures and mumps vaccines. The results obtained from both the MAPREC and MALDI-TOF methods showed excellent correlation. This suggests the potential utility of MALDI-TOF for routine quality control of live viral vaccines and for assessment of genetic stability and quantitative monitoring of genetic changes in other RNA viruses of clinical interest. PMID:11593021
Eltahir, Yassir M.; Al Hammadi, Zulaikha M.; Tao, Ying; Queen, Krista; Hosani, Farida Al; Gerber, Susan I.; Hall, Aron J.; Al Muhairi, Salama
2017-01-01
Camels are known carriers for many viral pathogens, including Middle East respiratory syndrome coronavirus (MERS-CoV). It is likely that there are additional, as yet unidentified viruses in camels with the potential to cause disease in humans. In this study, we performed metagenomic sequencing analysis on nasopharyngeal swab samples from 108 MERS-CoV-positive dromedary camels from a live animal market in Abu Dhabi, United Arab Emirates. We obtained a total of 846.72 million high-quality reads from these nasopharyngeal swab samples, of which 2.88 million (0.34%) were related to viral sequences while 512.63 million (60.5%) and 50.87 million (6%) matched bacterial and eukaryotic sequences, respectively. Among the viral reads, sequences related to mammalian viruses from 13 genera in 10 viral families were identified, including Coronaviridae, Nairoviridae, Paramyxoviridae, Parvoviridae, Polyomaviridae, Papillomaviridae, Astroviridae, Picornaviridae, Poxviridae, and Genomoviridae. Some viral sequences belong to known camel or human viruses and others are from potentially novel camel viruses with only limited sequence similarity to virus sequences in GenBank. A total of five potentially novel virus species or strains were identified. Co-infection of at least two recently identified camel coronaviruses was detected in 92.6% of the camels in the study. This study provides a comprehensive survey of viruses in the virome of upper respiratory samples in camels that have extensive contact with the human population. PMID:28902913
Viral Genome DataBase: storing and analyzing genes and proteins from complete viral genomes.
Hiscock, D; Upton, C
2000-05-01
The Viral Genome DataBase (VGDB) contains detailed information of the genes and predicted protein sequences from 15 completely sequenced genomes of large (&100 kb) viruses (2847 genes). The data that is stored includes DNA sequence, protein sequence, GenBank and user-entered notes, molecular weight (MW), isoelectric point (pI), amino acid content, A + T%, nucleotide frequency, dinucleotide frequency and codon use. The VGDB is a mySQL database with a user-friendly JAVA GUI. Results of queries can be easily sorted by any of the individual parameters. The software and additional figures and information are available at http://athena.bioc.uvic.ca/genomes/index.html .
Ball Python Nidovirus: a Candidate Etiologic Agent for Severe Respiratory Disease in Python regius
Stenglein, Mark D.; Jacobson, Elliott R.; Wozniak, Edward J.; Wellehan, James F. X.; Kincaid, Anne; Gordon, Marcus; Porter, Brian F.; Baumgartner, Wes; Stahl, Scott; Kelley, Karen; Towner, Jonathan S.
2014-01-01
ABSTRACT A severe, sometimes fatal respiratory disease has been observed in captive ball pythons (Python regius) since the late 1990s. In order to better understand this disease and its etiology, we collected case and control samples and performed pathological and diagnostic analyses. Electron micrographs revealed filamentous virus-like particles in lung epithelial cells of sick animals. Diagnostic testing for known pathogens did not identify an etiologic agent, so unbiased metagenomic sequencing was performed. Abundant nidovirus-like sequences were identified in cases and were used to assemble the genome of a previously unknown virus in the order Nidovirales. The nidoviruses, which were not previously known to infect nonavian reptiles, are a diverse order that includes important human and veterinary pathogens. The presence of the viral RNA was confirmed in all diseased animals (n = 8) but was not detected in healthy pythons or other snakes (n = 57). Viral RNA levels were generally highest in the lung and other respiratory tract tissues. The 33.5-kb viral genome is the largest RNA genome yet described and shares canonical characteristics with other nidovirus genomes, although several features distinguish this from related viruses. This virus, which we named ball python nidovirus (BPNV), will likely establish a new genus in Torovirinae subfamily. The identification of a novel nidovirus in reptiles contributes to our understanding of the biology and evolution of related viruses, and its association with lung disease in pythons is a promising step toward elucidating an etiology for this long-standing veterinary disease. PMID:25205093
Santos, Luciane Amorim; Gray, Rebecca R; Monteiro-Cunha, Joana Paixão; Strazza, Evandra; Kashima, Simone; Santos, Edson de Souza; Araújo, Thessika Hialla Almeida; Gonçalves, Marilda de Souza; Salemi, Marco; Alcantara, Luiz Carlos Junior
2015-09-01
Characterizing the impact of HIV transmission routes on viral genetic diversity can improve the understanding of the mechanisms of virus evolution and adaptation. HIV vertical transmission can occur in utero, during delivery, or while breastfeeding. The present study investigated the phylodynamics of the HIV-1 env gene in mother-to-child transmission by analyzing one chronically infected pair from Brazil and three acutely infected pairs from Zambia, with three to five time points. Sequences from 25 clones from each sample were obtained and aligned using Clustal X. ML trees were constructed in PhyML using the best evolutionary model. Bayesian analyses testing the relaxed and strict molecular clock were performed using BEAST and a Bayesian Skyline Plot (BSP) was construed. The genetic variability of previously described epitopes was investigated and compared between each individual time point and between mother and child sequences. The relaxed molecular clock was the best-fitted model for all datasets. The tree topologies did not show differentiation in the evolutionary dynamics of the virus circulating in the mother from the viral population in the child. In the BSP, the effective population size was more constant in time in the chronically infected patients while in the acute patients it was possible to detect bottlenecks. The genetic variability within viral epitopes recognized by the human immune system was considerably higher among the chronically infected pair in comparison with acutely infected pairs. These results contribute to a better understanding of HIV-1 evolutionary dynamics in mother-to-child transmission.
Descloux, Elodie; Cao-Lormeau, Van-Mai; Roche, Claudine; De Lamballerie, Xavier
2009-01-01
Background Dengue fever (DF) is an emerging infectious disease in the tropics and subtropics. Determinants of DF epidemiology and factors involved in severe cases—dengue haemorrhagic fever (DHF) and dengue shock syndrome (DSS)—remain imperfectly characterized. Since 2000, serotype 1 (DENV-1) has predominated in the South Pacific. The aim of this study was (i) to determine the origin and (ii) to study the evolutionary relationships of DENV-1 viruses that have circulated in French Polynesia (FP) from the severe 2001 outbreak to the recent 2006 epidemic, and (iii) to analyse the viral intra-host genetic diversity according to clinical presentation. Methodology/Principal Findings Sequences of 181 envelope gene and 12 complete polyproteins of DENV-1 viruses obtained from human sera in FP during the 2001–2006 period were generated. Phylogenetic analysis showed that all DENV-1 FP strains belonged to genotype IV–“South Pacific” and derived from a single introduction event from South-East Asia followed by a 6-year in situ evolution. Although the ratio of nonsynonymous/synonymous substitutions per site indicated strong negative selection, a mutation in the envelope glycoprotein (S222T) appeared in 2002 and was subsequently fixed. It was noted that genetic diversification was very significant during the 2002–2005 period of endemic DENV-1 circulation. For nine DF sera and eight DHF/DSS sera, approximately 40 clones/serum of partial envelope gene were sequenced. Importantly, analysis revealed that the intra-host genetic diversity was significantly lower in severe cases than in classical DF. Conclusions/Significance First, this study showed that DENV-1 epidemiology in FP was different from that described in other South-Pacific islands, characterized by a long sustained viral circulation and the absence of new viral introduction over a 6-year period. Second, a significant part of DENV-1 evolution was observed during the endemic period characterized by the rapid fixation of S222T in the envelope protein that may reflect genetic drift or adaptation to the mosquito vector. Third, for the first time, it is suggested that clinical outcome may be correlated with intra-host genetic diversity. PMID:19652703
Gene repertoire of amoeba-associated giant viruses.
Colson, Philippe; Raoult, Didier
2010-01-01
Acanthamoeba polyphaga mimivirus, Marseillevirus, and Sputnik, a virophage, are intra-amoebal viruses that have been isolated from water collected in cooling towers. They have provided fascinating data and have raised exciting questions about viruses definition and evolution. Mimivirus and Marseillevirus have been classified in the nucleo-cytoplasmic large DNA viruses (NCLDVs) class. Their genomes are the largest and fifth largest viral genomes sequenced so far. The gene repertoire of these amoeba-associated viruses can be divided into four groups: the core genome, genes acquired by lateral gene transfer, duplicated genes, and ORFans. Open reading frames (ORFs) that have homologs in the NCLDVs core gene set represent 2.9 and 6.1% of the Mimivirus and Marseillevirus gene contents, respectively. A substantial proportion of the Mimivirus, Marseillevirus and Sputnik ORFs exhibit sequence similarities to homologs found in bacteria, archaea, eukaryotes or viruses. The large amount of chimeric genes in these viral genomes might have resulted from acquisitions by lateral gene transfers, implicating sympatric bacteria and viruses with an intra-amoebal lifestyle. In addition, lineage-specific gene expansion may have played a major role in the genome shaping. Altogether, the data so far accumulated on amoeba-associated giant viruses are a powerful incentive to isolate and study additional strains to gain better understanding of their pangenome. Copyright 2010 S. Karger AG, Basel.
The transmission dynamics and diversity of human metapneumovirus in Peru.
Pollett, Simon; Trovão, Nidia S; Tan, Yi; Eden, John-Sebastian; Halpin, Rebecca A; Bera, Jayati; Das, Suman R; Wentworth, David; Ocaña, Victor; Mendocilla, Silvia M; Álvarez, Carlos; Calisto, Maria E; Garcia, Josefina; Halsey, Eric; Ampuero, Julia S; Nelson, Martha I; Leguia, Mariana
2017-12-29
The transmission dynamics of human metapneumovirus (HMPV) in tropical countries remain unclear. Further understanding of the genetic diversity of the virus could aid in HMPV vaccine design and improve our understanding of respiratory virus transmission dynamics in low- and middle-income countries. We examined the evolution of HMPV in Peru through phylogenetic analysis of 61 full genome HMPV sequences collected in three ecologically diverse regions of Peru (Lima, Piura, and Iquitos) during 2008-2012, comprising the largest data set of HMPV whole genomes sequenced from any tropical country to date. We revealed extensive genetic diversity generated by frequent viral introductions, with little evidence of local persistence. While considerable viral traffic between non-Peruvian countries and Peru was observed, HMPV epidemics in Peruvian locales were more frequently epidemiologically linked with other sites within Peru. We showed that Iquitos experienced greater HMPV traffic than the similar sized city of Piura by both Bayesian and maximum likelihood methods. There is extensive HMPV genetic diversity even within smaller and relatively less connected cities of Peru and this virus is spatially fluid. Greater diversity of HMPV in Iquitos compared to Piura may relate to higher volumes of human movement, including air traffic to this location. © 2017 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
Variation and Evolution in the Glutamine-Rich Repeat Region of Drosophila Argonaute-2
Palmer, William H.; Obbard, Darren J.
2016-01-01
RNA interference pathways mediate biological processes through Argonaute-family proteins, which bind small RNAs as guides to silence complementary target nucleic acids . In insects and crustaceans Argonaute-2 silences viral nucleic acids, and therefore acts as a primary effector of innate antiviral immunity. Although the function of the major Argonaute-2 domains, which are conserved across most Argonaute-family proteins, are known, many invertebrate Argonaute-2 homologs contain a glutamine-rich repeat (GRR) region of unknown function at the N-terminus . Here we combine long-read amplicon sequencing of Drosophila Genetic Reference Panel (DGRP) lines with publicly available sequence data from many insect species to show that this region evolves extremely rapidly and is hyper-variable within species. We identify distinct GRR haplotype groups in Drosophila melanogaster, and suggest that one of these haplotype groups has recently risen to high frequency in a North American population. Finally, we use published data from genome-wide association studies of viral resistance in D. melanogaster to test whether GRR haplotypes are associated with survival after virus challenge. We find a marginally significant association with survival after challenge with Drosophila C Virus in the DGRP, but we were unable to replicate this finding using lines from the Drosophila Synthetic Population Resource panel. PMID:27317784
Casartelli, Nicoletta; Di Matteo, Gigliola; Argentini, Claudio; Cancrini, Caterina; Bernardi, Stefania; Castelli, Guido; Scarlatti, Gabriella; Plebani, Anna; Rossi, Paolo; Doria, Margherita
2003-06-13
Evaluation of sequence evolution as well as structural defects and mutations of the human immunodeficiency virus-type 1 (HIV-1) nef gene in relation to disease progression in infected children. We examined a large number of nef alleles sequentially derived from perinatally HIV-1-infected children with different rates of disease progression: six non-progressors (NPs), four rapid progressors (RPs), and three slow progressors (SPs). Nef alleles (182 total) were isolated from patients' peripheral blood mononuclear cells (PBMCs), sequenced and analysed for their evolutionary pattern, frequency of mutations and occurrence of amino acid variations associated with different stages of disease. The evolution rate of the nef gene apparently correlated with CD4+ decline in all progression groups. Evidence for rapid viral turnover and positive selection for changes were found only in two SPs and two RPs respectively. In NPs, a higher proportion of disrupted sequences and mutations at various functional motifs were observed. Furthermore, NP-derived Nef proteins were often changed at residues localized in the folded core domain at cytotoxic T lymphocytes (CTL) epitopes (E(105), K(106), E(110), Y(132), K(164), and R(200)), while other residues outside the core domain are more often changed in RPs (A(43)) and SPs (N(173) and Y(214)). Our results suggest a link between nef gene functions and the progression rate in HIV-1-infected children. Moreover, non-progressor-associated variations in the core domain of Nef, together with the genetic analysis, suggest that nef gene evolution is shaped by an effective immune system in these patients.
Kawasaki, Junna; Kawamura, Maki; Ohsato, Yoshiharu; Ito, Jumpei; Nishigaki, Kazuo
2017-10-15
Recombination events induce significant genetic changes, and this process can result in virus genetic diversity or in the generation of novel pathogenicity. We discovered a new recombinant feline leukemia virus (FeLV) gag gene harboring an unrelated insertion, termed the X region, which was derived from Felis catus endogenous gammaretrovirus 4 (FcERV-gamma4). The identified FcERV-gamma4 proviruses have lost their coding capabilities, but some can express their viral RNA in feline tissues. Although the X-region-carrying recombinant FeLVs appeared to be replication-defective viruses, they were detected in 6.4% of tested FeLV-infected cats. All isolated recombinant FeLV clones commonly incorporated a middle part of the FcERV-gamma4 5'-leader region as an X region. Surprisingly, a sequence corresponding to the portion contained in all X regions is also present in at least 13 endogenous retroviruses (ERVs) observed in the cat, human, primate, and pig genomes. We termed this shared genetic feature the commonly shared (CS) sequence. Despite our phylogenetic analysis indicating that all CS-sequence-carrying ERVs are classified as gammaretroviruses, no obvious closeness was revealed among these ERVs. However, the Shannon entropy in the CS sequence was lower than that in other parts of the provirus genome. Notably, the CS sequence of human endogenous retrovirus T had 73.8% similarity with that of FcERV-gamma4, and specific signals were detected in the human genome by Southern blot analysis using a probe for the FcERV-gamma4 CS sequence. Our results provide an interesting evolutionary history for CS-sequence circulation among several distinct ancestral viruses and a novel recombined virus over a prolonged period. IMPORTANCE Recombination among ERVs or modern viral genomes causes a rapid evolution of retroviruses, and this phenomenon can result in the serious situation of viral disease reemergence. We identified a novel recombinant FeLV gag gene that contains an unrelated sequence, termed the X region. This region originated from the 5' leader of FcERV-gamma4, a replication-incompetent feline ERV. Surprisingly, a sequence corresponding to the X region is also present in the 5' portion of other ERVs, including human endogenous retroviruses. Scattered copies of the ERVs carrying the unique genetic feature, here named the commonly shared (CS) sequence, were found in each host genome, suggesting that ancestral viruses may have captured and maintained the CS sequence. More recently, a novel recombinant FeLV hijacked the CS sequence from inactivated FcERV-gamma4 as the X region. Therefore, tracing the CS sequences can provide unique models for not only the modern reservoir of new recombinant viruses but also the genetic features shared among ancient retroviruses. Copyright © 2017 American Society for Microbiology.
Hou, Weiguo; Wang, Shang; Briggs, Brandon R; Li, Gaoyuan; Xie, Wei; Dong, Hailiang
2018-01-01
Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
Hou, Weiguo; Wang, Shang; Briggs, Brandon R.; Li, Gaoyuan; Xie, Wei; Dong, Hailiang
2018-01-01
Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
Russell, Jessica S; Caly, Leon; Kostecki, Renata; McGuinness, Sarah L; Carter, Glen; Bulach, Dieter; Seemann, Torsten; Stinear, Tim P; Baird, Rob; Catton, Mike; Druce, Julian
2018-06-11
Murray Valley Encephalitis virus (MVEV) is a mosquito-borne Flavivirus. Clinical presentation is rare but severe, with a case fatality rate of 15⁻30%. Here we report a case of MVEV from the cerebrospinal fluid (CSF) of a patient in the Northern Territory in Australia. Initial diagnosis was performed using both MVEV-specific real-time, and Pan- Flavivirus conventional, Polymerase Chain Reaction (PCR), with confirmation by Sanger sequencing. Subsequent isolation, the first from CSF, was conducted in Vero cells and the observed cytopathic effect was confirmed by increasing viral titre in the real-time PCR. Isolation allowed for full genome sequencing using the Scriptseq V2 RNASeq library preparation kit. A consensus genome for VIDRL-MVE was generated and phylogenetic analysis identified it as Genotype 2. This is the first reported isolation, and full genome sequencing of MVEV from CSF. It is also the first time Genotype 2 has been identified in humans. As such, this case has significant implications for public health surveillance, epidemiology, and the understanding of MVEV evolution.
Broad Surveys of DNA Viral Diversity Obtained through Viral Metagenomics of Mosquitoes
Ng, Terry Fei Fan; Willner, Dana L.; Lim, Yan Wei; Schmieder, Robert; Chau, Betty; Nilsson, Christina; Anthony, Simon; Ruan, Yijun; Rohwer, Forest; Breitbart, Mya
2011-01-01
Viruses are the most abundant and diverse genetic entities on Earth; however, broad surveys of viral diversity are hindered by the lack of a universal assay for viruses and the inability to sample a sufficient number of individual hosts. This study utilized vector-enabled metagenomics (VEM) to provide a snapshot of the diversity of DNA viruses present in three mosquito samples from San Diego, California. The majority of the sequences were novel, suggesting that the viral community in mosquitoes, as well as the animal and plant hosts they feed on, is highly diverse and largely uncharacterized. Each mosquito sample contained a distinct viral community. The mosquito viromes contained sequences related to a broad range of animal, plant, insect and bacterial viruses. Animal viruses identified included anelloviruses, circoviruses, herpesviruses, poxviruses, and papillomaviruses, which mosquitoes may have obtained from vertebrate hosts during blood feeding. Notably, sequences related to human papillomaviruses were identified in one of the mosquito samples. Sequences similar to plant viruses were identified in all mosquito viromes, which were potentially acquired through feeding on plant nectar. Numerous bacteriophages and insect viruses were also detected, including a novel densovirus likely infecting Culex erythrothorax. Through sampling insect vectors, VEM enables broad survey of viral diversity and has significantly increased our knowledge of the DNA viruses present in mosquitoes. PMID:21674005
Fernandez-Cassi, X; Timoneda, N; Gonzales-Gustavson, E; Abril, J F; Bofill-Mas, S; Girones, R
2017-09-18
Microbial food-borne diseases are still frequently reported despite the implementation of microbial quality legislation to improve food safety. Among all the microbial agents, viruses are the most important causative agents of food-borne outbreaks. The development and application of a new generation of sequencing techniques to test for viral contaminants in fresh produce is an unexplored field that allows for the study of the viral populations that might be transmitted by the fecal-oral route through the consumption of contaminated food. To advance this promising field, parsley was planted and grown under controlled conditions and irrigated using contaminated river water. Viruses polluting the irrigation water and the parsley leaves were studied by using metagenomics. To address possible contamination due to sample manipulation, library preparation, and other sources, parsley plants irrigated with nutritive solution were used as a negative control. In parallel, viruses present in the river water used for plant irrigation were analyzed using the same methodology. It was possible to assign viral taxons from 2.4 to 74.88% of the total reads sequenced depending on the sample. Most of the viral reads detected in the river water were related to the plant viral families Tymoviridae (66.13%) and Virgaviridae (14.45%) and the phage viral families Myoviridae (5.70%), Siphoviridae (5.06%), and Microviridae (2.89%). Less than 1% of the viral reads were related to viral families that infect humans, including members of the Adenoviridae, Reoviridae, Picornaviridae and Astroviridae families. On the surface of the parsley plants, most of the viral reads that were detected were assigned to the Dicistroviridae family (41.52%). Sequences related to important viral pathogens, such as the hepatitis E virus, several picornaviruses from species A and B as well as human sapoviruses and GIV noroviruses were detected. The high diversity of viral sequences found in the parsley plants suggests that irrigation on fecally-tainted food may have a role in the transmission of a wide diversity of viral families. This finding reinforces the idea that the best way to avoid food-borne viral diseases is to introduce good field irrigation and production practices. New strains have been identified that are related to the Picornaviridae and distantly related to the Hepeviridae family. However, the detection of a viral genome alone does not necessarily indicate there is a risk of infection or disease development. Thus, further investigation is crucial for correlating the detection of viral metagenomes in samples with the risk of infection. There is also an urgent need to develop new methods to improve the sensitivity of current Next Generation Sequencing (NGS) techniques in the food safety area. Copyright © 2017 Elsevier B.V. All rights reserved.
Computational clustering for viral reference proteomes
Chen, Chuming; Huang, Hongzhan; Mazumder, Raja; Natale, Darren A.; McGarvey, Peter B.; Zhang, Jian; Polson, Shawn W.; Wang, Yuqi; Wu, Cathy H.
2016-01-01
Motivation: The enormous number of redundant sequenced genomes has hindered efforts to analyze and functionally annotate proteins. As the taxonomy of viruses is not uniformly defined, viral proteomes pose special challenges in this regard. Grouping viruses based on the similarity of their proteins at proteome scale can normalize against potential taxonomic nomenclature anomalies. Results: We present Viral Reference Proteomes (Viral RPs), which are computed from complete virus proteomes within UniProtKB. Viral RPs based on 95, 75, 55, 35 and 15% co-membership in proteome similarity based clusters are provided. Comparison of our computational Viral RPs with UniProt’s curator-selected Reference Proteomes indicates that the two sets are consistent and complementary. Furthermore, each Viral RP represents a cluster of virus proteomes that was consistent with virus or host taxonomy. We provide BLASTP search and FTP download of Viral RP protein sequences, and a browser to facilitate the visualization of Viral RPs. Availability and implementation: http://proteininformationresource.org/rps/viruses/ Contact: chenc@udel.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153712
2011-01-01
Background Complex mutants can be selected under sequential selective pressure by HBV therapy. To determine hepatitis B virus genomic evolution during antiviral therapy we characterized the HBV quasi-species in a patient who did no respond to therapy following lamivudine breakthrough for a period of 14 years. Case Presentation The polymerase and precore/core genes were amplified and sequenced at determined intervals in a period of 14 years. HBV viral load and HBeAg/Anti-HBe serological profiles as well as amino transferase levels were also measured. A mixture of lamivudine-resistant genotype A2 HBV strains harboring the rtM204V mutation coexisted in the patient following viral breakthrough to lamivudine. The L180M+M204V dominant mutant displayed strong lamivudine-resistance. As therapy was changed to adefovir, then to entecavir, and finally to entecavir-tenofovir the viral load showed fluctuations but lamivudine-resistant strains continued to be selected, with minor contributions to the HBV quasi-species composition of additional resistance-associated mutations. At the end of the 14-year follow up period, high viral loads were predominant, with viral strains harboring the lamivudine-resistance signature rtL180M+M204V. The precore/core frame A1762T and G1764A double mutation was detected before treatment and remaining in this condition during the entire follow-up. Specific entecavir and tenofovir primary resistance-associated mutations were not detected at any time. Plasma concentrations of tenofovir indicated adequate metabolism of the drug. Conclusions We report the selection of HBV mutants carrying well-defined primary resistance mutations that escaped lamivudine in a fourteen-year follow-up period. With the exception of tenofovir resistance mutations, subsequent unselected primary resistance mutations were detected as minor populations into the HBV quasispecies composition during adefovir or entecavir monotherapies. Although tenofovir is considered an appropriate therapeutic alternative for the treatment of entecavir-unresponsive patients, its use was not effective in the case reported here. PMID:21696601
Sarmady, Mahdi; Dampier, William; Tozeren, Aydin
2011-01-01
Virus proteins alter protein pathways of the host toward the synthesis of viral particles by breaking and making edges via binding to host proteins. In this study, we developed a computational approach to predict viral sequence hotspots for binding to host proteins based on sequences of viral and host proteins and literature-curated virus-host protein interactome data. We use a motif discovery algorithm repeatedly on collections of sequences of viral proteins and immediate binding partners of their host targets and choose only those motifs that are conserved on viral sequences and highly statistically enriched among binding partners of virus protein targeted host proteins. Our results match experimental data on binding sites of Nef to host proteins such as MAPK1, VAV1, LCK, HCK, HLA-A, CD4, FYN, and GNB2L1 with high statistical significance but is a poor predictor of Nef binding sites on highly flexible, hoop-like regions. Predicted hotspots recapture CD8 cell epitopes of HIV Nef highlighting their importance in modulating virus-host interactions. Host proteins potentially targeted or outcompeted by Nef appear crowding the T cell receptor, natural killer cell mediated cytotoxicity, and neurotrophin signaling pathways. Scanning of HIV Nef motifs on multiple alignments of hepatitis C protein NS5A produces results consistent with literature, indicating the potential value of the hotspot discovery in advancing our understanding of virus-host crosstalk. PMID:21738584
Structural Plasticity and Rapid Evolution in a Viral RNA Revealed by In Vivo Genetic Selection▿ †
Guo, Rong; Lin, Wai; Zhang, Jiuchun; Simon, Anne E.; Kushner, David B.
2009-01-01
Satellite RNAs usually lack substantial homology with their helper viruses. The 356-nucleotide satC of Turnip crinkle virus (TCV) is unusual in that its 3′-half shares high sequence similarity with the TCV 3′ end. Computer modeling, structure probing, and/or compensatory mutagenesis identified four hairpins and three pseudoknots in this TCV region that participate in replication and/or translation. Two hairpins and two pseudoknots have been confirmed as important for satC replication. One portion of the related 3′ end of satC that remains poorly characterized corresponds to juxtaposed TCV hairpins H4a and H4b and pseudoknot ψ3, which are required for the TCV-specific requirement of translation (V. A. Stupina et al., RNA 14:2379-2393, 2008). Replacement of satC H4a with randomized sequence and scoring for fitness in plants by in vivo genetic selection (SELEX) resulted in winning sequences that contain an H4a-like stem-loop, which can have additional upstream sequence composing a portion of the stem. SELEX of the combined H4a and H4b region in satC generated three distinct groups of winning sequences. One group models into two stem-loops similar to H4a and H4b of TCV. However, the selected sequences in the other two groups model into single hairpins. Evolution of these single-hairpin SELEX winners in plants resulted in satC that can accumulate to wild-type (wt) levels in protoplasts but remain less fit in planta when competed against wt satC. These data indicate that two highly distinct RNA conformations in the H4a and H4b region can mediate satC fitness in protoplasts. PMID:19004956
Bull, Rowena A; Leung, Preston; Gaudieri, Silvana; Deshpande, Pooja; Cameron, Barbara; Walker, Melanie; Chopra, Abha; Lloyd, Andrew R; Luciani, Fabio
2015-05-01
The interaction between hepatitis C virus (HCV) and cellular immune responses during very early infection is critical for disease outcome. To date, the impact of antigen-specific cellular immune responses on the evolution of the viral population establishing infection and on potential escape has not been studied. Understanding these early host-virus dynamics is important for the development of a preventative vaccine. Three subjects who were followed longitudinally from the detection of viremia preseroconversion until disease outcome were analyzed. The evolution of transmitted/founder (T/F) viruses was undertaken using deep sequencing. CD8(+) T cell responses were measured via enzyme-linked immunosorbent spot (ELISpot) assay using HLA class I-restricted T/F epitopes. T/F viruses were rapidly extinguished in all subjects associated with either viral clearance (n = 1) or replacement with viral variants leading to establishment of chronic infection (n = 2). CD8(+) T cell responses against 11 T/F epitopes were detectable by 33 to 44 days postinfection, and 5 of these epitopes had not previously been reported. These responses declined rapidly in those who became chronically infected and were maintained in the subject who cleared infection. Higher-magnitude CD8(+) T cell responses were associated with rapid development of immune escape variants at a rate of up to 0.1 per day. Rapid escape from CD8(+) T cell responses has been quantified for the first time in the early phase of primary HCV infection. These rapid escape dynamics were associated with higher-magnitude CD8(+) T cell responses. These findings raise questions regarding optimal selection of immunogens for HCV vaccine development and suggest that detailed analysis of individual epitopes may be required. A major limitation in our detailed understanding of the role of immune response in HCV clearance has been the lack of data on very early primary infection when the transmitted viral variants successfully establish the acute infection. This study was made possible through the availability of specimens from a unique cohort of asymptomatic primary infection cases in whom the first available viremic samples were collected approximately 3 weeks postinfection and at regular intervals thereafter. The study included detailed examination of both the evolution of the viral population and the host cellular immune responses against the T/F viruses. The findings here provide the first evidence of host cellular responses targeting T/F variants and imposing a strong selective force toward viral escape. The results of this study provide useful insight on how virus escapes the host response and consequently on future analysis of vaccine-induced immunity. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aw, Tiong Gim; Howe, Adina; Rose, Joan B.
2014-12-01
Genomic-based molecular techniques are emerging as powerful tools that allow a comprehensive characterization of water and wastewater microbiomes. Most recently, next generation sequencing (NGS) technologies which produce large amounts of sequence data are beginning to impact the field of environmental virology. In this study, NGS and bioinformatics have been employed for the direct detection and characterization of viruses in wastewater and of viruses isolated after cell culture. Viral particles were concentrated and purified from sewage samples by polyethylene glycol precipitation. Viral nucleic acid was extracted and randomly amplified prior to sequencing using Illumina technology, yielding a total of 18 millionmore » sequence reads. Most of the viral sequences detected could not be characterized, indicating the great viral diversity that is yet to be discovered. This sewage virome was dominated by bacteriophages and contained sequences related to known human pathogenic viruses such as adenoviruses (species B, C and F), polyomaviruses JC and BK and enteroviruses (type B). An array of other animal viruses was also found, suggesting unknown zoonotic viruses. This study demonstrated the feasibility of metagenomic approaches to characterize viruses in complex environmental water samples.« less
Bertels, Frederic; Marzel, Alex; Leventhal, Gabriel; Mitov, Venelin; Fellay, Jacques; Günthard, Huldrych F; Böni, Jürg; Yerly, Sabine; Klimkait, Thomas; Aubert, Vincent; Battegay, Manuel; Rauch, Andri; Cavassini, Matthias; Calmy, Alexandra; Bernasconi, Enos; Schmid, Patrick; Scherrer, Alexandra U; Müller, Viktor; Bonhoeffer, Sebastian; Kouyos, Roger; Regoes, Roland R
2018-01-01
Abstract Pathogen strains may differ in virulence because they attain different loads in their hosts, or because they induce different disease-causing mechanisms independent of their load. In evolutionary ecology, the latter is referred to as “per-parasite pathogenicity”. Using viral load and CD4+ T-cell measures from 2014 HIV-1 subtype B-infected individuals enrolled in the Swiss HIV Cohort Study, we investigated if virulence—measured as the rate of decline of CD4+ T cells—and per-parasite pathogenicity are heritable from donor to recipient. We estimated heritability by donor–recipient regressions applied to 196 previously identified transmission pairs, and by phylogenetic mixed models applied to a phylogenetic tree inferred from HIV pol sequences. Regressing the CD4+ T-cell declines and per-parasite pathogenicities of the transmission pairs did not yield heritability estimates significantly different from zero. With the phylogenetic mixed model, however, our best estimate for the heritability of the CD4+ T-cell decline is 17% (5–30%), and that of the per-parasite pathogenicity is 17% (4–29%). Further, we confirm that the set-point viral load is heritable, and estimate a heritability of 29% (12–46%). Interestingly, the pattern of evolution of all these traits differs significantly from neutrality, and is most consistent with stabilizing selection for the set-point viral load, and with directional selection for the CD4+ T-cell decline and the per-parasite pathogenicity. Our analysis shows that the viral genotype affects virulence mainly by modulating the per-parasite pathogenicity, while the indirect effect via the set-point viral load is minor. PMID:29029206
Dugast, Anne-Sophie; Arnold, Kelly; Lofano, Giuseppe; Moore, Sarah; Hoffner, Michelle; Simek, Melissa; Poignard, Pascal; Seaman, Michael; Suscovich, Todd J; Pereyra, Florencia; Walker, Bruce D; Lauffenburger, Doug; Kwon, Douglas S; Keele, Brandon F; Alter, Galit
2017-04-15
Understanding the mechanism(s) by which broadly neutralizing antibodies (bNAbs) emerge naturally following infection is crucial for the development of a protective vaccine against human immunodeficiency virus (HIV). Although previous studies have implicated high viremia and associated immune activation as potential drivers for the development of bNAbs, here we sought to unlink the effect of these 2 parameters by evaluating the key inflammatory predictors of bNAb development in HIV-infected individuals who spontaneously control HIV in the absence of antiretroviral therapy ("controllers"). The breadth of antibody-mediated neutralization against 11 tier 2 or 3 viruses was assessed in 163 clade B spontaneous controllers of HIV. Plasma levels of 17 cytokines were screened in the same set of subjects. The relationship of the inflammatory signature was assessed in the context of viral blips or viral RNA levels in peripheral blood or gastrointestinal biopsies from aviremic controllers (<50 copies RNA/mL) and in the context of viral sequence diversity analysis in the plasma of viremic controllers (<50-2000 copies RNA/mL). A unique inflammatory profile, including high plasma levels of CXCL13, sCD40L, IP10, RANTES, and TNFα, was observed in HIV controllers who developed bNAbs. Interestingly, viral load and tissue viremia, but not intermittent viral blips, were associated with these cytokine profiles. However, viral diversity was not significantly associated with increased breadth in controllers. These results suggest that low antigenic diversity in the setting of a unique inflammatory profile associated with antigen persistence may be linked to the evolution of neutralizing antibody breadth. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.
Cas9 specifies functional viral targets during CRISPR-Cas adaptation.
Heler, Robert; Samai, Poulami; Modell, Joshua W; Weiner, Catherine; Goldberg, Gregory W; Bikard, David; Marraffini, Luciano A
2015-03-12
Clustered regularly interspaced short palindromic repeat (CRISPR) loci and their associated (Cas) proteins provide adaptive immunity against viral infection in prokaryotes. Upon infection, short phage sequences known as spacers integrate between CRISPR repeats and are transcribed into small RNA molecules that guide the Cas9 nuclease to the viral targets (protospacers). Streptococcus pyogenes Cas9 cleavage of the viral genome requires the presence of a 5'-NGG-3' protospacer adjacent motif (PAM) sequence immediately downstream of the viral target. It is not known whether and how viral sequences flanked by the correct PAM are chosen as new spacers. Here we show that Cas9 selects functional spacers by recognizing their PAM during spacer acquisition. The replacement of cas9 with alleles that lack the PAM recognition motif or recognize an NGGNG PAM eliminated or changed PAM specificity during spacer acquisition, respectively. Cas9 associates with other proteins of the acquisition machinery (Cas1, Cas2 and Csn2), presumably to provide PAM-specificity to this process. These results establish a new function for Cas9 in the genesis of prokaryotic immunological memory.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hraber, Peter; Korber, Bette; Wagh, Kshitij
Within-host genetic sequencing from samples collected over time provides a dynamic view of how viruses evade host immunity. Immune-driven mutations might stimulate neutralization breadth by selecting antibodies adapted to cycles of immune escape that generate within-subject epitope diversity. Comprehensive identification of immune-escape mutations is experimentally and computationally challenging. With current technology, many more viral sequences can readily be obtained than can be tested for binding and neutralization, making down-selection necessary. Typically, this is done manually, by picking variants that represent different time-points and branches on a phylogenetic tree. Such strategies are likely to miss many relevant mutations and combinations ofmore » mutations, and to be redundant for other mutations. Longitudinal Antigenic Sequences and Sites from Intrahost Evolution (LASSIE) uses transmitted founder loss to identify virus “hot-spots” under putative immune selection and chooses sequences that represent recurrent mutations in selected sites. LASSIE favors earliest sequences in which mutations arise. Here, with well-characterized longitudinal Env sequences, we confirmed selected sites were concentrated in antibody contacts and selected sequences represented diverse antigenic phenotypes. Finally, practical applications include rapidly identifying immune targets under selective pressure within a subject, selecting minimal sets of reagents for immunological assays that characterize evolving antibody responses, and for immunogens in polyvalent “cocktail” vaccines.« less
Dorsch-Häsler, Karoline; Fisher, Paul B.; Weinstein, I. Bernard; Ginsberg, Harold S.
1980-01-01
The integration pattern of viral DNA was studied in a number of cell lines transformed by wild-type adenovirus type 5 (Ad5 WT) and two mutants of the DNA-binding protein gene, H5ts125 and H5ts107. The effect of chemical carcinogens on the integration of viral DNA was also investigated. Liquid hybridization (C0t) analyses showed that rat embryo cells transformed by Ad5 WT usually contained only the left-hand end of the viral genome, whereas cell lines transformed by H5ts125 or H5ts107 at either the semipermissive (36°C) or nonpermissive (39.5°C) temperature often contained one to five copies of all or most of the entire adenovirus genome. The arrangement of the integrated adenovirus DNA sequences was determined by cleavage of transformed cell DNA with restriction endonucleases XbaI, EcoRI, or HindIII followed by transfer of separated fragments to nitrocellulose paper and hybridization according to the technique of E. M. Southern (J. Mol. Biol. 98: 503-517, 1975). It was found that the adenovirus genome is integrated as a linear sequence covalently linked to host cell DNA; that the viral DNA is integrated into different host DNA sequences in each cell line studied; that in cell lines that contain multiple copies of the Ad5 genome the viral DNA sequences can be integrated in a single set of host cell DNA sequences and not as concatemers; and that chemical carcinogens do not alter the extent or pattern of viral DNA integration. Images PMID:6246266
de Andrade, Roberto R S; Vaslin, Maite F S
2014-03-07
Next-generation parallel sequencing (NGS) allows the identification of viral pathogens by sequencing the small RNAs of infected hosts. Thus, viral genomes may be assembled from host immune response products without prior virus enrichment, amplification or purification. However, mapping of the vast information obtained presents a bioinformatics challenge. In order to by pass the need of line command and basic bioinformatics knowledge, we develop a mapping software with a graphical interface to the assemblage of viral genomes from small RNA dataset obtained by NGS. SearchSmallRNA was developed in JAVA language version 7 using NetBeans IDE 7.1 software. The program also allows the analysis of the viral small interfering RNAs (vsRNAs) profile; providing an overview of the size distribution and other features of the vsRNAs produced in infected cells. The program performs comparisons between each read sequenced present in a library and a chosen reference genome. Reads showing Hamming distances smaller or equal to an allowed mismatched will be selected as positives and used to the assemblage of a long nucleotide genome sequence. In order to validate the software, distinct analysis using NGS dataset obtained from HIV and two plant viruses were used to reconstruct viral whole genomes. SearchSmallRNA program was able to reconstructed viral genomes using NGS of small RNA dataset with high degree of reliability so it will be a valuable tool for viruses sequencing and discovery. It is accessible and free to all research communities and has the advantage to have an easy-to-use graphical interface. SearchSmallRNA was written in Java and is freely available at http://www.microbiologia.ufrj.br/ssrna/.
2014-01-01
Background Next-generation parallel sequencing (NGS) allows the identification of viral pathogens by sequencing the small RNAs of infected hosts. Thus, viral genomes may be assembled from host immune response products without prior virus enrichment, amplification or purification. However, mapping of the vast information obtained presents a bioinformatics challenge. Methods In order to by pass the need of line command and basic bioinformatics knowledge, we develop a mapping software with a graphical interface to the assemblage of viral genomes from small RNA dataset obtained by NGS. SearchSmallRNA was developed in JAVA language version 7 using NetBeans IDE 7.1 software. The program also allows the analysis of the viral small interfering RNAs (vsRNAs) profile; providing an overview of the size distribution and other features of the vsRNAs produced in infected cells. Results The program performs comparisons between each read sequenced present in a library and a chosen reference genome. Reads showing Hamming distances smaller or equal to an allowed mismatched will be selected as positives and used to the assemblage of a long nucleotide genome sequence. In order to validate the software, distinct analysis using NGS dataset obtained from HIV and two plant viruses were used to reconstruct viral whole genomes. Conclusions SearchSmallRNA program was able to reconstructed viral genomes using NGS of small RNA dataset with high degree of reliability so it will be a valuable tool for viruses sequencing and discovery. It is accessible and free to all research communities and has the advantage to have an easy-to-use graphical interface. Availability and implementation SearchSmallRNA was written in Java and is freely available at http://www.microbiologia.ufrj.br/ssrna/. PMID:24607237
A Pan-HIV Strategy for Complete Genome Sequencing
Yamaguchi, Julie; Alessandri-Gradt, Elodie; Tell, Robert W.; Brennan, Catherine A.
2015-01-01
Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e., switching mechanism at 5′ end of RNA transcript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance. PMID:26699702
Loncoman, Carlos A; Hartley, Carol A; Coppo, Mauricio J C; Vaz, Paola K; Diaz-Méndez, Andrés; Browning, Glenn F; García, Maricarmen; Spatz, Stephen; Devlin, Joanne M
2017-12-01
Recombination is a feature of many alphaherpesviruses that infect people and animals. Infectious laryngotracheitis virus (ILTV; Gallid alphaherpesvirus 1 ) causes respiratory disease in chickens, resulting in significant production losses in poultry industries worldwide. Natural (field) ILTV recombination is widespread, particularly recombination between attenuated ILTV vaccine strains to create virulent viruses. These virulent recombinants have had a major impact on animal health. Recently, the development of a single nucleotide polymorphism (SNP) genotyping assay for ILTV has helped to understand ILTV recombination in laboratory settings. In this study, we applied this SNP genotyping assay to further examine ILTV recombination in the natural host. Following coinoculation of specific-pathogen-free chickens, we examined the resultant progeny for evidence of viral recombination and characterized the diversity of the recombinants over time. The results showed that ILTV replication and recombination are closely related and that the recombinant viral progeny are most diverse 4 days after coinoculation, which is the peak of viral replication. Further, the locations of recombination breakpoints in a selection of the recombinant progeny, and in field isolates of ILTV from different geographical regions, were examined following full-genome sequencing and used to identify recombination hot spots in the ILTV genome. IMPORTANCE Alphaherpesviruses are common causes of disease in people and animals. Recombination enables genome diversification in many different species of alphaherpesviruses, which can lead to the evolution of higher levels of viral virulence. Using the alphaherpesvirus infectious laryngotracheitis virus (ILTV), we performed coinfections in the natural host (chickens) to demonstrate high levels of virus recombination. Higher levels of diversity in the recombinant progeny coincided with the highest levels of virus replication. In the recombinant progeny, and in field isolates, recombination occurred at greater frequency in recombination hot spot regions of the virus genome. Our results suggest that control measures that aim to limit viral replication could offer the potential to limit virus recombination and thus the evolution of virulence. The development and use of vaccines that are focused on limiting virus replication, rather than vaccines that are focused more on limiting clinical disease, may be indicated in order to better control disease. Copyright © 2017 American Society for Microbiology.
A wide extent of inter-strain diversity in virulent and vaccine strains of alphaherpesviruses.
Szpara, Moriah L; Tafuri, Yolanda R; Parsons, Lance; Shamim, S Rafi; Verstrepen, Kevin J; Legendre, Matthieu; Enquist, L W
2011-10-01
Alphaherpesviruses are widespread in the human population, and include herpes simplex virus 1 (HSV-1) and 2, and varicella zoster virus (VZV). These viral pathogens cause epithelial lesions, and then infect the nervous system to cause lifelong latency, reactivation, and spread. A related veterinary herpesvirus, pseudorabies (PRV), causes similar disease in livestock that result in significant economic losses. Vaccines developed for VZV and PRV serve as useful models for the development of an HSV-1 vaccine. We present full genome sequence comparisons of the PRV vaccine strain Bartha, and two virulent PRV isolates, Kaplan and Becker. These genome sequences were determined by high-throughput sequencing and assembly, and present new insights into the attenuation of a mammalian alphaherpesvirus vaccine strain. We find many previously unknown coding differences between PRV Bartha and the virulent strains, including changes to the fusion proteins gH and gB, and over forty other viral proteins. Inter-strain variation in PRV protein sequences is much closer to levels previously observed for HSV-1 than for the highly stable VZV proteome. Almost 20% of the PRV genome contains tandem short sequence repeats (SSRs), a class of nucleic acids motifs whose length-variation has been associated with changes in DNA binding site efficiency, transcriptional regulation, and protein interactions. We find SSRs throughout the herpesvirus family, and provide the first global characterization of SSRs in viruses, both within and between strains. We find SSR length variation between different isolates of PRV and HSV-1, which may provide a new mechanism for phenotypic variation between strains. Finally, we detected a small number of polymorphic bases within each plaque-purified PRV strain, and we characterize the effect of passage and plaque-purification on these polymorphisms. These data add to growing evidence that even plaque-purified stocks of stable DNA viruses exhibit limited sequence heterogeneity, which likely seeds future strain evolution.
Global Avian Influenza Surveillance in Wild Birds: A Strategy to Capture Viral Diversity
Machalaba, Catherine C.; Elwood, Sarah E.; Forcella, Simona; Smith, Kristine M.; Hamilton, Keith; Jebara, Karim B.; Swayne, David E.; Webby, Richard J.; Mumford, Elizabeth; Mazet, Jonna A.K.; Gaidet, Nicolas; Daszak, Peter
2015-01-01
Wild birds play a major role in the evolution, maintenance, and spread of avian influenza viruses. However, surveillance for these viruses in wild birds is sporadic, geographically biased, and often limited to the last outbreak virus. To identify opportunities to optimize wild bird surveillance for understanding viral diversity, we reviewed responses to a World Organisation for Animal Health–administered survey, government reports to this organization, articles on Web of Knowledge, and the Influenza Research Database. At least 119 countries conducted avian influenza virus surveillance in wild birds during 2008–2013, but coordination and standardization was lacking among surveillance efforts, and most focused on limited subsets of influenza viruses. Given high financial and public health burdens of recent avian influenza outbreaks, we call for sustained, cost-effective investments in locations with high avian influenza diversity in wild birds and efforts to promote standardized sampling, testing, and reporting methods, including full-genome sequencing and sharing of isolates with the scientific community. PMID:25811221
Cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human
Song, Huai-Dong; Tu, Chang-Chun; Zhang, Guo-Wei; Wang, Sheng-Yue; Zheng, Kui; Lei, Lian-Cheng; Chen, Qiu-Xia; Gao, Yu-Wei; Zhou, Hui-Qiong; Xiang, Hua; Zheng, Hua-Jun; Chern, Shur-Wern Wang; Cheng, Feng; Pan, Chun-Ming; Xuan, Hua; Chen, Sai-Juan; Luo, Hui-Ming; Zhou, Duan-Hua; Liu, Yu-Fei; He, Jian-Feng; Qin, Peng-Zhe; Li, Ling-Hui; Ren, Yu-Qi; Liang, Wen-Jia; Yu, Ye-Dong; Anderson, Larry; Wang, Ming; Xu, Rui-Heng; Wu, Xin-Wei; Zheng, Huan-Ying; Chen, Jin-Ding; Liang, Guodong; Gao, Yang; Liao, Ming; Fang, Ling; Jiang, Li-Yun; Li, Hui; Chen, Fang; Di, Biao; He, Li-Juan; Lin, Jin-Yan; Tong, Suxiang; Kong, Xiangang; Du, Lin; Hao, Pei; Tang, Hua; Bernini, Andrea; Yu, Xiao-Jing; Spiga, Ottavia; Guo, Zong-Ming; Pan, Hai-Yan; He, Wei-Zhong; Manuguerra, Jean-Claude; Fontanet, Arnaud; Danchin, Antoine; Niccolai, Neri; Li, Yi-Xue; Wu, Chung-I; Zhao, Guo-Ping
2005-01-01
The genomic sequences of severe acute respiratory syndrome coronaviruses from human and palm civet of the 2003/2004 outbreak in the city of Guangzhou, China, were nearly identical. Phylogenetic analysis suggested an independent viral invasion from animal to human in this new episode. Combining all existing data but excluding singletons, we identified 202 single-nucleotide variations. Among them, 17 are polymorphic in palm civets only. The ratio of nonsynonymous/synonymous nucleotide substitution in palm civets collected 1 yr apart from different geographic locations is very high, suggesting a rapid evolving process of viral proteins in civet as well, much like their adaptation in the human host in the early 2002–2003 epidemic. Major genetic variations in some critical genes, particularly the Spike gene, seemed essential for the transition from animal-to-human transmission to human-to-human transmission, which eventually caused the first severe acute respiratory syndrome outbreak of 2002/2003. PMID:15695582
Recent studies on the developing human hepatocellular carcinoma.
Gerber, M A
1986-01-01
From our knowledge of characteristic phenotypic changes of the preneoplastic lesions during the stepwise evolution of hepatocellular carcinoma (HCC) in experimental models, we are now beginning to define the structural, histochemical, biochemical, antigenic and molecular properties of early HCC and of the putative preneoplastic changes in human liver. Histological, ultrastructural, morphometric and immunohistochemical studies suggest that adenomatous nodules of regenerating and hyperplastic hepatocytes are more likely to represent direct precursors of HCC than dysplastic hepatocytes. Histochemical and immunomorphological investigations show appreciable functional and phenotypic heterogeneity of human HCC as previously recognized in experimental hepatocarcinogenesis. Studies of altered expression of oncogenes in the regenerating liver and HCC are beginning to define the molecular mechanisms in cell growth and malignant transformation. Although integration of Hepadna viral DNA sequences frequently occurs during persistent infection in man and animals, the exact mechanism of viral oncogenesis remains to be elucidated. It is likely that the development of monoclonal antibodies to surface antigens on transformed hepatocytes will be useful for exploring lineage relationships between the cell populations involved in hepatocarcinogenesis.
Viral factors in influenza pandemic risk assessment
Lipsitch, Marc; Barclay, Wendy; Raman, Rahul; Russell, Charles J; Belser, Jessica A; Cobey, Sarah; Kasson, Peter M; Lloyd-Smith, James O; Maurer-Stroh, Sebastian; Riley, Steven; Beauchemin, Catherine AA; Bedford, Trevor; Friedrich, Thomas C; Handel, Andreas; Herfst, Sander; Murcia, Pablo R; Roche, Benjamin; Wilke, Claus O; Russell, Colin A
2016-01-01
The threat of an influenza A virus pandemic stems from continual virus spillovers from reservoir species, a tiny fraction of which spark sustained transmission in humans. To date, no pandemic emergence of a new influenza strain has been preceded by detection of a closely related precursor in an animal or human. Nonetheless, influenza surveillance efforts are expanding, prompting a need for tools to assess the pandemic risk posed by a detected virus. The goal would be to use genetic sequence and/or biological assays of viral traits to identify those non-human influenza viruses with the greatest risk of evolving into pandemic threats, and/or to understand drivers of such evolution, to prioritize pandemic prevention or response measures. We describe such efforts, identify progress and ongoing challenges, and discuss three specific traits of influenza viruses (hemagglutinin receptor binding specificity, hemagglutinin pH of activation, and polymerase complex efficiency) that contribute to pandemic risk. DOI: http://dx.doi.org/10.7554/eLife.18491.001 PMID:27834632
Zhang, Shu; McDonald, Paul W.; Thompson, Travis A.; Dennis, Allison F.; Akopov, Asmik; Kirkness, Ewen F.; Patton, John T.
2014-01-01
ABSTRACT Rotaviruses (RVs) are 11-segmented, double-stranded RNA viruses that cause severe gastroenteritis in children. In addition to an error-prone genome replication mechanism, RVs can increase their genetic diversity by reassorting genes during host coinfection. Such exchanges allow RVs to acquire advantageous genes and adapt in the face of selective pressures. However, reassortment may also impose fitness costs if it unlinks genes/proteins that have accumulated compensatory, coadaptive mutations and that operate best when kept together. To better understand human RV evolutionary dynamics, we analyzed the genome sequences of 135 strains (genotype G1/G3/G4-P[8]-I1-C1-R1-A1-N1-T1-E1-H1) that were collected at a single location in Washington, DC, during the years 1974 to 1991. Intragenotypic phylogenetic trees were constructed for each viral gene using the nucleotide sequences, thereby defining novel allele level gene constellations (GCs) and illuminating putative reassortment events. The results showed that RVs with distinct GCs cocirculated during the vast majority of the collection years and that some of these GCs persisted in the community unchanged by reassortment. To investigate the influence of protein coadaptation on GC maintenance, we performed a mutual information-based analysis of the concatenated amino acid sequences and identified an extensive covariance network. Unexpectedly, amino acid covariation was highest between VP4 and VP2, which are structural components of the RV virion that are not thought to directly interact. These results suggest that GCs may be influenced by the selective constraints placed on functionally coadapted, albeit noninteracting, viral proteins. This work raises important questions about mutation-reassortment interplay and its impact on human RV evolution. IMPORTANCE Rotaviruses are devastating human pathogens that cause severe diarrhea and kill >450,000 children each year. The virus can evolve by accumulating mutations and by acquiring new genes from other strains via a process called reassortment. However, little is known about the relationship between mutation accumulation and gene reassortment for rotaviruses and how it impacts viral evolution. In this study, we analyzed the genome sequences of human strains found in clinical fecal specimens that were collected at a single hospital over an 18-year time span. We found that many rotaviruses did not reassort their genes but instead maintained them as specific sets (i.e., constellations). By analyzing the encoded proteins, we discovered concurrent amino acid changes among them, which suggests that they are functionally coadapted to operate best when kept together. This study increases our understanding of how rotaviruses evolve over time in the human population. PMID:24942570
Stenglein, Mark D.; Jacobson, Elliott R.; Chang, Li-Wen; Sanders, Chris; Hawkins, Michelle G.; Guzman, David S-M.; Drazenovich, Tracy; Dunker, Freeland; Kamaka, Elizabeth K.; Fisher, Debbie; Reavill, Drury R.; Meola, Linda F.; Levens, Gregory; DeRisi, Joseph L.
2015-01-01
Arenaviruses are one of the largest families of human hemorrhagic fever viruses and are known to infect both mammals and snakes. Arenaviruses package a large (L) and small (S) genome segment in their virions. For segmented RNA viruses like these, novel genotypes can be generated through mutation, recombination, and reassortment. Although it is believed that an ancient recombination event led to the emergence of a new lineage of mammalian arenaviruses, neither recombination nor reassortment has been definitively documented in natural arenavirus infections. Here, we used metagenomic sequencing to survey the viral diversity present in captive arenavirus-infected snakes. From 48 infected animals, we determined the complete or near complete sequence of 210 genome segments that grouped into 23 L and 11 S genotypes. The majority of snakes were multiply infected, with up to 4 distinct S and 11 distinct L segment genotypes in individual animals. This S/L imbalance was typical: in all cases intrahost L segment genotypes outnumbered S genotypes, and a particular S segment genotype dominated in individual animals and at a population level. We corroborated sequencing results by qRT-PCR and virus isolation, and isolates replicated as ensembles in culture. Numerous instances of recombination and reassortment were detected, including recombinant segments with unusual organizations featuring 2 intergenic regions and superfluous content, which were capable of stable replication and transmission despite their atypical structures. Overall, this represents intrahost diversity of an extent and form that goes well beyond what has been observed for arenaviruses or for viruses in general. This diversity can be plausibly attributed to the captive intermingling of sub-clinically infected wild-caught snakes. Thus, beyond providing a unique opportunity to study arenavirus evolution and adaptation, these findings allow the investigation of unintended anthropogenic impacts on viral ecology, diversity, and disease potential. PMID:25993603
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, C
2009-11-12
In FY09 they will (1) complete the implementation, verification, calibration, and sensitivity and scalability analysis of the in-cell virus replication model; (2) complete the design of the cell culture (cell-to-cell infection) model; (3) continue the research, design, and development of their bioinformatics tools: the Web-based structure-alignment-based sequence variability tool and the functional annotation of the genome database; (4) collaborate with the University of California at San Francisco on areas of common interest; and (5) submit journal articles that describe the in-cell model with simulations and the bioinformatics approaches to evaluation of genome variability and fitness.
RNA viruses in trypanosomatid parasites: a historical overview
Grybchuk, Danyil; Kostygov, Alexei Y; Macedo, Diego H; d’Avila-Levy, Claudia M; Yurchenko, Vyacheslav
2018-01-01
Viruses of trypanosomatids are now being extensively studied because of their diversity and the roles they play in flagellates’ biology. Among the most prominent examples are leishmaniaviruses implicated in pathogenesis of Leishmania parasites. Here, we present a historical overview of this field, starting with early reports of virus-like particles on electron microphotographs, and culminating in detailed molecular descriptions of viruses obtained using modern next generation sequencing-based techniques. Because of their diversity, different life cycle strategies and host specificity, we believe that trypanosomatids are a fertile ground for further explorations to better understand viral evolution, routes of transitions, and molecular mechanisms of adaptation to different hosts. PMID:29513877
RNA viruses in trypanosomatid parasites: a historical overview.
Grybchuk, Danyil; Kostygov, Alexei Y; Macedo, Diego H; d'Avila-Levy, Claudia M; Yurchenko, Vyacheslav
2018-02-19
Viruses of trypanosomatids are now being extensively studied because of their diversity and the roles they play in flagellates' biology. Among the most prominent examples are leishmaniaviruses implicated in pathogenesis of Leishmania parasites. Here, we present a historical overview of this field, starting with early reports of virus-like particles on electron microphotographs, and culminating in detailed molecular descriptions of viruses obtained using modern next generation sequencing-based techniques. Because of their diversity, different life cycle strategies and host specificity, we believe that trypanosomatids are a fertile ground for further explorations to better understand viral evolution, routes of transitions, and molecular mechanisms of adaptation to different hosts.
deCamp, Allan C; Rolland, Morgane; Edlefsen, Paul T; Sanders-Buell, Eric; Hall, Breana; Magaret, Craig A; Fiore-Gartland, Andrew J; Juraska, Michal; Carpp, Lindsay N; Karuna, Shelly T; Bose, Meera; LePore, Steven; Miller, Shana; O'Sullivan, Annemarie; Poltavee, Kultida; Bai, Hongjun; Dommaraju, Kalpana; Zhao, Hong; Wong, Kim; Chen, Lennie; Ahmed, Hasan; Goodman, Derrick; Tay, Matthew Z; Gottardo, Raphael; Koup, Richard A; Bailer, Robert; Mascola, John R; Graham, Barney S; Roederer, Mario; O'Connell, Robert J; Michael, Nelson L; Robb, Merlin L; Adams, Elizabeth; D'Souza, Patricia; Kublin, James; Corey, Lawrence; Geraghty, Daniel E; Frahm, Nicole; Tomaras, Georgia D; McElrath, M Juliana; Frenkel, Lisa; Styrchak, Sheila; Tovanabutra, Sodsai; Sobieszczyk, Magdalena E; Hammer, Scott M; Kim, Jerome H; Mullins, James I; Gilbert, Peter B
2017-01-01
Although the HVTN 505 DNA/recombinant adenovirus type 5 vector HIV-1 vaccine trial showed no overall efficacy, analysis of breakthrough HIV-1 sequences in participants can help determine whether vaccine-induced immune responses impacted viruses that caused infection. We analyzed 480 HIV-1 genomes sampled from 27 vaccine and 20 placebo recipients and found that intra-host HIV-1 diversity was significantly lower in vaccine recipients (P ≤ 0.04, Q-values ≤ 0.09) in Gag, Pol, Vif and envelope glycoprotein gp120 (Env-gp120). Furthermore, Env-gp120 sequences from vaccine recipients were significantly more distant from the subtype B vaccine insert than sequences from placebo recipients (P = 0.01, Q-value = 0.12). These vaccine effects were associated with signatures mapping to CD4 binding site and CD4-induced monoclonal antibody footprints. These results suggest either (i) no vaccine efficacy to block acquisition of any viral genotype but vaccine-accelerated Env evolution post-acquisition; or (ii) vaccine efficacy against HIV-1s with Env sequences closest to the vaccine insert combined with increased acquisition due to other factors, potentially including the vaccine vector.
Edlefsen, Paul T.; Sanders-Buell, Eric; Hall, Breana; Magaret, Craig A.; Fiore-Gartland, Andrew J.; Juraska, Michal; Carpp, Lindsay N.; Karuna, Shelly T.; Bose, Meera; LePore, Steven; Miller, Shana; O'Sullivan, Annemarie; Poltavee, Kultida; Bai, Hongjun; Dommaraju, Kalpana; Zhao, Hong; Wong, Kim; Chen, Lennie; Ahmed, Hasan; Goodman, Derrick; Tay, Matthew Z.; Gottardo, Raphael; Koup, Richard A.; Bailer, Robert; Mascola, John R.; Graham, Barney S.; Roederer, Mario; O’Connell, Robert J.; Michael, Nelson L.; Robb, Merlin L.; Adams, Elizabeth; D’Souza, Patricia; Kublin, James; Corey, Lawrence; Geraghty, Daniel E.; Frahm, Nicole; Tomaras, Georgia D.; McElrath, M. Juliana; Frenkel, Lisa; Styrchak, Sheila; Tovanabutra, Sodsai; Sobieszczyk, Magdalena E.; Hammer, Scott M.; Kim, Jerome H.; Mullins, James I.; Gilbert, Peter B.
2017-01-01
Although the HVTN 505 DNA/recombinant adenovirus type 5 vector HIV-1 vaccine trial showed no overall efficacy, analysis of breakthrough HIV-1 sequences in participants can help determine whether vaccine-induced immune responses impacted viruses that caused infection. We analyzed 480 HIV-1 genomes sampled from 27 vaccine and 20 placebo recipients and found that intra-host HIV-1 diversity was significantly lower in vaccine recipients (P ≤ 0.04, Q-values ≤ 0.09) in Gag, Pol, Vif and envelope glycoprotein gp120 (Env-gp120). Furthermore, Env-gp120 sequences from vaccine recipients were significantly more distant from the subtype B vaccine insert than sequences from placebo recipients (P = 0.01, Q-value = 0.12). These vaccine effects were associated with signatures mapping to CD4 binding site and CD4-induced monoclonal antibody footprints. These results suggest either (i) no vaccine efficacy to block acquisition of any viral genotype but vaccine-accelerated Env evolution post-acquisition; or (ii) vaccine efficacy against HIV-1s with Env sequences closest to the vaccine insert combined with increased acquisition due to other factors, potentially including the vaccine vector. PMID:29149197
Pace of Coreceptor Tropism Switch in HIV-1-Infected Individuals after Recent Infection.
Arif, Muhammad Shoaib; Hunter, James; Léda, Ana Rachel; Zukurov, Jean Paulo Lopes; Samer, Sadia; Camargo, Michelle; Galinskas, Juliana; Kallás, Esper Georges; Komninakis, Shirley Vasconcelos; Janini, Luiz Mario; Sucupira, Maria Cecilia; Diaz, Ricardo Sobhie
2017-10-01
HIV-1 entry into target cells influences several aspects of HIV-1 pathogenesis, including viral tropism, HIV-1 transmission and disease progression, and response to entry inhibitors. The evolution from CCR5- to CXCR4-using strains in a given human host is still unpredictable. Here we analyzed timing and predictors for coreceptor evolution among recently HIV-1-infected individuals. Proviral DNA was longitudinally evaluated in 66 individuals using Geno2pheno [coreceptor] Demographics, viral load, CD4 + and CD8 + T cell counts, CCR5Δ32 polymorphisms, GB virus C (GBV-C) coinfection, and HLA profiles were also evaluated. Ultradeep sequencing was performed on initial samples from 11 selected individuals. A tropism switch from CCR5- to CXCR4-using strains was identified in 9/49 (18.4%) individuals. Only a low baseline false-positive rate (FPR) was found to be a significant tropism switch predictor. No minor CXCR4-using variants were identified in initial samples of 4 of 5 R5/non-R5 switchers. Logistic regression analysis showed that patients with an FPR of >40.6% at baseline presented a stable FPR over time whereas lower FPRs tend to progressively decay, leading to emergence of CXCR4-using strains, with a mean evolution time of 27.29 months (range, 8.90 to 64.62). An FPR threshold above 40.6% determined by logistic regression analysis may make it unnecessary to further determine tropism for prediction of disease progression related to emergence of X4 strains or use of CCR5 antagonists. The detection of variants with intermediate FPRs and progressive FPR decay over time not only strengthens the power of Geno2pheno in predicting HIV tropism but also indirectly confirms a continuous evolution from earlier R5 variants toward CXCR4-using strains. IMPORTANCE The introduction of CCR5 antagonists in the antiretroviral arsenal has sparked interest in coreceptors utilized by HIV-1. Despite concentrated efforts, viral and human host features predicting tropism switch are still poorly understood. Limited longitudinal data are available to assess the influence that these factors have on predicting tropism switch and disease progression. The present study describes longitudinal tropism evolution in a group of recently HIV-infected individuals to determine the prevalence and potential correlates of tropism switch. We demonstrated here that a low baseline FPR determined by the Geno2pheno [coreceptor] algorithm can predict tropism evolution from CCR5 to CXCR4 coreceptor use. Copyright © 2017 American Society for Microbiology.
De novo selection of oncogenes.
Chacón, Kelly M; Petti, Lisa M; Scheideman, Elizabeth H; Pirazzoli, Valentina; Politi, Katerina; DiMaio, Daniel
2014-01-07
All cellular proteins are derived from preexisting ones by natural selection. Because of the random nature of this process, many potentially useful protein structures never arose or were discarded during evolution. Here, we used a single round of genetic selection in mouse cells to isolate chemically simple, biologically active transmembrane proteins that do not contain any amino acid sequences from preexisting proteins. We screened a retroviral library expressing hundreds of thousands of proteins consisting of hydrophobic amino acids in random order to isolate four 29-aa proteins that induced focus formation in mouse and human fibroblasts and tumors in mice. These proteins share no amino acid sequences with known cellular or viral proteins, and the simplest of them contains only seven different amino acids. They transformed cells by forming a stable complex with the platelet-derived growth factor β receptor transmembrane domain and causing ligand-independent receptor activation. We term this approach de novo selection and suggest that it can be used to generate structures and activities not observed in nature, create prototypes for novel research reagents and therapeutics, and provide insight into cell biology, transmembrane protein-protein interactions, and possibly virus evolution and the origin of life.
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples.
Laird Smith, Melissa; Murrell, Ben; Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E; Kosakovsky Pond, Sergei L; Smith, Davey M
2016-07-01
The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences' Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data.
Goya, Stephanie; Valinotto, Laura E; Tittarelli, Estefania; Rojo, Gabriel L; Nabaes Jodar, Mercedes S; Greninger, Alexander L; Zaiat, Jonathan J; Marti, Marcelo A; Mistchenko, Alicia S; Viegas, Mariana
2018-01-01
Over the last decade, the number of viral genome sequences deposited in available databases has grown exponentially. However, sequencing methodology vary widely and many published works have relied on viral enrichment by viral culture or nucleic acid amplification with specific primers rather than through unbiased techniques such as metagenomics. The genome of RNA viruses is highly variable and these enrichment methodologies may be difficult to achieve or may bias the results. In order to obtain genomic sequences of human respiratory syncytial virus (HRSV) from positive nasopharyngeal aspirates diverse methodologies were evaluated and compared. A total of 29 nearly complete and complete viral genomes were obtained. The best performance was achieved with a DNase I treatment to the RNA directly extracted from the nasopharyngeal aspirate (NPA), sequence-independent single-primer amplification (SISPA) and library preparation performed with Nextera XT DNA Library Prep Kit with manual normalization. An average of 633,789 and 1,674,845 filtered reads per library were obtained with MiSeq and NextSeq 500 platforms, respectively. The higher output of NextSeq 500 was accompanied by the increasing of duplicated reads percentage generated during SISPA (from an average of 1.5% duplicated viral reads in MiSeq to an average of 74% in NextSeq 500). HRSV genome recovery was not affected by the presence or absence of duplicated reads but the computational demand during the analysis was increased. Considering that only samples with viral load ≥ E+06 copies/ml NPA were tested, no correlation between sample viral loads and number of total filtered reads was observed, nor with the mapped viral reads. The HRSV genomes showed a mean coverage of 98.46% with the best methodology. In addition, genomes of human metapneumovirus (HMPV), human rhinovirus (HRV) and human parainfluenza virus types 1-3 (HPIV1-3) were also obtained with the selected optimal methodology.
Vidigal, Pedro M P; Mafra, Claudio L; Silva, Fernanda M F; Fietto, Juliana L R; Silva Júnior, Abelardo; Almeida, Márcia R
2012-01-01
Porcine circovirus-2 (PCV-2) is an emerging virus associated with a number of different syndromes in pigs known as Porcine Circovirus Associated Diseases (PCVAD). Since its identification and characterization in the early 1990s, PCV-2 has achieved a worldwide distribution, becoming endemic in most pig-producing countries, and is currently considered as the main cause of losses on pig farms. In this study, we analyzed the main routes of the spread of PCV-2 between pig-producing countries using phylogenetic and phylogeographical approaches. A search for PCV-2 genome sequences in GenBank was performed, and the 420 PCV-2 sequences obtained were grouped into haplotypes (group of sequences that showed 100% identity), based on the infinite sites model of genome evolution. A phylogenetic hypothesis was inferred by Bayesian Inference for the classification of viral strains and a haplotype network was constructed by Median Joining to predict the geographical distribution of and genealogical relationships between haplotypes. In order to establish an epidemiological and economic context in these analyses, we considered all information about PCV-2 sequences available in GenBank, including papers published on viral isolation, and live pig trading statistics available on the UN Comtrade database (http://comtrade.un.org/). In these analyses, we identified a strong correlation between the means of PCV-2 dispersal predicted by the haplotype network and the statistics on the international trading of live pigs. This correlation provides a new perspective on the epidemiology of PCV-2, highlighting the importance of the movement of animals around the world in the emergence of new pathogens, and showing the need for effective sanitary barriers when trading live animals. Copyright © 2011 Elsevier B.V. All rights reserved.
Diverse novel astroviruses identified in wild Himalayan marmots.
Ao, Yuan-Yun; Yu, Jie-Mei; Li, Li-Li; Cao, Jing-Yuan; Deng, Hong-Yan; Xin, Yun-Yun; Liu, Meng-Meng; Lin, Lin; Lu, Shan; Xu, Jian-Guo; Duan, Zhao-Jun
2017-04-01
With advances in viral surveillance and next-generation sequencing, highly diverse novel astroviruses (AstVs) and different animal hosts had been discovered in recent years. However, the existence of AstVs in marmots had yet to be shown. Here, we identified two highly divergent strains of AstVs (tentatively named Qinghai Himalayanmarmot AstVs, HHMAstV1 and HHMAstV2), by viral metagenomic analysis in liver tissues isolated from wild Marmota himalayana in China. Overall, 12 of 99 (12.1 %) M. himalayana faecal samples were positive for the presence of genetically diverse AstVs, while only HHMAstV1 and HHMAstV2 were identified in 300 liver samples. The complete genomic sequences of HHMAstV1 and HHMAstV2 were 6681 and 6610 nt in length, respectively, with the typical genomic organization of AstVs. Analysis of the complete ORF 2 sequence showed that these novel AstVs are most closely related to the rabbit AstV, mamastrovirus 23 (with 31.0 and 48.0 % shared amino acid identity, respectively). Phylogenetic analysis of the amino acid sequences of ORF1a, ORF1b and ORF2 indicated that HHMAstV1 and HHMAstV2 form two distinct clusters among the mamastroviruses, and may share a common ancestor with the rabbit-specific mamastrovirus 23. These results suggest that HHMAstV1 and HHMAstV2 are two novel species of the genus Mamastrovirus in the Astroviridae. The remarkable diversity of these novel AstVs will contribute to a greater understanding of the evolution and ecology of AstVs, although additional studies will be needed to understand the clinical significance of these novel AstVs in marmots, as well as in humans.
Lemey, Philippe; Rambaut, Andrew; Bedford, Trevor; Faria, Nuno; Bielejec, Filip; Baele, Guy; Russell, Colin A; Smith, Derek J; Pybus, Oliver G; Brockmann, Dirk; Suchard, Marc A
2014-02-01
Information on global human movement patterns is central to spatial epidemiological models used to predict the behavior of influenza and other infectious diseases. Yet it remains difficult to test which modes of dispersal drive pathogen spread at various geographic scales using standard epidemiological data alone. Evolutionary analyses of pathogen genome sequences increasingly provide insights into the spatial dynamics of influenza viruses, but to date they have largely neglected the wealth of information on human mobility, mainly because no statistical framework exists within which viral gene sequences and empirical data on host movement can be combined. Here, we address this problem by applying a phylogeographic approach to elucidate the global spread of human influenza subtype H3N2 and assess its ability to predict the spatial spread of human influenza A viruses worldwide. Using a framework that estimates the migration history of human influenza while simultaneously testing and quantifying a range of potential predictive variables of spatial spread, we show that the global dynamics of influenza H3N2 are driven by air passenger flows, whereas at more local scales spread is also determined by processes that correlate with geographic distance. Our analyses further confirm a central role for mainland China and Southeast Asia in maintaining a source population for global influenza diversity. By comparing model output with the known pandemic expansion of H1N1 during 2009, we demonstrate that predictions of influenza spatial spread are most accurate when data on human mobility and viral evolution are integrated. In conclusion, the global dynamics of influenza viruses are best explained by combining human mobility data with the spatial information inherent in sampled viral genomes. The integrated approach introduced here offers great potential for epidemiological surveillance through phylogeographic reconstructions and for improving predictive models of disease control.
Marine, Rachel L; Nasko, Daniel J; Wray, Jeffrey; Polson, Shawn W; Wommack, K Eric
2017-01-01
Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70 kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells. PMID:28731469
Marine, Rachel L; Nasko, Daniel J; Wray, Jeffrey; Polson, Shawn W; Wommack, K Eric
2017-11-01
Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70 kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells.
Vinner, Lasse; Mourier, Tobias; Friis-Nielsen, Jens; Gniadecki, Robert; Dybkaer, Karen; Rosenberg, Jacob; Langhoff, Jill Levin; Cruz, David Flores Santa; Fonager, Jannik; Izarzugaza, Jose M G; Gupta, Ramneek; Sicheritz-Ponten, Thomas; Brunak, Søren; Willerslev, Eske; Nielsen, Lars Peter; Hansen, Anders Johannes
2015-08-19
Although nearly one fifth of all human cancers have an infectious aetiology, the causes for the majority of cancers remain unexplained. Despite the enormous data output from high-throughput shotgun sequencing, viral DNA in a clinical sample typically constitutes a proportion of host DNA that is too small to be detected. Sequence variation among virus genomes complicates application of sequence-specific, and highly sensitive, PCR methods. Therefore, we aimed to develop and characterize a method that permits sensitive detection of sequences despite considerable variation. We demonstrate that our low-stringency in-solution hybridization method enables detection of <100 viral copies. Furthermore, distantly related proviral sequences may be enriched by orders of magnitude, enabling discovery of hitherto unknown viral sequences by high-throughput sequencing. The sensitivity was sufficient to detect retroviral sequences in clinical samples. We used this method to conduct an investigation for novel retrovirus in samples from three cancer types. In accordance with recent studies our investigation revealed no retroviral infections in human B-cell lymphoma cells, cutaneous T-cell lymphoma or colorectal cancer biopsies. Nonetheless, our generally applicable method makes sensitive detection possible and permits sequencing of distantly related sequences from complex material.
Kundu, Rhiannon; Knight, Robin; Dunga, Meenakshi; Peakman, Mark
2018-01-01
Coxsackie B Virus (CBV) infection has been linked to the aetiology of type 1 diabetes (T1D) and vaccination has been proposed as prophylaxis for disease prevention. Serum neutralising antibodies and the presence of viral protein and RNA in tissues have been common tools to examine this potential disease relationship, whilst the role of anti-CBV cytotoxic T cell responses and their targets have not been studied. To address this knowledge gap, we augmented conventional HLA-binding predictive algorithm-based epitope discovery by cross-referencing epitopes with sites of positive natural selection within the CBV3 viral genome, identified using mixed effects models of evolution. Eight epitopes for the common MHC class I allele HLA-A*0201 occur at sites that appear to be positively selected. Furthermore, such epitopes span the viral genome, indicating that effective anti-viral responses may not be restricted to the capsid region. To assess the spectrum of IFNy responses in non-diabetic subjects and recently diagnosed type 1 diabetes (T1D) patients, we stimulated PBMC ex vivo with pools of synthetic peptides based on component-restricted sequences identified in silico. We found responders were more likely to recognize multiple rather than a single CBV peptide pool, indicating that the natural course of infection results in multiple targets for effector memory responses, rather than immunodominant epitopes or viral components. The finding that anti-CBV CD8 T cell immunity is broadly targeted has implications for vaccination strategies and studies on the pathogenesis of CBV-linked diseases.
Kassaye, Seble; Johnston, Elizabeth; McColgan, Bryan; Kantor, Rami; Zijenah, Lynn; Katzenstein, David
2009-01-01
In resource-constrained settings, antiretroviral treatment (ART) is often continued based on clinical and CD4 responses, without virologic monitoring. ART with incomplete viral suppression was assessed in 27 subjects with subtype C HIV-1 by measuring plasma HIV-1 RNA, drug resistance, viral tropism, and evolution in polymerase (pol) and envelope (env) genes. The association between these viral parameters and CD4 cell change over time was analyzed using linear regression models. Increased area under the curve of HIV-1 RNA replication was a predictor of lower CD4 cell gains (p <0.007), while less drug resistance measured as a genotypic susceptibility score (GSS) (p=0.065), and lower rates of evolution in pol and env genes (p= 0.08 and 0.097, respectively) measured as genetic distance were modestly associated with increasing CD4 cell counts. Evolution of pol and env were correlated (R2 = 0.48, p=0.005), however, greater evolution was identified in env vs. pol (p <0.05). CXCR4-usage (X4) was detected in 14/27 (52%) but no differences in CD4 cell change or plasma viremia were associated with X4-usage. Among subtype C HIV-1 infected patients in Zimbabwe receiving incompletely suppressive ART, higher virus replication and lower CD4 cell gains were associated with drug resistance and evolution of polymerase and envelope. PMID:19295330
Neutral Theory and Rapidly Evolving Viral Pathogens.
Frost, Simon D W; Magalis, Brittany Rife; Kosakovsky Pond, Sergei L
2018-06-01
The evolution of viral pathogens is shaped by strong selective forces that are exerted during jumps to new hosts, confrontations with host immune responses and antiviral drugs, and numerous other processes. However, while undeniably strong and frequent, adaptive evolution is largely confined to small parts of information-packed viral genomes, and the majority of observed variation is effectively neutral. The predictions and implications of the neutral theory have proven immensely useful in this context, with applications spanning understanding within-host population structure, tracing the origins and spread of viral pathogens, predicting evolutionary dynamics, and modeling the emergence of drug resistance. We highlight the multiple ways in which the neutral theory has had an impact, which has been accelerated in the age of high-throughput, high-resolution genomics.
A comprehensive and quantitative exploration of thousands of viral genomes
Mahmoudabadi, Gita
2018-01-01
The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends – such as gene density, noncoding percentage, and abundances of functional gene categories – across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends. PMID:29624169
A comprehensive and quantitative exploration of thousands of viral genomes.
Mahmoudabadi, Gita; Phillips, Rob
2018-04-19
The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends - such as gene density, noncoding percentage, and abundances of functional gene categories - across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends. © 2018, Mahmoudabadi et al.
Pathways to extinction: beyond the error threshold.
Manrubia, Susanna C; Domingo, Esteban; Lázaro, Ester
2010-06-27
Since the introduction of the quasispecies and the error catastrophe concepts for molecular evolution by Eigen and their subsequent application to viral populations, increased mutagenesis has become a common strategy to cause the extinction of viral infectivity. Nevertheless, the high complexity of virus populations has shown that viral extinction can occur through several other pathways apart from crossing an error threshold. Increases in the mutation rate enhance the appearance of defective forms and promote the selection of mechanisms that are able to counteract the accelerated appearance of mutations. Current models of viral evolution take into account more realistic scenarios that consider compensatory and lethal mutations, a highly redundant genotype-to-phenotype map, rough fitness landscapes relating phenotype and fitness, and where phenotype is described as a set of interdependent traits. Further, viral populations cannot be understood without specifying the characteristics of the environment where they evolve and adapt. Altogether, it turns out that the pathways through which viral quasispecies go extinct are multiple and diverse.
Rojas Sánchez, Patricia; Cobos, Alberto; Navaro, Marisa; Ramos, José Tomas; Pagán, Israel
2017-01-01
Abstract Determining the factors modulating the genetic diversity of HIV-1 populations is essential to understand viral evolution. This study analyzes the relative importance of clinical factors in the intrahost HIV-1 subtype B (HIV-1B) evolution and in the fixation of drug resistance mutations (DRM) during longitudinal pediatric HIV-1 infection. We recovered 162 partial HIV-1B pol sequences (from 3 to 24 per patient) from 24 perinatally infected patients from the Madrid Cohort of HIV-1 infected children and adolescents in a time interval ranging from 2.2 to 20.3 years. We applied machine learning classification methods to analyze the relative importance of 28 clinical/epidemiological/virological factors in the HIV-1B evolution to predict HIV-1B genetic diversity (d), nonsynonymous and synonymous mutations (dN, dS) and DRM presence. Most of the 24 HIV-1B infected pediatric patients were Spanish (91.7%), diagnosed before 2000 (83.3%), and all were antiretroviral therapy experienced. They had from 0.3 to 18.8 years of HIV-1 exposure at sampling time. Most sequences presented DRM. The best-predictor variables for HIV-1B evolutionary parameters were the age of HIV-1 diagnosis for d, the age at first antiretroviral treatment for dN and the year of HIV-1 diagnosis for ds. The year of infection (birth year) and year of sampling seemed to be relevant for fixation of both DRM at large and, considering drug families, to protease inhibitors (PI). This study identifies, for the first time using machine learning, the factors affecting more HIV-1B pol evolution and those affecting DRM fixation in HIV-1B infected pediatric patients. PMID:29044435
A gyrovirus infecting a sea bird
Li, Linlin; Pesavento, Patricia A.; Gaynor, Anne M.; Duerr, Rebecca S.; Phan, Tung Gia; Zhang, Wen; Deng, Xutao
2015-01-01
We characterized the genome of a highly divergent gyrovirus (GyV8) in the spleen and uropygial gland tissues of a diseased northern fulmar (Fulmarus glacialis), a pelagic bird beached in San Francisco, California. No other exogenous viral sequences could be identified using viral metagenomics. The small circular DNA genome shared no significant nucleotide sequence identity, and only 38–42 % amino acid sequence identity in VP1, with any of the previously identified gyroviruses. GyV8 is the first member of the third major phylogenetic clade of this viral genus and the first gyrovirus detected in an avian species other than chicken. PMID:26036564
The nucleotide sequence and genome organization of Plasmopara halstedii virus.
Heller-Dohmen, Marion; Göpfert, Jens C; Pfannstiel, Jens; Spring, Otmar
2011-03-17
Only very few viruses of Oomycetes have been studied in detail. Isometric virions were found in different isolates of the oomycete Plasmopara halstedii, the downy mildew pathogen of sunflower. However, complete nucleotide sequences and data on the genome organization were lacking. Viral RNA of different P. halstedii isolates was subjected to nucleotide sequencing and analysis of the viral genome. The N-terminal sequence of the viral coat protein was determined using Top-Down MALDI-TOF analysis. The complete nucleotide sequences of both single-stranded RNA segments (RNA1 and RNA2) were established. RNA1 consisted of 2793 nucleotides (nt) exclusive its 3' poly(A) tract and a single open-reading frame (ORF1) of 2745 nt. ORF1 was framed by a 5' untranslated region (5' UTR) of 18 nt and a 3' untranslated region (3' UTR) of 30 nt. ORF1 contained motifs of RNA-dependent RNA polymerases (RdRp) and showed similarities to RdRp of Scleropthora macrospora virus A (SmV A) and viruses within the Nodaviridae family. RNA2 consisted of 1526 nt exclusive its 3' poly(A) tract and a second ORF (ORF2) of 1128 nt. ORF2 coded for the single viral coat protein (CP) and was framed by a 5' UTR of 164 nt and a 3' UTR of 234 nt. The deduced amino acid sequence of ORF2 was verified by nano-LC-ESI-MS/MS experiments. Top-Down MALDI-TOF analysis revealed the N-terminal sequence of the CP. The N-terminal sequence represented a region within ORF2 suggesting a proteolytic processing of the CP in vivo. The CP showed similarities to CP of SmV A and viruses within the Tombusviridae family. Fragments of RNA1 (ca. 1.9 kb) and RNA2 (ca. 1.4 kb) were used to analyze the nucleotide sequence variation of virions in different P. halstedii isolates. Viral sequence variation was 0.3% or less regardless of their host's pathotypes, the geographical origin and the sensitivity towards the fungicide metalaxyl. The results showed the presence of a single and new virus type in different P. halstedii isolates. Insignificant viral sequence variation indicated that the virus did not account for differences in pathogenicity of the oomycete P. halstedii.
Holmes, E.C.; Stephenson, A.G.
2014-01-01
Determining the extent and structure of intra-host genetic diversity and the magnitude and impact of population bottlenecks is central to understanding the mechanisms of viral evolution. To determine the nature of viral evolution following systemic movement through a plant, we performed deep sequencing of 23 leaves that grew sequentially along a single Cucurbita pepo vine that was infected with zucchini yellow mosaic virus (ZYMV), and on a leaf that grew in on a side branch. Strikingly, of 112 genetic (i.e. sub-consensus) variants observed in the data set as a whole, only 22 were found in multiple leaves. Similarly, only three of the 13 variants present in the inoculating population were found in the subsequent leaves on the vine. Hence, it appears that systemic movement is characterized by sequential population bottlenecks, although not sufficient to reduce the population to a single virion as multiple variants were consistently transmitted between leaves. In addition, the number of variants within a leaf increases as a function of distance from the inoculated (source) leaf, suggesting that the circulating sap may serve as a continual source of virus. Notably, multiple mutational variants were observed in the cylindrical Inclusion (CI) protein (known to be involved in both cell-to-cell and systemic movement of the virus) that were present in multiple (19/24) leaf samples. These mutations resulted in a conformational change, suggesting that they might confer a selective advantage in systemic movement within the vine. Overall, these data reveal that bottlenecks occur during systemic movement, that variants circulate in the phloem sap throughout the infection process, and that important conformational changes in CI protein may arise during individual infections. PMID:25107623
2011-01-01
Background Foot-and-mouth disease virus (FMDV) uses a highly conserved Arg-Gly-Asp (RGD) triplet for attachment to host cells and this motif is believed to be essential for virus viability. Previous sequence analyses of the 1D-encoding region of an FMDV field isolate (Asia1/JS/CHA/05) and its two derivatives indicated that two viruses, which contained an Arg-Asp-Asp (RDD) or an Arg-Ser-Asp (RSD) triplet instead of the RGD integrin recognition motif, were generated serendipitously upon short-term evolution of field isolate in different biological environments. To examine the influence of single amino acid substitutions in the receptor binding site of the RDD-containing FMD viral genome on virus viability and the ability of non-RGD FMDVs to cause disease in susceptible animals, we constructed an RDD-containing FMDV full-length cDNA clone and derived mutant molecules with RGD or RSD receptor recognition motifs. Following transfection of BSR cells with the full-length genome plasmids, the genetically engineered viruses were examined for their infectious potential in cell culture and susceptible animals. Results Amino acid sequence analysis of the 1D-coding region of different derivatives derived from the Asia1/JS/CHA/05 field isolate revealed that the RDD mutants became dominant or achieved population equilibrium with coexistence of the RGD and RSD subpopulations at an early phase of type Asia1 FMDV quasispecies evolution. Furthermore, the RDD and RSD sequences remained genetically stable for at least 20 passages. Using reverse genetics, the RDD-, RSD-, and RGD-containing FMD viruses were rescued from full-length cDNA clones, and single amino acid substitution in RDD-containing FMD viral genome did not affect virus viability. The genetically engineered viruses replicated stably in BHK-21 cells and had similar growth properties to the parental virus. The RDD parental virus and two non-RGD recombinant viruses were virulent to pigs and bovines that developed typical clinical disease and viremia. Conclusions FMDV quasispecies evolving in a different biological environment gained the capability of selecting different receptor recognition site. The RDD-containing FMD viral genome can accommodate substitutions in the receptor binding site without additional changes in the capsid. The viruses expressing non-RGD receptor binding sites can replicate stably in vitro and produce typical FMD clinical disease in susceptible animals. PMID:21711567
Koppstein, David; Ashour, Joseph; Bartel, David P.
2015-01-01
The influenza polymerase cleaves host RNAs ∼10–13 nucleotides downstream of their 5′ ends and uses this capped fragment to prime viral mRNA synthesis. To better understand this process of cap snatching, we used high-throughput sequencing to determine the 5′ ends of A/WSN/33 (H1N1) influenza mRNAs. The sequences provided clear evidence for nascent-chain realignment during transcription initiation and revealed a strong influence of the viral template on the frequency of realignment. After accounting for the extra nucleotides inserted through realignment, analysis of the capped fragments indicated that the different viral mRNAs were each prepended with a common set of sequences and that the polymerase often cleaved host RNAs after a purine and often primed transcription on a single base pair to either the terminal or penultimate residue of the viral template. We also developed a bioinformatic approach to identify the targeted host transcripts despite limited information content within snatched fragments and found that small nuclear RNAs and small nucleolar RNAs contributed the most abundant capped leaders. These results provide insight into the mechanism of viral transcription initiation and reveal the diversity of the cap-snatched repertoire, showing that noncoding transcripts as well as mRNAs are used to make influenza mRNAs. PMID:25901029
Host-Associated Metagenomics: A Guide to Generating Infectious RNA Viromes
Robert, Catherine; Pascalis, Hervé; Michelle, Caroline; Jardot, Priscilla; Charrel, Rémi; Raoult, Didier; Desnues, Christelle
2015-01-01
Background Metagenomic analyses have been widely used in the last decade to describe viral communities in various environments or to identify the etiology of human, animal, and plant pathologies. Here, we present a simple and standardized protocol that allows for the purification and sequencing of RNA viromes from complex biological samples with an important reduction of host DNA and RNA contaminants, while preserving the infectivity of viral particles. Principal Findings We evaluated different viral purification steps, random reverse transcriptions and sequence-independent amplifications of a pool of representative RNA viruses. Viruses remained infectious after the purification process. We then validated the protocol by sequencing the RNA virome of human body lice engorged in vitro with artificially contaminated human blood. The full genomes of the most abundant viruses absorbed by the lice during the blood meal were successfully sequenced. Interestingly, random amplifications differed in the genome coverage of segmented RNA viruses. Moreover, the majority of reads were taxonomically identified, and only 7–15% of all reads were classified as “unknown”, depending on the random amplification method. Conclusion The protocol reported here could easily be applied to generate RNA viral metagenomes from complex biological samples of different origins. Our protocol allows further virological characterizations of the described viral communities because it preserves the infectivity of viral particles and allows for the isolation of viruses. PMID:26431175
Moser, Lindsey A.; Ramirez-Carvajal, Lisbeth; Puri, Vinita; Pauszek, Steven J.; Matthews, Krystal; Dilley, Kari A.; Mullan, Clancy; McGraw, Jennifer; Khayat, Michael; Beeri, Karen; Yee, Anthony; Dugan, Vivien; Heise, Mark T.; Frieman, Matthew B.; Rodriguez, Luis L.; Bernard, Kristen A.; Wentworth, David E.
2016-01-01
ABSTRACT Several biosafety level 3 and/or 4 (BSL-3/4) pathogens are high-consequence, single-stranded RNA viruses, and their genomes, when introduced into permissive cells, are infectious. Moreover, many of these viruses are select agents (SAs), and their genomes are also considered SAs. For this reason, cDNAs and/or their derivatives must be tested to ensure the absence of infectious virus and/or viral RNA before transfer out of the BSL-3/4 and/or SA laboratory. This tremendously limits the capacity to conduct viral genomic research, particularly the application of next-generation sequencing (NGS). Here, we present a sequence-independent method to rapidly amplify viral genomic RNA while simultaneously abolishing both viral and genomic RNA infectivity across multiple single-stranded positive-sense RNA (ssRNA+) virus families. The process generates barcoded DNA amplicons that range in length from 300 to 1,000 bp, which cannot be used to rescue a virus and are stable to transport at room temperature. Our barcoding approach allows for up to 288 barcoded samples to be pooled into a single library and run across various NGS platforms without potential reconstitution of the viral genome. Our data demonstrate that this approach provides full-length genomic sequence information not only from high-titer virion preparations but it can also recover specific viral sequence from samples with limited starting material in the background of cellular RNA, and it can be used to identify pathogens from unknown samples. In summary, we describe a rapid, universal standard operating procedure that generates high-quality NGS libraries free of infectious virus and infectious viral RNA. IMPORTANCE This report establishes and validates a standard operating procedure (SOP) for select agents (SAs) and other biosafety level 3 and/or 4 (BSL-3/4) RNA viruses to rapidly generate noninfectious, barcoded cDNA amenable for next-generation sequencing (NGS). This eliminates the burden of testing all processed samples derived from high-consequence pathogens prior to transfer from high-containment laboratories to lower-containment facilities for sequencing. Our established protocol can be scaled up for high-throughput sequencing of hundreds of samples simultaneously, which can dramatically reduce the cost and effort required for NGS library construction. NGS data from this SOP can provide complete genome coverage from viral stocks and can also detect virus-specific reads from limited starting material. Our data suggest that the procedure can be implemented and easily validated by institutional biosafety committees across research laboratories. PMID:27822536
Functional Role of Infective Viral Particles on Metal Reduction
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coates, John D.
2014-04-01
A proposed strategy for the remediation of uranium (U) contaminated sites was based on the immobilization of U by reducing the oxidized soluble U, U(VI), to form a reduced insoluble end product, U(IV). Previous studies identified Geobacter sp., including G. sulfurreducens and G. metallireducens, as predominant U(VI)-reducing bacteria under acetate-oxidizing and U(VI)-reducing conditions. Examination of the finished genome sequence annotation of the canonical metal reducing species Geobacter sulfurreducens strain PCA and G. metallireduceans strain GS-15 as well as the draft genome sequence of G. uraniumreducens strain Rf4 identified phage related proteins. In addition, the completed genome for Anaeromyxobacter dehalogenans andmore » the draft genome sequence of Desulfovibrio desulfuricans strain G20, two more model metal-reducing bacteria, also revealed phage related sequences. The presence of these gene sequences indicated that Geobacter spp., Anaeromyxobacter spp., and Desulfovibrio spp. are susceptible to viral infection. Furthermore, viral populations in soils and sedimentary environments in the order of 6.4×10{sup 6}–2.7×10{sup 10} VLP’s cm{sup -3} have been observed. In some cases, viral populations exceed bacterial populations in these environments suggesting that a relationship may exist between viruses and bacteria. Our preliminary screens of samples collected from the ESR FRC indicated that viral like particles were observed in significant numbers. The objective of this study was to investigate the potential functional role viruses play in metal reduction specifically Fe(III) and U(VI) reduction, the environmental parameters affecting viral infection of metal reducing bacteria, and the subsequent effects on U transport.« less
Kim, Kiyeon; Omori, Ryosuke; Ueno, Keisuke; Iida, Sayaka; Ito, Kimihito
2016-01-01
Understanding the evolutionary dynamics of influenza viruses is essential to control both avian and human influenza. Here, we analyze host-specific and segment-specific Tajima's D trends of influenza A virus through a systematic review using viral sequences registered in the National Center for Biotechnology Information. To avoid bias from viral population subdivision, viral sequences were stratified according to their sampling locations and sampling years. As a result, we obtained a total of 580 datasets each of which consists of nucleotide sequences of influenza A viruses isolated from a single population of hosts at a single sampling site within a single year. By analyzing nucleotide sequences in the datasets, we found that Tajima's D values of viral sequences were different depending on hosts and gene segments. Tajima's D values of viruses isolated from chicken and human samples showed negative, suggesting purifying selection or a rapid population growth of the viruses. The negative Tajima's D values in rapidly growing viral population were also observed in computer simulations. Tajima's D values of PB2, PB1, PA, NP, and M genes of the viruses circulating in wild mallards were close to zero, suggesting that these genes have undergone neutral selection in constant-sized population. On the other hand, Tajima's D values of HA and NA genes of these viruses were positive, indicating HA and NA have undergone balancing selection in wild mallards. Taken together, these results indicated the existence of unknown factors that maintain viral subtypes in wild mallards.
Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures
Pride, David T; Schoenfeld, Thomas
2008-01-01
Background Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. Results From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes. Conclusion That BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis. PMID:18798991
Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures.
Pride, David T; Schoenfeld, Thomas
2008-09-17
Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes. That BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis.
Lam, Tommy Tsan-Yuk; Ip, Hon S.; Ghedin, Elodie; Wentworth, David E.; Halpin, Rebecca A.; Stockwell, Timothy B.; Spiro, David J.; Dusek, Robert J.; Bortner, James B.; Hoskins, Jenny; Bales, Bradley D.; Yparraguirre, Dan R.; Holmes, Edward C.
2012-01-01
Despite the importance of migratory birds in the ecology and evolution of avian influenza virus (AIV), there is a lack of information on the patterns of AIV spread at the intra-continental scale. We applied a variety of statistical phylogeographic techniques to a plethora of viral genome sequence data to determine the strength, pattern and determinants of gene flow in AIV sampled from wild birds in North America. These analyses revealed a clear isolation-by-distance of AIV among sampling localities. In addition, we show that phylogeographic models incorporating information on the avian flyway of sampling proved a better fit to the observed sequence data than those specifying homogeneous or random rates of gene flow among localities. In sum, these data strongly suggest that the intra-continental spread of AIV by migratory birds is subject to major ecological barriers, including spatial distance and avian flyway.
Gubala, Aneta; Davis, Steven; Weir, Richard; Melville, Lorna; Cowled, Chris; Boyle, David
2011-09-01
Tibrogargan virus (TIBV) and Coastal Plains virus (CPV) were isolated from cattle in Australia and TIBV has also been isolated from the biting midge Culicoides brevitarsis. Complete genomic sequencing revealed that the viruses share a novel genome structure within the family Rhabdoviridae, each virus containing two additional putative genes between the matrix protein (M) and glycoprotein (G) genes and one between the G and viral RNA polymerase (L) genes. The predicted novel protein products are highly diverged at the sequence level but demonstrate clear conservation of secondary structure elements, suggesting conservation of biological functions. Phylogenetic analyses showed that TIBV and CPV form an independent group within the 'dimarhabdovirus supergroup'. Although no disease has been observed in association with these viruses, antibodies were detected at high prevalence in cattle and buffalo in northern Australia, indicating the need for disease monitoring and further study of this distinctive group of viruses.
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples
Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E.; Kosakovsky Pond, Sergei L.
2016-01-01
Abstract The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences’ Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data. PMID:29492273
Shah, Jigna D; Baller, Joshua; Zhang, Ying; Silverstein, Kevin; Xing, Zheng; Cardona, Carol J
2014-12-01
RNA viruses have been associated with enteritis in poultry and have been isolated from diseased birds. The same viral agents have also been detected in healthy flocks bringing into question their role in health and disease. In order to understand better eukaryotic viruses in the gut, this project focused on evaluating alternative methods to purify and concentrate viral particles, which do not involve the use of density gradients, for generating viral metagenome data. In this study, the sequence outcomes of three tissue processing methods have been evaluated and a data analysis pipeline has been established for RNA viruses from the gastrointestinal tract. In addition, with the use of the best method and increased sequencing depth, a glimpse of the RNA viral community in the gastrointestinal tract of a clinically normal 5-week old turkey is presented. The viruses from the Reoviridae and Astroviridae families together accounted for 76.3% of total viruses identified. The rarefaction curve at the species level further indicated that majority of the species diversity was included with the increased sequencing depth, implying that viruses from other viral families were present in very low abundance. Copyright © 2014 Elsevier B.V. All rights reserved.
Mukherjee, Krishanu; Campos, Henry; Kolaczkowski, Bryan
2013-03-01
RNA interference (RNAi) is a eukaryotic molecular system that serves two primary functions: 1) gene regulation and 2) protection against selfish elements such as viruses and transposable DNA. Although the biochemistry of RNAi has been detailed in model organisms, very little is known about the broad-scale patterns and forces that have shaped RNAi evolution. Here, we provide a comprehensive evolutionary analysis of the Dicer protein family, which carries out the initial RNA recognition and processing steps in the RNAi pathway. We show that Dicer genes duplicated and diversified independently in early animal and plant evolution, coincident with the origins of multicellularity. We identify a strong signature of long-term protein-coding adaptation that has continually reshaped the RNA-binding pocket of the plant Dicer responsible for antiviral immunity, suggesting an evolutionary arms race with viral factors. We also identify key changes in Dicer domain architecture and sequence leading to specialization in either gene-regulatory or protective functions in animal and plant paralogs. As a whole, these results reveal a dynamic picture in which the evolution of Dicer function has driven elaboration of parallel RNAi functional pathways in animals and plants.
Pathogen evolution and disease emergence in carnivores.
McCarthy, Alex J; Shaw, Marie-Anne; Goodman, Simon J
2007-12-22
Emerging infectious diseases constitute some of the most pressing problems for both human and domestic animal health, and biodiversity conservation. Currently it is not clear whether the removal of past constraints on geographical distribution and transmission possibilities for pathogens alone are sufficient to give rise to novel host-pathogen combinations, or whether pathogen evolution is also generally required for establishment in novel hosts. Canine distemper virus (CDV) is a morbillivirus that is prevalent in the world dog population and poses an important conservation threat to a diverse range of carnivores. We performed an extensive phylogenetic and molecular evolution analysis on complete sequences of all CDV genes to assess the role of selection and recombination in shaping viral genetic diversity and driving the emergence of CDV in non-dog hosts. We tested the specific hypothesis that molecular adaptation at known receptor-binding sites of the haemagglutinin gene is associated with independent instances of the spread of CDV to novel non-dog hosts in the wild. This hypothesis was upheld, providing compelling evidence that repeated evolution at known functional sites (in this case residues 530 and 549 of the haemagglutinin molecule) is associated with multiple independent occurrences of disease emergence in a range of novel host species.
Multiplexing Short Primers for Viral Family PCR
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S N; Hiddessen, A L; Hara, C A
We describe a Multiplex Primer Prediction (MPP) algorithm to build multiplex compatible primer sets for large, diverse, and unalignable sets of target sequences. The MPP algorithm is scalable to larger target sets than other available software, and it does not require a multiple sequence alignment. We applied it to questions in viral detection, and demonstrated that there are no universally conserved priming sequences among viruses and that it could require an unfeasibly large number of primers ({approx}3700 18-mers or {approx}2000 10-mers) to generate amplicons from all sequenced viruses. We then designed primer sets separately for each viral family, and formore » several diverse species such as foot-and-mouth disease virus, hemagglutinin and neuraminidase segments of influenza A virus, Norwalk virus, and HIV-1.« less
McLaughlin, Paul J; Keegan, Liam P
2014-08-01
Nearly 150 different enzymatically modified forms of the four canonical residues in RNA have been identified. For instance, enzymes of the ADAR (adenosine deaminase acting on RNA) family convert adenosine residues into inosine in cellular dsRNAs. Recent findings show that DNA endonuclease V enzymes have undergone an evolutionary transition from cleaving 3' to deoxyinosine in DNA and ssDNA to cleaving 3' to inosine in dsRNA and ssRNA in humans. Recent work on dsRNA-binding domains of ADARs and other proteins also shows that a degree of sequence specificity is achieved by direct readout in the minor groove. However, the level of sequence specificity observed is much less than that of DNA major groove-binding helix-turn-helix proteins. We suggest that the evolution of DNA-binding proteins following the RNA to DNA genome transition represents the major advantage that DNA genomes have over RNA genomes. We propose that a hypothetical RNA modification, a RRAR (ribose reductase acting on genomic dsRNA) produced the first stretches of DNA in RNA genomes. We discuss why this is the most satisfactory explanation for the origin of DNA. The evolution of this RNA modification and later steps to DNA genomes are likely to have been driven by cellular genome co-evolution with viruses and intragenomic parasites. RNA modifications continue to be involved in host-virus conflicts; in vertebrates, edited cellular dsRNAs with inosine-uracil base pairs appear to be recognized as self RNA and to suppress activation of innate immune sensors that detect viral dsRNA.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Immonen, Taina T.; Conway, Jessica M.; Romero-Severson, Ethan O.
HIV-1 is subject to immune pressure exerted by the host, giving variants that escape the immune response an advantage. Virus released from activated latent cells competes against variants that have continually evolved and adapted to host immune pressure. Nevertheless, there is increasing evidence that virus displaying a signal of latency survives in patient plasma despite having reduced fitness due to long-term immune memory. We investigated the survival of virus with latent envelope genomic fragments by simulating within-host HIV-1 sequence evolution and the cycling of viral lineages in and out of the latent reservoir. Our model incorporates a detailed mutation processmore » including nucleotide substitution, recombination, latent reservoir dynamics, diversifying selection pressure driven by the immune response, and purifying selection pressure asserted by deleterious mutations. We evaluated the ability of our model to capture sequence evolution in vivo by comparing our simulated sequences to HIV-1 envelope sequence data from 16 HIV-infected untreated patients. Empirical sequence divergence and diversity measures were qualitatively and quantitatively similar to those of our simulated HIV-1 populations, suggesting that our model invokes realistic trends of HIV-1 genetic evolution. Moreover, reconstructed phylogenies of simulated and patient HIV-1 populations showed similar topological structures. Our simulation results suggest that recombination is a key mechanism facilitating the persistence of virus with latent envelope genomic fragments in the productively infected cell population. Recombination increased the survival probability of latent virus forms approximately 13-fold. Prevalence of virus with latent fragments in productively infected cells was observed in only 2% of simulations when we ignored recombination, while the proportion increased to 27% of simulations when we allowed recombination. We also found that the selection pressures exerted by different fitness landscapes influenced the shape of phylogenies, diversity trends, and survival of virus with latent genomic fragments. Furthermore, our model predicts that the persistence of latent genomic fragments from multiple different ancestral origins increases sequence diversity in plasma for reasonable fitness landscapes.« less
Immonen, Taina T.; Conway, Jessica M.; Romero-Severson, Ethan O.; ...
2015-12-22
HIV-1 is subject to immune pressure exerted by the host, giving variants that escape the immune response an advantage. Virus released from activated latent cells competes against variants that have continually evolved and adapted to host immune pressure. Nevertheless, there is increasing evidence that virus displaying a signal of latency survives in patient plasma despite having reduced fitness due to long-term immune memory. We investigated the survival of virus with latent envelope genomic fragments by simulating within-host HIV-1 sequence evolution and the cycling of viral lineages in and out of the latent reservoir. Our model incorporates a detailed mutation processmore » including nucleotide substitution, recombination, latent reservoir dynamics, diversifying selection pressure driven by the immune response, and purifying selection pressure asserted by deleterious mutations. We evaluated the ability of our model to capture sequence evolution in vivo by comparing our simulated sequences to HIV-1 envelope sequence data from 16 HIV-infected untreated patients. Empirical sequence divergence and diversity measures were qualitatively and quantitatively similar to those of our simulated HIV-1 populations, suggesting that our model invokes realistic trends of HIV-1 genetic evolution. Moreover, reconstructed phylogenies of simulated and patient HIV-1 populations showed similar topological structures. Our simulation results suggest that recombination is a key mechanism facilitating the persistence of virus with latent envelope genomic fragments in the productively infected cell population. Recombination increased the survival probability of latent virus forms approximately 13-fold. Prevalence of virus with latent fragments in productively infected cells was observed in only 2% of simulations when we ignored recombination, while the proportion increased to 27% of simulations when we allowed recombination. We also found that the selection pressures exerted by different fitness landscapes influenced the shape of phylogenies, diversity trends, and survival of virus with latent genomic fragments. Furthermore, our model predicts that the persistence of latent genomic fragments from multiple different ancestral origins increases sequence diversity in plasma for reasonable fitness landscapes.« less
Rose, Rebecca; Lamers, Susanna L.; Nolan, David J.; Maidji, Ekaterina; Faria, N. R.; Pybus, Oliver G.; Dollar, James J.; Maruniak, Samuel A.; McAvoy, Andrew C.; Salemi, Marco; Stoddart, Cheryl A.; Singer, Elyse J.
2016-01-01
ABSTRACT While combined antiretroviral therapy (cART) can result in undetectable plasma viral loads, it does not eradicate HIV infection. Furthermore, HIV-infected individuals while on cART remain at an increased risk of developing serious comorbidities, such as cancer, neurological disease, and atherosclerosis, suggesting that during cART, tissue-based HIV may contribute to such pathologies. We obtained DNA and RNA env, nef, and pol sequences using single-genome sequencing from postmortem tissues of three HIV+ cART-treated (cART+) individuals with undetectable viral load and metastatic cancer at death and performed time-scaled Bayesian evolutionary analyses. We used a sensitive in situ hybridization technique to visualize HIV gag-pol mRNA transcripts in cerebellum and lymph node tissues from one patient. Tissue-associated virus evolved at similar rates in cART+ and cART-naive (cART−) patients. Phylogenetic trees were characterized by two distinct features: (i) branching patterns consistent with constant viral evolution and dispersal among tissues and (ii) very recently derived clades containing both DNA and RNA sequences from multiple tissues. Rapid expansion of virus near death corresponded to wide-spread metastasis. HIV RNA+ cells clustered in cerebellum tissue but were dispersed in lymph node tissue, mirroring the evolutionary patterns observed for that patient. Activated, infiltrating macrophages were associated with HIV RNA. Our data provide evidence that tissues serve as a sanctuary for wild-type HIV during cART and suggest the importance of macrophages as an alternative reservoir and mechanism of virus spread. IMPORTANCE Combined antiretroviral therapy (cART) reduces plasma HIV to undetectable levels; however, removal of cART results in plasma HIV rebound, thus highlighting its inability to entirely rid the body of infection. Additionally, HIV-infected individuals on cART remain at high risk of serious diseases, which suggests a contribution from residual HIV. In this study, we isolated and sequenced HIV from postmortem tissues from three HIV+ cART+ individuals who died with metastatic cancer and had no detectable plasma viral load. Using high-resolution evolutionary analyses, we found that tissue-based HIV continues to replicate, evolve, and migrate among tissues during cART. Furthermore, cancer onset and metastasis coincided with increased HIV expansion, suggesting a linked mechanism. HIV-expressing cells were associated with tissue macrophages, a target of HIV infection. Our results suggest the importance of tissues, and macrophages in particular, as a target for novel anti-HIV therapies. PMID:27466425
Rose, Rebecca; Lamers, Susanna L; Nolan, David J; Maidji, Ekaterina; Faria, N R; Pybus, Oliver G; Dollar, James J; Maruniak, Samuel A; McAvoy, Andrew C; Salemi, Marco; Stoddart, Cheryl A; Singer, Elyse J; McGrath, Michael S
2016-10-15
While combined antiretroviral therapy (cART) can result in undetectable plasma viral loads, it does not eradicate HIV infection. Furthermore, HIV-infected individuals while on cART remain at an increased risk of developing serious comorbidities, such as cancer, neurological disease, and atherosclerosis, suggesting that during cART, tissue-based HIV may contribute to such pathologies. We obtained DNA and RNA env, nef, and pol sequences using single-genome sequencing from postmortem tissues of three HIV(+) cART-treated (cART(+)) individuals with undetectable viral load and metastatic cancer at death and performed time-scaled Bayesian evolutionary analyses. We used a sensitive in situ hybridization technique to visualize HIV gag-pol mRNA transcripts in cerebellum and lymph node tissues from one patient. Tissue-associated virus evolved at similar rates in cART(+) and cART-naive (cART(-)) patients. Phylogenetic trees were characterized by two distinct features: (i) branching patterns consistent with constant viral evolution and dispersal among tissues and (ii) very recently derived clades containing both DNA and RNA sequences from multiple tissues. Rapid expansion of virus near death corresponded to wide-spread metastasis. HIV RNA(+) cells clustered in cerebellum tissue but were dispersed in lymph node tissue, mirroring the evolutionary patterns observed for that patient. Activated, infiltrating macrophages were associated with HIV RNA. Our data provide evidence that tissues serve as a sanctuary for wild-type HIV during cART and suggest the importance of macrophages as an alternative reservoir and mechanism of virus spread. Combined antiretroviral therapy (cART) reduces plasma HIV to undetectable levels; however, removal of cART results in plasma HIV rebound, thus highlighting its inability to entirely rid the body of infection. Additionally, HIV-infected individuals on cART remain at high risk of serious diseases, which suggests a contribution from residual HIV. In this study, we isolated and sequenced HIV from postmortem tissues from three HIV(+) cART(+) individuals who died with metastatic cancer and had no detectable plasma viral load. Using high-resolution evolutionary analyses, we found that tissue-based HIV continues to replicate, evolve, and migrate among tissues during cART. Furthermore, cancer onset and metastasis coincided with increased HIV expansion, suggesting a linked mechanism. HIV-expressing cells were associated with tissue macrophages, a target of HIV infection. Our results suggest the importance of tissues, and macrophages in particular, as a target for novel anti-HIV therapies. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Bass, David; Moureau, Gregory; Tang, Shuoya; McAlister, Erica; Culverwell, C. Lorna; Glücksman, Edvard; Wang, Hui; Brown, T. David K.; Gould, Ernest A.; Harbach, Ralph E.; de Lamballerie, Xavier; Firth, Andrew E.
2013-01-01
We investigated whether small RNA (sRNA) sequenced from field-collected mosquitoes and chironomids (Diptera) can be used as a proxy signature of viral prevalence within a range of species and viral groups, using sRNAs sequenced from wild-caught specimens, to inform total RNA deep sequencing of samples of particular interest. Using this strategy, we sequenced from adult Anopheles maculipennis s.l. mosquitoes the apparently nearly complete genome of one previously undescribed virus related to chronic bee paralysis virus, and, from a pool of Ochlerotatus caspius and Oc. detritus mosquitoes, a nearly complete entomobirnavirus genome. We also reconstructed long sequences (1503-6557 nt) related to at least nine other viruses. Crucially, several of the sequences detected were reconstructed from host organisms highly divergent from those in which related viruses have been previously isolated or discovered. It is clear that viral transmission and maintenance cycles in nature are likely to be significantly more complex and taxonomically diverse than previously expected. PMID:24260463
Dacheux, Laurent; Cervantes-Gonzalez, Minerva; Guigon, Ghislaine; Thiberge, Jean-Michel; Vandenbogaert, Mathias; Maufrais, Corinne
2014-01-01
The prediction of viral zoonosis epidemics has become a major public health issue. A profound understanding of the viral population in key animal species acting as reservoirs represents an important step towards this goal. Bats harbor diverse viruses, some of which are of particular interest because they cause severe human diseases. However, little is known about the diversity of the global population of viruses found in bats (virome). We determined the viral diversity of five different French insectivorous bat species (nine specimens in total) in close contact with humans. Sequence-independent amplification, high-throughput sequencing with Illumina technology and a dedicated bioinformatics analysis pipeline were used on pooled tissues (brain, liver and lungs). Comparisons of the sequences of contigs and unassembled reads provided a global taxonomic distribution of virus-related sequences for each sample, highlighting differences both within and between bat species. Many viral families were present in these viromes, including viruses known to infect bacteria, plants/fungi, insects or vertebrates, the most relevant being those infecting mammals (Retroviridae, Herpesviridae, Bunyaviridae, Poxviridae, Flaviviridae, Reoviridae, Bornaviridae, Picobirnaviridae). In particular, we detected several new mammalian viruses, including rotaviruses, gammaretroviruses, bornaviruses and bunyaviruses with the identification of the first bat nairovirus. These observations demonstrate that bats naturally harbor viruses from many different families, most of which infect mammals. They may therefore constitute a major reservoir of viral diversity that should be analyzed carefully, to determine the role played by bats in the spread of zoonotic viral infections. PMID:24489870
The Fecal Viral Flora of Wild Rodents
Phan, Tung G.; Kapusinszky, Beatrix; Wang, Chunlin; Rose, Robert K.; Lipton, Howard L.; Delwart, Eric L.
2011-01-01
The frequent interactions of rodents with humans make them a common source of zoonotic infections. To obtain an initial unbiased measure of the viral diversity in the enteric tract of wild rodents we sequenced partially purified, randomly amplified viral RNA and DNA in the feces of 105 wild rodents (mouse, vole, and rat) collected in California and Virginia. We identified in decreasing frequency sequences related to the mammalian viruses families Circoviridae, Picobirnaviridae, Picornaviridae, Astroviridae, Parvoviridae, Papillomaviridae, Adenoviridae, and Coronaviridae. Seventeen small circular DNA genomes containing one or two replicase genes distantly related to the Circoviridae representing several potentially new viral families were characterized. In the Picornaviridae family two new candidate genera as well as a close genetic relative of the human pathogen Aichi virus were characterized. Fragments of the first mouse sapelovirus and picobirnaviruses were identified and the first murine astrovirus genome was characterized. A mouse papillomavirus genome and fragments of a novel adenovirus and adenovirus-associated virus were also sequenced. The next largest fraction of the rodent fecal virome was related to insect viruses of the Densoviridae, Iridoviridae, Polydnaviridae, Dicistroviriade, Bromoviridae, and Virgaviridae families followed by plant virus-related sequences in the Nanoviridae, Geminiviridae, Phycodnaviridae, Secoviridae, Partitiviridae, Tymoviridae, Alphaflexiviridae, and Tombusviridae families reflecting the largely insect and plant rodent diet. Phylogenetic analyses of full and partial viral genomes therefore revealed many previously unreported viral species, genera, and families. The close genetic similarities noted between some rodent and human viruses might reflect past zoonoses. This study increases our understanding of the viral diversity in wild rodents and highlights the large number of still uncharacterized viruses in mammals. PMID:21909269
Viral taxonomy needs a spring clean; its exploration era is over.
Gibbs, Adrian J
2013-08-09
The International Committee on Taxonomy of Viruses has recently changed its approved definition of a viral species, and also discontinued work on its database of virus descriptions. These events indicate that the exploration era of viral taxonomy has ended; over the past century the principles of viral taxonomy have been established, the tools for phylogenetic inference invented, and the ultimate discriminatory data required for taxonomy, namely gene sequences, are now readily available. Further changes would make viral taxonomy more informative. First, the status of a 'taxonomic species' with an italicized name should only be given to viruses that are specifically linked with a single 'type genomic sequence' like those in the NCBI Reference Sequence Database. Secondly all approved taxa should be predominately monophyletic, and uninformative higher taxa disendorsed. These are 'quality assurance' measures and would improve the value of viral nomenclature to its users. The ICTV should also promote the use of a public database, such as Wikipedia, to replace the ICTV database as a store of the primary metadata of individual viruses, and should publish abstracts of the ICTV Reports in that database, so that they are 'Open Access'.
Luft, F; Klaes, R; Nees, M; Dürst, M; Heilmann, V; Melsheimer, P; von Knebel Doeberitz, M
2001-04-01
Human papillomavirus (HPV) genomes usually persist as episomal molecules in HPV associated preneoplastic lesions whereas they are frequently integrated into the host cell genome in HPV-related cancers cells. This suggests that malignant conversion of HPV-infected epithelia is linked to recombination of cellular and viral sequences. Due to technical limitations, precise sequence information on viral-cellular junctions were obtained only for few cell lines and primary lesions. In order to facilitate the molecular analysis of genomic HPV integration, we established a ligation-mediated PCR assay for the detection of integrated papillomavirus sequences (DIPS-PCR). DIPS-PCR was initially used to amplify genomic viral-cellular junctions from HPV-associated cervical cancer cell lines (C4-I, C4-II, SW756, and HeLa) and HPV-immortalized keratinocyte lines (HPKIA, HPKII). In addition to junctions already reported in public data bases, various new fusion fragments were identified. Subsequently, 22 different viral-cellular junctions were amplified from 17 cervical carcinomas and 1 vulval intraepithelial neoplasia (VIN III). Sequence analysis of each junction revealed that the viral E1 open reading frame (ORF) was fused to cellular sequences in 20 of 22 (91%) cases. Chromosomal integration loci mapped to chromosomes 1 (2n), 2 (3n), 7 (2n), 8 (3n), 10 (1n), 14 (5n), 16 (1n), 17 (2n), and mitochondrial DNA (1n), suggesting random distribution of chromosomal integration sites. Precise sequence information obtained by DIPS-PCR was further used to monitor the monoclonal origin of 4 cervical cancers, 1 case of recurrent premalignant lesions and 1 lymph node metastasis. Therefore, DIPS-PCR might allow efficient therapy control and prediction of relapse in patients with HPV-associated anogenital cancers. Copyright 2001 Wiley-Liss, Inc.
Gallei, Andreas; Orlich, Michaela; Thiel, Heinz-Juergen; Becher, Paul
2005-01-01
Several studies have demonstrated that cytopathogenic (cp) pestivirus strains evolve from noncytopathogenic (noncp) viruses by nonhomologous RNA recombination. In addition, two recent reports showed the rapid emergence of noncp Bovine viral diarrhea virus (BVDV) after a few cell culture passages of cp BVDV strains by homologous recombination between identical duplicated viral sequences. To allow the identification of recombination sites from noncp BVDV strains that evolve from cp viruses, we constructed the cp BVDV strains CP442 and CP552. Both harbor duplicated viral sequences of different origin flanking the cellular insertion Nedd8*; the latter is a prerequisite for their cytopathogenicity. In contrast to the previous studies, isolation of noncp strains was possible only after extensive cell culture passages of CP442 and CP552. Sequence analysis of 15 isolated noncp BVDVs confirmed that all recombinant strains lack at least most of Nedd8*. Interestingly, only one strain resulted from homologous recombination while the other 14 strains were generated by nonhomologous recombination. Accordingly, our data suggest that the extent of sequence identity between participating sequences influences both frequency and mode (homologous versus nonhomologous) of RNA recombination in pestiviruses. Further analyses of the noncp recombinant strains revealed that a duplication of 14 codons in the BVDV nonstructural protein 4B (NS4B) gene does not interfere with efficient viral replication. Moreover, an insertion of viral sequences between the NS4A and NS4B genes was well tolerated. These findings thus led to the identification of two genomic loci which appear to be suited for the insertion of heterologous sequences into the genomes of pestiviruses and related viruses. PMID:16254361
Rotavirus I in feces of a cat with diarrhea.
Phan, Tung G; Leutenegger, Christian M; Chan, Roxanne; Delwart, Eric
2017-06-01
A divergent rotavirus I was detected using viral metagenomics in the feces of a cat with diarrhea. The eleven segments of rotavirus I strain Felis catus encoded non-structural and structural proteins with amino acid identities ranging from 25 to 79% to the only two currently sequenced members of that viral species both derived from canine feces. No other eukaryotic viral sequences nor bacterial and protozoan pathogens were detected in this fecal sample suggesting the involvement of rotavirus I in feline diarrhea.
Tilton, John C.; Wilen, Craig B.; Didigu, Chukwuka A.; Sinha, Rohini; Harrison, Jessamina E.; Agrawal-Gamse, Caroline; Henning, Elizabeth A.; Bushman, Frederick D.; Martin, Jeffrey N.; Deeks, Steven G.; Doms, Robert W.
2010-01-01
CCR5 antagonists inhibit HIV entry by binding to a coreceptor and inducing changes in the extracellular loops (ECLs) of CCR5. In this study, we analyzed viruses from 11 treatment-experienced patients who experienced virologic failure on treatment regimens containing the CCR5 antagonist maraviroc (MVC). Viruses from one patient developed high-level resistance to MVC during the course of treatment. Although resistance to one CCR5 antagonist is often associated with broad cross-resistance to other agents, these viruses remained sensitive to most other CCR5 antagonists, including vicriviroc and aplaviroc. MVC resistance was dependent upon mutations within the V3 loop of the viral envelope (Env) protein and was modulated by additional mutations in the V4 loop. Deep sequencing of pretreatment plasma viral RNA indicated that resistance appears to have occurred by evolution of drug-bound CCR5 use, despite the presence of viral sequences predictive of CXCR4 use. Envs obtained from this patient before and during MVC treatment were able to infect cells expressing very low CCR5 levels, indicating highly efficient use of a coreceptor. In contrast to previous reports in which CCR5 antagonist-resistant viruses interact predominantly with the N terminus of CCR5, these MVC-resistant Envs were also dependent upon the drug-modified ECLs of CCR5 for entry. Our results suggest a model of CCR5 cross-resistance whereby viruses that predominantly utilize the N terminus are broadly cross-resistant to multiple CCR5 antagonists, whereas viruses that require both the N terminus and antagonist-specific ECL changes demonstrate a narrow cross-resistance profile. PMID:20702642
Bull, Marta; Learn, Gerald; Genowati, Indira; McKernan, Jennifer; Hitti, Jane; Lockhart, David; Tapia, Kenneth; Holte, Sarah; Dragavon, Joan; Coombs, Robert; Mullins, James; Frenkel, Lisa
2009-09-22
Compartmentalization of HIV-1 between the genital tract and blood was noted in half of 57 women included in 12 studies primarily using cell-free virus. To further understand differences between genital tract and blood viruses of women with chronic HIV-1 infection cell-free and cell-associated virus populations were sequenced from these tissues, reasoning that integrated viral DNA includes variants archived from earlier in infection, and provides a greater array of genotypes for comparisons. Multiple sequences from single-genome-amplification of HIV-1 RNA and DNA from the genital tract and blood of each woman were compared in a cross-sectional study. Maximum likelihood phylogenies were evaluated for evidence of compartmentalization using four statistical tests. Genital tract and blood HIV-1 appears compartmentalized in 7/13 women by >/=2 statistical analyses. These subjects' phylograms were characterized by low diversity genital-specific viral clades interspersed between clades containing both genital and blood sequences. Many of the genital-specific clades contained monotypic HIV-1 sequences. In 2/7 women, HIV-1 populations were significantly compartmentalized across all four statistical tests; both had low diversity genital tract-only clades. Collapsing monotypic variants into a single sequence diminished the prevalence and extent of compartmentalization. Viral sequences did not demonstrate tissue-specific signature amino acid residues, differential immune selection, or co-receptor usage. In women with chronic HIV-1 infection multiple identical sequences suggest proliferation of HIV-1-infected cells, and low diversity tissue-specific phylogenetic clades are consistent with bursts of viral replication. These monotypic and tissue-specific viruses provide statistical support for compartmentalization of HIV-1 between the female genital tract and blood. However, the intermingling of these clades with clades comprised of both genital and blood sequences and the absence of tissue-specific genetic features suggests compartmentalization between blood and genital tract may be due to viral replication and proliferation of infected cells, and questions whether HIV-1 in the female genital tract is distinct from blood.
Going, going, gone: predicting the fate of genomic insertions in plant RNA viruses.
Willemsen, Anouk; Carrasco, José L; Elena, Santiago F; Zwart, Mark P
2018-05-10
Horizontal gene transfer is common among viruses, while they also have highly compact genomes and tend to lose artificial genomic insertions rapidly. Understanding the stability of genomic insertions in viral genomes is therefore relevant for explaining and predicting their evolutionary patterns. Here, we revisit a large body of experimental research on a plant RNA virus, tobacco etch potyvirus (TEV), to identify the patterns underlying the stability of a range of homologous and heterologous insertions in the viral genome. We obtained a wide range of estimates for the recombination rate-the rate at which deletions removing the insertion occur-and these appeared to be independent of the type of insertion and its location. Of the factors we considered, recombination rate was the best predictor of insertion stability, although we could not identify the specific sequence characteristics that would help predict insertion instability. We also considered experimentally the possibility that functional insertions lead to higher mutational robustness through increased redundancy. However, our observations suggest that both functional and non-functional increases in genome size decreased the mutational robustness. Our results therefore demonstrate the importance of recombination rates for predicting the long-term stability and evolution of viral RNA genomes and suggest that there are unexpected drawbacks to increases in genome size for mutational robustness.
Genetic diversity and evolutionary dynamics of Ebola virus in Sierra Leone.
Tong, Yi-Gang; Shi, Wei-Feng; Liu, Di; Qian, Jun; Liang, Long; Bo, Xiao-Chen; Liu, Jun; Ren, Hong-Guang; Fan, Hang; Ni, Ming; Sun, Yang; Jin, Yuan; Teng, Yue; Li, Zhen; Kargbo, David; Dafae, Foday; Kanu, Alex; Chen, Cheng-Chao; Lan, Zhi-Heng; Jiang, Hui; Luo, Yang; Lu, Hui-Jun; Zhang, Xiao-Guang; Yang, Fan; Hu, Yi; Cao, Yu-Xi; Deng, Yong-Qiang; Su, Hao-Xiang; Sun, Yu; Liu, Wen-Sen; Wang, Zhuang; Wang, Cheng-Yu; Bu, Zhao-Yang; Guo, Zhen-Dong; Zhang, Liu-Bo; Nie, Wei-Min; Bai, Chang-Qing; Sun, Chun-Hua; An, Xiao-Ping; Xu, Pei-Song; Zhang, Xiang-Li-Lan; Huang, Yong; Mi, Zhi-Qiang; Yu, Dong; Yao, Hong-Wu; Feng, Yong; Xia, Zhi-Ping; Zheng, Xue-Xing; Yang, Song-Tao; Lu, Bing; Jiang, Jia-Fu; Kargbo, Brima; He, Fu-Chu; Gao, George F; Cao, Wu-Chun
2015-08-06
A novel Ebola virus (EBOV) first identified in March 2014 has infected more than 25,000 people in West Africa, resulting in more than 10,000 deaths. Preliminary analyses of genome sequences of 81 EBOV collected from March to June 2014 from Guinea and Sierra Leone suggest that the 2014 EBOV originated from an independent transmission event from its natural reservoir followed by sustained human-to-human infections. It has been reported that the EBOV genome variation might have an effect on the efficacy of sequence-based virus detection and candidate therapeutics. However, only limited viral information has been available since July 2014, when the outbreak entered a rapid growth phase. Here we describe 175 full-length EBOV genome sequences from five severely stricken districts in Sierra Leone from 28 September to 11 November 2014. We found that the 2014 EBOV has become more phylogenetically and genetically diverse from July to November 2014, characterized by the emergence of multiple novel lineages. The substitution rate for the 2014 EBOV was estimated to be 1.23 × 10(-3) substitutions per site per year (95% highest posterior density interval, 1.04 × 10(-3) to 1.41 × 10(-3) substitutions per site per year), approximating to that observed between previous EBOV outbreaks. The sharp increase in genetic diversity of the 2014 EBOV warrants extensive EBOV surveillance in Sierra Leone, Guinea and Liberia to better understand the viral evolution and transmission dynamics of the ongoing outbreak. These data will facilitate the international efforts to develop vaccines and therapeutics.
Dynamics of Preferential Substrate Recognition in HIV-1 Protease: Redefining the Substrate Envelope
Özen, Ayşegül; Haliloğlu, Türkan; Schiffer, Celia A.
2011-01-01
HIV-1 protease (PR) permits viral maturation by processing the Gag and Gag-Pro-Pol polyproteins. Though HIV-1 PR inhibitors (PIs) are used in combination antiviral therapy, the emergence of drug resistance has limited their efficacy. The rapid evolution of HIV-1 necessitates the consideration of drug resistance in novel drug-design strategies. Drug-resistant HIV-1 PR variants, while no longer efficiently inhibited, continue to efficiently hydrolyze the natural viral substrates. Though highly diverse in sequence, the HIV-1 PR substrates bind in a conserved three-dimensional shape we defined as the “substrate envelope”. We previously showed that resistance mutations arise where PIs protrude beyond the substrate envelope, as these regions are crucial for drug binding but not for substrate recognition. Here, we extend this model by considering the role of protein dynamics in the interaction of HIV-1 PR with its substrates. Seven molecular dynamics simulations of PR-substrate complexes were performed to estimate the conformational flexibility of substrates in their complexes. Interdependency of the substrate-protease interactions may compensate for the variations in cleavage-site sequences, and explain how a diverse set of sequences can be recognized as substrates by the same enzyme. This diversity may be essential for regulating sequential processing of substrates. We also define a dynamic substrate envelope as a more accurate representation of PR-substrate interactions. This dynamic substrate envelope, described by a probability distribution function, is a powerful tool for drug design efforts targeting ensembles of resistant HIV-1 PR variants with the aim of developing drugs that are less susceptible to resistance. PMID:21762811
Diversity and distribution of single-stranded DNA phages in the North Atlantic Ocean
Tucker, Kimberly P; Parsons, Rachel; Symonds, Erin M; Breitbart, Mya
2011-01-01
Knowledge of marine phages is highly biased toward double-stranded DNA (dsDNA) phages; however, recent metagenomic surveys have also identified single-stranded DNA (ssDNA) phages in the oceans. Here, we describe two complete ssDNA phage genomes that were reconstructed from a viral metagenome from 80 m depth at the Bermuda Atlantic Time-series Study (BATS) site in the northwestern Sargasso Sea and examine their spatial and temporal distributions. Both genomes (SARssφ1 and SARssφ2) exhibited similarity to known phages of the Microviridae family in terms of size, GC content, genome organization and protein sequence. PCR amplification of the replication initiation protein (Rep) gene revealed narrow and distinct depth distributions for the newly described ssDNA phages within the upper 200 m of the water column at the BATS site. Comparison of Rep gene sequences obtained from the BATS site over time revealed changes in the diversity of ssDNA phages over monthly time scales, although some nearly identical sequences were recovered from samples collected 4 years apart. Examination of ssDNA phage diversity along transects through the North Atlantic Ocean revealed a positive correlation between genetic distance and geographic distance between sampling sites. Together, the data suggest fundamental differences between the distribution of these ssDNA phages and the distribution of known marine dsDNA phages, possibly because of differences in host range, host distribution, virion stability, or viral evolution mechanisms and rates. Future work needs to elucidate the host ranges for oceanic ssDNA phages and determine their ecological roles in the marine ecosystem. PMID:21124487
Massive activation of archaeal defense genes during viral infection.
Quax, Tessa E F; Voet, Marleen; Sismeiro, Odile; Dillies, Marie-Agnes; Jagla, Bernd; Coppée, Jean-Yves; Sezonov, Guennadi; Forterre, Patrick; van der Oost, John; Lavigne, Rob; Prangishvili, David
2013-08-01
Archaeal viruses display unusually high genetic and morphological diversity. Studies of these viruses proved to be instrumental for the expansion of knowledge on viral diversity and evolution. The Sulfolobus islandicus rod-shaped virus 2 (SIRV2) is a model to study virus-host interactions in Archaea. It is a lytic virus that exploits a unique egress mechanism based on the formation of remarkable pyramidal structures on the host cell envelope. Using whole-transcriptome sequencing, we present here a global map defining host and viral gene expression during the infection cycle of SIRV2 in its hyperthermophilic host S. islandicus LAL14/1. This information was used, in combination with a yeast two-hybrid analysis of SIRV2 protein interactions, to advance current understanding of viral gene functions. As a consequence of SIRV2 infection, transcription of more than one-third of S. islandicus genes was differentially regulated. While expression of genes involved in cell division decreased, those genes playing a role in antiviral defense were activated on a large scale. Expression of genes belonging to toxin-antitoxin and clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems was specifically pronounced. The observed different degree of activation of various CRISPR-Cas systems highlights the specialized functions they perform. The information on individual gene expression and activation of antiviral defense systems is expected to aid future studies aimed at detailed understanding of the functions and interplay of these systems in vivo.
Divergence and evolution of homologous regions of Bombyx mori nuclear polyhedrosis virus.
Majima, K; Kobara, R; Maeda, S
1993-01-01
Homologous regions (hrs) (hr1,hr2-left,hr2-right,hr3,hr4-left,hr 4-right, and hr5) similar to those found in the Autographa californica nuclear polyhedrosis virus (AcNPV) genome were found in the Bombyx mori NPV (BmNPV) genome. The BmNPV hrs contained two to eight repeats of a homologous nucleotide sequence which were on average about 75 bp long. All of these homologous sequence repeats contained a 26-bp-long palindrome motif with an EcoRI or EcoRI-like site at its core. The consensus sequence of the BmNPV hrs showed 95% conservation with respect to those found in AcNPV. Nucleotide sequence analysis indicated that hr2-left and hr2-right of BmNPV evolved from an ancestor similar to hr2 of AcNPV by inversion, cleavage, and ligation. The polarities of the BmNPV and AcNPV hrs were conserved except for that of hr4-left. Within hr4-right of BmNPV, four repeats of a previously underscribed palindrome motif were found. Bmhr5D, a BmNPV mutant which lacked hr5, replicated at a rate similar to that of wild-type BmNPV in BmN cells and silkworm larvae, indicating that hr5 was not essential for viral replication. After ten passages of Bmhr5D in BmN cells, no detectable changes in its genome were observed by restriction endonuclease analysis. The evolution and divergence of the BmNPV genome are also discussed. Images PMID:8230471
Geldenhuys, Marike; Mortlock, Marinda; Weyer, Jacqueline; Bezuidt, Oliver; Seamark, Ernest C J; Kearney, Teresa; Gleasner, Cheryl; Erkkila, Tracy H; Cui, Helen; Markotter, Wanda
2018-01-01
Species within the Neoromicia bat genus are abundant and widely distributed in Africa. It is common for these insectivorous bats to roost in anthropogenic structures in urban regions. Additionally, Neoromicia capensis have previously been identified as potential hosts for Middle East respiratory syndrome (MERS)-related coronaviruses. This study aimed to ascertain the gastrointestinal virome of these bats, as viruses excreted in fecal material or which may be replicating in rectal or intestinal tissues have the greatest opportunities of coming into contact with other hosts. Samples were collected in five regions of South Africa over eight years. Initial virome composition was determined by viral metagenomic sequencing by pooling samples and enriching for viral particles. Libraries were sequenced on the Illumina MiSeq and NextSeq500 platforms, producing a combined 37 million reads. Bioinformatics analysis of the high throughput sequencing data detected the full genome of a novel species of the Circoviridae family, and also identified sequence data from the Adenoviridae, Coronaviridae, Herpesviridae, Parvoviridae, Papillomaviridae, Phenuiviridae, and Picornaviridae families. Metagenomic sequencing data was insufficient to determine the viral diversity of certain families due to the fragmented coverage of genomes and lack of suitable sequencing depth, as some viruses were detected from the analysis of reads-data only. Follow up conventional PCR assays targeting conserved gene regions for the Adenoviridae, Coronaviridae, and Herpesviridae families were used to confirm metagenomic data and generate additional sequences to determine genetic diversity. The complete coding genome of a MERS-related coronavirus was recovered with additional amplicon sequencing on the MiSeq platform. The new genome shared 97.2% overall nucleotide identity to a previous Neoromicia-associated MERS-related virus, also from South Africa. Conventional PCR analysis detected diverse adenovirus and herpesvirus sequences that were widespread throughout Neoromicia populations in South Africa. Furthermore, similar adenovirus sequences were detected within these populations throughout several years. With the exception of the coronaviruses, the study represents the first report of sequence data from several viral families within a Southern African insectivorous bat genus; highlighting the need for continued investigations in this regard.
Geldenhuys, Marike; Mortlock, Marinda; Weyer, Jacqueline; Bezuidt, Oliver; Seamark, Ernest C. J.; Kearney, Teresa; Gleasner, Cheryl; Erkkila, Tracy H.; Cui, Helen; Markotter, Wanda
2018-01-01
Species within the Neoromicia bat genus are abundant and widely distributed in Africa. It is common for these insectivorous bats to roost in anthropogenic structures in urban regions. Additionally, Neoromicia capensis have previously been identified as potential hosts for Middle East respiratory syndrome (MERS)-related coronaviruses. This study aimed to ascertain the gastrointestinal virome of these bats, as viruses excreted in fecal material or which may be replicating in rectal or intestinal tissues have the greatest opportunities of coming into contact with other hosts. Samples were collected in five regions of South Africa over eight years. Initial virome composition was determined by viral metagenomic sequencing by pooling samples and enriching for viral particles. Libraries were sequenced on the Illumina MiSeq and NextSeq500 platforms, producing a combined 37 million reads. Bioinformatics analysis of the high throughput sequencing data detected the full genome of a novel species of the Circoviridae family, and also identified sequence data from the Adenoviridae, Coronaviridae, Herpesviridae, Parvoviridae, Papillomaviridae, Phenuiviridae, and Picornaviridae families. Metagenomic sequencing data was insufficient to determine the viral diversity of certain families due to the fragmented coverage of genomes and lack of suitable sequencing depth, as some viruses were detected from the analysis of reads-data only. Follow up conventional PCR assays targeting conserved gene regions for the Adenoviridae, Coronaviridae, and Herpesviridae families were used to confirm metagenomic data and generate additional sequences to determine genetic diversity. The complete coding genome of a MERS-related coronavirus was recovered with additional amplicon sequencing on the MiSeq platform. The new genome shared 97.2% overall nucleotide identity to a previous Neoromicia-associated MERS-related virus, also from South Africa. Conventional PCR analysis detected diverse adenovirus and herpesvirus sequences that were widespread throughout Neoromicia populations in South Africa. Furthermore, similar adenovirus sequences were detected within these populations throughout several years. With the exception of the coronaviruses, the study represents the first report of sequence data from several viral families within a Southern African insectivorous bat genus; highlighting the need for continued investigations in this regard. PMID:29579103
Forterre, Patrick
2013-01-01
Viruses have been considered for a long time as by-products of biological evolution. This view is changing now as a result of several recent discoveries. Viral ecologists have shown that viral particles are the most abundant biological entities on our planet, whereas metagenomic analyses have revealed an unexpected abundance and diversity of viral genes in the biosphere. Comparative genomics have highlighted the uniqueness of viral sequences, in contradiction with the traditional view of viruses as pickpockets of cellular genes. On the contrary, cellular genomes, especially eukaryotic ones, turned out to be full of genes derived from viruses or related elements (plasmids, transposons, retroelements and so on). The discovery of unusual viruses infecting archaea has shown that the viral world is much more diverse than previously thought, ruining the traditional dichotomy between bacteriophages and viruses. Finally, the discovery of giant viruses has blurred the traditional image of viruses as small entities. Furthermore, essential clues on virus history have been obtained in the last ten years. In particular, structural analyses of capsid proteins have uncovered deeply rooted homologies between viruses infecting different cellular domains, suggesting that viruses originated before the last universal common ancestor (LUCA). These studies have shown that several lineages of viruses originated independently, i.e., viruses are polyphyletic. From the time of LUCA, viruses have coevolved with their hosts, and viral lineages can be viewed as lianas wrapping around the trunk, branches and leaves of the tree of life. Although viruses are very diverse, with genomes encoding from one to more than one thousand proteins, they can all be simply defined as organisms producing virions. Virions themselves can be defined as infectious particles made of at least one protein associated with the viral nucleic acid, endowed with the capability to protect the viral genome and ensure its delivery to the infected cell. These definitions, which clearly distinguish viruses from plasmids, suggest that infectious RNA molecules that only encode an RNA replicase presently classified among viruses by the ICTV (International Committee for the Taxonomy of Viruses) into families of Endornaviridae and Hypoviridae are in fact RNA plasmids. Since a viral genome should encode for at least one structural protein, these definitions also imply that viruses originated after the emergence of the ribosome in an RNA-protein cellular world. Although virions are the hallmarks of viruses, viruses and virions should not be confused. The infection transforms the ribocell (cell encoding ribosomes and dividing by binary fission) into a virocell (cell producing virions) or ribovirocell (cell that produces virions but can still divide by binary fission). In the ribovirocell, two different organisms, defined by their distinct evolutionary histories, coexist in symbiosis in the same cell. The virocells or ribovirocells are the living forms of the virus, which can be in fine considered to be a living organism. In the virocell, the metabolism is reorganized for the production of virions, while the ability to capture and store free energy is retained, as in other cellular organisms. In the virocell, viral genomes replicate, recombine and evolve, leading to the emergence of new viral proteins and potentially novel functions. Some of these new functions can be later on transferred to the cell, explaining how viruses can play a major (often underestimated) role in the evolution of cellular organisms. The virocell concept thus helps to understand recent hypotheses suggesting that viruses played a critical role in major evolutionary transitions, such as the origin of DNA genomes or else the origin of the eukaryotic nucleus. Finally, it is more and more recognized that viruses are the major source of variation and selection in living organisms (both viruses and cells), the two pillars of darwinism. One can thus conclude that the continuous interaction between viruses and cells, all along the history of life, has been, and still is, a major engine of biological evolution. © Société de Biologie, 2013.
PCR Amplification Strategies towards full-length HIV-1 Genome sequencing.
Liu, Chao Chun; Ji, Hezhao
2018-06-26
The advent of next generation sequencing has enabled greater resolution of viral diversity and improved feasibility of full viral genome sequencing allowing routine HIV-1 full genome sequencing in both research and diagnostic settings. Regardless of the sequencing platform selected, successful PCR amplification of the HIV-1 genome is essential for sequencing template preparation. As such, full HIV-1 genome amplification is a crucial step in dictating the successful and reliable sequencing downstream. Here we reviewed existing PCR protocols leading to HIV-1 full genome sequencing. In addition to the discussion on basic considerations on relevant PCR design, the advantages as well as the pitfalls of published protocols were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Steel, Olivia; Kraberger, Simona; Sikorski, Alyssa; Young, Laura M; Catchpole, Ryan J; Stevens, Aaron J; Ladley, Jenny J; Coray, Dorien S; Stainton, Daisy; Dayaram, Anisha; Julian, Laurel; van Bysterveldt, Katherine; Varsani, Arvind
2016-09-01
In recent years, innovations in molecular techniques and sequencing technologies have resulted in a rapid expansion in the number of known viral sequences, in particular those with circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA genomes. CRESS DNA viruses are present in the virome of many ecosystems and are known to infect a wide range of organisms. A large number of the recently identified CRESS DNA viruses cannot be classified into any known viral families, indicating that the current view of CRESS DNA viral sequence space is greatly underestimated. Animal faecal matter has proven to be a particularly useful source for sampling CRESS DNA viruses in an ecosystem, as it is cost-effective and non-invasive. In this study a viral metagenomic approach was used to explore the diversity of CRESS DNA viruses present in the faeces of domesticated and wild animals in New Zealand. Thirty-eight complete CRESS DNA viral genomes and two circular molecules (that may be defective molecules or single components of multicomponent genomes) were identified from forty-nine individual animal faecal samples. Based on shared genome organisations and sequence similarities, eighteen of the isolates were classified as gemycircularviruses and twelve isolates were classified as smacoviruses. The remaining eight isolates lack significant sequence similarity with any members of known CRESS DNA virus groups. This research adds significantly to our knowledge of CRESS DNA viral diversity in New Zealand, emphasising the prevalence of CRESS DNA viruses in nature, and reinforcing the suggestion that a large proportion of CRESS DNA viruses are yet to be identified. Copyright © 2016 Elsevier B.V. All rights reserved.
Kanda, Teru; Furuse, Yuki; Oshitani, Hitoshi; Kiyono, Tohru
2016-05-01
The Epstein-Barr virus (EBV) is etiologically linked to approximately 10% of gastric cancers, in which viral genomes are maintained as multicopy episomes. EBV-positive gastric cancer cells are incompetent for progeny virus production, making viral DNA cloning extremely difficult. Here we describe a highly efficient strategy for obtaining bacterial artificial chromosome (BAC) clones of EBV episomes by utilizing a CRISPR/Cas9-mediated strand break of the viral genome and subsequent homology-directed repair. EBV strains maintained in two gastric cancer cell lines (SNU719 and YCCEL1) were cloned, and their complete viral genome sequences were determined. Infectious viruses of gastric cancer cell-derived EBVs were reconstituted, and the viruses established stable latent infections in immortalized keratinocytes. While Ras oncoprotein overexpression caused massive vacuolar degeneration and cell death in control keratinocytes, EBV-infected keratinocytes survived in the presence of Ras expression. These results implicate EBV infection in predisposing epithelial cells to malignant transformation by inducing resistance to oncogene-induced cell death. Recent progress in DNA-sequencing technology has accelerated EBV whole-genome sequencing, and the repertoire of sequenced EBV genomes is increasing progressively. Accordingly, the presence of EBV variant strains that may be relevant to EBV-associated diseases has begun to attract interest. Clearly, the determination of additional disease-associated viral genome sequences will facilitate the identification of any disease-specific EBV variants. We found that CRISPR/Cas9-mediated cleavage of EBV episomal DNA enabled the cloning of disease-associated viral strains with unprecedented efficiency. As a proof of concept, two gastric cancer cell-derived EBV strains were cloned, and the infection of epithelial cells with reconstituted viruses provided important clues about the mechanism of EBV-mediated epithelial carcinogenesis. This experimental system should contribute to establishing the relationship between viral genome variation and EBV-associated diseases. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De; Qin, Cheng-Feng
2013-06-01
cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5' cyclization sequence (5'CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5'CS, and the presence of DCS-PK facilitates the formation of 5'-3' RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution.
Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De
2013-01-01
cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5′ cyclization sequence (5′CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5′CS, and the presence of DCS-PK facilitates the formation of 5′-3′ RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution. PMID:23576500
Fitness cost of reassortment in human influenza.
Villa, Mara; Lässig, Michael
2017-11-01
Reassortment, which is the exchange of genome sequence between viruses co-infecting a host cell, plays an important role in the evolution of segmented viruses. In the human influenza virus, reassortment happens most frequently between co-existing variants within the same lineage. This process breaks genetic linkage and fitness correlations between viral genome segments, but the resulting net effect on viral fitness has remained unclear. In this paper, we determine rate and average selective effect of reassortment processes in the human influenza lineage A/H3N2. For the surface proteins hemagglutinin and neuraminidase, reassortant variants with a mean distance of at least 3 nucleotides to their parent strains get established at a rate of about 10-2 in units of the neutral point mutation rate. Our inference is based on a new method to map reassortment events from joint genealogies of multiple genome segments, which is tested by extensive simulations. We show that intra-lineage reassortment processes are, on average, under substantial negative selection that increases in strength with increasing sequence distance between the parent strains. The deleterious effects of reassortment manifest themselves in two ways: there are fewer reassortment events than expected from a null model of neutral reassortment, and reassortant strains have fewer descendants than their non-reassortant counterparts. Our results suggest that influenza evolves under ubiquitous epistasis across proteins, which produces fitness barriers against reassortment even between co-circulating strains within one lineage.
Carlson, Jonathan M.; Chan, Benjamin; Chopera, Denis R.; Brumme, Chanson J.; Markle, Tristan J.; Martin, Eric; Shahid, Aniqa; Anmole, Gursev; Mwimanzi, Philip; Nassab, Pauline; Penney, Kali A.; Rahman, Manal A.; Milloy, M.-J.; Schechter, Martin T.; Markowitz, Martin; Carrington, Mary; Walker, Bruce D.; Wagner, Theresa; Buchbinder, Susan; Fuchs, Jonathan; Koblin, Beryl; Mayer, Kenneth H.; Harrigan, P. Richard; Brockman, Mark A.; Poon, Art F. Y.; Brumme, Zabrina L.
2014-01-01
HLA-restricted immune escape mutations that persist following HIV transmission could gradually spread through the viral population, thereby compromising host antiviral immunity as the epidemic progresses. To assess the extent and phenotypic impact of this phenomenon in an immunogenetically diverse population, we genotypically and functionally compared linked HLA and HIV (Gag/Nef) sequences from 358 historic (1979–1989) and 382 modern (2000–2011) specimens from four key cities in the North American epidemic (New York, Boston, San Francisco, Vancouver). Inferred HIV phylogenies were star-like, with approximately two-fold greater mean pairwise distances in modern versus historic sequences. The reconstructed epidemic ancestral (founder) HIV sequence was essentially identical to the North American subtype B consensus. Consistent with gradual diversification of a “consensus-like” founder virus, the median “background” frequencies of individual HLA-associated polymorphisms in HIV (in individuals lacking the restricting HLA[s]) were ∼2-fold higher in modern versus historic HIV sequences, though these remained notably low overall (e.g. in Gag, medians were 3.7% in the 2000s versus 2.0% in the 1980s). HIV polymorphisms exhibiting the greatest relative spread were those restricted by protective HLAs. Despite these increases, when HIV sequences were analyzed as a whole, their total average burden of polymorphisms that were “pre-adapted” to the average host HLA profile was only ∼2% greater in modern versus historic eras. Furthermore, HLA-associated polymorphisms identified in historic HIV sequences were consistent with those detectable today, with none identified that could explain the few HIV codons where the inferred epidemic ancestor differed from the modern consensus. Results are therefore consistent with slow HIV adaptation to HLA, but at a rate unlikely to yield imminent negative implications for cellular immunity, at least in North America. Intriguingly, temporal changes in protein activity of patient-derived Nef (though not Gag) sequences were observed, suggesting functional implications of population-level HIV evolution on certain viral proteins. PMID:24762668
Metavir 2: new tools for viral metagenome comparison and assembled virome analysis
2014-01-01
Background Metagenomics, based on culture-independent sequencing, is a well-fitted approach to provide insights into the composition, structure and dynamics of environmental viral communities. Following recent advances in sequencing technologies, new challenges arise for existing bioinformatic tools dedicated to viral metagenome (i.e. virome) analysis as (i) the number of viromes is rapidly growing and (ii) large genomic fragments can now be obtained by assembling the huge amount of sequence data generated for each metagenome. Results To face these challenges, a new version of Metavir was developed. First, all Metavir tools have been adapted to support comparative analysis of viromes in order to improve the analysis of multiple datasets. In addition to the sequence comparison previously provided, viromes can now be compared through their k-mer frequencies, their taxonomic compositions, recruitment plots and phylogenetic trees containing sequences from different datasets. Second, a new section has been specifically designed to handle assembled viromes made of thousands of large genomic fragments (i.e. contigs). This section includes an annotation pipeline for uploaded viral contigs (gene prediction, similarity search against reference viral genomes and protein domains) and an extensive comparison between contigs and reference genomes. Contigs and their annotations can be explored on the website through specifically developed dynamic genomic maps and interactive networks. Conclusions The new features of Metavir 2 allow users to explore and analyze viromes composed of raw reads or assembled fragments through a set of adapted tools and a user-friendly interface. PMID:24646187
Haseloff, J; Goelet, P; Zimmern, D; Ahlquist, P; Dasgupta, R; Kaesberg, P
1984-01-01
The plant viruses alfalfa mosaic virus (AMV) and brome mosaic virus (BMV) each divide their genetic information among three RNAs while tobacco mosaic virus (TMV) contains a single genomic RNA. Amino acid sequence comparisons suggest that the single proteins encoded by AMV RNA 1 and BMV RNA 1 and by AMV RNA 2 and BMV RNA 2 are related to the NH2-terminal two-thirds and the COOH-terminal one-third, respectively, of the largest protein encoded by TMV. Separating these two domains in the TMV RNA sequence is an amber termination codon, whose partial suppression allows translation of the downstream domain. Many of the residues that the TMV read-through domain and the segmented plant viruses have in common are also conserved in a read-through domain found in the nonstructural polyprotein of the animal alphaviruses Sindbis and Middelburg. We suggest that, despite substantial differences in gene organization and expression, all of these viruses use related proteins for common functions in RNA replication. Reassortment of functional modules of coding and regulatory sequence from preexisting viral or cellular sources, perhaps via RNA recombination, may be an important mechanism in RNA virus evolution. PMID:6611550
Mutation of HIV-1 genomes in a clinical population treated with the mutagenic nucleoside KP1461.
Mullins, James I; Heath, Laura; Hughes, James P; Kicha, Jessica; Styrchak, Sheila; Wong, Kim G; Rao, Ushnal; Hansen, Alexis; Harris, Kevin S; Laurent, Jean-Pierre; Li, Deyu; Simpson, Jeffrey H; Essigmann, John M; Loeb, Lawrence A; Parkins, Jeffrey
2011-01-14
The deoxycytidine analog KP1212, and its prodrug KP1461, are prototypes of a new class of antiretroviral drugs designed to increase viral mutation rates, with the goal of eventually causing the collapse of the viral population. Here we present an extensive analysis of viral sequences from HIV-1 infected volunteers from the first "mechanism validation" phase II clinical trial of a mutagenic base analog in which individuals previously treated with antiviral drugs received 1600 mg of KP1461 twice per day for 124 days. Plasma viral loads were not reduced, and overall levels of viral mutation were not increased during this short-term study, however, the mutation spectrum of HIV was altered. A large number (N = 105 per sample) of sequences were analyzed, each derived from individual HIV-1 RNA templates, after 0, 56 and 124 days of therapy from 10 treated and 10 untreated control individuals (>7.1 million base pairs of unique viral templates were sequenced). We found that private mutations, those not found in more than one viral sequence and likely to have occurred in the most recent rounds of replication, increased in treated individuals relative to controls after 56 (p = 0.038) and 124 (p = 0.002) days of drug treatment. The spectrum of mutations observed in the treated group showed an excess of A to G and G to A mutations (p = 0.01), and to a lesser extent T to C and C to T mutations (p = 0.09), as predicted by the mechanism of action of the drug. These results validate the proposed mechanism of action in humans and should spur development of this novel antiretroviral approach.
Mutation of HIV-1 Genomes in a Clinical Population Treated with the Mutagenic Nucleoside KP1461
Mullins, James I.; Heath, Laura; Hughes, James P.; Kicha, Jessica; Styrchak, Sheila; Wong, Kim G.; Rao, Ushnal; Hansen, Alexis; Harris, Kevin S.; Laurent, Jean-Pierre; Li, Deyu; Simpson, Jeffrey H.; Essigmann, John M.; Loeb, Lawrence A.; Parkins, Jeffrey
2011-01-01
The deoxycytidine analog KP1212, and its prodrug KP1461, are prototypes of a new class of antiretroviral drugs designed to increase viral mutation rates, with the goal of eventually causing the collapse of the viral population. Here we present an extensive analysis of viral sequences from HIV-1 infected volunteers from the first “mechanism validation” phase II clinical trial of a mutagenic base analog in which individuals previously treated with antiviral drugs received 1600 mg of KP1461 twice per day for 124 days. Plasma viral loads were not reduced, and overall levels of viral mutation were not increased during this short-term study, however, the mutation spectrum of HIV was altered. A large number (N = 105 per sample) of sequences were analyzed, each derived from individual HIV-1 RNA templates, after 0, 56 and 124 days of therapy from 10 treated and 10 untreated control individuals (>7.1 million base pairs of unique viral templates were sequenced). We found that private mutations, those not found in more than one viral sequence and likely to have occurred in the most recent rounds of replication, increased in treated individuals relative to controls after 56 (p = 0.038) and 124 (p = 0.002) days of drug treatment. The spectrum of mutations observed in the treated group showed an excess of A to G and G to A mutations (p = 0.01), and to a lesser extent T to C and C to T mutations (p = 0.09), as predicted by the mechanism of action of the drug. These results validate the proposed mechanism of action in humans and should spur development of this novel antiretroviral approach. PMID:21264288
Anderson, Tavis K; Laegreid, William W; Cerutti, Francesco; Osorio, Fernando A; Nelson, Eric A; Christopher-Hennings, Jane; Goldberg, Tony L
2012-06-15
The extraordinary genetic and antigenic variability of RNA viruses is arguably the greatest challenge to the development of broadly effective vaccines. No single viral variant can induce sufficiently broad immunity, and incorporating all known naturally circulating variants into one multivalent vaccine is not feasible. Furthermore, no objective strategies currently exist to select actual viral variants that should be included or excluded in polyvalent vaccines. To address this problem, we demonstrate a method based on graph theory that quantifies the relative importance of viral variants. We demonstrate our method through application to the envelope glycoprotein gene of a particularly diverse RNA virus of pigs: porcine reproductive and respiratory syndrome virus (PRRSV). Using distance matrices derived from sequence nucleotide difference, amino acid difference and evolutionary distance, we constructed viral networks and used common network statistics to assign each sequence an objective ranking of relative 'importance'. To validate our approach, we use an independent published algorithm to score our top-ranked wild-type variants for coverage of putative T-cell epitopes across the 9383 sequences in our dataset. Top-ranked viruses achieve significantly higher coverage than low-ranked viruses, and top-ranked viruses achieve nearly equal coverage as a synthetic mosaic protein constructed in silico from the same set of 9383 sequences. Our approach relies on the network structure of PRRSV but applies to any diverse RNA virus because it identifies subsets of viral variants that are most important to overall viral diversity. We suggest that this method, through the objective quantification of variant importance, provides criteria for choosing viral variants for further characterization, diagnostics, surveillance and ultimately polyvalent vaccine development.
The Enigmatic Origin of Papillomavirus Protein Domains
Kirsip, Heleri; Gaston, Kevin
2017-01-01
Almost a century has passed since the discovery of papillomaviruses. A few decades of research have given a wealth of information on the molecular biology of papillomaviruses. Several excellent studies have been performed looking at the long- and short-term evolution of these viruses. However, when and how papillomaviruses originate is still a mystery. In this study, we systematically searched the (sequenced) biosphere to find distant homologs of papillomaviral protein domains. Our data show that, even including structural information, which allows us to find deeper evolutionary relationships compared to sequence-only based methods, only half of the protein domains in papillomaviruses have relatives in the rest of the biosphere. We show that the major capsid protein L1 and the replication protein E1 have relatives in several viral families, sharing three protein domains with Polyomaviridae and Parvoviridae. However, only the E1 replication protein has connections with cellular organisms. Most likely, the papillomavirus ancestor is of marine origin, a biotope that is not very well sequenced at the present time. Nevertheless, there is no evidence as to how papillomaviruses originated and how they became vertebrate and epithelium specific. PMID:28832519
The Enigmatic Origin of Papillomavirus Protein Domains.
Puustusmaa, Mikk; Kirsip, Heleri; Gaston, Kevin; Abroi, Aare
2017-08-23
Almost a century has passed since the discovery of papillomaviruses. A few decades of research have given a wealth of information on the molecular biology of papillomaviruses. Several excellent studies have been performed looking at the long- and short-term evolution of these viruses. However, when and how papillomaviruses originate is still a mystery. In this study, we systematically searched the (sequenced) biosphere to find distant homologs of papillomaviral protein domains. Our data show that, even including structural information, which allows us to find deeper evolutionary relationships compared to sequence-only based methods, only half of the protein domains in papillomaviruses have relatives in the rest of the biosphere. We show that the major capsid protein L1 and the replication protein E1 have relatives in several viral families, sharing three protein domains with Polyomaviridae and Parvoviridae . However, only the E1 replication protein has connections with cellular organisms. Most likely, the papillomavirus ancestor is of marine origin, a biotope that is not very well sequenced at the present time. Nevertheless, there is no evidence as to how papillomaviruses originated and how they became vertebrate and epithelium specific.
Transposases are the most abundant, most ubiquitous genes in nature.
Aziz, Ramy K; Breitbart, Mya; Edwards, Robert A
2010-07-01
Genes, like organisms, struggle for existence, and the most successful genes persist and widely disseminate in nature. The unbiased determination of the most successful genes requires access to sequence data from a wide range of phylogenetic taxa and ecosystems, which has finally become achievable thanks to the deluge of genomic and metagenomic sequences. Here, we analyzed 10 million protein-encoding genes and gene tags in sequenced bacterial, archaeal, eukaryotic and viral genomes and metagenomes, and our analysis demonstrates that genes encoding transposases are the most prevalent genes in nature. The finding that these genes, classically considered as selfish genes, outnumber essential or housekeeping genes suggests that they offer selective advantage to the genomes and ecosystems they inhabit, a hypothesis in agreement with an emerging body of literature. Their mobile nature not only promotes dissemination of transposable elements within and between genomes but also leads to mutations and rearrangements that can accelerate biological diversification and--consequently--evolution. By securing their own replication and dissemination, transposases guarantee to thrive so long as nucleic acid-based life forms exist.
Modelling the Evolution and Spread of HIV Immune Escape Mutants
Fryer, Helen R.; Frater, John; Duda, Anna; Roberts, Mick G.; Phillips, Rodney E.; McLean, Angela R.
2010-01-01
During infection with human immunodeficiency virus (HIV), immune pressure from cytotoxic T-lymphocytes (CTLs) selects for viral mutants that confer escape from CTL recognition. These escape variants can be transmitted between individuals where, depending upon their cost to viral fitness and the CTL responses made by the recipient, they may revert. The rates of within-host evolution and their concordant impact upon the rate of spread of escape mutants at the population level are uncertain. Here we present a mathematical model of within-host evolution of escape mutants, transmission of these variants between hosts and subsequent reversion in new hosts. The model is an extension of the well-known SI model of disease transmission and includes three further parameters that describe host immunogenetic heterogeneity and rates of within host viral evolution. We use the model to explain why some escape mutants appear to have stable prevalence whilst others are spreading through the population. Further, we use it to compare diverse datasets on CTL escape, highlighting where different sources agree or disagree on within-host evolutionary rates. The several dozen CTL epitopes we survey from HIV-1 gag, RT and nef reveal a relatively sedate rate of evolution with average rates of escape measured in years and reversion in decades. For many epitopes in HIV, occasional rapid within-host evolution is not reflected in fast evolution at the population level. PMID:21124991
Wymant, Chris; Colijn, Caroline; Danaviah, Siva; Essex, Max; Frost, Simon; Gall, Astrid; Gaseitsiwe, Simani; Grabowski, Mary K.; Gray, Ronald; Guindon, Stephane; von Haeseler, Arndt; Kaleebu, Pontiano; Kendall, Michelle; Kozlov, Alexey; Manasa, Justen; Minh, Bui Quang; Moyo, Sikhulile; Novitsky, Vlad; Nsubuga, Rebecca; Pillay, Sureshnee; Quinn, Thomas C.; Serwadda, David; Ssemwanga, Deogratius; Stamatakis, Alexandros; Trifinopoulos, Jana; Wawer, Maria; Brown, Andy Leigh; de Oliveira, Tulio; Kellam, Paul; Pillay, Deenan; Fraser, Christophe
2017-01-01
Abstract To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the “Phylogenetics and Networks for Generalised HIV Epidemics in Africa” consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n = 2,833; MRC/UVRI Uganda, n = 701; Mochudi Prevention Project, n = 359; Africa Health Research Institute Resistance Cohort, n = 92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3′ end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences (NGS) has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phylogeny reconstruction is expected. PMID:28540766
Ratmann, Oliver; Wymant, Chris; Colijn, Caroline; Danaviah, Siva; Essex, M; Frost, Simon D W; Gall, Astrid; Gaiseitsiwe, Simani; Grabowski, Mary; Gray, Ronald; Guindon, Stephane; von Haeseler, Arndt; Kaleebu, Pontiano; Kendall, Michelle; Kozlov, Alexey; Manasa, Justen; Minh, Bui Quang; Moyo, Sikhulile; Novitsky, Vladimir; Nsubuga, Rebecca; Pillay, Sureshnee; Quinn, Thomas C; Serwadda, David; Ssemwanga, Deogratius; Stamatakis, Alexandros; Trifinopoulos, Jana; Wawer, Maria; Leigh Brown, Andrew; de Oliveira, Tulio; Kellam, Paul; Pillay, Deenan; Fraser, Christophe
2017-05-25
To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the 'Phylogenetics and Networks for Generalised HIV Epidemics in Africa' consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n=2,833; MRC/UVRI Uganda, n=701; Mochudi Prevention Project, n=359; Africa Health Research Institute Resistance Cohort, n=92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3' end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phylogeny reconstruction is expected.
Atkins, Katherine E; Read, Andrew F; Savill, Nicholas J; Renz, Katrin G; Islam, A F M Fakhrul; Walkden-Brown, Stephen W; Woolhouse, Mark E J
2013-03-01
Marek's disease virus (MDV), a commercially important disease of poultry, has become substantially more virulent over the last 60 years. This evolution was presumably a consequence of changes in virus ecology associated with the intensification of the poultry industry. Here, we assess whether vaccination or reduced host life span could have generated natural selection, which favored more virulent strains. Using previously published experimental data, we estimated viral fitness under a range of cohort durations and vaccine treatments on broiler farms. We found that viral fitness maximized at intermediate virulence, as a result of a trade-off between virulence and transmission previously reported. Our results suggest that vaccination, acting on this trade-off, could have led to the evolution of increased virulence. By keeping the host alive, vaccination prolongs infectious periods of virulent strains. Improvements in host genetics and nutrition, which reduced broiler life spans below 50 days, could have also increased the virulence of the circulating MDV strains because shortened cohort duration reduces the impact of host death on viral fitness. These results illustrate the dramatic impact anthropogenic change can potentially have on pathogen virulence. © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.
Madi, Nada; Al-Nakib, Widad; Mustafa, Abu Salim; Habibi, Nazima
2018-03-01
A metagenomic approach based on target independent next-generation sequencing has become a known method for the detection of both known and novel viruses in clinical samples. This study aimed to use the metagenomic sequencing approach to characterize the viral diversity in respiratory samples from patients with respiratory tract infections. We have investigated 86 respiratory samples received from various hospitals in Kuwait between 2015 and 2016 for the diagnosis of respiratory tract infections. A metagenomic approach using the next-generation sequencer to characterize viruses was used. According to the metagenomic analysis, an average of 145, 019 reads were identified, and 2% of these reads were of viral origin. Also, metagenomic analysis of the viral sequences revealed many known respiratory viruses, which were detected in 30.2% of the clinical samples. Also, sequences of non-respiratory viruses were detected in 14% of the clinical samples, while sequences of non-human viruses were detected in 55.8% of the clinical samples. The average genome coverage of the viruses was 12% with the highest genome coverage of 99.2% for respiratory syncytial virus, and the lowest was 1% for torque teno midi virus 2. Our results showed 47.7% agreement between multiplex Real-Time PCR and metagenomics sequencing in the detection of respiratory viruses in the clinical samples. Though there are some difficulties in using this method to clinical samples such as specimen quality, these observations are indicative of the promising utility of the metagenomic sequencing approach for the identification of respiratory viruses in patients with respiratory tract infections. © 2017 Wiley Periodicals, Inc.
Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao
2018-06-15
Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied by the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here, we explored within-host genetic diversity of HPV by performing deep-sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52, and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC) and were deep sequenced. After constructing a reference viral genome sequence for each specimen, nucleotide positions showing changes with >0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with various numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the trinucleotide context encompassing substituted bases revealed that TpCpN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep-sequencing analyses, we show for the first time a comprehensive snapshot of the within-host genetic diversity of high-risk HPVs during cervical carcinogenesis. Quasispecies harboring minor nucleotide variations in viral whole-genome sequences were extensively observed across different grades of CIN and cervical cancer. Among the within-host variations, C-to-T transitions, a characteristic change mediated by cellular APOBEC cytosine deaminases, were predominantly detected throughout the whole viral genome, most strikingly in low-grade CIN lesions. The results strongly suggest that within-host variations of the HPV genome are primarily generated through the interaction with host cell DNA-editing enzymes and that such within-host variability is an evolutionary source of the genetic diversity of HPVs. Copyright © 2018 American Society for Microbiology.
Error catastrophe and phase transition in the empirical fitness landscape of HIV
NASA Astrophysics Data System (ADS)
Hart, Gregory R.; Ferguson, Andrew L.
2015-03-01
We have translated clinical sequence databases of the p6 HIV protein into an empirical fitness landscape quantifying viral replicative capacity as a function of the amino acid sequence. We show that the viral population resides close to a phase transition in sequence space corresponding to an "error catastrophe" beyond which there is lethal accumulation of mutations. Our model predicts that the phase transition may be induced by drug therapies that elevate the mutation rate, or by forcing mutations at particular amino acids. Applying immune pressure to any combination of killer T-cell targets cannot induce the transition, providing a rationale for why the viral protein can exist close to the error catastrophe without sustaining fatal fitness penalties due to adaptive immunity.
Duda, Anja; Stange, Annett; Lüftenegger, Daniel; Stanke, Nicole; Westphal, Dana; Pietschmann, Thomas; Eastman, Scott W; Linial, Maxine L; Rethwilm, Axel; Lindemann, Dirk
2004-12-01
Analogous to cellular glycoproteins, viral envelope proteins contain N-terminal signal sequences responsible for targeting them to the secretory pathway. The prototype foamy virus (PFV) envelope (Env) shows a highly unusual biosynthesis. Its precursor protein has a type III membrane topology with both the N and C terminus located in the cytoplasm. Coexpression of FV glycoprotein and interaction of its leader peptide (LP) with the viral capsid is essential for viral particle budding and egress. Processing of PFV Env into the particle-associated LP, surface (SU), and transmembrane (TM) subunits occur posttranslationally during transport to the cell surface by yet-unidentified cellular proteases. Here we provide strong evidence that furin itself or a furin-like protease and not the signal peptidase complex is responsible for both processing events. N-terminal protein sequencing of the SU and TM subunits of purified PFV Env-immunoglobulin G immunoadhesin identified furin consensus sequences upstream of both cleavage sites. Mutagenesis analysis of two overlapping furin consensus sequences at the PFV LP/SU cleavage site in the wild-type protein confirmed the sequencing data and demonstrated utilization of only the first site. Fully processed SU was almost completely absent in viral particles of mutants having conserved arginine residues replaced by alanines in the first furin consensus sequence, but normal processing was observed upon mutation of the second motif. Although these mutants displayed a significant loss in infectivity as a result of reduced particle release, no correlation to processing inhibition was observed, since another mutant having normal LP/SU processing had a similar defect.
Ab initio gene identification in metagenomic sequences
Zhu, Wenhan; Lomsadze, Alexandre; Borodovsky, Mark
2010-01-01
We describe an algorithm for gene identification in DNA sequences derived from shotgun sequencing of microbial communities. Accurate ab initio gene prediction in a short nucleotide sequence of anonymous origin is hampered by uncertainty in model parameters. While several machine learning approaches could be proposed to bypass this difficulty, one effective method is to estimate parameters from dependencies, formed in evolution, between frequencies of oligonucleotides in protein-coding regions and genome nucleotide composition. Original version of the method was proposed in 1999 and has been used since for (i) reconstructing codon frequency vector needed for gene finding in viral genomes and (ii) initializing parameters of self-training gene finding algorithms. With advent of new prokaryotic genomes en masse it became possible to enhance the original approach by using direct polynomial and logistic approximations of oligonucleotide frequencies, as well as by separating models for bacteria and archaea. These advances have increased the accuracy of model reconstruction and, subsequently, gene prediction. We describe the refined method and assess its accuracy on known prokaryotic genomes split into short sequences. Also, we show that as a result of application of the new method, several thousands of new genes could be added to existing annotations of several human and mouse gut metagenomes. PMID:20403810
Patterns and rates of viral evolution in HIV-1 subtype B infected females and males.
Dapp, Michael J; Kober, Kord M; Chen, Lennie; Westfall, Dylan H; Wong, Kim; Zhao, Hong; Hall, Breana M; Deng, Wenjie; Sibley, Thomas; Ghorai, Suvankar; Kim, Katie; Chen, Natalie; McHugh, Sarah; Au, Lily; Cohen, Mardge; Anastos, Kathryn; Mullins, James I
2017-01-01
Biological sex differences affect the course of HIV infection, with untreated women having lower viral loads compared to their male counterparts but, for a given viral load, women have a higher rate of progression to AIDS. However, the vast majority of data on viral evolution, a process that is clearly impacted by host immunity and could be impacted by sex differences, has been derived from men. We conducted an intensive analysis of HIV-1 gag and env-gp120 evolution taken over the first 6-11 years of infection from 8 Women's Interagency HIV Study (WIHS) participants who had not received combination antiretroviral therapy (ART). This was compared to similar data previously collected from men, with both groups infected with HIV-1 subtype B. Early virus populations in men and women were generally homogenous with no differences in diversity between sexes. No differences in ensuing nucleotide substitution rates were found between the female and male cohorts studied herein. As previously reported for men, time to peak diversity in env-gp120 in women was positively associated with time to CD4+ cell count below 200 (P = 0.017), and the number of predicted N-linked glycosylation sites generally increased over time, followed by a plateau or decline, with the majority of changes localized to the V1-V2 region. These findings strongly suggest that the sex differences in HIV-1 disease progression attributed to immune system composition and sensitivities are not revealed by, nor do they impact, global patterns of viral evolution, the latter of which proceeds similarly in women and men.
Rapid molecular evolution of human bocavirus revealed by Bayesian coalescent inference.
Zehender, Gianguglielmo; De Maddalena, Chiara; Canuti, Marta; Zappa, Alessandra; Amendola, Antonella; Lai, Alessia; Galli, Massimo; Tanzi, Elisabetta
2010-03-01
Human bocavirus (HBoV) is a linear single-stranded DNA virus belonging to the Parvoviridae family that has recently been isolated from the upper respiratory tract of children with acute respiratory infection. All of the strains observed so far segregate into two genotypes (1 and 2) with a low level of polymorphism. Given the recent description of the infection and the lack of epidemiological and molecular data, we estimated the virus's rates of molecular evolution and population dynamics. A dataset of forty-nine dated VP2 sequences, including also eight new isolates obtained from pharyngeal swabs of Italian patients with acute respiratory tract infections, was submitted to phylogenetic analysis. The model parameters, evolutionary rates and population dynamics were co-estimated using a Bayesian Markov Chain Monte Carlo approach, and site-specific positive and negative selection was also investigated. Recombination was investigated by seven different methods and one suspected recombinant strain was excluded from further analysis. The estimated mean evolutionary rate of HBoV was 8.6x10(-4)subs/site/year, and that of the 1st+2nd codon positions was more than 15 times less than that of the 3rd codon position. Viral population dynamics analysis revealed that the two known genotypes diverged recently (mean tMRCA: 24 years), and that the epidemic due to HBoV genotype 2 grew exponentially at a rate of 1.01year(-1). Selection analysis of the partial VP2 showed that 8.5% of sites were under significant negative pressure and the absence of positive selection. Our results show that, like other parvoviruses, HBoV is characterised by a rapid evolution. The low level of polymorphism is probably due to a relatively recent divergence between the circulating genotypes and strong purifying selection acting on viral antigens.
Hughes, Paul; Deng, Wenjie; Olson, Scott C; Coombs, Robert W; Chung, Michael H; Frenkel, Lisa M
2016-03-01
Accurate analysis of minor populations of drug-resistant HIV requires analysis of a sufficient number of viral templates. We assessed the effect of experimental conditions on the analysis of HIV pol 454 pyrosequences generated from plasma using (1) the "Insertion-deletion (indel) and Carry Forward Correction" (ICC) pipeline, which clusters sequence reads using a nonsubstitution approach and can correct for indels and carry forward errors, and (2) the "Primer Identification (ID)" method, which facilitates construction of a consensus sequence to correct for sequencing errors and allelic skewing. The Primer ID and ICC methods produced similar estimates of viral diversity, but differed in the number of sequence variants generated. Sequence preparation for ICC was comparably simple, but was limited by an inability to assess the number of templates analyzed and allelic skewing. The more costly Primer ID method corrected for allelic skewing and provided the number of viral templates analyzed, which revealed that amplifiable HIV templates varied across specimens and did not correlate with clinical viral load. This latter observation highlights the value of the Primer ID method, which by determining the number of templates amplified, enables more accurate assessment of minority species in the virus population, which may be relevant to prescribing effective antiretroviral therapy.
Estimate of within population incremental selection through branch imbalance in lineage trees
Liberman, Gilad; Benichou, Jennifer I.C.; Maman, Yaakov; Glanville, Jacob; Alter, Idan; Louzoun, Yoram
2016-01-01
Incremental selection within a population, defined as limited fitness changes following mutation, is an important aspect of many evolutionary processes. Strongly advantageous or deleterious mutations are detected using the synonymous to non-synonymous mutations ratio. However, there are currently no precise methods to estimate incremental selection. We here provide for the first time such a detailed method and show its precision in multiple cases of micro-evolution. The proposed method is a novel mixed lineage tree/sequence based method to detect within population selection as defined by the effect of mutations on the average number of offspring. Specifically, we propose to measure the log of the ratio between the number of leaves in lineage trees branches following synonymous and non-synonymous mutations. The method requires a high enough number of sequences, and a large enough number of independent mutations. It assumes that all mutations are independent events. It does not require of a baseline model and is practically not affected by sampling biases. We show the method's wide applicability by testing it on multiple cases of micro-evolution. We show that it can detect genes and inter-genic regions using the selection rate and detect selection pressures in viral proteins and in the immune response to pathogens. PMID:26586802
Pauly, Matthew D; Procario, Megan C; Lauring, Adam S
2017-01-01
Influenza virus’ low replicative fidelity contributes to its capacity for rapid evolution. Clonal sequencing and fluctuation tests have suggested that the influenza virus mutation rate is 2.7 × 10–6 - 3.0 × 10–5 substitutions per nucleotide per strand copied (s/n/r). However, sequencing assays are biased toward mutations with minimal fitness impacts and fluctuation tests typically investigate only a subset of all possible single nucleotide mutations. We developed a fluctuation test based on reversion to fluorescence in a set of virally encoded mutant green fluorescent proteins, which allowed us to measure the rates of selectively neutral mutations representative of the twelve different mutation types. We measured an overall mutation rate of 1.8 × 10–4 s/n/r for PR8 (H1N1) and 2.5 × 10–4 s/n/r for Hong Kong 2014 (H3N2) and a transitional bias of 2.7–3.6. Our data suggest that each replicated genome will have an average of 2–3 mutations and highlight the importance of mutational load in influenza virus evolution. DOI: http://dx.doi.org/10.7554/eLife.26437.001 PMID:28598328
Hepatitis A virus: host interactions, molecular epidemiology and evolution.
Vaughan, Gilberto; Goncalves Rossi, Livia Maria; Forbi, Joseph C; de Paula, Vanessa S; Purdy, Michael A; Xia, Guoliang; Khudyakov, Yury E
2014-01-01
Infection with hepatitis A virus (HAV) is the commonest viral cause of liver disease and presents an important public health problem worldwide. Several unique HAV properties and molecular mechanisms of its interaction with host were recently discovered and should aid in clarifying the pathogenesis of hepatitis A. Genetic characterization of HAV strains have resulted in the identification of different genotypes and subtypes, which exhibit a characteristic worldwide distribution. Shifts in HAV endemicity occurring in different parts of the world, introduction of genetically diverse strains from geographically distant regions, genotype displacement observed in some countries and population expansion detected in the last decades of the 20th century using phylogenetic analysis are important factors contributing to the complex dynamics of HAV infections worldwide. Strong selection pressures, some of which, like usage of deoptimized codons, are unique to HAV, limit genetic variability of the virus. Analysis of subgenomic regions has been proven useful for outbreak investigations. However, sharing short sequences among epidemiologically unrelated strains indicates that specific identification of HAV strains for molecular surveillance can be achieved only using whole-genome sequences. Here, we present up-to-date information on the HAV molecular epidemiology and evolution, and highlight the most relevant features of the HAV-host interactions. Published by Elsevier B.V.
1987-10-13
after multiple passages in vivo and in vitro. J. Gen. Virol. 67, 1741- 1744. Sabin , A.B. (1985). Oral poliovirus vaccine : history of its development...IN (N NEW APPROACHES TO ATTENUATED HEPATITIS A VACCINE DEVELOPMENT: Q) CLONING AND SEQUENCING OF CELL-CULTURE ADAPTED VIRAL cDNA I ANNUAL REPORT...6ll02Bsl0 A 055 11. TITLE (Include Security Classification) New Approaches to Attenuated Hepatitis A Vaccine Development: Cloning and Sequencing of Cell
Evolution of the Digital Society Reveals Balance between Viral and Mass Media Influence
NASA Astrophysics Data System (ADS)
Kleineberg, Kaj-Kolja; Boguñá, Marián
2014-07-01
Online social networks (OSNs) enable researchers to study the social universe at a previously unattainable scale. The worldwide impact and the necessity to sustain the rapid growth of OSNs emphasize the importance of unraveling the laws governing their evolution. Empirical results show that, unlike many real-world growing networked systems, OSNs follow an intricate path that includes a dynamical percolation transition. In light of these results, we present a quantitative two-parameter model that reproduces the entire topological evolution of a quasi-isolated OSN with unprecedented precision from the birth of the network. This allows us to precisely gauge the fundamental macroscopic and microscopic mechanisms involved. Our findings suggest that the coupling between the real preexisting underlying social structure, a viral spreading mechanism, and mass media influence govern the evolution of OSNs. The empirical validation of our model, on a macroscopic scale, reveals that virality is 4-5 times stronger than mass media influence and, on a microscopic scale, individuals have a higher subscription probability if invited by weaker social contacts, in agreement with the "strength of weak ties" paradigm.
Redmond, Catherine J.; Dooley, Katharine E.; Fu, Haiqing; Gillison, Maura L.; Akagi, Keiko; Symer, David E.; Aladjem, Mirit I.
2018-01-01
Integration of human papillomavirus (HPV) genomes into cellular chromatin is common in HPV-associated cancers. Integration is random, and each site is unique depending on how and where the virus integrates. We recently showed that tandemly integrated HPV16 could result in the formation of a super-enhancer-like element that drives transcription of the viral oncogenes. Here, we characterize the chromatin landscape and genomic architecture of this integration locus to elucidate the mechanisms that promoted de novo super-enhancer formation. Using next-generation sequencing and molecular combing/fiber-FISH, we show that ~26 copies of HPV16 are integrated into an intergenic region of chromosome 2p23.2, interspersed with 25 kb of amplified, flanking cellular DNA. This interspersed, co-amplified viral-host pattern is frequent in HPV-associated cancers and here we designate it as Type III integration. An abundant viral-cellular fusion transcript encoding the viral E6/E7 oncogenes is expressed from the integration locus and the chromatin encompassing both the viral enhancer and a region in the adjacent amplified cellular sequences is strongly enriched in the super-enhancer markers H3K27ac and Brd4. Notably, the peak in the amplified cellular sequence corresponds to an epithelial-cell-type specific enhancer. Thus, HPV16 integration generated a super-enhancer-like element composed of tandem interspersed copies of the viral upstream regulatory region and a cellular enhancer, to drive high levels of oncogene expression. PMID:29364907
The quantitative theory of within-host viral evolution
NASA Astrophysics Data System (ADS)
Rouzine, Igor M.; Weinberger, Leor S.
2013-01-01
During the 1990s, a group of virologists and physicists began development of a quantitative theory to explain the rapid evolution of human immunodeficiency virus type 1 (HIV-1). This theory also proved to be instrumental in understanding the rapid emergence of drug resistance in patients. Over the past two decades, this theory expanded to account for a broad array of factors important to viral evolution and propelled development of a generalized theory applicable to a broad range of asexual and partly sexual populations with many evolving sites. Here, we discuss the conceptual and theoretical tools developed to calculate the speed and other parameters of evolution, with a particular focus on the concept of ‘clonal interference’ and its applications to untreated patients.
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis.
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J.; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis. PMID:24695106
Polyomavirus BK non-coding control region rearrangements in health and disease.
Sharma, Preety M; Gupta, Gaurav; Vats, Abhay; Shapiro, Ron; Randhawa, Parmjeet S
2007-08-01
BK virus is an increasingly recognized pathogen in transplanted patients. DNA sequencing of this virus shows considerable genomic variability. To understand the clinical significance of rearrangements in the non-coding control region (NCCR) of BK virus (BKV), we report a meta-analysis of 507 sequences, including 40 sequences generated in our own laboratory, for associations between rearrangements and disease, tissue tropism, geographic origin, and viral genotype. NCCR rearrangements were less frequent in (a) asymptomatic BKV viruria compared to patients viral nephropathy (1.7% vs. 22.5%), and (b) viral genotype 1 compared to other genotypes (2.4% vs. 11.2%). Rearrangements were commoner in malignancy (78.6%), and Norwegians (45.7%), and less common in East Indians (0%), and Japanese (4.3%). A surprising number of rearranged sequences were reported from mononuclear cells of healthy subjects, whereas most plasma sequences were archetypal. This difference could not be related to potential recombinase activity in lymphocytes, as consensus recombination signal sequences could not be found in the NCCR region. NCCR rearrangements are neither required nor a sufficient condition to produce clinical disease. BKV nephropathy and hemorrhagic cystitis are not associated with any unique NCCR configuration or nucleotide sequence.
Evolution of the viral hemorrhagic septicemia virus: divergence, selection and origin.
He, Mei; Yan, Xue-Chun; Liang, Yang; Sun, Xiao-Wen; Teng, Chun-Bo
2014-08-01
Viral hemorrhagic septicemia virus (VHSV) is an economically significant rhabdovirus that affects an increasing number of freshwater and marine fish species. Extensive studies have been conducted on the molecular epizootiology, genetic diversity, and phylogeny of VHSV. However, there are discrepancies between the reported estimates of the nucleotide substitution rate for the G gene and the divergence times for the genotypes. Herein, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of the six VHSV genes. Rate estimates based on the G gene indicated that the marine genotypes/subtypes might not all evolve slower than their major European freshwater counterpart. Age calculations on the six genes revealed that the first bifurcation event of the analyzed isolates might have taken place within the last 300 years, which was much younger than previously thought. Selection analyses suggested that two codons of the G gene might be positively selected. Surveys of codon usage bias showed that the P, M and NV genes exhibited genotype-specific variations. Furthermore, we proposed that VHSV originated from the Pacific Northwest of North America. Copyright © 2014 Elsevier Inc. All rights reserved.
Viruses and miRNAs: More Friends than Foes.
Bruscella, Patrice; Bottini, Silvia; Baudesson, Camille; Pawlotsky, Jean-Michel; Feray, Cyrille; Trabucchi, Michele
2017-01-01
There is evidence that eukaryotic miRNAs (hereafter called host miRNAs) play a role in the replication and propagation of viruses. Expression or targeting of host miRNAs can be involved in cellular antiviral responses. Most times host miRNAs play a role in viral life-cycles and promote infection through complex regulatory pathways. miRNAs can also be encoded by a viral genome and be expressed in the host cell. Viral miRNAs can share common sequences with host miRNAs or have totally different sequences. They can regulate a variety of biological processes involved in viral infection, including apoptosis, evasion of the immune response, or modulation of viral life-cycle phases. Overall, virus/miRNA pathway interaction is defined by a plethora of complex mechanisms, though not yet fully understood. This article review summarizes recent advances and novel biological concepts related to the understanding of miRNA expression, control and function during viral infections. The article also discusses potential therapeutic applications of this particular host-pathogen interaction.
Viruses and miRNAs: More Friends than Foes
Bruscella, Patrice; Bottini, Silvia; Baudesson, Camille; Pawlotsky, Jean-Michel; Feray, Cyrille; Trabucchi, Michele
2017-01-01
There is evidence that eukaryotic miRNAs (hereafter called host miRNAs) play a role in the replication and propagation of viruses. Expression or targeting of host miRNAs can be involved in cellular antiviral responses. Most times host miRNAs play a role in viral life-cycles and promote infection through complex regulatory pathways. miRNAs can also be encoded by a viral genome and be expressed in the host cell. Viral miRNAs can share common sequences with host miRNAs or have totally different sequences. They can regulate a variety of biological processes involved in viral infection, including apoptosis, evasion of the immune response, or modulation of viral life-cycle phases. Overall, virus/miRNA pathway interaction is defined by a plethora of complex mechanisms, though not yet fully understood. This article review summarizes recent advances and novel biological concepts related to the understanding of miRNA expression, control and function during viral infections. The article also discusses potential therapeutic applications of this particular host–pathogen interaction. PMID:28555130
Kamboj, Atul; Hallwirth, Claus V; Alexander, Ian E; McCowage, Geoffrey B; Kramer, Belinda
2017-06-17
The analysis of viral vector genomic integration sites is an important component in assessing the safety and efficiency of patient treatment using gene therapy. Alongside this clinical application, integration site identification is a key step in the genetic mapping of viral elements in mutagenesis screens that aim to elucidate gene function. We have developed a UNIX-based vector integration site analysis pipeline (Ub-ISAP) that utilises a UNIX-based workflow for automated integration site identification and annotation of both single and paired-end sequencing reads. Reads that contain viral sequences of interest are selected and aligned to the host genome, and unique integration sites are then classified as transcription start site-proximal, intragenic or intergenic. Ub-ISAP provides a reliable and efficient pipeline to generate large datasets for assessing the safety and efficiency of integrating vectors in clinical settings, with broader applications in cancer research. Ub-ISAP is available as an open source software package at https://sourceforge.net/projects/ub-isap/ .
Molecular Evolution and Phylogeography of Co-circulating IHNV and VHSV in Italy
Abbadi, Miriam; Fusaro, Alice; Ceolin, Chiara; Casarotto, Claudia; Quartesan, Rosita; Dalla Pozza, Manuela; Cattoli, Giovanni; Toffan, Anna; Holmes, Edward C.; Panzarin, Valentina
2016-01-01
Infectious haematopoietic necrosis virus (IHNV) and viral haemorrhagic septicaemia virus (VHSV) are the most important viral pathogens impacting rainbow trout farming. These viruses are persistent in Italy, where they are responsible for severe disease outbreaks (epizootics) that affect the profitability of the trout industry. Despite the importance of IHNV and VHSV, little is known about their evolution at a local scale, although this is likely to be important for virus eradication and control. To address this issue we performed a detailed molecular evolutionary and epidemiological analysis of IHNV and VHSV in trout farms from northern Italy. Full-length glycoprotein gene sequences of a selection of VHSV (n = 108) and IHNV (n = 89) strains were obtained. This revealed that Italian VHSV strains belong to sublineages Ia1 and Ia2 of genotype Ia and are distributed into 7 genetic clusters. In contrast, all Italian IHNV isolates fell within genogroup E, for which only a single genetic cluster was identified. More striking was that IHNV has evolved more rapidly than VHSV (mean rates of 11 and 7.3 × 10−4 nucleotide substitutions per site, per year, respectively), indicating that these viruses exhibit fundamentally different evolutionary dynamics. The time to the most recent common ancestor of both IHNV and VHSV was consistent with the first reports of these pathogens in Italy. By combining sequence data with epidemiological information it was possible to identify different patterns of virus spread among trout farms, in which adjacent facilities can be infected by either genetically similar or different viruses, and farms located in different water catchments can be infected by identical strains. Overall, these findings highlight the importance of combining molecular and epidemiological information to identify the determinants of IHN and VHS spread, and to provide data that is central to future surveillance strategies and possibly control. PMID:27602026
Borozan, Ivan; Watt, Stuart; Ferretti, Vincent
2015-05-01
Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. ivan.borozan@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Borozan, Ivan; Watt, Stuart; Ferretti, Vincent
2015-01-01
Motivation: Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Results: Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. Availability and implementation: All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. Contact: ivan.borozan@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25573913
O'Flaherty, Brigid M; Li, Yan; Tao, Ying; Paden, Clinton R; Queen, Krista; Zhang, Jing; Dinwiddie, Darrell L; Gross, Stephen M; Schroth, Gary P; Tong, Suxiang
2018-06-01
Next generation sequencing (NGS) technologies have revolutionized the genomics field and are becoming more commonplace for identification of human infectious diseases. However, due to the low abundance of viral nucleic acids (NAs) in relation to host, viral identification using direct NGS technologies often lacks sufficient sensitivity. Here, we describe an approach based on two complementary enrichment strategies that significantly improves the sensitivity of NGS-based virus identification. To start, we developed two sets of DNA probes to enrich virus NAs associated with respiratory diseases. The first set of probes spans the genomes, allowing for identification of known viruses and full genome sequencing, while the second set targets regions conserved among viral families or genera, providing the ability to detect both known and potentially novel members of those virus groups. Efficiency of enrichment was assessed by NGS testing reference virus and clinical samples with known infection. We show significant improvement in viral identification using enriched NGS compared to unenriched NGS. Without enrichment, we observed an average of 0.3% targeted viral reads per sample. However, after enrichment, 50%-99% of the reads per sample were the targeted viral reads for both the reference isolates and clinical specimens using both probe sets. Importantly, dramatic improvements on genome coverage were also observed following virus-specific probe enrichment. The methods described here provide improved sensitivity for virus identification by NGS, allowing for a more comprehensive analysis of disease etiology. © 2018 O'Flaherty et al.; Published by Cold Spring Harbor Laboratory Press.
Deep Sequencing to Identify the Causes of Viral Encephalitis
Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.
2014-01-01
Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
Leda, Ana Rachel; Hunter, James; Oliveira, Ursula Castro; Azevedo, Inacio Junqueira; Sucupira, Maria Cecilia Araripe; Diaz, Ricardo Sobhie
2018-04-19
The presence of minority transmitted drug resistance mutations was assessed using ultra-deep sequencing and correlated with disease progression among recently HIV-1-infected individuals from Brazil. Samples at baseline during recent infection and 1 year after the establishment of the infection were analysed. Viral RNA and proviral DNA from 25 individuals were subjected to ultra-deep sequencing of the reverse transcriptase and protease regions of HIV-1. Viral strains carrying transmitted drug resistance mutations were detected in 9 out of the 25 patients, for all major antiretroviral classes, ranging from one to five mutations per patient. Ultra-deep sequencing detected strains with frequencies as low as 1.6% and only strains with frequencies >20% were detected by population plasma sequencing (three patients). Transmitted drug resistance strains with frequencies <14.8% did not persist upon established infection. The presence of transmitted drug resistance mutations was negatively correlated with the viral load and with CD4+ T cell count decay. Transmitted drug resistance mutations representing small percentages of the viral population do not persist during infection because they are negatively selected in the first year after HIV-1 seroconversion.
Application of viromics: a new approach to the understanding of viral infections in humans.
Ramamurthy, Mageshbabu; Sankar, Sathish; Kannangai, Rajesh; Nandagopal, Balaji; Sridharan, Gopalan
2017-12-01
This review is focused at exploring the strengths of modern technology driven data compiled in the areas of virus gene sequencing, virus protein structures and their implication to viral diagnosis and therapy. The information for virome analysis (viromics) is generated by the study of viral genomes (entire nucleotide sequence) and viral genes (coding for protein). Presently, the study of viral infectious diseases in terms of etiopathogenesis and development of newer therapeutics is undergoing rapid changes. Currently, viromics relies on deep sequencing, next generation sequencing (NGS) data and public domain databases like GenBank and unique virus specific databases. Two commonly used NGS platforms: Illumina and Ion Torrent, recommend maximum fragment lengths of about 300 and 400 nucleotides for analysis respectively. Direct detection of viruses in clinical samples is now evolving using these methods. Presently, there are a considerable number of good treatment options for HBV/HIV/HCV. These viruses however show development of drug resistance. The drug susceptibility regions of the genomes are sequenced and the prediction of drug resistance is now possible from 3 public domains available on the web. This has been made possible through advances in the technology with the advent of high throughput sequencing and meta-analysis through sophisticated and easy to use software and the use of high speed computers for bioinformatics. More recently NGS technology has been improved with single-molecule real-time sequencing. Here complete long reads can be obtained with less error overcoming a limitation of the NGS which is inherently prone to software anomalies that arise in the hands of personnel without adequate training. The development in understanding the viruses in terms of their genome, pathobiology, transcriptomics and molecular epidemiology constitutes viromics. It could be stated that these developments will bring about radical changes and advancement especially in the field of antiviral therapy and diagnostic virology.
An RNAi in silico approach to find an optimal shRNA cocktail against HIV-1
2010-01-01
Background HIV-1 can be inhibited by RNA interference in vitro through the expression of short hairpin RNAs (shRNAs) that target conserved genome sequences. In silico shRNA design for HIV has lacked a detailed study of virus variability constituting a possible breaking point in a clinical setting. We designed shRNAs against HIV-1 considering the variability observed in naïve and drug-resistant isolates available at public databases. Methods A Bioperl-based algorithm was developed to automatically scan multiple sequence alignments of HIV, while evaluating the possibility of identifying dominant and subdominant viral variants that could be used as efficient silencing molecules. Student t-test and Bonferroni Dunn correction test were used to assess statistical significance of our findings. Results Our in silico approach identified the most common viral variants within highly conserved genome regions, with a calculated free energy of ≥ -6.6 kcal/mol. This is crucial for strand loading to RISC complex and for a predicted silencing efficiency score, which could be used in combination for achieving over 90% silencing. Resistant and naïve isolate variability revealed that the most frequent shRNA per region targets a maximum of 85% of viral sequences. Adding more divergent sequences maintained this percentage. Specific sequence features that have been found to be related with higher silencing efficiency were hardly accomplished in conserved regions, even when lower entropy values correlated with better scores. We identified a conserved region among most HIV-1 genomes, which meets as many sequence features for efficient silencing. Conclusions HIV-1 variability is an obstacle to achieving absolute silencing using shRNAs designed against a consensus sequence, mainly because there are many functional viral variants. Our shRNA cocktail could be truly effective at silencing dominant and subdominant naïve viral variants. Additionally, resistant isolates might be targeted under specific antiretroviral selective pressure, but in both cases these should be tested exhaustively prior to clinical use. PMID:21172023
Fadrosh, Douglas W.; Andrews-Pfannkoch, Cynthia; Williamson, Shannon J.
2011-01-01
Viruses, particularly bacteriophages (phages), are the most numerous biological entities on Earth1,2. Viruses modulate host cell abundance and diversity, contribute to the cycling of nutrients, alter host cell phenotype, and influence the evolution of both host cell and viral communities through the lateral transfer of genes 3. Numerous studies have highlighted the staggering genetic diversity of viruses and their functional potential in a variety of natural environments. Metagenomic techniques have been used to study the taxonomic diversity and functional potential of complex viral assemblages whose members contain single-stranded DNA (ssDNA), double-stranded DNA (dsDNA) and RNA genotypes 4-9. Current library construction protocols used to study environmental DNA-containing or RNA-containing viruses require an initial nuclease treatment in order to remove nontargeted templates 10. However, a comprehensive understanding of the collective gene complement of the virus community and virus diversity requires knowledge of all members regardless of genome composition. Fractionation of purified nucleic acid subtypes provides an effective mechanism by which to study viral assemblages without sacrificing a subset of the community’s genetic signature. Hydroxyapatite, a crystalline form of calcium phosphate, has been employed in the separation of nucleic acids, as well as proteins and microbes, since the 1960s11. By exploiting the charge interaction between the positively-charged Ca2+ ions of the hydroxyapatite and the negatively charged phosphate backbone of the nucleic acid subtypes, it is possible to preferentially elute each nucleic acid subtype independent of the others. We recently employed this strategy to independently fractionate the genomes of ssDNA, dsDNA and RNA-containing viruses in preparation of DNA sequencing 12. Here, we present a method for the fractionation and recovery of ssDNA, dsDNA and RNA viral nucleic acids from mixed viral assemblages using hydroxyapatite chromotography. PMID:21989424
Nikolaitchik, Olga A.; Burdick, Ryan C.; Gorelick, Robert J.; Keele, Brandon F.; Hu, Wei-Shau; Pathak, Vinay K.
2016-01-01
Although the predominant effect of host restriction APOBEC3 proteins on HIV-1 infection is to block viral replication, they might inadvertently increase retroviral genetic variation by inducing G-to-A hypermutation. Numerous studies have disagreed on the contribution of hypermutation to viral genetic diversity and evolution. Confounding factors contributing to the debate include the extent of lethal (stop codon) and sublethal hypermutation induced by different APOBEC3 proteins, the inability to distinguish between G-to-A mutations induced by APOBEC3 proteins and error-prone viral replication, the potential impact of hypermutation on the frequency of retroviral recombination, and the extent to which viral recombination occurs in vivo, which can reassort mutations in hypermutated genomes. Here, we determined the effects of hypermutation on the HIV-1 recombination rate and its contribution to genetic variation through recombination to generate progeny genomes containing portions of hypermutated genomes without lethal mutations. We found that hypermutation did not significantly affect the rate of recombination, and recombination between hypermutated and wild-type genomes only increased the viral mutation rate by 3.9 × 10−5 mutations/bp/replication cycle in heterozygous virions, which is similar to the HIV-1 mutation rate. Since copackaging of hypermutated and wild-type genomes occurs very rarely in vivo, recombination between hypermutated and wild-type genomes does not significantly contribute to the genetic variation of replicating HIV-1. We also analyzed previously reported hypermutated sequences from infected patients and determined that the frequency of sublethal mutagenesis for A3G and A3F is negligible (4 × 10−21 and1 × 10−11, respectively) and its contribution to viral mutations is far below mutations generated during error-prone reverse transcription. Taken together, we conclude that the contribution of APOBEC3-induced hypermutation to HIV-1 genetic variation is substantially lower than that from mutations during error-prone replication. PMID:27186986
Delviks-Frankenberry, Krista A; Nikolaitchik, Olga A; Burdick, Ryan C; Gorelick, Robert J; Keele, Brandon F; Hu, Wei-Shau; Pathak, Vinay K
2016-05-01
Although the predominant effect of host restriction APOBEC3 proteins on HIV-1 infection is to block viral replication, they might inadvertently increase retroviral genetic variation by inducing G-to-A hypermutation. Numerous studies have disagreed on the contribution of hypermutation to viral genetic diversity and evolution. Confounding factors contributing to the debate include the extent of lethal (stop codon) and sublethal hypermutation induced by different APOBEC3 proteins, the inability to distinguish between G-to-A mutations induced by APOBEC3 proteins and error-prone viral replication, the potential impact of hypermutation on the frequency of retroviral recombination, and the extent to which viral recombination occurs in vivo, which can reassort mutations in hypermutated genomes. Here, we determined the effects of hypermutation on the HIV-1 recombination rate and its contribution to genetic variation through recombination to generate progeny genomes containing portions of hypermutated genomes without lethal mutations. We found that hypermutation did not significantly affect the rate of recombination, and recombination between hypermutated and wild-type genomes only increased the viral mutation rate by 3.9 × 10-5 mutations/bp/replication cycle in heterozygous virions, which is similar to the HIV-1 mutation rate. Since copackaging of hypermutated and wild-type genomes occurs very rarely in vivo, recombination between hypermutated and wild-type genomes does not significantly contribute to the genetic variation of replicating HIV-1. We also analyzed previously reported hypermutated sequences from infected patients and determined that the frequency of sublethal mutagenesis for A3G and A3F is negligible (4 × 10-21 and1 × 10-11, respectively) and its contribution to viral mutations is far below mutations generated during error-prone reverse transcription. Taken together, we conclude that the contribution of APOBEC3-induced hypermutation to HIV-1 genetic variation is substantially lower than that from mutations during error-prone replication.
Lim, Chun Shen; Brown, Chris M
2017-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community.
Lim, Chun Shen; Brown, Chris M.
2018-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community. PMID:29354101
Marsic, Damien; Govindasamy, Lakshmanan; Currlin, Seth; Markusic, David M; Tseng, Yu-Shan; Herzog, Roland W; Agbandje-McKenna, Mavis; Zolotukhin, Sergei
2014-01-01
Methodologies to improve existing adeno-associated virus (AAV) vectors for gene therapy include either rational approaches or directed evolution to derive capsid variants characterized by superior transduction efficiencies in targeted tissues. Here, we integrated both approaches in one unified design strategy of “virtual family shuffling” to derive a combinatorial capsid library whereby only variable regions on the surface of the capsid are modified. Individual sublibraries were first assembled in order to preselect compatible amino acid residues within restricted surface-exposed regions to minimize the generation of dead-end variants. Subsequently, the successful families were interbred to derive a combined library of ~8 × 105 complexity. Next-generation sequencing of the packaged viral DNA revealed capsid surface areas susceptible to directed evolution, thus providing guidance for future designs. We demonstrated the utility of the library by deriving an AAV2-based vector characterized by a 20-fold higher transduction efficiency in murine liver, now equivalent to that of AAV8. PMID:25048217
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences
Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.
2012-01-01
ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
Glycosylation at Asn91 of H1N1 haemagglutinin affects binding to glycan receptors
Jayaraman, Akila; Koh, Xiaoying; Li, Jing; Raman, Rahul; Viswanathan, Karthik; Shriver, Zachary; Sasisekharan, Ram
2012-01-01
The glycoprotein HA (haemagglutinin) on the surface of influenza A virus plays a central role in recognition and binding to specific host cell-surface glycan receptors and in fusion of viral membrane to the host nuclear membrane during viral replication. Given the abundance of HA on the viral surface, this protein is also the primary target for host innate and adaptive immune responses. Although addition of glycosylation sites on HA are a part of viral evolution to evade the host immune responses, there are specific glycosylation sites that are conserved during most of the evolution of the virus. In the present study, it was demonstrated that one such conserved glycosylation site at Asn91 in H1N1 HA critically governs the glycan receptor-binding specificity and hence would potentially impinge on the host adaptation of the virus. PMID:22642577
Glycosylation at Asn91 of H1N1 haemagglutinin affects binding to glycan receptors.
Jayaraman, Akila; Koh, Xiaoying; Li, Jing; Raman, Rahul; Viswanathan, Karthik; Shriver, Zachary; Sasisekharan, Ram
2012-06-15
The glycoprotein HA (haemagglutinin) on the surface of influenza A virus plays a central role in recognition and binding to specific host cell-surface glycan receptors and in fusion of viral membrane to the host nuclear membrane during viral replication. Given the abundance of HA on the viral surface, this protein is also the primary target for host innate and adaptive immune responses. Although addition of glycosylation sites on HA are a part of viral evolution to evade the host immune responses, there are specific glycosylation sites that are conserved during most of the evolution of the virus. In the present study, it was demonstrated that one such conserved glycosylation site at Asn(91) in H1N1 HA critically governs the glycan receptor-binding specificity and hence would potentially impinge on the host adaptation of the virus.
The Role of Graphlets in Viral Processes on Networks
NASA Astrophysics Data System (ADS)
Khorshidi, Samira; Al Hasan, Mohammad; Mohler, George; Short, Martin B.
2018-05-01
Predicting the evolution of viral processes on networks is an important problem with applications arising in biology, the social sciences, and the study of the Internet. In existing works, mean-field analysis based upon degree distribution is used for the prediction of viral spreading across networks of different types. However, it has been shown that degree distribution alone fails to predict the behavior of viruses on some real-world networks and recent attempts have been made to use assortativity to address this shortcoming. In this paper, we show that adding assortativity does not fully explain the variance in the spread of viruses for a number of real-world networks. We propose using the graphlet frequency distribution in combination with assortativity to explain variations in the evolution of viral processes across networks with identical degree distribution. Using a data-driven approach by coupling predictive modeling with viral process simulation on real-world networks, we show that simple regression models based on graphlet frequency distribution can explain over 95% of the variance in virality on networks with the same degree distribution but different network topologies. Our results not only highlight the importance of graphlets but also identify a small collection of graphlets which may have the highest influence over the viral processes on a network.
NASA Astrophysics Data System (ADS)
Champeimont, Raphaël; Laine, Elodie; Hu, Shuang-Wei; Penin, Francois; Carbone, Alessandra
2016-05-01
A novel computational approach of coevolution analysis allowed us to reconstruct the protein-protein interaction network of the Hepatitis C Virus (HCV) at the residue resolution. For the first time, coevolution analysis of an entire viral genome was realized, based on a limited set of protein sequences with high sequence identity within genotypes. The identified coevolving residues constitute highly relevant predictions of protein-protein interactions for further experimental identification of HCV protein complexes. The method can be used to analyse other viral genomes and to predict the associated protein interaction networks.
de Borba, Luana; Villordo, Sergio M; Iglesias, Nestor G; Filomatori, Claudia V; Gebhard, Leopoldo G; Gamarnik, Andrea V
2015-03-01
The dengue virus genome is a dynamic molecule that adopts different conformations in the infected cell. Here, using RNA folding predictions, chemical probing analysis, RNA binding assays, and functional studies, we identified new cis-acting elements present in the capsid coding sequence that facilitate cyclization of the viral RNA by hybridization with a sequence involved in a local dumbbell structure at the viral 3' untranslated region (UTR). The identified interaction differentially enhances viral replication in mosquito and mammalian cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Sites of Retroviral DNA Integration: From Basic Research to Clinical Applications
Serrao, Erik; Engelman, Alan N.
2016-01-01
One of the most crucial steps in the life cycle of a retrovirus is the integration of the viral DNA (vDNA) copy of the RNA genome into the genome of an infected host cell. Integration provides for efficient viral gene expression as well as for the segregation of the viral genomes to daughter cells upon cell division. Some integrated viruses are not well expressed, and cells latently infected with HIV-1 can resist the action of potent antiretroviral drugs and remain dormant for decades. Intensive research has been dedicated to understanding the catalytic mechanism of integration, as well as the viral and cellular determinants that influence integration site distribution throughout the host genome. In this review we summarize the evolution of techniques that have been used to recover and map retroviral integration sites, from the early days that first indicated that integration could occur in multiple cellular DNA locations, to current technologies that map upwards of millions of unique integration sites from single in vitro integration reactions or cell culture infections. We further review important insights gained from the use of such mapping techniques, including the monitoring of cell clonal expansion in patients treated with retrovirus-based gene therapy vectors, or AIDS patients on suppressive antiretroviral therapy (ART). These insights span from integrase (IN) enzyme sequence preferences within target DNA (tDNA) at the sites of integration, to the roles of host cellular proteins in mediating global integration distribution, to the potential relationship between genomic location of vDNA integration site and retroviral latency. PMID:26508664
Jachiet, Pierre-Alain; Colson, Philippe; Lopez, Philippe; Bapteste, Eric
2014-01-01
Complex nongradual evolutionary processes such as gene remodeling are difficult to model, to visualize, and to investigate systematically. Despite these challenges, the creation of composite (or mosaic) genes by combination of genetic segments from unrelated gene families was established as an important adaptive phenomena in eukaryotic genomes. In contrast, almost no general studies have been conducted to quantify composite genes in viruses. Although viral genome mosaicism has been well-described, the extent of gene mosaicism and its rules of emergence remain largely unexplored. Applying methods from graph theory to inclusive similarity networks, and using data from more than 3,000 complete viral genomes, we provide the first demonstration that composite genes in viruses are 1) functionally biased, 2) involved in key aspects of the arm race between cells and viruses, and 3) can be classified into two distinct types of composite genes in all viral classes. Beyond the quantification of the widespread recombination of genes among different viruses of the same class, we also report a striking sharing of genetic information between viruses of different classes and with different nucleic acid types. This latter discovery provides novel evidence for the existence of a large and complex mobilome network, which appears partly bound by the sharing of genetic information and by the formation of composite genes between mobile entities with different genetic material. Considering that there are around 10E31 viruses on the planet, gene remodeling appears as a hugely significant way of generating and moving novel sequences between different kinds of organisms on Earth. PMID:25104113
Chopera, Denis R.; Ntale, Roman; Ndabambi, Nonkululeko; Garrett, Nigel; Gray, Clive M.; Matten, David; Karim, Quarraisha Abdool; Karim, Salim Abdool; Williamson, Carolyn
2016-01-01
Objective HIV-1 escape from cytotoxic T-lymphocytes (CTL) results in the accumulation of HLA-associated mutations in the viral genome. To understand the contribution of early escape to disease progression, this study investigated the evolution and pathogenic implications of CTL escape in a cohort followed from infection for five years. Methods Viral loads and CD4+ counts were monitored in 78 subtype C infected individuals from onset of infection until CD4+ decline to <350 cells/μl or five years post-infection. The gag gene was sequenced and HLA-associated changes between enrolment and 12 months post-infection were mapped. Results HLA-associated escape mutations were identified in 48 (62%) of the participants and were associated with CD4+ decline to <350 copies/ml (p=0.05). Escape mutations in variable Gag proteins (p17 and p7p6) had a greater impact on disease progression than escape in more conserved regions (p24) (p=0.03). The association between HLA-associated escape mutations and CD4+ decline was independent of protective HLA allele (B*57, B*58:01, B*81) expression. Conclusion The high frequency of escape contributed to rapid disease progression in this cohort. While HLA-adaption in both conserved and variable Gag domains in the first year of infection was detrimental to long term clinical outcome, escape in variable domains had greater impact. PMID:27755110
Marais, Armelle; Faure, Chantal; Mustafayev, Eldar; Candresse, Thierry
2015-01-01
Double stranded RNAs from Prunus samples gathered from various surveys were analyzed by a deep-sequencing approach. Contig annotations revealed the presence of a potential new viral species in an Azerbaijani almond tree (Prunus amygdalus) and its genome sequence was completed. Its genomic organization is similar to that of the recently described Apricot vein clearing associated virus (AVCaV) for which two new isolates were also characterized, in a similar fashion, from two Japanese plums (Prunus salicina) from a French germplasm collection. The amino acid identity values between the four proteins encoded by the genome of the new virus have identity levels with those of AVCaV which fall clearly outside the species demarcation criteria. The new virus should therefore be considered as a new species for which the name of Caucasus prunus virus (CPrV) has been proposed. Phylogenetic relationships and nucleotide comparisons suggested that together with AVCaV, CPrV could define a new genus (proposed name: Prunevirus) in the family Betaflexiviridae. A molecular test targeting both members of the new genus was developed, allowing the detection of additional AVCaV isolates, and therefore extending the known geographical distribution and the host range of AVCaV. Moreover, the phylogenetic trees reconstructed with the amino acid sequences of replicase, movement and coat proteins of representative Betaflexiviridae members suggest that Citrus leaf blotch virus (CLBV, type member of the genus Citrivirus) may have evolved from a recombination event involving a Prunevirus, further highlighting the importance of recombination as a driving force in Betaflexiviridae evolution. The sequences reported in the present manuscript have been deposited in the GenBank database under accession numbers KM507061-KM504070.
Marais, Armelle; Faure, Chantal; Mustafayev, Eldar; Candresse, Thierry
2015-01-01
Double stranded RNAs from Prunus samples gathered from various surveys were analyzed by a deep-sequencing approach. Contig annotations revealed the presence of a potential new viral species in an Azerbaijani almond tree (Prunus amygdalus) and its genome sequence was completed. Its genomic organization is similar to that of the recently described Apricot vein clearing associated virus (AVCaV) for which two new isolates were also characterized, in a similar fashion, from two Japanese plums (Prunus salicina) from a French germplasm collection. The amino acid identity values between the four proteins encoded by the genome of the new virus have identity levels with those of AVCaV which fall clearly outside the species demarcation criteria. The new virus should therefore be considered as a new species for which the name of Caucasus prunus virus (CPrV) has been proposed. Phylogenetic relationships and nucleotide comparisons suggested that together with AVCaV, CPrV could define a new genus (proposed name: Prunevirus) in the family Betaflexiviridae. A molecular test targeting both members of the new genus was developed, allowing the detection of additional AVCaV isolates, and therefore extending the known geographical distribution and the host range of AVCaV. Moreover, the phylogenetic trees reconstructed with the amino acid sequences of replicase, movement and coat proteins of representative Betaflexiviridae members suggest that Citrus leaf blotch virus (CLBV, type member of the genus Citrivirus) may have evolved from a recombination event involving a Prunevirus, further highlighting the importance of recombination as a driving force in Betaflexiviridae evolution. The sequences reported in the present manuscript have been deposited in the GenBank database under accession numbers KM507061-KM504070. PMID:26086395
Russo, Alice G; Eden, John-Sebastian; Enosi Tuipulotu, Daniel; Shi, Mang; Selechnik, Daniel; Shine, Richard; Rollins, Lee Ann; Holmes, Edward C; White, Peter A
2018-06-13
Cane toads are a notorious invasive species, inhabiting over 1.2 million km 2 of Australia and threatening native biodiversity. Release of pathogenic cane toad viruses is one possible biocontrol strategy yet is currently hindered by the poorly-described cane toad virome. Metatranscriptomic analysis of 16 cane toad livers revealed the presence of a novel and full-length picornavirus, Rhimavirus A (RhiV-A), a member of a reptile and amphibian specific-cluster of the Picornaviridae basal to the Kobuvirus -like group. In the combined liver transcriptome, we also identified a complete genome sequence of a distinct epsilonretrovirus, R. marina endogenous retrovirus (RMERV). The recently sequenced cane toad genome contains eight complete RMERV proviruses, as well as 21 additional truncated insertions. The oldest full length RMERV provirus was estimated to have inserted 1.9 MYA. To screen for these viral sequences in additional toads, we analysed publicly available transcriptomes from six diverse Australian locations. RhiV-A transcripts were identified in toads sampled from three locations across 1,000 km of Australia, stretching to the current Western Australia (WA) invasion front, whilst RMERV transcripts were observed at all six sites. Lastly, we scanned the cane toad genome for non-retroviral endogenous viral elements, finding three sequences related to small DNA viruses in the family Circoviridae This shows ancestral circoviral infection with subsequent genomic integration. The identification of these current and past viral infections enriches our knowledge of the cane toad virome, an understanding of which will facilitate future work on infection and disease in this important invasive species. Importance Cane toads are poisonous amphibians which were introduced to Australia in 1935 for insect control. Since then, their population has increased dramatically, and they now threat many native Australian species. One potential method to control the population is to release a cane toad virus with high mortality, yet few cane toad viruses have been characterised. This study samples cane toads from different Australian locations and uses an RNA sequencing and computational approach to find new viruses. We report novel complete picornavirus and retrovirus sequences which were genetically similar to viruses infecting frogs, reptiles and fish. Using data generated in other studies, we show that these viral sequences are present in cane toads from distinct Australian locations. Three sequences related to circoviruses were also found in the toad genome. The identification of new viral sequences will aid future studies which investigate their prevalence and potential as agents for biocontrol. Copyright © 2018 American Society for Microbiology.
Yang, Teng-Chieh; Maluf, Nasib Karl
2012-02-21
Human adenovirus (Ad) is an icosahedral, double-stranded DNA virus. Viral DNA packaging refers to the process whereby the viral genome becomes encapsulated by the viral particle. In Ad, activation of the DNA packaging reaction requires at least three viral components: the IVa2 and L4-22K proteins and a section of DNA within the viral genome, called the packaging sequence. Previous studies have shown that the IVa2 and L4-22K proteins specifically bind to conserved elements within the packaging sequence and that these interactions are absolutely required for the observation of DNA packaging. However, the equilibrium mechanism for assembly of IVa2 and L4-22K onto the packaging sequence has not been determined. Here we characterize the assembly of the IVa2 and L4-22K proteins onto truncated packaging sequence DNA by analytical sedimentation velocity and equilibrium methods. At limiting concentrations of L4-22K, we observe a species with two IVa2 monomers and one L4-22K monomer bound to the DNA. In this species, the L4-22K monomer is promoting positive cooperative interactions between the two bound IVa2 monomers. As L4-22K levels are increased, we observe a species with one IVa2 monomer and three L4-22K monomers bound to the DNA. To explain this result, we propose a model in which L4-22K self-assembly on the DNA competes with IVa2 for positive heterocooperative interactions, destabilizing binding of the second IVa2 monomer. Thus, we propose that L4-22K levels control the extent of cooperativity observed between adjacently bound IVa2 monomers. We have also determined the hydrodynamic properties of all observed stoichiometric species; we observe that species with three L4-22K monomers bound have more extended conformations than species with a single L4-22K bound. We suggest this might reflect a molecular switch that controls insertion of the viral DNA into the capsid.
CRISPR-Cas systems exploit viral DNA injection to establish and maintain adaptive immunity.
Modell, Joshua W; Jiang, Wenyan; Marraffini, Luciano A
2017-04-06
Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas systems provide protection against viral and plasmid infection by capturing short DNA sequences from these invaders and integrating them into the CRISPR locus of the prokaryotic host. These sequences, known as spacers, are transcribed into short CRISPR RNA guides that specify the cleavage site of Cas nucleases in the genome of the invader. It is not known when spacer sequences are acquired during viral infection. Here, to investigate this, we tracked spacer acquisition in Staphylococcus aureus cells harbouring a type II CRISPR-Cas9 system after infection with the staphylococcal bacteriophage ϕ12. We found that new spacers were acquired immediately after infection preferentially from the cos site, the viral free DNA end that is first injected into the cell. Analysis of spacer acquisition after infection with mutant phages demonstrated that most spacers are acquired during DNA injection, but not during other stages of the viral cycle that produce free DNA ends, such as DNA replication or packaging. Finally, we showed that spacers acquired from early-injected genomic regions, which direct Cas9 cleavage of the viral DNA immediately after infection, provide better immunity than spacers acquired from late-injected regions. Our results reveal that CRISPR-Cas systems exploit the phage life cycle to generate a pattern of spacer acquisition that ensures a successful CRISPR immune response.
Tumor malignancy is engaged to prokaryotic homolog toolbox.
Fernandes, Janaina; Guedes, Patrícia G; Lage, Celso Luiz S; Rodrigues, Juliany Cola F; Lage, Claudia de Alencar S
2012-04-01
Cancer cells display high proliferation rates and survival provided by high glycolysis, chemoresistance and radioresistance, metabolic features that appear to be activated with malignancy, and seemed to have arisen as early in evolution as in unicellular/prokaryotic organisms. Based on these assumptions, we hypothesize that aggressive phenotypes found in malignant cells may be related to acquired unicellular behavior, launched within a tumor when viral and prokaryotic homologs are overexpressed performing likely robust functions. The ensemble of these expressed viral and prokaryotic close homologs in the proteome of a tumor tissue gives them advantage over normal cells. To assess the hypothesis validity, sequences of human proteins involved in apoptosis, energetic metabolism, cell mobility and adhesion, chemo- and radio-resistance were aligned to homologs present in other life forms, excluding all eukaryotes, using PSI-BLAST, with further corroboration from data available in the literature. The analysis revealed that selected sequences of proteins involved in apoptosis and tumor suppression (as p53 and pRB) scored non-significant (E-value>0.001) with prokaryotic homologs; on the other hand, human proteins involved in cellular chemo- and radio-resistance scored highly significant with prokaryotic and viral homologs (as catalase, E-value=zero). We inferred that such upregulated and/or functionally activated proteins in aggressive malignant cells represent a toolbox of modern human homologs evolved from a similar key set that have granted survival of ancient prokaryotes against extremely harsh environments. According to what has been discussed along this analysis, high mutation rates usually hit hotspots in important conserved protein domains, allowing uncontrolled expansion of more resistant, death-evading malignant clones. That is the case of point mutations in key viral proteins affording viruses escape to chemotherapy, and human homologs of such retroviral proteins (as Ras, Akt and EGFR) can elicit the same phenotype. Furthermore, a corollary to this hypothesis presumes that target-directed anti-cancer therapy should target human protein domains of low similarity to prokaryotic homologs for a well-succeeded anti-cancer therapy. Copyright © 2012 Elsevier Ltd. All rights reserved.
Yue, Ling; Pfafferott, Katja J.; Baalwa, Joshua; ...
2015-01-08
Control of virus replication in HIV-1 infection is critical to delaying disease progression. While cellular immune responses are a key determinant of control, relatively little is known about the contribution of the infecting virus to this process. To gain insight into this interplay between virus and host in viral control, we conducted a detailed analysis of two heterosexual HIV-1 subtype A transmission pairs in which female recipients sharing three HLA class I alleles exhibited contrasting clinical outcomes: R880F controlled virus replication while R463F experienced high viral loads and rapid disease progression. Near full-length single genome amplification defined the infecting transmitted/foundermore » (T/F) virus proteome and subsequent sequence evolution over the first year of infection for both acutely infected recipients. T/F virus replicative capacities were compared in vitro, while the development of the earliest cellular immune response was defined using autologous virus sequence-based peptides. The R880F T/F virus replicated significantly slower in vitro than that transmitted to R463F. While neutralizing antibody responses were similar in both subjects, during acute infection R880F mounted a broad T cell response, the most dominant components of which targeted epitopes from which escape was limited. In contrast, the primary HIV-specific T cell response in R463F was focused on just two epitopes, one of which rapidly escaped. This comprehensive study highlights both the importance of the contribution of the lower replication capacity of the transmitted/founder virus and an associated induction of a broad primary HIV-specific T cell response, which was not undermined by rapid epitope escape, to long-term viral control in HIV-1 infection. It underscores the importance of the earliest CD8 T cell response targeting regions of the virus proteome that cannot mutate without a high fitness cost, further emphasizing the need for vaccines that elicit a breadth of T cell responses to conserved viral epitopes.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yue, Ling; Pfafferott, Katja J.; Baalwa, Joshua
Control of virus replication in HIV-1 infection is critical to delaying disease progression. While cellular immune responses are a key determinant of control, relatively little is known about the contribution of the infecting virus to this process. To gain insight into this interplay between virus and host in viral control, we conducted a detailed analysis of two heterosexual HIV-1 subtype A transmission pairs in which female recipients sharing three HLA class I alleles exhibited contrasting clinical outcomes: R880F controlled virus replication while R463F experienced high viral loads and rapid disease progression. Near full-length single genome amplification defined the infecting transmitted/foundermore » (T/F) virus proteome and subsequent sequence evolution over the first year of infection for both acutely infected recipients. T/F virus replicative capacities were compared in vitro, while the development of the earliest cellular immune response was defined using autologous virus sequence-based peptides. The R880F T/F virus replicated significantly slower in vitro than that transmitted to R463F. While neutralizing antibody responses were similar in both subjects, during acute infection R880F mounted a broad T cell response, the most dominant components of which targeted epitopes from which escape was limited. In contrast, the primary HIV-specific T cell response in R463F was focused on just two epitopes, one of which rapidly escaped. This comprehensive study highlights both the importance of the contribution of the lower replication capacity of the transmitted/founder virus and an associated induction of a broad primary HIV-specific T cell response, which was not undermined by rapid epitope escape, to long-term viral control in HIV-1 infection. It underscores the importance of the earliest CD8 T cell response targeting regions of the virus proteome that cannot mutate without a high fitness cost, further emphasizing the need for vaccines that elicit a breadth of T cell responses to conserved viral epitopes.« less
The Vaginal Eukaryotic DNA Virome and Preterm Birth.
Wylie, Kristine M; Wylie, Todd N; Cahill, Alison G; Macones, George A; Tuuli, Methodius G; Stout, Molly J
2018-05-05
Despite decades of attempts to link infectious agents to preterm birth, an exact causative microbe or community of microbes remains elusive. Culture-independent sequencing of vaginal bacterial communities demonstrates community characteristics are associated with preterm birth, although none are specific enough to apply clinically. Viruses are important components of the vaginal microbiome and have dynamic relationships with vaginal bacterial communities. We hypothesized that vaginal eukaryotic DNA viral communities (the "vaginal virome") either alone or in the context of bacterial communities are associated with preterm birth. The objective of this study was to use high-throughput sequencing to examine the vaginal eukaryotic DNA virome in a cohort of pregnant women and examine associations between vaginal community characteristics and preterm birth. This is a nested case-control study within a prospective cohort study of women with singleton pregnancies, not on supplemental progesterone, and without cervical cerclage in situ. Serial mid-vaginal swabs were obtained at routine prenatal visits. DNA was extracted, bacterial communities were characterized by 16S rRNA gene sequencing, and eukaryotic viral communities were characterized by enrichment of viral nucleic acid with the ViroCap targeted sequence capture panel followed by nucleic acid sequencing. Viral communities were analyzed according to presence/absence of viruses, diversity, dynamics over time, and association with bacterial community data obtained from the same specimens. Sixty subjects contributed 128 vaginal swabs longitudinally across pregnancy. Twenty-four patients delivered preterm. Participants were predominantly African-American (65%). Six families of eukaryotic DNA viruses were detected in the vaginal samples. At least 1 virus was detected in 80% of women. No specific virus or group of viruses was associated with preterm delivery. Higher viral richness was significantly associated with preterm delivery in the full group and in the African American subgroup (P=0.0005 and P=0.0003, respectively). Having both high bacterial diversity and high viral diversity in the first trimester was associated with the highest risk for preterm birth. Higher vaginal viral diversity is associated with preterm birth. Changes in vaginal virome diversity appear similar to changes in the vaginal bacterial microbiome over pregnancy, suggesting that underlying physiology of pregnancy may regulate both bacterial and viral communities. Copyright © 2018 Elsevier Inc. All rights reserved.
May, Jared; Johnson, Philip; Saleem, Huma
2017-01-01
ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526
Bondre, Vijay P; Sankararaman, Vasudha; Andhare, Vijaysinh; Tupekar, Manisha; Sapkal, Gajanan N
2016-11-01
Human herpes simplex virus 1 (HSV-1) is the most common cause of sporadic encephalitis in humans that contributes to >10 per cent of the encephalitis cases occurring worldwide. Availability of limited full genome sequences from a small number of isolates resulted in poor understanding of host and viral factors responsible for variable clinical outcome. In this study genetic relationship, extent and source of recombination using full-length genome sequence derived from a newly isolated HSV-1 isolate was studied in comparison with those sampled from patients with varied clinical outcome. Full genome sequence of HSV-1 isolated from cerebrospinal fluid (CSF) of a patient with acute encephalitis syndrome (AES) by inoculation in baby hamster kidney-21 (BHK-21) cells was determined using next-generation sequencing (NGS) technology. Phylogenetic analysis of the newly generated sequence in comparison with 33 additional full-length genomes defined genetic relationship with worldwide distributed strains. The bootscan and similarity plot analysis defined recombination crossovers and similarities between newly isolated Indian HSV-1 with six Asian and a total of 34 worldwide isolated strains. Mapping of 376,332 reads amplified from HSV-1 DNA by NGS generated full-length genome of 151,024 bp from newly isolated Indian HSV-1. Phylogenetic analysis classified worldwide distributed strains into three major evolutionary lineages correlating to their geographic distribution. Lineage 1 containing strains were isolated from America and Europe; lineage 2 contained all the strains from Asian countries along with the North American KOS and RE strains whereas the South African isolates were distributed into two groups under lineage 3. Recombination analysis confirmed events of recombination in Indian HSV-1 genome resulting from mixing of different strains evolved in Asian countries. Our results showed that the full-length genome sequence generated from an Indian HSV-1 isolate shared close genetic relationship with the American KOS and Chinese CR38 strains which belonged to the Asian genetic lineage. Recombination analysis of Indian isolate demonstrated multiple recombination crossover points throughout the genome. This full-length genome sequence amplified from the Indian isolate would be helpful to study HSV evolution, genetic basis of differential pathogenesis, host-virus interactions and viral factors contributing towards differential clinical outcome in human infections.
Bondre, Vijay P.; Sankararaman, Vasudha; Andhare, Vijaysinh; Tupekar, Manisha; Sapkal, Gajanan N.
2016-01-01
Background & objectives: Human herpes simplex virus 1 (HSV-1) is the most common cause of sporadic encephalitis in humans that contributes to >10 per cent of the encephalitis cases occurring worldwide. Availability of limited full genome sequences from a small number of isolates resulted in poor understanding of host and viral factors responsible for variable clinical outcome. In this study genetic relationship, extent and source of recombination using full-length genome sequence derived from a newly isolated HSV-1 isolate was studied in comparison with those sampled from patients with varied clinical outcome. Methods: Full genome sequence of HSV-1 isolated from cerebrospinal fluid (CSF) of a patient with acute encephalitis syndrome (AES) by inoculation in baby hamster kidney-21 (BHK-21) cells was determined using next-generation sequencing (NGS) technology. Phylogenetic analysis of the newly generated sequence in comparison with 33 additional full-length genomes defined genetic relationship with worldwide distributed strains. The bootscan and similarity plot analysis defined recombination crossovers and similarities between newly isolated Indian HSV-1 with six Asian and a total of 34 worldwide isolated strains. Results: Mapping of 376,332 reads amplified from HSV-1 DNA by NGS generated full-length genome of 151,024 bp from newly isolated Indian HSV-1. Phylogenetic analysis classified worldwide distributed strains into three major evolutionary lineages correlating to their geographic distribution. Lineage 1 containing strains were isolated from America and Europe; lineage 2 contained all the strains from Asian countries along with the North American KOS and RE strains whereas the South African isolates were distributed into two groups under lineage 3. Recombination analysis confirmed events of recombination in Indian HSV-1 genome resulting from mixing of different strains evolved in Asian countries. Interpretation & conclusions: Our results showed that the full-length genome sequence generated from an Indian HSV-1 isolate shared close genetic relationship with the American KOS and Chinese CR38 strains which belonged to the Asian genetic lineage. Recombination analysis of Indian isolate demonstrated multiple recombination crossover points throughout the genome. This full-length genome sequence amplified from the Indian isolate would be helpful to study HSV evolution, genetic basis of differential pathogenesis, host-virus interactions and viral factors contributing towards differential clinical outcome in human infections. PMID:28361829
Virus versus Host Plant MicroRNAs: Who Determines the Outcome of the Interaction?
Maghuly, Fatemeh; Ramkat, Rose C.; Laimer, Margit
2014-01-01
Considering the importance of microRNAs (miRNAs) in the regulation of essential processes in plant pathogen interactions, it is not surprising that, while plant miRNA sequences counteract viral attack via antiviral RNA silencing, viruses in turn have developed antihost defense mechanisms blocking these RNA silencing pathways and establish a counter-defense. In the current study, computational and stem-loop Reverse Transcription – Polymerase Chain Reaction (RT-PCR) approaches were employed to a) predict and validate virus encoded mature miRNAs (miRs) in 39 DNA-A sequences of the bipartite genomes of African cassava mosaic virus (ACMV) and East African cassava mosaic virus-Uganda (EACMV-UG) isolates, b) determine whether virus encoded miRs/miRs* generated from the 5′/3′ harpin arms have the capacity to bind to genomic sequences of the host plants Jatropha or cassava and c) investigate whether plant encoded miR/miR* sequences have the potential to bind to the viral genomes. Different viral pre-miRNA hairpin sequences and viral miR/miR* length variants occurring as isomiRs were predicted in both viruses. These miRNAs were located in three Open Reading Frames (ORFs) and in the Intergenic Region (IR). Moreover, various target genes for miRNAs from both viruses were predicted and annotated in the host plant genomes indicating that they are involved in biotic response, metabolic pathways and transcription factors. Plant miRs/miRs* from conserved and highly expressed families were identified, which were shown to have potential targets in the genome of both begomoviruses, representing potential plant miRNAs mediating antiviral defense. This is the first assessment of predicted viral miRs/miRs* of ACMV and EACMV-UG and host plant miRNAs, providing a reference point for miRNA identification in pathogens and their hosts. These findings will improve the understanding of host- pathogen interaction pathways and the function of viral miRNAs in Euphorbiaceous crop plants. PMID:24896088
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weiss, S.B.
Our laboratory has explored the use of short DNA oligomers as targets for activated polycyclic aromatic hydrocarbons, such as benzo(a)pyrene diol epoxide (BPDE), in order to detect alterations in DNA sequence arrangement. In this model system, oligomers alkylated with (+)-BPDE are ligated into M13 viral DNA and used to transfect Escherichia coli. These cells are plated on agar, incubated at 37/sup 0/C, progeny viral clones are selected, amplified, and the viral DNAs isolated are sequenced at the site of oligomer insertion. We have devised a procedure for the preparation of unique duplex DNA oligomers such that the site of oligomermore » alkylation is specific for a single deoxynucleotide species in the two DNA strands. The procedure for oligomer assembly also allows us to vary the position of the alkylated residue in each of the two strands. Using our model system, the results obtained over the past year can be summarized as follows. When nonalkylated oligomer constructs are ligated into M13 viral DNA and used to transfect E. coli, no modifications in DNA sequence arrangement are detected in progeny viral DNAs. On the other hand, with oligomer constructs containing BP-adducts two major types of modifications in DNA sequence arrangement were observed: (1) large deletions, and (2) nonhomologous (illegitimate) recombinants. Both of these DNA modifications result in the complete removal of the oligomer insert. Transfection of E. coli that are recA/sup -/ does not alter these DNA modifications, therefore, it appears that the deletions and recombinants induced by the alkylated inserts are not under control of the RecA gene. As the distance between the alkylated residues in the duplex strands is increased, the number of recombinant events detected is reduced. In addition to the above types of DNA modifications, restoration of the original nucleotide sequence in the alkylated construct was also observed in progeny viral DNAs. 7 refs., 6 figs., 2 tabs.« less
Ip, Hon S.; Wiley, Michael R.; Long, Renee; Gustavo, Palacios; Shearn-Bochsler, Valerie; Whitehouse, Chris A.
2014-01-01
Advances in massively parallel DNA sequencing platforms, commonly termed next-generation sequencing (NGS) technologies, have greatly reduced time, labor, and cost associated with DNA sequencing. Thus, NGS has become a routine tool for new viral pathogen discovery and will likely become the standard for routine laboratory diagnostics of infectious diseases in the near future. This study demonstrated the application of NGS for the rapid identification and characterization of a virus isolated from the brain of an endangered Mississippi sandhill crane. This bird was part of a population restoration effort and was found in an emaciated state several days after Hurricane Isaac passed over the refuge in Mississippi in 2012. Post-mortem examination had identified trichostrongyliasis as the possible cause of death, but because a virus with morphology consistent with a togavirus was isolated from the brain of the bird, an arboviral etiology was strongly suspected. Because individual molecular assays for several known arboviruses were negative, unbiased NGS by Illumina MiSeq was used to definitively identify and characterize the causative viral agent. Whole genome sequencing and phylogenetic analysis revealed the viral isolate to be the Highlands J virus, a known avian pathogen. This study demonstrates the use of unbiased NGS for the rapid detection and characterization of an unidentified viral pathogen and the application of this technology to wildlife disease diagnostics and conservation medicine.
Full-length genomic characterization and molecular evolution of canine parvovirus in China.
Zhou, Ling; Tang, Qinghai; Shi, Lijun; Kong, Miaomiao; Liang, Lin; Mao, Qianqian; Bu, Bin; Yao, Lunguang; Zhao, Kai; Cui, Shangjin; Leal, Élcio
2016-06-01
Canine parvovirus type 2 (CPV-2) can cause acute haemorrhagic enteritis in dogs and myocarditis in puppies. This disease has become one of the most serious infectious diseases of dogs. During 2014 in China, there were many cases of acute infectious diarrhoea in dogs. Some faecal samples were negative for the CPV-2 antigen based on a colloidal gold test strip but were positive based on PCR, and a viral strain was isolated from one such sample. The cytopathic effect on susceptible cells and the results of the immunoperoxidase monolayer assay, PCR, and sequencing indicated that the pathogen was CPV-2. The strain was named CPV-NY-14, and the full-length genome was sequenced and analysed. A maximum likelihood tree was constructed using the full-length genome and all available CPV-2 genomes. New strains have replaced the original strain in Taiwan and Italy, although the CPV-2a strain is still predominant there. However, CPV-2a still causes many cases of acute infectious diarrhoea in dogs in China.
Lescot, Magali; Hingamp, Pascal; Kojima, Kenji K; Villar, Emilie; Romac, Sarah; Veluchamy, Alaguraj; Boccara, Martine; Jaillon, Olivier; Iudicone, Daniele; Bowler, Chris; Wincker, Patrick; Claverie, Jean-Michel; Ogata, Hiroyuki
2016-01-01
Genes encoding reverse transcriptases (RTs) are found in most eukaryotes, often as a component of retrotransposons, as well as in retroviruses and in prokaryotic retroelements. We investigated the abundance, classification and transcriptional status of RTs based on Tara Oceans marine metagenomes and metatranscriptomes encompassing a wide organism size range. Our analyses revealed that RTs predominate large-size fraction metagenomes (>5 μm), where they reached a maximum of 13.5% of the total gene abundance. Metagenomic RTs were widely distributed across the phylogeny of known RTs, but many belonged to previously uncharacterized clades. Metatranscriptomic RTs showed distinct abundance patterns across samples compared with metagenomic RTs. The relative abundances of viral and bacterial RTs among identified RT sequences were higher in metatranscriptomes than in metagenomes and these sequences were detected in all metatranscriptome size fractions. Overall, these observations suggest an active proliferation of various RT-assisted elements, which could be involved in genome evolution or adaptive processes of plankton assemblage. PMID:26613339