Goldsmith, Dawn B.; Parsons, Rachel J.; Beyene, Damitu; Salamon, Peter
2015-01-01
Deep sequencing of the viral phoH gene, a host-derived auxiliary metabolic gene, was used to track viral diversity throughout the water column at the Bermuda Atlantic Time-series Study (BATS) site in the summer (September) and winter (March) of three years. Viral phoH sequences reveal differences in the viral communities throughout a depth profile and between seasons in the same year. Variation was also detected between the same seasons in subsequent years, though these differences were not as great as the summer/winter distinctions. Over 3,600 phoH operational taxonomic units (OTUs; 97% sequence identity) were identified. Despite high richness, most phoH sequences belong to a few large, common OTUs whereas the majority of the OTUs are small and rare. While many OTUs make sporadic appearances at just a few times or depths, a small number of OTUs dominate the community throughout the seasons, depths, and years. PMID:26157645
Exploring viral infection using single-cell sequencing.
Rato, Sylvie; Golumbeanu, Monica; Telenti, Amalio; Ciuffi, Angela
2017-07-15
Single-cell sequencing (SCS) has emerged as a valuable tool to study cellular heterogeneity in diverse fields, including virology. By studying the viral and cellular genome and/or transcriptome, the dynamics of viral infection can be investigated at single cell level. Most studies have explored the impact of cell-to-cell variation on the viral life cycle from the point of view of the virus, by analyzing viral sequences, and from the point of view of the cell, mainly by analyzing the cellular host transcriptome. In this review, we will focus on recent studies that use single-cell sequencing to explore viral diversity and cell variability in response to viral replication. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Viral genetic variation accounts for a third of variability in HIV-1 set-point viral load in Europe.
Blanquart, François; Wymant, Chris; Cornelissen, Marion; Gall, Astrid; Bakker, Margreet; Bezemer, Daniela; Hall, Matthew; Hillebregt, Mariska; Ong, Swee Hoe; Albert, Jan; Bannert, Norbert; Fellay, Jacques; Fransen, Katrien; Gourlay, Annabelle J; Grabowski, M Kate; Gunsenheimer-Bartmeyer, Barbara; Günthard, Huldrych F; Kivelä, Pia; Kouyos, Roger; Laeyendecker, Oliver; Liitsola, Kirsi; Meyer, Laurence; Porter, Kholoud; Ristola, Matti; van Sighem, Ard; Vanham, Guido; Berkhout, Ben; Kellam, Paul; Reiss, Peter; Fraser, Christophe
2017-06-01
HIV-1 set-point viral load-the approximately stable value of viraemia in the first years of chronic infection-is a strong predictor of clinical outcome and is highly variable across infected individuals. To better understand HIV-1 pathogenesis and the evolution of the viral population, we must quantify the heritability of set-point viral load, which is the fraction of variation in this phenotype attributable to viral genetic variation. However, current estimates of heritability vary widely, from 6% to 59%. Here we used a dataset of 2,028 seroconverters infected between 1985 and 2013 from 5 European countries (Belgium, Switzerland, France, the Netherlands and the United Kingdom) and estimated the heritability of set-point viral load at 31% (CI 15%-43%). Specifically, heritability was measured using models of character evolution describing how viral load evolves on the phylogeny of whole-genome viral sequences. In contrast to previous studies, (i) we measured viral loads using standardized assays on a sample collected in a strict time window of 6 to 24 months after infection, from which the viral genome was also sequenced; (ii) we compared 2 models of character evolution, the classical "Brownian motion" model and another model ("Ornstein-Uhlenbeck") that includes stabilising selection on viral load; (iii) we controlled for covariates, including age and sex, which may inflate estimates of heritability; and (iv) we developed a goodness of fit test based on the correlation of viral loads in cherries of the phylogenetic tree, showing that both models of character evolution fit the data well. An overall heritability of 31% (CI 15%-43%) is consistent with other studies based on regression of viral load in donor-recipient pairs. Thus, about a third of variation in HIV-1 virulence is attributable to viral genetic variation.
Sequence variation of the feline immunodeficiency virus genome and its clinical relevance.
Stickney, A L; Dunowska, M; Cave, N J
2013-06-08
The ongoing evolution of feline immunodeficiency virus (FIV) has resulted in the existence of a diverse continuum of viruses. FIV isolates differ with regards to their mutation and replication rates, plasma viral loads, cell tropism and the ability to induce apoptosis. Clinical disease in FIV-infected cats is also inconsistent. Genomic sequence variation of FIV is likely to be responsible for some of the variation in viral behaviour. The specific genetic sequences that influence these key viral properties remain to be determined. With knowledge of the specific key determinants of pathogenicity, there is the potential for veterinarians in the future to apply this information for prognostic purposes. Genomic sequence variation of FIV also presents an obstacle to effective vaccine development. Most challenge studies demonstrate acceptable efficacy of a dual-subtype FIV vaccine (Fel-O-Vax FIV) against FIV infection under experimental settings; however, vaccine efficacy in the field still remains to be proven. It is important that we discover the key determinants of immunity induced by this vaccine; such data would compliment vaccine field efficacy studies and provide the basis to make informed recommendations on its use.
Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao
2018-06-15
Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied by the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here, we explored within-host genetic diversity of HPV by performing deep-sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52, and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC) and were deep sequenced. After constructing a reference viral genome sequence for each specimen, nucleotide positions showing changes with >0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with various numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the trinucleotide context encompassing substituted bases revealed that TpCpN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep-sequencing analyses, we show for the first time a comprehensive snapshot of the within-host genetic diversity of high-risk HPVs during cervical carcinogenesis. Quasispecies harboring minor nucleotide variations in viral whole-genome sequences were extensively observed across different grades of CIN and cervical cancer. Among the within-host variations, C-to-T transitions, a characteristic change mediated by cellular APOBEC cytosine deaminases, were predominantly detected throughout the whole viral genome, most strikingly in low-grade CIN lesions. The results strongly suggest that within-host variations of the HPV genome are primarily generated through the interaction with host cell DNA-editing enzymes and that such within-host variability is an evolutionary source of the genetic diversity of HPVs. Copyright © 2018 American Society for Microbiology.
Viral genetic variation accounts for a third of variability in HIV-1 set-point viral load in Europe
Wymant, Chris; Cornelissen, Marion; Gall, Astrid; Bakker, Margreet; Bezemer, Daniela; Hall, Matthew; Hillebregt, Mariska; Ong, Swee Hoe; Albert, Jan; Bannert, Norbert; Fellay, Jacques; Fransen, Katrien; Gourlay, Annabelle J.; Grabowski, M. Kate; Gunsenheimer-Bartmeyer, Barbara; Günthard, Huldrych F.; Kivelä, Pia; Kouyos, Roger; Laeyendecker, Oliver; Liitsola, Kirsi; Meyer, Laurence; Porter, Kholoud; Ristola, Matti; van Sighem, Ard; Vanham, Guido; Berkhout, Ben; Kellam, Paul; Reiss, Peter; Fraser, Christophe
2017-01-01
HIV-1 set-point viral load—the approximately stable value of viraemia in the first years of chronic infection—is a strong predictor of clinical outcome and is highly variable across infected individuals. To better understand HIV-1 pathogenesis and the evolution of the viral population, we must quantify the heritability of set-point viral load, which is the fraction of variation in this phenotype attributable to viral genetic variation. However, current estimates of heritability vary widely, from 6% to 59%. Here we used a dataset of 2,028 seroconverters infected between 1985 and 2013 from 5 European countries (Belgium, Switzerland, France, the Netherlands and the United Kingdom) and estimated the heritability of set-point viral load at 31% (CI 15%–43%). Specifically, heritability was measured using models of character evolution describing how viral load evolves on the phylogeny of whole-genome viral sequences. In contrast to previous studies, (i) we measured viral loads using standardized assays on a sample collected in a strict time window of 6 to 24 months after infection, from which the viral genome was also sequenced; (ii) we compared 2 models of character evolution, the classical “Brownian motion” model and another model (“Ornstein–Uhlenbeck”) that includes stabilising selection on viral load; (iii) we controlled for covariates, including age and sex, which may inflate estimates of heritability; and (iv) we developed a goodness of fit test based on the correlation of viral loads in cherries of the phylogenetic tree, showing that both models of character evolution fit the data well. An overall heritability of 31% (CI 15%–43%) is consistent with other studies based on regression of viral load in donor–recipient pairs. Thus, about a third of variation in HIV-1 virulence is attributable to viral genetic variation. PMID:28604782
Vinner, Lasse; Mourier, Tobias; Friis-Nielsen, Jens; Gniadecki, Robert; Dybkaer, Karen; Rosenberg, Jacob; Langhoff, Jill Levin; Cruz, David Flores Santa; Fonager, Jannik; Izarzugaza, Jose M G; Gupta, Ramneek; Sicheritz-Ponten, Thomas; Brunak, Søren; Willerslev, Eske; Nielsen, Lars Peter; Hansen, Anders Johannes
2015-08-19
Although nearly one fifth of all human cancers have an infectious aetiology, the causes for the majority of cancers remain unexplained. Despite the enormous data output from high-throughput shotgun sequencing, viral DNA in a clinical sample typically constitutes a proportion of host DNA that is too small to be detected. Sequence variation among virus genomes complicates application of sequence-specific, and highly sensitive, PCR methods. Therefore, we aimed to develop and characterize a method that permits sensitive detection of sequences despite considerable variation. We demonstrate that our low-stringency in-solution hybridization method enables detection of <100 viral copies. Furthermore, distantly related proviral sequences may be enriched by orders of magnitude, enabling discovery of hitherto unknown viral sequences by high-throughput sequencing. The sensitivity was sufficient to detect retroviral sequences in clinical samples. We used this method to conduct an investigation for novel retrovirus in samples from three cancer types. In accordance with recent studies our investigation revealed no retroviral infections in human B-cell lymphoma cells, cutaneous T-cell lymphoma or colorectal cancer biopsies. Nonetheless, our generally applicable method makes sensitive detection possible and permits sequencing of distantly related sequences from complex material.
Amexis, Georgios; Oeth, Paul; Abel, Kenneth; Ivshina, Anna; Pelloquin, Francois; Cantor, Charles R.; Braun, Andreas; Chumakov, Konstantin
2001-01-01
RNA viruses exist as quasispecies, heterogeneous and dynamic mixtures of mutants having one or more consensus sequences. An adequate description of the genomic structure of such viral populations must include the consensus sequence(s) plus a quantitative assessment of sequence heterogeneities. For example, in quality control of live attenuated viral vaccines, the presence of even small quantities of mutants or revertants may indicate incomplete or unstable attenuation that may influence vaccine safety. Previously, we demonstrated the monitoring of oral poliovirus vaccine with the use of mutant analysis by PCR and restriction enzyme cleavage (MAPREC). In this report, we investigate genetic variation in live attenuated mumps virus vaccine by using both MAPREC and a platform (DNA MassArray) based on matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry. Mumps vaccines prepared from the Jeryl Lynn strain typically contain at least two distinct viral substrains, JL1 and JL2, which have been characterized by full length sequencing. We report the development of assays for characterizing sequence variants in these substrains and demonstrate their use in quantitative analysis of substrains and sequence variations in mixed virus cultures and mumps vaccines. The results obtained from both the MAPREC and MALDI-TOF methods showed excellent correlation. This suggests the potential utility of MALDI-TOF for routine quality control of live viral vaccines and for assessment of genetic stability and quantitative monitoring of genetic changes in other RNA viruses of clinical interest. PMID:11593021
NASA Astrophysics Data System (ADS)
Shekhar, Karthik; Ruberman, Claire F.; Ferguson, Andrew L.; Barton, John P.; Kardar, Mehran; Chakraborty, Arup K.
2013-12-01
Mutational escape from vaccine-induced immune responses has thwarted the development of a successful vaccine against AIDS, whose causative agent is HIV, a highly mutable virus. Knowing the virus' fitness as a function of its proteomic sequence can enable rational design of potent vaccines, as this information can focus vaccine-induced immune responses to target mutational vulnerabilities of the virus. Spin models have been proposed as a means to infer intrinsic fitness landscapes of HIV proteins from patient-derived viral protein sequences. These sequences are the product of nonequilibrium viral evolution driven by patient-specific immune responses and are subject to phylogenetic constraints. How can such sequence data allow inference of intrinsic fitness landscapes? We combined computer simulations and variational theory á la Feynman to show that, in most circumstances, spin models inferred from patient-derived viral sequences reflect the correct rank order of the fitness of mutant viral strains. Our findings are relevant for diverse viruses.
Setiawan, Laurentia C; Gijsbers, Esther F; van Nuenen, Adrianus C; Kootstra, Neeltje A
2015-08-01
The HLA-B27 allele is over-represented among human immunodeficiency virus type 1-infected long-term non-progressors. In these patients, strong CTL responses targeting HLA-B27-restricted viral epitopes have been associated with long-term asymptomatic survival. Indeed, loss of control of viraemia in HLA-B27 patients has been associated with CTL escape at position 264 in the immunodominant KK10 epitope. This CTL escape mutation in the viral Gag protein has been associated with severe viral attenuation and may require the presence of compensatory mutations before emerging. Here, we studied sequence evolution within HLA-B27-restricted CTL epitopes in the viral Gag protein during the course of infection of seven HLA-B27-positive patients. Longitudinal gag sequences obtained at different time points around the time of AIDS diagnosis were obtained and analysed for the presence of mutations in epitopes restricted by HLA-B27, and for potential compensatory mutations. Sequence variations were observed in the HLA-B27-restricted CTL epitopes IK9 and DR11, and the immunodominant KK10 epitope. However, the presence of sequence variations in the HLA-B27-restricted CTL epitopes could not be associated with an increase in viraemia in the majority of the patients studied. Furthermore, we observed low genetic diversity in the gag region of the viral variants throughout the course of infection, which is indicative of low viral replication and corresponds to the low viral load observed in the HLA-B27-positive patients. These data indicated that control of viral replication can be maintained in HLA-B27-positive patients despite the emergence of viral mutations in HLA-B27-restricted epitopes.
The nucleotide sequence and genome organization of Plasmopara halstedii virus.
Heller-Dohmen, Marion; Göpfert, Jens C; Pfannstiel, Jens; Spring, Otmar
2011-03-17
Only very few viruses of Oomycetes have been studied in detail. Isometric virions were found in different isolates of the oomycete Plasmopara halstedii, the downy mildew pathogen of sunflower. However, complete nucleotide sequences and data on the genome organization were lacking. Viral RNA of different P. halstedii isolates was subjected to nucleotide sequencing and analysis of the viral genome. The N-terminal sequence of the viral coat protein was determined using Top-Down MALDI-TOF analysis. The complete nucleotide sequences of both single-stranded RNA segments (RNA1 and RNA2) were established. RNA1 consisted of 2793 nucleotides (nt) exclusive its 3' poly(A) tract and a single open-reading frame (ORF1) of 2745 nt. ORF1 was framed by a 5' untranslated region (5' UTR) of 18 nt and a 3' untranslated region (3' UTR) of 30 nt. ORF1 contained motifs of RNA-dependent RNA polymerases (RdRp) and showed similarities to RdRp of Scleropthora macrospora virus A (SmV A) and viruses within the Nodaviridae family. RNA2 consisted of 1526 nt exclusive its 3' poly(A) tract and a second ORF (ORF2) of 1128 nt. ORF2 coded for the single viral coat protein (CP) and was framed by a 5' UTR of 164 nt and a 3' UTR of 234 nt. The deduced amino acid sequence of ORF2 was verified by nano-LC-ESI-MS/MS experiments. Top-Down MALDI-TOF analysis revealed the N-terminal sequence of the CP. The N-terminal sequence represented a region within ORF2 suggesting a proteolytic processing of the CP in vivo. The CP showed similarities to CP of SmV A and viruses within the Tombusviridae family. Fragments of RNA1 (ca. 1.9 kb) and RNA2 (ca. 1.4 kb) were used to analyze the nucleotide sequence variation of virions in different P. halstedii isolates. Viral sequence variation was 0.3% or less regardless of their host's pathotypes, the geographical origin and the sensitivity towards the fungicide metalaxyl. The results showed the presence of a single and new virus type in different P. halstedii isolates. Insignificant viral sequence variation indicated that the virus did not account for differences in pathogenicity of the oomycete P. halstedii.
Extreme heterogeneity of influenza virus infection in single cells
Russell, Alistair B; Trapnell, Cole
2018-01-01
Viral infection can dramatically alter a cell’s transcriptome. However, these changes have mostly been studied by bulk measurements on many cells. Here we use single-cell mRNA sequencing to examine the transcriptional consequences of influenza virus infection. We find extremely wide cell-to-cell variation in the productivity of viral transcription – viral transcripts comprise less than a percent of total mRNA in many infected cells, but a few cells derive over half their mRNA from virus. Some infected cells fail to express at least one viral gene, but this gene absence only partially explains variation in viral transcriptional load. Despite variation in viral load, the relative abundances of viral mRNAs are fairly consistent across infected cells. Activation of innate immune pathways is rare, but some cellular genes co-vary in abundance with the amount of viral mRNA. Overall, our results highlight the complexity of viral infection at the level of single cells. PMID:29451492
Metagenomics of rumen bacteriophage from thirteen lactating dairy cattle
2013-01-01
Background The bovine rumen hosts a diverse and complex community of Eukarya, Bacteria, Archea and viruses (including bacteriophage). The rumen viral population (the rumen virome) has received little attention compared to the rumen microbial population (the rumen microbiome). We used massively parallel sequencing of virus like particles to investigate the diversity of the rumen virome in thirteen lactating Australian Holstein dairy cattle all housed in the same location, 12 of which were sampled on the same day. Results Fourteen putative viral sequence fragments over 30 Kbp in length were assembled and annotated. Many of the putative genes in the assembled contigs showed no homology to previously annotated genes, highlighting the large amount of work still required to fully annotate the functions encoded in viral genomes. The abundance of the contig sequences varied widely between animals, even though the cattle were of the same age, stage of lactation and fed the same diets. Additionally the twelve animals which were co-habited shared a number of their dominant viral contigs. We compared the functional characteristics of our bovine viromes with that of other viromes, as well as rumen microbiomes. At the functional level, we found strong similarities between all of the viral samples, which were highly distinct from the rumen microbiome samples. Conclusions Our findings suggest a large amount of between animal variation in the bovine rumen virome and that co-habiting animals may have more similar viromes than non co-habited animals. We report the deepest sequencing to date of the rumen virome. This work highlights the enormous amount of novelty and variation present in the rumen virome. PMID:24180266
Gourraud, P A; Karaouni, A; Woo, J M; Schmidt, T; Oksenberg, J R; Hecht, F M; Liegler, T J; Barbour, J D
2011-03-01
We examined single nucleotide polymorphisms (SNP) in the APOBEC3 locus on chromosome 22, paired with population sequences of pro-viral human immunodeficiency virus-1 (HIV-1) vif from peripheral blood mononuclear cells, from 96 recently HIV-1-infected treatment-naive adults. We found evidence for the existence of an APOBEC3H linkage disequilibrium (LD) block associated with variation in GA → AA, or APOBEC3F/H signature, sequence changes in pro-viral HIV-1 vif sequence (top 10 significant SNPs with a significant p = 4.8 × 10(-3)). We identified a common five position risk haplotype distal to APOBEC3H (A3Hrh). These markers were in high LD (D' = 1; r(2) = 0.98) to a previously described A3H "RED" haplotype containing a variant (E121) with enhanced susceptibility to HIV-1 Vif. This association was confirmed by a haplotype analysis. Homozygote carriers of the A3Hrh had lower GA->AA (A3F/H) sequence editing upon pro-viral HIV-1 vif sequence (p = 0.01), and lower HIV-1 RNA levels over time during early, untreated HIV-1 infection, (p = 0.015 mixed effects model). This effect may be due to enhanced susceptibility of A3H forms to HIV-1 Vif mediated viral suppression of sequence editing activity, slowing viral diversification and escape from immune responses. Copyright © 2011 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Carnegie, Nicole Bohme; Wang, Rui; Novitsky, Vladimir; De Gruttola, Victor
2014-01-01
Linkage analysis is useful in investigating disease transmission dynamics and the effect of interventions on them, but estimates of probabilities of linkage between infected people from observed data can be biased downward when missingness is informative. We investigate variation in the rates at which subjects' viral genotypes link across groups defined by viral load (low/high) and antiretroviral treatment (ART) status using blood samples from household surveys in the Northeast sector of Mochudi, Botswana. The probability of obtaining a sequence from a sample varies with viral load; samples with low viral load are harder to amplify. Pairwise genetic distances were estimated from aligned nucleotide sequences of HIV-1C env gp120. It is first shown that the probability that randomly selected sequences are linked can be estimated consistently from observed data. This is then used to develop estimates of the probability that a sequence from one group links to at least one sequence from another group under the assumption of independence across pairs. Furthermore, a resampling approach is developed that accounts for the presence of correlation across pairs, with diagnostics for assessing the reliability of the method. Sequences were obtained for 65% of subjects with high viral load (HVL, n = 117), 54% of subjects with low viral load but not on ART (LVL, n = 180), and 45% of subjects on ART (ART, n = 126). The probability of linkage between two individuals is highest if both have HVL, and lowest if one has LVL and the other has LVL or is on ART. Linkage across groups is high for HVL and lower for LVL and ART. Adjustment for missing data increases the group-wise linkage rates by 40–100%, and changes the relative rates between groups. Bias in inferences regarding HIV viral linkage that arise from differential ability to genotype samples can be reduced by appropriate methods for accommodating missing data. PMID:24415932
Carnegie, Nicole Bohme; Wang, Rui; Novitsky, Vladimir; De Gruttola, Victor
2014-01-01
Linkage analysis is useful in investigating disease transmission dynamics and the effect of interventions on them, but estimates of probabilities of linkage between infected people from observed data can be biased downward when missingness is informative. We investigate variation in the rates at which subjects' viral genotypes link across groups defined by viral load (low/high) and antiretroviral treatment (ART) status using blood samples from household surveys in the Northeast sector of Mochudi, Botswana. The probability of obtaining a sequence from a sample varies with viral load; samples with low viral load are harder to amplify. Pairwise genetic distances were estimated from aligned nucleotide sequences of HIV-1C env gp120. It is first shown that the probability that randomly selected sequences are linked can be estimated consistently from observed data. This is then used to develop estimates of the probability that a sequence from one group links to at least one sequence from another group under the assumption of independence across pairs. Furthermore, a resampling approach is developed that accounts for the presence of correlation across pairs, with diagnostics for assessing the reliability of the method. Sequences were obtained for 65% of subjects with high viral load (HVL, n = 117), 54% of subjects with low viral load but not on ART (LVL, n = 180), and 45% of subjects on ART (ART, n = 126). The probability of linkage between two individuals is highest if both have HVL, and lowest if one has LVL and the other has LVL or is on ART. Linkage across groups is high for HVL and lower for LVL and ART. Adjustment for missing data increases the group-wise linkage rates by 40-100%, and changes the relative rates between groups. Bias in inferences regarding HIV viral linkage that arise from differential ability to genotype samples can be reduced by appropriate methods for accommodating missing data.
2011-01-01
Background CD8+ T cells play an important role in control of viral replication during acute and early human immunodeficiency virus type 1 (HIV-1) infection, contributing to containment of the acute viral burst and establishment of the prognostically-important persisting viral load. Understanding mechanisms that impair CD8+ T cell-mediated control of HIV replication in primary infection is thus of importance. This study addressed the relative extent to which HIV-specific T cell responses are impacted by viral mutational escape versus reduction in response avidity during the first year of infection. Results 18 patients presenting with symptomatic primary HIV-1 infection, most of whom subsequently established moderate-high persisting viral loads, were studied. HIV-specific T cell responses were mapped in each individual and responses to a subset of optimally-defined CD8+ T cell epitopes were followed from acute infection onwards to determine whether they were escaped or declined in avidity over time. During the first year of infection, sequence variation occurred in/around 26/33 epitopes studied (79%). In 82% of cases of intra-epitopic sequence variation, the mutation was confirmed to confer escape, although T cell responses were subsequently expanded to variant sequences in some cases. In contrast, < 10% of responses to index sequence epitopes declined in functional avidity over the same time-frame, and a similar proportion of responses actually exhibited an increase in functional avidity during this period. Conclusions Escape appears to constitute a much more important means of viral evasion of CD8+ T cell responses in acute and early HIV infection than decline in functional avidity of epitope-specific T cells. These findings support the design of vaccines to elicit T cell responses that are difficult for the virus to escape. PMID:21635736
Analysis of human herpesvirus-6 IE1 sequence variation in clinical samples.
Stanton, Richard; Wilkinson, Gavin W G; Fox, Julie D
2003-12-01
Herpesvirus immediate early (IE) proteins are known to play key roles in establishing productive infections, regulating reactivation from latency, and creating a cellular environment favourable to viral replication. Human herpesvirus-6 (HHV-6) IE genes have not been studied as intensively as their homologues in the prototype betaherpesvirus human cytomegalovirus (HCMV). Whilst the HCMV IE1 gene is relatively conserved, early studies indicated that HHV-6 IE1 exhibited a high level of sequence variation between HHV-6A and HHV-6B isolates, although the observation was based primarily on virus stocks that had been isolated and propagated in vitro. In this study, we investigated the level of HHV-6 IE1 sequence variation in vivo by direct sequencing of circulating virus in clinical samples without prior in vitro culture. Sequences exactly matching those reported for reference HHV-6 isolates were identified in clinical samples, thus the HHV-6 laboratory strains used in the majority of in vitro studies appear to be representative of virus circulating in vivo with respect to the IE1 gene. The HHV-6 IE1 sequence is also conserved in reference strains that had been passaged extensively in vitro. The high degree of divergence between variant A and B type IE1 sequences was confirmed, but interestingly HHV-6B IE1 sequences were observed to further segregate into two distinct subgroups, with the laboratory strains Z29 and HST representative of these two subgroups. Within each HHV-6B subgroup, a remarkably high level of homology was observed. Thus the HHV-6 IE1 sequence appears highly stable, underlining its potential importance to the viral life cycle. Copyright 2003 Wiley-Liss, Inc.
Evolutionary and biophysical relationships among the papillomavirus E2 proteins.
Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael
2009-01-01
Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.
Short reads from honey bee (Apis sp.) sequencing projects reflect microbial associate diversity
Hurst, Gregory D.D.
2017-01-01
High throughput (or ‘next generation’) sequencing has transformed most areas of biological research and is now a standard method that underpins empirical study of organismal biology, and (through comparison of genomes), reveals patterns of evolution. For projects focused on animals, these sequencing methods do not discriminate between the primary target of sequencing (the animal genome) and ‘contaminating’ material, such as associated microbes. A common first step is to filter out these contaminants to allow better assembly of the animal genome or transcriptome. Here, we aimed to assess if these ‘contaminations’ provide information with regard to biologically important microorganisms associated with the individual. To achieve this, we examined whether the short read data from Apis retrieved elements of its well established microbiome. To this end, we screened almost 1,000 short read libraries of honey bee (Apis sp.) DNA sequencing project for the presence of microbial sequences, and find sequences from known honey bee microbial associates in at least 11% of them. Further to this, we screened ∼500 Apis RNA sequencing libraries for evidence of viral infections, which were found to be present in about half of them. We then used the data to reconstruct draft genomes of three Apis associated bacteria, as well as several viral strains de novo. We conclude that ‘contamination’ in short read sequencing libraries can provide useful genomic information on microbial taxa known to be associated with the target organisms, and may even lead to the discovery of novel associations. Finally, we demonstrate that RNAseq samples from experiments commonly carry uneven viral loads across libraries. We note variation in viral presence and load may be a confounding feature of differential gene expression analyses, and as such it should be incorporated as a random factor in analyses. PMID:28717593
Short reads from honey bee (Apis sp.) sequencing projects reflect microbial associate diversity.
Gerth, Michael; Hurst, Gregory D D
2017-01-01
High throughput (or 'next generation') sequencing has transformed most areas of biological research and is now a standard method that underpins empirical study of organismal biology, and (through comparison of genomes), reveals patterns of evolution. For projects focused on animals, these sequencing methods do not discriminate between the primary target of sequencing (the animal genome) and 'contaminating' material, such as associated microbes. A common first step is to filter out these contaminants to allow better assembly of the animal genome or transcriptome. Here, we aimed to assess if these 'contaminations' provide information with regard to biologically important microorganisms associated with the individual. To achieve this, we examined whether the short read data from Apis retrieved elements of its well established microbiome. To this end, we screened almost 1,000 short read libraries of honey bee ( Apis sp.) DNA sequencing project for the presence of microbial sequences, and find sequences from known honey bee microbial associates in at least 11% of them. Further to this, we screened ∼500 Apis RNA sequencing libraries for evidence of viral infections, which were found to be present in about half of them. We then used the data to reconstruct draft genomes of three Apis associated bacteria, as well as several viral strains de novo . We conclude that 'contamination' in short read sequencing libraries can provide useful genomic information on microbial taxa known to be associated with the target organisms, and may even lead to the discovery of novel associations. Finally, we demonstrate that RNAseq samples from experiments commonly carry uneven viral loads across libraries. We note variation in viral presence and load may be a confounding feature of differential gene expression analyses, and as such it should be incorporated as a random factor in analyses.
Use of multiple competitors for quantification of human immunodeficiency virus type 1 RNA in plasma.
Vener, T; Nygren, M; Andersson, A; Uhlén, M; Albert, J; Lundeberg, J
1998-07-01
Quantification of human immunodeficiency virus type 1 (HIV-1) RNA in plasma has rapidly become an important tool in basic HIV research and in the clinical care of infected individuals. Here, a quantitative HIV assay based on competitive reverse transcription-PCR with multiple competitors was developed. Four RNA competitors containing identical PCR primer binding sequences as the viral HIV-1 RNA target were constructed. One of the PCR primers was fluorescently labeled, which facilitated discrimination between the viral RNA and competitor amplicons by fragment analysis with conventional automated sequencers. The coamplification of known amounts of the RNA competitors provided the means to establish internal calibration curves for the individual reactions resulting in exclusion of tube-to-tube variations. Calibration curves were created from the peak areas, which were proportional to the starting amount of each competitor. The fluorescence detection format was expanded to provide a dynamic range of more than 5 log units. This quantitative assay allowed for reproducible analysis of samples containing as few as 40 viral copies of HIV-1 RNA per reaction. The within- and between-run coefficients of variation were <24% (range, 10 to 24) and <36% (range, 27 to 36), respectively. The high reproducibility (standard deviation, <0.13 log) of the overall procedure for quantification of HIV-1 RNA in plasma, including sample preparation, amplification, and detection variations, allowed reliable detection of a 0.5-log change in RNA viral load. The assay could be a useful tool for monitoring HIV-1 disease progression and antiviral treatment and can easily be adapted to the quantification of other pathogens.
Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin
2018-01-01
Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139
Kanda, Teru; Furuse, Yuki; Oshitani, Hitoshi; Kiyono, Tohru
2016-05-01
The Epstein-Barr virus (EBV) is etiologically linked to approximately 10% of gastric cancers, in which viral genomes are maintained as multicopy episomes. EBV-positive gastric cancer cells are incompetent for progeny virus production, making viral DNA cloning extremely difficult. Here we describe a highly efficient strategy for obtaining bacterial artificial chromosome (BAC) clones of EBV episomes by utilizing a CRISPR/Cas9-mediated strand break of the viral genome and subsequent homology-directed repair. EBV strains maintained in two gastric cancer cell lines (SNU719 and YCCEL1) were cloned, and their complete viral genome sequences were determined. Infectious viruses of gastric cancer cell-derived EBVs were reconstituted, and the viruses established stable latent infections in immortalized keratinocytes. While Ras oncoprotein overexpression caused massive vacuolar degeneration and cell death in control keratinocytes, EBV-infected keratinocytes survived in the presence of Ras expression. These results implicate EBV infection in predisposing epithelial cells to malignant transformation by inducing resistance to oncogene-induced cell death. Recent progress in DNA-sequencing technology has accelerated EBV whole-genome sequencing, and the repertoire of sequenced EBV genomes is increasing progressively. Accordingly, the presence of EBV variant strains that may be relevant to EBV-associated diseases has begun to attract interest. Clearly, the determination of additional disease-associated viral genome sequences will facilitate the identification of any disease-specific EBV variants. We found that CRISPR/Cas9-mediated cleavage of EBV episomal DNA enabled the cloning of disease-associated viral strains with unprecedented efficiency. As a proof of concept, two gastric cancer cell-derived EBV strains were cloned, and the infection of epithelial cells with reconstituted viruses provided important clues about the mechanism of EBV-mediated epithelial carcinogenesis. This experimental system should contribute to establishing the relationship between viral genome variation and EBV-associated diseases. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai
2017-11-23
The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Chauhan, Sushma; Rahman, Hifzur; Mastan, Shaik G; Pamidimarri, D V N Sudheer; Reddy, Muppala P
2018-07-20
Begomoviruses belong to the family Geminiviridae are associated with several disease symptoms, such as mosaic and leaf curling in Jatropha curcas. The molecular characterization of these viral strains will help in developing management strategies to control the disease. In this study, J. curcas that was infected with begomovirus and showed acute leaf curling symptoms were identified. DNA-A segment from pathogenic viral strain was isolated and sequenced. The sequenced genome was assembled and characterized in detail. The full-length DNA-A sequence was covered by primer walking. The genome sequence showed the general organization of DNA-A from begomovirus by the distribution of ORFs in both viral and anti-viral strands. The genome size ranged from 2844 bp-2852 bp. Three strains with minor nucleotide variations were identified, and a phylogenetic analysis was performed by comparing the DNA-A segments from other reported begomovirus isolates. The maximum sequence similarity was observed with Euphorbia yellow mosaic virus (FN435995). In the phylogenetic tree, no clustering was observed with previously reported begomovirus strains isolated from J. curcas host. The strains isolated in this study belong to new begomoviral strain that elicits symptoms of leaf curling in J. curcas. The results indicate that the probable origin of the strains is from Jatropha mosaic virus infecting J. gassypifolia. The strains isolated in this study are referred as Jatropha curcas leaf curl India virus (JCLCIV) based on the major symptoms exhibited by host J. curcas. Copyright © 2018 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Banfield, Jillian; Breitbart, Mya; VerBerkmoes, Nathan
CRISPRs (clustered regularly interspaced short palindromic repeats) are adaptive immune systems in Bacteria and Archaea. Transcripts of the spacers that separate the repeats confer immunity through sequence identity with a targeted region (proto-spacer) in phage/viral, plasmid, or other foreign DNA. Short sequences immediately flanking the proto-spacer (proto-spacer adjacent motifs—PAMs) are important in both procuring spacers from and providing immunity to targeted sequences. New spacers are incorporated unidirectionally at the leader end of the CRISPR loci, thus recording a timeline of recent viral exposure. In the early phase of our research, we documented extremely rapid diversification of the CRISPR loci inmore » natural populations [Tyson and Banfield, 2008] matched by high levels of sequence variation in natural viral populations [Andersson and Banfield, 2008]. Since then, in a genetically tractable model laboratory system, we have 1) tracked phage mutation and CRISPR diversification, and in a natural model system, we have 2) examined population history via over time, 3) investigated the timescale over which spacers become ineffective and the process by which ineffective spacers are removed, and 4) analyzed viral diversity. In addition to research activities, our group has organized five international CRISPR meetings, the fifth to be held at University of California, Berkeley in June 2012. Most importantly, the project provided the majority of funding support for Christine Sun (Ph.D. 2012).« less
Exploration of sequence space as the basis of viral RNA genome segmentation.
Moreno, Elena; Ojosnegros, Samuel; García-Arriaza, Juan; Escarmís, Cristina; Domingo, Esteban; Perales, Celia
2014-05-06
The mechanisms of viral RNA genome segmentation are unknown. On extensive passage of foot-and-mouth disease virus in baby hamster kidney-21 cells, the virus accumulated multiple point mutations and underwent a transition akin to genome segmentation. The standard single RNA genome molecule was replaced by genomes harboring internal in-frame deletions affecting the L- or capsid-coding region. These genomes were infectious and killed cells by complementation. Here we show that the point mutations in the nonstructural protein-coding region (P2, P3) that accumulated in the standard genome before segmentation increased the relative fitness of the segmented version relative to the standard genome. Fitness increase was documented by intracellular expression of virus-coded proteins and infectious progeny production by RNAs with the internal deletions placed in the sequence context of the parental and evolved genome. The complementation activity involved several viral proteins, one of them being the leader proteinase L. Thus, a history of genetic drift with accumulation of point mutations was needed to allow a major variation in the structure of a viral genome. Thus, exploration of sequence space by a viral genome (in this case an unsegmented RNA) can reach a point of the space in which a totally different genome structure (in this case, a segmented RNA) is favored over the form that performed the exploration.
Genomic stability of adipogenic human adenovirus 36.
Nam, J-H; Na, H-N; Atkinson, R L; Dhurandhar, N V
2014-02-01
Human adenovirus Ad36 increases adiposity in several animal models, including rodents and non-human primates. Importantly, Ad36 is associated with human obesity, which has prompted research to understand its epidemiology and to develop a vaccine to prevent a subgroup of obesity. For this purpose, understanding the genomic stability of Ad36 in vivo and in vitro infections is critical. Here, we examined whether in vitro cell passaging over a 14-year period introduced any genetic variation in Ad36. We sequenced the whole genome of Ad36-which was plaque purified in 1998 from the original strain obtained from American Type Culture Collection, and passaged approximately 12 times over the past 14 years (Ad36-2012). This DNA sequence was compared with a previously published sequence of Ad36 likely obtained from the same source (Ad36-1988). Compared with Ad36-1988, only two nucleotides were altered in Ad36-2012: a T insertion at nucleotide 1862, which may induce early termination of the E1B viral protein, and a T➝C transition at nucleotide 26 136. Virus with the T insertion (designated Ad36-2012-T6) was mixed with wild-type virus lacking the T insertion (designated Ad36-2012-T5) in the viral stock. The transition at nucleotide 26 136 does not change the encoded amino acid (aspartic acid) in the pVIII viral protein. The rate of genetic variation in Ad36 is ∼2.37 × 10(-6) mutations/nucleotide/passage. Of particular importance, there were no mutations in the E4orf1 gene, the critical gene for producing obesity. This very-low-variation rate should reduce concerns about genetic variability when developing Ad36 vaccines or developing assays for detecting Ad36 infection in populations.
Use of Multiple Competitors for Quantification of Human Immunodeficiency Virus Type 1 RNA in Plasma
Vener, Tanya; Nygren, Malin; Andersson, AnnaLena; Uhlén, Mathias; Albert, Jan; Lundeberg, Joakim
1998-01-01
Quantification of human immunodeficiency virus type 1 (HIV-1) RNA in plasma has rapidly become an important tool in basic HIV research and in the clinical care of infected individuals. Here, a quantitative HIV assay based on competitive reverse transcription-PCR with multiple competitors was developed. Four RNA competitors containing identical PCR primer binding sequences as the viral HIV-1 RNA target were constructed. One of the PCR primers was fluorescently labeled, which facilitated discrimination between the viral RNA and competitor amplicons by fragment analysis with conventional automated sequencers. The coamplification of known amounts of the RNA competitors provided the means to establish internal calibration curves for the individual reactions resulting in exclusion of tube-to-tube variations. Calibration curves were created from the peak areas, which were proportional to the starting amount of each competitor. The fluorescence detection format was expanded to provide a dynamic range of more than 5 log units. This quantitative assay allowed for reproducible analysis of samples containing as few as 40 viral copies of HIV-1 RNA per reaction. The within- and between-run coefficients of variation were <24% (range, 10 to 24) and <36% (range, 27 to 36), respectively. The high reproducibility (standard deviation, <0.13 log) of the overall procedure for quantification of HIV-1 RNA in plasma, including sample preparation, amplification, and detection variations, allowed reliable detection of a 0.5-log change in RNA viral load. The assay could be a useful tool for monitoring HIV-1 disease progression and antiviral treatment and can easily be adapted to the quantification of other pathogens. PMID:9650926
A wide extent of inter-strain diversity in virulent and vaccine strains of alphaherpesviruses.
Szpara, Moriah L; Tafuri, Yolanda R; Parsons, Lance; Shamim, S Rafi; Verstrepen, Kevin J; Legendre, Matthieu; Enquist, L W
2011-10-01
Alphaherpesviruses are widespread in the human population, and include herpes simplex virus 1 (HSV-1) and 2, and varicella zoster virus (VZV). These viral pathogens cause epithelial lesions, and then infect the nervous system to cause lifelong latency, reactivation, and spread. A related veterinary herpesvirus, pseudorabies (PRV), causes similar disease in livestock that result in significant economic losses. Vaccines developed for VZV and PRV serve as useful models for the development of an HSV-1 vaccine. We present full genome sequence comparisons of the PRV vaccine strain Bartha, and two virulent PRV isolates, Kaplan and Becker. These genome sequences were determined by high-throughput sequencing and assembly, and present new insights into the attenuation of a mammalian alphaherpesvirus vaccine strain. We find many previously unknown coding differences between PRV Bartha and the virulent strains, including changes to the fusion proteins gH and gB, and over forty other viral proteins. Inter-strain variation in PRV protein sequences is much closer to levels previously observed for HSV-1 than for the highly stable VZV proteome. Almost 20% of the PRV genome contains tandem short sequence repeats (SSRs), a class of nucleic acids motifs whose length-variation has been associated with changes in DNA binding site efficiency, transcriptional regulation, and protein interactions. We find SSRs throughout the herpesvirus family, and provide the first global characterization of SSRs in viruses, both within and between strains. We find SSR length variation between different isolates of PRV and HSV-1, which may provide a new mechanism for phenotypic variation between strains. Finally, we detected a small number of polymorphic bases within each plaque-purified PRV strain, and we characterize the effect of passage and plaque-purification on these polymorphisms. These data add to growing evidence that even plaque-purified stocks of stable DNA viruses exhibit limited sequence heterogeneity, which likely seeds future strain evolution.
Weiss, Eric R; Lamers, Susanna L; Henderson, Jennifer L; Melnikov, Alexandre; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa; Nusbaum, Chad; Luzuriaga, Katherine
2018-01-15
Over 90% of the world's population is persistently infected with Epstein-Barr virus. While EBV does not cause disease in most individuals, it is the common cause of acute infectious mononucleosis (AIM) and has been associated with several cancers and autoimmune diseases, highlighting a need for a preventive vaccine. At present, very few primary, circulating EBV genomes have been sequenced directly from infected individuals. While low levels of diversity and low viral evolution rates have been predicted for double-stranded DNA (dsDNA) viruses, recent studies have demonstrated appreciable diversity in common dsDNA pathogens (e.g., cytomegalovirus). Here, we report 40 full-length EBV genome sequences obtained from matched oral wash and B cell fractions from a cohort of 10 AIM patients. Both intra- and interpatient diversity were observed across the length of the entire viral genome. Diversity was most pronounced in viral genes required for establishing latent infection and persistence, with appreciable levels of diversity also detected in structural genes, including envelope glycoproteins. Interestingly, intrapatient diversity declined significantly over time ( P < 0.01), and this was particularly evident on comparison of viral genomes sequenced from B cell fractions in early primary infection and convalescence ( P < 0.001). B cell-associated viral genomes were observed to converge, becoming nearly identical to the B95.8 reference genome over time (Spearman rank-order correlation test; r = -0.5589, P = 0.0264). The reduction in diversity was most marked in the EBV latency genes. In summary, our data suggest independent convergence of diverse viral genome sequences toward a reference-like strain within a relatively short period following primary EBV infection. IMPORTANCE Identification of viral proteins with low variability and high immunogenicity is important for the development of a protective vaccine. Knowledge of genome diversity within circulating viral populations is a key step in this process, as is the expansion of intrahost genomic variation during infection. We report full-length EBV genomes sequenced from the blood and oral wash of 10 individuals early in primary infection and during convalescence. Our data demonstrate considerable diversity within the pool of circulating EBV strains, as well as within individual patients. Overall viral diversity decreased from early to persistent infection, particularly in latently infected B cells, which serve as the viral reservoir. Reduction in B cell-associated viral genome diversity coincided with a convergence toward a reference-like EBV genotype. Greater convergence positively correlated with time after infection, suggesting that the reference-like genome is the result of selection. Copyright © 2018 American Society for Microbiology.
Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike
2018-01-01
ABSTRACT Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection. PMID:29564396
Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike; Khan, Arifa S
2018-01-01
Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection.
Nikolaitchik, Olga A.; Burdick, Ryan C.; Gorelick, Robert J.; Keele, Brandon F.; Hu, Wei-Shau; Pathak, Vinay K.
2016-01-01
Although the predominant effect of host restriction APOBEC3 proteins on HIV-1 infection is to block viral replication, they might inadvertently increase retroviral genetic variation by inducing G-to-A hypermutation. Numerous studies have disagreed on the contribution of hypermutation to viral genetic diversity and evolution. Confounding factors contributing to the debate include the extent of lethal (stop codon) and sublethal hypermutation induced by different APOBEC3 proteins, the inability to distinguish between G-to-A mutations induced by APOBEC3 proteins and error-prone viral replication, the potential impact of hypermutation on the frequency of retroviral recombination, and the extent to which viral recombination occurs in vivo, which can reassort mutations in hypermutated genomes. Here, we determined the effects of hypermutation on the HIV-1 recombination rate and its contribution to genetic variation through recombination to generate progeny genomes containing portions of hypermutated genomes without lethal mutations. We found that hypermutation did not significantly affect the rate of recombination, and recombination between hypermutated and wild-type genomes only increased the viral mutation rate by 3.9 × 10−5 mutations/bp/replication cycle in heterozygous virions, which is similar to the HIV-1 mutation rate. Since copackaging of hypermutated and wild-type genomes occurs very rarely in vivo, recombination between hypermutated and wild-type genomes does not significantly contribute to the genetic variation of replicating HIV-1. We also analyzed previously reported hypermutated sequences from infected patients and determined that the frequency of sublethal mutagenesis for A3G and A3F is negligible (4 × 10−21 and1 × 10−11, respectively) and its contribution to viral mutations is far below mutations generated during error-prone reverse transcription. Taken together, we conclude that the contribution of APOBEC3-induced hypermutation to HIV-1 genetic variation is substantially lower than that from mutations during error-prone replication. PMID:27186986
Delviks-Frankenberry, Krista A; Nikolaitchik, Olga A; Burdick, Ryan C; Gorelick, Robert J; Keele, Brandon F; Hu, Wei-Shau; Pathak, Vinay K
2016-05-01
Although the predominant effect of host restriction APOBEC3 proteins on HIV-1 infection is to block viral replication, they might inadvertently increase retroviral genetic variation by inducing G-to-A hypermutation. Numerous studies have disagreed on the contribution of hypermutation to viral genetic diversity and evolution. Confounding factors contributing to the debate include the extent of lethal (stop codon) and sublethal hypermutation induced by different APOBEC3 proteins, the inability to distinguish between G-to-A mutations induced by APOBEC3 proteins and error-prone viral replication, the potential impact of hypermutation on the frequency of retroviral recombination, and the extent to which viral recombination occurs in vivo, which can reassort mutations in hypermutated genomes. Here, we determined the effects of hypermutation on the HIV-1 recombination rate and its contribution to genetic variation through recombination to generate progeny genomes containing portions of hypermutated genomes without lethal mutations. We found that hypermutation did not significantly affect the rate of recombination, and recombination between hypermutated and wild-type genomes only increased the viral mutation rate by 3.9 × 10-5 mutations/bp/replication cycle in heterozygous virions, which is similar to the HIV-1 mutation rate. Since copackaging of hypermutated and wild-type genomes occurs very rarely in vivo, recombination between hypermutated and wild-type genomes does not significantly contribute to the genetic variation of replicating HIV-1. We also analyzed previously reported hypermutated sequences from infected patients and determined that the frequency of sublethal mutagenesis for A3G and A3F is negligible (4 × 10-21 and1 × 10-11, respectively) and its contribution to viral mutations is far below mutations generated during error-prone reverse transcription. Taken together, we conclude that the contribution of APOBEC3-induced hypermutation to HIV-1 genetic variation is substantially lower than that from mutations during error-prone replication.
Ebolavirus comparative genomics
Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; ...
2015-07-14
The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less
Renta, J Y; Cadilla, C L; Vega, M E; Hillyer, G V; Estrada, C; Jiménez, E; Abreu, E; Méndez, I; Gandía, J; Meléndez-Guerrero, L M
1997-11-01
In this study, the HIV-1 variant viruses from ten pregnant women and their infants were isolated and characterized longitudinally in order to determine the role that viral envelope (gp120-V3 loop) gene variation and viral tropism play in vertical transmission. Biological phenotyping of each HIV variant was accomplished by growth in MT-2, and macrophages from healthy and non-HIV-infected donors. Genetic characterization of the variants was accomplished by DNA sequence analysis. All the women enrolled in this study received ZDV therapy. Virus was cultured from eight out of ten env V3-PCR positive mothers. HIV-1 isolates were all non-syncitium inducing variants. None of the mothers were found to transmit HIV, as determined by DNA PCR and quantitative co-cultures on their infants which were seronegative for HIV-1 through one year after birth. Viral cultures from infant blood samples were negative and infants were all healthy. However, nested env V3-PCR detected proviral DNA in five out of ten infants. In contrast, conventional gag-PCR was negative in the same five infants. Sequences of the five maternal-infant pairs were different, suggesting unique infant HIV-1 variants. The three highest maternal viral load values corresponded to infants that were env V3-PCR positive. These results suggest that HIV-1 particles are transmitted from ZDV-treated mothers to infants. Infant follow up is recommended to determine if HIV-1 has been inhibited by the immune system of the infants.
Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing.
Morse, Alison M; Calabro, Kaitlyn R; Fear, Justin M; Bloom, David C; McIntyre, Lauren M
2017-08-16
High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, is complicated due to the existence of large repeat regions and arrays of smaller reiterated sequences that are commonly found in these genomes. In addition, the inherent genetic variation in populations of isolates for viruses and other microorganisms presents an additional challenge to many existing HTS sequence assembly pipelines. Here, we evaluate two approaches for the identification of genetic variants in HSV1 strains using Illumina short read sequencing data. The first, a reference-based approach, identifies variants from reads aligned to a reference sequence and the second, a de novo assembly approach, identifies variants from reads aligned to de novo assembled consensus sequences. Of critical importance for both approaches is the reduction in the number of low complexity regions through the construction of a non-redundant reference genome. We compared variants identified in the two methods. Our results indicate that approximately 85% of variants are identified regardless of the approach. The reference-based approach to variant discovery captures an additional 15% representing variants divergent from the HSV1 reference possibly due to viral passage. Reference-based approaches are significantly less labor-intensive and identify variants across the genome where de novo assembly-based approaches are limited to regions where contigs have been successfully assembled. In addition, regions of poor quality assembly can lead to false variant identification in de novo consensus sequences. For viruses with a well-assembled reference genome, a reference-based approach is recommended.
Modeling Host Genetic Regulation of Influenza Pathogenesis in the Collaborative Cross
Ferris, Martin T.; Aylor, David L.; Bottomly, Daniel; Whitmore, Alan C.; Aicher, Lauri D.; Bell, Timothy A.; Bradel-Tretheway, Birgit; Bryan, Janine T.; Buus, Ryan J.; Gralinski, Lisa E.; Haagmans, Bart L.; McMillan, Leonard; Miller, Darla R.; Rosenzweig, Elizabeth; Valdar, William; Wang, Jeremy; Churchill, Gary A.; Threadgill, David W.; McWeeney, Shannon K.; Katze, Michael G.; Pardo-Manuel de Villena, Fernando; Baric, Ralph S.; Heise, Mark T.
2013-01-01
Genetic variation contributes to host responses and outcomes following infection by influenza A virus or other viral infections. Yet narrow windows of disease symptoms and confounding environmental factors have made it difficult to identify polymorphic genes that contribute to differential disease outcomes in human populations. Therefore, to control for these confounding environmental variables in a system that models the levels of genetic diversity found in outbred populations such as humans, we used incipient lines of the highly genetically diverse Collaborative Cross (CC) recombinant inbred (RI) panel (the pre-CC population) to study how genetic variation impacts influenza associated disease across a genetically diverse population. A wide range of variation in influenza disease related phenotypes including virus replication, virus-induced inflammation, and weight loss was observed. Many of the disease associated phenotypes were correlated, with viral replication and virus-induced inflammation being predictors of virus-induced weight loss. Despite these correlations, pre-CC mice with unique and novel disease phenotype combinations were observed. We also identified sets of transcripts (modules) that were correlated with aspects of disease. In order to identify how host genetic polymorphisms contribute to the observed variation in disease, we conducted quantitative trait loci (QTL) mapping. We identified several QTL contributing to specific aspects of the host response including virus-induced weight loss, titer, pulmonary edema, neutrophil recruitment to the airways, and transcriptional expression. Existing whole-genome sequence data was applied to identify high priority candidate genes within QTL regions. A key host response QTL was located at the site of the known anti-influenza Mx1 gene. We sequenced the coding regions of Mx1 in the eight CC founder strains, and identified a novel Mx1 allele that showed reduced ability to inhibit viral replication, while maintaining protection from weight loss. PMID:23468633
Heiman, Erica M.; McDonald, Sarah M.; Barro, Mario; Taraporewala, Zenobia F.; Bar-Magen, Tamara; Patton, John T.
2008-01-01
Group A human rotaviruses (HRVs) are the major cause of severe viral gastroenteritis in infants and young children. To gain insight into the level of genetic variation among HRVs, we determined the genome sequences for 10 strains belonging to different VP7 serotypes (G types). The HRVs chosen for this study, D, DS-1, P, ST3, IAL28, Se584, 69M, WI61, A64, and L26, were isolated from infected persons and adapted to cell culture to use as serotype references. Our sequencing results revealed that most of the individual proteins from each HRV belong to one of three genotypes (1, 2, or 3) based on their similarities to proteins of genogroup strains (Wa, DS-1, or AU-1, respectively). Strains D, P, ST3, IAL28, and WI61 encode genotype 1 (Wa-like) proteins, whereas strains DS-1 and 69M encode genotype 2 (DS-1-like) proteins. Of the 10 HRVs sequenced, 3 of them (Se584, A64, and L26) encode proteins belonging to more than one genotype, indicating that they are intergenogroup reassortants. We used amino acid sequence alignments to identify residues that distinguish proteins belonging to HRV genotype 1, 2, or 3. These genotype-specific changes cluster in definitive regions within each viral protein, many of which are sites of known protein-protein interactions. For the intermediate viral capsid protein (VP6), the changes map onto the atomic structure at the VP2-VP6, VP4-VP6, and VP7-VP6 interfaces. The results of this study provide evidence that group A HRV gene constellations exist and may be influenced by interactions among viral proteins during replication. PMID:18786998
Dissection of Influenza Infection In Vivo by Single-Cell RNA Sequencing.
Steuerman, Yael; Cohen, Merav; Peshes-Yaloz, Naama; Valadarsky, Liran; Cohn, Ofir; David, Eyal; Frishberg, Amit; Mayo, Lior; Bacharach, Eran; Amit, Ido; Gat-Viks, Irit
2018-06-01
The influenza virus is a major cause of morbidity and mortality worldwide. Yet, both the impact of intracellular viral replication and the variation in host response across different cell types remain uncharacterized. Here we used single-cell RNA sequencing to investigate the heterogeneity in the response of lung tissue cells to in vivo influenza infection. Analysis of viral and host transcriptomes in the same single cell enabled us to resolve the cellular heterogeneity of bystander (exposed but uninfected) as compared with infected cells. We reveal that all major immune and non-immune cell types manifest substantial fractions of infected cells, albeit at low viral transcriptome loads relative to epithelial cells. We show that all cell types respond primarily with a robust generic transcriptional response, and we demonstrate novel markers specific for influenza-infected as opposed to bystander cells. These findings open new avenues for targeted therapy aimed exclusively at infected cells. Copyright © 2018 Elsevier Inc. All rights reserved.
Genetic variability and evolutionary dynamics of viruses of the family Closteroviridae
Rubio, Luis; Guerri, José; Moreno, Pedro
2013-01-01
RNA viruses have a great potential for genetic variation, rapid evolution and adaptation. Characterization of the genetic variation of viral populations provides relevant information on the processes involved in virus evolution and epidemiology and it is crucial for designing reliable diagnostic tools and developing efficient and durable disease control strategies. Here we performed an updated analysis of sequences available in Genbank and reviewed present knowledge on the genetic variability and evolutionary processes of viruses of the family Closteroviridae. Several factors have shaped the genetic structure and diversity of closteroviruses. (I) A strong negative selection seems to be responsible for the high genetic stability in space and time for some viruses. (2) Long distance migration, probably by human transport of infected propagative plant material, have caused that genetically similar virus isolates are found in distant geographical regions. (3) Recombination between divergent sequence variants have generated new genotypes and plays an important role for the evolution of some viruses of the family Closteroviridae. (4) Interaction between virus strains or between different viruses in mixed infections may alter accumulation of certain strains. (5) Host change or virus transmission by insect vectors induced changes in the viral population structure due to positive selection of sequence variants with higher fitness for host-virus or vector-virus interaction (adaptation) or by genetic drift due to random selection of sequence variants during the population bottleneck associated to the transmission process. PMID:23805130
Angly, Florent E; Willner, Dana; Prieto-Davó, Alejandra; Edwards, Robert A; Schmieder, Robert; Vega-Thurber, Rebecca; Antonopoulos, Dionysios A; Barott, Katie; Cottrell, Matthew T; Desnues, Christelle; Dinsdale, Elizabeth A; Furlan, Mike; Haynes, Matthew; Henn, Matthew R; Hu, Yongfei; Kirchman, David L; McDole, Tracey; McPherson, John D; Meyer, Folker; Miller, R Michael; Mundt, Egbert; Naviaux, Robert K; Rodriguez-Mueller, Beltran; Stevens, Rick; Wegley, Linda; Zhang, Lixin; Zhu, Baoli; Rohwer, Forest
2009-12-01
Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions.
Strain Variation in an Emerging Iridovirus of Warm-Water Fishes
Goldberg, Tony L.; Coleman, David A.; Grant, Emily C.; Inendino, Kate R.; Philipp, David P.
2003-01-01
Although iridoviruses vary widely within and among genera with respect to their host range and virulence, variation within iridovirus species has been less extensively characterized. This study explores the nature and extent of intraspecific variation within an emerging iridovirus of North American warm-water fishes, largemouth bass virus (LMBV). Three LMBV isolates recovered from three distinct sources differed genetically and phenotypically. Genetically, the isolates differed in the banding patterns generated from amplified fragment length polymorphism analysis but not in their DNA sequences at two loci of different degrees of evolutionary stability. In vitro, the isolates replicated at identical rates in cell culture, as determined by real-time quantitative PCR of viral particles released into suspension. In vivo, the isolates varied over fivefold in virulence, as measured by the rate at which they induced mortality in juvenile largemouth bass. This variation was reflected in the viral loads of exposed fish, measured using real-time quantitative PCR; the most virulent viral strain also replicated to the highest level in fish. Together, these results justify the designation of these isolates as different strains of LMBV. Strain variation in iridoviruses could help explain why animal populations naturally infected with iridovirus pathogens vary so extensively in their clinical responses to infection. The results of this study are especially relevant to emerging iridoviruses of aquaculture systems and wildlife. PMID:12885900
USDA-ARS?s Scientific Manuscript database
Porcine reproductive and respiratory syndrome virus (PRRSV) is widespread with a high variation in sequence and virulence among the divergent strains and causes an economically destructive disease. A viral ovarian domain protease (vOTU) has been previously identified within the nonstructural protein...
Weiss, Eric R.; Alter, Galit; Ogembo, Javier Gordon; Henderson, Jennifer L.; Tabak, Barbara; Bakiş, Yasin; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa
2016-01-01
ABSTRACT The Epstein-Barr virus (EBV) gp350 glycoprotein interacts with the cellular receptor to mediate viral entry and is thought to be the major target for neutralizing antibodies. To better understand the role of EBV-specific antibodies in the control of viral replication and the evolution of sequence diversity, we measured EBV gp350-specific antibody responses and sequenced the gp350 gene in samples obtained from individuals experiencing primary EBV infection (acute infectious mononucleosis [AIM]) and again 6 months later (during convalescence [CONV]). EBV gp350-specific IgG was detected in the sera of 17 (71%) of 24 individuals at the time of AIM and all 24 (100%) individuals during CONV; binding antibody titers increased from AIM through CONV, reaching levels equivalent to those in age-matched, chronically infected individuals. Antibody-dependent cell-mediated phagocytosis (ADCP) was rarely detected during AIM (4 of 24 individuals; 17%) but was commonly detected during CONV (19 of 24 individuals; 79%). The majority (83%) of samples taken during AIM neutralized infection of primary B cells; all samples obtained at 6 months postdiagnosis neutralized EBV infection of cultured and primary target cells. Deep sequencing revealed interpatient gp350 sequence variation but conservation of the CR2-binding site. The levels of gp350-specific neutralizing activity directly correlated with higher peripheral blood EBV DNA levels during AIM and a greater evolution of diversity in gp350 nucleotide sequences from AIM to CONV. In summary, we conclude that the viral load and EBV gp350 diversity during early infection are associated with the development of neutralizing antibody responses following AIM. IMPORTANCE Antibodies against viral surface proteins can blunt the spread of viral infection by coating viral particles, mediating uptake by immune cells, or blocking interaction with host cell receptors, making them a desirable component of a sterilizing vaccine. The EBV surface protein gp350 is a major target for antibodies. We report the detection of EBV gp350-specific antibodies capable of neutralizing EBV infection in vitro. The majority of gp350-directed vaccines focus on glycoproteins from lab-adapted strains, which may poorly reflect primary viral envelope diversity. We report some of the first primary gp350 sequences, noting that the gp350 host receptor binding site is remarkably stable across patients and time. However, changes in overall gene diversity were detectable during infection. Patients with higher peripheral blood viral loads in primary infection and greater changes in viral diversity generated more efficient antibodies. Our findings provide insight into the generation of functional antibodies, necessary for vaccine development. PMID:27733645
Weiss, Eric R; Alter, Galit; Ogembo, Javier Gordon; Henderson, Jennifer L; Tabak, Barbara; Bakiş, Yasin; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa; Luzuriaga, Katherine
2017-01-01
The Epstein-Barr virus (EBV) gp350 glycoprotein interacts with the cellular receptor to mediate viral entry and is thought to be the major target for neutralizing antibodies. To better understand the role of EBV-specific antibodies in the control of viral replication and the evolution of sequence diversity, we measured EBV gp350-specific antibody responses and sequenced the gp350 gene in samples obtained from individuals experiencing primary EBV infection (acute infectious mononucleosis [AIM]) and again 6 months later (during convalescence [CONV]). EBV gp350-specific IgG was detected in the sera of 17 (71%) of 24 individuals at the time of AIM and all 24 (100%) individuals during CONV; binding antibody titers increased from AIM through CONV, reaching levels equivalent to those in age-matched, chronically infected individuals. Antibody-dependent cell-mediated phagocytosis (ADCP) was rarely detected during AIM (4 of 24 individuals; 17%) but was commonly detected during CONV (19 of 24 individuals; 79%). The majority (83%) of samples taken during AIM neutralized infection of primary B cells; all samples obtained at 6 months postdiagnosis neutralized EBV infection of cultured and primary target cells. Deep sequencing revealed interpatient gp350 sequence variation but conservation of the CR2-binding site. The levels of gp350-specific neutralizing activity directly correlated with higher peripheral blood EBV DNA levels during AIM and a greater evolution of diversity in gp350 nucleotide sequences from AIM to CONV. In summary, we conclude that the viral load and EBV gp350 diversity during early infection are associated with the development of neutralizing antibody responses following AIM. Antibodies against viral surface proteins can blunt the spread of viral infection by coating viral particles, mediating uptake by immune cells, or blocking interaction with host cell receptors, making them a desirable component of a sterilizing vaccine. The EBV surface protein gp350 is a major target for antibodies. We report the detection of EBV gp350-specific antibodies capable of neutralizing EBV infection in vitro The majority of gp350-directed vaccines focus on glycoproteins from lab-adapted strains, which may poorly reflect primary viral envelope diversity. We report some of the first primary gp350 sequences, noting that the gp350 host receptor binding site is remarkably stable across patients and time. However, changes in overall gene diversity were detectable during infection. Patients with higher peripheral blood viral loads in primary infection and greater changes in viral diversity generated more efficient antibodies. Our findings provide insight into the generation of functional antibodies, necessary for vaccine development. Copyright © 2016 American Society for Microbiology.
Salmon, Jérôme; Nonnenmacher, Mathieu; Cazé, Sandrine; Flamant, Patricia; Croissant, Odile; Orth, Gérard; Breitburd, Françoise
2000-01-01
We previously reported the partial characterization of two cottontail rabbit papillomavirus (CRPV) subtypes with strikingly divergent E6 and E7 oncoproteins. We report now the complete nucleotide sequences of these subtypes, referred to as CRPVa4 (7,868 nucleotides) and CRPVb (7,867 nucleotides). The CRPVa4 and CRPVb genomes differed at 238 (3%) nucleotide positions, whereas CRPVa4 and the prototype CRPV differed by only 5 nucleotides. The most variable region (7% nucleotide divergence) included the long regulatory region (LRR) and the E6 and E7 genes. A mutation in the stop codon resulted in an 8-amino-acid-longer CRPVb E4 protein, and a nucleotide deletion reduced the coding capacity of the E5 gene from 101 to 25 amino acids. In domestic rabbits homozygous for a specific haplotype of the DRA and DQA genes of the major histocompatibility complex, warts induced by CRPVb DNA or a chimeric genome containing the CRPVb LRR/E6/E7 region showed an early regression, whereas warts induced by CRPVa4 or a chimeric genome containing the CRPVa4 LRR/E6/E7 region persisted and evolved into carcinomas. In contrast, most CRPVa, CRPVb, and chimeric CRPV DNA-induced warts showed no early regression in rabbits homozygous for another DRA-DQA haplotype. Little, if any, viral replication is usually observed in domestic rabbit warts. When warts induced by CRPVa and CRPVb virions and DNA were compared, the number of cells positive for viral DNA or capsid antigens was found to be greater by 1 order of magnitude for specimens induced by CRPVb. Thus, both sequence variation in the LRR/E6/E7 region and the genetic constitution of the host influence the expression of the oncogenic potential of CRPV. Furthermore, intratype variation may overcome to some extent the host restriction of CRPV replication in domestic rabbits. PMID:11044121
Korber, B T; Osmanov, S; Esparza, J; Myers, G
1994-11-01
The World Health Organization Global Programme on AIDS (WHO/GPA) is conducting a large-scale collaborative study of human immunodeficiency virus type 1 (HIV-1) variation, based in four potential vaccine-trial site countries: Brazil, Rwanda, Thailand, and Uganda. Through the course of this study, it was crucial to keep track of certain attributes of the samples from which the viral nucleotide sequences were derived (e.g., country of origin and viral culture characterization), so that meaningful sequence comparisons could be made. Here we describe a system developed in the context of the WHO/GPA study that summarizes such critical attributes by representing them as standardized characters directly incorporated into sequence names. This nomenclature allows linkage of clinical, phenotypic, and geographic information with molecular data. We propose that other investigators involved in human immunodeficiency virus (HIV) nucleotide sequencing efforts adopt a similar standardized sequence nomenclature to facilitate cross-study sequence comparison. HIV sequence data are being generated at an ever-increasing rate; directly coupled to this increase is our deepening understanding of biological parameters that influence or result from sequence variability. A standardized sequence nomenclature that includes relevant biological information would enable researchers to better utilize the growing body of sequence data, and enhance their ability to interpret the biological implications of their own data through facilitating comparisons with previously published work.
Jo, Yeonhwa; Choi, Hoseong; Kim, Sang-Min; Kim, Sun-Lim; Lee, Bong Choon; Cho, Won Kyong
2016-08-09
Next-generation sequencing (NGS) provides many possibilities for plant virology research. In this study, we performed integrated analyses using plant transcriptome data for plant virus identification using Apple stem grooving virus (ASGV) as an exemplar virus. We used 15 publicly available transcriptome libraries from three different studies, two mRNA-Seq studies and a small RNA-Seq study. We de novo assembled nearly complete genomes of ASGV isolates Fuji and Cuiguan from apple and pear transcriptomes, respectively, and identified single nucleotide variations (SNVs) of ASGV within the transcriptomes. We demonstrated the application of NGS raw data to confirm viral infections in the plant transcriptomes. In addition, we compared the usability of two de novo assemblers, Trinity and Velvet, for virus identification and genome assembly. A phylogenetic tree revealed that ASGV and Citrus tatter leaf virus (CTLV) are the same virus, which was divided into two clades. Recombination analyses identified six recombination events from 21 viral genomes. Taken together, our in silico analyses using NGS data provide a successful application of plant transcriptomes to reveal extensive information associated with viral genome assembly, SNVs, phylogenetic relationships, and genetic recombination.
Natural Variation of Epstein-Barr Virus Genes, Proteins, and Primary MicroRNA.
Correia, Samantha; Palser, Anne; Elgueta Karstegl, Claudio; Middeldorp, Jaap M; Ramayanti, Octavia; Cohen, Jeffrey I; Hildesheim, Allan; Fellner, Maria Dolores; Wiels, Joelle; White, Robert E; Kellam, Paul; Farrell, Paul J
2017-08-01
Viral gene sequences from an enlarged set of about 200 Epstein-Barr virus (EBV) strains, including many primary isolates, have been used to investigate variation in key viral genetic regions, particularly LMP1, Zp, gp350, EBNA1, and the BART microRNA (miRNA) cluster 2. Determination of type 1 and type 2 EBV in saliva samples from people from a wide range of geographic and ethnic backgrounds demonstrates a small percentage of healthy white Caucasian British people carrying predominantly type 2 EBV. Linkage of Zp and gp350 variants to type 2 EBV is likely to be due to their genes being adjacent to the EBNA3 locus, which is one of the major determinants of the type 1/type 2 distinction. A novel classification of EBNA1 DNA binding domains, named QCIGP, results from phylogeny analysis of their protein sequences but is not linked to the type 1/type 2 classification. The BART cluster 2 miRNA region is classified into three major variants through single-nucleotide polymorphisms (SNPs) in the primary miRNA outside the mature miRNA sequences. These SNPs can result in altered levels of expression of some miRNAs from the BART variant frequently present in Chinese and Indonesian nasopharyngeal carcinoma (NPC) samples. The EBV genetic variants identified here provide a basis for future, more directed analysis of association of specific EBV variations with EBV biology and EBV-associated diseases. IMPORTANCE Incidence of diseases associated with EBV varies greatly in different parts of the world. Thus, relationships between EBV genome sequence variation and health, disease, geography, and ethnicity of the host may be important for understanding the role of EBV in diseases and for development of an effective EBV vaccine. This paper provides the most comprehensive analysis so far of variation in specific EBV genes relevant to these diseases and proposed EBV vaccines. By focusing on variation in LMP1, Zp, gp350, EBNA1, and the BART miRNA cluster 2, new relationships with the known type 1/type 2 strains are demonstrated, and a novel classification of EBNA1 and the BART miRNAs is proposed. Copyright © 2017 Correia et al.
DOE Office of Scientific and Technical Information (OSTI.GOV)
López, José L.; Golemba, Marcelo; Hernández, Edgardo
Rhodopsins are broadly distributed. In this work, we analyzed 23 metagenomes corresponding to marine sediment samples from four regions that share cold climate conditions (Norway; Sweden; Argentina and Antarctica). In order to investigate the genes evolution of viral rhodopsins, an initial set of 6224 bacterial rhodopsin sequences according to COG5524 were retrieved from the 23 metagenomes. After selection by the presence of transmembrane domains and alignment, 123 viral (51) and non-viral (72) sequences (>50 amino acids) were finally included in further analysis. Viral rhodopsin genes were homologs of Phaeocystis globosa virus and Organic lake Phycodnavirus. Non-viral microbial rhodopsin genes weremore » ascribed to Bacteroidetes, Planctomycetes, Firmicutes, Actinobacteria, Cyanobacteria, Proteobacteria, Deinococcus-Thermus and Cryptophyta and Fungi. A rescreening using Blastp, using as queries the viral sequences previously described, retrieved 30 sequences (>100 amino acids). Phylogeographic analysis revealed a geographical clustering of the sequences affiliated to the viral group. This clustering was not observed for the microbial non-viral sequences. The phylogenetic reconstruction allowed us to propose the existence of a putative ancestor of viral rhodopsin genes related to Actinobacteria and Chloroflexi. This is the first report about the existence of a phylogeographic association of the viral rhodopsin sequences from marine sediments.« less
The evolution of subtype B HIV-1 tat in the Netherlands during 1985-2012.
van der Kuyl, Antoinette C; Vink, Monique; Zorgdrager, Fokla; Bakker, Margreet; Wymant, Chris; Hall, Matthew; Gall, Astrid; Blanquart, François; Berkhout, Ben; Fraser, Christophe; Cornelissen, Marion
2018-05-02
For the production of viral genomic RNA, HIV-1 is dependent on an early viral protein, Tat, which is required for high-level transcription. The quantity of viral RNA detectable in blood of HIV-1 infected individuals varies dramatically, and a factor involved could be the efficiency of Tat protein variants to stimulate RNA transcription. HIV-1 virulence, measured by set-point viral load, has been observed to increase over time in the Netherlands and elsewhere. Investigation of tat gene evolution in clinical isolates could discover a role of Tat in this changing virulence. A dataset of 291 Dutch HIV-1 subtype B tat genes, derived from full-length HIV-1 genome sequences from samples obtained between 1985-2012, was used to analyse the evolution of Tat. Twenty-two patient-derived tat genes, and the control Tat HXB2 were analysed for their capacity to stimulate expression of an LTR-luciferase reporter gene construct in diverse cell lines, as well as for their ability to complement a tat-defective HIV-1 LAI clone. Analysis of 291 historical tat sequences from the Netherlands showed ample amino acid (aa) variation between isolates, although no specific mutations were selected for over time. Of note, however, the encoded protein varied its length over the years through the loss or gain of stop codons in the second exon. In transmission clusters, a selection against the shorter Tat86 ORF was apparent in favour of the more common Tat101 version, likely due to negative selection against Tat86 itself, although random drift, transmission bottlenecks, or linkage to other variants could also explain the observation. There was no correlation between Tat length and set-point viral load; however, the number of non-intermediate variants in our study was small. In addition, variation in the length of Tat did not significantly change its capacity to stimulate transcription. From 1985 till 2012, variation in the length of the HIV-1 subtype B tat gene is increasingly found in the Dutch epidemic. However, as Tat proteins did not differ significantly in their capacity to stimulate transcription elongation in vitro, the increased HIV-1 virulence seen in recent years could not be linked to an evolving viral Tat protein. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.
Retroviral DNA Integration Directed by HIV Integration Protein in Vitro
NASA Astrophysics Data System (ADS)
Bushman, Frederic D.; Fujiwara, Tamio; Craigie, Robert
1990-09-01
Efficient retroviral growth requires integration of a DNA copy of the viral RNA genome into a chromosome of the host. As a first step in analyzing the mechanism of integration of human immunodeficiency virus (HIV) DNA, a cell-free system was established that models the integration reaction. The in vitro system depends on the HIV integration (IN) protein, which was partially purified from insect cells engineered to express IN protein in large quantities. Integration was detected in a biological assay that scores the insertion of a linear DNA containing HIV terminal sequences into a λ DNA target. Some integration products generated in this assay contained five-base pair duplications of the target DNA at the recombination junctions, a characteristic of HIV integration in vivo; the remaining products contained aberrant junctional sequences that may have been produced in a variation of the normal reaction. These results indicate that HIV IN protein is the only viral protein required to insert model HIV DNA sequences into a target DNA in vitro.
Genome-Wide Networks of Amino Acid Covariances Are Common among Viruses
Donlin, Maureen J.; Szeto, Brandon; Gohara, David W.; Aurora, Rajeev
2012-01-01
Coordinated variation among positions in amino acid sequence alignments can reveal genetic dependencies at noncontiguous positions, but methods to assess these interactions are incompletely developed. Previously, we found genome-wide networks of covarying residue positions in the hepatitis C virus genome (R. Aurora, M. J. Donlin, N. A. Cannon, and J. E. Tavis, J. Clin. Invest. 119:225–236, 2009). Here, we asked whether such networks are present in a diverse set of viruses and, if so, what they may imply about viral biology. Viral sequences were obtained for 16 viruses in 13 species from 9 families. The entire viral coding potential for each virus was aligned, all possible amino acid covariances were identified using the observed-minus-expected-squared algorithm at a false-discovery rate of ≤1%, and networks of covariances were assessed using standard methods. Covariances that spanned the viral coding potential were common in all viruses. In all cases, the covariances formed a single network that contained essentially all of the covariances. The hepatitis C virus networks had hub-and-spoke topologies, but all other networks had random topologies with an unusually large number of highly connected nodes. These results indicate that genome-wide networks of genetic associations and the coordinated evolution they imply are very common in viral genomes, that the networks rarely have the hub-and-spoke topology that dominates other biological networks, and that network topologies can vary substantially even within a given viral group. Five examples with hepatitis B virus and poliovirus are presented to illustrate how covariance network analysis can lead to inferences about viral biology. PMID:22238298
Hurwitz, Bonnie L; Westveld, Anton H; Brum, Jennifer R; Sullivan, Matthew B
2014-07-22
Long-standing questions in marine viral ecology are centered on understanding how viral assemblages change along gradients in space and time. However, investigating these fundamental ecological questions has been challenging due to incomplete representation of naturally occurring viral diversity in single gene- or morphology-based studies and an inability to identify up to 90% of reads in viral metagenomes (viromes). Although protein clustering techniques provide a significant advance by helping organize this unknown metagenomic sequence space, they typically use only ∼75% of the data and rely on assembly methods not yet tuned for naturally occurring sequence variation. Here, we introduce an annotation- and assembly-free strategy for comparative metagenomics that combines shared k-mer and social network analyses (regression modeling). This robust statistical framework enables visualization of complex sample networks and determination of ecological factors driving community structure. Application to 32 viromes from the Pacific Ocean Virome dataset identified clusters of samples broadly delineated by photic zone and revealed that geographic region, depth, and proximity to shore were significant predictors of community structure. Within subsets of this dataset, depth, season, and oxygen concentration were significant drivers of viral community structure at a single open ocean station, whereas variability along onshore-offshore transects was driven by oxygen concentration in an area with an oxygen minimum zone and not depth or proximity to shore, as might be expected. Together these results demonstrate that this highly scalable approach using complete metagenomic network-based comparisons can both test and generate hypotheses for ecological investigation of viral and microbial communities in nature.
Hurwitz, Bonnie L.; Westveld, Anton H.; Brum, Jennifer R.; Sullivan, Matthew B.
2014-01-01
Long-standing questions in marine viral ecology are centered on understanding how viral assemblages change along gradients in space and time. However, investigating these fundamental ecological questions has been challenging due to incomplete representation of naturally occurring viral diversity in single gene- or morphology-based studies and an inability to identify up to 90% of reads in viral metagenomes (viromes). Although protein clustering techniques provide a significant advance by helping organize this unknown metagenomic sequence space, they typically use only ∼75% of the data and rely on assembly methods not yet tuned for naturally occurring sequence variation. Here, we introduce an annotation- and assembly-free strategy for comparative metagenomics that combines shared k-mer and social network analyses (regression modeling). This robust statistical framework enables visualization of complex sample networks and determination of ecological factors driving community structure. Application to 32 viromes from the Pacific Ocean Virome dataset identified clusters of samples broadly delineated by photic zone and revealed that geographic region, depth, and proximity to shore were significant predictors of community structure. Within subsets of this dataset, depth, season, and oxygen concentration were significant drivers of viral community structure at a single open ocean station, whereas variability along onshore–offshore transects was driven by oxygen concentration in an area with an oxygen minimum zone and not depth or proximity to shore, as might be expected. Together these results demonstrate that this highly scalable approach using complete metagenomic network-based comparisons can both test and generate hypotheses for ecological investigation of viral and microbial communities in nature. PMID:25002514
Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs.
Chen-Harris, Haiyin; Borucki, Monica K; Torres, Clinton; Slezak, Tom R; Allen, Jonathan E
2013-02-12
High throughput sequencing is beginning to make a transformative impact in the area of viral evolution. Deep sequencing has the potential to reveal the mutant spectrum within a viral sample at high resolution, thus enabling the close examination of viral mutational dynamics both within- and between-hosts. The challenge however, is to accurately model the errors in the sequencing data and differentiate real viral mutations, particularly those that exist at low frequencies, from sequencing errors. We demonstrate that overlapping read pairs (ORP) -- generated by combining short fragment sequencing libraries and longer sequencing reads -- significantly reduce sequencing error rates and improve rare variant detection accuracy. Using this sequencing protocol and an error model optimized for variant detection, we are able to capture a large number of genetic mutations present within a viral population at ultra-low frequency levels (<0.05%). Our rare variant detection strategies have important implications beyond viral evolution and can be applied to any basic and clinical research area that requires the identification of rare mutations.
Comparing viral metagenomics methods using a highly multiplexed human viral pathogens reagent
Li, Linlin; Deng, Xutao; Mee, Edward T.; Collot-Teixeira, Sophie; Anderson, Rob; Schepelmann, Silke; Minor, Philip D.; Delwart, Eric
2014-01-01
Unbiased metagenomic sequencing holds significant potential as a diagnostic tool for the simultaneous detection of any previously genetically described viral nucleic acids in clinical samples. Viral genome sequences can also inform on likely phenotypes including drug susceptibility or neutralization serotypes. In this study, different variables of the laboratory methods often used to generate viral metagenomics libraries on the efficiency of viral detection and virus genome coverage were compared. A biological reagent consisting of 25 different human RNA and DNA viral pathogens was used to estimate the effect of filtration and nuclease digestion, DNA/RNA extraction methods, pre-amplification and the use of different library preparation kits on the detection of viral nucleic acids. Filtration and nuclease treatment led to slight decreases in the percentage of viral sequence reads and number of viruses detected. For nucleic acid extractions silica spin columns improved viral sequence recovery relative to magnetic beads and Trizol extraction. Pre-amplification using random RT-PCR while generating more viral sequence reads resulted in detection of fewer viruses, more overlapping sequences, and lower genome coverage. The ScriptSeq library preparation method retrieved more viruses and a greater fraction of their genomes than the TruSeq and Nextera methods. Viral metagenomics sequencing was able to simultaneously detect up to 22 different viruses in the biological reagent analyzed including all those detected by qPCR. Further optimization will be required for the detection of viruses in biologically more complex samples such as tissues, blood, or feces. PMID:25497414
Snow, M.; Cunningham, C.O.; Melvin, W.T.; Kurath, G.
1999-01-01
A ribonuclease (RNase) protection assay (RPA) has been used to detect nucleotide sequence variation within the nucleoprotein gene of 39 viral haemorrhagic septicaemia virus (VHSV) isolates of European marine origin. The classification of VHSV isolates based on RPA cleavage patterns permitted the identification of ten distinct groups of viruses based on differences at the molecular level. The nucleotide sequence of representatives of each of these groupings was determined and subjected to phylogenetic analysis. This revealed grouping of the European marine isolates of VHSV into three genotypes circulating within distinct geographic areas. A fourth genotype was identified comprising isolates originating from North America. Phylogenetic analyses indicated that VHSV isolates recovered from wild caught fish around the British Isles were genetically related to isolates responsible for losses in farmed turbot. Furthermore, a relationship between naturally occurring marine isolates and VHSV isolates causing mortality among rainbow trout in continental Europe was demonstrated. Analysis of the nucleoprotein gene identifies distinct lineages of viral haemorrhagic septicaemia virus within the European marine environment. Virus Res. 63, 35-44. Available from:
Origins and challenges of viral dark matter.
Krishnamurthy, Siddharth R; Wang, David
2017-07-15
The accurate classification of viral dark matter - metagenomic sequences that originate from viruses but do not align to any reference virus sequences - is one of the major obstacles in comprehensively defining the virome. Depending on the sample, viral dark matter can make up from anywhere between 40 and 90% of sequences. This review focuses on the specific nature of dark matter as it relates to viral sequences. We identify three factors that contribute to the existence of viral dark matter: the divergence and length of virus sequences, the limitations of alignment based classification, and limited representation of viruses in reference sequence databases. We then discuss current methods that have been developed to at least partially circumvent these limitations and thereby reduce the extent of viral dark matter. Copyright © 2017 Elsevier B.V. All rights reserved.
Molecular characterization of occult hepatitis B virus in genotype E-infected subjects.
Zahn, Astrid; Li, Chengyao; Danso, Kwabena; Candotti, Daniel; Owusu-Ofori, Shirley; Temple, Jillian; Allain, Jean-Pierre
2008-02-01
Occult hepatitis B virus (HBV) infection (OBI), defined as the presence of HBV DNA without detectable HBV surface antigen (HBsAg), is frequent in west Africa, where genotype E is prevalent. The prevalence of OBI in 804 blood donors and 1368 pregnant women was 1.7 and 1.5%, respectively. Nine of 32 OBI carriers were evaluated with HBV serology, viral load and complete HBV genome sequence of two to five clones. All samples except one were anti-HBV core antigen-positive and three contained antibodies against HBsAg (anti-HBs). All strains were of genotype E and formed quasispecies with 0.20-1.28% intra-sample sequence variation. Few uncommon mutations (absent in 23 genotype E reference sequences) were found across the entire genome. Two mutations in the core region encoded truncated or abnormal capsid protein, potentially affecting viral production, but were probably rescued by non-mutated variants, as found in one clone. No evidence of escape mutants was found in anti-HBs-carrying samples, as the 'a' region was consistently wild type. OBI carriers constitute approximately 10% of all HBV DNA-viraemic adult Ghanaians. OBI carriers appear as a disparate group, with a very low viral load in common, but multiple origins reflecting decades of natural evolution in an area essentially devoid of human intervention.
Surface gene variants of hepatitis B Virus in Saudi Patients.
Al-Qudari, Ahmed Y; Amer, Haitham M; Abdo, Ayman A; Hussain, Zahid; Al-Hamoudi, Waleed; Alswat, Khalid; Almajhdi, Fahad N
2016-01-01
Hepatitis B virus (HBV) continues to be one of the most important viral pathogens in humans. Surface (S) protein is the major HBV antigen that mediates virus attachment and entry and determines the virus subtype. Mutations in S gene, particularly in the "a" determinant, can influence virus detection by ELISA and may generate escape mutants. Since no records have documented the S gene mutations in HBV strains circulating in Saudi Arabia, the current study was designed to study sequence variation of S gene in strains circulating in Saudi Arabia and its correlation with clinical and risk factors. A total of 123 HBV-infected patients were recruited for this study. Clinical and biochemical parameters, serological markers, and viral load were determined in all patients. The entire S gene sequence of samples with viral load exceeding 2000 IU/mL was retrieved and exploited in sequence and phylogenetic analysis. A total of 48 mutations (21 unique) were recorded in viral strains in Saudi Arabia, among which 24 (11 unique) changed their respective amino acids. Two amino acid changes were recorded in "a" determinant, including F130L and S135F with no evidence of the vaccine escape mutant G145R in any of the samples. No specific relationship was recognized between the mutation/amino acid change record of HBsAg in strains in Saudi Arabia and clinical or laboratory data. Phylogenetic analysis categorized HBV viral strains in Saudi Arabia as members of subgenotypes D1 and D3. The present report is the first that describes mutation analysis of HBsAg in strains in Saudi Arabia on both nucleotide and amino acid levels. Different substitutions, particularly in major hydrophilic region, may have a potential influence on disease diagnosis, vaccination strategy, and antiviral chemotherapy.
Korber, B T; Kunstman, K J; Patterson, B K; Furtado, M; McEvilly, M M; Levy, R; Wolinsky, S M
1994-01-01
Human immunodeficiency virus type 1 (HIV-1) sequences were generated from blood and from brain tissue obtained by stereotactic biopsy from six patients undergoing a diagnostic neurosurgical procedure. Proviral DNA was directly amplified by nested PCR, and 8 to 36 clones from each sample were sequenced. Phylogenetic analysis of intrapatient envelope V3-V5 region HIV-1 DNA sequence sets revealed that brain viral sequences were clustered relative to the blood viral sequences, suggestive of tissue-specific compartmentalization of the virus in four of the six cases. In the other two cases, the blood and brain virus sequences were intermingled in the phylogenetic analyses, suggesting trafficking of virus between the two tissues. Slide-based PCR-driven in situ hybridization of two of the patients' brain biopsy samples confirmed our interpretation of the intrapatient phylogenetic analyses. Interpatient V3 region brain-derived sequence distances were significantly less than blood-derived sequence distances. Relative to the tip of the loop, the set of brain-derived viral sequences had a tendency towards negative or neutral charge compared with the set of blood-derived viral sequences. Entropy calculations were used as a measure of the variability at each position in alignments of blood and brain viral sequences. A relatively conserved set of positions were found, with a significantly lower entropy in the brain-than in the blood-derived viral sequences. These sites constitute a brain "signature pattern," or a noncontiguous set of amino acids in the V3 region conserved in viral sequences derived from brain tissue. This brain-derived signature pattern was also well preserved among isolates previously characterized in vitro as macrophage tropic. Macrophage-monocyte tropism may be the biological constraint that results in the conservation of the viral brain signature pattern. Images PMID:7933130
Janes, Holly; Frahm, Nicole; DeCamp, Allan; Rolland, Morgane; Gabriel, Erin; Wolfson, Julian; Hertz, Tomer; Kallas, Esper; Goepfert, Paul; Friedrich, David P.; Corey, Lawrence; Mullins, James I.; McElrath, M. Juliana; Gilbert, Peter
2012-01-01
Background The sieve analysis for the Step trial found evidence that breakthrough HIV-1 sequences for MRKAd5/HIV-1 Gag/Pol/Nef vaccine recipients were more divergent from the vaccine insert than placebo sequences in regions with predicted epitopes. We linked the viral sequence data with immune response and acute viral load data to explore mechanisms for and consequences of the observed sieve effect. Methods Ninety-one male participants (37 placebo and 54 vaccine recipients) were included; viral sequences were obtained at the time of HIV-1 diagnosis. T-cell responses were measured 4 weeks post-second vaccination and at the first or second week post-diagnosis. Acute viral load was obtained at RNA-positive and antibody-negative visits. Findings Vaccine recipients had a greater magnitude of post-infection CD8+ T cell response than placebo recipients (median 1.68% vs 1.18%; p = 0·04) and greater breadth of post-infection response (median 4.5 vs 2; p = 0·06). Viral sequences for vaccine recipients were marginally more divergent from the insert than placebo sequences in regions of Nef targeted by pre-infection immune responses (p = 0·04; Pol p = 0·13; Gag p = 0·89). Magnitude and breadth of pre-infection responses did not correlate with distance of the viral sequence to the insert (p>0·50). Acute log viral load trended lower in vaccine versus placebo recipients (estimated mean 4·7 vs 5·1) but the difference was not significant (p = 0·27). Neither was acute viral load associated with distance of the viral sequence to the insert (p>0·30). Interpretation Despite evidence of anamnestic responses, the sieve effect was not well explained by available measures of T-cell immunogenicity. Sequence divergence from the vaccine was not significantly associated with acute viral load. While point estimates suggested weak vaccine suppression of viral load, the result was not significant and more viral load data would be needed to detect suppression. PMID:22952672
DOE Office of Scientific and Technical Information (OSTI.GOV)
Villiers, Etienne P. de, E-mail: e.villiers@cgiar.or; Gallardo, Carmina; Arias, Marisa
Viral molecular epidemiology has traditionally analyzed variation in single genes. Whole genome phylogenetic analysis of 123 concatenated genes from 11 ASFV genomes, including E75, a newly sequenced virulent isolate from Spain, identified two clusters. One contained South African isolates from ticks and warthog, suggesting derivation from a sylvatic transmission cycle. The second contained isolates from West Africa and the Iberian Peninsula. Two isolates, from Kenya and Malawi, were outliers. Of the nine genomes within the clusters, seven were within p72 genotype 1. The 11 genomes sequenced comprised only 5 of the 22 p72 genotypes. Comparison of synonymous and non-synonymous mutationsmore » at the genome level identified 20 genes subject to selection pressure for diversification. A novel gene of the E75 virus evolved by the fusion of two genes within the 360 multicopy family. Comparative genomics reveals high diversity within a limited sample of the ASFV viral gene pool.« less
Wargo, Andrew R.; Kell, Alison M.; Scott, Robert J.; Thorgaard, Gary H.; Kurath, Gael
2012-01-01
Little is known about the factors that drive the high levels of between-host variation in pathogen burden that are frequently observed in viral infections. Here, two factors thought to impact viral load variability, host genetic diversity and stochastic processes linked with viral entry into the host, were examined. This work was conducted with the aquatic vertebrate virus, Infectious hematopoietic necrosis virus (IHNV), in its natural host, rainbow trout. It was found that in controlled in vivo infections of IHNV, a suggestive trend of reduced between-fish viral load variation was observed in a clonal population of isogenic trout compared to a genetically diverse population of out-bred trout. However, this trend was not statistically significant for any of the four viral genotypes examined, and high levels of fish-to-fish variation persisted even in the isogenic trout population. A decrease in fish-to-fish viral load variation was also observed in virus injection challenges that bypassed the host entry step, compared to fish exposed to the virus through the natural water-borne immersion route of infection. This trend was significant for three of the four virus genotypes examined and suggests host entry may play a role in viral load variability. However, high levels of viral load variation also remained in the injection challenges. Together, these results indicate that although host genetic diversity and viral entry may play some role in between-fish viral load variation, they are not major factors. Other biological and non-biological parameters that may influence viral load variation are discussed.
Brüwer, Jan D.
2018-01-01
Current research posits that all multicellular organisms live in symbioses with associated microorganisms and form so-called metaorganisms or holobionts. Cnidarian metaorganisms are of specific interest given that stony corals provide the foundation of the globally threatened coral reef ecosystems. To gain first insight into viruses associated with the coral model system Aiptasia (sensu Exaiptasia pallida), we analyzed an existing RNA-Seq dataset of aposymbiotic, partially populated, and fully symbiotic Aiptasia CC7 anemones with Symbiodinium. Our approach included the selective removal of anemone host and algal endosymbiont sequences and subsequent microbial sequence annotation. Of a total of 297 million raw sequence reads, 8.6 million (∼3%) remained after host and endosymbiont sequence removal. Of these, 3,293 sequences could be assigned as of viral origin. Taxonomic annotation of these sequences suggests that Aiptasia is associated with a diverse viral community, comprising 116 viral taxa covering 40 families. The viral assemblage was dominated by viruses from the families Herpesviridae (12.00%), Partitiviridae (9.93%), and Picornaviridae (9.87%). Despite an overall stable viral assemblage, we found that some viral taxa exhibited significant changes in their relative abundance when Aiptasia engaged in a symbiotic relationship with Symbiodinium. Elucidation of viral taxa consistently present across all conditions revealed a core virome of 15 viral taxa from 11 viral families, encompassing many viruses previously reported as members of coral viromes. Despite the non-random selection of viral genetic material due to the nature of the sequencing data analyzed, our study provides a first insight into the viral community associated with Aiptasia. Similarities of the Aiptasia viral community with those of corals corroborate the application of Aiptasia as a model system to study coral holobionts. Further, the change in abundance of certain viral taxa across different symbiotic states suggests a role of viruses in the algal endosymbiosis, but the functional significance of this remains to be determined. PMID:29507840
Laboratory procedures to generate viral metagenomes.
Thurber, Rebecca V; Haynes, Matthew; Breitbart, Mya; Wegley, Linda; Rohwer, Forest
2009-01-01
This collection of laboratory protocols describes the steps to collect viruses from various samples with the specific aim of generating viral metagenome sequence libraries (viromes). Viral metagenomics, the study of uncultured viral nucleic acid sequences from different biomes, relies on several concentration, purification, extraction, sequencing and heuristic bioinformatic methods. No single technique can provide an all-inclusive approach, and therefore the protocols presented here will be discussed in terms of hypothetical projects. However, care must be taken to individualize each step depending on the source and type of viral-particles. This protocol is a description of the processes we have successfully used to: (i) concentrate viral particles from various types of samples, (ii) eliminate contaminating cells and free nucleic acids and (iii) extract, amplify and purify viral nucleic acids. Overall, a sample can be processed to isolate viral nucleic acids suitable for high-throughput sequencing in approximately 1 week.
IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses
Paez-Espino, David; Chen, I. -Min A.; Palaniappan, Krishna; ...
2016-10-30
Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from > 6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs aremore » grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparingwith external sequences, thus serving as an essential resource in the viral genomics community.« less
IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paez-Espino, David; Chen, I. -Min A.; Palaniappan, Krishna
Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from > 6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs aremore » grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparingwith external sequences, thus serving as an essential resource in the viral genomics community.« less
Evolution and Diversity in Human Herpes Simplex Virus Genomes
Gatherer, Derek; Ochoa, Alejandro; Greenbaum, Benjamin; Dolan, Aidan; Bowden, Rory J.; Enquist, Lynn W.; Legendre, Matthieu; Davison, Andrew J.
2014-01-01
Herpes simplex virus 1 (HSV-1) causes a chronic, lifelong infection in >60% of adults. Multiple recent vaccine trials have failed, with viral diversity likely contributing to these failures. To understand HSV-1 diversity better, we comprehensively compared 20 newly sequenced viral genomes from China, Japan, Kenya, and South Korea with six previously sequenced genomes from the United States, Europe, and Japan. In this diverse collection of passaged strains, we found that one-fifth of the newly sequenced members share a gene deletion and one-third exhibit homopolymeric frameshift mutations (HFMs). Individual strains exhibit genotypic and potential phenotypic variation via HFMs, deletions, short sequence repeats, and single-nucleotide polymorphisms, although the protein sequence identity between strains exceeds 90% on average. In the first genome-scale analysis of positive selection in HSV-1, we found signs of selection in specific proteins and residues, including the fusion protein glycoprotein H. We also confirmed previous results suggesting that recombination has occurred with high frequency throughout the HSV-1 genome. Despite this, the HSV-1 strains analyzed clustered by geographic origin during whole-genome distance analysis. These data shed light on likely routes of HSV-1 adaptation to changing environments and will aid in the selection of vaccine antigens that are invariant worldwide. PMID:24227835
Mina, Thomas; Amini-Bavil-Olyaee, Samad; Shirvani-Dastgerdi, Elham; Trovão, Nídia Sequeira; Van Ranst, Marc; Pourkarim, Mahmoud Reza
2017-04-01
Fulminant hepatitis among different clinical outcomes of hepatitis B virus infection is very rare and manifests high mortality rate, however it has not been investigated in Belgian inhabitants yet. In the frame of a retrospective study between 1995 and 2010, 80 serum samples (in some cases serial samples) archived in Biobank, were collected from 24 patients who had clinically developed fulminant infection of hepatitis B virus. In total, 33 hepatitis B virus (HBV) strains (31 full-length genome and 2 partial viral genes) of different HBV genotypes and subgenotypes including A2, B2, D1, D2, D3 and E, were amplified, sequenced and phylogenetically analyzed. HBV isolated strains from native and exotic patients were characterized by genome variations associated with viral invasiveness. Although several mutations at nucleotide and protein levels were detected, evolutionary analyses revealed a negative selective pressure over the viral genomes. This study revealed influence of immigration through a steady change in the viral epidemiological profile of the Belgian population. Copyright © 2017 Elsevier B.V. All rights reserved.
Recoding method that removes inhibitory sequences and improves HIV gene expression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rabadan, Raul; Krasnitz, Michael; Robins, Harlan
The invention relates to inhibitory nucleotide signal sequences or "INS" sequences in the genomes of lentiviruses. In particular the invention relates to the AGG motif present in all viral genomes. The AGG motif may have an inhibitory effect on a virus, for example by reducing the levels of, or maintaining low steady-state levels of, viral RNAs in host cells, and inducing and/or maintaining in viral latency. In one aspect, the invention provides vaccines that contain, or are produced from, viral nucleic acids in which the AGG sequences have been mutated. In another aspect, the invention provides methods and compositions formore » affecting the function of the AGG motif, and methods for identifying other INS sequences in viral genomes.« less
McFaul, Katie; Liptrott, Neill; Cox, Alison; Martin, Phillip; Egan, Deirdre; Owen, Andrew; Kelly, Sarah; Karolia, Zeenat; Shaw, Kate; Bower, Mark; Boffito, Marta
2016-09-01
The use of combination antiretroviral therapy (cART) and cytotoxic chemotherapy for HIV-associated lymphoma runs the risks of inducing HIV drug resistance. This study examined two possible mechanisms: altered expression of membrane drug transporter protein (MTP) and acquisition of mutations in pro-viral DNA. Expression levels of MTP and pro-viral DNA resistance mutation analysis were performed on peripheral blood mononuclear cells (PBMC) before, during, and after chemotherapy. Twenty nine patients completed the three time point estimations. There were no significant variations before, during, and after chemotherapy in the expression of four MTPs: ABCB1, ABCC1, ABCC2, and SLCO3A1 (OATP3A1). Pro-viral DNA sequencing revealed that only one patient developed a new nucleos/tide reverse transcriptase inhibitor-associated mutation (184V) during the course of the study, giving a mutation rate of 0.0027 per person per year. In conclusion, concomitant administration of cytotoxic chemotherapy and cART does not induce expression of MTP. Furthermore, no significant changes in viral resistance were observed pre- and post-chemotherapy, suggesting mutagenic cytotoxic chemotherapy seems not to induce mutations in HIV pro-viral DNA.
Yamani, Laura Navika; Utsumi, Takako; Juniastuti; Wandono, Hadi; Widjanarko, Doddy; Triantanoe, Ari; Wasityastuti, Widya; Liang, Yujiao; Okada, Rina; Tanahashi, Toshihito; Murakami, Yoshiki; Azuma, Takeshi; Soetjipto; Lusida, Maria Inge; Hayashi, Yoshitake
2015-01-01
Quasispecies of hepatitis B virus (HBV) with variations in the major hydrophilic region (MHR) of the HBV surface antigen (HBsAg) can evolve during infection, allowing HBV to evade neutralizing antibodies. These escape variants may contribute to chronic infections. In this study, we looked for MHR variants in HBV quasispecies using ultradeep sequencing and evaluated the relationship between these variants and clinical manifestations in infected patients. We enrolled 30 Indonesian patients with hepatitis B infection (11 with chronic hepatitis and 19 with advanced liver disease). The most common subgenotype/subtype of HBV was B3/adw (97%). The HBsAg titer was lower in patients with advanced liver disease than that in patients with chronic hepatitis. The MHR variants were grouped based on the percentage of the viral population affected: major, ≥20% of the total population; intermediate, 5% to <20%; and minor, 1% to <5%. The rates of MHR variation that were present in the major and intermediate viral population were significantly greater in patients with advanced liver disease than those in chronic patients. The most frequent MHR variants related to immune evasion in the major and intermediate populations were P120Q/T, T123A, P127T, Q129H/R, M133L/T, and G145R. The major population of MHR variants causing impaired of HBsAg secretion (e.g., G119R, Q129R, T140I, and G145R) was detected only in advanced liver disease patients. This is the first study to use ultradeep sequencing for the detection of MHR variants of HBV quasispecies in Indonesian patients. We found that a greater number of MHR variations was related to disease severity and reduced likelihood of HBsAg titer. PMID:26202119
Verbist, Bie; Clement, Lieven; Reumers, Joke; Thys, Kim; Vapirev, Alexander; Talloen, Willem; Wetzels, Yves; Meys, Joris; Aerssens, Jeroen; Bijnens, Luc; Thas, Olivier
2015-02-22
Deep-sequencing allows for an in-depth characterization of sequence variation in complex populations. However, technology associated errors may impede a powerful assessment of low-frequency mutations. Fortunately, base calls are complemented with quality scores which are derived from a quadruplet of intensities, one channel for each nucleotide type for Illumina sequencing. The highest intensity of the four channels determines the base that is called. Mismatch bases can often be corrected by the second best base, i.e. the base with the second highest intensity in the quadruplet. A virus variant model-based clustering method, ViVaMBC, is presented that explores quality scores and second best base calls for identifying and quantifying viral variants. ViVaMBC is optimized to call variants at the codon level (nucleotide triplets) which enables immediate biological interpretation of the variants with respect to their antiviral drug responses. Using mixtures of HCV plasmids we show that our method accurately estimates frequencies down to 0.5%. The estimates are unbiased when average coverages of 25,000 are reached. A comparison with the SNP-callers V-Phaser2, ShoRAH, and LoFreq shows that ViVaMBC has a superb sensitivity and specificity for variants with frequencies above 0.4%. Unlike the competitors, ViVaMBC reports a higher number of false-positive findings with frequencies below 0.4% which might partially originate from picking up artificial variants introduced by errors in the sample and library preparation step. ViVaMBC is the first method to call viral variants directly at the codon level. The strength of the approach lies in modeling the error probabilities based on the quality scores. Although the use of second best base calls appeared very promising in our data exploration phase, their utility was limited. They provided a slight increase in sensitivity, which however does not warrant the additional computational cost of running the offline base caller. Apparently a lot of information is already contained in the quality scores enabling the model based clustering procedure to adjust the majority of the sequencing errors. Overall the sensitivity of ViVaMBC is such that technical constraints like PCR errors start to form the bottleneck for low frequency variant detection.
Continuous Influx of Genetic Material from Host to Virus Populations
Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane
2016-01-01
Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors. PMID:26829124
Continuous Influx of Genetic Material from Host to Virus Populations.
Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane; Cordaux, Richard; Herniou, Elisabeth A
2016-02-01
Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors.
IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses.
Paez-Espino, David; Chen, I-Min A; Palaniappan, Krishna; Ratner, Anna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Huang, Jinghua; Markowitz, Victor M; Nielsen, Torben; Huntemann, Marcel; K Reddy, T B; Pavlopoulos, Georgios A; Sullivan, Matthew B; Campbell, Barbara J; Chen, Feng; McMahon, Katherine; Hallam, Steve J; Denef, Vincent; Cavicchioli, Ricardo; Caffrey, Sean M; Streit, Wolfgang R; Webster, John; Handley, Kim M; Salekdeh, Ghasem H; Tsesmetzis, Nicolas; Setubal, Joao C; Pope, Phillip B; Liu, Wen-Tso; Rivers, Adam R; Ivanova, Natalia N; Kyrpides, Nikos C
2017-01-04
Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from >6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs are grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparing with external sequences, thus serving as an essential resource in the viral genomics community. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Pontremoli, Chiara; Forni, Diego; Cagliani, Rachele; Pozzoli, Uberto; Riva, Stefania; Bravo, Ignacio G; Clerici, Mario; Sironi, Manuela
2017-10-01
The Old World (OW) arenavirus complex includes several species of rodent-borne viruses, some of which (i.e., Lassa virus, LASV and Lymphocytic choriomeningitis virus, LCMV) cause human diseases. Most LCMV and LASV infections are caused by rodent-to-human transmissions. Thus, viral evolution is largely determined by events that occur in the wildlife reservoirs. We used a set of human- and rodent-derived viral sequences to investigate the evolutionary history underlying OW arenavirus speciation, as well as the more recent selective events that accompanied LASV spread in West Africa. We show that the viral RNA polymerase (L protein) was a major positive selection target in OW arenaviruses and during LASV out-of-Nigeria migration. No evidence of selection was observed for the glycoprotein, whereas positive selection acted on the nucleoprotein (NP) during LCMV speciation. Positively selected sites in L and NP are surrounded by highly conserved residues, and the bulk of the viral genome evolves under purifying selection. Several positively selected sites are likely to modulate viral replication/transcription. In both L and NP, structural features (solvent exposed surface area) are important determinants of site-wise evolutionary rate variation. By incorporating several rodent-derived sequences, we also performed an analysis of OW arenavirus codon adaptation to the human host. Results do not support a previously hypothesized role of codon adaptation in disease severity for non-Nigerian strains. In conclusion, L and NP represent the major selection targets and possible determinants of disease presentation; these results suggest that field surveys and experimental studies should primarily focus on these proteins. © 2017 John Wiley & Sons Ltd.
Genome Sequencing and Analysis of Geographically Diverse Clinical Isolates of Herpes Simplex Virus 2
Lamers, Susanna L.; Weiner, Brian; Ray, Stuart C.; Colgrove, Robert C.; Diaz, Fernando; Jing, Lichen; Wang, Kening; Saif, Sakina; Young, Sarah; Henn, Matthew; Laeyendecker, Oliver; Tobian, Aaron A. R.; Cohen, Jeffrey I.; Koelle, David M.; Quinn, Thomas C.; Knipe, David M.
2015-01-01
ABSTRACT Herpes simplex virus 2 (HSV-2), the principal causative agent of recurrent genital herpes, is a highly prevalent viral infection worldwide. Limited information is available on the amount of genomic DNA variation between HSV-2 strains because only two genomes have been determined, the HG52 laboratory strain and the newly sequenced SD90e low-passage-number clinical isolate strain, each from a different geographical area. In this study, we report the nearly complete genome sequences of 34 HSV-2 low-passage-number and laboratory strains, 14 of which were collected in Uganda, 1 in South Africa, 11 in the United States, and 8 in Japan. Our analyses of these genomes demonstrated remarkable sequence conservation, regardless of geographic origin, with the maximum nucleotide divergence between strains being 0.4% across the genome. In contrast, prior studies indicated that HSV-1 genomes exhibit more sequence diversity, as well as geographical clustering. Additionally, unlike HSV-1, little viral recombination between HSV-2 strains could be substantiated. These results are interpreted in light of HSV-2 evolution, epidemiology, and pathogenesis. Finally, the newly generated sequences more closely resemble the low-passage-number SD90e than HG52, supporting the use of the former as the new reference genome of HSV-2. IMPORTANCE Herpes simplex virus 2 (HSV-2) is a causative agent of genital and neonatal herpes. Therefore, knowledge of its DNA genome and genetic variability is central to preventing and treating genital herpes. However, only two full-length HSV-2 genomes have been reported. In this study, we sequenced 34 additional HSV-2 low-passage-number and laboratory viral genomes and initiated analysis of the genetic diversity of HSV-2 strains from around the world. The analysis of these genomes will facilitate research aimed at vaccine development, diagnosis, and the evaluation of clinical manifestations and transmission of HSV-2. This information will also contribute to our understanding of HSV evolution. PMID:26018166
Benmansour, A.; Bascuro, B.; Monnier, A.F.; Vende, P.; Winton, J.R.; de Kinkelin, P.
1997-01-01
To evaluate the genetic diversity of viral haemorrhagic septicaemia virus (VHSV), the sequence of the glycoprotein genes (G) of 11 North American and European isolates were determined. Comparison with the G protein of representative members of the family Rhabdoviridae suggested that VHSV was a different virus species from infectious haemorrhagic necrosis virus (IHNV) and Hirame rhabdovirus (HIRRV). At a higher taxonomic level, VHSV, IHNV and HIRRV formed a group which was genetically closest to the genus Lyssavirus. Compared with each other, the G genes of VHSV displayed a dissimilar overall genetic diversity which correlated with differences in geographical origin. The multiple sequence alignment of the complete G protein, showed that the divergent positions were not uniformly distributed along the sequence. A central region (amino acid position 245-300) accumulated substitutions and appeared to be highly variable. The genetic heterogeneity within a single isolate was high, with an apparent internal mutation frequency of 1.2 x 10(-3) per nucleotide site, attesting the quasispecies nature of the viral population. The phylogeny separated VHSV strains according to the major geographical area of isolation: genotype I for continental Europe, genotype II for the British Isles, and genotype III for North America. Isolates from continental Europe exhibited the highest genetic variability, with sub-groups correlated partially with the serological classification. Neither neutralizing polyclonal sera, nor monoclonal antibodies, were able to discriminate between the genotypes. The overall structure of the phylogenetic tree suggests that VHSV genetic diversity and evolution fit within the model of random change and positive selection operating on quasispecies.
Cheng, Chun-Pei; Lan, Kuo-Lun; Liu, Wen-Chun; Chang, Ting-Tsung; Tseng, Vincent S
2016-12-01
Hepatitis B viral (HBV) infection is strongly associated with an increased risk of liver diseases like cirrhosis or hepatocellular carcinoma (HCC). Many lines of evidence suggest that deletions occurring in HBV genomic DNA are highly associated with the activity of HBV via the interplay between aberrant viral proteins release and human immune system. Deletions finding on the HBV whole genome sequences is thus a very important issue though there exist underlying the challenges in mining such big and complex biological data. Although some next generation sequencing (NGS) tools are recently designed for identifying structural variations such as insertions or deletions, their validity is generally committed to human sequences study. This design may not be suitable for viruses due to different species. We propose a graphics processing unit (GPU)-based data mining method called DeF-GPU to efficiently and precisely identify HBV deletions from large NGS data, which generally contain millions of reads. To fit the single instruction multiple data instructions, sequencing reads are referred to as multiple data and the deletion finding procedure is referred to as a single instruction. We use Compute Unified Device Architecture (CUDA) to parallelize the procedures, and further validate DeF-GPU on 5 synthetic and 1 real datasets. Our results suggest that DeF-GPU outperforms the existing commonly-used method Pindel and is able to exactly identify the deletions of our ground truth in few seconds. The source code and other related materials are available at https://sourceforge.net/projects/defgpu/. Copyright © 2016 Elsevier Inc. All rights reserved.
Discovery of DNA viruses in wild-caught mosquitoes using small RNA high throughput sequencing.
Ma, Maijuan; Huang, Yong; Gong, Zhengda; Zhuang, Lu; Li, Cun; Yang, Hong; Tong, Yigang; Liu, Wei; Cao, Wuchun
2011-01-01
Mosquito-borne infectious diseases pose a severe threat to public health in many areas of the world. Current methods for pathogen detection and surveillance are usually dependent on prior knowledge of the etiologic agents involved. Hence, efficient approaches are required for screening wild mosquito populations to detect known and unknown pathogens. In this study, we explored the use of Next Generation Sequencing to identify viral agents in wild-caught mosquitoes. We extracted total RNA from different mosquito species from South China. Small 18-30 bp length RNA molecules were purified, reverse-transcribed into cDNA and sequenced using Illumina GAIIx instrumentation. Bioinformatic analyses to identify putative viral agents were conducted and the results confirmed by PCR. We identified a non-enveloped single-stranded DNA densovirus in the wild-caught Culex pipiens molestus mosquitoes. The majority of the viral transcripts (.>80% of the region) were covered by the small viral RNAs, with a few peaks of very high coverage obtained. The +/- strand sequence ratio of the small RNAs was approximately 7∶1, indicating that the molecules were mainly derived from the viral RNA transcripts. The small viral RNAs overlapped, enabling contig assembly of the viral genome sequence. We identified some small RNAs in the reverse repeat regions of the viral 5'- and 3' -untranslated regions where no transcripts were expected. Our results demonstrate for the first time that high throughput sequencing of small RNA is feasible for identifying viral agents in wild-caught mosquitoes. Our results show that it is possible to detect DNA viruses by sequencing the small RNAs obtained from insects, although the underlying mechanism of small viral RNA biogenesis is unclear. Our data and those of other researchers show that high throughput small RNA sequencing can be used for pathogen surveillance in wild mosquito vectors.
Community-Acquired Poliovirus Infection in Children with Primary Immunodeficiencies in Tunisia
Triki, Hinda; Barbouche, Mohamed Ridha; Bahri, Olfa; Bejaoui, Mohamed; Dellagi, Koussay
2003-01-01
The global polio eradication program recommends the use of massive vaccination campaigns with live vaccine through National Immunization Days (NIDs) to displace the wild virus from the community. Immunodeficient patients may be indirectly infected and become chronic excretors and potential reservoirs of polioviruses, a concern for the posteradication era. This prospective study aimed to assess the risk of community-acquired infection of immunodeficient patients following NIDs, the dynamics of viral excretion and the genetic variation of excreted viruses. Sixteen children with various primary immunodeficiencies, who did not receive the vaccine during the campaign, were investigated. Stool samples were collected weekly, shortly after the NIDs, during at least 3 months, and were processed for viral isolation. Isolates were characterized by three intratypic differentiation methods and partial sequencing of the VP1/2A region. Polioviruses were detected in 4 out of 16 patients (serotype 1 in 3 patients and serotype 3 in 1 patient). Sequencing revealed more than 99% homology with homotypic Sabin strains, suggesting recent infection. Duration of viral excretion ranged from 1 to 7 weeks. Nine out of eleven isolates from the three poliovirus serotype 1-infected patients disclosed a non-Sabin-like phenotype by enzyme-linked immunosorbent assay and had recurrent mutations within or close to the neutralizing antigenic sites. In summary, the risk of secondary infection in immunodeficient patients is within the range previously reported for the general population. Although none of the four infected patients developed prolonged viral excretion, particular viral variants were selected and may be of epidemiological significance. PMID:12624052
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.
Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H
2017-04-15
Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification
Sinclair, Robert M.; Ravantti, Janne J.
2017-01-01
ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
Yam, Alice Wei Yee; Colmant, Agathe M. G.; McLean, Breeanna J.; Prow, Natalie A.; Watterson, Daniel; Hall-Mendelin, Sonja; Warrilow, David; Ng, Mah-Lee; Khromykh, Alexander A.; Hall, Roy A.
2015-01-01
Mosquito-borne viruses encompass a range of virus families, comprising a number of significant human pathogens (e.g., dengue viruses, West Nile virus, Chikungunya virus). Virulent strains of these viruses are continually evolving and expanding their geographic range, thus rapid and sensitive screening assays are required to detect emerging viruses and monitor their prevalence and spread in mosquito populations. Double-stranded RNA (dsRNA) is produced during the replication of many of these viruses as either an intermediate in RNA replication (e.g., flaviviruses, togaviruses) or the double-stranded RNA genome (e.g., reoviruses). Detection and discovery of novel viruses from field and clinical samples usually relies on recognition of antigens or nucleotide sequences conserved within a virus genus or family. However, due to the wide antigenic and genetic variation within and between viral families, many novel or divergent species can be overlooked by these approaches. We have developed two monoclonal antibodies (mAbs) which show co-localised staining with proteins involved in viral RNA replication in immunofluorescence assay (IFA), suggesting specific reactivity to viral dsRNA. By assessing binding against a panel of synthetic dsRNA molecules, we have shown that these mAbs recognise dsRNA greater than 30 base pairs in length in a sequence-independent manner. IFA and enzyme-linked immunosorbent assay (ELISA) were employed to demonstrate detection of a panel of RNA viruses from several families, in a range of cell types. These mAbs, termed monoclonal antibodies to viral RNA intermediates in cells (MAVRIC), have now been incorporated into a high-throughput, economical ELISA-based screening system for the detection and discovery of viruses from mosquito populations. Our results have demonstrated that this simple system enables the efficient detection and isolation of a range of known and novel viruses in cells inoculated with field-caught mosquito samples, and represents a rapid, sequence-independent, and cost-effective approach to virus discovery. PMID:25799391
Ray, Greeshma; Schmitt, Phuong Tieu
2016-01-01
ABSTRACT Paramyxovirus particles are formed by a budding process coordinated by viral matrix (M) proteins. M proteins coalesce at sites underlying infected cell membranes and induce other viral components, including viral glycoproteins and viral ribonucleoprotein complexes (vRNPs), to assemble at these locations from which particles bud. M proteins interact with the nucleocapsid (NP or N) components of vRNPs, and these interactions enable production of infectious, genome-containing virions. For the paramyxoviruses parainfluenza virus 5 (PIV5) and mumps virus, M-NP interaction also contributes to efficient production of virus-like particles (VLPs) in transfected cells. A DLD sequence near the C-terminal end of PIV5 NP protein was previously found to be necessary for M-NP interaction and efficient VLP production. Here, we demonstrate that 15-residue-long, DLD-containing sequences derived from either the PIV5 or Nipah virus nucleocapsid protein C-terminal ends are sufficient to direct packaging of a foreign protein, Renilla luciferase, into budding VLPs. Mumps virus NP protein harbors DWD in place of the DLD sequence found in PIV5 NP protein, and consequently, PIV5 NP protein is incompatible with mumps virus M protein. A single amino acid change converting DLD to DWD within PIV5 NP protein induced compatibility between these proteins and allowed efficient production of mumps VLPs. Our data suggest a model in which paramyxoviruses share an overall common strategy for directing M-NP interactions but with important variations contained within DLD-like sequences that play key roles in defining M/NP protein compatibilities. IMPORTANCE Paramyxoviruses are responsible for a wide range of diseases that affect both humans and animals. Paramyxovirus pathogens include measles virus, mumps virus, human respiratory syncytial virus, and the zoonotic paramyxoviruses Nipah virus and Hendra virus. Infectivity of paramyxovirus particles depends on matrix-nucleocapsid protein interactions which enable efficient packaging of encapsidated viral RNA genomes into budding virions. In this study, we have defined regions near the C-terminal ends of paramyxovirus nucleocapsid proteins that are important for matrix protein interaction and that are sufficient to direct a foreign protein into budding particles. These results advance our basic understanding of paramyxovirus genome packaging interactions and also have implications for the potential use of virus-like particles as protein delivery tools. PMID:26792745
Ray, Greeshma; Schmitt, Phuong Tieu; Schmitt, Anthony P
2016-01-20
Paramyxovirus particles are formed by a budding process coordinated by viral matrix (M) proteins. M proteins coalesce at sites underlying infected cell membranes and induce other viral components, including viral glycoproteins and viral ribonucleoprotein complexes (vRNPs), to assemble at these locations from which particles bud. M proteins interact with the nucleocapsid (NP or N) components of vRNPs, and these interactions enable production of infectious, genome-containing virions. For the paramyxoviruses parainfluenza virus 5 (PIV5) and mumps virus, M-NP interaction also contributes to efficient production of virus-like particles (VLPs) in transfected cells. A DLD sequence near the C-terminal end of PIV5 NP protein was previously found to be necessary for M-NP interaction and efficient VLP production. Here, we demonstrate that 15-residue-long, DLD-containing sequences derived from either the PIV5 or Nipah virus nucleocapsid protein C-terminal ends are sufficient to direct packaging of a foreign protein, Renilla luciferase, into budding VLPs. Mumps virus NP protein harbors DWD in place of the DLD sequence found in PIV5 NP protein, and consequently, PIV5 NP protein is incompatible with mumps virus M protein. A single amino acid change converting DLD to DWD within PIV5 NP protein induced compatibility between these proteins and allowed efficient production of mumps VLPs. Our data suggest a model in which paramyxoviruses share an overall common strategy for directing M-NP interactions but with important variations contained within DLD-like sequences that play key roles in defining M/NP protein compatibilities. Paramyxoviruses are responsible for a wide range of diseases that affect both humans and animals. Paramyxovirus pathogens include measles virus, mumps virus, human respiratory syncytial virus, and the zoonotic paramyxoviruses Nipah virus and Hendra virus. Infectivity of paramyxovirus particles depends on matrix-nucleocapsid protein interactions which enable efficient packaging of encapsidated viral RNA genomes into budding virions. In this study, we have defined regions near the C-terminal ends of paramyxovirus nucleocapsid proteins that are important for matrix protein interaction and that are sufficient to direct a foreign protein into budding particles. These results advance our basic understanding of paramyxovirus genome packaging interactions and also have implications for the potential use of virus-like particles as protein delivery tools. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Zhu, Yuan O; Aw, Pauline P K; de Sessions, Paola Florez; Hong, Shuzhen; See, Lee Xian; Hong, Lewis Z; Wilm, Andreas; Li, Chen Hao; Hue, Stephane; Lim, Seng Gee; Nagarajan, Niranjan; Burkholder, William F; Hibberd, Martin
2017-10-27
Viral populations are complex, dynamic, and fast evolving. The evolution of groups of closely related viruses in a competitive environment is termed quasispecies. To fully understand the role that quasispecies play in viral evolution, characterizing the trajectories of viral genotypes in an evolving population is the key. In particular, long-range haplotype information for thousands of individual viruses is critical; yet generating this information is non-trivial. Popular deep sequencing methods generate relatively short reads that do not preserve linkage information, while third generation sequencing methods have higher error rates that make detection of low frequency mutations a bioinformatics challenge. Here we applied BAsE-Seq, an Illumina-based single-virion sequencing technology, to eight samples from four chronic hepatitis B (CHB) patients - once before antiviral treatment and once after viral rebound due to resistance. With single-virion sequencing, we obtained 248-8796 single-virion sequences per sample, which allowed us to find evidence for both hard and soft selective sweeps. We were able to reconstruct population demographic history that was independently verified by clinically collected data. We further verified four of the samples independently through PacBio SMRT and Illumina Pooled deep sequencing. Overall, we showed that single-virion sequencing yields insight into viral evolution and population dynamics in an efficient and high throughput manner. We believe that single-virion sequencing is widely applicable to the study of viral evolution in the context of drug resistance and host adaptation, allows differentiation between soft or hard selective sweeps, and may be useful in the reconstruction of intra-host viral population demographic history.
Iftikhar, Romana; Ashfaq, Muhammad; Rasool, Akhtar; Hebert, Paul D N
2016-01-01
Although thrips are globally important crop pests and vectors of viral disease, species identifications are difficult because of their small size and inconspicuous morphological differences. Sequence variation in the mitochondrial COI-5' (DNA barcode) region has proven effective for the identification of species in many groups of insect pests. We analyzed barcode sequence variation among 471 thrips from various plant hosts in north-central Pakistan. The Barcode Index Number (BIN) system assigned these sequences to 55 BINs, while the Automatic Barcode Gap Discovery detected 56 partitions, a count that coincided with the number of monophyletic lineages recognized by Neighbor-Joining analysis and Bayesian inference. Congeneric species showed an average of 19% sequence divergence (range = 5.6% - 27%) at COI, while intraspecific distances averaged 0.6% (range = 0.0% - 7.6%). BIN analysis suggested that all intraspecific divergence >3.0% actually involved a species complex. In fact, sequences for three major pest species (Haplothrips reuteri, Thrips palmi, Thrips tabaci), and one predatory thrips (Aeolothrips intermedius) showed deep intraspecific divergences, providing evidence that each is a cryptic species complex. The study compiles the first barcode reference library for the thrips of Pakistan, and examines global haplotype diversity in four important pest thrips.
2017-01-01
ABSTRACT RNA viruses are one of the fastest-evolving biological entities. Within their hosts, they exist as genetically diverse populations (i.e., viral mutant swarms), which are sculpted by different evolutionary mechanisms, such as mutation, natural selection, and genetic drift, and also the interactions between genetic variants within the mutant swarms. To elucidate the mechanisms that modulate the population diversity of an important plant-pathogenic virus, we performed evolution experiments with Potato virus Y (PVY) in potato genotypes that differ in their defense response against the virus. Using deep sequencing of small RNAs, we followed the temporal dynamics of standing and newly generated variations in the evolving viral lineages. A time-sampled approach allowed us to (i) reconstruct theoretical haplotypes in the starting population by using clustering of single nucleotide polymorphisms' trajectories and (ii) use quantitative population genetics approaches to estimate the contribution of selection and genetic drift, and their interplay, to the evolution of the virus. We detected imprints of strong selective sweeps and narrow genetic bottlenecks, followed by the shift in frequency of selected haplotypes. Comparison of patterns of viral evolution in differently susceptible host genotypes indicated possible diversifying evolution of PVY in the less-susceptible host (efficient in the accumulation of salicylic acid). IMPORTANCE High diversity of within-host populations of RNA viruses is an important aspect of their biology, since they represent a reservoir of genetic variants, which can enable quick adaptation of viruses to a changing environment. This study focuses on an important plant virus, Potato virus Y, and describes, at high resolution, temporal changes in the structure of viral populations within different potato genotypes. A novel and easy-to-implement computational approach was established to cluster single nucleotide polymorphisms into viral haplotypes from very short sequencing reads. During the experiment, a shift in the frequency of selected viral haplotypes was observed after a narrow genetic bottleneck, indicating an important role of the genetic drift in the evolution of the virus. On the other hand, a possible case of diversifying selection of the virus was observed in less susceptible host genotypes. PMID:28592544
Cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human
Song, Huai-Dong; Tu, Chang-Chun; Zhang, Guo-Wei; Wang, Sheng-Yue; Zheng, Kui; Lei, Lian-Cheng; Chen, Qiu-Xia; Gao, Yu-Wei; Zhou, Hui-Qiong; Xiang, Hua; Zheng, Hua-Jun; Chern, Shur-Wern Wang; Cheng, Feng; Pan, Chun-Ming; Xuan, Hua; Chen, Sai-Juan; Luo, Hui-Ming; Zhou, Duan-Hua; Liu, Yu-Fei; He, Jian-Feng; Qin, Peng-Zhe; Li, Ling-Hui; Ren, Yu-Qi; Liang, Wen-Jia; Yu, Ye-Dong; Anderson, Larry; Wang, Ming; Xu, Rui-Heng; Wu, Xin-Wei; Zheng, Huan-Ying; Chen, Jin-Ding; Liang, Guodong; Gao, Yang; Liao, Ming; Fang, Ling; Jiang, Li-Yun; Li, Hui; Chen, Fang; Di, Biao; He, Li-Juan; Lin, Jin-Yan; Tong, Suxiang; Kong, Xiangang; Du, Lin; Hao, Pei; Tang, Hua; Bernini, Andrea; Yu, Xiao-Jing; Spiga, Ottavia; Guo, Zong-Ming; Pan, Hai-Yan; He, Wei-Zhong; Manuguerra, Jean-Claude; Fontanet, Arnaud; Danchin, Antoine; Niccolai, Neri; Li, Yi-Xue; Wu, Chung-I; Zhao, Guo-Ping
2005-01-01
The genomic sequences of severe acute respiratory syndrome coronaviruses from human and palm civet of the 2003/2004 outbreak in the city of Guangzhou, China, were nearly identical. Phylogenetic analysis suggested an independent viral invasion from animal to human in this new episode. Combining all existing data but excluding singletons, we identified 202 single-nucleotide variations. Among them, 17 are polymorphic in palm civets only. The ratio of nonsynonymous/synonymous nucleotide substitution in palm civets collected 1 yr apart from different geographic locations is very high, suggesting a rapid evolving process of viral proteins in civet as well, much like their adaptation in the human host in the early 2002–2003 epidemic. Major genetic variations in some critical genes, particularly the Spike gene, seemed essential for the transition from animal-to-human transmission to human-to-human transmission, which eventually caused the first severe acute respiratory syndrome outbreak of 2002/2003. PMID:15695582
Genotype-specific variation in West Nile virus dispersal in California.
Duggal, Nisha K; Reisen, William K; Fang, Ying; Newman, Ruchi M; Yang, Xiao; Ebel, Gregory D; Brault, Aaron C
2015-11-01
West Nile virus (WNV) is an arbovirus that was first reported in North America in New York in 1999 and, by 2003, had spread more than 4000 km to California. However, variation in viral genetics associated with spread is not well understood. Herein, we report sequences for more than 100 WNV isolates made from mosquito pools that were collected from 2003 to 2011 as part of routine surveillance by the California Mosquito-borne Virus Surveillance System. We performed phylogeographic analyses and demonstrated that 5 independent introductions of WNV (1 WN02 genotype strain and 4 SW03 genotype strains) occurred in California. The SW03 genotype of WNV was constrained to the southwestern U.S. and had a more rapid rate of spread. In addition, geographic constraint of WNV strains within a single region for up to 6 years suggest viral maintenance has been driven by resident, rather than migratory, birds and overwintering in mosquitoes. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Díaz-Muñoz, Samuel L
2017-01-01
Infection of more than one virus in a host, coinfection, is common across taxa and environments. Viral coinfection can enable genetic exchange, alter the dynamics of infections, and change the course of viral evolution. Yet, a systematic test of the factors explaining variation in viral coinfection across different taxa and environments awaits completion. Here I employ three microbial data sets of virus-host interactions covering cross-infectivity, culture coinfection, and single-cell coinfection (total: 6,564 microbial hosts, 13,103 viruses) to provide a broad, comprehensive picture of the ecological and biological factors shaping viral coinfection. I found evidence that ecology and virus-virus interactions are recurrent factors shaping coinfection patterns. Host ecology was a consistent and strong predictor of coinfection across all three data sets: cross-infectivity, culture coinfection, and single-cell coinfection. Host phylogeny or taxonomy was a less consistent predictor, being weak or absent in the cross-infectivity and single-cell coinfection models, yet it was the strongest predictor in the culture coinfection model. Virus-virus interactions strongly affected coinfection. In the largest test of superinfection exclusion to date, prophage sequences reduced culture coinfection by other prophages, with a weaker effect on extrachromosomal virus coinfection. At the single-cell level, prophage sequences eliminated coinfection. Virus-virus interactions also increased culture coinfection with ssDNA-dsDNA coinfections >2× more likely than ssDNA-only coinfections. The presence of CRISPR spacers was associated with a ∼50% reduction in single-cell coinfection in a marine bacteria, despite the absence of exact spacer matches in any active infection. Collectively, these results suggest the environment bacteria inhabit and the interactions among surrounding viruses are two factors consistently shaping viral coinfection patterns. These findings highlight the role of virus-virus interactions in coinfection with implications for phage therapy, microbiome dynamics, and viral infection treatments.
Fung, Elisabeth; Hill, Kelly; Hogendoorn, Katja; Glatz, Richard V; Napier, Kathryn R; Bellgard, Matthew I; Barrero, Roberto A
2018-02-01
Bee pollination is critical for improving productivity of one third of all plants or plant products consumed by humans. The health of honey bees is in decline in many countries worldwide, and RNA viruses together with other biological, environmental and anthropogenic factors have been identified as the main causes. The rapid genetic variation of viruses represents a challenge for diagnosis. Thus, application of deep sequencing methods for detection and analysis of viruses has increased over the last years. In this study, we leverage from the innate Dicer-2 mediated antiviral response against viruses to reconstruct complete viral genomes using virus-derived small interfering RNAs (vsiRNAs). Symptomatic A. mellifera larvae collected from hives free of Colony Collapse Disorder (CCD) and the parasitic Varroa mite (Varroa destructor) were used to generate more than 107 million small RNA reads. We show that de novo assembly of insect viral sequences is less fragmented using only 22 nt long vsiRNAs rather than a combination of 21-22 nt small RNAs. Our results show that A. mellifera larvae activate the RNAi immune response in the presence of Sacbrood virus (SBV). We assembled three SBV genomes from three individual larvae from different hives in a single apiary, with 1-2% nucleotide sequence variability among them. We found 3-4% variability between SBV genomes generated in this study and earlier published Australian variants suggesting the presence of different SBV quasispecies within the country. Copyright © 2018. Published by Elsevier Inc.
Zhang, Qian; Jun, Se -Ran; Leuze, Michael; ...
2017-01-19
The development of rapid, economical genome sequencing has shed new light on the classification of viruses. As of October 2016, the National Center for Biotechnology Information (NCBI) database contained >2 million viral genome sequences and a reference set of ~4000 viral genome sequences that cover a wide range of known viral families. Whole-genome sequences can be used to improve viral classification and provide insight into the viral tree of life . However, due to the lack of evolutionary conservation amongst diverse viruses, it is not feasible to build a viral tree of life using traditional phylogenetic methods based on conservedmore » proteins. In this study, we used an alignment-free method that uses k-mers as genomic features for a large-scale comparison of complete viral genomes available in RefSeq. To determine the optimal feature length, k (an essential step in constructing a meaningful dendrogram), we designed a comprehensive strategy that combines three approaches: (1) cumulative relative entropy, (2) average number of common features among genomes, and (3) the Shannon diversity index. This strategy was used to determine k for all 3,905 complete viral genomes in RefSeq. Lastly, the resulting dendrogram shows consistency with the viral taxonomy of the ICTV and the Baltimore classification of viruses.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Qian; Jun, Se -Ran; Leuze, Michael
The development of rapid, economical genome sequencing has shed new light on the classification of viruses. As of October 2016, the National Center for Biotechnology Information (NCBI) database contained >2 million viral genome sequences and a reference set of ~4000 viral genome sequences that cover a wide range of known viral families. Whole-genome sequences can be used to improve viral classification and provide insight into the viral tree of life . However, due to the lack of evolutionary conservation amongst diverse viruses, it is not feasible to build a viral tree of life using traditional phylogenetic methods based on conservedmore » proteins. In this study, we used an alignment-free method that uses k-mers as genomic features for a large-scale comparison of complete viral genomes available in RefSeq. To determine the optimal feature length, k (an essential step in constructing a meaningful dendrogram), we designed a comprehensive strategy that combines three approaches: (1) cumulative relative entropy, (2) average number of common features among genomes, and (3) the Shannon diversity index. This strategy was used to determine k for all 3,905 complete viral genomes in RefSeq. Lastly, the resulting dendrogram shows consistency with the viral taxonomy of the ICTV and the Baltimore classification of viruses.« less
Zhang, Qian; Jun, Se-Ran; Leuze, Michael; Ussery, David; Nookaew, Intawat
2017-01-01
The development of rapid, economical genome sequencing has shed new light on the classification of viruses. As of October 2016, the National Center for Biotechnology Information (NCBI) database contained >2 million viral genome sequences and a reference set of ~4000 viral genome sequences that cover a wide range of known viral families. Whole-genome sequences can be used to improve viral classification and provide insight into the viral “tree of life”. However, due to the lack of evolutionary conservation amongst diverse viruses, it is not feasible to build a viral tree of life using traditional phylogenetic methods based on conserved proteins. In this study, we used an alignment-free method that uses k-mers as genomic features for a large-scale comparison of complete viral genomes available in RefSeq. To determine the optimal feature length, k (an essential step in constructing a meaningful dendrogram), we designed a comprehensive strategy that combines three approaches: (1) cumulative relative entropy, (2) average number of common features among genomes, and (3) the Shannon diversity index. This strategy was used to determine k for all 3,905 complete viral genomes in RefSeq. The resulting dendrogram shows consistency with the viral taxonomy of the ICTV and the Baltimore classification of viruses. PMID:28102365
Lenzmeier, B A; Giebler, H A; Nyborg, J K
1998-02-01
Efficient human T-cell leukemia virus type 1 (HTLV-1) replication and viral gene expression are dependent upon the virally encoded oncoprotein Tax. To activate HTLV-1 transcription, Tax interacts with the cellular DNA binding protein cyclic AMP-responsive element binding protein (CREB) and recruits the coactivator CREB binding protein (CBP), forming a nucleoprotein complex on the three viral cyclic AMP-responsive elements (CREs) in the HTLV-1 promoter. Short stretches of dG-dC-rich (GC-rich) DNA, immediately flanking each of the viral CREs, are essential for Tax recruitment of CBP in vitro and Tax transactivation in vivo. Although the importance of the viral CRE-flanking sequences is well established, several studies have failed to identify an interaction between Tax and the DNA. The mechanistic role of the viral CRE-flanking sequences has therefore remained enigmatic. In this study, we used high resolution methidiumpropyl-EDTA iron(II) footprinting to show that Tax extended the CREB footprint into the GC-rich DNA flanking sequences of the viral CRE. The Tax-CREB footprint was enhanced but not extended by the KIX domain of CBP, suggesting that the coactivator increased the stability of the nucleoprotein complex. Conversely, the footprint pattern of CREB on a cellular CRE lacking GC-rich flanking sequences did not change in the presence of Tax or Tax plus KIX. The minor-groove DNA binding drug chromomycin A3 bound to the GC-rich flanking sequences and inhibited the association of Tax and the Tax-CBP complex without affecting CREB binding. Tax specifically cross-linked to the viral CRE in the 5'-flanking sequence, and this cross-link was blocked by chromomycin A3. Together, these data support a model where Tax interacts directly with both CREB and the minor-groove viral CRE-flanking sequences to form a high-affinity binding site for the recruitment of CBP to the HTLV-1 promoter.
Distinct families of cis-acting RNA replication elements epsilon from hepatitis B viruses
Chen, Augustine; Brown, Chris
2012-01-01
The hepadnavirus encapsidation signal, epsilon (ε), is an RNA structure located at the 5′ end of the viral pregenomic RNA. It is essential for viral replication and functions in polymerase protein binding and priming. This structure could also have potential regulatory roles in controlling the expression of viral replicative proteins. In addition to its structure, the primary sequence of this RNA element has crucial functional roles in the viral lifecycle. Although the ε elements in hepadnaviruses share common critical functions, there are some significant differences in mammalian and avian hepadnaviruses, which include both sequence and structural variations. Here we present several covariance models for ε elements from the Hepadnaviridae. The model building included experimentally determined data from previous studies using chemical probing and NMR analysis. These models have sufficient similarity to comprise a clan. The clan has in common a highly conserved overall structure consisting of a lower-stem, bulge, upper-stem and apical-loop. The models differ in functionally critical regions—notably the two types of avian ε elements have a tetra-loop (UGUU) including a non-canonical UU base pair, while the hepatitis B virus (HBV) epsilon has a tri-loop (UGU). The avian epsilon elements have a less stable dynamic structure in the upper stem. Comparisons between these models and all other Rfam models, and searches of genomes, showed these structures are specific to the Hepadnaviridae. Two family models and the clan are available from the Rfam database. PMID:22418844
Metagenomic characterization of airborne viral DNA diversity in the near-surface atmosphere.
Whon, Tae Woong; Kim, Min-Soo; Roh, Seong Woon; Shin, Na-Ri; Lee, Hae-Won; Bae, Jin-Woo
2012-08-01
Airborne viruses are expected to be ubiquitous in the atmosphere but they still remain poorly understood. This study investigated the temporal and spatial dynamics of airborne viruses and their genotypic characteristics in air samples collected from three distinct land use types (a residential district [RD], a forest [FR], and an industrial complex [IC]) and from rainwater samples freshly precipitated at the RD site (RD-rain). Viral abundance exhibited a seasonal fluctuation in the range between 1.7 × 10(6) and 4.0 × 10(7) viruses m(-3), which increased from autumn to winter and decreased toward spring, but no significant spatial differences were observed. Temporal variations in viral abundance were inversely correlated with seasonal changes in temperature and absolute humidity. Metagenomic analysis of air viromes amplified by rolling-circle phi29 polymerase-based random hexamer priming indicated the dominance of plant-associated single-stranded DNA (ssDNA) geminivirus-related viruses, followed by animal-infecting circovirus-related sequences, with low numbers of nanoviruses and microphages-related genomes. Particularly, the majority of the geminivirus-related viruses were closely related to ssDNA mycoviruses that infect plant-pathogenic fungi. Phylogenetic analysis based on the replication initiator protein sequence indicated that the airborne ssDNA viruses were distantly related to known ssDNA viruses, suggesting that a high diversity of viruses were newly discovered. This research is the first to report the seasonality of airborne viruses and their genetic diversity, which enhances our understanding of viral ecology in temperate regions.
Wang, Jin; Dong, Hongping; Chionh, Yok Hian; McBee, Megan E.; Sirirungruang, Sasilada; Cunningham, Richard P.; Shi, Pei-Yong; Dedon, Peter C.
2016-01-01
The misincorporation of 2′-deoxyribonucleotides (dNs) into RNA has important implications for the function of non-coding RNAs, the translational fidelity of coding RNAs and the mutagenic evolution of viral RNA genomes. However, quantitative appreciation for the degree to which dN misincorporation occurs is limited by the lack of analytical tools. Here, we report a method to hydrolyze RNA to release 2′-deoxyribonucleotide-ribonucleotide pairs (dNrN) that are then quantified by chromatography-coupled mass spectrometry (LC-MS). Using this platform, we found misincorporated dNs occurring at 1 per 103 to 105 ribonucleotide (nt) in mRNA, rRNAs and tRNA in human cells, Escherichia coli, Saccharomyces cerevisiae and, most abundantly, in the RNA genome of dengue virus. The frequency of dNs varied widely among organisms and sequence contexts, and partly reflected the in vitro discrimination efficiencies of different RNA polymerases against 2′-deoxyribonucleoside 5′-triphosphates (dNTPs). Further, we demonstrate a strong link between dN frequencies in RNA and the balance of dNTPs and ribonucleoside 5′-triphosphates (rNTPs) in the cellular pool, with significant stress-induced variation of dN incorporation. Potential implications of dNs in RNA are discussed, including the possibilities of dN incorporation in RNA as a contributing factor in viral evolution and human disease, and as a host immune defense mechanism against viral infections. PMID:27365049
Metagenomic Characterization of Airborne Viral DNA Diversity in the Near-Surface Atmosphere
Whon, Tae Woong; Kim, Min-Soo; Roh, Seong Woon; Shin, Na-Ri; Lee, Hae-Won
2012-01-01
Airborne viruses are expected to be ubiquitous in the atmosphere but they still remain poorly understood. This study investigated the temporal and spatial dynamics of airborne viruses and their genotypic characteristics in air samples collected from three distinct land use types (a residential district [RD], a forest [FR], and an industrial complex [IC]) and from rainwater samples freshly precipitated at the RD site (RD-rain). Viral abundance exhibited a seasonal fluctuation in the range between 1.7 × 106 and 4.0 × 107 viruses m−3, which increased from autumn to winter and decreased toward spring, but no significant spatial differences were observed. Temporal variations in viral abundance were inversely correlated with seasonal changes in temperature and absolute humidity. Metagenomic analysis of air viromes amplified by rolling-circle phi29 polymerase-based random hexamer priming indicated the dominance of plant-associated single-stranded DNA (ssDNA) geminivirus-related viruses, followed by animal-infecting circovirus-related sequences, with low numbers of nanoviruses and microphages-related genomes. Particularly, the majority of the geminivirus-related viruses were closely related to ssDNA mycoviruses that infect plant-pathogenic fungi. Phylogenetic analysis based on the replication initiator protein sequence indicated that the airborne ssDNA viruses were distantly related to known ssDNA viruses, suggesting that a high diversity of viruses were newly discovered. This research is the first to report the seasonality of airborne viruses and their genetic diversity, which enhances our understanding of viral ecology in temperate regions. PMID:22623790
Leung, Preston; Eltahla, Auda A; Lloyd, Andrew R; Bull, Rowena A; Luciani, Fabio
2017-07-15
With the advent of affordable deep sequencing technologies, detection of low frequency variants within genetically diverse viral populations can now be achieved with unprecedented depth and efficiency. The high-resolution data provided by next generation sequencing technologies is currently recognised as the gold standard in estimation of viral diversity. In the analysis of rapidly mutating viruses, longitudinal deep sequencing datasets from viral genomes during individual infection episodes, as well as at the epidemiological level during outbreaks, now allow for more sophisticated analyses such as statistical estimates of the impact of complex mutation patterns on the evolution of the viral populations both within and between hosts. These analyses are revealing more accurate descriptions of the evolutionary dynamics that underpin the rapid adaptation of these viruses to the host response, and to drug therapies. This review assesses recent developments in methods and provide informative research examples using deep sequencing data generated from rapidly mutating viruses infecting humans, particularly hepatitis C virus (HCV), human immunodeficiency virus (HIV), Ebola virus and influenza virus, to understand the evolution of viral genomes and to explore the relationship between viral mutations and the host adaptive immune response. Finally, we discuss limitations in current technologies, and future directions that take advantage of publically available large deep sequencing datasets. Copyright © 2016 Elsevier B.V. All rights reserved.
Blanc, Hervé; Bordería, Antonio V.; Díaz, Gisell; Henningsson, Rasmus; Gonzalez, Daniel; Santana, Emidalys; Alvarez, Mayling; Castro, Osvaldo; Fontes, Magnus; Vignuzzi, Marco; Guzman, Maria G.
2016-01-01
ABSTRACT During the dengue virus type 3 (DENV-3) epidemic that occurred in Havana in 2001 to 2002, severe disease was associated with the infection sequence DENV-1 followed by DENV-3 (DENV-1/DENV-3), while the sequence DENV-2/DENV-3 was associated with mild/asymptomatic infections. To determine the role of the virus in the increasing severity demonstrated during the epidemic, serum samples collected at different time points were studied. A total of 22 full-length sequences were obtained using a deep-sequencing approach. Bayesian phylogenetic analysis of consensus sequences revealed that two DENV-3 lineages were circulating in Havana at that time, both grouped within genotype III. The predominant lineage is closely related to Peruvian and Ecuadorian strains, while the minor lineage is related to Venezuelan strains. According to consensus sequences, relatively few nonsynonymous mutations were observed; only one was fixed during the epidemic at position 4380 in the NS2B gene. Intrahost genetic analysis indicated that a significant minor population was selected and became predominant toward the end of the epidemic. In conclusion, greater variability was detected during the epidemic's progression in terms of significant minority variants, particularly in the nonstructural genes. An increasing trend of genetic diversity toward the end of the epidemic was observed only for synonymous variant allele rates, with higher variability in secondary cases. Remarkably, significant intrahost genetic variation was demonstrated within the same patient during the course of secondary infection with DENV-1/DENV-3, including changes in the structural proteins premembrane (PrM) and envelope (E). Therefore, the dynamic of evolving viral populations in the context of heterotypic antibodies could be related to the increasing clinical severity observed during the epidemic. IMPORTANCE Based on the evidence that DENV fitness is context dependent, our research has focused on the study of viral factors associated with intraepidemic increasing severity in a unique epidemiological setting. Here, we investigated the intrahost genetic diversity in acute human samples collected at different time points during the DENV-3 epidemic that occurred in Cuba in 2001 to 2002 using a deep-sequencing approach. We concluded that greater variability in significant minor populations occurred as the epidemic progressed, particularly in the nonstructural genes, with higher variability observed in secondary infection cases. Remarkably, for the first time significant intrahost genetic variation was demonstrated within the same patient during the course of secondary infection with DENV-1/DENV-3, including changes in structural proteins. These findings indicate that high-resolution approaches are needed to unravel molecular mechanisms involved in dengue pathogenesis. PMID:26889031
Lewis, Jo E.; Brameld, John M.; Hill, Phil; Barrett, Perry; Ebling, Francis J.P.; Jethwa, Preeti H.
2015-01-01
Introduction The viral 2A sequence has become an attractive alternative to the traditional internal ribosomal entry site (IRES) for simultaneous over-expression of two genes and in combination with recombinant adeno-associated viruses (rAAV) has been used to manipulate gene expression in vitro. New method To develop a rAAV construct in combination with the viral 2A sequence to allow long-term over-expression of the vgf gene and fluorescent marker gene for tracking of the transfected neurones in vivo. Results Transient transfection of the AAV plasmid containing the vgf gene, viral 2A sequence and eGFP into SH-SY5Y cells resulted in eGFP fluorescence comparable to a commercially available reporter construct. This increase in fluorescent cells was accompanied by an increase in VGF mRNA expression. Infusion of the rAAV vector containing the vgf gene, viral 2A sequence and eGFP resulted in eGFP fluorescence in the hypothalamus of both mice and Siberian hamsters, 32 weeks post infusion. In situ hybridisation confirmed that the location of VGF mRNA expression in the hypothalamus corresponded to the eGFP pattern of fluorescence. Comparison with old method The viral 2A sequence is much smaller than the traditional IRES and therefore allowed over-expression of the vgf gene with fluorescent tracking without compromising viral capacity. Conclusion The use of the viral 2A sequence in the AAV plasmid allowed the simultaneous expression of both genes in vitro. When used in combination with rAAV it resulted in long-term over-expression of both genes at equivalent locations in the hypothalamus of both Siberian hamsters and mice, without any adverse effects. PMID:26300182
Kazemian, Majid; Ren, Min; Lin, Jian-Xin; Liao, Wei; Spolski, Rosanne
2015-01-01
ABSTRACT Viruses are causally associated with a number of human malignancies. In this study, we sought to identify new virus-cancer associations by searching RNA sequencing data sets from >2,000 patients, encompassing 21 cancers from The Cancer Genome Atlas (TCGA), for the presence of viral sequences. In agreement with previous studies, we found human papillomavirus 16 (HPV16) and HPV18 in oropharyngeal cancer and hepatitis B and C viruses in liver cancer. Unexpectedly, however, we found HPV38, a cutaneous form of HPV associated with skin cancer, in 32 of 168 samples from endometrial cancer. In 12 of the HPV38-positive (HPV38+) samples, we observed at least one paired read that mapped to both human and HPV38 genomes, indicative of viral integration into the host DNA, something not previously demonstrated for HPV38. The expression levels of HPV38 transcripts were relatively low, and all 32 HPV38+ samples belonged to the same experimental batch of 40 samples, whereas none of the other 128 endometrial carcinoma samples were HPV38+, raising doubts about the significance of the HPV38 association. Moreover, the HPV38+ samples contained the same 10 novel single nucleotide variations (SNVs), leading us to hypothesize that one patient was infected with this new isolate of HPV38, which was integrated into his/her genome and may have cross-contaminated other TCGA samples within batch 228. Based on our analysis, we propose guidelines to examine the batch effect, virus expression level, and SNVs as part of next-generation sequencing (NGS) data analysis for evaluating the significance of viral/pathogen sequences in clinical samples. IMPORTANCE High-throughput RNA sequencing (RNA-Seq), followed by computational analysis, has vastly accelerated the identification of viral and other pathogenic sequences in clinical samples, but cross-contamination during the processing of the samples remain a major problem that can lead to erroneous conclusions. We found HPV38 sequences specifically present in RNA-Seq samples from endometrial cancer patients from TCGA, a virus not previously associated with this type of cancer. However, multiple lines of evidence suggest possible cross-contamination in these samples, which were processed together in the same batch. Despite this potential cross-contamination, our data indicate that we have detected a new isolate of HPV38 that appears to be integrated into the human genome. We also provide general guidelines for computational detection and interpretation of pathogen-disease associations. PMID:26085148
Kazemian, Majid; Ren, Min; Lin, Jian-Xin; Liao, Wei; Spolski, Rosanne; Leonard, Warren J
2015-09-01
Viruses are causally associated with a number of human malignancies. In this study, we sought to identify new virus-cancer associations by searching RNA sequencing data sets from >2,000 patients, encompassing 21 cancers from The Cancer Genome Atlas (TCGA), for the presence of viral sequences. In agreement with previous studies, we found human papillomavirus 16 (HPV16) and HPV18 in oropharyngeal cancer and hepatitis B and C viruses in liver cancer. Unexpectedly, however, we found HPV38, a cutaneous form of HPV associated with skin cancer, in 32 of 168 samples from endometrial cancer. In 12 of the HPV38-positive (HPV38(+)) samples, we observed at least one paired read that mapped to both human and HPV38 genomes, indicative of viral integration into the host DNA, something not previously demonstrated for HPV38. The expression levels of HPV38 transcripts were relatively low, and all 32 HPV38(+) samples belonged to the same experimental batch of 40 samples, whereas none of the other 128 endometrial carcinoma samples were HPV38(+), raising doubts about the significance of the HPV38 association. Moreover, the HPV38(+) samples contained the same 10 novel single nucleotide variations (SNVs), leading us to hypothesize that one patient was infected with this new isolate of HPV38, which was integrated into his/her genome and may have cross-contaminated other TCGA samples within batch 228. Based on our analysis, we propose guidelines to examine the batch effect, virus expression level, and SNVs as part of next-generation sequencing (NGS) data analysis for evaluating the significance of viral/pathogen sequences in clinical samples. High-throughput RNA sequencing (RNA-Seq), followed by computational analysis, has vastly accelerated the identification of viral and other pathogenic sequences in clinical samples, but cross-contamination during the processing of the samples remain a major problem that can lead to erroneous conclusions. We found HPV38 sequences specifically present in RNA-Seq samples from endometrial cancer patients from TCGA, a virus not previously associated with this type of cancer. However, multiple lines of evidence suggest possible cross-contamination in these samples, which were processed together in the same batch. Despite this potential cross-contamination, our data indicate that we have detected a new isolate of HPV38 that appears to be integrated into the human genome. We also provide general guidelines for computational detection and interpretation of pathogen-disease associations. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Mapping HLA-A2, -A3 and -B7 supertype-restricted T-cell epitopes in the ebolavirus proteome.
Lim, Wan Ching; Khan, Asif M
2018-01-19
Ebolavirus (EBOV) is responsible for one of the most fatal diseases encountered by mankind. Cellular T-cell responses have been implicated to be important in providing protection against the virus. Antigenic variation can result in viral escape from immune recognition. Mapping targets of immune responses among the sequence of viral proteins is, thus, an important first step towards understanding the immune responses to viral variants and can aid in the identification of vaccine targets. Herein, we performed a large-scale, proteome-wide mapping and diversity analyses of putative HLA supertype-restricted T-cell epitopes of Zaire ebolavirus (ZEBOV), the most pathogenic species among the EBOV family. All publicly available ZEBOV sequences (14,098) for each of the nine viral proteins were retrieved, removed of irrelevant and duplicate sequences, and aligned. The overall proteome diversity of the non-redundant sequences was studied by use of Shannon's entropy. The sequences were predicted, by use of the NetCTLpan server, for HLA-A2, -A3, and -B7 supertype-restricted epitopes, which are relevant to African and other ethnicities and provide for large (~86%) population coverage. The predicted epitopes were mapped to the alignment of each protein for analyses of antigenic sequence diversity and relevance to structure and function. The putative epitopes were validated by comparison with experimentally confirmed epitopes. ZEBOV proteome was generally conserved, with an average entropy of 0.16. The 185 HLA supertype-restricted T-cell epitopes predicted (82 (A2), 37 (A3) and 66 (B7)) mapped to 125 alignment positions and covered ~24% of the proteome length. Many of the epitopes showed a propensity to co-localize at select positions of the alignment. Thirty (30) of the mapped positions were completely conserved and may be attractive for vaccine design. The remaining (95) positions had one or more epitopes, with or without non-epitope variants. A significant number (24) of the putative epitopes matched reported experimentally validated HLA ligands/T-cell epitopes of A2, A3 and/or B7 supertype representative allele restrictions. The epitopes generally corresponded to functional motifs/domains and there was no correlation to localization on the protein 3D structure. These data and the epitope map provide important insights into the interaction between EBOV and the host immune system.
Hölzemer, Angelique; Thobakgale, Christina F; Jimenez Cruz, Camilo A; Garcia-Beltran, Wilfredo F; Carlson, Jonathan M; van Teijlingen, Nienke H; Mann, Jaclyn K; Jaggernath, Manjeetha; Kang, Seung-gu; Körner, Christian; Chung, Amy W; Schafer, Jamie L; Evans, David T; Alter, Galit; Walker, Bruce D; Goulder, Philip J; Carrington, Mary; Hartmann, Pia; Pertel, Thomas; Zhou, Ruhong; Ndung'u, Thumbi; Altfeld, Marcus
2015-11-01
Viruses can evade immune surveillance, but the underlying mechanisms are insufficiently understood. Here, we sought to understand the mechanisms by which natural killer (NK) cells recognize HIV-1-infected cells and how this virus can evade NK-cell-mediated immune pressure. Two sequence mutations in p24 Gag associated with the presence of specific KIR/HLA combined genotypes were identified in HIV-1 clade C viruses from a large cohort of infected, untreated individuals in South Africa (n = 392), suggesting viral escape from KIR+ NK cells through sequence variations within HLA class I-presented epitopes. One sequence polymorphism at position 303 of p24 Gag (TGag303V), selected for in infected individuals with both KIR2DL3 and HLA-C*03:04, enabled significantly better binding of the inhibitory KIR2DL3 receptor to HLA-C*03:04-expressing cells presenting this variant epitope compared to the wild-type epitope (wild-type mean 18.01 ± 10.45 standard deviation [SD] and variant mean 44.67 ± 14.42 SD, p = 0.002). Furthermore, activation of primary KIR2DL3+ NK cells from healthy donors in response to HLA-C*03:04+ target cells presenting the variant epitope was significantly reduced in comparison to cells presenting the wild-type sequence (wild-type mean 0.78 ± 0.07 standard error of the mean [SEM] and variant mean 0.63 ± 0.07 SEM, p = 0.012). Structural modeling and surface plasmon resonance of KIR/peptide/HLA interactions in the context of the different viral sequence variants studied supported these results. Future studies will be needed to assess processing and antigen presentation of the investigated HIV-1 epitope in natural infection, and the consequences for viral control. These data provide novel insights into how viruses can evade NK cell immunity through the selection of mutations in HLA-presented epitopes that enhance binding to inhibitory NK cell receptors. Better understanding of the mechanisms by which HIV-1 evades NK-cell-mediated immune pressure and the functional validation of a structural modeling approach will facilitate the development of novel targeted immune interventions to harness the antiviral activities of NK cells.
Tsai, Hsiang-Jung; Tseng, Chun-hsien; Chang, Poa-chun; Mei, Kai; Wang, Shih-Chi
2004-09-01
To understand the genetic variations between the field strains of waterfowl parvoviruses and their attenuated derivatives, we analyzed the complete nucleotide sequences of the viral protein 1 (VP1) genes of nine field strains and two vaccine strains of waterfowl parvoviruses. Sequence comparison of the VP1 proteins showed that these viruses could be divided into goose parvovirus (GPV) related and Muscovy duck parvovirus (MDPV) related groups. The amino acid difference between GPV- and MDPV-related groups ranged from 13.1% to 15.8%, and the most variable region resided in the N terminus of VP2. The vaccine strains of GPV and MDPV exhibited only 1.2% and 0.3% difference in amino acid when compared with their parental field strains, and most of these differences resided in residues 497-575 of VP1, suggesting that these residues might be important for the attenuation of GPV and MDPV. When the GPV strains isolated in 1982 (the strain 82-0308) and in 2001 (the strain 01-1001) were compared, only 0.3% difference in amino acid was found, while MDPV strains isolated in 1990 (the strain 90-0219) and 1997 (the strain 97-0104) showed only 0.4% difference in amino acid. The result indicates that the genome of waterfowl parvovirus had remained highly stable in the field.
Cornelissen, Marion; Gall, Astrid; Vink, Monique; Zorgdrager, Fokla; Binter, Špela; Edwards, Stephanie; Jurriaans, Suzanne; Bakker, Margreet; Ong, Swee Hoe; Gras, Luuk; van Sighem, Ard; Bezemer, Daniela; de Wolf, Frank; Reiss, Peter; Kellam, Paul; Berkhout, Ben; Fraser, Christophe; van der Kuyl, Antoinette C
2017-07-15
The BEEHIVE (Bridging the Evolution and Epidemiology of HIV in Europe) project aims to analyse nearly-complete viral genomes from >3000 HIV-1 infected Europeans using high-throughput deep sequencing techniques to investigate the virus genetic contribution to virulence. Following the development of a computational pipeline, including a new de novo assembler for RNA virus genomes, to generate larger contiguous sequences (contigs) from the abundance of short sequence reads that characterise the data, another area that determines genome sequencing success is the quality and quantity of the input RNA. A pilot experiment with 125 patient plasma samples was performed to investigate the optimal method for isolation of HIV-1 viral RNA for long amplicon genome sequencing. Manual isolation with the QIAamp Viral RNA Mini Kit (Qiagen) was superior over robotically extracted RNA using either the QIAcube robotic system, the mSample Preparation Systems RNA kit with automated extraction by the m2000sp system (Abbott Molecular), or the MagNA Pure 96 System in combination with the MagNA Pure 96 Instrument (Roche Diagnostics). We scored amplification of a set of four HIV-1 amplicons of ∼1.9, 3.6, 3.0 and 3.5kb, and subsequent recovery of near-complete viral genomes. Subsequently, 616 BEEHIVE patient samples were analysed to determine factors that influence successful amplification of the genome in four overlapping amplicons using the QIAamp Viral RNA Kit for viral RNA isolation. Both low plasma viral load and high sample age (stored before 1999) negatively influenced the amplification of viral amplicons >3kb. A plasma viral load of >100,000 copies/ml resulted in successful amplification of all four amplicons for 86% of the samples, this value dropped to only 46% for samples with viral loads of <20,000 copies/ml. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Li, Linlin; Joseph, G. Victoria; Wang, Chunlin; Jones, Morris; Fellers, Gary M.; Kunz, Thomas H.; Delwart, Eric
2010-01-01
Bats are hosts to a variety of viruses capable of zoonotic transmissions. Because of increased contact between bats, humans, and other animal species, the possibility exists for further cross-species transmissions and ensuing disease outbreaks. We describe here full and partial viral genomes identified using metagenomics in the guano of bats from California and Texas. A total of 34% and 58% of 390,000 sequence reads from bat guano in California and Texas, respectively, were related to eukaryotic viruses, and the largest proportion of those infect insects, reflecting the diet of these insectivorous bats, including members of the viral families Dicistroviridae, Iflaviridae, Tetraviridae, and Nodaviridae and the subfamily Densovirinae. The second largest proportion of virus-related sequences infects plants and fungi, likely reflecting the diet of ingested insects, including members of the viral families Luteoviridae, Secoviridae, Tymoviridae, and Partitiviridae and the genus Sobemovirus. Bat guano viruses related to those infecting mammals comprised the third largest group, including members of the viral families Parvoviridae, Circoviridae, Picornaviridae, Adenoviridae, Poxviridae, Astroviridae, and Coronaviridae. No close relative of known human viral pathogens was identified in these bat populations. Phylogenetic analysis was used to clarify the relationship to known viral taxa of novel sequences detected in bat guano samples, showing that some guano viral sequences fall outside existing taxonomic groups. This initial characterization of the bat guano virome, the first metagenomic analysis of viruses in wild mammals using second-generation sequencing, therefore showed the presence of previously unidentified viral species, genera, and possibly families. Viral metagenomics is a useful tool for genetically characterizing viruses present in animals with the known capability of direct or indirect viral zoonosis to humans.
Li, Linlin; Victoria, Joseph G.; Wang, Chunlin; Jones, Morris; Fellers, Gary M.; Kunz, Thomas H.; Delwart, Eric
2010-01-01
Bats are hosts to a variety of viruses capable of zoonotic transmissions. Because of increased contact between bats, humans, and other animal species, the possibility exists for further cross-species transmissions and ensuing disease outbreaks. We describe here full and partial viral genomes identified using metagenomics in the guano of bats from California and Texas. A total of 34% and 58% of 390,000 sequence reads from bat guano in California and Texas, respectively, were related to eukaryotic viruses, and the largest proportion of those infect insects, reflecting the diet of these insectivorous bats, including members of the viral families Dicistroviridae, Iflaviridae, Tetraviridae, and Nodaviridae and the subfamily Densovirinae. The second largest proportion of virus-related sequences infects plants and fungi, likely reflecting the diet of ingested insects, including members of the viral families Luteoviridae, Secoviridae, Tymoviridae, and Partitiviridae and the genus Sobemovirus. Bat guano viruses related to those infecting mammals comprised the third largest group, including members of the viral families Parvoviridae, Circoviridae, Picornaviridae, Adenoviridae, Poxviridae, Astroviridae, and Coronaviridae. No close relative of known human viral pathogens was identified in these bat populations. Phylogenetic analysis was used to clarify the relationship to known viral taxa of novel sequences detected in bat guano samples, showing that some guano viral sequences fall outside existing taxonomic groups. This initial characterization of the bat guano virome, the first metagenomic analysis of viruses in wild mammals using second-generation sequencing, therefore showed the presence of previously unidentified viral species, genera, and possibly families. Viral metagenomics is a useful tool for genetically characterizing viruses present in animals with the known capability of direct or indirect viral zoonosis to humans. PMID:20463061
Li, Linlin; Victoria, Joseph G; Wang, Chunlin; Jones, Morris; Fellers, Gary M; Kunz, Thomas H; Delwart, Eric
2010-07-01
Bats are hosts to a variety of viruses capable of zoonotic transmissions. Because of increased contact between bats, humans, and other animal species, the possibility exists for further cross-species transmissions and ensuing disease outbreaks. We describe here full and partial viral genomes identified using metagenomics in the guano of bats from California and Texas. A total of 34% and 58% of 390,000 sequence reads from bat guano in California and Texas, respectively, were related to eukaryotic viruses, and the largest proportion of those infect insects, reflecting the diet of these insectivorous bats, including members of the viral families Dicistroviridae, Iflaviridae, Tetraviridae, and Nodaviridae and the subfamily Densovirinae. The second largest proportion of virus-related sequences infects plants and fungi, likely reflecting the diet of ingested insects, including members of the viral families Luteoviridae, Secoviridae, Tymoviridae, and Partitiviridae and the genus Sobemovirus. Bat guano viruses related to those infecting mammals comprised the third largest group, including members of the viral families Parvoviridae, Circoviridae, Picornaviridae, Adenoviridae, Poxviridae, Astroviridae, and Coronaviridae. No close relative of known human viral pathogens was identified in these bat populations. Phylogenetic analysis was used to clarify the relationship to known viral taxa of novel sequences detected in bat guano samples, showing that some guano viral sequences fall outside existing taxonomic groups. This initial characterization of the bat guano virome, the first metagenomic analysis of viruses in wild mammals using second-generation sequencing, therefore showed the presence of previously unidentified viral species, genera, and possibly families. Viral metagenomics is a useful tool for genetically characterizing viruses present in animals with the known capability of direct or indirect viral zoonosis to humans.
Ruan, Yi Jun; Wei, Chia Lin; Ee, Ai Ling; Vega, Vinsensius B; Thoreau, Herve; Su, Se Thoe Yun; Chia, Jer-Ming; Ng, Patrick; Chiu, Kuo Ping; Lim, Landri; Zhang, Tao; Peng, Chan Kwai; Lin, Ean Oon Lynette; Lee, Ng Mah; Yee, Sin Leo; Ng, Lisa F P; Chee, Ren Ee; Stanton, Lawrence W; Long, Philip M; Liu, Edison T
2003-05-24
The cause of severe acute respiratory syndrome (SARS) has been identified as a new coronavirus. Whole genome sequence analysis of various isolates might provide an indication of potential strain differences of this new virus. Moreover, mutation analysis will help to develop effective vaccines. We sequenced the entire SARS viral genome of cultured isolates from the index case (SIN2500) presenting in Singapore, from three primary contacts (SIN2774, SIN2748, and SIN2677), and one secondary contact (SIN2679). These sequences were compared with the isolates from Canada (TOR2), Hong Kong (CUHK-W1 and HKU39849), Hanoi (URBANI), Guangzhou (GZ01), and Beijing (BJ01, BJ02, BJ03, BJ04). We identified 129 sequence variations among the 14 isolates, with 16 recurrent variant sequences. Common variant sequences at four loci define two distinct genotypes of the SARS virus. One genotype was linked with infections originating in Hotel M in Hong Kong, the second contained isolates from Hong Kong, Guangzhou, and Beijing with no association with Hotel M (p<0.0001). Moreover, other common sequence variants further distinguished the geographical origins of the isolates, especially between Singapore and Beijing. Despite the recent onset of the SARS epidemic, genetic signatures are emerging that partition the worldwide SARS viral isolates into groups on the basis of contact source history and geography. These signatures can be used to trace sources of infection. In addition, a common variant associated with a non-conservative aminoacid change in the S1 region of the spike protein, suggests that immunological pressures might be starting to influence the evolution of the SARS virus in human populations.
Casillas, Rosario; Tabernero, David; Gregori, Josep; Belmonte, Irene; Cortese, Maria Francesca; González, Carolina; Riveiro-Barciela, Mar; López, Rosa Maria; Quer, Josep; Esteban, Rafael; Buti, Maria; Rodríguez-Frías, Francisco
2018-01-01
AIM To determine the variability/conservation of the domain of hepatitis B virus (HBV) preS1 region that interacts with sodium-taurocholate cotransporting polypeptide (hereafter, NTCP-interacting domain) and the prevalence of the rs2296651 polymorphism (S267F, NTCP variant) in a Spanish population. METHODS Serum samples from 246 individuals were included and divided into 3 groups: patients with chronic HBV infection (CHB) (n = 41, 73% Caucasians), patients with resolved HBV infection (n = 100, 100% Caucasians) and an HBV-uninfected control group (n = 105, 100% Caucasians). Variability/conservation of the amino acid (aa) sequences of the NTCP-interacting domain, (aa 2-48 in viral genotype D) and a highly conserved preS1 domain associated with virion morphogenesis (aa 92-103 in viral genotype D) were analyzed by next-generation sequencing and compared in 18 CHB patients with viremia > 4 log IU/mL. The rs2296651 polymorphism was determined in all individuals in all 3 groups using an in-house real-time PCR melting curve analysis. RESULTS The HBV preS1 NTCP-interacting domain showed a high degree of conservation among the examined viral genomes especially between aa 9 and 21 (in the genotype D consensus sequence). As compared with the virion morphogenesis domain, the NTCP-interacting domain had a smaller proportion of HBV genotype-unrelated changes comprising > 1% of the quasispecies (25.5% vs 31.8%), but a larger proportion of genotype-associated viral polymorphisms (34% vs 27.3%), according to consensus sequences from GenBank patterns of HBV genotypes A to H. Variation/conservation in both domains depended on viral genotype, with genotype C being the most highly conserved and genotype E the most variable (limited finding, only 2 genotype E included). Of note, proline residues were highly conserved in both domains, and serine residues showed changes only to threonine or tyrosine in the virion morphogenesis domain. The rs2296651 polymorphism was not detected in any participant. CONCLUSION In our CHB population, the NTCP-interacting domain was highly conserved, particularly the proline residues and essential amino acids related with the NTCP interaction, and the prevalence of rs2296651 was low/null. PMID:29456407
Epstein-Barr Virus Sequence Variation—Biology and Disease
Tzellos, Stelios; Farrell, Paul J.
2012-01-01
Some key questions in Epstein-Barr virus (EBV) biology center on whether naturally occurring sequence differences in the virus affect infection or EBV associated diseases. Understanding the pattern of EBV sequence variation is also important for possible development of EBV vaccines. At present EBV isolates worldwide can be grouped into Type 1 and Type 2, a classification based on the EBNA2 gene sequence. Type 1 EBV is the most prevalent worldwide but Type 2 is common in parts of Africa. Type 1 transforms human B cells into lymphoblastoid cell lines much more efficiently than Type 2 EBV. Molecular mechanisms that may account for this difference in cell transformation are now becoming clearer. Advances in sequencing technology will greatly increase the amount of whole EBV genome data for EBV isolated from different parts of the world. Study of regional variation of EBV strains independent of the Type 1/Type 2 classification and systematic investigation of the relationship between viral strains, infection and disease will become possible. The recent discovery that specific mutation of the EBV EBNA3B gene may be linked to development of diffuse large B cell lymphoma illustrates the importance that mutations in the virus genome may have in infection and human disease. PMID:25436768
Chutiwitoonchai, Nopporn; Kakisaka, Michinori; Yamada, Kazunori; Aida, Yoko
2014-01-01
The assembly of influenza virus progeny virions requires machinery that exports viral genomic ribonucleoproteins from the cell nucleus. Currently, seven nuclear export signal (NES) consensus sequences have been identified in different viral proteins, including NS1, NS2, M1, and NP. The present study examined the roles of viral NES consensus sequences and their significance in terms of viral replication and nuclear export. Mutation of the NP-NES3 consensus sequence resulted in a failure to rescue viruses using a reverse genetics approach, whereas mutation of the NS2-NES1 and NS2-NES2 sequences led to a strong reduction in viral replication kinetics compared with the wild-type sequence. While the viral replication kinetics for other NES mutant viruses were also lower than those of the wild-type, the difference was not so marked. Immunofluorescence analysis after transient expression of NP-NES3, NS2-NES1, or NS2-NES2 proteins in host cells showed that they accumulated in the cell nucleus. These results suggest that the NP-NES3 consensus sequence is mostly required for viral replication. Therefore, each of the hydrophobic (Φ) residues within this NES consensus sequence (Φ1, Φ2, Φ3, or Φ4) was mutated, and its viral replication and nuclear export function were analyzed. No viruses harboring NP-NES3 Φ2 or Φ3 mutants could be rescued. Consistent with this, the NP-NES3 Φ2 and Φ3 mutants showed reduced binding affinity with CRM1 in a pull-down assay, and both accumulated in the cell nucleus. Indeed, a nuclear export assay revealed that these mutant proteins showed lower nuclear export activity than the wild-type protein. Moreover, the Φ2 and Φ3 residues (along with other Φ residues) within the NP-NES3 consensus were highly conserved among different influenza A viruses, including human, avian, and swine. Taken together, these results suggest that the Φ2 and Φ3 residues within the NP-NES3 protein are important for its nuclear export function during viral replication.
Utachee, Piraporn; Jinnopat, Piyamat; Isarangkura-Na-Ayuthaya, Panasda; de Silva, Udayanga Chandimal; Nakamura, Shota; Siripanyaphinyo, Uamporn; Wichukchinda, Nuanjun; Tokunaga, Kenzo; Yasunaga, Teruo; Sawanpanyalert, Pathom; Ikuta, Kazuyoshi; Auwanit, Wattana; Kameoka, Masanori
2009-02-01
CRF01_AE is a major subtype of human immunodeficiency virus type 1 (HIV-1) circulating in Southeast Asia, including Thailand. HIV-1 env genes were amplified by polymerase chain reaction from blood samples of HIV-1-infected patients residing in Thailand in 2006, and cloned into the pNL4-3-derived reporter viral construct. Generated envelope protein (Env)-recombinant virus was examined for its infectivity, and then 35 infectious CRF01_AE Env-recombinant viruses were selected. Sequencing analysis revealed that the interclone variation of the deduced amino acid sequences was higher in CRF01_AE env genes isolated in 2006 than in those isolated in the early 1990s, suggesting that env gene variation has been increasing gradually among CRF01_AE viruses prevalent in Thailand. We also examined the characteristics of the deduced amino acid sequences of 35 CRF01_AE env genes. Our results may provide useful information to help in better understanding the genotype of env genes of CRF01_AE viruses currently circulating in Thailand.
SARS-CoV and Emergent Coronaviruses: Viral Determinants of Interspecies Transmission
Bolles, Meagan; Donaldson, Eric; Baric, Ralph
2011-01-01
Most new emerging viruses are derived from strains circulating in zoonotic reservoirs. Coronaviruses, which had an established potential for cross-species transmission within domesticated animals, suddenly became relevant with the unexpected emergence of the highly pathogenic human SARS-CoV strain from zoonotic reservoirs in 2002. SARS-CoV infected approximately 8000 people worldwide before public health measures halted the epidemic. Supported by robust time-ordered sequence variation, structural biology, well-characterized patient pools, and biological data, the emergence of SARS-CoV represents one of the best studied natural models of viral disease emergence from zoonotic sources. This review article summarizes previous and more recent advances into the molecular and structural characteristics, with particular emphasis on host-receptor interactions, that drove this remarkable virus disease outbreak in human populations. PMID:22180768
Getchell, Rodman G; Cornwell, Emily R; Bogdanowicz, Steven; Andrés, Jose; Batts, William N; Kurath, Gael; Breyta, Rachel; Choi, Joanna G; Farrell, John M; Bowser, Paul R
2017-11-21
Four viral hemorrhagic septicemia virus (VHSV) genotype IVb isolates were sequenced, their genetic variation explored, and comparative virulence assayed with experimental infections of northern pike Esox lucius fry. In addition to the type strain MI03, the complete 11183 bp genome of the first round goby Neogobius melanostomus isolate from the St. Lawrence River, and the 2013 and 2014 isolates from gizzard shad Dorosoma cepedianum die-offs in Irondequoit Bay, Lake Ontario and Dunkirk Harbor, Lake Erie were all deep sequenced on an Illumina platform. Mutations documented in the 11 yr since the MI03 index case from Lake St. Clair muskellunge Esox masquinongy showed 87 polymorphisms among the 4 isolates. Twenty-six mutations were non-synonymous and located at 18 different positions within the matrix protein, glycoprotein, non-virion protein, and RNA polymerase genes. The same 4 isolates were used to infect northern pike fry by a single 1 h bath exposure. Cumulative percent mortality varied from 42.5 to 62.5%. VHSV was detected in 57% (41/72) of the survivors at the end of the 21-d trial, suggesting that the virus was not rapidly cleared. Lesions were observed in many of the moribund and dead northern pike, such as hemorrhaging in the skin and fins, as well as hydrocephalus. Mean viral load measured from the trunk and visceral tissues of MI03-infected pike was significantly higher than the quantities detected in fish infected with the most recent isolates of genotype IVb, but there were no differences in cumulative mortality observed.
Rebrikov, Denis V; Bulina, Maria E; Bogdanova, Ekaterina A; Vagner, Loura L; Lukyanov, Sergey A
2002-01-01
Background Freshwater planarians are widely used as models for investigation of pattern formation and studies on genetic variation in populations. Despite extensive information on the biology and genetics of planaria, the occurrence and distribution of viruses in these animals remains an unexplored area of research. Results Using a combination of Suppression Subtractive Hybridization (SSH) and Mirror Orientation Selection (MOS), we compared the genomes of two strains of freshwater planarian, Girardia tigrina. The novel extrachromosomal DNA-containing virus-like element denoted PEVE (Planarian Extrachromosomal Virus-like Element) was identified in one planarian strain. The PEVE genome (about 7.5 kb) consists of two unique regions (Ul and Us) flanked by inverted repeats. Sequence analyses reveal that PEVE comprises two helicase-like sequences in the genome, of which the first is a homolog of a circoviral replication initiator protein (Rep), and the second is similar to the papillomavirus E1 helicase domain. PEVE genome exists in at least two variant forms with different arrangements of single-stranded and double-stranded DNA stretches that correspond to the Us and Ul regions. Using PCR analysis and whole-mount in situ hybridization, we characterized PEVE distribution and expression in the planarian body. Conclusions PEVE is the first viral element identified in free-living flatworms. This element differs from all known viruses and viral elements, and comprises two potential helicases that are homologous to proteins from distant viral phyla. PEVE is unevenly distributed in the worm body, and is detected in specific parenchyma cells. PMID:12065025
Association of coral algal symbionts with a diverse viral community responsive to heat shock.
Brüwer, Jan D; Agrawal, Shobhit; Liew, Yi Jin; Aranda, Manuel; Voolstra, Christian R
2017-08-17
Stony corals provide the structural foundation of coral reef ecosystems and are termed holobionts given they engage in symbioses, in particular with photosynthetic dinoflagellates of the genus Symbiodinium. Besides Symbiodinium, corals also engage with bacteria affecting metabolism, immunity, and resilience of the coral holobiont, but the role of associated viruses is largely unknown. In this regard, the increase of studies using RNA sequencing (RNA-Seq) to assess gene expression provides an opportunity to elucidate viral signatures encompassed within the data via careful delineation of sequence reads and their source of origin. Here, we re-analyzed an RNA-Seq dataset from a cultured coral symbiont (Symbiodinium microadriaticum, Clade A1) across four experimental treatments (control, cold shock, heat shock, dark shock) to characterize associated viral diversity, abundance, and gene expression. Our approach comprised the filtering and removal of host sequence reads, subsequent phylogenetic assignment of sequence reads of putative viral origin, and the assembly and analysis of differentially expressed viral genes. About 15.46% (123 million) of all sequence reads were non-host-related, of which <1% could be classified as archaea, bacteria, or virus. Of these, 18.78% were annotated as virus and comprised a diverse community consistent across experimental treatments. Further, non-host related sequence reads assembled into 56,064 contigs, including 4856 contigs of putative viral origin that featured 43 differentially expressed genes during heat shock. The differentially expressed genes included viral kinases, ubiquitin, and ankyrin repeat proteins (amongst others), which are suggested to help the virus proliferate and inhibit the algal host's antiviral response. Our results suggest that a diverse viral community is associated with coral algal endosymbionts of the genus Symbiodinium, which prompts further research on their ecological role in coral health and resilience.
RNA circularization reveals terminal sequence heterogeneity in a double-stranded RNA virus.
Widmer, G
1993-03-01
Double-stranded RNA viruses (dsRNA), termed LRV1, have been found in several strains of the protozoan parasite Leishmania. With the aim of constructing a full-length cDNA copy of the viral genome, including its terminal sequences, a protocol based on PCR amplification across the 3'-5' junction of circularized RNA was developed. This method proved to be applicable to dsRNA. It provided a relatively simple alternative to one-sided PCR, without loss of specificity inherent in the use of generic primers. LRV1 terminal nucleotide sequences obtained by this method showed a considerable variation in length, particularly at the 5' end of the positive strand, as well as the potential for forming 3' overhangs. The opposite genomic end terminates in 0, 1, or 2 TCA trinucleotide repeats. These results are compared with terminal sequences derived from one-sided PCR experiments.
Geisler, Christoph
2018-02-07
Adventitious viral contamination in cell substrates used for biologicals production is a major safety concern. A powerful new approach that can be used to identify adventitious viruses is a combination of bioinformatics tools with massively parallel sequencing technology. Typically, this involves mapping or BLASTN searching individual reads against viral nucleotide databases. Although extremely sensitive for known viruses, this approach can easily miss viruses that are too dissimilar to viruses in the database. Moreover, it is computationally intensive and requires reference cell genome databases. To avoid these drawbacks, we set out to develop an alternative approach. We reasoned that searching genome and transcriptome assemblies for adventitious viral contaminants using TBLASTN with a compact viral protein database covering extant viral diversity as the query could be fast and sensitive without a requirement for high performance computing hardware. We tested our approach on Spodoptera frugiperda Sf-RVN, a recently isolated insect cell line, to determine if it was contaminated with one or more adventitious viruses. We used Illumina reads to assemble the Sf-RVN genome and transcriptome and searched them for adventitious viral contaminants using TBLASTN with our viral protein database. We found no evidence of viral contamination, which was substantiated by the fact that our searches otherwise identified diverse sequences encoding virus-like proteins. These sequences included Maverick, R1 LINE, and errantivirus transposons, all of which are common in insect genomes. We also identified previously described as well as novel endogenous viral elements similar to ORFs encoded by diverse insect viruses. Our results demonstrate TBLASTN searching massively parallel sequencing (MPS) assemblies with a compact, manually curated viral protein database is more sensitive for adventitious virus detection than BLASTN, as we identified various sequences that encoded virus-like proteins, but had no similarity to viral sequences at the nucleotide level. Moreover, searches were fast without requiring high performance computing hardware. Our study also documents the enhanced biosafety profile of Sf-RVN as compared to other Sf cell lines, and supports the notion that Sf-RVN is highly suitable for the production of safe biologicals.
VirSorter: mining viral signal from microbial genomic data.
Roux, Simon; Enault, Francois; Hurwitz, Bonnie L; Sullivan, Matthew B
2015-01-01
Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome), new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs) of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter's prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages). Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs) as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision) on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in "reverse" to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made available through the iPlant Cyberinfrastructure that provides a web-based user interface interconnected with the required computing resources. VirSorter thus complements existing prophage prediction softwares to better leverage fragmented, SAG and metagenomic datasets in a way that will scale to modern sequencing. Given these features, VirSorter should enable the discovery of new viruses in microbial datasets, and further our understanding of uncultivated viral communities across diverse ecosystems.
VirSorter: mining viral signal from microbial genomic data
Roux, Simon; Enault, Francois; Hurwitz, Bonnie L.
2015-01-01
Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome), new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs) of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter’s prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages). Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs) as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision) on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in “reverse” to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made available through the iPlant Cyberinfrastructure that provides a web-based user interface interconnected with the required computing resources. VirSorter thus complements existing prophage prediction softwares to better leverage fragmented, SAG and metagenomic datasets in a way that will scale to modern sequencing. Given these features, VirSorter should enable the discovery of new viruses in microbial datasets, and further our understanding of uncultivated viral communities across diverse ecosystems. PMID:26038737
Horizontal acquisition of transposable elements and viral sequences: patterns and consequences.
Gilbert, Clément; Feschotte, Cédric
2018-04-01
It is becoming clear that most eukaryotic transposable elements (TEs) owe their evolutionary success in part to horizontal transfer events, which enable them to invade new species. Recent large-scale studies are beginning to unravel the mechanisms and ecological factors underlying this mode of transmission. Viruses are increasingly recognized as vectors in the process but also as a direct source of genetic material horizontally acquired by eukaryotic organisms. Because TEs and endogenous viruses are major catalysts of variation and innovation in genomes, we argue that horizontal inheritance has had a more profound impact in eukaryotic evolution than is commonly appreciated. To support this proposal, we compile a list of examples, including some previously unrecognized, whereby new host functions and phenotypes can be directly attributed to horizontally acquired TE or viral sequences. We predict that the number of examples will rapidly grow in the future as the prevalence of horizontal transfer in the life cycle of TEs becomes even more apparent, firmly establishing this form of non-Mendelian inheritance as a consequential facet of eukaryotic evolution. Copyright © 2018 Elsevier Ltd. All rights reserved.
Potential Links between Hepadnavirus and Bornavirus Sequences in the Host Genome and Cancer.
Honda, Tomoyuki
2017-01-01
Various viruses leave their sequences in the host genomes during infection. Such events occur mainly in retrovirus infection but also sometimes in DNA and non-retroviral RNA virus infections. If viral sequences are integrated into the genomes of germ line cells, the sequences can become inherited as endogenous viral elements (EVEs). The integration events of viral sequences may have oncogenic potential. Because proviral integrations of some retroviruses and/or reactivation of endogenous retroviruses are closely linked to cancers, viral insertions related to non-retroviral viruses also possibly contribute to cancer development. This article focuses on genomic viral sequences derived from two non-retroviral viruses, whose endogenization is already reported, and discusses their possible contributions to cancer. Viral insertions of hepatitis B virus play roles in the development of hepatocellular carcinoma. Endogenous bornavirus-like elements, the only non-retroviral RNA virus-related EVEs found in the human genome, may also be involved in cancer formation. In addition, the possible contribution of the interactions between viruses and retrotransposons, which seem to be a major driving force for generating EVEs related to non-retroviral RNA viruses, to cancers will be discussed. Future studies regarding the possible links described here may open a new avenue for the development of novel therapeutics for tumor virus-related cancers and/or provide novel insights into EVE functions.
Metagenomic characterization of viral communities in Goseong Bay, Korea
NASA Astrophysics Data System (ADS)
Hwang, Jinik; Park, So Yun; Park, Mirye; Lee, Sukchan; Jo, Yeonhwa; Cho, Won Kyong; Lee, Taek-Kyun
2016-12-01
In this study, seawater samples were collected from Goseong Bay, Korea in March 2014 and viral populations were examined by metagenomics assembly. Enrichment of marine viral particles using FeCl3 followed by next-generation sequencing produced numerous sequences. De novo assembly and BLAST search showed that most of the obtained contigs were unknown sequences and only 0.74% of sequences were associated with known viruses. As a result, 138 viruses, including bacteriophages (87%), viruses infecting algae and others (13%) were identified. The identified 138 viruses were divided into 11 orders, 14 families, 34 genera, and 133 species. The dominant viruses were Pelagibacter phage HTVC010P and Roseobacter phage SIO1. The viruses infecting algae, including the Ostreococcus species, accounted for 9.4% of total identified viruses. In addition, we identified pathogenic herpes viruses infecting fishes and giant viruses infecting parasitic acanthamoeba species. This is a comprehensive study to reveal the viral populations in the Goseong Bay using metagenomics. The information associated with the marine viral community in Goseong Bay, Korea will be useful for comparative analysis in other marine viral communities.
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roux, Simon; Hallam, Steven J.; Woyke, Tanja
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 newmore » viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.« less
Viral dark matter and virus-host interactions resolved from publicly available microbial genomes.
Roux, Simon; Hallam, Steven J; Woyke, Tanja; Sullivan, Matthew B
2015-07-22
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus-host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7-38% of 'unknown' sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus-host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
Roux, Simon; Hallam, Steven J.; Woyke, Tanja; ...
2015-07-22
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 newmore » viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.« less
Lewis, Jo E; Brameld, John M; Hill, Phil; Barrett, Perry; Ebling, Francis J P; Jethwa, Preeti H
2015-12-30
The viral 2A sequence has become an attractive alternative to the traditional internal ribosomal entry site (IRES) for simultaneous over-expression of two genes and in combination with recombinant adeno-associated viruses (rAAV) has been used to manipulate gene expression in vitro. To develop a rAAV construct in combination with the viral 2A sequence to allow long-term over-expression of the vgf gene and fluorescent marker gene for tracking of the transfected neurones in vivo. Transient transfection of the AAV plasmid containing the vgf gene, viral 2A sequence and eGFP into SH-SY5Y cells resulted in eGFP fluorescence comparable to a commercially available reporter construct. This increase in fluorescent cells was accompanied by an increase in VGF mRNA expression. Infusion of the rAAV vector containing the vgf gene, viral 2A sequence and eGFP resulted in eGFP fluorescence in the hypothalamus of both mice and Siberian hamsters, 32 weeks post infusion. In situ hybridisation confirmed that the location of VGF mRNA expression in the hypothalamus corresponded to the eGFP pattern of fluorescence. The viral 2A sequence is much smaller than the traditional IRES and therefore allowed over-expression of the vgf gene with fluorescent tracking without compromising viral capacity. The use of the viral 2A sequence in the AAV plasmid allowed the simultaneous expression of both genes in vitro. When used in combination with rAAV it resulted in long-term over-expression of both genes at equivalent locations in the hypothalamus of both Siberian hamsters and mice, without any adverse effects. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Fuentes-Pananá, Ezequiel M; Larios-Serrato, Violeta; Méndez-Tenorio, Alfonso; Morales-Sánchez, Abigail; Arias, Carlos F; Torres, Javier
2016-01-01
Gastric (GC) and breast (BrC) cancer are two of the most common and deadly tumours. Different lines of evidence suggest a possible causative role of viral infections for both GC and BrC. Wide genome sequencing (WGS) technologies allow searching for viral agents in tissues of patients with cancer. These technologies have already contributed to establish virus-cancer associations as well as to discovery new tumour viruses. The objective of this study was to document possible associations of viral infection with GC and BrC in Mexican patients. In order to gain idea about cost effective conditions of experimental sequencing, we first carried out an in silico simulation of WGS. The next-generation-platform IlluminaGallx was then used to sequence GC and BrC tumour samples. While we did not find viral sequences in tissues from BrC patients, multiple reads matching Epstein-Barr virus (EBV) sequences were found in GC tissues. An end-point polymerase chain reaction confirmed an enrichment of EBV sequences in one of the GC samples sequenced, validating the next-generation sequencing-bioinformatics pipeline. PMID:26910355
Fuentes-Pananá, Ezequiel M; Larios-Serrato, Violeta; Méndez-Tenorio, Alfonso; Morales-Sánchez, Abigail; Arias, Carlos F; Torres, Javier
2016-03-01
Gastric (GC) and breast (BrC) cancer are two of the most common and deadly tumours. Different lines of evidence suggest a possible causative role of viral infections for both GC and BrC. Wide genome sequencing (WGS) technologies allow searching for viral agents in tissues of patients with cancer. These technologies have already contributed to establish virus-cancer associations as well as to discovery new tumour viruses. The objective of this study was to document possible associations of viral infection with GC and BrC in Mexican patients. In order to gain idea about cost effective conditions of experimental sequencing, we first carried out an in silico simulation of WGS. The next-generation-platform IlluminaGallx was then used to sequence GC and BrC tumour samples. While we did not find viral sequences in tissues from BrC patients, multiple reads matching Epstein-Barr virus (EBV) sequences were found in GC tissues. An end-point polymerase chain reaction confirmed an enrichment of EBV sequences in one of the GC samples sequenced, validating the next-generation sequencing-bioinformatics pipeline.
Coffey, Lark L; Page, Brady L; Greninger, Alexander L; Herring, Belinda L; Russell, Richard C; Doggett, Stephen L; Haniotis, John; Wang, Chunlin; Deng, Xutao; Delwart, Eric L
2014-01-05
Viral metagenomics characterizes known and identifies unknown viruses based on sequence similarities to any previously sequenced viral genomes. A metagenomics approach was used to identify virus sequences in Australian mosquitoes causing cytopathic effects in inoculated mammalian cell cultures. Sequence comparisons revealed strains of Liao Ning virus (Reovirus, Seadornavirus), previously detected only in China, livestock-infecting Stretch Lagoon virus (Reovirus, Orbivirus), two novel dimarhabdoviruses, named Beaumont and North Creek viruses, and two novel orthobunyaviruses, named Murrumbidgee and Salt Ash viruses. The novel virus proteomes diverged by ≥ 50% relative to their closest previously genetically characterized viral relatives. Deep sequencing also generated genomes of Warrego and Wallal viruses, orbiviruses linked to kangaroo blindness, whose genomes had not been fully characterized. This study highlights viral metagenomics in concert with traditional arbovirus surveillance to characterize known and new arboviruses in field-collected mosquitoes. Follow-up epidemiological studies are required to determine whether the novel viruses infect humans. © 2013 Elsevier Inc. All rights reserved.
Huiet, L; Feldstein, P A; Tsai, J H; Falk, B W
1993-12-01
Primer extension analyses and a PCR-based cloning strategy were used to identify and characterize 5' nucleotide sequences on the maize stripe virus (MStV) RNA4 mRNA transcripts encoding the major noncapsid protein (NCP). Direct RNA sequence analysis by primer extension showed that the NCP mRNA transcripts had 10-15 nucleotides beyond the 5' terminus of the MStV RNA4 nucleotide sequence. MStV genomic RNAs isolated from ribonucleoprotein particles (RNPs) lacked the additional 5' nucleotides. cDNA clones representing the 5' region of the mRNA transcripts were constructed, and the nucleotide sequences of the 5' regions were determined for 16 clones. Each was found to have a distinct 10-15 nucleotide sequence immediately 5' of the MStV RNA4 sequence. Eleven of 16 clones had the correct MStV RNA4 5' nucleotide sequence, while five showed minor variations at or near the 5' most MStV RNA4 nucleotide. These characteristics show strong similarities to other viral mRNA transcripts which are synthesized by cap snatching.
Allison, Andrew B; Kohler, Dennis J; Ortega, Alicia; Hoover, Elizabeth A; Grove, Daniel M; Holmes, Edward C; Parrish, Colin R
2014-11-01
Canine parvovirus (CPV) emerged as a new pandemic pathogen of dogs in the 1970s and is closely related to feline panleukopenia virus (FPV), a parvovirus of cats and related carnivores. Although both viruses have wide host ranges, analysis of viral sequences recovered from different wild carnivore species, as shown here, demonstrated that>95% were derived from CPV-like viruses, suggesting that CPV is dominant in sylvatic cycles. Many viral sequences showed host-specific mutations in their capsid proteins, which were often close to sites known to control binding to the transferrin receptor (TfR), the host receptor for these carnivore parvoviruses, and which exhibited frequent parallel evolution. To further examine the process of host adaptation, we passaged parvoviruses with alternative backgrounds in cells from different carnivore hosts. Specific mutations were selected in several viruses and these differed depending on both the background of the virus and the host cells in which they were passaged. Strikingly, these in vitro mutations recapitulated many specific changes seen in viruses from natural populations, strongly suggesting they are host adaptive, and which were shown to result in fitness advantages over their parental virus. Comparison of the sequences of the transferrin receptors of the different carnivore species demonstrated that many mutations occurred in and around the apical domain where the virus binds, indicating that viral variants were likely selected through their fit to receptor structures. Some of the viruses accumulated high levels of variation upon passage in alternative hosts, while others could infect multiple different hosts with no or only a few additional mutations. Overall, these studies demonstrate that the evolutionary history of a virus, including how long it has been circulating and in which hosts, as well as its phylogenetic background, has a profound effect on determining viral host range.
Allison, Andrew B.; Kohler, Dennis J.; Ortega, Alicia; Hoover, Elizabeth A.; Grove, Daniel M.; Holmes, Edward C.; Parrish, Colin R.
2014-01-01
Canine parvovirus (CPV) emerged as a new pandemic pathogen of dogs in the 1970s and is closely related to feline panleukopenia virus (FPV), a parvovirus of cats and related carnivores. Although both viruses have wide host ranges, analysis of viral sequences recovered from different wild carnivore species, as shown here, demonstrated that >95% were derived from CPV-like viruses, suggesting that CPV is dominant in sylvatic cycles. Many viral sequences showed host-specific mutations in their capsid proteins, which were often close to sites known to control binding to the transferrin receptor (TfR), the host receptor for these carnivore parvoviruses, and which exhibited frequent parallel evolution. To further examine the process of host adaptation, we passaged parvoviruses with alternative backgrounds in cells from different carnivore hosts. Specific mutations were selected in several viruses and these differed depending on both the background of the virus and the host cells in which they were passaged. Strikingly, these in vitro mutations recapitulated many specific changes seen in viruses from natural populations, strongly suggesting they are host adaptive, and which were shown to result in fitness advantages over their parental virus. Comparison of the sequences of the transferrin receptors of the different carnivore species demonstrated that many mutations occurred in and around the apical domain where the virus binds, indicating that viral variants were likely selected through their fit to receptor structures. Some of the viruses accumulated high levels of variation upon passage in alternative hosts, while others could infect multiple different hosts with no or only a few additional mutations. Overall, these studies demonstrate that the evolutionary history of a virus, including how long it has been circulating and in which hosts, as well as its phylogenetic background, has a profound effect on determining viral host range. PMID:25375184
Luque, Daniel; Gómez-Blanco, Josué; Garriga, Damiá; Brilot, Axel F.; González, José M.; Havens, Wendy M.; Carrascosa, José L.; Trus, Benes L.; Verdaguer, Nuria; Ghabrial, Said A.; Castón, José R.
2014-01-01
Viruses evolve so rapidly that sequence-based comparison is not suitable for detecting relatedness among distant viruses. Structure-based comparisons suggest that evolution led to a small number of viral classes or lineages that can be grouped by capsid protein (CP) folds. Here, we report that the CP structure of the fungal dsRNA Penicillium chrysogenum virus (PcV) shows the progenitor fold of the dsRNA virus lineage and suggests a relationship between lineages. Cryo-EM structure at near-atomic resolution showed that the 982-aa PcV CP is formed by a repeated α-helical core, indicative of gene duplication despite lack of sequence similarity between the two halves. Superimposition of secondary structure elements identified a single “hotspot” at which variation is introduced by insertion of peptide segments. Structural comparison of PcV and other distantly related dsRNA viruses detected preferential insertion sites at which the complexity of the conserved α-helical core, made up of ancestral structural motifs that have acted as a skeleton, might have increased, leading to evolution of the highly varied current structures. Analyses of structural motifs only apparent after systematic structural comparisons indicated that the hallmark fold preserved in the dsRNA virus lineage shares a long (spinal) α-helix tangential to the capsid surface with the head-tailed phage and herpesvirus viral lineage. PMID:24821769
Small RNA Analysis in Sindbis Virus Infected Human HEK293 Cells
Dalmay, Tamas; Powell, Penny P.
2013-01-01
Introduction In contrast to the defence mechanism of RNA interference (RNAi) in plants and invertebrates, its role in the innate response to virus infection of mammals is a matter of debate. Since RNAi has a well-established role in controlling infection of the alphavirus Sindbis virus (SINV) in insects, we have used this virus to investigate the role of RNAi in SINV infection of human cells. Results SINV AR339 and TR339-GFP were adapted to grow in HEK293 cells. Deep sequencing of small RNAs (sRNAs) early in SINV infection (4 and 6 hpi) showed low abundance (0.8%) of viral sRNAs (vsRNAs), with no size, sequence or location specific patterns characteristic of Dicer products nor did they possess any discernible pattern to ascribe to a specific RNAi biogenesis pathway. This was supported by multiple variants for each sequence, and lack of hot spots along the viral genome sequence. The abundance of the best defined vsRNAs was below the limit of Northern blot detection. The adaptation of the virus to HEK293 cells showed little sequence changes compared to the reference; however, a SNP in E1 gene with a preference from G to C was found. Deep sequencing results showed little variation of expression of cellular microRNAs (miRNAs) at 4 and 6 hpi compared to uninfected cells. Twelve miRNAs exhibiting some minor differential expression by sequencing, showed no difference in expression by Northern blot analysis. Conclusions We show that, unlike SINV infection of invertebrates, generation of Dicer-dependent svRNAs and change in expression of cellular miRNAs were not detected as part of the Human response to SINV. PMID:24391886
Structure, sequence and expression of the hepatitis delta (δ) viral genome
NASA Astrophysics Data System (ADS)
Wang, Kang-Sheng; Choo, Qui-Lim; Weiner, Amy J.; Ou, Jing-Hsiung; Najarian, Richard C.; Thayer, Richard M.; Mullenbach, Guy T.; Denniston, Katherine J.; Gerin, John L.; Houghton, Michael
1986-10-01
Biochemical and electron microscopic data indicate that the human hepatitis δ viral agent contains a covalently closed circular and single-stranded RNA genome that has certain similarities with viroid-like agents from plants. The sequence of the viral genome (1,678 nucleotides) has been determined and an open reading frame within the complementary strand has been shown to encode an antigen that binds specifically to antisera from patients with chronic hepatitis δ viral infections.
Nouri, Shahideh; Salem, Nidá; Nigg, Jared C.
2015-01-01
ABSTRACT The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. IMPORTANCE Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. PMID:26676774
Nouri, Shahideh; Salem, Nidá; Nigg, Jared C; Falk, Bryce W
2015-12-16
The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Barrero, Roberto A; Napier, Kathryn R; Cunnington, James; Liefting, Lia; Keenan, Sandi; Frampton, Rebekah A; Szabo, Tamas; Bulman, Simon; Hunter, Adam; Ward, Lisa; Whattam, Mark; Bellgard, Matthew I
2017-01-11
Detection and preventing entry of exotic viruses and viroids at the border is critical for protecting plant industries trade worldwide. Existing post entry quarantine screening protocols rely on time-consuming biological indicators and/or molecular assays that require knowledge of infecting viral pathogens. Plants have developed the ability to recognise and respond to viral infections through Dicer-like enzymes that cleave viral sequences into specific small RNA products. Many studies reported the use of a broad range of small RNAs encompassing the product sizes of several Dicer enzymes involved in distinct biological pathways. Here we optimise the assembly of viral sequences by using specific small RNA subsets. We sequenced the small RNA fractions of 21 plants held at quarantine glasshouse facilities in Australia and New Zealand. Benchmarking of several de novo assembler tools yielded SPAdes using a kmer of 19 to produce the best assembly outcomes. We also found that de novo assembly using 21-25 nt small RNAs can result in chimeric assemblies of viral sequences and plant host sequences. Such non-specific assemblies can be resolved by using 21-22 nt or 24 nt small RNAs subsets. Among the 21 selected samples, we identified contigs with sequence similarity to 18 viruses and 3 viroids in 13 samples. Most of the viruses were assembled using only 21-22 nt long virus-derived siRNAs (viRNAs), except for one Citrus endogenous pararetrovirus that was more efficiently assembled using 24 nt long viRNAs. All three viroids found in this study were fully assembled using either 21-22 nt or 24 nt viRNAs. Optimised analysis workflows were customised within the Yabi web-based analytical environment. We present a fully automated viral surveillance and diagnosis web-based bioinformatics toolkit that provides a flexible, user-friendly, robust and scalable interface for the discovery and diagnosis of viral pathogens. We have implemented an automated viral surveillance and diagnosis (VSD) bioinformatics toolkit that produces improved viruses and viroid sequence assemblies. The VSD toolkit provides several optimised and reusable workflows applicable to distinct viral pathogens. We envisage that this resource will facilitate the surveillance and diagnosis viral pathogens in plants, insects and invertebrates.
Li, Yan; Khalafalla, Abdelmalik Ibrahim; Paden, Clinton R; Yusof, Mohammed F; Eltahir, Yassir M; Al Hammadi, Zulaikha M; Tao, Ying; Queen, Krista; Hosani, Farida Al; Gerber, Susan I; Hall, Aron J; Al Muhairi, Salama; Tong, Suxiang
2017-01-01
Camels are known carriers for many viral pathogens, including Middle East respiratory syndrome coronavirus (MERS-CoV). It is likely that there are additional, as yet unidentified viruses in camels with the potential to cause disease in humans. In this study, we performed metagenomic sequencing analysis on nasopharyngeal swab samples from 108 MERS-CoV-positive dromedary camels from a live animal market in Abu Dhabi, United Arab Emirates. We obtained a total of 846.72 million high-quality reads from these nasopharyngeal swab samples, of which 2.88 million (0.34%) were related to viral sequences while 512.63 million (60.5%) and 50.87 million (6%) matched bacterial and eukaryotic sequences, respectively. Among the viral reads, sequences related to mammalian viruses from 13 genera in 10 viral families were identified, including Coronaviridae, Nairoviridae, Paramyxoviridae, Parvoviridae, Polyomaviridae, Papillomaviridae, Astroviridae, Picornaviridae, Poxviridae, and Genomoviridae. Some viral sequences belong to known camel or human viruses and others are from potentially novel camel viruses with only limited sequence similarity to virus sequences in GenBank. A total of five potentially novel virus species or strains were identified. Co-infection of at least two recently identified camel coronaviruses was detected in 92.6% of the camels in the study. This study provides a comprehensive survey of viruses in the virome of upper respiratory samples in camels that have extensive contact with the human population.
Investigating the viral ecology of global bee communities with high-throughput metagenomics.
Galbraith, David A; Fuller, Zachary L; Ray, Allyson M; Brockmann, Axel; Frazier, Maryann; Gikungu, Mary W; Martinez, J Francisco Iturralde; Kapheim, Karen M; Kerby, Jeffrey T; Kocher, Sarah D; Losyev, Oleksiy; Muli, Elliud; Patch, Harland M; Rosa, Cristina; Sakamoto, Joyce M; Stanley, Scott; Vaudo, Anthony D; Grozinger, Christina M
2018-06-11
Bee viral ecology is a fascinating emerging area of research: viruses exert a range of effects on their hosts, exacerbate impacts of other environmental stressors, and, importantly, are readily shared across multiple bee species in a community. However, our understanding of bee viral communities is limited, as it is primarily derived from studies of North American and European Apis mellifera populations. Here, we examined viruses in populations of A. mellifera and 11 other bee species from 9 countries, across 4 continents and Oceania. We developed a novel pipeline to rapidly and inexpensively screen for bee viruses. This pipeline includes purification of encapsulated RNA/DNA viruses, sequence-independent amplification, high throughput sequencing, integrated assembly of contigs, and filtering to identify contigs specifically corresponding to viral sequences. We identified sequences for (+)ssRNA, (-)ssRNA, dsRNA, and ssDNA viruses. Overall, we found 127 contigs corresponding to novel viruses (i.e. previously not observed in bees), with 27 represented by >0.1% of the reads in a given sample, and 7 contained an RdRp or replicase sequence which could be used for robust phylogenetic analysis. This study provides a sequence-independent pipeline for viral metagenomics analysis, and greatly expands our understanding of the diversity of viruses found in bee communities.
Unique variations of Epstein-Barr virus-encoded BARF1 gene in nasopharyngeal carcinoma biopsies.
Wang, Yun; Wang, Xiao-Feng; Sun, Zhi-Fu; Luo, Bing
2012-06-01
The Epstein-Barr virus (EBV) BamHI-A rightward frame 1 (BARF1) gene is frequently expressed in EBV-associated epithelial malignancies and involves in oncogenicity and immunomodulation. To characterize the variations of BARF1 gene in different populations, the sequences of BARF1 gene in Northern Chinese nasopharyngeal carcinoma (NPC), EBV-associated gastric carcinoma (EBVaGC) and healthy donors were analyzed. The correlation of BARF1 variation with polymorphisms of BamHI F fragment (type F and f variants) and EBV-coded viral interleukin-10 (vIL-10) gene (B95-8 and SPM patterns) was also explored. Two major subtypes of BARF1 gene, designated as B95-8 and V29A, were identified. B95-8 subtype had identical amino acid sequence to B95-8 and was the dominant subtype among the EBV isolates from Northern China. V29A subtype, with one consistent amino acid change at residue 29 (V→A) and several nucleotide changes, showed higher frequency in NPC cases (25.3%, 20/79) than in EBVaGC cases (0/45) or healthy donors (4.3%, 2/46) (NPC vs. EBVaGC: P=0.0001; NPC vs. healthy donor: P=0.004). A preferential linkage between BamHI F and BARF1/vIL-10 polymorphisms was found. Type f isolates was specially correlated with the V29A/SPM genotype in NPC isolates and type f/V29A/SPM was preferentially found in NPC. BARF1/c-fms homology domain, transforming domain and cytotoxic T lymphocyte (CTL) epitopes of BARF1 were highly conserved in most isolates, suggesting the important role of BARF1 in virus infection and the potential usefulness in EBV-targeting immunotherapy of EBV-associated tumors. The relatively higher prevalence of type f/V29A/SPM strains in NPC may also suggest the association between these variations in multiple viral genes and NPC. Copyright © 2012 Elsevier B.V. All rights reserved.
Shi, Liyu; Weng, Jianfeng; Liu, Changlin; Song, Xinyuan; Miao, Hongqin; Hao, Zhuanfang; Xie, Chuanxiao; Li, Mingshun; Zhang, Degui; Bai, Li; Pan, Guangtang; Li, Xinhai; Zhang, Shihuang
2013-04-01
Maize rough dwarf disease (MRDD, a viral disease) results in significant grain yield losses, while genetic basis of which is largely unknown. Based on comparative genomics, eukaryotic translation initiation factor 4E (eIF4E) was considered as a candidate gene for MRDD resistance, validation of which will help to understand the possible genetic mechanism of this disease. ZmeIF4E (orthologs of eIF4E gene in maize) encodes a protein of 218 amino acids, harboring five exons and no variation in the cDNA sequence is identified between the resistant inbred line, X178 and susceptible one, Ye478. ZmeIF4E expression was different in the two lines plants treated with three plant hormones, ethylene, salicylic acid, and jasmonates at V3 developmental stage, suggesting that ZmeIF4E is more likely to be involved in the regulation of defense gene expression and induction of local and systemic resistance. Moreover, four cis-acting elements related to plant defense responses, including DOFCOREZM, EECCRCAH1, GT1GAMSCAM4, and GT1CONSENSUS were detected in ZmeIF4E promoter for harboring sequence variation in the two lines. Association analysis with 163 inbred lines revealed that one SNP in EECCRCAH1 is significantly associated with CSI of MRDD in two environments, which explained 3.33 and 9.04 % of phenotypic variation, respectively. Meanwhile, one SNP in GT-1 motif was found to affect MRDD resistance only in one of the two environments, which explained 5.17 % of phenotypic variation. Collectively, regulatory motifs respectively harboring the two significant SNPs in ZmeIF4E promoter could be involved in the defense process of maize after viral infection. These results contribute to understand maize defense mechanisms against maize rough dwarf virus.
Immunity: Insect Immune Memory Goes Viral.
Ligoxygakis, Petros
2017-11-20
Adaptive memory in insect immunity has been controversial. In this issue, Andino and co-workers propose that acquisition of viral sequences in the host genome gives rise to anti-sense, anti-viral piRNAs. Such sequences can be regarded as both a genomic archive of past infections and as an armour of potential heritable memory. Copyright © 2017 Elsevier Ltd. All rights reserved.
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
Roux, Simon; Hallam, Steven J; Woyke, Tanja; Sullivan, Matthew B
2015-01-01
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes. DOI: http://dx.doi.org/10.7554/eLife.08490.001 PMID:26200428
Probing the Structures of Viral RNA Regulatory Elements with SHAPE and Related Methodologies
Rausch, Jason W.; Sztuba-Solinska, Joanna; Le Grice, Stuart F. J.
2018-01-01
Viral RNAs were selected by evolution to possess maximum functionality in a minimal sequence. Depending on the classification of the virus and the type of RNA in question, viral RNAs must alternately be replicated, spliced, transcribed, transported from the nucleus into the cytoplasm, translated and/or packaged into nascent virions, and in most cases, provide the sequence and structural determinants to facilitate these processes. One consequence of this compact multifunctionality is that viral RNA structures can be exquisitely complex, often involving intermolecular interactions with RNA or protein, intramolecular interactions between sequence segments separated by several thousands of nucleotides, or specialized motifs such as pseudoknots or kissing loops. The fluidity of viral RNA structure can also present a challenge when attempting to characterize it, as genomic RNAs especially are likely to sample numerous conformations at various stages of the virus life cycle. Here we review advances in chemoenzymatic structure probing that have made it possible to address such challenges with respect to cis-acting elements, full-length viral genomes and long non-coding RNAs that play a major role in regulating viral gene expression. PMID:29375504
Rizk, Francine; Laverdure, Sylvain; d'Alençon, Emmanuelle; Bossin, Hervé; Dupressoir, Thierry
2018-01-01
The Lepidopteran ambidensovirus 1 isolated from Junonia coenia (hereafter JcDV) is an invertebrate parvovirus considered as a viral transduction vector as well as a potential tool for the biological control of insect pests. Previous works showed that JcDV-based circular plasmids experimentally integrate into insect cells genomic DNA. In order to approach the natural conditions of infection and possible integration, we generated linear JcDV- gfp based molecules which were transfected into non permissive Spodoptera frugiperda ( Sf9 ) cultured cells. Cells were monitored for the expression of green fluorescent protein (GFP) and DNA was analyzed for integration of transduced viral sequences. Non-structural protein modulation of the VP-gene cassette promoter activity was additionally assayed. We show that linear JcDV-derived molecules are capable of long term genomic integration and sustained transgene expression in Sf9 cells. As expected, only the deletion of both inverted terminal repeats (ITR) or the polyadenylation signals of NS and VP genes dramatically impairs the global transduction/expression efficiency. However, all the integrated viral sequences we characterized appear "scrambled" whatever the viral content of the transfected vector. Despite a strong GFP expression, we were unable to recover any full sequence of the original constructs and found rearranged viral and non-viral sequences as well. Cellular flanking sequences were identified as non-coding ones. On the other hand, the kinetics of GFP expression over time led us to investigate the apparent down-regulation by non-structural proteins of the VP-gene cassette promoter. Altogether, our results show that JcDV-derived sequences included in linear DNA molecules are able to drive efficiently the integration and expression of a foreign gene into the genome of insect cells, whatever their composition, provided that at least one ITR is present. However, the transfected sequences were extensively rearranged with cellular DNA during or after random integration in the host cell genome. Lastly, the non-structural proteins seem to participate in the regulation of p9 promoter activity rather than to the integration of viral sequences.
Ho, Daniel W H; Sze, Karen M F; Ng, Irene O L
2015-08-28
Viral integration into the human genome upon infection is an important risk factor for various human malignancies. We developed viral integration site detection tool called Virus-Clip, which makes use of information extracted from soft-clipped sequencing reads to identify exact positions of human and virus breakpoints of integration events. With initial read alignment to virus reference genome and streamlined procedures, Virus-Clip delivers a simple, fast and memory-efficient solution to viral integration site detection. Moreover, it can also automatically annotate the integration events with the corresponding affected human genes. Virus-Clip has been verified using whole-transcriptome sequencing data and its detection was validated to have satisfactory sensitivity and specificity. Marked advancement in performance was detected, compared to existing tools. It is applicable to versatile types of data including whole-genome sequencing, whole-transcriptome sequencing, and targeted sequencing. Virus-Clip is available at http://web.hku.hk/~dwhho/Virus-Clip.zip.
Borisenko, A S; Kotus, E V; Kaloshin, A A
2008-01-01
Significant number of scientific publications devoted to inhibition of viral replication by antisense RNA (asRNA) genes shows that this approach is useful for gene therapy of viral infections. To investigate the possibility of suppression of HTLV-1 virus reproduction by asRNA we constructed recombinant plasmids containing asRNA genes against U3 long terminal repeats region and X gene under the control of promoter of myeloproliferative sarcoma virus (MPSV) or without such promoter. Using stable calcium-phosphate transfection method with subsequent selection in the presence of G-418, RaHOS line-based cell clones carrying both asRNA genes and sequences able to bind HTLV-1 transactivator proteins (i.e. "traps" of viral transactivators, TVT) were obtained. Data from dot-hybridization analysis of viral RNA extracted from RaHOS cell clones showed that TVT sequences are able to suppress the viral RNA synthesis on 90% and asRNA against X gene synthesis--on 50%.
Metagenomic Analysis of Viral Communities in (Hado)Pelagic Sediments
Yoshida, Mitsuhiro; Takaki, Yoshihiro; Eitoku, Masamitsu; Nunoura, Takuro; Takai, Ken
2013-01-01
In this study, we analyzed viral metagenomes (viromes) in the sedimentary habitats of three geographically and geologically distinct (hado)pelagic environments in the northwest Pacific; the Izu-Ogasawara Trench (water depth = 9,760 m) (OG), the Challenger Deep in the Mariana Trench (10,325 m) (MA), and the forearc basin off the Shimokita Peninsula (1,181 m) (SH). Virus abundance ranged from 106 to 1011 viruses/cm3 of sediments (down to 30 cm below the seafloor [cmbsf]). We recovered viral DNA assemblages (viromes) from the (hado)pelagic sediment samples and obtained a total of 37,458, 39,882, and 70,882 sequence reads by 454 GS FLX Titanium pyrosequencing from the virome libraries of the OG, MA, and SH (hado)pelagic sediments, respectively. Only 24−30% of the sequence reads from each virome library exhibited significant similarities to the sequences deposited in the public nr protein database (E-value <10−3 in BLAST). Among the sequences identified as potential viral genes based on the BLAST search, 95−99% of the sequence reads in each library were related to genes from single-stranded DNA (ssDNA) viral families, including Microviridae, Circoviridae, and Geminiviridae. A relatively high abundance of sequences related to the genetic markers (major capsid protein [VP1] and replication protein [Rep]) of two ssDNA viral groups were also detected in these libraries, thereby revealing a high genotypic diversity of their viruses (833 genotypes for VP1 and 2,551 genotypes for Rep). A majority of the viral genes predicted from each library were classified into three ssDNA viral protein categories: Rep, VP1, and minor capsid protein. The deep-sea sedimentary viromes were distinct from the viromes obtained from the oceanic and fresh waters and marine eukaryotes, and thus, deep-sea sediments harbor novel viromes, including previously unidentified ssDNA viruses. PMID:23468952
Metagenomic analysis of viral communities in (hado)pelagic sediments.
Yoshida, Mitsuhiro; Takaki, Yoshihiro; Eitoku, Masamitsu; Nunoura, Takuro; Takai, Ken
2013-01-01
In this study, we analyzed viral metagenomes (viromes) in the sedimentary habitats of three geographically and geologically distinct (hado)pelagic environments in the northwest Pacific; the Izu-Ogasawara Trench (water depth = 9,760 m) (OG), the Challenger Deep in the Mariana Trench (10,325 m) (MA), and the forearc basin off the Shimokita Peninsula (1,181 m) (SH). Virus abundance ranged from 10(6) to 10(11) viruses/cm(3) of sediments (down to 30 cm below the seafloor [cmbsf]). We recovered viral DNA assemblages (viromes) from the (hado)pelagic sediment samples and obtained a total of 37,458, 39,882, and 70,882 sequence reads by 454 GS FLX Titanium pyrosequencing from the virome libraries of the OG, MA, and SH (hado)pelagic sediments, respectively. Only 24-30% of the sequence reads from each virome library exhibited significant similarities to the sequences deposited in the public nr protein database (E-value <10(-3) in BLAST). Among the sequences identified as potential viral genes based on the BLAST search, 95-99% of the sequence reads in each library were related to genes from single-stranded DNA (ssDNA) viral families, including Microviridae, Circoviridae, and Geminiviridae. A relatively high abundance of sequences related to the genetic markers (major capsid protein [VP1] and replication protein [Rep]) of two ssDNA viral groups were also detected in these libraries, thereby revealing a high genotypic diversity of their viruses (833 genotypes for VP1 and 2,551 genotypes for Rep). A majority of the viral genes predicted from each library were classified into three ssDNA viral protein categories: Rep, VP1, and minor capsid protein. The deep-sea sedimentary viromes were distinct from the viromes obtained from the oceanic and fresh waters and marine eukaryotes, and thus, deep-sea sediments harbor novel viromes, including previously unidentified ssDNA viruses.
Viral quasispecies inference from 454 pyrosequencing
2013-01-01
Background Many potentially life-threatening infectious viruses are highly mutable in nature. Characterizing the fittest variants within a quasispecies from infected patients is expected to allow unprecedented opportunities to investigate the relationship between quasispecies diversity and disease epidemiology. The advent of next-generation sequencing technologies has allowed the study of virus diversity with high-throughput sequencing, although these methods come with higher rates of errors which can artificially increase diversity. Results Here we introduce a novel computational approach that incorporates base quality scores from next-generation sequencers for reconstructing viral genome sequences that simultaneously infers the number of variants within a quasispecies that are present. Comparisons on simulated and clinical data on dengue virus suggest that the novel approach provides a more accurate inference of the underlying number of variants within the quasispecies, which is vital for clinical efforts in mapping the within-host viral diversity. Sequence alignments generated by our approach are also found to exhibit lower rates of error. Conclusions The ability to infer the viral quasispecies colony that is present within a human host provides the potential for a more accurate classification of the viral phenotype. Understanding the genomics of viruses will be relevant not just to studying how to control or even eradicate these viral infectious diseases, but also in learning about the innate protection in the human host against the viruses. PMID:24308284
Getchell, Rodman G.; Cornwell, Emily R.; Bogdanowicz, Steven; Andres, Jose; Batts, William N.; Kurath, Gael; Breyta, Rachel; Choi, Joanna G.; Farrell, John M.; Bowser, Paul R.
2017-01-01
Four viral hemorrhagic septicemia virus (VHSV) genotype IVb isolates were sequenced, their genetic variation explored, and comparative virulence assayed with experimental infections of northern pike Esox lucius fry. In addition to the type strain MI03, the complete 11183 bp genome of the first round goby Neogobius melanostomus isolate from the St. Lawrence River, and the 2013 and 2014 isolates from gizzard shad Dorosoma cepedianum die-offs in Irondequoit Bay, Lake Ontario and Dunkirk Harbor, Lake Erie were all deep sequenced on an Illumina platform. Mutations documented in the 11 yr since the MI03 index case from Lake St. Clair muskellunge Esox masquinongy showed 87 polymorphisms among the 4 isolates. Twenty-six mutations were non-synonymous and located at 18 different positions within the matrix protein, glycoprotein, non-virion protein, and RNA polymerase genes. The same 4 isolates were used to infect northern pike fry by a single 1 h bath exposure. Cumulative percent mortality varied from 42.5 to 62.5%. VHSV was detected in 57% (41/72) of the survivors at the end of the 21-d trial, suggesting that the virus was not rapidly cleared. Lesions were observed in many of the moribund and dead northern pike, such as hemorrhaging in the skin and fins, as well as hydrocephalus. Mean viral load measured from the trunk and visceral tissues of MI03-infected pike was significantly higher than the quantities detected in fish infected with the most recent isolates of genotype IVb, but there were no differences in cumulative mortality observed.
Laassri, Majid; Dragunsky, Eugenia; Enterline, Joan; Eremeeva, Tatiana; Ivanova, Olga; Lottenbach, Kathleen; Belshe, Robert; Chumakov, Konstantin
2005-01-01
Sabin strains of poliovirus used in the manufacture of oral poliovirus vaccine (OPV) are prone to genetic variations that occur during growth in cell cultures and the organisms of vaccine recipients. Such derivative viruses often have increased neurovirulence and transmissibility, and in some cases they can reestablish chains of transmission in human populations. Monitoring for vaccine-derived polioviruses is an important part of the worldwide campaign to eradicate poliomyelitis. Analysis of vaccine-derived polioviruses requires, as a first step, their isolation in cell cultures, which takes significant time and may yield viral stocks that are not fully representative of the strains present in the original sample. Here we demonstrate that full-length viral cDNA can be PCR amplified directly from stool samples and immediately subjected to genomic analysis by oligonucleotide microarray hybridization and nucleotide sequencing. Most fecal samples from healthy children who received OPV were found to contain variants of Sabin vaccine viruses. Sequence changes in the 5′ untranslated region were common, as were changes in the VP1-coding region, including changes in a major antigenic site. Analysis of stool samples taken from cases of acute flaccid paralysis revealed the presence of mixtures of recombinant polioviruses, in addition to the emergence of new sequence variants. Avoiding the need for cell culture isolation dramatically shortened the time needed for identification and analysis of vaccine-derived polioviruses and could be useful for preliminary screening of clinical samples. The amplified full-length viral cDNA can be archived and used to recover live virus for further virological studies. PMID:15956413
Amicarelli, Giulia; Adlerstein, Daniel; Shehi, Erlet; Wang, Fengfei; Makrigiorgos, G Mike
2006-10-01
Genotyping methods that reveal single-nucleotide differences are useful for a wide range of applications. We used digestion of 3-way DNA junctions in a novel technology, OneCutEventAmplificatioN (OCEAN) that allows sequence-specific signal generation and amplification. We combined OCEAN with peptide-nucleic-acid (PNA)-based variant enrichment to detect and simultaneously genotype v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog (KRAS) codon 12 sequence variants in human tissue specimens. We analyzed KRAS codon 12 sequence variants in 106 lung cancer surgical specimens. We conducted a PNA-PCR reaction that suppresses wild-type KRAS amplification and genotyped the product with a set of OCEAN reactions carried out in fluorescence microplate format. The isothermal OCEAN assay enabled a 3-way DNA junction to form between the specific target nucleic acid, a fluorescently labeled "amplifier", and an "anchor". The amplifier-anchor contact contains the recognition site for a restriction enzyme. Digestion produces a cleaved amplifier and generation of a fluorescent signal. The cleaved amplifier dissociates from the 3-way DNA junction, allowing a new amplifier to bind and propagate the reaction. The system detected and genotyped KRAS sequence variants down to approximately 0.3% variant-to-wild-type alleles. PNA-PCR/OCEAN had a concordance rate with PNA-PCR/sequencing of 93% to 98%, depending on the exact implementation. Concordance rate with restriction endonuclease-mediated selective-PCR/sequencing was 89%. OCEAN is a practical and low-cost novel technology for sequence-specific signal generation. Reliable analysis of KRAS sequence alterations in human specimens circumvents the requirement for sequencing. Application is expected in genotyping KRAS codon 12 sequence variants in surgical specimens or in bodily fluids, as well as single-base variations and sequence alterations in other genes.
Marston, D A; McElhinney, L M; Johnson, N; Müller, T; Conzelmann, K K; Tordo, N; Fooks, A R
2007-04-01
We report the first full-length genomic sequences for European bat lyssavirus type-1 (EBLV-1) and type-2 (EBLV-2). The EBLV-1 genomic sequence was derived from a virus isolated from a serotine bat in Hamburg, Germany, in 1968 and the EBLV-2 sequence was derived from a virus isolate from a human case of rabies that occurred in Scotland in 2002. A long-distance PCR strategy was used to amplify the open reading frames (ORFs), followed by standard and modified RACE (rapid amplification of cDNA ends) techniques to amplify the 3' and 5' ends. The lengths of each complete viral genome for EBLV-1 and EBLV-2 were 11 966 and 11 930 base pairs, respectively, and follow the standard rhabdovirus genome organization of five viral proteins. Comparison with other lyssavirus sequences demonstrates variation in degrees of homology, with the genomic termini showing a high degree of complementarity. The nucleoprotein was the most conserved, both intra- and intergenotypically, followed by the polymerase (L), matrix and glyco- proteins, with the phosphoprotein being the most variable. In addition, we have shown that the two EBLVs utilize a conserved transcription termination and polyadenylation (TTP) motif, approximately 50 nt upstream of the L gene start codon. All available lyssavirus sequences to date, with the exception of Pasteur virus (PV) and PV-derived isolates, use the second TTP site. This observation may explain differences in pathogenicity between lyssavirus strains, dependent on the length of the untranslated region, which might affect transcriptional activity and RNA stability.
The Spleen Is an HIV-1 Sanctuary During Combined Antiretroviral Therapy.
Nolan, David J; Rose, Rebecca; Rodriguez, Patricia H; Salemi, Marco; Singer, Elyse J; Lamers, Susanna L; McGrath, Michael S
2018-01-01
Combined antiretroviral therapy (cART) does not eradicate HIV, which persists for years and can re-establish replication if treatment is stopped. The current challenge is identifying those tissues harboring virus through cART. Here, we used HIV env-nef single genome sequencing and HIV gag droplet digital PCR (ddPCR) to survey 50 tissues from five subjects on cART with no detectable plasma viral load at death. The spleen most consistently contained multiple proviral and expressed sequences (4/5 participants). Spleen-derived HIV demonstrated two distinct phylogenetic patterns: multiple identical sequences, often from different tissues, as well as diverse viral sequences on long terminal branches. Our results suggested that ddPCR may overestimate the size of the tissue-based viral reservoir. The spleen, a lymphatic organ at the intersection of the immune and circulatory systems, may play a key role in viral persistence.
Emergence of a Novel Avian Pox Disease in British Tit Species
Lawson, Becki; Lachish, Shelly; Colvile, Katie M.; Durrant, Chris; Peck, Kirsi M.; Toms, Mike P.; Sheldon, Ben C.; Cunningham, Andrew A.
2012-01-01
Avian pox is a viral disease with a wide host range. In Great Britain, avian pox in birds of the Paridae family was first diagnosed in a great tit (Parus major) from south-east England in 2006. An increasing number of avian pox incidents in Paridae have been reported each year since, indicative of an emergent infection. Here, we utilise a database of opportunistic reports of garden bird mortality and morbidity to analyse spatial and temporal patterns of suspected avian pox throughout Great Britain, 2006–2010. Reports of affected Paridae (211 incidents) outnumbered reports in non-Paridae (91 incidents). The majority (90%) of Paridae incidents involved great tits. Paridae pox incidents were more likely to involve multiple individuals (77.3%) than were incidents in non-Paridae hosts (31.9%). Unlike the small wart-like lesions usually seen in non-Paridae with avian pox in Great Britain, lesions in Paridae were frequently large, often with an ulcerated surface and caseous core. Spatial analyses revealed strong clustering of suspected avian pox incidents involving Paridae hosts, but only weak, inconsistent clustering of incidents involving non-Paridae hosts. There was no spatial association between Paridae and non-Paridae incidents. We documented significant spatial spread of Paridae pox from an origin in south-east England; no spatial spread was evident for non-Paridae pox. For both host clades, there was an annual peak of reports in August/September. Sequencing of the avian poxvirus 4b core protein produced an identical viral sequence from each of 20 great tits tested from Great Britain. This sequence was identical to that from great tits from central Europe and Scandinavia. In contrast, sequence variation was evident amongst virus tested from 17 non-Paridae hosts of 5 species. Our findings show Paridae pox to be an emerging infectious disease in wild birds in Great Britain, apparently originating from viral incursion from central Europe or Scandinavia. PMID:23185231
Emergence of a novel avian pox disease in British tit species.
Lawson, Becki; Lachish, Shelly; Colvile, Katie M; Durrant, Chris; Peck, Kirsi M; Toms, Mike P; Sheldon, Ben C; Cunningham, Andrew A
2012-01-01
Avian pox is a viral disease with a wide host range. In Great Britain, avian pox in birds of the Paridae family was first diagnosed in a great tit (Parus major) from south-east England in 2006. An increasing number of avian pox incidents in Paridae have been reported each year since, indicative of an emergent infection. Here, we utilise a database of opportunistic reports of garden bird mortality and morbidity to analyse spatial and temporal patterns of suspected avian pox throughout Great Britain, 2006-2010. Reports of affected Paridae (211 incidents) outnumbered reports in non-Paridae (91 incidents). The majority (90%) of Paridae incidents involved great tits. Paridae pox incidents were more likely to involve multiple individuals (77.3%) than were incidents in non-Paridae hosts (31.9%). Unlike the small wart-like lesions usually seen in non-Paridae with avian pox in Great Britain, lesions in Paridae were frequently large, often with an ulcerated surface and caseous core. Spatial analyses revealed strong clustering of suspected avian pox incidents involving Paridae hosts, but only weak, inconsistent clustering of incidents involving non-Paridae hosts. There was no spatial association between Paridae and non-Paridae incidents. We documented significant spatial spread of Paridae pox from an origin in south-east England; no spatial spread was evident for non-Paridae pox. For both host clades, there was an annual peak of reports in August/September. Sequencing of the avian poxvirus 4b core protein produced an identical viral sequence from each of 20 great tits tested from Great Britain. This sequence was identical to that from great tits from central Europe and Scandinavia. In contrast, sequence variation was evident amongst virus tested from 17 non-Paridae hosts of 5 species. Our findings show Paridae pox to be an emerging infectious disease in wild birds in Great Britain, apparently originating from viral incursion from central Europe or Scandinavia.
2013-01-01
Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218
Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D
2013-03-07
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.
Piontkivska, Helen; Matos, Luis F; Paul, Sinu; Scharfenberg, Brian; Farmerie, William G; Miyamoto, Michael M; Wayne, Marta L
2016-10-05
Sigma virus (DMelSV) is ubiquitous in natural populations of Drosophila melanogaster. Host-mediated, selective RNA editing of adenosines to inosines (ADAR) may contribute to control of viral infection by preventing transcripts from being transported into the cytoplasm or being translated accurately; or by increasing the viral genomic mutation rate. Previous PCR-based studies showed that ADAR mutations occur in DMelSV at low frequency. Here we use SOLiD TM deep sequencing of flies from a single host population from Athens, GA, USA to comprehensively evaluate patterns of sequence variation in DMelSV with respect to ADAR. GA dinucleotides, which are weak targets of ADAR, are strongly overrepresented in the positive strand of the virus, consistent with selection to generate ADAR resistance on this complement of the transient, double-stranded RNA intermediate in replication and transcription. Potential ADAR sites in a worldwide sample of viruses are more likely to be "resistant" if the sites do not vary among samples. Either variable sites are less constrained and hence are subject to weaker selection than conserved sites, or the variation is driven by ADAR. We also find evidence of mutations segregating within hosts, hereafter referred to as hypervariable sites. Some of these sites were variable only in one or two flies (i.e., rare); others were shared by four or even all five of the flies (i.e., common). Rare and common hypervariable sites were indistinguishable with respect to susceptibility to ADAR; however, polymorphism in rare sites were more likely to be consistent with the action of ADAR than in common ones, again suggesting that ADAR is deleterious to the virus. Thus, in DMelSV, host mutagenesis is constraining viral evolution both within and between hosts. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Murphy, B; Hillman, C; McDonnel, S
2014-01-22
Feline immunodeficiency virus (FIV)-infected cats enter a clinically asymptomatic phase during chronic infection. Despite the lack of overt clinical disease, the asymptomatic phase is characterized by persistent immunologic impairment. In the peripheral blood obtained from cats experimentally infected with FIV-C for approximately 5 years, we identified a persistent inversion of the CD4/CD8 ratio. We cloned and sequenced the FIV-C long terminal repeat containing the viral promoter from cells infected with the inoculating virus and from in vivo-derived peripheral blood mononuclear cells and CD4 T cells isolated at multiple time points throughout the asymptomatic phase. Relative to the inoculating virus, viral sequences amplified from cells isolated from all of the infected animals demonstrated multiple single nucleotide mutations and a short deletion within the viral U3, R and U5 regions. A transcriptionally inactivating proviral mutation in the U3 promoter AP-1 site was identified at multiple time points from all of the infected animals but not within cell-associated viral RNA. In contrast, no mutations were identified within the sequence of the viral dUTPase gene amplified from PBMC isolated at approximately 5 years post-infection relative to the inoculating sequence. The possible implications of these mutations to viral pathogenesis are discussed. Copyright © 2013 Elsevier B.V. All rights reserved.
Palma, Paolo; Zangari, Paola; Alteri, Claudia; Tchidjou, Hyppolite K; Manno, Emma Concetta; Liuzzi, Giuseppina; Perno, Carlo Federico; Rossi, Paolo; Bertoli, Ada; Bernardi, Stefania
2016-12-09
HIV genetic diversity implicates major challenges for the control of viral infection by the immune system and for the identification of an effective immunotherapeutic strategy. With the present case report we underline as HIV evolution could be effectively halted by early antiretroviral treatment (eART). Few cases supported this evidence due to the difficulty of performing amplification and sequencing analysis in long-term viral suppressed patients. Here, we reported the case of limited HIV-1 viral evolution over time in a successful early treated child. A perinatally HIV-1 infected infant was treated within 7 weeks of age with zidovudine, lamivudine, nevirapine and lopinavir/ritonavir. At antiretroviral treatment (ART) initiation HIV-1 viral load (VL) and CD4 percentage were >500,000 copies/ml and 35%, respectively. Plasma genotypic resistance test showed a wild-type virus. The child reached VL undetectability after 33 weeks of combination antiretroviral therapy (cART) since he maintained a stable VL <40copies/ml. After 116 weeks on ART we were able to perform amplification and sequencing assay on the plasma virus. At this time VL was <40 copies/ml and CD4 percentage was 40%. Again the genotypic resistance test revealed a wild-type virus. The phylogenetic analysis performed on the HIV-1 pol sequences of the mother and the child revealed that sequences clustered with C subtype reference strains and formed a monophyletic cluster distinct from the other C sequences included in the analysis (bootstrap value >90%). Any major evolutionary divergence was detected. eART limits the viral evolution avoiding the emergence of new viral variants. This result may have important implications in host immune control and may sustain the challenge search of new personalized immunotherapeutic approaches to achieve a prolonged viral remission.
Squires, R Burke; Pickett, Brett E; Das, Sajal; Scheuermann, Richard H
2014-12-01
In 2009 a novel pandemic H1N1 influenza virus (H1N1pdm09) emerged as the first official influenza pandemic of the 21st century. Early genomic sequence analysis pointed to the swine origin of the virus. Here we report a novel computational approach to determine the evolutionary trajectory of viral sequences that uses data-driven estimations of nucleotide substitution rates to track the gradual accumulation of observed sequence alterations over time. Phylogenetic analysis and multiple sequence alignments show that sequences belonging to the resulting evolutionary trajectory of the H1N1pdm09 lineage exhibit a gradual accumulation of sequence variations and tight temporal correlations in the topological structure of the phylogenetic trees. These results suggest that our evolutionary trajectory analysis (ETA) can more effectively pinpoint the evolutionary history of viruses, including the host and geographical location traversed by each segment, when compared against either BLAST or traditional phylogenetic analysis alone. Copyright © 2014 Elsevier B.V. All rights reserved.
Gimenez, Magalí Diana; Yañez-Santos, Anahí Mara; Paz, Rosalía Cristina; Quiroga, Mariana Paola; Marfil, Carlos Federico; Conci, Vilma Cecilia; García-Lampasona, Sandra Claudia
2016-01-01
This is the first report assessing epigenetic variation in garlic. High genetic and epigenetic polymorphism during in vitro culture was detected.Sequencing of MSAP fragments revealed homology with ESTs. Garlic (Allium sativum) is a worldwide crop of economic importance susceptible to viral infections that can cause significant yield losses. Meristem tissue culture is the most employed method to sanitize elite cultivars.Often the virus-free garlic plants obtained are multiplied in vitro (micro propagation). However, it was reported that micro-propagation frequently produces somaclonal variation at the phenotypic level, which is an undesirable trait when breeders are seeking to maintain varietal stability. We employed amplification fragment length polymorphism and methylation sensitive amplified polymorphism (MSAP) methodologies to assess genetic and epigenetic modifications in two culture systems: virus-free plants obtained by meristem culture followed by in vitro multiplication and field culture. Our results suggest that garlic exhibits genetic and epigenetic polymorphism under field growing conditions. However, during in vitro culture system both kinds of polymorphisms intensify indicating that this system induces somaclonal variation. Furthermore, while genetic changes accumulated along the time of in vitro culture, epigenetic polymorphism reached the major variation at 6 months and then stabilize, being demethylation and CG methylation the principal conversions.Cloning and sequencing differentially methylated MSAP fragments allowed us to identify coding and unknown sequences of A. sativum, including sequences belonging to LTR Gypsy retrotransposons. Together, our results highlight that main changes occur in the initial 6 months of micro propagation. For the best of our knowledge, this is the first report on epigenetic assessment in garlic.
Eltahir, Yassir M.; Al Hammadi, Zulaikha M.; Tao, Ying; Queen, Krista; Hosani, Farida Al; Gerber, Susan I.; Hall, Aron J.; Al Muhairi, Salama
2017-01-01
Camels are known carriers for many viral pathogens, including Middle East respiratory syndrome coronavirus (MERS-CoV). It is likely that there are additional, as yet unidentified viruses in camels with the potential to cause disease in humans. In this study, we performed metagenomic sequencing analysis on nasopharyngeal swab samples from 108 MERS-CoV-positive dromedary camels from a live animal market in Abu Dhabi, United Arab Emirates. We obtained a total of 846.72 million high-quality reads from these nasopharyngeal swab samples, of which 2.88 million (0.34%) were related to viral sequences while 512.63 million (60.5%) and 50.87 million (6%) matched bacterial and eukaryotic sequences, respectively. Among the viral reads, sequences related to mammalian viruses from 13 genera in 10 viral families were identified, including Coronaviridae, Nairoviridae, Paramyxoviridae, Parvoviridae, Polyomaviridae, Papillomaviridae, Astroviridae, Picornaviridae, Poxviridae, and Genomoviridae. Some viral sequences belong to known camel or human viruses and others are from potentially novel camel viruses with only limited sequence similarity to virus sequences in GenBank. A total of five potentially novel virus species or strains were identified. Co-infection of at least two recently identified camel coronaviruses was detected in 92.6% of the camels in the study. This study provides a comprehensive survey of viruses in the virome of upper respiratory samples in camels that have extensive contact with the human population. PMID:28902913
Viral Genome DataBase: storing and analyzing genes and proteins from complete viral genomes.
Hiscock, D; Upton, C
2000-05-01
The Viral Genome DataBase (VGDB) contains detailed information of the genes and predicted protein sequences from 15 completely sequenced genomes of large (&100 kb) viruses (2847 genes). The data that is stored includes DNA sequence, protein sequence, GenBank and user-entered notes, molecular weight (MW), isoelectric point (pI), amino acid content, A + T%, nucleotide frequency, dinucleotide frequency and codon use. The VGDB is a mySQL database with a user-friendly JAVA GUI. Results of queries can be easily sorted by any of the individual parameters. The software and additional figures and information are available at http://athena.bioc.uvic.ca/genomes/index.html .
Hou, Weiguo; Wang, Shang; Briggs, Brandon R; Li, Gaoyuan; Xie, Wei; Dong, Hailiang
2018-01-01
Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
Hou, Weiguo; Wang, Shang; Briggs, Brandon R.; Li, Gaoyuan; Xie, Wei; Dong, Hailiang
2018-01-01
Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
Smith, Richard H; Hallwirth, Claus V; Westerman, Michael; Hetherington, Nicola A; Tseng, Yu-Shan; Cecchini, Sylvain; Virag, Tamas; Ziegler, Mona-Larissa; Rogozin, Igor B; Koonin, Eugene V; Agbandje-McKenna, Mavis; Kotin, Robert M; Alexander, Ian E
2016-07-05
Germline endogenous viral elements (EVEs) genetically preserve viral nucleotide sequences useful to the study of viral evolution, gene mutation, and the phylogenetic relationships among host organisms. Here, we describe a lineage-specific, adeno-associated virus (AAV)-derived endogenous viral element (mAAV-EVE1) found within the germline of numerous closely related marsupial species. Molecular screening of a marsupial DNA panel indicated that mAAV-EVE1 occurs specifically within the marsupial suborder Macropodiformes (present-day kangaroos, wallabies, and related macropodoids), to the exclusion of other Diprotodontian lineages. Orthologous mAAV-EVE1 locus sequences from sixteen macropodoid species, representing a speciation history spanning an estimated 30 million years, facilitated compilation of an inferred ancestral sequence that recapitulates the genome of an ancient marsupial AAV that circulated among Australian metatherian fauna sometime during the late Eocene to early Oligocene. In silico gene reconstruction and molecular modelling indicate remarkable conservation of viral structure over a geologic timescale. Characterisation of AAV-EVE loci among disparate species affords insight into AAV evolution and, in the case of macropodoid species, may offer an additional genetic basis for assignment of phylogenetic relationships among the Macropodoidea. From an applied perspective, the identified AAV "fossils" provide novel capsid sequences for use in translational research and clinical applications.
Broad Surveys of DNA Viral Diversity Obtained through Viral Metagenomics of Mosquitoes
Ng, Terry Fei Fan; Willner, Dana L.; Lim, Yan Wei; Schmieder, Robert; Chau, Betty; Nilsson, Christina; Anthony, Simon; Ruan, Yijun; Rohwer, Forest; Breitbart, Mya
2011-01-01
Viruses are the most abundant and diverse genetic entities on Earth; however, broad surveys of viral diversity are hindered by the lack of a universal assay for viruses and the inability to sample a sufficient number of individual hosts. This study utilized vector-enabled metagenomics (VEM) to provide a snapshot of the diversity of DNA viruses present in three mosquito samples from San Diego, California. The majority of the sequences were novel, suggesting that the viral community in mosquitoes, as well as the animal and plant hosts they feed on, is highly diverse and largely uncharacterized. Each mosquito sample contained a distinct viral community. The mosquito viromes contained sequences related to a broad range of animal, plant, insect and bacterial viruses. Animal viruses identified included anelloviruses, circoviruses, herpesviruses, poxviruses, and papillomaviruses, which mosquitoes may have obtained from vertebrate hosts during blood feeding. Notably, sequences related to human papillomaviruses were identified in one of the mosquito samples. Sequences similar to plant viruses were identified in all mosquito viromes, which were potentially acquired through feeding on plant nectar. Numerous bacteriophages and insect viruses were also detected, including a novel densovirus likely infecting Culex erythrothorax. Through sampling insect vectors, VEM enables broad survey of viral diversity and has significantly increased our knowledge of the DNA viruses present in mosquitoes. PMID:21674005
Fernandez-Cassi, X; Timoneda, N; Gonzales-Gustavson, E; Abril, J F; Bofill-Mas, S; Girones, R
2017-09-18
Microbial food-borne diseases are still frequently reported despite the implementation of microbial quality legislation to improve food safety. Among all the microbial agents, viruses are the most important causative agents of food-borne outbreaks. The development and application of a new generation of sequencing techniques to test for viral contaminants in fresh produce is an unexplored field that allows for the study of the viral populations that might be transmitted by the fecal-oral route through the consumption of contaminated food. To advance this promising field, parsley was planted and grown under controlled conditions and irrigated using contaminated river water. Viruses polluting the irrigation water and the parsley leaves were studied by using metagenomics. To address possible contamination due to sample manipulation, library preparation, and other sources, parsley plants irrigated with nutritive solution were used as a negative control. In parallel, viruses present in the river water used for plant irrigation were analyzed using the same methodology. It was possible to assign viral taxons from 2.4 to 74.88% of the total reads sequenced depending on the sample. Most of the viral reads detected in the river water were related to the plant viral families Tymoviridae (66.13%) and Virgaviridae (14.45%) and the phage viral families Myoviridae (5.70%), Siphoviridae (5.06%), and Microviridae (2.89%). Less than 1% of the viral reads were related to viral families that infect humans, including members of the Adenoviridae, Reoviridae, Picornaviridae and Astroviridae families. On the surface of the parsley plants, most of the viral reads that were detected were assigned to the Dicistroviridae family (41.52%). Sequences related to important viral pathogens, such as the hepatitis E virus, several picornaviruses from species A and B as well as human sapoviruses and GIV noroviruses were detected. The high diversity of viral sequences found in the parsley plants suggests that irrigation on fecally-tainted food may have a role in the transmission of a wide diversity of viral families. This finding reinforces the idea that the best way to avoid food-borne viral diseases is to introduce good field irrigation and production practices. New strains have been identified that are related to the Picornaviridae and distantly related to the Hepeviridae family. However, the detection of a viral genome alone does not necessarily indicate there is a risk of infection or disease development. Thus, further investigation is crucial for correlating the detection of viral metagenomes in samples with the risk of infection. There is also an urgent need to develop new methods to improve the sensitivity of current Next Generation Sequencing (NGS) techniques in the food safety area. Copyright © 2017 Elsevier B.V. All rights reserved.
Computational clustering for viral reference proteomes
Chen, Chuming; Huang, Hongzhan; Mazumder, Raja; Natale, Darren A.; McGarvey, Peter B.; Zhang, Jian; Polson, Shawn W.; Wang, Yuqi; Wu, Cathy H.
2016-01-01
Motivation: The enormous number of redundant sequenced genomes has hindered efforts to analyze and functionally annotate proteins. As the taxonomy of viruses is not uniformly defined, viral proteomes pose special challenges in this regard. Grouping viruses based on the similarity of their proteins at proteome scale can normalize against potential taxonomic nomenclature anomalies. Results: We present Viral Reference Proteomes (Viral RPs), which are computed from complete virus proteomes within UniProtKB. Viral RPs based on 95, 75, 55, 35 and 15% co-membership in proteome similarity based clusters are provided. Comparison of our computational Viral RPs with UniProt’s curator-selected Reference Proteomes indicates that the two sets are consistent and complementary. Furthermore, each Viral RP represents a cluster of virus proteomes that was consistent with virus or host taxonomy. We provide BLASTP search and FTP download of Viral RP protein sequences, and a browser to facilitate the visualization of Viral RPs. Availability and implementation: http://proteininformationresource.org/rps/viruses/ Contact: chenc@udel.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153712
Genome sequence diversity and clues to the evolution of variola (smallpox) virus.
Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M
2006-08-11
Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.
Purcell, Maureen K; Pearman-Gillman, Schuyler; Thompson, Rachel L; Gregg, Jacob L; Hart, Lucas M; Winton, James R; Emmenegger, Eveline J; Hershberger, Paul K
2016-07-01
Viral erythrocytic necrosis (VEN) is a disease of marine and anadromous fish that is caused by the erythrocytic necrosis virus (ENV), which was recently identified as a novel member of family Iridoviridae by next-generation sequencing. Phylogenetic analysis of the ENV DNA polymerase grouped ENV with other erythrocytic iridoviruses from snakes and lizards. In the present study, we identified the gene encoding the ENV major capsid protein (MCP) and developed a quantitative real-time PCR (qPCR) assay targeting this gene. Phylogenetic analysis of the MCP gene sequence supported the conclusion that ENV does not group with any of the currently described iridovirus genera. Because there is no information regarding genetic variation of the MCP gene across the reported host and geographic range for ENV, we also developed a second qPCR assay for a more conserved ATPase-like gene region. The MCP and ATPase qPCR assays demonstrated good analytical and diagnostic sensitivity and specificity based on samples from laboratory challenges of Pacific herring Clupea pallasii The qPCR assays had similar diagnostic sensitivity and specificity as light microscopy of stained blood smears for the presence of intraerythrocytic inclusion bodies. However, the qPCR assays may detect viral DNA early in infection prior to the formation of inclusion bodies. Both qPCR assays appear suitable for viral surveillance or as a confirmatory test for ENV in Pacific herring from the Salish Sea. © 2016 The Author(s).
Purcell, Maureen K.; Pearman-Gillman, Schuyler; Thompson, Rachel L.; Gregg, Jacob L.; Hart, Lucas M.; Winton, James R.; Emmenegger, Eveline J.; Hershberger, Paul K.
2016-01-01
Viral erythrocytic necrosis (VEN) is a disease of marine and anadromous fish that is caused by the erythrocytic necrosis virus (ENV), which was recently identified as a novel member of family Iridoviridae by next-generation sequencing. Phylogenetic analysis of the ENV DNA polymerase grouped ENV with other erythrocytic iridoviruses from snakes and lizards. In the present study, we identified the gene encoding the ENV major capsid protein (MCP) and developed a quantitative real-time PCR (qPCR) assay targeting this gene. Phylogenetic analysis of the MCP gene sequence supported the conclusion that ENV does not group with any of the currently described iridovirus genera. Because there is no information regarding genetic variation of the MCP gene across the reported host and geographic range for ENV, we also developed a second qPCR assay for a more conserved ATPase-like gene region. The MCP and ATPase qPCR assays demonstrated good analytical and diagnostic sensitivity and specificity based on samples from laboratory challenges of Pacific herring Clupea pallasii. The qPCR assays had similar diagnostic sensitivity and specificity as light microscopy of stained blood smears for the presence of intraerythrocytic inclusion bodies. However, the qPCR assays may detect viral DNA early in infection prior to the formation of inclusion bodies. Both qPCR assays appear suitable for viral surveillance or as a confirmatory test for ENV in Pacific herring from the Salish Sea.
Miyashita, Shuhei; Ishibashi, Kazuhiro; Kishino, Hirohisa; Ishikawa, Masayuki
2015-01-01
Recent studies on evolutionarily distant viral groups have shown that the number of viral genomes that establish cell infection after cell-to-cell transmission is unexpectedly small (1–20 genomes). This aspect of viral infection appears to be important for the adaptation and survival of viruses. To clarify how the number of viral genomes that establish cell infection is determined, we developed a simulation model of cell infection for tomato mosaic virus (ToMV), a positive-strand RNA virus. The model showed that stochastic processes that govern the replication or degradation of individual genomes result in the infection by a small number of genomes, while a large number of infectious genomes are introduced in the cell. It also predicted two interesting characteristics regarding cell infection patterns: stochastic variation among cells in the number of viral genomes that establish infection and stochastic inequality in the accumulation of their progenies in each cell. Both characteristics were validated experimentally by inoculating tobacco cells with a library of nucleotide sequence–tagged ToMV and analyzing the viral genomes that accumulated in each cell using a high-throughput sequencer. An additional simulation model revealed that these two characteristics enhance selection during tissue infection. The cell infection model also predicted a mechanism that enhances selection at the cellular level: a small difference in the replication abilities of coinfected variants results in a large difference in individual accumulation via the multiple-round formation of the replication complex (i.e., the replication machinery). Importantly, this predicted effect was observed in vivo. The cell infection model was robust to changes in the parameter values, suggesting that other viruses could adopt similar adaptation mechanisms. Taken together, these data reveal a comprehensive picture of viral infection processes including replication, cell-to-cell transmission, and evolution, which are based on the stochastic behavior of the viral genome molecules in each cell. PMID:25781391
Sarmady, Mahdi; Dampier, William; Tozeren, Aydin
2011-01-01
Virus proteins alter protein pathways of the host toward the synthesis of viral particles by breaking and making edges via binding to host proteins. In this study, we developed a computational approach to predict viral sequence hotspots for binding to host proteins based on sequences of viral and host proteins and literature-curated virus-host protein interactome data. We use a motif discovery algorithm repeatedly on collections of sequences of viral proteins and immediate binding partners of their host targets and choose only those motifs that are conserved on viral sequences and highly statistically enriched among binding partners of virus protein targeted host proteins. Our results match experimental data on binding sites of Nef to host proteins such as MAPK1, VAV1, LCK, HCK, HLA-A, CD4, FYN, and GNB2L1 with high statistical significance but is a poor predictor of Nef binding sites on highly flexible, hoop-like regions. Predicted hotspots recapture CD8 cell epitopes of HIV Nef highlighting their importance in modulating virus-host interactions. Host proteins potentially targeted or outcompeted by Nef appear crowding the T cell receptor, natural killer cell mediated cytotoxicity, and neurotrophin signaling pathways. Scanning of HIV Nef motifs on multiple alignments of hepatitis C protein NS5A produces results consistent with literature, indicating the potential value of the hotspot discovery in advancing our understanding of virus-host crosstalk. PMID:21738584
Defining the roles for Vpr in HIV-1-associated neuropathogenesis
James, Tony; Nonnemacher, Michael R.; Wigdahl, Brian; Krebs, Fred C.
2016-01-01
It is increasingly evident that the human immunodeficiency virus type 1 (HIV-1) viral protein R (Vpr) has a unique role in neuropathogenesis. Its ability to induce G2/M arrest coupled with its capacity to increase viral gene transcription gives it a unique role in sustaining viral replication and aiding in the establishment and maintenance of a systemic infection. The requirement of Vpr for HIV-1 infection and replication in cells of monocytic origin (a key lineage of cells involved in HIV-1 neuroinvasion) suggests an important role in establishing and sustaining infection in the central nervous system (CNS). Contributions of Vpr to neuropathogenesis can be expanded further through (i) naturally occurring HIV-1 sequence variation that results in functionally divergent Vpr variants; (ii) the dual activities of Vpr as a intracellular protein delivered and expressed during HIV-1 infection and as an extracellular protein that can act on neighboring, uninfected cells; (iii) cell type-dependent consequences of Vpr expression and exposure, including cell cycle arrest, metabolic dysregulation, and cytotoxicity; and (iv) the effects of Vpr on exosome-based intercellular communication in the CNS. Revealing the effects of this pleiotropic viral protein is an essential part of a greater understanding of HIV-1-associated pathogenesis and potential approaches to treating and preventing disease caused by HIV-1 infection. PMID:27056720
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aw, Tiong Gim; Howe, Adina; Rose, Joan B.
2014-12-01
Genomic-based molecular techniques are emerging as powerful tools that allow a comprehensive characterization of water and wastewater microbiomes. Most recently, next generation sequencing (NGS) technologies which produce large amounts of sequence data are beginning to impact the field of environmental virology. In this study, NGS and bioinformatics have been employed for the direct detection and characterization of viruses in wastewater and of viruses isolated after cell culture. Viral particles were concentrated and purified from sewage samples by polyethylene glycol precipitation. Viral nucleic acid was extracted and randomly amplified prior to sequencing using Illumina technology, yielding a total of 18 millionmore » sequence reads. Most of the viral sequences detected could not be characterized, indicating the great viral diversity that is yet to be discovered. This sewage virome was dominated by bacteriophages and contained sequences related to known human pathogenic viruses such as adenoviruses (species B, C and F), polyomaviruses JC and BK and enteroviruses (type B). An array of other animal viruses was also found, suggesting unknown zoonotic viruses. This study demonstrated the feasibility of metagenomic approaches to characterize viruses in complex environmental water samples.« less
Cas9 specifies functional viral targets during CRISPR-Cas adaptation.
Heler, Robert; Samai, Poulami; Modell, Joshua W; Weiner, Catherine; Goldberg, Gregory W; Bikard, David; Marraffini, Luciano A
2015-03-12
Clustered regularly interspaced short palindromic repeat (CRISPR) loci and their associated (Cas) proteins provide adaptive immunity against viral infection in prokaryotes. Upon infection, short phage sequences known as spacers integrate between CRISPR repeats and are transcribed into small RNA molecules that guide the Cas9 nuclease to the viral targets (protospacers). Streptococcus pyogenes Cas9 cleavage of the viral genome requires the presence of a 5'-NGG-3' protospacer adjacent motif (PAM) sequence immediately downstream of the viral target. It is not known whether and how viral sequences flanked by the correct PAM are chosen as new spacers. Here we show that Cas9 selects functional spacers by recognizing their PAM during spacer acquisition. The replacement of cas9 with alleles that lack the PAM recognition motif or recognize an NGGNG PAM eliminated or changed PAM specificity during spacer acquisition, respectively. Cas9 associates with other proteins of the acquisition machinery (Cas1, Cas2 and Csn2), presumably to provide PAM-specificity to this process. These results establish a new function for Cas9 in the genesis of prokaryotic immunological memory.
Dorsch-Häsler, Karoline; Fisher, Paul B.; Weinstein, I. Bernard; Ginsberg, Harold S.
1980-01-01
The integration pattern of viral DNA was studied in a number of cell lines transformed by wild-type adenovirus type 5 (Ad5 WT) and two mutants of the DNA-binding protein gene, H5ts125 and H5ts107. The effect of chemical carcinogens on the integration of viral DNA was also investigated. Liquid hybridization (C0t) analyses showed that rat embryo cells transformed by Ad5 WT usually contained only the left-hand end of the viral genome, whereas cell lines transformed by H5ts125 or H5ts107 at either the semipermissive (36°C) or nonpermissive (39.5°C) temperature often contained one to five copies of all or most of the entire adenovirus genome. The arrangement of the integrated adenovirus DNA sequences was determined by cleavage of transformed cell DNA with restriction endonucleases XbaI, EcoRI, or HindIII followed by transfer of separated fragments to nitrocellulose paper and hybridization according to the technique of E. M. Southern (J. Mol. Biol. 98: 503-517, 1975). It was found that the adenovirus genome is integrated as a linear sequence covalently linked to host cell DNA; that the viral DNA is integrated into different host DNA sequences in each cell line studied; that in cell lines that contain multiple copies of the Ad5 genome the viral DNA sequences can be integrated in a single set of host cell DNA sequences and not as concatemers; and that chemical carcinogens do not alter the extent or pattern of viral DNA integration. Images PMID:6246266
de Andrade, Roberto R S; Vaslin, Maite F S
2014-03-07
Next-generation parallel sequencing (NGS) allows the identification of viral pathogens by sequencing the small RNAs of infected hosts. Thus, viral genomes may be assembled from host immune response products without prior virus enrichment, amplification or purification. However, mapping of the vast information obtained presents a bioinformatics challenge. In order to by pass the need of line command and basic bioinformatics knowledge, we develop a mapping software with a graphical interface to the assemblage of viral genomes from small RNA dataset obtained by NGS. SearchSmallRNA was developed in JAVA language version 7 using NetBeans IDE 7.1 software. The program also allows the analysis of the viral small interfering RNAs (vsRNAs) profile; providing an overview of the size distribution and other features of the vsRNAs produced in infected cells. The program performs comparisons between each read sequenced present in a library and a chosen reference genome. Reads showing Hamming distances smaller or equal to an allowed mismatched will be selected as positives and used to the assemblage of a long nucleotide genome sequence. In order to validate the software, distinct analysis using NGS dataset obtained from HIV and two plant viruses were used to reconstruct viral whole genomes. SearchSmallRNA program was able to reconstructed viral genomes using NGS of small RNA dataset with high degree of reliability so it will be a valuable tool for viruses sequencing and discovery. It is accessible and free to all research communities and has the advantage to have an easy-to-use graphical interface. SearchSmallRNA was written in Java and is freely available at http://www.microbiologia.ufrj.br/ssrna/.
2014-01-01
Background Next-generation parallel sequencing (NGS) allows the identification of viral pathogens by sequencing the small RNAs of infected hosts. Thus, viral genomes may be assembled from host immune response products without prior virus enrichment, amplification or purification. However, mapping of the vast information obtained presents a bioinformatics challenge. Methods In order to by pass the need of line command and basic bioinformatics knowledge, we develop a mapping software with a graphical interface to the assemblage of viral genomes from small RNA dataset obtained by NGS. SearchSmallRNA was developed in JAVA language version 7 using NetBeans IDE 7.1 software. The program also allows the analysis of the viral small interfering RNAs (vsRNAs) profile; providing an overview of the size distribution and other features of the vsRNAs produced in infected cells. Results The program performs comparisons between each read sequenced present in a library and a chosen reference genome. Reads showing Hamming distances smaller or equal to an allowed mismatched will be selected as positives and used to the assemblage of a long nucleotide genome sequence. In order to validate the software, distinct analysis using NGS dataset obtained from HIV and two plant viruses were used to reconstruct viral whole genomes. Conclusions SearchSmallRNA program was able to reconstructed viral genomes using NGS of small RNA dataset with high degree of reliability so it will be a valuable tool for viruses sequencing and discovery. It is accessible and free to all research communities and has the advantage to have an easy-to-use graphical interface. Availability and implementation SearchSmallRNA was written in Java and is freely available at http://www.microbiologia.ufrj.br/ssrna/. PMID:24607237
Quick, Joshua; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah C; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno R; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2017-06-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
Craigo, Jodi K.; Montelaro, Ronald C.
2013-01-01
Equine infectious anemia (EIA), identified in 1843 [1] as an infectious disease of horses and as a viral infection in 1904, remains a concern in veterinary medicine today. Equine infectious anemia virus (EIAV) has served as an animal model of HIV-1/AIDS research since the original identification of HIV. Similar to other lentiviruses, EIAV has a high propensity for genomic sequence and antigenic variation, principally in its envelope (Env) proteins. However, EIAV possesses a unique and dynamic disease presentation that has facilitated comprehensive analyses of the interactions between the evolving virus population, progressive host immune responses, and the definition of viral and host correlates of immune control and vaccine efficacy. Summarized here are key findings in EIAV that have provided important lessons toward understanding long term immune control of lentivirus infections and the parameters for development of an enduring broadly protective AIDS vaccine. PMID:24316675
A Pan-HIV Strategy for Complete Genome Sequencing
Yamaguchi, Julie; Alessandri-Gradt, Elodie; Tell, Robert W.; Brennan, Catherine A.
2015-01-01
Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e., switching mechanism at 5′ end of RNA transcript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance. PMID:26699702
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples.
Laird Smith, Melissa; Murrell, Ben; Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E; Kosakovsky Pond, Sergei L; Smith, Davey M
2016-07-01
The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences' Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data.
Liu, Yang; Chiaromonte, Francesca; Ross, Howard; Malhotra, Raunaq; Elleder, Daniel; Poss, Mary
2015-06-30
Infection with feline immunodeficiency virus (FIV) causes an immunosuppressive disease whose consequences are less severe if cats are co-infected with an attenuated FIV strain (PLV). We use virus diversity measurements, which reflect replication ability and the virus response to various conditions, to test whether diversity of virulent FIV in lymphoid tissues is altered in the presence of PLV. Our data consisted of the 3' half of the FIV genome from three tissues of animals infected with FIV alone, or with FIV and PLV, sequenced by 454 technology. Since rare variants dominate virus populations, we had to carefully distinguish sequence variation from errors due to experimental protocols and sequencing. We considered an exponential-normal convolution model used for background correction of microarray data, and modified it to formulate an error correction approach for minor allele frequencies derived from high-throughput sequencing. Similar to accounting for over-dispersion in counts, this accounts for error-inflated variability in frequencies - and quite effectively reproduces empirically observed distributions. After obtaining error-corrected minor allele frequencies, we applied ANalysis Of VAriance (ANOVA) based on a linear mixed model and found that conserved sites and transition frequencies in FIV genes differ among tissues of dual and single infected cats. Furthermore, analysis of minor allele frequencies at individual FIV genome sites revealed 242 sites significantly affected by infection status (dual vs. single) or infection status by tissue interaction. All together, our results demonstrated a decrease in FIV diversity in bone marrow in the presence of PLV. Importantly, these effects were weakened or undetectable when error correction was performed with other approaches (thresholding of minor allele frequencies; probabilistic clustering of reads). We also queried the data for cytidine deaminase activity on the viral genome, which causes an asymmetric increase in G to A substitutions, but found no evidence for this host defense strategy. Our error correction approach for minor allele frequencies (more sensitive and computationally efficient than other algorithms) and our statistical treatment of variation (ANOVA) were critical for effective use of high-throughput sequencing data in understanding viral diversity. We found that co-infection with PLV shifts FIV diversity from bone marrow to lymph node and spleen.
Neill, John D; Newcomer, Benjamin W; Marley, Shonda D; Ridpath, Julia F; Givens, M Daniel
2012-08-06
Bovine viral diarrhea virus (BVDV) strains circulating in livestock herds show significant sequence variation. Conventional wisdom states that most sequence variation arises during acute infections in response to immune or other environmental pressures. A recent study showed that more nucleotide changes were introduced into the BVDV genomic RNA during the establishment of a single fetal persistent infection than following a series of acute infections of naïve cattle. However, it was not known if nucleotide changes were introduce when the virus crossed the placenta and infected the fetus or during the acute infection of the dam. The sequence of the open reading frame (ORF) from viruses isolated from four acutely infected pregnant heifers following exposure to persistently infected (PI) calves was compared to the sequences of the virus from the progenitor PI calf and the virus from the resulting progeny PI calf to determine when genetic change was introduced. This was compared to genetic change found in viruses isolated from a pregnant PI cow and its PI calf, and in three viruses isolated from acutely infected, non-pregnant cattle exposed to PI calves. Most genetic changes previously identified between the progenitor and progeny PI viruses were in place in the acute phase viruses isolated from the dams six days post-exposure to the progenitor PI calf. Additionally, each progeny PI virus had two to three unique nucleotide substitutions that were introduced in crossing the placenta and infection of the fetus. The nucleotide sequence of two acute phase viruses isolated from steers exposed to PI calves revealed that six and seven nucleotide changes were introduced during the acute infection. The sequence of the BVDV-2 virus isolated from an acute infection of a PI calf (BVDV-1a) co-housed with a BVDV-2 PI calf had ten nucleotides that were different from the progenitor PI virus. Finally, twenty nucleotide changes were identified in the PI virus of a calf born to a PI dam. These results demonstrate that nucleotide changes are introduced into the BVDV infecting pregnant cattle at rates of 2.3 to 8 fold higher then during the acute infection of non-pregnant animals.
Goya, Stephanie; Valinotto, Laura E; Tittarelli, Estefania; Rojo, Gabriel L; Nabaes Jodar, Mercedes S; Greninger, Alexander L; Zaiat, Jonathan J; Marti, Marcelo A; Mistchenko, Alicia S; Viegas, Mariana
2018-01-01
Over the last decade, the number of viral genome sequences deposited in available databases has grown exponentially. However, sequencing methodology vary widely and many published works have relied on viral enrichment by viral culture or nucleic acid amplification with specific primers rather than through unbiased techniques such as metagenomics. The genome of RNA viruses is highly variable and these enrichment methodologies may be difficult to achieve or may bias the results. In order to obtain genomic sequences of human respiratory syncytial virus (HRSV) from positive nasopharyngeal aspirates diverse methodologies were evaluated and compared. A total of 29 nearly complete and complete viral genomes were obtained. The best performance was achieved with a DNase I treatment to the RNA directly extracted from the nasopharyngeal aspirate (NPA), sequence-independent single-primer amplification (SISPA) and library preparation performed with Nextera XT DNA Library Prep Kit with manual normalization. An average of 633,789 and 1,674,845 filtered reads per library were obtained with MiSeq and NextSeq 500 platforms, respectively. The higher output of NextSeq 500 was accompanied by the increasing of duplicated reads percentage generated during SISPA (from an average of 1.5% duplicated viral reads in MiSeq to an average of 74% in NextSeq 500). HRSV genome recovery was not affected by the presence or absence of duplicated reads but the computational demand during the analysis was increased. Considering that only samples with viral load ≥ E+06 copies/ml NPA were tested, no correlation between sample viral loads and number of total filtered reads was observed, nor with the mapped viral reads. The HRSV genomes showed a mean coverage of 98.46% with the best methodology. In addition, genomes of human metapneumovirus (HMPV), human rhinovirus (HRV) and human parainfluenza virus types 1-3 (HPIV1-3) were also obtained with the selected optimal methodology.
Marine, Rachel L; Nasko, Daniel J; Wray, Jeffrey; Polson, Shawn W; Wommack, K Eric
2017-01-01
Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70 kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells. PMID:28731469
Marine, Rachel L; Nasko, Daniel J; Wray, Jeffrey; Polson, Shawn W; Wommack, K Eric
2017-11-01
Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70 kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells.
Lorenzetti, Mario Alejandro; Gutiérrez, Marina Inés; Altcheh, Jaime; Moscatelli, Guillermo; Moroni, Samanta; Chabay, Paola Andrea; Preciado, María Victoria
2009-11-01
Epstein-Barr virus genotypes can be distinguished by polymorphic variations in the genes encoding EBNA2, 3A, 3B, and 3C. The immediate early gene BZLF1 plays a key role in modulating the switch from latency to lytic replication and therefore enabling viral propagation. The aim of this study was to investigate and compare BZLF1 promoter sequence (Zp) variation in pediatric infectious mononucleosis (IM) and in pediatric EBV positive lymphoma biopsies. Zp was sequenced from peripheral blood mononuclear cells (PBMC) and throat swabs from 10 patients with IM at the time of diagnosis (D0) and during convalescence; and from 13 lymphoma biopsies. Zp - P and Zp - V3 variants were found in eight and one IM patients, as well as in five and six tumor biopsies, respectively. A correlation between viral genotype and Zp variant was found significant for Zp - V3 and EBV2 (P = 0.0002). One IM patient harbored two concomitant Zp variants. Regardless of anatomical compartment or stage of disease all IM patients displayed the same Zp variant along the course of the study. No new infections or adaptative selection of different variants was evidenced. A new Zp variant (Zp - V3 + 49) was described in two Hodgkin lymphomas, but not in IM. This is the first study to describe Zp variants compartmentalization in children with acute EBV infection and convalescence in a developing country; and comparing them with Zp variants in pediatric lymphomas from the same geographic area.
Hartard, C.; Rivet, R.; Banas, S.
2015-01-01
F-specific RNA bacteriophages (FRNAPH) have been widely studied as tools for evaluating fecal or viral pollution in water. It has also been proposed that they can be used to differentiate human from animal fecal contamination. While FRNAPH subgroup I (FRNAPH-I) and FRNAPH-IV are often associated with animal pollution, FRNAPH-II and -III prevail in human wastewater. However, this distribution is not absolute, and variable survival rates in these subgroups lead to misinterpretation of the original distribution. In this context, we studied FRNAPH distribution in urban wastewater and animal feces/wastewater. To increase the specificity, we partially sequenced the genomes of phages of urban and animal origins. The persistence of the genomes and infectivity were also studied, over time in wastewater and during treatment, for each subgroup. FRNAPH-I genome sequences did not show any specific urban or animal clusters to allow development of molecular tools for differentiation. They were the most resistant and as such may be used as fecal or viral indicators. FRNAPH-II's low prevalence and low sequence variability in animal stools, combined with specific clusters formed by urban strains, allowed differentiation between urban and animal pollution by using a specific reverse transcription-PCR (RT-PCR) method. The subgroup's resistance over time was comparable to that of FRNAPH-I, but its surface properties allowed higher elimination rates during activated-sludge treatment. FRNAPH-III's low sequence variability in animal wastewater and specific cluster formation by urban strains also allowed differentiation by using a specific RT-PCR method. Nevertheless, its low resistance restricted it to being used only for recent urban pollution detection. FRNAPH-IV was too rare to be used. PMID:26162878
Hartard, C; Rivet, R; Banas, S; Gantzer, C
2015-09-01
F-specific RNA bacteriophages (FRNAPH) have been widely studied as tools for evaluating fecal or viral pollution in water. It has also been proposed that they can be used to differentiate human from animal fecal contamination. While FRNAPH subgroup I (FRNAPH-I) and FRNAPH-IV are often associated with animal pollution, FRNAPH-II and -III prevail in human wastewater. However, this distribution is not absolute, and variable survival rates in these subgroups lead to misinterpretation of the original distribution. In this context, we studied FRNAPH distribution in urban wastewater and animal feces/wastewater. To increase the specificity, we partially sequenced the genomes of phages of urban and animal origins. The persistence of the genomes and infectivity were also studied, over time in wastewater and during treatment, for each subgroup. FRNAPH-I genome sequences did not show any specific urban or animal clusters to allow development of molecular tools for differentiation. They were the most resistant and as such may be used as fecal or viral indicators. FRNAPH-II's low prevalence and low sequence variability in animal stools, combined with specific clusters formed by urban strains, allowed differentiation between urban and animal pollution by using a specific reverse transcription-PCR (RT-PCR) method. The subgroup's resistance over time was comparable to that of FRNAPH-I, but its surface properties allowed higher elimination rates during activated-sludge treatment. FRNAPH-III's low sequence variability in animal wastewater and specific cluster formation by urban strains also allowed differentiation by using a specific RT-PCR method. Nevertheless, its low resistance restricted it to being used only for recent urban pollution detection. FRNAPH-IV was too rare to be used. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
A comprehensive and quantitative exploration of thousands of viral genomes
Mahmoudabadi, Gita
2018-01-01
The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends – such as gene density, noncoding percentage, and abundances of functional gene categories – across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends. PMID:29624169
A comprehensive and quantitative exploration of thousands of viral genomes.
Mahmoudabadi, Gita; Phillips, Rob
2018-04-19
The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends - such as gene density, noncoding percentage, and abundances of functional gene categories - across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends. © 2018, Mahmoudabadi et al.
Li, Hui; Stoddard, Mark B; Wang, Shuyi; Giorgi, Elena E; Blair, Lily M; Learn, Gerald H; Hahn, Beatrice H; Alter, Harvey J; Busch, Michael P; Fierer, Daniel S; Ribeiro, Ruy M; Perelson, Alan S; Bhattacharya, Tanmoy; Shaw, George M
2016-01-01
Despite the recent development of highly effective anti-hepatitis C virus (HCV) drugs, the global burden of this pathogen remains immense. Control or eradication of HCV will likely require the broad application of antiviral drugs and development of an effective vaccine. A precise molecular identification of transmitted/founder (T/F) HCV genomes that lead to productive clinical infection could play a critical role in vaccine research, as it has for HIV-1. However, the replication schema of these two RNA viruses differ substantially, as do viral responses to innate and adaptive host defenses. These differences raise questions as to the certainty of T/F HCV genome inferences, particularly in cases where multiple closely related sequence lineages have been observed. To clarify these issues and distinguish between competing models of early HCV diversification, we examined seven cases of acute HCV infection in humans and chimpanzees, including three examples of virus transmission between linked donors and recipients. Using single-genome sequencing (SGS) of plasma vRNA, we found that inferred T/F sequences in recipients were identical to viral sequences in their respective donors. Early in infection, HCV genomes generally evolved according to a simple model of random evolution where the coalescent corresponded to the T/F sequence. Closely related sequence lineages could be explained by high multiplicity infection from a donor whose viral sequences had undergone a pretransmission bottleneck due to treatment, immune selection, or recent infection. These findings validate SGS, together with mathematical modeling and phylogenetic analysis, as a novel strategy to infer T/F HCV genome sequences. Despite the recent development of highly effective, interferon-sparing anti-hepatitis C virus (HCV) drugs, the global burden of this pathogen remains immense. Control or eradication of HCV will likely require the broad application of antiviral drugs and the development of an effective vaccine, which could be facilitated by a precise molecular identification of transmitted/founder (T/F) viral genomes and their progeny. We used single-genome sequencing to show that inferred HCV T/F sequences in recipients were identical to viral sequences in their respective donors and that viral genomes generally evolved early in infection according to a simple model of random sequence evolution. Altogether, the findings validate T/F genome inferences and illustrate how T/F sequence identification can illuminate studies of HCV transmission, immunopathogenesis, drug resistance development, and vaccine protection, including sieving effects on breakthrough virus strains. Copyright © 2015 Li et al.
A gyrovirus infecting a sea bird
Li, Linlin; Pesavento, Patricia A.; Gaynor, Anne M.; Duerr, Rebecca S.; Phan, Tung Gia; Zhang, Wen; Deng, Xutao
2015-01-01
We characterized the genome of a highly divergent gyrovirus (GyV8) in the spleen and uropygial gland tissues of a diseased northern fulmar (Fulmarus glacialis), a pelagic bird beached in San Francisco, California. No other exogenous viral sequences could be identified using viral metagenomics. The small circular DNA genome shared no significant nucleotide sequence identity, and only 38–42 % amino acid sequence identity in VP1, with any of the previously identified gyroviruses. GyV8 is the first member of the third major phylogenetic clade of this viral genus and the first gyrovirus detected in an avian species other than chicken. PMID:26036564
Koppstein, David; Ashour, Joseph; Bartel, David P.
2015-01-01
The influenza polymerase cleaves host RNAs ∼10–13 nucleotides downstream of their 5′ ends and uses this capped fragment to prime viral mRNA synthesis. To better understand this process of cap snatching, we used high-throughput sequencing to determine the 5′ ends of A/WSN/33 (H1N1) influenza mRNAs. The sequences provided clear evidence for nascent-chain realignment during transcription initiation and revealed a strong influence of the viral template on the frequency of realignment. After accounting for the extra nucleotides inserted through realignment, analysis of the capped fragments indicated that the different viral mRNAs were each prepended with a common set of sequences and that the polymerase often cleaved host RNAs after a purine and often primed transcription on a single base pair to either the terminal or penultimate residue of the viral template. We also developed a bioinformatic approach to identify the targeted host transcripts despite limited information content within snatched fragments and found that small nuclear RNAs and small nucleolar RNAs contributed the most abundant capped leaders. These results provide insight into the mechanism of viral transcription initiation and reveal the diversity of the cap-snatched repertoire, showing that noncoding transcripts as well as mRNAs are used to make influenza mRNAs. PMID:25901029
Host-Associated Metagenomics: A Guide to Generating Infectious RNA Viromes
Robert, Catherine; Pascalis, Hervé; Michelle, Caroline; Jardot, Priscilla; Charrel, Rémi; Raoult, Didier; Desnues, Christelle
2015-01-01
Background Metagenomic analyses have been widely used in the last decade to describe viral communities in various environments or to identify the etiology of human, animal, and plant pathologies. Here, we present a simple and standardized protocol that allows for the purification and sequencing of RNA viromes from complex biological samples with an important reduction of host DNA and RNA contaminants, while preserving the infectivity of viral particles. Principal Findings We evaluated different viral purification steps, random reverse transcriptions and sequence-independent amplifications of a pool of representative RNA viruses. Viruses remained infectious after the purification process. We then validated the protocol by sequencing the RNA virome of human body lice engorged in vitro with artificially contaminated human blood. The full genomes of the most abundant viruses absorbed by the lice during the blood meal were successfully sequenced. Interestingly, random amplifications differed in the genome coverage of segmented RNA viruses. Moreover, the majority of reads were taxonomically identified, and only 7–15% of all reads were classified as “unknown”, depending on the random amplification method. Conclusion The protocol reported here could easily be applied to generate RNA viral metagenomes from complex biological samples of different origins. Our protocol allows further virological characterizations of the described viral communities because it preserves the infectivity of viral particles and allows for the isolation of viruses. PMID:26431175
Moser, Lindsey A.; Ramirez-Carvajal, Lisbeth; Puri, Vinita; Pauszek, Steven J.; Matthews, Krystal; Dilley, Kari A.; Mullan, Clancy; McGraw, Jennifer; Khayat, Michael; Beeri, Karen; Yee, Anthony; Dugan, Vivien; Heise, Mark T.; Frieman, Matthew B.; Rodriguez, Luis L.; Bernard, Kristen A.; Wentworth, David E.
2016-01-01
ABSTRACT Several biosafety level 3 and/or 4 (BSL-3/4) pathogens are high-consequence, single-stranded RNA viruses, and their genomes, when introduced into permissive cells, are infectious. Moreover, many of these viruses are select agents (SAs), and their genomes are also considered SAs. For this reason, cDNAs and/or their derivatives must be tested to ensure the absence of infectious virus and/or viral RNA before transfer out of the BSL-3/4 and/or SA laboratory. This tremendously limits the capacity to conduct viral genomic research, particularly the application of next-generation sequencing (NGS). Here, we present a sequence-independent method to rapidly amplify viral genomic RNA while simultaneously abolishing both viral and genomic RNA infectivity across multiple single-stranded positive-sense RNA (ssRNA+) virus families. The process generates barcoded DNA amplicons that range in length from 300 to 1,000 bp, which cannot be used to rescue a virus and are stable to transport at room temperature. Our barcoding approach allows for up to 288 barcoded samples to be pooled into a single library and run across various NGS platforms without potential reconstitution of the viral genome. Our data demonstrate that this approach provides full-length genomic sequence information not only from high-titer virion preparations but it can also recover specific viral sequence from samples with limited starting material in the background of cellular RNA, and it can be used to identify pathogens from unknown samples. In summary, we describe a rapid, universal standard operating procedure that generates high-quality NGS libraries free of infectious virus and infectious viral RNA. IMPORTANCE This report establishes and validates a standard operating procedure (SOP) for select agents (SAs) and other biosafety level 3 and/or 4 (BSL-3/4) RNA viruses to rapidly generate noninfectious, barcoded cDNA amenable for next-generation sequencing (NGS). This eliminates the burden of testing all processed samples derived from high-consequence pathogens prior to transfer from high-containment laboratories to lower-containment facilities for sequencing. Our established protocol can be scaled up for high-throughput sequencing of hundreds of samples simultaneously, which can dramatically reduce the cost and effort required for NGS library construction. NGS data from this SOP can provide complete genome coverage from viral stocks and can also detect virus-specific reads from limited starting material. Our data suggest that the procedure can be implemented and easily validated by institutional biosafety committees across research laboratories. PMID:27822536
Ebolavirus comparative genomics
Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.
2015-01-01
The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035
Misencik, Michael J.; Grubaugh, Nathan D.; Andreadis, Theodore G.; Ebel, Gregory D.
2016-01-01
Abstract The genus Flavivirus includes a number of newly recognized viruses that infect and replicate only within mosquitoes. To determine whether insect-specific flaviviruses (ISFs) may infect Culiseta (Cs.) melanura mosquitoes, we screened pools of field-collected mosquitoes for virus infection by RT-PCR targeting conserved regions of the NS5 gene. NS5 nucleotide sequences amplified from Cs. melanura pools were genetically similar to other ISFs and most closely matched Calbertado virus from Culex tarsalis, sharing 68.7% nucleotide and 76.1% amino acid sequence identity. The complete genome of one virus isolate was sequenced to reveal a primary open reading frame (ORF) encoding a viral polyprotein characteristic of the genus Flavivirus. Phylogenetic analysis showed that this virus represents a distinct evolutionary lineage that belongs to the classical ISF group. The virus was detected solely in Cs. melanura pools, occurred in sampled populations from Connecticut, New York, New Hampshire, and Maine, and infected both adult and larval stages of the mosquito. Maximum likelihood estimate infection rates (MLE-IR) were relatively stable in overwintering Cs. melanura larvae collected monthly from November of 2012 through May of 2013 (MLE-IR = 0.7–2.1/100 mosquitoes) and in host-seeking females collected weekly from June through October of 2013 (MLE-IR = 3.8–11.5/100 mosquitoes). Phylogenetic analysis of viral sequences revealed limited genetic variation that lacked obvious geographic structure among strains in the northeastern United States. This new virus is provisionally named Culiseta flavivirus on the basis of its host association with Cs. melanura. PMID:26807512
Functional Role of Infective Viral Particles on Metal Reduction
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coates, John D.
2014-04-01
A proposed strategy for the remediation of uranium (U) contaminated sites was based on the immobilization of U by reducing the oxidized soluble U, U(VI), to form a reduced insoluble end product, U(IV). Previous studies identified Geobacter sp., including G. sulfurreducens and G. metallireducens, as predominant U(VI)-reducing bacteria under acetate-oxidizing and U(VI)-reducing conditions. Examination of the finished genome sequence annotation of the canonical metal reducing species Geobacter sulfurreducens strain PCA and G. metallireduceans strain GS-15 as well as the draft genome sequence of G. uraniumreducens strain Rf4 identified phage related proteins. In addition, the completed genome for Anaeromyxobacter dehalogenans andmore » the draft genome sequence of Desulfovibrio desulfuricans strain G20, two more model metal-reducing bacteria, also revealed phage related sequences. The presence of these gene sequences indicated that Geobacter spp., Anaeromyxobacter spp., and Desulfovibrio spp. are susceptible to viral infection. Furthermore, viral populations in soils and sedimentary environments in the order of 6.4×10{sup 6}–2.7×10{sup 10} VLP’s cm{sup -3} have been observed. In some cases, viral populations exceed bacterial populations in these environments suggesting that a relationship may exist between viruses and bacteria. Our preliminary screens of samples collected from the ESR FRC indicated that viral like particles were observed in significant numbers. The objective of this study was to investigate the potential functional role viruses play in metal reduction specifically Fe(III) and U(VI) reduction, the environmental parameters affecting viral infection of metal reducing bacteria, and the subsequent effects on U transport.« less
Kim, Kiyeon; Omori, Ryosuke; Ueno, Keisuke; Iida, Sayaka; Ito, Kimihito
2016-01-01
Understanding the evolutionary dynamics of influenza viruses is essential to control both avian and human influenza. Here, we analyze host-specific and segment-specific Tajima's D trends of influenza A virus through a systematic review using viral sequences registered in the National Center for Biotechnology Information. To avoid bias from viral population subdivision, viral sequences were stratified according to their sampling locations and sampling years. As a result, we obtained a total of 580 datasets each of which consists of nucleotide sequences of influenza A viruses isolated from a single population of hosts at a single sampling site within a single year. By analyzing nucleotide sequences in the datasets, we found that Tajima's D values of viral sequences were different depending on hosts and gene segments. Tajima's D values of viruses isolated from chicken and human samples showed negative, suggesting purifying selection or a rapid population growth of the viruses. The negative Tajima's D values in rapidly growing viral population were also observed in computer simulations. Tajima's D values of PB2, PB1, PA, NP, and M genes of the viruses circulating in wild mallards were close to zero, suggesting that these genes have undergone neutral selection in constant-sized population. On the other hand, Tajima's D values of HA and NA genes of these viruses were positive, indicating HA and NA have undergone balancing selection in wild mallards. Taken together, these results indicated the existence of unknown factors that maintain viral subtypes in wild mallards.
Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures
Pride, David T; Schoenfeld, Thomas
2008-01-01
Background Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. Results From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes. Conclusion That BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis. PMID:18798991
Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures.
Pride, David T; Schoenfeld, Thomas
2008-09-17
Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes. That BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis.
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples
Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E.; Kosakovsky Pond, Sergei L.
2016-01-01
Abstract The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences’ Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data. PMID:29492273
Sobel Leonard, Ashley; McClain, Micah T; Smith, Gavin J D; Wentworth, David E; Halpin, Rebecca A; Lin, Xudong; Ransier, Amy; Stockwell, Timothy B; Das, Suman R; Gilbert, Anthony S; Lambkin-Williams, Robert; Ginsburg, Geoffrey S; Woods, Christopher W; Koelle, Katia
2016-12-15
Knowledge of influenza virus evolution at the point of transmission and at the intrahost level remains limited, particularly for human hosts. Here, we analyze a unique viral data set of next-generation sequencing (NGS) samples generated from a human influenza challenge study wherein 17 healthy subjects were inoculated with cell- and egg-passaged virus. Nasal wash samples collected from 7 of these subjects were successfully deep sequenced. From these, we characterized changes in the subjects' viral populations during infection and identified differences between the virus in these samples and the viral stock used to inoculate the subjects. We first calculated pairwise genetic distances between the subjects' nasal wash samples, the viral stock, and the influenza virus A/Wisconsin/67/2005 (H3N2) reference strain used to generate the stock virus. These distances revealed that considerable viral evolution occurred at various points in the human challenge study. Further quantitative analyses indicated that (i) the viral stock contained genetic variants that originated and likely were selected for during the passaging process, (ii) direct intranasal inoculation with the viral stock resulted in a selective bottleneck that reduced nonsynonymous genetic diversity in the viral hemagglutinin and nucleoprotein, and (iii) intrahost viral evolution continued over the course of infection. These intrahost evolutionary dynamics were dominated by purifying selection. Our findings indicate that rapid viral evolution can occur during acute influenza infection in otherwise healthy human hosts when the founding population size of the virus is large, as is the case with direct intranasal inoculation. Influenza viruses circulating among humans are known to rapidly evolve over time. However, little is known about how influenza virus evolves across single transmission events and over the course of a single infection. To address these issues, we analyze influenza virus sequences from a human challenge experiment that initiated infection with a cell- and egg-passaged viral stock, which appeared to have adapted during its preparation. We find that the subjects' viral populations differ genetically from the viral stock, with subjects' viral populations having lower representation of the amino-acid-changing variants that arose during viral preparation. We also find that most of the viral evolution occurring over single infections is characterized by further decreases in the frequencies of these amino-acid-changing variants and that only limited intrahost genetic diversification through new mutations is apparent. Our findings indicate that influenza virus populations can undergo rapid genetic changes during acute human infections. Copyright © 2016 Sobel Leonard et al.
Shah, Jigna D; Baller, Joshua; Zhang, Ying; Silverstein, Kevin; Xing, Zheng; Cardona, Carol J
2014-12-01
RNA viruses have been associated with enteritis in poultry and have been isolated from diseased birds. The same viral agents have also been detected in healthy flocks bringing into question their role in health and disease. In order to understand better eukaryotic viruses in the gut, this project focused on evaluating alternative methods to purify and concentrate viral particles, which do not involve the use of density gradients, for generating viral metagenome data. In this study, the sequence outcomes of three tissue processing methods have been evaluated and a data analysis pipeline has been established for RNA viruses from the gastrointestinal tract. In addition, with the use of the best method and increased sequencing depth, a glimpse of the RNA viral community in the gastrointestinal tract of a clinically normal 5-week old turkey is presented. The viruses from the Reoviridae and Astroviridae families together accounted for 76.3% of total viruses identified. The rarefaction curve at the species level further indicated that majority of the species diversity was included with the increased sequencing depth, implying that viruses from other viral families were present in very low abundance. Copyright © 2014 Elsevier B.V. All rights reserved.
Multiplexing Short Primers for Viral Family PCR
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S N; Hiddessen, A L; Hara, C A
We describe a Multiplex Primer Prediction (MPP) algorithm to build multiplex compatible primer sets for large, diverse, and unalignable sets of target sequences. The MPP algorithm is scalable to larger target sets than other available software, and it does not require a multiple sequence alignment. We applied it to questions in viral detection, and demonstrated that there are no universally conserved priming sequences among viruses and that it could require an unfeasibly large number of primers ({approx}3700 18-mers or {approx}2000 10-mers) to generate amplicons from all sequenced viruses. We then designed primer sets separately for each viral family, and formore » several diverse species such as foot-and-mouth disease virus, hemagglutinin and neuraminidase segments of influenza A virus, Norwalk virus, and HIV-1.« less
Bass, David; Moureau, Gregory; Tang, Shuoya; McAlister, Erica; Culverwell, C. Lorna; Glücksman, Edvard; Wang, Hui; Brown, T. David K.; Gould, Ernest A.; Harbach, Ralph E.; de Lamballerie, Xavier; Firth, Andrew E.
2013-01-01
We investigated whether small RNA (sRNA) sequenced from field-collected mosquitoes and chironomids (Diptera) can be used as a proxy signature of viral prevalence within a range of species and viral groups, using sRNAs sequenced from wild-caught specimens, to inform total RNA deep sequencing of samples of particular interest. Using this strategy, we sequenced from adult Anopheles maculipennis s.l. mosquitoes the apparently nearly complete genome of one previously undescribed virus related to chronic bee paralysis virus, and, from a pool of Ochlerotatus caspius and Oc. detritus mosquitoes, a nearly complete entomobirnavirus genome. We also reconstructed long sequences (1503-6557 nt) related to at least nine other viruses. Crucially, several of the sequences detected were reconstructed from host organisms highly divergent from those in which related viruses have been previously isolated or discovered. It is clear that viral transmission and maintenance cycles in nature are likely to be significantly more complex and taxonomically diverse than previously expected. PMID:24260463
Dacheux, Laurent; Cervantes-Gonzalez, Minerva; Guigon, Ghislaine; Thiberge, Jean-Michel; Vandenbogaert, Mathias; Maufrais, Corinne
2014-01-01
The prediction of viral zoonosis epidemics has become a major public health issue. A profound understanding of the viral population in key animal species acting as reservoirs represents an important step towards this goal. Bats harbor diverse viruses, some of which are of particular interest because they cause severe human diseases. However, little is known about the diversity of the global population of viruses found in bats (virome). We determined the viral diversity of five different French insectivorous bat species (nine specimens in total) in close contact with humans. Sequence-independent amplification, high-throughput sequencing with Illumina technology and a dedicated bioinformatics analysis pipeline were used on pooled tissues (brain, liver and lungs). Comparisons of the sequences of contigs and unassembled reads provided a global taxonomic distribution of virus-related sequences for each sample, highlighting differences both within and between bat species. Many viral families were present in these viromes, including viruses known to infect bacteria, plants/fungi, insects or vertebrates, the most relevant being those infecting mammals (Retroviridae, Herpesviridae, Bunyaviridae, Poxviridae, Flaviviridae, Reoviridae, Bornaviridae, Picobirnaviridae). In particular, we detected several new mammalian viruses, including rotaviruses, gammaretroviruses, bornaviruses and bunyaviruses with the identification of the first bat nairovirus. These observations demonstrate that bats naturally harbor viruses from many different families, most of which infect mammals. They may therefore constitute a major reservoir of viral diversity that should be analyzed carefully, to determine the role played by bats in the spread of zoonotic viral infections. PMID:24489870
The Fecal Viral Flora of Wild Rodents
Phan, Tung G.; Kapusinszky, Beatrix; Wang, Chunlin; Rose, Robert K.; Lipton, Howard L.; Delwart, Eric L.
2011-01-01
The frequent interactions of rodents with humans make them a common source of zoonotic infections. To obtain an initial unbiased measure of the viral diversity in the enteric tract of wild rodents we sequenced partially purified, randomly amplified viral RNA and DNA in the feces of 105 wild rodents (mouse, vole, and rat) collected in California and Virginia. We identified in decreasing frequency sequences related to the mammalian viruses families Circoviridae, Picobirnaviridae, Picornaviridae, Astroviridae, Parvoviridae, Papillomaviridae, Adenoviridae, and Coronaviridae. Seventeen small circular DNA genomes containing one or two replicase genes distantly related to the Circoviridae representing several potentially new viral families were characterized. In the Picornaviridae family two new candidate genera as well as a close genetic relative of the human pathogen Aichi virus were characterized. Fragments of the first mouse sapelovirus and picobirnaviruses were identified and the first murine astrovirus genome was characterized. A mouse papillomavirus genome and fragments of a novel adenovirus and adenovirus-associated virus were also sequenced. The next largest fraction of the rodent fecal virome was related to insect viruses of the Densoviridae, Iridoviridae, Polydnaviridae, Dicistroviriade, Bromoviridae, and Virgaviridae families followed by plant virus-related sequences in the Nanoviridae, Geminiviridae, Phycodnaviridae, Secoviridae, Partitiviridae, Tymoviridae, Alphaflexiviridae, and Tombusviridae families reflecting the largely insect and plant rodent diet. Phylogenetic analyses of full and partial viral genomes therefore revealed many previously unreported viral species, genera, and families. The close genetic similarities noted between some rodent and human viruses might reflect past zoonoses. This study increases our understanding of the viral diversity in wild rodents and highlights the large number of still uncharacterized viruses in mammals. PMID:21909269
Viral taxonomy needs a spring clean; its exploration era is over.
Gibbs, Adrian J
2013-08-09
The International Committee on Taxonomy of Viruses has recently changed its approved definition of a viral species, and also discontinued work on its database of virus descriptions. These events indicate that the exploration era of viral taxonomy has ended; over the past century the principles of viral taxonomy have been established, the tools for phylogenetic inference invented, and the ultimate discriminatory data required for taxonomy, namely gene sequences, are now readily available. Further changes would make viral taxonomy more informative. First, the status of a 'taxonomic species' with an italicized name should only be given to viruses that are specifically linked with a single 'type genomic sequence' like those in the NCBI Reference Sequence Database. Secondly all approved taxa should be predominately monophyletic, and uninformative higher taxa disendorsed. These are 'quality assurance' measures and would improve the value of viral nomenclature to its users. The ICTV should also promote the use of a public database, such as Wikipedia, to replace the ICTV database as a store of the primary metadata of individual viruses, and should publish abstracts of the ICTV Reports in that database, so that they are 'Open Access'.
Luft, F; Klaes, R; Nees, M; Dürst, M; Heilmann, V; Melsheimer, P; von Knebel Doeberitz, M
2001-04-01
Human papillomavirus (HPV) genomes usually persist as episomal molecules in HPV associated preneoplastic lesions whereas they are frequently integrated into the host cell genome in HPV-related cancers cells. This suggests that malignant conversion of HPV-infected epithelia is linked to recombination of cellular and viral sequences. Due to technical limitations, precise sequence information on viral-cellular junctions were obtained only for few cell lines and primary lesions. In order to facilitate the molecular analysis of genomic HPV integration, we established a ligation-mediated PCR assay for the detection of integrated papillomavirus sequences (DIPS-PCR). DIPS-PCR was initially used to amplify genomic viral-cellular junctions from HPV-associated cervical cancer cell lines (C4-I, C4-II, SW756, and HeLa) and HPV-immortalized keratinocyte lines (HPKIA, HPKII). In addition to junctions already reported in public data bases, various new fusion fragments were identified. Subsequently, 22 different viral-cellular junctions were amplified from 17 cervical carcinomas and 1 vulval intraepithelial neoplasia (VIN III). Sequence analysis of each junction revealed that the viral E1 open reading frame (ORF) was fused to cellular sequences in 20 of 22 (91%) cases. Chromosomal integration loci mapped to chromosomes 1 (2n), 2 (3n), 7 (2n), 8 (3n), 10 (1n), 14 (5n), 16 (1n), 17 (2n), and mitochondrial DNA (1n), suggesting random distribution of chromosomal integration sites. Precise sequence information obtained by DIPS-PCR was further used to monitor the monoclonal origin of 4 cervical cancers, 1 case of recurrent premalignant lesions and 1 lymph node metastasis. Therefore, DIPS-PCR might allow efficient therapy control and prediction of relapse in patients with HPV-associated anogenital cancers. Copyright 2001 Wiley-Liss, Inc.
Gallei, Andreas; Orlich, Michaela; Thiel, Heinz-Juergen; Becher, Paul
2005-01-01
Several studies have demonstrated that cytopathogenic (cp) pestivirus strains evolve from noncytopathogenic (noncp) viruses by nonhomologous RNA recombination. In addition, two recent reports showed the rapid emergence of noncp Bovine viral diarrhea virus (BVDV) after a few cell culture passages of cp BVDV strains by homologous recombination between identical duplicated viral sequences. To allow the identification of recombination sites from noncp BVDV strains that evolve from cp viruses, we constructed the cp BVDV strains CP442 and CP552. Both harbor duplicated viral sequences of different origin flanking the cellular insertion Nedd8*; the latter is a prerequisite for their cytopathogenicity. In contrast to the previous studies, isolation of noncp strains was possible only after extensive cell culture passages of CP442 and CP552. Sequence analysis of 15 isolated noncp BVDVs confirmed that all recombinant strains lack at least most of Nedd8*. Interestingly, only one strain resulted from homologous recombination while the other 14 strains were generated by nonhomologous recombination. Accordingly, our data suggest that the extent of sequence identity between participating sequences influences both frequency and mode (homologous versus nonhomologous) of RNA recombination in pestiviruses. Further analyses of the noncp recombinant strains revealed that a duplication of 14 codons in the BVDV nonstructural protein 4B (NS4B) gene does not interfere with efficient viral replication. Moreover, an insertion of viral sequences between the NS4A and NS4B genes was well tolerated. These findings thus led to the identification of two genomic loci which appear to be suited for the insertion of heterologous sequences into the genomes of pestiviruses and related viruses. PMID:16254361
Rotavirus I in feces of a cat with diarrhea.
Phan, Tung G; Leutenegger, Christian M; Chan, Roxanne; Delwart, Eric
2017-06-01
A divergent rotavirus I was detected using viral metagenomics in the feces of a cat with diarrhea. The eleven segments of rotavirus I strain Felis catus encoded non-structural and structural proteins with amino acid identities ranging from 25 to 79% to the only two currently sequenced members of that viral species both derived from canine feces. No other eukaryotic viral sequences nor bacterial and protozoan pathogens were detected in this fecal sample suggesting the involvement of rotavirus I in feline diarrhea.
Kodama, T; Mori, K; Kawahara, T; Ringler, D J; Desrosiers, R C
1993-01-01
One rhesus macaque displayed severe encephalomyelitis and another displayed severe enterocolitis following infection with molecularly cloned simian immunodeficiency virus (SIV) strain SIVmac239. Little or no free anti-SIV antibody developed in these two macaques, and they died relatively quickly (4 to 6 months) after infection. Manifestation of the tissue-specific disease in these macaques was associated with the emergence of variants with high replicative capacity for macrophages and primary infection of tissue macrophages. The nature of sequence variation in the central region (vif, vpr, and vpx), the env gene, and the nef long terminal repeat (LTR) region in brain, colon, and other tissues was examined to see whether specific genetic changes were associated with SIV replication in brain or gut. Sequence analysis revealed strong conservation of the intergenic central region, nef, and the LTR. However, analysis of env sequences in these two macaques and one other revealed significant, interesting patterns of sequence variation. (i) Changes in env that were found previously to contribute to the replicative ability of SIVmac for macrophages in culture were present in the tissues of these animals. (ii) The greatest variability was located in the regions between V1 and V2 and from "V3" through C3 in gp120, which are different in location from the variable regions observed previously in animals with strong antibody responses and long-term persistent infection. (iii) The predominant sequence change of D-->N at position 385 in C3 is most surprising, since this change in both SIV and human immunodeficiency virus type 1 has been associated with dramatically diminished affinity for CD4 and replication in vitro. (iv) The nature of sequence changes at some positions (146, 178, 345, 385, and "V3") suggests that viral replication in brain and gut may be facilitated by specific sequence changes in env in addition to those that impart a general ability to replicate well in macrophages. These results demonstrate that complex selective pressures, including immune responses and varying cell and tissue specificity, can influence the nature of sequence changes in env. Images PMID:8411355
Bull, Marta; Learn, Gerald; Genowati, Indira; McKernan, Jennifer; Hitti, Jane; Lockhart, David; Tapia, Kenneth; Holte, Sarah; Dragavon, Joan; Coombs, Robert; Mullins, James; Frenkel, Lisa
2009-09-22
Compartmentalization of HIV-1 between the genital tract and blood was noted in half of 57 women included in 12 studies primarily using cell-free virus. To further understand differences between genital tract and blood viruses of women with chronic HIV-1 infection cell-free and cell-associated virus populations were sequenced from these tissues, reasoning that integrated viral DNA includes variants archived from earlier in infection, and provides a greater array of genotypes for comparisons. Multiple sequences from single-genome-amplification of HIV-1 RNA and DNA from the genital tract and blood of each woman were compared in a cross-sectional study. Maximum likelihood phylogenies were evaluated for evidence of compartmentalization using four statistical tests. Genital tract and blood HIV-1 appears compartmentalized in 7/13 women by >/=2 statistical analyses. These subjects' phylograms were characterized by low diversity genital-specific viral clades interspersed between clades containing both genital and blood sequences. Many of the genital-specific clades contained monotypic HIV-1 sequences. In 2/7 women, HIV-1 populations were significantly compartmentalized across all four statistical tests; both had low diversity genital tract-only clades. Collapsing monotypic variants into a single sequence diminished the prevalence and extent of compartmentalization. Viral sequences did not demonstrate tissue-specific signature amino acid residues, differential immune selection, or co-receptor usage. In women with chronic HIV-1 infection multiple identical sequences suggest proliferation of HIV-1-infected cells, and low diversity tissue-specific phylogenetic clades are consistent with bursts of viral replication. These monotypic and tissue-specific viruses provide statistical support for compartmentalization of HIV-1 between the female genital tract and blood. However, the intermingling of these clades with clades comprised of both genital and blood sequences and the absence of tissue-specific genetic features suggests compartmentalization between blood and genital tract may be due to viral replication and proliferation of infected cells, and questions whether HIV-1 in the female genital tract is distinct from blood.
Geldenhuys, Marike; Mortlock, Marinda; Weyer, Jacqueline; Bezuidt, Oliver; Seamark, Ernest C J; Kearney, Teresa; Gleasner, Cheryl; Erkkila, Tracy H; Cui, Helen; Markotter, Wanda
2018-01-01
Species within the Neoromicia bat genus are abundant and widely distributed in Africa. It is common for these insectivorous bats to roost in anthropogenic structures in urban regions. Additionally, Neoromicia capensis have previously been identified as potential hosts for Middle East respiratory syndrome (MERS)-related coronaviruses. This study aimed to ascertain the gastrointestinal virome of these bats, as viruses excreted in fecal material or which may be replicating in rectal or intestinal tissues have the greatest opportunities of coming into contact with other hosts. Samples were collected in five regions of South Africa over eight years. Initial virome composition was determined by viral metagenomic sequencing by pooling samples and enriching for viral particles. Libraries were sequenced on the Illumina MiSeq and NextSeq500 platforms, producing a combined 37 million reads. Bioinformatics analysis of the high throughput sequencing data detected the full genome of a novel species of the Circoviridae family, and also identified sequence data from the Adenoviridae, Coronaviridae, Herpesviridae, Parvoviridae, Papillomaviridae, Phenuiviridae, and Picornaviridae families. Metagenomic sequencing data was insufficient to determine the viral diversity of certain families due to the fragmented coverage of genomes and lack of suitable sequencing depth, as some viruses were detected from the analysis of reads-data only. Follow up conventional PCR assays targeting conserved gene regions for the Adenoviridae, Coronaviridae, and Herpesviridae families were used to confirm metagenomic data and generate additional sequences to determine genetic diversity. The complete coding genome of a MERS-related coronavirus was recovered with additional amplicon sequencing on the MiSeq platform. The new genome shared 97.2% overall nucleotide identity to a previous Neoromicia-associated MERS-related virus, also from South Africa. Conventional PCR analysis detected diverse adenovirus and herpesvirus sequences that were widespread throughout Neoromicia populations in South Africa. Furthermore, similar adenovirus sequences were detected within these populations throughout several years. With the exception of the coronaviruses, the study represents the first report of sequence data from several viral families within a Southern African insectivorous bat genus; highlighting the need for continued investigations in this regard.
Geldenhuys, Marike; Mortlock, Marinda; Weyer, Jacqueline; Bezuidt, Oliver; Seamark, Ernest C. J.; Kearney, Teresa; Gleasner, Cheryl; Erkkila, Tracy H.; Cui, Helen; Markotter, Wanda
2018-01-01
Species within the Neoromicia bat genus are abundant and widely distributed in Africa. It is common for these insectivorous bats to roost in anthropogenic structures in urban regions. Additionally, Neoromicia capensis have previously been identified as potential hosts for Middle East respiratory syndrome (MERS)-related coronaviruses. This study aimed to ascertain the gastrointestinal virome of these bats, as viruses excreted in fecal material or which may be replicating in rectal or intestinal tissues have the greatest opportunities of coming into contact with other hosts. Samples were collected in five regions of South Africa over eight years. Initial virome composition was determined by viral metagenomic sequencing by pooling samples and enriching for viral particles. Libraries were sequenced on the Illumina MiSeq and NextSeq500 platforms, producing a combined 37 million reads. Bioinformatics analysis of the high throughput sequencing data detected the full genome of a novel species of the Circoviridae family, and also identified sequence data from the Adenoviridae, Coronaviridae, Herpesviridae, Parvoviridae, Papillomaviridae, Phenuiviridae, and Picornaviridae families. Metagenomic sequencing data was insufficient to determine the viral diversity of certain families due to the fragmented coverage of genomes and lack of suitable sequencing depth, as some viruses were detected from the analysis of reads-data only. Follow up conventional PCR assays targeting conserved gene regions for the Adenoviridae, Coronaviridae, and Herpesviridae families were used to confirm metagenomic data and generate additional sequences to determine genetic diversity. The complete coding genome of a MERS-related coronavirus was recovered with additional amplicon sequencing on the MiSeq platform. The new genome shared 97.2% overall nucleotide identity to a previous Neoromicia-associated MERS-related virus, also from South Africa. Conventional PCR analysis detected diverse adenovirus and herpesvirus sequences that were widespread throughout Neoromicia populations in South Africa. Furthermore, similar adenovirus sequences were detected within these populations throughout several years. With the exception of the coronaviruses, the study represents the first report of sequence data from several viral families within a Southern African insectivorous bat genus; highlighting the need for continued investigations in this regard. PMID:29579103
Real-time PCR for simultaneous detection and genotyping of bovine viral diarrhea virus.
Letellier, C; Kerkhofs, P
2003-12-01
Since two genotypes of bovine viral diarrhea viruses (BVDV) occur in Belgian herds, their differentiation is important for disease surveillance. A quantitative real-time PCR assay was developed to detect and classify bovine viral diarrhea viruses in genotype I and II. A pair of primers specific for highly conserved regions of the 5'UTR and two TaqMan probes were designed. The FAM and VIC-labeled probe sequences differed by three nucleotides, allowing the differentiation between genotype I and II. The assay detectability of genotype I and II real-time PCR assay was 1000 and 100 copies, respectively. Highly reproducible data were obtained as the coefficients of variation of threshold cycle values in inter-runs were less than 2.2%. The correct classification of genotype I and II viruses was assessed by using reference strains and characterized field isolates of both genotypes. The application to clinical diagnosis was evaluated on pooled blood samples by post run measurement of the FAM- and VIC-associated fluorescence. The 100% agreement with the conventional RT-PCR method confirmed that this new technique could be used for routine detection of persistently infected immunotolerant animals.
PCR Amplification Strategies towards full-length HIV-1 Genome sequencing.
Liu, Chao Chun; Ji, Hezhao
2018-06-26
The advent of next generation sequencing has enabled greater resolution of viral diversity and improved feasibility of full viral genome sequencing allowing routine HIV-1 full genome sequencing in both research and diagnostic settings. Regardless of the sequencing platform selected, successful PCR amplification of the HIV-1 genome is essential for sequencing template preparation. As such, full HIV-1 genome amplification is a crucial step in dictating the successful and reliable sequencing downstream. Here we reviewed existing PCR protocols leading to HIV-1 full genome sequencing. In addition to the discussion on basic considerations on relevant PCR design, the advantages as well as the pitfalls of published protocols were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Steel, Olivia; Kraberger, Simona; Sikorski, Alyssa; Young, Laura M; Catchpole, Ryan J; Stevens, Aaron J; Ladley, Jenny J; Coray, Dorien S; Stainton, Daisy; Dayaram, Anisha; Julian, Laurel; van Bysterveldt, Katherine; Varsani, Arvind
2016-09-01
In recent years, innovations in molecular techniques and sequencing technologies have resulted in a rapid expansion in the number of known viral sequences, in particular those with circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA genomes. CRESS DNA viruses are present in the virome of many ecosystems and are known to infect a wide range of organisms. A large number of the recently identified CRESS DNA viruses cannot be classified into any known viral families, indicating that the current view of CRESS DNA viral sequence space is greatly underestimated. Animal faecal matter has proven to be a particularly useful source for sampling CRESS DNA viruses in an ecosystem, as it is cost-effective and non-invasive. In this study a viral metagenomic approach was used to explore the diversity of CRESS DNA viruses present in the faeces of domesticated and wild animals in New Zealand. Thirty-eight complete CRESS DNA viral genomes and two circular molecules (that may be defective molecules or single components of multicomponent genomes) were identified from forty-nine individual animal faecal samples. Based on shared genome organisations and sequence similarities, eighteen of the isolates were classified as gemycircularviruses and twelve isolates were classified as smacoviruses. The remaining eight isolates lack significant sequence similarity with any members of known CRESS DNA virus groups. This research adds significantly to our knowledge of CRESS DNA viral diversity in New Zealand, emphasising the prevalence of CRESS DNA viruses in nature, and reinforcing the suggestion that a large proportion of CRESS DNA viruses are yet to be identified. Copyright © 2016 Elsevier B.V. All rights reserved.
A metagenomic survey of viral abundance and diversity in mosquitoes from Hubei province.
Shi, Chenyan; Liu, Yi; Hu, Xiaomin; Xiong, Jinfeng; Zhang, Bo; Yuan, Zhiming
2015-01-01
Mosquitoes as one of the most common but important vectors have the potential to transmit or acquire a lot of viruses through biting, however viral flora in mosquitoes and its impact on mosquito-borne disease transmission has not been well investigated and evaluated. In this study, the metagenomic techniquehas been successfully employed in analyzing the abundance and diversity of viral community in three mosquito samples from Hubei, China. Among 92,304 reads produced through a run with 454 GS FLX system, 39% have high similarities with viral sequences belonging to identified bacterial, fungal, animal, plant and insect viruses, and 0.02% were classed into unidentified viral sequences, demonstrating high abundance and diversity of viruses in mosquitoes. Furthermore, two novel viruses in subfamily Densovirinae and family Dicistroviridae were identified, and six torque tenosus virus1 in family Anelloviridae, three porcine parvoviruses in subfamily Parvovirinae and a Culex tritaeniorhynchus rhabdovirus in Family Rhabdoviridae were preliminarily characterized. The viral metagenomic analysis offered us a deep insight into the viral population of mosquito which played an important role in viral initiative or passive transmission and evolution during the process.
Metavir 2: new tools for viral metagenome comparison and assembled virome analysis
2014-01-01
Background Metagenomics, based on culture-independent sequencing, is a well-fitted approach to provide insights into the composition, structure and dynamics of environmental viral communities. Following recent advances in sequencing technologies, new challenges arise for existing bioinformatic tools dedicated to viral metagenome (i.e. virome) analysis as (i) the number of viromes is rapidly growing and (ii) large genomic fragments can now be obtained by assembling the huge amount of sequence data generated for each metagenome. Results To face these challenges, a new version of Metavir was developed. First, all Metavir tools have been adapted to support comparative analysis of viromes in order to improve the analysis of multiple datasets. In addition to the sequence comparison previously provided, viromes can now be compared through their k-mer frequencies, their taxonomic compositions, recruitment plots and phylogenetic trees containing sequences from different datasets. Second, a new section has been specifically designed to handle assembled viromes made of thousands of large genomic fragments (i.e. contigs). This section includes an annotation pipeline for uploaded viral contigs (gene prediction, similarity search against reference viral genomes and protein domains) and an extensive comparison between contigs and reference genomes. Contigs and their annotations can be explored on the website through specifically developed dynamic genomic maps and interactive networks. Conclusions The new features of Metavir 2 allow users to explore and analyze viromes composed of raw reads or assembled fragments through a set of adapted tools and a user-friendly interface. PMID:24646187
Quick, Josh; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2018-01-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples without isolation remains challenging for viruses such as Zika, where metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence complete genomes comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimised library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved starting with clinical samples in 1-2 days following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. PMID:28538739
Variation and Evolution in the Glutamine-Rich Repeat Region of Drosophila Argonaute-2
Palmer, William H.; Obbard, Darren J.
2016-01-01
RNA interference pathways mediate biological processes through Argonaute-family proteins, which bind small RNAs as guides to silence complementary target nucleic acids . In insects and crustaceans Argonaute-2 silences viral nucleic acids, and therefore acts as a primary effector of innate antiviral immunity. Although the function of the major Argonaute-2 domains, which are conserved across most Argonaute-family proteins, are known, many invertebrate Argonaute-2 homologs contain a glutamine-rich repeat (GRR) region of unknown function at the N-terminus . Here we combine long-read amplicon sequencing of Drosophila Genetic Reference Panel (DGRP) lines with publicly available sequence data from many insect species to show that this region evolves extremely rapidly and is hyper-variable within species. We identify distinct GRR haplotype groups in Drosophila melanogaster, and suggest that one of these haplotype groups has recently risen to high frequency in a North American population. Finally, we use published data from genome-wide association studies of viral resistance in D. melanogaster to test whether GRR haplotypes are associated with survival after virus challenge. We find a marginally significant association with survival after challenge with Drosophila C Virus in the DGRP, but we were unable to replicate this finding using lines from the Drosophila Synthetic Population Resource panel. PMID:27317784
Mutation of HIV-1 genomes in a clinical population treated with the mutagenic nucleoside KP1461.
Mullins, James I; Heath, Laura; Hughes, James P; Kicha, Jessica; Styrchak, Sheila; Wong, Kim G; Rao, Ushnal; Hansen, Alexis; Harris, Kevin S; Laurent, Jean-Pierre; Li, Deyu; Simpson, Jeffrey H; Essigmann, John M; Loeb, Lawrence A; Parkins, Jeffrey
2011-01-14
The deoxycytidine analog KP1212, and its prodrug KP1461, are prototypes of a new class of antiretroviral drugs designed to increase viral mutation rates, with the goal of eventually causing the collapse of the viral population. Here we present an extensive analysis of viral sequences from HIV-1 infected volunteers from the first "mechanism validation" phase II clinical trial of a mutagenic base analog in which individuals previously treated with antiviral drugs received 1600 mg of KP1461 twice per day for 124 days. Plasma viral loads were not reduced, and overall levels of viral mutation were not increased during this short-term study, however, the mutation spectrum of HIV was altered. A large number (N = 105 per sample) of sequences were analyzed, each derived from individual HIV-1 RNA templates, after 0, 56 and 124 days of therapy from 10 treated and 10 untreated control individuals (>7.1 million base pairs of unique viral templates were sequenced). We found that private mutations, those not found in more than one viral sequence and likely to have occurred in the most recent rounds of replication, increased in treated individuals relative to controls after 56 (p = 0.038) and 124 (p = 0.002) days of drug treatment. The spectrum of mutations observed in the treated group showed an excess of A to G and G to A mutations (p = 0.01), and to a lesser extent T to C and C to T mutations (p = 0.09), as predicted by the mechanism of action of the drug. These results validate the proposed mechanism of action in humans and should spur development of this novel antiretroviral approach.
Mutation of HIV-1 Genomes in a Clinical Population Treated with the Mutagenic Nucleoside KP1461
Mullins, James I.; Heath, Laura; Hughes, James P.; Kicha, Jessica; Styrchak, Sheila; Wong, Kim G.; Rao, Ushnal; Hansen, Alexis; Harris, Kevin S.; Laurent, Jean-Pierre; Li, Deyu; Simpson, Jeffrey H.; Essigmann, John M.; Loeb, Lawrence A.; Parkins, Jeffrey
2011-01-01
The deoxycytidine analog KP1212, and its prodrug KP1461, are prototypes of a new class of antiretroviral drugs designed to increase viral mutation rates, with the goal of eventually causing the collapse of the viral population. Here we present an extensive analysis of viral sequences from HIV-1 infected volunteers from the first “mechanism validation” phase II clinical trial of a mutagenic base analog in which individuals previously treated with antiviral drugs received 1600 mg of KP1461 twice per day for 124 days. Plasma viral loads were not reduced, and overall levels of viral mutation were not increased during this short-term study, however, the mutation spectrum of HIV was altered. A large number (N = 105 per sample) of sequences were analyzed, each derived from individual HIV-1 RNA templates, after 0, 56 and 124 days of therapy from 10 treated and 10 untreated control individuals (>7.1 million base pairs of unique viral templates were sequenced). We found that private mutations, those not found in more than one viral sequence and likely to have occurred in the most recent rounds of replication, increased in treated individuals relative to controls after 56 (p = 0.038) and 124 (p = 0.002) days of drug treatment. The spectrum of mutations observed in the treated group showed an excess of A to G and G to A mutations (p = 0.01), and to a lesser extent T to C and C to T mutations (p = 0.09), as predicted by the mechanism of action of the drug. These results validate the proposed mechanism of action in humans and should spur development of this novel antiretroviral approach. PMID:21264288
Anderson, Tavis K; Laegreid, William W; Cerutti, Francesco; Osorio, Fernando A; Nelson, Eric A; Christopher-Hennings, Jane; Goldberg, Tony L
2012-06-15
The extraordinary genetic and antigenic variability of RNA viruses is arguably the greatest challenge to the development of broadly effective vaccines. No single viral variant can induce sufficiently broad immunity, and incorporating all known naturally circulating variants into one multivalent vaccine is not feasible. Furthermore, no objective strategies currently exist to select actual viral variants that should be included or excluded in polyvalent vaccines. To address this problem, we demonstrate a method based on graph theory that quantifies the relative importance of viral variants. We demonstrate our method through application to the envelope glycoprotein gene of a particularly diverse RNA virus of pigs: porcine reproductive and respiratory syndrome virus (PRRSV). Using distance matrices derived from sequence nucleotide difference, amino acid difference and evolutionary distance, we constructed viral networks and used common network statistics to assign each sequence an objective ranking of relative 'importance'. To validate our approach, we use an independent published algorithm to score our top-ranked wild-type variants for coverage of putative T-cell epitopes across the 9383 sequences in our dataset. Top-ranked viruses achieve significantly higher coverage than low-ranked viruses, and top-ranked viruses achieve nearly equal coverage as a synthetic mosaic protein constructed in silico from the same set of 9383 sequences. Our approach relies on the network structure of PRRSV but applies to any diverse RNA virus because it identifies subsets of viral variants that are most important to overall viral diversity. We suggest that this method, through the objective quantification of variant importance, provides criteria for choosing viral variants for further characterization, diagnostics, surveillance and ultimately polyvalent vaccine development.
Singh, Mini Pritam; Majumdar, Manasi; Thapa, Babu Ram; Gupta, Puneet Kumar; Khurana, Jasmine; Budhathoki, Bimal; Ratho, Radha Kanta
2015-02-01
Hepatitis A virus usually causes acute viral hepatitis (AVH) in the paediatric age group with a recent shift in age distribution and disease manifestations like acute liver failure (ALF). This has been attributed to mutations in 5'non-translated region (5'NTR) which affects the viral multiplication. The present study was aimed to carry out the molecular detection and phylogenetic analysis of hepatitis A virus strains circulating in north western India. Serum samples from in patients and those attending out patient department of Pediatric Gastroenterology in a tertiary care hospital in north India during 2007-2011 with clinically suspected AVH were tested for anti-hepatitis A virus (HAV) IgM antibodies. Acute phase serum samples were subjected to nested PCR targeting the 5'NTR region followed by sequencing of the representative strains. A total of 1334 samples were tested, 290 (21.7%) were positive for anti-HAV IgM antibody. Of these, 78 serum samples (< 7 days old) were subjected to PCR and 47.4% (37/78) samples showed the presence of HAV RNA. Children < 15 yr of age accounted for majority (94%) of cases with highest seropositivity during rainy season. Sequencing of 15 representative strains was carried out and the circulating genotype was found to be III A. The nucleotide sequences showed high homology among the strains with a variation ranging from 0.1-1 per cent over the years. An important substitution of G to A at 324 position was shown by both AVH and ALF strains. The cumulative substitution in AVH strains Vs ALF strains as compared to GBM, Indian and prototype strain in the 200-500 region of 5' NTR was comparable. Our results showed hepatitis A still a disease of children with III A as a circulating genotype in this region. The mutations at 5'NTR region warrant further analysis as these affect the structure of internal ribosomal entry site which is important for viral replication.
Zahraei, Bentolhoda; Hashemzadeh, Mohammad Sadegh; Najarasl, Mohammad; Zahiriyeganeh, Samaneh; Tat, Mahdi; Metanat, Maliheh; Sepehri Rad, Nahid; Khansari-Nejad, Behzad; Zafari, Ehsan; Sharti, Mojtaba; Dorostkar, Ruhollah
2016-01-01
The Crimean-Congo hemorrhagic fever (CCHF) virus causes severe disease in humans, with a high mortality rate. Since, there is no approved vaccine or specific treatment for CCHF, an early and accurate diagnosis, as well as reliable surveillance, is essential for case management and patient improvement. For this research, our aim was to evaluate the application of a novel SYBR Green based one-step real-time reverse-transcriptase polymerase chain reaction (rRT-PCR) assay for the in-house diagnosis of the CCHF virus. In this experimental study, the highly conserved S-region sequence of the CCHF viral genome was first adapted from GenBank, and the specific primers targeting this region were designed. Then, the viral RNA was extracted from 75 serum samples from different patients in eastern Iran. The sensitivity and specificity of the primers were also evaluated in positive serum samples previously confirmed to have the CCHF virus, by this one-step rRT-PCR assay, as well as a DNA sequencing analysis. From a total of 75 suspected serum samples, 42 were confirmed to be positive for CCHF virus, with no false-positives detected by the sequencing results. After 40 amplification cycles, the melting curve analysis revealed a mean melting temperature (Tm) of 86.5 ± 0.6°C (quite different from those of the primer-dimers), and the positive samples showed only a small variation in the parameters. In all of the positive samples, the predicted length of 420 bp was confirmed by electrophoresis. Moreover, the sensitivity test showed that this assay can detect less than 20 copies of viral RNA per reaction. This study showed that this novel one-step rRT-PCR assay is a rapid, reliable, repeatable, specific, sensitive, and simple tool for the detection of the CCHF virus.
Wymant, Chris; Colijn, Caroline; Danaviah, Siva; Essex, Max; Frost, Simon; Gall, Astrid; Gaseitsiwe, Simani; Grabowski, Mary K.; Gray, Ronald; Guindon, Stephane; von Haeseler, Arndt; Kaleebu, Pontiano; Kendall, Michelle; Kozlov, Alexey; Manasa, Justen; Minh, Bui Quang; Moyo, Sikhulile; Novitsky, Vlad; Nsubuga, Rebecca; Pillay, Sureshnee; Quinn, Thomas C.; Serwadda, David; Ssemwanga, Deogratius; Stamatakis, Alexandros; Trifinopoulos, Jana; Wawer, Maria; Brown, Andy Leigh; de Oliveira, Tulio; Kellam, Paul; Pillay, Deenan; Fraser, Christophe
2017-01-01
Abstract To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the “Phylogenetics and Networks for Generalised HIV Epidemics in Africa” consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n = 2,833; MRC/UVRI Uganda, n = 701; Mochudi Prevention Project, n = 359; Africa Health Research Institute Resistance Cohort, n = 92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3′ end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences (NGS) has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phylogeny reconstruction is expected. PMID:28540766
Ratmann, Oliver; Wymant, Chris; Colijn, Caroline; Danaviah, Siva; Essex, M; Frost, Simon D W; Gall, Astrid; Gaiseitsiwe, Simani; Grabowski, Mary; Gray, Ronald; Guindon, Stephane; von Haeseler, Arndt; Kaleebu, Pontiano; Kendall, Michelle; Kozlov, Alexey; Manasa, Justen; Minh, Bui Quang; Moyo, Sikhulile; Novitsky, Vladimir; Nsubuga, Rebecca; Pillay, Sureshnee; Quinn, Thomas C; Serwadda, David; Ssemwanga, Deogratius; Stamatakis, Alexandros; Trifinopoulos, Jana; Wawer, Maria; Leigh Brown, Andrew; de Oliveira, Tulio; Kellam, Paul; Pillay, Deenan; Fraser, Christophe
2017-05-25
To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the 'Phylogenetics and Networks for Generalised HIV Epidemics in Africa' consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n=2,833; MRC/UVRI Uganda, n=701; Mochudi Prevention Project, n=359; Africa Health Research Institute Resistance Cohort, n=92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3' end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phylogeny reconstruction is expected.
Madi, Nada; Al-Nakib, Widad; Mustafa, Abu Salim; Habibi, Nazima
2018-03-01
A metagenomic approach based on target independent next-generation sequencing has become a known method for the detection of both known and novel viruses in clinical samples. This study aimed to use the metagenomic sequencing approach to characterize the viral diversity in respiratory samples from patients with respiratory tract infections. We have investigated 86 respiratory samples received from various hospitals in Kuwait between 2015 and 2016 for the diagnosis of respiratory tract infections. A metagenomic approach using the next-generation sequencer to characterize viruses was used. According to the metagenomic analysis, an average of 145, 019 reads were identified, and 2% of these reads were of viral origin. Also, metagenomic analysis of the viral sequences revealed many known respiratory viruses, which were detected in 30.2% of the clinical samples. Also, sequences of non-respiratory viruses were detected in 14% of the clinical samples, while sequences of non-human viruses were detected in 55.8% of the clinical samples. The average genome coverage of the viruses was 12% with the highest genome coverage of 99.2% for respiratory syncytial virus, and the lowest was 1% for torque teno midi virus 2. Our results showed 47.7% agreement between multiplex Real-Time PCR and metagenomics sequencing in the detection of respiratory viruses in the clinical samples. Though there are some difficulties in using this method to clinical samples such as specimen quality, these observations are indicative of the promising utility of the metagenomic sequencing approach for the identification of respiratory viruses in patients with respiratory tract infections. © 2017 Wiley Periodicals, Inc.
Tizioto, Polyana C; Kim, JaeWoo; Seabury, Christopher M; Schnabel, Robert D; Gershwin, Laurel J; Van Eenennaam, Alison L; Toaff-Rosenstein, Rachel; Neibergs, Holly L; Taylor, Jeremy F
2015-01-01
Susceptibility to bovine respiratory disease (BRD) is multi-factorial and is influenced by stress in conjunction with infection by both bacterial and viral pathogens. While vaccination is broadly used in an effort to prevent BRD, it is far from being fully protective and cases diagnosed from a combination of observed clinical signs without any attempt at identifying the causal pathogens are usually treated with antibiotics. Dairy and beef cattle losses from BRD are profound worldwide and genetic studies have now been initiated to elucidate host loci which underlie susceptibility with the objective of enabling molecular breeding to reduce disease prevalence. In this study, we employed RNA sequencing to examine the bronchial lymph node transcriptomes of controls and beef cattle which had individually been experimentally challenged with bovine respiratory syncytial virus, infectious bovine rhinotracheitis, bovine viral diarrhea virus, Pasteurella multocida, Mannheimia haemolytica or Mycoplasma bovis to identify the genes that are involved in the bovine immune response to infection. We found that 142 differentially expressed genes were located in previously described quantitative trait locus regions associated with risk of BRD. Mutations affecting the expression or amino acid composition of these genes may affect disease susceptibility and could be incorporated into molecular breeding programs. Genes involved in innate immunity were generally found to be differentially expressed between the control and pathogen-challenged animals suggesting that variation in these genes may lead to a heritability of susceptibility that is pathogen independent. However, we also found pathogen-specific expression profiles which suggest that host genetic variation for BRD susceptibility is pathogen dependent.
Genome variations associated with viral susceptibility and calcification in Emiliania huxleyi.
Kegel, Jessica U; John, Uwe; Valentin, Klaus; Frickenhaus, Stephan
2013-01-01
Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain) to perform comparative genomic hybridizations (CGH) of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification). Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core) genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies.
Genome Variations Associated with Viral Susceptibility and Calcification in Emiliania huxleyi
Kegel, Jessica U.; John, Uwe; Valentin, Klaus; Frickenhaus, Stephan
2013-01-01
Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain) to perform comparative genomic hybridizations (CGH) of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification). Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core) genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies. PMID:24260453
Error catastrophe and phase transition in the empirical fitness landscape of HIV
NASA Astrophysics Data System (ADS)
Hart, Gregory R.; Ferguson, Andrew L.
2015-03-01
We have translated clinical sequence databases of the p6 HIV protein into an empirical fitness landscape quantifying viral replicative capacity as a function of the amino acid sequence. We show that the viral population resides close to a phase transition in sequence space corresponding to an "error catastrophe" beyond which there is lethal accumulation of mutations. Our model predicts that the phase transition may be induced by drug therapies that elevate the mutation rate, or by forcing mutations at particular amino acids. Applying immune pressure to any combination of killer T-cell targets cannot induce the transition, providing a rationale for why the viral protein can exist close to the error catastrophe without sustaining fatal fitness penalties due to adaptive immunity.
Duda, Anja; Stange, Annett; Lüftenegger, Daniel; Stanke, Nicole; Westphal, Dana; Pietschmann, Thomas; Eastman, Scott W; Linial, Maxine L; Rethwilm, Axel; Lindemann, Dirk
2004-12-01
Analogous to cellular glycoproteins, viral envelope proteins contain N-terminal signal sequences responsible for targeting them to the secretory pathway. The prototype foamy virus (PFV) envelope (Env) shows a highly unusual biosynthesis. Its precursor protein has a type III membrane topology with both the N and C terminus located in the cytoplasm. Coexpression of FV glycoprotein and interaction of its leader peptide (LP) with the viral capsid is essential for viral particle budding and egress. Processing of PFV Env into the particle-associated LP, surface (SU), and transmembrane (TM) subunits occur posttranslationally during transport to the cell surface by yet-unidentified cellular proteases. Here we provide strong evidence that furin itself or a furin-like protease and not the signal peptidase complex is responsible for both processing events. N-terminal protein sequencing of the SU and TM subunits of purified PFV Env-immunoglobulin G immunoadhesin identified furin consensus sequences upstream of both cleavage sites. Mutagenesis analysis of two overlapping furin consensus sequences at the PFV LP/SU cleavage site in the wild-type protein confirmed the sequencing data and demonstrated utilization of only the first site. Fully processed SU was almost completely absent in viral particles of mutants having conserved arginine residues replaced by alanines in the first furin consensus sequence, but normal processing was observed upon mutation of the second motif. Although these mutants displayed a significant loss in infectivity as a result of reduced particle release, no correlation to processing inhibition was observed, since another mutant having normal LP/SU processing had a similar defect.
Hughes, Paul; Deng, Wenjie; Olson, Scott C; Coombs, Robert W; Chung, Michael H; Frenkel, Lisa M
2016-03-01
Accurate analysis of minor populations of drug-resistant HIV requires analysis of a sufficient number of viral templates. We assessed the effect of experimental conditions on the analysis of HIV pol 454 pyrosequences generated from plasma using (1) the "Insertion-deletion (indel) and Carry Forward Correction" (ICC) pipeline, which clusters sequence reads using a nonsubstitution approach and can correct for indels and carry forward errors, and (2) the "Primer Identification (ID)" method, which facilitates construction of a consensus sequence to correct for sequencing errors and allelic skewing. The Primer ID and ICC methods produced similar estimates of viral diversity, but differed in the number of sequence variants generated. Sequence preparation for ICC was comparably simple, but was limited by an inability to assess the number of templates analyzed and allelic skewing. The more costly Primer ID method corrected for allelic skewing and provided the number of viral templates analyzed, which revealed that amplifiable HIV templates varied across specimens and did not correlate with clinical viral load. This latter observation highlights the value of the Primer ID method, which by determining the number of templates amplified, enables more accurate assessment of minority species in the virus population, which may be relevant to prescribing effective antiretroviral therapy.
1987-10-13
after multiple passages in vivo and in vitro. J. Gen. Virol. 67, 1741- 1744. Sabin , A.B. (1985). Oral poliovirus vaccine : history of its development...IN (N NEW APPROACHES TO ATTENUATED HEPATITIS A VACCINE DEVELOPMENT: Q) CLONING AND SEQUENCING OF CELL-CULTURE ADAPTED VIRAL cDNA I ANNUAL REPORT...6ll02Bsl0 A 055 11. TITLE (Include Security Classification) New Approaches to Attenuated Hepatitis A Vaccine Development: Cloning and Sequencing of Cell
Redmond, Catherine J.; Dooley, Katharine E.; Fu, Haiqing; Gillison, Maura L.; Akagi, Keiko; Symer, David E.; Aladjem, Mirit I.
2018-01-01
Integration of human papillomavirus (HPV) genomes into cellular chromatin is common in HPV-associated cancers. Integration is random, and each site is unique depending on how and where the virus integrates. We recently showed that tandemly integrated HPV16 could result in the formation of a super-enhancer-like element that drives transcription of the viral oncogenes. Here, we characterize the chromatin landscape and genomic architecture of this integration locus to elucidate the mechanisms that promoted de novo super-enhancer formation. Using next-generation sequencing and molecular combing/fiber-FISH, we show that ~26 copies of HPV16 are integrated into an intergenic region of chromosome 2p23.2, interspersed with 25 kb of amplified, flanking cellular DNA. This interspersed, co-amplified viral-host pattern is frequent in HPV-associated cancers and here we designate it as Type III integration. An abundant viral-cellular fusion transcript encoding the viral E6/E7 oncogenes is expressed from the integration locus and the chromatin encompassing both the viral enhancer and a region in the adjacent amplified cellular sequences is strongly enriched in the super-enhancer markers H3K27ac and Brd4. Notably, the peak in the amplified cellular sequence corresponds to an epithelial-cell-type specific enhancer. Thus, HPV16 integration generated a super-enhancer-like element composed of tandem interspersed copies of the viral upstream regulatory region and a cellular enhancer, to drive high levels of oncogene expression. PMID:29364907
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis.
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J.; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis. PMID:24695106
Polyomavirus BK non-coding control region rearrangements in health and disease.
Sharma, Preety M; Gupta, Gaurav; Vats, Abhay; Shapiro, Ron; Randhawa, Parmjeet S
2007-08-01
BK virus is an increasingly recognized pathogen in transplanted patients. DNA sequencing of this virus shows considerable genomic variability. To understand the clinical significance of rearrangements in the non-coding control region (NCCR) of BK virus (BKV), we report a meta-analysis of 507 sequences, including 40 sequences generated in our own laboratory, for associations between rearrangements and disease, tissue tropism, geographic origin, and viral genotype. NCCR rearrangements were less frequent in (a) asymptomatic BKV viruria compared to patients viral nephropathy (1.7% vs. 22.5%), and (b) viral genotype 1 compared to other genotypes (2.4% vs. 11.2%). Rearrangements were commoner in malignancy (78.6%), and Norwegians (45.7%), and less common in East Indians (0%), and Japanese (4.3%). A surprising number of rearranged sequences were reported from mononuclear cells of healthy subjects, whereas most plasma sequences were archetypal. This difference could not be related to potential recombinase activity in lymphocytes, as consensus recombination signal sequences could not be found in the NCCR region. NCCR rearrangements are neither required nor a sufficient condition to produce clinical disease. BKV nephropathy and hemorrhagic cystitis are not associated with any unique NCCR configuration or nucleotide sequence.
Using Signature Genes as Tools To Assess Environmental Viral Ecology and Diversity
Adriaenssens, Evelien M.
2014-01-01
Viruses (including bacteriophages) are the most abundant biological entities on the planet. As such, they are thought to have a major impact on all aspects of microbial community structure and function. Despite this critical role in ecosystem processes, the study of virus/phage diversity has lagged far behind parallel studies of the bacterial and eukaryotic kingdoms, largely due to the absence of any universal phylogenetic marker. Here we review the development and use of signature genes to investigate viral diversity, as a viable strategy for data sets of specific virus groups. Genes that have been used include those encoding structural proteins, such as portal protein, major capsid protein, and tail sheath protein, auxiliary metabolism genes, such as psbA, psbB, and phoH, and several polymerase genes. These marker genes have been used in combination with PCR-based fingerprinting and/or sequencing strategies to investigate spatial, temporal, and seasonal variations and diversity in a wide range of habitats. PMID:24837394
Wilker, Peter R.; Dinis, Jorge M.; Starrett, Gabriel; Imai, Masaki; Hatta, Masato; Nelson, Chase W.; O’Connor, David H.; Hughes, Austin L.; Neumann, Gabriele; Kawaoka, Yoshihiro; Friedrich, Thomas C.
2013-01-01
The emergence of human-transmissible H5N1 avian influenza viruses poses a major pandemic threat. H5N1 viruses are thought to be highly genetically diverse both among and within hosts, but the effects of this diversity on viral replication and transmission are poorly understood. Here we use deep sequencing to investigate the impact of within-host viral variation on adaptation and transmission of H5N1 viruses in ferrets. We show that although within-host genetic diversity in hemagglutinin (HA) increases during replication in inoculated ferrets, HA diversity is dramatically reduced upon respiratory droplet transmission, where infection is established by only 1–2 distinct HA segments from a diverse source virus population in transmitting animals. Moreover, minor HA variants present in as little as 5.9% of viruses within the source animal become dominant in ferrets infected via respiratory droplets. These findings demonstrate that selective pressures acting during influenza virus transmission among mammals impose a significant bottleneck. PMID:24149915
Viruses and miRNAs: More Friends than Foes.
Bruscella, Patrice; Bottini, Silvia; Baudesson, Camille; Pawlotsky, Jean-Michel; Feray, Cyrille; Trabucchi, Michele
2017-01-01
There is evidence that eukaryotic miRNAs (hereafter called host miRNAs) play a role in the replication and propagation of viruses. Expression or targeting of host miRNAs can be involved in cellular antiviral responses. Most times host miRNAs play a role in viral life-cycles and promote infection through complex regulatory pathways. miRNAs can also be encoded by a viral genome and be expressed in the host cell. Viral miRNAs can share common sequences with host miRNAs or have totally different sequences. They can regulate a variety of biological processes involved in viral infection, including apoptosis, evasion of the immune response, or modulation of viral life-cycle phases. Overall, virus/miRNA pathway interaction is defined by a plethora of complex mechanisms, though not yet fully understood. This article review summarizes recent advances and novel biological concepts related to the understanding of miRNA expression, control and function during viral infections. The article also discusses potential therapeutic applications of this particular host-pathogen interaction.
Viruses and miRNAs: More Friends than Foes
Bruscella, Patrice; Bottini, Silvia; Baudesson, Camille; Pawlotsky, Jean-Michel; Feray, Cyrille; Trabucchi, Michele
2017-01-01
There is evidence that eukaryotic miRNAs (hereafter called host miRNAs) play a role in the replication and propagation of viruses. Expression or targeting of host miRNAs can be involved in cellular antiviral responses. Most times host miRNAs play a role in viral life-cycles and promote infection through complex regulatory pathways. miRNAs can also be encoded by a viral genome and be expressed in the host cell. Viral miRNAs can share common sequences with host miRNAs or have totally different sequences. They can regulate a variety of biological processes involved in viral infection, including apoptosis, evasion of the immune response, or modulation of viral life-cycle phases. Overall, virus/miRNA pathway interaction is defined by a plethora of complex mechanisms, though not yet fully understood. This article review summarizes recent advances and novel biological concepts related to the understanding of miRNA expression, control and function during viral infections. The article also discusses potential therapeutic applications of this particular host–pathogen interaction. PMID:28555130
Ferns, R Bridget; Tarr, Alexander W; Hue, Stephane; Urbanowicz, Richard A; McClure, C Patrick; Gilson, Richard; Ball, Jonathan K; Nastouli, Eleni; Garson, Jeremy A; Pillay, Deenan
2016-05-01
HIV-1 infected patients who acquire HCV infection have higher rates of chronicity and liver disease progression than patients with HCV mono-infection. Understanding early events in this pathogenic process is important. We applied single genome sequencing of the E1 to NS3 regions and viral pseudotype neutralization assays to explore the consequences of viral quasispecies evolution from pre-seroconversion to chronicity in four co-infected individuals (mean follow up 566 days). We observed that one to three founder viruses were transmitted. Relatively low viral sequence diversity, possibly related to an impaired immune response, due to HIV infection was observed in three patients. However, the fourth patient, after an early purifying selection displayed increasing E2 sequence evolution, possibly related to being on suppressive antiretroviral therapy. Viral pseudotypes generated from HCV variants showed relative resistance to neutralization by autologous plasma but not to plasma collected from later time points, confirming ongoing virus escape from antibody neutralization. Copyright © 2016 Elsevier Inc. All rights reserved.
Kamboj, Atul; Hallwirth, Claus V; Alexander, Ian E; McCowage, Geoffrey B; Kramer, Belinda
2017-06-17
The analysis of viral vector genomic integration sites is an important component in assessing the safety and efficiency of patient treatment using gene therapy. Alongside this clinical application, integration site identification is a key step in the genetic mapping of viral elements in mutagenesis screens that aim to elucidate gene function. We have developed a UNIX-based vector integration site analysis pipeline (Ub-ISAP) that utilises a UNIX-based workflow for automated integration site identification and annotation of both single and paired-end sequencing reads. Reads that contain viral sequences of interest are selected and aligned to the host genome, and unique integration sites are then classified as transcription start site-proximal, intragenic or intergenic. Ub-ISAP provides a reliable and efficient pipeline to generate large datasets for assessing the safety and efficiency of integrating vectors in clinical settings, with broader applications in cancer research. Ub-ISAP is available as an open source software package at https://sourceforge.net/projects/ub-isap/ .
Chen, Sunlu; Zheng, Huizhen; Kishima, Yuji
2017-06-01
The interplay of different virus species in a host cell after infection can affect the adaptation of each virus. Endogenous viral elements, such as endogenous pararetroviruses (PRVs), have arisen from vertical inheritance of viral sequences integrated into host germline genomes. As viral genomic fossils, these sequences can thus serve as valuable paleogenomic data to study the long-term evolutionary dynamics of virus-virus interactions, but they have rarely been applied for this purpose. All extant PRVs have been considered autonomous species in their parasitic life cycle in host cells. Here, we provide evidence for multiple non-autonomous PRV species with structural defects in viral activity that have frequently infected ancient grass hosts and adapted through interplay between viruses. Our paleogenomic analyses using endogenous PRVs in grass genomes revealed that these non-autonomous PRV species have participated in interplay with autonomous PRVs in a possible commensal partnership, or, alternatively, with one another in a possible mutualistic partnership. These partnerships, which have been established by the sharing of noncoding regulatory sequences (NRSs) in intergenic regions between two partner viruses, have been further maintained and altered by the sequence homogenization of NRSs between partners. Strikingly, we found that frequent region-specific recombination, rather than mutation selection, is the main causative mechanism of NRS homogenization. Our results, obtained from ancient DNA records of viruses, suggest that adaptation of PRVs has occurred by concerted evolution of NRSs between different virus species in the same host. Our findings further imply that evaluation of within-host NRS interactions within and between populations of viral pathogens may be important.
A Metagenomic Survey of Viral Abundance and Diversity in Mosquitoes from Hubei Province
Shi, Chenyan; Liu, Yi; Hu, Xiaomin; Xiong, Jinfeng; Zhang, Bo; Yuan, Zhiming
2015-01-01
Mosquitoes as one of the most common but important vectors have the potential to transmit or acquire a lot of viruses through biting, however viral flora in mosquitoes and its impact on mosquito-borne disease transmission has not been well investigated and evaluated. In this study, the metagenomic techniquehas been successfully employed in analyzing the abundance and diversity of viral community in three mosquito samples from Hubei, China. Among 92,304 reads produced through a run with 454 GS FLX system, 39% have high similarities with viral sequences belonging to identified bacterial, fungal, animal, plant and insect viruses, and 0.02% were classed into unidentified viral sequences, demonstrating high abundance and diversity of viruses in mosquitoes. Furthermore, two novel viruses in subfamily Densovirinae and family Dicistroviridae were identified, and six torque tenosus virus1 in family Anelloviridae, three porcine parvoviruses in subfamily Parvovirinae and a Culex tritaeniorhynchus rhabdovirus in Family Rhabdoviridae were preliminarily characterized. The viral metagenomic analysis offered us a deep insight into the viral population of mosquito which played an important role in viral initiative or passive transmission and evolution during the process. PMID:26030271
Borozan, Ivan; Watt, Stuart; Ferretti, Vincent
2015-05-01
Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. ivan.borozan@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Borozan, Ivan; Watt, Stuart; Ferretti, Vincent
2015-01-01
Motivation: Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Results: Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. Availability and implementation: All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. Contact: ivan.borozan@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25573913
O'Flaherty, Brigid M; Li, Yan; Tao, Ying; Paden, Clinton R; Queen, Krista; Zhang, Jing; Dinwiddie, Darrell L; Gross, Stephen M; Schroth, Gary P; Tong, Suxiang
2018-06-01
Next generation sequencing (NGS) technologies have revolutionized the genomics field and are becoming more commonplace for identification of human infectious diseases. However, due to the low abundance of viral nucleic acids (NAs) in relation to host, viral identification using direct NGS technologies often lacks sufficient sensitivity. Here, we describe an approach based on two complementary enrichment strategies that significantly improves the sensitivity of NGS-based virus identification. To start, we developed two sets of DNA probes to enrich virus NAs associated with respiratory diseases. The first set of probes spans the genomes, allowing for identification of known viruses and full genome sequencing, while the second set targets regions conserved among viral families or genera, providing the ability to detect both known and potentially novel members of those virus groups. Efficiency of enrichment was assessed by NGS testing reference virus and clinical samples with known infection. We show significant improvement in viral identification using enriched NGS compared to unenriched NGS. Without enrichment, we observed an average of 0.3% targeted viral reads per sample. However, after enrichment, 50%-99% of the reads per sample were the targeted viral reads for both the reference isolates and clinical specimens using both probe sets. Importantly, dramatic improvements on genome coverage were also observed following virus-specific probe enrichment. The methods described here provide improved sensitivity for virus identification by NGS, allowing for a more comprehensive analysis of disease etiology. © 2018 O'Flaherty et al.; Published by Cold Spring Harbor Laboratory Press.
Deep Sequencing to Identify the Causes of Viral Encephalitis
Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.
2014-01-01
Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
Leda, Ana Rachel; Hunter, James; Oliveira, Ursula Castro; Azevedo, Inacio Junqueira; Sucupira, Maria Cecilia Araripe; Diaz, Ricardo Sobhie
2018-04-19
The presence of minority transmitted drug resistance mutations was assessed using ultra-deep sequencing and correlated with disease progression among recently HIV-1-infected individuals from Brazil. Samples at baseline during recent infection and 1 year after the establishment of the infection were analysed. Viral RNA and proviral DNA from 25 individuals were subjected to ultra-deep sequencing of the reverse transcriptase and protease regions of HIV-1. Viral strains carrying transmitted drug resistance mutations were detected in 9 out of the 25 patients, for all major antiretroviral classes, ranging from one to five mutations per patient. Ultra-deep sequencing detected strains with frequencies as low as 1.6% and only strains with frequencies >20% were detected by population plasma sequencing (three patients). Transmitted drug resistance strains with frequencies <14.8% did not persist upon established infection. The presence of transmitted drug resistance mutations was negatively correlated with the viral load and with CD4+ T cell count decay. Transmitted drug resistance mutations representing small percentages of the viral population do not persist during infection because they are negatively selected in the first year after HIV-1 seroconversion.
Application of viromics: a new approach to the understanding of viral infections in humans.
Ramamurthy, Mageshbabu; Sankar, Sathish; Kannangai, Rajesh; Nandagopal, Balaji; Sridharan, Gopalan
2017-12-01
This review is focused at exploring the strengths of modern technology driven data compiled in the areas of virus gene sequencing, virus protein structures and their implication to viral diagnosis and therapy. The information for virome analysis (viromics) is generated by the study of viral genomes (entire nucleotide sequence) and viral genes (coding for protein). Presently, the study of viral infectious diseases in terms of etiopathogenesis and development of newer therapeutics is undergoing rapid changes. Currently, viromics relies on deep sequencing, next generation sequencing (NGS) data and public domain databases like GenBank and unique virus specific databases. Two commonly used NGS platforms: Illumina and Ion Torrent, recommend maximum fragment lengths of about 300 and 400 nucleotides for analysis respectively. Direct detection of viruses in clinical samples is now evolving using these methods. Presently, there are a considerable number of good treatment options for HBV/HIV/HCV. These viruses however show development of drug resistance. The drug susceptibility regions of the genomes are sequenced and the prediction of drug resistance is now possible from 3 public domains available on the web. This has been made possible through advances in the technology with the advent of high throughput sequencing and meta-analysis through sophisticated and easy to use software and the use of high speed computers for bioinformatics. More recently NGS technology has been improved with single-molecule real-time sequencing. Here complete long reads can be obtained with less error overcoming a limitation of the NGS which is inherently prone to software anomalies that arise in the hands of personnel without adequate training. The development in understanding the viruses in terms of their genome, pathobiology, transcriptomics and molecular epidemiology constitutes viromics. It could be stated that these developments will bring about radical changes and advancement especially in the field of antiviral therapy and diagnostic virology.
An RNAi in silico approach to find an optimal shRNA cocktail against HIV-1
2010-01-01
Background HIV-1 can be inhibited by RNA interference in vitro through the expression of short hairpin RNAs (shRNAs) that target conserved genome sequences. In silico shRNA design for HIV has lacked a detailed study of virus variability constituting a possible breaking point in a clinical setting. We designed shRNAs against HIV-1 considering the variability observed in naïve and drug-resistant isolates available at public databases. Methods A Bioperl-based algorithm was developed to automatically scan multiple sequence alignments of HIV, while evaluating the possibility of identifying dominant and subdominant viral variants that could be used as efficient silencing molecules. Student t-test and Bonferroni Dunn correction test were used to assess statistical significance of our findings. Results Our in silico approach identified the most common viral variants within highly conserved genome regions, with a calculated free energy of ≥ -6.6 kcal/mol. This is crucial for strand loading to RISC complex and for a predicted silencing efficiency score, which could be used in combination for achieving over 90% silencing. Resistant and naïve isolate variability revealed that the most frequent shRNA per region targets a maximum of 85% of viral sequences. Adding more divergent sequences maintained this percentage. Specific sequence features that have been found to be related with higher silencing efficiency were hardly accomplished in conserved regions, even when lower entropy values correlated with better scores. We identified a conserved region among most HIV-1 genomes, which meets as many sequence features for efficient silencing. Conclusions HIV-1 variability is an obstacle to achieving absolute silencing using shRNAs designed against a consensus sequence, mainly because there are many functional viral variants. Our shRNA cocktail could be truly effective at silencing dominant and subdominant naïve viral variants. Additionally, resistant isolates might be targeted under specific antiretroviral selective pressure, but in both cases these should be tested exhaustively prior to clinical use. PMID:21172023
Lim, Chun Shen; Brown, Chris M
2017-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community.
Lim, Chun Shen; Brown, Chris M.
2018-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community. PMID:29354101
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences
Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.
2012-01-01
ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
The polymorphisms of LCR, E6, and E7 of HPV-58 isolates in Yunnan, Southwest China.
Xi, Juemin; Chen, Junying; Xu, Miaoling; Yang, Hongying; Wen, Songjiao; Pan, Yue; Wang, Xiaodan; Ye, Chao; Qiu, Lijuan; Sun, Qiangming
2018-04-25
Variations in HPV LCR/E6/E7 have been shown to be associated with the viral persistence and cervical cancer development. So far, there are few reports about the polymorphisms of the HPV-58 LCR/E6/E7 sequences in Southwest China. This study aims to characterize the gene polymorphisms of the HPV-58 LCR/E6/E7 sequences in women of Southwest China, and assess the effects of variations on the immune recognition of viral E6 and E7 antigens. Twelve LCR/E6/E7 of the HPV-58 isolates were amplified and sequenced. A neighbor-joining phylogenetic tree was constructed by MEGA 7.0, followed by the secondary structure prediction of the related proteins using PSIPRED v3.3. The selection pressure acting on the HPV-58 E6 and E7 coding regions was estimated by Bayes empirical Bayes analysis of PAML 4.8. Meanwhile, the MHC class-I and II binding peptides were predicted by the ProPred-I server and ProPred server. The transcription factor binding sites in the HPV-58 LCR were analyzed using the JASPAR database. Twenty nine SNPs (20 in the LCR, 3 in the E6, 6 in the E7) were identified at 27 nucleotide sites across the HPV-58 LCR/E6/E7. From the most variable to the least variable, the nucleotide variations were LCR > E7 > E6. The combinations of all the SNPs resulted in 11 unique sequences, which were clustered into the A lineage (7 belong to A1, 2 belong to A2, and 2 belong to A3). An insertion (TGTCAGTTTCCT) was found between the nucleotide sites 7280 and 7281 in 2 variants, and a deletion (TTTAT) was found between 7429 and 7433 in 1 variant. The most common non-synonymous substitution V77A in the E7 was observed in the sequences encoding the α-helix. 63G in the E7 was determined to be the only one positively selected site in the HPV-58 E6/E7 sequences. Six non-synonymous amino acid substitutions (including S71F and K93 N in the E6, and T20I, G41R, G63S/D, and V77A in the E7) were affecting multiple putative epitopes for both CD4 + and CD8 + T-cells. In the LCR, C7265G and C7266T were the most variable sites and were the potential binding sites for the transcription factor SOX10. These results provide an insight into the intrinsic geographical relatedness and biological differences of the HPV-58 variants, and contribute to further research on the HPV-58 epidemiology, carcinogenesis, and therapeutic vaccine development.
BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.
Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F
2014-01-01
We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.
NASA Astrophysics Data System (ADS)
Champeimont, Raphaël; Laine, Elodie; Hu, Shuang-Wei; Penin, Francois; Carbone, Alessandra
2016-05-01
A novel computational approach of coevolution analysis allowed us to reconstruct the protein-protein interaction network of the Hepatitis C Virus (HCV) at the residue resolution. For the first time, coevolution analysis of an entire viral genome was realized, based on a limited set of protein sequences with high sequence identity within genotypes. The identified coevolving residues constitute highly relevant predictions of protein-protein interactions for further experimental identification of HCV protein complexes. The method can be used to analyse other viral genomes and to predict the associated protein interaction networks.
de Borba, Luana; Villordo, Sergio M; Iglesias, Nestor G; Filomatori, Claudia V; Gebhard, Leopoldo G; Gamarnik, Andrea V
2015-03-01
The dengue virus genome is a dynamic molecule that adopts different conformations in the infected cell. Here, using RNA folding predictions, chemical probing analysis, RNA binding assays, and functional studies, we identified new cis-acting elements present in the capsid coding sequence that facilitate cyclization of the viral RNA by hybridization with a sequence involved in a local dumbbell structure at the viral 3' untranslated region (UTR). The identified interaction differentially enhances viral replication in mosquito and mammalian cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Casartelli, Nicoletta; Di Matteo, Gigliola; Argentini, Claudio; Cancrini, Caterina; Bernardi, Stefania; Castelli, Guido; Scarlatti, Gabriella; Plebani, Anna; Rossi, Paolo; Doria, Margherita
2003-06-13
Evaluation of sequence evolution as well as structural defects and mutations of the human immunodeficiency virus-type 1 (HIV-1) nef gene in relation to disease progression in infected children. We examined a large number of nef alleles sequentially derived from perinatally HIV-1-infected children with different rates of disease progression: six non-progressors (NPs), four rapid progressors (RPs), and three slow progressors (SPs). Nef alleles (182 total) were isolated from patients' peripheral blood mononuclear cells (PBMCs), sequenced and analysed for their evolutionary pattern, frequency of mutations and occurrence of amino acid variations associated with different stages of disease. The evolution rate of the nef gene apparently correlated with CD4+ decline in all progression groups. Evidence for rapid viral turnover and positive selection for changes were found only in two SPs and two RPs respectively. In NPs, a higher proportion of disrupted sequences and mutations at various functional motifs were observed. Furthermore, NP-derived Nef proteins were often changed at residues localized in the folded core domain at cytotoxic T lymphocytes (CTL) epitopes (E(105), K(106), E(110), Y(132), K(164), and R(200)), while other residues outside the core domain are more often changed in RPs (A(43)) and SPs (N(173) and Y(214)). Our results suggest a link between nef gene functions and the progression rate in HIV-1-infected children. Moreover, non-progressor-associated variations in the core domain of Nef, together with the genetic analysis, suggest that nef gene evolution is shaped by an effective immune system in these patients.
Poon, Art F. Y; Kosakovsky Pond, Sergei L.; Bennett, Phil; Richman, Douglas D; Leigh Brown, Andrew J.; Frost, Simon D. W
2007-01-01
CD8+ cytotoxic T-lymphocytes (CTLs) perform a critical role in the immune control of viral infections, including those caused by human immunodeficiency virus type 1 (HIV-1) and hepatitis C virus (HCV). As a result, genetic variation at CTL epitopes is strongly influenced by host-specific selection for either escape from the immune response, or reversion due to the replicative costs of escape mutations in the absence of CTL recognition. Under strong CTL-mediated selection, codon positions within epitopes may immediately “toggle” in response to each host, such that genetic variation in the circulating virus population is shaped by rapid adaptation to immune variation in the host population. However, this hypothesis neglects the substantial genetic variation that accumulates in virus populations within hosts. Here, we evaluate this quantity for a large number of HIV-1– (n ≥ 3,000) and HCV-infected patients (n ≥ 2,600) by screening bulk RT-PCR sequences for sequencing “mixtures” (i.e., ambiguous nucleotides), which act as site-specific markers of genetic variation within each host. We find that nonsynonymous mixtures are abundant and significantly associated with codon positions under host-specific CTL selection, which should deplete within-host variation by driving the fixation of the favored variant. Using a simple model, we demonstrate that this apparently contradictory outcome can be explained by the transmission of unfavorable variants to new hosts before they are removed by selection, which occurs more frequently when selection and transmission occur on similar time scales. Consequently, the circulating virus population is shaped by the transmission rate and the disparity in selection intensities for escape or reversion as much as it is shaped by the immune diversity of the host population, with potentially serious implications for vaccine design. PMID:17397261
Tan, Yi; Hassan, Ferdaus; Schuster, Jennifer E.; Simenauer, Ari; Selvarangan, Rangaraj; Halpin, Rebecca A.; Lin, Xudong; Fedorova, Nadia; Stockwell, Timothy B.; Lam, Tommy Tsan-Yuk; Chappell, James D.; Hartert, Tina V.; Holmes, Edward C.
2015-01-01
ABSTRACT In August 2014, an outbreak of enterovirus D68 (EV-D68) occurred in North America, causing severe respiratory disease in children. Due to a lack of complete genome sequence data, there is only a limited understanding of the molecular evolution and epidemiology of EV-D68 during this outbreak, and it is uncertain whether the differing clinical manifestations of EV-D68 infection are associated with specific viral lineages. We developed a high-throughput complete genome sequencing pipeline for EV-D68 that produced a total of 59 complete genomes from respiratory samples with a 95% success rate, including 57 genomes from Kansas City, MO, collected during the 2014 outbreak. With these data in hand, we performed phylogenetic analyses of complete genome and VP1 capsid protein sequences. Notably, we observed considerable genetic diversity among EV-D68 isolates in Kansas City, manifest as phylogenetically distinct lineages, indicative of multiple introductions of this virus into the city. In addition, we identified an intersubclade recombination event within EV-D68, the first recombinant in this virus reported to date. Finally, we found no significant association between EV-D68 genetic variation, either lineages or individual mutations, and a variety of demographic and clinical variables, suggesting that host factors likely play a major role in determining disease severity. Overall, our study revealed the complex pattern of viral evolution within a single geographic locality during a single outbreak, which has implications for the design of effective intervention and prevention strategies. IMPORTANCE Until recently, EV-D68 was considered to be an uncommon human pathogen, associated with mild respiratory illness. However, in 2014 EV-D68 was responsible for more than 1,000 disease cases in North America, including severe respiratory illness in children and acute flaccid myelitis, raising concerns about its potential impact on public health. Despite the emergence of EV-D68, a lack of full-length genome sequences means that little is known about the molecular evolution of this virus within a single geographic locality during a single outbreak. Here, we doubled the number of publicly available complete genome sequences of EV-D68 by performing high-throughput next-generation sequencing, characterized the evolutionary history of this outbreak in detail, identified a recombination event, and investigated whether there was any correlation between the demographic and clinical characteristics of the patients and the viral variant that infected them. Overall, these results will help inform the design of intervention strategies for EV-D68. PMID:26656685
What factors determine the severity of hepatitis A-related acute liver failure?
Ajmera, V; Xia, G; Vaughan, G; Forbi, J C; Ganova-Raeva, L M; Khudyakov, Y; Opio, C K; Taylor, R; Restrepo, R; Munoz, S; Fontana, R J; Lee, W M
2011-07-01
The reason(s) that hepatitis A virus (HAV) infection may progress infrequently to acute liver failure are poorly understood. We examined host and viral factors in 29 consecutive adult patients with HAV-associated acute liver failure enrolled at 10 sites participating in the US ALF Study Group. Eighteen of twenty-four acute liver failure sera were PCR positive while six had no detectable virus. HAV genotype was determined using phylogenetic analysis and the full-length genome sequences of the HAV from a cute liver failure sera were compared to those from self-limited acute HAV cases selected from the CDC database. We found that rates of nucleotide substitution did not vary significantly between the liver failure and non-liver failure cases and there was no significant variation in amino acid sequences between the two groups. Four of 18 HAV isolates were sub-genotype IB, acquired from the same study site over a 3.5-year period. Sub-genotype IB was found more frequently among acute liver failure cases compared to the non-liver failure cases (chi-square test, P < 0.01). At another centre, a mother and her son presented with HAV and liver failure within 1 month of each other. Predictors of spontaneous survival included detectable serum HAV RNA, while age, gender, HAV genotype and nucleotide substitutions were not associated with outcome. The more frequent appearance of rapid viral clearance and its association with poor outcomes in acute liver failure as well as the finding of familial cases imply a possible host genetic predisposition that contributes to a fulminant course. Recurrent cases of the rare sub-genotype IB over several years at a single centre imply a community reservoir of infection and possible increased pathogenicity of certain infrequent viral genotypes. © 2010 Blackwell Publishing Ltd.
What factors determine the severity of hepatitis A-related acute liver failure?
Ajmera, V.; Xia, G.; Vaughan, G.; Forbi, J. C.; Ganova-Raeva, L. M.; Khudyakov, Y.; Opio, C. K.; Taylor, R.; Restrepo, R.; Munoz, S.; Fontana, R. J.; Lee, W. M.
2016-01-01
SUMMARY The reason(s) that hepatitis A virus (HAV) infection may progress infrequently to acute liver failure are poorly understood. We examined host and viral factors in 29 consecutive adult patients with HAV-associated acute liver failure enrolled at 10 sites participating in the US ALF Study Group. Eighteen of twenty-four acute liver failure sera were PCR positive while six had no detectable virus. HAV genotype was determined using phylogenetic analysis and the full-length genome sequences of the HAV from a cute liver failure sera were compared to those from self-limited acute HAV cases selected from the CDC database. We found that rates of nucleotide substitution did not vary significantly between the liver failure and non-liver failure cases and there was no significant variation in amino acid sequences between the two groups. Four of 18 HAV isolates were subgenotype IB, acquired from the same study site over a 3.5-year period. Sub-genotype IB was found more frequently among acute liver failure cases compared to the non-liver failure cases (chi-square test, P < 0.01). At another centre, a mother and her son presented with HAV and liver failure within 1 month of each other. Predictors of spontaneous survival included detectable serum HAV RNA, while age, gender, HAV genotype and nucleotide substitutions were not associated with outcome. The more frequent appearance of rapid viral clearance and its association with poor outcomes in acute liver failure as well as the finding of familial cases imply a possible host genetic predisposition that contributes to a fulminant course. Recurrent cases of the rare subgenotype IB over several years at a single centre imply a community reservoir of infection and possible increased pathogenicity of certain infrequent viral genotypes. PMID:21143345
Combelas, Nicolas; Holmblat, Barbara; Joffret, Marie-Line; Colbère-Garapin, Florence; Delpeyroux, Francis
2011-01-01
Genetic recombination in RNA viruses was discovered many years ago for poliovirus (PV), an enterovirus of the Picornaviridae family, and studied using PV or other picornaviruses as models. Recently, recombination was shown to be a general phenomenon between different types of enteroviruses of the same species. In particular, the interest for this mechanism of genetic plasticity was renewed with the emergence of pathogenic recombinant circulating vaccine-derived polioviruses (cVDPVs), which were implicated in poliomyelitis outbreaks in several regions of the world with insufficient vaccination coverage. Most of these cVDPVs had mosaic genomes constituted of mutated poliovaccine capsid sequences and part or all of the non-structural sequences from other human enteroviruses of species C (HEV-C), in particular coxsackie A viruses. A study in Madagascar showed that recombinant cVDPVs had been co-circulating in a small population of children with many different HEV-C types. This viral ecosystem showed a surprising and extensive biodiversity associated to several types and recombinant genotypes, indicating that intertypic genetic recombination was not only a mechanism of evolution for HEV-C, but an usual mode of genetic plasticity shaping viral diversity. Results suggested that recombination may be, in conjunction with mutations, implicated in the phenotypic diversity of enterovirus strains and in the emergence of new pathogenic strains. Nevertheless, little is known about the rules and mechanisms which govern genetic exchanges between HEV-C types, as well as about the importance of intertypic recombination in generating phenotypic variation. This review summarizes our current knowledge of the mechanisms of evolution of PV, in particular recombination events leading to the emergence of recombinant cVDPVs. PMID:21994791
Combelas, Nicolas; Holmblat, Barbara; Joffret, Marie-Line; Colbère-Garapin, Florence; Delpeyroux, Francis
2011-08-01
Genetic recombination in RNA viruses was discovered many years ago for poliovirus (PV), an enterovirus of the Picornaviridae family, and studied using PV or other picornaviruses as models. Recently, recombination was shown to be a general phenomenon between different types of enteroviruses of the same species. In particular, the interest for this mechanism of genetic plasticity was renewed with the emergence of pathogenic recombinant circulating vaccine-derived polioviruses (cVDPVs), which were implicated in poliomyelitis outbreaks in several regions of the world with insufficient vaccination coverage. Most of these cVDPVs had mosaic genomes constituted of mutated poliovaccine capsid sequences and part or all of the non-structural sequences from other human enteroviruses of species C (HEV-C), in particular coxsackie A viruses. A study in Madagascar showed that recombinant cVDPVs had been co-circulating in a small population of children with many different HEV-C types. This viral ecosystem showed a surprising and extensive biodiversity associated to several types and recombinant genotypes, indicating that intertypic genetic recombination was not only a mechanism of evolution for HEV-C, but an usual mode of genetic plasticity shaping viral diversity. Results suggested that recombination may be, in conjunction with mutations, implicated in the phenotypic diversity of enterovirus strains and in the emergence of new pathogenic strains. Nevertheless, little is known about the rules and mechanisms which govern genetic exchanges between HEV-C types, as well as about the importance of intertypic recombination in generating phenotypic variation. This review summarizes our current knowledge of the mechanisms of evolution of PV, in particular recombination events leading to the emergence of recombinant cVDPVs.
Russo, Alice G; Eden, John-Sebastian; Enosi Tuipulotu, Daniel; Shi, Mang; Selechnik, Daniel; Shine, Richard; Rollins, Lee Ann; Holmes, Edward C; White, Peter A
2018-06-13
Cane toads are a notorious invasive species, inhabiting over 1.2 million km 2 of Australia and threatening native biodiversity. Release of pathogenic cane toad viruses is one possible biocontrol strategy yet is currently hindered by the poorly-described cane toad virome. Metatranscriptomic analysis of 16 cane toad livers revealed the presence of a novel and full-length picornavirus, Rhimavirus A (RhiV-A), a member of a reptile and amphibian specific-cluster of the Picornaviridae basal to the Kobuvirus -like group. In the combined liver transcriptome, we also identified a complete genome sequence of a distinct epsilonretrovirus, R. marina endogenous retrovirus (RMERV). The recently sequenced cane toad genome contains eight complete RMERV proviruses, as well as 21 additional truncated insertions. The oldest full length RMERV provirus was estimated to have inserted 1.9 MYA. To screen for these viral sequences in additional toads, we analysed publicly available transcriptomes from six diverse Australian locations. RhiV-A transcripts were identified in toads sampled from three locations across 1,000 km of Australia, stretching to the current Western Australia (WA) invasion front, whilst RMERV transcripts were observed at all six sites. Lastly, we scanned the cane toad genome for non-retroviral endogenous viral elements, finding three sequences related to small DNA viruses in the family Circoviridae This shows ancestral circoviral infection with subsequent genomic integration. The identification of these current and past viral infections enriches our knowledge of the cane toad virome, an understanding of which will facilitate future work on infection and disease in this important invasive species. Importance Cane toads are poisonous amphibians which were introduced to Australia in 1935 for insect control. Since then, their population has increased dramatically, and they now threat many native Australian species. One potential method to control the population is to release a cane toad virus with high mortality, yet few cane toad viruses have been characterised. This study samples cane toads from different Australian locations and uses an RNA sequencing and computational approach to find new viruses. We report novel complete picornavirus and retrovirus sequences which were genetically similar to viruses infecting frogs, reptiles and fish. Using data generated in other studies, we show that these viral sequences are present in cane toads from distinct Australian locations. Three sequences related to circoviruses were also found in the toad genome. The identification of new viral sequences will aid future studies which investigate their prevalence and potential as agents for biocontrol. Copyright © 2018 American Society for Microbiology.
Yang, Teng-Chieh; Maluf, Nasib Karl
2012-02-21
Human adenovirus (Ad) is an icosahedral, double-stranded DNA virus. Viral DNA packaging refers to the process whereby the viral genome becomes encapsulated by the viral particle. In Ad, activation of the DNA packaging reaction requires at least three viral components: the IVa2 and L4-22K proteins and a section of DNA within the viral genome, called the packaging sequence. Previous studies have shown that the IVa2 and L4-22K proteins specifically bind to conserved elements within the packaging sequence and that these interactions are absolutely required for the observation of DNA packaging. However, the equilibrium mechanism for assembly of IVa2 and L4-22K onto the packaging sequence has not been determined. Here we characterize the assembly of the IVa2 and L4-22K proteins onto truncated packaging sequence DNA by analytical sedimentation velocity and equilibrium methods. At limiting concentrations of L4-22K, we observe a species with two IVa2 monomers and one L4-22K monomer bound to the DNA. In this species, the L4-22K monomer is promoting positive cooperative interactions between the two bound IVa2 monomers. As L4-22K levels are increased, we observe a species with one IVa2 monomer and three L4-22K monomers bound to the DNA. To explain this result, we propose a model in which L4-22K self-assembly on the DNA competes with IVa2 for positive heterocooperative interactions, destabilizing binding of the second IVa2 monomer. Thus, we propose that L4-22K levels control the extent of cooperativity observed between adjacently bound IVa2 monomers. We have also determined the hydrodynamic properties of all observed stoichiometric species; we observe that species with three L4-22K monomers bound have more extended conformations than species with a single L4-22K bound. We suggest this might reflect a molecular switch that controls insertion of the viral DNA into the capsid.
Castro-Prieto, Aines; Wachter, Bettina; Melzheimer, Joerg; Thalwitzer, Susanne; Hofer, Heribert; Sommer, Simone
2012-01-01
Background Genes under selection provide ecologically important information useful for conservation issues. Major histocompatibility complex (MHC) class I and II genes are essential for the immune defence against pathogens from intracellular (e.g. viruses) and extracellular (e.g. helminths) origins, respectively. Serosurvey studies in Namibian cheetahs (Acinonyx juabuts) revealed higher exposure to viral pathogens in individuals from north-central than east-central regions. Here we examined whether the observed differences in exposure to viruses influence the patterns of genetic variation and differentiation at MHC loci in 88 free-ranging Namibian cheetahs. Methodology/Principal Findings Genetic variation at MHC I and II loci was assessed through single-stranded conformation polymorphism (SSCP) analysis and sequencing. While the overall allelic diversity did not differ, we observed a high genetic differentiation at MHC class I loci between cheetahs from north-central and east-central Namibia. No such differentiation in MHC class II and neutral markers were found. Conclusions/Significance Our results suggest that MHC class I variation mirrors the variation in selection pressure imposed by viruses in free-ranging cheetahs across Namibian farmland. This is of high significance for future management and conservation programs of this species. PMID:23145096
Castro-Prieto, Aines; Wachter, Bettina; Melzheimer, Joerg; Thalwitzer, Susanne; Hofer, Heribert; Sommer, Simone
2012-01-01
Genes under selection provide ecologically important information useful for conservation issues. Major histocompatibility complex (MHC) class I and II genes are essential for the immune defence against pathogens from intracellular (e.g. viruses) and extracellular (e.g. helminths) origins, respectively. Serosurvey studies in Namibian cheetahs (Acinonyx juabuts) revealed higher exposure to viral pathogens in individuals from north-central than east-central regions. Here we examined whether the observed differences in exposure to viruses influence the patterns of genetic variation and differentiation at MHC loci in 88 free-ranging Namibian cheetahs. Genetic variation at MHC I and II loci was assessed through single-stranded conformation polymorphism (SSCP) analysis and sequencing. While the overall allelic diversity did not differ, we observed a high genetic differentiation at MHC class I loci between cheetahs from north-central and east-central Namibia. No such differentiation in MHC class II and neutral markers were found. Our results suggest that MHC class I variation mirrors the variation in selection pressure imposed by viruses in free-ranging cheetahs across Namibian farmland. This is of high significance for future management and conservation programs of this species.
CRISPR-Cas systems exploit viral DNA injection to establish and maintain adaptive immunity.
Modell, Joshua W; Jiang, Wenyan; Marraffini, Luciano A
2017-04-06
Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas systems provide protection against viral and plasmid infection by capturing short DNA sequences from these invaders and integrating them into the CRISPR locus of the prokaryotic host. These sequences, known as spacers, are transcribed into short CRISPR RNA guides that specify the cleavage site of Cas nucleases in the genome of the invader. It is not known when spacer sequences are acquired during viral infection. Here, to investigate this, we tracked spacer acquisition in Staphylococcus aureus cells harbouring a type II CRISPR-Cas9 system after infection with the staphylococcal bacteriophage ϕ12. We found that new spacers were acquired immediately after infection preferentially from the cos site, the viral free DNA end that is first injected into the cell. Analysis of spacer acquisition after infection with mutant phages demonstrated that most spacers are acquired during DNA injection, but not during other stages of the viral cycle that produce free DNA ends, such as DNA replication or packaging. Finally, we showed that spacers acquired from early-injected genomic regions, which direct Cas9 cleavage of the viral DNA immediately after infection, provide better immunity than spacers acquired from late-injected regions. Our results reveal that CRISPR-Cas systems exploit the phage life cycle to generate a pattern of spacer acquisition that ensures a successful CRISPR immune response.
The Vaginal Eukaryotic DNA Virome and Preterm Birth.
Wylie, Kristine M; Wylie, Todd N; Cahill, Alison G; Macones, George A; Tuuli, Methodius G; Stout, Molly J
2018-05-05
Despite decades of attempts to link infectious agents to preterm birth, an exact causative microbe or community of microbes remains elusive. Culture-independent sequencing of vaginal bacterial communities demonstrates community characteristics are associated with preterm birth, although none are specific enough to apply clinically. Viruses are important components of the vaginal microbiome and have dynamic relationships with vaginal bacterial communities. We hypothesized that vaginal eukaryotic DNA viral communities (the "vaginal virome") either alone or in the context of bacterial communities are associated with preterm birth. The objective of this study was to use high-throughput sequencing to examine the vaginal eukaryotic DNA virome in a cohort of pregnant women and examine associations between vaginal community characteristics and preterm birth. This is a nested case-control study within a prospective cohort study of women with singleton pregnancies, not on supplemental progesterone, and without cervical cerclage in situ. Serial mid-vaginal swabs were obtained at routine prenatal visits. DNA was extracted, bacterial communities were characterized by 16S rRNA gene sequencing, and eukaryotic viral communities were characterized by enrichment of viral nucleic acid with the ViroCap targeted sequence capture panel followed by nucleic acid sequencing. Viral communities were analyzed according to presence/absence of viruses, diversity, dynamics over time, and association with bacterial community data obtained from the same specimens. Sixty subjects contributed 128 vaginal swabs longitudinally across pregnancy. Twenty-four patients delivered preterm. Participants were predominantly African-American (65%). Six families of eukaryotic DNA viruses were detected in the vaginal samples. At least 1 virus was detected in 80% of women. No specific virus or group of viruses was associated with preterm delivery. Higher viral richness was significantly associated with preterm delivery in the full group and in the African American subgroup (P=0.0005 and P=0.0003, respectively). Having both high bacterial diversity and high viral diversity in the first trimester was associated with the highest risk for preterm birth. Higher vaginal viral diversity is associated with preterm birth. Changes in vaginal virome diversity appear similar to changes in the vaginal bacterial microbiome over pregnancy, suggesting that underlying physiology of pregnancy may regulate both bacterial and viral communities. Copyright © 2018 Elsevier Inc. All rights reserved.
May, Jared; Johnson, Philip; Saleem, Huma
2017-01-01
ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526
Virus versus Host Plant MicroRNAs: Who Determines the Outcome of the Interaction?
Maghuly, Fatemeh; Ramkat, Rose C.; Laimer, Margit
2014-01-01
Considering the importance of microRNAs (miRNAs) in the regulation of essential processes in plant pathogen interactions, it is not surprising that, while plant miRNA sequences counteract viral attack via antiviral RNA silencing, viruses in turn have developed antihost defense mechanisms blocking these RNA silencing pathways and establish a counter-defense. In the current study, computational and stem-loop Reverse Transcription – Polymerase Chain Reaction (RT-PCR) approaches were employed to a) predict and validate virus encoded mature miRNAs (miRs) in 39 DNA-A sequences of the bipartite genomes of African cassava mosaic virus (ACMV) and East African cassava mosaic virus-Uganda (EACMV-UG) isolates, b) determine whether virus encoded miRs/miRs* generated from the 5′/3′ harpin arms have the capacity to bind to genomic sequences of the host plants Jatropha or cassava and c) investigate whether plant encoded miR/miR* sequences have the potential to bind to the viral genomes. Different viral pre-miRNA hairpin sequences and viral miR/miR* length variants occurring as isomiRs were predicted in both viruses. These miRNAs were located in three Open Reading Frames (ORFs) and in the Intergenic Region (IR). Moreover, various target genes for miRNAs from both viruses were predicted and annotated in the host plant genomes indicating that they are involved in biotic response, metabolic pathways and transcription factors. Plant miRs/miRs* from conserved and highly expressed families were identified, which were shown to have potential targets in the genome of both begomoviruses, representing potential plant miRNAs mediating antiviral defense. This is the first assessment of predicted viral miRs/miRs* of ACMV and EACMV-UG and host plant miRNAs, providing a reference point for miRNA identification in pathogens and their hosts. These findings will improve the understanding of host- pathogen interaction pathways and the function of viral miRNAs in Euphorbiaceous crop plants. PMID:24896088
Using hidden Markov models and observed evolution to annotate viral genomes.
McCauley, Stephen; Hein, Jotun
2006-06-01
ssRNA (single stranded) viral genomes are generally constrained in length and utilize overlapping reading frames to maximally exploit the coding potential within the genome length restrictions. This overlapping coding phenomenon leads to complex evolutionary constraints operating on the genome. In regions which code for more than one protein, silent mutations in one reading frame generally have a protein coding effect in another. To maximize coding flexibility in all reading frames, overlapping regions are often compositionally biased towards amino acids which are 6-fold degenerate with respect to the 64 codon alphabet. Previous methodologies have used this fact in an ad hoc manner to look for overlapping genes by motif matching. In this paper differentiated nucleotide compositional patterns in overlapping regions are incorporated into a probabilistic hidden Markov model (HMM) framework which is used to annotate ssRNA viral genomes. This work focuses on single sequence annotation and applies an HMM framework to ssRNA viral annotation. A description of how the HMM is parameterized, whilst annotating within a missing data framework is given. A Phylogenetic HMM (Phylo-HMM) extension, as applied to 14 aligned HIV2 sequences is also presented. This evolutionary extension serves as an illustration of the potential of the Phylo-HMM framework for ssRNA viral genomic annotation. The single sequence annotation procedure (SSA) is applied to 14 different strains of the HIV2 virus. Further results on alternative ssRNA viral genomes are presented to illustrate more generally the performance of the method. The results of the SSA method are encouraging however there is still room for improvement, and since there is overwhelming evidence to indicate that comparative methods can improve coding sequence (CDS) annotation, the SSA method is extended to a Phylo-HMM to incorporate evolutionary information. The Phylo-HMM extension is applied to the same set of 14 HIV2 sequences which are pre-aligned. The performance improvement that results from including the evolutionary information in the analysis is illustrated.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weiss, S.B.
Our laboratory has explored the use of short DNA oligomers as targets for activated polycyclic aromatic hydrocarbons, such as benzo(a)pyrene diol epoxide (BPDE), in order to detect alterations in DNA sequence arrangement. In this model system, oligomers alkylated with (+)-BPDE are ligated into M13 viral DNA and used to transfect Escherichia coli. These cells are plated on agar, incubated at 37/sup 0/C, progeny viral clones are selected, amplified, and the viral DNAs isolated are sequenced at the site of oligomer insertion. We have devised a procedure for the preparation of unique duplex DNA oligomers such that the site of oligomermore » alkylation is specific for a single deoxynucleotide species in the two DNA strands. The procedure for oligomer assembly also allows us to vary the position of the alkylated residue in each of the two strands. Using our model system, the results obtained over the past year can be summarized as follows. When nonalkylated oligomer constructs are ligated into M13 viral DNA and used to transfect E. coli, no modifications in DNA sequence arrangement are detected in progeny viral DNAs. On the other hand, with oligomer constructs containing BP-adducts two major types of modifications in DNA sequence arrangement were observed: (1) large deletions, and (2) nonhomologous (illegitimate) recombinants. Both of these DNA modifications result in the complete removal of the oligomer insert. Transfection of E. coli that are recA/sup -/ does not alter these DNA modifications, therefore, it appears that the deletions and recombinants induced by the alkylated inserts are not under control of the RecA gene. As the distance between the alkylated residues in the duplex strands is increased, the number of recombinant events detected is reduced. In addition to the above types of DNA modifications, restoration of the original nucleotide sequence in the alkylated construct was also observed in progeny viral DNAs. 7 refs., 6 figs., 2 tabs.« less
Ansari, Israr-ul H.; Allen, Todd; Berical, Andrew; Stock, Peter G.; Barin, Burc; Striker, Rob
2013-01-01
Hepatitis C virus (HCV) replication is limited by cyclophilin inhibitors but it remains unclear how viral genetic variations influence susceptibility to cyclosporine (cyclosporine A, CsA), a cyclophilin inhibitor. In this study HCV from liver transplant patients was sequenced before and after CsA exposure. Phenotypic analysis of NS5A sequence was performed by using HCV sub genomic replicon to determine CsA susceptibility. The data indicates an atypical proline at position 328 in NS5A causes increases CsA sensitivity both in the context of genotype 1a and 1b residues. Point mutants mimicking other naturally occurring residues at this position also increased (Ala) or decreased (Arg) replicon sensitivity to CsA relative to the typical threonine (genotype 1a) or serine (genotype 1b) at this position. This work has implications for treatment of HCV by cyclophilin inhibitors. PMID:23290631
Ip, Hon S.; Wiley, Michael R.; Long, Renee; Gustavo, Palacios; Shearn-Bochsler, Valerie; Whitehouse, Chris A.
2014-01-01
Advances in massively parallel DNA sequencing platforms, commonly termed next-generation sequencing (NGS) technologies, have greatly reduced time, labor, and cost associated with DNA sequencing. Thus, NGS has become a routine tool for new viral pathogen discovery and will likely become the standard for routine laboratory diagnostics of infectious diseases in the near future. This study demonstrated the application of NGS for the rapid identification and characterization of a virus isolated from the brain of an endangered Mississippi sandhill crane. This bird was part of a population restoration effort and was found in an emaciated state several days after Hurricane Isaac passed over the refuge in Mississippi in 2012. Post-mortem examination had identified trichostrongyliasis as the possible cause of death, but because a virus with morphology consistent with a togavirus was isolated from the brain of the bird, an arboviral etiology was strongly suspected. Because individual molecular assays for several known arboviruses were negative, unbiased NGS by Illumina MiSeq was used to definitively identify and characterize the causative viral agent. Whole genome sequencing and phylogenetic analysis revealed the viral isolate to be the Highlands J virus, a known avian pathogen. This study demonstrates the use of unbiased NGS for the rapid detection and characterization of an unidentified viral pathogen and the application of this technology to wildlife disease diagnostics and conservation medicine.
Genetic diversity and evolutionary dynamics of Ebola virus in Sierra Leone.
Tong, Yi-Gang; Shi, Wei-Feng; Liu, Di; Qian, Jun; Liang, Long; Bo, Xiao-Chen; Liu, Jun; Ren, Hong-Guang; Fan, Hang; Ni, Ming; Sun, Yang; Jin, Yuan; Teng, Yue; Li, Zhen; Kargbo, David; Dafae, Foday; Kanu, Alex; Chen, Cheng-Chao; Lan, Zhi-Heng; Jiang, Hui; Luo, Yang; Lu, Hui-Jun; Zhang, Xiao-Guang; Yang, Fan; Hu, Yi; Cao, Yu-Xi; Deng, Yong-Qiang; Su, Hao-Xiang; Sun, Yu; Liu, Wen-Sen; Wang, Zhuang; Wang, Cheng-Yu; Bu, Zhao-Yang; Guo, Zhen-Dong; Zhang, Liu-Bo; Nie, Wei-Min; Bai, Chang-Qing; Sun, Chun-Hua; An, Xiao-Ping; Xu, Pei-Song; Zhang, Xiang-Li-Lan; Huang, Yong; Mi, Zhi-Qiang; Yu, Dong; Yao, Hong-Wu; Feng, Yong; Xia, Zhi-Ping; Zheng, Xue-Xing; Yang, Song-Tao; Lu, Bing; Jiang, Jia-Fu; Kargbo, Brima; He, Fu-Chu; Gao, George F; Cao, Wu-Chun
2015-08-06
A novel Ebola virus (EBOV) first identified in March 2014 has infected more than 25,000 people in West Africa, resulting in more than 10,000 deaths. Preliminary analyses of genome sequences of 81 EBOV collected from March to June 2014 from Guinea and Sierra Leone suggest that the 2014 EBOV originated from an independent transmission event from its natural reservoir followed by sustained human-to-human infections. It has been reported that the EBOV genome variation might have an effect on the efficacy of sequence-based virus detection and candidate therapeutics. However, only limited viral information has been available since July 2014, when the outbreak entered a rapid growth phase. Here we describe 175 full-length EBOV genome sequences from five severely stricken districts in Sierra Leone from 28 September to 11 November 2014. We found that the 2014 EBOV has become more phylogenetically and genetically diverse from July to November 2014, characterized by the emergence of multiple novel lineages. The substitution rate for the 2014 EBOV was estimated to be 1.23 × 10(-3) substitutions per site per year (95% highest posterior density interval, 1.04 × 10(-3) to 1.41 × 10(-3) substitutions per site per year), approximating to that observed between previous EBOV outbreaks. The sharp increase in genetic diversity of the 2014 EBOV warrants extensive EBOV surveillance in Sierra Leone, Guinea and Liberia to better understand the viral evolution and transmission dynamics of the ongoing outbreak. These data will facilitate the international efforts to develop vaccines and therapeutics.
Dynamics of Preferential Substrate Recognition in HIV-1 Protease: Redefining the Substrate Envelope
Özen, Ayşegül; Haliloğlu, Türkan; Schiffer, Celia A.
2011-01-01
HIV-1 protease (PR) permits viral maturation by processing the Gag and Gag-Pro-Pol polyproteins. Though HIV-1 PR inhibitors (PIs) are used in combination antiviral therapy, the emergence of drug resistance has limited their efficacy. The rapid evolution of HIV-1 necessitates the consideration of drug resistance in novel drug-design strategies. Drug-resistant HIV-1 PR variants, while no longer efficiently inhibited, continue to efficiently hydrolyze the natural viral substrates. Though highly diverse in sequence, the HIV-1 PR substrates bind in a conserved three-dimensional shape we defined as the “substrate envelope”. We previously showed that resistance mutations arise where PIs protrude beyond the substrate envelope, as these regions are crucial for drug binding but not for substrate recognition. Here, we extend this model by considering the role of protein dynamics in the interaction of HIV-1 PR with its substrates. Seven molecular dynamics simulations of PR-substrate complexes were performed to estimate the conformational flexibility of substrates in their complexes. Interdependency of the substrate-protease interactions may compensate for the variations in cleavage-site sequences, and explain how a diverse set of sequences can be recognized as substrates by the same enzyme. This diversity may be essential for regulating sequential processing of substrates. We also define a dynamic substrate envelope as a more accurate representation of PR-substrate interactions. This dynamic substrate envelope, described by a probability distribution function, is a powerful tool for drug design efforts targeting ensembles of resistant HIV-1 PR variants with the aim of developing drugs that are less susceptible to resistance. PMID:21762811
Croville, Guillaume; Soubies, Sébastien Mathieu; Barbieri, Johanna; Klopp, Christophe; Mariette, Jérôme; Bouchez, Olivier; Camus-Bouclainville, Christelle
2012-01-01
Adaptation of avian influenza viruses (AIVs) from waterfowl to domestic poultry with a deletion in the neuraminidase (NA) stalk has already been reported. The way the virus undergoes this evolution, however, is thus far unclear. We address this question using pyrosequencing of duck and turkey low-pathogenicity AIVs. Ducks and turkeys were sampled at the very beginning of an H6N1 outbreak, and turkeys were swabbed again 8 days later. NA stalk deletions were evidenced in turkeys by Sanger sequencing. To further investigate viral evolution, 454 pyrosequencing was performed: for each set of samples, up to 41,500 reads of ca. 400 bp were generated and aligned. Genetic polymorphisms between duck and turkey viruses were tracked on the whole genome. NA deletion was detected in less than 2% of reads in duck feces but in 100% of reads in turkey tracheal specimens collected at the same time. Further variations in length were observed in NA from turkeys 8 days later. Similarly, minority mutants emerged on the hemagglutinin (HA) gene, with substitutions mostly in the receptor binding site on the globular head. These critical changes suggest a strong evolutionary pressure in turkeys. The increasing performances of next-generation sequencing technologies should enable us to monitor the genomic diversity of avian influenza viruses and early emergence of potentially pathogenic variants within bird flocks. The present study, based on 454 pyrosequencing, suggests that NA deletion, an example of AIV adaptation from waterfowl to domestic poultry, occurs by selection rather than de novo emergence of viral mutants. PMID:22718944
Phylogeography Takes a Relaxed Random Walk in Continuous Space and Time
Lemey, Philippe; Rambaut, Andrew; Welch, John J.; Suchard, Marc A.
2010-01-01
Research aimed at understanding the geographic context of evolutionary histories is burgeoning across biological disciplines. Recent endeavors attempt to interpret contemporaneous genetic variation in the light of increasingly detailed geographical and environmental observations. Such interest has promoted the development of phylogeographic inference techniques that explicitly aim to integrate such heterogeneous data. One promising development involves reconstructing phylogeographic history on a continuous landscape. Here, we present a Bayesian statistical approach to infer continuous phylogeographic diffusion using random walk models while simultaneously reconstructing the evolutionary history in time from molecular sequence data. Moreover, by accommodating branch-specific variation in dispersal rates, we relax the most restrictive assumption of the standard Brownian diffusion process and demonstrate increased statistical efficiency in spatial reconstructions of overdispersed random walks by analyzing both simulated and real viral genetic data. We further illustrate how drawing inference about summary statistics from a fully specified stochastic process over both sequence evolution and spatial movement reveals important characteristics of a rabies epidemic. Together with recent advances in discrete phylogeographic inference, the continuous model developments furnish a flexible statistical framework for biogeographical reconstructions that is easily expanded upon to accommodate various landscape genetic features. PMID:20203288
Current Advances on Virus Discovery and Diagnostic Role of Viral Metagenomics in Aquatic Organisms
Munang'andu, Hetron M.; Mugimba, Kizito K.; Byarugaba, Denis K.; Mutoloki, Stephen; Evensen, Øystein
2017-01-01
The global expansion of the aquaculture industry has brought with it a corresponding increase of novel viruses infecting different aquatic organisms. These emerging viral pathogens have proved to be a challenge to the use of traditional cell-cultures and immunoassays for identification of new viruses especially in situations where the novel viruses are unculturable and no antibodies exist for their identification. Viral metagenomics has the potential to identify novel viruses without prior knowledge of their genomic sequence data and may provide a solution for the study of unculturable viruses. This review provides a synopsis on the contribution of viral metagenomics to the discovery of viruses infecting different aquatic organisms as well as its potential role in viral diagnostics. High throughput Next Generation sequencing (NGS) and library construction used in metagenomic projects have simplified the task of generating complete viral genomes unlike the challenge faced in traditional methods that use multiple primers targeted at different segments and VPs to generate the entire genome of a novel virus. In terms of diagnostics, studies carried out this far show that viral metagenomics has the potential to serve as a multifaceted tool able to study and identify etiological agents of single infections, co-infections, tissue tropism, profiling viral infections of different aquatic organisms, epidemiological monitoring of disease prevalence, evolutionary phylogenetic analyses, and the study of genomic diversity in quasispecies viruses. With sequencing technologies and bioinformatics analytical tools becoming cheaper and easier, we anticipate that metagenomics will soon become a routine tool for the discovery, study, and identification of novel pathogens including viruses to enable timely disease control for emerging diseases in aquaculture. PMID:28382024
2010-01-01
Background BamHI-A rightward frame-1 (BARF1) is a carcinoma-specific Epstein-Barr virus (EBV) encoded oncogene. Here we describe the BARF1 sequence diversity in nasopharyngeal carcinoma (NPC), other EBV-related diseases and Indonesian healthy EBV carriers in relation to EBV genotype, viral load and serology markers. Nasopharyngeal brushings from 56 NPC cases, blood or tissue from 15 other EBV-related disorders, spontaneous B cell lines (LCL) from 5 Indonesian healthy individuals and several prototype EBV isolates were analysed by PCR-direct sequencing. Results Most NPC isolates revealed specific BARF1 nucleotide changes compared to prototype B95-8 virus. At the protein level these mutations resulted in 3 main substitutions (V29A, W72G, H130R), which are not considered to cause gross tertiary structure alterations in the hexameric BARF1 protein. At least one amino acid conversion was detected in 80.3% of NPC samples compared to 33.3% of non-NPC samples (p < 0.001) and 40.0% of healthy LCLs (p = 0.074). NPC isolates also showed more frequent codon mutation than non-NPC samples. EBV strain typing revealed most isolates as EBV type 1. The viral load of either NPC or non-NPC samples was high, but only in non- NPC group it related to a particular BARF1 variant. Serology on NPC sera using IgA/EBNA-1 ELISA, IgA/VCA-p18 ELISA and immunoblot score showed no relation with BARF1 sequence diversity (p = 0.802, 0.382 and 0.058, respectively). NPC patients had variable antibody reactivity against purified hexameric NPC-derived BARF1 irrespective of the endogenous BARF1 sequence. Conclusion The sequence variation of BARF1 observed in Indonesian NPC patients and controls may reflect a natural selection of EBV strains unlikely to be predisposing to carcinogenesis. The conserved nature of BARF1 may reflect an important role in EBV (epithelial) persistence. PMID:20849661
Hutajulu, Susanna H; Hoebe, Eveline K; Verkuijlen, Sandra Awm; Fachiroh, Jajah; Hariwijanto, Bambang; Haryana, Sofia M; Stevens, Servi Jc; Greijer, Astrid E; Middeldorp, Jaap M
2010-09-19
BamHI-A rightward frame-1 (BARF1) is a carcinoma-specific Epstein-Barr virus (EBV) encoded oncogene. Here we describe the BARF1 sequence diversity in nasopharyngeal carcinoma (NPC), other EBV-related diseases and Indonesian healthy EBV carriers in relation to EBV genotype, viral load and serology markers. Nasopharyngeal brushings from 56 NPC cases, blood or tissue from 15 other EBV-related disorders, spontaneous B cell lines (LCL) from 5 Indonesian healthy individuals and several prototype EBV isolates were analysed by PCR-direct sequencing. Most NPC isolates revealed specific BARF1 nucleotide changes compared to prototype B95-8 virus. At the protein level these mutations resulted in 3 main substitutions (V29A, W72G, H130R), which are not considered to cause gross tertiary structure alterations in the hexameric BARF1 protein. At least one amino acid conversion was detected in 80.3% of NPC samples compared to 33.3% of non-NPC samples (p < 0.001) and 40.0% of healthy LCLs (p = 0.074). NPC isolates also showed more frequent codon mutation than non-NPC samples. EBV strain typing revealed most isolates as EBV type 1. The viral load of either NPC or non-NPC samples was high, but only in non- NPC group it related to a particular BARF1 variant. Serology on NPC sera using IgA/EBNA-1 ELISA, IgA/VCA-p18 ELISA and immunoblot score showed no relation with BARF1 sequence diversity (p = 0.802, 0.382 and 0.058, respectively). NPC patients had variable antibody reactivity against purified hexameric NPC-derived BARF1 irrespective of the endogenous BARF1 sequence. The sequence variation of BARF1 observed in Indonesian NPC patients and controls may reflect a natural selection of EBV strains unlikely to be predisposing to carcinogenesis. The conserved nature of BARF1 may reflect an important role in EBV (epithelial) persistence.
DeBoever, Christopher; Reid, Erin G.; Smith, Erin N.; Wang, Xiaoyun; Dumaop, Wilmar; Harismendy, Olivier; Carson, Dennis; Richman, Douglas; Masliah, Eliezer; Frazer, Kelly A.
2013-01-01
Primary central nervous system lymphomas (PCNSL) have a dramatically increased prevalence among persons living with AIDS and are known to be associated with human Epstein Barr virus (EBV) infection. Previous work suggests that in some cases, co-infection with other viruses may be important for PCNSL pathogenesis. Viral transcription in tumor samples can be measured using next generation transcriptome sequencing. We demonstrate the ability of transcriptome sequencing to identify viruses, characterize viral expression, and identify viral variants by sequencing four archived AIDS-related PCNSL tissue samples and analyzing raw sequencing reads. EBV was detected in all four PCNSL samples and cytomegalovirus (CMV), JC polyomavirus (JCV), and HIV were also discovered, consistent with clinical diagnoses. CMV was found to express three long non-coding RNAs recently reported as expressed during active infection. Single nucleotide variants were observed in each of the viruses observed and three indels were found in CMV. No viruses were found in several control tumor types including 32 diffuse large B-cell lymphoma samples. This study demonstrates the ability of next generation transcriptome sequencing to accurately identify viruses, including DNA viruses, in solid human cancer tissue samples. PMID:24023918
Yoshida, Naoto; Shimura, Hanako; Masuta, Chikara
2018-06-01
Allexiviruses are economically important garlic viruses that are involved in garlic mosaic diseases. In this study, we characterized the allexivirus cysteine-rich protein (CRP) gene located just downstream of the coat protein (CP) gene in the viral genome. We determined the nucleotide sequences of the CP and CRP genes from numerous allexivirus isolates and performed a phylogenetic analysis. According to the resulting phylogenetic tree, we found that allexiviruses were clearly divided into two major groups (group I and group II) based on the sequences of the CP and CRP genes. In addition, the allexiviruses in group II had distinct sequences just before the CRP gene, while group I isolates did not. The inserted sequence between the CP and CRP genes was partially complementary to garlic 18S rRNA. Using a potato virus X vector, we showed that the CRPs affected viral accumulation and symptom induction in Nicotiana benthamiana, suggesting that the allexivirus CRP is a pathogenicity determinant. We assume that the inserted sequences before the CRP gene may have been generated during viral evolution to alter the termination-reinitiation mechanism for coupled translation of CP and CRP.
NASA Astrophysics Data System (ADS)
Daly, R. A.; Mouser, P. J.; Trexler, R.; Wrighton, K. C.
2014-12-01
Despite a growing appreciation for the ecological role of viruses in marine and gut systems, little is known about their role in the terrestrial deep (> 2000 m) subsurface. We used assembly-based metagenomics to examine the viral component in fluids from hydraulically fractured Marcellus shale gas wells. Here we reconstructed microbial and viral genomes from samples collected 7, 82, and 328 days post fracturing. Viruses accounted for 4.14%, 0.92% and 0.59% of the sample reads that mapped to the assembly. We identified 6 complete, circularized viral genomes and an additional 92 viral contigs > 5 kb with a maximum contig size of 73.6 kb. A BLAST comparison to NCBI viral genomes revealed that 85% of viral contigs had significant hits to the viral order Caudovirales, with 43% of sequences belonging to the family Siphoviridae, 38% to Myoviridae, and 12% to Podoviridae. Enrichment of Caudovirales viruses was supported by a large number of predicted proteins characteristic of tailed viruses including terminases (TerL), tape measure, tail formation, and baseplate related proteins. The viral contigs included evidence of lytic and temperate lifestyles, with the 7 day sample having the greatest number of detected lytic viruses. Notably in this sample, the most abundant virus was lytic and its inferred host, a member of the Vibrionaceae, was not detected at later time points. Analyses of CRISPR sequences (a viral and foreign DNA immune system in bacteria and archaea), linked 18 viral contigs to hosts. CRISPR linkages increased through time and all bacterial and archaeal genomes recovered in the final time point had genes for CRISPR-mediated viral defense. The majority of CRISPR sequences linked phage genomes to several Halanaerobium strains, which are the dominant and persisting members of the community inferred to be responsible for carbon and sulfur cycling in these shales. Network analysis revealed that several viruses were present in the 82 and 328 day samples; this viral persistence is consistent with concomitant temporal stability in geochemistry and microbial community composition. Our findings suggest that after a disturbance (hydraulic fracturing) viral predation and host immunity is an important controller of microbial community structure, metabolism, and thus biogeochemical cycling in the deep subsurface.
Identifying predictors of time-inhomogeneous viral evolutionary processes.
Bielejec, Filip; Baele, Guy; Rodrigo, Allen G; Suchard, Marc A; Lemey, Philippe
2016-07-01
Various factors determine the rate at which mutations are generated and fixed in viral genomes. Viral evolutionary rates may vary over the course of a single persistent infection and can reflect changes in replication rates and selective dynamics. Dedicated statistical inference approaches are required to understand how the complex interplay of these processes shapes the genetic diversity and divergence in viral populations. Although evolutionary models accommodating a high degree of complexity can now be formalized, adequately informing these models by potentially sparse data, and assessing the association of the resulting estimates with external predictors, remains a major challenge. In this article, we present a novel Bayesian evolutionary inference method, which integrates multiple potential predictors and tests their association with variation in the absolute rates of synonymous and non-synonymous substitutions along the evolutionary history. We consider clinical and virological measures as predictors, but also changes in population size trajectories that are simultaneously inferred using coalescent modelling. We demonstrate the potential of our method in an application to within-host HIV-1 sequence data sampled throughout the infection of multiple patients. While analyses of individual patient populations lack statistical power, we detect significant evidence for an abrupt drop in non-synonymous rates in late stage infection and a more gradual increase in synonymous rates over the course of infection in a joint analysis across all patients. The former is predicted by the immune relaxation hypothesis while the latter may be in line with increasing replicative fitness during the asymptomatic stage.
Viral Diversity Threshold for Adaptive Immunity in Prokaryotes
Weinberger, Ariel D.; Wolf, Yuri I.; Lobkovsky, Alexander E.; Gilmore, Michael S.; Koonin, Eugene V.
2012-01-01
ABSTRACT Bacteria and archaea face continual onslaughts of rapidly diversifying viruses and plasmids. Many prokaryotes maintain adaptive immune systems known as clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (Cas). CRISPR-Cas systems are genomic sensors that serially acquire viral and plasmid DNA fragments (spacers) that are utilized to target and cleave matching viral and plasmid DNA in subsequent genomic invasions, offering critical immunological memory. Only 50% of sequenced bacteria possess CRISPR-Cas immunity, in contrast to over 90% of sequenced archaea. To probe why half of bacteria lack CRISPR-Cas immunity, we combined comparative genomics and mathematical modeling. Analysis of hundreds of diverse prokaryotic genomes shows that CRISPR-Cas systems are substantially more prevalent in thermophiles than in mesophiles. With sequenced bacteria disproportionately mesophilic and sequenced archaea mostly thermophilic, the presence of CRISPR-Cas appears to depend more on environmental temperature than on bacterial-archaeal taxonomy. Mutation rates are typically severalfold higher in mesophilic prokaryotes than in thermophilic prokaryotes. To quantitatively test whether accelerated viral mutation leads microbes to lose CRISPR-Cas systems, we developed a stochastic model of virus-CRISPR coevolution. The model competes CRISPR-Cas-positive (CRISPR-Cas+) prokaryotes against CRISPR-Cas-negative (CRISPR-Cas−) prokaryotes, continually weighing the antiviral benefits conferred by CRISPR-Cas immunity against its fitness costs. Tracking this cost-benefit analysis across parameter space reveals viral mutation rate thresholds beyond which CRISPR-Cas cannot provide sufficient immunity and is purged from host populations. These results offer a simple, testable viral diversity hypothesis to explain why mesophilic bacteria disproportionately lack CRISPR-Cas immunity. More generally, fundamental limits on the adaptability of biological sensors (Lamarckian evolution) are predicted. PMID:23221803
Gambley, C F; Geering, A D W; Steele, V; Thomas, J E
2008-01-01
A previously published partial sequence of pineapple bacilliform virus was shown to be from a retrotransposon (family Metaviridae) and not from a badnavirus as previously thought. Two newly discovered sequence groups isolated from pineapple were associated with bacilliform virions and were transmitted by mealybugs. Phylogenetic analyses indicated that they were members of new badnavirus species. A third caulimovirid sequence was also amplified from pineapple, but available evidence suggests that this DNA is not encapsidated, but more likely derived from an endogenous virus.
Piantadosi, Anne; Mukerji, Shibani S; Chitneni, Pooja; Cho, Tracey A; Cosimi, Lisa A; Hung, Deborah T; Goldberg, Marcia B; Sabeti, Pardis C; Kuritzkes, Daniel R; Grad, Yonatan H
2017-01-01
Enteroviruses cause a wide spectrum of clinical disease. In this study, we describe the case of a young man with orchitis and aseptic meningitis who was diagnosed with enterovirus infection. Using unbiased "metagenomic" massively parallel sequencing, we assembled a near-complete viral genome, the first use of this method for full-genome viral sequencing from cerebrospinal fluid. We found that the genome belonged to the subgroup echovirus 30, which is a common cause of aseptic meningitis but has not been previously reported to cause orchitis.
Discovery of Novel Viruses in Mosquitoes from the Zambezi Valley of Mozambique
Hayer, Juliette; Abilio, Ana Paula; Mulandane, Fernando Chanisso; Verner-Carlsson, Jenny; Falk, Kerstin I.; Fafetine, Jose M.; Berg, Mikael; Blomström, Anne-Lie
2016-01-01
Mosquitoes carry a wide variety of viruses that can cause vector-borne infectious diseases and affect both human and veterinary public health. Although Mozambique can be considered a hot spot for emerging infectious diseases due to factors such as a rich vector population and a close vector/human/wildlife interface, the viral flora in mosquitoes have not previously been investigated. In this study, viral metagenomics was employed to analyze the viral communities in Culex and Mansonia mosquitoes in the Zambezia province of Mozambique. Among the 1.7 and 2.6 million sequences produced from the Culex and Mansonia samples, respectively, 3269 and 983 reads were classified as viral sequences. Viruses belonging to the Flaviviridae, Rhabdoviridae and Iflaviridae families were detected, and different unclassified single- and double-stranded RNA viruses were also identified. A near complete genome of a flavivirus, tentatively named Cuacua virus, was obtained from the Mansonia mosquitoes. Phylogenetic analysis of this flavivirus, using the NS5 amino acid sequence, showed that it grouped with ‘insect-specific’ viruses and was most closely related to Nakiwogo virus previously identified in Uganda. Both mosquito genera had viral sequences related to Rhabdoviruses, and these were most closely related to Culex tritaeniorhynchus rhabdovirus (CTRV). The results from this study suggest that several viruses specific for insects belonging to, for example, the Flaviviridae and Rhabdoviridae families, as well as a number of unclassified RNA viruses, are present in mosquitoes in Mozambique. PMID:27682810
What can we learn about lyssavirus genomes using 454 sequencing?
Höper, Dirk; Finke, Stefan; Freuling, Conrad M; Hoffmann, Bernd; Beer, Martin
2012-01-01
The main task of the individual project number four"Whole genome sequencing, virus-host adaptation, and molecular epidemiological analyses of lyssaviruses "within the network" Lyssaviruses--a potential re-emerging public health threat" is to provide high quality complete genome sequences from lyssaviruses. These sequences are analysed in-depth with regard to the diversity of the viral populations as to both quasi-species and so-called defective interfering RNAs. Moreover, the sequence data will facilitate further epidemiological analyses, will provide insight into the evolution of lyssaviruses and will be the basis for the design of novel nucleic acid based diagnostics. The first results presented here indicate that not only high quality full-length lyssavirus genome sequences can be generated, but indeed efficient analysis of the viral population gets feasible.
Dahl, Viktor; Gisslen, Magnus; Hagberg, Lars; Peterson, Julia; Shao, Wei; Spudich, Serena; Price, Richard W.; Palmer, Sarah
2014-01-01
We sequenced the genome of human immunodeficiency virus type 1 (HIV-1) recovered from 70 cerebrospinal fluid (CSF) specimens and 29 plasma samples and corresponding samples obtained before treatment initiation from 17 subjects receiving suppressive therapy. More CSF sequences than plasma sequences were hypermutants. We determined CSF sequences and plasma sequences in specimens obtained from 2 subjects after treatment initiation. In one subject, we found genetically distinct CSF and plasma sequences, indicating that they came from HIV-1 from 2 different compartments, one potentially the central nervous system, during suppressive therapy. In addition, there was little evidence of viral evolution in the CSF during therapy, suggesting that continuous virus replication is not the major cause of viral persistence in the central nervous system. PMID:24338353
Dahl, Viktor; Gisslen, Magnus; Hagberg, Lars; Peterson, Julia; Shao, Wei; Spudich, Serena; Price, Richard W; Palmer, Sarah
2014-05-15
We sequenced the genome of human immunodeficiency virus type 1 (HIV-1) recovered from 70 cerebrospinal fluid (CSF) specimens and 29 plasma samples and corresponding samples obtained before treatment initiation from 17 subjects receiving suppressive therapy. More CSF sequences than plasma sequences were hypermutants. We determined CSF sequences and plasma sequences in specimens obtained from 2 subjects after treatment initiation. In one subject, we found genetically distinct CSF and plasma sequences, indicating that they came from HIV-1 from 2 different compartments, one potentially the central nervous system, during suppressive therapy. In addition, there was little evidence of viral evolution in the CSF during therapy, suggesting that continuous virus replication is not the major cause of viral persistence in the central nervous system.
Wang, Nidan; Li, Yijia; Han, Yang; Xie, Jing; Li, Taisheng
2017-06-01
The association between baseline human immunodeficiency virus (HIV) sequence diversity and HIV DNA decay after the initiation of antiretroviral therapy (ART) remains uncharacterized during the early stages of HIV infection. Samples were obtained from a cohort of 17 patients with early HIV infection (<6 months after infection) who initiated ART, and the C2V5 region of the HIV-1 envelope (env) gene was amplified via single genome amplification (SGA) to determine the peripheral plasma HIV quasispecies. We categorized HIV quasispecies into two groups according to baseline viral sequence genetic distance, which was determined by the Poisson-Fitter tool. Total HIV DNA in peripheral blood mononuclear cells (PBMCs), viral load, and T cell subsets were measured prior to and after the initiation of ART. The median SGA sequence number was 17 (range 6-28). At baseline, we identified 7 patients with homogeneous viral populations (designated the Homogeneous group) and 10 patients with heterogeneous viral populations (designated the Heterogeneous group) based on SGA sequences. Both groups exhibited similar HIV DNA decay rates during the first 6 months of ART (P > 0.99), but the Homogenous group experienced more prominent decay than the Heterogeneous group after 6 months (P = 0.037). The Heterogeneous group had higher CD4 cell counts after ART initiation; however, both groups had comparable recovery in terms of CD4/CD8 ratios and CD8 T cell activation levels. Viral population homogeneity upon the initiation of ART is associated with a decrease in HIV DNA levels during ART. J. Med. Virol. 89:982-988, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Metzgar, David; Myers, Christopher A.; Russell, Kevin L.; Faix, Dennis; Blair, Patrick J.; Brown, Jason; Vo, Scott; Swayne, David E.; Thomas, Colleen; Stenger, David A.; Lin, Baochuan; Malanoski, Anthony P.; Wang, Zheng; Blaney, Kate M.; Long, Nina C.; Schnur, Joel M.; Saad, Magdi D.; Borsuk, Lisa A.; Lichanska, Agnieszka M.; Lorence, Matthew C.; Weslowski, Brian; Schafer, Klaus O.; Tibbetts, Clark
2010-01-01
For more than four decades the cause of most type A influenza virus infections of humans has been attributed to only two viral subtypes, A/H1N1 or A/H3N2. In contrast, avian and other vertebrate species are a reservoir of type A influenza virus genome diversity, hosting strains representing at least 120 of 144 combinations of 16 viral hemagglutinin and 9 viral neuraminidase subtypes. Viral genome segment reassortments and mutations emerging within this reservoir may spawn new influenza virus strains as imminent epidemic or pandemic threats to human health and poultry production. Traditional methods to detect and differentiate influenza virus subtypes are either time-consuming and labor-intensive (culture-based) or remarkably insensitive (antibody-based). Molecular diagnostic assays based upon reverse transcriptase-polymerase chain reaction (RT-PCR) have short assay cycle time, and high analytical sensitivity and specificity. However, none of these diagnostic tests determine viral gene nucleotide sequences to distinguish strains and variants of a detected pathogen from one specimen to the next. Decision-quality, strain- and variant-specific pathogen gene sequence information may be critical for public health, infection control, surveillance, epidemiology, or medical/veterinary treatment planning. The Resequencing Pathogen Microarray (RPM-Flu) is a robust, highly multiplexed and target gene sequencing-based alternative to both traditional culture- or biomarker-based diagnostic tests. RPM-Flu is a single, simultaneous differential diagnostic assay for all subtype combinations of type A influenza viruses and for 30 other viral and bacterial pathogens that may cause influenza-like illness. These other pathogen targets of RPM-Flu may co-infect and compound the morbidity and/or mortality of patients with influenza. The informative specificity of a single RPM-Flu test represents specimen-specific viral gene sequences as determinants of virus type, A/HN subtype, virulence, host-range, and resistance to antiviral agents. PMID:20140251
Metzgar, David; Myers, Christopher A; Russell, Kevin L; Faix, Dennis; Blair, Patrick J; Brown, Jason; Vo, Scott; Swayne, David E; Thomas, Colleen; Stenger, David A; Lin, Baochuan; Malanoski, Anthony P; Wang, Zheng; Blaney, Kate M; Long, Nina C; Schnur, Joel M; Saad, Magdi D; Borsuk, Lisa A; Lichanska, Agnieszka M; Lorence, Matthew C; Weslowski, Brian; Schafer, Klaus O; Tibbetts, Clark
2010-02-03
For more than four decades the cause of most type A influenza virus infections of humans has been attributed to only two viral subtypes, A/H1N1 or A/H3N2. In contrast, avian and other vertebrate species are a reservoir of type A influenza virus genome diversity, hosting strains representing at least 120 of 144 combinations of 16 viral hemagglutinin and 9 viral neuraminidase subtypes. Viral genome segment reassortments and mutations emerging within this reservoir may spawn new influenza virus strains as imminent epidemic or pandemic threats to human health and poultry production. Traditional methods to detect and differentiate influenza virus subtypes are either time-consuming and labor-intensive (culture-based) or remarkably insensitive (antibody-based). Molecular diagnostic assays based upon reverse transcriptase-polymerase chain reaction (RT-PCR) have short assay cycle time, and high analytical sensitivity and specificity. However, none of these diagnostic tests determine viral gene nucleotide sequences to distinguish strains and variants of a detected pathogen from one specimen to the next. Decision-quality, strain- and variant-specific pathogen gene sequence information may be critical for public health, infection control, surveillance, epidemiology, or medical/veterinary treatment planning. The Resequencing Pathogen Microarray (RPM-Flu) is a robust, highly multiplexed and target gene sequencing-based alternative to both traditional culture- or biomarker-based diagnostic tests. RPM-Flu is a single, simultaneous differential diagnostic assay for all subtype combinations of type A influenza viruses and for 30 other viral and bacterial pathogens that may cause influenza-like illness. These other pathogen targets of RPM-Flu may co-infect and compound the morbidity and/or mortality of patients with influenza. The informative specificity of a single RPM-Flu test represents specimen-specific viral gene sequences as determinants of virus type, A/HN subtype, virulence, host-range, and resistance to antiviral agents.
Stoppani, Elena; Bassi, Ivan; Dotti, Silvia; Lizier, Michela; Ferrari, Maura; Lucchini, Franco
2015-08-01
Influenza A virus is the principal agent responsible of the respiratory tract's infections in humans. Every year, highly pathogenic and infectious strains with new antigenic assets appear, making ineffective vaccines so far developed. The discovery of RNA interference (RNAi) opened the way to the progress of new promising drugs against Influenza A virus and also to the introduction of disease resistance traits in genetically modified animals. In this paper, we show that Madin-Darby Canine Kidney (MDCK) cell line expressing short hairpin RNAs (shRNAs) cassette, designed on a specific conserved region of the nucleoprotein (NP) viral genome, can strongly inhibit the viral replication of four viral strains sharing the target sequence, reducing the viral mRNA respectively to 2.5×10(-4), 7.5×10(-5), 1.7×10(-3), 1.9×10(-4) compared to the control, as assessed by real-time PCR. Moreover, we demonstrate that during the challenge with a viral strain bearing a single mismatch on the target sequence, although a weaker inhibition is observed, viral mRNA is still lowered down to 1.2×10(-3) folds in the shRNA-expressing clone compared to the control, indicating a broad potential use of this approach. In addition, we developed a highly predictive and fast screening test of siRNA sequences based on dual-luciferase assay, useful for the in vitro prediction of the potential effect of viral inhibition. In conclusion, these findings reveal new siRNA sequences able to inhibit Influenza A virus replication and provide a basis for the development of siRNAs as prophylaxis and therapy for influenza infection both in humans and animals. Copyright © 2015 Elsevier B.V. All rights reserved.
HIV-1 low copy viral sequencing-A prototype assay.
Mellberg, Tomas; Krabbe, Jon; Gisslén, Magnus; Svennerholm, Bo
2016-01-01
In HIV-1 patients with low viral burden, sequencing is often problematic, yet important. This study presents a sensitive, sub-type independent system for sequencing of low level viremia. Sequencing data from 32 HIV-1 infected patients with low level viremia were collected longitudinally. A combination of ViroSeq® HIV-1 Genotyping System and an in-house nesting protocol was used. Eight sub-types were represented. The success-rate of amplification of both PR and RT in the same sample was 100% in samples with viral loads above 100 copies/ml. Below 100 copies/ml, this study managed to amplify both regions in 7/13 (54%) samples. The assays were able to amplify either PR or RT in all sub-types included but one sub-type A specimen. In conclusion, this study presents a promising, simple assay to increase the ability to perform HIV-1 resistance testing at low level viremia. This is a prototype assay and the method needs further testing to evaluate clinical performance.
Hang, Jun; Vento, Todd J; Norby, Erica A; Jarman, Richard G; Keiser, Paul B; Kuschner, Robert A; Binn, Leonard N
2017-08-01
Human adenoviruses (HAdV), in particular types 4 and 7, frequently cause acute respiratory disease (ARD) during basic military training. HAdV4 and HAdV7 vaccines reduced the ARD risk in U.S. military. It is important to identify other respiratory pathogens and assess their potential impact on military readiness. In 2002, during a period when the HAdV vaccines were not available, throat swabs were taken from trainees (n = 184) with respiratory infections at Fort Jackson, South Carolina. Viral etiology was investigated initially with viral culture and neutralization assay and recently in this study by sequencing the viral isolates. Viral culture and neutralization assays identified 90 HAdV4 isolates and 27 additional cultures that showed viral cytopathic effects (CPE), including some with picornavirus-like CPE. Next-generation sequencing confirmed these results and determined viral genotypes, including 77 HAdV4, 4 HAdV3, 1 HAdV2, 17 coxsackievirus A21 (CAV21), and 1 enterovirus D68. Two samples were positive for both HAdV4 and CAV21. The identified genotypes are phylogenetically close to but distinct from those found during other years or in other military/non-military sites. HAdV4 is the predominant respiratory pathogen in unvaccinated military trainee. HAdV4 has temporal and demographic variability. CAV21 is a significant respiratory pathogen and needs to be evaluated for its current significance in military basic trainees. © 2017 Wiley Periodicals, Inc.
Diversity of DNA and RNA Viruses in Indoor Air As Assessed via Metagenomic Sequencing.
Rosario, Karyna; Fierer, Noah; Miller, Shelly; Luongo, Julia; Breitbart, Mya
2018-02-06
Diverse bacterial and fungal communities inhabit human-occupied buildings and circulate in indoor air; however, viral diversity in these man-made environments remains largely unknown. Here we investigated DNA and RNA viruses circulating in the air of 12 university dormitory rooms by analyzing dust accumulated over a one-year period on heating, ventilation, and air conditioning (HVAC) filters. A metagenomic sequencing approach was used to determine the identity and diversity of viral particles extracted from the HVAC filters. We detected a broad diversity of viruses associated with a range of hosts, including animals, arthropods, bacteria, fungi, humans, plants, and protists, suggesting that disparate organisms can contribute to indoor airborne viral communities. Viral community composition and the distribution of human-infecting papillomaviruses and polyomaviruses were distinct in the different dormitory rooms, indicating that airborne viral communities are variable in human-occupied spaces and appear to reflect differential rates of viral shedding from room occupants. This work significantly expands the known airborne viral diversity found indoors, enabling the design of sensitive and quantitative assays to further investigate specific viruses of interest and providing new insight into the likely sources of viruses found in indoor air.
Alho, Olli-Pekka
2003-12-01
The objective was to assess the impact of ostial obstruction and anatomical variations on paranasal sinus functioning during viral colds with computed tomography (CT) in subjects with and without a history of sinusitis. Cross-sectional study. Twenty-three volunteers with a history of recurrent sinusitis and 25 subjects without such history who had an early (symptoms for 2-4 d) natural cold were examined by taking viral specimens and CT scans and recording symptoms. The pathological sinus changes in the CT scans were scored, and several paranasal bony anatomical variations recorded. Viral origin of the cold was identified in 32 (67%) subjects, similarly in the two groups. Ostiomeatal obstruction and anatomical variations were equally frequent in the subjects with and without a sinusitis history (17 of 23 vs. 17 of 25 for ostial obstruction and 17 of 23 vs. 20 of 25 for at least one variation, respectively). However, in the case of ostiomeatal obstruction the combined CT score of ethmoidal and maxillary sinuses was significantly higher in the subjects with a sinusitis history than in those without (mean +/- SD, 3.0 +/- 0.9 vs. 2.3 +/- 1.2 [P =.05, t test]). In the sinusitis-prone subjects, several variations were associated significantly with various pathological sinus CT changes (septal deviation, horizontally situated processus uncinatus, large concha bullosa, and laterally concave concha media), whereas in the control subjects, only the presence of Haller cells was related to sphenoidal sinus disease. Ostiomeatal complex obstruction and bony anatomical variations seem to have a greater impact on the functioning of paranasal sinuses during viral colds in sinusitis-prone subjects than in subjects without a sinusitis history. These differences may be associated with the increased risk of bacterial sinusitis.
Martínez-Torres, A. O.; Mosquera, M. M.; Sanz, J. C.; Ramos, B.; Echevarría, J. E.
2009-01-01
An outbreak of rubella affected 460 individuals in 2004 and 2005 in the community of Madrid, Spain. Most of the patients were nonvaccinated Latin American immigrants or Spanish males. This study presents the first data on rubella virus genotypes in Spain. Forty selected clinical samples (2 urine, 5 serum, 3 blood, 2 saliva, and 28 pharyngeal exudate samples) from 40 cases were collected. The 739-nucleotide sequence recommended by the World Health Organization obtained from viral RNA in these samples was analyzed by using the MEGA v4.0 software. Seventeen isolates were obtained from 40 clinical samples from the outbreak, including two isolated from congenital rubella syndrome cases. Only viral RNA of genotype 1j was detected in both isolates and clinical specimens. Two variations in amino acids, G253C and T394S, which are involved in neutralization epitopes arose during the outbreak, but apparently there was no positive selection of either of them. The origin of the outbreak remains unknown because of poor virologic surveillance in Latin America and the African countries neighboring Spain. On the other hand, this is the first report of this genotype in Europe. The few published sequences of genotype 1j indicate that it comes from Japan and the Philippines, but there are no epidemiological data supporting this as the origin of the Madrid outbreak. PMID:19020066
Kim, John E; Beckthold, Brenda; Chen, Zhaoxia; Mihowich, Jennifer; Malloch, Laurie; Gill, Michael John
2007-11-01
The presence of HIV-1 non-B subtypes is increasing worldwide. This poses challenges to commercial diagnostic and viral load (RNA) monitoring tests that are predominantly based on HIV-1 subtype B strains. Based on phylogenetic analysis of the gag, pol, and env gene regions, we describe the first HIV-1 H/J recombinant in Canada that presented divergent viral load values. DNA sequence analysis of the gag gene region further revealed that genetic diversity between this H/J recombinant and the primers and probes used in the bio-Merieux Nuclisens HIV-1 QT (Nuclisens) and Roche Amplicor Monitor HIV-1, v1.5 (Monitor) viral RNA assays can erroneously lead to undetectable viral load values. This observation appears to be more problematic in the Nuclisens assay. In light of increasing genetic diversity in HIV worldwide we recommend that DNA sequencing of HIV, especially in the gag gene region targeted by primers and probes used in molecular diagnostic and viral load tests, be incorporated into clinical monitoring practices.
Phage and Nucleocytoplasmic Large Viral Sequences Dominate Coral Viromes from the Arabian Gulf.
Mahmoud, Huda; Jose, Liny
2017-01-01
Corals that naturally thrive under extreme conditions are gaining increasing attention due to their importance as living models to understand the impact of global warming on world corals. Here, we present the first metagenomic study of viral communities in corals thriving in a thermally variable water body in which the temperature fluctuates between 11 and 39°C in different seasons. The viral assemblages of two of the most abundant massive ( Porites harrisoni ) and branching ( Acropora downingi ) corals in offshore and inshore reef systems in the northern Arabian Gulf were investigated. Samples were collected from five reef systems during summer, autumn and winter of 2011/2012. The two coral viromes contain 12 viral families, including 10 dsDNA viral families [Siphoviridae, Podoviridae, Myoviridae, Phycodnaviridae, Baculoviridae, Herpesviridae, Adenoviridae, Alloherpesviridae, Mimiviridae and one unclassified family], one-ssDNA viral family (Microviridae) and one RNA viral family (Retroviridae). Overall, sequences significantly similar to Podoviridae were the most abundant in the P. harrisoni and A. downingi viromes. Various morphological types of virus-like particles (VLPs) were confirmed in the healthy coral tissue by transmission electron microscopy, including large tailless VLPs and electron-dense core VLPs. Tailed bacteriophages were isolated from coral tissue using a plaque assay. Higher functional gene diversity was recorded in A. downingi than in P. harrisoni , and comparative metagenomics revealed that the Gulf viral assemblages are functionally distinct from Pacific Ocean coral viral communities.
Efficient error correction for next-generation sequencing of viral amplicons
2012-01-01
Background Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. Results In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Conclusions Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses. The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm PMID:22759430
Efficient error correction for next-generation sequencing of viral amplicons.
Skums, Pavel; Dimitrova, Zoya; Campo, David S; Vaughan, Gilberto; Rossi, Livia; Forbi, Joseph C; Yokosawa, Jonny; Zelikovsky, Alex; Khudyakov, Yury
2012-06-25
Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roux, Simon; Emerson, Joanne B.; Eloe-Fadrosh, Emiley A.
BackgroundViral metagenomics (viromics) is increasingly used to obtain uncultivated viral genomes, evaluate community diversity, and assess ecological hypotheses. While viromic experimental methods are relatively mature and widely accepted by the research community, robust bioinformatics standards remain to be established. Here we usedin silicomock viral communities to evaluate the viromic sequence-to-ecological-inference pipeline, including (i) read pre-processing and metagenome assembly, (ii) thresholds applied to estimate viral relative abundances based on read mapping to assembled contigs, and (iii) normalization methods applied to the matrix of viral relative abundances for alpha and beta diversity estimates. ResultsTools specifically designed for metagenomes, specifically metaSPAdes, MEGAHIT, andmore » IDBA-UD, were the most effective at assembling viromes. Read pre-processing, such as partitioning, had virtually no impact on assembly output, but may be useful when hardware is limited. Viral populations with 2–5 × coverage typically assembled well, whereas lesser coverage led to fragmented assembly. Strain heterogeneity within populations hampered assembly, especially when strains were closely related (average nucleotide identity, or ANI ≥97%) and when the most abundant strain represented <50% of the population. Viral community composition assessments based on read recruitment were generally accurate when the following thresholds for detection were applied: (i) ≥10 kb contig lengths to define populations, (ii) coverage defined from reads mapping at ≥90% identity, and (iii) ≥75% of contig length with ≥1 × coverage. Finally, although data are limited to the most abundant viruses in a community, alpha and beta diversity patterns were robustly estimated (±10%) when comparing samples of similar sequencing depth, but more divergent (up to 80%) when sequencing depth was uneven across the dataset. In the latter cases, the use of normalization methods specifically developed for metagenomes provided the best estimates. ConclusionsThese simulations provide benchmarks for selecting analysis cut-offs and establish that an optimized sample-to-ecological-inference viromics pipeline is robust for making ecological inferences from natural viral communities. Continued development to better accessing RNA, rare, and/or diverse viral populations and improved reference viral genome availability will alleviate many of viromics remaining limitations.« less
Roux, Simon; Emerson, Joanne B.; Eloe-Fadrosh, Emiley A.; ...
2017-09-21
BackgroundViral metagenomics (viromics) is increasingly used to obtain uncultivated viral genomes, evaluate community diversity, and assess ecological hypotheses. While viromic experimental methods are relatively mature and widely accepted by the research community, robust bioinformatics standards remain to be established. Here we usedin silicomock viral communities to evaluate the viromic sequence-to-ecological-inference pipeline, including (i) read pre-processing and metagenome assembly, (ii) thresholds applied to estimate viral relative abundances based on read mapping to assembled contigs, and (iii) normalization methods applied to the matrix of viral relative abundances for alpha and beta diversity estimates. ResultsTools specifically designed for metagenomes, specifically metaSPAdes, MEGAHIT, andmore » IDBA-UD, were the most effective at assembling viromes. Read pre-processing, such as partitioning, had virtually no impact on assembly output, but may be useful when hardware is limited. Viral populations with 2–5 × coverage typically assembled well, whereas lesser coverage led to fragmented assembly. Strain heterogeneity within populations hampered assembly, especially when strains were closely related (average nucleotide identity, or ANI ≥97%) and when the most abundant strain represented <50% of the population. Viral community composition assessments based on read recruitment were generally accurate when the following thresholds for detection were applied: (i) ≥10 kb contig lengths to define populations, (ii) coverage defined from reads mapping at ≥90% identity, and (iii) ≥75% of contig length with ≥1 × coverage. Finally, although data are limited to the most abundant viruses in a community, alpha and beta diversity patterns were robustly estimated (±10%) when comparing samples of similar sequencing depth, but more divergent (up to 80%) when sequencing depth was uneven across the dataset. In the latter cases, the use of normalization methods specifically developed for metagenomes provided the best estimates. ConclusionsThese simulations provide benchmarks for selecting analysis cut-offs and establish that an optimized sample-to-ecological-inference viromics pipeline is robust for making ecological inferences from natural viral communities. Continued development to better accessing RNA, rare, and/or diverse viral populations and improved reference viral genome availability will alleviate many of viromics remaining limitations.« less
Classification of viral zoonosis through receptor pattern analysis.
Bae, Se-Eun; Son, Hyeon Seok
2011-04-13
Viral zoonosis, the transmission of a virus from its primary vertebrate reservoir species to humans, requires ubiquitous cellular proteins known as receptor proteins. Zoonosis can occur not only through direct transmission from vertebrates to humans, but also through intermediate reservoirs or other environmental factors. Viruses can be categorized according to genotype (ssDNA, dsDNA, ssRNA and dsRNA viruses). Among them, the RNA viruses exhibit particularly high mutation rates and are especially problematic for this reason. Most zoonotic viruses are RNA viruses that change their envelope proteins to facilitate binding to various receptors of host species. In this study, we sought to predict zoonotic propensity through the analysis of receptor characteristics. We hypothesized that the major barrier to interspecies virus transmission is that receptor sequences vary among species--in other words, that the specific amino acid sequence of the receptor determines the ability of the viral envelope protein to attach to the cell. We analysed host-cell receptor sequences for their hydrophobicity/hydrophilicity characteristics. We then analysed these properties for similarities among receptors of different species and used a statistical discriminant analysis to predict the likelihood of transmission among species. This study is an attempt to predict zoonosis through simple computational analysis of receptor sequence differences. Our method may be useful in predicting the zoonotic potential of newly discovered viral strains.
APOBEC3D and APOBEC3F potently promote HIV-1 diversification and evolution in humanized mouse model.
Sato, Kei; Takeuchi, Junko S; Misawa, Naoko; Izumi, Taisuke; Kobayashi, Tomoko; Kimura, Yuichi; Iwami, Shingo; Takaori-Kondo, Akifumi; Hu, Wei-Shau; Aihara, Kazuyuki; Ito, Mamoru; An, Dong Sung; Pathak, Vinay K; Koyanagi, Yoshio
2014-10-01
Several APOBEC3 proteins, particularly APOBEC3D, APOBEC3F, and APOBEC3G, induce G-to-A hypermutations in HIV-1 genome, and abrogate viral replication in experimental systems, but their relative contributions to controlling viral replication and viral genetic variation in vivo have not been elucidated. On the other hand, an HIV-1-encoded protein, Vif, can degrade these APOBEC3 proteins via a ubiquitin/proteasome pathway. Although APOBEC3 proteins have been widely considered as potent restriction factors against HIV-1, it remains unclear which endogenous APOBEC3 protein(s) affect HIV-1 propagation in vivo. Here we use a humanized mouse model and HIV-1 with mutations in Vif motifs that are responsible for specific APOBEC3 interactions, DRMR/AAAA (4A) or YRHHY/AAAAA (5A), and demonstrate that endogenous APOBEC3D/F and APOBEC3G exert strong anti-HIV-1 activity in vivo. We also show that the growth kinetics of 4A HIV-1 negatively correlated with the expression level of APOBEC3F. Moreover, single genome sequencing analyses of viral RNA in plasma of infected mice reveal that 4A HIV-1 is specifically and significantly diversified. Furthermore, a mutated virus that is capable of using both CCR5 and CXCR4 as entry coreceptor is specifically detected in 4A HIV-1-infected mice. Taken together, our results demonstrate that APOBEC3D/F and APOBEC3G fundamentally work as restriction factors against HIV-1 in vivo, but at the same time, that APOBEC3D and APOBEC3F are capable of promoting viral diversification and evolution in vivo.
Nakagawa, So; Takahashi, Mahoko Ueda
2016-01-01
In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species.Database URL: http://geve.med.u-tokai.ac.jp. © The Author(s) 2016. Published by Oxford University Press.
Nakagawa, So; Takahashi, Mahoko Ueda
2016-01-01
In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species. Database URL: http://geve.med.u-tokai.ac.jp PMID:27242033
Detection of Merkel Cell Polyomavirus DNA in Serum Samples of Healthy Blood Donors
Mazzoni, Elisa; Rotondo, John C.; Marracino, Luisa; Selvatici, Rita; Bononi, Ilaria; Torreggiani, Elena; Touzé, Antoine; Martini, Fernanda; Tognon, Mauro G.
2017-01-01
Merkel cell polyomavirus (MCPyV) has been detected in 80% of Merkel cell carcinomas (MCC). In the host, the MCPyV reservoir remains elusive. MCPyV DNA sequences were revealed in blood donor buffy coats. In this study, MCPyV DNA sequences were investigated in the sera (n = 190) of healthy blood donors. Two MCPyV DNA sequences, coding for the viral oncoprotein large T antigen (LT), were investigated using polymerase chain reaction (PCR) methods and DNA sequencing. Circulating MCPyV sequences were detected in sera with a prevalence of 2.6% (5/190), at low-DNA viral load, which is in the range of 1–4 and 1–5 copies/μl by real-time PCR and droplet digital PCR, respectively. DNA sequencing carried out in the five MCPyV-positive samples indicated that the two MCPyV LT sequences which were analyzed belong to the MKL-1 strain. Circulating MCPyV LT sequences are present in blood donor sera. MCPyV-positive samples from blood donors could represent a potential vehicle for MCPyV infection in receivers, whereas an increase in viral load may occur with multiple blood transfusions. In certain patient conditions, such as immune-depression/suppression, additional disease or old age, transfusion of MCPyV-positive samples could be an additional risk factor for MCC onset. PMID:29238698
Hoffman, Brett; Li, Zhubing; Liu, Qiang
2015-08-01
Hepatitis C virus (HCV) non-structural protein 5A (NS5A) is essential for viral replication; however, its effect on HCV RNA translation remains controversial partially due to the use of reporters lacking the 3' UTR, where NS5A binds to the poly(U/UC) sequence. We investigated the role of NS5A in HCV translation using a monocistronic RNA containing a Renilla luciferase gene flanked by the HCV UTRs. We found that NS5A downregulated viral RNA translation in a dose-dependent manner. This downregulation required both the 5' and 3' UTRs of HCV because substitution of either sequence with the 5' and 3' UTRs of enterovirus 71 or a cap structure at the 5' end eliminated the effects of NS5A on translation. Translation of the HCV genomic RNA was also downregulated by NS5A. The inhibition of HCV translation by NS5A required the poly(U/UC) sequence in the 3' UTR as NS5A did not affect translation when it was deleted. In addition, we showed that, whilst the amphipathic α-helix of NS5A has no effect on viral translation, the three domains of NS5A can inhibit translation independently, also dependent on the presence of the poly(U/UC) sequence in the 3' UTR. These results suggested that NS5A downregulated HCV RNA translation through a mechanism involving the poly(U/UC) sequence in the 3' UTR.
Xia, Xia-Yu; Ge, Meng; Hsi, Jenny H; He, Xiang; Ruan, Yu-Hua; Wang, Zhi-Xin; Shao, Yi-Ming; Pan, Xian-Ming
2014-01-01
Accurate estimates of HIV-1 incidence are essential for monitoring epidemic trends and evaluating intervention efforts. However, the long asymptomatic stage of HIV-1 infection makes it difficult to effectively distinguish incident infections from chronic ones. Current incidence assays based on serology or viral sequence diversity are both still lacking in accuracy. In the present work, a sequence clustering based diversity (SCBD) assay was devised by utilizing the fact that viral sequences derived from each transmitted/founder (T/F) strain tend to cluster together at early stage, and that only the intra-cluster diversity is correlated with the time since HIV-1 infection. The dot-matrix pairwise alignment was used to eliminate the disproportional impact of insertion/deletions (indels) and recombination events, and so was the proportion of clusterable sequences (Pc) as an index to identify late chronic infections with declined viral genetic diversity. Tested on a dataset containing 398 incident and 163 chronic infection cases collected from the Los Alamos HIV database (last modified 2/8/2012), our SCBD method achieved 99.5% sensitivity and 98.8% specificity, with an overall accuracy of 99.3%. Further analysis and evaluation also suggested its performance was not affected by host factors such as the viral subtypes and transmission routes. The SCBD method demonstrated the potential of sequencing based techniques to become useful for identifying incident infections. Its use may be most advantageous for settings with low to moderate incidence relative to available resources. The online service is available at http://www.bioinfo.tsinghua.edu.cn:8080/SCBD/index.jsp.
Kosoltanapiwat, Nathamon; Yindee, Marnoch; Chavez, Irwin Fernandez; Leaungwutiwong, Pornsawan; Adisakwattana, Poom; Singhasivanon, Pratap; Thawornkuno, Charin; Thippornchai, Narin; Rungruengkitkun, Amporn; Soontorn, Juthamas; Pearsiriwuttipong, Sasipan
2016-01-25
Bovine enteroviruses (BEV) are members of the genus Enterovirus in the family Picornaviridae. They are predominantly isolated from cattle feces, but also are detected in feces of other animals, including goats and deer. These viruses are found in apparently healthy animals, as well as in animals with clinical signs and several studies reported recently suggest a potential role of BEV in causing disease in animals. In this study, we surveyed the presence of BEV in domestic and wild animals in Thailand, and assessed their genetic variability. Viral RNA was extracted from fecal samples of cattle, domestic goats, Indian bison (gaurs), and deer. The 5' untranslated region (5'UTR) was amplified by nested reverse transcription-polymerase chain reaction (RT-PCR) with primers specific to BEV 5'UTR. PCR products were sequenced and analyzed phylogenetically using the neighbor-joining algorithm to observe genetic variations in regions of the bovine and bovine-like enteroviral 5'UTR found in this study. BEV and BEV-like sequences were detected in the fecal samples of cattle (40/60, 67 %), gaurs (3/30, 10 %), and goats (11/46, 24 %). Phylogenetic analyses of the partial 5'UTR sequences indicated that different BEV variants (both EV-E and EV-F species) co-circulated in the domestic cattle, whereas the sequences from gaurs and goats clustered according to the animal species, suggesting that these viruses are host species-specific. Varieties of BEV and BEV-like 5'UTR sequences were detected in fecal samples from both domestic and wild animals. To our knowledge, this is the first report of the genetic variability of BEV in Thailand.
Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia
2017-01-01
Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
Influenza A virus hemagglutinin glycosylation compensates for antibody escape fitness costs.
Kosik, Ivan; Ince, William L; Gentles, Lauren E; Oler, Andrew J; Kosikova, Martina; Angel, Matthew; Magadán, Javier G; Xie, Hang; Brooke, Christopher B; Yewdell, Jonathan W
2018-01-01
Rapid antigenic evolution enables the persistence of seasonal influenza A and B viruses in human populations despite widespread herd immunity. Understanding viral mechanisms that enable antigenic evolution is critical for designing durable vaccines and therapeutics. Here, we utilize the primerID method of error-correcting viral population sequencing to reveal an unexpected role for hemagglutinin (HA) glycosylation in compensating for fitness defects resulting from escape from anti-HA neutralizing antibodies. Antibody-free propagation following antigenic escape rapidly selected viruses with mutations that modulated receptor binding avidity through the addition of N-linked glycans to the HA globular domain. These findings expand our understanding of the viral mechanisms that maintain fitness during antigenic evolution to include glycan addition, and highlight the immense power of high-definition virus population sequencing to reveal novel viral adaptive mechanisms.
Finding and identifying the viral needle in the metagenomic haystack: trends and challenges
Soueidan, Hayssam; Schmitt, Louise-Amélie; Candresse, Thierry; Nikolski, Macha
2015-01-01
Collectively, viruses have the greatest genetic diversity on Earth, occupy extremely varied niches and are likely able to infect all living organisms. Viral infections are an important issue for human health and cause considerable economic losses when agriculturally important crops or husbandry animals are infected. The advent of metagenomics has provided a precious tool to study viruses by sampling them in natural environments and identifying the genomic composition of a sample. However, reaching a clear recognition and taxonomic assignment of the identified viruses has been hampered by the computational difficulty of these problems. In this perspective paper we examine the trends in current research for the identification of viral sequences in a metagenomic sample, pinpoint the intrinsic computational difficulties for the identification of novel viral sequences within metagenomic samples, and suggest possible avenues to overcome them. PMID:25610431
Shedding new light on viral photosynthesis.
Puxty, Richard J; Millard, Andrew D; Evans, David J; Scanlan, David J
2015-10-01
Viruses infecting the environmentally important marine cyanobacteria Prochlorococcus and Synechococcus encode 'auxiliary metabolic genes' (AMGs) involved in the light and dark reactions of photosynthesis. Here, we discuss progress on the inventory of such AMGs in the ever-increasing number of viral genome sequences as well as in metagenomic datasets. We contextualise these gene acquisitions with reference to a hypothesised fitness gain to the phage. We also report new evidence with regard to the sequence and predicted structural properties of viral petE genes encoding the soluble electron carrier plastocyanin. Viral copies of PetE exhibit extensive modifications to the N-terminal signal peptide and possess several novel residues in a region responsible for interaction with redox partners. We also highlight potential knowledge gaps in this field and discuss future opportunities to discover novel phage-host interactions involved in the photosynthetic process.
Can the HIV-1 splicing machinery be targeted for drug discovery?
Dlamini, Zodwa; Hull, Rodney
2017-01-01
HIV-1 is able to express multiple protein types and isoforms from a single 9 kb mRNA transcript. These proteins are also expressed at particular stages of viral development, and this is achieved through the control of alternative splicing and the export of these transcripts from the nucleus. The nuclear export is controlled by the HIV protein Rev being required to transport incompletely spliced and partially spliced mRNA from the nucleus where they are normally retained. This implies a close relationship between the control of alternate splicing and the nuclear export of mRNA in the control of HIV-1 viral proliferation. This review discusses both the processes. The specificity and regulation of splicing in HIV-1 is controlled by the use of specific splice sites as well as exonic splicing enhancer and exonic splicing silencer sequences. The use of these silencer and enhancer sequences is dependent on the serine arginine family of proteins as well as the heterogeneous nuclear ribonucleoprotein family of proteins that bind to these sequences and increase or decrease splicing. Since alternative splicing is such a critical factor in viral development, it presents itself as a promising drug target. This review aims to discuss the inhibition of splicing, which would stall viral development, as an anti-HIV therapeutic strategy. In this review, the most recent knowledge of splicing in human immunodeficiency viral development and the latest therapeutic strategies targeting human immunodeficiency viral splicing are discussed. PMID:28331370
Watanabe, Yoshiyuki; Yamamoto, Hiroyuki; Oikawa, Ritsuko; Toyota, Minoru; Yamamoto, Masakazu; Kokudo, Norihiro; Tanaka, Shinji; Arii, Shigeki; Yotsuyanagi, Hiroshi; Koike, Kazuhiko; Itoh, Fumio
2015-01-01
Integration of DNA viruses into the human genome plays an important role in various types of tumors, including hepatitis B virus (HBV)–related hepatocellular carcinoma. However, the molecular details and clinical impact of HBV integration on either human or HBV epigenomes are unknown. Here, we show that methylation of the integrated HBV DNA is related to the methylation status of the flanking human genome. We developed a next-generation sequencing-based method for structural methylation analysis of integrated viral genomes (denoted G-NaVI). This method is a novel approach that enables enrichment of viral fragments for sequencing using unique baits based on the sequence of the HBV genome. We detected integrated HBV sequences in the genome of the PLC/PRF/5 cell line and found variable levels of methylation within the integrated HBV genomes. Allele-specific methylation analysis revealed that the HBV genome often became significantly methylated when integrated into highly methylated host sites. After integration into unmethylated human genome regions such as promoters, however, the HBV DNA remains unmethylated and may eventually play an important role in tumorigenesis. The observed dynamic changes in DNA methylation of the host and viral genomes may functionally affect the biological behavior of HBV. These findings may impact public health given that millions of people worldwide are carriers of HBV. We also believe our assay will be a powerful tool to increase our understanding of the various types of DNA virus-associated tumorigenesis. PMID:25653310
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
USDA-ARS?s Scientific Manuscript database
Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
New parvovirus in child with unexplained diarrhea, Tunisia.
Phan, Tung G; Sdiri-Loulizi, Khira; Aouni, Mahjoub; Ambert-Balay, Katia; Pothier, Pierre; Deng, Xutao; Delwart, Eric
2014-11-01
A divergent parvovirus genome was the only eukaryotic viral sequence detected in feces of a Tunisian child with unexplained diarrhea. Tusavirus 1 shared 44% and 39% identity with the nonstructural protein 1 and viral protein 1, respectively, of the closest genome, Kilham rat parvovirus, indicating presence of a new human viral species in the Protoparvovirus genus.
New Parvovirus in Child with Unexplained Diarrhea, Tunisia
Phan, Tung G.; Sdiri-Loulizi, Khira; Aouni, Mahjoub; Ambert-Balay, Katia; Pothier, Pierre; Deng, Xutao
2014-01-01
A divergent parvovirus genome was the only eukaryotic viral sequence detected in feces of a Tunisian child with unexplained diarrhea. Tusavirus 1 shared 44% and 39% identity with the nonstructural protein 1 and viral protein 1, respectively, of the closest genome, Kilham rat parvovirus, indicating presence of a new human viral species in the Protoparvovirus genus. PMID:25340816
NASA Astrophysics Data System (ADS)
Emerson, J. B.; Brum, J. R.; Roux, S.; Bolduc, B.; Woodcroft, B. J.; Singleton, C. M.; Boyd, J. A.; Hodgkins, S. B.; Wilson, R.; Trubl, G. G.; Jang, H. B.; Crill, P. M.; Chanton, J.; Saleska, S. R.; Rich, V. I.; Tyson, G. W.; Sullivan, M. B.
2016-12-01
Methane and carbon dioxide emissions, which are under significant microbial control, provide positive feedbacks to climate change in thawing permafrost peatlands. Although viruses in marine systems have been shown to impact microbial ecology and biogeochemical cycling through host cell lysis, horizontal gene transfer, and auxiliary metabolic gene expression, viral ecology in permafrost and other soils remains virtually unstudied due to methodological challenges. Here, we identified viral sequences in 208 assembled bulk soil metagenomes derived from a permafrost thaw gradient in Stordalen Mire, northern Sweden, from 2010-2012. 2,048 viral populations were recovered, which genome- and network-based classification revealed to be largely novel, increasing known viral genera globally by 40%. Ecologically, viral communities differed significantly across the thaw gradient and by soil depth. Co-occurring microbial community composition, soil moisture, and pH were predictors of viral community composition, indicative of biological and biogeochemical feedbacks as permafrost thaws. Host prediction—achieved through clustered regularly interspaced short palindromic repeats (CRISPRs), tetranucleotide frequency patterns, and other sequence similarities to binned microbial population genomes—was able to link 38% of the viral populations to a microbial host. 5% of the implicated hosts were archaea, predominantly methanogens and ammonia-oxidizing Nitrososphaera, 45% were Acidobacteria or Verrucomicrobia (mostly predicted heterotrophic complex carbon degraders), and 21% were Proteobacteria, including methane oxidizers. Recovered viral genome fragments also contained auxiliary metabolic genes involved in carbon and nitrogen cycling. Together, these data reveal multiple levels of previously unknown viral contributions to biogeochemical cycling, including to carbon gas emissions, in peatland soils undergoing and contributing to climate change. This work represents a significant step towards understanding viral roles in microbially-mediated biogeochemical cycling in soil.
Yoshida, Mitsuhiro; Mochizuki, Tomohiro; Urayama, Syun-Ichi; Yoshida-Takashima, Yukari; Nishi, Shinro; Hirai, Miho; Nomaki, Hidetaka; Takaki, Yoshihiro; Nunoura, Takuro; Takai, Ken
2018-01-01
Previous studies on marine environmental virology have primarily focused on double-stranded DNA (dsDNA) viruses; however, it has recently been suggested that single-stranded DNA (ssDNA) viruses are more abundant in marine ecosystems. In this study, we performed a quantitative viral community DNA analysis to estimate the relative abundance and composition of both ssDNA and dsDNA viruses in offshore upper bathyal sediment from Tohoku, Japan (water depth = 500 m). The estimated dsDNA viral abundance ranged from 3 × 106 to 5 × 106 genome copies per cm3 sediment, showing values similar to the range of fluorescence-based direct virus counts. In contrast, the estimated ssDNA viral abundance ranged from 1 × 108 to 3 × 109 genome copies per cm3 sediment, thus providing an estimation that the ssDNA viral populations represent 96.3–99.8% of the benthic total DNA viral assemblages. In the ssDNA viral metagenome, most of the identified viral sequences were associated with ssDNA viral families such as Circoviridae and Microviridae. The principle components analysis of the ssDNA viral sequence components from the sedimentary ssDNA viral metagenomic libraries found that the different depth viral communities at the study site all exhibited similar profiles compared with deep-sea sediment ones at other reference sites. Our results suggested that deep-sea benthic ssDNA viruses have been significantly underestimated by conventional direct virus counts and that their contributions to deep-sea benthic microbial mortality and geochemical cycles should be further addressed by such a new quantitative approach. PMID:29467725
Dal Bosco, Daniela; Sinski, Iraci; Ritschel, Patrícia S; Camargo, Umberto A; Fajardo, Thor V M; Harakava, Ricardo; Quecini, Vera
2018-06-06
Increased tolerance to pathogens is an important goal in conventional and biotechnology-assisted grapevine breeding programs worldwide. Fungal and viral pathogens cause direct losses in berry production, but also affect the quality of the final products. Precision breeding strategies allow the introduction of resistance characters in elite cultivars, although the factors determining the plant's overall performance are not fully characterized. Grapevine plants expressing defense proteins, from fungal or plant origins, or of the coat protein gene of grapevine leafroll-associated virus 3 (GLRaV-3) were generated by Agrobacterium-mediated transformation of somatic embryos and shoot apical meristems. The responses of the transformed lines to pathogen challenges were investigated by biochemical, phytopathological and molecular methods. The expression of a Metarhizium anisopliae chitinase gene delayed pathogenesis and disease progression against the necrotrophic pathogen Botrytis cinerea. Modified lines expressing a Solanum nigrum osmotin-like protein also exhibited slower disease progression, but to a smaller extent. Grapevine lines carrying two hairpin-inducing constructs had lower GLRaV-3 titers when challenged by grafting, although disease symptoms and viral multiplication were detected. The levels of global genome methylation were determined for the genetically engineered lines, and correlation analyses demonstrated the association between higher levels of methylated DNA and larger portions of virus-derived sequences. Resistance expression was also negatively correlated with the contents of introduced viral sequences and genome methylation, indicating that the effectiveness of resistance strategies employing sequences of viral origin is subject to epigenetic regulation in grapevine.
Paldurai, Anandan; Kim, Shin-Hee; Nayak, Baibaswata; Xiao, Sa; Shive, Heather; Collins, Peter L.
2014-01-01
ABSTRACT Naturally occurring Newcastle disease virus (NDV) strains vary greatly in virulence. The presence of multibasic residues at the proteolytic cleavage site of the fusion (F) protein has been shown to be a primary determinant differentiating virulent versus avirulent strains. However, there is wide variation in virulence among virulent strains. There also are examples of incongruity between cleavage site sequence and virulence. These observations suggest that additional viral factors contribute to virulence. In this study, we evaluated the contribution of each viral gene to virulence individually and in different combinations by exchanging genes between velogenic (highly virulent) strain GB Texas (GBT) and mesogenic (moderately virulent) strain Beaudette C (BC). These two strains are phylogenetically closely related, and their F proteins contain identical cleavage site sequences, 112RRQKR↓F117. A total of 20 chimeric viruses were constructed and evaluated in vitro, in 1-day-old chicks, and in 2-week-old chickens. The results showed that both the envelope-associated and polymerase-associated proteins contribute to the difference in virulence between rBC and rGBT, with the envelope-associated proteins playing the greater role. The F protein was the major individual contributor and was sometimes augmented by the homologous M and HN proteins. The dramatic effect of F was independent of its cleavage site sequence since that was identical in the two strains. The polymerase L protein was the next major individual contributor and was sometimes augmented by the homologous N and P proteins. The leader and trailer regions did not appear to contribute to the difference in virulence between BC and GBT. IMPORTANCE This study is the first comprehensive and systematic study of NDV virulence and pathogenesis. Genetic exchanges between a mesogenic and a velogenic strain revealed that the fusion glycoprotein is the major virulence determinant regardless of the identical virulence protease cleavage site sequence present in both strains. The contribution of the large polymerase protein to NDV virulence is second only to that of the fusion glycoprotein. The identification of virulence determinants is of considerable importance, because of the potential to generate better live attenuated NDV vaccines. It may also be possible to apply these findings to other paramyxoviruses. PMID:24850737
Sequence features of viral and human Internal Ribosome Entry Sites predictive of their activity
Elias-Kirma, Shani; Nir, Ronit; Segal, Eran
2017-01-01
Translation of mRNAs through Internal Ribosome Entry Sites (IRESs) has emerged as a prominent mechanism of cellular and viral initiation. It supports cap-independent translation of select cellular genes under normal conditions, and in conditions when cap-dependent translation is inhibited. IRES structure and sequence are believed to be involved in this process. However due to the small number of IRESs known, there have been no systematic investigations of the determinants of IRES activity. With the recent discovery of thousands of novel IRESs in human and viruses, the next challenge is to decipher the sequence determinants of IRES activity. We present the first in-depth computational analysis of a large body of IRESs, exploring RNA sequence features predictive of IRES activity. We identified predictive k-mer features resembling IRES trans-acting factor (ITAF) binding motifs across human and viral IRESs, and found that their effect on expression depends on their sequence, number and position. Our results also suggest that the architecture of retroviral IRESs differs from that of other viruses, presumably due to their exposure to the nuclear environment. Finally, we measured IRES activity of synthetically designed sequences to confirm our prediction of increasing activity as a function of the number of short IRES elements. PMID:28922394
Oka, Tomoichiro; Doan, Yen Hai; Shimoike, Takashi; Haga, Kei; Takizawa, Takenori
2017-12-01
Sapoviruses (SaVs) are enteric viruses and have been detected in various mammals. They are divided into multiple genogroups and genotypes based on the entire major capsid protein (VP1) encoding region sequences. In this study, we determined the first complete genome sequences of two genogroup V, genotype 3 (GV.3) SaV strains detected from swine fecal samples, in combination with Illumina MiSeq sequencing of the libraries prepared from viral RNA and PCR products. The lengths of the viral genome (7494 nucleotides [nt] excluding polyA tail) and short 5'-untranslated region (14 nt) as well as two predicted open reading frames are similar to those of other SaVs. The amino acid differences between the two porcine SaVs are most frequent in the central region of the VP1-encoding region. A stem-loop structure which was predicted in the first 41 nt of the 5'-terminal region of GV.3 SaVs and the other available complete genome sequences of SaVs may have a critical role in viral genome replication. Our study provides complete genome sequences of rarely reported GV.3 SaV strains and highlights the common 5'-terminal genomic feature of SaVs detected from different mammalian species.
Ndunguru, Joseph; Taylor, Nigel J; Yadav, Jitender; Aly, Haytham; Legg, James P; Aveling, Terry; Thompson, Graham; Fauquet, Claude M
2005-05-18
Plant viral diseases present major constraints to crop production. Effective sampling of the viruses infecting plants is required to facilitate their molecular study and is essential for the development of crop protection and improvement programs. Retaining integrity of viral pathogens within sampled plant tissues is often a limiting factor in this process, most especially when sample sizes are large and when operating in developing counties and regions remote from laboratory facilities. FTA is a paper-based system designed to fix and store nucleic acids directly from fresh tissues pressed into the treated paper. We report here the use of FTA as an effective technology for sampling and retrieval of DNA and RNA viruses from plant tissues and their subsequent molecular analysis. DNA and RNA viruses were successfully recovered from leaf tissues of maize, cassava, tomato and tobacco pressed into FTA Classic Cards. Viral nucleic acids eluted from FTA cards were found to be suitable for diagnostic molecular analysis by PCR-based techniques and restriction analysis, and for cloning and nucleotide sequencing in a manner equivalent to that offered by tradition isolation methods. Efficacy of the technology was demonstrated both from sampled greenhouse-grown plants and from leaf presses taken from crop plants growing in farmer's fields in East Africa. In addition, FTA technology was shown to be suitable for recovery of viral-derived transgene sequences integrated into the plant genome. Results demonstrate that FTA is a practical, economical and sensitive method for sampling, storage and retrieval of viral pathogens and plant genomic sequences, when working under controlled conditions and in the field. Application of this technology has the potential to significantly increase ability to bring modern analytical techniques to bear on the viral pathogens infecting crop plants.
Egge, Elianne Sirnæs; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente
2015-01-01
Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September–October (autumn) and lowest in April–May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3–5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. PMID:25893259
Egge, Elianne Sirnaes; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente
2015-06-01
Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September-October (autumn) and lowest in April-May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3-5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. © 2015 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Fun, Axel; Leitner, Thomas; Vandekerckhove, Linos; Däumer, Martin; Thielen, Alexander; Buchholz, Bernd; Hoepelman, Andy I M; Gisolf, Elizabeth H; Schipper, Pauline J; Wensing, Annemarie M J; Nijhuis, Monique
2018-01-05
Emergence of resistance against integrase inhibitor raltegravir in human immunodeficiency virus type 1 (HIV-1) patients is generally associated with selection of one of three signature mutations: Y143C/R, Q148K/H/R or N155H, representing three distinct resistance pathways. The mechanisms that drive selection of a specific pathway are still poorly understood. We investigated the impact of the HIV-1 genetic background and population dynamics on the emergence of raltegravir resistance. Using deep sequencing we analyzed the integrase coding sequence (CDS) in longitudinal samples from five patients who initiated raltegravir plus optimized background therapy at viral loads > 5000 copies/ml. To investigate the role of the HIV-1 genetic background we created recombinant viruses containing the viral integrase coding region from pre-raltegravir samples from two patients in whom raltegravir resistance developed through different pathways. The in vitro selections performed with these recombinant viruses were designed to mimic natural population bottlenecks. Deep sequencing analysis of the viral integrase CDS revealed that the virological response to raltegravir containing therapy inversely correlated with the relative amount of unique sequence variants that emerged suggesting diversifying selection during drug pressure. In 4/5 patients multiple signature mutations representing different resistance pathways were observed. Interestingly, the resistant population can consist of a single resistant variant that completely dominates the population but also of multiple variants from different resistance pathways that coexist in the viral population. We also found evidence for increased diversification after stronger bottlenecks. In vitro selections with low viral titers, mimicking population bottlenecks, revealed that both recombinant viruses and HXB2 reference virus were able to select mutations from different resistance pathways, although typically only one resistance pathway emerged in each individual culture. The generation of a specific raltegravir resistant variant is not predisposed in the genetic background of the viral integrase CDS. Typically, in the early phases of therapy failure the sequence space is explored and multiple resistance pathways emerge and then compete for dominance which frequently results in a switch of the dominant population over time towards the fittest variant or even multiple variants of similar fitness that can coexist in the viral population.
Three closely related herpesviruses are associated with fibropapillomatosis in marine turtles
Quackenbush, S.L.; Work, Thierry M.; Balazs, George H.; Casey, Rufina N.; Rovnak, J.; Chaves, A.; duToit, L.; Baines, J.D.; Parrish, C.R.; Bowser, Paul R.; Casey, James W.
1998-01-01
Green turtle fibropapillomatosis is a neoplastic disease of increasingly significant threat to the survivability of this species. Degenerate PCR primers that target highly conserved regions of genes encoding herpesvirus DNA polymerases were used to amplify a DNA sequence from fibropapillomas and fibromas from Hawaiian and Florida green turtles. All of the tumors tested (n= 23) were found to harbor viral DNA, whereas no viral DNA was detected in skin biopsies from tumor-negative turtles. The tissue distribution of the green turtle herpesvirus appears to be generally limited to tumors where viral DNA was found to accumulate at approximately two to five copies per cell and is occasionally detected, only by PCR, in some tissues normally associated with tumor development. In addition, herpesviral DNA was detected in fibropapillomas from two loggerhead and four olive ridley turtles. Nucleotide sequencing of a 483-bp fragment of the turtle herpesvirus DNA polymerase gene determined that the Florida green turtle and loggerhead turtle sequences are identical and differ from the Hawaiian green turtle sequence by five nucleotide changes, which results in two amino acid substitutions. The olive ridley sequence differs from the Florida and Hawaiian green turtle sequences by 15 and 16 nucleotide changes, respectively, resulting in four amino acid substitutions, three of which are unique to the olive ridley sequence. Our data suggest that these closely related turtle herpesviruses are intimately involved in the genesis of fibropapillomatosis.
Recently Patented Viral Nucleotide Sequences and Generation of Virus-Derived Vaccines.
Venkataraman, Srividhya; Ahmad, Tauqeer; Haidar, Mounir A; Hefferon, Kathleen L
2017-01-01
With an increase in comprehension of the molecular biology of viruses, there has been a recent surge in the application of virus sequences and viral gene expression strategies towards the diagnosis and treatment of diseases. The scope of the patenting landscape has widened as a result and the current review discusses patents pertaining to live / attenuated viral vaccines. The vaccines addressed here have been developed by both conventional means as well as by the state-of-the-art genetic engineering techniques. This review also addresses the applications of these patents for clinical and biotechnological purposes. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Sequential Bottlenecks Drive Viral Evolution in Early Acute Hepatitis C Virus Infection
McElroy, Kerensa; Gaudieri, Silvana; Pham, Son T.; Chopra, Abha; Cameron, Barbara; Maher, Lisa; Dore, Gregory J.; White, Peter A.; Lloyd, Andrew R.
2011-01-01
Hepatitis C is a pandemic human RNA virus, which commonly causes chronic infection and liver disease. The characterization of viral populations that successfully initiate infection, and also those that drive progression to chronicity is instrumental for understanding pathogenesis and vaccine design. A comprehensive and longitudinal analysis of the viral population was conducted in four subjects followed from very early acute infection to resolution of disease outcome. By means of next generation sequencing (NGS) and standard cloning/Sanger sequencing, genetic diversity and viral variants were quantified over the course of the infection at frequencies as low as 0.1%. Phylogenetic analysis of reassembled viral variants revealed acute infection was dominated by two sequential bottleneck events, irrespective of subsequent chronicity or clearance. The first bottleneck was associated with transmission, with one to two viral variants successfully establishing infection. The second occurred approximately 100 days post-infection, and was characterized by a decline in viral diversity. In the two subjects who developed chronic infection, this second bottleneck was followed by the emergence of a new viral population, which evolved from the founder variants via a selective sweep with fixation in a small number of mutated sites. The diversity at sites with non-synonymous mutation was higher in predicted cytotoxic T cell epitopes, suggesting immune-driven evolution. These results provide the first detailed analysis of early within-host evolution of HCV, indicating strong selective forces limit viral evolution in the acute phase of infection. PMID:21912520
Replication of a chronic hepatitis B virus genotype F1b construct.
Hernández, Sergio; Jiménez, Gustavo; Alarcón, Valentina; Prieto, Cristian; Muñoz, Francisca; Riquelme, Constanza; Venegas, Mauricio; Brahm, Javier; Loyola, Alejandra; Villanueva, Rodrigo A
2016-03-01
Genotype F is one of the less-studied genotypes of human hepatitis B virus, although it is widely distributed in regions of Central and South American. Our previous studies have shown that HBV genotype F is prevalent in Chile, and phylogenetic analysis of its full-length sequence amplified from the sera of chronically infected patients identified it as HBV subgenotype F1b. We have previously reported the full-length sequence of a HBV molecular clone obtained from a patient chronically infected with genotype F1b. In this report, we established a system to study HBV replication based on hepatoma cell lines transfected with full-length monomers of the HBV genome. Culture supernatants were analyzed after transfection and found to contain both HBsAg and HBeAg viral antigens. Consistently, fractionated cell extracts revealed the presence of viral replication, with both cytoplasmic and nuclear DNA intermediates. Analysis of HBV-transfected cells by indirect immunofluorescence or immunoelectron microscopy revealed the expression of viral antigens and cytoplasmic viral particles, respectively. To test the functionality of the ongoing viral replication further at the level of chromatinized cccDNA, transfected cells were treated with a histone deacetylase inhibitor, and this resulted in increased viral replication. This correlated with changes posttranslational modifications of histones at viral promoters. Thus, the development of this viral replication system for HBV genotype F will facilitate studies on the regulation of viral replication and the identification of new antiviral drugs.
Capul, Althea A; de la Torre, Juan Carlos; Buchmeier, Michael J
2011-04-01
Arenaviruses are negative-strand RNA viruses that cause human diseases such as lymphocytic choriomeningitis, Bolivian hemorrhagic fever, and Lassa hemorrhagic fever. No licensed vaccines exist, and current treatment is limited to ribavirin. The prototypic arenavirus, lymphocytic choriomeningitis virus (LCMV), is a model for dissecting virus-host interactions in persistent and acute disease. The RING finger protein Z has been identified as the driving force of arenaviral budding and acts as the viral matrix protein. While residues in Z required for viral budding have been described, residues that govern the Z matrix function(s) have yet to be fully elucidated. Because this matrix function is integral to viral assembly, we reasoned that this would be reflected in sequence conservation. Using sequence alignment, we identified several conserved residues in Z outside the RING and late domains. Nine residues were each mutated to alanine in Lassa fever virus Z. All of the mutations affected the expression of an LCMV minigenome and the infectivity of virus-like particles, but to greatly varying degrees. Interestingly, no mutations appeared to affect Z-mediated budding or association with viral GP. Our findings provide direct experimental evidence supporting a role for Z in the modulation of the activity of the viral ribonucleoprotein (RNP) complex and its packaging into mature infectious viral particles.
Vega Thurber, Rebecca L.; Barott, Katie L.; Hall, Dana; Liu, Hong; Rodriguez-Mueller, Beltran; Desnues, Christelle; Edwards, Robert A.; Haynes, Matthew; Angly, Florent E.; Wegley, Linda; Rohwer, Forest L.
2008-01-01
During the last several decades corals have been in decline and at least one-third of all coral species are now threatened with extinction. Coral disease has been a major contributor to this threat, but little is known about the responsible pathogens. To date most research has focused on bacterial and fungal diseases; however, viruses may also be important for coral health. Using a combination of empirical viral metagenomics and real-time PCR, we show that Porites compressa corals contain a suite of eukaryotic viruses, many related to the Herpesviridae. This coral-associated viral consortium was found to shift in response to abiotic stressors. In particular, when exposed to reduced pH, elevated nutrients, and thermal stress, the abundance of herpes-like viral sequences rapidly increased in 2 separate experiments. Herpes-like viral sequences were rarely detected in apparently healthy corals, but were abundant in a majority of stressed samples. In addition, surveys of the Nematostella and Hydra genomic projects demonstrate that even distantly related Cnidarians contain numerous herpes-like viral genes, likely as a result of latent or endogenous viral infection. These data support the hypotheses that corals experience viral infections, which are exacerbated by stress, and that herpes-like viruses are common in Cnidarians. PMID:19017800
Singh, Mini Pritam; Majumdar, Manasi; Thapa, Babu Ram; Gupta, Puneet Kumar; Khurana, Jasmine; Budhathoki, Bimal; Ratho, Radha Kanta
2015-01-01
Background & objectives: Hepatitis A virus usually causes acute viral hepatitis (AVH) in the paediatric age group with a recent shift in age distribution and disease manifestations like acute liver failure (ALF). This has been attributed to mutations in 5’non-translated region (5’NTR) which affects the viral multiplication. The present study was aimed to carry out the molecular detection and phylogenetic analysis of hepatitis A virus strains circulating in north western India. Methods: Serum samples from in patients and those attending out patient department of Pediatric Gastroenterology in a tertiary care hospital in north India during 2007-2011 with clinically suspected AVH were tested for anti-hepatitis A virus (HAV) IgM antibodies. Acute phase serum samples were subjected to nested PCR targeting the 5’NTR region followed by sequencing of the representative strains. Results: A total of 1334 samples were tested, 290 (21.7%) were positive for anti-HAV IgM antibody. Of these, 78 serum samples (< 7 days old) were subjected to PCR and 47.4% (37/78) samples showed the presence of HAV RNA. Children < 15 yr of age accounted for majority (94%) of cases with highest seropositivity during rainy season. Sequencing of 15 representative strains was carried out and the circulating genotype was found to be III A. The nucleotide sequences showed high homology among the strains with a variation ranging from 0.1-1 per cent over the years. An important substitution of G to A at 324 position was shown by both AVH and ALF strains. The cumulative substitution in AVH strains Vs ALF strains as compared to GBM, Indian and prototype strain in the 200-500 region of 5’ NTR was comparable. Interpretation & conclusion: Our results showed hepatitis A still a disease of children with III A as a circulating genotype in this region. The mutations at 5’NTR region warrant further analysis as these affect the structure of internal ribosomal entry site which is important for viral replication. PMID:25900957
Structure and Temporal Dynamics of Populations within Wheat Streak Mosaic Virus Isolates
Hall, Jeffrey S.; French, Roy; Morris, T. Jack; Stenger, Drake C.
2001-01-01
Variation within the Type and Sidney 81 strains of wheat streak mosaic virus was assessed by single-strand conformation polymorphism (SSCP) analysis and confirmed by nucleotide sequencing. Limiting-dilution subisolates (LDSIs) of each strain were evaluated for polymorphism in the P1, P3, NIa, and CP cistrons. Different SSCP patterns among LDSIs of a strain were associated with single-nucleotide substitutions. Sidney 81 LDSI-S10 was used as founding inoculum to establish three lineages each in wheat, corn, and barley. The P1, HC-Pro, P3, CI, NIa, NIb, and CP cistrons of LDSI-S10 and each lineage at passages 1, 3, 6, and 9 were evaluated for polymorphism. By passage 9, each lineage differed in consensus sequence from LDSI-S10. The majority of substitutions occurred within NIa and CP, although at least one change occurred in each cistron except HC-Pro and P3. Most consensus sequence changes among lineages were independent, with substitutions accumulating over time. However, LDSI-S10 bore a variant nucleotide (G6016) in NIa that was restored to A6016 in eight of nine lineages by passage 6. This near-global reversion is most easily explained by selection. Examination of nonconsensus variation revealed a pool of unique substitutions (singletons) that remained constant in frequency during passage, regardless of the host species examined. These results suggest that mutations arising by viral polymerase error are generated at a constant rate but that most newly generated mutants are sequestered in virions and do not serve as replication templates. Thus, a substantial fraction of variation generated is static and has yet to be tested for relative fitness. In contrast, nonsingleton variation increased upon passage, suggesting that some mutants do serve as replication templates and may become established in a population. Replicated mutants may or may not rise to prominence to become the consensus sequence in a lineage, with the fate of any particular mutant subject to selection and stochastic processes such as genetic drift and population growth factors. PMID:11581391
Dinu, Sorin; Calistru, Petre-Iacob; Ceauşu, Emanoil; Târdeil, Graţiela; Oprişan, Gabriela
2015-01-01
Although the European recommendations include the use of new antiviral drugs for the treatment of hepatitis C, in Romania the current treatment remains interferon plus ribavirin. First generation viral protease inhibitors (i.e. boceprevir, telaprevir), which have raised the chances of obtaining viral clearance in up to 70% of infection cases produced by genotype 1 isolates, have not been introduced yet as standard treatment in our country. The success of these new antivirals is limited by the occurrence and selection of resistance mutations during therapy. We set-up a molecular study aiming to detect any resistance mutations to boceprevir and telaprevir harbored by hepatitis C isolates infecting Romanian patients naïve to viral protease inhibitors. Since these new antivirals are efficient and approved for genotype 1 infection, viral samples were genotyped following a protocol previously developed by our research group. We analyzed by both population sequencing and molecular cloning and sequencing the NS3 protease region of hepatitis C virus isolates infecting patients which were not previously exposed to boceprevir and telaprevir. All the analyzed samples were subtype 1b and resembled the samples collected in recent years from Romanian patients. Molecular cloning followed by sequencing showed great intra-host diversity, which is known to represent the source of isolates with different resistance phenotypes. Both population sequencing and molecular cloning followed by clone sequencing revealed two boceprevir resistance mutations (T54S and V55A), respectively, a telaprevir resistance mutation (T54S) in the sequences obtained from a patient with chronic hepatitis C. To our knowledge, this is the first study indicating the existence of pre-treatment resistance mutations to boceprevir and telaprevir in hepatitis C virus isolates infecting Romanian patients.
The organisation and interviral homologies of genes at the 3' end of tobacco rattle virus RNA1
Boccara, Martine; Hamilton, William D. O.; Baulcombe, David C.
1986-01-01
The RNA1 of tobacco rattle virus (TRV) has been cloned as cDNA and the nucleotide sequence determined of 2 kb from the 3'-terminal region. The sequence contains three long open reading frames. One of these starts 5' of the cDNA and probably corresponds to the carboxy-terminal sequence of a 170-K protein encoded on RNA1. The deduced protein sequence from this reading frame shows homology with the putative replicases of tobacco mosaic virus (TMV) and tricornaviruses. The location of the second open reading frame, which encodes a 29-K polypeptide, was shown by Northern blot analysis to coincide with a 1.6-kb subgenomic RNA. The validity of this reading frame was confirmed by showing that the cDNA extending over this region could be transcribed and translated in vitro to produce a polypeptide of the predicted size which co-migrates in electrophoresis with a translation product of authentic viral RNA. The sequence of this 29-K polypeptide showed homology with two regions in the 30-K protein of TMV. This homology includes positions in the TMV 30-K protein where mutations have been identified which affect the transport of virus between cells. The third open reading frame encodes a potential 16-K protein and was shown by Northern blot hybridisation to be contained within the region of a 0.7-kb subgenomic RNA which is found in cellular RNA of infected cells but not virus particles. The many similarities between TRV and TMV in viral morphology, gene organisation and sequence suggest that these two viral groups may share a common viral ancestor. ImagesFig. 2.Fig. 3. PMID:16453668
Evolution of the viral hemorrhagic septicemia virus: divergence, selection and origin.
He, Mei; Yan, Xue-Chun; Liang, Yang; Sun, Xiao-Wen; Teng, Chun-Bo
2014-08-01
Viral hemorrhagic septicemia virus (VHSV) is an economically significant rhabdovirus that affects an increasing number of freshwater and marine fish species. Extensive studies have been conducted on the molecular epizootiology, genetic diversity, and phylogeny of VHSV. However, there are discrepancies between the reported estimates of the nucleotide substitution rate for the G gene and the divergence times for the genotypes. Herein, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of the six VHSV genes. Rate estimates based on the G gene indicated that the marine genotypes/subtypes might not all evolve slower than their major European freshwater counterpart. Age calculations on the six genes revealed that the first bifurcation event of the analyzed isolates might have taken place within the last 300 years, which was much younger than previously thought. Selection analyses suggested that two codons of the G gene might be positively selected. Surveys of codon usage bias showed that the P, M and NV genes exhibited genotype-specific variations. Furthermore, we proposed that VHSV originated from the Pacific Northwest of North America. Copyright © 2014 Elsevier Inc. All rights reserved.
Differential replication of Foot-and-mouth disease viruses in mice determine lethality.
Cacciabue, Marco; García-Núñez, María Soledad; Delgado, Fernando; Currá, Anabella; Marrero, Rubén; Molinari, Paula; Rieder, Elizabeth; Carrillo, Elisa; Gismondi, María Inés
2017-09-01
Adult C57BL/6J mice have been used to study Foot-and-mouth disease virus (FMDV) biology. In this work, two variants of an FMDV A/Arg/01 strain exhibiting differential pathogenicity in adult mice were identified and characterized: a non-lethal virus (A01NL) caused mild signs of disease, whereas a lethal virus (A01L) caused death within 24-48h independently of the dose used. Both viruses caused a systemic infection with pathological changes in the exocrine pancreas. Virus A01L reached higher viral loads in plasma and organs of inoculated mice as well as increased replication in an ovine kidney cell line. Complete consensus sequences revealed 6 non-synonymous changes between A01L and A10NL genomes that might be linked to replication differences, as suggested by in silico prediction studies. Our results highlight the biological significance of discrete genomic variations and reinforce the usefulness of this animal model to study viral determinants of lethality. Copyright © 2017 Elsevier Inc. All rights reserved.
Genomic characterization and phylogenetic analysis of Zika virus circulating in the Americas.
Ye, Qing; Liu, Zhong-Yu; Han, Jian-Feng; Jiang, Tao; Li, Xiao-Feng; Qin, Cheng-Feng
2016-09-01
The rapid spread and potential link with birth defects have made Zika virus (ZIKV) a global public health problem. The virus was discovered 70years ago, yet the knowledge about its genomic structure and the genetic variations associated with current ZIKV explosive epidemics remains not fully understood. In this review, the genome organization, especially conserved terminal structures of ZIKV genome were characterized and compared with other mosquito-borne flaviviruses. It is suggested that major viral proteins of ZIKV share high structural and functional similarity with other known flaviviruses as shown by sequence comparison and prediction of functional motifs in viral proteins. Phylogenetic analysis demonstrated that all ZIKV strains circulating in the America form a unique clade within the Asian lineage. Furthermore, we identified a series of conserved amino acid residues that differentiate the Asian strains including the current circulating American strains from the ancient African strains. Overall, our findings provide an overview of ZIKV genome characterization and evolutionary dynamics in the Americas and point out critical clues for future virological and epidemiological studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Defining Differential Genetic Signatures in CXCR4- and the CCR5-Utilizing HIV-1 Co-Linear Sequences
Aiamkitsumrit, Benjamas; Dampier, Will; Martin-Garcia, Julio; Nonnemacher, Michael R.; Pirrone, Vanessa; Ivanova, Tatyana; Zhong, Wen; Kilareski, Evelyn; Aldigun, Hazeez; Frantz, Brian; Rimbey, Matthew; Wojno, Adam; Passic, Shendra; Williams, Jean W.; Shah, Sonia; Blakey, Brandon; Parikh, Nirzari; Jacobson, Jeffrey M.; Moldover, Brian; Wigdahl, Brian
2014-01-01
The adaptation of human immunodeficiency virus type-1 (HIV-1) to an array of physiologic niches is advantaged by the plasticity of the viral genome, encoded proteins, and promoter. CXCR4-utilizing (X4) viruses preferentially, but not universally, infect CD4+ T cells, generating high levels of virus within activated HIV-1-infected T cells that can be detected in regional lymph nodes and peripheral blood. By comparison, the CCR5-utilizing (R5) viruses have a greater preference for cells of the monocyte-macrophage lineage; however, while R5 viruses also display a propensity to enter and replicate in T cells, they infect a smaller percentage of CD4+ T cells in comparison to X4 viruses. Additionally, R5 viruses have been associated with viral transmission and CNS disease and are also more prevalent during HIV-1 disease. Specific adaptive changes associated with X4 and R5 viruses were identified in co-linear viral sequences beyond the Env-V3. The in silico position-specific scoring matrix (PSSM) algorithm was used to define distinct groups of X4 and R5 sequences based solely on sequences in Env-V3. Bioinformatic tools were used to identify genetic signatures involving specific protein domains or long terminal repeat (LTR) transcription factor sites within co-linear viral protein R (Vpr), trans-activator of transcription (Tat), or LTR sequences that were preferentially associated with X4 or R5 Env-V3 sequences. A number of differential amino acid and nucleotide changes were identified across the co-linear Vpr, Tat, and LTR sequences, suggesting the presence of specific genetic signatures that preferentially associate with X4 or R5 viruses. Investigation of the genetic relatedness between X4 and R5 viruses utilizing phylogenetic analyses of complete sequences could not be used to definitively and uniquely identify groups of R5 or X4 sequences; in contrast, differences in the genetic diversities between X4 and R5 were readily identified within these co-linear sequences in HIV-1-infected patients. PMID:25265194
Bioinformatics prediction of siRNAs as potential antiviral agents against dengue viruses
Villegas-Rosales, Paula M; Méndez-Tenorio, Alfonso; Ortega-Soto, Elizabeth; Barrón, Blanca L
2012-01-01
Dengue virus (DENV 1-4) represents the major emerging arthropod-borne viral infection in the world. Currently, there is neither an available vaccine nor a specific treatment. Hence, there is a need of antiviral drugs for these viral infections; we describe the prediction of short interfering RNA (siRNA) as potential therapeutic agents against the four DENV serotypes. Our strategy was to carry out a series of multiple alignments using ClustalX program to find conserved sequences among the four DENV serotype genomes to obtain a consensus sequence for siRNAs design. A highly conserved sequence among the four DENV serotypes, located in the encoding sequence for NS4B and NS5 proteins was found. A total of 2,893 complete DENV genomes were downloaded from the NCBI, and after a depuration procedure to identify identical sequences, 220 complete DENV genomes were left. They were edited to select the NS4B and NS5 sequences, which were aligned to obtain a consensus sequence. Three different servers were used for siRNA design, and the resulting siRNAs were aligned to identify the most prevalent sequences. Three siRNAs were chosen, one targeted the genome region that codifies for NS4B protein and the other two; the region for NS5 protein. Predicted secondary structure for DENV genomes was used to demonstrate that the siRNAs were able to target the viral genome forming double stranded structures, necessary to activate the RNA silencing machinery. PMID:22829722
Li, Hu; Zhang, Li; Ren, Hong; Hu, Peng
2018-01-01
Viral diversity seems to predict treatment outcomes in certain viral infections. The aim of this study was to evaluate the association between baseline intra-patient viral diversity and hepatitis B surface antigen (HBsAg) decline following PEGylated interferon-alpha (Peg-IFN-α) therapy. Twenty-six HBeAg-positive patients who were treated with Peg-IFN-α were enrolled. Nested polymerase chain reaction (PCR), cloning, and sequencing of the hepatitis B virus S gene were performed on baseline samples, and normalized Shannon entropy (Sn) was calculated as a measure of small hepatitis B surface protein (SHBs) diversity. Multiple regression analysis was used to estimate the association between baseline Sn and HBsAg decline. Of the 26 patients enrolled in the study, 65.4% were male and 61.5% were infected with hepatitis B virus genotype B. The median HBsAg level at baseline was 4.5 log 10 IU/mL (interquartile range: 4.1-4.9) and declined to 3.0 log 10 IU/mL (interquartile range: 1.7-3.9) after 48 weeks of Peg-IFN-α treatment. In models adjusted for baseline alanine aminotransferase (ALT) and HBsAg, the adjusted coefficients (95% CI) for ΔHBsAg and relative percentage HBsAg decrease were -1.3 (-2.5, -0.2) log 10 IU/mL for higher SHBs diversity (Sn≥0.58) patients and -26.4% (-50.2%, -2.5%) for lower diversity (Sn<0.58) patients. Further analysis showed that the "a" determinant upstream flanking region and the first loop of the "a" determinant (nucleotides 341-359, 371-389, and 381-399) were the main sources of higher SHBs diversity. Baseline intra-patient SHBs diversity was inverse to HBsAg decline in HBeAg-positive chronic hepatitis B (CHB) patients receiving Peg-IFN-α monotherapy. Also, more sequence variations within the "a" determinant upstream flanking region and the first loop of the "a" determinant were the main sources of the higher SHBs diversity.
Bröer, Sonja; Hage, Elias; Käufer, Christopher; Gerhauser, Ingo; Anjum, Muneeb; Li, Lin; Baumgärtner, Wolfgang; Schulz, Thomas F; Löscher, Wolfgang
2017-03-01
Following intracerebral inoculation, the BeAn 8386 strain of Theiler's virus causes persistent infection and inflammatory demyelinating encephalomyelitis in the spinal cord of T-cell defective SJL/J mice, which is widely used as a model of multiple sclerosis. In contrast, C57BL/6 (B6) mice clear the virus and develop inflammation and lesions in the hippocampus, associated with acute and chronic seizures, representing a novel model of viral encephalitis-induced epilepsy. Here we characterize the geno- and phenotype of two naturally occurring variants of BeAn (BeAn-1 and BeAn-2) that can be used to further understand the viral and host factors involved in the neuropathogenesis in B6 and SJL/J mice. Next generation sequencing disclosed 15 single nucleotide differences between BeAn-1 and BeAn-2, of which 4 are coding changes and 3 are in the 5'-UTR (5'-untranslated region). The relatively minor variations in the nucleotide sequence of the two BeAn substrains led to marked differences in neurovirulence. In SJL/J mice, inflammatory demyelination in the spinal cord and its clinical consequences were significantly more marked following infection with BeAn-1 than with BeAn-2. Both BeAn substrains caused lymphocyte infiltration and increase of MAC3-positive cells in the hippocampus, but hippocampal damage and seizures were only observed in B6 mice. Seizures occurred in one third of BeAn-2 infected B6 mice, but not in BeAn-1 infected B6 mice. By comparing individual mice by receiver operating characteristic (ROC) curve analysis, the severity of hippocampal neurodegeneration and amount of MAC3-positive microglia/macrophages discriminated seizing from non-seizing B6 mice, whereas T-lymphocyte brain infiltration was not found to be a crucial factor. These data add novel evidence to the view that differential outcome of infection may be not invariably linked to a distinct viral burden but to a finely tuned balance between antiviral immune responses that although essential for host resistance can also contribute to immunopathology. Copyright © 2016. Published by Elsevier Inc.
Mechanisms of inhibition of viral replication in plants
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1990-01-01
We have made a number of interesting observations of importance to the fields of virology and plant molecular biology. Topics include the genome of cucumber mosaic virus (CMV), recombination of the CMV genome, transgenic plants and viral movement genes, mapping resistance breakage sequences in the tomato mosaic virus (TMV) genome, and mapping pathogeneticity domains and viral RNA heterogeneity. 1 fig., 1 tab.
Human viral pathogens are pervasive in wastewater treatment center aerosols.
Brisebois, Evelyne; Veillette, Marc; Dion-Dupont, Vanessa; Lavoie, Jacques; Corbeil, Jacques; Culley, Alexander; Duchaine, Caroline
2018-05-01
Wastewater treatment center (WTC) workers may be vulnerable to diseases caused by viruses, such as the common cold, influenza and gastro-intestinal infections. Although there is a substantial body of literature characterizing the microbial community found in wastewater, only a few studies have characterized the viral component of WTC aerosols, despite the fact that most diseases affecting WTC workers are of viral origin and that some of these viruses are transmitted through the air. In this study, we evaluated in four WTCs the presence of 11 viral pathogens of particular concern in this milieu and used a metagenomic approach to characterize the total viral community in the air of one of those WTCs. The presence of viruses in aerosols in different locations of individual WTCs was evaluated and the results obtained with four commonly used air samplers were compared. We detected four of the eleven viruses tested, including human adenovirus (hAdV), rotavirus, hepatitis A virus (HAV) and Herpes Simplex virus type 1 (HSV1). The results of the metagenomic assay uncovered very few viral RNA sequences in WTC aerosols, however sequences from human DNA viruses were in much greater relative abundance. Copyright © 2017. Published by Elsevier B.V.
Experimental evidence that RNA recombination occurs in the Japanese encephalitis virus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chuang, C.-K.; Chen, W.-J., E-mail: wjchen@mail.cgu.edu.t; Department of Public Health and Parasitology, Chang Gung University, Kwei-San, Tao-Yuan 33332, Taiwan
2009-11-25
Due to the lack of a proofreading function and error-repairing ability of genomic RNA, accumulated mutations are known to be a force driving viral evolution in the genus Flavivirus, including the Japanese encephalitis (JE) virus. Based on sequencing data, RNA recombination was recently postulated to be another factor associated with genomic variations in these viruses. We herein provide experimental evidence to demonstrate the occurrence of RNA recombination in the JE virus using two local pure clones (T1P1-S1 and CJN-S1) respectively derived from the local strains, T1P1 and CJN. Based on results from a restriction fragment length polymorphism (RFLP) assay onmore » the C/preM junction comprising a fragment of 868 nucleotides (nt 10-877), the recombinant progeny virus was primarily formed in BHK-21 cells that had been co-infected with the two clones used in this study. Nine of 20 recombinant forms of the JE virus had a crossover in the nt 123-323 region. Sequencing data derived from these recombinants revealed that no nucleotide deletion or insertion occurred in this region favoring crossovers, indicating that precisely, not aberrantly, homologous recombination was involved. With site-directed mutagenesis, three stem-loop secondary structures were destabilized and re-stabilized in sequence, leading to changes in the frequency of recombination. This suggests that the conformation, not the free energy, of the secondary structure is important in modulating RNA recombination of the virus. It was concluded that because RNA recombination generates genetic diversity in the JE virus, this must be considered particularly in studies of viral evolution, epidemiology, and possible vaccine safety.« less
2014-01-01
Background Bovine respiratory syncytial virus (BRSV) is one of the major pathogens involved in the bovine respiratory disease (BRD) complex. The seroprevalence to BRSV in Norwegian cattle herds is high, but its role in epidemics of respiratory disease is unclear. The aims of the study were to investigate the etiological role of BRSV and other respiratory viruses in epidemics of BRD and to perform phylogenetic analysis of Norwegian BRSV strains. Results BRSV infection was detected either serologically and/or virologically in 18 (86%) of 21 outbreaks and in most cases as a single viral agent. When serology indicated that bovine coronavirus and/or bovine parainfluenza virus 3 were present, the number of BRSV positive animals in the herd was always higher, supporting the view of BRSV as the main pathogen. Sequencing of the G gene of BRSV positive samples showed that the current circulating Norwegian BRSVs belong to genetic subgroup II, along with other North European isolates. One isolate from an outbreak in Norway in 1976 was also investigated. This strain formed a separate branch in subgroup II, clearly different from the current Scandinavian sequences. The currently circulating BRSV could be divided into two different strains that were present in the same geographical area at the same time. The sequence variations between the two strains were in an antigenic important part of the G protein. Conclusion The results demonstrated that BRSV is the most important etiological agent of epidemics of BRD in Norway and that it often acts as the only viral agent. The phylogenetic analysis of the Norwegian strains of BRSV and several previously published isolates supported the theory of geographical and temporal clustering of BRSV. PMID:24423030
Liu, Jin; Shao, Luyao; Trang, Phong; Yang, Zhu; Reeves, Michael; Sun, Xu; Vu, Gia-Phong; Wang, Yu; Li, Hongjian; Zheng, Congyi; Lu, Sangwei; Liu, Fenyong
2016-06-09
An external guide sequence (EGS) is a RNA sequence which can interact with a target mRNA to form a tertiary structure like a pre-tRNA and recruit intracellular ribonuclease P (RNase P), a tRNA processing enzyme, to degrade target mRNA. Previously, an in vitro selection procedure has been used by us to engineer new EGSs that are more robust in inducing human RNase P to cleave their targeted mRNAs. In this study, we constructed EGSs from a variant to target the mRNA encoding herpes simplex virus 1 (HSV-1) major transcription regulator ICP4, which is essential for the expression of viral early and late genes and viral growth. The EGS variant induced human RNase P cleavage of ICP4 mRNA sequence 60 times better than the EGS generated from a natural pre-tRNA. A decrease of about 97% and 75% in the level of ICP4 gene expression and an inhibition of about 7,000- and 500-fold in viral growth were observed in HSV infected cells expressing the variant and the pre-tRNA-derived EGS, respectively. This study shows that engineered EGSs can inhibit HSV-1 gene expression and viral growth. Furthermore, these results demonstrate the potential for engineered EGS RNAs to be developed and used as anti-HSV therapeutics.
Viral Communities Associated with Human Pericardial Fluids in Idiopathic Pericarditis
Fancello, Laura; Monteil, Sonia; Popgeorgiev, Nikolay; Rivet, Romain; Gouriet, Frédérique; Fournier, Pierre-Edouard; Raoult, Didier; Desnues, Christelle
2014-01-01
Pericarditis is a common human disease defined by inflammation of the pericardium. Currently, 40% to 85% of pericarditis cases have no identified etiology. Most of these cases are thought to be caused by an infection of undetected, unsuspected or unknown viruses. In this work, we used a culture- and sequence-independent approach to investigate the viral DNA communities present in human pericardial fluids. Seven viral metagenomes were generated from the pericardial fluid of patients affected by pericarditis of unknown etiology and one metagenome was generated from the pericardial fluid of a sudden infant death case. As a positive control we generated one metagenome from the pericardial fluid of a patient affected by pericarditis caused by herpesvirus type 3. Furthermore, we used as negative controls a total of 6 pericardial fluids from 6 different individuals affected by pericarditis of non-infectious origin: 5 of them were sequenced as a unique pool and the remaining one was sequenced separately. The results showed a significant presence of torque teno viruses especially in one patient, while herpesviruses and papillomaviruses were present in the positive control. Co-infections by different genotypes of the same viral type (torque teno viruses) or different viruses (herpesviruses and papillomaviruses) were observed. Sequences related to bacteriophages infecting Staphylococcus, Enterobacteria, Streptococcus, Burkholderia and Pseudomonas were also detected in three patients. This study detected torque teno viruses and papillomaviruses, for the first time, in human pericardial fluids. PMID:24690743
Liu, Jin; Shao, Luyao; Trang, Phong; Yang, Zhu; Reeves, Michael; Sun, Xu; Vu, Gia-Phong; Wang, Yu; Li, Hongjian; Zheng, Congyi; Lu, Sangwei; Liu, Fenyong
2016-01-01
An external guide sequence (EGS) is a RNA sequence which can interact with a target mRNA to form a tertiary structure like a pre-tRNA and recruit intracellular ribonuclease P (RNase P), a tRNA processing enzyme, to degrade target mRNA. Previously, an in vitro selection procedure has been used by us to engineer new EGSs that are more robust in inducing human RNase P to cleave their targeted mRNAs. In this study, we constructed EGSs from a variant to target the mRNA encoding herpes simplex virus 1 (HSV-1) major transcription regulator ICP4, which is essential for the expression of viral early and late genes and viral growth. The EGS variant induced human RNase P cleavage of ICP4 mRNA sequence 60 times better than the EGS generated from a natural pre-tRNA. A decrease of about 97% and 75% in the level of ICP4 gene expression and an inhibition of about 7,000- and 500-fold in viral growth were observed in HSV infected cells expressing the variant and the pre-tRNA-derived EGS, respectively. This study shows that engineered EGSs can inhibit HSV-1 gene expression and viral growth. Furthermore, these results demonstrate the potential for engineered EGS RNAs to be developed and used as anti-HSV therapeutics. PMID:27279482
Discovering Deeply Divergent RNA Viruses in Existing Metatranscriptome Data with Machine Learning
NASA Astrophysics Data System (ADS)
Rivers, A. R.
2016-02-01
Most sampling of RNA viruses and phages has been directed toward a narrow range of hosts and environments. Several marine metagenomic studies have examined the RNA viral fraction in aquatic samples and found a number of picornaviruses and uncharacterized sequences. The lack of homology to known protein families has limited the discovery of new RNA viruses. We developed a computational method for identifying RNA viruses that relies on information in the codon transition probabilities of viral sequences to train a classifier. This approach does not rely on homology, but it has higher information content than other reference-free methods such as tetranucleotide frequency. Training and validation with RefSeq data gave true positive and true negative rates of 99.6% and 99.5% on the highly imbalanced validation sets (0.2% viruses) that, like the metatranscriptomes themselves, contain mostly non-viral sequences. To further test the method, a validation dataset of putative RNA virus genomes were identified in metatransciptomes by the presence of RNA dependent RNA polymerase, an essential gene for RNA viruses. The classifier successfully identified 99.4% of those contigs as viral. This approach is currently being extended to screen all metatranscriptome data sequenced at the DOE Joint Genome Institute, presently 4.5 Gb of assembled data from 504 public projects representing a wide range of marine, aquatic and terrestrial environments.
Zhou, Shuntai; Jones, Corbin; Mieczkowski, Piotr
2015-01-01
ABSTRACT Validating the sampling depth and reducing sequencing errors are critical for studies of viral populations using next-generation sequencing (NGS). We previously described the use of Primer ID to tag each viral RNA template with a block of degenerate nucleotides in the cDNA primer. We now show that low-abundance Primer IDs (offspring Primer IDs) are generated due to PCR/sequencing errors. These artifactual Primer IDs can be removed using a cutoff model for the number of reads required to make a template consensus sequence. We have modeled the fraction of sequences lost due to Primer ID resampling. For a typical sequencing run, less than 10% of the raw reads are lost to offspring Primer ID filtering and resampling. The remaining raw reads are used to correct for PCR resampling and sequencing errors. We also demonstrate that Primer ID reveals bias intrinsic to PCR, especially at low template input or utilization. cDNA synthesis and PCR convert ca. 20% of RNA templates into recoverable sequences, and 30-fold sequence coverage recovers most of these template sequences. We have directly measured the residual error rate to be around 1 in 10,000 nucleotides. We use this error rate and the Poisson distribution to define the cutoff to identify preexisting drug resistance mutations at low abundance in an HIV-infected subject. Collectively, these studies show that >90% of the raw sequence reads can be used to validate template sampling depth and to dramatically reduce the error rate in assessing a genetically diverse viral population using NGS. IMPORTANCE Although next-generation sequencing (NGS) has revolutionized sequencing strategies, it suffers from serious limitations in defining sequence heterogeneity in a genetically diverse population, such as HIV-1 due to PCR resampling and PCR/sequencing errors. The Primer ID approach reveals the true sampling depth and greatly reduces errors. Knowing the sampling depth allows the construction of a model of how to maximize the recovery of sequences from input templates and to reduce resampling of the Primer ID so that appropriate multiplexing can be included in the experimental design. With the defined sampling depth and measured error rate, we are able to assign cutoffs for the accurate detection of minority variants in viral populations. This approach allows the power of NGS to be realized without having to guess about sampling depth or to ignore the problem of PCR resampling, while also being able to correct most of the errors in the data set. PMID:26041299
Bing, Tiejun; Zhang, Suzhen; Liu, Xiaojuan; Liang, Zhibin; Shao, Peng; Zhang, Song; Qiao, Wentao; Tan, Juan
2016-06-30
Bovine foamy virus (BFV) encodes the transactivator BTas, which enhances viral gene transcription by binding to the long terminal repeat promoter and the internal promoter. In this study, we investigated the different replication capacities of two similar BFV full-length DNA clones, pBS-BFV-Y and pBS-BFV-B. Here, functional analysis of several chimeric clones revealed a major role for the C-terminal region of the viral genome in causing this difference. Furthermore, BTas-B, which is located in this C-terminal region, exhibited a 20-fold higher transactivation activity than BTas-Y. Sequence alignment showed that these two sequences differ only at amino acid 108, with BTas-B containing N108 and BTas-Y containing D108 at this position. Results of mutagenesis studies demonstrated that residue N108 is important for BTas binding to viral promoters. In addition, the N108D mutation in pBS-BFV-B reduced the viral replication capacity by about 1.5-fold. Our results suggest that residue N108 is important for BTas binding to BFV promoters and has a major role in BFV replication. These findings not only advances our understanding of the transactivation mechanism of BTas, but they also highlight the importance of certain sequence polymorphisms in modulating the replication capacity of isolated BFV clones.
Raw Sewage Harbors Diverse Viral Populations
Cantalupo, Paul G.; Calgua, Byron; Zhao, Guoyan; Hundesa, Ayalkibet; Wier, Adam D.; Katz, Josh P.; Grabe, Michael; Hendrix, Roger W.; Girones, Rosina; Wang, David; Pipas, James M.
2011-01-01
ABSTRACT At this time, about 3,000 different viruses are recognized, but metagenomic studies suggest that these viruses are a small fraction of the viruses that exist in nature. We have explored viral diversity by deep sequencing nucleic acids obtained from virion populations enriched from raw sewage. We identified 234 known viruses, including 17 that infect humans. Plant, insect, and algal viruses as well as bacteriophages were also present. These viruses represented 26 taxonomic families and included viruses with single-stranded DNA (ssDNA), double-stranded DNA (dsDNA), positive-sense ssRNA [ssRNA(+)], and dsRNA genomes. Novel viruses that could be placed in specific taxa represented 51 different families, making untreated wastewater the most diverse viral metagenome (genetic material recovered directly from environmental samples) examined thus far. However, the vast majority of sequence reads bore little or no sequence relation to known viruses and thus could not be placed into specific taxa. These results show that the vast majority of the viruses on Earth have not yet been characterized. Untreated wastewater provides a rich matrix for identifying novel viruses and for studying virus diversity. Importance At this time, virology is focused on the study of a relatively small number of viral species. Specific viruses are studied either because they are easily propagated in the laboratory or because they are associated with disease. The lack of knowledge of the size and characteristics of the viral universe and the diversity of viral genomes is a roadblock to understanding important issues, such as the origin of emerging pathogens and the extent of gene exchange among viruses. Untreated wastewater is an ideal system for assessing viral diversity because virion populations from large numbers of individuals are deposited and because raw sewage itself provides a rich environment for the growth of diverse host species and thus their viruses. These studies suggest that the viral universe is far more vast and diverse than previously suspected. PMID:21972239
Stepanauskas, Ramunas; Fergusson, Elizabeth A; Brown, Joseph; Poulton, Nicole J; Tupper, Ben; Labonté, Jessica M; Becraft, Eric D; Brown, Julia M; Pachiadaki, Maria G; Povilaitis, Tadas; Thompson, Brian P; Mascena, Corianna J; Bellows, Wendy K; Lubys, Arvydas
2017-07-20
Microbial single-cell genomics can be used to provide insights into the metabolic potential, interactions, and evolution of uncultured microorganisms. Here we present WGA-X, a method based on multiple displacement amplification of DNA that utilizes a thermostable mutant of the phi29 polymerase. WGA-X enhances genome recovery from individual microbial cells and viral particles while maintaining ease of use and scalability. The greatest improvements are observed when amplifying high G+C content templates, such as those belonging to the predominant bacteria in agricultural soils. By integrating WGA-X with calibrated index-cell sorting and high-throughput genomic sequencing, we are able to analyze genomic sequences and cell sizes of hundreds of individual, uncultured bacteria, archaea, protists, and viral particles, obtained directly from marine and soil samples, in a single experiment. This approach may find diverse applications in microbiology and in biomedical and forensic studies of humans and other multicellular organisms.Single-cell genomics can be used to study uncultured microorganisms. Here, Stepanauskas et al. present a method combining improved multiple displacement amplification and FACS, to obtain genomic sequences and cell size information from uncultivated microbial cells and viral particles in environmental samples.
Fluid spatial dynamics of West Nile virus in the USA: Rapid spread in a permissive host environment
Di Giallonardo , Francesca; Geoghegan, Jemma L.; Docherty, Douglas E.; McLean, Robert G.; Zody, Michael C.; Qu, James; Yang, Xiao; Birren, Bruce W.; Malboeuf, Christine M.; Newman, R.; Ip, Hon S.; Holmes, Edward C.
2016-01-01
The introduction of West Nile virus (WNV) into North America in 1999 is a classical example of viral emergence in a new environment, with its subsequent dispersion across the continent having a major impact on local bird populations. Despite the importance of this epizootic, the pattern, dynamics and determinants of WNV spread in its natural hosts remain uncertain. In particular, it is unclear whether the virus encountered major barriers to transmission, or spread in an unconstrained manner, and if specific viral lineages were favored over others indicative of intrinsic differences in fitness. To address these key questions in WNV evolution and ecology we sequenced the complete genomes of approximately 300 avian isolates sampled across the USA between 2001-2012. Phylogenetic analysis revealed a relatively ‘star-like' tree structure, indicative of explosive viral spread in US, although with some replacement of viral genotypes through time. These data are striking in that viral sequences exhibit relatively limited clustering according to geographic region, particularly for those viruses sampled from birds, and no strong phylogenetic association with well sampled avian species. The genome sequence data analysed here also contain relatively little evidence for adaptive evolution, particularly on structural proteins, suggesting that most viral lineages are of similar fitness, and that WNV is well adapted to the ecology of mosquito vectors and diverse avian hosts in the USA. In sum, the molecular evolution of WNV in North America depicts a largely unfettered expansion within a permissive host and geographic population with little evidence of major adaptive barriers.
The Fecal Virome of Pigs on a High-Density Farm ▿ †
Shan, Tongling; Li, Linlin; Simmonds, Peter; Wang, Chunlin; Moeser, Adam; Delwart, Eric
2011-01-01
Swine are an important source of proteins worldwide but are subject to frequent viral outbreaks and numerous infections capable of infecting humans. Modern farming conditions may also increase viral transmission and potential zoonotic spread. We describe here the metagenomics-derived virome in the feces of 24 healthy and 12 diarrheic piglets on a high-density farm. An average of 4.2 different mammalian viruses were shed by healthy piglets, reflecting a high level of asymptomatic infections. Diarrheic pigs shed an average of 5.4 different mammalian viruses. Ninety-nine percent of the viral sequences were related to the RNA virus families Picornaviridae, Astroviridae, Coronaviridae, and Caliciviridae, while 1% were related to the small DNA virus families Circoviridae, and Parvoviridae. Porcine RNA viruses identified, in order of decreasing number of sequence reads, consisted of kobuviruses, astroviruses, enteroviruses, sapoviruses, sapeloviruses, coronaviruses, bocaviruses, and teschoviruses. The near-full genomes of multiple novel species of porcine astroviruses and bocaviruses were generated and phylogenetically analyzed. Multiple small circular DNA genomes encoding replicase proteins plus two highly divergent members of the Picornavirales order were also characterized. The possible origin of these viral genomes from pig-infecting protozoans and nematodes, based on closest sequence similarities, is discussed. In summary, an unbiased survey of viruses in the feces of intensely farmed animals revealed frequent coinfections with a highly diverse set of viruses providing favorable conditions for viral recombination. Viral surveys of animals can readily document the circulation of known and new viruses, facilitating the detection of emerging viruses and prospective evaluation of their pathogenic and zoonotic potentials. PMID:21900163
Pan, Pinliang; Tao, Xiaoxia; Zhang, Qi; Xing, Wenge; Sun, Xianguang; Pei, Lijian; Jiang, Yan
2007-12-01
To investigate the correlation between three viral load assays for circulating recombinant form (CRF)_BC. Recent studies in HIV-1 molecular epidemiology, reveals that CRF_BC is the dominant subtype of HIV-1 virus in mainland China, representing over 45% of the HIV-1 infected population. The performances of nucleic acid sequence-based amplification (NASBA), branched DNA (bDNA) and reverse transcriptase polymerase chain reaction (RT-PCR) were compared for the HIV-1 viral load detection and quantitation of CRF_BC in China. Sixteen HIV-1 positive and three HIV-1 negative samples were collected. Sequencing of the positive samples in the gp41 region was conducted. The HIV-1 viral load values were determined using bDNA, RT-PCR and NASBA assays. Deming regression analysis with SPSS 12.0 (SPS Inc., Chicago, Illinois, USA) was performed for data analysis. Sequencing and phylogenetic analysis of env gene (gp41) region of the 16 HIV-1 positive clinical specimens from Guizhou Province in southwest China revealed the dominance of the subtype CRF_BC in that region. A good correlation of their viral load values was observed among three assays. Pearson's correlation between RT-PCR and bDNA is 0.969, Lg(VL)RT-PCR = 0.969 * Lg(VL)bDNA + 0.55; Pearson's correlation between RT-PCR and NASBA is 0.968, Lg(VL)RT-PCR = 0.968 * Lg(VL)NASBA + 0.937; Pearson's correlation between NASBA and bDNA is 0.980, Lg(VL)NASBA = 0.980 * Lg(VL)bDNA - 0.318. When testing with 3 different assays, RT-PCR, bDNA and NASBA, the group of 16 HIV-1 positive samples showed the viral load value was highest for RT-PCR, followed by bDNA then NASBA, which is consistent with the former results in subtype B. The three viral load assays are highly correlative for CRF_BC in China.
Muñoz-Alía, Miguel Ángel; Fernández-Muñoz, Rafael; Casasnovas, José María; Porras-Mansilla, Rebeca; Serrano-Pardo, Ángela; Pagán, Israel; Ordobás, María; Ramírez, Rosa; Celma, María Luisa
2015-01-22
Measles virus circulates endemically in African and Asian large urban populations, causing outbreaks worldwide in populations with up-to-95% immune protection. We studied the natural genetic variability of genotype B3.1 in a population with 95% vaccine coverage throughout an imported six month measles outbreak. From first pass viral isolates of 47 patients we performed direct sequencing of genomic cDNA. Whilst no variation from index case sequence occurred in the Nucleocapsid gene hyper-variable carboxy end, in the Hemagglutinin gene, main target for neutralizing antibodies, we observed gradual nucleotide divergence from index case along the outbreak (0% to 0.380%, average 0.138%) with the emergence of transient and persistent non-synonymous and synonymous mutations. Little or no variation was observed between the index and last outbreak cases in Phosphoprotein, Nucleocapsid, Matrix and Fusion genes. Most of the H non-synonymous mutations were mapped on the protein surface near antigenic and receptors binding sites. We estimated a MV-Hemagglutinin nucleotide substitution rate of 7.28 × 10-6 substitutions/site/day by a Bayesian phylogenetic analysis. The dN/dS analysis did not suggest significant immune or other selective pressures on the H gene during the outbreak. These results emphasize the usefulness of MV-H sequence analysis in measles epidemiological surveillance and elimination programs, and in detection of potentially emergence of measles virus neutralization-resistant mutants. Copyright © 2014 Elsevier B.V. All rights reserved.
CRISPR Spacer Arrays for Detection of Viral Signatures from Acidic Hot Springs
NASA Astrophysics Data System (ADS)
Snyder, J. C.; Bateson, M. M.; Suciu, D.; Young, M. J.
2010-04-01
Viruses are the most abundant life-like entities on the planet Earth. Using CRISPR spacer sequences, we have developed a microarray-based approach to detecting viral signatures in the acidic hot springs of Yellowstone.
Ho, Cynthia K. Y.; Raghwani, Jayna; Koekkoek, Sylvie; Liang, Richard H.; Van der Meer, Jan T. M.; Van Der Valk, Marc; De Jong, Menno; Pybus, Oliver G.
2016-01-01
ABSTRACT In contrast to other available next-generation sequencing platforms, PacBio single-molecule, real-time (SMRT) sequencing has the advantage of generating long reads albeit with a relatively higher error rate in unprocessed data. Using this platform, we longitudinally sampled and sequenced the hepatitis C virus (HCV) envelope genome region (1,680 nucleotides [nt]) from individuals belonging to a cluster of sexually transmitted cases. All five subjects were coinfected with HIV-1 and a closely related strain of HCV genotype 4d. In total, 50 samples were analyzed by using SMRT sequencing. By using 7 passes of circular consensus sequencing, the error rate was reduced to 0.37%, and the median number of sequences was 612 per sample. A further reduction of insertions was achieved by alignment against a sample-specific reference sequence. However, in vitro recombination during PCR amplification could not be excluded. Phylogenetic analysis supported close relationships among HCV sequences from the four male subjects and subsequent transmission from one subject to his female partner. Transmission was characterized by a strong genetic bottleneck. Viral genetic diversity was low during acute infection and increased upon progression to chronicity but subsequently fluctuated during chronic infection, caused by the alternate detection of distinct coexisting lineages. SMRT sequencing combines long reads with sufficient depth for many phylogenetic analyses and can therefore provide insights into within-host HCV evolutionary dynamics without the need for haplotype reconstruction using statistical algorithms. IMPORTANCE Next-generation sequencing has revolutionized the study of genetically variable RNA virus populations, but for phylogenetic and evolutionary analyses, longer sequences than those generated by most available platforms, while minimizing the intrinsic error rate, are desired. Here, we demonstrate for the first time that PacBio SMRT sequencing technology can be used to generate full-length HCV envelope sequences at the single-molecule level, providing a data set with large sequencing depth for the characterization of intrahost viral dynamics. The selection of consensus reads derived from at least 7 full circular consensus sequencing rounds significantly reduced the intrinsic high error rate of this method. We used this method to genetically characterize a unique transmission cluster of sexually transmitted HCV infections, providing insight into the distinct evolutionary pathways in each patient over time and identifying the transmission-associated genetic bottleneck as well as fluctuations in viral genetic diversity over time, accompanied by dynamic shifts in viral subpopulations. PMID:28077634
An approach for identification of unknown viruses using sequencing-by-hybridization.
Katoski, Sarah E; Meyer, Hermann; Ibrahim, Sofi
2015-09-01
Accurate identification of biological threat agents, especially RNA viruses, in clinical or environmental samples can be challenging because the concentration of viral genomic material in a given sample is usually low, viral genomic RNA is liable to degradation, and RNA viruses are extremely diverse. A two-tiered approach was used for initial identification, then full genomic characterization of 199 RNA viruses belonging to virus families Arenaviridae, Bunyaviridae, Filoviridae, Flaviviridae, and Togaviridae. A Sequencing-by-hybridization (SBH) microarray was used to tentatively identify a viral pathogen then, the identity is confirmed by guided next-generation sequencing (NGS). After optimization and evaluation of the SBH and NGS methodologies with various virus species and strains, the approach was used to test the ability to identify viruses in blinded samples. The SBH correctly identified two Ebola viruses in the blinded samples within 24 hr, and by using guided amplicon sequencing with 454 GS FLX, the identities of the viruses in both samples were confirmed. SBH provides at relatively low-cost screening of biological samples against a panel of viral pathogens that can be custom-designed on a microarray. Once the identity of virus is deduced from the highest hybridization signal on the SBH microarray, guided (amplicon) NGS sequencing can be used not only to confirm the identity of the virus but also to provide further information about the strain or isolate, including a potential genetic manipulation. This approach can be useful in situations where natural or deliberate biological threat incidents might occur and a rapid response is required. © 2015 Wiley Periodicals, Inc.
Virome Assembly and Annotation: A Surprise in the Namib Desert
Hesse, Uljana; van Heusden, Peter; Kirby, Bronwyn M.; Olonade, Israel; van Zyl, Leonardo J.; Trindade, Marla
2017-01-01
Sequencing, assembly, and annotation of environmental virome samples is challenging. Methodological biases and differences in species abundance result in fragmentary read coverage; sequence reconstruction is further complicated by the mosaic nature of viral genomes. In this paper, we focus on biocomputational aspects of virome analysis, emphasizing latent pitfalls in sequence annotation. Using simulated viromes that mimic environmental data challenges we assessed the performance of five assemblers (CLC-Workbench, IDBA-UD, SPAdes, RayMeta, ABySS). Individual analyses of relevant scaffold length fractions revealed shortcomings of some programs in reconstruction of viral genomes with excessive read coverage (IDBA-UD, RayMeta), and in accurate assembly of scaffolds ≥50 kb (SPAdes, RayMeta, ABySS). The CLC-Workbench assembler performed best in terms of genome recovery (including highly covered genomes) and correct reconstruction of large scaffolds; and was used to assemble a virome from a copper rich site in the Namib Desert. We found that scaffold network analysis and cluster-specific read reassembly improved reconstruction of sequences with excessive read coverage, and that strict data filtering for non-viral sequences prior to downstream analyses was essential. In this study we describe novel viral genomes identified in the Namib Desert copper site virome. Taxonomic affiliations of diverse proteins in the dataset and phylogenetic analyses of circovirus-like proteins indicated links to the marine habitat. Considering additional evidence from this dataset we hypothesize that viruses may have been carried from the Atlantic Ocean into the Namib Desert by fog and wind, highlighting the impact of the extended environment on an investigated niche in metagenome studies. PMID:28167933
Correa, Adrienne M. S.; Ainsworth, Tracy D.; Rosales, Stephanie M.; Thurber, Andrew R.; Butler, Christopher R.; Vega Thurber, Rebecca L.
2016-01-01
Previous studies of coral viruses have employed either microscopy or metagenomics, but few have attempted to comprehensively link the presence of a virus-like particle (VLP) to a genomic sequence. We conducted transmission electron microscopy imaging and virome analysis in tandem to characterize the most conspicuous viral types found within the dominant Pacific reef-building coral genus Acropora. Collections for this study inadvertently captured what we interpret as a natural outbreak of viral infection driven by aerial exposure of the reef flat coincident with heavy rainfall and concomitant mass bleaching. All experimental corals in this study had high titers of viral particles. Three of the dominant VLPs identified were observed in all tissue layers and budding out from the epidermis, including viruses that were ∼70, ∼120, and ∼150 nm in diameter; these VLPs all contained electron dense cores. These morphological traits are reminiscent of retroviruses, herpesviruses, and nucleocytoplasmic large DNA viruses (NCLDVs), respectively. Some 300–500 nm megavirus-like VLPs also were observed within and associated with dinoflagellate algal endosymbiont (Symbiodinium) cells. Abundant sequence similarities to a gammaretrovirus, herpesviruses, and members of the NCLDVs, based on a virome generated from five Acropora aspera colonies, corroborated these morphology-based identifications. Additionally sequence similarities to two diagnostic genes, a MutS and (based on re-annotation of sequences from another study) a DNA polymerase B gene, most closely resembled Pyramimonas orientalis virus, demonstrating the association of a cosmopolitan megavirus with Symbiodinium. We also identified several other virus-like particles in host tissues, along with sequences phylogenetically similar to circoviruses, phages, and filamentous viruses. This study suggests that viral outbreaks may be a common but previously undocumented component of natural bleaching events, particularly following repeated episodes of multiple environmental stressors. PMID:26941712
Genetic characterization of poxviruses in Camelus dromedarius in Ethiopia, 2011-2014.
Gelaye, Esayas; Achenbach, Jenna Elizabeth; Ayelet, Gelagay; Jenberie, Shiferaw; Yami, Martha; Grabherr, Reingard; Loitsch, Angelika; Diallo, Adama; Lamien, Charles Euloge
2016-10-01
Camelpox and camel contagious ecthyma are infectious viral diseases of camelids caused by camelpox virus (CMLV) and camel contagious ecthyma virus (CCEV), respectively. Even though, in Ethiopia, pox disease has been creating significant economic losses in camel production, little is known on the responsible pathogens and their genetic diversity. Thus, the present study aimed at isolation, identification and genetic characterization of the causative viruses. Accordingly, clinical case observations, infectious virus isolation, and molecular and phylogenetic analysis of poxviruses infecting camels in three regions and six districts in the country, Afar (Chifra), Oromia (Arero, Miyu and Yabello) and Somali (Gursum and Jijiga) between 2011 and 2014 were undertaken. The full hemagglutinin (HA) and partial A-type inclusion protein (ATIP) genes of CMLV and full major envelope protein (B2L) gene of CCEV of Ethiopian isolates were sequenced, analyzed and compared among each other and to foreign isolates. The viral isolation confirmed the presence of infectious poxviruses. The preliminary screening by PCR showed 27 CMLVs and 20 CCEVs. The sequence analyses showed that the HA and ATIP gene sequences are highly conserved within the local isolates of CMLVs, and formed a single cluster together with isolates from Somalia and Syria. Unlike CMLVs, the B2L gene analysis of Ethiopian CCEV showed few genetic variations. The phylogenetic analysis revealed three clusters of CCEV in Ethiopia with the isolates clustering according to their geographical origins. To our knowledge, this is the first report indicating the existence of CCEV in Ethiopia where camel contagious ecthyma was misdiagnosed as camelpox. Additionally, this study has also disclosed the existence of co-infections with CMLV and CCEV. A comprehensive characterization of poxviruses affecting camels in Ethiopia and the full genome sequencing of representative isolates are recommended to better understand the dynamics of pox diseases of camels and to assist in the implementation of more efficient control measures. Copyright © 2016 Elsevier B.V. All rights reserved.
Detection of Novel Sequences Related to African Swine Fever Virus in Human Serum and Sewage▿ †
Loh, Joy; Zhao, Guoyan; Presti, Rachel M.; Holtz, Lori R.; Finkbeiner, Stacy R.; Droit, Lindsay; Villasana, Zoilmar; Todd, Collin; Pipas, James M.; Calgua, Byron; Girones, Rosina; Wang, David; Virgin, Herbert W.
2009-01-01
The family Asfarviridae contains only a single virus species, African swine fever virus (ASFV). ASFV is a viral agent with significant economic impact due to its devastating effects on populations of domesticated pigs during outbreaks but has not been reported to infect humans. We report here the discovery of novel viral sequences in human serum and sewage which are clearly related to the asfarvirus family but highly divergent from ASFV. Detection of these sequences suggests that greater genetic diversity may exist among asfarviruses than previously thought and raises the possibility that human infection by asfarviruses may occur. PMID:19812170
Nuclear targeting of viral and non-viral DNA.
Chowdhury, E H
2009-07-01
The nuclear envelope presents a major barrier to transgene delivery and expression using a non-viral vector. Virus is capable of overcoming the barrier to deliver their genetic materials efficiently into the nucleus by virtue of the specialized protein components with the unique amino acid sequences recognizing cellular nuclear transport machinery. However, considering the safety issues in the clinical gene therapy for treating critical human diseases, non-viral systems are highly promising compared with their viral counterparts. This review summarizes the progress on exploring the nuclear traffic mechanisms for the prominent viral vectors and the technological innovations for the nuclear delivery of non-viral DNA by mimicking those natural processes evolved for the viruses as well as for many cellular proteins.
Viruses as Winners in the Game of Life.
Cobián Güemes, Ana Georgina; Youle, Merry; Cantú, Vito Adrian; Felts, Ben; Nulton, James; Rohwer, Forest
2016-09-29
Viruses are the most abundant and the most diverse life form. In this meta-analysis we estimate that there are 4.80×10 31 phages on Earth. Further, 97% of viruses are in soil and sediment-two underinvestigated biomes that combined account for only ∼2.5% of publicly available viral metagenomes. The majority of the most abundant viral sequences from all biomes are novel. Our analysis drawing on all publicly available viral metagenomes observed a mere 257,698 viral genotypes on Earth-an unrealistically low number-which attests to the current paucity of viral metagenomic data. Further advances in viral ecology and diversity call for a shift of attention to previously ignored major biomes and careful application of verified methods for viral metagenomic analysis.
Papuchon, Jennifer; Pinson, Patricia; Guidicelli, Gwenda-Line; Bellecave, Pantxika; Thomas, Réjean; LeBlanc, Roger; Reigadas, Sandrine; Taupin, Jean-Luc; Baril, Jean Guy; Routy, Jean Pierre; Wainberg, Mark; Fleury, Hervé
2014-01-01
In patients responding successfully to ART, the next therapeutic step is viral cure. An interesting strategy is antiviral vaccination, particularly involving CD8 T cell epitopes. However, attempts at vaccination are dependent on the immunogenetic background of individuals. The Provir/Latitude 45 project aims to investigate which CTL epitopes in proviral HIV-1 will be recognized by the immune system when HLA alleles are taken into consideration. A prior study (Papuchon et al, PLoS ONE 2013) showed that chronically-infected patients under successful ART exhibited variations of proviral CTL epitopes compared to a reference viral strain (HXB2) and that a generic vaccine may not be efficient. Here, we investigated viral and/or proviral CTL epitopes at different time points in recently infected individuals of the Canadian primary HIV infection cohort and assessed the affinity of these epitopes for HLA alleles during the study period. An analysis of the results confirms that it is not possible to fully predict which epitopes will be recognized by the HLA alleles of the patients if the reference sequences and epitopes are taken as the basis of simulation. Epitopes may be seen to vary in circulating RNA and proviral DNA. Despite this confirmation, the overall variability of the epitopes was low in these patients who are temporally close to primary infection.
Papuchon, Jennifer; Pinson, Patricia; Guidicelli, Gwenda-Line; Bellecave, Pantxika; Thomas, Réjean; LeBlanc, Roger; Reigadas, Sandrine; Taupin, Jean-Luc; Baril, Jean Guy; Routy, Jean Pierre; Wainberg, Mark; Fleury, Hervé
2014-01-01
In patients responding successfully to ART, the next therapeutic step is viral cure. An interesting strategy is antiviral vaccination, particularly involving CD8 T cell epitopes. However, attempts at vaccination are dependent on the immunogenetic background of individuals. The Provir/Latitude 45 project aims to investigate which CTL epitopes in proviral HIV-1 will be recognized by the immune system when HLA alleles are taken into consideration. A prior study (Papuchon et al, PLoS ONE 2013) showed that chronically-infected patients under successful ART exhibited variations of proviral CTL epitopes compared to a reference viral strain (HXB2) and that a generic vaccine may not be efficient. Here, we investigated viral and/or proviral CTL epitopes at different time points in recently infected individuals of the Canadian primary HIV infection cohort and assessed the affinity of these epitopes for HLA alleles during the study period. An analysis of the results confirms that it is not possible to fully predict which epitopes will be recognized by the HLA alleles of the patients if the reference sequences and epitopes are taken as the basis of simulation. Epitopes may be seen to vary in circulating RNA and proviral DNA. Despite this confirmation, the overall variability of the epitopes was low in these patients who are temporally close to primary infection. PMID:24964202
Tamarozzi, Elvira Regina; Giuliatti, Silvana
2018-01-09
Intrinsic disorder is very important in the biological function of several proteins, and is directly linked to their foldability during interaction with their targets. There is a close relationship between the intrinsically disordered proteins and the process of carcinogenesis involving viral pathogens. Among these pathogens, we have highlighted the human papillomavirus (HPV) in this study. HPV is currently among the most common sexually transmitted infections, besides being the cause of several types of cancer. HPVs are divided into two groups, called high- and low-risk, based on their oncogenic potential. The high-risk HPV E6 protein has been the target of much research, in seeking treatments against HPV, due to its direct involvement in the process of cell cycle control. To understand the role of intrinsic disorder of the viral proteins in the oncogenic potential of different HPV types, the structural characteristics of intrinsically disordered regions of high and low-risk HPV E6 proteins were analyzed. In silico analyses of primary sequences, prediction of tertiary structures, and analyses of molecular dynamics allowed the observation of the behavior of such disordered regions in these proteins, thereby proving a direct relationship of structural variation with the degree of oncogenicity of HPVs. The results obtained may contribute to the development of new therapies, targeting the E6 oncoprotein, for the treatment of HPV-associated diseases.
Integrating Phylodynamics and Epidemiology to Estimate Transmission Diversity in Viral Epidemics
Magiorkinis, Gkikas; Sypsa, Vana; Magiorkinis, Emmanouil; Paraskevis, Dimitrios; Katsoulidou, Antigoni; Belshaw, Robert; Fraser, Christophe; Pybus, Oliver George; Hatzakis, Angelos
2013-01-01
The epidemiology of chronic viral infections, such as those caused by Hepatitis C Virus (HCV) and Human Immunodeficiency Virus (HIV), is affected by the risk group structure of the infected population. Risk groups are defined by each of their members having acquired infection through a specific behavior. However, risk group definitions say little about the transmission potential of each infected individual. Variation in the number of secondary infections is extremely difficult to estimate for HCV and HIV but crucial in the design of efficient control interventions. Here we describe a novel method that combines epidemiological and population genetic approaches to estimate the variation in transmissibility of rapidly-evolving viral epidemics. We evaluate this method using a nationwide HCV epidemic and for the first time co-estimate viral generation times and superspreading events from a combination of molecular and epidemiological data. We anticipate that this integrated approach will form the basis of powerful tools for describing the transmission dynamics of chronic viral diseases, and for evaluating control strategies directed against them. PMID:23382662
Ellenbecker, Mary; St Goddard, Jeremy; Sundet, Alec; Lanchy, Jean-Marc; Raiford, Douglas; Lodmell, J Stephen
2015-10-01
Rift Valley fever virus (RVFV) is a potent human and livestock pathogen endemic to sub-Saharan Africa and the Arabian Peninsula that has potential to spread to other parts of the world. Although there is no proven effective and safe treatment for RVFV infections, a potential therapeutic target is the virally encoded nucleocapsid protein (N). During the course of infection, N binds to viral RNA, and perturbation of this interaction can inhibit viral replication. To gain insight into how N recognizes viral RNA specifically, we designed an algorithm that uses a distance matrix and multidimensional scaling to compare the predicted secondary structures of known N-binding RNAs, or aptamers, that were isolated and characterized in previous in vitro evolution experiment. These aptamers did not exhibit overt sequence or predicted structure similarity, so we employed bioinformatic methods to propose novel aptamers based on analysis and clustering of secondary structures. We screened and scored the predicted secondary structures of novel randomly generated RNA sequences in silico and selected several of these putative N-binding RNAs whose secondary structures were similar to those of known N-binding RNAs. We found that overall the in silico generated RNA sequences bound well to N in vitro. Furthermore, introduction of these RNAs into cells prior to infection with RVFV inhibited viral replication in cell culture. This proof of concept study demonstrates how the predictive power of bioinformatics and the empirical power of biochemistry can be jointly harnessed to discover, synthesize, and test new RNA sequences that bind tightly to RVFV N protein. The approach would be easily generalizable to other applications. Copyright © 2015 Elsevier Ltd. All rights reserved.
Virome comparisons in wild-diseased and healthy captive giant pandas.
Zhang, Wen; Yang, Shixing; Shan, Tongling; Hou, Rong; Liu, Zhijian; Li, Wang; Guo, Lianghua; Wang, Yan; Chen, Peng; Wang, Xiaochun; Feng, Feifei; Wang, Hua; Chen, Chao; Shen, Quan; Zhou, Chenglin; Hua, Xiuguo; Cui, Li; Deng, Xutao; Zhang, Zhihe; Qi, Dunwu; Delwart, Eric
2017-08-07
The giant panda (Ailuropoda melanoleuca) is a vulnerable mammal herbivore living wild in central China. Viral infections have become a potential threat to the health of these endangered animals, but limited information related to these infections is available. Using a viral metagenomic approach, we surveyed viruses in the feces, nasopharyngeal secretions, blood, and different tissues from a wild giant panda that died from an unknown disease, a healthy wild giant panda, and 46 healthy captive animals. The previously uncharacterized complete or near complete genomes of four viruses from three genera in Papillomaviridae family, six viruses in a proposed new Picornaviridae genus (Aimelvirus), two unclassified viruses related to posaviruses in Picornavirales order, 19 anelloviruses in four different clades of Anelloviridae family, four putative circoviruses, and 15 viruses belonging to the recently described Genomoviridae family were sequenced. Reflecting the diet of giant pandas, numerous insect virus sequences related to the families Iflaviridae, Dicistroviridae, Iridoviridae, Baculoviridae, Polydnaviridae, and subfamily Densovirinae and plant viruses sequences related to the families Tombusviridae, Partitiviridae, Secoviridae, Geminiviridae, Luteoviridae, Virgaviridae, and Rhabdoviridae; genus Umbravirus, Alphaflexiviridae, and Phycodnaviridae were also detected in fecal samples. A small number of insect virus sequences were also detected in the nasopharyngeal secretions of healthy giant pandas and lung tissues from the dead wild giant panda. Although the viral families present in the sick giant panda were also detected in the healthy ones, a higher proportion of papillomaviruses, picornaviruses, and anelloviruses reads were detected in the diseased panda. This viral survey increases our understanding of eukaryotic viruses in giant pandas and provides a baseline for comparison to viruses detected in future infectious disease outbreaks. The similar viral families detected in sick and healthy giant pandas indicate that these viruses result in commensal infections in most immuno-competent animals.
Morales, Lucia; Mateos-Gomez, Pedro A.; Capiscol, Carmen; del Palacio, Lorena; Sola, Isabel
2013-01-01
Preferential RNA packaging in coronaviruses involves the recognition of viral genomic RNA, a crucial process for viral particle morphogenesis mediated by RNA-specific sequences, known as packaging signals. An essential packaging signal component of transmissible gastroenteritis coronavirus (TGEV) has been further delimited to the first 598 nucleotides (nt) from the 5′ end of its RNA genome, by using recombinant viruses transcribing subgenomic mRNA that included potential packaging signals. The integrity of the entire sequence domain was necessary because deletion of any of the five structural motifs defined within this region abrogated specific packaging of this viral RNA. One of these RNA motifs was the stem-loop SL5, a highly conserved motif in coronaviruses located at nucleotide positions 106 to 136. Partial deletion or point mutations within this motif also abrogated packaging. Using TGEV-derived defective minigenomes replicated in trans by a helper virus, we have shown that TGEV RNA packaging is a replication-independent process. Furthermore, the last 494 nt of the genomic 3′ end were not essential for packaging, although this region increased packaging efficiency. TGEV RNA sequences identified as necessary for viral genome packaging were not sufficient to direct packaging of a heterologous sequence derived from the green fluorescent protein gene. These results indicated that TGEV genome packaging is a complex process involving many factors in addition to the identified RNA packaging signal. The identification of well-defined RNA motifs within the TGEV RNA genome that are essential for packaging will be useful for designing packaging-deficient biosafe coronavirus-derived vectors and providing new targets for antiviral therapies. PMID:23966403
Müller, M; Schnitzler, P; Koonin, E V; Darai, G
1995-05-01
Cytoplasmic DNA viruses encode a DNA-dependent RNA polymerase (DdRP) that is essential for transcription of viral genes. The amino acid sequences of the known largest subunits of DdRPs from different species contain highly conserved regions. Oligonucleotide primers, deduced from two conserved domains (RQP[T/S]LH and NADFDGDE) were used for detecting the corresponding gene of fish lymphocystis disease virus (FLCDV), a member of the family Iridoviridae, which replicates in the cytoplasm of infected cells of flatfish. The gene coding for the largest subunit of the DdRP was identified using a PCR-derived probe. The screening of the complete EcoRI gene library of the viral genome led to the identification of the gene locus of the largest subunit of the DdRP within the EcoRI DNA fragment B (12.4 kbp, 0.034 to 0.165 map units). The nucleotide sequence of a part (8334 bp) of the EcoRI DNA fragment B was determined and a large ORF on the lower strand (ATG = 5787; TAA = 2190) was detected which encodes a protein of 1199 amino acids. Comparison of the amino acid sequences of the largest subunits of the DdRP (RPO1) of FLCDV and Chilo iridescent virus (CIV) revealed a dramatic difference in their domain organization. Unlike the 1051 aa RPO1 of CIV, which lacks the C-terminal domain conserved in eukaryotic, eubacterial and other viral RNA polymerases, the 1199 aa RPO1 of FLCDV is fully collinear with its cellular and viral homologues. Despite this difference, comparative analysis of the amino acid sequences of viral and cellular RNA polymerases suggests a common origin for the largest RNA polymerase subunits of FLCDV and CIV.
Walline, Heather M; Komarck, Christine M; McHugh, Jonathan B; Tang, Alice L; Owen, John H; Teh, Bin T; McKean, Erin; Glover, Thomas; Graham, Martin P; Prince, Mark E; Chepeha, Douglas B; Chinn, Steven B; Ferris, Robert L; Gollin, Susanne M; Hoffmann, Thomas K; Bier, Henning; Brakenhoff, Ruud; Bradford, Carol R; Carey, Thomas E
2017-01-01
Background HPV-positive oropharyngeal cancer is generally associated with excellent response to therapy, but some HPV-positive tumors progress despite aggressive therapy. This study evaluates viral oncogene expression and viral integration sites in HPV16 and HPV18-positive squamous carcinoma cell lines. Methods E6-E7 alternate transcripts were assessed by RT-PCR. Detection of integrated papillomavirus sequences (DIPS-PCR) and sequencing identified viral insertion sites and affected host genes. Cellular gene expression was assessed across viral integration sites. Results All HPV-positive cell lines expressed alternate HPVE6/E7 splicing indicative of active viral oncogenesis. HPV integration occurred within cancer-related genes TP63, DCC, JAK1, TERT, ATR, ETV6, PGR, PTPRN2, and TMEM237 in 8 HNSCC lines but UM-SCC-105 and UM-GCC-1 had only intergenic integration. Conclusions HPV integration into cancer-related genes occurred in 7/9 HPV-positive cell lines and of these six were from tumors that progressed. HPV integration into cancer-related genes may be a secondary carcinogenic driver in HPV-driven tumors. PMID:28236344
Development of a PCR assay to detect papillomavirus infection in the snow leopard.
Mitsouras, Katherine; Faulhaber, Erica A; Hui, Gordon; Joslin, Janis O; Eng, Curtis; Barr, Margaret C; Irizarry, Kristopher Jl
2011-07-18
Papillomaviruses (PVs) are a group of small, non-encapsulated, species-specific DNA viruses that have been detected in a variety of mammalian and avian species including humans, canines and felines. PVs cause lesions in the skin and mucous membranes of the host and after persistent infection, a subset of PVs can cause tumors such as cervical malignancies and head and neck squamous cell carcinoma in humans. PVs from several species have been isolated and their genomes have been sequenced, thereby increasing our understanding of the mechanism of viral oncogenesis and allowing for the development of molecular assays for the detection of PV infection. In humans, molecular testing for PV DNA is used to identify patients with persistent infections at risk for developing cervical cancer. In felids, PVs have been isolated and sequenced from oral papillomatous lesions of several wild species including bobcats, Asian lions and snow leopards. Since a number of wild felids are endangered, PV associated disease is a concern and there is a need for molecular tools that can be used to further study papillomavirus in these species. We used the sequence of the snow leopard papillomavirus UuPV1 to develop a PCR strategy to amplify viral DNA from samples obtained from captive animals. We designed primer pairs that flank the E6 and E7 viral oncogenes and amplify two DNA fragments encompassing these genes. We detected viral DNA for E6 and E7 in genomic DNA isolated from saliva, but not in paired blood samples from snow leopards. We verified the identity of these PCR products by restriction digest and DNA sequencing. The sequences of the PCR products were 100% identical to the published UuPV1 genome sequence. We developed a PCR assay to detect papillomavirus in snow leopards and amplified viral DNA encompassing the E6 and E7 oncogenes specifically in the saliva of animals. This assay could be utilized for the molecular investigation of papillomavirus in snow leopards using saliva, thereby allowing the detection of the virus in the anatomical site where oral papillomatous lesions develop during later stages of infection and disease development.
Development of a PCR Assay to detect Papillomavirus Infection in the Snow Leopard
2011-01-01
Background Papillomaviruses (PVs) are a group of small, non-encapsulated, species-specific DNA viruses that have been detected in a variety of mammalian and avian species including humans, canines and felines. PVs cause lesions in the skin and mucous membranes of the host and after persistent infection, a subset of PVs can cause tumors such as cervical malignancies and head and neck squamous cell carcinoma in humans. PVs from several species have been isolated and their genomes have been sequenced, thereby increasing our understanding of the mechanism of viral oncogenesis and allowing for the development of molecular assays for the detection of PV infection. In humans, molecular testing for PV DNA is used to identify patients with persistent infections at risk for developing cervical cancer. In felids, PVs have been isolated and sequenced from oral papillomatous lesions of several wild species including bobcats, Asian lions and snow leopards. Since a number of wild felids are endangered, PV associated disease is a concern and there is a need for molecular tools that can be used to further study papillomavirus in these species. Results We used the sequence of the snow leopard papillomavirus UuPV1 to develop a PCR strategy to amplify viral DNA from samples obtained from captive animals. We designed primer pairs that flank the E6 and E7 viral oncogenes and amplify two DNA fragments encompassing these genes. We detected viral DNA for E6 and E7 in genomic DNA isolated from saliva, but not in paired blood samples from snow leopards. We verified the identity of these PCR products by restriction digest and DNA sequencing. The sequences of the PCR products were 100% identical to the published UuPV1 genome sequence. Conclusions We developed a PCR assay to detect papillomavirus in snow leopards and amplified viral DNA encompassing the E6 and E7 oncogenes specifically in the saliva of animals. This assay could be utilized for the molecular investigation of papillomavirus in snow leopards using saliva, thereby allowing the detection of the virus in the anatomical site where oral papillomatous lesions develop during later stages of infection and disease development. PMID:21767399
2013-01-01
Background Up to 20% of cancers worldwide are thought to be associated with microbial pathogens, including bacteria and viruses. The widely used methods of viral infection detection are usually limited to a few a priori suspected viruses in one cancer type. To our knowledge, there have not been many broad screening approaches to address this problem more comprehensively. Methods In this study, we performed a comprehensive screening for viruses in nine common cancers using a multistep computational approach. Tumor transcriptome and genome sequencing data were available from The Cancer Genome Atlas (TCGA). Nine hundred fifty eight primary tumors in nine common cancers with poor prognosis were screened against a non-redundant database of virus sequences. DNA sequences from normal matched tissue specimens were used as controls to test whether each virus is associated with tumors. Results We identified human papilloma virus type 18 (HPV-18) and four human herpes viruses (HHV) types 4, 5, 6B, and 8, also known as EBV, CMV, roseola virus, and KSHV, in colon, rectal, and stomach adenocarcinomas. In total, 59% of screened gastrointestinal adenocarcinomas (GIA) were positive for at least one virus: 26% for EBV, 21% for CMV, 7% for HHV-6B, and 20% for HPV-18. Over 20% of tumors were co-infected with multiple viruses. Two viruses (EBV and CMV) were statistically significantly associated with colorectal cancers when compared to the matched healthy tissues from the same individuals (p = 0.02 and 0.03, respectively). HPV-18 was not detected in DNA, and thus, no association testing was possible. Nevertheless, HPV-18 expression patterns suggest viral integration in the host genome, consistent with the potentially oncogenic nature of HPV-18 in colorectal adenocarcinomas. The estimated counts of viral copies were below one per cell for all identified viruses and approached the detection limit. Conclusions Our comprehensive screening for viruses in multiple cancer types using next-generation sequencing data clearly demonstrates the presence of viral sequences in GIA. EBV, CMV, and HPV-18 are potentially causal for GIA, although their oncogenic role is yet to be established. PMID:24279398
Cui, Hongguang
2016-01-01
ABSTRACT The potyviral RNA genome encodes two polyproteins that are proteolytically processed by three viral protease domains into 11 mature proteins. Extensive molecular studies have identified functions for the majority of the viral proteins. For example, 6K2, one of the two smallest potyviral proteins, is an integral membrane protein and induces the endoplasmic reticulum (ER)-originated replication vesicles that target the chloroplast for robust viral replication. However, the functional role of 6K1, the other smallest protein, remains uncharacterized. In this study, we developed a series of recombinant full-length viral cDNA clones derived from a Canadian Plum pox virus (PPV) isolate. We found that deletion of any of the short motifs of 6K1 (each of which ranged from 5 to 13 amino acids), most of the 6K1 sequence (but with the conserved sequence of the cleavage sites being retained), or all of the 6K1 sequence in the PPV infectious clone abolished viral replication. The trans expression of 6K1 or the cis expression of a dislocated 6K1 failed to rescue the loss-of-replication phenotype, suggesting the temporal and spatial requirement of 6K1 for viral replication. Disruption of the N- or C-terminal cleavage site of 6K1, which prevented the release of 6K1 from the polyprotein, either partially or completely inhibited viral replication, suggesting the functional importance of the mature 6K1. We further found that green fluorescent protein-tagged 6K1 formed punctate inclusions at the viral early infection stage and colocalized with chloroplast-bound viral replicase elements 6K2 and NIb. Taken together, our results suggest that 6K1 is required for viral replication and is an important viral element of the viral replication complex at the early infection stage. IMPORTANCE Potyviruses account for more than 30% of known plant viruses and consist of many agriculturally important viruses. The genomes of potyviruses encode two polyproteins that are proteolytically processed into 11 mature proteins, with the majority of them having been at least partially functionally characterized. However, the functional role of a small protein named 6K1 remains obscure. In this study, we showed that deletion of 6K1 or a short motif/region of 6K1 in the full-length cDNA clones of plum pox virus abolishes viral replication and that mutation of the N- or C-terminal cleavage sites of 6K1 to prevent its release from the polyprotein greatly attenuates or completely inhibits viral replication, suggesting its important role in potyviral infection. We report that 6K1 forms punctate structures and targets the replication vesicles in PPV-infected plant leaf cells at the early infection stage. Our data reveal that 6K1 is an important viral protein of the potyviral replication complex. PMID:26962227
Eden, John-Sebastian; Read, Andrew J.; Duckworth, Janine A.; Strive, Tanja
2015-01-01
To resolve the evolutionary history of rabbit hemorrhagic disease virus (RHDV), we performed a genomic analysis of the viral stocks imported and released as a biocontrol measure in Australia, as well as a global phylogenetic analysis. Importantly, conflicts were identified between the sequences determined here and those previously published that may have affected evolutionary rate estimates. By removing likely erroneous sequences, we show that RHDV emerged only shortly before its initial description in China. PMID:26378178
Herbeck, Joshua T.; Mittler, John E.; Gottlieb, Geoffrey S.; Mullins, James I.
2014-01-01
Trends in HIV virulence have been monitored since the start of the AIDS pandemic, as studying HIV virulence informs our understanding of HIV epidemiology and pathogenesis. Here, we model changes in HIV virulence as a strictly evolutionary process, using set point viral load (SPVL) as a proxy, to make inferences about empirical SPVL trends from longitudinal HIV cohorts. We develop an agent-based epidemic model based on HIV viral load dynamics. The model contains functions for viral load and transmission, SPVL and disease progression, viral load trajectories in multiple stages of infection, and the heritability of SPVL across transmissions. We find that HIV virulence evolves to an intermediate level that balances infectiousness with longer infected lifespans, resulting in an optimal SPVL∼4.75 log10 viral RNA copies/mL. Adaptive viral evolution may explain observed HIV virulence trends: our model produces SPVL trends with magnitudes that are broadly similar to empirical trends. With regard to variation among studies in empirical SPVL trends, results from our model suggest that variation may be explained by the specific epidemic context, e.g. the mean SPVL of the founding lineage or the age of the epidemic; or improvements in HIV screening and diagnosis that results in sampling biases. We also use our model to examine trends in community viral load, a population-level measure of HIV viral load that is thought to reflect a population's overall transmission potential. We find that community viral load evolves in association with SPVL, in the absence of prevention programs such as antiretroviral therapy, and that the mean community viral load is not necessarily a strong predictor of HIV incidence. PMID:24945322
Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.
van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J
2017-10-01
Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is limited. By sequencing a number of infections with known follow-up for up to 3 years, we gained initial insights into the genetic diversity of HPV16 and the effects of the viral genome on the persistence of infections. A SNP comparison between sequences obtained from clearing and persistent infections did not identify strongly acting DNA variations responsible for these infection outcomes. In addition, we identified an HPV16 reinfection event where sequencing of initial and follow-up samples showed different HPV16 variants. Based on conventional genotyping, this infection would incorrectly be considered a persistent HPV16 infection. In the context of vaccine efficacy and monitoring studies, such infections could potentially cause reduced reported efficacy or efficiency. Copyright © 2017 van der Weele et al.
Improved bacteriophage genome data is necessary for integrating viral and bacterial ecology.
Bibby, Kyle
2014-02-01
The recent rise in "omics"-enabled approaches has lead to improved understanding in many areas of microbial ecology. However, despite the importance that viruses play in a broad microbial ecology context, viral ecology remains largely not integrated into high-throughput microbial ecology studies. A fundamental hindrance to the integration of viral ecology into omics-enabled microbial ecology studies is the lack of suitable reference bacteriophage genomes in reference databases-currently, only 0.001% of bacteriophage diversity is represented in genome sequence databases. This commentary serves to highlight this issue and to promote bacteriophage genome sequencing as a valuable scientific undertaking to both better understand bacteriophage diversity and move towards a more holistic view of microbial ecology.
Turina, Massimo; Ghignone, Stefano; Astolfi, Nausicaa; Silvestri, Alessandro; Bonfante, Paola; Lanfranco, Luisa
2018-02-02
Arbuscular Mycorrhizal Fungi (AMF) are key components of the plant microbiota. AMF genetic complexity is increased by the presence of endobacteria, which live inside many species. A further component of such complexity is the virome associated to AMF, whose knowledge is still very limited. Here, by exploiting transcriptomic data we describe the virome of Gigaspora margarita. A BLAST search for viral RNA-dependent RNA polymerases sequences allowed the identification of four mitoviruses, one Ourmia-like narnavirus, one Giardia-like virus, and two sequences related to Fusarium graminearum mycoviruses. Northern blot and RT-PCR confirmed the authenticity of all the sequences with the exception of the F. graminearum-related ones. All the mitoviruses are replicative and functional since both positive strand and negative strand RNA are present. The abundance of the viral RNA molecules is not regulated by the presence or absence of Candidatus Glomeribacter gigasporarum, the endobacterium hosted by G. margarita, with the exception of the Ourmia-like sequence which is absent in bacteria-cured spores. In addition, we report, for the first time, DNA fragments corresponding to mitovirus sequences associated to the presence of viral RNA. These sequences are not integrated in the mitochondrial DNA and preliminary evidence seems to exclude integration in the nuclear genome. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.
Chang, Suhua; Zhang, Jiajie; Liao, Xiaoyun; Zhu, Xinxing; Wang, Dahai; Zhu, Jiang; Feng, Tao; Zhu, Baoli; Gao, George F; Wang, Jian; Yang, Huanming; Yu, Jun; Wang, Jing
2007-01-01
Frequent outbreaks of highly pathogenic avian influenza and the increasing data available for comparative analysis require a central database specialized in influenza viruses (IVs). We have established the Influenza Virus Database (IVDB) to integrate information and create an analysis platform for genetic, genomic, and phylogenetic studies of the virus. IVDB hosts complete genome sequences of influenza A virus generated by Beijing Institute of Genomics (BIG) and curates all other published IV sequences after expert annotation. Our Q-Filter system classifies and ranks all nucleotide sequences into seven categories according to sequence content and integrity. IVDB provides a series of tools and viewers for comparative analysis of the viral genomes, genes, genetic polymorphisms and phylogenetic relationships. A search system has been developed for users to retrieve a combination of different data types by setting search options. To facilitate analysis of global viral transmission and evolution, the IV Sequence Distribution Tool (IVDT) has been developed to display the worldwide geographic distribution of chosen viral genotypes and to couple genomic data with epidemiological data. The BLAST, multiple sequence alignment and phylogenetic analysis tools were integrated for online data analysis. Furthermore, IVDB offers instant access to pre-computed alignments and polymorphisms of IV genes and proteins, and presents the results as SNP distribution plots and minor allele distributions. IVDB is publicly available at http://influenza.genomics.org.cn.
Dimonte, Salvatore
2017-01-01
Antisense protein (ASP) is the new actor of viral life of Human Immunodeficiency Virus type 1 (HIV-1) although proposed above 20 years ago. The asp ORF is into complementary strand of the gp120/gp41 junction of env gene. The ASP biological role remains little known. Knowing the Env markers of viral tropism, a dataset of sequences (660 strains) was used to analyze the hypothetical ASP involvement in CCR5 (R5) and/or CXCR4 (X4) co-receptor interaction. Preliminarily, prevalence of ASP and gp120 V3 mutations was performed; following association among mutations were elaborate. The classical V3 tropic-signatures were confirmed, and 36 R5- and 22 X4-tropic ASP mutations were found. Moreover, by analyzing the ASP sequences, 36 out of 179 amino acid positions significantly associated with different co-receptor usage were found. Several statistically significant associations between gp120 V3 and ASP mutations were observed. The dendrogram showed the existence of a cluster associated with R5-usage and a large cluster associated with X4-usage. These results show that gp120 V3 and specific amino acid changes in ASP are associated together with CXCR4 and/or CCR5-usage. These findings implement previous observations on unclear ASP functions. J. Med. Virol. 89:112-122, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Global patterns in coronavirus diversity
Johnson, Christine K.; Greig, Denise J.; Kramer, Sarah; Che, Xiaoyu; Wells, Heather; Hicks, Allison L.; Joly, Damien O.; Wolfe, Nathan D.; Daszak, Peter; Karesh, William; Lipkin, W. I.; Morse, Stephen S.; Mazet, Jonna A. K.
2017-01-01
Abstract Since the emergence of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) and Middle East Respiratory Syndrom Coronavirus (MERS-CoV) it has become increasingly clear that bats are important reservoirs of CoVs. Despite this, only 6% of all CoV sequences in GenBank are from bats. The remaining 94% largely consist of known pathogens of public health or agricultural significance, indicating that current research effort is heavily biased towards describing known diseases rather than the ‘pre-emergent’ diversity in bats. Our study addresses this critical gap, and focuses on resource poor countries where the risk of zoonotic emergence is believed to be highest. We surveyed the diversity of CoVs in multiple host taxa from twenty countries to explore the factors driving viral diversity at a global scale. We identified sequences representing 100 discrete phylogenetic clusters, ninety-one of which were found in bats, and used ecological and epidemiologic analyses to show that patterns of CoV diversity correlate with those of bat diversity. This cements bats as the major evolutionary reservoirs and ecological drivers of CoV diversity. Co-phylogenetic reconciliation analysis was also used to show that host switching has contributed to CoV evolution, and a preliminary analysis suggests that regional variation exists in the dynamics of this process. Overall our study represents a model for exploring global viral diversity and advances our fundamental understanding of CoV biodiversity and the potential risk factors associated with zoonotic emergence. PMID:28630747
Jesús, Torres; Rogelio, López; Abraham, Cetina; Uriel, López; J- Daniel, García; Alfonso, Méndez-Tenorio; Lilia, Barrón Blanca
2012-01-01
There are very few antiviral drugs available to fight viral infections and the appearance of viral strains resistant to these antivirals is not a rare event. Hence, the design of new antiviral drugs is important. We describe the prediction of peptides with antiviral activity (AVP) derived from the viral glycoproteins involved in the entrance of herpes simplex (HSV) and influenza A viruses into their host cells. It is known, that during this event viral glycoproteins suffer several conformational changes due to protein-protein interactions, which lead to membrane fusion between the viral envelope and the cellular membrane. Our hypothesis is that AVPs can be derived from these viral glycoproteins, specifically from regions highly conserved in amino acid sequences, which at the same time have the physicochemical properties of being highly exposed (antigenic), hydrophilic, flexible, and charged, since these properties are important for protein-protein interactions. For that, we separately analyzed the HSV glycoprotein H and B, and influenza A viruses hemagglutinin (HA), using several bioinformatics tools. A set of multiple alignments was carried out, to find the most conserved regions in the amino acid sequences. Then, the physicochemical properties indicated above were analyzed. We predicted several peptides 12-20 amino acid length which by docking analysis were able to interact with the fusion viral glycoproteins and thus may prevent conformational changes in them, blocking the viral infection. Our strategy to design AVPs seems to be very promising since the peptides were synthetized and their antiviral activities have produced very encouraging results. PMID:23144542
Gambelli, Lavinia; Cremers, Geert; Mesman, Rob; Guerrero, Simon; Dutilh, Bas E.; Jetten, Mike S. M.; Op den Camp, Huub J. M.; van Niftrik, Laura
2016-01-01
With its capacity for anaerobic methane oxidation and denitrification, the bacterium Methylomirabilis oxyfera plays an important role in natural ecosystems. Its unique physiology can be exploited for more sustainable wastewater treatment technologies. However, operational stability of full-scale bioreactors can experience setbacks due to, for example, bacteriophage blooms. By shaping microbial communities through mortality, horizontal gene transfer, and metabolic reprogramming, bacteriophages are important players in most ecosystems. Here, we analyzed an infected Methylomirabilis sp. bioreactor enrichment culture using (advanced) electron microscopy, viral metagenomics and bioinformatics. Electron micrographs revealed four different viral morphotypes, one of which was observed to infect Methylomirabilis cells. The infected cells contained densely packed ~55 nm icosahedral bacteriophage particles with a putative internal membrane. Various stages of virion assembly were observed. Moreover, during the bacteriophage replication, the host cytoplasmic membrane appeared extremely patchy, which suggests that the bacteriophages may use host bacterial lipids to build their own putative internal membrane. The viral metagenome contained 1.87 million base pairs of assembled viral sequences, from which five putative complete viral genomes were assembled and manually annotated. Using bioinformatics analyses, we could not identify which viral genome belonged to the Methylomirabilis- infecting bacteriophage, in part because the obtained viral genome sequences were novel and unique to this reactor system. Taken together these results show that new bacteriophages can be detected in anaerobic cultivation systems and that the effect of bacteriophages on the microbial community in these systems is a topic for further study. PMID:27877158
Verma, Anjali; Rajagopalan, Pavithra; Lotke, Rishikesh; Varghese, Rebu; Selvam, Deepak; Kundu, Tapas K.
2016-01-01
ABSTRACT Of the various genetic subtypes of human immunodeficiency virus types 1 and 2 (HIV-1 and HIV-2) and simian immunodeficiency virus (SIV), only in subtype C of HIV-1 is a genetically variant NF-κB binding site found at the core of the viral promoter in association with a subtype-specific Sp1III motif. How the subtype-associated variations in the core transcription factor binding sites (TFBS) influence gene expression from the viral promoter has not been examined previously. Using panels of infectious viral molecular clones, we demonstrate that subtype-specific NF-κB and Sp1III motifs have evolved for optimal gene expression, and neither of the motifs can be replaced by a corresponding TFBS variant. The variant NF-κB motif binds NF-κB with an affinity 2-fold higher than that of the generic NF-κB site. Importantly, in the context of an infectious virus, the subtype-specific Sp1III motif demonstrates a profound loss of function in association with the generic NF-κB motif. An additional substitution of the Sp1III motif fully restores viral replication, suggesting that the subtype C-specific Sp1III has evolved to function with the variant, but not generic, NF-κB motif. A change of only two base pairs in the central NF-κB motif completely suppresses viral transcription from the provirus and converts the promoter into heterochromatin refractory to tumor necrosis factor alpha (TNF-α) induction. The present work represents the first demonstration of functional incompatibility between an otherwise functional NF-κB motif and a unique Sp1 site in the context of an HIV-1 promoter. Our work provides important leads as to the evolution of the HIV-1 subtype C viral promoter with relevance for gene expression regulation and viral latency. IMPORTANCE Subtype-specific genetic variations provide a powerful tool to examine how these variations offer a replication advantage to specific viral subtypes, if any. Only in subtype C of HIV-1 are two genetically distinct transcription factor binding sites positioned at the most critical location of the viral promoter. Since a single promoter regulates viral gene expression, the promoter variations can play a critical role in determining the replication fitness of the viral strains. Our work for the first time provides a scientific explanation for the presence of a unique NF-κB binding motif in subtype C, a major HIV-1 genetic family responsible for half of the global HIV-1 infections. The results offer compelling evidence that the subtype C viral promoter not only is stronger but also is endowed with a qualitative gain-of-function advantage. The genetically variant NF-κB and the Sp1III motifs may be respond differently to specific cell signal pathways, and these mechanisms must be examined. PMID:27194770
Isakov, Ofer; Bordería, Antonio V; Golan, David; Hamenahem, Amir; Celniker, Gershon; Yoffe, Liron; Blanc, Hervé; Vignuzzi, Marco; Shomron, Noam
2015-07-01
The study of RNA virus populations is a challenging task. Each population of RNA virus is composed of a collection of different, yet related genomes often referred to as mutant spectra or quasispecies. Virologists using deep sequencing technologies face major obstacles when studying virus population dynamics, both experimentally and in natural settings due to the relatively high error rates of these technologies and the lack of high performance pipelines. In order to overcome these hurdles we developed a computational pipeline, termed ViVan (Viral Variance Analysis). ViVan is a complete pipeline facilitating the identification, characterization and comparison of sequence variance in deep sequenced virus populations. Applying ViVan on deep sequenced data obtained from samples that were previously characterized by more classical approaches, we uncovered novel and potentially crucial aspects of virus populations. With our experimental work, we illustrate how ViVan can be used for studies ranging from the more practical, detection of resistant mutations and effects of antiviral treatments, to the more theoretical temporal characterization of the population in evolutionary studies. Freely available on the web at http://www.vivanbioinfo.org : nshomron@post.tau.ac.il Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Davidsson, Marcus; Diaz-Fernandez, Paula; Schwich, Oliver D.; Torroba, Marcos; Wang, Gang; Björklund, Tomas
2016-01-01
Detailed characterization and mapping of oligonucleotide function in vivo is generally a very time consuming effort that only allows for hypothesis driven subsampling of the full sequence to be analysed. Recent advances in deep sequencing together with highly efficient parallel oligonucleotide synthesis and cloning techniques have, however, opened up for entirely new ways to map genetic function in vivo. Here we present a novel, optimized protocol for the generation of universally applicable, barcode labelled, plasmid libraries. The libraries are designed to enable the production of viral vector preparations assessing coding or non-coding RNA function in vivo. When generating high diversity libraries, it is a challenge to achieve efficient cloning, unambiguous barcoding and detailed characterization using low-cost sequencing technologies. With the presented protocol, diversity of above 3 million uniquely barcoded adeno-associated viral (AAV) plasmids can be achieved in a single reaction through a process achievable in any molecular biology laboratory. This approach opens up for a multitude of in vivo assessments from the evaluation of enhancer and promoter regions to the optimization of genome editing. The generated plasmid libraries are also useful for validation of sequencing clustering algorithms and we here validate the newly presented message passing clustering process named Starcode. PMID:27874090
The Lyssavirus glycoprotein: A key to cross-immunity.
Buthelezi, Sindisiwe G; Dirr, Heini W; Chakauya, Ereck; Chikwamba, Rachel; Martens, Lennart; Tsekoa, Tsepo L; Stoychev, Stoyan H; Vandermarliere, Elien
2016-11-01
Rabies is an acute viral encephalomyelitis in warm-blooded vertebrates, caused by viruses belonging to Rhabdovirus family and genus Lyssavirus. Although rabies is categorised as a neglected disease, the rabies virus (RABV) is the most studied amongst Lyssaviruses which show nearly identical infection patterns. In efforts to improving post-exposure prophylaxis, several anti-rabies monoclonal antibodies (mAbs) targeting the glycoprotein (G protein) sites I, II, III and G5 have been characterized. To explore cross-neutralization capacity of available mAbs and discover new possible B-cell epitopes, we have analyzed all available glycoprotein sequences from Lyssaviruses with a focus on sequence variation and conservation. This information was mapped on the structure of a representative G protein. We proposed several possible cross-neutralizing B-cell epitopes (GUVTTTF, WLRTV, REECLD and EHLVVEEL) in complement to the already well-characterized antigenic sites. The research could facilitate development of novel cross-reactive mAbs against RABV and even more broad, against possibly all Lyssavirus members. Copyright © 2016 Elsevier Inc. All rights reserved.
Quackenbush, S.L.; Casey, R.N.; Murcek, R.J.; Paul, T.A.; Work, Thierry M.; Limpus, C.J.; Chaves, A.; duToit, L.; Perez, J.V.; Aguirre, A.A.; Spraker, T.R.; Horrocks, J.A.; Vermeer, L.A.; Balazs, G.S.; Casey, J.W.
2001-01-01
Quantitative real-time PCR has been used to measure fibropapilloma-associated turtle herpesvirus (FPTHV) pol DNA loads in fibropapillomas, fibromas, and uninvolved tissues of green, loggerhead, and olive ridley turtles from Hawaii, Florida, Costa Rica, Australia, Mexico, and the West Indies. The viral DNA loads from tumors obtained from terminal animals were relatively homogenous (range 2a??20 copies/cell), whereas DNA copy numbers from biopsied tumors and skin of otherwise healthy turtles displayed a wide variation (range 0.001a??170 copies/cell) and may reflect the stage of tumor development. FPTHV DNA loads in tumors were 2.5a??4.5 logs higher than in uninvolved skin from the same animal regardless of geographic location, further implying a role for FPTHV in the etiology of fibropapillomatosis. Although FPTHV pol sequences amplified from tumors are highly related to each other, single signature amino acid substitutions distinguish the Australia/Hawaii, Mexico/Costa Rica, and Florida/Caribbean groups.
2018-01-01
ABSTRACT Primary infection with human cytomegalovirus (HCMV) results in a lifelong infection due to its ability to establish latent infection, with one characterized viral reservoir being hematopoietic cells. Although reactivation from latency causes serious disease in immunocompromised individuals, our molecular understanding of latency is limited. Here, we delineate viral gene expression during natural HCMV persistent infection by analyzing the massive transcriptome RNA sequencing (RNA-seq) atlas generated by the Genotype-Tissue Expression (GTEx) project. This systematic analysis reveals that HCMV persistence in vivo is prevalent in diverse tissues. Notably, we find only viral transcripts that resemble gene expression during various stages of lytic infection with no evidence of any highly restricted latency-associated viral gene expression program. To further define the transcriptional landscape during HCMV latent infection, we also used single-cell RNA-seq and a tractable experimental latency model. In contrast to some current views on latency, we also find no evidence for any highly restricted latency-associated viral gene expression program. Instead, we reveal that latency-associated gene expression largely mirrors a late lytic viral program, albeit at much lower levels of expression. Overall, our work has the potential to revolutionize our understanding of HCMV persistence and suggests that latency is governed mainly by quantitative changes, with a limited number of qualitative changes, in viral gene expression. PMID:29535194
Gao, Fei; Qu, Zehui; Li, Liwei; Yu, Lingxue; Jiang, Yifeng; Zhou, Yanjun; Yang, Shen; Zheng, Hao; Huang, Qinfeng; Tong, Wu; Tong, Guangzhi
2016-08-01
Porcine reproductive and respiratory syndrome virus (PRRSV) has a condensed single-stranded positive-sense RNA genome that contains several overlapping regions. The transcription regulatory sequence (TRS) is the important cis-acting element participating in PRRSV discontinuous transcription process. Based on reverse genetic system of type 2 highly pathogenic PRRSV cell-passage attenuated strain pHuN4-F112, firefly luciferase or Renilla luciferase genes were inserted between ORF1b and ORF2. An extra TRS6 was embedded behind the foreign luciferase genes. pA-Fluc and pA-Rluc were constructed and successfully rescued in MARC-145 cells. The phenotypical characteristics of the progeny virus were indistinguishable from those of vHuN4-F112 and were genetically stable for at least 25 cell passages. Mutant virus-infected cells were lysed at different time points to assess luciferase activities and measure foreign gene expression levels. The results showed identical variations in the luciferase activities of the recombinants in MARC-145 cells, indicating that they were suitable for monitoring viral propagation in PRRSV-permissive cell cultures. They were also used to infect pulmonary alveolar macrophages, which yielded similar variations in luciferase activities. Therefore, vA-Fluc and vA-Rluc present powerful new tools to monitor PRRSV propagation in both passaged and target cells. Copyright © 2016 Elsevier Ltd. All rights reserved.
Cao-Lormeau, Van-Mai; Lambrechts, Louis
2017-01-01
Abstract Like other pathogens with high mutation and replication rates, within-host dengue virus (DENV) populations evolve during infection of their main mosquito vector, Aedes aegypti. Within-host DENV evolution during transmission provides opportunities for adaptation and emergence of novel virus variants. Recent studies of DENV genetic diversity failed to detect convergent evolution of adaptive mutations in mosquito tissues such as midgut and salivary glands, suggesting that convergent positive selection is not a major driver of within-host DENV evolution in the vector. However, it is unknown whether this conclusion extends to the transmitted viral subpopulation because it is technically difficult to sequence DENV genomes in mosquito saliva. Here, we achieved DENV full-genome sequencing by pooling saliva samples collected non-sacrificially from 49 to 163 individual Ae. aegypti mosquitoes previously infected with one of two DENV-1 genotypes. We compared the transmitted viral subpopulations found in the pooled saliva samples collected in time series with the input viral population present in the infectious blood meal. In all pooled saliva samples examined, the full-genome consensus sequence of the input viral population was unchanged. Although the pooling strategy prevents analysis of individual saliva samples, our results demonstrate the lack of strong convergent positive selection during a single round of DENV transmission by Ae. aegypti. This finding reinforces the idea that genetic drift and purifying selection are the dominant evolutionary forces shaping within-host DENV genetic diversity during transmission by mosquitoes. PMID:29497564
The Viral Evolution Core within the AIDS and Cancer Virus Program will extract viral RNA/DNA from cell-free or cell-associated samples. Complementary (cDNA) will be generated as needed, and cDNA or DNA will be diluted to a single copy prior to nested
Genetic diversity of Trichomonas vaginalis clinical isolates from Henan province in central China.
Mao, Meng; Liu, Hui Li
2015-07-01
Trichomonas vaginalis is a flagellated protozoan parasite that infects the human urogenital tract, causing the most common non-viral, sexually transmitted disease worldwide. In this study, genetic variants of T. vaginalis were identified in Henan Province, China. Fragments of the small subunit of nuclear ribosomal RNA (18S rRNA) were amplified from 32 T. vaginalis isolates obtained from seven regions of Henan Province. Overall, 18 haplotypes were determined from the 18S rRNA sequences. Each sampled population and the total population displayed high haplotype diversity (Hd), accompanied by very low nucleotide diversity (Pi). In these molecular genetic variants, 91.58% genetic variation was derived from intra-regions. Phylogenetic analysis revealed no correlation between phylogeny and geographic distribution. Demographic analysis supported population expansion of T. vaginalis isolates from central China. Our findings showing moderate-to-high genetic variations in the 32 isolates of T. vaginalis provide useful knowledge for monitoring changes in parasite populations for the development of future control strategies.
Betz-Stablein, B. D.; Töpfer, A.; Littlejohn, M.; Yuen, L.; Colledge, D.; Sozzi, V.; Angus, P.; Thompson, A.; Revill, P.; Beerenwinkel, N.; Warner, N.
2016-01-01
ABSTRACT Chronic hepatitis B (CHB) is prevalent worldwide. The infectious agent, hepatitis B virus (HBV), replicates via an RNA intermediate and is error prone, leading to the rapid generation of closely related but not identical viral variants, including those that can escape host immune responses and antiviral treatments. The complexity of CHB can be further enhanced by the presence of HBV variants with large deletions in the genome generated via splicing (spHBV variants). Although spHBV variants are incapable of autonomous replication, their replication is rescued by wild-type HBV. spHBV variants have been shown to enhance wild-type virus replication, and their prevalence increases with liver disease progression. Single-molecule deep sequencing was performed on whole HBV genomes extracted from samples, including the liver explant, longitudinally collected from a subject with CHB over a 15-year period after liver transplantation. By employing novel bioinformatics methods, this analysis showed that the dynamics of the viral population across a period of changing treatment regimens was complex. The spHBV variants detected in the liver explant remained present posttransplantation, and a highly diverse novel spHBV population as well as variants with multiple deletions in the pre-S genes emerged. The identification of novel mutations outside the HBV reverse transcriptase gene that co-occurred with known drug resistance-associated mutations highlights the relevance of using full-genome deep sequencing and supports the hypothesis that drug resistance involves interactions across the full length of the HBV genome. IMPORTANCE Single-molecule sequencing allowed the characterization, in unprecedented detail, of the evolution of HBV populations and offered unique insights into the dynamics of defective and spHBV variants following liver transplantation and complex treatment regimens. This analysis also showed the rapid adaptation of HBV populations to treatment regimens with evolving drug resistance phenotypes and evidence of purifying selection across the whole genome. Finally, the new open-source bioinformatics tools with the capacity to easily identify potential spliced variants from deep sequencing data are freely available. PMID:27252524
Complexity and dynamics of HIV-1 chemokine receptor usage in a multidrug-resistant adolescent.
Cavarelli, Mariangela; Mainetti, Lara; Pignataro, Angela Rosa; Bigoloni, Alba; Tolazzi, Monica; Galli, Andrea; Nozza, Silvia; Castagna, Antonella; Sampaolo, Michela; Boeri, Enzo; Scarlatti, Gabriella
2014-12-01
Maraviroc (MVC) is licensed in clinical practice for patients with R5 virus and virological failure; however, in anecdotal reports, dual/mixed viruses were also inhibited. We retrospectively evaluated the evolution of HIV-1 coreceptor tropism in plasma and peripheral blood mononuclear cells (PBMCs) of an infected adolescent with a CCR5/CXCR4 Trofile profile who experienced an important but temporary immunological and virological response during a 16-month period of MVC-based therapy. Coreceptor usage of biological viral clones isolated from PBMCs was investigated in U87.CD4 cells expressing wild-type or chimeric CCR5 and CXCR4. Plasma and PBMC-derived viral clones were sequenced to predict coreceptor tropism using the geno2pheno algorithm from the V3 envelope sequence and pol gene-resistant mutations. From start to 8.5 months of MVC treatment only R5X4 viral clones were observed, whereas at 16 months the phenotype enlarged to also include R5 and X4 clones. Chimeric receptor usage suggested the preferential usage of the CXCR4 coreceptor by the R5X4 biological clones. According to phenotypic data, R5 viruses were susceptible, whereas R5X4 and X4 viruses were resistant to RANTES and MVC in vitro. Clones at 16 months, but not at baseline, showed an amino acidic resistance pattern in protease and reverse transcription genes, which, however, did not drive their tropisms. The geno2pheno algorithm predicted at baseline R5 viruses in plasma, and from 5.5 months throughout follow-up only CXCR4-using viruses. An extended methodological approach is needed to unravel the complexity of the phenotype and variation of viruses resident in the different compartments of an infected individual. The accurate evaluation of the proportion of residual R5 viruses may guide therapeutic intervention in highly experienced patients with limited therapeutic options.
Complexity and Dynamics of HIV-1 Chemokine Receptor Usage in a Multidrug-Resistant Adolescent
Mainetti, Lara; Pignataro, Angela Rosa; Bigoloni, Alba; Tolazzi, Monica; Galli, Andrea; Nozza, Silvia; Castagna, Antonella; Sampaolo, Michela; Boeri, Enzo; Scarlatti, Gabriella
2014-01-01
Abstract Maraviroc (MVC) is licensed in clinical practice for patients with R5 virus and virological failure; however, in anecdotal reports, dual/mixed viruses were also inhibited. We retrospectively evaluated the evolution of HIV-1 coreceptor tropism in plasma and peripheral blood mononuclear cells (PBMCs) of an infected adolescent with a CCR5/CXCR4 Trofile profile who experienced an important but temporary immunological and virological response during a 16-month period of MVC-based therapy. Coreceptor usage of biological viral clones isolated from PBMCs was investigated in U87.CD4 cells expressing wild-type or chimeric CCR5 and CXCR4. Plasma and PBMC-derived viral clones were sequenced to predict coreceptor tropism using the geno2pheno algorithm from the V3 envelope sequence and pol gene-resistant mutations. From start to 8.5 months of MVC treatment only R5X4 viral clones were observed, whereas at 16 months the phenotype enlarged to also include R5 and X4 clones. Chimeric receptor usage suggested the preferential usage of the CXCR4 coreceptor by the R5X4 biological clones. According to phenotypic data, R5 viruses were susceptible, whereas R5X4 and X4 viruses were resistant to RANTES and MVC in vitro. Clones at 16 months, but not at baseline, showed an amino acidic resistance pattern in protease and reverse transcription genes, which, however, did not drive their tropisms. The geno2pheno algorithm predicted at baseline R5 viruses in plasma, and from 5.5 months throughout follow-up only CXCR4-using viruses. An extended methodological approach is needed to unravel the complexity of the phenotype and variation of viruses resident in the different compartments of an infected individual. The accurate evaluation of the proportion of residual R5 viruses may guide therapeutic intervention in highly experienced patients with limited therapeutic options. PMID:25275490
Smirnova, Ekaterina; Firth, Andrew E; Miller, W Allen; Scheidecker, Danièle; Brault, Véronique; Reinbold, Catherine; Rakotondrafara, Aurélie M; Chung, Betty Y-W; Ziegler-Graff, Véronique
2015-05-01
Viruses in the family Luteoviridae have positive-sense RNA genomes of around 5.2 to 6.3 kb, and they are limited to the phloem in infected plants. The Luteovirus and Polerovirus genera include all but one virus in the Luteoviridae. They share a common gene block, which encodes the coat protein (ORF3), a movement protein (ORF4), and a carboxy-terminal extension to the coat protein (ORF5). These three proteins all have been reported to participate in the phloem-specific movement of the virus in plants. All three are translated from one subgenomic RNA, sgRNA1. Here, we report the discovery of a novel short ORF, termed ORF3a, encoded near the 5' end of sgRNA1. Initially, this ORF was predicted by statistical analysis of sequence variation in large sets of aligned viral sequences. ORF3a is positioned upstream of ORF3 and its translation initiates at a non-AUG codon. Functional analysis of the ORF3a protein, P3a, was conducted with Turnip yellows virus (TuYV), a polerovirus, for which translation of ORF3a begins at an ACG codon. ORF3a was translated from a transcript corresponding to sgRNA1 in vitro, and immunodetection assays confirmed expression of P3a in infected protoplasts and in agroinoculated plants. Mutations that prevent expression of P3a, or which overexpress P3a, did not affect TuYV replication in protoplasts or inoculated Arabidopsis thaliana leaves, but prevented virus systemic infection (long-distance movement) in plants. Expression of P3a from a separate viral or plasmid vector complemented movement of a TuYV mutant lacking ORF3a. Subcellular localization studies with fluorescent protein fusions revealed that P3a is targeted to the Golgi apparatus and plasmodesmata, supporting an essential role for P3a in viral movement.
Smirnova, Ekaterina; Firth, Andrew E.; Miller, W. Allen; Scheidecker, Danièle; Brault, Véronique; Reinbold, Catherine; Rakotondrafara, Aurélie M.; Chung, Betty Y.-W.; Ziegler-Graff, Véronique
2015-01-01
Viruses in the family Luteoviridae have positive-sense RNA genomes of around 5.2 to 6.3 kb, and they are limited to the phloem in infected plants. The Luteovirus and Polerovirus genera include all but one virus in the Luteoviridae. They share a common gene block, which encodes the coat protein (ORF3), a movement protein (ORF4), and a carboxy-terminal extension to the coat protein (ORF5). These three proteins all have been reported to participate in the phloem-specific movement of the virus in plants. All three are translated from one subgenomic RNA, sgRNA1. Here, we report the discovery of a novel short ORF, termed ORF3a, encoded near the 5’ end of sgRNA1. Initially, this ORF was predicted by statistical analysis of sequence variation in large sets of aligned viral sequences. ORF3a is positioned upstream of ORF3 and its translation initiates at a non-AUG codon. Functional analysis of the ORF3a protein, P3a, was conducted with Turnip yellows virus (TuYV), a polerovirus, for which translation of ORF3a begins at an ACG codon. ORF3a was translated from a transcript corresponding to sgRNA1 in vitro, and immunodetection assays confirmed expression of P3a in infected protoplasts and in agroinoculated plants. Mutations that prevent expression of P3a, or which overexpress P3a, did not affect TuYV replication in protoplasts or inoculated Arabidopsis thaliana leaves, but prevented virus systemic infection (long-distance movement) in plants. Expression of P3a from a separate viral or plasmid vector complemented movement of a TuYV mutant lacking ORF3a. Subcellular localization studies with fluorescent protein fusions revealed that P3a is targeted to the Golgi apparatus and plasmodesmata, supporting an essential role for P3a in viral movement. PMID:25946037
Chiu, Elliott S; Hoover, Edward A; VandeWoude, Sue
2018-01-10
Feline leukemia virus (FeLV) was the first feline retrovirus discovered, and is associated with multiple fatal disease syndromes in cats, including lymphoma. The original research conducted on FeLV employed classical virological techniques. As methods have evolved to allow FeLV genetic characterization, investigators have continued to unravel the molecular pathology associated with this fascinating agent. In this review, we discuss how FeLV classification, transmission, and disease-inducing potential have been defined sequentially by viral interference assays, Sanger sequencing, PCR, and next-generation sequencing. In particular, we highlight the influences of endogenous FeLV and host genetics that represent FeLV research opportunities on the near horizon.
Next-Generation Sequencing: a Diagnostic One-Stop Shop for Hepatitis C?
Poljak, Mario
2016-10-01
Before starting chronic hepatitis C treatment, the viral genotype/subtype has to be accurately determined and potentially coupled with drug resistance testing. Due to the high genetic variability of the hepatitis C virus, this can be a demanding task that can potentially be streamlined by viral whole-genome sequencing using next-generation sequencing as demonstrated by an article in this issue of the Journal of Clinical Microbiology by E. Thomson, C. L. C. Ip, A. Badhan, M. T. Christiansen, W. Adamson, et al. (J Clin Microbiol. 54:2455-2469, 2016, http://dx.doi.org/10.1128/JCM.00330-16). Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Spancerniene, Ugne; Grigas, Juozas; Buitkuviene, Jurate; Zymantiene, Judita; Juozaitiene, Vida; Stankeviciute, Milda; Razukevicius, Dainius; Zienius, Dainius; Stankevicius, Arunas
2018-02-23
Hepatitis E virus (HEV) is one of the major causes of acute viral hepatitis worldwide. In Europe, food-borne zoonotic transmission of HEV genotype 3 has been associated with domestic pigs and wild boar. Controversial data are available on the circulation of the virus in animals that are used for human consumption, and to date, no gold standard has yet been defined for the diagnosis of HEV-associated hepatitis. To investigate the current HEV infection status in Lithuanian pigs and wild ungulates, the presence of viral RNA was analyzed by nested reverse transcription polymerase chain reaction (RT-nPCR) in randomly selected samples, and the viral RNA was subsequently genotyped. In total, 32.98 and 22.55% of the domestic pig samples were HEV-positive using RT-nPCR targeting the ORF1 and ORF2 fragments, respectively. Among ungulates, 25.94% of the wild boar samples, 22.58% of the roe deer samples, 6.67% of the red deer samples and 7.69% of the moose samples were positive for HEV RNA using primers targeting the ORF1 fragment. Using primers targeting the ORF2 fragment of the HEV genome, viral RNA was only detected in 17.03% of the wild boar samples and 12.90% of the roe deer samples. Phylogenetic analysis based on a 348-nucleotide-long region of the HEV ORF2 showed that all obtained sequences detected in Lithuanian domestic pigs and wildlife belonged to genotype 3. In this study, the sequences identified from pigs, wild boars and roe deer clustered within the 3i subtype reference sequences from the GenBank database. The sequences obtained from pig farms located in two different counties of Lithuania were of the HEV 3f subtype. The wild boar sequences clustered within subtypes 3i and 3h, clearly indicating that wild boars can harbor additional subtypes of HEV. For the first time, the ORF2 nucleotide sequences obtained from roe deer proved that HEV subtype 3i can be found in a novel host. The results of the viral prevalence and phylogenetic analyses clearly demonstrated viral infection in Lithuanian pigs and wild ungulates, thus highlighting a significant concern for zoonotic virus transmission through both the food chain and direct contact with animals. Unexpected HEV genotype 3 subtype diversity in Lithuania and neighboring countries revealed that further studies are necessary to understand the mode of HEV transmission between animals and humans in the Baltic States region.
Thompson, T.M.; Batts, W.N.; Faisal, M.; Bowser, P.; Casey, J.W.; Phillips, K.; Garver, K.A.; Winton, J.; Kurath, G.
2011-01-01
Viral hemorrhagic septicemia virus (VHSV) is a fish rhabdovirus that causes disease in a broad range of marine and freshwater hosts. The known geographic range includes the Northern Atlantic and Pacific Oceans, and recently it has invaded the Great Lakes region of North America. The goal of this work was to characterize genetic diversity of Great Lakes VHSV isolates at the early stage of this viral emergence by comparing a partial glycoprotein (G) gene sequence (669 nt) of 108 isolates collected from 2003 to 2009 from 31 species and at 37 sites. Phylogenetic analysis showed that all isolates fell into sub-lineage IVb within the major VHSV genetic group IV. Among these 108 isolates, genetic diversity was low, with a maximum of 1.05% within the 669 nt region. There were 11 unique sequences, designated vcG001 to vcG011. Two dominant sequence types, vcG001 and vcG002, accounted for 90% (97 of 108) of the isolates. The vcG001 isolates were most widespread. We saw no apparent association of sequence type with host or year of isolation, but we did note a spatial pattern, in which vcG002 isolates were more prevalent in the easternmost sub-regions, including inland New York state and the St. Lawrence Seaway. Different sequence types were found among isolates from single disease outbreaks, and mixtures of types were evident within 2 isolates from individual fish. Overall, the genetic diversity of VHSV in the Great Lakes region was found to be extremely low, consistent with an introduction of a new virus into a geographic region with previously naïve host populations.
Thompson, Tarin M; Batts, William N; Faisal, Mohamed; Bowser, Paul; Casey, James W; Phillips, Kenneth; Garver, Kyle A; Winton, James; Kurath, Gael
2011-08-29
Viral hemorrhagic septicemia virus (VHSV) is a fish rhabdovirus that causes disease in a broad range of marine and freshwater hosts. The known geographic range includes the Northern Atlantic and Pacific Oceans, and recently it has invaded the Great Lakes region of North America. The goal of this work was to characterize genetic diversity of Great Lakes VHSV isolates at the early stage of this viral emergence by comparing a partial glycoprotein (G) gene sequence (669 nt) of 108 isolates collected from 2003 to 2009 from 31 species and at 37 sites. Phylogenetic analysis showed that all isolates fell into sub-lineage IVb within the major VHSV genetic group IV. Among these 108 isolates, genetic diversity was low, with a maximum of 1.05% within the 669 nt region. There were 11 unique sequences, designated vcG001 to vcG011. Two dominant sequence types, vcG001 and vcG002, accounted for 90% (97 of 108) of the isolates. The vcG001 isolates were most widespread. We saw no apparent association of sequence type with host or year of isolation, but we did note a spatial pattern, in which vcG002 isolates were more prevalent in the easternmost sub-regions, including inland New York state and the St. Lawrence Seaway. Different sequence types were found among isolates from single disease outbreaks, and mixtures of types were evident within 2 isolates from individual fish. Overall, the genetic diversity of VHSV in the Great Lakes region was found to be extremely low, consistent with an introduction of a new virus into a geographic region with previously naive host populations.
Nadin-Davis, Susan A; Colville, Adam; Trewby, Hannah; Biek, Roman; Real, Leslie
2017-03-15
Raccoon rabies remains a serious public health problem throughout much of the eastern seaboard of North America due to the urban nature of the reservoir host and the many challenges inherent in multi-jurisdictional efforts to administer co-ordinated and comprehensive wildlife rabies control programmes. Better understanding of the mechanisms of spread of rabies virus can play a significant role in guiding such control efforts. To facilitate a detailed molecular epidemiological study of raccoon rabies virus movements across eastern North America, we developed a methodology to efficiently determine whole genome sequences of hundreds of viral samples. The workflow combines the generation of a limited number of overlapping amplicons covering the complete viral genome and use of high throughput sequencing technology. The value of this approach is demonstrated through a retrospective phylogenetic analysis of an outbreak of raccoon rabies which occurred in the province of Ontario between 1999 and 2005. As demonstrated by the number of single nucleotide polymorphisms detected, whole genome sequence data were far more effective than single gene sequences in discriminating between samples and this facilitated the generation of more robust and informative phylogenies that yielded insights into the spatio-temporal pattern of viral spread. With minor modification this approach could be applied to other rabies virus variants thereby facilitating greatly improved phylogenetic inference and thus better understanding of the spread of this serious zoonotic disease. Such information will inform the most appropriate strategies for rabies control in wildlife reservoirs. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
Breyta, Rachel; McKenney, Douglas; Tesfaye, Tarin; Ono, Kotaro; Kurath, Gael
2016-01-01
Surveillance and genetic typing of field isolates of a fish rhabdovirus, infectious hematopoietic necrosis virus (IHNV), has identified four dominant viral genotypes that were involved in serial viral emergence and displacement events in steelhead trout (Oncorhynchus mykiss) in western North America. To investigate drivers of these landscape-scale events, IHNV isolates designated 007, 111, 110, and 139, representing the four relevant genotypes, were compared for virulence and infectivity in controlled laboratory challenge studies in five relevant steelhead trout populations. Viral virulence was assessed as mortality using lethal dose estimates (LD50), survival kinetics, and proportional hazards analysis. A pattern of increasing virulence for isolates 007, 111, and 110 was consistent in all five host populations tested, and correlated with serial emergence and displacements in the virus-endemic lower Columbia River source region during 1980–2013. The fourth isolate, 139, did not have higher virulence than the previous isolate 110. However, the mG139M genotype displayed a conditional displacement phenotype in that it displaced type mG110M in coastal Washington, but not in the lower Columbia River region, indicating that factors other than evolution of higher viral virulence were involved in some displacement events. Viral infectivity, measured as infectious dose (ID50), did not correlate consistently with virulence or with viral emergence, and showed a narrow range of variation relative to the variation observed in virulence. Comparison among the five steelhead trout populations confirmed variation in resistance to IHNV, but correlations with previous history of virus exposure or with sites of viral emergence varied between IHNV source and sink regions. Overall, this study indicated increasing viral virulence over time as a potential driver for emergence and displacement events in the endemic Lower Columbia River source region where these IHNV genotypes originated, but not in adjacent sink regions.
Ndunguru, Joseph; Taylor, Nigel J; Yadav, Jitender; Aly, Haytham; Legg, James P; Aveling, Terry; Thompson, Graham; Fauquet, Claude M
2005-01-01
Background Plant viral diseases present major constraints to crop production. Effective sampling of the viruses infecting plants is required to facilitate their molecular study and is essential for the development of crop protection and improvement programs. Retaining integrity of viral pathogens within sampled plant tissues is often a limiting factor in this process, most especially when sample sizes are large and when operating in developing counties and regions remote from laboratory facilities. FTA is a paper-based system designed to fix and store nucleic acids directly from fresh tissues pressed into the treated paper. We report here the use of FTA as an effective technology for sampling and retrieval of DNA and RNA viruses from plant tissues and their subsequent molecular analysis. Results DNA and RNA viruses were successfully recovered from leaf tissues of maize, cassava, tomato and tobacco pressed into FTA® Classic Cards. Viral nucleic acids eluted from FTA cards were found to be suitable for diagnostic molecular analysis by PCR-based techniques and restriction analysis, and for cloning and nucleotide sequencing in a manner equivalent to that offered by tradition isolation methods. Efficacy of the technology was demonstrated both from sampled greenhouse-grown plants and from leaf presses taken from crop plants growing in farmer's fields in East Africa. In addition, FTA technology was shown to be suitable for recovery of viral-derived transgene sequences integrated into the plant genome. Conclusion Results demonstrate that FTA is a practical, economical and sensitive method for sampling, storage and retrieval of viral pathogens and plant genomic sequences, when working under controlled conditions and in the field. Application of this technology has the potential to significantly increase ability to bring modern analytical techniques to bear on the viral pathogens infecting crop plants. PMID:15904535
Grzela, Renata; Nusbaum, Julien; Fieulaine, Sonia; Lavecchia, Francesco; Bienvenut, Willy V; Dian, Cyril; Meinnel, Thierry; Giglione, Carmela
2017-09-08
Prokaryotic proteins must be deformylated before the removal of their first methionine. Peptide deformylase (PDF) is indispensable and guarantees this mechanism. Recent metagenomics studies revealed new idiosyncratic PDF forms as the most abundant family of viral sequences. Little is known regarding these viral PDFs, including the capacity of the corresponding encoded proteins to ensure deformylase activity. We provide here the first evidence that viral PDFs, including the shortest PDF identified to date, Vp16 PDF, display deformylase activity in vivo, despite the absence of the key ribosome-interacting C-terminal region. Moreover, characterization of phage Vp16 PDF underscores unexpected structural and molecular features with the C-terminal Isoleucine residue significantly contributing to deformylase activity both in vitro and in vivo. This residue fully compensates for the absence of the usual long C-domain. Taken together, these data elucidate an unexpected mechanism of enzyme natural evolution and adaptation within viral sequences.
Whitmer, Shannon L M; Albariño, César; Shepard, Samuel S; Dudas, Gytis; Sheth, Mili; Brown, Shelley C; Cannon, Deborah; Erickson, Bobbie R; Gibbons, Aridth; Schuh, Amy; Sealy, Tara; Ervin, Elizabeth; Frace, Mike; Uyeki, Timothy M; Nichol, Stuart T; Ströher, Ute
2016-10-15
Several patients with Ebola virus disease (EVD) managed in the United States have received ZMapp monoclonal antibodies, TKM-Ebola small interfering RNA, brincidofovir, and/or convalescent plasma as investigational therapeutics. To investigate whether treatment selected for Ebola virus (EBOV) mutations conferring resistance, viral sequencing was performed on RNA extracted from clinical blood specimens from patients with EVD following treatment, and putative viral targets were analyzed. We observed no major or minor EBOV mutations within regions targeted by therapeutics. This small subset of patients and clinical specimens suggests that evolution of resistance is not a direct consequence of antiviral treatment. As EVD antiviral treatments are introduced into wider use, it is essential that continuous viral full-genome surveillance is performed, to monitor for the emergence of escape mutations. Published by Oxford University Press for the Infectious Diseases Society of America 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Virus Database and Online Inquiry System Based on Natural Vectors.
Dong, Rui; Zheng, Hui; Tian, Kun; Yau, Shek-Chung; Mao, Weiguang; Yu, Wenping; Yin, Changchuan; Yu, Chenglong; He, Rong Lucy; Yang, Jie; Yau, Stephen St
2017-01-01
We construct a virus database called VirusDB (http://yaulab.math.tsinghua.edu.cn/VirusDB/) and an online inquiry system to serve people who are interested in viral classification and prediction. The database stores all viral genomes, their corresponding natural vectors, and the classification information of the single/multiple-segmented viral reference sequences downloaded from National Center for Biotechnology Information. The online inquiry system serves the purpose of computing natural vectors and their distances based on submitted genomes, providing an online interface for accessing and using the database for viral classification and prediction, and back-end processes for automatic and manual updating of database content to synchronize with GenBank. Submitted genomes data in FASTA format will be carried out and the prediction results with 5 closest neighbors and their classifications will be returned by email. Considering the one-to-one correspondence between sequence and natural vector, time efficiency, and high accuracy, natural vector is a significant advance compared with alignment methods, which makes VirusDB a useful database in further research.
Falk, L; Lindahl, T; Bjursell, G; Klein, G
1979-07-15
Herpesvirus papio (HVP) is an indigenous B-lymphotropic virus of baboons (Papio sp.) present in latent form in baboon lymphoblastoid cell lines. It shares cross-reacting viral capsid and early antigens with the Epstein-Barr virus (EBV), and HVP DNA and EBV DNA show partial sequence homology. EBV-specific complementary RNA was employed here as a probe to investigate the physical state of the HVP DNA component in baboon lymphoblastoid cells after fractionation of cellular DNA by density gradient centrifugation. Five virus-producing cultures contained both free and integrated HVP DNA sequences while one non-producing cell line had two or three viral genome equivalents per cell in an apparently integrated form. Further analysis of one virus-producing line showed that the free HVP DNA fraction was composed of both linear and circular viral DNA. Contour length measurements of HVP circular DNA molecules by electron microscopy revealed that they were similar in length to the EBV circular DNA present in human lymphoblastoid cells.
The Papillomavirus Episteme: a major update to the papillomavirus sequence database.
Van Doorslaer, Koenraad; Li, Zhiwen; Xirasagar, Sandhya; Maes, Piet; Kaminsky, David; Liou, David; Sun, Qiang; Kaur, Ramandeep; Huyen, Yentram; McBride, Alison A
2017-01-04
The Papillomavirus Episteme (PaVE) is a database of curated papillomavirus genomic sequences, accompanied by web-based sequence analysis tools. This update describes the addition of major new features. The papillomavirus genomes within PaVE have been further annotated, and now includes the major spliced mRNA transcripts. Viral genes and transcripts can be visualized on both linear and circular genome browsers. Evolutionary relationships among PaVE reference protein sequences can be analysed using multiple sequence alignments and phylogenetic trees. To assist in viral discovery, PaVE offers a typing tool; a simplified algorithm to determine whether a newly sequenced virus is novel. PaVE also now contains an image library containing gross clinical and histopathological images of papillomavirus infected lesions. Database URL: https://pave.niaid.nih.gov/. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M
2015-05-01
To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.
Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor
2015-01-01
Abstract To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice. PMID:25560745
Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing.
Olvera, Alex; Busquets, Núria; Cortey, Marti; de Deus, Nilsa; Ganges, Llilianne; Núñez, José Ignacio; Peralta, Bibiana; Toskano, Jennifer; Dolz, Roser
2010-05-01
Changes in livestock production systems in recent years have altered the presentation of many diseases resulting in the need for more sophisticated control measures. At the same time, new molecular assays have been developed to support the diagnosis of animal viral disease. Nucleotide sequences generated by these diagnostic techniques can be used in phylogenetic analysis to infer phenotypes by sequence homology and to perform molecular epidemiology studies. In this review, some key elements of phylogenetic analysis are highlighted, such as the selection of the appropriate neutral phylogenetic marker, the proper phylogenetic method and different techniques to test the reliability of the resulting tree. Examples are given of current and future applications of phylogenetic reconstructions in viral livestock diseases. Copyright 2009 Elsevier Ltd. All rights reserved.
Non-coding RNAs in virology: an RNA genomics approach.
Isaac, Christopher; Patel, Trushar R; Zovoilis, Athanasios
2018-04-01
Advances in sequencing technologies and bioinformatic analysis techniques have greatly improved our understanding of various classes of RNAs and their functions. Despite not coding for proteins, non-coding RNAs (ncRNAs) are emerging as essential biomolecules fundamental for cellular functions and cell survival. Interestingly, ncRNAs produced by viruses not only control the expression of viral genes, but also influence host cell regulation and circumvent host innate immune response. Correspondingly, ncRNAs produced by the host genome can play a key role in host-virus interactions. In this article, we will first discuss a number of types of viral and mammalian ncRNAs associated with viral infections. Subsequently, we also describe the new possibilities and opportunities that RNA genomics and next-generation sequencing technologies provide for studying ncRNAs in virology.
Cui, Hongguang; Wang, Aiming
2016-05-15
The potyviral RNA genome encodes two polyproteins that are proteolytically processed by three viral protease domains into 11 mature proteins. Extensive molecular studies have identified functions for the majority of the viral proteins. For example, 6K2, one of the two smallest potyviral proteins, is an integral membrane protein and induces the endoplasmic reticulum (ER)-originated replication vesicles that target the chloroplast for robust viral replication. However, the functional role of 6K1, the other smallest protein, remains uncharacterized. In this study, we developed a series of recombinant full-length viral cDNA clones derived from a Canadian Plum pox virus (PPV) isolate. We found that deletion of any of the short motifs of 6K1 (each of which ranged from 5 to 13 amino acids), most of the 6K1 sequence (but with the conserved sequence of the cleavage sites being retained), or all of the 6K1 sequence in the PPV infectious clone abolished viral replication. The trans expression of 6K1 or the cis expression of a dislocated 6K1 failed to rescue the loss-of-replication phenotype, suggesting the temporal and spatial requirement of 6K1 for viral replication. Disruption of the N- or C-terminal cleavage site of 6K1, which prevented the release of 6K1 from the polyprotein, either partially or completely inhibited viral replication, suggesting the functional importance of the mature 6K1. We further found that green fluorescent protein-tagged 6K1 formed punctate inclusions at the viral early infection stage and colocalized with chloroplast-bound viral replicase elements 6K2 and NIb. Taken together, our results suggest that 6K1 is required for viral replication and is an important viral element of the viral replication complex at the early infection stage. Potyviruses account for more than 30% of known plant viruses and consist of many agriculturally important viruses. The genomes of potyviruses encode two polyproteins that are proteolytically processed into 11 mature proteins, with the majority of them having been at least partially functionally characterized. However, the functional role of a small protein named 6K1 remains obscure. In this study, we showed that deletion of 6K1 or a short motif/region of 6K1 in the full-length cDNA clones of plum pox virus abolishes viral replication and that mutation of the N- or C-terminal cleavage sites of 6K1 to prevent its release from the polyprotein greatly attenuates or completely inhibits viral replication, suggesting its important role in potyviral infection. We report that 6K1 forms punctate structures and targets the replication vesicles in PPV-infected plant leaf cells at the early infection stage. Our data reveal that 6K1 is an important viral protein of the potyviral replication complex. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Simonen, Marja-Leena; Roivainen, Merja; Iber, Jane; Burns, Cara; Hovi, Tapani
2010-01-01
In 1984, a wild type 3 poliovirus (PV3/FIN84) spread all over Finland causing nine cases of paralytic poliomyelitis and one case of aseptic meningitis. The outbreak was ended in 1985 with an intensive vaccination campaign. By limited sequence comparison with previously isolated PV3 strains, closest relatives of PV3/FIN84 were found among strains circulating in the Mediterranean region. Now we wanted to reanalyse the relationships using approaches currently exploited in poliovirus surveillance. Cell lysates of 22 strains isolated during the outbreak and stored frozen were subjected to RT-PCR amplification in three genomic regions without prior subculture. Sequences of the entire VP1 coding region, 150 nucleotides in the VP1-2A junction, most of the 5' non-coding region, partial sequences of the 3D RNA polymerase coding region and partial 3' non-coding region were compared within the outbreak and with sequences available in data banks. In addition, complete nucleotide sequences were obtained for 2 strains isolated from two different cases of disease during the outbreak. The results confirmed the previously described wide intraepidemic variation of the strains, including amino acid substitutions in antigenic sites, as well as the likely Mediterranean region origin of the strains. Simplot and bootscanning analyses of the complete genomes indicated complicated evolutionary history of the non-capsid coding regions of the genome suggesting several recombinations with different HEV-C viruses in the past.
Sequence of retrovirus provirus resembles that of bacterial transposable elements
NASA Astrophysics Data System (ADS)
Shimotohno, Kunitada; Mizutani, Satoshi; Temin, Howard M.
1980-06-01
The nucleotide sequences of the terminal regions of an infectious integrated retrovirus cloned in the modified λ phage cloning vector Charon 4A have been elucidated. There is a 569-base pair direct repeat at both ends of the viral DNA. The cell-virus junctions at each end consist of a 5-base pair direct repeat of cell DNA next to a 3-base pair inverted repeat of viral DNA. This structure resembles that of a transposable element and is consistent with the protovirus hypothesis that retroviruses evolved from the cell genome.
Yuan, Ji; Cheung, Paul K M; Zhang, Huifang M; Chau, David; Yang, Decheng
2005-02-01
Coxsackievirus B3 (CVB3) is the most common causal agent of viral myocarditis, but existing drug therapies are of limited value. Application of small interfering RNA (siRNA) in knockdown of gene expression is an emerging technology in antiviral gene therapy. To investigate whether RNA interference (RNAi) can protect against CVB3 infection, we evaluated the effects of RNAi on viral replication in HeLa cells and murine cardiomyocytes by using five CVB3-specific siRNAs targeting distinct regions of the viral genome. The most effective one is siRNA-4, targeting the viral protease 2A, achieving a 92% inhibition of CVB3 replication. The specific RNAi effects could last at least 48 h, and cell viability assay revealed that 90% of siRNA-4-pretreated cells were still alive and lacked detectable viral protein expression 48 h postinfection. Moreover, administration of siRNAs after viral infection could also effectively inhibit viral replication, indicating its therapeutic potential. Further evaluation by combination found that no enhanced inhibitory effects were observed when siRNA-4 was cotransfected with each of the other four candidates. In mutational analysis of the mechanisms of siRNA action, we found that siRNA functions by targeting the positive strand of virus and requires a perfect sequence match in the central region of the target, but mismatches were more tolerated near the 3' end than the 5' end of the antisense strand. These findings reveal an effective target for CVB3 silencing and provide a new possibility for antiviral intervention.
Epstein-Barr virus strains and variations: Geographic or disease-specific variants?
Neves, Marco; Marinho-Dias, Joana; Ribeiro, Joana; Sousa, Hugo
2017-03-01
The Epstein-Barr Virus (EBV) is associated with the development of several diseases, including infectious mononucleosis (IM), Burkitt's Lymphoma (BL), Nasopharyngeal Carcinoma, and other neoplasias. The publication of EBV genome 1984 led to several studies regarding the identification of different viral strains. Currently, EBV is divided into EBV type 1 (B95-8 strain) and EBV type 2 (AG876 strain), also known as type A and type B, which have been distinguished based upon genetic differences in the Epstein-Barr nuclear antigens (EBNAs) sequence. Several other EBV strains have been described in the past 10 years considering variations on EBV genome, and many have attempted to clarify if these variations are ethnic or geographically correlated, or if they are disease related. Indeed, there is an increasing interest to describe possible specific disease associations, with emphasis on different malignancies. These studies aim to clarify if these variations are ethnic or geographically correlated, or if they are disease related, thus being important to characterize the epidemiologic genetic distribution of EBV strains on our population. Here, we review the current knowledge on the different EBV strains and variants and its association with different diseases. J. Med. Virol. 89:373-387, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Genetic diversity and epidemiology of infectious hematopoietic necrosis virus in Alaska
Emmenegger, E.G; Meyers, T.R.; Burton, T.O.; Kurath, G.
2000-01-01
Forty-two infectious hematopoietic necrosis virus (IHNV) isolates from Alaska were analyzed using the ribonuclease protection assay (RPA) and nucleotide sequencing. RPA analyses, utilizing 4 probes, N5, N3 (N gene), GF (G gene), and NV (NV gene), determined that the haplotypes of all 3 genes demonstrated a consistent spatial pattern. Virus isolates belonging to the most common haplotype groups were distributed throughout Alaska, whereas isolates in small haplotype groups were obtained from only 1 site (hatchery, lake, etc.). The temporal pattern of the GF haplotypes suggested a 'genetic acclimation' of the G gene, possibly due to positive selection on the glycoprotein. A pairwise comparison of the sequence data determined that the maximum nucleotide diversity of the isolates was 2.75% (10 mismatches) for the NV gene, and 1.99% (6 mismatches) for a 301 base pair region of the G gene, indicating that the genetic diversity of IHNV within Alaska is notably lower than in the more southern portions of the IHNV North American range. Phylogenetic analysis of representative Alaskan sequences and sequences of 12 previously characterized IHNV strains from Washington, Oregon, Idaho, California (USA) and British Columbia (Canada) distinguished the isolates into clusters that correlated with geographic origin and indicated that the Alaskan and British Columbia isolates may have a common viral ancestral lineage. Comparisons of multiple isolates from the same site provided epidemiological insights into viral transmission patterns and indicated that viral evolution, viral introduction, and genetic stasis were the mechanisms involved with IHN virus population dynamics in Alaska. The examples of genetic stasis and the overall low sequence heterogeneity of the Alaskan isolates suggested that they are evolutionarily constrained. This study establishes a baseline of genetic fingerprint patterns and sequence groups representing the genetic diversity of Alaskan IHNV isolates. This information could be used to determine the source of an IHN outbreak and to facilitate decisions in fisheries management of Alaskan salmonid stocks.
Stalder, Hanspeter; Hug, Corinne; Zanoni, Reto; Vogt, Hans-Rudolf; Peterhans, Ernst; Schweizer, Matthias; Bachofen, Claudia
2016-06-15
Pestiviruses infect a wide variety of animals of the order Artiodactyla, with bovine viral diarrhea virus (BVDV) being an economically important pathogen of livestock globally. BVDV is maintained in the cattle population by infecting fetuses early in gestation and, thus, by generating persistently infected (PI) animals that efficiently transmit the virus throughout their lifetime. In 2008, Switzerland started a national control campaign with the aim to eradicate BVDV from all bovines in the country by searching for and eliminating every PI cattle. Different from previous eradication programs, all animals of the entire population were tested for virus within one year, followed by testing each newborn calf in the subsequent four years. Overall, 3,855,814 animals were tested from 2008 through 2011, 20,553 of which returned an initial BVDV-positive result. We were able to obtain samples from at least 36% of all initially positive tested animals. We sequenced the 5' untranslated region (UTR) of more than 7400 pestiviral strains and compiled the sequence data in a database together with an array of information on the PI animals, among others, the location of the farm in which they were born, their dams, and the locations where the animals had lived. To our knowledge, this is the largest database combining viral sequences with animal data of an endemic viral disease. Using unique identification tags, the different datasets within the database were connected to run diverse molecular epidemiological analyses. The large sets of animal and sequence data made it possible to run analyses in both directions, i.e., starting from a likely epidemiological link, or starting from related sequences. We present the results of three epidemiological investigations in detail and a compilation of 122 individual investigations that show the usefulness of such a database in a country-wide BVD eradication program. Copyright © 2015 Elsevier B.V. All rights reserved.
Di Giallonardo, Francesca; Geoghegan, Jemma L; Docherty, Douglas E; McLean, Robert G; Zody, Michael C; Qu, James; Yang, Xiao; Birren, Bruce W; Malboeuf, Christine M; Newman, Ruchi M; Ip, Hon S; Holmes, Edward C
2016-01-15
The introduction of West Nile virus (WNV) into North America in 1999 is a classic example of viral emergence in a new environment, with its subsequent dispersion across the continent having a major impact on local bird populations. Despite the importance of this epizootic, the pattern, dynamics, and determinants of WNV spread in its natural hosts remain uncertain. In particular, it is unclear whether the virus encountered major barriers to transmission, or spread in an unconstrained manner, and if specific viral lineages were favored over others indicative of intrinsic differences in fitness. To address these key questions in WNV evolution and ecology, we sequenced the complete genomes of approximately 300 avian isolates sampled across the United States between 2001 and 2012. Phylogenetic analysis revealed a relatively star-like tree structure, indicative of explosive viral spread in the United States, although with some replacement of viral genotypes through time. These data are striking in that viral sequences exhibit relatively limited clustering according to geographic region, particularly for those viruses sampled from birds, and no strong phylogenetic association with well-sampled avian species. The genome sequence data analyzed here also contain relatively little evidence for adaptive evolution, particularly of structural proteins, suggesting that most viral lineages are of similar fitness and that WNV is well adapted to the ecology of mosquito vectors and diverse avian hosts in the United States. In sum, the molecular evolution of WNV in North America depicts a largely unfettered expansion within a permissive host and geographic population with little evidence of major adaptive barriers. How viruses spread in new host and geographic environments is central to understanding the emergence and evolution of novel infectious diseases and for predicting their likely impact. The emergence of the vector-borne West Nile virus (WNV) in North America in 1999 represents a classic example of this process. Using approximately 300 new viral genomes sampled from wild birds, we show that WNV experienced an explosive spread with little geographical or host constraints within birds and relatively low levels of adaptive evolution. From its introduction into the state of New York, WNV spread across the United States, reaching California and Florida within 4 years, a migration that is clearly reflected in our genomic sequence data, and with a general absence of distinct geographical clusters of bird viruses. However, some geographically distinct viral lineages were found to circulate in mosquitoes, likely reflecting their limited long-distance movement compared to avian species. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Mlotshwa, Mandla; Riou, Catherine; Chopera, Denis; de Assis Rosa, Debra; Ntale, Roman; Treunicht, Florette; Woodman, Zenda; Werner, Lise; van Loggerenberg, Francois; Mlisana, Koleka; Abdool Karim, Salim; Williamson, Carolyn; Gray, Clive M.
2010-01-01
Deciphering immune events during early stages of human immunodeficiency virus type 1 (HIV-1) infection is critical for understanding the course of disease. We characterized the hierarchy of HIV-1-specific T-cell gamma interferon (IFN-γ) enzyme-linked immunospot (ELISPOT) assay responses during acute subtype C infection in 53 individuals and associated temporal patterns of responses with disease progression in the first 12 months. There was a diverse pattern of T-cell recognition across the proteome, with the recognition of Nef being immunodominant as early as 3 weeks postinfection. Over the first 6 months, we found that there was a 23% chance of an increased response to Nef for every week postinfection (P = 0.0024), followed by a nonsignificant increase to Pol (4.6%) and Gag (3.2%). Responses to Env and regulatory proteins appeared to remain stable. Three temporal patterns of HIV-specific T-cell responses could be distinguished: persistent, lost, or new. The proportion of persistent T-cell responses was significantly lower (P = 0.0037) in individuals defined as rapid progressors than in those progressing slowly and who controlled viremia. Almost 90% of lost T-cell responses were coincidental with autologous viral epitope escape. Regression analysis between the time to fixed viral escape and lost T-cell responses (r = 0.61; P = 0.019) showed a mean delay of 14 weeks after viral escape. Collectively, T-cell epitope recognition is not a static event, and temporal patterns of IFN-γ-based responses exist. This is due partly to viral sequence variation but also to the recognition of invariant viral epitopes that leads to waves of persistent T-cell immunity, which appears to associate with slower disease progression in the first year of infection. PMID:20826686
BS-virus-finder: virus integration calling using bisulfite sequencing data.
Gao, Shengjie; Hu, Xuesong; Xu, Fengping; Gao, Changduo; Xiong, Kai; Zhao, Xiao; Chen, Haixiao; Zhao, Shancen; Wang, Mengyao; Fu, Dongke; Zhao, Xiaohui; Bai, Jie; Mao, Likai; Li, Bo; Wu, Song; Wang, Jian; Li, Shengbin; Yang, Huangming; Bolund, Lars; Pedersen, Christian N S
2018-01-01
DNA methylation plays a key role in the regulation of gene expression and carcinogenesis. Bisulfite sequencing studies mainly focus on calling single nucleotide polymorphism, different methylation region, and find allele-specific DNA methylation. Until now, only a few software tools have focused on virus integration using bisulfite sequencing data. We have developed a new and easy-to-use software tool, named BS-virus-finder (BSVF, RRID:SCR_015727), to detect viral integration breakpoints in whole human genomes. The tool is hosted at https://github.com/BGI-SZ/BSVF. BS-virus-finder demonstrates high sensitivity and specificity. It is useful in epigenetic studies and to reveal the relationship between viral integration and DNA methylation. BS-virus-finder is the first software tool to detect virus integration loci by using bisulfite sequencing data. © The Authors 2017. Published by Oxford University Press.
Sequencing Needs for Viral Diagnostics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S N; Lam, M; Mulakken, N J
2004-01-26
We built a system to guide decisions regarding the amount of genomic sequencing required to develop diagnostic DNA signatures, which are short sequences that are sufficient to uniquely identify a viral species. We used our existing DNA diagnostic signature prediction pipeline, which selects regions of a target species genome that are conserved among strains of the target (for reliability, to prevent false negatives) and unique relative to other species (for specificity, to avoid false positives). We performed simulations, based on existing sequence data, to assess the number of genome sequences of a target species and of close phylogenetic relatives (''nearmore » neighbors'') that are required to predict diagnostic signature regions that are conserved among strains of the target species and unique relative to other bacterial and viral species. For DNA viruses such as variola (smallpox), three target genomes provide sufficient guidance for selecting species-wide signatures. Three near neighbor genomes are critical for species specificity. In contrast, most RNA viruses require four target genomes and no near neighbor genomes, since lack of conservation among strains is more limiting than uniqueness. SARS and Ebola Zaire are exceptional, as additional target genomes currently do not improve predictions, but near neighbor sequences are urgently needed. Our results also indicate that double stranded DNA viruses are more conserved among strains than are RNA viruses, since in most cases there was at least one conserved signature candidate for the DNA viruses and zero conserved signature candidates for the RNA viruses.« less
Günthard, H F; Wong, J K; Ignacio, C C; Havlir, D V; Richman, D D
1998-07-01
The performance of the high-density oligonucleotide array methodology (GeneChip) in detecting drug resistance mutations in HIV-1 pol was compared with that of automated dideoxynucleotide sequencing (ABI) of clinical samples, viral stocks, and plasmid-derived NL4-3 clones. Sequences from 29 clinical samples (plasma RNA, n = 17; lymph node RNA, n = 5; lymph node DNA, n = 7) from 12 patients, from 6 viral stock RNA samples, and from 13 NL4-3 clones were generated by both methods. Editing was done independently by a different investigator for each method before comparing the sequences. In addition, NL4-3 wild type (WT) and mutants were mixed in varying concentrations and sequenced by both methods. Overall, a concordance of 99.1% was found for a total of 30,865 bases compared. The comparison of clinical samples (plasma RNA and lymph node RNA and DNA) showed a slightly lower match of base calls, 98.8% for 19,831 nucleotides compared (protease region, 99.5%, n = 8272; RT region, 98.3%, n = 11,316), than for viral stocks and NL4-3 clones (protease region, 99.8%; RT region, 99.5%). Artificial mixing experiments showed a bias toward calling wild-type bases by GeneChip. Discordant base calls are most likely due to differential detection of mixtures. The concordance between GeneChip and ABI was high and appeared dependent on the nature of the templates (directly amplified versus cloned) and the complexity of mixes.
Genome sequences of nine vesicular stomatitis virus isolates from South America
USDA-ARS?s Scientific Manuscript database
We report nine full-genome sequences of vesicular stomatitis virus obtrained by Illumina next-generation sequencing of RNA, isolated from either cattle epithelial suspensions or cell culture supernatants. Seven of these viral genomes belonged to the New Jersey serotype/species, clade III, while two...
Nelson, Patrick W; Gilchrist, Michael A; Coombs, Daniel; Hyman, James M; Perelson, Alan S
2004-09-01
Mathematical models of HIV-1 infection can help interpret drug treatment experiments and improve our understanding of the interplay between HIV-1 and the immune system. We develop and analyze an age- structured model of HIV-1 infection that allows for variations in the death rate of productively infected T cells and the production rate of viral particles as a function of the length of time a T cell has been infected. We show that this model is a generalization of the standard differential equation and of delay models previously used to describe HIV-1 infection, and provides a means for exploring fundamental issues of viral production and death. We show that the model has uninfected and infected steady states, linked by a transcritical bifurcation. We perform a local stability analysis of the nontrivial equilibrium solution and provide a general stability condition for models with age structure. We then use numerical methods to study solutions of our model focusing on the analysis of primary HIV infection. We show that the time to reach peak viral levels in the blood depends not only on initial conditions but also on the way in which viral production ramps up. If viral production ramps up slowly, we find that the time to peak viral load is delayed compared to results obtained using the standard (constant viral production) model of HIV infection. We find that data on viral load changing over time is insufficient to identify the functions specifying the dependence of the viral production rate or infected cell death rate on infected cell age. These functions must be determined through new quantitative experiments.
RNA Modulates the Interaction between Influenza A Virus NS1 and Human PABP1.
Arias-Mireles, Bryan H; de Rozieres, Cyrus M; Ly, Kevin; Joseph, Simpson
2018-05-25
Nonstructural protein 1 (NS1) is a multifunctional protein involved in preventing host-interferon response in influenza A virus (IAV). Previous studies have indicated that NS1 also stimulates the translation of viral mRNA by binding to conserved sequences in the viral 5'-UTR. Additionally, NS1 binds to poly(A) binding protein 1 (PABP1) and eukaryotic initiation factor 4G (eIF4G). The interaction of NS1 with the viral 5'-UTR, PABP1, and eIF4G has been suggested to specifically enhance the translation of viral mRNAs. In contrast, we report that NS1 does not directly bind to sequences in the viral 5'-UTR, indicating that NS1 is not responsible for providing the specificity to stimulate viral mRNA translation. We also monitored the interaction of NS1 with PABP1 using a new, quantitative FRET assay. Our data show that NS1 binds to PABP1 with high affinity; however, the binding of double-stranded RNA (dsRNA) to NS1 weakens the binding of NS1 to PABP1. Correspondingly, the binding of PABP1 to NS1 weakens the binding of NS1 to double-stranded RNA (dsRNA). In contrast, the affinity of PABP1 for binding to poly(A) RNA is not significantly changed by NS1. We propose that the modulation of NS1·PABP1 interaction by dsRNA may be important for the viral cycle.
Uncovering the Repertoire of Endogenous Flaviviral Elements in Aedes Mosquito Genomes
Suzuki, Yasutsugu; Frangeul, Lionel; Dickson, Laura B.; Blanc, Hervé; Verdier, Yann; Vinh, Joelle
2017-01-01
ABSTRACT Endogenous viral elements derived from nonretroviral RNA viruses have been described in various animal genomes. Whether they have a biological function, such as host immune protection against related viruses, is a field of intense study. Here, we investigated the repertoire of endogenous flaviviral elements (EFVEs) in Aedes mosquitoes, the vectors of arboviruses such as dengue and chikungunya viruses. Previous studies identified three EFVEs from Aedes albopictus cell lines and one from Aedes aegypti cell lines. However, an in-depth characterization of EFVEs in wild-type mosquito populations and individual mosquitoes in vivo has not been performed. We detected the full-length DNA sequence of the previously described EFVEs and their respective transcripts in several A. albopictus and A. aegypti populations from geographically distinct areas. However, EFVE-derived proteins were not detected by mass spectrometry. Using deep sequencing, we detected the production of PIWI-interacting RNA-like small RNAs, in an antisense orientation, targeting the EFVEs and their flanking regions in vivo. The EFVEs were integrated in repetitive regions of the mosquito genomes, and their flanking sequences varied among mosquito populations. We bioinformatically predicted several new EFVEs from a Vietnamese A. albopictus population and observed variation in the occurrence of those elements among mosquitoes. Phylogenetic analysis of an A. aegypti EFVE suggested that it integrated prior to the global expansion of the species and subsequently diverged among and within populations. The findings of this study together reveal the substantial structural and nucleotide diversity of flaviviral integrations in Aedes genomes. Unraveling this diversity will help to elucidate the potential biological function of these EFVEs. IMPORTANCE Endogenous viral elements (EVEs) are whole or partial viral sequences integrated in host genomes. Interestingly, some EVEs have important functions for host fitness and antiviral defense. Because mosquitoes also have EVEs in their genomes, characterizing these EVEs is a prerequisite for their potential use to manipulate the mosquito antiviral response. In the study described here, we focused on EVEs related to the Flavivirus genus, to which dengue and Zika viruses belong, in individual Aedes mosquitoes from geographically distinct areas. We show the existence in vivo of flaviviral EVEs previously identified in mosquito cell lines, and we detected new ones. We show that EVEs have evolved differently in each mosquito population. They produce transcripts and small RNAs but not proteins, suggesting a function at the RNA level. Our study uncovers the diverse repertoire of flaviviral EVEs in Aedes mosquito populations and contributes to an understanding of their role in the host antiviral system. PMID:28539440
Uncovering the Repertoire of Endogenous Flaviviral Elements in Aedes Mosquito Genomes.
Suzuki, Yasutsugu; Frangeul, Lionel; Dickson, Laura B; Blanc, Hervé; Verdier, Yann; Vinh, Joelle; Lambrechts, Louis; Saleh, Maria-Carla
2017-08-01
Endogenous viral elements derived from nonretroviral RNA viruses have been described in various animal genomes. Whether they have a biological function, such as host immune protection against related viruses, is a field of intense study. Here, we investigated the repertoire of endogenous flaviviral elements (EFVEs) in Aedes mosquitoes, the vectors of arboviruses such as dengue and chikungunya viruses. Previous studies identified three EFVEs from Aedes albopictus cell lines and one from Aedes aegypti cell lines. However, an in-depth characterization of EFVEs in wild-type mosquito populations and individual mosquitoes in vivo has not been performed. We detected the full-length DNA sequence of the previously described EFVEs and their respective transcripts in several A. albopictus and A. aegypti populations from geographically distinct areas. However, EFVE-derived proteins were not detected by mass spectrometry. Using deep sequencing, we detected the production of PIWI-interacting RNA-like small RNAs, in an antisense orientation, targeting the EFVEs and their flanking regions in vivo The EFVEs were integrated in repetitive regions of the mosquito genomes, and their flanking sequences varied among mosquito populations. We bioinformatically predicted several new EFVEs from a Vietnamese A. albopictus population and observed variation in the occurrence of those elements among mosquitoes. Phylogenetic analysis of an A. aegypti EFVE suggested that it integrated prior to the global expansion of the species and subsequently diverged among and within populations. The findings of this study together reveal the substantial structural and nucleotide diversity of flaviviral integrations in Aedes genomes. Unraveling this diversity will help to elucidate the potential biological function of these EFVEs. IMPORTANCE Endogenous viral elements (EVEs) are whole or partial viral sequences integrated in host genomes. Interestingly, some EVEs have important functions for host fitness and antiviral defense. Because mosquitoes also have EVEs in their genomes, characterizing these EVEs is a prerequisite for their potential use to manipulate the mosquito antiviral response. In the study described here, we focused on EVEs related to the Flavivirus genus, to which dengue and Zika viruses belong, in individual Aedes mosquitoes from geographically distinct areas. We show the existence in vivo of flaviviral EVEs previously identified in mosquito cell lines, and we detected new ones. We show that EVEs have evolved differently in each mosquito population. They produce transcripts and small RNAs but not proteins, suggesting a function at the RNA level. Our study uncovers the diverse repertoire of flaviviral EVEs in Aedes mosquito populations and contributes to an understanding of their role in the host antiviral system. Copyright © 2017 Suzuki et al.
Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina.
Valles, Steven M; Porter, Sanford D; Calcaterra, Luis A
2018-01-01
Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations.
Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina
Porter, Sanford D.; Calcaterra, Luis A.
2018-01-01
Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations. PMID:29466388
Circularization of the HIV-1 genome facilitates strand transfer during reverse transcription
Beerens, Nancy; Kjems, Jørgen
2010-01-01
Two obligatory DNA strand transfers take place during reverse transcription of a retroviral RNA genome. The first strand transfer involves a jump from the 5′ to the 3′ terminal repeat (R) region positioned at each end of the viral genome. The process depends on base pairing between the cDNA synthesized from the 5′ R region and the 3′ R RNA. The tertiary conformation of the viral RNA genome may facilitate strand transfer by juxtaposing the 5′ R and 3′ R sequences that are 9 kb apart in the linear sequence. In this study, RNA sequences involved in an interaction between the 5′ and 3′ ends of the HIV-1 genome were mapped by mutational analysis. This interaction appears to be mediated mainly by a sequence in the extreme 3′ end of the viral genome and in the gag open reading frame. Mutation of 3′ R sequences was found to inhibit the 5′–3′ interaction, which could be restored by a complementary mutation in the 5′ gag region. Furthermore, we find that circularization of the HIV-1 genome does not affect the initiation of reverse transcription, but stimulates the first strand transfer during reverse transcription in vitro, underscoring the functional importance of the interaction. PMID:20430859
Geisler, Christoph; Jarvis, Donald L
2016-07-01
Spodoptera frugiperda (Sf) cell lines are used to produce several biologicals for human and veterinary use. Recently, it was discovered that all tested Sf cell lines are persistently infected with Sf-rhabdovirus, a novel rhabdovirus. As part of an effort to search for other adventitious viruses, we searched the Sf cell genome and transcriptome for sequences related to Sf-rhabdovirus. To our surprise, we found intact Sf-rhabdovirus N- and P-like ORFs, and partial Sf-rhabdovirus G- and L-like ORFs. The transcribed and genomic sequences matched, indicating the transcripts were derived from the genomic sequences. These appear to be endogenous viral elements (EVEs), which result from the integration of partial viral genetic material into the host cell genome. It is theoretically impossible for the Sf-rhabdovirus-like EVEs to produce infectious virus particles as 1) they are disseminated across 4 genomic loci, 2) the G and L ORFs are incomplete, and 3) the M ORF is missing. Our finding of transcribed virus-like sequences in Sf cells underscores that MPS-based searches for adventitious viruses in cell substrates used to manufacture biologics should take into account both genomic and transcribed sequences to facilitate the identification of transcribed EVE's, and to avoid false positive detection of replication-competent adventitious viruses. Copyright © 2016 International Alliance for Biological Standardization. Published by Elsevier Ltd. All rights reserved.
Geisler, Christoph; Jarvis, Donald L.
2016-01-01
Spodoptera frugiperda (Sf) cell lines are used to produce several biologicals for human and veterinary use. Recently, it was discovered that all tested Sf cell lines are persistently infected with Sf-rhabdovirus, a novel rhabdovirus. As part of an effort to search for other adventitious viruses, we searched the Sf cell genome and transcriptome for sequences related to Sf-rhabdovirus. To our surprise, we found intact Sf-rhabdovirus N- and P-like ORFs, and partial Sf-rhabdovirus G- and L-like ORFs. The transcribed and genomic sequences matched, indicating the transcripts were derived from the genomic sequences. These appear to be endogenous viral elements (EVEs), which result from the integration of partial viral genetic material into the host cell genome. It is theoretically impossible for the Sf-rhabdovirus-like EVEs to produce infectious virus particles as 1) they are disseminated across 4 genomic loci, 2) the G and L ORFs are incomplete, and 3) the M ORF is missing. Our finding of transcribed virus-like sequences in Sf cells underscores that MPS-based searches for adventitious viruses in cell substrates used to manufacture biologics should take into account both genomic and transcribed sequences to facilitate the identification of transcribed EVE's, and to avoid false positive detection of replication-competent adventitious viruses. PMID:27236849
Marine, Rachel; McCarren, Coleen; Vorrasane, Vansay; Nasko, Dan; Crowgey, Erin; Polson, Shawn W; Wommack, K Eric
2014-01-30
Shotgun metagenomics has become an important tool for investigating the ecology of microorganisms. Underlying these investigations is the assumption that metagenome sequence data accurately estimates the census of microbial populations. Multiple displacement amplification (MDA) of microbial community DNA is often used in cases where it is difficult to obtain enough DNA for sequencing; however, MDA can result in amplification biases that may impact subsequent estimates of population census from metagenome data. Some have posited that pooling replicate MDA reactions negates these biases and restores the accuracy of population analyses. This assumption has not been empirically tested. Using mock viral communities, we examined the influence of pooling on population-scale analyses. In pooled and single reaction MDA treatments, sequence coverage of viral populations was highly variable and coverage patterns across viral genomes were nearly identical, indicating that initial priming biases were reproducible and that pooling did not alleviate biases. In contrast, control unamplified sequence libraries showed relatively even coverage across phage genomes. MDA should be avoided for metagenomic investigations that require quantitative estimates of microbial taxa and gene functional groups. While MDA is an indispensable technique in applications such as single-cell genomics, amplification biases cannot be overcome by combining replicate MDA reactions. Alternative library preparation techniques should be utilized for quantitative microbial ecology studies utilizing metagenomic sequencing approaches.
Rennick, Linda J; Duprex, W Paul; Rima, Bert K
2007-10-01
Transcription from morbillivirus genomes commences at a single promoter in the 3' non-coding terminus, with the six genes being transcribed sequentially. The 3' and 5' untranslated regions (UTRs) of the genes (mRNA sense), together with the intergenic trinucleotide spacer, comprise the non-coding sequences (NCS) of the virus and contain the conserved gene end and gene start signals, respectively. Bicistronic minigenomes containing transcription units (TUs) encoding autofluorescent reporter proteins separated by measles virus (MV) NCS were used to give a direct estimation of gene expression in single, living cells by assessing the relative amounts of each fluorescent protein in each cell. Initially, five minigenomes containing each of the MV NCS were generated. Assays were developed to determine the amount of each fluorescent protein in cells at both cell population and single-cell levels. This revealed significant variations in gene expression between cells expressing the same NCS-containing minigenome. The minigenome containing the M/F NCS produced significantly lower amounts of fluorescent protein from the second TU (TU2), compared with the other minigenomes. A minigenome with a truncated F 5' UTR had increased expression from TU2. This UTR is 524 nt longer than the other MV 5' UTRs. Insertions into the 5' UTR of the enhanced green fluorescent protein gene in the minigenome containing the N/P NCS showed that specific sequences, rather than just the additional length of F 5' UTR, govern this decreased expression from TU2.
Espy, Nicole; Pérez-Sautu, Unai; Ramírez de Arellano, Eva; Negredo, Anabel; Wiley, Michael R; Bavari, Sina; Díaz Menendez, Marta; Paz Sánchez-Seco, María; Palacios, Gustavo
2018-03-23
The use of ribavirin to treat infections of Crimean-Congo Hemorrhagic Fever virus (CCHFV) has been controversial based on uncertainties on its antiviral efficacy in clinical case studies. We studied the effect of ribavirin treatment on viral populations in a recent case by deep sequencing plasma samples taken from a CCHFV-infected patient before, during, and after a five-day regimen of ribavirin. CCHFV viral load dropped during ribavirin treatment and subclonal diversity (transitions) and indels increased in viral genomes during treatment. Although the results are based on a single case, these data demonstrate the mutagenic effect of ribavirin on CCHFV in vivo. (Word Count: 100).
Nucleic Acid-Based Approaches for Detection of Viral Hepatitis
Behzadi, Payam; Ranjbar, Reza; Alavian, Seyed Moayed
2014-01-01
Context: To determining suitable nucleic acid diagnostics for individual viral hepatitis agent, an extensive search using related keywords was done in major medical library and data were collected, categorized, and summarized in different sections. Results: Various types of molecular biology tools can be used to detect and quantify viral genomic elements and analyze the sequences. These molecular assays are proper technologies for rapidly detecting viral agents with high accuracy, high sensitivity, and high specificity. Nonetheless, the application of each diagnostic method is completely dependent on viral agent. Conclusions: Despite rapidity, automation, accuracy, cost-effectiveness, high sensitivity, and high specificity of molecular techniques, each type of molecular technology has its own advantages and disadvantages. PMID:25789132
Nacken, Wolfgang; Anhlan, Darisuren; Hrincius, Eike R; Mostafa, Ahmed; Wolff, Thorsten; Sadewasser, Anne; Pleschka, Stephan; Ehrhardt, Christina; Ludwig, Stephan
2014-08-01
A hallmark cell response to influenza A virus (IAV) infections is the phosphorylation and activation of c-jun N-terminal kinase (JNK). However, so far it is not fully clear which molecules are involved in the activation of JNK upon IAV infection. Here, we report that the transfection of influenza viral-RNA induces JNK in a retinoic acid-inducible gene I (RIG-I)-dependent manner. However, neither RIG-I-like receptors nor MyD88-dependent Toll-like receptors were found to be involved in the activation of JNK upon IAV infection. Viral JNK activation may be blocked by addition of cycloheximide and heat shock protein inhibitors during infection, suggesting that the expression of an IAV-encoded protein is responsible for JNK activation. Indeed, the overexpression of nonstructural protein 1 (NS1) of certain IAV subtypes activated JNK, whereas those of some other subtypes failed to activate JNK. Site-directed mutagenesis experiments using NS1 of the IAV H7N7, H5N1, and H3N2 subtypes identified the amino acid residue phenylalanine (F) at position 103 to be decisive for JNK activation. Cleavage- and polyadenylation-specific factor 30 (CPSF30), whose binding to NS1 is stabilized by the amino acids F103 and M106, is not involved in JNK activation. Conclusively, subtype-specific sequence variations in the IAV NS1 protein result in subtype-specific differences in JNK signaling upon IAV infection. Influenza A virus (IAV) infection leads to the activation or modulation of multiple signaling pathways. Here, we demonstrate for the first time that the c-jun N-terminal kinase (JNK), a long-known stress-activated mitogen-activated protein (MAP) kinase, is activated by RIG-I when cells are treated with IAV RNA. However, at the same time, nonstructural protein 1 (NS1) of IAV has an intrinsic JNK-activating property that is dependent on IAV subtype-specific amino acid variations around position 103. Our findings identify two different and independent pathways that result in the activation of JNK in the course of an IAV infection. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Nacken, Wolfgang; Anhlan, Darisuren; Hrincius, Eike R.; Mostafa, Ahmed; Wolff, Thorsten; Sadewasser, Anne; Pleschka, Stephan; Ehrhardt, Christina
2014-01-01
ABSTRACT A hallmark cell response to influenza A virus (IAV) infections is the phosphorylation and activation of c-jun N-terminal kinase (JNK). However, so far it is not fully clear which molecules are involved in the activation of JNK upon IAV infection. Here, we report that the transfection of influenza viral-RNA induces JNK in a retinoic acid-inducible gene I (RIG-I)-dependent manner. However, neither RIG-I-like receptors nor MyD88-dependent Toll-like receptors were found to be involved in the activation of JNK upon IAV infection. Viral JNK activation may be blocked by addition of cycloheximide and heat shock protein inhibitors during infection, suggesting that the expression of an IAV-encoded protein is responsible for JNK activation. Indeed, the overexpression of nonstructural protein 1 (NS1) of certain IAV subtypes activated JNK, whereas those of some other subtypes failed to activate JNK. Site-directed mutagenesis experiments using NS1 of the IAV H7N7, H5N1, and H3N2 subtypes identified the amino acid residue phenylalanine (F) at position 103 to be decisive for JNK activation. Cleavage- and polyadenylation-specific factor 30 (CPSF30), whose binding to NS1 is stabilized by the amino acids F103 and M106, is not involved in JNK activation. Conclusively, subtype-specific sequence variations in the IAV NS1 protein result in subtype-specific differences in JNK signaling upon IAV infection. IMPORTANCE Influenza A virus (IAV) infection leads to the activation or modulation of multiple signaling pathways. Here, we demonstrate for the first time that the c-jun N-terminal kinase (JNK), a long-known stress-activated mitogen-activated protein (MAP) kinase, is activated by RIG-I when cells are treated with IAV RNA. However, at the same time, nonstructural protein 1 (NS1) of IAV has an intrinsic JNK-activating property that is dependent on IAV subtype-specific amino acid variations around position 103. Our findings identify two different and independent pathways that result in the activation of JNK in the course of an IAV infection. PMID:24872593
Genetic structure of Culex erraticus populations across the Americas.
Mendenhall, Ian H; Bahl, Justin; Blum, Michael J; Wesson, Dawn M
2012-05-01
Culex erraticus (Dyar & Knab) is a potential competent vector for several arboviruses such as Eastern and Venezuelan equine encephalitis viruses and West Nile virus. It therefore may play a role in the maintenance and spread of viral populations in areas of concern, including the United States where it occurs in >33 states. However, little information is available on potential barriers to movement across the species' distribution. Here, we analyze genetic variation among Cx. erraticus collected from Colombia, Guatemala, and nine locations in the United States to better understand population structure and connectivity. Comparative sequence analysis of the second internal transcribed spacer and mitochondrial NADH dehydrogenase genes identified two major lineages of sampled populations. One lineage represented the central and eastern United States, whereas the other corresponded to Central America, South America, and the western United States. Hierarchical analysis of genetic variation provided further evidence of regional population structure, although the majority of genetic variation was found to reside within populations, suggestive of large population sizes. Although significant physical barriers such as the Chihuahuan Desert probably constrain the spread of Cx. erraticus, large population sizes and connectivity within regions remain important risk factors that probably contribute to the movement of arboviruses within and between these regions.
Frange, Pierre; Meyer, Laurence; Jung, Matthieu; Goujard, Cecile; Zucman, David; Abel, Sylvie; Hochedez, Patrick; Gousset, Marine; Gascuel, Olivier; Rouzioux, Christine; Chaix, Marie-Laure
2013-01-01
Objective Characterization of HIV-1 sequences in newly infected individuals is important for elucidating the mechanisms of viral sexual transmission. We report the identification of transmitted/founder viruses in eight pairs of HIV-1 sexually-infected patients enrolled at the time of primary infection (“recipients”) and their transmitting partners (“donors”). Methods Using a single genome-amplification approach, we compared quasispecies in donors and recipients on the basis of 316 and 376 C2V5 env sequences amplified from plasma viral RNA and PBMC-associated DNA, respectively. Results Both DNA and RNA sequences indicated very homogeneous viral populations in all recipients, suggesting transmission of a single variant, even in cases of recent sexually transmitted infections (STIs) in donors (n = 2) or recipients (n = 3). In all pairs, the transmitted/founder virus was derived from an infrequent variant population within the blood of the donor. The donor variant sequences most closely related to the recipient sequences were found in plasma samples in 3/8 cases and/or in PBMC samples in 6/8 cases. Although donors were exclusively (n = 4) or predominantly (n = 4) infected by CCR5-tropic (R5) strains, two recipients were infected with highly homogeneous CXCR4/dual-mixed-tropic (X4/DM) viral populations, identified in both DNA and RNA. The proportion of X4/DM quasispecies in donors was higher in cases of X4/DM than R5 HIV transmission (16.7–22.0% versus 0–2.6%), suggesting that X4/DM transmission may be associated with a threshold population of X4/DM circulating quasispecies in donors. Conclusions These suggest that a severe genetic bottleneck occurs during subtype B HIV-1 heterosexual and homosexual transmission. Sexually-transmitted/founder virus cannot be directly predicted by analysis of the donor’s quasispecies in plasma and/or PBMC. Additional studies are required to fully understand the traits that confer the capacity to transmit and establish infection, and determine the role of concomitant STIs in mitigating the genetic bottleneck in mucosal HIV transmission. PMID:23874894
Sedlackova, Lenka; Perkins, Keith D; Meyer, Julia; Strain, Anna K; Goldman, Oksana; Rice, Stephen A
2010-03-01
During productive herpes simplex virus type 1 (HSV-1) infection, a subset of viral delayed-early (DE) and late (L) genes require the immediate-early (IE) protein ICP27 for their expression. However, the cis-acting regulatory sequences in DE and L genes that mediate their specific induction by ICP27 are unknown. One viral L gene that is highly dependent on ICP27 is that encoding glycoprotein C (gC). We previously demonstrated that this gene is posttranscriptionally transactivated by ICP27 in a plasmid cotransfection assay. Based on our past results, we hypothesized that the gC gene possesses a cis-acting inhibitory sequence and that ICP27 overcomes the effects of this sequence to enable efficient gC expression. To test this model, we systematically deleted sequences from the body of the gC gene and tested the resulting constructs for expression. In so doing, we identified a 258-bp "silencing element" (SE) in the 5' portion of the gC coding region. When present, the SE inhibits gC mRNA accumulation from a transiently transfected gC gene, unless ICP27 is present. Moreover, the SE can be transferred to another HSV-1 gene, where it inhibits mRNA accumulation in the absence of ICP27 and confers high-level expression in the presence of ICP27. Thus, for the first time, an ICP27-responsive sequence has been identified in a physiologically relevant ICP27 target gene. To see if the SE functions during viral infection, we engineered HSV-1 recombinants that lack the SE, either in a wild-type (WT) or ICP27-null genetic background. In an ICP27-null background, deletion of the SE led to ICP27-independent expression of the gC gene, demonstrating that the SE functions during viral infection. Surprisingly, the ICP27-independent gC expression seen with the mutant occurred even in the absence of viral DNA synthesis, indicating that the SE helps to regulate the tight DNA replication-dependent expression of gC.
A first report and complete genome sequence of alfalfa enamovirus from Sudan
USDA-ARS?s Scientific Manuscript database
A full genome sequence of a viral pathogen, provisionally named alfalfa enamovirus 2 (AEV-2), was reconstructed from short reads obtained by Illumina RNA sequencing of alfalfa sample originating from Sudan. Ambiguous nucleotides in the resultant consensus assembly and identity of the predicted virus...
Assessment of FIV-C infection of cats as a function of treatment with the protease inhibitor, TL-3
de Rozières, Sohela; Swan, Christina H; Sheeter, Dennis A; Clingerman, Karen J; Lin, Ying-Chuan; Huitron-Resendiz, Salvador; Henriksen, Steven; Torbett, Bruce E; Elder, John H
2004-01-01
Background The protease inhibitor, TL-3, demonstrated broad efficacy in vitro against FIV, HIV and SIV (simian immunodeficiency virus), and exhibited very strong protective effects on early neurologic alterations in the CNS of FIV-PPR infected cats. In this study, we analyzed TL-3 efficacy using a highly pathogenic FIV-C isolate, which causes a severe acute phase immunodeficiency syndrome, with high early mortality rates. Results Twenty cats were infected with uncloned FIV-C and half were treated with TL-3 while the other half were left untreated. Two uninfected cats were used as controls. The general health and the immunological and virological status of the animals was monitored for eight weeks following infection. All infected animals became viremic independent of TL-3 treatment and seven of 20 FIV-C infected animals developed severe immunodepletive disease in conjunction with significantly (p ≤ 0.05) higher viral RNA loads as compared to asymptomatic animals. A marked and progressive increase in CD8+ T lymphocytes in animals surviving acute phase infection was noted, which was not evident in symptomatic animals (p ≤ 0.05). Average viral loads were lower in TL-3 treated animals and of the 6 animals requiring euthanasia, four were from the untreated cohort. At eight weeks post infection, half of the TL-3 treated animals and only one of six untreated animals had viral loads below detection limits. Analysis of protease genes in TL-3 treated animals with higher than average viral loads revealed sequence variations relative to wild type protease. In particular, one mutant, D105G, imparted 5-fold resistance against TL-3 relative to wild type protease. Conclusions The findings indicate that the protease inhibitor, TL-3, when administered orally as a monotherapy, did not prevent viremia in cats infected with high dose FIV-C. However, the modest lowering of viral loads with TL-3 treatment, the greater survival rate in symptomatic animals of the treated cohort, and the lower average viral load in TL-3 treated animals at eight weeks post infection is indicative of a therapeutic effect of the compound on virus infection. PMID:15555065
Assessment of FIV-C infection of cats as a function of treatment with the protease inhibitor, TL-3.
de Rozières, Sohela; Swan, Christina H; Sheeter, Dennis A; Clingerman, Karen J; Lin, Ying-Chuan; Huitron-Resendiz, Salvador; Henriksen, Steven; Torbett, Bruce E; Elder, John H
2004-11-19
The protease inhibitor, TL-3, demonstrated broad efficacy in vitro against FIV, HIV and SIV (simian immunodeficiency virus), and exhibited very strong protective effects on early neurologic alterations in the CNS of FIV-PPR infected cats. In this study, we analyzed TL-3 efficacy using a highly pathogenic FIV-C isolate, which causes a severe acute phase immunodeficiency syndrome, with high early mortality rates. Twenty cats were infected with uncloned FIV-C and half were treated with TL-3 while the other half were left untreated. Two uninfected cats were used as controls. The general health and the immunological and virological status of the animals was monitored for eight weeks following infection. All infected animals became viremic independent of TL-3 treatment and seven of 20 FIV-C infected animals developed severe immunodepletive disease in conjunction with significantly (p < or = 0.05) higher viral RNA loads as compared to asymptomatic animals. A marked and progressive increase in CD8+ T lymphocytes in animals surviving acute phase infection was noted, which was not evident in symptomatic animals (p < or = 0.05). Average viral loads were lower in TL-3 treated animals and of the 6 animals requiring euthanasia, four were from the untreated cohort. At eight weeks post infection, half of the TL-3 treated animals and only one of six untreated animals had viral loads below detection limits. Analysis of protease genes in TL-3 treated animals with higher than average viral loads revealed sequence variations relative to wild type protease. In particular, one mutant, D105G, imparted 5-fold resistance against TL-3 relative to wild type protease. The findings indicate that the protease inhibitor, TL-3, when administered orally as a monotherapy, did not prevent viremia in cats infected with high dose FIV-C. However, the modest lowering of viral loads with TL-3 treatment, the greater survival rate in symptomatic animals of the treated cohort, and the lower average viral load in TL-3 treated animals at eight weeks post infection is indicative of a therapeutic effect of the compound on virus infection.
Neuman, Benjamin W.; Stein, David A.; Kroeker, Andrew D.; Churchill, Michael J.; Kim, Alice M.; Kuhn, Peter; Dawson, Philip; Moulton, Hong M.; Bestwick, Richard K.; Iversen, Patrick L.; Buchmeier, Michael J.
2005-01-01
The recently emerged severe acute respiratory syndrome coronavirus (SARS-CoV) is a potent pathogen of humans and is capable of rapid global spread. Peptide-conjugated antisense morpholino oligomers (P-PMO) were designed to bind by base pairing to specific sequences in the SARS-CoV (Tor2 strain) genome. The P-PMO were tested for their capacity to inhibit production of infectious virus as well as to probe the function of conserved viral RNA motifs and secondary structures. Several virus-targeted P-PMO and a random-sequence control P-PMO showed low inhibitory activity against SARS coronavirus. Certain other virus-targeted P-PMO reduced virus-induced cytopathology and cell-to-cell spread as a consequence of decreasing viral amplification. Active P-PMO were effective when administered at any time prior to peak viral synthesis and exerted sustained antiviral effects while present in culture medium. P-PMO showed low nonspecific inhibitory activity against translation of nontargeted RNA or growth of the arenavirus lymphocytic choriomeningitis virus. Two P-PMO targeting the viral transcription-regulatory sequence (TRS) region in the 5′ untranslated region were the most effective inhibitors tested. After several viral passages in the presence of a TRS-targeted P-PMO, partially drug-resistant SARS-CoV mutants arose which contained three contiguous base point mutations at the binding site of a TRS-targeted P-PMO. Those partially resistant viruses grew more slowly and formed smaller plaques than wild-type SARS-CoV. These results suggest PMO compounds have powerful therapeutic and investigative potential toward coronavirus infection. PMID:16014928
The Chern-Simons Current in Systems of DNA-RNA Transcriptions
NASA Astrophysics Data System (ADS)
Capozziello, Salvatore; Pincak, Richard; Kanjamapornkul, Kabin; Saridakis, Emmanuel N.
2018-04-01
A Chern-Simons current, coming from ghost and anti-ghost fields of supersymmetry theory, can be used to define a spectrum of gene expression in new time series data where a spinor field, as alternative representation of a gene, is adopted instead of using the standard alphabet sequence of bases $A, T, C, G, U$. After a general discussion on the use of supersymmetry in biological systems, we give examples of the use of supersymmetry for living organism, discuss the codon and anti-codon ghost fields and develop an algebraic construction for the trash DNA, the DNA area which does not seem active in biological systems. As a general result, all hidden states of codon can be computed by Chern-Simons 3 forms. Finally, we plot a time series of genetic variations of viral glycoprotein gene and host T-cell receptor gene by using a gene tensor correlation network related to the Chern-Simons current. An empirical analysis of genetic shift, in host cell receptor genes with separated cluster of gene and genetic drift in viral gene, is obtained by using a tensor correlation plot over time series data derived as the empirical mode decomposition of Chern-Simons current.
The evolutionary dynamics of the lion Panthera leo revealed by host and viral population genomics.
Antunes, Agostinho; Troyer, Jennifer L; Roelke, Melody E; Pecon-Slattery, Jill; Packer, Craig; Winterbach, Christiaan; Winterbach, Hanlie; Hemson, Graham; Frank, Laurence; Stander, Philip; Siefert, Ludwig; Driciru, Margaret; Funston, Paul J; Alexander, Kathy A; Prager, Katherine C; Mills, Gus; Wildt, David; Bush, Mitch; O'Brien, Stephen J; Johnson, Warren E
2008-11-01
The lion Panthera leo is one of the world's most charismatic carnivores and is one of Africa's key predators. Here, we used a large dataset from 357 lions comprehending 1.13 megabases of sequence data and genotypes from 22 microsatellite loci to characterize its recent evolutionary history. Patterns of molecular genetic variation in multiple maternal (mtDNA), paternal (Y-chromosome), and biparental nuclear (nDNA) genetic markers were compared with patterns of sequence and subtype variation of the lion feline immunodeficiency virus (FIV(Ple)), a lentivirus analogous to human immunodeficiency virus (HIV). In spite of the ability of lions to disperse long distances, patterns of lion genetic diversity suggest substantial population subdivision (mtDNA Phi(ST) = 0.92; nDNA F(ST) = 0.18), and reduced gene flow, which, along with large differences in sero-prevalence of six distinct FIV(Ple) subtypes among lion populations, refute the hypothesis that African lions consist of a single panmictic population. Our results suggest that extant lion populations derive from several Pleistocene refugia in East and Southern Africa ( approximately 324,000-169,000 years ago), which expanded during the Late Pleistocene ( approximately 100,000 years ago) into Central and North Africa and into Asia. During the Pleistocene/Holocene transition ( approximately 14,000-7,000 years), another expansion occurred from southern refugia northwards towards East Africa, causing population interbreeding. In particular, lion and FIV(Ple) variation affirms that the large, well-studied lion population occupying the greater Serengeti Ecosystem is derived from three distinct populations that admixed recently.
The Evolutionary Dynamics of the Lion Panthera leo Revealed by Host and Viral Population Genomics
Antunes, Agostinho; Troyer, Jennifer L.; Roelke, Melody E.; Pecon-Slattery, Jill; Packer, Craig; Winterbach, Christiaan; Winterbach, Hanlie; Hemson, Graham; Frank, Laurence; Stander, Philip; Siefert, Ludwig; Driciru, Margaret; Funston, Paul J.; Alexander, Kathy A.; Prager, Katherine C.; Mills, Gus; Wildt, David; Bush, Mitch; O'Brien, Stephen J.; Johnson, Warren E.
2008-01-01
The lion Panthera leo is one of the world's most charismatic carnivores and is one of Africa's key predators. Here, we used a large dataset from 357 lions comprehending 1.13 megabases of sequence data and genotypes from 22 microsatellite loci to characterize its recent evolutionary history. Patterns of molecular genetic variation in multiple maternal (mtDNA), paternal (Y-chromosome), and biparental nuclear (nDNA) genetic markers were compared with patterns of sequence and subtype variation of the lion feline immunodeficiency virus (FIVPle), a lentivirus analogous to human immunodeficiency virus (HIV). In spite of the ability of lions to disperse long distances, patterns of lion genetic diversity suggest substantial population subdivision (mtDNA ΦST = 0.92; nDNA F ST = 0.18), and reduced gene flow, which, along with large differences in sero-prevalence of six distinct FIVPle subtypes among lion populations, refute the hypothesis that African lions consist of a single panmictic population. Our results suggest that extant lion populations derive from several Pleistocene refugia in East and Southern Africa (∼324,000–169,000 years ago), which expanded during the Late Pleistocene (∼100,000 years ago) into Central and North Africa and into Asia. During the Pleistocene/Holocene transition (∼14,000–7,000 years), another expansion occurred from southern refugia northwards towards East Africa, causing population interbreeding. In particular, lion and FIVPle variation affirms that the large, well-studied lion population occupying the greater Serengeti Ecosystem is derived from three distinct populations that admixed recently. PMID:18989457
Hughes, Joseph; Biek, Roman; Litster, Annette; Willett, Brian J.; Hosie, Margaret J.
2015-01-01
Analysing the evolution of feline immunodeficiency virus (FIV) at the intra-host level is important in order to address whether the diversity and composition of viral quasispecies affect disease progression. We examined the intra-host diversity and the evolutionary rates of the entire env and structural fragments of the env sequences obtained from sequential blood samples in 43 naturally infected domestic cats that displayed different clinical outcomes. We observed in the majority of cats that FIV env showed very low levels of intra-host diversity. We estimated that env evolved at a rate of 1.16×10−3 substitutions per site per year and demonstrated that recombinant sequences evolved faster than non-recombinant sequences. It was evident that the V3–V5 fragment of FIV env displayed higher evolutionary rates in healthy cats than in those with terminal illness. Our study provided the first evidence that the leader sequence of env, rather than the V3–V5 sequence, had the highest intra-host diversity and the highest evolutionary rate of all env fragments, consistent with this region being under a strong selective pressure for genetic variation. Overall, FIV env displayed relatively low intra-host diversity and evolved slowly in naturally infected cats. The maximum evolutionary rate was observed in the leader sequence of env. Although genetic stability is not necessarily a prerequisite for clinical stability, the higher genetic stability of FIV compared with human immunodeficiency virus might explain why many naturally infected cats do not progress rapidly to AIDS. PMID:25535323