DOE Office of Scientific and Technical Information (OSTI.GOV)
Lokareddy, Ravi K.; Sankhala, Rajeshwer S.; Roy, Ankoor
Tailed bacteriophages and herpesviruses assemble infectious particles via an empty precursor capsid (or ‘procapsid’) built by multiple copies of coat and scaffolding protein and by one dodecameric portal protein. Genome packaging triggers rearrangement of the coat protein and release of scaffolding protein, resulting in dramatic procapsid lattice expansion. Here, we provide structural evidence that the portal protein of the bacteriophage P22 exists in two distinct dodecameric conformations: an asymmetric assembly in the procapsid (PC-portal) that is competent for high affinity binding to the large terminase packaging protein, and a symmetric ring in the mature virion (MV-portal) that has negligible affinitymore » for the packaging motor. Modelling studies indicate the structure of PC-portal is incompatible with DNA coaxially spooled around the portal vertex, suggesting that newly packaged DNA triggers the switch from PC- to MV-conformation. Thus, we propose the signal for termination of ‘Headful Packaging’ is a DNA-dependent symmetrization of portal protein.« less
Population Dynamics of Viral Inactivation
NASA Astrophysics Data System (ADS)
Freeman, Krista; Li, Dong; Behrens, Manja; Streletzky, Kiril; Olsson, Ulf; Evilevitch, Alex
We have investigated the population dynamics of viral inactivation in vitrousing time-resolved cryo electron microscopy combined with light and X-ray scattering techniques. Using bacteriophage λ as a model system for pressurized double-stranded DNA viruses, we found that virions incubated with their cell receptor eject their genome in a stochastic triggering process. The triggering of DNA ejection occurs in a non synchronized manner after the receptor addition, resulting in an exponential decay of the number of genome-filled viruses with time. We have explored the characteristic time constant of this triggering process at different temperatures, salt conditions, and packaged genome lengths. Furthermore, using the temperature dependence we determined an activation energy for DNA ejections. The dependences of the time constant and activation energy on internal DNA pressure, affected by salt conditions and encapsidated genome length, suggest that the triggering process is directly dependent on the conformational state of the encapsidated DNA. The results of this work provide insight into how the in vivo kinetics of the spread of viral infection are influenced by intra- and extra cellular environmental conditions. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. DGE-1252522.
Analysis pipelines and packages for Infinium HumanMethylation450 BeadChip (450k) data
Morris, Tiffany J.; Beck, Stephan
2015-01-01
The Illumina HumanMethylation450 BeadChip has become a popular platform for interrogating DNA methylation in epigenome-wide association studies (EWAS) and related projects as well as resource efforts such as the International Cancer Genome Consortium (ICGC) and the International Human Epigenome Consortium (IHEC). This has resulted in an exponential increase of 450k data in recent years and triggered the development of numerous integrated analysis pipelines and stand-alone packages. This review will introduce and discuss the currently most popular pipelines and packages and is particularly aimed at new 450k users. PMID:25233806
Analysis pipelines and packages for Infinium HumanMethylation450 BeadChip (450k) data.
Morris, Tiffany J; Beck, Stephan
2015-01-15
The Illumina HumanMethylation450 BeadChip has become a popular platform for interrogating DNA methylation in epigenome-wide association studies (EWAS) and related projects as well as resource efforts such as the International Cancer Genome Consortium (ICGC) and the International Human Epigenome Consortium (IHEC). This has resulted in an exponential increase of 450k data in recent years and triggered the development of numerous integrated analysis pipelines and stand-alone packages. This review will introduce and discuss the currently most popular pipelines and packages and is particularly aimed at new 450k users. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
On the Selective Packaging of Genomic RNA by HIV-1.
Comas-Garcia, Mauricio; Davis, Sean R; Rein, Alan
2016-09-12
Like other retroviruses, human immunodeficiency virus type 1 (HIV-1) selectively packages genomic RNA (gRNA) during virus assembly. However, in the absence of the gRNA, cellular messenger RNAs (mRNAs) are packaged. While the gRNA is selected because of its cis-acting packaging signal, the mechanism of this selection is not understood. The affinity of Gag (the viral structural protein) for cellular RNAs at physiological ionic strength is not much higher than that for the gRNA. However, binding to the gRNA is more salt-resistant, implying that it has a higher non-electrostatic component. We have previously studied the spacer 1 (SP1) region of Gag and showed that it can undergo a concentration-dependent conformational transition. We proposed that this transition represents the first step in assembly, i.e., the conversion of Gag to an assembly-ready state. To explain selective packaging of gRNA, we suggest here that binding of Gag to gRNA, with its high non-electrostatic component, triggers this conversion more readily than binding to other RNAs; thus we predict that a Gag-gRNA complex will nucleate particle assembly more efficiently than other Gag-RNA complexes. New data shows that among cellular mRNAs, those with long 3'-untranslated regions (UTR) are selectively packaged. It seems plausible that the 3'-UTR, a stretch of RNA not occupied by ribosomes, offers a favorable binding site for Gag.
Influence of Internal DNA Pressure on Stability and Infectivity of Phage λ
Bauer, D. W.; Evilevitch, A.
2016-01-01
Viruses must remain infectious while in harsh extracellular environments. An important aspect of viral particle stability for double-stranded DNA viruses is the energetically unfavorable state of the tightly confined DNA chain within the virus capsid creating pressures of tens of atmospheres. Here we study the influence of internal genome pressure on the thermal stability of viral particles. Using differential scanning calorimetry (DSC) to monitor genome loss upon heating, we find that internal pressure destabilizes the virion, resulting in a smaller activation energy barrier to trigger DNA release. These experiments are complemented by plaque assay and electron microscopy measurements to determine the influence of intra-capsid DNA pressure on the rates of viral infectivity loss. At higher temperatures (65 – 75 °C), failure to retain the packaged genome is the dominant mechanism of viral inactivation. Conversely, at lower temperatures (40 – 55 ºC), a separate inactivation mechanism dominates, which results in non-infectious particles that still retain their packaged DNA. Most significantly, both mechanisms of infectivity loss are directly influenced by internal DNA pressure, with higher pressure resulting in a more rapid rate of inactivation at all temperatures. PMID:26254570
Components of Adenovirus Genome Packaging
Ahi, Yadvinder S.; Mittal, Suresh K.
2016-01-01
Adenoviruses (AdVs) are icosahedral viruses with double-stranded DNA (dsDNA) genomes. Genome packaging in AdV is thought to be similar to that seen in dsDNA containing icosahedral bacteriophages and herpesviruses. Specific recognition of the AdV genome is mediated by a packaging domain located close to the left end of the viral genome and is mediated by the viral packaging machinery. Our understanding of the role of various components of the viral packaging machinery in AdV genome packaging has greatly advanced in recent years. Characterization of empty capsids assembled in the absence of one or more components involved in packaging, identification of the unique vertex, and demonstration of the role of IVa2, the putative packaging ATPase, in genome packaging have provided compelling evidence that AdVs follow a sequential assembly pathway. This review provides a detailed discussion on the functions of the various viral and cellular factors involved in AdV genome packaging. We conclude by briefly discussing the roles of the empty capsids, assembly intermediates, scaffolding proteins, portal vertex and DNA encapsidating enzymes in AdV assembly and packaging. PMID:27721809
Morales, Lucia; Mateos-Gomez, Pedro A.; Capiscol, Carmen; del Palacio, Lorena; Sola, Isabel
2013-01-01
Preferential RNA packaging in coronaviruses involves the recognition of viral genomic RNA, a crucial process for viral particle morphogenesis mediated by RNA-specific sequences, known as packaging signals. An essential packaging signal component of transmissible gastroenteritis coronavirus (TGEV) has been further delimited to the first 598 nucleotides (nt) from the 5′ end of its RNA genome, by using recombinant viruses transcribing subgenomic mRNA that included potential packaging signals. The integrity of the entire sequence domain was necessary because deletion of any of the five structural motifs defined within this region abrogated specific packaging of this viral RNA. One of these RNA motifs was the stem-loop SL5, a highly conserved motif in coronaviruses located at nucleotide positions 106 to 136. Partial deletion or point mutations within this motif also abrogated packaging. Using TGEV-derived defective minigenomes replicated in trans by a helper virus, we have shown that TGEV RNA packaging is a replication-independent process. Furthermore, the last 494 nt of the genomic 3′ end were not essential for packaging, although this region increased packaging efficiency. TGEV RNA sequences identified as necessary for viral genome packaging were not sufficient to direct packaging of a heterologous sequence derived from the green fluorescent protein gene. These results indicated that TGEV genome packaging is a complex process involving many factors in addition to the identified RNA packaging signal. The identification of well-defined RNA motifs within the TGEV RNA genome that are essential for packaging will be useful for designing packaging-deficient biosafe coronavirus-derived vectors and providing new targets for antiviral therapies. PMID:23966403
Dynamics of bacteriophage genome ejection in vitro and in vivo
NASA Astrophysics Data System (ADS)
Panja, Debabrata; Molineux, Ian J.
2010-12-01
Bacteriophages, phages for short, are viruses of bacteria. The majority of phages contain a double-stranded DNA genome packaged in a capsid at a density of ~500 mg ml-1. This high density requires substantial compression of the normal B-form helix, leading to the conjecture that DNA in mature phage virions is under significant pressure, and that pressure is used to eject the DNA during infection. A large number of theoretical, computer simulation and in vitro experimental studies surrounding this conjecture have revealed many—though often isolated and/or contradictory—aspects of packaged DNA. This prompts us to present a unified view of the statistical physics and thermodynamics of DNA packaged in phage capsids. We argue that the DNA in a mature phage is in a (meta)stable state, wherein electrostatic self-repulsion is balanced by curvature stress due to confinement in the capsid. We show that in addition to the osmotic pressure associated with the packaged DNA and its counterions, there are four different pressures within the capsid: pressure on the DNA, hydrostatic pressure, the pressure experienced by the capsid and the pressure associated with the chemical potential of DNA ejection. Significantly, we analyze the mechanism of force transmission in the packaged DNA and demonstrate that the pressure on DNA is not important for ejection. We derive equations showing a strong hydrostatic pressure difference across the capsid shell. We propose that when a phage is triggered to eject by interaction with its receptor in vitro, the (thermodynamic) incentive of water molecules to enter the phage capsid flushes the DNA out of the capsid. In vivo, the difference between the osmotic pressures in the bacterial cell cytoplasm and the culture medium similarly results in a water flow that drags the DNA out of the capsid and into the bacterial cell.
Yang, Qin; Maluf, Nasib Karl; Catalano, Carlos Enrique
2008-11-28
The developmental pathways for a variety of eukaryotic and prokaryotic double-stranded DNA viruses include packaging of viral DNA into a preformed procapsid structure, catalyzed by terminase enzymes and fueled by ATP hydrolysis. In most instances, a capsid expansion process accompanies DNA packaging, which significantly increases the volume of the capsid to accommodate the full-length viral genome. "Decoration" proteins add to the surface of the expanded capsid lattice, and the terminase motors tightly package DNA, generating up to approximately 20 atm of internal capsid pressure. Herein we describe biochemical studies on genome packaging using bacteriophage lambda as a model system. Kinetic analysis suggests that the packaging motor possesses at least four ATPase catalytic sites that act cooperatively to effect DNA translocation, and that the motor is highly processive. While not required for DNA translocation into the capsid, the phage lambda capsid decoration protein gpD is essential for the packaging of the penultimate 8-10 kb (15-20%) of the viral genome; virtually no DNA is packaged in the absence of gpD when large DNA substrates are used, most likely due to a loss of capsid structural integrity. Finally, we show that ATP hydrolysis is required to retain the genome in a packaged state subsequent to condensation within the capsid. Presumably, the packaging motor continues to "idle" at the genome end and to maintain a positive pressure towards the packaged state. Surprisingly, ADP, guanosine triphosphate, and the nonhydrolyzable ATP analog 5'-adenylyl-beta,gamma-imidodiphosphate (AMP-PNP) similarly stabilize the packaged viral genome despite the fact that they fail to support genome packaging. In contrast, the poorly hydrolyzed ATP analog ATP-gammaS only partially stabilizes the nucleocapsid, and a DNA is released in "quantized" steps. We interpret the ensemble of data to indicate that (i) the viral procapsid possesses a degree of plasticity that is required to accommodate the packaging of large DNA substrates; (ii) the gpD decoration protein is required to stabilize the fully expanded capsid; and (iii) nucleotides regulate high-affinity DNA binding interactions that are required to maintain DNA in the packaged state.
Ionic switch controls the DNA state in phage λ
Li, Dong; Liu, Ting; Zuo, Xiaobing; Li, Tao; Qiu, Xiangyun; Evilevitch, Alex
2015-01-01
We have recently found that DNA packaged in phage λ undergoes a disordering transition triggered by temperature, which results in increased genome mobility. This solid-to-fluid like DNA transition markedly increases the number of infectious λ particles facilitating infection. However, the structural transition strongly depends on temperature and ionic conditions in the surrounding medium. Using titration microcalorimetry combined with solution X-ray scattering, we mapped both energetic and structural changes associated with transition of the encapsidated λ-DNA. Packaged DNA needs to reach a critical stress level in order for transition to occur. We varied the stress on DNA in the capsid by changing the temperature, packaged DNA length and ionic conditions. We found striking evidence that the intracapsid DNA transition is ‘switched on’ at the ionic conditions mimicking those in vivo and also at the physiologic temperature of infection at 37°C. This ion regulated on-off switch of packaged DNA mobility in turn affects viral replication. These results suggest a remarkable adaptation of phage λ to the environment of its host bacteria in the human gut. The metastable DNA state in the capsid provides a new paradigm for the physical evolution of viruses. PMID:26092697
Ionic switch controls the DNA state in phage λ
Li, Dong; Liu, Ting; Zuo, Xiaobing; ...
2015-06-19
We have recently found that DNA packaged in phage λ undergoes a disordering transition triggered by temperature, which results in increased genome mobility. This solid-to-fluid like DNA transition markedly increases the number of infectious λ particles facilitating infection. However, the structural transition strongly depends on temperature and ionic conditions in the surrounding medium. Using titration microcalorimetry combined with solution X-ray scattering, we mapped both energetic and structural changes associated with transition of the encapsidated λ-DNA. Packaged DNA needs to reach a critical stress level in order for transition to occur. We varied the stress on DNA in the capsid bymore » changing the temperature, packaged DNA length and ionic conditions. We found striking evidence that the intracapsid DNA transition is ‘switched on’ at the ionic conditions mimicking those in vivo and also at the physiologic temperature of infection at 37°C. This ion regulated on-off switch of packaged DNA mobility in turn affects viral replication. The results suggest a remarkable adaptation of phage λ to the environment of its host bacteria in the human gut. The metastable DNA state in the capsid provides a new paradigm for the physical evolution of viruses.« less
Ionic switch controls the DNA state in phage λ
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Dong; Liu, Ting; Zuo, Xiaobing
We have recently found that DNA packaged in phage λ undergoes a disordering transition triggered by temperature, which results in increased genome mobility. This solid-to-fluid like DNA transition markedly increases the number of infectious λ particles facilitating infection. However, the structural transition strongly depends on temperature and ionic conditions in the surrounding medium. Using titration microcalorimetry combined with solution X-ray scattering, we mapped both energetic and structural changes associated with transition of the encapsidated λ-DNA. Packaged DNA needs to reach a critical stress level in order for transition to occur. We varied the stress on DNA in the capsid bymore » changing the temperature, packaged DNA length and ionic conditions. We found striking evidence that the intracapsid DNA transition is ‘switched on’ at the ionic conditions mimicking those in vivo and also at the physiologic temperature of infection at 37°C. This ion regulated on-off switch of packaged DNA mobility in turn affects viral replication. The results suggest a remarkable adaptation of phage λ to the environment of its host bacteria in the human gut. The metastable DNA state in the capsid provides a new paradigm for the physical evolution of viruses.« less
GenomeGraphs: integrated genomic data visualization with R.
Durinck, Steffen; Bullard, James; Spellman, Paul T; Dudoit, Sandrine
2009-01-06
Biological studies involve a growing number of distinct high-throughput experiments to characterize samples of interest. There is a lack of methods to visualize these different genomic datasets in a versatile manner. In addition, genomic data analysis requires integrated visualization of experimental data along with constantly changing genomic annotation and statistical analyses. We developed GenomeGraphs, as an add-on software package for the statistical programming environment R, to facilitate integrated visualization of genomic datasets. GenomeGraphs uses the biomaRt package to perform on-line annotation queries to Ensembl and translates these to gene/transcript structures in viewports of the grid graphics package. This allows genomic annotation to be plotted together with experimental data. GenomeGraphs can also be used to plot custom annotation tracks in combination with different experimental data types together in one plot using the same genomic coordinate system. GenomeGraphs is a flexible and extensible software package which can be used to visualize a multitude of genomic datasets within the statistical programming environment R.
Revisiting the genome packaging in viruses with lessons from the "Giants".
Chelikani, Venkata; Ranjan, Tushar; Kondabagil, Kiran
2014-10-01
Genome encapsidation is an essential step in the life cycle of viruses. Viruses either use some of the most powerful ATP-dependent motors to compel the genetic material into the preformed capsid or make use of the positively charged proteins to bind and condense the negatively charged genome in an energy-independent manner. While the former is a hallmark of large DNA viruses, the latter is commonly seen in small DNA and RNA viruses. Discoveries of many complex giant viruses such as mimivirus, megavirus, pandoravirus, etc., belonging to the nucleo-cytoplasmic large DNA virus (NCLDV) superfamily have changed the perception of genome packaging in viruses. From what little we have understood so far, it seems that the genome packaging mechanism in NCLDVs has nothing in common with other well-characterized viral packaging systems such as the portal-terminase system or the energy-independent system. Recent findings suggest that in giant viruses, the genome segregation and packaging processes are more intricately coupled than those of other viral systems. Interestingly, giant viral packaging systems also seem to possess features that are analogous to bacterial and archaeal chromosome segregation. Although there is a lot of diversity in terms of host range, type of genome, and genome size among viruses, they all seem to use three major types of independent innovations to accomplish genome encapsidation. Here, we have made an attempt to comprehensively review all the known viral genome packaging systems, including the one that is operative in giant viruses, by proposing a simple and expanded classification system that divides the viral packaging systems into three large groups (types I-III) on the basis of the mechanism employed and the relatedness of the major packaging proteins. Known variants within each group have been further classified into subgroups to reflect their unique adaptations. Copyright © 2014 Elsevier Inc. All rights reserved.
Deep sequencing of foot-and-mouth disease virus reveals RNA sequences involved in genome packaging.
Logan, Grace; Newman, Joseph; Wright, Caroline F; Lasecka-Dykes, Lidia; Haydon, Daniel T; Cottam, Eleanor M; Tuthill, Tobias J
2017-10-18
Non-enveloped viruses protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. Packaging and capsid assembly in RNA viruses can involve interactions between capsid proteins and secondary structures in the viral genome as exemplified by the RNA bacteriophage MS2 and as proposed for other RNA viruses of plants, animals and human. In the picornavirus family of non-enveloped RNA viruses, the requirements for genome packaging remain poorly understood. Here we show a novel and simple approach to identify predicted RNA secondary structures involved in genome packaging in the picornavirus foot-and-mouth disease virus (FMDV). By interrogating deep sequencing data generated from both packaged and unpackaged populations of RNA we have determined multiple regions of the genome with constrained variation in the packaged population. Predicted secondary structures of these regions revealed stem loops with conservation of structure and a common motif at the loop. Disruption of these features resulted in attenuation of virus growth in cell culture due to a reduction in assembly of mature virions. This study provides evidence for the involvement of predicted RNA structures in picornavirus packaging and offers a readily transferable methodology for identifying packaging requirements in many other viruses. Importance In order to transmit their genetic material to a new host, non-enveloped viruses must protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. For many non-enveloped RNA viruses the requirements for this critical part of the viral life cycle remain poorly understood. We have identified RNA sequences involved in genome packaging of the picornavirus foot-and-mouth disease virus. This virus causes an economically devastating disease of livestock affecting both the developed and developing world. The experimental methods developed to carry out this work are novel, simple and transferable to the study of packaging signals in other RNA viruses. Improved understanding of RNA packaging may lead to novel vaccine approaches or targets for antiviral drugs with broad spectrum activity. Copyright © 2017 Logan et al.
Specificity of interactions among the DNA-packaging machine components of T4-related bacteriophages.
Gao, Song; Rao, Venigalla B
2011-02-04
Tailed bacteriophages use powerful molecular motors to package the viral genome into a preformed capsid. Packaging at a rate of up to ∼2000 bp/s and generating a power density twice that of an automobile engine, the phage T4 motor is the fastest and most powerful reported to date. Central to DNA packaging are dynamic interactions among the packaging components, capsid (gp23), portal (gp20), motor (gp17, large "terminase"), and regulator (gp16, small terminase), leading to precise orchestration of the packaging process, but the mechanisms are poorly understood. Here we analyzed the interactions between small and large terminases of T4-related phages. Our results show that the gp17 packaging ATPase is maximally stimulated by homologous, but not heterologous, gp16. Multiple interaction sites are identified in both gp16 and gp17. The specificity determinants in gp16 are clustered in the diverged N- and C-terminal domains (regions I-III). Swapping of diverged region(s), such as replacing C-terminal RB49 region III with that of T4, switched ATPase stimulation specificity. Two specificity regions, amino acids 37-52 and 290-315, are identified in or near the gp17-ATPase "transmission" subdomain II. gp16 binding at these sites might cause a conformational change positioning the ATPase-coupling residues into the catalytic pocket, triggering ATP hydrolysis. These results lead to a model in which multiple weak interactions between motor and regulator allow dynamic assembly and disassembly of various packaging complexes, depending on the functional state of the packaging machine. This might be a general mechanism for regulation of the phage packaging machine and other complex molecular machines.
Substrate interactions and promiscuity in a viral DNA packaging motor.
Aathavan, K; Politzer, Adam T; Kaplan, Ariel; Moffitt, Jeffrey R; Chemla, Yann R; Grimes, Shelley; Jardine, Paul J; Anderson, Dwight L; Bustamante, Carlos
2009-10-01
The ASCE (additional strand, conserved E) superfamily of proteins consists of structurally similar ATPases associated with diverse cellular activities involving metabolism and transport of proteins and nucleic acids in all forms of life. A subset of these enzymes consists of multimeric ringed pumps responsible for DNA transport in processes including genome packaging in adenoviruses, herpesviruses, poxviruses and tailed bacteriophages. Although their mechanism of mechanochemical conversion is beginning to be understood, little is known about how these motors engage their nucleic acid substrates. Questions remain as to whether the motors contact a single DNA element, such as a phosphate or a base, or whether contacts are distributed over several parts of the DNA. Furthermore, the role of these contacts in the mechanochemical cycle is unknown. Here we use the genome packaging motor of the Bacillus subtilis bacteriophage varphi29 (ref. 4) to address these questions. The full mechanochemical cycle of the motor, in which the ATPase is a pentameric-ring of gene product 16 (gp16), involves two phases-an ATP-loading dwell followed by a translocation burst of four 2.5-base-pair (bp) steps triggered by hydrolysis product release. By challenging the motor with a variety of modified DNA substrates, we show that during the dwell phase important contacts are made with adjacent phosphates every 10-bp on the 5'-3' strand in the direction of packaging. As well as providing stable, long-lived contacts, these phosphate interactions also regulate the chemical cycle. In contrast, during the burst phase, we find that DNA translocation is driven against large forces by extensive contacts, some of which are not specific to the chemical moieties of DNA. Such promiscuous, nonspecific contacts may reflect common translocase-substrate interactions for both the nucleic acid and protein translocases of the ASCE superfamily.
Substrate Interactions and Promiscuity in a Viral DNA Packaging Motor
Aathavan, K.; Politzer, Adam T.; Kaplan, Ariel; Moffitt, Jeffrey R.; Chemla, Yann R.; Grimes, Shelley; Jardine, Paul J.; Anderson, Dwight L.; Bustamante, Carlos
2009-01-01
The ASCE superfamily of proteins consists of structurally similar ATPases associated with diverse cellular activities involving metabolism and transport of proteins and nucleic acids in all forms of life1. A subset of these enzymes are multimeric ringed pumps responsible for DNA transport in processes including genome packaging in adenoviruses, herpesviruses, poxviruses, and tailed bacteriophages2. While their mechanism of mechanochemical conversion is beginning to be understood3, little is known about how these motors engage their nucleic acid substrates. Do motors contact a single DNA element, such as a phosphate or a base, or are contacts distributed over multiple parts of the DNA? In addition, what role do these contacts play in the mechanochemical cycle? Here we use the genome packaging motor of the Bacillus subtilis bacteriophage φ294 to address these questions. The full mechanochemical cycle of the motor, whose ATPase is a pentameric-ring5 of gene product 16, involves two phases-- an ATP loading dwell followed by a translocation burst of four 2.5-bp steps6 triggered by hydrolysis product release7. By challenging the motor with a variety of modified DNA substrates, we find that during the dwell phase important contacts are made with adjacent phosphates every 10-bp on the 5’-3’ strand in the direction of packaging. In addition to providing stable, long-lived contacts, these phosphate interactions also regulate the chemical cycle. In contrast, during the burst phase, we find that DNA translocation is driven against large forces by extensive contacts, some of which are not specific to the chemical moieties of DNA. Such promiscuous, non-specific contacts may reflect common translocase-substrate interactions for both the nucleic acid and protein translocases of the ASCE superfamily1. PMID:19794496
The Influenza A Virus PB2, PA, NP, and M Segments Play a Pivotal Role during Genome Packaging
Gao, Qinshan; Chou, Yi-Ying; Doğanay, Sultan; Vafabakhsh, Reza; Ha, Taekjip
2012-01-01
The genomes of influenza A viruses consist of eight negative-strand RNA segments. Recent studies suggest that influenza viruses are able to specifically package their segmented genomes into the progeny virions. Segment-specific packaging signals of influenza virus RNAs (vRNAs) are located in the 5′ and 3′ noncoding regions, as well as in the terminal regions, of the open reading frames. How these packaging signals function during genome packaging remains unclear. Previously, we generated a 7-segmented virus in which the hemagglutinin (HA) and neuraminidase (NA) segments of the influenza A/Puerto Rico/8/34 virus were replaced by a chimeric influenza C virus hemagglutinin/esterase/fusion (HEF) segment carrying the HA packaging sequences. The robust growth of the HEF virus suggested that the NA segment is not required for the packaging of other segments. In this study, in order to determine the roles of the other seven segments during influenza A virus genome assembly, we continued to use this HEF virus as a tool and analyzed the effects of replacing the packaging sequences of other segments with those of the NA segment. Our results showed that deleting the packaging signals of the PB1, HA, or NS segment had no effect on the growth of the HEF virus, while growth was greatly impaired when the packaging sequence of the PB2, PA, nucleoprotein (NP), or matrix (M) segment was removed. These results indicate that the PB2, PA, NP, and M segments play a more important role than the remaining four vRNAs during the genome-packaging process. PMID:22532680
Expressing Transgenes That Exceed the Packaging Capacity of Adeno-Associated Virus Capsids
Chamberlain, Kyle; Riyad, Jalish Mahmud; Weber, Thomas
2016-01-01
Recombinant adeno-associated virus vectors (rAAV) are being explored as gene delivery vehicles for the treatment of various inherited and acquired disorders. rAAVs are attractive vectors for several reasons: wild-type AAVs are nonpathogenic, and rAAVs can trigger long-term transgene expression even in the absence of genome integration—at least in postmitotic tissues. Moreover, rAAVs have a low immunogenic profile, and the various AAV serotypes and variants display broad but distinct tropisms. One limitation of rAAVs is that their genome-packaging capacity is only ∼5 kb. For most applications this is not of major concern because the median human protein size is 375 amino acids. Excluding the ITRs, for a protein of typical length, this allows the incorporation of ∼3.5 kb of DNA for the promoter, polyadenylation sequence, and other regulatory elements into a single AAV vector. Nonetheless, for certain diseases the packaging limit of AAV does not allow the delivery of a full-length therapeutic protein by a single AAV vector. Hence, approaches to overcome this limitation have become an important area of research for AAV gene therapy. Among the most promising approaches to overcome the limitation imposed by the packaging capacity of AAV is the use of dual-vector approaches, whereby a transgene is split across two separate AAV vectors. Coinfection of a cell with these two rAAVs will then—through a variety of mechanisms—result in the transcription of an assembled mRNA that could not be encoded by a single AAV vector because of the DNA packaging limits of AAV. The main purpose of this review is to assess the current literature with respect to dual-AAV-vector design, to highlight the effectiveness of the different methodologies and to briefly discuss future areas of research to improve the efficiency of dual-AAV-vector transduction. PMID:26757051
Deciphering the role of the Gag-Pol ribosomal frameshift signal in HIV-1 RNA genome packaging.
Nikolaitchik, Olga A; Hu, Wei-Shau
2014-04-01
A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5' untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5' end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts.
Deciphering the Role of the Gag-Pol Ribosomal Frameshift Signal in HIV-1 RNA Genome Packaging
Nikolaitchik, Olga A.
2014-01-01
ABSTRACT A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5′ untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. IMPORTANCE To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5′ end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts. PMID:24453371
Gel, Bernat; Díez-Villanueva, Anna; Serra, Eduard; Buschbeck, Marcus; Peinado, Miguel A; Malinverni, Roberto
2016-01-15
Statistically assessing the relation between a set of genomic regions and other genomic features is a common challenging task in genomic and epigenomic analyses. Randomization based approaches implicitly take into account the complexity of the genome without the need of assuming an underlying statistical model. regioneR is an R package that implements a permutation test framework specifically designed to work with genomic regions. In addition to the predefined randomization and evaluation strategies, regioneR is fully customizable allowing the use of custom strategies to adapt it to specific questions. Finally, it also implements a novel function to evaluate the local specificity of the detected association. regioneR is an R package released under Artistic-2.0 License. The source code and documents are freely available through Bioconductor (http://www.bioconductor.org/packages/regioneR). rmalinverni@carrerasresearch.org. © The Author 2015. Published by Oxford University Press.
Small terminase couples viral DNA-binding to genome-packaging ATPase activity
Roy, Ankoor; Bhardwaj, Anshul; Datta, Pinaki; Lander, Gabriel C.; Cingolani, Gino
2012-01-01
SUMMARY Packaging of viral genomes into empty procapsids is powered by a large DNA-packaging motor. In most viruses, this machine is composed of a large (L) and a small (S) terminase subunit complexed with a dodecamer of portal protein. Here, we describe the 1.75 Å crystal structure of the bacteriophage P22 S-terminase in a nonameric conformation. The structure presents a central channel ~23 Å in diameter, sufficiently large to accommodate hydrated B-DNA. The last 23 residues of S-terminase are essential for binding to DNA and assembly to L-terminase. Upon binding to its own DNA, S-terminase functions as a specific activator of L-terminase ATPase activity. The DNA-dependent stimulation of ATPase activity thus rationalizes the exclusive specificity of genome-packaging motors for viral DNA in the crowd of host DNA, ensuring fidelity of packaging and avoiding wasteful ATP hydrolysis. This posits a model for DNA-dependent activation of genome-packaging motors of general interest in virology. PMID:22771211
GenomeDiagram: a python package for the visualization of large-scale genomic data.
Pritchard, Leighton; White, Jennifer A; Birch, Paul R J; Toth, Ian K
2006-03-01
We present GenomeDiagram, a flexible, open-source Python module for the visualization of large-scale genomic, comparative genomic and other data with reference to a single chromosome or other biological sequence. GenomeDiagram may be used to generate publication-quality vector graphics, rastered images and in-line streamed graphics for webpages. The package integrates with datatypes from the BioPython project, and is available for Windows, Linux and Mac OS X systems. GenomeDiagram is freely available as source code (under GNU Public License) at http://bioinf.scri.ac.uk/lp/programs.html, and requires Python 2.3 or higher, and recent versions of the ReportLab and BioPython packages. A user manual, example code and images are available at http://bioinf.scri.ac.uk/lp/programs.html.
Cryo-Electron Microscopy of Viruses Infecting Bacterium
NASA Astrophysics Data System (ADS)
Chiu, Wah
2010-03-01
Single particle cryo-EM can yield structures of infectious bacterial viruses with and without imposed icosahedral symmetry at subnanometer resolution. Reconstructions of infectious and empty phage particles show substantial differences in the portal vertex protein complex at one of the 12 pentameric vertices in the icosahedral virus particle through which the viral genomes are packaged or released. In addition, electron cryo-tomography of viruses during infecting its bacterial host cell displayed multiple conformations of the tail fiber of the virus. Our structural observations by single particle and tomographic reconstructions suggest a mechanism whereby the viral tail fibers, upon binding to the host cell, induce a cascade of structural alterations of the portal vertex protein complex that triggers DNA release.
Zheng, Guangyong; Xu, Yaochen; Zhang, Xiujun; Liu, Zhi-Ping; Wang, Zhuo; Chen, Luonan; Zhu, Xin-Guang
2016-12-23
A gene regulatory network (GRN) represents interactions of genes inside a cell or tissue, in which vertexes and edges stand for genes and their regulatory interactions respectively. Reconstruction of gene regulatory networks, in particular, genome-scale networks, is essential for comparative exploration of different species and mechanistic investigation of biological processes. Currently, most of network inference methods are computationally intensive, which are usually effective for small-scale tasks (e.g., networks with a few hundred genes), but are difficult to construct GRNs at genome-scale. Here, we present a software package for gene regulatory network reconstruction at a genomic level, in which gene interaction is measured by the conditional mutual information measurement using a parallel computing framework (so the package is named CMIP). The package is a greatly improved implementation of our previous PCA-CMI algorithm. In CMIP, we provide not only an automatic threshold determination method but also an effective parallel computing framework for network inference. Performance tests on benchmark datasets show that the accuracy of CMIP is comparable to most current network inference methods. Moreover, running tests on synthetic datasets demonstrate that CMIP can handle large datasets especially genome-wide datasets within an acceptable time period. In addition, successful application on a real genomic dataset confirms its practical applicability of the package. This new software package provides a powerful tool for genomic network reconstruction to biological community. The software can be accessed at http://www.picb.ac.cn/CMIP/ .
Influence of sequence and size of DNA on packaging efficiency of parvovirus MVM-based vectors.
Brandenburger, A; Coessens, E; El Bakkouri, K; Velu, T
1999-05-01
We have derived a vector from the autonomous parvovirus MVM(p), which expresses human IL-2 specifically in transformed cells (Russell et al., J. Virol 1992;66:2821-2828). Testing the therapeutic potential of these vectors in vivo requires high-titer stocks. Stocks with a titer of 10(9) can be obtained after concentration and purification (Avalosse et al., J. Virol. Methods 1996;62:179-183), but this method requires large culture volumes and cannot easily be scaled up. We wanted to increase the production of recombinant virus at the initial transfection step. Poor vector titers could be due to inadequate genome amplification or to inefficient packaging. Here we show that intracellular amplification of MVM vector genomes is not the limiting factor for vector production. Several vector genomes of different size and/or structure were amplified to an equal extent. Their amplification was also equivalent to that of a cotransfected wild-type genome. We did not observe any interference between vector and wild-type genomes at the level of DNA amplification. Despite equivalent genome amplification, vector titers varied greatly between the different genomes, presumably owing to differences in packaging efficiency. Genomes with a size close to 100% that of wild type were packaged most efficiently with loss of efficiency at lower and higher sizes. However, certain genomes of identical size showed different packaging efficiencies, illustrating the importance of the DNA sequence, and probably its structure.
Park, Byeonghyeok; Baek, Min-Jeong; Min, Byoungnam; Choi, In-Geol
2017-09-01
Genome annotation is a primary step in genomic research. To establish a light and portable prokaryotic genome annotation pipeline for use in individual laboratories, we developed a Shiny app package designated as "P-CAPS" (Prokaryotic Contig Annotation Pipeline Server). The package is composed of R and Python scripts that integrate publicly available annotation programs into a server application. P-CAPS is not only a browser-based interactive application but also a distributable Shiny app package that can be installed on any personal computer. The final annotation is provided in various standard formats and is summarized in an R markdown document. Annotation can be visualized and examined with a public genome browser. A benchmark test showed that the annotation quality and completeness of P-CAPS were reliable and compatible with those of currently available public pipelines.
Towards elucidation of the mechanism of biological nanomotors
NASA Astrophysics Data System (ADS)
Zhao, Zhengyi
Biological functions such as cell mitosis, bacterial binary fission, DNA replication or repair, homologous recombination, Holliday junction resolution, viral genome packaging, and cell entry all involve biomotor-driven DNA translocation. In the past, the ubiquitous biological nanomotors were classified into two categories: linear and rotation motors. In 2013, we discovered a third type of biomotor, revolving motor without rotation. The revolving motion is further found to be widespread among many biological systems. In addition, the detailed sequential action mechanism of the ATPase ring in the phi29 dsDNA packaging motor has been elucidated: ATP binding induces a conformational entropy alternation of ATPase to a high affinity toward dsDNA; ATP hydrolysis triggers another conformational entropy change in ATPase to a low DNA affinity, by which the dsDNA substrate is pushed toward an adjacent ATPase subunit. The subunit communication is regulated by an arginine finger that extends from one ATPase subunit to the adjacent unit, resulting in an asymmetrical hexameric organization. Continuation of this process promotes the movement and revolving of the dsDNA within the hexameric ATPase ring. Coordination of all the motor components facilitate the motion direction control of the viral DNA packaging motors, and make it unusually powerful and effective. KEYWORDS: Phi29 dsDNA Packaging Motor, Bio-nanomotor, RNA Nanotechnology, DNA Translocase, One-Way Revolving, ASCE Superfamily, AAA+ Superfamily.
Gherghe, Cristina; Lombo, Tania; Leonard, Christopher W.; Datta, Siddhartha A. K.; Bess, Julian W.; Gorelick, Robert J.; Rein, Alan; Weeks, Kevin M.
2010-01-01
All retroviral genomic RNAs contain a cis-acting packaging signal by which dimeric genomes are selectively packaged into nascent virions. However, it is not understood how Gag (the viral structural protein) interacts with these signals to package the genome with high selectivity. We probed the structure of murine leukemia virus RNA inside virus particles using SHAPE, a high-throughput RNA structure analysis technology. These experiments showed that NC (the nucleic acid binding domain derived from Gag) binds within the virus to the sequence UCUG-UR-UCUG. Recombinant Gag and NC proteins bound to this same RNA sequence in dimeric RNA in vitro; in all cases, interactions were strongest with the first U and final G in each UCUG element. The RNA structural context is critical: High-affinity binding requires base-paired regions flanking this motif, and two UCUG-UR-UCUG motifs are specifically exposed in the viral RNA dimer. Mutating the guanosine residues in these two motifs—only four nucleotides per genomic RNA—reduced packaging 100-fold, comparable to the level of nonspecific packaging. These results thus explain the selective packaging of dimeric RNA. This paradigm has implications for RNA recognition in general, illustrating how local context and RNA structure can create information-rich recognition signals from simple single-stranded sequence elements in large RNAs. PMID:20974908
Structural constraints in the packaging of bluetongue virus genomic segments
Burkhardt, Christiane; Sung, Po-Yu; Celma, Cristina C.
2014-01-01
The mechanism used by bluetongue virus (BTV) to ensure the sorting and packaging of its 10 genomic segments is still poorly understood. In this study, we investigated the packaging constraints for two BTV genomic segments from two different serotypes. Segment 4 (S4) of BTV serotype 9 was mutated sequentially and packaging of mutant ssRNAs was investigated by two newly developed RNA packaging assay systems, one in vivo and the other in vitro. Modelling of the mutated ssRNA followed by biochemical data analysis suggested that a conformational motif formed by interaction of the 5′ and 3′ ends of the molecule was necessary and sufficient for packaging. A similar structural signal was also identified in S8 of BTV serotype 1. Furthermore, the same conformational analysis of secondary structures for positive-sense ssRNAs was used to generate a chimeric segment that maintained the putative packaging motif but contained unrelated internal sequences. This chimeric segment was packaged successfully, confirming that the motif identified directs the correct packaging of the segment. PMID:24980574
Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr.
Privé, Florian; Aschard, Hugues; Ziyatdinov, Andrey; Blum, Michael G B
2017-03-30
Genome-wide datasets produced for association studies have dramatically increased in size over the past few years, with modern datasets commonly including millions of variants measured in dozens of thousands of individuals. This increase in data size is a major challenge severely slowing down genomic analyses, leading to some software becoming obsolete and researchers having limited access to diverse analysis tools. Here we present two R packages, bigstatsr and bigsnpr, allowing for the analysis of large scale genomic data to be performed within R. To address large data size, the packages use memory-mapping for accessing data matrices stored on disk instead of in RAM. To perform data pre-processing and data analysis, the packages integrate most of the tools that are commonly used, either through transparent system calls to existing software, or through updated or improved implementation of existing methods. In particular, the packages implement fast and accurate computations of principal component analysis and association studies, functions to remove SNPs in linkage disequilibrium and algorithms to learn polygenic risk scores on millions of SNPs. We illustrate applications of the two R packages by analyzing a case-control genomic dataset for celiac disease, performing an association study and computing Polygenic Risk Scores. Finally, we demonstrate the scalability of the R packages by analyzing a simulated genome-wide dataset including 500,000 individuals and 1 million markers on a single desktop computer. https://privefl.github.io/bigstatsr/ & https://privefl.github.io/bigsnpr/. florian.prive@univ-grenoble-alpes.fr & michael.blum@univ-grenoble-alpes.fr. Supplementary materials are available at Bioinformatics online.
snpGeneSets: An R Package for Genome-Wide Study Annotation
Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian
2016-01-01
Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048
Drouin, Lauren M.; Lins, Bridget; Janssen, Maria; Bennett, Antonette; Chipman, Paul; McKenna, Robert; Chen, Weijun; Muzyczka, Nicholas; Cardone, Giovanni
2016-01-01
ABSTRACT The adeno-associated viruses (AAV) are promising therapeutic gene delivery vectors and better understanding of their capsid assembly and genome packaging mechanism is needed for improved vector production. Empty AAV capsids assemble in the nucleus prior to genome packaging by virally encoded Rep proteins. To elucidate the capsid determinants of this process, structural differences between wild-type (wt) AAV2 and a packaging deficient variant, AAV2-R432A, were examined using cryo-electron microscopy and three-dimensional image reconstruction both at an ∼5.0-Å resolution (medium) and also at 3.8- and 3.7-Å resolutions (high), respectively. The high resolution structures showed that removal of the arginine side chain in AAV2-R432A eliminated hydrogen bonding interactions, resulting in altered intramolecular and intermolecular interactions propagated from under the 3-fold axis toward the 5-fold channel. Consistent with these observations, differential scanning calorimetry showed an ∼10°C decrease in thermal stability for AAV2-R432A compared to wt-AAV2. In addition, the medium resolution structures revealed differences in the juxtaposition of the less ordered, N-terminal region of their capsid proteins, VP1/2/3. A structural rearrangement in AAV2-R432A repositioned the βA strand region under the icosahedral 2-fold axis rather than antiparallel to the βB strand, eliminating many intramolecular interactions. Thus, a single amino acid substitution can significantly alter the AAV capsid integrity to the extent of reducing its stability and possibly rendering it unable to tolerate the stress of genome packaging. Furthermore, the data show that the 2-, 3-, and 5-fold regions of the capsid contributed to producing the packaging defect and highlight a tight connection between the entire capsid in maintaining packaging efficiency. IMPORTANCE The mechanism of AAV genome packaging is still poorly understood, particularly with respect to the capsid determinants of the required capsid-Rep interaction. Understanding this mechanism may aid in the improvement of AAV packaging efficiency, which is currently ∼1:10 (10%) genome packaged to empty capsid in vector preparations. This report identifies regions of the AAV capsid that play roles in genome packaging and that may be important for Rep recognition. It also demonstrates the need to maintain capsid stability for the success of this process. This information is important for efforts to improve AAV genome packaging and will also inform the engineering of AAV capsid variants for improved tropism, specific tissue targeting, and host antibody escape by defining amino acids that cannot be altered without detriment to infectious vector production. PMID:27440903
Single-Molecule FISH Reveals Non-selective Packaging of Rift Valley Fever Virus Genome Segments
Wichgers Schreur, Paul J.; Kortekaas, Jeroen
2016-01-01
The bunyavirus genome comprises a small (S), medium (M), and large (L) RNA segment of negative polarity. Although genome segmentation confers evolutionary advantages by enabling genome reassortment events with related viruses, genome segmentation also complicates genome replication and packaging. Accumulating evidence suggests that genomes of viruses with eight or more genome segments are incorporated into virions by highly selective processes. Remarkably, little is known about the genome packaging process of the tri-segmented bunyaviruses. Here, we evaluated, by single-molecule RNA fluorescence in situ hybridization (FISH), the intracellular spatio-temporal distribution and replication kinetics of the Rift Valley fever virus (RVFV) genome and determined the segment composition of mature virions. The results reveal that the RVFV genome segments start to replicate near the site of infection before spreading and replicating throughout the cytoplasm followed by translocation to the virion assembly site at the Golgi network. Despite the average intracellular S, M and L genome segments approached a 1:1:1 ratio, major differences in genome segment ratios were observed among cells. We also observed a significant amount of cells lacking evidence of M-segment replication. Analysis of two-segmented replicons and four-segmented viruses subsequently confirmed the previous notion that Golgi recruitment is mediated by the Gn glycoprotein. The absence of colocalization of the different segments in the cytoplasm and the successful rescue of a tri-segmented variant with a codon shuffled M-segment suggested that inter-segment interactions are unlikely to drive the copackaging of the different segments into a single virion. The latter was confirmed by direct visualization of RNPs inside mature virions which showed that the majority of virions lack one or more genome segments. Altogether, this study suggests that RVFV genome packaging is a non-selective process. PMID:27548280
MAGNAMWAR: an R package for genome-wide association studies of bacterial orthologs.
Sexton, Corinne E; Smith, Hayden Z; Newell, Peter D; Douglas, Angela E; Chaston, John M
2018-06-01
Here we report on an R package for genome-wide association studies of orthologous genes in bacteria. Before using the software, orthologs from bacterial genomes or metagenomes are defined using local or online implementations of OrthoMCL. These presence-absence patterns are statistically associated with variation in user-collected phenotypes using the Mono-Associated GNotobiotic Animals Metagenome-Wide Association R package (MAGNAMWAR). Genotype-phenotype associations can be performed with several different statistical tests based on the type and distribution of the data. MAGNAMWAR is available on CRAN. john_chaston@byu.edu.
Ho, Michelle L; Adler, Benjamin A; Torre, Michael L; Silberg, Jonathan J; Suh, Junghae
2013-12-20
Adeno-associated virus (AAV) recombination can result in chimeric capsid protein subunits whose ability to assemble into an oligomeric capsid, package a genome, and transduce cells depends on the inheritance of sequence from different AAV parents. To develop quantitative design principles for guiding site-directed recombination of AAV capsids, we have examined how capsid structural perturbations predicted by the SCHEMA algorithm correlate with experimental measurements of disruption in seventeen chimeric capsid proteins. In our small chimera population, created by recombining AAV serotypes 2 and 4, we found that protection of viral genomes and cellular transduction were inversely related to calculated disruption of the capsid structure. Interestingly, however, we did not observe a correlation between genome packaging and calculated structural disruption; a majority of the chimeric capsid proteins formed at least partially assembled capsids and more than half packaged genomes, including those with the highest SCHEMA disruption. These results suggest that the sequence space accessed by recombination of divergent AAV serotypes is rich in capsid chimeras that assemble into 60-mer capsids and package viral genomes. Overall, the SCHEMA algorithm may be useful for delineating quantitative design principles to guide the creation of libraries enriched in genome-protecting virus nanoparticles that can effectively transduce cells. Such improvements to the virus design process may help advance not only gene therapy applications but also other bionanotechnologies dependent upon the development of viruses with new sequences and functions.
Ho, Michelle L.; Adler, Benjamin A.; Torre, Michael L.; Silberg, Jonathan J.; Suh, Junghae
2013-01-01
Adeno-associated virus (AAV) recombination can result in chimeric capsid protein subunits whose ability to assemble into an oligomeric capsid, package a genome, and transduce cells depends on the inheritance of sequence from different AAV parents. To develop quantitative design principles for guiding site-directed recombination of AAV capsids, we have examined how capsid structural perturbations predicted by the SCHEMA algorithm correlate with experimental measurements of disruption in seventeen chimeric capsid proteins. In our small chimera population, created by recombining AAV serotypes 2 and 4, we found that protection of viral genomes and cellular transduction were inversely related to calculated disruption of the capsid structure. Interestingly, however, we did not observe a correlation between genome packaging and calculated structural disruption; a majority of the chimeric capsid proteins formed at least partially assembled capsids and more than half packaged genomes, including those with the highest SCHEMA disruption. These results suggest that the sequence space accessed by recombination of divergent AAV serotypes is rich in capsid chimeras that assemble into 60-mer capsids and package viral genomes. Overall, the SCHEMA algorithm may be useful for delineating quantitative design principles to guide the creation of libraries enriched in genome-protecting virus nanoparticles that can effectively transduce cells. Such improvements to the virus design process may help advance not only gene therapy applications, but also other bionanotechnologies dependent upon the development of viruses with new sequences and functions. PMID:23899192
Interactions between HIV-1 Gag and Viral RNA Genome Enhance Virion Assembly.
Dilley, Kari A; Nikolaitchik, Olga A; Galli, Andrea; Burdick, Ryan C; Levine, Louis; Li, Kelvin; Rein, Alan; Pathak, Vinay K; Hu, Wei-Shau
2017-08-15
Most HIV-1 virions contain two copies of full-length viral RNA, indicating that genome packaging is efficient and tightly regulated. However, the structural protein Gag is the only component required for the assembly of noninfectious viruslike particles, and the viral RNA is dispensable in this process. The mechanism that allows HIV-1 to achieve such high efficiency of genome packaging when a packageable viral RNA is not required for virus assembly is currently unknown. In this report, we examined the role of HIV-1 RNA in virus assembly and found that packageable HIV-1 RNA enhances particle production when Gag is expressed at levels similar to those in cells containing one provirus. However, such enhancement is diminished when Gag is overexpressed, suggesting that the effects of viral RNA can be replaced by increased Gag concentration in cells. We also showed that the specific interactions between Gag and viral RNA are required for the enhancement of particle production. Taken together, these studies are consistent with our previous hypothesis that specific dimeric viral RNA-Gag interactions are the nucleation event of infectious virion assembly, ensuring that one RNA dimer is packaged into each nascent virion. These studies shed light on the mechanism by which HIV-1 achieves efficient genome packaging during virus assembly. IMPORTANCE Retrovirus assembly is a well-choreographed event, during which many viral and cellular components come together to generate infectious virions. The viral RNA genome carries the genetic information to new host cells, providing instructions to generate new virions, and therefore is essential for virion infectivity. In this report, we show that the specific interaction of the viral RNA genome with the structural protein Gag facilitates virion assembly and particle production. These findings resolve the conundrum that HIV-1 RNA is selectively packaged into virions with high efficiency despite being dispensable for virion assembly. Understanding the mechanism used by HIV-1 to ensure genome packaging provides significant insights into viral assembly and replication. Copyright © 2017 American Society for Microbiology.
Drouin, Lauren M; Lins, Bridget; Janssen, Maria; Bennett, Antonette; Chipman, Paul; McKenna, Robert; Chen, Weijun; Muzyczka, Nicholas; Cardone, Giovanni; Baker, Timothy S; Agbandje-McKenna, Mavis
2016-10-01
The adeno-associated viruses (AAV) are promising therapeutic gene delivery vectors and better understanding of their capsid assembly and genome packaging mechanism is needed for improved vector production. Empty AAV capsids assemble in the nucleus prior to genome packaging by virally encoded Rep proteins. To elucidate the capsid determinants of this process, structural differences between wild-type (wt) AAV2 and a packaging deficient variant, AAV2-R432A, were examined using cryo-electron microscopy and three-dimensional image reconstruction both at an ∼5.0-Å resolution (medium) and also at 3.8- and 3.7-Å resolutions (high), respectively. The high resolution structures showed that removal of the arginine side chain in AAV2-R432A eliminated hydrogen bonding interactions, resulting in altered intramolecular and intermolecular interactions propagated from under the 3-fold axis toward the 5-fold channel. Consistent with these observations, differential scanning calorimetry showed an ∼10°C decrease in thermal stability for AAV2-R432A compared to wt-AAV2. In addition, the medium resolution structures revealed differences in the juxtaposition of the less ordered, N-terminal region of their capsid proteins, VP1/2/3. A structural rearrangement in AAV2-R432A repositioned the βA strand region under the icosahedral 2-fold axis rather than antiparallel to the βB strand, eliminating many intramolecular interactions. Thus, a single amino acid substitution can significantly alter the AAV capsid integrity to the extent of reducing its stability and possibly rendering it unable to tolerate the stress of genome packaging. Furthermore, the data show that the 2-, 3-, and 5-fold regions of the capsid contributed to producing the packaging defect and highlight a tight connection between the entire capsid in maintaining packaging efficiency. The mechanism of AAV genome packaging is still poorly understood, particularly with respect to the capsid determinants of the required capsid-Rep interaction. Understanding this mechanism may aid in the improvement of AAV packaging efficiency, which is currently ∼1:10 (10%) genome packaged to empty capsid in vector preparations. This report identifies regions of the AAV capsid that play roles in genome packaging and that may be important for Rep recognition. It also demonstrates the need to maintain capsid stability for the success of this process. This information is important for efforts to improve AAV genome packaging and will also inform the engineering of AAV capsid variants for improved tropism, specific tissue targeting, and host antibody escape by defining amino acids that cannot be altered without detriment to infectious vector production. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
An RNA Domain Imparts Specificity and Selectivity to a Viral DNA Packaging Motor
Zhao, Wei; Jardine, Paul J.
2015-01-01
ABSTRACT During assembly, double-stranded DNA viruses, including bacteriophages and herpesviruses, utilize a powerful molecular motor to package their genomic DNA into a preformed viral capsid. An integral component of the packaging motor in the Bacillus subtilis bacteriophage ϕ29 is a viral genome-encoded pentameric ring of RNA (prohead RNA [pRNA]). pRNA is a 174-base transcript comprised of two domains, domains I and II. Early studies initially isolated a 120-base form (domain I only) that retains high biological activity in vitro; hence, no function could be assigned to domain II. Here we define a role for this domain in the packaging process. DNA packaging using restriction digests of ϕ29 DNA showed that motors with the 174-base pRNA supported the correct polarity of DNA packaging, selectively packaging the DNA left end. In contrast, motors containing the 120-base pRNA had compromised specificity, packaging both left- and right-end fragments. The presence of domain II also provides selectivity in competition assays with genomes from related phages. Furthermore, motors with the 174-base pRNA were restrictive, in that they packaged only one DNA fragment into the head, whereas motors with the 120-base pRNA packaged several fragments into the head, indicating multiple initiation events. These results show that domain II imparts specificity and stringency to the motor during the packaging initiation events that precede DNA translocation. Heteromeric rings of pRNA demonstrated that one or two copies of domain II were sufficient to impart this selectivity/stringency. Although ϕ29 differs from other double-stranded DNA phages in having an RNA motor component, the function provided by pRNA is carried on the motor protein components in other phages. IMPORTANCE During virus assembly, genome packaging involves the delivery of newly synthesized viral nucleic acid into a protein shell. In the double-stranded DNA phages and herpesviruses, this is accomplished by a powerful molecular motor that translocates the viral DNA into a preformed viral shell. A key event in DNA packaging is recognition of the viral DNA among other nucleic acids in the host cell. Commonly, a DNA-binding protein mediates the interaction of viral DNA with the motor/head shell. Here we show that for the bacteriophage ϕ29, this essential step of genome recognition is mediated by a viral genome-encoded RNA rather than a protein. A domain of the prohead RNA (pRNA) imparts specificity and stringency to the motor by ensuring the correct orientation of DNA packaging and restricting initiation to a single event. Since this assembly step is unique to the virus, DNA packaging is a novel target for the development of antiviral drugs. PMID:26423956
An RNA Domain Imparts Specificity and Selectivity to a Viral DNA Packaging Motor.
Zhao, Wei; Jardine, Paul J; Grimes, Shelley
2015-12-01
During assembly, double-stranded DNA viruses, including bacteriophages and herpesviruses, utilize a powerful molecular motor to package their genomic DNA into a preformed viral capsid. An integral component of the packaging motor in the Bacillus subtilis bacteriophage ϕ29 is a viral genome-encoded pentameric ring of RNA (prohead RNA [pRNA]). pRNA is a 174-base transcript comprised of two domains, domains I and II. Early studies initially isolated a 120-base form (domain I only) that retains high biological activity in vitro; hence, no function could be assigned to domain II. Here we define a role for this domain in the packaging process. DNA packaging using restriction digests of ϕ29 DNA showed that motors with the 174-base pRNA supported the correct polarity of DNA packaging, selectively packaging the DNA left end. In contrast, motors containing the 120-base pRNA had compromised specificity, packaging both left- and right-end fragments. The presence of domain II also provides selectivity in competition assays with genomes from related phages. Furthermore, motors with the 174-base pRNA were restrictive, in that they packaged only one DNA fragment into the head, whereas motors with the 120-base pRNA packaged several fragments into the head, indicating multiple initiation events. These results show that domain II imparts specificity and stringency to the motor during the packaging initiation events that precede DNA translocation. Heteromeric rings of pRNA demonstrated that one or two copies of domain II were sufficient to impart this selectivity/stringency. Although ϕ29 differs from other double-stranded DNA phages in having an RNA motor component, the function provided by pRNA is carried on the motor protein components in other phages. During virus assembly, genome packaging involves the delivery of newly synthesized viral nucleic acid into a protein shell. In the double-stranded DNA phages and herpesviruses, this is accomplished by a powerful molecular motor that translocates the viral DNA into a preformed viral shell. A key event in DNA packaging is recognition of the viral DNA among other nucleic acids in the host cell. Commonly, a DNA-binding protein mediates the interaction of viral DNA with the motor/head shell. Here we show that for the bacteriophage ϕ29, this essential step of genome recognition is mediated by a viral genome-encoded RNA rather than a protein. A domain of the prohead RNA (pRNA) imparts specificity and stringency to the motor by ensuring the correct orientation of DNA packaging and restricting initiation to a single event. Since this assembly step is unique to the virus, DNA packaging is a novel target for the development of antiviral drugs. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Stewart, H.; Bingham, R.J.; White, S. J.; Dykeman, E. C.; Zothner, C.; Tuplin, A. K.; Stockley, P. G.; Twarock, R.; Harris, M.
2016-01-01
The specific packaging of the hepatitis C virus (HCV) genome is hypothesised to be driven by Core-RNA interactions. To identify the regions of the viral genome involved in this process, we used SELEX (systematic evolution of ligands by exponential enrichment) to identify RNA aptamers which bind specifically to Core in vitro. Comparison of these aptamers to multiple HCV genomes revealed the presence of a conserved terminal loop motif within short RNA stem-loop structures. We postulated that interactions of these motifs, as well as sub-motifs which were present in HCV genomes at statistically significant levels, with the Core protein may drive virion assembly. We mutated 8 of these predicted motifs within the HCV infectious molecular clone JFH-1, thereby producing a range of mutant viruses predicted to possess altered RNA secondary structures. RNA replication and viral titre were unaltered in viruses possessing only one mutated structure. However, infectivity titres were decreased in viruses possessing a higher number of mutated regions. This work thus identified multiple novel RNA motifs which appear to contribute to genome packaging. We suggest that these structures act as cooperative packaging signals to drive specific RNA encapsidation during HCV assembly. PMID:26972799
Tomo, Naoki; Goto, Toshiyuki; Morikawa, Yuko
2013-03-26
Yeast is recognized as a generally safe microorganism and is utilized for the production of pharmaceutical products, including vaccines. We previously showed that expression of human immunodeficiency virus type 1 (HIV-1) Gag protein in Saccharomyces cerevisiae spheroplasts released Gag virus-like particles (VLPs) extracellularly, suggesting that the production system could be used in vaccine development. In this study, we further establish HIV-1 genome packaging into Gag VLPs in a yeast cell system. The nearly full-length HIV-1 genome containing the entire 5' long terminal repeat, U3-R-U5, did not transcribe gag mRNA in yeast. Co-expression of HIV-1 Tat, a transcription activator, did not support the transcription. When the HIV-1 promoter U3 was replaced with the promoter for the yeast glyceraldehyde-3-phosphate dehydrogenase gene, gag mRNA transcription was restored, but no Gag protein expression was observed. Co-expression of HIV-1 Rev, a factor that facilitates nuclear export of gag mRNA, did not support the protein synthesis. Progressive deletions of R-U5 and its downstream stem-loop-rich region (SL) to the gag start ATG codon restored Gag protein expression, suggesting that a highly structured noncoding RNA generated from the R-U5-SL region had an inhibitory effect on gag mRNA translation. When a plasmid containing the HIV-1 genome with the R-U5-SL region was coexpressed with an expression plasmid for Gag protein, the HIV-1 genomic RNA was transcribed and incorporated into Gag VLPs formed by Gag protein assembly, indicative of the trans-packaging of HIV-1 genomic RNA into Gag VLPs in a yeast cell system. The concentration of HIV-1 genomic RNA in Gag VLPs released from yeast was approximately 500-fold higher than that in yeast cytoplasm. The deletion of R-U5 to the gag gene resulted in the failure of HIV-1 RNA packaging into Gag VLPs, indicating that the packaging signal of HIV-1 genomic RNA present in the R-U5 to gag region functions similarly in yeast cells. Our data indicate that selective trans-packaging of HIV-1 genomic RNA into Gag VLPs occurs in a yeast cell system, analogous to a mammalian cell system, suggesting that yeast may provide an alternative packaging system for lentiviral RNA.
Packaging of HCV-RNA into lentiviral vector
DOE Office of Scientific and Technical Information (OSTI.GOV)
Caval, Vincent; Piver, Eric; Service de Biochimie et Biologie Moleculaire, CHRU de Tours
2011-11-04
Highlights: Black-Right-Pointing-Pointer Description of HCV-RNA Core-D1 interactions. Black-Right-Pointing-Pointer In vivo evaluation of the packaging of HCV genome. Black-Right-Pointing-Pointer Determination of the role of the three basic sub-domains of D1. Black-Right-Pointing-Pointer Heterologous system involving HIV-1 vector particles to mobilise HCV genome. Black-Right-Pointing-Pointer Full length mobilisation of HCV genome and HCV-receptor-independent entry. -- Abstract: The advent of infectious molecular clones of Hepatitis C virus (HCV) has unlocked the understanding of HCV life cycle. However, packaging of the genomic RNA, which is crucial to generate infectious viral particles, remains poorly understood. Molecular interactions of the domain 1 (D1) of HCV Core protein andmore » HCV RNA have been described in vitro. Since compaction of genetic information within HCV genome has hampered conventional mutational approach to study packaging in vivo, we developed a novel heterologous system to evaluate the interactions between HCV RNA and Core D1. For this, we took advantage of the recruitment of Vpr fusion-proteins into HIV-1 particles. By fusing HCV Core D1 to Vpr we were able to package and transfer a HCV subgenomic replicon into a HIV-1 based lentiviral vector. We next examined how deletion mutants of basic sub-domains of Core D1 influenced HCV RNA recruitment. The results emphasized the crucial role of the first and third basic regions of D1 in packaging. Interestingly, the system described here allowed us to mobilise full-length JFH1 genome in CD81 defective cells, which are normally refractory to HCV infection. This finding paves the way to an evaluation of the replication capability of HCV in various cell types.« less
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
Bontempi, Gianluca; Ceccarelli, Michele; Noushmehr, Houtan
2016-01-01
Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided unprecedented opportunities to interrogate the epigenome of cultured cancer cell lines as well as normal and tumor tissues with high genomic resolution. The Bioconductor project offers more than 1,000 open-source software and statistical packages to analyze high-throughput genomic data. However, most packages are designed for specific data types (e.g. expression, epigenetics, genomics) and there is no one comprehensive tool that provides a complete integrative analysis of the resources and data provided by all three public projects. A need to create an integration of these different analyses was recently proposed. In this workflow, we provide a series of biologically focused integrative analyses of different molecular data. We describe how to download, process and prepare TCGA data and by harnessing several key Bioconductor packages, we describe how to extract biologically meaningful genomic and epigenomic data. Using Roadmap and ENCODE data, we provide a work plan to identify biologically relevant functional epigenomic elements associated with cancer. To illustrate our workflow, we analyzed two types of brain tumors: low-grade glioma (LGG) versus high-grade glioma (glioblastoma multiform or GBM). This workflow introduces the following Bioconductor packages: AnnotationHub, ChIPSeeker, ComplexHeatmap, pathview, ELMER, GAIA, MINET, RTCGAToolbox, TCGAbiolinks. PMID:28232861
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages.
Silva, Tiago C; Colaprico, Antonio; Olsen, Catharina; D'Angelo, Fulvio; Bontempi, Gianluca; Ceccarelli, Michele; Noushmehr, Houtan
2016-01-01
Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided unprecedented opportunities to interrogate the epigenome of cultured cancer cell lines as well as normal and tumor tissues with high genomic resolution. The Bioconductor project offers more than 1,000 open-source software and statistical packages to analyze high-throughput genomic data. However, most packages are designed for specific data types (e.g. expression, epigenetics, genomics) and there is no one comprehensive tool that provides a complete integrative analysis of the resources and data provided by all three public projects. A need to create an integration of these different analyses was recently proposed. In this workflow, we provide a series of biologically focused integrative analyses of different molecular data. We describe how to download, process and prepare TCGA data and by harnessing several key Bioconductor packages, we describe how to extract biologically meaningful genomic and epigenomic data. Using Roadmap and ENCODE data, we provide a work plan to identify biologically relevant functional epigenomic elements associated with cancer. To illustrate our workflow, we analyzed two types of brain tumors: low-grade glioma (LGG) versus high-grade glioma (glioblastoma multiform or GBM). This workflow introduces the following Bioconductor packages: AnnotationHub, ChIPSeeker, ComplexHeatmap, pathview, ELMER, GAIA, MINET, RTCGAToolbox, TCGAbiolinks.
WhopGenome: high-speed access to whole-genome variation and sequence data in R.
Wittelsbürger, Ulrich; Pfeifer, Bastian; Lercher, Martin J
2015-02-01
The statistical programming language R has become a de facto standard for the analysis of many types of biological data, and is well suited for the rapid development of new algorithms. However, variant call data from population-scale resequencing projects are typically too large to be read and processed efficiently with R's built-in I/O capabilities. WhopGenome can efficiently read whole-genome variation data stored in the widely used variant call format (VCF) file format into several R data types. VCF files can be accessed either on local hard drives or on remote servers. WhopGenome can associate variants with annotations such as those available from the UCSC genome browser, and can accelerate the reading process by filtering loci according to user-defined criteria. WhopGenome can also read other Tabix-indexed files and create indices to allow fast selective access to FASTA-formatted sequence files. The WhopGenome R package is available on CRAN at http://cran.r-project.org/web/packages/WhopGenome/. A Bioconductor package has been submitted. lercher@cs.uni-duesseldorf.de. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard
2017-10-10
Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.
GUIDEseq: a bioconductor package to analyze GUIDE-Seq datasets for CRISPR-Cas nucleases.
Zhu, Lihua Julie; Lawrence, Michael; Gupta, Ankit; Pagès, Hervé; Kucukural, Alper; Garber, Manuel; Wolfe, Scot A
2017-05-15
Genome editing technologies developed around the CRISPR-Cas9 nuclease system have facilitated the investigation of a broad range of biological questions. These nucleases also hold tremendous promise for treating a variety of genetic disorders. In the context of their therapeutic application, it is important to identify the spectrum of genomic sequences that are cleaved by a candidate nuclease when programmed with a particular guide RNA, as well as the cleavage efficiency of these sites. Powerful new experimental approaches, such as GUIDE-seq, facilitate the sensitive, unbiased genome-wide detection of nuclease cleavage sites within the genome. Flexible bioinformatics analysis tools for processing GUIDE-seq data are needed. Here, we describe an open source, open development software suite, GUIDEseq, for GUIDE-seq data analysis and annotation as a Bioconductor package in R. The GUIDEseq package provides a flexible platform with more than 60 adjustable parameters for the analysis of datasets associated with custom nuclease applications. These parameters allow data analysis to be tailored to different nuclease platforms with different length and complexity in their guide and PAM recognition sequences or their DNA cleavage position. They also enable users to customize sequence aggregation criteria, and vary peak calling thresholds that can influence the number of potential off-target sites recovered. GUIDEseq also annotates potential off-target sites that overlap with genes based on genome annotation information, as these may be the most important off-target sites for further characterization. In addition, GUIDEseq enables the comparison and visualization of off-target site overlap between different datasets for a rapid comparison of different nuclease configurations or experimental conditions. For each identified off-target, the GUIDEseq package outputs mapped GUIDE-Seq read count as well as cleavage score from a user specified off-target cleavage score prediction algorithm permitting the identification of genomic sequences with unexpected cleavage activity. The GUIDEseq package enables analysis of GUIDE-data from various nuclease platforms for any species with a defined genomic sequence. This software package has been used successfully to analyze several GUIDE-seq datasets. The software, source code and documentation are freely available at http://www.bioconductor.org/packages/release/bioc/html/GUIDEseq.html .
MC-GenomeKey: a multicloud system for the detection and annotation of genomic variants.
Elshazly, Hatem; Souilmi, Yassine; Tonellato, Peter J; Wall, Dennis P; Abouelhoda, Mohamed
2017-01-20
Next Generation Genome sequencing techniques became affordable for massive sequencing efforts devoted to clinical characterization of human diseases. However, the cost of providing cloud-based data analysis of the mounting datasets remains a concerning bottleneck for providing cost-effective clinical services. To address this computational problem, it is important to optimize the variant analysis workflow and the used analysis tools to reduce the overall computational processing time, and concomitantly reduce the processing cost. Furthermore, it is important to capitalize on the use of the recent development in the cloud computing market, which have witnessed more providers competing in terms of products and prices. In this paper, we present a new package called MC-GenomeKey (Multi-Cloud GenomeKey) that efficiently executes the variant analysis workflow for detecting and annotating mutations using cloud resources from different commercial cloud providers. Our package supports Amazon, Google, and Azure clouds, as well as, any other cloud platform based on OpenStack. Our package allows different scenarios of execution with different levels of sophistication, up to the one where a workflow can be executed using a cluster whose nodes come from different clouds. MC-GenomeKey also supports scenarios to exploit the spot instance model of Amazon in combination with the use of other cloud platforms to provide significant cost reduction. To the best of our knowledge, this is the first solution that optimizes the execution of the workflow using computational resources from different cloud providers. MC-GenomeKey provides an efficient multicloud based solution to detect and annotate mutations. The package can run in different commercial cloud platforms, which enables the user to seize the best offers. The package also provides a reliable means to make use of the low-cost spot instance model of Amazon, as it provides an efficient solution to the sudden termination of spot machines as a result of a sudden price increase. The package has a web-interface and it is available for free for academic use.
Distinct DNA exit and packaging portals in the virus Acanthamoeba polyphaga mimivirus.
Zauberman, Nathan; Mutsafi, Yael; Halevy, Daniel Ben; Shimoni, Eyal; Klein, Eugenia; Xiao, Chuan; Sun, Siyang; Minsky, Abraham
2008-05-13
Icosahedral double-stranded DNA viruses use a single portal for genome delivery and packaging. The extensive structural similarity revealed by such portals in diverse viruses, as well as their invariable positioning at a unique icosahedral vertex, led to the consensus that a particular, highly conserved vertex-portal architecture is essential for viral DNA translocations. Here we present an exception to this paradigm by demonstrating that genome delivery and packaging in the virus Acanthamoeba polyphaga mimivirus occur through two distinct portals. By using high-resolution techniques, including electron tomography and cryo-scanning electron microscopy, we show that Mimivirus genome delivery entails a large-scale conformational change of the capsid, whereby five icosahedral faces open up. This opening, which occurs at a unique vertex of the capsid that we coined the "stargate", allows for the formation of a massive membrane conduit through which the viral DNA is released. A transient aperture centered at an icosahedral face distal to the DNA delivery site acts as a non-vertex DNA packaging portal. In conjunction with comparative genomic studies, our observations imply a viral packaging pathway akin to bacterial DNA segregation, which might be shared by diverse internal membrane-containing viruses.
Distinct DNA Exit and Packaging Portals in the Virus Acanthamoeba polyphaga mimivirus
Zauberman, Nathan; Mutsafi, Yael; Halevy, Daniel Ben; Shimoni, Eyal; Klein, Eugenia; Xiao, Chuan; Sun, Siyang; Minsky, Abraham
2008-01-01
Icosahedral double-stranded DNA viruses use a single portal for genome delivery and packaging. The extensive structural similarity revealed by such portals in diverse viruses, as well as their invariable positioning at a unique icosahedral vertex, led to the consensus that a particular, highly conserved vertex-portal architecture is essential for viral DNA translocations. Here we present an exception to this paradigm by demonstrating that genome delivery and packaging in the virus Acanthamoeba polyphaga mimivirus occur through two distinct portals. By using high-resolution techniques, including electron tomography and cryo-scanning electron microscopy, we show that Mimivirus genome delivery entails a large-scale conformational change of the capsid, whereby five icosahedral faces open up. This opening, which occurs at a unique vertex of the capsid that we coined the “stargate”, allows for the formation of a massive membrane conduit through which the viral DNA is released. A transient aperture centered at an icosahedral face distal to the DNA delivery site acts as a non-vertex DNA packaging portal. In conjunction with comparative genomic studies, our observations imply a viral packaging pathway akin to bacterial DNA segregation, which might be shared by diverse internal membrane–containing viruses. PMID:18479185
Experimental Approaches to Study Genome Packaging of Influenza A Viruses.
Isel, Catherine; Munier, Sandie; Naffakh, Nadia
2016-08-09
The genome of influenza A viruses (IAV) consists of eight single-stranded negative sense viral RNAs (vRNAs) encapsidated into viral ribonucleoproteins (vRNPs). It is now well established that genome packaging (i.e., the incorporation of a set of eight distinct vRNPs into budding viral particles), follows a specific pathway guided by segment-specific cis-acting packaging signals on each vRNA. However, the precise nature and function of the packaging signals, and the mechanisms underlying the assembly of vRNPs into sub-bundles in the cytoplasm and their selective packaging at the viral budding site, remain largely unknown. Here, we review the diverse and complementary methods currently being used to elucidate these aspects of the viral cycle. They range from conventional and competitive reverse genetics, single molecule imaging of vRNPs by fluorescence in situ hybridization (FISH) and high-resolution electron microscopy and tomography of budding viral particles, to solely in vitro approaches to investigate vRNA-vRNA interactions at the molecular level.
RNA Encapsidation and Packaging in the Phleboviruses
Hornak, Katherine E.; Lanchy, Jean-Marc; Lodmell, J. Stephen
2016-01-01
The Bunyaviridae represents the largest family of segmented RNA viruses, which infect a staggering diversity of plants, animals, and insects. Within the family Bunyaviridae, the Phlebovirus genus includes several important human and animal pathogens, including Rift Valley fever virus (RVFV), severe fever with thrombocytopenia syndrome virus (SFTSV), Uukuniemi virus (UUKV), and the sandfly fever viruses. The phleboviruses have small tripartite RNA genomes that encode a repertoire of 5–7 proteins. These few proteins accomplish the daunting task of recognizing and specifically packaging a tri-segment complement of viral genomic RNA in the midst of an abundance of host components. The critical nucleation events that eventually lead to virion production begin early on in the host cytoplasm as the first strands of nascent viral RNA (vRNA) are synthesized. The interaction between the vRNA and the viral nucleocapsid (N) protein effectively protects and masks the RNA from the host, and also forms the ribonucleoprotein (RNP) architecture that mediates downstream interactions and drives virion formation. Although the mechanism by which all three genomic counterparts are selectively co-packaged is not completely understood, we are beginning to understand the hierarchy of interactions that begins with N-RNA packaging and culminates in RNP packaging into new virus particles. In this review we focus on recent progress that highlights the molecular basis of RNA genome packaging in the phleboviruses. PMID:27428993
Dykeman, Eric C; Stockley, Peter G; Twarock, Reidun
2013-09-09
The current paradigm for assembly of single-stranded RNA viruses is based on a mechanism involving non-sequence-specific packaging of genomic RNA driven by electrostatic interactions. Recent experiments, however, provide compelling evidence for sequence specificity in this process both in vitro and in vivo. The existence of multiple RNA packaging signals (PSs) within viral genomes has been proposed, which facilitates assembly by binding coat proteins in such a way that they promote the protein-protein contacts needed to build the capsid. The binding energy from these interactions enables the confinement or compaction of the genomic RNAs. Identifying the nature of such PSs is crucial for a full understanding of assembly, which is an as yet untapped potential drug target for this important class of pathogens. Here, for two related bacterial viruses, we determine the sequences and locations of their PSs using Hamiltonian paths, a concept from graph theory, in combination with bioinformatics and structural studies. Their PSs have a common secondary structure motif but distinct consensus sequences and positions within the respective genomes. Despite these differences, the distributions of PSs in both viruses imply defined conformations for the packaged RNA genomes in contact with the protein shell in the capsid, consistent with a recent asymmetric structure determination of the MS2 virion. The PS distributions identified moreover imply a preferred, evolutionarily conserved assembly pathway with respect to the RNA sequence with potentially profound implications for other single-stranded RNA viruses known to have RNA PSs, including many animal and human pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.
Olova, Nelly; Krueger, Felix; Andrews, Simon; Oxley, David; Berrens, Rebecca V; Branco, Miguel R; Reik, Wolf
2018-03-15
Whole-genome bisulfite sequencing (WGBS) is becoming an increasingly accessible technique, used widely for both fundamental and disease-oriented research. Library preparation methods benefit from a variety of available kits, polymerases and bisulfite conversion protocols. Although some steps in the procedure, such as PCR amplification, are known to introduce biases, a systematic evaluation of biases in WGBS strategies is missing. We perform a comparative analysis of several commonly used pre- and post-bisulfite WGBS library preparation protocols for their performance and quality of sequencing outputs. Our results show that bisulfite conversion per se is the main trigger of pronounced sequencing biases, and PCR amplification builds on these underlying artefacts. The majority of standard library preparation methods yield a significantly biased sequence output and overestimate global methylation. Importantly, both absolute and relative methylation levels at specific genomic regions vary substantially between methods, with clear implications for DNA methylation studies. We show that amplification-free library preparation is the least biased approach for WGBS. In protocols with amplification, the choice of bisulfite conversion protocol or polymerase can significantly minimize artefacts. To aid with the quality assessment of existing WGBS datasets, we have integrated a bias diagnostic tool in the Bismark package and offer several approaches for consideration during the preparation and analysis of WGBS datasets.
pyGeno: A Python package for precision medicine and proteogenomics.
Daouda, Tariq; Perreault, Claude; Lemieux, Sébastien
2016-01-01
pyGeno is a Python package mainly intended for precision medicine applications that revolve around genomics and proteomics. It integrates reference sequences and annotations from Ensembl, genomic polymorphisms from the dbSNP database and data from next-gen sequencing into an easy to use, memory-efficient and fast framework, therefore allowing the user to easily explore subject-specific genomes and proteomes. Compared to a standalone program, pyGeno gives the user access to the complete expressivity of Python, a general programming language. Its range of application therefore encompasses both short scripts and large scale genome-wide studies.
pyGeno: A Python package for precision medicine and proteogenomics
Daouda, Tariq; Perreault, Claude; Lemieux, Sébastien
2016-01-01
pyGeno is a Python package mainly intended for precision medicine applications that revolve around genomics and proteomics. It integrates reference sequences and annotations from Ensembl, genomic polymorphisms from the dbSNP database and data from next-gen sequencing into an easy to use, memory-efficient and fast framework, therefore allowing the user to easily explore subject-specific genomes and proteomes. Compared to a standalone program, pyGeno gives the user access to the complete expressivity of Python, a general programming language. Its range of application therefore encompasses both short scripts and large scale genome-wide studies. PMID:27785359
Kaddis Maldonado, Rebecca J.; Parent, Leslie J.
2016-01-01
Infectious retrovirus particles contain two copies of unspliced viral RNA that serve as the viral genome. Unspliced retroviral RNA is transcribed in the nucleus by the host RNA polymerase II and has three potential fates: (1) it can be spliced into subgenomic messenger RNAs (mRNAs) for the translation of viral proteins; or it can remain unspliced to serve as either (2) the mRNA for the translation of Gag and Gag–Pol; or (3) the genomic RNA (gRNA) that is packaged into virions. The Gag structural protein recognizes and binds the unspliced viral RNA to select it as a genome, which is selected in preference to spliced viral RNAs and cellular RNAs. In this review, we summarize the current state of understanding about how retroviral packaging is orchestrated within the cell and explore potential new mechanisms based on recent discoveries in the field. We discuss the cis-acting elements in the unspliced viral RNA and the properties of the Gag protein that are required for their interaction. In addition, we discuss the role of host factors in influencing the fate of the newly transcribed viral RNA, current models for how retroviruses distinguish unspliced viral mRNA from viral genomic RNA, and the possible subcellular sites of genomic RNA dimerization and selection by Gag. Although this review centers primarily on the wealth of data available for the alpharetrovirus Rous sarcoma virus, in which a discrete RNA packaging sequence has been identified, we have also summarized the cis- and trans-acting factors as well as the mechanisms governing gRNA packaging of other retroviruses for comparison. PMID:27657110
Fajardo, Teodoro; Sung, Po-Yu; Roy, Polly
2015-01-01
Bluetongue virus (BTV) causes hemorrhagic disease in economically important livestock. The BTV genome is organized into ten discrete double-stranded RNA molecules (S1-S10) which have been suggested to follow a sequential packaging pathway from smallest to largest segment during virus capsid assembly. To substantiate and extend these studies, we have investigated the RNA sorting and packaging mechanisms with a new experimental approach using inhibitory oligonucleotides. Putative packaging signals present in the 3’untranslated regions of BTV segments were targeted by a number of nuclease resistant oligoribonucleotides (ORNs) and their effects on virus replication in cell culture were assessed. ORNs complementary to the 3’ UTR of BTV RNAs significantly inhibited virus replication without affecting protein synthesis. Same ORNs were found to inhibit complex formation when added to a novel RNA-RNA interaction assay which measured the formation of supramolecular complexes between and among different RNA segments. ORNs targeting the 3’UTR of BTV segment 10, the smallest RNA segment, were shown to be the most potent and deletions or substitution mutations of the targeted sequences diminished the RNA complexes and abolished the recovery of viable viruses using reverse genetics. Cell-free capsid assembly/RNA packaging assay also confirmed that the inhibitory ORNs could interfere with RNA packaging and further substitution mutations within the putative RNA packaging sequence have identified the recognition sequence concerned. Exchange of 3’UTR between segments have further demonstrated that RNA recognition was segment specific, most likely acting as part of the secondary structure of the entire genomic segment. Our data confirm that genome packaging in this segmented dsRNA virus occurs via the formation of supramolecular complexes formed by the interaction of specific sequences located in the 3’ UTRs. Additionally, the inhibition of packaging in-trans with inhibitory ORNs suggests this that interaction is a bona fide target for the design of compounds with antiviral activity. PMID:26646790
Fajardo, Teodoro; Sung, Po-Yu; Roy, Polly
2015-12-01
Bluetongue virus (BTV) causes hemorrhagic disease in economically important livestock. The BTV genome is organized into ten discrete double-stranded RNA molecules (S1-S10) which have been suggested to follow a sequential packaging pathway from smallest to largest segment during virus capsid assembly. To substantiate and extend these studies, we have investigated the RNA sorting and packaging mechanisms with a new experimental approach using inhibitory oligonucleotides. Putative packaging signals present in the 3'untranslated regions of BTV segments were targeted by a number of nuclease resistant oligoribonucleotides (ORNs) and their effects on virus replication in cell culture were assessed. ORNs complementary to the 3' UTR of BTV RNAs significantly inhibited virus replication without affecting protein synthesis. Same ORNs were found to inhibit complex formation when added to a novel RNA-RNA interaction assay which measured the formation of supramolecular complexes between and among different RNA segments. ORNs targeting the 3'UTR of BTV segment 10, the smallest RNA segment, were shown to be the most potent and deletions or substitution mutations of the targeted sequences diminished the RNA complexes and abolished the recovery of viable viruses using reverse genetics. Cell-free capsid assembly/RNA packaging assay also confirmed that the inhibitory ORNs could interfere with RNA packaging and further substitution mutations within the putative RNA packaging sequence have identified the recognition sequence concerned. Exchange of 3'UTR between segments have further demonstrated that RNA recognition was segment specific, most likely acting as part of the secondary structure of the entire genomic segment. Our data confirm that genome packaging in this segmented dsRNA virus occurs via the formation of supramolecular complexes formed by the interaction of specific sequences located in the 3' UTRs. Additionally, the inhibition of packaging in-trans with inhibitory ORNs suggests this that interaction is a bona fide target for the design of compounds with antiviral activity.
GAPIT: genome association and prediction integrated tool.
Lipka, Alexander E; Tian, Feng; Wang, Qishan; Peiffer, Jason; Li, Meng; Bradbury, Peter J; Gore, Michael A; Buckler, Edward S; Zhang, Zhiwu
2012-09-15
Software programs that conduct genome-wide association studies and genomic prediction and selection need to use methodologies that maximize statistical power, provide high prediction accuracy and run in a computationally efficient manner. We developed an R package called Genome Association and Prediction Integrated Tool (GAPIT) that implements advanced statistical methods including the compressed mixed linear model (CMLM) and CMLM-based genomic prediction and selection. The GAPIT package can handle large datasets in excess of 10 000 individuals and 1 million single-nucleotide polymorphisms with minimal computational time, while providing user-friendly access and concise tables and graphs to interpret results. http://www.maizegenetics.net/GAPIT. zhiwu.zhang@cornell.edu Supplementary data are available at Bioinformatics online.
Hsin, Wei-Chen; Chang, Chan-Hua; Chang, Chi-You; Peng, Wei-Hao; Chien, Chung-Liang; Chang, Ming-Fu; Chang, Shin C
2018-05-24
Middle East respiratory syndrome coronavirus (MERS-CoV) consists of a positive-sense, single-stranded RNA genome and four structural proteins: the spike, envelope, membrane, and nucleocapsid protein. The assembly of the viral genome into virus particles involves viral structural proteins and is believed to be mediated through recognition of specific sequences and RNA structures of the viral genome. A culture system for the production of MERS coronavirus-like particles (MERS VLPs) was determined and established by electron microscopy and the detection of coexpressed viral structural proteins. Using the VLP system, a 258-nucleotide RNA fragment, which spans nucleotides 19,712 to 19,969 of the MERS-CoV genome (designated PS258(19712-19969) ME ), was identified to function as a packaging signal. Assembly of the RNA packaging signal into MERS VLPs is dependent on the viral nucleocapsid protein. In addition, a 45-nucleotide stable stem-loop substructure of the PS258(19712-19969) ME interacted with both the N-terminal domain and the C-terminal domain of the viral nucleocapsid protein. Furthermore, a functional SARS-CoV RNA packaging signal failed to assemble into the MERS VLPs, which indicated virus-specific assembly of the RNA genome. A MERS-oV RNA packaging signal was identified by the detection of GFP expression following an incubation of MERS VLPs carrying the heterologous mRNA GFP-PS258(19712-19969) ME with virus permissive Huh7 cells. The MERS VLP system could help us in understanding virus infection and morphogenesis.
Lun, Aaron T.L.; Smyth, Gordon K.
2016-01-01
Chromatin immunoprecipitation with massively parallel sequencing (ChIP-seq) is widely used to identify binding sites for a target protein in the genome. An important scientific application is to identify changes in protein binding between different treatment conditions, i.e. to detect differential binding. This can reveal potential mechanisms through which changes in binding may contribute to the treatment effect. The csaw package provides a framework for the de novo detection of differentially bound genomic regions. It uses a window-based strategy to summarize read counts across the genome. It exploits existing statistical software to test for significant differences in each window. Finally, it clusters windows into regions for output and controls the false discovery rate properly over all detected regions. The csaw package can handle arbitrarily complex experimental designs involving biological replicates. It can be applied to both transcription factor and histone mark datasets, and, more generally, to any type of sequencing data measuring genomic coverage. csaw performs favorably against existing methods for de novo DB analyses on both simulated and real data. csaw is implemented as a R software package and is freely available from the open-source Bioconductor project. PMID:26578583
Cross- and Co-Packaging of Retroviral RNAs and Their Consequences
Ali, Lizna M.; Rizvi, Tahir A.; Mustafa, Farah
2016-01-01
Retroviruses belong to the family Retroviridae and are ribonucleoprotein (RNP) particles that contain a dimeric RNA genome. Retroviral particle assembly is a complex process, and how the virus is able to recognize and specifically capture the genomic RNA (gRNA) among millions of other cellular and spliced retroviral RNAs has been the subject of extensive investigation over the last two decades. The specificity towards RNA packaging requires higher order interactions of the retroviral gRNA with the structural Gag proteins. Moreover, several retroviruses have been shown to have the ability to cross-/co-package gRNA from other retroviruses, despite little sequence homology. This review will compare the determinants of gRNA encapsidation among different retroviruses, followed by an examination of our current understanding of the interaction between diverse viral genomes and heterologous proteins, leading to their cross-/co-packaging. Retroviruses are well-known serious animal and human pathogens, and such a cross-/co-packaging phenomenon could result in the generation of novel viral variants with unknown pathogenic potential. At the same time, however, an enhanced understanding of the molecular mechanisms involved in these specific interactions makes retroviruses an attractive target for anti-viral drugs, vaccines, and vectors for human gene therapy. PMID:27727192
Cross- and Co-Packaging of Retroviral RNAs and Their Consequences.
Ali, Lizna M; Rizvi, Tahir A; Mustafa, Farah
2016-10-11
Retroviruses belong to the family Retroviridae and are ribonucleoprotein (RNP) particles that contain a dimeric RNA genome. Retroviral particle assembly is a complex process, and how the virus is able to recognize and specifically capture the genomic RNA (gRNA) among millions of other cellular and spliced retroviral RNAs has been the subject of extensive investigation over the last two decades. The specificity towards RNA packaging requires higher order interactions of the retroviral gRNA with the structural Gag proteins. Moreover, several retroviruses have been shown to have the ability to cross-/co-package gRNA from other retroviruses, despite little sequence homology. This review will compare the determinants of gRNA encapsidation among different retroviruses, followed by an examination of our current understanding of the interaction between diverse viral genomes and heterologous proteins, leading to their cross-/co-packaging. Retroviruses are well-known serious animal and human pathogens, and such a cross-/co-packaging phenomenon could result in the generation of novel viral variants with unknown pathogenic potential. At the same time, however, an enhanced understanding of the molecular mechanisms involved in these specific interactions makes retroviruses an attractive target for anti-viral drugs, vaccines, and vectors for human gene therapy.
Kuo, Lili; Koetzner, Cheri A; Hurst, Kelley R; Masters, Paul S
2014-04-01
The coronavirus nucleocapsid (N) protein forms a helical ribonucleoprotein with the viral positive-strand RNA genome and binds to the principal constituent of the virion envelope, the membrane (M) protein, to facilitate assembly and budding. Besides these structural roles, N protein associates with a component of the replicase-transcriptase complex, nonstructural protein 3, at a critical early stage of infection. N protein has also been proposed to participate in the replication and selective packaging of genomic RNA and the transcription and translation of subgenomic mRNA. Coronavirus N proteins contain two structurally distinct RNA-binding domains, an unusual characteristic among RNA viruses. To probe the functions of these domains in the N protein of the model coronavirus mouse hepatitis virus (MHV), we constructed mutants in which each RNA-binding domain was replaced by its counterpart from the N protein of severe acute respiratory syndrome coronavirus (SARS-CoV). Mapping of revertants of the resulting chimeric viruses provided evidence for extensive intramolecular interactions between the two RNA-binding domains. Through analysis of viral RNA that was packaged into virions we identified the second of the two RNA-binding domains as a principal determinant of MHV packaging signal recognition. As expected, the interaction of N protein with M protein was not affected in either of the chimeric viruses. Moreover, the SARS-CoV N substitutions did not alter the fidelity of leader-body junction formation during subgenomic mRNA synthesis. These results more clearly delineate the functions of N protein and establish a basis for further exploration of the mechanism of genomic RNA packaging. This work describes the interactions of the two RNA-binding domains of the nucleocapsid protein of a model coronavirus, mouse hepatitis virus. The main finding is that the second of the two domains plays an essential role in recognizing the RNA structure that allows the selective packaging of genomic RNA into assembled virions.
Forces from the Portal Govern the Late-Stage DNA Transport in a Viral DNA Packaging Nanomotor.
Jing, Peng; Burris, Benjamin; Zhang, Rong
2016-07-12
In the Phi29 bacteriophage, the DNA packaging nanomotor packs its double-stranded DNA genome into the virus capsid. At the late stage of DNA packaging, the negatively charged genome is increasingly compacted at a higher density in the capsid with a higher internal pressure. During the process, two Donnan effects, osmotic pressure and Donnan equilibrium potentials, are significantly amplified, which, in turn, affect the channel activity of the portal protein, GP10, embedded in the semipermeable capsid shell. In the research, planar lipid bilayer experiments were used to study the channel activities of the viral protein. The Donnan effect on the conformational changes of the viral protein was discovered, indicating GP10 may not be a static channel at the late stage of DNA packaging. Due to the conformational changes, GP10 may generate electrostatic forces that govern the DNA transport. For the section of the genome DNA that remains outside of the connector channel, a strong repulsive force from the viral protein would be generated against the DNA entry; however, for the section of the genome DNA within the channel, the portal protein would become a Brownian motor, which adopts the flash Brownian ratchet mechanism to pump the DNA against the increasingly built-up internal pressure (up to 20 atm) in the capsid. Therefore, the DNA transport in the nanoscale viral channel at the late stage of DNA packaging could be a consequence of Brownian movement of the genomic DNA, which would be rectified and harnessed by the forces from the interior wall of the viral channel under the influence of the Donnan effect. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Oreshkova, Nadia; Moormann, Rob J. M.; Kortekaas, Jeroen
2014-01-01
ABSTRACT Bunyavirus genomes comprise a small (S), a medium (M), and a large (L) RNA segment of negative polarity. Although the untranslated regions have been shown to comprise signals required for transcription, replication, and encapsidation, the mechanisms that drive the packaging of at least one S, M, and L segment into a single virion to generate infectious virus are largely unknown. One of the most important members of the Bunyaviridae family that causes devastating disease in ruminants and occasionally humans is the Rift Valley fever virus (RVFV). We studied the flexibility of RVFV genome packaging by splitting the glycoprotein precursor gene, encoding the (NSm)GnGc polyprotein, into two individual genes encoding either (NSm)Gn or Gc. Using reverse genetics, six viruses with a segmented glycoprotein precursor gene were rescued, varying from a virus comprising two S-type segments in the absence of an M-type segment to a virus consisting of four segments (RVFV-4s), of which three are M-type. Despite that all virus variants were able to grow in mammalian cell lines, they were unable to spread efficiently in cells of mosquito origin. Moreover, in vivo studies demonstrated that RVFV-4s is unable to cause disseminated infection and disease in mice, even in the presence of the main virulence factor NSs, but induced a protective immune response against a lethal challenge with wild-type virus. In summary, splitting bunyavirus glycoprotein precursor genes provides new opportunities to study bunyavirus genome packaging and offers new methods to develop next-generation live-attenuated bunyavirus vaccines. IMPORTANCE Rift Valley fever virus (RVFV) causes devastating disease in ruminants and occasionally humans. Virions capable of productive infection comprise at least one copy of the small (S), medium (M), and large (L) RNA genome segments. The M segment encodes a glycoprotein precursor (GPC) protein that is cotranslationally cleaved into Gn and Gc, which are required for virus entry and fusion. We studied the flexibility of RVFV genome packaging and developed experimental live-attenuated vaccines by applying a unique strategy based on the splitting of the GnGc open reading frame. Several RVFV variants, varying from viruses comprising two S-type segments to viruses consisting of four segments (RVFV-4s), of which three are M-type, could be rescued and were shown to induce a rapid protective immune response. Altogether, the segmentation of bunyavirus GPCs provides a new method for studying bunyavirus genome packaging and facilitates the development of novel live-attenuated bunyavirus vaccines. PMID:25008937
Wichgers Schreur, Paul J; Oreshkova, Nadia; Moormann, Rob J M; Kortekaas, Jeroen
2014-09-01
Bunyavirus genomes comprise a small (S), a medium (M), and a large (L) RNA segment of negative polarity. Although the untranslated regions have been shown to comprise signals required for transcription, replication, and encapsidation, the mechanisms that drive the packaging of at least one S, M, and L segment into a single virion to generate infectious virus are largely unknown. One of the most important members of the Bunyaviridae family that causes devastating disease in ruminants and occasionally humans is the Rift Valley fever virus (RVFV). We studied the flexibility of RVFV genome packaging by splitting the glycoprotein precursor gene, encoding the (NSm)GnGc polyprotein, into two individual genes encoding either (NSm)Gn or Gc. Using reverse genetics, six viruses with a segmented glycoprotein precursor gene were rescued, varying from a virus comprising two S-type segments in the absence of an M-type segment to a virus consisting of four segments (RVFV-4s), of which three are M-type. Despite that all virus variants were able to grow in mammalian cell lines, they were unable to spread efficiently in cells of mosquito origin. Moreover, in vivo studies demonstrated that RVFV-4s is unable to cause disseminated infection and disease in mice, even in the presence of the main virulence factor NSs, but induced a protective immune response against a lethal challenge with wild-type virus. In summary, splitting bunyavirus glycoprotein precursor genes provides new opportunities to study bunyavirus genome packaging and offers new methods to develop next-generation live-attenuated bunyavirus vaccines. Rift Valley fever virus (RVFV) causes devastating disease in ruminants and occasionally humans. Virions capable of productive infection comprise at least one copy of the small (S), medium (M), and large (L) RNA genome segments. The M segment encodes a glycoprotein precursor (GPC) protein that is cotranslationally cleaved into Gn and Gc, which are required for virus entry and fusion. We studied the flexibility of RVFV genome packaging and developed experimental live-attenuated vaccines by applying a unique strategy based on the splitting of the GnGc open reading frame. Several RVFV variants, varying from viruses comprising two S-type segments to viruses consisting of four segments (RVFV-4s), of which three are M-type, could be rescued and were shown to induce a rapid protective immune response. Altogether, the segmentation of bunyavirus GPCs provides a new method for studying bunyavirus genome packaging and facilitates the development of novel live-attenuated bunyavirus vaccines. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Chelikani, Venkata; Ranjan, Tushar; Zade, Amrutraj; Shukla, Avi; Kondabagil, Kiran
2014-06-01
Genome packaging is a critical step in the virion assembly process. The putative ATP-driven genome packaging motor of Acanthamoeba polyphaga mimivirus (APMV) and other nucleocytoplasmic large DNA viruses (NCLDVs) is a distant ortholog of prokaryotic chromosome segregation motors, such as FtsK and HerA, rather than other viral packaging motors, such as large terminase. Intriguingly, APMV also encodes other components, i.e., three putative serine recombinases and a putative type II topoisomerase, all of which are essential for chromosome segregation in prokaryotes. Based on our analyses of these components and taking the limited available literature into account, here we propose for the first time a model for genome segregation and packaging in APMV that can possibly be extended to NCLDV subfamilies, except perhaps Poxviridae and Ascoviridae. This model might represent a unique variation of the prokaryotic system acquired and contrived by the large DNA viruses of eukaryotes. It is also consistent with previous observations that unicellular eukaryotes, such as amoebae, are melting pots for the advent of chimeric organisms with novel mechanisms. Extremely large viruses with DNA genomes infect a wide range of eukaryotes, from human beings to amoebae and from crocodiles to algae. These large DNA viruses, unlike their much smaller cousins, have the capability of making most of the protein components required for their multiplication. Once they infect the cell, these viruses set up viral replication centers, known as viral factories, to carry out their multiplication with very little help from the host. Our sequence analyses show that there is remarkable similarity between prokaryotes (bacteria and archaea) and large DNA viruses, such as mimivirus, vaccinia virus, and pandoravirus, in the way that they process their newly synthesized genetic material to make sure that only one copy of the complete genome is generated and is meticulously placed inside the newly synthesized viral particle. These findings have important evolutionary implications about the origin and evolution of large viruses.
Chelikani, Venkata; Ranjan, Tushar; Zade, Amrutraj; Shukla, Avi
2014-01-01
ABSTRACT Genome packaging is a critical step in the virion assembly process. The putative ATP-driven genome packaging motor of Acanthamoeba polyphaga mimivirus (APMV) and other nucleocytoplasmic large DNA viruses (NCLDVs) is a distant ortholog of prokaryotic chromosome segregation motors, such as FtsK and HerA, rather than other viral packaging motors, such as large terminase. Intriguingly, APMV also encodes other components, i.e., three putative serine recombinases and a putative type II topoisomerase, all of which are essential for chromosome segregation in prokaryotes. Based on our analyses of these components and taking the limited available literature into account, here we propose for the first time a model for genome segregation and packaging in APMV that can possibly be extended to NCLDV subfamilies, except perhaps Poxviridae and Ascoviridae. This model might represent a unique variation of the prokaryotic system acquired and contrived by the large DNA viruses of eukaryotes. It is also consistent with previous observations that unicellular eukaryotes, such as amoebae, are melting pots for the advent of chimeric organisms with novel mechanisms. IMPORTANCE Extremely large viruses with DNA genomes infect a wide range of eukaryotes, from human beings to amoebae and from crocodiles to algae. These large DNA viruses, unlike their much smaller cousins, have the capability of making most of the protein components required for their multiplication. Once they infect the cell, these viruses set up viral replication centers, known as viral factories, to carry out their multiplication with very little help from the host. Our sequence analyses show that there is remarkable similarity between prokaryotes (bacteria and archaea) and large DNA viruses, such as mimivirus, vaccinia virus, and pandoravirus, in the way that they process their newly synthesized genetic material to make sure that only one copy of the complete genome is generated and is meticulously placed inside the newly synthesized viral particle. These findings have important evolutionary implications about the origin and evolution of large viruses. PMID:24623441
Johnson, Reed F.; McCarthy, Sarah E.; Godlewski, Peter J.; Harty, Ronald N.
2006-01-01
The packaging of viral genomic RNA into nucleocapsids and subsequently into virions is not completely understood. Phosphoprotein (P) and nucleoprotein (NP) interactions link NP-RNA complexes with P-L (polymerase) complexes to form viral nucleocapsids. The nucleocapsid then interacts with the viral matrix protein, leading to specific packaging of the nucleocapsid into the virion. A mammalian two-hybrid assay and confocal microscopy were used to demonstrate that Ebola virus VP35 and VP40 interact and colocalize in transfected cells. VP35 was packaged into budding virus-like particles (VLPs) as observed by protease protection assays. Moreover, VP40 and VP35 were sufficient for packaging an Ebola virus minignome RNA into VLPs. Results from immunoprecipitation-reverse transcriptase PCR experiments suggest that VP35 confers specificity of the nucleocapsid for viral genomic RNA by direct VP35-RNA interactions. PMID:16698994
GenoMatrix: A Software Package for Pedigree-Based and Genomic Prediction Analyses on Complex Traits.
Nazarian, Alireza; Gezan, Salvador Alejandro
2016-07-01
Genomic and pedigree-based best linear unbiased prediction methodologies (G-BLUP and P-BLUP) have proven themselves efficient for partitioning the phenotypic variance of complex traits into its components, estimating the individuals' genetic merits, and predicting unobserved (or yet-to-be observed) phenotypes in many species and fields of study. The GenoMatrix software, presented here, is a user-friendly package to facilitate the process of using genome-wide marker data and parentage information for G-BLUP and P-BLUP analyses on complex traits. It provides users with a collection of applications which help them on a set of tasks from performing quality control on data to constructing and manipulating the genomic and pedigree-based relationship matrices and obtaining their inverses. Such matrices will be then used in downstream analyses by other statistical packages. The package also enables users to obtain predicted values for unobserved individuals based on the genetic values of observed related individuals. GenoMatrix is available to the research community as a Windows 64bit executable and can be downloaded free of charge at: http://compbio.ufl.edu/software/genomatrix/. © The American Genetic Association. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
CMG-biotools, a free workbench for basic comparative microbial genomics.
Vesth, Tammi; Lagesen, Karin; Acar, Öncel; Ussery, David
2013-01-01
Today, there are more than a hundred times as many sequenced prokaryotic genomes than were present in the year 2000. The economical sequencing of genomic DNA has facilitated a whole new approach to microbial genomics. The real power of genomics is manifested through comparative genomics that can reveal strain specific characteristics, diversity within species and many other aspects. However, comparative genomics is a field not easily entered into by scientists with few computational skills. The CMG-biotools package is designed for microbiologists with limited knowledge of computational analysis and can be used to perform a number of analyses and comparisons of genomic data. The CMG-biotools system presents a stand-alone interface for comparative microbial genomics. The package is a customized operating system, based on Xubuntu 10.10, available through the open source Ubuntu project. The system can be installed on a virtual computer, allowing the user to run the system alongside any other operating system. Source codes for all programs are provided under GNU license, which makes it possible to transfer the programs to other systems if so desired. We here demonstrate the package by comparing and analyzing the diversity within the class Negativicutes, represented by 31 genomes including 10 genera. The analyses include 16S rRNA phylogeny, basic DNA and codon statistics, proteome comparisons using BLAST and graphical analyses of DNA structures. This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data formats and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much training.
i-ADHoRe 2.0: an improved tool to detect degenerated genomic homology using genomic profiles.
Simillion, Cedric; Janssens, Koen; Sterck, Lieven; Van de Peer, Yves
2008-01-01
i-ADHoRe is a software tool that combines gene content and gene order information of homologous genomic segments into profiles to detect highly degenerated homology relations within and between genomes. The new version offers, besides a significant increase in performance, several optimizations to the algorithm, most importantly to the profile alignment routine. As a result, the annotations of multiple genomes, or parts thereof, can be fed simultaneously into the program, after which it will report all regions of homology, both within and between genomes. The i-ADHoRe 2.0 package contains the C++ source code for the main program as well as various Perl scripts and a fully documented Perl API to facilitate post-processing. The software runs on any Linux- or -UNIX based platform. The package is freely available for academic users and can be downloaded from http://bioinformatics.psb.ugent.be/
Ins and Outs of Multipartite Positive-Strand RNA Plant Viruses: Packaging versus Systemic Spread
Dall’Ara, Mattia; Ratti, Claudio; Bouzoubaa, Salah E.; Gilmer, David
2016-01-01
Viruses possessing a non-segmented genome require a specific recognition of their nucleic acid to ensure its protection in a capsid. A similar feature exists for viruses having a segmented genome, usually consisting of viral genomic segments joined together into one viral entity. While this appears as a rule for animal viruses, the majority of segmented plant viruses package their genomic segments individually. To ensure a productive infection, all viral particles and thereby all segments have to be present in the same cell. Progression of the virus within the plant requires as well a concerted genome preservation to avoid loss of function. In this review, we will discuss the “life aspects” of chosen phytoviruses and argue for the existence of RNA-RNA interactions that drive the preservation of viral genome integrity while the virus progresses in the plant. PMID:27548199
USDA-ARS?s Scientific Manuscript database
Tomato Functional Genomics Database (TFGD; http://ted.bti.cornell.edu) provides a comprehensive systems biology resource to store, mine, analyze, visualize and integrate large-scale tomato functional genomics datasets. The database is expanded from the previously described Tomato Expression Database...
ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data
2010-01-01
Background Chromatin immunoprecipitation (ChIP) followed by high-throughput sequencing (ChIP-seq) or ChIP followed by genome tiling array analysis (ChIP-chip) have become standard technologies for genome-wide identification of DNA-binding protein target sites. A number of algorithms have been developed in parallel that allow identification of binding sites from ChIP-seq or ChIP-chip datasets and subsequent visualization in the University of California Santa Cruz (UCSC) Genome Browser as custom annotation tracks. However, summarizing these tracks can be a daunting task, particularly if there are a large number of binding sites or the binding sites are distributed widely across the genome. Results We have developed ChIPpeakAnno as a Bioconductor package within the statistical programming environment R to facilitate batch annotation of enriched peaks identified from ChIP-seq, ChIP-chip, cap analysis of gene expression (CAGE) or any experiments resulting in a large number of enriched genomic regions. The binding sites annotated with ChIPpeakAnno can be viewed easily as a table, a pie chart or plotted in histogram form, i.e., the distribution of distances to the nearest genes for each set of peaks. In addition, we have implemented functionalities for determining the significance of overlap between replicates or binding sites among transcription factors within a complex, and for drawing Venn diagrams to visualize the extent of the overlap between replicates. Furthermore, the package includes functionalities to retrieve sequences flanking putative binding sites for PCR amplification, cloning, or motif discovery, and to identify Gene Ontology (GO) terms associated with adjacent genes. Conclusions ChIPpeakAnno enables batch annotation of the binding sites identified from ChIP-seq, ChIP-chip, CAGE or any technology that results in a large number of enriched genomic regions within the statistical programming environment R. Allowing users to pass their own annotation data such as a different Chromatin immunoprecipitation (ChIP) preparation and a dataset from literature, or existing annotation packages, such as GenomicFeatures and BSgenome, provides flexibility. Tight integration to the biomaRt package enables up-to-date annotation retrieval from the BioMart database. PMID:20459804
compendiumdb: an R package for retrieval and storage of functional genomics data.
Nandal, Umesh K; van Kampen, Antoine H C; Moerland, Perry D
2016-09-15
Currently, the Gene Expression Omnibus (GEO) contains public data of over 1 million samples from more than 40 000 microarray-based functional genomics experiments. This provides a rich source of information for novel biological discoveries. However, unlocking this potential often requires retrieving and storing a large number of expression profiles from a wide range of different studies and platforms. The compendiumdb R package provides an environment for downloading functional genomics data from GEO, parsing the information into a local or remote database and interacting with the database using dedicated R functions, thus enabling seamless integration with other tools available in R/Bioconductor. The compendiumdb package is written in R, MySQL and Perl. Source code and binaries are available from CRAN (http://cran.r-project.org/web/packages/compendiumdb/) for all major platforms (Linux, MS Windows and OS X) under the GPLv3 license. p.d.moerland@amc.uva.nl Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Peering down the barrel of a bacteriophage portal: the genome packaging and release valve in p22.
Tang, Jinghua; Lander, Gabriel C; Olia, Adam S; Olia, Adam; Li, Rui; Casjens, Sherwood; Prevelige, Peter; Cingolani, Gino; Baker, Timothy S; Johnson, John E
2011-04-13
The encapsidated genome in all double-strand DNA bacteriophages is packaged to liquid crystalline density through a unique vertex in the procapsid assembly intermediate, which has a portal protein dodecamer in place of five coat protein subunits. The portal orchestrates DNA packaging and exit, through a series of varying interactions with the scaffolding, terminase, and closure proteins. Here, we report an asymmetric cryoEM reconstruction of the entire P22 virion at 7.8 Å resolution. X-ray crystal structure models of the full-length portal and of the portal lacking 123 residues at the C terminus in complex with gene product 4 (Δ123portal-gp4) obtained by Olia et al. (2011) were fitted into this reconstruction. The interpreted density map revealed that the 150 Å, coiled-coil, barrel portion of the portal entraps the last DNA to be packaged and suggests a mechanism for head-full DNA signaling and transient stabilization of the genome during addition of closure proteins. Copyright © 2011 Elsevier Ltd. All rights reserved.
Genome packaging in EL and Lin68, two giant phiKZ-like bacteriophages of P. aeruginosa.
Sokolova, O S; Shaburova, O V; Pechnikova, E V; Shaytan, A K; Krylov, S V; Kiselev, N A; Krylov, V N
2014-11-01
A unique feature of the Pseudomonas aeruginosa giant phage phiKZ is its way of genome packaging onto a spool-like protein structure, the inner body. Until recently, no similar structures have been detected in other phages. We have studied DNA packaging in P. aeruginosa phages EL and Lin68 using cryo-electron microscopy and revealed the presence of inner bodies. The shape and positioning of the inner body and the density of the DNA packaging in EL are different from those found in phiKZ and Lin68. This internal organization explains how the shorter EL genome is packed into a large EL capsid, which has the same external dimensions as the capsids of phiKZ and Lin68. The similarity in the structural organization in EL and other phiKZ-like phages indicates that EL is phylogenetically related to other phiKZ-like phages, and, despite the lack of detectable DNA homology, EL, phiKZ, and Lin68 descend from a common ancestor. Copyright © 2014 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nam, Hyun-Joo; Gurda, Brittney L.; McKenna, Robert
2012-09-17
The single-stranded DNA (ssDNA) parvoviruses enter host cells through receptor-mediated endocytosis, and infection depends on processing in the early to late endosome as well as in the lysosome prior to nuclear entry for replication. However, the mechanisms of capsid endosomal processing, including the effects of low pH, are poorly understood. To gain insight into the structural transitions required for this essential step in infection, the crystal structures of empty and green fluorescent protein (GFP) gene-packaged adeno-associated virus serotype 8 (AAV8) have been determined at pH values of 6.0, 5.5, and 4.0 and then at pH 7.5 after incubation at pHmore » 4.0, mimicking the conditions encountered during endocytic trafficking. While the capsid viral protein (VP) topologies of all the structures were similar, significant amino acid side chain conformational rearrangements were observed on (i) the interior surface of the capsid under the icosahedral 3-fold axis near ordered nucleic acid density that was lost concomitant with the conformational change as pH was reduced and (ii) the exterior capsid surface close to the icosahedral 2-fold depression. The 3-fold change is consistent with DNA release from an ordering interaction on the inside surface of the capsid at low pH values and suggests transitions that likely trigger the capsid for genome uncoating. The surface change results in disruption of VP-VP interface interactions and a decrease in buried surface area between VP monomers. This disruption points to capsid destabilization which may (i) release VP1 amino acids for its phospholipase A2 function for endosomal escape and nuclear localization signals for nuclear targeting and (ii) trigger genome uncoating.« less
Packaging of Dinoroseobacter shibae DNA into Gene Transfer Agent Particles Is Not Random.
Tomasch, Jürgen; Wang, Hui; Hall, April T K; Patzelt, Diana; Preusse, Matthias; Petersen, Jörn; Brinkmann, Henner; Bunk, Boyke; Bhuju, Sabin; Jarek, Michael; Geffers, Robert; Lang, Andrew S; Wagner-Döbler, Irene
2018-01-01
Gene transfer agents (GTAs) are phage-like particles which contain a fragment of genomic DNA of the bacterial or archaeal producer and deliver this to a recipient cell. GTA gene clusters are present in the genomes of almost all marine Rhodobacteraceae (Roseobacters) and might be important contributors to horizontal gene transfer in the world's oceans. For all organisms studied so far, no obvious evidence of sequence specificity or other nonrandom process responsible for packaging genomic DNA into GTAs has been found. Here, we show that knock-out of an autoinducer synthase gene of Dinoroseobacter shibae resulted in overproduction and release of functional GTA particles (DsGTA). Next-generation sequencing of the 4.2-kb DNA fragments isolated from DsGTAs revealed that packaging was not random. DNA from low-GC conjugative plasmids but not from high-GC chromids was excluded from packaging. Seven chromosomal regions were strongly overrepresented in DNA isolated from DsGTA. These packaging peaks lacked identifiable conserved sequence motifs that might represent recognition sites for the GTA terminase complex. Low-GC regions of the chromosome, including the origin and terminus of replication, were underrepresented in DNA isolated from DsGTAs. DNA methylation reduced packaging frequency while the level of gene expression had no influence. Chromosomal regions found to be over- and underrepresented in DsGTA-DNA were regularly spaced. We propose that a "headful" type of packaging is initiated at the sites of coverage peaks and, after linearization of the chromosomal DNA, proceeds in both directions from the initiation site. GC-content, DNA-modifications, and chromatin structure might influence at which sides GTA packaging can be initiated. © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Packaging of Dinoroseobacter shibae DNA into Gene Transfer Agent Particles Is Not Random
Wang, Hui; Hall, April T K; Patzelt, Diana; Preusse, Matthias; Petersen, Jörn; Brinkmann, Henner; Bunk, Boyke; Bhuju, Sabin; Jarek, Michael; Geffers, Robert; Lang, Andrew S; Wagner-Döbler, Irene
2018-01-01
Abstract Gene transfer agents (GTAs) are phage-like particles which contain a fragment of genomic DNA of the bacterial or archaeal producer and deliver this to a recipient cell. GTA gene clusters are present in the genomes of almost all marine Rhodobacteraceae (Roseobacters) and might be important contributors to horizontal gene transfer in the world’s oceans. For all organisms studied so far, no obvious evidence of sequence specificity or other nonrandom process responsible for packaging genomic DNA into GTAs has been found. Here, we show that knock-out of an autoinducer synthase gene of Dinoroseobacter shibae resulted in overproduction and release of functional GTA particles (DsGTA). Next-generation sequencing of the 4.2-kb DNA fragments isolated from DsGTAs revealed that packaging was not random. DNA from low-GC conjugative plasmids but not from high-GC chromids was excluded from packaging. Seven chromosomal regions were strongly overrepresented in DNA isolated from DsGTA. These packaging peaks lacked identifiable conserved sequence motifs that might represent recognition sites for the GTA terminase complex. Low-GC regions of the chromosome, including the origin and terminus of replication, were underrepresented in DNA isolated from DsGTAs. DNA methylation reduced packaging frequency while the level of gene expression had no influence. Chromosomal regions found to be over- and underrepresented in DsGTA-DNA were regularly spaced. We propose that a “headful” type of packaging is initiated at the sites of coverage peaks and, after linearization of the chromosomal DNA, proceeds in both directions from the initiation site. GC-content, DNA-modifications, and chromatin structure might influence at which sides GTA packaging can be initiated. PMID:29325123
iPat: intelligent prediction and association tool for genomic research.
Chen, Chunpeng James; Zhang, Zhiwu
2018-06-01
The ultimate goal of genomic research is to effectively predict phenotypes from genotypes so that medical management can improve human health and molecular breeding can increase agricultural production. Genomic prediction or selection (GS) plays a complementary role to genome-wide association studies (GWAS), which is the primary method to identify genes underlying phenotypes. Unfortunately, most computing tools cannot perform data analyses for both GWAS and GS. Furthermore, the majority of these tools are executed through a command-line interface (CLI), which requires programming skills. Non-programmers struggle to use them efficiently because of the steep learning curves and zero tolerance for data formats and mistakes when inputting keywords and parameters. To address these problems, this study developed a software package, named the Intelligent Prediction and Association Tool (iPat), with a user-friendly graphical user interface. With iPat, GWAS or GS can be performed using a pointing device to simply drag and/or click on graphical elements to specify input data files, choose input parameters and select analytical models. Models available to users include those implemented in third party CLI packages such as GAPIT, PLINK, FarmCPU, BLINK, rrBLUP and BGLR. Users can choose any data format and conduct analyses with any of these packages. File conversions are automatically conducted for specified input data and selected packages. A GWAS-assisted genomic prediction method was implemented to perform genomic prediction using any GWAS method such as FarmCPU. iPat was written in Java for adaptation to multiple operating systems including Windows, Mac and Linux. The iPat executable file, user manual, tutorials and example datasets are freely available at http://zzlab.net/iPat. zhiwu.zhang@wsu.edu.
Fire behavior sensor package remote trigger design
Dan Jimenez; Jason Forthofer; James Reardon; Bret Butler
2007-01-01
Fire behavior characteristics (such as temperature, radiant and total heat flux, 2- and 3-dimensional velocities, and air flow) are extremely difficult to measure insitu. Although insitu sensor packages are capable of such measurements in realtime, it is also essential to acquire video documentation as a means of better understanding the fire behavior data recorded by...
Seo, Jang-Kyun; Kwon, Sun-Jung; Rao, A L N
2012-06-01
Genome packaging is functionally coupled to replication in RNA viruses pathogenic to humans (Poliovirus), insects (Flock house virus [FHV]), and plants (Brome mosaic virus [BMV]). However, the underlying mechanism is not fully understood. We have observed previously that in FHV and BMV, unlike ectopically expressed capsid protein (CP), packaging specificity results from RNA encapsidation by CP that has been translated from mRNA produced from replicating genomic RNA. Consequently, we hypothesize that a physical interaction with replicase increases the CP specificity for packaging viral RNAs. We tested this hypothesis by evaluating the molecular interaction between replicase protein and CP using a FHV-Nicotiana benthamiana system. Bimolecular fluorescence complementation in conjunction with fluorescent cellular protein markers and coimmunoprecipitation assays demonstrated that FHV replicase (protein A) and CP physically interact at the mitochondrial site of replication and that this interaction requires the N-proximal region from either amino acids 1 to 31 or amino acids 32 to 50 of the CP. In contrast to the mitochondrial localization of CP derived from FHV replication, ectopic expression displayed a characteristic punctate pattern on the endoplasmic reticulum (ER). This pattern was altered to relocalize the CP throughout the cytoplasm when the C-proximal hydrophobic domain was deleted. Analysis of the packaging phenotypes of the CP mutants defective either in protein A-CP interactions or ER localization suggested that synchronization between protein A-CP interaction and its subcellular localization is imperative to confer packaging specificity.
Seo, Jang-Kyun; Kwon, Sun-Jung
2012-01-01
Genome packaging is functionally coupled to replication in RNA viruses pathogenic to humans (Poliovirus), insects (Flock house virus [FHV]), and plants (Brome mosaic virus [BMV]). However, the underlying mechanism is not fully understood. We have observed previously that in FHV and BMV, unlike ectopically expressed capsid protein (CP), packaging specificity results from RNA encapsidation by CP that has been translated from mRNA produced from replicating genomic RNA. Consequently, we hypothesize that a physical interaction with replicase increases the CP specificity for packaging viral RNAs. We tested this hypothesis by evaluating the molecular interaction between replicase protein and CP using a FHV-Nicotiana benthamiana system. Bimolecular fluorescence complementation in conjunction with fluorescent cellular protein markers and coimmunoprecipitation assays demonstrated that FHV replicase (protein A) and CP physically interact at the mitochondrial site of replication and that this interaction requires the N-proximal region from either amino acids 1 to 31 or amino acids 32 to 50 of the CP. In contrast to the mitochondrial localization of CP derived from FHV replication, ectopic expression displayed a characteristic punctate pattern on the endoplasmic reticulum (ER). This pattern was altered to relocalize the CP throughout the cytoplasm when the C-proximal hydrophobic domain was deleted. Analysis of the packaging phenotypes of the CP mutants defective either in protein A-CP interactions or ER localization suggested that synchronization between protein A-CP interaction and its subcellular localization is imperative to confer packaging specificity. PMID:22438552
USDA-ARS?s Scientific Manuscript database
The ARS Microbial Genome Sequence Database (http://199.133.98.43), a web-based database server, was established utilizing the BIGSdb (Bacterial Isolate Genomics Sequence Database) software package, developed at Oxford University, as a tool to manage multi-locus sequence data for the family Streptomy...
Coordinates and intervals in graph-based reference genomes.
Rand, Knut D; Grytten, Ivar; Nederbragt, Alexander J; Storvik, Geir O; Glad, Ingrid K; Sandve, Geir K
2017-05-18
It has been proposed that future reference genomes should be graph structures in order to better represent the sequence diversity present in a species. However, there is currently no standard method to represent genomic intervals, such as the positions of genes or transcription factor binding sites, on graph-based reference genomes. We formalize offset-based coordinate systems on graph-based reference genomes and introduce methods for representing intervals on these reference structures. We show the advantage of our methods by representing genes on a graph-based representation of the newest assembly of the human genome (GRCh38) and its alternative loci for regions that are highly variable. More complex reference genomes, containing alternative loci, require methods to represent genomic data on these structures. Our proposed notation for genomic intervals makes it possible to fully utilize the alternative loci of the GRCh38 assembly and potential future graph-based reference genomes. We have made a Python package for representing such intervals on offset-based coordinate systems, available at https://github.com/uio-cels/offsetbasedgraph . An interactive web-tool using this Python package to visualize genes on a graph created from GRCh38 is available at https://github.com/uio-cels/genomicgraphcoords .
Genome packaging in EL and Lin68, two giant phiKZ-like bacteriophages of P. aeruginosa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sokolova, O.S., E-mail: sokolova@mail.bio.msu.ru; A.V. Shoubnikov Institute of Crystallography RAS, Moscow; Shaburova, O.V.
A unique feature of the Pseudomonas aeruginosa giant phage phiKZ is its way of genome packaging onto a spool-like protein structure, the inner body. Until recently, no similar structures have been detected in other phages. We have studied DNA packaging in P. aeruginosa phages EL and Lin68 using cryo-electron microscopy and revealed the presence of inner bodies. The shape and positioning of the inner body and the density of the DNA packaging in EL are different from those found in phiKZ and Lin68. This internal organization explains how the shorter EL genome is packed into a large EL capsid, whichmore » has the same external dimensions as the capsids of phiKZ and Lin68. The similarity in the structural organization in EL and other phiKZ-like phages indicates that EL is phylogenetically related to other phiKZ-like phages, and, despite the lack of detectable DNA homology, EL, phiKZ, and Lin68 descend from a common ancestor. - Highlights: • We performed a comparative structural study of giant P. aeruginosa phages: EL, Lin68 and phiKZ. • We revealed that the inner body is a common feature in giant phages. • The phage genome size correlates with the overall dimensions of the inner body.« less
A Thermodynamic Model for Genome Packaging in Hepatitis B Virus.
Kim, Jehoon; Wu, Jianzhong
2015-10-20
Understanding the fundamentals of genome packaging in viral capsids is important for finding effective antiviral strategies and for utilizing benign viral particles for gene therapy. While the structure of encapsidated genomic materials has been routinely characterized with experimental techniques such as cryo-electron microscopy and x-ray diffraction, much less is known about the molecular driving forces underlying genome assembly in an intracellular environment and its in vivo interactions with the capsid proteins. Here we study the thermodynamic basis of the pregenomic RNA encapsidation in human Hepatitis B virus in vivo using a coarse-grained molecular model that captures the essential components of nonspecific intermolecular interactions. The thermodynamic model is used to examine how the electrostatic interaction between the packaged RNA and the highly charged C-terminal domains (CTD) of capsid proteins regulate the nucleocapsid formation. The theoretical model predicts optimal RNA content in Hepatitis B virus nucleocapsids with different CTD lengths in good agreement with mutagenesis measurements, confirming the predominant role of electrostatic interactions and molecular excluded-volume effects in genome packaging. We find that the amount of encapsidated RNA is not linearly correlated with the net charge of CTD tails as suggested by earlier theoretical studies. Our thermodynamic analysis of the nucleocapsid structure and stability indicates that ∼10% of the CTD residues are free from complexation with RNA, resulting in partially exposed CTD tails. The thermodynamic model also predicts the free energy of complex formation between macromolecules, which corroborates experimental results for the impact of CTD truncation on the nucleocapsid stability. Copyright © 2015 Biophysical Society. Published by Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sun, Siyang; Gao, Song; Kondabagil, Kiran
2012-04-04
Tailed DNA bacteriophages assemble empty procapsids that are subsequently filled with the viral genome by means of a DNA packaging machine situated at a special fivefold vertex. The packaging machine consists of a 'small terminase' and a 'large terminase' component. One of the functions of the small terminase is to initiate packaging of the viral genome, whereas the large terminase is responsible for the ATP-powered translocation of DNA. The small terminase subunit has three domains, an N-terminal DNA-binding domain, a central oligomerization domain, and a C-terminal domain for interacting with the large terminase. Here we report structures of the centralmore » domain in two different oligomerization states for a small terminase from the T4 family of phages. In addition, we report biochemical studies that establish the function for each of the small terminase domains. On the basis of the structural and biochemical information, we propose a model for DNA packaging initiation.« less
Volkova, Eugenia; Gorchakov, Rodion; Frolov, Ilya
2008-01-01
Alphaviruses are regarded as attractive systems for expression of heterologous genes and development of recombinant vaccines. Venezuelan equine encephalitis virus (VEE)-based vectors are particularly promising because of their specificity to lymphoid tissues and strong resistance to interferon. To improve understanding of the VEE genome packaging and optimize application of this virus as a vector, we analyzed in more detail the mechanism of packaging of the VEE-specific RNAs. The presence of the RNAs in the VEE particles during serial passaging in tissue culture was found to depend not only on the presence of packaging signal(s), but also on the ability of these RNAs to express in cis nsP1, nsP2 and nsP3 in the form of a P123 precursor. Packaging of VEE genomes into infectious virions was also found to be more efficient compared to that of Sindbis virus, in spite of lower levels of RNA replication and structural protein production. PMID:16239019
Borst, Eva Maria; Kleine-Albers, Jennifer; Gabaev, Ildar; Babić, Marina; Wagner, Karen; Binz, Anne; Degenhardt, Inga; Kalesse, Markus; Jonjić, Stipan; Bauerfeind, Rudolf
2013-01-01
Cleavage of human cytomegalovirus (HCMV) genomes as well as their packaging into capsids is an enzymatic process mediated by viral proteins and therefore a promising target for antiviral therapy. The HCMV proteins pUL56 and pUL89 form the terminase and play a central role in cleavage-packaging, but several additional viral proteins, including pUL51, had been suggested to contribute to this process, although they remain largely uncharacterized. To study the function of pUL51 in infected cells, we constructed HCMV mutants encoding epitope-tagged versions of pUL51 and used a conditionally replicating virus (HCMV-UL51-ddFKBP), in which pUL51 levels could be regulated by a synthetic ligand. In cells infected with HCMV-UL51-ddFKBP, viral DNA replication was not affected when pUL51 was knocked down. However, no unit-length genomes and no DNA-filled C capsids were found, indicating that cleavage of concatemeric HCMV DNA and genome packaging into capsids did not occur in the absence of pUL51. pUL51 was expressed mainly with late kinetics and was targeted to nuclear replication compartments, where it colocalized with pUL56 and pUL89. Upon pUL51 knockdown, pUL56 and pUL89 were no longer detectable in replication compartments, suggesting that pUL51 is needed for their correct subnuclear localization. Moreover, pUL51 was found in a complex with the terminase subunits pUL56 and pUL89. Our data provide evidence that pUL51 is crucial for HCMV genome cleavage-packaging and may represent a third component of the viral terminase complex. Interference with the interactions between the terminase subunits by antiviral drugs could be a strategy to disrupt the HCMV replication cycle. PMID:23175377
SL1 revisited: functional analysis of the structure and conformation of HIV-1 genome RNA.
Sakuragi, Sayuri; Yokoyama, Masaru; Shioda, Tatsuo; Sato, Hironori; Sakuragi, Jun-Ichi
2016-11-11
The dimer initiation site/dimer linkage sequence (DIS/DLS) region of HIV is located on the 5' end of the viral genome and suggested to form complex secondary/tertiary structures. Within this structure, stem-loop 1 (SL1) is believed to be most important and an essential key to dimerization, since the sequence and predicted secondary structure of SL1 are highly stable and conserved among various virus subtypes. In particular, a six-base palindromic sequence is always present at the hairpin loop of SL1 and the formation of kissing-loop structure at this position between the two strands of genomic RNA is suggested to trigger dimerization. Although the higher-order structure model of SL1 is well accepted and perhaps even undoubted lately, there could be stillroom for consideration to depict the functional SL1 structure while in vivo (in virion or cell). In this study, we performed several analyses to identify the nucleotides and/or basepairing within SL1 which are necessary for HIV-1 genome dimerization, encapsidation, recombination and infectivity. We unexpectedly found that some nucleotides that are believed to contribute the formation of the stem do not impact dimerization or infectivity. On the other hand, we found that one G-C basepair involved in stem formation may serve as an alternative dimer interactive site. We also report on our further investigation of the roles of the palindromic sequences on viral replication. Collectively, we aim to assemble a more-comprehensive functional map of SL1 on the HIV-1 viral life cycle. We discovered several possibilities for a novel structure of SL1 in HIV-1 DLS. The newly proposed structure model suggested that the hairpin loop of SL1 appeared larger, and genome dimerization process might consist of more complicated mechanism than previously understood. Further investigations would be still required to fully understand the genome packaging and dimerization of HIV.
Technical note: an R package for fitting sparse neural networks with application in animal breeding.
Wang, Yangfan; Mi, Xue; Rosa, Guilherme J M; Chen, Zhihui; Lin, Ping; Wang, Shi; Bao, Zhenmin
2018-05-04
Neural networks (NNs) have emerged as a new tool for genomic selection (GS) in animal breeding. However, the properties of NN used in GS for the prediction of phenotypic outcomes are not well characterized due to the problem of over-parameterization of NN and difficulties in using whole-genome marker sets as high-dimensional NN input. In this note, we have developed an R package called snnR that finds an optimal sparse structure of a NN by minimizing the square error subject to a penalty on the L1-norm of the parameters (weights and biases), therefore solving the problem of over-parameterization in NN. We have also tested some models fitted in the snnR package to demonstrate their feasibility and effectiveness to be used in several cases as examples. In comparison of snnR to the R package brnn (the Bayesian regularized single layer NNs), with both using the entries of a genotype matrix or a genomic relationship matrix as inputs, snnR has greatly improved the computational efficiency and the prediction ability for the GS in animal breeding because snnR implements a sparse NN with many hidden layers.
Sun, Meng; Grigsby, Iwen F; Gorelick, Robert J; Mansky, Louis M; Musier-Forsyth, Karin
2014-01-01
Retroviral RNA encapsidation involves a recognition event between genomic RNA (gRNA) and one or more domains in Gag. In HIV-1, the nucleocapsid (NC) domain is involved in gRNA packaging and displays robust nucleic acid (NA) binding and chaperone functions. In comparison, NC of human T-cell leukemia virus type 1 (HTLV-1), a deltaretrovirus, displays weaker NA binding and chaperone activity. Mutation of conserved charged residues in the deltaretrovirus bovine leukemia virus (BLV) matrix (MA) and NC domains affects virus replication and gRNA packaging efficiency. Based on these observations, we hypothesized that the MA domain may generally contribute to NA binding and genome encapsidation in deltaretroviruses. Here, we examined the interaction between HTLV-2 and HIV-1 MA proteins and various NAs in vitro. HTLV-2 MA displays higher NA binding affinity and better chaperone activity than HIV-1 MA. HTLV-2 MA also binds NAs with higher affinity than HTLV-2 NC and displays more robust chaperone function. Mutation of two basic residues in HTLV-2 MA α-helix II, previously implicated in BLV gRNA packaging, reduces NA binding affinity. HTLV-2 MA binds with high affinity and specificity to RNA derived from the putative packaging signal of HTLV-2 relative to nonspecific NA. Furthermore, an HIV-1 MA triple mutant designed to mimic the basic character of HTLV-2 MA α-helix II dramatically improves binding affinity and chaperone activity of HIV-1 MA in vitro and restores RNA packaging to a ΔNC HIV-1 variant in cell-based assays. Taken together, these results are consistent with a role for deltaretrovirus MA proteins in viral RNA packaging.
Orchestrating high-throughput genomic analysis with Bioconductor
Huber, Wolfgang; Carey, Vincent J.; Gentleman, Robert; Anders, Simon; Carlson, Marc; Carvalho, Benilton S.; Bravo, Hector Corrada; Davis, Sean; Gatto, Laurent; Girke, Thomas; Gottardo, Raphael; Hahne, Florian; Hansen, Kasper D.; Irizarry, Rafael A.; Lawrence, Michael; Love, Michael I.; MacDonald, James; Obenchain, Valerie; Oleś, Andrzej K.; Pagès, Hervé; Reyes, Alejandro; Shannon, Paul; Smyth, Gordon K.; Tenenbaum, Dan; Waldron, Levi; Morgan, Martin
2015-01-01
Bioconductor is an open-source, open-development software project for the analysis and comprehension of high-throughput data in genomics and molecular biology. The project aims to enable interdisciplinary research, collaboration and rapid development of scientific software. Based on the statistical programming language R, Bioconductor comprises 934 interoperable packages contributed by a large, diverse community of scientists. Packages cover a range of bioinformatic and statistical applications. They undergo formal initial review and continuous automated testing. We present an overview for prospective users and contributors. PMID:25633503
CMG-Biotools, a Free Workbench for Basic Comparative Microbial Genomics
Vesth, Tammi; Lagesen, Karin; Acar, Öncel; Ussery, David
2013-01-01
Background Today, there are more than a hundred times as many sequenced prokaryotic genomes than were present in the year 2000. The economical sequencing of genomic DNA has facilitated a whole new approach to microbial genomics. The real power of genomics is manifested through comparative genomics that can reveal strain specific characteristics, diversity within species and many other aspects. However, comparative genomics is a field not easily entered into by scientists with few computational skills. The CMG-biotools package is designed for microbiologists with limited knowledge of computational analysis and can be used to perform a number of analyses and comparisons of genomic data. Results The CMG-biotools system presents a stand-alone interface for comparative microbial genomics. The package is a customized operating system, based on Xubuntu 10.10, available through the open source Ubuntu project. The system can be installed on a virtual computer, allowing the user to run the system alongside any other operating system. Source codes for all programs are provided under GNU license, which makes it possible to transfer the programs to other systems if so desired. We here demonstrate the package by comparing and analyzing the diversity within the class Negativicutes, represented by 31 genomes including 10 genera. The analyses include 16S rRNA phylogeny, basic DNA and codon statistics, proteome comparisons using BLAST and graphical analyses of DNA structures. Conclusion This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data formats and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much training. PMID:23577086
Ostrovnaya, Irina; Seshan, Venkatraman E; Olshen, Adam B; Begg, Colin B
2011-06-15
If a cancer patient develops multiple tumors, it is sometimes impossible to determine whether these tumors are independent or clonal based solely on pathological characteristics. Investigators have studied how to improve this diagnostic challenge by comparing the presence of loss of heterozygosity (LOH) at selected genetic locations of tumor samples, or by comparing genomewide copy number array profiles. We have previously developed statistical methodology to compare such genomic profiles for an evidence of clonality. We assembled the software for these tests in a new R package called 'Clonality'. For LOH profiles, the package contains significance tests. The analysis of copy number profiles includes a likelihood ratio statistic and reference distribution, as well as an option to produce various plots that summarize the results. Bioconductor (http://bioconductor.org/packages/release/bioc/html/Clonality.html) and http://www.mskcc.org/mskcc/html/13287.cfm.
Kravatsky, Yuri V; Chechetkin, Vladimir R; Tchurikov, Nikolai A; Kravatskaya, Galina I
2015-02-01
The broad class of tasks in genetics and epigenetics can be reduced to the study of various features that are distributed over the genome (genome tracks). The rapid and efficient processing of the huge amount of data stored in the genome-scale databases cannot be achieved without the software packages based on the analytical criteria. However, strong inhomogeneity of genome tracks hampers the development of relevant statistics. We developed the criteria for the assessment of genome track inhomogeneity and correlations between two genome tracks. We also developed a software package, Genome Track Analyzer, based on this theory. The theory and software were tested on simulated data and were applied to the study of correlations between CpG islands and transcription start sites in the Homo sapiens genome, between profiles of protein-binding sites in chromosomes of Drosophila melanogaster, and between DNA double-strand breaks and histone marks in the H. sapiens genome. Significant correlations between transcription start sites on the forward and the reverse strands were observed in genomes of D. melanogaster, Caenorhabditis elegans, Mus musculus, H. sapiens, and Danio rerio. The observed correlations may be related to the regulation of gene expression in eukaryotes. Genome Track Analyzer is freely available at http://ancorr.eimb.ru/. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
O'Mahony, Patrick; Schäfer, Mike Steffen
2005-02-01
The essay compares German and Irish media coverage of human genome research in the year 2000, using qualitative and quantitative frame analysis of a print media corpus. Drawing from a media-theoretical account of science communication, the study examines four analytic dimensions: (1) the influence of global and national sources of discourse; (2) the nature of elaboration on important themes; (3) the extent of societal participation in discourse production; (4) the cultural conditions in which the discourse resonates. The analysis shows that a global discursive package, emphasizing claims of scientific achievement and medical progress, dominates media coverage in both countries. However, German coverage is more extensive and elaborate, and includes a wider range of participants. Irish coverage more often incorporates the global package without further elaboration. These finding indicate that the global package is 'localized' differently due to national patterns of interests, German participation in human genome research, traditions of media coverage, and the domestic resonance of the issue.
Mechanism for Coordinated RNA Packaging and Genome Replication by Rotavirus Polymerase VP1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Xiaohui; McDonald, Sarah M.; Tortorici, M. Alejandra
2009-04-08
Rotavirus RNA-dependent RNA polymerase VP1 catalyzes RNA synthesis within a subviral particle. This activity depends on core shell protein VP2. A conserved sequence at the 3' end of plus-strand RNA templates is important for polymerase association and genome replication. We have determined the structure of VP1 at 2.9 {angstrom} resolution, as apoenzyme and in complex with RNA. The cage-like enzyme is similar to reovirus {lambda}3, with four tunnels leading to or from a central, catalytic cavity. A distinguishing characteristic of VP1 is specific recognition, by conserved features of the template-entry channel, of four bases, UGUG, in the conserved 3' sequence.more » Well-defined interactions with these bases position the RNA so that its 3' end overshoots the initiating register, producing a stable but catalytically inactive complex. We propose that specific 3' end recognition selects rotavirus RNA for packaging and that VP2 activates the autoinhibited VP1/RNA complex to coordinate packaging and genome replication.« less
The Systems Biology Markup Language (SBML) Level 3 Package: Flux Balance Constraints.
Olivier, Brett G; Bergmann, Frank T
2015-09-04
Constraint-based modeling is a well established modelling methodology used to analyze and study biological networks on both a medium and genome scale. Due to their large size, genome scale models are typically analysed using constraint-based optimization techniques. One widely used method is Flux Balance Analysis (FBA) which, for example, requires a modelling description to include: the definition of a stoichiometric matrix, an objective function and bounds on the values that fluxes can obtain at steady state. The Flux Balance Constraints (FBC) Package extends SBML Level 3 and provides a standardized format for the encoding, exchange and annotation of constraint-based models. It includes support for modelling concepts such as objective functions, flux bounds and model component annotation that facilitates reaction balancing. The FBC package establishes a base level for the unambiguous exchange of genome-scale, constraint-based models, that can be built upon by the community to meet future needs (e. g. by extending it to cover dynamic FBC models).
Gopal, Radhika; Venter, P. Arno; Schneemann, Anette
2014-01-01
Nodaviruses are icosahedral viruses with a bipartite, positive-sense RNA genome. The two RNAs are packaged into a single virion by a poorly understood mechanism. We chose two distantly related nodaviruses, Flock House virus and Nodamura virus, to explore formation of viral reassortants as a means to further understand genome recognition and encapsidation. In mixed infections, the viruses were incompatible at the level of RNA replication and their coat proteins segregated into separate populations of progeny particles. RNA packaging, on the other hand, was indiscriminate as all four viral RNAs were detectable in each progeny population. Consistent with the trans-encapsidation phenotype, fluorescence in situ hybridization of viral RNA revealed that the genomes of the two viruses co-localized throughout the cytoplasm. Our results imply that nodaviral RNAs lack rigorously defined packaging signals and that coencapsidation of the viral RNAs does not require a pair of cognate RNA1 and RNA2. PMID:24725955
The Systems Biology Markup Language (SBML) Level 3 Package: Flux Balance Constraints.
Olivier, Brett G; Bergmann, Frank T
2015-06-01
Constraint-based modeling is a well established modelling methodology used to analyze and study biological networks on both a medium and genome scale. Due to their large size, genome scale models are typically analysed using constraint-based optimization techniques. One widely used method is Flux Balance Analysis (FBA) which, for example, requires a modelling description to include: the definition of a stoichiometric matrix, an objective function and bounds on the values that fluxes can obtain at steady state. The Flux Balance Constraints (FBC) Package extends SBML Level 3 and provides a standardized format for the encoding, exchange and annotation of constraint-based models. It includes support for modelling concepts such as objective functions, flux bounds and model component annotation that facilitates reaction balancing. The FBC package establishes a base level for the unambiguous exchange of genome-scale, constraint-based models, that can be built upon by the community to meet future needs (e. g. by extending it to cover dynamic FBC models).
Padilla-Sanchez, Victor; Gao, Song; Kim, Hyung Rae; Kihara, Daisuke; Sun, Lei; Rossmann, Michael G; Rao, Venigalla B
2014-03-06
Tailed bacteriophages and herpesviruses consist of a structurally well conserved dodecameric portal at a special 5-fold vertex of the capsid. The portal plays critical roles in head assembly, genome packaging, neck/tail attachment, and genome ejection. Although the structures of portals from phages φ29, SPP1, and P22 have been determined, their mechanistic roles have not been well understood. Structural analysis of phage T4 portal (gp20) has been hampered because of its unusual interaction with the Escherichia coli inner membrane. Here, we predict atomic models for the T4 portal monomer and dodecamer, and we fit the dodecamer into the cryo-electron microscopy density of the phage portal vertex. The core structure, like that from other phages, is cone shaped with the wider end containing the "wing" and "crown" domains inside the phage head. A long "stem" encloses a central channel, and a narrow "stalk" protrudes outside the capsid. A biochemical approach was developed to analyze portal function by incorporating plasmid-expressed portal protein into phage heads and determining the effect of mutations on head assembly, DNA translocation, and virion production. We found that the protruding loops of the stalk domain are involved in assembling the DNA packaging motor. A loop that connects the stalk to the channel might be required for communication between the motor and the portal. The "tunnel" loops that project into the channel are essential for sealing the packaged head. These studies established that the portal is required throughout the DNA packaging process, with different domains participating at different stages of genome packaging. © 2013.
iGC-an integrated analysis package of gene expression and copy number alteration.
Lai, Yi-Pin; Wang, Liang-Bo; Wang, Wei-An; Lai, Liang-Chuan; Tsai, Mong-Hsun; Lu, Tzu-Pin; Chuang, Eric Y
2017-01-14
With the advancement in high-throughput technologies, researchers can simultaneously investigate gene expression and copy number alteration (CNA) data from individual patients at a lower cost. Traditional analysis methods analyze each type of data individually and integrate their results using Venn diagrams. Challenges arise, however, when the results are irreproducible and inconsistent across multiple platforms. To address these issues, one possible approach is to concurrently analyze both gene expression profiling and CNAs in the same individual. We have developed an open-source R/Bioconductor package (iGC). Multiple input formats are supported and users can define their own criteria for identifying differentially expressed genes driven by CNAs. The analysis of two real microarray datasets demonstrated that the CNA-driven genes identified by the iGC package showed significantly higher Pearson correlation coefficients with their gene expression levels and copy numbers than those genes located in a genomic region with CNA. Compared with the Venn diagram approach, the iGC package showed better performance. The iGC package is effective and useful for identifying CNA-driven genes. By simultaneously considering both comparative genomic and transcriptomic data, it can provide better understanding of biological and medical questions. The iGC package's source code and manual are freely available at https://www.bioconductor.org/packages/release/bioc/html/iGC.html .
From Cells to Virus Particles: Quantitative Methods to Monitor RNA Packaging
Ferrer, Mireia; Henriet, Simon; Chamontin, Célia; Lainé, Sébastien; Mougel, Marylène
2016-01-01
In cells, positive strand RNA viruses, such as Retroviridae, must selectively recognize their full-length RNA genome among abundant cellular RNAs to assemble and release particles. How viruses coordinate the intracellular trafficking of both RNA and protein components to the assembly sites of infectious particles at the cell surface remains a long-standing question. The mechanisms ensuring packaging of genomic RNA are essential for viral infectivity. Since RNA packaging impacts on several essential functions of retroviral replication such as RNA dimerization, translation and recombination events, there are many studies that require the determination of RNA packaging efficiency and/or RNA packaging ability. Studies of RNA encapsidation rely upon techniques for the identification and quantification of RNA species packaged by the virus. This review focuses on the different approaches available to monitor RNA packaging: Northern blot analysis, ribonuclease protection assay and quantitative reverse transcriptase-coupled polymerase chain reaction as well as the most recent RNA imaging and sequencing technologies. Advantages, disadvantages and limitations of these approaches will be discussed in order to help the investigator to choose the most appropriate technique. Although the review was written with the prototypic simple murine leukemia virus (MLV) and complex human immunodeficiency virus type 1 (HIV-1) in mind, the techniques were described in order to benefit to a larger community. PMID:27556480
Solid-to-fluid – like DNA transition in viruses facilitates infection
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Ting; Sae-Ueng, Udom; Li, Dong
2014-10-14
Releasing the packaged viral DNA into the host cell is an essential process to initiate viral infection. In many double-stranded DNA bacterial viruses and herpesviruses, the tightly packaged genome is hexagonally ordered and stressed in the protein shell, called the capsid. DNA condensed in this state inside viral capsids has been shown to be trapped in a glassy state, with restricted molecular motion in vitro. This limited intracapsid DNA mobility is caused by the sliding friction between closely packaged DNA strands, as a result of the repulsive interactions between the negative charges on the DNA helices. It had been unclearmore » how this rigid crystalline structure of the viral genome rapidly ejects from the capsid, reaching rates of 60,000 bp/s. Through a combination of single- molecule and bulk techniques, we determined how the structure and energy of the encapsidated DNA in phage λ regulates the mobility required for its ejection. Our data show that packaged λ -DNA undergoes a solid-to-fluid – like disordering transition as a function of temperature, resultin g locally in less densely packed DNA, reducing DNA – DNA repulsions. This p rocess leads to a sig- nificant increase in genome mobility or fluidity, which facilitates genome release at temperatures close to that of viral infection (37 °C), suggesting a remarkab le physical adaptation of bac- terial viruses to the environment of Escherichia coli cells in a human host.« less
mySyntenyPortal: an application package to construct websites for synteny block analysis.
Lee, Jongin; Lee, Daehwan; Sim, Mikang; Kwon, Daehong; Kim, Juyeon; Ko, Younhee; Kim, Jaebum
2018-06-05
Advances in sequencing technologies have facilitated large-scale comparative genomics based on whole genome sequencing. Constructing and investigating conserved genomic regions among multiple species (called synteny blocks) are essential in the comparative genomics. However, they require significant amounts of computational resources and time in addition to bioinformatics skills. Many web interfaces have been developed to make such tasks easier. However, these web interfaces cannot be customized for users who want to use their own set of genome sequences or definition of synteny blocks. To resolve this limitation, we present mySyntenyPortal, a stand-alone application package to construct websites for synteny block analyses by using users' own genome data. mySyntenyPortal provides both command line and web-based interfaces to build and manage websites for large-scale comparative genomic analyses. The websites can be also easily published and accessed by other users. To demonstrate the usability of mySyntenyPortal, we present an example study for building websites to compare genomes of three mammalian species (human, mouse, and cow) and show how they can be easily utilized to identify potential genes affected by genome rearrangements. mySyntenyPortal will contribute for extended comparative genomic analyses based on large-scale whole genome sequences by providing unique functionality to support the easy creation of interactive websites for synteny block analyses from user's own genome data.
Maya Miles, Douglas; Peñate, Xenia; Sanmartín Olmo, Trinidad; Jourquin, Frederic; Muñoz Centeno, Maria Cruz; Mendoza, Manuel; Simon, Marie-Noelle; Chavez, Sebastian; Geli, Vincent
2018-03-27
Whole-genome duplications (WGDs) have played a central role in the evolution of genomes and constitute an important source of genome instability in cancer. Here, we show in Saccharomyces cerevisiae that abnormal accumulations of histones are sufficient to induce WGDs. Our results link these WGDs to a reduced incorporation of the histone variant H2A.Z to chromatin. Moreover, we show that high levels of histones promote Swe1 WEE1 stabilisation thereby triggering the phosphorylation and inhibition of Cdc28 CDK1 through a mechanism different of the canonical DNA damage response. Our results link high levels of histones to a specific type of genome instability that is quite frequently observed in cancer and uncovers a new mechanism that might be able to respond to high levels of histones. © 2018, Maya Miles et al.
Peñate, Xenia; Sanmartín Olmo, Trinidad; Jourquin, Frederic; Muñoz Centeno, Maria Cruz; Mendoza, Manuel; Simon, Marie-Noelle; Chavez, Sebastian
2018-01-01
Whole-genome duplications (WGDs) have played a central role in the evolution of genomes and constitute an important source of genome instability in cancer. Here, we show in Saccharomyces cerevisiae that abnormal accumulations of histones are sufficient to induce WGDs. Our results link these WGDs to a reduced incorporation of the histone variant H2A.Z to chromatin. Moreover, we show that high levels of histones promote Swe1WEE1 stabilisation thereby triggering the phosphorylation and inhibition of Cdc28CDK1 through a mechanism different of the canonical DNA damage response. Our results link high levels of histones to a specific type of genome instability that is quite frequently observed in cancer and uncovers a new mechanism that might be able to respond to high levels of histones. PMID:29580382
Greenwald, William W; Li, He; Smith, Erin N; Benaglio, Paola; Nariai, Naoki; Frazer, Kelly A
2017-04-07
Genomic interaction studies use next-generation sequencing (NGS) to examine the interactions between two loci on the genome, with subsequent bioinformatics analyses typically including annotation, intersection, and merging of data from multiple experiments. While many file types and analysis tools exist for storing and manipulating single locus NGS data, there is currently no file standard or analysis tool suite for manipulating and storing paired-genomic-loci: the data type resulting from "genomic interaction" studies. As genomic interaction sequencing data are becoming prevalent, a standard file format and tools for working with these data conveniently and efficiently are needed. This article details a file standard and novel software tool suite for working with paired-genomic-loci data. We present the paired-genomic-loci (PGL) file standard for genomic-interactions data, and the accompanying analysis tool suite "pgltools": a cross platform, pypy compatible python package available both as an easy-to-use UNIX package, and as a python module, for integration into pipelines of paired-genomic-loci analyses. Pgltools is a freely available, open source tool suite for manipulating paired-genomic-loci data. Source code, an in-depth manual, and a tutorial are available publicly at www.github.com/billgreenwald/pgltools , and a python module of the operations can be installed from PyPI via the PyGLtools module.
Morgan, Martin; Anders, Simon; Lawrence, Michael; Aboyoun, Patrick; Pagès, Hervé; Gentleman, Robert
2009-01-01
Summary: ShortRead is a package for input, quality assessment, manipulation and output of high-throughput sequencing data. ShortRead is provided in the R and Bioconductor environments, allowing ready access to additional facilities for advanced statistical analysis, data transformation, visualization and integration with diverse genomic resources. Availability and Implementation: This package is implemented in R and available at the Bioconductor web site; the package contains a ‘vignette’ outlining typical work flows. Contact: mtmorgan@fhcrc.org PMID:19654119
Structure of Ljungan virus provides insight into genome packaging of this picornavirus
NASA Astrophysics Data System (ADS)
Zhu, Ling; Wang, Xiangxi; Ren, Jingshan; Porta, Claudine; Wenham, Hannah; Ekström, Jens-Ola; Panjwani, Anusha; Knowles, Nick J.; Kotecha, Abhay; Siebert, C. Alistair; Lindberg, A. Michael; Fry, Elizabeth E.; Rao, Zihe; Tuthill, Tobias J.; Stuart, David I.
2015-10-01
Picornaviruses are responsible for a range of human and animal diseases, but how their RNA genome is packaged remains poorly understood. A particularly poorly studied group within this family are those that lack the internal coat protein, VP4. Here we report the atomic structure of one such virus, Ljungan virus, the type member of the genus Parechovirus B, which has been linked to diabetes and myocarditis in humans. The 3.78-Å resolution cryo-electron microscopy structure shows remarkable features, including an extended VP1 C terminus, forming a major protuberance on the outer surface of the virus, and a basic motif at the N terminus of VP3, binding to which orders some 12% of the viral genome. This apparently charge-driven RNA attachment suggests that this branch of the picornaviruses uses a different mechanism of genome encapsidation, perhaps explored early in the evolution of picornaviruses.
GST-PRIME: an algorithm for genome-wide primer design.
Leister, Dario; Varotto, Claudio
2007-01-01
The profiling of mRNA expression based on DNA arrays has become a powerful tool to study genome-wide transcription of genes in a number of organisms. GST-PRIME is a software package created to facilitate large-scale primer design for the amplification of probes to be immobilized on arrays for transcriptome analyses, even though it can be also applied in low-throughput approaches. GST-PRIME allows highly efficient, direct amplification of gene-sequence tags (GSTs) from genomic DNA (gDNA), starting from annotated genome or transcript sequences. GST-PRIME provides a customer-friendly platform for automatic primer design, and despite the relative simplicity of the algorithm, experimental tests in the model plant species Arabidopsis thaliana confirmed the reliability of the software. This chapter describes the algorithm used for primer design, its input and output files, and the installation of the standalone package and its use.
Structure of Ljungan virus provides insight into genome packaging of this picornavirus.
Zhu, Ling; Wang, Xiangxi; Ren, Jingshan; Porta, Claudine; Wenham, Hannah; Ekström, Jens-Ola; Panjwani, Anusha; Knowles, Nick J; Kotecha, Abhay; Siebert, C Alistair; Lindberg, A Michael; Fry, Elizabeth E; Rao, Zihe; Tuthill, Tobias J; Stuart, David I
2015-10-08
Picornaviruses are responsible for a range of human and animal diseases, but how their RNA genome is packaged remains poorly understood. A particularly poorly studied group within this family are those that lack the internal coat protein, VP4. Here we report the atomic structure of one such virus, Ljungan virus, the type member of the genus Parechovirus B, which has been linked to diabetes and myocarditis in humans. The 3.78-Å resolution cryo-electron microscopy structure shows remarkable features, including an extended VP1 C terminus, forming a major protuberance on the outer surface of the virus, and a basic motif at the N terminus of VP3, binding to which orders some 12% of the viral genome. This apparently charge-driven RNA attachment suggests that this branch of the picornaviruses uses a different mechanism of genome encapsidation, perhaps explored early in the evolution of picornaviruses.
Padilla-Sanchez, Victor; Gao, Song; Kim, Hyung Rae; Kihara, Daisuke; Sun, Lei; Rossmann, Michael G.; Rao, Venigalla B.
2013-01-01
Tailed bacteriophages and herpesviruses consist of a structurally well conserved dodecameric portal at a special five-fold vertex of the capsid. The portal plays critical roles in head assembly, genome packaging, neck/tail attachment, and genome ejection. Although the structures of portals from phages φ29, SPP1 and P22 have been determined, their mechanistic roles have not been well understood. Structural analysis of phage T4 portal (gp20) has been hampered because of its unusual interaction with the E. coli inner membrane. Here, we predict atomic models for the T4 portal monomer and dodecamer, and fit the dodecamer into the cryoEM density of the phage portal vertex. The core structure, like that from other phages, is cone-shaped with the wider end containing the “wing” and “crown” domains inside the phage head. A long “stem” encloses a central channel, and a narrow “stalk” protrudes outside the capsid. A biochemical approach was developed to analyze portal function by incorporating plasmid-expressed portal protein into phage heads and determining the effect of mutations on head assembly, DNA translocation, and virion production. We found that the protruding loops of the stalk domain are involved in assembling the DNA packaging motor. A loop that connects the stalk to the channel might be required for communication between the motor and portal. The “tunnel” loops that project into the channel are essential for sealing the packaged head. These studies established that the portal is required throughout the DNA packaging process, with different domains participating at different stages of genome packaging. PMID:24126213
viRome: an R package for the visualization and analysis of viral small RNA sequence datasets.
Watson, Mick; Schnettler, Esther; Kohl, Alain
2013-08-01
RNA interference (RNAi) is known to play an important part in defence against viruses in a range of species. Second-generation sequencing technologies allow us to assay these systems and the small RNAs that play a key role with unprecedented depth. However, scientists need access to tools that can condense, analyse and display the resulting data. Here, we present viRome, a package for R that takes aligned sequence data and produces a range of essential plots and reports. viRome is released under the BSD license as a package for R available for both Windows and Linux http://virome.sf.net. Additional information and a tutorial is available on the ARK-Genomics website: http://www.ark-genomics.org/bioinformatics/virome. mick.watson@roslin.ed.ac.uk.
Blood leak alarm interference by hydoxocobalamin is hemodialysis machine dependent.
Sutter, M E; Clarke, M E; Cobb, J; Daubert, G P; Rathore, V S; Aston, L S; Poppenga, R H; Ford, J B; Owen, K P; Albertson, T E
2012-12-01
Hydroxocobalamin has been reported to interfere with the blood leak alarm on hemodialysis machines making it difficult to use this treatment modality after hydroxocobalamin infusion. The objective was to determine if this interference with hydroxocobalamin occurs across hemodialysis machines by different manufacturers. Additionally, we aimed to see if this represented a colorimetric interference alone or if it is the optical properties of hydroxocobalamin. Hydroxocobalamin was reconstituted per package insert. Food coloring was added to 0.9% saline to create the colors of the visual spectrum. Optical properties of absorbance and transmittance were measured. Hydroxocobalamin and the saline solutions were infused into the Fresenius 2008K™ and the Gambro Phoenix X36™ machines. Times were recorded from the start of the machine until the solution finished or the alarm triggered. When evaluating the Gambro Phoenix X36™ machine and dialysis circuit; the alarm did not trigger. In contrast, the blood leak alarm on the Fresenius 2008K™ machine was tripped by both the red solution and hydoxocobalamin infused per the package insert. The alarm stopped the machine between 128 and 132 seconds for the red solution and between 30 and 35 seconds with the hydroxocobalamin. Membranes of the circuits where the alarm tripped were examined and remained intact without blood. Results were validated on different machines with new circuits. Hydroxocobalamin infusion per package insert and the red saline solution prepared with Red Dye 40 both triggered the blood leak alarm and stopped the Fresenius 2008K™ machine. However, this was not true for the Gambro Phoenix X36™ machine as the alarm never triggered. The interference with the Fresenius 2008K™ appears colorimetric due to normal saline with Red Dye 40 triggering the alarm. We alert physicians to become familiar with the properties of individual dialysis machines prior to use of hydroxocobalamin. When facing difficulties with hemodialysis after the administration of hydroxocobalamin, consider attempting with a different manufactures machine or model if available or contact the manufacturer directly.
mRNA Molecules Containing Murine Leukemia Virus Packaging Signals Are Encapsidated as Dimers
Hibbert, Catherine S.; Mirro, Jane; Rein, Alan
2004-01-01
Prior work by others has shown that insertion of ψ (i.e., leader) sequences from the Moloney murine leukemia virus (MLV) genome into the 3′ untranslated region of a nonviral mRNA leads to the specific encapsidation of this RNA in MLV particles. We now report that these RNAs are, like genomic RNAs, encapsidated as dimers. These dimers have the same thermostability as MLV genomic RNA dimers; like them, these dimers are more stable if isolated from mature virions than from immature virions. We characterized encapsidated mRNAs containing deletions or truncations of MLV ψ or with ψ sequences from MLV-related acute transforming viruses. The results indicate that the dimeric linkage in genomic RNA can be completely attributed to the ψ region of the genome. While this conclusion agrees with earlier electron microscopic studies on mature MLV dimers, it is the first evidence as to the site of the linkage in immature dimers for any retrovirus. Since the Ψ+ mRNA is not encapsidated as well as genomic RNA, it is only present in a minority of virions. The fact that it is nevertheless dimeric argues strongly that two of these molecules are packaged into particles together. We also found that the kissing loop is unnecessary for this coencapsidation or for the stability of mature dimers but makes a major contribution to the stability of immature dimers. Our results are consistent with the hypothesis that the packaging signal involves a dimeric structure in which the RNAs are joined by intermolecular interactions between GACG loops. PMID:15452213
Gruber, Bernd; Unmack, Peter J; Berry, Oliver F; Georges, Arthur
2018-05-01
Although vast technological advances have been made and genetic software packages are growing in number, it is not a trivial task to analyse SNP data. We announce a new r package, dartr, enabling the analysis of single nucleotide polymorphism data for population genomic and phylogenomic applications. dartr provides user-friendly functions for data quality control and marker selection, and permits rigorous evaluations of conformation to Hardy-Weinberg equilibrium, gametic-phase disequilibrium and neutrality. The package reports standard descriptive statistics, permits exploration of patterns in the data through principal components analysis and conducts standard F-statistics, as well as basic phylogenetic analyses, population assignment, isolation by distance and exports data to a variety of commonly used downstream applications (e.g., newhybrids, faststructure and phylogeny applications) outside of the r environment. The package serves two main purposes: first, a user-friendly approach to lower the hurdle to analyse such data-therefore, the package comes with a detailed tutorial targeted to the r beginner to allow data analysis without requiring deep knowledge of r. Second, we use a single, well-established format-genlight from the adegenet package-as input for all our functions to avoid data reformatting. By strictly using the genlight format, we hope to facilitate this format as the de facto standard of future software developments and hence reduce the format jungle of genetic data sets. The dartr package is available via the r CRAN network and GitHub. © 2017 John Wiley & Sons Ltd.
Confrontation, Consolidation, and Recognition: The Oocyte’s Perspective on the Incoming Sperm
Miller, David
2015-01-01
From the oocyte’s perspective, the incoming sperm poses a significant challenge. Despite (usually) arising from a male of the same species, the sperm is a “foreign” body that may carry with it additional, undesirable factors such as transposable elements (mainly retroposons) into the egg. These factors can arise either during spermatogenesis or while the sperm is moving through the epididymis or the female genital tract. Furthermore, in addition to the paternal genome, the sperm also carries its own complex repertoire of RNAs into the egg that includes mRNAs, lncRNAs, and sncRNAs. Last, the paternal genome itself is efficiently packaged into a protamine (nucleo-toroid) and histone (nucleosome)-based chromatin scaffold within which much of the RNA is embedded. Taken together, the sperm delivers a far more complex package to the egg than was originally thought. Understanding this complexity, at both the compositional and structural level, depends largely on investigating sperm chromatin from both the genomic (DNA packaging) and epigenomic (RNA carriage and extant histone modifications) perspectives. Why this complexity has arisen and its likely purpose requires us to look more closely at what happens in the oocyte when the sperm gains entry and the processes that then take place preparing the paternal (and maternal) genomes for syngamy. PMID:25957313
Sedivy, Arthur; Subirats, Xavier; Kowalski, Heinrich; Köhler, Gottfried; Blaas, Dieter
2013-01-01
Upon infection, many RNA viruses reorganize their capsid for release of the genome into the host cell cytosol for replication. Often, this process is triggered by receptor binding and/or by the acidic environment in endosomes. In the genus Enterovirus, which includes more than 150 human rhinovirus (HRV) serotypes causing the common cold, there is persuasive evidence that the viral RNA exits single-stranded through channels formed in the protein shell. We have determined the time-dependent emergence of the RNA ends from HRV2 on incubation of virions at 56°C using hybridization with specific oligonucleotides and detection by fluorescence correlation spectroscopy. We report that psoralen UV crosslinking prevents complete RNA release, allowing for identification of the sequences remaining inside the capsid. We also present the structure of uncoating intermediates in which parts of the RNA are condensed and take the form of a rod that is directed roughly towards a two-fold icosahedral axis, the presumed RNA exit point. Taken together, in contrast to schemes frequently depicted in textbooks and reviews, our findings demonstrate that exit of the RNA starts from the 3′-end. This suggests that packaging also occurs in an ordered manner resulting in the 3′-poly-(A) tail becoming located close to a position of pore formation during conversion of the virion into a subviral particle. This directional genome release may be common to many icosahedral non-enveloped single-stranded RNA viruses. PMID:23592991
Mustafa, Farah; Vivet-Boudou, Valérie; Jabeen, Ayesha; Ali, Lizna M; Kalloush, Rawan M; Marquet, Roland; Rizvi, Tahir A
2018-06-21
Packaging the mouse mammary tumor virus (MMTV) genomic RNA (gRNA) requires the entire 5' untranslated region (UTR) in conjunction with the first 120 nucleotides of the gag gene. This region includes several palindromic (pal) sequence(s) and stable stem loops (SLs). Among these, stem loop 4 (SL4) adopts a bifurcated structure consisting of three stems, two apical loops, and an internal loop. Pal II, located in one of the apical loops, mediates gRNA dimerization, a process intricately linked to packaging. We thus hypothesized that the bifurcated SL4 structure could constitute the major gRNA packaging determinant. To test this hypothesis, the two apical loops and the flanking sequences forming the bifurcated SL4 were individually mutated. These mutations all had deleterious effects on gRNA packaging and propagation. Next, single and compensatory mutants were designed to destabilize then recreate the bifurcated SL4 structure. A structure-function analysis using bioinformatics predictions and RNA chemical probing revealed that mutations that led to the loss of the SL4 bifurcated structure abrogated RNA packaging and propagation, while compensatory mutations that recreated the native SL4 structure restored RNA packaging and propagation to wild type levels. Altogether, our results demonstrate that SL4 constitutes the principal packaging determinant of MMTV gRNA. Our findings further suggest that SL4 acts as a structural switch that can not only differentiate between RNA for translation versus packaging/dimerization, but its location also allows differentiation between spliced and unspliced RNAs during gRNA encapsidation.
Kainov, Denis E; Pirttimaa, Markus; Tuma, Roman; Butcher, Sarah J; Thomas, George J; Bamford, Dennis H; Makeyev, Eugene V
2003-11-28
Genomes of complex viruses have been demonstrated, in many cases, to be packaged into preformed empty capsids (procapsids). This reaction is performed by molecular motors translocating nucleic acid against the concentration gradient at the expense of NTP hydrolysis. At present, the molecular mechanisms of packaging remain elusive due to the complex nature of packaging motors. In the case of the double-stranded RNA bacteriophage phi 6 from the Cystoviridae family, packaging of single-stranded genomic precursors requires a hexameric NTPase, P4. In the present study, the purified P4 proteins from two other cystoviruses, phi 8 and phi 13, were characterized and compared with phi 6 P4. All three proteins are hexameric, single-stranded RNA-stimulated NTPases with alpha/beta folds. Using a direct motor assay, we found that phi 8 and phi 13 P4 hexamers translocate 5' to 3' along ssRNA, whereas the analogous activity of phi 6 P4 requires association with the procapsid. This difference is explained by the intrinsically high affinity of phi 8 and phi 13 P4s for nucleic acids. The unidirectional translocation results in RNA helicase activity. Thus, P4 proteins of Cystoviridae exhibit extensive similarity to hexameric helicases and are simple models for studying viral packaging motor mechanisms.
Treviño, Victor; Tamez-Pena, Jose
2017-06-15
The association of genomic alterations to outcomes in cancer is affected by a problem of unbalanced groups generated by the low frequency of alterations. For this, an R package (VALORATE) that estimates the null distribution and the P -value of the log-rank based on a recent reformulation is presented. For a given number of alterations that define the size of survival groups, the log-rank density is estimated by a weighted sum of conditional distributions depending on a co-occurrence term of mutations and events. The estimations are accurately accelerated by sampling across co-occurrences allowing the analysis of large genomic datasets in few minutes. In conclusion, the proposed VALORATE R package is a valuable tool for survival analysis. The R package is available in CRAN at https://cran.r-project.org and in http://bioinformatica.mty.itesm.mx/valorateR . vtrevino@itesm.mx. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Inhibition of HIV-1 by a peptide ligand of the genomic RNA packaging signal Psi.
Dietz, Julia; Koch, Joachim; Kaur, Ajit; Raja, Chinnappan; Stein, Stefan; Grez, Manuel; Pustowka, Anette; Mensch, Sarah; Ferner, Jan; Möller, Lars; Bannert, Norbert; Tampé, Robert; Divita, Gilles; Mély, Yves; Schwalbe, Harald; Dietrich, Ursula
2008-05-01
The interaction of the nucleocapsid NCp7 of the human immunodeficiency virus type 1 (HIV-1) Gag polyprotein with the RNA packaging signal Psi ensures specific encapsidation of the dimeric full length viral genome into nascent virus particles. Being an essential step in the HIV-1 replication cycle, specific genome encapsidation represents a promising target for therapeutic intervention. We previously selected peptides binding to HIV-1 Psi-RNA or stem loops (SL) thereof by phage display. Herein, we describe synthesis of peptide variants of the consensus HWWPWW motif on membrane supports to optimize Psi-RNA binding. The optimized peptide, psi-pepB, was characterized in detail with respect to its conformation and binding properties for the SL3 of the Psi packaging signal by NMR and tryptophan fluorescence quenching. Functional analysis revealed that psi-pepB caused a strong reduction of virus release by infected cells as monitored by reduced transduction efficiencies, capsid p24 antigen levels, and electron microscopy. Thus, this peptide shows antiviral activity and could serve as a lead compound to develop new drugs targeting HIV-1.
Genome-wide regression and prediction with the BGLR statistical package.
Pérez, Paulino; de los Campos, Gustavo
2014-10-01
Many modern genomic data analyses require implementing regressions where the number of parameters (p, e.g., the number of marker effects) exceeds sample size (n). Implementing these large-p-with-small-n regressions poses several statistical and computational challenges, some of which can be confronted using Bayesian methods. This approach allows integrating various parametric and nonparametric shrinkage and variable selection procedures in a unified and consistent manner. The BGLR R-package implements a large collection of Bayesian regression models, including parametric variable selection and shrinkage methods and semiparametric procedures (Bayesian reproducing kernel Hilbert spaces regressions, RKHS). The software was originally developed for genomic applications; however, the methods implemented are useful for many nongenomic applications as well. The response can be continuous (censored or not) or categorical (either binary or ordinal). The algorithm is based on a Gibbs sampler with scalar updates and the implementation takes advantage of efficient compiled C and Fortran routines. In this article we describe the methods implemented in BGLR, present examples of the use of the package, and discuss practical issues emerging in real-data analysis. Copyright © 2014 by the Genetics Society of America.
USDA-ARS?s Scientific Manuscript database
Allopolyploidization is considered an essential evolutionary process in plants that could trigger genomic shock in allopolyploid genome through activation of transcription of retrotransposons, which may be important in plant evolution. Two retrotransposon-based markers, inter-retrotransposon amplifi...
Dayer, Mohammad Reza; Dayer, Mohammad Saaid; Rezatofighi, Seyedeh Elham
2015-04-01
The Crimean-Congo Hemorrhagic Fever (CCHF) is an infectious disease of high virulence and mortality caused by a negative sense RNA nairovirus. The genomic RNA of CCHFV is enwrapped by its nucleoprotein. Positively charged residues on CCHFV nucleoprotein provide multiple binding sites to facilitate genomic RNA encapsidation. In the present work, we investigated the mechanism underlying preferential packaging of the negative sense genomic RNA by CCHFV nucleoprotein in the presence of host cell RNAs during viral assembly. The work included genome sequence analyses for different families of negative and positive sense RNA viruses, using serial docking experiments and molecular dynamic simulations. Our results indicated that the main determinant parameter of the nucleoprotein binding affinity for negative sense RNA is the ratio of purine/pyrimidine in the RNA molecule. A negative sense RNA with a purine/pyrimidine ratio (>1) higher than that of a positive sense RNA (<1) exhibits higher affinity for the nucleoprotein. Our calculations revealed that a negative sense RNA expresses about 0.5 kJ/mol higher binding energy per nucleotide compared to a positive sense RNA. This energy difference produces a binding energy high enough to make the negative sense RNA, the preferred substrate for packaging by CCHFV nucleoprotein in the presence of cellular or complementary positive sense RNAs. The outcome of this study may contribute to ongoing researches on other viral diseases caused by negative sense RNA viruses such as Ebola virus which poses a security threat to all humanity.
Efficient population-scale variant analysis and prioritization with VAPr.
Birmingham, Amanda; Mark, Adam M; Mazzaferro, Carlo; Xu, Guorong; Fisch, Kathleen M
2018-04-06
With the growing availability of population-scale whole-exome and whole-genome sequencing, demand for reproducible, scalable variant analysis has spread within genomic research communities. To address this need, we introduce the Python package VAPr (Variant Analysis and Prioritization). VAPr leverages existing annotation tools ANNOVAR and MyVariant.info with MongoDB-based flexible storage and filtering functionality. It offers biologists and bioinformatics generalists easy-to-use and scalable analysis and prioritization of genomic variants from large cohort studies. VAPr is developed in Python and is available for free use and extension under the MIT License. An install package is available on PyPi at https://pypi.python.org/pypi/VAPr, while source code and extensive documentation are on GitHub at https://github.com/ucsd-ccbb/VAPr. kfisch@ucsd.edu.
D-GENIES: dot plot large genomes in an interactive, efficient and simple way.
Cabanettes, Floréal; Klopp, Christophe
2018-01-01
Dot plots are widely used to quickly compare sequence sets. They provide a synthetic similarity overview, highlighting repetitions, breaks and inversions. Different tools have been developed to easily generated genomic alignment dot plots, but they are often limited in the input sequence size. D-GENIES is a standalone and web application performing large genome alignments using minimap2 software package and generating interactive dot plots. It enables users to sort query sequences along the reference, zoom in the plot and download several image, alignment or sequence files. D-GENIES is an easy-to-install, open-source software package (GPL) developed in Python and JavaScript. The source code is available at https://github.com/genotoul-bioinfo/dgenies and it can be tested at http://dgenies.toulouse.inra.fr/.
Hilbert, Brendan J.; Hayes, Janelle A.; Stone, Nicholas P.; Xu, Rui-Gang
2017-01-01
Abstract Many viruses use a powerful terminase motor to pump their genome inside an empty procapsid shell during virus maturation. The large terminase (TerL) protein contains both enzymatic activities necessary for packaging in such viruses: the adenosine triphosphatase (ATPase) that powers DNA translocation and an endonuclease that cleaves the concatemeric genome at both initiation and completion of genome packaging. However, how TerL binds DNA during translocation and cleavage remains mysterious. Here we investigate DNA binding and cleavage using TerL from the thermophilic phage P74-26. We report the structure of the P74-26 TerL nuclease domain, which allows us to model DNA binding in the nuclease active site. We screened a large panel of TerL variants for defects in binding and DNA cleavage, revealing that the ATPase domain is the primary site for DNA binding, and is required for nuclease activity. The nuclease domain is dispensable for DNA binding but residues lining the active site guide DNA for cleavage. Kinetic analysis of DNA cleavage suggests flexible tethering of the nuclease domains during DNA cleavage. We propose that interactions with the procapsid during DNA translocation conformationally restrict the nuclease domain, inhibiting cleavage; TerL release from the capsid upon completion of packaging unlocks the nuclease domains to cleave DNA. PMID:28082398
Porcine circovirus: transcription and rolling-circle DNA replication
USDA-ARS?s Scientific Manuscript database
This review summarizes the molecular studies pertaining to porcine circovirus (PCV) transcription and DNA replication. The genome of PCV is circular, single-stranded DNA and contains 1759-1768 nucleotides. Both the genome-strand (packaged in the virus particle) and the complementary-strand (synthesi...
Ardin, Maude; Cahais, Vincent; Castells, Xavier; Bouaoun, Liacine; Byrnes, Graham; Herceg, Zdenko; Zavadil, Jiri; Olivier, Magali
2016-04-18
The nature of somatic mutations observed in human tumors at single gene or genome-wide levels can reveal information on past carcinogenic exposures and mutational processes contributing to tumor development. While large amounts of sequencing data are being generated, the associated analysis and interpretation of mutation patterns that may reveal clues about the natural history of cancer present complex and challenging tasks that require advanced bioinformatics skills. To make such analyses accessible to a wider community of researchers with no programming expertise, we have developed within the web-based user-friendly platform Galaxy a first-of-its-kind package called MutSpec. MutSpec includes a set of tools that perform variant annotation and use advanced statistics for the identification of mutation signatures present in cancer genomes and for comparing the obtained signatures with those published in the COSMIC database and other sources. MutSpec offers an accessible framework for building reproducible analysis pipelines, integrating existing methods and scripts developed in-house with publicly available R packages. MutSpec may be used to analyse data from whole-exome, whole-genome or targeted sequencing experiments performed on human or mouse genomes. Results are provided in various formats including rich graphical outputs. An example is presented to illustrate the package functionalities, the straightforward workflow analysis and the richness of the statistics and publication-grade graphics produced by the tool. MutSpec offers an easy-to-use graphical interface embedded in the popular Galaxy platform that can be used by researchers with limited programming or bioinformatics expertise to analyse mutation signatures present in cancer genomes. MutSpec can thus effectively assist in the discovery of complex mutational processes resulting from exogenous and endogenous carcinogenic insults.
Bacteriophage T4 capsid packaging and unpackaging of DNA and proteins.
Mullaney, Julienne M; Black, Lindsay W
2014-01-01
Bacteriophage T4 has proven itself readily amenable to phage-based DNA and protein packaging, expression, and display systems due to its physical resiliency and genomic flexibility. As a large dsDNA phage with dispensable internal proteins and dispensable outer capsid proteins it can be adapted to package both DNA and proteins of interest within the capsid and to display peptides and proteins externally on the capsid. A single 170 kb linear DNA, or single or multiple copies of shorter linear DNAs, of any sequence can be packaged by the large terminase subunit in vitro into protein-containing proheads and give full or partially full capsids. The prohead receptacles for DNA packaging can also display peptides or full-length proteins from capsid display proteins HOC and SOC. Our laboratory has also developed a protein expression, packaging, and processing (PEPP) system which we have found to have advantages over mammalian and bacterial cell systems, including high yield, increased stability, and simplified downstream processing. Proteins that we have produced by the phage PEPP platform include human HIV-1 protease, micrococcal endonuclease from Staphylococcus aureus, restriction endonuclease EcoRI, luciferase, human granulocyte colony stimulating factor (GCSF), green fluorescent protein (GFP), and the 99 amino acid C-terminus of amyloid precursor protein (APP). Difficult to produce proteins that are toxic in mammalian protein expression systems are easily produced, packaged, and processed with the PEPP platform. APP is one example of such a highly refractory protein that has been produced successfully. The methods below describe the procedures for in vitro packaging of proheads with DNA and for producing recombinant T4 phage that carry a gene of interest in the phage genome and produce and internally package the corresponding protein of interest.
Autonomous microexplosives subsurface tracing system final report.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Engler, Bruce Phillip; Nogan, John; Melof, Brian Matthew
The objective of the autonomous micro-explosive subsurface tracing system is to image the location and geometry of hydraulically induced fractures in subsurface petroleum reservoirs. This system is based on the insertion of a swarm of autonomous micro-explosive packages during the fracturing process, with subsequent triggering of the energetic material to create an array of micro-seismic sources that can be detected and analyzed using existing seismic receiver arrays and analysis software. The project included investigations of energetic mixtures, triggering systems, package size and shape, and seismic output. Given the current absence of any technology capable of such high resolution mapping ofmore » subsurface structures, this technology has the potential for major impact on petroleum industry, which spends approximately $1 billion dollar per year on hydraulic fracturing operations in the United States alone.« less
Torrent, C; Gabus, C; Darlix, J L
1994-02-01
Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer.
High-resolution mapping of chromatin packaging in mouse embryonic stem cells and sperm.
Carone, Benjamin R; Hung, Jui-Hung; Hainer, Sarah J; Chou, Min-Te; Carone, Dawn M; Weng, Zhiping; Fazzio, Thomas G; Rando, Oliver J
2014-07-14
Mammalian embryonic stem cells (ESCs) and sperm exhibit unusual chromatin packaging that plays important roles in cellular function. Here, we extend a recently developed technique, based on deep paired-end sequencing of lightly digested chromatin, to assess footprints of nucleosomes and other DNA-binding proteins genome-wide in murine ESCs and sperm. In ESCs, we recover well-characterized features of chromatin such as promoter nucleosome depletion and further identify widespread footprints of sequence-specific DNA-binding proteins such as CTCF, which we validate in knockdown studies. We document global differences in nuclease accessibility between ESCs and sperm, finding that the majority of histone retention in sperm preferentially occurs in large gene-poor genomic regions, with only a small subset of nucleosomes being retained over promoters of developmental regulators. Finally, we describe evidence that CTCF remains associated with the genome in mature sperm, where it could play a role in organizing the sperm genome. Copyright © 2014 Elsevier Inc. All rights reserved.
Robert, Marc-André; Lytvyn, Viktoria; Deforet, Francis; Gilbert, Rénald; Gaillet, Bruno
2017-01-01
Virus-like particles (VLPs) derived from retroviruses and lentiviruses can be used to deliver recombinant proteins without the fear of causing insertional mutagenesis to the host cell genome. In this study we evaluate the potential of an inducible lentiviral vector packaging cell line for VLP production. The Gag gene from HIV-1 was fused to a gene encoding a selected protein and it was transfected into the packaging cells. Three proteins served as model: the green fluorescent protein and two transcription factors-the cumate transactivator (cTA) of the inducible CR5 promoter and the human Krüppel-like factor 4 (KLF4). The sizes of the VLPs were 120-150 nm in diameter and they were resistant to freeze/thaw cycles. Protein delivery by the VLPs reached up to 100% efficacy in human cells and was well tolerated. Gag-cTA triggered up to 1100-fold gene activation of the reporter gene in comparison to the negative control. Protein engineering was required to detect Gag-KLF4 activity. Thus, insertion of the VP16 transactivation domain increased the activity of the VLPs by eightfold. An additional 2.4-fold enhancement was obtained by inserting nuclear export signal. In conclusion, our platform produced VLPs capable of efficient protein transfer, and it was shown that protein engineering can be used to improve the activity of the delivered proteins as well as VLP production.
2011-05-01
genome was determined and compared to simian and human herpesvirus genomes representing alpha-herpesvi- ruses, beta- herpesviruses and gamma-1 and...of JMRV Genome with Select Simian and Human Herpesvirus Genomes Showing Percent Nucleotide Sequence Identity Virus JMRV RRV KSHV HVS RhLCV EBV RhCMV...2 - Introduction Particular viruses, especially gama- herpesviruses , may act as a trigger of multiple sclerosis (MS) (Levin et
Yao, Youli; Danna, Cristian H.; Zemp, Franz J.; Titov, Viktor; Ciftci, Ozan Nazim; Przybylski, Roman; Ausubel, Frederick M.; Kovalchuk, Igor
2011-01-01
We have previously shown that local exposure of plants to stress results in a systemic increase in genome instability. Here, we show that UV-C–irradiated plants produce a volatile signal that triggers an increase in genome instability in neighboring nonirradiated Arabidopsis thaliana plants. This volatile signal is interspecific, as UV-C–irradiated Arabidopsis plants transmit genome destabilization to naive tobacco (Nicotiana tabacum) plants and vice versa. We report that plants exposed to the volatile hormones methyl salicylate (MeSA) or methyl jasmonate (MeJA) exhibit a similar level of genome destabilization as UV-C–irradiated plants. We also found that irradiated Arabidopsis plants produce MeSA and MeJA. The analysis of mutants impaired in the synthesis and/or response to salicylic acid (SA) and/or jasmonic acid showed that at least one other volatile compound besides MeSA and MeJA can communicate interplant genome instability. The NONEXPRESSOR OF PATHOGENESIS-RELATED GENES1 (npr1) mutant, defective in SA signaling, is impaired in both the production and the perception of the volatile signals, demonstrating a key role for NPR1 as a central regulator of genome stability. Finally, various forms of stress resulting in the formation of necrotic lesions also generate a volatile signal that leads to genomic instability. PMID:22028460
Olson, Erik D; Musier-Forsyth, Karin
2018-03-31
Retroviral Gag proteins are responsible for coordinating many aspects of virion assembly. Gag possesses two distinct nucleic acid binding domains, matrix (MA) and nucleocapsid (NC). One of the critical functions of Gag is to specifically recognize, bind, and package the retroviral genomic RNA (gRNA) into assembling virions. Gag interactions with cellular RNAs have also been shown to regulate aspects of assembly. Recent results have shed light on the role of MA and NC domain interactions with nucleic acids, and how they jointly function to ensure packaging of the retroviral gRNA. Here, we will review the literature regarding RNA interactions with NC, MA, as well as overall mechanisms employed by Gag to interact with RNA. The discussion focuses on human immunodeficiency virus type-1, but other retroviruses will also be discussed. A model is presented combining all of the available data summarizing the various factors and layers of selection Gag employs to ensure specific gRNA packaging and correct virion assembly. Copyright © 2018 Elsevier Ltd. All rights reserved.
Dover, John A; Burmeister, Alita R; Molineux, Ian J; Parent, Kristin N
2016-09-19
Genomic architecture is the framework within which genes and regulatory elements evolve and where specific constructs may constrain or potentiate particular adaptations. One such construct is evident in phages that use a headful packaging strategy that results in progeny phage heads packaged with DNA until full rather than encapsidating a simple unit-length genome. Here, we investigate the evolution of the headful packaging phage Sf6 in response to barriers that impede efficient phage adsorption to the host cell. Ten replicate populations evolved faster Sf6 life cycles by parallel mutations found in a phage lysis gene and/or by large, 1.2- to 4.0-kb deletions that remove a mobile genetic IS911 element present in the ancestral phage genome. The fastest life cycles were found in phages that acquired both mutations. No mutations were found in genes encoding phage structural proteins, which were a priori expected from the experimental design that imposed a challenge for phage adsorption by using a Shigella flexneri host lacking receptors preferred by Sf6. We used DNA sequencing, molecular approaches, and physiological experiments on 82 clonal isolates taken from all 10 populations to reveal the genetic basis of the faster Sf6 life cycle. The majority of our isolates acquired deletions in the phage genome. Our results suggest that deletions are adaptive and can influence the duration of the phage life cycle while acting in conjunction with other lysis time-determining point mutations. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters.
Huang, Hailiang; Tata, Sandeep; Prill, Robert J
2013-01-01
Computational workloads for genome-wide association studies (GWAS) are growing in scale and complexity outpacing the capabilities of single-threaded software designed for personal computers. The BlueSNP R package implements GWAS statistical tests in the R programming language and executes the calculations across computer clusters configured with Apache Hadoop, a de facto standard framework for distributed data processing using the MapReduce formalism. BlueSNP makes computationally intensive analyses, such as estimating empirical p-values via data permutation, and searching for expression quantitative trait loci over thousands of genes, feasible for large genotype-phenotype datasets. http://github.com/ibm-bioinformatics/bluesnp
Chaturvedi, Sonali; Rao, A L N
2014-09-01
In Brome mosaic virus, it was hypothesized that a physical interaction between viral replicase and capsid protein (CP) is obligatory to confer genome packaging specificity. Here we tested this hypothesis by employing Bimolecular Fluorescent Complementation (BiFC) as a tool for evaluating protein-protein interactions in living cells. The efficacy of BiFC was validated by a known interaction between replicase protein 1a (p1a) and protein 2a (p2a) at the endoplasmic reticulum (ER) site of viral replication. Additionally, co-expression in planta of a bona fide pair of interacting protein partners of p1a and p2a had resulted in the assembly of a functional replicase. Subsequent BiFC assays in conjunction with mCherry labeled ER as a fluorescent cellular marker revealed that CP physically interacts with p2a, but not p1a, and this CP:p2a interaction occurs at the cytoplasmic phase of the ER. The significance of the CP:p2a interaction in BMV replication and genome packaging is discussed. Copyright © 2014 Elsevier Inc. All rights reserved.
Production of pseudoinfectious yellow fever virus with a two-component genome.
Shustov, Alexandr V; Mason, Peter W; Frolov, Ilya
2007-11-01
Application of genetically modified, deficient-in-replication flaviviruses that are incapable of developing productive, spreading infection is a promising means of designing safe and effective vaccines. Here we describe a two-component genome yellow fever virus (YFV) replication system in which each of the genomes encodes complete sets of nonstructural proteins that form the replication complex but expresses either only capsid or prM/E instead of the entire structural polyprotein. Upon delivery to the same cell, these genomes produce together all of the viral structural proteins, and cells release a combination of virions with both types of genomes packaged into separate particles. In tissue culture, this modified YFV can be further passaged at an escalating scale by using a high multiplicity of infection (MOI). However, at a low MOI, only one of the genomes is delivered into the cells, and infection cannot spread. The replicating prM/E-encoding genome produces extracellular E protein in the form of secreted subviral particles that are known to be an effective immunogen. The presented strategy of developing viruses defective in replication might be applied to other flaviviruses, and these two-component genome viruses can be useful for diagnostic or vaccine applications, including the delivery and expression of heterologous genes. In addition, the achieved separation of the capsid-coding sequence and the cyclization signal in the YFV genome provides a new means for studying the mechanism of the flavivirus packaging process.
Elliott, Richard M.
2014-01-01
Rift Valley fever virus (RVFV, family Bunyaviridae) is a mosquito-borne pathogen of both livestock and humans, found primarily in Sub-Saharan Africa and the Arabian Peninsula. The viral genome comprises two negative-sense (L and M segments) and one ambisense (S segment) RNAs that encode seven proteins. The S segment encodes the nucleocapsid (N) protein in the negative-sense and a nonstructural (NSs) protein in the positive-sense, though NSs cannot be translated directly from the S segment but rather from a specific subgenomic mRNA. Using reverse genetics we generated a virus, designated rMP12:S-Swap, in which the N protein is expressed from the NSs locus and NSs from the N locus within the genomic S RNA. In cells infected with rMP12:S-Swap NSs is expressed at higher levels with respect to N than in cells infected with the parental rMP12 virus. Despite NSs being the main interferon antagonist and determinant of virulence, growth of rMP12:S-Swap was attenuated in mammalian cells and gave a small plaque phenotype. The increased abundance of the NSs protein did not lead to faster inhibition of host cell protein synthesis or host cell transcription in infected mammalian cells. In cultured mosquito cells, however, infection with rMP12:S-Swap resulted in cell death rather than establishment of persistence as seen with rMP12. Finally, altering the composition of the S segment led to a differential packaging ratio of genomic to antigenomic RNA into rMP12:S-Swap virions. Our results highlight the plasticity of the RVFV genome and provide a useful experimental tool to investigate further the packaging mechanism of the segmented genome. PMID:24550727
Herpesvirus capsid assembly and DNA packaging
Heming, Jason D.; Conway, James F.; Homa, Fred L.
2017-01-01
Herpes simplex virus type I (HSV-1) is the causative agent of several pathologies ranging in severity from the common cold sore to life-threatening encephalitic infection. During productive lytic infection, over 80 viral proteins are expressed in a highly regulated manner, resulting in the replication of viral genomes and assembly of progeny virions. The virion of all herpesviruses consists of an external membrane envelope, a proteinaceous layer called the tegument, and an icosahedral capsid containing the double-stranded linear DNA genome. The capsid shell of HSV-1 is built from four structural proteins: a major capsid protein, VP5, which forms the capsomers (hexons and pentons), the triplex consisting of VP19C and VP23 found between the capsomers, and VP26 which binds to VP5 on hexons but not pentons. In addition, the dodecameric pUL6 portal complex occupies one of the 12 capsid vertices, and the capsid vertex specific component (CVSC), a heterotrimer complex of pUL17, pUL25 and pUL36 binds specifically to the triplexes adjacent to each penton. The capsid is assembled in the nucleus where the viral genome is packaged into newly assembled closed capsid shells. Cleavage and packaging of replicated, concatemeric viral DNA requires the seven viral proteins encoded by the UL6, UL15, UL17, UL25, UL28, UL32, and UL33 genes. Considerable advances have been made in understanding the structure of the herpesvirus capsid and the function of several of the DNA packaging proteins by applying biochemical, genetic, and structural techniques. This review is a summary of recent advances with respect to the structure of the HSV-1 virion capsid and what is known about the function of the seven packaging proteins and their interactions with each other and with the capsid shell. PMID:28528442
msgbsR: An R package for analysing methylation-sensitive restriction enzyme sequencing data.
Mayne, Benjamin T; Leemaqz, Shalem Y; Buckberry, Sam; Rodriguez Lopez, Carlos M; Roberts, Claire T; Bianco-Miotto, Tina; Breen, James
2018-02-01
Genotyping-by-sequencing (GBS) or restriction-site associated DNA marker sequencing (RAD-seq) is a practical and cost-effective method for analysing large genomes from high diversity species. This method of sequencing, coupled with methylation-sensitive enzymes (often referred to as methylation-sensitive restriction enzyme sequencing or MRE-seq), is an effective tool to study DNA methylation in parts of the genome that are inaccessible in other sequencing techniques or are not annotated in microarray technologies. Current software tools do not fulfil all methylation-sensitive restriction sequencing assays for determining differences in DNA methylation between samples. To fill this computational need, we present msgbsR, an R package that contains tools for the analysis of methylation-sensitive restriction enzyme sequencing experiments. msgbsR can be used to identify and quantify read counts at methylated sites directly from alignment files (BAM files) and enables verification of restriction enzyme cut sites with the correct recognition sequence of the individual enzyme. In addition, msgbsR assesses DNA methylation based on read coverage, similar to RNA sequencing experiments, rather than methylation proportion and is a useful tool in analysing differential methylation on large populations. The package is fully documented and available freely online as a Bioconductor package ( https://bioconductor.org/packages/release/bioc/html/msgbsR.html ).
Müllers, Erik; Uhlig, Tobias; Stirnnagel, Kristin; Fiebig, Uwe; Zentgraf, Hanswalter; Lindemann, Dirk
2011-02-01
Prototype foamy virus (PFV) Gag lacks the characteristic orthoretroviral Cys-His motifs that are essential for various steps of the orthoretroviral replication cycle, such as RNA packaging, reverse transcription, infectivity, integration, and viral assembly. Instead, it contains three glycine-arginine-rich boxes (GR boxes) in its C terminus that putatively represent a functional equivalent. We used a four-plasmid replication-deficient PFV vector system, with uncoupled RNA genome packaging and structural protein translation, to analyze the effects of deletion and various substitution mutations within each GR box on particle release, particle-associated protein composition, RNA packaging, DNA content, infectivity, particle morphology, and intracellular localization. The degree of viral particle release by all mutants was similar to that of the wild type. Only minimal effects on Pol encapsidation, exogenous reverse transcriptase (RT) activity, and genomic viral RNA packaging were observed. In contrast, particle-associated DNA content and infectivity were drastically reduced for all deletion mutants and were undetectable for all alanine substitution mutants. Furthermore, GR box I mutants had significant changes in particle morphology, and GR box II mutants lacked the typical nuclear localization pattern of PFV Gag. Finally, it could be shown that GR boxes I and III, but not GR box II, can functionally complement each other. It therefore appears that, similar to the orthoretroviral Cys-His motifs, the PFV Gag GR boxes are important for RNA encapsidation, genome reverse transcription, and virion infectivity as well as for particle morphogenesis.
Torrent, C; Gabus, C; Darlix, J L
1994-01-01
Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer. Images PMID:8289369
Kalloush, Rawan M.; Vivet-Boudou, Valérie; Ali, Lizna M.; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A.
2016-01-01
MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2′hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5′ region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. PMID:27095024
Dengue Virus Modulates the Unfolded Protein Response in a Time-dependent Manner*
Peña, José; Harris, Eva
2011-01-01
Flaviviruses, such as dengue virus (DENV), depend on the host endoplasmic reticulum for translation, replication, and packaging of their genomes. Here we report that DENV-2 infection modulates the unfolded protein response in a time-dependent manner. We show that early DENV-2 infection triggers and then suppresses PERK-mediated eIF2α phosphorylation and that in mid and late DENV-2 infection, the IRE1-XBP1 and ATF6 pathways are activated, respectively. Activation of IRE1-XBP1 correlated with induction of downstream targets GRP78, CHOP, and GADD34. Furthermore, induction of CHOP did not induce apoptotic markers, such as suppression of anti-apoptotic protein Bcl-2, activation of caspase-9 or caspase-3, and cleavage of poly(ADP-ribose) polymerase. Finally, we show that DENV-2 replication is affected in PERK−/− and IRE1−/− mouse embryo fibroblasts when compared with wild-type mouse embryo fibroblasts. These results demonstrate that time-dependent activation of the unfolded protein response by DENV-2 can override inhibition of translation, prevent apoptosis, and prolong the viral life cycle. PMID:21385877
fluff: exploratory analysis and visualization of high-throughput sequencing data
Georgiou, Georgios
2016-01-01
Summary. In this article we describe fluff, a software package that allows for simple exploration, clustering and visualization of high-throughput sequencing data mapped to a reference genome. The package contains three command-line tools to generate publication-quality figures in an uncomplicated manner using sensible defaults. Genome-wide data can be aggregated, clustered and visualized in a heatmap, according to different clustering methods. This includes a predefined setting to identify dynamic clusters between different conditions or developmental stages. Alternatively, clustered data can be visualized in a bandplot. Finally, fluff includes a tool to generate genomic profiles. As command-line tools, the fluff programs can easily be integrated into standard analysis pipelines. The installation is straightforward and documentation is available at http://fluff.readthedocs.org. Availability. fluff is implemented in Python and runs on Linux. The source code is freely available for download at https://github.com/simonvh/fluff. PMID:27547532
Mansilla, Sabrina F; Bertolin, Agustina P; Bergoglio, Valérie; Pillaire, Marie-Jeanne; González Besteiro, Marina A; Luzzani, Carlos; Miriuka, Santiago G; Hoffmann, Jean-Sébastien; Gottifredi, Vanesa
2016-01-01
The levels of the cyclin-dependent kinase (CDK) inhibitor p21 are low in S phase and insufficient to inhibit CDKs. We show here that endogenous p21, instead of being residual, it is functional and necessary to preserve the genomic stability of unstressed cells. p21depletion slows down nascent DNA elongation, triggers permanent replication defects and promotes the instability of hard-to-replicate genomic regions, namely common fragile sites (CFS). The p21’s PCNA interacting region (PIR), and not its CDK binding domain, is needed to prevent the replication defects and the genomic instability caused by p21 depletion. The alternative polymerase kappa is accountable for such defects as they were not observed after simultaneous depletion of both p21 and polymerase kappa. Hence, in CDK-independent manner, endogenous p21 prevents a type of genomic instability which is not triggered by endogenous DNA lesions but by a dysregulation in the DNA polymerase choice during genomic DNA synthesis. DOI: http://dx.doi.org/10.7554/eLife.18020.001 PMID:27740454
designGG: an R-package and web tool for the optimal design of genetical genomics experiments.
Li, Yang; Swertz, Morris A; Vera, Gonzalo; Fu, Jingyuan; Breitling, Rainer; Jansen, Ritsert C
2009-06-18
High-dimensional biomolecular profiling of genetically different individuals in one or more environmental conditions is an increasingly popular strategy for exploring the functioning of complex biological systems. The optimal design of such genetical genomics experiments in a cost-efficient and effective way is not trivial. This paper presents designGG, an R package for designing optimal genetical genomics experiments. A web implementation for designGG is available at http://gbic.biol.rug.nl/designGG. All software, including source code and documentation, is freely available. DesignGG allows users to intelligently select and allocate individuals to experimental units and conditions such as drug treatment. The user can maximize the power and resolution of detecting genetic, environmental and interaction effects in a genome-wide or local mode by giving more weight to genome regions of special interest, such as previously detected phenotypic quantitative trait loci. This will help to achieve high power and more accurate estimates of the effects of interesting factors, and thus yield a more reliable biological interpretation of data. DesignGG is applicable to linkage analysis of experimental crosses, e.g. recombinant inbred lines, as well as to association analysis of natural populations.
NASA Astrophysics Data System (ADS)
Migliori, Amy; Arya, Gaurav; Smith, Douglas E.
2012-10-01
Bacteriophage T4 is a double stranded DNA virus that infects E.coli by injecting the viral genome through the cellular wall of a host cell. The T4 genome must be ejected from the viral capsid with sufficient force to ensure infection. To generate high ejection forces, the genome is packaged to high density within the viral capsid. A DNA translocation motor, in which the protein gp17 hydrolyzes ATP and binds to the DNA, is responsible for translocating the genome into the capsid during viral maturation of T4. This motor generates forces in excess of 60 pN and packages DNA at rates exceeding 2000 base pairs/second (bp/s)1. Understanding these small yet powerful motors is important, as they have many potential applications. Though much is known about the activity of these motors from bulk and single molecule biophysical techniques, little is known about their detailed molecular mechanism. Recently, two structures of gp17 have been obtained: a high-resolution X-ray crystallographic structure showing a monomeric compacted form of the enzyme, and a cryo-electron microscopic structure of the extended form of gp17 in complex with actively packaging prohead complexes. Comparison of these two structures indicates several key differences, and a model has been proposed to explain the translocation action of the motor2. Key to this model are a set of residues forming ion pairs across two domains of the gp17 molecule that are proposed to be involved in force generation by causing the collapse of the extended form of gp17. Using a dual optical trap to measure the rates of DNA packaging and the generated forces, we present preliminary mutational data showing that these several of these ion pairs are important to motor function. We have also performed preliminary free energy calculations on the extended and collapsed state of gp17, to confirm that these interdomain ion pairs have large contributions to the change in free energy that occurs upon the collapse of gp17 during the proposed ratcheting mechanism.
Modular assembly of chimeric phi29 packaging RNAs that support DNA packaging.
Fang, Yun; Shu, Dan; Xiao, Feng; Guo, Peixuan; Qin, Peter Z
2008-08-08
The bacteriophage phi29 DNA packaging motor is a protein/RNA complex that can produce strong force to condense the linear-double-stranded DNA genome into a pre-formed protein capsid. The RNA component, called the packaging RNA (pRNA), utilizes magnesium-dependent inter-molecular base-pairing interactions to form ring-shaped complexes. The pRNA is a class of non-coding RNA, interacting with phi29 motor proteins to enable DNA packaging. Here, we report a two-piece chimeric pRNA construct that is fully competent in interacting with partner pRNA to form ring-shaped complexes, in packaging DNA via the motor, and in assembling infectious phi29 virions in vitro. This is the first example of a fully functional pRNA assembled using two non-covalently interacting fragments. The results support the notion of modular pRNA architecture in the phi29 packaging motor.
Modular assembly of chimeric phi29 packaging RNAs that support DNA packaging
Fang, Yun; Shu, Dan; Xiao, Feng; Guo, Peixuan; Qin, Peter Z.
2008-01-01
The bacteriophage phi29 DNA packaging motor is a protein/RNA complex that can produce strong force to condense the linear-double stranded DNA genome into a pre-formed protein capsid. The RNA component, called the packaging RNA (pRNA), utilizes magnesium-dependent intermolecular base-pairing interactions to form ring-shaped complexes. The pRNA is a class of non-coding RNA, interacting with phi29 motor proteins to enable DNA packaging. Here, we report a 2-piece chimeric pRNA construct that is fully competent in interacting with partner pRNA to form ring-shaped complexes, in packaging DNA via the motor, and in assembling infectious phi29 virions in vitro. This is the first example of a fully functional pRNA assembled using two non-covalently interacting fragments. The results support the notion of modular pRNA architecture in the phi29 packaging motor. PMID:18514064
Genomic Diversity of Phages Infecting Probiotic Strains of Lactobacillus paracasei
Rousseau, Geneviève M.; Capra, María L.; Quiberoni, Andrea; Tremblay, Denise M.; Labrie, Simon J.
2015-01-01
Strains of the Lactobacillus casei group have been extensively studied because some are used as probiotics in foods. Conversely, their phages have received much less attention. We analyzed the complete genome sequences of five L. paracasei temperate phages: CL1, CL2, iLp84, iLp1308, and iA2. Only phage iA2 could not replicate in an indicator strain. The genome lengths ranged from 34,155 bp (iA2) to 39,474 bp (CL1). Phages iA2 and iLp1308 (34,176 bp) possess the smallest genomes reported, thus far, for phages of the L. casei group. The GC contents of the five phage genomes ranged from 44.8 to 45.6%. As observed with many other phages, their genomes were organized as follows: genes coding for DNA packaging, morphogenesis, lysis, lysogeny, and replication. Phages CL1, CL2, and iLp1308 are highly related to each other. Phage iLp84 was also related to these three phages, but the similarities were limited to gene products involved in DNA packaging and structural proteins. Genomic fragments of phages CL1, CL2, iLp1308, and iLp84 were found in several genomes of L. casei strains. Prophage iA2 is unrelated to these four phages, but almost all of its genome was found in at least four L. casei strains. Overall, these phages are distinct from previously characterized Lactobacillus phages. Our results highlight the diversity of L. casei phages and indicate frequent DNA exchanges between phages and their hosts. PMID:26475105
Yang, Jian-Hua; Zhang, Xiao-Chen; Huang, Zhan-Peng; Zhou, Hui; Huang, Mian-Bo; Zhang, Shu; Chen, Yue-Qin; Qu, Liang-Hu
2006-01-01
Small nucleolar RNAs (snoRNAs) represent an abundant group of non-coding RNAs in eukaryotes. They can be divided into guide and orphan snoRNAs according to the presence or absence of antisense sequence to rRNAs or snRNAs. Current snoRNA-searching programs, which are essentially based on sequence complementarity to rRNAs or snRNAs, exist only for the screening of guide snoRNAs. In this study, we have developed an advanced computational package, snoSeeker, which includes CDseeker and ACAseeker programs, for the highly efficient and specific screening of both guide and orphan snoRNA genes in mammalian genomes. By using these programs, we have systematically scanned four human-mammal whole-genome alignment (WGA) sequences and identified 54 novel candidates including 26 orphan candidates as well as 266 known snoRNA genes. Eighteen novel snoRNAs were further experimentally confirmed with four snoRNAs exhibiting a tissue-specific or restricted expression pattern. The results of this study provide the most comprehensive listing of two families of snoRNA genes in the human genome till date.
GPFrontend and GPGraphics: graphical analysis tools for genetic association studies.
Uebe, Steffen; Pasutto, Francesca; Krumbiegel, Mandy; Schanze, Denny; Ekici, Arif B; Reis, André
2010-09-21
Most software packages for whole genome association studies are non-graphical, purely text based programs originally designed to run with UNIX-like operating systems. Graphical output is often not intended or supposed to be performed with other command line tools, e.g. gnuplot. Using the Microsoft .NET 2.0 platform and Visual Studio 2005, we have created a graphical software package to analyze data from microarray whole genome association studies, both for a DNA-pooling based approach as well as regular single sample data. Part of this package was made to integrate with GenePool 0.8.2, a previously existing software suite for GNU/Linux systems, which we have modified to run in a Microsoft Windows environment. Further modifications cause it to generate some additional data. This enables GenePool to interact with the .NET parts created by us. The programs we developed are GPFrontend, a graphical user interface and frontend to use GenePool and create metadata files for it, and GPGraphics, a program to further analyze and graphically evaluate output of different WGA analysis programs, among them also GenePool. Our programs enable regular MS Windows users without much experience in bioinformatics to easily visualize whole genome data from a variety of sources.
The Crystal Structure and RNA-Binding of an Orthomyxovirus Nucleoprotein
Zheng, Wenjie; Olson, John; Vakharia, Vikram; Tao, Yizhi Jane
2013-01-01
Genome packaging for viruses with segmented genomes is often a complex problem. This is particularly true for influenza viruses and other orthomyxoviruses, whose genome consists of multiple negative-sense RNAs encapsidated as ribonucleoprotein (RNP) complexes. To better understand the structural features of orthomyxovirus RNPs that allow them to be packaged, we determined the crystal structure of the nucleoprotein (NP) of a fish orthomyxovirus, the infectious salmon anemia virus (ISAV) (genus Isavirus). As the major protein component of the RNPs, ISAV-NP possesses a bi-lobular structure similar to the influenza virus NP. Because both RNA-free and RNA-bound ISAV NP forms stable dimers in solution, we were able to measure the NP RNA binding affinity as well as the stoichiometry using recombinant proteins and synthetic oligos. Our RNA binding analysis revealed that each ISAV-NP binds ∼12 nts of RNA, shorter than the 24–28 nts originally estimated for the influenza A virus NP based on population average. The 12-nt stoichiometry was further confirmed by results from electron microscopy and dynamic light scattering. Considering that RNPs of ISAV and the influenza viruses have similar morphologies and dimensions, our findings suggest that NP-free RNA may exist on orthomyxovirus RNPs, and selective RNP packaging may be accomplished through direct RNA-RNA interactions. PMID:24068932
iScreen: Image-Based High-Content RNAi Screening Analysis Tools.
Zhong, Rui; Dong, Xiaonan; Levine, Beth; Xie, Yang; Xiao, Guanghua
2015-09-01
High-throughput RNA interference (RNAi) screening has opened up a path to investigating functional genomics in a genome-wide pattern. However, such studies are often restricted to assays that have a single readout format. Recently, advanced image technologies have been coupled with high-throughput RNAi screening to develop high-content screening, in which one or more cell image(s), instead of a single readout, were generated from each well. This image-based high-content screening technology has led to genome-wide functional annotation in a wider spectrum of biological research studies, as well as in drug and target discovery, so that complex cellular phenotypes can be measured in a multiparametric format. Despite these advances, data analysis and visualization tools are still largely lacking for these types of experiments. Therefore, we developed iScreen (image-Based High-content RNAi Screening Analysis Tool), an R package for the statistical modeling and visualization of image-based high-content RNAi screening. Two case studies were used to demonstrate the capability and efficiency of the iScreen package. iScreen is available for download on CRAN (http://cran.cnr.berkeley.edu/web/packages/iScreen/index.html). The user manual is also available as a supplementary document. © 2014 Society for Laboratory Automation and Screening.
SNPassoc: an R package to perform whole genome association studies.
González, Juan R; Armengol, Lluís; Solé, Xavier; Guinó, Elisabet; Mercader, Josep M; Estivill, Xavier; Moreno, Víctor
2007-03-01
The popularization of large-scale genotyping projects has led to the widespread adoption of genetic association studies as the tool of choice in the search for single nucleotide polymorphisms (SNPs) underlying susceptibility to complex diseases. Although the analysis of individual SNPs is a relatively trivial task, when the number is large and multiple genetic models need to be explored it becomes necessary a tool to automate the analyses. In order to address this issue, we developed SNPassoc, an R package to carry out most common analyses in whole genome association studies. These analyses include descriptive statistics and exploratory analysis of missing values, calculation of Hardy-Weinberg equilibrium, analysis of association based on generalized linear models (either for quantitative or binary traits), and analysis of multiple SNPs (haplotype and epistasis analysis). Package SNPassoc is available at CRAN from http://cran.r-project.org. A tutorial is available on Bioinformatics online and in http://davinci.crg.es/estivill_lab/snpassoc.
HyDe: a Python Package for Genome-Scale Hybridization Detection.
Blischak, Paul D; Chifman, Julia; Wolfe, Andrea D; Kubatko, Laura S
2018-03-19
The analysis of hybridization and gene flow among closely related taxa is a common goal for researchers studying speciation and phylogeography. Many methods for hybridization detection use simple site pattern frequencies from observed genomic data and compare them to null models that predict an absence of gene flow. The theory underlying the detection of hybridization using these site pattern probabilities exploits the relationship between the coalescent process for gene trees within population trees and the process of mutation along the branches of the gene trees. For certain models, site patterns are predicted to occur in equal frequency (i.e., their difference is 0), producing a set of functions called phylogenetic invariants. In this paper we introduce HyDe, a software package for detecting hybridization using phylogenetic invariants arising under the coalescent model with hybridization. HyDe is written in Python, and can be used interactively or through the command line using pre-packaged scripts. We demonstrate the use of HyDe on simulated data, as well as on two empirical data sets from the literature. We focus in particular on identifying individual hybrids within population samples and on distinguishing between hybrid speciation and gene flow. HyDe is freely available as an open source Python package under the GNU GPL v3 on both GitHub (https://github.com/pblischak/HyDe) and the Python Package Index (PyPI: https://pypi.python.org/pypi/phyde).
The novel asymmetric entry intermediate of a picornavirus captured with nanodiscs
Lee, Hyunwook; Shingler, Kristin L.; Organtini, Lindsey J.; Ashley, Robert E.; Makhov, Alexander M.; Conway, James F.; Hafenstein, Susan
2016-01-01
Many nonenveloped viruses engage host receptors that initiate capsid conformational changes necessary for genome release. Structural studies on the mechanisms of picornavirus entry have relied on in vitro approaches of virus incubated at high temperatures or with excess receptor molecules to trigger the entry intermediate or A-particle. We have induced the coxsackievirus B3 entry intermediate by triggering the virus with full-length receptors embedded in lipid bilayer nanodiscs. These asymmetrically formed A-particles were reconstructed using cryo-electron microscopy and a direct electron detector. These first high-resolution structures of a picornavirus entry intermediate captured at a membrane with and without imposing icosahedral symmetry (3.9 and 7.8 Å, respectively) revealed a novel A-particle that is markedly different from the classical A-particles. The asymmetric receptor binding triggers minimal global capsid expansion but marked local conformational changes at the site of receptor interaction. In addition, viral proteins extrude from the capsid only at the site of extensive protein remodeling adjacent to the nanodisc. Thus, the binding of the receptor triggers formation of a unique site in preparation for genome release. PMID:27574701
DMRfinder: efficiently identifying differentially methylated regions from MethylC-seq data.
Gaspar, John M; Hart, Ronald P
2017-11-29
DNA methylation is an epigenetic modification that is studied at a single-base resolution with bisulfite treatment followed by high-throughput sequencing. After alignment of the sequence reads to a reference genome, methylation counts are analyzed to determine genomic regions that are differentially methylated between two or more biological conditions. Even though a variety of software packages is available for different aspects of the bioinformatics analysis, they often produce results that are biased or require excessive computational requirements. DMRfinder is a novel computational pipeline that identifies differentially methylated regions efficiently. Following alignment, DMRfinder extracts methylation counts and performs a modified single-linkage clustering of methylation sites into genomic regions. It then compares methylation levels using beta-binomial hierarchical modeling and Wald tests. Among its innovative attributes are the analyses of novel methylation sites and methylation linkage, as well as the simultaneous statistical analysis of multiple sample groups. To demonstrate its efficiency, DMRfinder is benchmarked against other computational approaches using a large published dataset. Contrasting two replicates of the same sample yielded minimal genomic regions with DMRfinder, whereas two alternative software packages reported a substantial number of false positives. Further analyses of biological samples revealed fundamental differences between DMRfinder and another software package, despite the fact that they utilize the same underlying statistical basis. For each step, DMRfinder completed the analysis in a fraction of the time required by other software. Among the computational approaches for identifying differentially methylated regions from high-throughput bisulfite sequencing datasets, DMRfinder is the first that integrates all the post-alignment steps in a single package. Compared to other software, DMRfinder is extremely efficient and unbiased in this process. DMRfinder is free and open-source software, available on GitHub ( github.com/jsh58/DMRfinder ); it is written in Python and R, and is supported on Linux.
Kalloush, Rawan M; Vivet-Boudou, Valérie; Ali, Lizna M; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A
2016-06-01
MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2'hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5' region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. © 2016 Kalloush et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Nair, Nidhi; Shoaib, Muhammad
2017-01-01
Genomic DNA is compacted into chromatin through packaging with histone and non-histone proteins. Importantly, DNA accessibility is dynamically regulated to ensure genome stability. This is exemplified in the response to DNA damage where chromatin relaxation near genomic lesions serves to promote access of relevant enzymes to specific DNA regions for signaling and repair. Furthermore, recent data highlight genome maintenance roles of chromatin through the regulation of endogenous DNA-templated processes including transcription and replication. Here, we review research that shows the importance of chromatin structure regulation in maintaining genome integrity by multiple mechanisms including facilitating DNA repair and directly suppressing endogenous DNA damage. PMID:28698521
Selective recruitment of nuclear factors to productively replicating herpes simplex virus genomes.
Dembowski, Jill A; DeLuca, Neal A
2015-05-01
Much of the HSV-1 life cycle is carried out in the cell nucleus, including the expression, replication, repair, and packaging of viral genomes. Viral proteins, as well as cellular factors, play essential roles in these processes. Isolation of proteins on nascent DNA (iPOND) was developed to label and purify cellular replication forks. We adapted aspects of this method to label viral genomes to both image, and purify replicating HSV-1 genomes for the identification of associated proteins. Many viral and cellular factors were enriched on viral genomes, including factors that mediate DNA replication, repair, chromatin remodeling, transcription, and RNA processing. As infection proceeded, packaging and structural components were enriched to a greater extent. Among the more abundant proteins that copurified with genomes were the viral transcription factor ICP4 and the replication protein ICP8. Furthermore, all seven viral replication proteins were enriched on viral genomes, along with cellular PCNA and topoisomerases, while other cellular replication proteins were not detected. The chromatin-remodeling complexes present on viral genomes included the INO80, SWI/SNF, NURD, and FACT complexes, which may prevent chromatinization of the genome. Consistent with this conclusion, histones were not readily recovered with purified viral genomes, and imaging studies revealed an underrepresentation of histones on viral genomes. RNA polymerase II, the mediator complex, TFIID, TFIIH, and several other transcriptional activators and repressors were also affinity purified with viral DNA. The presence of INO80, NURD, SWI/SNF, mediator, TFIID, and TFIIH components is consistent with previous studies in which these complexes copurified with ICP4. Therefore, ICP4 is likely involved in the recruitment of these key cellular chromatin remodeling and transcription factors to viral genomes. Taken together, iPOND is a valuable method for the study of viral genome dynamics during infection and provides a comprehensive view of how HSV-1 selectively utilizes cellular resources.
RNA secondary structures of the bacteriophage phi6 packaging regions.
Pirttimaa, M J; Bamford, D H
2000-06-01
Bacteriophage phi6 genome consists of three segments of double-stranded RNA. During maturation, single-stranded copies of these segments are packaged into preformed polymerase complex particles. Only phi6 RNA is packaged, and each particle contains only one copy of each segment. An in vitro packaging and replication assay has been developed for phi6, and the packaging signals (pac sites) have been mapped to the 5' ends of the RNA segments. In this study, we propose secondary structure models for the pac sites of phi6 single-stranded RNA segments. Our models accommodate data from structure-specific chemical modifications, free energy minimizations, and phylogenetic comparisons. Previously reported pac site deletion studies are also discussed. Each pac site possesses a unique architecture, that, however, contains common structural elements.
Role of DNA-DNA Interactions on the Structure and Thermodynamics of Bacteriophages Lambda and P4
Petrov, Anton S.; Harvey, Stephen C.
2010-01-01
Electrostatic interactions play an important role in both packaging of DNA inside bacteriophages and its release into bacterial cells. While at physiological conditions DNA strands repel each other, the presence of polyvalent cations such as spermine and spermidine in solutions leads to the formation of DNA condensates. In this study, we discuss packaging of DNA into bacteriophages P4 and Lambda under repulsive and attractive conditions using a coarse-grained model of DNA and capsids. Packaging under repulsive conditions leads to the appearance of the coaxial spooling conformations; DNA occupies all available space inside the capsid. Under the attractive potential both packed systems reveal toroidal conformations, leaving the central part of the capsids empty. We also present a detailed thermodynamic analysis of packaging and show that the forces required to pack the genomes in the presence of polyamines are significantly lower than those observed under repulsive conditions. The analysis reveals that in both the repulsive and attractive regimes the entropic penalty of DNA confinement has a significant non-negligible contribution into the total energy of packaging. Additionally we report the results of simulations of DNA condensation inside partially packed Lambda. We found that at low densities DNA behaves as free unconfined polymer and condenses into the toroidal structures; at higher densities rearrangement of the genome into toroids becomes hindered, and condensation results in the formation of non-equilibrium structures. In all cases packaging in a specific conformation occurs as a result of interplay between bending stresses experienced by the confined polymer and interactions between the strands. PMID:21074621
Company profile: Complete Genomics Inc.
Reid, Clifford
2011-02-01
Complete Genomics Inc. is a life sciences company that focuses on complete human genome sequencing. It is taking a completely different approach to DNA sequencing than other companies in the industry. Rather than building a general-purpose platform for sequencing all organisms and all applications, it has focused on a single application - complete human genome sequencing. The company's Complete Genomics Analysis Platform (CGA™ Platform) comprises an integrated package of biochemistry, instrumentation and software that sequences human genomes at the highest quality, lowest cost and largest scale available. Complete Genomics offers a turnkey service that enables customers to outsource their human genome sequencing to the company's genome sequencing center in Mountain View, CA, USA. Customers send in their DNA samples, the company does all the library preparation, DNA sequencing, assembly and variant analysis, and customers receive research-ready data that they can use for biological discovery.
Rizvi, Tahir A; Kenyon, Julia C; Ali, Jahabar; Aktar, Suriya J; Phillip, Pretty S; Ghazawi, Akela; Mustafa, Farah; Lever, Andrew M L
2010-10-15
The feline immunodeficiency virus (FIV) is a lentivirus that is related to human immunodeficiency virus (HIV), causing a similar pathology in cats. It is a potential small animal model for AIDS and the FIV-based vectors are also being pursued for human gene therapy. Previous studies have mapped the FIV packaging signal (ψ) to two or more discontinuous regions within the 5' 511 nt of the genomic RNA and structural analyses have determined its secondary structure. The 5' and 3' sequences within ψ region interact through extensive long-range interactions (LRIs), including a conserved heptanucleotide interaction between R/U5 and gag. Other secondary structural elements identified include a conserved 150 nt stem-loop (SL2) and a small palindromic stem-loop within gag open reading frame that might act as a viral dimerization initiation site. We have performed extensive mutational analysis of these sequences and structures and ascertained their importance in FIV packaging using a trans-complementation assay. Disrupting the conserved heptanucleotide LRI to prevent base pairing between R/U5 and gag reduced packaging by 2.8-5.5 fold. Restoration of pairing using an alternative, non-wild type (wt) LRI sequence restored RNA packaging and propagation to wt levels, suggesting that it is the structure of the LRI, rather than its sequence, that is important for FIV packaging. Disrupting the palindrome within gag reduced packaging by 1.5-3-fold, but substitution with a different palindromic sequence did not restore packaging completely, suggesting that the sequence of this region as well as its palindromic nature is important. Mutation of individual regions of SL2 did not have a pronounced effect on FIV packaging, suggesting that either it is the structure of SL2 as a whole that is necessary for optimal packaging, or that there is redundancy within this structure. The mutational analysis presented here has further validated the previously predicted RNA secondary structure of FIV ψ. Copyright © 2010 Elsevier Ltd. All rights reserved.
DNA bending-induced phase transition of encapsidated genome in phage λ
Lander, Gabriel C.; Johnson, John E.; Rau, Donald C.; Potter, Clinton S.; Carragher, Bridget; Evilevitch, Alex
2013-01-01
The DNA structure in phage capsids is determined by DNA–DNA interactions and bending energy. The effects of repulsive interactions on DNA interaxial distance were previously investigated, but not the effect of DNA bending on its structure in viral capsids. By varying packaged DNA length and through addition of spermine ions, we transform the interaction energy from net repulsive to net attractive. This allowed us to isolate the effect of bending on the resulting DNA structure. We used single particle cryo-electron microscopy reconstruction analysis to determine the interstrand spacing of double-stranded DNA encapsidated in phage λ capsids. The data reveal that stress and packing defects, both resulting from DNA bending in the capsid, are able to induce a long-range phase transition in the encapsidated DNA genome from a hexagonal to a cholesteric packing structure. This structural observation suggests significant changes in genome fluidity as a result of a phase transition affecting the rates of viral DNA ejection and packaging. PMID:23449219
Solid-to-fluid DNA transition inside HSV-1 capsid close to the temperature of infection
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sae-Ueng, Udom; Li, Dong; Zuo, Xiaobing
2014-10-01
DNA in the human Herpes simplex virus type 1 (HSV-1) capsid is packaged to a tight density. This leads to tens of atmospheres of internal pressure responsible for the delivery of the herpes genome into the cell nucleus. In this study we show that, despite its liquid crystalline state inside the capsid, the DNA is fluid-like, which facilitates its ejection into the cell nucleus during infection. We found that the sliding friction between closely packaged DNA strands, caused by interstrand repulsive interactions, is reduced by the ionic environment of epithelial cells and neurons susceptible to herpes infection. However, variations inmore » the ionic conditions corresponding to neuronal activity can restrict DNA mobility in the capsid, making it more solid-like. This can inhibit intranuclear DNA release and interfere with viral replication. In addition, the temperature of the human host (37 °C) induces a disordering transition of the encapsidated herpes genome, which reduces interstrand interactions and provides genome mobility required for infection.« less
ssbio: a Python framework for structural systems biology.
Mih, Nathan; Brunk, Elizabeth; Chen, Ke; Catoiu, Edward; Sastry, Anand; Kavvas, Erol; Monk, Jonathan M; Zhang, Zhen; Palsson, Bernhard O
2018-06-15
Working with protein structures at the genome-scale has been challenging in a variety of ways. Here, we present ssbio, a Python package that provides a framework to easily work with structural information in the context of genome-scale network reconstructions, which can contain thousands of individual proteins. The ssbio package provides an automated pipeline to construct high quality genome-scale models with protein structures (GEM-PROs), wrappers to popular third-party programs to compute associated protein properties, and methods to visualize and annotate structures directly in Jupyter notebooks, thus lowering the barrier of linking 3D structural data with established systems workflows. ssbio is implemented in Python and available to download under the MIT license at http://github.com/SBRG/ssbio. Documentation and Jupyter notebook tutorials are available at http://ssbio.readthedocs.io/en/latest/. Interactive notebooks can be launched using Binder at https://mybinder.org/v2/gh/SBRG/ssbio/master?filepath=Binder.ipynb. Supplementary data are available at Bioinformatics online.
Hillig, Roman C; Urlinger, Stefanie; Fanghänel, Jörg; Brocks, Bodo; Haenel, Cornelia; Stark, Yvonne; Sülzle, Detlev; Svergun, Dmitri I; Baesler, Siegfried; Malawski, Guido; Moosmayer, Dieter; Menrad, Andreas; Schirner, Michael; Licha, Kai
2008-03-14
Molecular interactions between near-IR fluorescent probes and specific antibodies may be exploited to generate novel smart probes for diagnostic imaging. Using a new phage display technology, we developed such antibody Fab fragments with subnanomolar binding affinity for tetrasulfocyanine, a near-IR in vivo imaging agent. Unexpectedly, some Fabs induced redshifts of the dye absorption peak of up to 44 nm. This is the largest shift reported for a biological system so far. Crystal structure determination and absorption spectroscopy in the crystal in combination with microcalorimetry and small-angle X-ray scattering in solution revealed that the redshift is triggered by formation of a Fab dimer, with tetrasulfocyanine being buried in a fully closed protein cavity within the dimer interface. The derived principle of shifting the absorption peak of a symmetric dye via packaging within a Fab dimer interface may be transferred to other diagnostic fluorophores, opening the way towards smart imaging probes that change their wavelength upon interaction with an antibody.
USDA-ARS?s Scientific Manuscript database
Poultry products serve as the main source of Campylobacter jejuni subsp. jejuni (Cjj) infections in humans. Cjj infections are a leading cause of foodborne gastroenteritis and are a prevalent antecedent to Guillain-Barré syndrome (GBS). This study describes the genome of Cjj HS:19 strain RM1285 isol...
GANESH: software for customized annotation of genome regions.
Huntley, Derek; Hummerich, Holger; Smedley, Damian; Kittivoravitkul, Sasivimol; McCarthy, Mark; Little, Peter; Sergot, Marek
2003-09-01
GANESH is a software package designed to support the genetic analysis of regions of human and other genomes. It provides a set of components that may be assembled to construct a self-updating database of DNA sequence, mapping data, and annotations of possible genome features. Once one or more remote sources of data for the target region have been identified, all sequences for that region are downloaded, assimilated, and subjected to a (configurable) set of standard database-searching and genome-analysis packages. The results are stored in compressed form in a relational database, and are updated automatically on a regular schedule so that they are always immediately available in their most up-to-date versions. A Java front-end, executed as a stand alone application or web applet, provides a graphical interface for navigating the database and for viewing the annotations. There are facilities for importing and exporting data in the format of the Distributed Annotation System (DAS), enabling a GANESH database to be used as a component of a DAS configuration. The system has been used to construct databases for about a dozen regions of human chromosomes and for three regions of mouse chromosomes.
Lau, Cia-Hin; Suh, Yousin
2017-01-01
Adeno-associated virus (AAV) has shown promising therapeutic efficacy with a good safety profile in a wide range of animal models and human clinical trials. With the advent of clustered regulatory interspaced short palindromic repeat (CRISPR)-based genome-editing technologies, AAV provides one of the most suitable viral vectors to package, deliver, and express CRISPR components for targeted gene editing. Recent discoveries of smaller Cas9 orthologues have enabled the packaging of Cas9 nuclease and its chimeric guide RNA into a single AAV delivery vehicle for robust in vivo genome editing. Here, we discuss how the combined use of small Cas9 orthologues, tissue-specific minimal promoters, AAV serotypes, and different routes of administration has advanced the development of efficient and precise in vivo genome editing and comprehensively review the various AAV-CRISPR systems that have been effectively used in animals. We then discuss the clinical implications and potential strategies to overcome off-target effects, immunogenicity, and toxicity associated with CRISPR components and AAV delivery vehicles. Finally, we discuss ongoing non-viral-based ex vivo gene therapy clinical trials to underscore the current challenges and future prospects of CRISPR/Cas9 delivery for human therapeutics. PMID:29333255
Langevin Dynamics Simulations of Genome Packing in Bacteriophage
Forrey, Christopher; Muthukumar, M.
2006-01-01
We use Langevin dynamics simulations to study the process by which a coarse-grained DNA chain is packaged within an icosahedral container. We focus our inquiry on three areas of interest in viral packing: the evolving structure of the packaged DNA condensate; the packing velocity; and the internal buildup of energy and resultant forces. Each of these areas has been studied experimentally, and we find that we can qualitatively reproduce experimental results. However, our findings also suggest that the phage genome packing process is fundamentally different than that suggested by the inverse spool model. We suggest that packing in general does not proceed in the deterministic fashion of the inverse-spool model, but rather is stochastic in character. As the chain configuration becomes compressed within the capsid, the structure, energy, and packing velocity all become dependent upon polymer dynamics. That many observed features of the packing process are rooted in condensed-phase polymer dynamics suggests that statistical mechanics, rather than mechanics, should serve as the proper theoretical basis for genome packing. Finally we suggest that, as a result of an internal protein unique to bacteriophage T7, the T7 genome may be significantly more ordered than is true for bacteriophage in general. PMID:16617089
Langevin dynamics simulations of genome packing in bacteriophage.
Forrey, Christopher; Muthukumar, M
2006-07-01
We use Langevin dynamics simulations to study the process by which a coarse-grained DNA chain is packaged within an icosahedral container. We focus our inquiry on three areas of interest in viral packing: the evolving structure of the packaged DNA condensate; the packing velocity; and the internal buildup of energy and resultant forces. Each of these areas has been studied experimentally, and we find that we can qualitatively reproduce experimental results. However, our findings also suggest that the phage genome packing process is fundamentally different than that suggested by the inverse spool model. We suggest that packing in general does not proceed in the deterministic fashion of the inverse-spool model, but rather is stochastic in character. As the chain configuration becomes compressed within the capsid, the structure, energy, and packing velocity all become dependent upon polymer dynamics. That many observed features of the packing process are rooted in condensed-phase polymer dynamics suggests that statistical mechanics, rather than mechanics, should serve as the proper theoretical basis for genome packing. Finally we suggest that, as a result of an internal protein unique to bacteriophage T7, the T7 genome may be significantly more ordered than is true for bacteriophage in general.
Structural transitions in Cowpea chlorotic mottle virus (CCMV)
NASA Astrophysics Data System (ADS)
Liepold, Lars O.; Revis, Jennifer; Allen, Mark; Oltrogge, Luke; Young, Mark; Douglas, Trevor
2005-12-01
Viral capsids act as molecular containers for the encapsulation of genomic nucleic acid. These protein cages can also be used as constrained reaction vessels for packaging and entrapment of synthetic cargos. The icosahedral Cowpea chlorotic mottle virus (CCMV) is an excellent model for understanding the encapsulation and packaging of both genomic and synthetic materials. High-resolution structural information of the CCMV capsid has been invaluable for evaluating structure-function relationships in the assembled capsid but does not allow insight into the capsid dynamics. The dynamic nature of the CCMV capsid might play an important role in the biological function of the virus. The CCMV capsid undergoes a pH and metal ion dependent reversible structural transition where 60 separate pores in the capsid open or close, exposing the interior of the protein cage to the bulk medium. In addition, the highly basic N-terminal domain of the capsid, which is disordered in the crystal structure, plays a significant role in packaging the viral cargo. Interestingly, in limited proteolysis and mass spectrometry experiments the N-terminal domain is the first part of the subunit to be cleaved, confirming its dynamic nature. Based on our fundamental understanding of the capsid dynamics in CCMV, we have utilized these aspects to direct packaging of a range of synthetic materials including drugs and inorganic nanoparticles.
Smyth, Redmond P; Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe; von Kleist, Max; Marquet, Roland
2018-05-18
Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5' region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5' PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production.
Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe
2018-01-01
Abstract Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5′ region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5′ PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production. PMID:29514260
FunChIP: an R/Bioconductor package for functional classification of ChIP-seq shapes.
Parodi, Alice C L; Sangalli, Laura M; Vantini, Simone; Amati, Bruno; Secchi, Piercesare; Morelli, Marco J
2017-08-15
Chromatin Immunoprecipitation followed by sequencing (ChIP-seq) generates local accumulations of sequencing reads on the genome ("peaks"), which correspond to specific protein-DNA interactions or chromatin modifications. Peaks are detected by considering their total area above a background signal, usually neglecting their shapes, which instead may convey additional biological information. We present FunChIP, an R/Bioconductor package for clustering peaks according to a functional representation of their shapes: after approximating their profiles with cubic B-splines, FunChIP minimizes their functional distance and classifies the peaks applying a k-mean alignment and clustering algorithm. The whole pipeline is user-friendly and provides visualization functions for a quick inspection of the results. An application to the transcription factor Myc in 3T9 murine fibroblasts shows that clusters of peaks with different shapes are associated with different genomic locations and different transcriptional regulatory activity. The package is implemented in R and is available under Artistic Licence 2.0 from the Bioconductor website (http://bioconductor.org/packages/FunChIP). marco.morelli@iit.it. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Compact high voltage solid state switch
Glidden, Steven C.
2003-09-23
A compact, solid state, high voltage switch capable of high conduction current with a high rate of current risetime (high di/dt) that can be used to replace thyratrons in existing and new applications. The switch has multiple thyristors packaged in a single enclosure. Each thyristor has its own gate drive circuit that circuit obtains its energy from the energy that is being switched in the main circuit. The gate drives are triggered with a low voltage, low current pulse isolated by a small inexpensive transformer. The gate circuits can also be triggered with an optical signal, eliminating the trigger transformer altogether. This approach makes it easier to connect many thyristors in series to obtain the hold off voltages of greater than 80 kV.
Evidence that viral RNAs have evolved for efficient, two-stage packaging.
Borodavka, Alexander; Tuma, Roman; Stockley, Peter G
2012-09-25
Genome packaging is an essential step in virus replication and a potential drug target. Single-stranded RNA viruses have been thought to encapsidate their genomes by gradual co-assembly with capsid subunits. In contrast, using a single molecule fluorescence assay to monitor RNA conformation and virus assembly in real time, with two viruses from differing structural families, we have discovered that packaging is a two-stage process. Initially, the genomic RNAs undergo rapid and dramatic (approximately 20-30%) collapse of their solution conformations upon addition of cognate coat proteins. The collapse occurs with a substoichiometric ratio of coat protein subunits and is followed by a gradual increase in particle size, consistent with the recruitment of additional subunits to complete a growing capsid. Equivalently sized nonviral RNAs, including high copy potential in vivo competitor mRNAs, do not collapse. They do support particle assembly, however, but yield many aberrant structures in contrast to viral RNAs that make only capsids of the correct size. The collapse is specific to viral RNA fragments, implying that it depends on a series of specific RNA-protein interactions. For bacteriophage MS2, we have shown that collapse is driven by subsequent protein-protein interactions, consistent with the RNA-protein contacts occurring in defined spatial locations. Conformational collapse appears to be a distinct feature of viral RNA that has evolved to facilitate assembly. Aspects of this process mimic those seen in ribosome assembly.
Fei, Zhongyang; Guan, Chaoxu; Gao, Huijun; Zhongyang Fei; Chaoxu Guan; Huijun Gao; Fei, Zhongyang; Guan, Chaoxu; Gao, Huijun
2018-06-01
This paper is concerned with the exponential synchronization for master-slave chaotic delayed neural network with event trigger control scheme. The model is established on a network control framework, where both external disturbance and network-induced delay are taken into consideration. The desired aim is to synchronize the master and slave systems with limited communication capacity and network bandwidth. In order to save the network resource, we adopt a hybrid event trigger approach, which not only reduces the data package sending out, but also gets rid of the Zeno phenomenon. By using an appropriate Lyapunov functional, a sufficient criterion for the stability is proposed for the error system with extended ( , , )-dissipativity performance index. Moreover, hybrid event trigger scheme and controller are codesigned for network-based delayed neural network to guarantee the exponential synchronization between the master and slave systems. The effectiveness and potential of the proposed results are demonstrated through a numerical example.
Altruism, Empathy, and Sex Offender Treatment
ERIC Educational Resources Information Center
Ward, Tony; Durrant, Russil
2013-01-01
Treatment programs for serious offenders such as sex offenders typically include an empathy training component as part of a comprehensive intervention package. The reasons for doing so are partly based on research evidence indicating that social disconnection and relationship ruptures related to empathy failures often trigger offending, and also…
Metastable Polymers for On Demand Transient Electronic Packaging
2018-01-17
a triggerable polymer for engineering applications. 25 Approved for public release; distribution is unlimited. 6 REFERENCES (1) Aso, C.; Tagami, S...R. Advanced Materials 2014, 26, 7637. (4) Ito, H.; Willson, C. G. Polymer Engineering & Science 1983, 23, 1012. (5) Ito, H.; England, W. P.; Ueda, M
A novel packaging system for the generation of helper-free oncolytic MVM vector stocks.
Brandenburger, A; Russell, S
1996-10-01
MVM-based autonomous parvoviral vectors have been shown to target the expression of heterologous genes in neoplastic cells and are therefore of interest for cancer gene therapy. The traditional method for production of parvoviral vectors requires the cotransfection of vector and helper plasmids into MVM-permissive cell lines, but recombination between the cotransfected plasmids invariably gives rise to vector stocks that are heavily contaminated with wild-type MVM. Therefore, to minimise recombination between the vector and helper genomes we have utilised a cell line in which the MVM helper functions are expressed inducibly from a modified MVM genome that is stably integrated into the host cell chromosome. Using this MVM packaging cell line, we could reproducibly generate MVM vector stocks that contained no detectable helper virus.
LPmerge: an R package for merging genetic maps by linear programming.
Endelman, Jeffrey B; Plomion, Christophe
2014-06-01
Consensus genetic maps constructed from multiple populations are an important resource for both basic and applied research, including genome-wide association analysis, genome sequence assembly and studies of evolution. The LPmerge software uses linear programming to efficiently minimize the mean absolute error between the consensus map and the linkage maps from each population. This minimization is performed subject to linear inequality constraints that ensure the ordering of the markers in the linkage maps is preserved. When marker order is inconsistent between linkage maps, a minimum set of ordinal constraints is deleted to resolve the conflicts. LPmerge is on CRAN at http://cran.r-project.org/web/packages/LPmerge. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beck, Markus H.; Inman, Ross B.; Strand, Michael R.
2007-03-01
Polydnaviruses (PDVs) are distinguished by their unique association with parasitoid wasps and their segmented, double-stranded (ds) DNA genomes that are non-equimolar in abundance. Relatively little is actually known, however, about genome packaging or segment abundance of these viruses. Here, we conducted electron microscopy (EM) and real-time polymerase chain reaction (PCR) studies to characterize packaging and segment abundance of Microplitis demolitor bracovirus (MdBV). Like other PDVs, MdBV replicates in the ovaries of females where virions accumulate to form a suspension called calyx fluid. Wasps then inject a quantity of calyx fluid when ovipositing into hosts. The MdBV genome consists of 15more » segments that range from 3.6 (segment A) to 34.3 kb (segment O). EM analysis indicated that MdBV virions contain a single nucleocapsid that encapsidates one circular DNA of variable size. We developed a semi-quantitative real-time PCR assay using SYBR Green I. This assay indicated that five (J, O, H, N and B) segments of the MdBV genome accounted for more than 60% of the viral DNAs in calyx fluid. Estimates of relative segment abundance using our real-time PCR assay were also very similar to DNA size distributions determined from micrographs. Analysis of parasitized Pseudoplusia includens larvae indicated that copy number of MdBV segments C, B and J varied between hosts but their relative abundance within a host was virtually identical to their abundance in calyx fluid. Among-tissue assays indicated that each viral segment was most abundant in hemocytes and least abundant in salivary glands. However, the relative abundance of each segment to one another was similar in all tissues. We also found no clear relationship between MdBV segment and transcript abundance in hemocytes and fat body.« less
Software engineering the mixed model for genome-wide association studies on large samples.
Zhang, Zhiwu; Buckler, Edward S; Casstevens, Terry M; Bradbury, Peter J
2009-11-01
Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample size and number of markers used for GWAS is increasing dramatically, resulting in greater statistical power to detect those associations. The use of mixed models with increasingly large data sets depends on the availability of software for analyzing those models. While multiple software packages implement the mixed model method, no single package provides the best combination of fast computation, ability to handle large samples, flexible modeling and ease of use. Key elements of association analysis with mixed models are reviewed, including modeling phenotype-genotype associations using mixed models, population stratification, kinship and its estimation, variance component estimation, use of best linear unbiased predictors or residuals in place of raw phenotype, improving efficiency and software-user interaction. The available software packages are evaluated, and suggestions made for future software development.
Experimental comparison of forces resisting viral DNA packaging and driving DNA ejection
NASA Astrophysics Data System (ADS)
Keller, Nicholas; Berndsen, Zachary T.; Jardine, Paul J.; Smith, Douglas E.
2017-05-01
We compare forces resisting DNA packaging and forces driving DNA ejection in bacteriophage phi29 with theoretical predictions. Ejection of DNA from prohead-motor complexes is triggered by heating complexes after in vitro packaging and force is inferred from the suppression of ejection by applied osmotic pressure. Ejection force from 0 % to 80 % filling is found to be in quantitative agreement with predictions of a continuum mechanics model that assumes a repulsive DNA-DNA interaction potential based on DNA condensation studies and predicts an inverse-spool conformation. Force resisting DNA packaging from ˜80 % to 100 % filling inferred from optical tweezers studies is also consistent with the predictions of this model. The striking agreement with these two different measurements suggests that the overall energetics of DNA packaging is well described by the model. However, since electron microscopy studies of phi29 do not reveal a spool conformation, our findings suggest that the spool model overestimates the role of bending rigidity and underestimates the role of intrastrand repulsion. Below ˜80 % filling the inferred forces resisting packaging are unexpectedly lower than the inferred ejection forces, suggesting that in this filling range the forces are less accurately determined or strongly temperature dependent.
Experimental comparison of forces resisting viral DNA packaging and driving DNA ejection.
Keller, Nicholas; Berndsen, Zachary T; Jardine, Paul J; Smith, Douglas E
2017-05-01
We compare forces resisting DNA packaging and forces driving DNA ejection in bacteriophage phi29 with theoretical predictions. Ejection of DNA from prohead-motor complexes is triggered by heating complexes after in vitro packaging and force is inferred from the suppression of ejection by applied osmotic pressure. Ejection force from 0% to 80% filling is found to be in quantitative agreement with predictions of a continuum mechanics model that assumes a repulsive DNA-DNA interaction potential based on DNA condensation studies and predicts an inverse-spool conformation. Force resisting DNA packaging from ∼80% to 100% filling inferred from optical tweezers studies is also consistent with the predictions of this model. The striking agreement with these two different measurements suggests that the overall energetics of DNA packaging is well described by the model. However, since electron microscopy studies of phi29 do not reveal a spool conformation, our findings suggest that the spool model overestimates the role of bending rigidity and underestimates the role of intrastrand repulsion. Below ∼80% filling the inferred forces resisting packaging are unexpectedly lower than the inferred ejection forces, suggesting that in this filling range the forces are less accurately determined or strongly temperature dependent.
Bailén, Gloria; Guillén, Fabián; Castillo, Salvador; Serrano, María; Valero, Daniel; Martínez-Romero, Domingo
2006-03-22
Ethylene triggers the ripening process of tomato affecting the storage durability and shelf life (loss of quality) and inducing fruit decay. In this paper, an active packaging has been developed on the basis of the combination of modified atmosphere packaging (MAP) and the addition of granular-activated carbon (GAC) alone or impregnated with palladium as a catalyst (GAC-Pd). A steady-state atmosphere was 4 and 10 kPa for O2 and CO2 in control packages, while it was 8 and 7 kPa for O2 and CO2 in treated ones. The addition of GAC-Pd led to the lower ethylene accumulation inside packages, while the higher was obtained in controls. The parameters related to ripening showed that treated tomatoes exhibited a reduction in color evolution, softening, and weight loss, especially for GAC-Pd treatment. Moreover, these treatments were also effective in delaying tomato decay. After sensorial panel, tomatoes treated with GAC-Pd received the higher scores in terms of sweetness, firmness, juiciness, color, odor, and flavor. Results from the GC-MS analysis of the MAP headspace showed that 23 volatile compounds were identified in control packages, with these volatiles being significantly reduced in MAP-treated packages, which was correlated to the odor intensity detected by panelists after bag opening.
van Bel, Nikki; van der Velden, Yme; Bonnard, Damien; Le Rouzic, Erwann; Das, Atze T; Benarous, Richard; Berkhout, Ben
2014-01-01
The viral integrase (IN) is an essential protein for HIV-1 replication. IN inserts the viral dsDNA into the host chromosome, thereby aided by the cellular co-factor LEDGF/p75. Recently a new class of integrase inhibitors was described: allosteric IN inhibitors (ALLINIs). Although designed to interfere with the IN-LEDGF/p75 interaction to block HIV DNA integration during the early phase of HIV-1 replication, the major impact was surprisingly found on the process of virus maturation during the late phase, causing a reverse transcription defect upon infection of target cells. Virus particles produced in the presence of an ALLINI are misformed with the ribonucleoprotein located outside the virus core. Virus assembly and maturation are highly orchestrated and regulated processes in which several viral proteins and RNA molecules closely interact. It is therefore of interest to study whether ALLINIs have unpredicted pleiotropic effects on these RNA-related processes. We confirm that the ALLINI BI-D inhibits virus replication and that the produced virus is non-infectious. Furthermore, we show that the wild-type level of HIV-1 genomic RNA is packaged in virions and these genomes are in a dimeric state. The tRNAlys3 primer for reverse transcription was properly placed on this genomic RNA and could be extended ex vivo. In addition, the packaged reverse transcriptase enzyme was fully active when extracted from virions. As the RNA and enzyme components for reverse transcription are properly present in virions produced in the presence of BI-D, the inhibition of reverse transcription is likely to reflect the mislocalization of the components in the aberrant virus particle.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schaumberg, Andrew
The Omics Tools package provides several small trivial tools for work in genomics. This single portable package, the omics.jar file, is a toolbox that works in any Java-based environment, including PCs, Macs, and supercomputers. The number of tools is expected to grow. One tool (called cmsearch.hadoop or cmsearch.local), calls the external cmsearch program to predict non-coding RNA in a genome. The cmsearch program is part of the third-party Infernal package. Omics Tools does not contain Infernal. Infernal may be installed separately. The cmsearch.hadoop subtool requires Apache Hadoop and runs on a supercomputer, though cmsearch.local does not and runs on amore » server. Omics Tools does not contain Hadoop. Hadoop mat be installed separartely The other tools (cmgbk, cmgff, fastats, pal, randgrp, randgrpr, randsub) do not interface with third-party tools. Omics Tools is written in Java and Scala programming languages. Invoking the help command shows currently available tools, as shown below: schaumbe@gpint06:~/proj/omics$ java -jar omics.jar help Known commands are: cmgbk : compare cmsearch and GenBank Infernal hits cmgff : compare hits among two GFF (version 3) files cmsearch.hadoop : find Infernal hits in a genome, on your supercomputer cmsearch.local : find Infernal hits in a genome, on your workstation fastats : FASTA stats, e.g. # bases, GC content pal : stem-loop motif detection by palindromic sequence search (code stub) randgrp : random subsample without replacement, of groups randgrpr : random subsample with replacement, of groups (fast) randsub : random subsample without replacement, of file lines For more help regarding a particular command, use: java -jar omics.jar command help Usage: java -jar omics.jar command args« less
Zhang, Jianwei; Kudrna, Dave; Mu, Ting; Li, Weiming; Copetti, Dario; Yu, Yeisoo; Goicoechea, Jose Luis; Lei, Yang; Wing, Rod A
2016-10-15
Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build 'new-gen-assemblies' that result in high-quality 'annotation-ready' pseudomolecules. With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to 'group,' 'merge,' 'order and orient' sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user's total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory. The GPM (with LIMS) package is available at https://github.com/Jianwei-Zhang/LIMS CONTACTS: jzhang@mail.hzau.edu.cn or rwing@mail.arizona.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Hammoud, Saher Sue; Nix, David A; Hammoud, Ahmad O; Gibson, Mark; Cairns, Bradley R; Carrell, Douglas T
2011-09-01
The sperm chromatin of fertile men retains a small number of nucleosomes that are enriched at developmental gene promoters and imprinted gene loci. This unique chromatin packaging at certain gene promoters provides these genomic loci the ability to convey instructive epigenetic information to the zygote, potentially expanding the role and significance of the sperm epigenome in embryogenesis. We hypothesize that changes in chromatin packaging may be associated with poor reproductive outcome. Seven patients with reproductive dysfunction were recruited: three had unexplained poor embryogenesis during IVF and four were diagnosed with male infertility and previously shown to have altered protamination. Genome-wide analysis of the location of histones and histone modifications was analyzed by isolation and purification of DNA bound to histones and protamines. The histone-bound fraction of DNA was analyzed using high-throughput sequencing, both initially and following chromatin immunoprecipitation. The protamine-bound fraction was hybridized to agilent arrays. DNA methylation was examined using bisulfite sequencing. Unlike fertile men, five of seven infertile men had non-programmatic (randomly distributed) histone retention genome-wide. Interestingly, in contrast to the total histone pool, the localization of H3 Lysine 4 methylation (H3K4me) or H3 Lysine 27 methylation (H3K27me) was highly similar in the gametes of infertile men compared with fertile men. However, there was a reduction in the amount of H3K4me or H3K27me retained at developmental transcription factors and certain imprinted genes. Finally, the methylation status of candidate developmental promoters and imprinted loci were altered in a subset of the infertile men. This initial genome-wide analysis of epigenetic markings in the sperm of infertile men demonstrates differences in composition and epigenetic markings compared with fertile men, especially at certain imprinted and developmental loci. Although no single locus displays a complete change in chromatin packaging or DNA modification, the data suggest that moderate changes throughout the genome exist and may have a cumulative detrimental effect on fecundity.
DNA packaging in viral capsids with peptide arms.
Cao, Qianqian; Bachmann, Michael
2017-01-18
Strong chain rigidity and electrostatic self-repulsion of packed double-stranded DNA in viruses require a molecular motor to pull the DNA into the capsid. However, what is the role of electrostatic interactions between different charged components in the packaging process? Though various theories and computer simulation models were developed for the understanding of viral assembly and packaging dynamics of the genome, long-range electrostatic interactions and capsid structure have typically been neglected or oversimplified. By means of molecular dynamics simulations, we explore the effects of electrostatic interactions on the packaging dynamics of DNA based on a coarse-grained DNA and capsid model by explicitly including peptide arms (PAs), linked to the inner surface of the capsid, and counterions. Our results indicate that the electrostatic interactions between PAs, DNA, and counterions have a significant influence on the packaging dynamics. We also find that the packed DNA conformations are largely affected by the structure of the PA layer, but the packaging rate is insensitive to the layer structure.
Dostálková, Alžběta; Kaufman, Filip; Křížová, Ivana; Kultová, Anna; Strohalmová, Karolína; Hadravová, Romana; Ruml, Tomáš; Rumlová, Michaela
2018-05-15
In addition to specific RNA-binding zinc finger domains, the retroviral Gag polyprotein contains clusters of basic amino acid residues that are thought to support Gag-viral genomic RNA (gRNA) interactions. One of these clusters is the basic K 16 NK 18 EK 20 region, located upstream of the first zinc finger of the Mason-Pfizer monkey virus (M-PMV) nucleocapsid (NC) protein. To investigate the role of this basic region in the M-PMV life cycle, we used a combination of in vivo and in vitro methods to study a series of mutants in which the overall charge of this region was more positive (RNRER), more negative (AEAEA), or neutral (AAAAA). The mutations markedly affected gRNA incorporation and the onset of reverse transcription. The introduction of a more negative charge (AEAEA) significantly reduced the incorporation of M-PMV gRNA into nascent particles. Moreover, the assembly of immature particles of the AEAEA Gag mutant was relocated from the perinuclear region to the plasma membrane. In contrast, an enhancement of the basicity of this region of M-PMV NC (RNRER) caused a substantially more efficient incorporation of gRNA, subsequently resulting in an increase in M-PMV RNRER infectivity. Nevertheless, despite the larger amount of gRNA packaged by the RNRER mutant, the onset of reverse transcription was delayed in comparison to that of the wild type. Our data clearly show the requirement for certain positively charged amino acid residues upstream of the first zinc finger for proper gRNA incorporation, assembly of immature particles, and proceeding of reverse transcription. IMPORTANCE We identified a short sequence within the Gag polyprotein that, together with the zinc finger domains and the previously identified RKK motif, contributes to the packaging of genomic RNA (gRNA) of Mason-Pfizer monkey virus (M-PMV). Importantly, in addition to gRNA incorporation, this basic region (KNKEK) at the N terminus of the nucleocapsid protein is crucial for the onset of reverse transcription. Mutations that change the positive charge of the region to a negative one significantly reduced specific gRNA packaging. The assembly of immature particles of this mutant was reoriented from the perinuclear region to the plasma membrane. On the contrary, an enhancement of the basic character of this region increased both the efficiency of gRNA packaging and the infectivity of the virus. However, the onset of reverse transcription was delayed even in this mutant. In summary, the basic region in M-PMV Gag plays a key role in the packaging of genomic RNA and, consequently, in assembly and reverse transcription. Copyright © 2018 American Society for Microbiology.
methylPipe and compEpiTools: a suite of R packages for the integrative analysis of epigenomics data.
Kishore, Kamal; de Pretis, Stefano; Lister, Ryan; Morelli, Marco J; Bianchi, Valerio; Amati, Bruno; Ecker, Joseph R; Pelizzola, Mattia
2015-09-29
Numerous methods are available to profile several epigenetic marks, providing data with different genome coverage and resolution. Large epigenomic datasets are then generated, and often combined with other high-throughput data, including RNA-seq, ChIP-seq for transcription factors (TFs) binding and DNase-seq experiments. Despite the numerous computational tools covering specific steps in the analysis of large-scale epigenomics data, comprehensive software solutions for their integrative analysis are still missing. Multiple tools must be identified and combined to jointly analyze histone marks, TFs binding and other -omics data together with DNA methylation data, complicating the analysis of these data and their integration with publicly available datasets. To overcome the burden of integrating various data types with multiple tools, we developed two companion R/Bioconductor packages. The former, methylPipe, is tailored to the analysis of high- or low-resolution DNA methylomes in several species, accommodating (hydroxy-)methyl-cytosines in both CpG and non-CpG sequence context. The analysis of multiple whole-genome bisulfite sequencing experiments is supported, while maintaining the ability of integrating targeted genomic data. The latter, compEpiTools, seamlessly incorporates the results obtained with methylPipe and supports their integration with other epigenomics data. It provides a number of methods to score these data in regions of interest, leading to the identification of enhancers, lncRNAs, and RNAPII stalling/elongation dynamics. Moreover, it allows a fast and comprehensive annotation of the resulting genomic regions, and the association of the corresponding genes with non-redundant GeneOntology terms. Finally, the package includes a flexible method based on heatmaps for the integration of various data types, combining annotation tracks with continuous or categorical data tracks. methylPipe and compEpiTools provide a comprehensive Bioconductor-compliant solution for the integrative analysis of heterogeneous epigenomics data. These packages are instrumental in providing biologists with minimal R skills a complete toolkit facilitating the analysis of their own data, or in accelerating the analyses performed by more experienced bioinformaticians.
MIRA: An R package for DNA methylation-based inference of regulatory activity.
Lawson, John T; Tomazou, Eleni M; Bock, Christoph; Sheffield, Nathan C
2018-03-01
DNA methylation contains information about the regulatory state of the cell. MIRA aggregates genome-scale DNA methylation data into a DNA methylation profile for independent region sets with shared biological annotation. Using this profile, MIRA infers and scores the collective regulatory activity for each region set. MIRA facilitates regulatory analysis in situations where classical regulatory assays would be difficult and allows public sources of open chromatin and protein binding regions to be leveraged for novel insight into the regulatory state of DNA methylation datasets. R package available on Bioconductor: http://bioconductor.org/packages/release/bioc/html/MIRA.html. nsheffield@virginia.edu.
Berg, Ingrid L; Neumann, Rita; Lam, Kwan-Wood G; Sarbajna, Shriparna; Odenthal-Hesse, Linda; May, Celia A; Jeffreys, Alec J
2010-10-01
PRDM9 has recently been identified as a likely trans regulator of meiotic recombination hot spots in humans and mice. PRDM9 contains a zinc finger array that, in humans, can recognize a short sequence motif associated with hot spots, with binding to this motif possibly triggering hot-spot activity via chromatin remodeling. We now report that human genetic variation at the PRDM9 locus has a strong effect on sperm hot-spot activity, even at hot spots lacking the sequence motif. Subtle changes within the zinc finger array can create hot-spot nonactivating or enhancing variants and can even trigger the appearance of a new hot spot, suggesting that PRDM9 is a major global regulator of hot spots in humans. Variation at the PRDM9 locus also influences aspects of genome instability-specifically, a megabase-scale rearrangement underlying two genomic disorders as well as minisatellite instability-implicating PRDM9 as a risk factor for some pathological genome rearrangements.
Berg, Ingrid L.; Neumann, Rita; Lam, Kwan-Wood G.; Sarbajna, Shriparna; Odenthal-Hesse, Linda; May, Celia A.; Jeffreys, Alec J.
2011-01-01
PRDM9 has recently been identified as a likely trans-regulator of meiotic recombination hot spots in humans and mice1-3. The protein contains a zinc finger array that in humans can recognise a short sequence motif associated with hot spots4, with binding to this motif possibly triggering hot-spot activity via chromatin remodelling5. We now show that variation in the zinc finger array in humans has a profound effect on sperm hot-spot activity, even at hot spots lacking the sequence motif. Very subtle changes within the array can create hot-spot non-activating and enhancing alleles, and even trigger the appearance of a new hot spot. PRDM9 thus appears to be the preeminent global regulator of hot spots in humans. Variation at this locus also influences aspects of genome instability, specifically a megabase-scale rearrangement underlying two genomic disorders6 as well as minisatellite instability7, implicating PRDM9 as a risk factor for some pathological genome rearrangements. PMID:20818382
Feng, Hui; Beck, Jürgen; Nassal, Michael; Hu, Kang-hong
2011-01-01
Background The specific interaction between hepatitis B virus (HBV) polymerase (P protein) and the ε RNA stem-loop on pregenomic (pg) RNA is crucial for viral replication. It triggers both pgRNA packaging and reverse transcription and thus represents an attractive antiviral target. RNA decoys mimicking ε in P protein binding but not supporting replication might represent novel HBV inhibitors. However, because generation of recombinant enzymatically active HBV polymerase is notoriously difficult, such decoys have as yet not been identified. Methodology/Principal Findings Here we used a SELEX approach, based on a new in vitro reconstitution system exploiting a recombinant truncated HBV P protein (miniP), to identify potential ε decoys in two large ε RNA pools with randomized upper stem. Selection of strongly P protein binding RNAs correlated with an unexpected strong enrichment of A residues. Two aptamers, S6 and S9, displayed particularly high affinity and specificity for miniP in vitro, yet did not support viral replication when part of a complete HBV genome. Introducing S9 RNA into transiently HBV producing HepG2 cells strongly suppressed pgRNA packaging and DNA synthesis, indicating the S9 RNA can indeed act as an ε decoy that competitively inhibits P protein binding to the authentic ε signal on pgRNA. Conclusions/Significance This study demonstrates the first successful identification of human HBV ε aptamers by an in vitro SELEX approach. Effective suppression of HBV replication by the S9 aptamer provides proof-of-principle for the ability of ε decoy RNAs to interfere with viral P-ε complex formation and suggests that S9-like RNAs may further be developed into useful therapeutics against chronic hepatitis B. PMID:22125633
Identification of pathogen genomic variants through an integrated pipeline
2014-01-01
Background Whole-genome sequencing represents a powerful experimental tool for pathogen research. We present methods for the analysis of small eukaryotic genomes, including a streamlined system (called Platypus) for finding single nucleotide and copy number variants as well as recombination events. Results We have validated our pipeline using four sets of Plasmodium falciparum drug resistant data containing 26 clones from 3D7 and Dd2 background strains, identifying an average of 11 single nucleotide variants per clone. We also identify 8 copy number variants with contributions to resistance, and report for the first time that all analyzed amplification events are in tandem. Conclusions The Platypus pipeline provides malaria researchers with a powerful tool to analyze short read sequencing data. It provides an accurate way to detect SNVs using known software packages, and a novel methodology for detection of CNVs, though it does not currently support detection of small indels. We have validated that the pipeline detects known SNVs in a variety of samples while filtering out spurious data. We bundle the methods into a freely available package. PMID:24589256
A Mutation in UL15 of Herpes Simplex Virus 1 That Reduces Packaging of Cleaved Genomes▿
Yang, Kui; Wills, Elizabeth G.; Baines, Joel D.
2011-01-01
Herpesvirus genomic DNA is cleaved from concatemers that accumulate in infected cell nuclei. Genomic DNA is inserted into preassembled capsids through a unique portal vertex. Extensive analyses of viral mutants have indicated that intact capsids, the portal vertex, and all components of a tripartite terminase enzyme are required to both cleave and package viral DNA, suggesting that DNA cleavage and packaging are inextricably linked. Because the processes have not been functionally separable, it has been difficult to parse the roles of individual proteins in the DNA cleavage/packaging reaction. In the present study, a virus bearing the deletion of codons 400 to 420 of UL15, encoding a terminase component, was analyzed. This virus, designated vJB27, failed to replicate on noncomplementing cells but cleaved concatemeric DNA to ca. 35 to 98% of wild-type levels. No DNA cleavage was detected in cells infected with a UL15-null virus or a virus lacking UL15 codons 383 to 385, comprising a motif proposed to couple ATP hydrolysis to DNA translocation. The amount of vJB27 DNA protected from DNase I digestion was reduced compared to the wild-type virus by 6.5- to 200-fold, depending on the DNA fragment analyzed, thus indicating a profound defect in DNA packaging. Capsids containing viral DNA were not detected in vJB27-infected cells, as determined by electron microscopy. These data suggest that pUL15 plays an essential role in DNA translocation into the capsid and indicate that this function is separable from its role in DNA cleavage. PMID:21880766
Baig, Tayyba T.; Lanchy, Jean-Marc; Lodmell, J. Stephen
2009-01-01
The packaging signal (ψ) of human immunodeficiency virus type 2 (HIV-2) is present in the 5′ noncoding region of RNA and contains a 10-nucleotide palindrome (pal; 5′-392-GGAGUGCUCC) located upstream of the dimerization signal stem-loop 1 (SL1). pal has been shown to be functionally important in vitro and in vivo. We previously showed that the 3′ side of pal (GCUCC-3′) is involved in base-pairing interactions with a sequence downstream of SL1 to make an extended SL1, which is important for replication in vivo and the regulation of dimerization in vitro. However, the role of the 5′ side of pal (5′-GGAGU) was less clear. Here, we characterized this role using an in vivo SELEX approach. We produced a population of HIV-2 DNA genomes with random sequences within the 5′ side of pal and transfected these into COS-7 cells. Viruses from COS-7 cells were used to infect C8166 permissive cells. After several weeks of serial passage in C8166 cells, surviving viruses were sequenced. On the 5′ side of pal there was a striking convergence toward a GGRGN consensus sequence. Individual clones with consensus and nonconsensus sequences were tested in infectivity and packaging assays. Analysis of individuals that diverged from the consensus sequence showed normal viral RNA and protein synthesis but had replication defects and impaired RNA packaging. These findings clearly indicate that the GGRG motif is essential for viral replication and genomic RNA packaging. PMID:18971263
2013-01-01
Background Conventional luteal support packages are inadequate to facilitate a fresh transfer after GnRH agonist (GnRHa) trigger in patients at high risk of developing ovarian hyperstimulation syndrome (OHSS). By providing intensive luteal-phase support with oestradiol and progesterone satisfactory implantation rates can be sustained. The objective of this study was to assess the live-birth rate and incidence of OHSS after GnRHa trigger and intensive luteal steroid support compared to traditional hCG trigger and conventional luteal support in OHSS high risk Asian patients. Methods We conducted a retrospective cohort study of 363 women exposed to GnRHa triggering with intensive luteal support compared with 257 women exposed to conventional hCG triggering. Women at risk of OHSS were defined by ovarian response ≥15 follicles ≥12 mm on the day of the trigger. Results Live-birth rates were similar in both groups GnRHa vs hCG; 29.8% vs 29.2% (p = 0.69). One late onset severe OHSS case was observed in the GnRHa trigger group (0.3%) compared to 18 cases (7%) after hCG trigger. Conclusions GnRHa trigger combined with intensive luteal steroid support in this group of OHSS high risk Asian patients can facilitate fresh embryo transfer, however, in contrast to previous reports the occurrence of late onset OHSS was not completely eliminated. PMID:24369069
Genome-Assisted Prediction of Quantitative Traits Using the R Package sommer.
Covarrubias-Pazaran, Giovanny
2016-01-01
Most traits of agronomic importance are quantitative in nature, and genetic markers have been used for decades to dissect such traits. Recently, genomic selection has earned attention as next generation sequencing technologies became feasible for major and minor crops. Mixed models have become a key tool for fitting genomic selection models, but most current genomic selection software can only include a single variance component other than the error, making hybrid prediction using additive, dominance and epistatic effects unfeasible for species displaying heterotic effects. Moreover, Likelihood-based software for fitting mixed models with multiple random effects that allows the user to specify the variance-covariance structure of random effects has not been fully exploited. A new open-source R package called sommer is presented to facilitate the use of mixed models for genomic selection and hybrid prediction purposes using more than one variance component and allowing specification of covariance structures. The use of sommer for genomic prediction is demonstrated through several examples using maize and wheat genotypic and phenotypic data. At its core, the program contains three algorithms for estimating variance components: Average information (AI), Expectation-Maximization (EM) and Efficient Mixed Model Association (EMMA). Kernels for calculating the additive, dominance and epistatic relationship matrices are included, along with other useful functions for genomic analysis. Results from sommer were comparable to other software, but the analysis was faster than Bayesian counterparts in the magnitude of hours to days. In addition, ability to deal with missing data, combined with greater flexibility and speed than other REML-based software was achieved by putting together some of the most efficient algorithms to fit models in a gentle environment such as R.
In trans paired nicking triggers seamless genome editing without double-stranded DNA cutting.
Chen, Xiaoyu; Janssen, Josephine M; Liu, Jin; Maggio, Ignazio; 't Jong, Anke E J; Mikkers, Harald M M; Gonçalves, Manuel A F V
2017-09-22
Precise genome editing involves homologous recombination between donor DNA and chromosomal sequences subjected to double-stranded DNA breaks made by programmable nucleases. Ideally, genome editing should be efficient, specific, and accurate. However, besides constituting potential translocation-initiating lesions, double-stranded DNA breaks (targeted or otherwise) are mostly repaired through unpredictable and mutagenic non-homologous recombination processes. Here, we report that the coordinated formation of paired single-stranded DNA breaks, or nicks, at donor plasmids and chromosomal target sites by RNA-guided nucleases based on CRISPR-Cas9 components, triggers seamless homology-directed gene targeting of large genetic payloads in human cells, including pluripotent stem cells. Importantly, in addition to significantly reducing the mutagenicity of the genome modification procedure, this in trans paired nicking strategy achieves multiplexed, single-step, gene targeting, and yields higher frequencies of accurately edited cells when compared to the standard double-stranded DNA break-dependent approach.CRISPR-Cas9-based gene editing involves double-strand breaks at target sequences, which are often repaired by mutagenic non-homologous end-joining. Here the authors use Cas9 nickases to generate coordinated single-strand breaks in donor and target DNA for precise homology-directed gene editing.
Kakisaka, Michinori; Yamada, Kazunori; Yamaji-Hasegawa, Akiko; Kobayashi, Toshihide; Aida, Yoko
2016-09-01
To be incorporated into progeny virions, the viral genome must be transported to the inner leaflet of the plasma membrane (PM) and accumulate there. Some viruses utilize lipid components to assemble at the PM. For example, simian virus 40 (SV40) targets the ganglioside GM1 and human immunodeficiency virus type 1 (HIV-1) utilizes phosphatidylinositol (4,5) bisphosphate [PI(4,5)P2]. Recent studies clearly indicate that Rab11-mediated recycling endosomes are required for influenza A virus (IAV) trafficking of vRNPs to the PM but it remains unclear how IAV vRNP localized or accumulate underneath the PM for viral genome incorporation into progeny virions. In this study, we found that the second intrinsically disordered region (IDR2) of NP regulates two binding steps involved in viral genome packaging. First, IDR2 facilitates NP oligomer binding to viral RNA to form vRNP. Secondly, vRNP assemble by interacting with PI(4,5)P2 at the PM via IDR2. These findings suggest that PI(4,5)P2 functions as the determinant of vRNP accumulation at the PM. Copyright © 2016 Elsevier Inc. All rights reserved.
Using Kepler for Tool Integration in Microarray Analysis Workflows.
Gan, Zhuohui; Stowe, Jennifer C; Altintas, Ilkay; McCulloch, Andrew D; Zambon, Alexander C
Increasing numbers of genomic technologies are leading to massive amounts of genomic data, all of which requires complex analysis. More and more bioinformatics analysis tools are being developed by scientist to simplify these analyses. However, different pipelines have been developed using different software environments. This makes integrations of these diverse bioinformatics tools difficult. Kepler provides an open source environment to integrate these disparate packages. Using Kepler, we integrated several external tools including Bioconductor packages, AltAnalyze, a python-based open source tool, and R-based comparison tool to build an automated workflow to meta-analyze both online and local microarray data. The automated workflow connects the integrated tools seamlessly, delivers data flow between the tools smoothly, and hence improves efficiency and accuracy of complex data analyses. Our workflow exemplifies the usage of Kepler as a scientific workflow platform for bioinformatics pipelines.
Micro- and nanoscale devices for the investigation of epigenetics and chromatin dynamics
NASA Astrophysics Data System (ADS)
Aguilar, Carlos A.; Craighead, Harold G.
2013-10-01
Deoxyribonucleic acid (DNA) is the blueprint on which life is based and transmitted, but the way in which chromatin -- a dynamic complex of nucleic acids and proteins -- is packaged and behaves in the cellular nucleus has only begun to be investigated. Epigenetic modifications sit 'on top of' the genome and affect how DNA is compacted into chromatin and transcribed into ribonucleic acid (RNA). The packaging and modifications around the genome have been shown to exert significant influence on cellular behaviour and, in turn, human development and disease. However, conventional techniques for studying epigenetic or conformational modifications of chromosomes have inherent limitations and, therefore, new methods based on micro- and nanoscale devices have been sought. Here, we review the development of these devices and explore their use in the study of DNA modifications, chromatin modifications and higher-order chromatin structures.
Tebel, Katrin; Boldt, Vivien; Steininger, Anne; Port, Matthias; Ebert, Grit; Ullmann, Reinhard
2017-01-06
The analysis of DNA copy number variants (CNV) has increasing impact in the field of genetic diagnostics and research. However, the interpretation of CNV data derived from high resolution array CGH or NGS platforms is complicated by the considerable variability of the human genome. Therefore, tools for multidimensional data analysis and comparison of patient cohorts are needed to assist in the discrimination of clinically relevant CNVs from others. We developed GenomeCAT, a standalone Java application for the analysis and integrative visualization of CNVs. GenomeCAT is composed of three modules dedicated to the inspection of single cases, comparative analysis of multidimensional data and group comparisons aiming at the identification of recurrent aberrations in patients sharing the same phenotype, respectively. Its flexible import options ease the comparative analysis of own results derived from microarray or NGS platforms with data from literature or public depositories. Multidimensional data obtained from different experiment types can be merged into a common data matrix to enable common visualization and analysis. All results are stored in the integrated MySQL database, but can also be exported as tab delimited files for further statistical calculations in external programs. GenomeCAT offers a broad spectrum of visualization and analysis tools that assist in the evaluation of CNVs in the context of other experiment data and annotations. The use of GenomeCAT does not require any specialized computer skills. The various R packages implemented for data analysis are fully integrated into GenomeCATs graphical user interface and the installation process is supported by a wizard. The flexibility in terms of data import and export in combination with the ability to create a common data matrix makes the program also well suited as an interface between genomic data from heterogeneous sources and external software tools. Due to the modular architecture the functionality of GenomeCAT can be easily extended by further R packages or customized plug-ins to meet future requirements.
Altermann, Eric; Lu, Jingli; McCulloch, Alan
2017-01-01
Expert curated annotation remains one of the critical steps in achieving a reliable biological relevant annotation. Here we announce the release of GAMOLA2, a user friendly and comprehensive software package to process, annotate and curate draft and complete bacterial, archaeal, and viral genomes. GAMOLA2 represents a wrapping tool to combine gene model determination, functional Blast, COG, Pfam, and TIGRfam analyses with structural predictions including detection of tRNAs, rRNA genes, non-coding RNAs, signal protein cleavage sites, transmembrane helices, CRISPR repeats and vector sequence contaminations. GAMOLA2 has already been validated in a wide range of bacterial and archaeal genomes, and its modular concept allows easy addition of further functionality in future releases. A modified and adapted version of the Artemis Genome Viewer (Sanger Institute) has been developed to leverage the additional features and underlying information provided by the GAMOLA2 analysis, and is part of the software distribution. In addition to genome annotations, GAMOLA2 features, among others, supplemental modules that assist in the creation of custom Blast databases, annotation transfers between genome versions, and the preparation of Genbank files for submission via the NCBI Sequin tool. GAMOLA2 is intended to be run under a Linux environment, whereas the subsequent visualization and manual curation in Artemis is mobile and platform independent. The development of GAMOLA2 is ongoing and community driven. New functionality can easily be added upon user requests, ensuring that GAMOLA2 provides information relevant to microbiologists. The software is available free of charge for academic use. PMID:28386247
Altermann, Eric; Lu, Jingli; McCulloch, Alan
2017-01-01
Expert curated annotation remains one of the critical steps in achieving a reliable biological relevant annotation. Here we announce the release of GAMOLA2, a user friendly and comprehensive software package to process, annotate and curate draft and complete bacterial, archaeal, and viral genomes. GAMOLA2 represents a wrapping tool to combine gene model determination, functional Blast, COG, Pfam, and TIGRfam analyses with structural predictions including detection of tRNAs, rRNA genes, non-coding RNAs, signal protein cleavage sites, transmembrane helices, CRISPR repeats and vector sequence contaminations. GAMOLA2 has already been validated in a wide range of bacterial and archaeal genomes, and its modular concept allows easy addition of further functionality in future releases. A modified and adapted version of the Artemis Genome Viewer (Sanger Institute) has been developed to leverage the additional features and underlying information provided by the GAMOLA2 analysis, and is part of the software distribution. In addition to genome annotations, GAMOLA2 features, among others, supplemental modules that assist in the creation of custom Blast databases, annotation transfers between genome versions, and the preparation of Genbank files for submission via the NCBI Sequin tool. GAMOLA2 is intended to be run under a Linux environment, whereas the subsequent visualization and manual curation in Artemis is mobile and platform independent. The development of GAMOLA2 is ongoing and community driven. New functionality can easily be added upon user requests, ensuring that GAMOLA2 provides information relevant to microbiologists. The software is available free of charge for academic use.
Yang, Teng-Chieh; Maluf, Nasib Karl
2012-02-21
Human adenovirus (Ad) is an icosahedral, double-stranded DNA virus. Viral DNA packaging refers to the process whereby the viral genome becomes encapsulated by the viral particle. In Ad, activation of the DNA packaging reaction requires at least three viral components: the IVa2 and L4-22K proteins and a section of DNA within the viral genome, called the packaging sequence. Previous studies have shown that the IVa2 and L4-22K proteins specifically bind to conserved elements within the packaging sequence and that these interactions are absolutely required for the observation of DNA packaging. However, the equilibrium mechanism for assembly of IVa2 and L4-22K onto the packaging sequence has not been determined. Here we characterize the assembly of the IVa2 and L4-22K proteins onto truncated packaging sequence DNA by analytical sedimentation velocity and equilibrium methods. At limiting concentrations of L4-22K, we observe a species with two IVa2 monomers and one L4-22K monomer bound to the DNA. In this species, the L4-22K monomer is promoting positive cooperative interactions between the two bound IVa2 monomers. As L4-22K levels are increased, we observe a species with one IVa2 monomer and three L4-22K monomers bound to the DNA. To explain this result, we propose a model in which L4-22K self-assembly on the DNA competes with IVa2 for positive heterocooperative interactions, destabilizing binding of the second IVa2 monomer. Thus, we propose that L4-22K levels control the extent of cooperativity observed between adjacently bound IVa2 monomers. We have also determined the hydrodynamic properties of all observed stoichiometric species; we observe that species with three L4-22K monomers bound have more extended conformations than species with a single L4-22K bound. We suggest this might reflect a molecular switch that controls insertion of the viral DNA into the capsid.
Histone demethylase JARID1C inactivation triggers genomic instability in sporadic renal cancer
Rondinelli, Beatrice; Rosano, Dalia; Antonini, Elena; Frenquelli, Michela; Montanini, Laura; Huang, DaChuan; Segalla, Simona; Yoshihara, Kosuke; Amin, Samir B.; Lazarevic, Dejan; The, Bin Tean; Verhaak, Roel G.W.; Futreal, P. Andrew; Di Croce, Luciano; Chin, Lynda; Cittaro, Davide; Tonon, Giovanni
2015-01-01
Mutations in genes encoding chromatin-remodeling proteins are often identified in a variety of cancers. For example, the histone demethylase JARID1C is frequently inactivated in patients with clear cell renal cell carcinoma (ccRCC); however, it is largely unknown how JARID1C dysfunction promotes cancer. Here, we determined that JARID1C binds broadly to chromatin domains characterized by the trimethylation of lysine 9 (H3K9me3), which is a histone mark enriched in heterochromatin. Moreover, we found that JARID1C localizes on heterochromatin, is required for heterochromatin replication, and forms a complex with established players of heterochromatin assembly, including SUV39H1 and HP1α, as well as with proteins not previously associated with heterochromatin assembly, such as the cullin 4 (CUL4) complex adaptor protein DDB1. Transcription on heterochromatin is tightly suppressed to safeguard the genome, and in ccRCC cells, JARID1C inactivation led to the unrestrained expression of heterochromatic noncoding RNAs (ncRNAs) that in turn triggered genomic instability. Moreover, ccRCC patients harboring JARID1C mutations exhibited aberrant ncRNA expression and increased genomic rearrangements compared with ccRCC patients with tumors endowed with other genetic lesions. Together, these data suggest that inactivation of JARID1C in renal cancer leads to heterochromatin disruption, genomic rearrangement, and aggressive ccRCCs. Moreover, our results shed light on a mechanism that underlies genomic instability in sporadic cancers. PMID:26551685
D3GB: An Interactive Genome Browser for R, Python, and WordPress.
Barrios, David; Prieto, Carlos
2017-05-01
Genome browsers are useful not only for showing final results but also for improving analysis protocols, testing data quality, and generating result drafts. Its integration in analysis pipelines allows the optimization of parameters, which leads to better results. New developments that facilitate the creation and utilization of genome browsers could contribute to improving analysis results and supporting the quick visualization of genomic data. D3 Genome Browser is an interactive genome browser that can be easily integrated in analysis protocols and shared on the Web. It is distributed as an R package, a Python module, and a WordPress plugin to facilitate its integration in pipelines and the utilization of platform capabilities. It is compatible with popular data formats such as GenBank, GFF, BED, FASTA, and VCF, and enables the exploration of genomic data with a Web browser.
Genomic Flexibility of Human Endogenous Retrovirus Type K
Dube, Derek; Contreras-Galindo, Rafael; He, Shirley; King, Steven R.; Gonzalez-Hernandez, Marta J.; Gitlin, Scott D.; Kaplan, Mark H.
2014-01-01
ABSTRACT Human endogenous retrovirus type K (HERV-K) proviruses are scattered throughout the human genome, but as no infectious HERV-K virus has been detected to date, the mechanism by which these viruses replicated and populated the genome remains unresolved. Here, we provide evidence that, in addition to the RNA genomes that canonical retroviruses package, modern HERV-K viruses can contain reverse-transcribed DNA (RT-DNA) genomes. Indeed, reverse transcription of genomic HERV-K RNA into the DNA form is able to occur in three distinct times and locations: (i) in the virus-producing cell prior to viral release, yielding a DNA-containing extracellular virus particle similar to the spumaviruses; (ii) within the extracellular virus particle itself, transitioning from an RNA-containing particle to a DNA-containing particle; and (iii) after entry of the RNA-containing virus into the target cell, similar to canonical retroviruses, such as murine leukemia virus and HIV. Moreover, using a resuscitated HERV-K virus construct, we show that both viruses with RNA genomes and viruses with DNA genomes are capable of infecting target cells. This high level of genomic flexibility historically could have permitted these viruses to replicate in various host cell environments, potentially assisting in their many integration events and resulting in their high prevalence in the human genome. Moreover, the ability of modern HERV-K viruses to proceed through reverse transcription and package RT-DNA genomes suggests a higher level of replication competency than was previously understood, and it may be relevant in HERV-K-associated human diseases. IMPORTANCE Retroviral elements comprise at least 8% of the human genome. Of all the endogenous retroviruses, HERV-K viruses are the most intact and biologically active. While a modern infectious HERV-K has yet to be found, HERV-K activation has been associated with cancers, autoimmune diseases, and HIV-1 infection. Thus, determining how this virus family became such a prevalent member of our genome and what it is capable of in its current form are of the utmost importance. Here, we provide evidence that HERV-K viruses currently found in the human genome are able to proceed through reverse transcription and historically utilized a life cycle with a surprising degree of genomic flexibility in which both RNA- and DNA-containing viruses were capable of mediating infection. PMID:24920813
Metastable Packaging For Transient Electronics
2014-09-01
dated 16 Jan 09. Report contains color. 14. ABSTRACT Metastable polymeric materials were synthesized, formulated with additives and microcapsules ...photoacid generation, thermal activation, and mechanical rupture of acid-filled microcapsules -- were investigated. 15. SUBJECT TERMS transient...carbonate sulfone) (PVBCS)... 11 3.3 Thermal and Mechanical Triggered Transience of Electronic Devices via Embedded Microcapsules
Packaging and Unpackaging Knowledge in Mass Higher Education--A Knowledge Management Perspective
ERIC Educational Resources Information Center
Guzman, Gustavo; Trivelato, Luiz F.
2011-01-01
The progressive deployment of market-oriented regulatory frameworks in mass Higher Education Institutions (MHEI hereafter) triggered, in a wide variety of forms and degrees, the application of Knowledge Management principles in MHEI. This means the application of the knowledge "codification strategy", where the focus is on the economies of the…
Golsteijn, Laura; Menkveld, Rimousky; King, Henry; Schneider, Christine; Schowanek, Diederik; Nissen, Sascha
2015-01-01
A.I.S.E., the International Association for Soaps, Detergents and Maintenance Products, launched the 'A.I.S.E. Charter for Sustainable Cleaning' in Europe in 2005 to promote sustainability in the cleaning and maintenance products industry. This Charter is a proactive programme for translating the concept of sustainable innovation into reality and actions. Per product category, life cycle assessments (LCA) are used to set sustainability criteria that are ambitious, but also achievable by all market players. This paper presents and discusses LCAs of six household detergent product categories conducted for the Charter, i.e.: manual dishwashing detergents, powder and tablet laundry detergents, window glass trigger spray cleaners, bathroom trigger spray cleaners, acid toilet cleaners, and bleach toilet cleaners. Relevant impact categories are identified, as well as the life cycle stages with the largest contribution to the environmental impact. It was concluded that the variables that mainly drive the results (i.e. the environmental hotspots) for manual dishwashing detergents and laundry detergents were the water temperature, water consumption (for manual dishwashing detergents), product dosage (for laundry detergents), and the choice and amount of surfactant. By contrast, for bathroom trigger sprays, acid and bleach toilet cleaners, the driving factors were plastic packaging, transportation to retailer, and specific ingredients. Additionally, the type of surfactant was important for bleach toilet cleaners. For window glass trigger sprays, the driving factors were the plastic packaging and the type of surfactant, and the other ingredients were of less importance. A.I.S.E. used the results of the studies to establish sustainability criteria, the so-called 'Charter Advanced Sustainability Profiles', which led to improvements in the marketplace.
2011-01-01
Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was amplified by PCR from AL8/78 and AS75 and resequenced with the ABI 3730 xl. In a sample of 302 randomly selected putative SNPs, 84.0% in gene regions, 88.0% in repeat junctions, and 81.3% in uncharacterized regions were validated. Conclusion An annotation-based genome-wide SNP discovery pipeline for NGS platforms was developed. The pipeline is suitable for SNP discovery in genomic libraries of complex genomes and does not require a reference genome sequence. The pipeline is applicable to all current NGS platforms, provided that at least one such platform generates relatively long reads. The pipeline package, AGSNP, and the discovered 497,118 Ae. tauschii SNPs can be accessed at (http://avena.pw.usda.gov/wheatD/agsnp.shtml). PMID:21266061
Zhang, Jianwei; Kudrna, Dave; Mu, Ting; Li, Weiming; Copetti, Dario; Yu, Yeisoo; Goicoechea, Jose Luis; Lei, Yang; Wing, Rod A.
2016-01-01
Abstract Motivation: Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool—Genome Puzzle Master (GPM)—that enables the integration of additional genomic signposts to edit and build ‘new-gen-assemblies’ that result in high-quality ‘annotation-ready’ pseudomolecules. Results: With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to ‘group,’ ‘merge,’ ‘order and orient’ sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user’s total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory. Availability and Implementation: The GPM (with LIMS) package is available at https://github.com/Jianwei-Zhang/LIMS Contacts: jzhang@mail.hzau.edu.cn or rwing@mail.arizona.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27318200
Murphy, James; Klumpp, Jochen; Mahony, Jennifer; O'Connell-Motherway, Mary; Nauta, Arjen; van Sinderen, Douwe
2014-10-01
So-called 936-type phages are among the most frequently isolated phages in dairy facilities utilising Lactococcus lactis starter cultures. Despite extensive efforts to control phage proliferation and decades of research, these phages continue to negatively impact cheese production in terms of the final product quality and consequently, monetary return. Whole genome sequencing and in silico analysis of three 936-type phage genomes identified several putative (orphan) methyltransferase (MTase)-encoding genes located within the packaging and replication regions of the genome. Utilising SMRT sequencing, methylome analysis was performed on all three phages, allowing the identification of adenine modifications consistent with N-6 methyladenine sequence methylation, which in some cases could be attributed to these phage-encoded MTases. Heterologous gene expression revealed that M.Phi145I/M.Phi93I and M.Phi93DAM, encoded by genes located within the packaging module, provide protection against the restriction enzymes HphI and DpnII, respectively, representing the first functional MTases identified in members of 936-type phages. SMRT sequencing technology enabled the identification of the target motifs of MTases encoded by the genomes of three lytic 936-type phages and these MTases represent the first functional MTases identified in this species of phage. The presence of these MTase-encoding genes on 936-type phage genomes is assumed to represent an adaptive response to circumvent host encoded restriction-modification systems thereby increasing the fitness of the phages in a dynamic dairy environment.
[The application of genome editing in identification of plant gene function and crop breeding].
Zhou, Xiang-chun; Xing, Yong-zhong
2016-03-01
Plant genome can be modified via current biotechnology with high specificity and excellent efficiency. Zinc finger nucleases (ZFN), transcription activator-like effector nucleases (TALEN) and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) system are the key engineered nucleases used in the genome editing. Genome editing techniques enable gene targeted mutagenesis, gene knock-out, gene insertion or replacement at the target sites during the endogenous DNA repair process, including non-homologous end joining (NHEJ) and homologous recombination (HR), triggered by the induction of DNA double-strand break (DSB). Genome editing has been successfully applied in the genome modification of diverse plant species, such as Arabidopsis thaliana, Oryza sativa, and Nicotiana tabacum. In this review, we summarize the application of genome editing in identification of plant gene function and crop breeding. Moreover, we also discuss the improving points of genome editing in crop precision genetic improvement for further study.
WebArray: an online platform for microarray data analysis
Xia, Xiaoqin; McClelland, Michael; Wang, Yipeng
2005-01-01
Background Many cutting-edge microarray analysis tools and algorithms, including commonly used limma and affy packages in Bioconductor, need sophisticated knowledge of mathematics, statistics and computer skills for implementation. Commercially available software can provide a user-friendly interface at considerable cost. To facilitate the use of these tools for microarray data analysis on an open platform we developed an online microarray data analysis platform, WebArray, for bench biologists to utilize these tools to explore data from single/dual color microarray experiments. Results The currently implemented functions were based on limma and affy package from Bioconductor, the spacings LOESS histogram (SPLOSH) method, PCA-assisted normalization method and genome mapping method. WebArray incorporates these packages and provides a user-friendly interface for accessing a wide range of key functions of limma and others, such as spot quality weight, background correction, graphical plotting, normalization, linear modeling, empirical bayes statistical analysis, false discovery rate (FDR) estimation, chromosomal mapping for genome comparison. Conclusion WebArray offers a convenient platform for bench biologists to access several cutting-edge microarray data analysis tools. The website is freely available at . It runs on a Linux server with Apache and MySQL. PMID:16371165
TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
Colaprico, Antonio; Silva, Tiago C.; Olsen, Catharina; Garofano, Luciano; Cava, Claudia; Garolini, Davide; Sabedot, Thais S.; Malta, Tathiane M.; Pagnotta, Stefano M.; Castiglioni, Isabella; Ceccarelli, Michele; Bontempi, Gianluca; Noushmehr, Houtan
2016-01-01
The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries. PMID:26704973
DNA packaging and ejection forces in bacteriophage
Kindt, James; Tzlil, Shelly; Ben-Shaul, Avinoam; Gelbart, William M.
2001-01-01
We calculate the forces required to package (or, equivalently, acting to eject) DNA into (from) a bacteriophage capsid, as a function of the loaded (ejected) length, under conditions for which the DNA is either self-repelling or self-attracting. Through computer simulation and analytical theory, we find the loading force to increase more than 10-fold (to tens of piconewtons) during the final third of the loading process; correspondingly, the internal pressure drops 10-fold to a few atmospheres (matching the osmotic pressure in the cell) upon ejection of just a small fraction of the phage genome. We also determine an evolution of the arrangement of packaged DNA from toroidal to spool-like structures. PMID:11707588
Guo, Peixuan; Zhao, Zhengyi; Haak, Jeannie; Wang, Shaoying; Weitao, Tao
2014-01-01
Biomotors were once classified into two categories: linear motor and rotation motor. For decades, the viral DNA-packaging motor has been popularly believed to be a five-fold rotation motor. Recently, a third type of biomotor with revolution mechanism without rotation has been discovered. By analogy, rotation resembles the Earth rotating on its axis in a complete cycle every 24 hours, while revolution resembles the Earth revolving around the Sun one circle per 365 days (see animations http://nanobio.uky.edu/movie.html). The action of revolution that enables a motor free of coiling and torque has solved many puzzles and debates that have occurred throughout the history of viral DNA packaging motor studies. It also settles the discrepancies concerning the structure, stoichiometry, and functioning of DNA translocation motors. This review uses bacteriophages Phi29, HK97, SPP1, P22, T4, T7 as well as bacterial DNA translocase FtsK and SpoIIIE as examples to elucidate the puzzles. These motors use a ATPase, some of which have been confirmed to be a hexamer, to revolve around the dsDNA sequentially. ATP binding induces conformational change and possibly an entropy alteration in ATPase to a high affinity toward dsDNA; but ATP hydrolysis triggers another entropic and conformational change in ATPase to a low affinity for DNA, by which dsDNA is pushed toward an adjacent ATPase subunit. The rotation and revolution mechanisms can be distinguished by the size of channel: the channels of rotation motors are equal to or smaller than 2 nm, whereas channels of revolution motors are larger than 3 nm. Rotation motors use parallel threads to operate with a right-handed channel, while revolution motors use a left-handed channel to drive the right-handed DNA in an anti-parallel arrangement. Coordination of several vector factors in the same direction makes viral DNA-packaging motors unusually powerful and effective. Revolution mechanism avoids DNA coiling in translocating the lengthy genomic dsDNA helix could be advantage for cell replication such as bacterial binary fission and cell mitosis without the need for topoisomerase or helicase to consume additional energy. PMID:24913057
2017-01-01
Adenovirus (AdV) morphogenesis is a complex process, many aspects of which remain unclear. In particular, it is not settled where in the nucleus assembly and packaging occur, and whether these processes occur in a sequential or a concerted manner. Here we use immunofluorescence and immunoelectron microscopy (immunoEM) to trace packaging factors and structural proteins at late times post infection by either wildtype virus or a delayed packaging mutant. We show that representatives of all assembly factors are present in the previously recognized peripheral replicative zone, which therefore is the AdV assembly factory. Assembly intermediates and abortive products observed in this region favor a concurrent assembly and packaging model comprising two pathways, one for capsid proteins and another one for core components. Only when both pathways are coupled by correct interaction between packaging proteins and the genome is the viral particle produced. Decoupling generates accumulation of empty capsids and unpackaged cores. PMID:28448571
The solid state detector technology for picosecond laser ranging
NASA Technical Reports Server (NTRS)
Prochazka, Ivan
1993-01-01
We developed an all solid state laser ranging detector technology, which makes the goal of millimeter accuracy achievable. Our design and construction philosophy is to combine the techniques of single photon ranging, ultrashort laser pulses, and fast fixed threshold discrimination while avoiding any analog signal processing within the laser ranging chain. The all solid state laser ranging detector package consists of the START detector and the STOP solid state photon counting module. Both the detectors are working in an optically triggered avalanche switching regime. The optical signal is triggering an avalanche current buildup which results in the generation of a uniform, fast risetime output pulse.
Stockwell, David Christopher; Bisarya, Hema; Classen, David C; Kirkendall, Eric S; Lachman, Peter I; Matlow, Anne G; Tham, Eric; Hyman, Dan; Lehman, Samuel M; Searles, Elizabeth; Muething, Stephen E; Sharek, Paul J
2016-12-01
To have impact on reducing harm in pediatric inpatients, an efficient and reliable process for harm detection is needed. This work describes the first step toward the development of a pediatric all-cause harm measurement tool by recognized experts in the field. An international group of leaders in pediatric patient safety and informatics were charged with developing a comprehensive pediatric inpatient all-cause harm measurement tool using a modified Delphi technique. The process was conducted in 5 distinct steps: (1) literature review of triggers (elements from a medical record that assist in identifying patient harm) for inclusion; (2) translation of triggers to likely associated harm, improving the ability for expert prioritization; (3) 2 applications of a modified Delphi selection approach with consensus criteria using severity and frequency of harm as well as detectability of the associated trigger as criteria to rate each trigger and associated harm; (4) developing specific trigger logic and relevant values when applicable; and (5) final vetting of the entire trigger list for pilot testing. Literature and expert panel review identified 108 triggers and associated harms suitable for consideration (steps 1 and 2). This list was pared to 64 triggers and their associated harms after the first of the 2 independent expert reviews. The second independent expert review led to further refinement of the trigger package, resulting in 46 items for inclusion (step 3). Adding in specific trigger logic expanded the list. Final review and voting resulted in a list of 51 triggers (steps 4 and 5). Application of a modified Delphi method on an expert-constructed list of 108 triggers, focusing on severity and frequency of harms as well as detectability of triggers in an electronic medical record, resulted in a final list of 51 pediatric triggers. Pilot testing this list of pediatric triggers to identify all-cause harm for pediatric inpatients is the next step to establish the appropriateness of each trigger for inclusion in a global pediatric safety measurement tool.
Ebola virus VP24 interacts with NP to facilitate nucleocapsid assembly and genome packaging.
Banadyga, Logan; Hoenen, Thomas; Ambroggio, Xavier; Dunham, Eric; Groseth, Allison; Ebihara, Hideki
2017-08-09
Ebola virus causes devastating hemorrhagic fever outbreaks for which no approved therapeutic exists. The viral nucleocapsid, which is minimally composed of the proteins NP, VP35, and VP24, represents an attractive target for drug development; however, the molecular determinants that govern the interactions and functions of these three proteins are still unknown. Through a series of mutational analyses, in combination with biochemical and bioinformatics approaches, we identified a region on VP24 that was critical for its interaction with NP. Importantly, we demonstrated that the interaction between VP24 and NP was required for both nucleocapsid assembly and genome packaging. Not only does this study underscore the critical role that these proteins play in the viral replication cycle, but it also identifies a key interaction interface on VP24 that may serve as a novel target for antiviral therapeutic intervention.
Genomics-inspired discovery of natural products.
Winter, Jaclyn M; Behnken, Swantje; Hertweck, Christian
2011-02-01
The massive surge in genome sequencing projects has opened our eyes to the overlooked biosynthetic potential and metabolic diversity of microorganisms. While traditional approaches have been successful at identifying many useful therapeutic agents from these organisms, new tactics are needed in order to exploit their true biosynthetic potential. Several genomics-inspired strategies have been successful in unveiling new metabolites that were overlooked under standard fermentation and detection conditions. In addition, genome sequences have given us valuable insight for genetically engineering biosynthesis gene clusters that remain silent or are poorly expressed in the absence of a specific trigger. As more genome sequences are becoming available, we are noticing the emergence of underexplored or neglected organisms as alternative resources for new therapeutic agents. Copyright © 2010 Elsevier Ltd. All rights reserved.
Plant genome and transcriptome annotations: from misconceptions to simple solutions
Bolger, Marie E; Arsova, Borjana; Usadel, Björn
2018-01-01
Abstract Next-generation sequencing has triggered an explosion of available genomic and transcriptomic resources in the plant sciences. Although genome and transcriptome sequencing has become orders of magnitudes cheaper and more efficient, often the functional annotation process is lagging behind. This might be hampered by the lack of a comprehensive enumeration of simple-to-use tools available to the plant researcher. In this comprehensive review, we present (i) typical ontologies to be used in the plant sciences, (ii) useful databases and resources used for functional annotation, (iii) what to expect from an annotated plant genome, (iv) an automated annotation pipeline and (v) a recipe and reference chart outlining typical steps used to annotate plant genomes/transcriptomes using publicly available resources. PMID:28062412
Dissection of specific binding of HIV-1 Gag to the 'packaging signal' in viral RNA.
Comas-Garcia, Mauricio; Datta, Siddhartha Ak; Baker, Laura; Varma, Rajat; Gudla, Prabhakar R; Rein, Alan
2017-07-20
Selective packaging of HIV-1 genomic RNA (gRNA) requires the presence of a cis -acting RNA element called the 'packaging signal' (Ψ). However, the mechanism by which Ψ promotes selective packaging of the gRNA is not well understood. We used fluorescence correlation spectroscopy and quenching data to monitor the binding of recombinant HIV-1 Gag protein to Cy5-tagged 190-base RNAs. At physiological ionic strength, Gag binds with very similar, nanomolar affinities to both Ψ-containing and control RNAs. We challenged these interactions by adding excess competing tRNA; introducing mutations in Gag; or raising the ionic strength. These modifications all revealed high specificity for Ψ. This specificity is evidently obscured in physiological salt by non-specific, predominantly electrostatic interactions. This nonspecific activity was attenuated by mutations in the MA, CA, and NC domains, including CA mutations disrupting Gag-Gag interaction. We propose that gRNA is selectively packaged because binding to Ψ nucleates virion assembly with particular efficiency.
ATP Depletion Blocks Herpes Simplex Virus DNA Packaging and Capsid Maturation
Dasgupta, Anindya; Wilson, Duncan W.
1999-01-01
During herpes simplex virus (HSV) assembly, immature procapsids must expel their internal scaffold proteins, transform their outer shell to form mature polyhedrons, and become packaged with the viral double-stranded (ds) DNA genome. A large number of virally encoded proteins are required for successful completion of these events, but their molecular roles are poorly understood. By analogy with the dsDNA bacteriophage we reasoned that HSV DNA packaging might be an ATP-requiring process and tested this hypothesis by adding an ATP depletion cocktail to cells accumulating unpackaged procapsids due to the presence of a temperature-sensitive lesion in the HSV maturational protease UL26. Following return to permissive temperature, HSV capsids were found to be unable to package DNA, suggesting that this process is indeed ATP dependent. Surprisingly, however, the display of epitopes indicative of capsid maturation was also inhibited. We conclude that either formation of these epitopes directly requires ATP or capsid maturation is normally arrested by a proofreading mechanism until DNA packaging has been successfully completed. PMID:9971781
Suomalainen, Maarit; Zheng, Yueting; Boucke, Karin
2017-01-01
The Adenovirus (Ad) genome within the capsid is tightly associated with a virus-encoded, histone-like core protein—protein VII. Two other Ad core proteins, V and X/μ, also are located within the virion and are loosely associated with viral DNA. Core protein VII remains associated with the Ad genome during the early phase of infection. It is not known if naked Ad DNA is packaged into the capsid, as with dsDNA bacteriophage and herpesviruses, followed by the encapsidation of viral core proteins, or if a unique packaging mechanism exists with Ad where a DNA-protein complex is simultaneously packaged into the virion. The latter model would require an entirely new molecular mechanism for packaging compared to known viral packaging motors. We characterized a virus with a conditional knockout of core protein VII. Remarkably, virus particles were assembled efficiently in the absence of protein VII. No changes in protein composition were evident with VII−virus particles, including the abundance of core protein V, but changes in the proteolytic processing of some capsid proteins were evident. Virus particles that lack protein VII enter the cell, but incoming virions did not escape efficiently from endosomes. This greatly diminished all subsequent aspects of the infectious cycle. These results reveal that the Ad major core protein VII is not required to condense viral DNA within the capsid, but rather plays an unexpected role during virus maturation and the early stages of infection. These results establish a new paradigm pertaining to the Ad assembly mechanism and reveal a new and important role of protein VII in early stages of infection. PMID:28628648
Nucleic Acid Binding by Mason-Pfizer Monkey Virus CA Promotes Virus Assembly and Genome Packaging
Füzik, Tibor; Píchalová, Růžena; Schur, Florian K. M.; Strohalmová, Karolína; Křížová, Ivana; Hadravová, Romana; Rumlová, Michaela; Briggs, John A. G.
2016-01-01
ABSTRACT The Gag polyprotein of retroviruses drives immature virus assembly by forming hexameric protein lattices. The assembly is primarily mediated by protein-protein interactions between capsid (CA) domains and by interactions between nucleocapsid (NC) domains and RNA. Specific interactions between NC and the viral RNA are required for genome packaging. Previously reported cryoelectron microscopy analysis of immature Mason-Pfizer monkey virus (M-PMV) particles suggested that a basic region (residues RKK) in CA may serve as an additional binding site for nucleic acids. Here, we have introduced mutations into the RKK region in both bacterial and proviral M-PMV vectors and have assessed their impact on M-PMV assembly, structure, RNA binding, budding/release, nuclear trafficking, and infectivity using in vitro and in vivo systems. Our data indicate that the RKK region binds and structures nucleic acid that serves to promote virus particle assembly in the cytoplasm. Moreover, the RKK region appears to be important for recruitment of viral genomic RNA into Gag particles, and this function could be linked to changes in nuclear trafficking. Together these observations suggest that in M-PMV, direct interactions between CA and nucleic acid play important functions in the late stages of the viral life cycle. IMPORTANCE Assembly of retrovirus particles is driven by the Gag polyprotein, which can self-assemble to form virus particles and interact with RNA to recruit the viral genome into the particles. Generally, the capsid domains of Gag contribute to essential protein-protein interactions during assembly, while the nucleocapsid domain interacts with RNA. The interactions between the nucleocapsid domain and RNA are important both for identifying the genome and for self-assembly of Gag molecules. Here, we show that a region of basic residues in the capsid protein of the betaretrovirus Mason-Pfizer monkey virus (M-PMV) contributes to interaction of Gag with nucleic acid. This interaction appears to provide a critical scaffolding function that promotes assembly of virus particles in the cytoplasm. It is also crucial for packaging the viral genome and thus for infectivity. These data indicate that, surprisingly, interactions between the capsid domain and RNA play an important role in the assembly of M-PMV. PMID:26912613
Bartels, Daniela; Kespohl, Sebastian; Albaum, Stefan; Drüke, Tanja; Goesmann, Alexander; Herold, Julia; Kaiser, Olaf; Pühler, Alfred; Pfeiffer, Friedhelm; Raddatz, Günter; Stoye, Jens; Meyer, Folker; Schuster, Stephan C
2005-04-01
We provide the graphical tool BACCardI for the construction of virtual clone maps from standard assembler output files or BLAST based sequence comparisons. This new tool has been applied to numerous genome projects to solve various problems including (a) validation of whole genome shotgun assemblies, (b) support for contig ordering in the finishing phase of a genome project, and (c) intergenome comparison between related strains when only one of the strains has been sequenced and a large insert library is available for the other. The BACCardI software can seamlessly interact with various sequence assembly packages. Genomic assemblies generated from sequence information need to be validated by independent methods such as physical maps. The time-consuming task of building physical maps can be circumvented by virtual clone maps derived from read pair information of large insert libraries.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chaturvedi, Sonali; Rao, A.L.N., E-mail: arao@ucr.edu
2014-09-15
In Brome mosaic virus, it was hypothesized that a physical interaction between viral replicase and capsid protein (CP) is obligatory to confer genome packaging specificity. Here we tested this hypothesis by employing Bimolecular Fluorescent Complementation (BiFC) as a tool for evaluating protein–protein interactions in living cells. The efficacy of BiFC was validated by a known interaction between replicase protein 1a (p1a) and protein 2a (p2a) at the endoplasmic reticulum (ER) site of viral replication. Additionally, co-expression in planta of a bona fide pair of interacting protein partners of p1a and p2a had resulted in the assembly of a functional replicase.more » Subsequent BiFC assays in conjunction with mCherry labeled ER as a fluorescent cellular marker revealed that CP physically interacts with p2a, but not p1a, and this CP:p2a interaction occurs at the cytoplasmic phase of the ER. The significance of the CP:p2a interaction in BMV replication and genome packaging is discussed. - Highlights: • YFP fusion proteins of BMV p1a and p2a are biologically active. • Self-interaction was observed for p1a, p2a and CP. • CP interacts with p2a but not p1a. • Majority of reconstituted YFP resulting from bona fide fusion protein partners localized on ER.« less
microRNA-mediated R gene regulation: molecular scabbards for double-edged swords.
Deng, Yingtian; Liu, Minglei; Li, Xiaofei; Li, Feng
2018-02-01
Plant resistance (R) proteins are immune receptors that recognize pathogen effectors and trigger rapid defense responses, namely effector-triggered immunity. R protein-mediated pathogen resistance is usually race specific. During plant-pathogen coevolution, plant genomes accumulated large numbers of R genes. Even though plant R genes provide important natural resources for breeding disease-resistant crops, their presence in the plant genome comes at a cost. Misregulation of R genes leads to developmental defects, such as stunted growth and reduced fertility. In the past decade, many microRNAs (miRNAs) have been identified to target various R genes in plant genomes. miRNAs reduce R gene levels under normal conditions and allow induction of R gene expression under various stresses. For these reasons, we consider R genes to be double-edged "swords" and miRNAs as molecular "scabbards". In the present review, we summarize the contributions and potential problems of these "swords" and discuss the features and production of the "scabbards", as well as the mechanisms used to pull the "sword" from the "scabbard" when needed.
Role of osmotic and hydrostatic pressures in bacteriophage genome ejection
NASA Astrophysics Data System (ADS)
Lemay, Serge G.; Panja, Debabrata; Molineux, Ian J.
2013-02-01
A critical step in the bacteriophage life cycle is genome ejection into host bacteria. The ejection process for double-stranded DNA phages has been studied thoroughly in vitro, where after triggering with the cellular receptor the genome ejects into a buffer. The experimental data have been interpreted in terms of the decrease in free energy of the densely packed DNA associated with genome ejection. Here we detail a simple model of genome ejection in terms of the hydrostatic and osmotic pressures inside the phage, a bacterium, and a buffer solution or culture medium. We argue that the hydrodynamic flow associated with the water movement from the buffer solution into the phage capsid and further drainage into the bacterial cytoplasm, driven by the osmotic gradient between the bacterial cytoplasm and culture medium, provides an alternative mechanism for phage genome ejection in vivo; the mechanism is perfectly consistent with phage genome ejection in vitro.
Hattori, Hiroyoshi; Janky, Rekin's; Nietfeld, Wilfried; Aerts, Stein; Madan Babu, M; Venkitaraman, Ashok R
2014-01-01
The human DNA damage response (DDR) triggers profound changes in gene expression, whose nature and regulation remain uncertain. Although certain micro-(mi)RNA species including miR34, miR-18, miR-16 and miR-143 have been implicated in the DDR, there is as yet no comprehensive description of genome-wide changes in the expression of miRNAs triggered by DNA breakage in human cells. We have used next-generation sequencing (NGS), combined with rigorous integrative computational analyses, to describe genome-wide changes in the expression of miRNAs during the human DDR. The changes affect 150 of 1523 miRNAs known in miRBase v18 from 4-24 h after the induction of DNA breakage, in cell-type dependent patterns. The regulatory regions of the most-highly regulated miRNA species are enriched in conserved binding sites for p53. Indeed, genome-wide changes in miRNA expression during the DDR are markedly altered in TP53-/- cells compared to otherwise isogenic controls. The expression levels of certain damage-induced, p53-regulated miRNAs in cancer samples correlate with patient survival. Our work reveals genome-wide and cell type-specific alterations in miRNA expression during the human DDR, which are regulated by the tumor suppressor protein p53. These findings provide a genomic resource to identify new molecules and mechanisms involved in the DDR, and to examine their role in tumor suppression and the clinical outcome of cancer patients.
Huang, Chao-Li; Pu, Pei-Hua; Huang, Hao-Jen; Sung, Huang-Mo; Liaw, Hung-Jiun; Chen, Yi-Min; Chen, Chien-Ming; Huang, Ming-Ban; Osada, Naoki; Gojobori, Takashi; Pai, Tun-Wen; Chen, Yu-Tin; Hwang, Chi-Chuan; Chiang, Tzen-Yuh
2015-03-15
Comparative genomics provides insights into the diversification of bacterial species. Bacterial speciation usually takes place with lasting homologous recombination, which not only acts as a cohering force between diverging lineages but brings advantageous alleles favored by natural selection, and results in ecologically distinct species, e.g., frequent host shift in Xanthomonas pathogenic to various plants. Using whole-genome sequences, we examined the genetic divergence in Xanthomonas campestris that infected Brassicaceae, and X. citri, pathogenic to a wider host range. Genetic differentiation between two incipient races of X. citri pv. mangiferaeindicae was attributable to a DNA fragment introduced by phages. In contrast to most portions of the genome that had nearly equivalent levels of genetic divergence between subspecies as a result of the accumulation of point mutations, 10% of the core genome involving with homologous recombination contributed to the diversification in Xanthomonas, as revealed by the correlation between homologous recombination and genomic divergence. Interestingly, 179 genes were under positive selection; 98 (54.7%) of these genes were involved in homologous recombination, indicating that foreign genetic fragments may have caused the adaptive diversification, especially in lineages with nutritional transitions. Homologous recombination may have provided genetic materials for the natural selection, and host shifts likely triggered ecological adaptation in Xanthomonas. To a certain extent, we observed positive selection nevertheless contributed to ecological divergence beyond host shifting. Altogether, mediated with lasting gene flow, species formation in Xanthomonas was likely governed by natural selection that played a key role in helping the deviating populations to explore novel niches (hosts) or respond to environmental cues, subsequently triggering species diversification.
Stockley, Peter G; Twarock, Reidun; Bakker, Saskia E; Barker, Amy M; Borodavka, Alexander; Dykeman, Eric; Ford, Robert J; Pearson, Arwen R; Phillips, Simon E V; Ranson, Neil A; Tuma, Roman
2013-03-01
The formation of a protective protein container is an essential step in the life-cycle of most viruses. In the case of single-stranded (ss)RNA viruses, this step occurs in parallel with genome packaging in a co-assembly process. Previously, it had been thought that this process can be explained entirely by electrostatics. Inspired by recent single-molecule fluorescence experiments that recapitulate the RNA packaging specificity seen in vivo for two model viruses, we present an alternative theory, which recognizes the important cooperative roles played by RNA-coat protein interactions, at sites we have termed packaging signals. The hypothesis is that multiple copies of packaging signals, repeated according to capsid symmetry, aid formation of the required capsid protein conformers at defined positions, resulting in significantly enhanced assembly efficiency. The precise mechanistic roles of packaging signal interactions may vary between viruses, as we have demonstrated for MS2 and STNV. We quantify the impact of packaging signals on capsid assembly efficiency using a dodecahedral model system, showing that heterogeneous affinity distributions of packaging signals for capsid protein out-compete those of homogeneous affinities. These insights pave the way to a new anti-viral therapy, reducing capsid assembly efficiency by targeting of the vital roles of the packaging signals, and opens up new avenues for the efficient construction of protein nanocontainers in bionanotechnology.
Weiss, Scott T.
2014-01-01
Bayesian Networks (BN) have been a popular predictive modeling formalism in bioinformatics, but their application in modern genomics has been slowed by an inability to cleanly handle domains with mixed discrete and continuous variables. Existing free BN software packages either discretize continuous variables, which can lead to information loss, or do not include inference routines, which makes prediction with the BN impossible. We present CGBayesNets, a BN package focused around prediction of a clinical phenotype from mixed discrete and continuous variables, which fills these gaps. CGBayesNets implements Bayesian likelihood and inference algorithms for the conditional Gaussian Bayesian network (CGBNs) formalism, one appropriate for predicting an outcome of interest from, e.g., multimodal genomic data. We provide four different network learning algorithms, each making a different tradeoff between computational cost and network likelihood. CGBayesNets provides a full suite of functions for model exploration and verification, including cross validation, bootstrapping, and AUC manipulation. We highlight several results obtained previously with CGBayesNets, including predictive models of wood properties from tree genomics, leukemia subtype classification from mixed genomic data, and robust prediction of intensive care unit mortality outcomes from metabolomic profiles. We also provide detailed example analysis on public metabolomic and gene expression datasets. CGBayesNets is implemented in MATLAB and available as MATLAB source code, under an Open Source license and anonymous download at http://www.cgbayesnets.com. PMID:24922310
Forsman, Päivi; Alatossava, Tapani
1991-01-01
The genomes of four Lactobacillus delbrueckii subsp. lactis bacteriophages were characterized by restriction endonuclease mapping, Southern hybridization, and heteroduplex analysis. The phages were isolated from different cheese processing plants in Finland between 1950 and 1972. All four phages had a small isometric head and a long noncontractile tail. Two different types of genome (double-stranded DNA) organization existed among the different phages, the pac type and the cos type, corresponding to alternative types of phage DNA packaging. Three phages belonged to the pac type, and a fourth was a cos-type phage. The pac-type phages were genetically closely related. In the genomes of the pac-type phages, three putative insertion/deletions (0.7 to 0.8 kb, 1.0 kb, and 1.5 kb) and one other region (0.9 kb) containing clustered base substitutions were discovered and localized. At the phenotype level, three main differences were observed among the pac-type phages. These concerned two minor structural proteins and the efficiency of phage DNA packaging. The genomes of the pac-type phages showed only weak homology with that of the cos-type phage. Phage-related DNA, probably a defective prophage, was located in the chromosome of the host strain sensitive to the cos-type phage. This DNA exhibited homology under stringent conditions to the pac-type phages. Images PMID:16348513
McGeachie, Michael J; Chang, Hsun-Hsien; Weiss, Scott T
2014-06-01
Bayesian Networks (BN) have been a popular predictive modeling formalism in bioinformatics, but their application in modern genomics has been slowed by an inability to cleanly handle domains with mixed discrete and continuous variables. Existing free BN software packages either discretize continuous variables, which can lead to information loss, or do not include inference routines, which makes prediction with the BN impossible. We present CGBayesNets, a BN package focused around prediction of a clinical phenotype from mixed discrete and continuous variables, which fills these gaps. CGBayesNets implements Bayesian likelihood and inference algorithms for the conditional Gaussian Bayesian network (CGBNs) formalism, one appropriate for predicting an outcome of interest from, e.g., multimodal genomic data. We provide four different network learning algorithms, each making a different tradeoff between computational cost and network likelihood. CGBayesNets provides a full suite of functions for model exploration and verification, including cross validation, bootstrapping, and AUC manipulation. We highlight several results obtained previously with CGBayesNets, including predictive models of wood properties from tree genomics, leukemia subtype classification from mixed genomic data, and robust prediction of intensive care unit mortality outcomes from metabolomic profiles. We also provide detailed example analysis on public metabolomic and gene expression datasets. CGBayesNets is implemented in MATLAB and available as MATLAB source code, under an Open Source license and anonymous download at http://www.cgbayesnets.com.
Moscardini, Mila; Pistello, Mauro; Bendinelli, M; Ficheux, Damien; Miller, Jennifer T; Gabus, Caroline; Le Grice, Stuart F J; Surewicz, Witold K; Darlix, Jean-Luc
2002-04-19
All lentiviruses and oncoretroviruses examined so far encode a major nucleic-acid binding protein (nucleocapsid or NC* protein), approximately 2500 molecules of which coat the dimeric RNA genome. Studies on HIV-1 and MoMuLV using in vitro model systems and in vivo have shown that NC protein is required to chaperone viral RNA dimerization and packaging during virus assembly, and proviral DNA synthesis by reverse transcriptase (RT) during infection. The human cellular prion protein (PrP), thought to be the major component of the agent causing transmissible spongiform encephalopathies (TSE), was recently found to possess a strong affinity for nucleic acids and to exhibit chaperone properties very similar to HIV-1 NC protein in the HIV-1 context in vitro. Tight binding of PrP to nucleic acids is proposed to participate directly in the prion disease process. To extend our understanding of lentiviruses and of the unexpected nucleic acid chaperone properties of the human prion protein, we set up an in vitro system to investigate replication of the feline immunodeficiency virus (FIV), which is functionally and phylogenetically distant from HIV-1. The results show that in the FIV model system, NC protein chaperones viral RNA dimerization, primer tRNA(Lys,3) annealing to the genomic primer-binding site (PBS) and minus strand DNA synthesis by the homologous FIV RT. FIV NC protein is able to trigger specific viral DNA synthesis by inhibiting self-priming of reverse transcription. The human prion protein was found to mimic the properties of FIV NC with respect to primer tRNA annealing to the viral RNA and chaperoning minus strand DNA synthesis. Copyright 2002 Elsevier Science Ltd.
Call, Rosemary J.; Burlison, Jonathan D.; Robertson, Jennifer J.; Scott, Jeffrey R.; Baker, Donald K.; Rossi, Michael G.; Howard, Scott C.; Hoffman, James M.
2014-01-01
Objective To investigate the use of a trigger tool for adverse drug event (ADE) detection in a pediatric hospital specializing in oncology, hematology, and other catastrophic diseases. Study design A medication-based trigger tool package analyzed electronic health records from February 2009 to February 2013. Chart review determined whether an ADE precipitated the trigger. Severity was assigned to ADEs, and preventability was assessed. Preventable ADEs were compared with the hospital’s electronic voluntary event reporting system to identify whether these ADEs had been previously identified. The positive predictive values (PPVs) of the entire trigger tool and individual triggers were calculated to assess their accuracy to detect ADEs. Results Trigger occurrences (n=706) were detected in 390 patients from six medication triggers, 33 of which were ADEs (overall PPV = 16%). Hyaluronidase had the highest PPV (60%). Most ADEs were category E harm (temporary harm) per the National Coordinating Council for Medication Error Reporting and Prevention (NCC MERP) index. One event was category H harm (intervention to sustain life). Naloxone was associated with the most grade 4 ADEs per the Common Terminology Criteria for Adverse Events (CTCAE) v4.03. Twenty-one (64%) ADEs were preventable; 3 of which were submitted via the voluntary reporting system. Conclusion Most of the medication-based triggers yielded low PPVs. Refining the triggers based on patients’ characteristics and medication usage patterns could increase the PPVs and make them more useful for quality improvement. To efficiently detect ADEs, triggers must be revised to reflect specialized pediatric patient populations such as hematology and oncology patients. PMID:24768254
Call, Rosemary J; Burlison, Jonathan D; Robertson, Jennifer J; Scott, Jeffrey R; Baker, Donald K; Rossi, Michael G; Howard, Scott C; Hoffman, James M
2014-09-01
To investigate the use of a trigger tool for the detection of adverse drug events (ADE) in a pediatric hospital specializing in oncology, hematology, and other catastrophic diseases. A medication-based trigger tool package analyzed electronic health records from February 2009 to February 2013. Chart review determined whether an ADE precipitated the trigger. Severity was assigned to ADEs, and preventability was assessed. Preventable ADEs were compared with the hospital's electronic voluntary event reporting system to identify whether these ADEs had been previously identified. The positive predictive values (PPVs) of the entire trigger tool and individual triggers were calculated to assess their accuracy to detect ADEs. Trigger occurrences (n = 706) were detected in 390 patients from 6 medication triggers, 33 of which were ADEs (overall PPV = 16%). Hyaluronidase had the greatest PPV (60%). Most ADEs were category E harm (temporary harm) per the National Coordinating Council for Medication Error Reporting and Prevention index. One event was category H harm (intervention to sustain life). Naloxone was associated with the most grade 4 ADEs per the Common Terminology Criteria for Adverse Events v4.03. Twenty-one (64%) ADEs were preventable, 3 of which were submitted via the voluntary reporting system. Most of the medication-based triggers yielded low PPVs. Refining the triggers based on patients' characteristics and medication usage patterns could increase the PPVs and make them more useful for quality improvement. To efficiently detect ADEs, triggers must be revised to reflect specialized pediatric patient populations such as hematology and oncology patients. Copyright © 2014 Elsevier Inc. All rights reserved.
RNA structural constraints in the evolution of the influenza A virus genome NP segment
Gultyaev, Alexander P; Tsyganov-Bodounov, Anton; Spronken, Monique IJ; van der Kooij, Sander; Fouchier, Ron AM; Olsthoorn, René CL
2014-01-01
Conserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length, including protein-coding regions. Calculations of mutual information values at the paired nucleotide positions demonstrate that these structures impose considerable constraints on the virus genome evolution. Functional importance of a pseudoknot structure, predicted in the NP packaging signal region, was confirmed by plaque assays of the mutant viruses with disrupted structure and those with restored folding using compensatory substitutions. Possible functions of the conserved RNA folding patterns in the influenza A virus genome are discussed. PMID:25180940
Interaction of packaging motor with the polymerase complex of dsRNA bacteriophage
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lisal, Jiri; Kainov, Denis E.; Lam, TuKiet T.
2006-07-20
Many viruses employ molecular motors to package their genomes into preformed empty capsids (procapsids). In dsRNA bacteriophages the packaging motor is a hexameric ATPase P4, which is an integral part of the multisubunit procapsid. Structural and biochemical studies revealed a plausible RNA-translocation mechanism for the isolated hexamer. However, little is known about the structure and regulation of the hexamer within the procapsid. Here we use hydrogen-deuterium exchange and mass spectrometry to delineate the interactions of the P4 hexamer with the bacteriophage phi12 procapsid. P4 associates with the procapsid via its C-terminal face. The interactions also stabilize subunit interfaces within themore » hexamer. The conformation of the virus-bound hexamer is more stable than the hexamer in solution, which is prone to spontaneous ring openings. We propose that the stabilization within the viral capsid increases the packaging processivity and confers selectivity during RNA loading.« less
chimeraviz: a tool for visualizing chimeric RNA.
Lågstad, Stian; Zhao, Sen; Hoff, Andreas M; Johannessen, Bjarne; Lingjærde, Ole Christian; Skotheim, Rolf I
2017-09-15
Advances in high-throughput RNA sequencing have enabled more efficient detection of fusion transcripts, but the technology and associated software used for fusion detection from sequencing data often yield a high false discovery rate. Good prioritization of the results is important, and this can be helped by a visualization framework that automatically integrates RNA data with known genomic features. Here we present chimeraviz , a Bioconductor package that automates the creation of chimeric RNA visualizations. The package supports input from nine different fusion-finder tools: deFuse, EricScript, InFusion, JAFFA, FusionCatcher, FusionMap, PRADA, SOAPfuse and STAR-FUSION. chimeraviz is an R package available via Bioconductor ( https://bioconductor.org/packages/release/bioc/html/chimeraviz.html ) under Artistic-2.0. Source code and support is available at GitHub ( https://github.com/stianlagstad/chimeraviz ). rolf.i.skotheim@rr-research.no. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Liu, Bin; Wu, Hao; Zhang, Deyuan; Wang, Xiaolong; Chou, Kuo-Chen
2017-02-21
To expedite the pace in conducting genome/proteome analysis, we have developed a Python package called Pse-Analysis. The powerful package can automatically complete the following five procedures: (1) sample feature extraction, (2) optimal parameter selection, (3) model training, (4) cross validation, and (5) evaluating prediction quality. All the work a user needs to do is to input a benchmark dataset along with the query biological sequences concerned. Based on the benchmark dataset, Pse-Analysis will automatically construct an ideal predictor, followed by yielding the predicted results for the submitted query samples. All the aforementioned tedious jobs can be automatically done by the computer. Moreover, the multiprocessing technique was adopted to enhance computational speed by about 6 folds. The Pse-Analysis Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/Pse-Analysis/, and can be directly run on Windows, Linux, and Unix.
Larsson, Daniel S D; van der Spoel, David
2012-07-10
The complete structure of the genomic material inside a virus capsid remains elusive, although a limited amount of symmetric nucleic acid can be resolved in the crystal structure of 17 icosahedral viruses. The negatively charged sugar-phosphate backbone of RNA and DNA as well as the large positive charge of the interior surface of the virus capsids suggest that electrostatic complementarity is an important factor in the packaging of the genomes in these viruses. To test how much packing information is encoded by the electrostatic and steric envelope of the capsid interior, we performed extensive all-atom molecular dynamics (MD) simulations of virus capsids with explicit water molecules and solvent ions. The model systems were two small plant viruses in which significant amounts of RNA has been observed by X-ray crystallography: satellite tobacco mosaic virus (STMV, 62% RNA visible) and satellite tobacco necrosis virus (STNV, 34% RNA visible). Simulations of half-capsids of these viruses with no RNA present revealed that the binding sites of RNA correlated well with regions populated by chloride ions, suggesting that it is possible to screen for the binding sites of nucleic acids by determining the equilibrium distribution of negative ions. By including the crystallographically resolved RNA in addition to ions, we predicted the localization of the unresolved RNA in the viruses. Both viruses showed a hot-spot for RNA binding at the 5-fold symmetry axis. The MD simulations were compared to predictions of the chloride density based on nonlinear Poisson-Boltzmann equation (PBE) calculations with mobile ions. Although the predictions are superficially similar, the PBE calculations overestimate the ion concentration close to the capsid surface and underestimate it far away, mainly because protein dynamics is not taken into account. Density maps from chloride screening can be used to aid in building atomic models of packaged virus genomes. Knowledge of the principles of genome packaging might be exploited for both antiviral therapy and technological applications.
Transcription as a Threat to Genome Integrity.
Gaillard, Hélène; Aguilera, Andrés
2016-06-02
Genomes undergo different types of sporadic alterations, including DNA damage, point mutations, and genome rearrangements, that constitute the basis for evolution. However, these changes may occur at high levels as a result of cell pathology and trigger genome instability, a hallmark of cancer and a number of genetic diseases. In the last two decades, evidence has accumulated that transcription constitutes an important natural source of DNA metabolic errors that can compromise the integrity of the genome. Transcription can create the conditions for high levels of mutations and recombination by its ability to open the DNA structure and remodel chromatin, making it more accessible to DNA insulting agents, and by its ability to become a barrier to DNA replication. Here we review the molecular basis of such events from a mechanistic perspective with particular emphasis on the role of transcription as a genome instability determinant.
HIV-1 Exploits a Dynamic Multi-aminoacyl-tRNA Synthetase Complex To Enhance Viral Replication.
Duchon, Alice A; St Gelais, Corine; Titkemeier, Nathan; Hatterschide, Joshua; Wu, Li; Musier-Forsyth, Karin
2017-11-01
A hallmark of retroviruses such as human immunodeficiency virus type 1 (HIV-1) is reverse transcription of genomic RNA to DNA, a process that is primed by cellular tRNAs. HIV-1 recruits human tRNA Lys3 to serve as the reverse transcription primer via an interaction between lysyl-tRNA synthetase (LysRS) and the HIV-1 Gag polyprotein. LysRS is normally sequestered in a multi-aminoacyl-tRNA synthetase complex (MSC). Previous studies demonstrated that components of the MSC can be mobilized in response to certain cellular stimuli, but how LysRS is redirected from the MSC to viral particles for packaging is unknown. Here, we show that upon HIV-1 infection, a free pool of non-MSC-associated LysRS is observed and partially relocalized to the nucleus. Heat inactivation of HIV-1 blocks nuclear localization of LysRS, but treatment with a reverse transcriptase inhibitor does not, suggesting that the trigger for relocalization occurs prior to reverse transcription. A reduction in HIV-1 infection is observed upon treatment with an inhibitor to mitogen-activated protein kinase that prevents phosphorylation of LysRS on Ser207, release of LysRS from the MSC, and nuclear localization. A phosphomimetic mutant of LysRS (S207D) that lacked the capability to aminoacylate tRNA Lys3 localized to the nucleus, rescued HIV-1 infectivity, and was packaged into virions. In contrast, a phosphoablative mutant (S207A) remained cytosolic and maintained full aminoacylation activity but failed to rescue infectivity and was not packaged. These findings suggest that HIV-1 takes advantage of the dynamic nature of the MSC to redirect and coopt cellular translation factors to enhance viral replication. IMPORTANCE Human tRNA Lys3 , the primer for reverse transcription, and LysRS are essential host factors packaged into HIV-1 virions. Previous studies found that tRNA Lys3 packaging depends on interactions between LysRS and HIV-1 Gag; however, many details regarding the mechanism of tRNA Lys3 and LysRS packaging remain unknown. LysRS is normally sequestered in a high-molecular-weight multi-aminoacyl-tRNA synthetase complex (MSC), restricting the pool of free LysRS-tRNA Lys Mounting evidence suggests that LysRS is released under a variety of stimuli to perform alternative functions within the cell. Here, we show that HIV-1 infection results in a free pool of LysRS that is relocalized to the nucleus of target cells. Blocking this pathway in HIV-1-producing cells resulted in less infectious progeny virions. Understanding the mechanism by which LysRS is recruited into the viral assembly pathway can be exploited for the development of specific and effective therapeutics targeting this nontranslational function. Copyright © 2017 American Society for Microbiology.
Parent, Kristin N.; Schrad, Jason R.; Cingolani, Gino
2018-01-01
The majority of viruses on Earth form capsids built by multiple copies of one or more types of a coat protein arranged with 532 symmetry, generating an icosahedral shell. This highly repetitive structure is ideal to closely pack identical protein subunits and to enclose the nucleic acid genomes. However, the icosahedral capsid is not merely a passive cage but undergoes dynamic events to promote packaging, maturation and the transfer of the viral genome into the host. These essential processes are often mediated by proteinaceous complexes that interrupt the shell’s icosahedral symmetry, providing a gateway through the capsid. In this review, we take an inventory of molecular structures observed either internally, or at the 5-fold vertices of icosahedral DNA viruses that infect bacteria, archea and eukaryotes. Taking advantage of the recent revolution in cryo-electron microscopy (cryo-EM) and building upon a wealth of crystallographic structures of individual components, we review the design principles of non-icosahedral structural components that interrupt icosahedral symmetry and discuss how these macromolecules play vital roles in genome packaging, ejection and host receptor-binding. PMID:29414851
Kaufman, Brett A.; Durisic, Nela; Mativetsky, Jeffrey M.; Costantino, Santiago; Hancock, Mark A.; Grutter, Peter
2007-01-01
Packaging DNA into condensed structures is integral to the transmission of genomes. The mammalian mitochondrial genome (mtDNA) is a high copy, maternally inherited genome in which mutations cause a variety of multisystem disorders. In all eukaryotic cells, multiple mtDNAs are packaged with protein into spheroid bodies called nucleoids, which are the fundamental units of mtDNA segregation. The mechanism of nucleoid formation, however, remains unknown. Here, we show that the mitochondrial transcription factor TFAM, an abundant and highly conserved High Mobility Group box protein, binds DNA cooperatively with nanomolar affinity as a homodimer and that it is capable of coordinating and fully compacting several DNA molecules together to form spheroid structures. We use noncontact atomic force microscopy, which achieves near cryo-electron microscope resolution, to reveal the structural details of protein–DNA compaction intermediates. The formation of these complexes involves the bending of the DNA backbone, and DNA loop formation, followed by the filling in of proximal available DNA sites until the DNA is compacted. These results indicate that TFAM alone is sufficient to organize mitochondrial chromatin and provide a mechanism for nucleoid formation. PMID:17581862
Package models and the information crisis of prebiotic evolution.
Silvestre, Daniel A M M; Fontanari, José F
2008-05-21
The coexistence between different types of templates has been the choice solution to the information crisis of prebiotic evolution, triggered by the finding that a single RNA-like template cannot carry enough information to code for any useful replicase. In principle, confining d distinct templates of length L in a package or protocell, whose survival depends on the coexistence of the templates it holds in, could resolve this crisis provided that d is made sufficiently large. Here we review the prototypical package model of Niesert et al. [1981. Origin of life between Scylla and Charybdis. J. Mol. Evol. 17, 348-353] which guarantees the greatest possible region of viability of the protocell population, and show that this model, and hence the entire package approach, does not resolve the information crisis. In particular, we show that the total information stored in a viable protocell (Ld) tends to a constant value that depends only on the spontaneous error rate per nucleotide of the template replication mechanism. As a result, an increase of d must be followed by a decrease of L, so that the net information gain is null.
EuCAP, a Eukaryotic Community Annotation Package, and its application to the rice genome
Thibaud-Nissen, Françoise; Campbell, Matthew; Hamilton, John P; Zhu, Wei; Buell, C Robin
2007-01-01
Background Despite the improvements of tools for automated annotation of genome sequences, manual curation at the structural and functional level can provide an increased level of refinement to genome annotation. The Institute for Genomic Research Rice Genome Annotation (hereafter named the Osa1 Genome Annotation) is the product of an automated pipeline and, for this reason, will benefit from the input of biologists with expertise in rice and/or particular gene families. Leveraging knowledge from a dispersed community of scientists is a demonstrated way of improving a genome annotation. This requires tools that facilitate 1) the submission of gene annotation to an annotation project, 2) the review of the submitted models by project annotators, and 3) the incorporation of the submitted models in the ongoing annotation effort. Results We have developed the Eukaryotic Community Annotation Package (EuCAP), an annotation tool, and have applied it to the rice genome. The primary level of curation by community annotators (CA) has been the annotation of gene families. Annotation can be submitted by email or through the EuCAP Web Tool. The CA models are aligned to the rice pseudomolecules and the coordinates of these alignments, along with functional annotation, are stored in the MySQL EuCAP Gene Model database. Web pages displaying the alignments of the CA models to the Osa1 Genome models are automatically generated from the EuCAP Gene Model database. The alignments are reviewed by the project annotators (PAs) in the context of experimental evidence. Upon approval by the PAs, the CA models, along with the corresponding functional annotations, are integrated into the Osa1 Genome Annotation. The CA annotations, grouped by family, are displayed on the Community Annotation pages of the project website , as well as in the Community Annotation track of the Genome Browser. Conclusion We have applied EuCAP to rice. As of July 2007, the structural and/or functional annotation of 1,094 genes representing 57 families have been deposited and integrated into the current gene set. All of the EuCAP components are open-source, thereby allowing the implementation of EuCAP for the annotation of other genomes. EuCAP is available at . PMID:17961238
Moszczynska, Anna; Burghardt, Kyle J.; Yu, Dongyue
2017-01-01
Short interspersed elements (SINEs) are typically silenced by DNA hypermethylation in somatic cells, but can retrotranspose in proliferating cells during adult neurogenesis. Hypomethylation caused by disease pathology or genotoxic stress leads to genomic instability of SINEs. The goal of the present investigation was to determine whether neurotoxic doses of binge or chronic methamphetamine (METH) trigger retrotransposition of the identifier (ID) element, a member of the rat SINE family, in the dentate gyrus genomic DNA. Adult male Sprague-Dawley rats were treated with saline or high doses of binge or chronic METH and sacrificed at three different time points thereafter. DNA methylation analysis, immunohistochemistry and next-generation sequencing (NGS) were performed on the dorsal dentate gyrus samples. Binge METH triggered hypomethylation, while chronic METH triggered hypermethylation of the CpG-2 site. Both METH regimens were associated with increased intensities in poly(A)-binding protein 1 (PABP1, a SINE regulatory protein)-like immunohistochemical staining in the dentate gyrus. The amplification of several ID element sequences was significantly higher in the chronic METH group than in the control group a week after METH, and they mapped to genes coding for proteins regulating cell growth and proliferation, transcription, protein function as well as for a variety of transporters. The results suggest that chronic METH induces ID element retrotransposition in the dorsal dentate gyrus and may affect hippocampal neurogenesis. PMID:28272323
Xue, Alexander T; Hickerson, Michael J
2017-11-01
Population genetic data from multiple taxa can address comparative phylogeographic questions about community-scale response to environmental shifts, and a useful strategy to this end is to employ hierarchical co-demographic models that directly test multi-taxa hypotheses within a single, unified analysis. This approach has been applied to classical phylogeographic data sets such as mitochondrial barcodes as well as reduced-genome polymorphism data sets that can yield 10,000s of SNPs, produced by emergent technologies such as RAD-seq and GBS. A strategy for the latter had been accomplished by adapting the site frequency spectrum to a novel summarization of population genomic data across multiple taxa called the aggregate site frequency spectrum (aSFS), which potentially can be deployed under various inferential frameworks including approximate Bayesian computation, random forest and composite likelihood optimization. Here, we introduce the r package multi-dice, a wrapper program that exploits existing simulation software for flexible execution of hierarchical model-based inference using the aSFS, which is derived from reduced genome data, as well as mitochondrial data. We validate several novel software features such as applying alternative inferential frameworks, enforcing a minimal threshold of time surrounding co-demographic pulses and specifying flexible hyperprior distributions. In sum, multi-dice provides comparative analysis within the familiar R environment while allowing a high degree of user customization, and will thus serve as a tool for comparative phylogeography and population genomics. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Sommerstein, Rami; Führer, Urs; Lo Priore, Elia; Casanova, Carlo; Meinel, Dominik M; Seth-Smith, Helena MB; Kronenberg, Andreas; Koch, Daniel; Senn, Laurence; Widmer, Andreas F; Egli, Adrian; Marschall, Jonas
2017-01-01
We describe an outbreak of Burkholderia stabilis associated with contaminated washing gloves, a commercially available Class I medical device. Triggered by an increase in Burkholderia cepacia complex (BCC) bacteremias and the detection of BCC in unopened packages of washing gloves, an ad hoc national outbreak committee comprising representatives of a public health organisation, a regulatory agency, and an expert association convened and commissioned an outbreak investigation. The investigation included retrospective case finding across Switzerland and whole genome sequencing (WGS) of isolates from cases and gloves. The investigation revealed that BCC were detected in clinical samples of 46 cases aged 17 to 91 years (33% females) from nine institutions between May 2015 and August 2016. Twenty-two isolates from case patients and 16 from washing gloves underwent WGS. All available outbreak isolates clustered within a span of < 19 differing alleles, while 13 unrelated clinical isolates differed by > 1,500 alleles. This BCC outbreak was rapidly identified, communicated, investigated and halted by an ad hoc collaboration of multiple stakeholders. WGS served as useful tool for confirming the source of the outbreak. This outbreak also highlights current regulatory limitations regarding Class I medical devices and the usefulness of a nationally coordinated outbreak response. PMID:29233255
An integrated workflow for analysis of ChIP-chip data.
Weigelt, Karin; Moehle, Christoph; Stempfl, Thomas; Weber, Bernhard; Langmann, Thomas
2008-08-01
Although ChIP-chip is a powerful tool for genome-wide discovery of transcription factor target genes, the steps involving raw data analysis, identification of promoters, and correlation with binding sites are still laborious processes. Therefore, we report an integrated workflow for the analysis of promoter tiling arrays with the Genomatix ChipInspector system. We compare this tool with open-source software packages to identify PU.1 regulated genes in mouse macrophages. Our results suggest that ChipInspector data analysis, comparative genomics for binding site prediction, and pathway/network modeling significantly facilitate and enhance whole-genome promoter profiling to reveal in vivo sites of transcription factor-DNA interactions.
Facies remolding in allochthonous chalk packages, Ekofisk and Albuskjell fields, North Sea
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lutz, S.J.; Ekdale, A.A.
1990-05-01
The Ekofish and Albuskjell fields in the Central Graben of the North Sea produce hydrocarbons from resedimented chalk reservoirs. Although the allochthonous nature of chalk in these fields has been recognized, the correlations of, and association between, allochthonous units has not been described. Core analysis of the Tor Formation (Maastrichtian) and the Ekofish Formation (Danian) reveals that slump deposits have been remolded into debris flows, ooze flows, and turbidites. Packages of allochthonous sediment were deposited in slope and base-of-slope environments. Two kinds of allochthonous packages occur. One package, 1-3-m thick, consists of a basal debris flow overlain by an oozemore » flow. The other package, 10-20-m thick, contains three units: a basal debris flow, an intermediate slump, and an overlying turbidite. Deposition of each type of package probably resulted from a single triggering event. Lateral changes in facies (increased convolution and decreased clastic content) and in type of deposit (slump or debris flow to ooze flow) within the packages resulted from differing degrees of deformation as the packages moved downslope. An increase in occurrence and angularity of chalk intraclasts, and in thickness of slump units from the Albuskjell field eastward to the Ekofisk field, suggest that the graben-bounding Hidra fault zone (about 30 km away) is the source of the allochthonous deposits. Vertical changes in the type of allochthonous package (from debris and ooze flows upward to slumps and turbidites) reflect decreasing topographic relief along the fault escarpment as the graben filled. This model of vertical (basin shallowing) and lateral (downslope) facies changes allows correlation of allochthonous chalk units, which are excellent hydrocarbon reservoirs.« less
TESSIM: a simulator for the Athena-X-IFU
NASA Astrophysics Data System (ADS)
Wilms, J.; Smith, S. J.; Peille, P.; Ceballos, M. T.; Cobo, B.; Dauser, T.; Brand, T.; den Hartog, R. H.; Bandler, S. R.; de Plaa, J.; den Herder, J.-W. A.
2016-07-01
We present the design of tessim, a simulator for the physics of transition edge sensors developed in the framework of the Athena end to end simulation effort. Designed to represent the general behavior of transition edge sensors and to provide input for engineering and science studies for Athena, tessim implements a numerical solution of the linearized equations describing these devices. The simulation includes a model for the relevant noise sources and several implementations of possible trigger algorithms. Input and output of the software are standard FITS- files which can be visualized and processed using standard X-ray astronomical tool packages. Tessim is freely available as part of the SIXTE package (http://www.sternwarte.uni-erlangen.de/research/sixte/).
TESSIM: A Simulator for the Athena-X-IFU
NASA Technical Reports Server (NTRS)
Wilms, J.; Smith, S. J.; Peille, P.; Ceballos, M. T.; Cobo, B.; Dauser, T.; Brand, T.; Den Hartog, R. H.; Bandler, S. R.; De Plaa, J.;
2016-01-01
We present the design of tessim, a simulator for the physics of transition edge sensors developed in the framework of the Athena end to end simulation effort. Designed to represent the general behavior of transition edge sensors and to provide input for engineering and science studies for Athena, tessim implements a numerical solution of the linearized equations describing these devices. The simulation includes a model for the relevant noise sources and several implementations of possible trigger algorithms. Input and output of the software are standard FITS-les which can be visualized and processed using standard X-ray astronomical tool packages. Tessim is freely available as part of the SIXTE package (http:www.sternwarte.uni-erlangen.deresearchsixte).
Hamilton, John P; Neeno-Eckwall, Eric C; Adhikari, Bishwo N; Perna, Nicole T; Tisserat, Ned; Leach, Jan E; Lévesque, C André; Buell, C Robin
2011-01-01
The Comprehensive Phytopathogen Genomics Resource (CPGR) provides a web-based portal for plant pathologists and diagnosticians to view the genome and trancriptome sequence status of 806 bacterial, fungal, oomycete, nematode, viral and viroid plant pathogens. Tools are available to search and analyze annotated genome sequences of 74 bacterial, fungal and oomycete pathogens. Oomycete and fungal genomes are obtained directly from GenBank, whereas bacterial genome sequences are downloaded from the A Systematic Annotation Package (ASAP) database that provides curation of genomes using comparative approaches. Curated lists of bacterial genes relevant to pathogenicity and avirulence are also provided. The Plant Pathogen Transcript Assemblies Database provides annotated assemblies of the transcribed regions of 82 eukaryotic genomes from publicly available single pass Expressed Sequence Tags. Data-mining tools are provided along with tools to create candidate diagnostic markers, an emerging use for genomic sequence data in plant pathology. The Plant Pathogen Ribosomal DNA (rDNA) database is a resource for pathogens that lack genome or transcriptome data sets and contains 131 755 rDNA sequences from GenBank for 17 613 species identified as plant pathogens and related genera. Database URL: http://cpgr.plantbiology.msu.edu.
ERIC Educational Resources Information Center
Alderman, Lyn
2016-01-01
In Australia, a review of the higher education sector is usually triggered by a change in government leadership, followed by the development and implementation of the government's response in the form of a reform package to enact change. The aim of this study was to conduct an independent evaluation of a large-scale national government policy…
MethylMix 2.0: an R package for identifying DNA methylation genes. | Office of Cancer Genomics
DNA methylation is an important mechanism regulating gene transcription, and its role in carcinogenesis has been extensively studied. Hyper and hypomethylation of genes is a major mechanism of gene expression deregulation in a wide range of diseases. At the same time, high-throughput DNA methylation assays have been developed generating vast amounts of genome wide DNA methylation measurements. We developed MethylMix, an algorithm implemented in R to identify disease specific hyper and hypomethylated genes.
Prel, Anne; Caval, Vincent; Gayon, Régis; Ravassard, Philippe; Duthoit, Christine; Payen, Emmanuel; Maouche-Chretien, Leila; Creneguy, Alison; Nguyen, Tuan Huy; Martin, Nicolas; Piver, Eric; Sevrain, Raphaël; Lamouroux, Lucille; Leboulch, Philippe; Deschaseaux, Frédéric; Bouillé, Pascale; Sensébé, Luc; Pagès, Jean-Christophe
2015-01-01
RNA delivery is an attractive strategy to achieve transient gene expression in research projects and in cell- or gene-based therapies. Despite significant efforts investigating vector-directed RNA transfer, there is still a requirement for better efficiency of delivery to primary cells and in vivo. Retroviral platforms drive RNA delivery, yet retrovirus RNA-packaging constraints limit gene transfer to two genome-molecules per viral particle. To improve retroviral transfer, we designed a dimerization-independent MS2-driven RNA packaging system using MS2-Coat-retrovirus chimeras. The engineered chimeric particles promoted effective packaging of several types of RNAs and enabled efficient transfer of biologically active RNAs in various cell types, including human CD34+ and iPS cells. Systemic injection of high-titer particles led to gene expression in mouse liver and transferring Cre-recombinase mRNA in muscle permitted widespread editing at the ROSA26 locus. We could further show that the VLPs were able to activate an osteoblast differentiation pathway by delivering RUNX2- or DLX5-mRNA into primary human bone-marrow mesenchymal-stem cells. Thus, the novel chimeric MS2-lentiviral particles are a versatile tool for a wide range of applications including cellular-programming or genome-editing. PMID:26528487
NASA Technical Reports Server (NTRS)
Ahn, H. S.; Whitaker, Ann F. (Technical Monitor)
2001-01-01
The first flight of the Advanced Thin Ionization Calorimeter (ATIC) experiment from McMurdo, Antarctica lasted for 16 days, starting in December, 2000. The ATIC instrument consists of a fully active 320-crystal, 960-channel Bismuth Germanate (BGO) calorimeter, 202 scintillator strips in 3 hodoscopes interleaved with a graphite target, and a 4480-pixel silicon matrix charge detector. We have developed an Object Oriented data processing package based on ROOT. In this paper, we will describe the data processing scheme used in handling the accumulated 45 GB of flight data. We will also discuss trigger issues by comparing the measured energy-dependent trigger efficiency with its simulation and calibration issues by considering the time-dependence of housekeeping information, etc.
PharmacoGx: an R package for analysis of large pharmacogenomic datasets.
Smirnov, Petr; Safikhani, Zhaleh; El-Hachem, Nehme; Wang, Dong; She, Adrian; Olsen, Catharina; Freeman, Mark; Selby, Heather; Gendoo, Deena M A; Grossmann, Patrick; Beck, Andrew H; Aerts, Hugo J W L; Lupien, Mathieu; Goldenberg, Anna; Haibe-Kains, Benjamin
2016-04-15
Pharmacogenomics holds great promise for the development of biomarkers of drug response and the design of new therapeutic options, which are key challenges in precision medicine. However, such data are scattered and lack standards for efficient access and analysis, consequently preventing the realization of the full potential of pharmacogenomics. To address these issues, we implemented PharmacoGx, an easy-to-use, open source package for integrative analysis of multiple pharmacogenomic datasets. We demonstrate the utility of our package in comparing large drug sensitivity datasets, such as the Genomics of Drug Sensitivity in Cancer and the Cancer Cell Line Encyclopedia. Moreover, we show how to use our package to easily perform Connectivity Map analysis. With increasing availability of drug-related data, our package will open new avenues of research for meta-analysis of pharmacogenomic data. PharmacoGx is implemented in R and can be easily installed on any system. The package is available from CRAN and its source code is available from GitHub. bhaibeka@uhnresearch.ca or benjamin.haibe.kains@utoronto.ca Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Analysis, annotation, and profiling of the oat seed transcriptome
USDA-ARS?s Scientific Manuscript database
Novel high-throughput next generation sequencing (NGS) technologies are providing opportunities to explore genomes and transcriptomes in a cost-effective manner. To construct a gene expression atlas of developing oat (Avena sativa) seeds, two software packages specifically designed for RNA-seq (Trin...
Hierarchical Scaffolding With Bambus
Pop, Mihai; Kosack, Daniel S.; Salzberg, Steven L.
2004-01-01
The output of a genome assembler generally comprises a collection of contiguous DNA sequences (contigs) whose relative placement along the genome is not defined. A procedure called scaffolding is commonly used to order and orient these contigs using paired read information. This ordering of contigs is an essential step when finishing and analyzing the data from a whole-genome shotgun project. Most recent assemblers include a scaffolding module; however, users have little control over the scaffolding algorithm or the information produced. We thus developed a general-purpose scaffolder, called Bambus, which affords users significant flexibility in controlling the scaffolding parameters. Bambus was used recently to scaffold the low-coverage draft dog genome data. Most significantly, Bambus enables the use of linking data other than that inferred from mate-pair information. For example, the sequence of a completed genome can be used to guide the scaffolding of a related organism. We present several applications of Bambus: support for finishing, comparative genomics, analysis of the haplotype structure of genomes, and scaffolding of a mammalian genome at low coverage. Bambus is available as an open-source package from our Web site. PMID:14707177
Hierarchical scaffolding with Bambus.
Pop, Mihai; Kosack, Daniel S; Salzberg, Steven L
2004-01-01
The output of a genome assembler generally comprises a collection of contiguous DNA sequences (contigs) whose relative placement along the genome is not defined. A procedure called scaffolding is commonly used to order and orient these contigs using paired read information. This ordering of contigs is an essential step when finishing and analyzing the data from a whole-genome shotgun project. Most recent assemblers include a scaffolding module; however, users have little control over the scaffolding algorithm or the information produced. We thus developed a general-purpose scaffolder, called Bambus, which affords users significant flexibility in controlling the scaffolding parameters. Bambus was used recently to scaffold the low-coverage draft dog genome data. Most significantly, Bambus enables the use of linking data other than that inferred from mate-pair information. For example, the sequence of a completed genome can be used to guide the scaffolding of a related organism. We present several applications of Bambus: support for finishing, comparative genomics, analysis of the haplotype structure of genomes, and scaffolding of a mammalian genome at low coverage. Bambus is available as an open-source package from our Web site.
Furman, Benjamin L. S.; Evans, Ben J.
2016-01-01
Sexual differentiation is fundamentally important for reproduction, yet the genetic triggers of this developmental process can vary, even between closely related species. Recent studies have uncovered, for example, variation in the genetic triggers for sexual differentiation within and between species of African clawed frogs (genus Xenopus). Here, we extend these discoveries by demonstrating that yet another sex determination system exists in Xenopus, specifically in the species Xenopus borealis. This system evolved recently in an ancestor of X. borealis that had the same sex determination system as X. laevis, a system which itself is newly evolved. Strikingly, the genomic region carrying the sex determination factor in X. borealis is homologous to that of therian mammals, including humans. Our results offer insights into how the genetic underpinnings of conserved phenotypes evolve, and suggest an important role for cooption of genetic building blocks with conserved developmental roles. PMID:27605520
Borrego, Belén; Rodríguez-Pulido, Miguel; Revilla, Concepción; Álvarez, Belén; Sobrino, Francisco; Domínguez, Javier; Sáiz, Margarita
2015-07-17
The innate immune system is the first line of defense against viral infections. Exploiting innate responses for antiviral, therapeutic and vaccine adjuvation strategies is being extensively explored. We have previously described, the ability of small in vitro RNA transcripts, mimicking the sequence and structure of different domains in the non-coding regions of the foot-and-mouth disease virus (FMDV) genome (ncRNAs), to trigger a potent and rapid innate immune response. These synthetic non-infectious molecules have proved to have a broad-range antiviral activity and to enhance the immunogenicity of an FMD inactivated vaccine in mice. Here, we have studied the involvement of pattern-recognition receptors (PRRs) in the ncRNA-induced innate response and analyzed the antiviral and cytokine profiles elicited in swine cultured cells, as well as peripheral blood mononuclear cells (PBMCs).
Li, Ruidong; Qu, Han; Wang, Shibo; Wei, Julong; Zhang, Le; Ma, Renyuan; Lu, Jianming; Zhu, Jianguo; Zhong, Wei-De; Jia, Zhenyu
2018-03-02
The large-scale multidimensional omics data in the Genomic Data Commons (GDC) provides opportunities to investigate the crosstalk among different RNA species and their regulatory mechanisms in cancers. Easy-to-use bioinformatics pipelines are needed to facilitate such studies. We have developed a user-friendly R/Bioconductor package, named GDCRNATools, for downloading, organizing, and analyzing RNA data in GDC with an emphasis on deciphering the lncRNA-mRNA related competing endogenous RNAs (ceRNAs) regulatory network in cancers. Many widely used bioinformatics tools and databases are utilized in our package. Users can easily pack preferred downstream analysis pipelines or integrate their own pipelines into the workflow. Interactive shiny web apps built in GDCRNATools greatly improve visualization of results from the analysis. GDCRNATools is an R/Bioconductor package that is freely available at Bioconductor (http://bioconductor.org/packages/devel/bioc/html/GDCRNATools.html). Detailed instructions, manual and example code are also available in Github (https://github.com/Jialab-UCR/GDCRNATools). arthur.jia@ucr.edu or zhongwd2009@live.cn or doctorzhujianguo@163.com.
ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data
Díaz-Uriarte, Ramón; Rueda, Oscar M.
2007-01-01
Background Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification of CNAs has resulted in a wealth of methodological studies. Methodology/Principal Findings ADaCGH is an R package and a web-based application for the analysis of aCGH data. It implements eight methods for detection of CNAs, gains and losses of genomic DNA, including all of the best performing ones from two recent reviews (CBS, GLAD, CGHseg, HMM). For improved speed, we use parallel computing (via MPI). Additional information (GO terms, PubMed citations, KEGG and Reactome pathways) is available for individual genes, and for sets of genes with altered copy numbers. Conclusions/Significance ADaCGH represents a qualitative increase in the standards of these types of applications: a) all of the best performing algorithms are included, not just one or two; b) we do not limit ourselves to providing a thin layer of CGI on top of existing BioConductor packages, but instead carefully use parallelization, examining different schemes, and are able to achieve significant decreases in user waiting time (factors up to 45×); c) we have added functionality not currently available in some methods, to adapt to recent recommendations (e.g., merging of segmentation results in wavelet-based and CGHseg algorithms); d) we incorporate redundancy, fault-tolerance and checkpointing, which are unique among web-based, parallelized applications; e) all of the code is available under open source licenses, allowing to build upon, copy, and adapt our code for other software projects. PMID:17710137
ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.
Díaz-Uriarte, Ramón; Rueda, Oscar M
2007-08-15
Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification of CNAs has resulted in a wealth of methodological studies. ADaCGH is an R package and a web-based application for the analysis of aCGH data. It implements eight methods for detection of CNAs, gains and losses of genomic DNA, including all of the best performing ones from two recent reviews (CBS, GLAD, CGHseg, HMM). For improved speed, we use parallel computing (via MPI). Additional information (GO terms, PubMed citations, KEGG and Reactome pathways) is available for individual genes, and for sets of genes with altered copy numbers. ADACGH represents a qualitative increase in the standards of these types of applications: a) all of the best performing algorithms are included, not just one or two; b) we do not limit ourselves to providing a thin layer of CGI on top of existing BioConductor packages, but instead carefully use parallelization, examining different schemes, and are able to achieve significant decreases in user waiting time (factors up to 45x); c) we have added functionality not currently available in some methods, to adapt to recent recommendations (e.g., merging of segmentation results in wavelet-based and CGHseg algorithms); d) we incorporate redundancy, fault-tolerance and checkpointing, which are unique among web-based, parallelized applications; e) all of the code is available under open source licenses, allowing to build upon, copy, and adapt our code for other software projects.
Meile, Lukas; Croll, Daniel; Brunner, Patrick C; Plissonneau, Clémence; Hartmann, Fanny E; McDonald, Bruce A; Sánchez-Vallet, Andrea
2018-04-25
Cultivar-strain specificity in the wheat-Zymoseptoria tritici pathosystem determines the infection outcome and is controlled by resistance genes on the host side, many of which have been identified. On the pathogen side, however, the molecular determinants of specificity remain largely unknown. We used genetic mapping, targeted gene disruption and allele swapping to characterise the recognition of the new avirulence factor Avr3D1. We then combined population genetic and comparative genomic analyses to characterise the evolutionary trajectory of Avr3D1. Avr3D1 is specifically recognised by wheat cultivars harbouring the Stb7 resistance gene, triggering a strong defence response without preventing pathogen infection and reproduction. Avr3D1 resides in a cluster of putative effector genes located in a genome region populated by independent transposable element insertions. The gene was present in all 132 investigated strains and is highly polymorphic, with 30 different protein variants identified. We demonstrated that specific amino acid substitutions in Avr3D1 led to evasion of recognition. These results demonstrate that quantitative resistance and gene-for-gene interactions are not mutually exclusive. Localising avirulence genes in highly plastic genomic regions probably facilitates accelerated evolution that enables escape from recognition by resistance proteins. © 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.
Šuligoj, Tanja; Gregorini, Armando; Colomba, Mariastella; Ellis, H Julia; Ciclitira, Paul J
2013-12-01
Coeliac disease is a chronic small intestinal immune-mediated enteropathy triggered by dietary gluten in genetically predisposed individuals. Since it is unknown if all wheat varieties are equally toxic to coeliac patients seven Triticum accessions showing different origin (ancient/modern) and ploidy (di-, tetra- hexaploid) were studied. Selected strains of wheat were ancient Triticum monococcum precoce (AA genome) and Triticum speltoides (BB genome), accessions of Triticum turgidum durum (AABB genome) including two ancient (Graziella Ra and Kamut) and two modern (Senatore Cappelli and Svevo) durum strains of wheat and Triticum aestivum compactum (AABBDD genome). Small intestinal gluten-specific T-cell lines generated from 13 coeliac patients were tested with wheat accessions by proliferation assays. All strains of wheat independent of ploidy or ancient/modern origin triggered heterogeneous responses covering wide ranges of stimulation indices. Ancient strains of wheat, although previously suggested to be low or devoid of coeliac toxicity, should be tested for immunogenicity using gluten-specific T-cell lines from multiple coeliac patients rather than gluten-specific clones to assess their potential toxicity. Our findings provide further evidence for the need for a strict gluten-free diet in coeliac patients, including avoidance of ancient strains of wheat. Copyright © 2013 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
Vernick, Kenneth D.
2017-01-01
Metavisitor is a software package that allows biologists and clinicians without specialized bioinformatics expertise to detect and assemble viral genomes from deep sequence datasets. The package is composed of a set of modular bioinformatic tools and workflows that are implemented in the Galaxy framework. Using the graphical Galaxy workflow editor, users with minimal computational skills can use existing Metavisitor workflows or adapt them to suit specific needs by adding or modifying analysis modules. Metavisitor works with DNA, RNA or small RNA sequencing data over a range of read lengths and can use a combination of de novo and guided approaches to assemble genomes from sequencing reads. We show that the software has the potential for quick diagnosis as well as discovery of viruses from a vast array of organisms. Importantly, we provide here executable Metavisitor use cases, which increase the accessibility and transparency of the software, ultimately enabling biologists or clinicians to focus on biological or medical questions. PMID:28045932
Stanley, J; Townsend, R
1986-01-01
Intact recombinant DNAs containing single copies of either component of the cassava latent virus genome can elicit infection when mechanically inoculated to host plants in the presence of the appropriate second component. Characterisation of infectious mutant progeny viruses, by analysis of virus-specific supercoiled DNA intermediates, indicates that most if not all of the cloning vector has been deleted, achieved at least in some cases by intermolecular recombination in vivo between DNAs 1 and 2. Significant rearrangements within the intergenic region of DNA 2, predominantly external to the common region, can be tolerated without loss of infectivity suggesting a somewhat passive role in virus multiplication for the sequences in question. Although packaging constraints might impose limits on the amount of DNA within geminate particles, isolation of an infectious coat protein mutant defective in virion production suggests that packaging is not essential for systemic spread of the viral DNA. Images PMID:2875435
Software for the Integration of Multiomics Experiments in Bioconductor.
Ramos, Marcel; Schiffer, Lucas; Re, Angela; Azhar, Rimsha; Basunia, Azfar; Rodriguez, Carmen; Chan, Tiffany; Chapman, Phil; Davis, Sean R; Gomez-Cabrero, David; Culhane, Aedin C; Haibe-Kains, Benjamin; Hansen, Kasper D; Kodali, Hanish; Louis, Marie S; Mer, Arvind S; Riester, Markus; Morgan, Martin; Carey, Vince; Waldron, Levi
2017-11-01
Multiomics experiments are increasingly commonplace in biomedical research and add layers of complexity to experimental design, data integration, and analysis. R and Bioconductor provide a generic framework for statistical analysis and visualization, as well as specialized data classes for a variety of high-throughput data types, but methods are lacking for integrative analysis of multiomics experiments. The MultiAssayExperiment software package, implemented in R and leveraging Bioconductor software and design principles, provides for the coordinated representation of, storage of, and operation on multiple diverse genomics data. We provide the unrestricted multiple 'omics data for each cancer tissue in The Cancer Genome Atlas as ready-to-analyze MultiAssayExperiment objects and demonstrate in these and other datasets how the software simplifies data representation, statistical analysis, and visualization. The MultiAssayExperiment Bioconductor package reduces major obstacles to efficient, scalable, and reproducible statistical analysis of multiomics data and enhances data science applications of multiple omics datasets. Cancer Res; 77(21); e39-42. ©2017 AACR . ©2017 American Association for Cancer Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shustov, Alexandr V.; Frolov, Ilya, E-mail: ivfrolov@UAB.ed
In our previous studies, we have stated to build a new strategy for developing defective, pseudoinfectious flaviviruses (PIVs) and applying them as a new type of vaccine candidates. PIVs combined the efficiency of live vaccines with the safety of inactivated or subunit vaccines. The results of the present work demonstrate further development of chimeric PIVs encoding dengue virus 2 (DEN2V) glycoproteins and yellow fever virus (YFV)-derived replicative machinery as potential vaccine candidates. The newly designed PIVs have synergistically functioning mutations in the prM and NS2A proteins, which abolish processing of the latter proteins and make the defective viruses capable ofmore » producing either only noninfectious, immature and/or subviral DEN2V particles. The PIV genomes can be packaged to high titers into infectious virions in vitro using the NS1-deficient YFV helper RNAs, and both PIVs and helpers can then be passaged as two-component genome viruses at an escalating scale.« less
Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce
Reyes-Chin-Wo, Sebastian; Wang, Zhiwen; Yang, Xinhua; Kozik, Alexander; Arikit, Siwaret; Song, Chi; Xia, Liangfeng; Froenicke, Lutz; Lavelle, Dean O.; Truco, María-José; Xia, Rui; Zhu, Shilin; Xu, Chunyan; Xu, Huaqin; Xu, Xun; Cox, Kyle; Korf, Ian; Meyers, Blake C.; Michelmore, Richard W.
2017-01-01
Lettuce (Lactuca sativa) is a major crop and a member of the large, highly successful Compositae family of flowering plants. Here we present a reference assembly for the species and family. This was generated using whole-genome shotgun Illumina reads plus in vitro proximity ligation data to create large superscaffolds; it was validated genetically and superscaffolds were oriented in genetic bins ordered along nine chromosomal pseudomolecules. We identify several genomic features that may have contributed to the success of the family, including genes encoding Cycloidea-like transcription factors, kinases, enzymes involved in rubber biosynthesis and disease resistance proteins that are expanded in the genome. We characterize 21 novel microRNAs, one of which may trigger phasiRNAs from numerous kinase transcripts. We provide evidence for a whole-genome triplication event specific but basal to the Compositae. We detect 26% of the genome in triplicated regions containing 30% of all genes that are enriched for regulatory sequences and depleted for genes involved in defence. PMID:28401891
A Guide for Industrial Mobilization
1989-03-01
packages; and cient, increased production controls may be needed. These actions include: i. Releasing machine tool trigger or- ders and increasing buys...710). the Department of Defense to maintain facili- 4. The National Defense Act authorizes: ties, machine tools , production equipment, and skilled...Defense Industrial Reserve Act pro- Room 3876, U.S. Departm nt of Commerce vides for the reserve of machine tools and other Washington, D.C. 20230 or
Fast randomization of large genomic datasets while preserving alteration counts.
Gobbi, Andrea; Iorio, Francesco; Dawson, Kevin J; Wedge, David C; Tamborero, David; Alexandrov, Ludmil B; Lopez-Bigas, Nuria; Garnett, Mathew J; Jurman, Giuseppe; Saez-Rodriguez, Julio
2014-09-01
Studying combinatorial patterns in cancer genomic datasets has recently emerged as a tool for identifying novel cancer driver networks. Approaches have been devised to quantify, for example, the tendency of a set of genes to be mutated in a 'mutually exclusive' manner. The significance of the proposed metrics is usually evaluated by computing P-values under appropriate null models. To this end, a Monte Carlo method (the switching-algorithm) is used to sample simulated datasets under a null model that preserves patient- and gene-wise mutation rates. In this method, a genomic dataset is represented as a bipartite network, to which Markov chain updates (switching-steps) are applied. These steps modify the network topology, and a minimal number of them must be executed to draw simulated datasets independently under the null model. This number has previously been deducted empirically to be a linear function of the total number of variants, making this process computationally expensive. We present a novel approximate lower bound for the number of switching-steps, derived analytically. Additionally, we have developed the R package BiRewire, including new efficient implementations of the switching-algorithm. We illustrate the performances of BiRewire by applying it to large real cancer genomics datasets. We report vast reductions in time requirement, with respect to existing implementations/bounds and equivalent P-value computations. Thus, we propose BiRewire to study statistical properties in genomic datasets, and other data that can be modeled as bipartite networks. BiRewire is available on BioConductor at http://www.bioconductor.org/packages/2.13/bioc/html/BiRewire.html. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Jo, Kyuri; Kwon, Hawk-Bin; Kim, Sun
2014-06-01
Measuring expression levels of genes at the whole genome level can be useful for many purposes, especially for revealing biological pathways underlying specific phenotype conditions. When gene expression is measured over a time period, we have opportunities to understand how organisms react to stress conditions over time. Thus many biologists routinely measure whole genome level gene expressions at multiple time points. However, there are several technical difficulties for analyzing such whole genome expression data. In addition, these days gene expression data is often measured by using RNA-sequencing rather than microarray technologies and then analysis of expression data is much more complicated since the analysis process should start with mapping short reads and produce differentially activated pathways and also possibly interactions among pathways. In addition, many useful tools for analyzing microarray gene expression data are not applicable for the RNA-seq data. Thus a comprehensive package for analyzing time series transcriptome data is much needed. In this article, we present a comprehensive package, Time-series RNA-seq Analysis Package (TRAP), integrating all necessary tasks such as mapping short reads, measuring gene expression levels, finding differentially expressed genes (DEGs), clustering and pathway analysis for time-series data in a single environment. In addition to implementing useful algorithms that are not available for RNA-seq data, we extended existing pathway analysis methods, ORA and SPIA, for time series analysis and estimates statistical values for combined dataset by an advanced metric. TRAP also produces visual summary of pathway interactions. Gene expression change labeling, a practical clustering method used in TRAP, enables more accurate interpretation of the data when combined with pathway analysis. We applied our methods on a real dataset for the analysis of rice (Oryza sativa L. Japonica nipponbare) upon drought stress. The result showed that TRAP was able to detect pathways more accurately than several existing methods. TRAP is available at http://biohealth.snu.ac.kr/software/TRAP/. Copyright © 2014 Elsevier Inc. All rights reserved.
Power, Robert A; Cohen-Woods, Sarah; Ng, Mandy Y; Butler, Amy W; Craddock, Nick; Korszun, Ania; Jones, Lisa; Jones, Ian; Gill, Michael; Rice, John P; Maier, Wolfgang; Zobel, Astrid; Mors, Ole; Placentino, Anna; Rietschel, Marcella; Aitchison, Katherine J; Tozzi, Federica; Muglia, Pierandrea; Breen, Gerome; Farmer, Anne E; McGuffin, Peter; Lewis, Cathryn M; Uher, Rudolf
2013-09-01
Stressful life events are an established trigger for depression and may contribute to the heterogeneity within genome-wide association analyses. With depression cases showing an excess of exposure to stressful events compared to controls, there is difficulty in distinguishing between "true" cases and a "normal" response to a stressful environment. This potential contamination of cases, and that from genetically at risk controls that have not yet experienced environmental triggers for onset, may reduce the power of studies to detect causal variants. In the RADIANT sample of 3,690 European individuals, we used propensity score matching to pair cases and controls on exposure to stressful life events. In 805 case-control pairs matched on stressful life event, we tested the influence of 457,670 common genetic variants on the propensity to depression under comparable level of adversity with a sign test. While this analysis produced no significant findings after genome-wide correction for multiple testing, we outline a novel methodology and perspective for providing environmental context in genetic studies. We recommend contextualizing depression by incorporating environmental exposure into genome-wide analyses as a complementary approach to testing gene-environment interactions. Possible explanations for negative findings include a lack of statistical power due to small sample size and conditional effects, resulting from the low rate of adequate matching. Our findings underscore the importance of collecting information on environmental risk factors in studies of depression and other complex phenotypes, so that sufficient sample sizes are available to investigate their effect in genome-wide association analysis. Copyright © 2013 Wiley Periodicals, Inc.
Langefeld, Carl D; Comeau, Mary E; Ng, Maggie C Y; Guan, Meijian; Dimitrov, Latchezar; Mudgal, Poorva; Spainhour, Mitzie H; Julian, Bruce A; Edberg, Jeffrey C; Croker, Jennifer A; Divers, Jasmin; Hicks, Pamela J; Bowden, Donald W; Chan, Gary C; Ma, Lijun; Palmer, Nicholette D; Kimberly, Robert P; Freedman, Barry I
2018-06-06
African Americans carrying two apolipoprotein L1 gene (APOL1) renal risk variants have a high risk for nephropathy. However, only a minority develops end-stage renal disease (ESRD). Hence, modifying factors likely contribute to initiation of kidney disease such as endogenous (HIV infection) or exogenous (interferon treatment) environmental modifiers. In this report, genome-wide association studies and a meta-analysis were performed to identify novel loci for nondiabetic ESRD in African Americans and to detect genetic modifiers in APOL1-associated nephropathy. Two African American cohorts were analyzed, 1749 nondiabetic ESRD cases and 1136 controls from Wake Forest and 901 lupus nephritis (LN)-ESRD cases and 520 controls with systemic lupus erythematosus but lacking nephropathy from the LN-ESRD Consortium. Association analyses adjusting for APOL1 G1/G2 renal-risk variants were completed and stratified by APOL1 risk genotype status. Individual genome-wide association studies and meta-analysis results of all 2650 ESRD cases and 1656 controls did not detect significant genome-wide associations with ESRD beyond APOL1. Similarly, no single nucleotide polymorphism showed significant genome-wide evidence of an interaction with APOL1 risk variants. Thus, although variants with small individual effects cannot be ruled out and are likely to exist, our results suggest that APOL1-environment interactions may be of greater clinical importance in triggering nephropathy in African Americans than APOL1 interactions with other single nucleotide polymorphisms. Copyright © 2018 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.
Le Thomas, Adrien; Stuwe, Evelyn; Li, Sisi; Marinov, Georgi; Rozhkov, Nikolay; Chen, Yung-Chia Ariel; Luo, Yicheng; Sachidanandam, Ravi; Toth, Katalin Fejes; Patel, Dinshaw; Aravin, Alexei A.
2014-01-01
Small noncoding RNAs that associate with Piwi proteins, called piRNAs, serve as guides for repression of diverse transposable elements in germ cells of metazoa. In Drosophila, the genomic regions that give rise to piRNAs, the so-called piRNA clusters, are transcribed to generate long precursor molecules that are processed into mature piRNAs. How genomic regions that give rise to piRNA precursor transcripts are differentiated from the rest of the genome and how these transcripts are specifically channeled into the piRNA biogenesis pathway are not known. We found that transgenerationally inherited piRNAs provide the critical trigger for piRNA production from homologous genomic regions in the next generation by two different mechanisms. First, inherited piRNAs enhance processing of homologous transcripts into mature piRNAs by initiating the ping-pong cycle in the cytoplasm. Second, inherited piRNAs induce installment of the histone 3 Lys9 trimethylation (H3K9me3) mark on genomic piRNA cluster sequences. The heterochromatin protein 1 (HP1) homolog Rhino binds to the H3K9me3 mark through its chromodomain and is enriched over piRNA clusters. Rhino recruits the piRNA biogenesis factor Cutoff to piRNA clusters and is required for efficient transcription of piRNA precursors. We propose that transgenerationally inherited piRNAs act as an epigenetic memory for identification of substrates for piRNA biogenesis on two levels: by inducing a permissive chromatin environment for piRNA precursor synthesis and by enhancing processing of these precursors. PMID:25085419
Burst-mode optical label processor with ultralow power consumption.
Ibrahim, Salah; Nakahara, Tatsushi; Ishikawa, Hiroshi; Takahashi, Ryo
2016-04-04
A novel label processor subsystem for 100-Gbps (25-Gbps × 4λs) burst-mode optical packets is developed, in which a highly energy-efficient method is pursued for extracting and interfacing the ultrafast packet-label to a CMOS-based processor where label recognition takes place. The method involves performing serial-to-parallel conversion for the label bits on a bit-by-bit basis by using an optoelectronic converter that is operated with a set of optical triggers generated in a burst-mode manner upon packet arrival. Here we present three key achievements that enabled a significant reduction in the total power consumption and latency of the whole subsystem; 1) based on a novel operation mechanism for providing amplification with bit-level selectivity, an optical trigger pulse generator, that consumes power for a very short duration upon packet arrival, is proposed and experimentally demonstrated, 2) the energy of optical triggers needed by the optoelectronic serial-to-parallel converter is reduced by utilizing a negative-polarity signal while employing an enhanced conversion scheme entitled the discharge-or-hold scheme, 3) the necessary optical trigger energy is further cut down by half by coupling the triggers through the chip's backside, whereas a novel lens-free packaging method is developed to enable a low-cost alignment process that works with simple visual observation.
Mechanisms of DNA Packaging by Large Double-Stranded DNA Viruses
Rao, Venigalla B.; Feiss, Michael
2016-01-01
Translocation of viral double-stranded DNA (dsDNA) into the icosahedral prohead shell is catalyzed by TerL, a motor protein that has ATPase, endonuclease, and translocase activities. TerL, following endonucleolytic cleavage of immature viral DNA concatemer recognized by TerS, assembles into a pentameric ring motor on the prohead’s portal vertex and uses ATP hydrolysis energy for DNA translocation. TerL’s N-terminal ATPase is connected by a hinge to the C-terminal endonuclease. Inchworm models propose that modest domain motions accompanying ATP hydrolysis are amplified, through changes in electrostatic interactions, into larger movements of the C-terminal domain bound to DNA. In phage φ29, four of the five TerL subunits sequentially hydrolyze ATP, each powering translocation of 2.5 bp. After one viral genome is encapsidated, the internal pressure signals termination of packaging and ejection of the motor. Current focus is on the structures of packaging complexes and the dynamics of TerL during DNA packaging, endonuclease regulation, and motor mechanics. PMID:26958920
Wang, Shaoying; Ji, Zhouxiang; Yan, Erfu; Haque, Farzin; Guo, Peixuan
2016-01-01
The DNA packaging motor of dsDNA bacterial viruses contains a head-tail connector with a channel for genome to enter during assembly and to exit during host infection. The DNA packaging motor of bacterial virus phi29 was recently reported to use the “One-way Revolution” mechanism for DNA packaging. This raises a question of how dsDNA is ejected during infection if the channel acts as a one-way inward valve. Here we report a three step conformational change of the portal channel that is common among DNA translocation motors of bacterial viruses T3, T4, SPP1, and phi29. The channels of these motors exercise three discrete steps of gating, as revealed by electrophysiological assays. It is proposed that the three step channel conformational changes occur during DNA entry process, resulting in a structural transition in preparation of DNA movement in the reverse direction during ejection. PMID:27181501
Reconstructing Past Admixture Processes from Local Genomic Ancestry Using Wavelet Transformation
Sanderson, Jean; Sudoyo, Herawati; Karafet, Tatiana M.; Hammer, Michael F.; Cox, Murray P.
2015-01-01
Admixture between long-separated populations is a defining feature of the genomes of many species. The mosaic block structure of admixed genomes can provide information about past contact events, including the time and extent of admixture. Here, we describe an improved wavelet-based technique that better characterizes ancestry block structure from observed genomic patterns. principal components analysis is first applied to genomic data to identify the primary population structure, followed by wavelet decomposition to develop a new characterization of local ancestry information along the chromosomes. For testing purposes, this method is applied to human genome-wide genotype data from Indonesia, as well as virtual genetic data generated using genome-scale sequential coalescent simulations under a wide range of admixture scenarios. Time of admixture is inferred using an approximate Bayesian computation framework, providing robust estimates of both admixture times and their associated levels of uncertainty. Crucially, we demonstrate that this revised wavelet approach, which we have released as the R package adwave, provides improved statistical power over existing wavelet-based techniques and can be used to address a broad range of admixture questions. PMID:25852078
Condezo, Gabriela N.; Marabini, Roberto; Ayora, Silvia; Carazo, José M.; Alba, Raúl; Chillón, Miguel
2015-01-01
ABSTRACT Adenovirus is one of the most complex icosahedral, nonenveloped viruses. Even after its structure was solved at near-atomic resolution by both cryo-electron microscopy and X-ray crystallography, the location of minor coat proteins is still a subject of debate. The elaborated capsid architecture is the product of a correspondingly complex assembly process, about which many aspects remain unknown. Genome encapsidation involves the concerted action of five virus proteins, and proteolytic processing by the virus protease is needed to prime the virion for sequential uncoating. Protein L1 52/55k is required for packaging, and multiple cleavages by the maturation protease facilitate its release from the nascent virion. Light-density particles are routinely produced in adenovirus infections and are thought to represent assembly intermediates. Here, we present the molecular and structural characterization of two different types of human adenovirus light particles produced by a mutant with delayed packaging. We show that these particles lack core polypeptide V but do not lack the density corresponding to this protein in the X-ray structure, thereby adding support to the adenovirus cryo-electron microscopy model. The two types of light particles present different degrees of proteolytic processing. Their structures provide the first glimpse of the organization of L1 52/55k protein inside the capsid shell and of how this organization changes upon partial maturation. Immature, full-length L1 52/55k is poised beneath the vertices to engage the virus genome. Upon proteolytic processing, L1 52/55k disengages from the capsid shell, facilitating genome release during uncoating. IMPORTANCE Adenoviruses have been extensively characterized as experimental systems in molecular biology, as human pathogens, and as therapeutic vectors. However, a clear picture of many aspects of their basic biology is still lacking. Two of these aspects are the location of minor coat proteins in the capsid and the molecular details of capsid assembly. Here, we provide evidence supporting one of the two current models for capsid architecture. We also show for the first time the location of the packaging protein L1 52/55k in particles lacking the virus genome and how this location changes during maturation. Our results contribute to clarifying standing questions in adenovirus capsid architecture and provide new details on the role of L1 52/55k protein in assembly. PMID:26178997
Histone Modification Associated with Initiation of DNA Replication | Center for Cancer Research
Before cells are able to divide, they must first duplicate their chromosomes accurately. DNA replication and packaging of DNA into chromosomes by histone proteins need to be coordinated by the cell to ensure proper transmission of genetic and epigenetic information to the next generation. Mammalian DNA replication begins at specific chromosomal sites, called replication origins, which are located throughout the genome. The replication origins are tightly regulated to start replication only once per cell division so that genomic stability is maintained and cancer development is prevented.
DNA Damage and Genomic Instability Induced by Inappropriate DNA Re-replication
2007-04-01
Conway, A., Lockhart, D. J., Davis, R. W., Brewer , B. J., and Fangman, W. L. (2001). Replication dynamics of the yeast genome. Science 294, 115–121... Brewer , B. J. (2001). An origin-deficient yeast artificial chromosome triggers a cell cycle checkpoint. Mol. Cell 7, 705–713. Vas, A., Mok, W., and...replication in yeast cells. We have demonstrated that re-replication induces a rapid and significant decrease in cell viability and a cellular DNA damage
Pellicer, Jaume; Kelly, Laura J; Leitch, Ilia J; Zomlefer, Wendy B; Fay, Michael F
2014-03-01
• Since the occurrence of giant genomes in angiosperms is restricted to just a few lineages, identifying where shifts towards genome obesity have occurred is essential for understanding the evolutionary mechanisms triggering this process. • Genome sizes were assessed using flow cytometry in 79 species and new chromosome numbers were obtained. Phylogenetically based statistical methods were applied to infer ancestral character reconstructions of chromosome numbers and nuclear DNA contents. • Melanthiaceae are the most diverse family in terms of genome size, with C-values ranging more than 230-fold. Our data confirmed that giant genomes are restricted to tribe Parideae, with most extant species in the family characterized by small genomes. Ancestral genome size reconstruction revealed that the most recent common ancestor (MRCA) for the family had a relatively small genome (1C = 5.37 pg). Chromosome losses and polyploidy are recovered as the main evolutionary mechanisms generating chromosome number change. • Genome evolution in Melanthiaceae has been characterized by a trend towards genome size reduction, with just one episode of dramatic DNA accumulation in Parideae. Such extreme contrasting profiles of genome size evolution illustrate the key role of transposable elements and chromosome rearrangements in driving the evolution of plant genomes. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Dominguez, Luis A.; Yildirim, Battalgazi; Husker, Allen L.; Cochran, Elizabeth S.; Christensen, Carl; Cruz-Atienza, Victor M.
2015-01-01
Each volunteer computer monitors ground motion and communicates using the Berkeley Open Infrastructure for Network Computing (BOINC, Anderson, 2004). Using a standard short‐term average, long‐term average (STLA) algorithm (Earle and Shearer, 1994; Cochran, Lawrence, Christensen, Chung, 2009; Cochran, Lawrence, Christensen, and Jakka, 2009), volunteer computer and sensor systems detect abrupt changes in the acceleration recordings. Each time a possible trigger signal is declared, a small package of information containing sensor and ground‐motion information is streamed to one of the QCN servers (Chung et al., 2011). Trigger signals, correlated in space and time, are then processed by the QCN server to look for potential earthquakes.
Johnson, Matthew C; Sena-Velez, Marta; Washburn, Brian K; Platt, Georgia N; Lu, Stephen; Brewer, Tess E; Lynn, Jason S; Stroupe, M Elizabeth; Jones, Kathryn M
2017-12-01
Bacteriophages of nitrogen-fixing rhizobial bacteria are revealing a wealth of novel structures, diverse enzyme combinations and genomic features. Here we report the cryo-EM structure of the phage capsid at 4.9-5.7Å-resolution, the phage particle proteome, and the genome of the Sinorhizobium meliloti-infecting Podovirus ΦM5. This is the first structure of a phage with a capsid and capsid-associated structural proteins related to those of the LUZ24-like viruses that infect Pseudomonas aeruginosa. Like many other Podoviruses, ΦM5 is a T=7 icosahedron with a smooth capsid and short, relatively featureless tail. Nonetheless, this group is phylogenetically quite distinct from Podoviruses of the well-characterized T7, P22, and epsilon 15 supergroups. Structurally, a distinct bridge of density that appears unique to ΦM5 reaches down the body of the coat protein to the extended loop that interacts with the next monomer in a hexamer, perhaps stabilizing the mature capsid. Further, the predicted tail fibers of ΦM5 are quite different from those of enteric bacteria phages, but have domains in common with other rhizophages. Genomically, ΦM5 is highly mosaic. The ΦM5 genome is 44,005bp with 357bp direct terminal repeats (DTRs) and 58 unique ORFs. Surprisingly, the capsid structural module, the tail module, the DNA-packaging terminase, the DNA replication module and the integrase each appear to be from a different lineage. One of the most unusual features of ΦM5 is its terminase whose large subunit is quite different from previously-described short-DTR-generating packaging machines and does not fit into any of the established phylogenetic groups. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Genomic Diversity in the Endosymbiotic Bacterium Rhizobium leguminosarum.
Sánchez-Cañizares, Carmen; Jorrín, Beatriz; Durán, David; Nadendla, Suvarna; Albareda, Marta; Rubio-Sanz, Laura; Lanza, Mónica; González-Guerrero, Manuel; Prieto, Rosa Isabel; Brito, Belén; Giglio, Michelle G; Rey, Luis; Ruiz-Argüeso, Tomás; Palacios, José M; Imperial, Juan
2018-01-24
Rhizobium leguminosarum bv. viciae is a soil α-proteobacterium that establishes a diazotrophic symbiosis with different legumes of the Fabeae tribe. The number of genome sequences from rhizobial strains available in public databases is constantly increasing, although complete, fully annotated genome structures from rhizobial genomes are scarce. In this work, we report and analyse the complete genome of R. leguminosarum bv. viciae UPM791. Whole genome sequencing can provide new insights into the genetic features contributing to symbiotically relevant processes such as bacterial adaptation to the rhizosphere, mechanisms for efficient competition with other bacteria, and the ability to establish a complex signalling dialogue with legumes, to enter the root without triggering plant defenses, and, ultimately, to fix nitrogen within the host. Comparison of the complete genome sequences of two strains of R. leguminosarum bv. viciae , 3841 and UPM791, highlights the existence of different symbiotic plasmids and a common core chromosome. Specific genomic traits, such as plasmid content or a distinctive regulation, define differential physiological capabilities of these endosymbionts. Among them, strain UPM791 presents unique adaptations for recycling the hydrogen generated in the nitrogen fixation process.
A minimal kinetic model for a viral DNA packaging machine.
Yang, Qin; Catalano, Carlos Enrique
2004-01-20
Terminase enzymes are common to both eukaryotic and prokaryotic double-stranded DNA viruses. These enzymes possess ATPase and nuclease activities that work in concert to "package" a viral genome into an empty procapsid, and it is likely that terminase enzymes from disparate viruses utilize a common packaging mechanism. Bacteriophage lambda terminase possesses a site-specific nuclease activity, a so-called helicase activity, a DNA translocase activity, and multiple ATPase catalytic sites that function to package viral DNA. Allosteric interactions between the multiple catalytic sites have been reported. This study probes these catalytic interactions using enzyme kinetic, photoaffinity labeling, and vanadate inhibition studies. The ensemble of data forms the basis for a minimal kinetic model for lambda terminase. The model incorporates an ADP-driven conformational reorganization of the terminase subunits assembled on viral DNA, which is central to the activation of a catalytically competent packaging machine. The proposed model provides a unifying mechanism for allosteric interaction between the multiple catalytic sites of the holoenzyme and explains much of the kinetic data in the literature. Given that similar packaging mechanisms have been proposed for viruses as dissimilar as lambda and the herpes viruses, the model may find general utility in our global understanding of the enzymology of virus assembly.
Shabani, Mahsa; Bezuidenhout, Louise; Borry, Pascal
2014-11-01
Introducing data sharing practices into the genomic research arena has challenged the current mechanisms established to protect rights of individuals and triggered policy considerations. To inform such policy deliberations, soliciting public and research participants' attitudes with respect to genomic data sharing is a necessity. The main electronic databases were searched in order to retrieve empirical studies, investigating the attitudes of research participants and the public towards genomic data sharing through public databases. In the 15 included studies, participants' attitudes towards genomic data sharing revealed the influence of a constellation of interrelated factors, including the personal perceptions of controllability and sensitivity of data, potential risks and benefits of data sharing at individual and social level and also governance level considerations. This analysis indicates that future policy responses and recruitment practices should be attentive to a wide variety of concerns in order to promote both responsible and progressive research.
Adelman, K; Salmon, B; Baines, J D
2001-03-13
The product of the herpes simplex virus type 1 U(L)28 gene is essential for cleavage of concatemeric viral DNA into genome-length units and packaging of this DNA into viral procapsids. To address the role of U(L)28 in this process, purified U(L)28 protein was assayed for the ability to recognize conserved herpesvirus DNA packaging sequences. We report that DNA fragments containing the pac1 DNA packaging motif can be induced by heat treatment to adopt novel DNA conformations that migrate faster than the corresponding duplex in nondenaturing gels. Surprisingly, these novel DNA structures are high-affinity substrates for U(L)28 protein binding, whereas double-stranded DNA of identical sequence composition is not recognized by U(L)28 protein. We demonstrate that only one strand of the pac1 motif is responsible for the formation of novel DNA structures that are bound tightly and specifically by U(L)28 protein. To determine the relevance of the observed U(L)28 protein-pac1 interaction to the cleavage and packaging process, we have analyzed the binding affinity of U(L)28 protein for pac1 mutants previously shown to be deficient in cleavage and packaging in vivo. Each of the pac1 mutants exhibited a decrease in DNA binding by U(L)28 protein that correlated directly with the reported reduction in cleavage and packaging efficiency, thereby supporting a role for the U(L)28 protein-pac1 interaction in vivo. These data therefore suggest that the formation of novel DNA structures by the pac1 motif confers added specificity on recognition of DNA packaging sequences by the U(L)28-encoded component of the herpesvirus cleavage and packaging machinery.
diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data.
Lun, Aaron T L; Smyth, Gordon K
2015-08-19
Chromatin conformation capture with high-throughput sequencing (Hi-C) is a technique that measures the in vivo intensity of interactions between all pairs of loci in the genome. Most conventional analyses of Hi-C data focus on the detection of statistically significant interactions. However, an alternative strategy involves identifying significant changes in the interaction intensity (i.e., differential interactions) between two or more biological conditions. This is more statistically rigorous and may provide more biologically relevant results. Here, we present the diffHic software package for the detection of differential interactions from Hi-C data. diffHic provides methods for read pair alignment and processing, counting into bin pairs, filtering out low-abundance events and normalization of trended or CNV-driven biases. It uses the statistical framework of the edgeR package to model biological variability and to test for significant differences between conditions. Several options for the visualization of results are also included. The use of diffHic is demonstrated with real Hi-C data sets. Performance against existing methods is also evaluated with simulated data. On real data, diffHic is able to successfully detect interactions with significant differences in intensity between biological conditions. It also compares favourably to existing software tools on simulated data sets. These results suggest that diffHic is a viable approach for differential analyses of Hi-C data.
missMethyl: an R package for analyzing data from Illumina's HumanMethylation450 platform.
Phipson, Belinda; Maksimovic, Jovana; Oshlack, Alicia
2016-01-15
DNA methylation is one of the most commonly studied epigenetic modifications due to its role in both disease and development. The Illumina HumanMethylation450 BeadChip is a cost-effective way to profile >450 000 CpGs across the human genome, making it a popular platform for profiling DNA methylation. Here we introduce missMethyl, an R package with a suite of tools for performing normalization, removal of unwanted variation in differential methylation analysis, differential variability testing and gene set analysis for the 450K array. missMethyl is an R package available from the Bioconductor project at www.bioconductor.org. alicia.oshlack@mcri.edu.au Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
A sequence-based survey of the complex structural organization of tumor genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav
2008-04-03
The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison ofmore » the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.« less
Large Scale Software Building with CMake in ATLAS
NASA Astrophysics Data System (ADS)
Elmsheuser, J.; Krasznahorkay, A.; Obreshkov, E.; Undrus, A.; ATLAS Collaboration
2017-10-01
The offline software of the ATLAS experiment at the Large Hadron Collider (LHC) serves as the platform for detector data reconstruction, simulation and analysis. It is also used in the detector’s trigger system to select LHC collision events during data taking. The ATLAS offline software consists of several million lines of C++ and Python code organized in a modular design of more than 2000 specialized packages. Because of different workflows, many stable numbered releases are in parallel production use. To accommodate specific workflow requests, software patches with modified libraries are distributed on top of existing software releases on a daily basis. The different ATLAS software applications also require a flexible build system that strongly supports unit and integration tests. Within the last year this build system was migrated to CMake. A CMake configuration has been developed that allows one to easily set up and build the above mentioned software packages. This also makes it possible to develop and test new and modified packages on top of existing releases. The system also allows one to detect and execute partial rebuilds of the release based on single package changes. The build system makes use of CPack for building RPM packages out of the software releases, and CTest for running unit and integration tests. We report on the migration and integration of the ATLAS software to CMake and show working examples of this large scale project in production.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Szadkowski, Zbigniew
2015-07-01
The paper presents the first results from the trigger based on the Discrete Cosine Transform (DCT) operating in the new Front-End Boards with Cyclone V FPGA deployed in 8 test surface detectors in the Pierre Auger Engineering Array. The patterns of the ADC traces generated by very inclined showers were obtained from the Auger database and from the CORSIKA simulation package supported next by Offline reconstruction Auger platform which gives a predicted digitized signal profiles. Simulations for many variants of the initial angle of shower, initialization depth in the atmosphere, type of particle and its initial energy gave a boundarymore » of the DCT coefficients used next for the on-line pattern recognition in the FPGA. Preliminary results have proven a right approach. We registered several showers triggered by the DCT for 120 MSps and 160 MSps. (authors)« less
Desensitization of triggers and urge reprocessing for pathological gambling: a case series.
Bae, Hwallip; Han, Changwoo; Kim, Daeho
2015-03-01
This case series introduces the desensitization of triggers and urge reprocessing (DeTUR), as a promising adjunctive therapy in addition to comprehensive treatment package for pathological gambling. This addiction protocol of eye movement desensitization and reprocessing was delivered to four male inpatients admitted to a 10-week inpatient program for pathological gambling. The therapist gave three 60-min weekly sessions of the DeTUR using bilateral stimulation (horizontal eye movements or alternative tactile stimuli) focusing on the hierarchy of triggering situations and the urge to initiate gambling behaviors. After treatment, self-reported gambling symptoms, depression, anxiety, and impulsiveness were all improved, and all the participants reported satisfaction with the therapy. They were followed up for 6 months and all maintained their abstinence from gambling and their symptomatic improvements. Given the efficiency (i.e., brevity and efficacy) of the treatment, a controlled study to confirm the effects of the DeTUR on pathological gambling would be justified.
NASA Astrophysics Data System (ADS)
Szadkowski, Zbigniew; Wiedeński, Michał
2017-06-01
We present first results from a trigger based on the discrete cosine transform (DCT) operating in new front-end boards with a Cyclone V E field-programmable gate array (FPGA) deployed in seven test surface detectors in the Pierre Auger Test Array. The patterns of the ADC traces generated by very inclined showers (arriving at 70° to 90° from the vertical) were obtained from the Auger database and from the CORSIKA simulation package supported by the Auger OffLine event reconstruction platform that gives predicted digitized signal profiles. Simulations for many values of the initial cosmic ray angle of arrival, the shower initialization depth in the atmosphere, the type of particle, and its initial energy gave a boundary on the DCT coefficients used for the online pattern recognition in the FPGA. Preliminary results validated the approach used. We recorded several showers triggered by the DCT for 120 Msamples/s and 160 Msamples/s.
SOFIA: an R package for enhancing genetic visualization with Circos
USDA-ARS?s Scientific Manuscript database
Visualization of data from any stage of genetic and genomic research is one of the most useful approaches for detecting potential errors, ensuring accuracy and reproducibility, and presentation of the resulting data. Currently software such as Circos, ClicO FS, and RCircos, among others, provide too...
Comparative analysis of prophages in Streptococcus mutans genomes
Fu, Tiwei; Fan, Xiangyu; Long, Quanxin; Deng, Wanyan; Song, Jinlin
2017-01-01
Prophages have been considered genetic units that have an intimate association with novel phenotypic properties of bacterial hosts, such as pathogenicity and genomic variation. Little is known about the genetic information of prophages in the genome of Streptococcus mutans, a major pathogen of human dental caries. In this study, we identified 35 prophage-like elements in S. mutans genomes and performed a comparative genomic analysis. Comparative genomic and phylogenetic analyses of prophage sequences revealed that the prophages could be classified into three main large clusters: Cluster A, Cluster B, and Cluster C. The S. mutans prophages in each cluster were compared. The genomic sequences of phismuN66-1, phismuNLML9-1, and phismu24-1 all shared similarities with the previously reported S. mutans phages M102, M102AD, and ϕAPCM01. The genomes were organized into seven major gene clusters according to the putative functions of the predicted open reading frames: packaging and structural modules, integrase, host lysis modules, DNA replication/recombination modules, transcriptional regulatory modules, other protein modules, and hypothetical protein modules. Moreover, an integrase gene was only identified in phismuNLML9-1 prophages. PMID:29158986
Liu, Siyang; Huang, Shujia; Rao, Junhua; Ye, Weijian; Krogh, Anders; Wang, Jun
2015-01-01
Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure.
Webb, Joseph A; Jones, Christopher P; Parent, Leslie J; Rouzina, Ioulia; Musier-Forsyth, Karin
2013-08-01
Despite the vast excess of cellular RNAs, precisely two copies of viral genomic RNA (gRNA) are selectively packaged into new human immunodeficiency type 1 (HIV-1) particles via specific interactions between the HIV-1 Gag and the gRNA psi (ψ) packaging signal. Gag consists of the matrix (MA), capsid, nucleocapsid (NC), and p6 domains. Binding of the Gag NC domain to ψ is necessary for gRNA packaging, but the mechanism by which Gag selectively interacts with ψ is unclear. Here, we investigate the binding of NC and Gag variants to an RNA derived from ψ (Psi RNA), as well as to a non-ψ region (TARPolyA). Binding was measured as a function of salt to obtain the effective charge (Zeff) and nonelectrostatic (i.e., specific) component of binding, Kd(1M). Gag binds to Psi RNA with a dramatically reduced Kd(1M) and lower Zeff relative to TARPolyA. NC, GagΔMA, and a dimerization mutant of Gag bind TARPolyA with reduced Zeff relative to WT Gag. Mutations involving the NC zinc finger motifs of Gag or changes to the G-rich NC-binding regions of Psi RNA significantly reduce the nonelectrostatic component of binding, leading to an increase in Zeff. These results show that Gag interacts with gRNA using different binding modes; both the NC and MA domains are bound to RNA in the case of TARPolyA, whereas binding to Psi RNA involves only the NC domain. Taken together, these results suggest a novel mechanism for selective gRNA encapsidation.
? PID output-feedback control under event-triggered protocol
NASA Astrophysics Data System (ADS)
Zhao, Di; Wang, Zidong; Ding, Derui; Wei, Guoliang; Alsaadi, Fuad E.
2018-07-01
This paper is concerned with the ? proportional-integral-derivative (PID) output-feedback control problem for a class of linear discrete-time systems under event-triggered protocols. The controller and the actuators are connected through a communication network of limited bandwidth, and an event-triggered communication mechanism is adopted to decide when a certain control signal should be transmitted to the respective actuator. Furthermore, a novel PID output-feedback controller is designed where the accumulative sum-loop (the counterpart to the integral-loop in the continues-time setting) operates on a limited time-window with hope to mitigate the effect from the past measurement data. The main objective of the problem under consideration is to design a desired PID controller such that the closed-loop system is exponentially stable and the prescribed ? disturbance rejection attenuation level is guaranteed under event-triggered protocols. By means of the Lyapunov stability theory combined with the orthogonal decomposition, sufficient conditions are established under which the addressed PID controller design problem is recast into a linear convex optimization one that can be easily solved via available software packages. Finally, a simulation example is exploited to illustrate the usefulness and effectiveness of the established control scheme.
Chaouiya, Claudine; Keating, Sarah M; Berenguier, Duncan; Naldi, Aurélien; Thieffry, Denis; van Iersel, Martijn P; Le Novère, Nicolas; Helikar, Tomáš
2015-09-04
Quantitative methods for modelling biological networks require an in-depth knowledge of the biochemical reactions and their stoichiometric and kinetic parameters. In many practical cases, this knowledge is missing. This has led to the development of several qualitative modelling methods using information such as, for example, gene expression data coming from functional genomic experiments. The SBML Level 3 Version 1 Core specification does not provide a mechanism for explicitly encoding qualitative models, but it does provide a mechanism for SBML packages to extend the Core specification and add additional syntactical constructs. The SBML Qualitative Models package for SBML Level 3 adds features so that qualitative models can be directly and explicitly encoded. The approach taken in this package is essentially based on the definition of regulatory or influence graphs. The SBML Qualitative Models package defines the structure and syntax necessary to describe qualitative models that associate discrete levels of activities with entity pools and the transitions between states that describe the processes involved. This is particularly suited to logical models (Boolean or multi-valued) and some classes of Petri net models can be encoded with the approach.
fastBMA: scalable network inference and transitive reduction.
Hung, Ling-Hong; Shi, Kaiyuan; Wu, Migao; Young, William Chad; Raftery, Adrian E; Yeung, Ka Yee
2017-10-01
Inferring genetic networks from genome-wide expression data is extremely demanding computationally. We have developed fastBMA, a distributed, parallel, and scalable implementation of Bayesian model averaging (BMA) for this purpose. fastBMA also includes a computationally efficient module for eliminating redundant indirect edges in the network by mapping the transitive reduction to an easily solved shortest-path problem. We evaluated the performance of fastBMA on synthetic data and experimental genome-wide time series yeast and human datasets. When using a single CPU core, fastBMA is up to 100 times faster than the next fastest method, LASSO, with increased accuracy. It is a memory-efficient, parallel, and distributed application that scales to human genome-wide expression data. A 10 000-gene regulation network can be obtained in a matter of hours using a 32-core cloud cluster (2 nodes of 16 cores). fastBMA is a significant improvement over its predecessor ScanBMA. It is more accurate and orders of magnitude faster than other fast network inference methods such as the 1 based on LASSO. The improved scalability allows it to calculate networks from genome scale data in a reasonable time frame. The transitive reduction method can improve accuracy in denser networks. fastBMA is available as code (M.I.T. license) from GitHub (https://github.com/lhhunghimself/fastBMA), as part of the updated networkBMA Bioconductor package (https://www.bioconductor.org/packages/release/bioc/html/networkBMA.html) and as ready-to-deploy Docker images (https://hub.docker.com/r/biodepot/fastbma/). © The Authors 2017. Published by Oxford University Press.
Xu, Yaomin; Guo, Xingyi; Sun, Jiayang; Zhao, Zhongming
2015-01-01
Motivation: Large-scale cancer genomic studies, such as The Cancer Genome Atlas (TCGA), have profiled multidimensional genomic data, including mutation and expression profiles on a variety of cancer cell types, to uncover the molecular mechanism of cancerogenesis. More than a hundred driver mutations have been characterized that confer the advantage of cell growth. However, how driver mutations regulate the transcriptome to affect cellular functions remains largely unexplored. Differential analysis of gene expression relative to a driver mutation on patient samples could provide us with new insights in understanding driver mutation dysregulation in tumor genome and developing personalized treatment strategies. Results: Here, we introduce the Snowball approach as a highly sensitive statistical analysis method to identify transcriptional signatures that are affected by a recurrent driver mutation. Snowball utilizes a resampling-based approach and combines a distance-based regression framework to assign a robust ranking index of genes based on their aggregated association with the presence of the mutation, and further selects the top significant genes for downstream data analyses or experiments. In our application of the Snowball approach to both synthesized and TCGA data, we demonstrated that it outperforms the standard methods and provides more accurate inferences to the functional effects and transcriptional dysregulation of driver mutations. Availability and implementation: R package and source code are available from CRAN at http://cran.r-project.org/web/packages/DESnowball, and also available at http://bioinfo.mc.vanderbilt.edu/DESnowball/. Contact: zhongming.zhao@vanderbilt.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25192743
Nambiar, Bindu; Cornell Sookdeo, Cathleen; Berthelette, Patricia; Jackson, Robert; Piraino, Susan; Burnham, Brenda; Nass, Shelley; Souza, David; O'Riordan, Catherine R; Vincent, Karen A; Cheng, Seng H; Armentano, Donna; Kyostio-Moore, Sirkka
2017-02-01
Several ongoing clinical studies are evaluating recombinant adeno-associated virus (rAAV) vectors as gene delivery vehicles for a variety of diseases. However, the production of vectors with genomes >4.7 kb is challenging, with vector preparations frequently containing truncated genomes. To determine whether the generation of oversized rAAVs can be improved using a producer cell-line (PCL) process, HeLaS3-cell lines harboring either a 5.1 or 5.4 kb rAAV vector genome encoding codon-optimized cDNA for human B-domain deleted Factor VIII (FVIII) were isolated. High-producing "masterwells" (MWs), defined as producing >50,000 vg/cell, were identified for each oversized vector. These MWs provided stable vector production for >20 passages. The quality and potency of the AAVrh8R/FVIII-5.1 and AAVrh8R/FVIII-5.4 vectors generated by the PCL method were then compared to those prepared via transient transfection (TXN). Southern and dot blot analyses demonstrated that both production methods resulted in packaging of heterogeneously sized genomes. However, the PCL-derived rAAV vector preparations contained some genomes >4.7 kb, whereas the majority of genomes generated by the TXN method were ≤4.7 kb. The PCL process reduced packaging of non-vector DNA for both the AAVrh8R/FVIII-5.1 and the AAVrh8R/FVIII-5.4 kb vector preparations. Furthermore, more DNA-containing viral particles were obtained for the AAVrh8R/FVIII-5.1 vector. In a mouse model of hemophilia A, animals administered a PCL-derived rAAV vector exhibited twofold higher plasma FVIII activity and increased levels of vector genomes in the liver than mice treated with vector produced via TXN did. Hence, the quality of oversized vectors prepared using the PCL method is greater than that of vectors generated using the TXN process, and importantly this improvement translates to enhanced performance in vivo.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beilstein, Frauke; Dreiseikelmann, Brigitte
2008-03-30
A group of 74 Aeromonas isolates from surface water of three ponds in Bielefeld, Germany was screened for prophage induction after UV irradiation. The phage {phi}O18P was induced from the Aeromonas media isolate O18. {phi}O18P belongs to the Myoviridae phage family. The complete nucleotide sequence of the double stranded DNA genome of bacteriophage {phi}O18P consists of 33,985 bp. The genome has 5' protruding cohesive ends of 16 bases. On the {phi}O18P genome 46 open reading frames (orfs) were identified which are organized in the modules integration and regulation, replication, head, packaging, tail and lysis. Additionally the phage DNA includes amore » methylase gene. Comparison of the genome architecture with those of other bacteriophages revealed significant similarities to the P2 phage family and especially to the prophages of Aeromonas salmonicida and the Vibrio cholerae phage K139.« less
Integrating genome assemblies with MAIA
Nijkamp, Jurgen; Winterbach, Wynand; van den Broek, Marcel; Daran, Jean-Marc; Reinders, Marcel; de Ridder, Dick
2010-01-01
Motivation: De novo assembly of a eukaryotic genome with next-generation sequencing data is still a challenging task. Over the past few years several assemblers have been developed, often suitable for one specific type of sequencing data. The number of known genomes is expanding rapidly, therefore it becomes possible to use multiple reference genomes for assembly projects. We introduce an assembly integrator that makes use of all available data, i.e. multiple de novo assemblies and mappings against multiple related genomes, by optimizing a weighted combination of criteria. Results: The developed algorithm was applied on the de novo sequencing of the Saccharomyces cerevisiae CEN.PK 113-7D strain. Using Solexa and 454 read data, two de novo and three comparative assemblies were constructed and subsequently integrated, yielding 29 contigs, covering more than 12 Mbp; a drastic improvement compared with the single assemblies. Availability: MAIA is available as a Matlab package and can be downloaded from http://bioinformatics.tudelft.nl Contact: j.f.nijkamp@tudelft.nl PMID:20823304
Athena X-IFU event reconstruction software: SIRENA
NASA Astrophysics Data System (ADS)
Ceballos, Maria Teresa; Cobo, Beatriz; Peille, Philippe; Wilms, Joern; Brand, Thorsten; Dauser, Thomas; Bandler, Simon; Smith, Stephen
2015-09-01
This contribution describes the status and technical details of the SIRENA package, the software currently in development to perform the on board event energy reconstruction for the Athena calorimeter X-IFU. This on board processing will be done in the X-IFU DRE unit and it will consist in an initial triggering of event pulses followed by an analysis (with the SIRENA package) to determine the energy content of such events.The current algorithm used by SIRENA is the optimal filtering technique (also used by ASTRO-H processor) although some other algorithms are also being tested.Here we present these studies and some preliminary results about the energy resolution of the instrument based on simulations done with the SIXTE simulator (http://www.sternwarte.uni-erlangen.de/research/sixte/) in which SIRENA is integrated.
Concise classification of the genomic porcine endogenous retroviral gamma1 load to defined lineages.
Klymiuk, Nikolai; Wolf, Eckhard; Aigner, Bernhard
2008-02-05
We investigated the infection history of porcine endogenous retroviruses (PERV) gamma1 by analyzing published env and LTR sequences. PERV sequences from various breeds, porcine cell lines and infected human primary cells were included in the study. We identified a considerable number of retroviral lineages indicating multiple independent colonization events of the porcine genome. A recent boost of the proviral load in an isolated pig herd and exclusive occurrence of distinct lineages in single studies indicated the ongoing colonization of the porcine genome with endogenous retroviruses. Retroviral recombination between co-packaged genomes was a general factor for PERV gamma1 diversity which indicated the simultaneous expression of different proviral loci over a period of time. In total, our detailed description of endogenous retroviral lineages is the prerequisite for breeding approaches to minimize the infectious potential of porcine tissues for the subsequent use in xenotransplantation.
Arloth, Janine; Bogdan, Ryan; Weber, Peter; Frishman, Goar; Menke, Andreas; Wagner, Klaus V.; Balsevich, Georgia; Schmidt, Mathias V.; Karbalai, Nazanin; Czamara, Darina; Altmann, Andre; Trümbach, Dietrich; Wurst, Wolfgang; Mehta, Divya; Uhr, Manfred; Klengel, Torsten; Erhardt, Angelika; Carey, Caitlin E.; Conley, Emily Drabant; Ripke, Stephan; Wray, Naomi R.; Lewis, Cathryn M.; Hamilton, Steven P.; Weissman, Myrna M.; Breen, Gerome; Byrne, Enda M.; Blackwood, Douglas H.R.; Boomsma, Dorret I.; Cichon, Sven; Heath, Andrew C.; Holsboer, Florian; Lucae, Susanne; Madden, Pamela A.F.; Martin, Nicholas G.; McGuffin, Peter; Muglia, Pierandrea; Noethen, Markus M.; Penninx, Brenda P.; Pergadia, Michele L.; Potash, James B.; Rietschel, Marcella; Lin, Danyu; Müller-Myhsok, Bertram; Shi, Jianxin; Steinberg, Stacy; Grabe, Hans J.; Lichtenstein, Paul; Magnusson, Patrik; Perlis, Roy H.; Preisig, Martin; Smoller, Jordan W.; Stefansson, Kari; Uher, Rudolf; Kutalik, Zoltan; Tansey, Katherine E.; Teumer, Alexander; Viktorin, Alexander; Barnes, Michael R.; Bettecken, Thomas; Binder, Elisabeth B.; Breuer, René; Castro, Victor M.; Churchill, Susanne E.; Coryell, William H.; Craddock, Nick; Craig, Ian W.; Czamara, Darina; De Geus, Eco J.; Degenhardt, Franziska; Farmer, Anne E.; Fava, Maurizio; Frank, Josef; Gainer, Vivian S.; Gallagher, Patience J.; Gordon, Scott D.; Goryachev, Sergey; Gross, Magdalena; Guipponi, Michel; Henders, Anjali K.; Herms, Stefan; Hickie, Ian B.; Hoefels, Susanne; Hoogendijk, Witte; Hottenga, Jouke Jan; Iosifescu, Dan V.; Ising, Marcus; Jones, Ian; Jones, Lisa; Jung-Ying, Tzeng; Knowles, James A.; Kohane, Isaac S.; Kohli, Martin A.; Korszun, Ania; Landen, Mikael; Lawson, William B.; Lewis, Glyn; MacIntyre, Donald; Maier, Wolfgang; Mattheisen, Manuel; McGrath, Patrick J.; McIntosh, Andrew; McLean, Alan; Middeldorp, Christel M.; Middleton, Lefkos; Montgomery, Grant M.; Murphy, Shawn N.; Nauck, Matthias; Nolen, Willem A.; Nyholt, Dale R.; O’Donovan, Michael; Oskarsson, Högni; Pedersen, Nancy; Scheftner, William A.; Schulz, Andrea; Schulze, Thomas G.; Shyn, Stanley I.; Sigurdsson, Engilbert; Slager, Susan L.; Smit, Johannes H.; Stefansson, Hreinn; Steffens, Michael; Thorgeirsson, Thorgeir; Tozzi, Federica; Treutlein, Jens; Uhr, Manfred; van den Oord, Edwin J.C.G.; Van Grootheest, Gerard; Völzke, Henry; Weilburg, Jeffrey B.; Willemsen, Gonneke; Zitman, Frans G.; Neale, Benjamin; Daly, Mark; Levinson, Douglas F.; Sullivan, Patrick F.; Ruepp, Andreas; Müller-Myhsok, Bertram; Hariri, Ahmad R.; Binder, Elisabeth B.
2015-01-01
Summary Depression risk is exacerbated by genetic factors and stress exposure; however, the biological mechanisms through which these factors interact to confer depression risk are poorly understood. One putative biological mechanism implicates variability in the ability of cortisol, released in response to stress, to trigger a cascade of adaptive genomic and non-genomic processes through glucocorticoid receptor (GR) activation. Here, we demonstrate that common genetic variants in long-range enhancer elements modulate the immediate transcriptional response to GR activation in human blood cells. These functional genetic variants increase risk for depression and co-heritable psychiatric disorders. Moreover, these risk variants are associated with inappropriate amygdala reactivity, a transdiagnostic psychiatric endophenotype and an important stress hormone response trigger. Network modeling and animal experiments suggest that these genetic differences in GR-induced transcriptional activation may mediate the risk for depression and other psychiatric disorders by altering a network of functionally related stress-sensitive genes in blood and brain. Video Abstract PMID:26050039
Sun, Dian Xing; Hu, Da Rong; Wu, Guang Hui; Hu, Xue Ling; Li, Juan; Fan, Gong Ren
2002-08-01
To explore the possibility of using HBV as a gene delivery vector, and to test the anti-HBV effects by intracellular combined expression of antisense RNA and dominant negative mutants of core protein. Full length of mutant HBV genome, which expresses core-partial P fusion protein and/or antisense RNA, was transfected into HepG2.2.15 cell lines. Positive clones were selected and mixed in respective groups with hygromycin in the culture medium. HBsAg and HBeAg, which exist in the culture medium, were tested by ELISA method. Intracellular HBc related HBV DNA was examined by dot blot hybridization. The existence of recombinant HBV virion in the culture medium was examined by PCR. Free of packaging signal, HBV genome, which express the HBV structural proteins including core, pol and preS/S proteins, was inserted into pCI-neo vector. HepG2 cell lines were employed to transfect with the construct. G418 selection was done at the concentration of 400mug/ml in the culture medium. The G418-resistant clones with the best expression of HBsAg and HBcAg were theoretically considered as packaging cell lines and propagated under the same conditions. It was transfected with plasmid pMEP-CPAS and then selected with G418 and hygromycin in the culture medium. The existence of recombinant HBV virion in the culture medium was examined by PCR. The mean inhibitory rates of HBsAg were 2.74% 3.83%, 40.08 2.05% (t=35.5, P<0.01), 66.54% 4.45% (t=42.3, P<0.01), and 73.68% 5.07% (t=51.9, P<0.01) in group 2.2.15-pMEP4, 2.2.15-CP, 2.2.15-SAS, and 2.2.15-CPAS, respectively. The mean inhibitory rates of HBeAg were 4.46% 4.25%, 52.86% 1.32% (t=36.2, P<0.01), 26.36% 1.69% (t=22.3, P<0.01), and 59.28% 2.10% (t=39.0, P<0.01), respectively. The inhibitory rates of HBc related HBV DNA were 0, 82.0%, 59.9%, and 96.6%, respectively. Recombinant HB virion was detectable in the culture medium of all the three treatment groups. G418-resistant HBV packaging cell line, which harbored an HBV mutant whose packaging signal had been deleted, was generated. Expression of HBsAg and HBcAg was detectable. Transfected with plasmid pMEP-CPAS, it was found to secrete recombinant HB virion and no wild-type HBV was detectable in the culture medium. It has stronger anti-HBV effects by combined expression of antisense RNA and dominant negative mutants than by individual expression of them. With the help of wild-type HBV, the modified HBV genome can form and secret HBV like particles, which provides evidence that the antiviral gene will be hepatotropic expression and the antiviral effects will be amplified. The packaging cell line can provide packaging for replication-defective HBV, but with low efficiency.
Detection of seal contamination in heat-sealed food packaging based on active infrared thermography
NASA Astrophysics Data System (ADS)
D'huys, Karlien; Saeys, Wouter; De Ketelaere, Bart
2015-05-01
In the food industry packaging is often applied to protect the product from the environment, assuring quality and safety throughout shelf life if properly performed. Packaging quality depends on the material used and the closure (seal). The material is selected based on the specific needs of the food product to be wrapped. However, proper closure of the package is often harder to achieve. One problem possibly jeopardizing seal quality is the presence of food particles between the seal. Seal contamination can cause a decreased seal strength and thus an increased packaging failure risk. It can also trigger the formation of microchannels through which air and microorganisms can enter and spoil the enclosed food. Therefore, early detection and removal of seal-contaminated packages from the production chain is essential. In this work, a pulsed-type active thermography method using the heat of the sealing bars as an excitation source was studied for detecting seal contamination. The cooling profile of contaminated seals was recorded. The detection performance of four processing methods (based on a single frame, a fit of the cooling profile, pulsed phase thermography and a matched filter) was compared. High resolution digital images served as a reference to quantify contamination. The lowest detection limit (equivalent diameter of 0.63 mm) and the lowest processing time (0.42 s per sample) were obtained for the method based on a single frame. Presumably, practical limitations in the recording stage prevented the added value of active thermography to be fully reflected in this application.
Ward, Ben J; van Oosterhout, Cock
2016-03-01
HYBRIDCHECK is a software package to visualize the recombination signal in large DNA sequence data set, and it can be used to analyse recombination, genetic introgression, hybridization and horizontal gene transfer. It can scan large (multiple kb) contigs and whole-genome sequences of three or more individuals. HYBRIDCHECK is written in the r software for OS X, Linux and Windows operating systems, and it has a simple graphical user interface. In addition, the r code can be readily incorporated in scripts and analysis pipelines. HYBRIDCHECK implements several ABBA-BABA tests and visualizes the effects of hybridization and the resulting mosaic-like genome structure in high-density graphics. The package also reports the following: (i) the breakpoint positions, (ii) the number of mutations in each introgressed block, (iii) the probability that the identified region is not caused by recombination and (iv) the estimated age of each recombination event. The divergence times between the donor and recombinant sequence are calculated using a JC, K80, F81, HKY or GTR correction, and the dating algorithm is exceedingly fast. By estimating the coalescence time of introgressed blocks, it is possible to distinguish between hybridization and incomplete lineage sorting. HYBRIDCHECK is libré software and it and its manual are free to download from http://ward9250.github.io/HybridCheck/. © 2015 John Wiley & Sons Ltd.
Probabilistic models of genetic variation in structured populations applied to global human studies.
Hao, Wei; Song, Minsun; Storey, John D
2016-03-01
Modern population genetics studies typically involve genome-wide genotyping of individuals from a diverse network of ancestries. An important problem is how to formulate and estimate probabilistic models of observed genotypes that account for complex population structure. The most prominent work on this problem has focused on estimating a model of admixture proportions of ancestral populations for each individual. Here, we instead focus on modeling variation of the genotypes without requiring a higher-level admixture interpretation. We formulate two general probabilistic models, and we propose computationally efficient algorithms to estimate them. First, we show how principal component analysis can be utilized to estimate a general model that includes the well-known Pritchard-Stephens-Donnelly admixture model as a special case. Noting some drawbacks of this approach, we introduce a new 'logistic factor analysis' framework that seeks to directly model the logit transformation of probabilities underlying observed genotypes in terms of latent variables that capture population structure. We demonstrate these advances on data from the Human Genome Diversity Panel and 1000 Genomes Project, where we are able to identify SNPs that are highly differentiated with respect to structure while making minimal modeling assumptions. A Bioconductor R package called lfa is available at http://www.bioconductor.org/packages/release/bioc/html/lfa.html jstorey@princeton.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
USDA-ARS?s Scientific Manuscript database
The ARS Culture Collection (NRRL) currently contains 7569 strains within the family Streptomycetaceae but 4368 of them have not been characterized to the species level. A gene sequence database using the Bacterial Isolate Genomic Sequence Database package (BIGSdb) (Jolley & Maiden, 2010) is availabl...
The Effects of Nucleosome Positioning and Chromatin Architecture on Transgene Expression
ERIC Educational Resources Information Center
Kempton, Colton E.
2017-01-01
Eukaryotes use proteins to carefully package and compact their genomes to fit into the nuclei of their individual cells. Nucleosomes are the primary level of compaction. Nucleosomes are formed when DNA wraps around an octamer of histone proteins and a nucleosome's position can limit access to genetic regulatory elements. Therefore, nucleosomes…
Polisher (conflicting versions 2.0.8 on IM Form, 1.0 on abstract)
DOE Office of Scientific and Technical Information (OSTI.GOV)
2008-09-18
Polisher is a software package designed to facilitate the error correction of an assembled genome using Illumia read data. The software addresses substandard regions by automatically correcting consensus errors and/or suggesting primer walking reactions to improve the quality of the bases. This is done by performing the following:...........
Genome-wide association study for lettuce cultivars with improved salad processing efficiency
USDA-ARS?s Scientific Manuscript database
Lettuce (Lactuca sativa L.) is widely used as the main ingredient of packaged leafy vegetable salads. Salad lettuce can have short shelf life, decaying as early as eight days after harvest and reducing the nutritional quality. Decayed lettuce is not marketable, produces extra waste, and results in t...
Hernandez-Ferrer, Carles; Quintela Garcia, Ines; Danielski, Katharina; Carracedo, Ángel; Pérez-Jurado, Luis A; González, Juan R
2015-05-20
The well-known Genome-Wide Association Studies (GWAS) had led to many scientific discoveries using SNP data. Even so, they were not able to explain the full heritability of complex diseases. Now, other structural variants like copy number variants or DNA inversions, either germ-line or in mosaicism events, are being studies. We present the R package affy2sv to pre-process Affymetrix CytoScan HD/750k array (also for Genome-Wide SNP 5.0/6.0 and Axiom) in structural variant studies. We illustrate the capabilities of affy2sv using two different complete pipelines on real data. The first one performing a GWAS and a mosaic alterations detection study, and the other detecting CNVs and performing an inversion calling. Both examples presented in the article show up how affy2sv can be used as part of more complex pipelines aimed to analyze Affymetrix SNP arrays data in genetic association studies, where different types of structural variants are considered.
GEMINI: Integrative Exploration of Genetic Variation and Genome Annotations
Paila, Umadevi; Chapman, Brad A.; Kirchner, Rory; Quinlan, Aaron R.
2013-01-01
Modern DNA sequencing technologies enable geneticists to rapidly identify genetic variation among many human genomes. However, isolating the minority of variants underlying disease remains an important, yet formidable challenge for medical genetics. We have developed GEMINI (GEnome MINIng), a flexible software package for exploring all forms of human genetic variation. Unlike existing tools, GEMINI integrates genetic variation with a diverse and adaptable set of genome annotations (e.g., dbSNP, ENCODE, UCSC, ClinVar, KEGG) into a unified database to facilitate interpretation and data exploration. Whereas other methods provide an inflexible set of variant filters or prioritization methods, GEMINI allows researchers to compose complex queries based on sample genotypes, inheritance patterns, and both pre-installed and custom genome annotations. GEMINI also provides methods for ad hoc queries and data exploration, a simple programming interface for custom analyses that leverage the underlying database, and both command line and graphical tools for common analyses. We demonstrate GEMINI's utility for exploring variation in personal genomes and family based genetic studies, and illustrate its ability to scale to studies involving thousands of human samples. GEMINI is designed for reproducibility and flexibility and our goal is to provide researchers with a standard framework for medical genomics. PMID:23874191
Wang, Qingguo; Jia, Peilin; Zhao, Zhongming
2015-01-01
Fueled by widespread applications of high-throughput next generation sequencing (NGS) technologies and urgent need to counter threats of pathogenic viruses, large-scale studies were conducted recently to investigate virus integration in host genomes (for example, human tumor genomes) that may cause carcinogenesis or other diseases. A limiting factor in these studies, however, is rapid virus evolution and resulting polymorphisms, which prevent reads from aligning readily to commonly used virus reference genomes, and, accordingly, make virus integration sites difficult to detect. Another confounding factor is host genomic instability as a result of virus insertions. To tackle these challenges and improve our capability to identify cryptic virus-host fusions, we present a new approach that detects Virus intEgration sites through iterative Reference SEquence customization (VERSE). To the best of our knowledge, VERSE is the first approach to improve detection through customizing reference genomes. Using 19 human tumors and cancer cell lines as test data, we demonstrated that VERSE substantially enhanced the sensitivity of virus integration site detection. VERSE is implemented in the open source package VirusFinder 2 that is available at http://bioinfo.mc.vanderbilt.edu/VirusFinder/.
esATAC: An Easy-to-use Systematic pipeline for ATAC-seq data analysis.
Wei, Zheng; Zhang, Wei; Fang, Huan; Li, Yanda; Wang, Xiaowo
2018-03-07
ATAC-seq is rapidly emerging as one of the major experimental approaches to probe chromatin accessibility genome-wide. Here, we present "esATAC", a highly integrated easy-to-use R/Bioconductor package, for systematic ATAC-seq data analysis. It covers essential steps for full analyzing procedure, including raw data processing, quality control and downstream statistical analysis such as peak calling, enrichment analysis and transcription factor footprinting. esATAC supports one command line execution for preset pipelines, and provides flexible interfaces for building customized pipelines. esATAC package is open source under the GPL-3.0 license. It is implemented in R and C ++. Source code and binaries for Linux, MAC OS X and Windows are available through Bioconductor https://www.bioconductor.org/packages/release/bioc/html/esATAC.html). xwwang@tsinghua.edu.cn. Supplementary data are available at Bioinformatics online.
Khramtsova, Ekaterina A; Stranger, Barbara E
2017-02-01
Over the last decade, genome-wide association studies (GWAS) have generated vast amounts of analysis results, requiring development of novel tools for data visualization. Quantile–quantile (QQ) plots and Manhattan plots are classical tools which have been utilized to visually summarize GWAS results and identify genetic variants significantly associated with traits of interest. However, static visualizations are limiting in the information that can be shown. Here, we present Assocplots, a Python package for viewing and exploring GWAS results not only using classic static Manhattan and QQ plots, but also through a dynamic extension which allows to interactively visualize the relationships between GWAS results from multiple cohorts or studies. The Assocplots package is open source and distributed under the MIT license via GitHub (https://github.com/khramts/assocplots) along with examples, documentation and installation instructions. ekhramts@medicine.bsd.uchicago.edu or bstranger@medicine.bsd.uchicago.edu
EXP-PAC: providing comparative analysis and storage of next generation gene expression data.
Church, Philip C; Goscinski, Andrzej; Lefèvre, Christophe
2012-07-01
Microarrays and more recently RNA sequencing has led to an increase in available gene expression data. How to manage and store this data is becoming a key issue. In response we have developed EXP-PAC, a web based software package for storage, management and analysis of gene expression and sequence data. Unique to this package is SQL based querying of gene expression data sets, distributed normalization of raw gene expression data and analysis of gene expression data across experiments and species. This package has been populated with lactation data in the international milk genomic consortium web portal (http://milkgenomics.org/). Source code is also available which can be hosted on a Windows, Linux or Mac APACHE server connected to a private or public network (http://mamsap.it.deakin.edu.au/~pcc/Release/EXP_PAC.html). Copyright © 2012 Elsevier Inc. All rights reserved.
Polymorphism of DNA conformation inside the bacteriophage capsid.
Leforestier, Amélie
2013-03-01
Double-stranded DNA bacteriophage genomes are packaged into their icosahedral capsids at the highest densities known so far (about 50 % w:v). How the molecule is folded at such density and how its conformation changes upon ejection or packaging are fascinating questions still largely open. We review cryo-TEM analyses of DNA conformation inside partially filled capsids as a function of the physico-chemical environment (ions, osmotic pressure, temperature). We show that there exists a wide variety of DNA conformations. Strikingly, the different observed structures can be described by some of the different models proposed over the years for DNA organisation inside bacteriophage capsids: either spool-like structures with axial or concentric symmetries, or liquid crystalline structures characterised by a DNA homogeneous density. The relevance of these conformations for the understanding of DNA folding and unfolding upon ejection and packaging in vivo is discussed.
Le Moëllic, Cathy; Ouvrard-Pascaud, Antoine; Capurro, Claudia; Cluzeaud, Francoise; Fay, Michel; Jaisser, Frederic; Farman, Nicolette; Blot-Chabaud, Marcel
2004-05-01
Effects of aldosterone on its target cells have long been considered to be mediated exclusively through the genomic pathway; however, evidence has been provided for rapid effects of the hormone that may involve nongenomic mechanisms. Whether an interaction exists between these two signaling pathways is not yet established. In this study, the authors show that aldosterone triggers both early nongenomic and late genomic increase in sodium transport in the RCCD(2) rat cortical collecting duct cell line. In these cells, the early (up to 2.5 h) aldosterone-induced increase in short-circuit current (Isc) is not blocked by the mineralocorticoid receptor (MR) antagonist RU26752, it does not require mRNA or protein synthesis, and it involves the PKCalpha signaling pathway. In addition, this early response is reproduced by aldosterone-BSA, which acts at the cell surface and presumably does not enter the cells (aldo-BSA is unable to trigger the late response). The authors also show that MR is rapidly phosphorylated on serine and threonine residues by aldosterone or aldosterone-BSA. In contrast, the late (4 to 24 h) aldosterone-induced increase in ion transport occurs through activation of the MR and requires mRNA and protein synthesis. Interestingly, nongenomic and genomic aldosterone actions appear to be interdependent. Blocking the PKCalpha pathway results in the inhibition of the late genomic response to aldosterone, as demonstrated by the suppression of aldosterone-induced increase in MR transactivation activity, alpha1 Na(+)/K(+)/ATPase mRNA, and Isc. These data suggest cross-talk between the nongenomic and genomic responses to aldosterone in renal cells and suggest that the aldosterone-MR mediated increase in mRNA/protein synthesis and ion transport depends, at least in part, upon PKCalpha activation. E-mail: marcel.blot-chabaud@pharmacie.univ-mrs.fr
Gene transfer agents: phage-like elements of genetic exchange
Lang, Andrew S.; Zhaxybayeva, Olga; Beatty, J. Thomas
2013-01-01
Horizontal gene transfer is important in the evolution of bacterial and archaeal genomes. An interesting genetic exchange process is carried out by diverse phage-like gene transfer agents (GTAs) that are found in a wide range of prokaryotes. Although GTAs resemble phages, they lack the hallmark capabilities that define typical phages, and they package random pieces of the producing cell’s genome. In this Review, we discuss the defining characteristics of the GTAs that have been identified to date, along with potential functions for these agents and the possible evolutionary forces that act on the genes involved in their production. PMID:22683880
Proteomic analysis of macrophage activated with salmonella lipopolysaccharide
USDA-ARS?s Scientific Manuscript database
Macrophages play pivotal role in immunity. They are activated by many pathogen derived molecules such as lipopolysaccharides (LPS) which trigger the production of various proteins and peptides that drive and resolve inflammation. There are numerous studies on the effect of LPS at the genome level bu...
Breast cancer: The translation of big genomic data to cancer precision medicine.
Low, Siew-Kee; Zembutsu, Hitoshi; Nakamura, Yusuke
2018-03-01
Cancer is a complex genetic disease that develops from the accumulation of genomic alterations in which germline variations predispose individuals to cancer and somatic alterations initiate and trigger the progression of cancer. For the past 2 decades, genomic research has advanced remarkably, evolving from single-gene to whole-genome screening by using genome-wide association study and next-generation sequencing that contributes to big genomic data. International collaborative efforts have contributed to curating these data to identify clinically significant alterations that could be used in clinical settings. Focusing on breast cancer, the present review summarizes the identification of genomic alterations with high-throughput screening as well as the use of genomic information in clinical trials that match cancer patients to therapies, which further leads to cancer precision medicine. Furthermore, cancer screening and monitoring were enhanced greatly by the use of liquid biopsies. With the growing data complexity and size, there is much anticipation in exploiting deep machine learning and artificial intelligence to curate integrative "-omics" data to refine the current medical practice to be applied in the near future. © 2017 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.
Producing genome structure populations with the dynamic and automated PGS software.
Hua, Nan; Tjong, Harianto; Shin, Hanjun; Gong, Ke; Zhou, Xianghong Jasmine; Alber, Frank
2018-05-01
Chromosome conformation capture technologies such as Hi-C are widely used to investigate the spatial organization of genomes. Because genome structures can vary considerably between individual cells of a population, interpreting ensemble-averaged Hi-C data can be challenging, in particular for long-range and interchromosomal interactions. We pioneered a probabilistic approach for the generation of a population of distinct diploid 3D genome structures consistent with all the chromatin-chromatin interaction probabilities from Hi-C experiments. Each structure in the population is a physical model of the genome in 3D. Analysis of these models yields new insights into the causes and the functional properties of the genome's organization in space and time. We provide a user-friendly software package, called PGS, which runs on local machines (for practice runs) and high-performance computing platforms. PGS takes a genome-wide Hi-C contact frequency matrix, along with information about genome segmentation, and produces an ensemble of 3D genome structures entirely consistent with the input. The software automatically generates an analysis report, and provides tools to extract and analyze the 3D coordinates of specific domains. Basic Linux command-line knowledge is sufficient for using this software. A typical running time of the pipeline is ∼3 d with 300 cores on a computer cluster to generate a population of 1,000 diploid genome structures at topological-associated domain (TAD)-level resolution.
Baumler, David J.; Banta, Lois M.; Hung, Kai F.; Schwarz, Jodi A.; Cabot, Eric L.; Glasner, Jeremy D.; Perna, Nicole T.
2012-01-01
Genomics and bioinformatics are topics of increasing interest in undergraduate biological science curricula. Many existing exercises focus on gene annotation and analysis of a single genome. In this paper, we present two educational modules designed to enable students to learn and apply fundamental concepts in comparative genomics using examples related to bacterial pathogenesis. Students first examine alignments of genomes of Escherichia coli O157:H7 strains isolated from three food-poisoning outbreaks using the multiple-genome alignment tool Mauve. Students investigate conservation of virulence factors using the Mauve viewer and by browsing annotations available at the A Systematic Annotation Package for Community Analysis of Genomes database. In the second module, students use an alignment of five Yersinia pestis genomes to analyze single-nucleotide polymorphisms of three genes to classify strains into biovar groups. Students are then given sequences of bacterial DNA amplified from the teeth of corpses from the first and second pandemics of the bubonic plague and asked to classify these new samples. Learning-assessment results reveal student improvement in self-efficacy and content knowledge, as well as students' ability to use BLAST to identify genomic islands and conduct analyses of virulence factors from E. coli O157:H7 or Y. pestis. Each of these educational modules offers educators new ready-to-implement resources for integrating comparative genomic topics into their curricula. PMID:22383620
Boylan, Brendan T; Moreira, Fernando R; Carlson, Tim W; Bernard, Kristen A
2017-02-01
Half of the human population is at risk of infection by an arthropod-borne virus. Many of these arboviruses, such as West Nile, dengue, and Zika viruses, infect humans by way of a bite from an infected mosquito. This infectious inoculum is insect cell-derived giving the virus particles distinct qualities not present in secondary infectious virus particles produced by infected vertebrate host cells. The insect cell-derived particles differ in the glycosylation of virus structural proteins and the lipid content of the envelope, as well as their induction of cytokines. Thus, in order to accurately mimic the inoculum delivered by arthropods, arboviruses should be derived from arthropod cells. Previous studies have packaged replicon genome in mammalian cells to produce replicon particles, which undergo only one round of infection, but no studies exist packaging replicon particles in mosquito cells. Here we optimized the packaging of West Nile virus replicon genome in mosquito cells and produced replicon particles at high concentration, allowing us to mimic mosquito cell-derived viral inoculum. These particles were mature with similar genome equivalents-to-infectious units as full-length West Nile virus. We then compared the mosquito cell-derived particles to mammalian cell-derived particles in mice. Both replicon particles infected skin at the inoculation site and the draining lymph node by 3 hours post-inoculation. The mammalian cell-derived replicon particles spread from the site of inoculation to the spleen and contralateral lymph nodes significantly more than the particles derived from mosquito cells. This in vivo difference in spread of West Nile replicons in the inoculum demonstrates the importance of using arthropod cell-derived particles to model early events in arboviral infection and highlights the value of these novel arthropod cell-derived replicon particles for studying the earliest virus-host interactions for arboviruses.
Svarovskaia, Evguenia S; Xu, Hongzhan; Mbisa, Jean L; Barr, Rebekah; Gorelick, Robert J; Ono, Akira; Freed, Eric O; Hu, Wei-Shau; Pathak, Vinay K
2004-08-20
Apolipoprotein B mRNA-editing enzyme-catalytic polypeptide-like 3G (APOBEC3G) is a host cytidine deaminase that is packaged into virions and confers resistance to retroviral infection. APOBEC3G deaminates deoxycytidines in minus strand DNA to deoxyuridines, resulting in G to A hypermutation and viral inactivation. Human immunodeficiency virus type 1 (HIV-1) virion infectivity factor counteracts the antiviral activity of APOBEC3G by inducing its proteosomal degradation and preventing virion incorporation. To elucidate the mechanism of viral suppression by APOBEC3G, we developed a sensitive cytidine deamination assay and analyzed APOBEC3G virion incorporation in a series of HIV-1 deletion mutants. Virus-like particles derived from constructs in which pol, env, and most of gag were deleted still contained high levels of cytidine deaminase activity; in addition, coimmunoprecipitation of APOBEC3G and HIV-1 Gag in the presence and absence of RNase A indicated that the two proteins do not interact directly but form an RNase-sensitive complex. Viral particles lacking HIV-1 genomic RNA which were generated from the gag-pol expression constructs pC-Help and pSYNGP packaged APOBEC3G at 30-40% of the wild-type level, indicating that interactions with viral RNA are not necessary for incorporation. In addition, viral particles produced from an nucleocapsid zinc finger mutant contained approximately 1% of the viral genomic RNA but approximately 30% of the cytidine deaminase activity. The reduction in APOBEC3G incorporation was equivalent to the reduction in the total RNA present in the nucleocapsid mutant virions. These results indicate that interactions with viral proteins or viral genomic RNA are not essential for APOBEC3G incorporation and suggest that APOBEC3G interactions with viral and nonviral RNAs that are packaged into viral particles are sufficient for APOBEC3G virion incorporation.
Kemppainen, Minna J.; Pardo, Alejandro G.
2010-01-01
Summary pSILBAγ silencing vector was constructed for efficient RNA silencing triggering in the model mycorrhizal fungus Laccaria bicolor. This cloning vector carries the Agaricus bisporus gpdII promoter, two multiple cloning sites separated by a L. bicolor nitrate reductase intron and the Aspergillus nidulans trpC terminator. pSILBAγ allows an easy oriented two‐step PCR cloning of hairpin sequences to be expressed in basidiomycetes. With one further cloning step into pHg, a pCAMBIA1300‐based binary vector carrying a hygromycin resistance cassette, the pHg/pSILBAγ plasmid is used for Agrobacterium‐mediated transformation. The pHg/pSILBAγ system results in predominantly single integrations of RNA silencing triggering T‐DNAs in the fungal genome and the integration sites of the transgenes can be resolved by plasmid rescue. pSILBAγ construct and two other pSILBA plasmid variants (pSILBA and pSILBAα) were evaluated for their capacity to silence Laccaria nitrate reductase gene. While all pSILBA variants tested resulted in up to 65–76% of transformants with reduced growth on nitrate, pSILBAγ produced the highest number (65%) of strongly affected fungal strains. The strongly silenced phenotype was shown to correlate with T‐DNA integration in transcriptionally active genomic sites. pHg/pSILBAγ was shown to produce T‐DNAs with minimum CpG methylation in transgene promoter regions which assures the maximum silencing trigger production in Laccaria. Methylation of the target endogene was only slight in RNA silencing triggered with constructs carrying an intronic spacer hairpin sequence. The silencing capacity of the pHg/pSILBAγ was further tested with Laccaria inositol‐1,4,5‐triphosphate 5‐phosphatase gene. Besides its use in silencing triggering, the herein described plasmid system can also be used for transgene expression in Laccaria. pHg/pSILBAγ silencing system is optimized for L. bicolor but it should be highly useful also for other homobasidiomycetes, group of fungi currently lacking molecular tools for RNA silencing. PMID:21255319
Dulmage, Keely A; Todor, Horia; Schmid, Amy K
2015-09-08
In all three domains of life, organisms use nonspecific DNA-binding proteins to compact and organize the genome as well as to regulate transcription on a global scale. Histone is the primary eukaryotic nucleoprotein, and its evolutionary roots can be traced to the archaea. However, not all archaea use this protein as the primary DNA-packaging component, raising questions regarding the role of histones in archaeal chromatin function. Here, quantitative phenotyping, transcriptomic, and proteomic assays were performed on deletion and overexpression mutants of the sole histone protein of the hypersaline-adapted haloarchaeal model organism Halobacterium salinarum. This protein is highly conserved among all sequenced haloarchaeal species and maintains hallmark residues required for eukaryotic histone functions. Surprisingly, despite this conservation at the sequence level, unlike in other archaea or eukaryotes, H. salinarum histone is required to regulate cell shape but is not necessary for survival. Genome-wide expression changes in histone deletion strains were global, significant but subtle in terms of fold change, bidirectional, and growth phase dependent. Mass spectrometric proteomic identification of proteins from chromatin enrichments yielded levels of histone and putative nucleoid-associated proteins similar to those of transcription factors, consistent with an open and transcriptionally active genome. Taken together, these data suggest that histone in H. salinarum plays a minor role in DNA compaction but important roles in growth-phase-dependent gene expression and regulation of cell shape. Histone function in haloarchaea more closely resembles a regulator of gene expression than a chromatin-organizing protein like canonical eukaryotic histone. Histones comprise the major protein component of eukaryotic chromatin and are required for both genome packaging and global regulation of expression. The current paradigm maintains that archaea whose genes encode histone also use these proteins to package DNA. In contrast, here we demonstrate that the sole histone encoded in the genome of the salt-adapted archaeon Halobacterium salinarum is both unessential and unlikely to be involved in DNA compaction despite conservation of residues important for eukaryotic histones. Rather, H. salinarum histone is required for global regulation of gene expression and cell shape. These data are consistent with the hypothesis that H. salinarum histone, strongly conserved across all other known salt-adapted archaea, serves a novel role in gene regulation and cell shape maintenance. Given that archaea possess the ancestral form of eukaryotic histone, this study has important implications for understanding the evolution of histone function. Copyright © 2015 Dulmage et al.
Library preparation and data analysis packages for rapid genome sequencing.
Pomraning, Kyle R; Smith, Kristina M; Bredeweg, Erin L; Connolly, Lanelle R; Phatale, Pallavi A; Freitag, Michael
2012-01-01
High-throughput sequencing (HTS) has quickly become a valuable tool for comparative genetics and genomics and is now regularly carried out in laboratories that are not connected to large sequencing centers. Here we describe an updated version of our protocol for constructing single- and paired-end Illumina sequencing libraries, beginning with purified genomic DNA. The present protocol can also be used for "multiplexing," i.e. the analysis of several samples in a single flowcell lane by generating "barcoded" or "indexed" Illumina sequencing libraries in a way that is independent from Illumina-supported methods. To analyze sequencing results, we suggest several independent approaches but end users should be aware that this is a quickly evolving field and that currently many alignment (or "mapping") and counting algorithms are being developed and tested.
PHYLUCE is a software package for the analysis of conserved genomic loci.
Faircloth, Brant C
2016-03-01
Targeted enrichment of conserved and ultraconserved genomic elements allows universal collection of phylogenomic data from hundreds of species at multiple time scales (<5 Ma to > 300 Ma). Prior to downstream inference, data from these types of targeted enrichment studies must undergo preprocessing to assemble contigs from sequence data; identify targeted, enriched loci from the off-target background data; align enriched contigs representing conserved loci to one another; and prepare and manipulate these alignments for subsequent phylogenomic inference. PHYLUCE is an efficient and easy-to-install software package that accomplishes these tasks across hundreds of taxa and thousands of enriched loci. PHYLUCE is written for Python 2.7. PHYLUCE is supported on OSX and Linux (RedHat/CentOS) operating systems. PHYLUCE source code is distributed under a BSD-style license from https://www.github.com/faircloth-lab/phyluce/ PHYLUCE is also available as a package (https://binstar.org/faircloth-lab/phyluce) for the Anaconda Python distribution that installs all dependencies, and users can request a PHYLUCE instance on iPlant Atmosphere (tag: phyluce). The software manual and a tutorial are available from http://phyluce.readthedocs.org/en/latest/ and test data are available from doi: 10.6084/m9.figshare.1284521. brant@faircloth-lab.org Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Labenski, Verena; Suerth, Julia D; Barczak, Elke; Heckl, Dirk; Levy, Camille; Bernadin, Ornellie; Charpentier, Emmanuelle; Williams, David A; Fehse, Boris; Verhoeyen, Els; Schambach, Axel
2016-08-01
Primary human T lymphocytes represent an important cell population for adoptive immunotherapies, including chimeric-antigen and T-cell receptor applications, as they have the capability to eliminate non-self, virus-infected and tumor cells. Given the increasing numbers of clinical immunotherapy applications, the development of an optimal vector platform for genetic T lymphocyte engineering, which allows cost-effective high-quality vector productions, remains a critical goal. Alpharetroviral self-inactivating vectors (ARV) have several advantages compared to other vector platforms, including a more random genomic integration pattern and reduced likelihood for inducing aberrant splicing of integrated proviruses. We developed an ARV platform for the transduction of primary human T lymphocytes. We demonstrated functional transgene transfer using the clinically relevant herpes-simplex-virus thymidine kinase variant TK.007. Proof-of-concept of alpharetroviral-mediated T-lymphocyte engineering was shown in vitro and in a humanized transplantation model in vivo. Furthermore, we established a stable, human alpharetroviral packaging cell line in which we deleted the entry receptor (SLC1A5) for RD114/TR-pseudotyped ARVs to prevent superinfection and enhance genomic integrity of the packaging cell line and viral particles. We showed that superinfection can be entirely prevented, while maintaining high recombinant virus titers. Taken together, this resulted in an improved production platform representing an economic strategy for translating the promising features of ARVs for therapeutic T-lymphocyte engineering. Copyright © 2016 Elsevier Ltd. All rights reserved.
Ingemarsdotter, Carin K; Zeng, Jingwei; Long, Ziqi; Lever, Andrew M L; Kenyon, Julia C
2018-03-14
NSC260594, a quinolinium derivative from the NCI diversity set II compound library, was previously identified in a target-based assay as an inhibitor of the interaction between the HIV-1 (ψ) stem-loop 3 (SL3) RNA and Gag. This compound was shown to exhibit potent antiviral activity. Here, the effects of this compound on individual stages of the viral lifecycle were examined by qRT-PCR, ELISA and Western blot, to see if its actions were specific to the viral packaging stage. The structural effects of NSC260594 binding to the HIV-1 gRNA were also examined by SHAPE and dimerization assays. Treatment of cells with NSC260594 did not reduce the number of integration events of incoming virus, and treatment of virus producing cells did not affect the level of intracellular Gag protein or viral particle release as determined by immunoblot. However, NSC260594 reduced the incorporation of gRNA into virions by up to 82%, without affecting levels of gRNA inside the cell. This reduction in packaging correlated closely with the reduction in infectivity of the released viral particles. To establish the structural effects of NSC260594 on the HIV-1 gRNA, we performed SHAPE analyses to pinpoint RNA structural changes. NSC260594 had a stabilizing effect on the wild type RNA that was not confined to SL3, but that was propagated across the structure. A packaging mutant lacking SL3 did not show this effect. NSC260594 acts as a specific inhibitor of HIV-1 RNA packaging. No other viral functions are affected. Its action involves preventing the interaction of Gag with SL3 by stabilizing this small RNA stem-loop which then leads to stabilization of the global packaging signal region (psi or ψ). This confirms data, previously only shown in analyses of isolated SL3 oligonucleotides, that SL3 is structurally labile in the presence of Gag and that this is critical for the complete psi region to be able to adopt different conformations. Since replication is otherwise unaffected by NSC260594 the flexibility of SL3 appears to be a unique requirement for genome encapsidation and identifies this process as a highly specific drug target. This study is proof of principle that development of a new class of antiretroviral drugs that specifically target viral packaging by binding to the viral genomic RNA is achievable.
Synthetic biology approach for plant protection using dsRNA.
Niehl, Annette; Soininen, Marjukka; Poranen, Minna M; Heinlein, Manfred
2018-02-26
Pathogens induce severe damages on cultivated plants and represent a serious threat to global food security. Emerging strategies for crop protection involve the external treatment of plants with double-stranded (ds)RNA to trigger RNA interference. However, applying this technology in greenhouses and fields depends on dsRNA quality, stability and efficient large-scale production. Using components of the bacteriophage phi6, we engineered a stable and accurate in vivo dsRNA production system in Pseudomonas syringae bacteria. Unlike other in vitro or in vivo dsRNA production systems that rely on DNA transcription and postsynthetic alignment of single-stranded RNA molecules, the phi6 system is based on the replication of dsRNA by an RNA-dependent RNA polymerase, thus allowing production of high-quality, long dsRNA molecules. The phi6 replication complex was reprogrammed to multiply dsRNA sequences homologous to tobacco mosaic virus (TMV) by replacing the coding regions within two of the three phi6 genome segments with TMV sequences and introduction of these constructs into P. syringae together with the third phi6 segment, which encodes the components of the phi6 replication complex. The stable production of TMV dsRNA was achieved by combining all the three phi6 genome segments and by maintaining the natural dsRNA sizes and sequence elements required for efficient replication and packaging of the segments. The produced TMV-derived dsRNAs inhibited TMV propagation when applied to infected Nicotiana benthamiana plants. The established dsRNA production system enables the broad application of dsRNA molecules as an efficient, highly flexible, nontransgenic and environmentally friendly approach for protecting crops against viruses and other pathogens. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Atrx promotes heterochromatin formation at retrotransposons
Sadic, Dennis; Schmidt, Katharina; Groh, Sophia; Kondofersky, Ivan; Ellwart, Joachim; Fuchs, Christiane; Theis, Fabian J; Schotta, Gunnar
2015-01-01
More than 50% of mammalian genomes consist of retrotransposon sequences. Silencing of retrotransposons by heterochromatin is essential to ensure genomic stability and transcriptional integrity. Here, we identified a short sequence element in intracisternal A particle (IAP) retrotransposons that is sufficient to trigger heterochromatin formation. We used this sequence in a genome-wide shRNA screen and identified the chromatin remodeler Atrx as a novel regulator of IAP silencing. Atrx binds to IAP elements and is necessary for efficient heterochromatin formation. In addition, Atrx facilitates a robust and largely inaccessible heterochromatin structure as Atrx knockout cells display increased chromatin accessibility at retrotransposons and non-repetitive heterochromatic loci. In summary, we demonstrate a direct role of Atrx in the establishment and robust maintenance of heterochromatin. PMID:26012739
Birkenbihl, Rainer P.; Kracher, Barbara; Roccaro, Mario
2017-01-01
During microbial-associated molecular pattern-triggered immunity (MTI), molecules derived from microbes are perceived by cell surface receptors and upon signaling to the nucleus initiate a massive transcriptional reprogramming critical to mount an appropriate host defense response. WRKY transcription factors play an important role in regulating these transcriptional processes. Here, we determined on a genome-wide scale the flg22-induced in vivo DNA binding dynamics of three of the most prominent WRKY factors, WRKY18, WRKY40, and WRKY33. The three WRKY factors each bound to more than 1000 gene loci predominantly at W-box elements, the known WRKY binding motif. Binding occurred mainly in the 500-bp promoter regions of these genes. Many of the targeted genes are involved in signal perception and transduction not only during MTI but also upon damage-associated molecular pattern-triggered immunity, providing a mechanistic link between these functionally interconnected basal defense pathways. Among the additional targets were genes involved in the production of indolic secondary metabolites and in modulating distinct plant hormone pathways. Importantly, among the targeted genes were numerous transcription factors, encoding predominantly ethylene response factors, active during early MTI, and WRKY factors, supporting the previously hypothesized existence of a WRKY subregulatory network. Transcriptional analysis revealed that WRKY18 and WRKY40 function redundantly as negative regulators of flg22-induced genes often to prevent exaggerated defense responses. PMID:28011690
Equalizer reduces SNP bias in Affymetrix microarrays.
Quigley, David
2015-07-30
Gene expression microarrays measure the levels of messenger ribonucleic acid (mRNA) in a sample using probe sequences that hybridize with transcribed regions. These probe sequences are designed using a reference genome for the relevant species. However, most model organisms and all humans have genomes that deviate from their reference. These variations, which include single nucleotide polymorphisms, insertions of additional nucleotides, and nucleotide deletions, can affect the microarray's performance. Genetic experiments comparing individuals bearing different population-associated single nucleotide polymorphisms that intersect microarray probes are therefore subject to systemic bias, as the reduction in binding efficiency due to a technical artifact is confounded with genetic differences between parental strains. This problem has been recognized for some time, and earlier methods of compensation have attempted to identify probes affected by genome variants using statistical models. These methods may require replicate microarray measurement of gene expression in the relevant tissue in inbred parental samples, which are not always available in model organisms and are never available in humans. By using sequence information for the genomes of organisms under investigation, potentially problematic probes can now be identified a priori. However, there is no published software tool that makes it easy to eliminate these probes from an annotation. I present equalizer, a software package that uses genome variant data to modify annotation files for the commonly used Affymetrix IVT and Gene/Exon platforms. These files can be used by any microarray normalization method for subsequent analysis. I demonstrate how use of equalizer on experiments mapping germline influence on gene expression in a genetic cross between two divergent mouse species and in human samples significantly reduces probe hybridization-induced bias, reducing false positive and false negative findings. The equalizer package reduces probe hybridization bias from experiments performed on the Affymetrix microarray platform, allowing accurate assessment of germline influence on gene expression.
Amarillas, Luis; Rubí-Rangel, Lucia; Chaidez, Cristobal; González-Robles, Arturo; Lightbourn-Rojas, Luis; León-Félix, Josefina
2017-01-01
Foodborne diseases are a serious and growing problem, and the incidence and prevalence of antimicrobial resistance among foodborne pathogens is reported to have increased. The emergence of antibiotic-resistant bacterial strains demands novel strategies to counteract this epidemic. In this regard, lytic bacteriophages have reemerged as an alternative for the control of pathogenic bacteria. However, the effective use of phages relies on appropriate biological and genomic characterization. In this study, we present the isolation and characterization of a novel bacteriophage named phiLLS, which has shown strong lytic activity against generic and multidrug-resistant Escherichia coli strains. Transmission electron microscopy of phiLLS morphology revealed that it belongs to the Siphoviridae family. Furthermore, this phage exhibited a relatively large burst size of 176 plaque-forming units per infected cell. Phage phiLLS significantly reduced the growth of E. coli under laboratory conditions. Analyses of restriction profiles showed the presence of submolar fragments, confirming that phiLLS is a pac-type phage. Phylogenetic analysis based on the amino acid sequence of large terminase subunits confirmed that this phage uses a headful packaging strategy to package their genome. Genomic sequencing and bioinformatic analysis showed that phiLLS is a novel bacteriophage that is most closely related to T5-like phages. In silico analysis indicated that the phiLLS genome consists of 107,263 bp (39.0 % GC content) encoding 160 putative ORFs, 16 tRNAs, several potential promoters and transcriptional terminators. Genome analysis suggests that the phage phiLLS is strictly lytic without carrying genes associated with virulence factors and/or potential immunoreactive allergen proteins. The bacteriophage isolated in this study has shown promising results in the biocontrol of bacterial growth under in vitro conditions, suggesting that it may prove useful as an alternative agent for the control of foodborne pathogens. However, further oral toxicity testing is needed to ensure the safety of phage use. PMID:28785246
Amarillas, Luis; Rubí-Rangel, Lucia; Chaidez, Cristobal; González-Robles, Arturo; Lightbourn-Rojas, Luis; León-Félix, Josefina
2017-01-01
Foodborne diseases are a serious and growing problem, and the incidence and prevalence of antimicrobial resistance among foodborne pathogens is reported to have increased. The emergence of antibiotic-resistant bacterial strains demands novel strategies to counteract this epidemic. In this regard, lytic bacteriophages have reemerged as an alternative for the control of pathogenic bacteria. However, the effective use of phages relies on appropriate biological and genomic characterization. In this study, we present the isolation and characterization of a novel bacteriophage named phiLLS, which has shown strong lytic activity against generic and multidrug-resistant Escherichia coli strains. Transmission electron microscopy of phiLLS morphology revealed that it belongs to the Siphoviridae family. Furthermore, this phage exhibited a relatively large burst size of 176 plaque-forming units per infected cell. Phage phiLLS significantly reduced the growth of E. coli under laboratory conditions. Analyses of restriction profiles showed the presence of submolar fragments, confirming that phiLLS is a pac -type phage. Phylogenetic analysis based on the amino acid sequence of large terminase subunits confirmed that this phage uses a headful packaging strategy to package their genome. Genomic sequencing and bioinformatic analysis showed that phiLLS is a novel bacteriophage that is most closely related to T5-like phages. In silico analysis indicated that the phiLLS genome consists of 107,263 bp (39.0 % GC content) encoding 160 putative ORFs, 16 tRNAs, several potential promoters and transcriptional terminators. Genome analysis suggests that the phage phiLLS is strictly lytic without carrying genes associated with virulence factors and/or potential immunoreactive allergen proteins. The bacteriophage isolated in this study has shown promising results in the biocontrol of bacterial growth under in vitro conditions, suggesting that it may prove useful as an alternative agent for the control of foodborne pathogens. However, further oral toxicity testing is needed to ensure the safety of phage use.
Selective Packaging of Host tRNA's by Murine Leukemia Virus Particles Does Not Require Genomic RNA
Levin, Judith G.; Seidman, J. G.
1979-01-01
The 4S RNA contained in RNA tumor virus particles consists of a selected population of host tRNA's. However, the mechanism by which virions select host tRNA's has not been elucidated. We have considered a model which specifies that 35S genomic RNA determines which tRNA's are to be encapsidated as well as the relative amounts of these tRNA's within the virion. The model was tested by comparing the free 4S RNA composition of normal murine leukemia virus (MuLV) particles and noninfectious virions from actinomycin D (ActD)-treated cells, which are deficient in genomic RNA (ActD virions). Viral 4S RNA was analyzed by two-dimensional polyacrylamide gel electrophoresis. Surprisingly, the patterns obtained for control and ActD 4S RNA were identical to each other and were clearly distinct from the cell 4S RNA pattern. The viral patterns had three prominent areas of radioactivity. One of the spots was identified on the basis of its oligonucleotide fingerprint as tRNA Pro, the primer for MuLV RNA-directed DNA synthesis. These results were obtained with two different MuLV strains, AKR and Moloney, each grown in SC-1 cells. The demonstration that ActD virions contain primer tRNA and in general exhibit the characteristic MuLV tRNA pattern rather than the complete representation of cell 4S RNA leads to the conclusion that genomic RNA is not the major determinant in selective packaging of host tRNA's. A possible role for one or more viral proteins, including reverse transcriptase, is suggested. Images PMID:219227
De Rocquigny, H; Gabus, C; Vincent, A; Fournié-Zaluski, M C; Roques, B; Darlix, J L
1992-01-01
The nucleocapsid (NC) of human immunodeficiency virus type 1 consists of a large number of NC protein molecules, probably wrapping the dimeric RNA genome within the virion inner core. NC protein is a gag-encoded product that contains two zinc fingers flanked by basic residues. In human immunodeficiency virus type 1 virions, NCp15 is ultimately processed into NCp7 and p6 proteins. During virion assembly the retroviral NC protein is necessary for core formation and genomic RNA encapsidation, which are essential for virus infectivity. In vitro NCp15 activates viral RNA dimerization, a process most probably linked in vivo to genomic RNA packaging, and replication primer tRNA(Lys,3) annealing to the initiation site of reverse transcription. To characterize the domains of human immunodeficiency virus type 1 NC protein necessary for its various functions, the 72-amino acid NCp7 and several derived peptides were synthesized in a pure form. We show here that synthetic NCp7 with or without the two zinc fingers has the RNA annealing activities of NCp15. Further deletions of the N-terminal 12 and C-terminal 8 amino acids, leading to a 27-residue peptide lacking the finger domains, have little or no effect on NC protein activity in vitro. However deletion of short sequences containing basic residues flanking the first finger leads to a complete loss of NC protein activity. It is proposed that the basic residues and the zinc fingers cooperate to select and package the genomic RNA in vivo. Inhibition of the viral RNA binding and annealing activities associated with the basic residues flanking the first zinc finger of NC protein could therefore be used as a model for the design of antiviral agents. Images PMID:1631144
De Rocquigny, H; Gabus, C; Vincent, A; Fournié-Zaluski, M C; Roques, B; Darlix, J L
1992-07-15
The nucleocapsid (NC) of human immunodeficiency virus type 1 consists of a large number of NC protein molecules, probably wrapping the dimeric RNA genome within the virion inner core. NC protein is a gag-encoded product that contains two zinc fingers flanked by basic residues. In human immunodeficiency virus type 1 virions, NCp15 is ultimately processed into NCp7 and p6 proteins. During virion assembly the retroviral NC protein is necessary for core formation and genomic RNA encapsidation, which are essential for virus infectivity. In vitro NCp15 activates viral RNA dimerization, a process most probably linked in vivo to genomic RNA packaging, and replication primer tRNA(Lys,3) annealing to the initiation site of reverse transcription. To characterize the domains of human immunodeficiency virus type 1 NC protein necessary for its various functions, the 72-amino acid NCp7 and several derived peptides were synthesized in a pure form. We show here that synthetic NCp7 with or without the two zinc fingers has the RNA annealing activities of NCp15. Further deletions of the N-terminal 12 and C-terminal 8 amino acids, leading to a 27-residue peptide lacking the finger domains, have little or no effect on NC protein activity in vitro. However deletion of short sequences containing basic residues flanking the first finger leads to a complete loss of NC protein activity. It is proposed that the basic residues and the zinc fingers cooperate to select and package the genomic RNA in vivo. Inhibition of the viral RNA binding and annealing activities associated with the basic residues flanking the first zinc finger of NC protein could therefore be used as a model for the design of antiviral agents.
Jacquin, Laval; Cao, Tuong-Vi; Ahmadi, Nourollah
2016-01-01
One objective of this study was to provide readers with a clear and unified understanding of parametric statistical and kernel methods, used for genomic prediction, and to compare some of these in the context of rice breeding for quantitative traits. Furthermore, another objective was to provide a simple and user-friendly R package, named KRMM, which allows users to perform RKHS regression with several kernels. After introducing the concept of regularized empirical risk minimization, the connections between well-known parametric and kernel methods such as Ridge regression [i.e., genomic best linear unbiased predictor (GBLUP)] and reproducing kernel Hilbert space (RKHS) regression were reviewed. Ridge regression was then reformulated so as to show and emphasize the advantage of the kernel "trick" concept, exploited by kernel methods in the context of epistatic genetic architectures, over parametric frameworks used by conventional methods. Some parametric and kernel methods; least absolute shrinkage and selection operator (LASSO), GBLUP, support vector machine regression (SVR) and RKHS regression were thereupon compared for their genomic predictive ability in the context of rice breeding using three real data sets. Among the compared methods, RKHS regression and SVR were often the most accurate methods for prediction followed by GBLUP and LASSO. An R function which allows users to perform RR-BLUP of marker effects, GBLUP and RKHS regression, with a Gaussian, Laplacian, polynomial or ANOVA kernel, in a reasonable computation time has been developed. Moreover, a modified version of this function, which allows users to tune kernels for RKHS regression, has also been developed and parallelized for HPC Linux clusters. The corresponding KRMM package and all scripts have been made publicly available.
Waardenberg, Ashley J; Basset, Samuel D; Bouveret, Romaric; Harvey, Richard P
2015-09-02
Gene ontology (GO) enrichment is commonly used for inferring biological meaning from systems biology experiments. However, determining differential GO and pathway enrichment between DNA-binding experiments or using the GO structure to classify experiments has received little attention. Herein, we present a bioinformatics tool, CompGO, for identifying Differentially Enriched Gene Ontologies, called DiEGOs, and pathways, through the use of a z-score derivation of log odds ratios, and visualizing these differences at GO and pathway level. Through public experimental data focused on the cardiac transcription factor NKX2-5, we illustrate the problems associated with comparing GO enrichments between experiments using a simple overlap approach. We have developed an R/Bioconductor package, CompGO, which implements a new statistic normally used in epidemiological studies for performing comparative GO analyses and visualizing comparisons from . BED data containing genomic coordinates as well as gene lists as inputs. We justify the statistic through inclusion of experimental data and compare to the commonly used overlap method. CompGO is freely available as a R/Bioconductor package enabling easy integration into existing pipelines and is available at: http://www.bioconductor.org/packages/release/bioc/html/CompGO.html packages/release/bioc/html/CompGO.html.
Lutz, Sharon M; Thwing, Annie; Schmiege, Sarah; Kroehl, Miranda; Baker, Christopher D; Starling, Anne P; Hokanson, John E; Ghosh, Debashis
2017-07-19
In mediation analysis if unmeasured confounding is present, the estimates for the direct and mediated effects may be over or under estimated. Most methods for the sensitivity analysis of unmeasured confounding in mediation have focused on the mediator-outcome relationship. The Umediation R package enables the user to simulate unmeasured confounding of the exposure-mediator, exposure-outcome, and mediator-outcome relationships in order to see how the results of the mediation analysis would change in the presence of unmeasured confounding. We apply the Umediation package to the Genetic Epidemiology of Chronic Obstructive Pulmonary Disease (COPDGene) study to examine the role of unmeasured confounding due to population stratification on the effect of a single nucleotide polymorphism (SNP) in the CHRNA5/3/B4 locus on pulmonary function decline as mediated by cigarette smoking. Umediation is a flexible R package that examines the role of unmeasured confounding in mediation analysis allowing for normally distributed or Bernoulli distributed exposures, outcomes, mediators, measured confounders, and unmeasured confounders. Umediation also accommodates multiple measured confounders, multiple unmeasured confounders, and allows for a mediator-exposure interaction on the outcome. Umediation is available as an R package at https://github.com/SharonLutz/Umediation A tutorial on how to install and use the Umediation package is available in the Additional file 1.
What’s the Damage? The Impact of Pathogens on Pathways that Maintain Host Genome Integrity
Weitzman, Matthew D.; Weitzman, Jonathan B.
2014-01-01
Maintaining genome integrity and transmission of intact genomes is critical for cellular, organismal, and species survival. Cells can detect damaged DNA, activate checkpoints, and either enable DNA repair or trigger apoptosis to eliminate the damaged cell. Aberrations in these mechanisms lead to somatic mutations and genetic instability, which are hallmarks of cancer. Considering the long history of host-microbe coevolution, an impact of microbial infection on host genome integrity is not unexpected, and emerging links between microbial infections and oncogenesis further reinforce this idea. In this review, we compare strategies employed by viruses, bacteria, and parasites to alter, subvert, or otherwise manipulate host DNA damage and repair pathways. We highlight how microbes contribute to tumorigenesis by directly inducing DNA damage, inactivating checkpoint controls, or manipulating repair processes. We also discuss indirect effects resulting from inflammatory responses, changes in cellular metabolism, nuclear architecture, and epigenome integrity, and the associated evolutionary tradeoffs. PMID:24629335
AE Recorder Characteristics and Development.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Partridge, Michael E.; Curtis, Shane Keawe; McGrogan, David Paul
2016-11-01
The Anomalous Environment Recorder (AE Recorder) provides a robust data recording capability for multiple high-shock applications including earth penetrators. The AE Recorder, packaged as a 2.4" di ameter cylinder 3" tall, acquires 12 accelerometer, 2 auxiliary, and 6 discrete signal channels at 250k samples / second. Recording depth is 213 seconds plus 75ms of pre-trigger data. The mechanical, electrical, and firmware are described as well as support electro nics designed for the first use of the recorder.
2007-08-01
example, the topic of massively multiplayer online role playing games ( MMORPGs ). (A related class of first person multiplayer shooters includes...key and measurable indicators that require immediate action. They develop plans of action for these trigger points to avoid premature commitment of...recognize that the cause of the incident may be an accident or act of nature, or it may be either criminal or terrorist activity. They avoid the
Telomeres and replicative senescence: Is it only length that counts?
von Zglinicki, T
2001-07-26
Telomeres are well established as a major 'replicometer', counting the population doublings in primary human cell cultures and ultimately triggering replicative senescence. However, neither is the pace of this biological clock inert, nor is there a fixed threshold telomere length acting as the universal trigger of replicative senescence. The available data suggest that opening of the telomeric loop and unscheduled exposure of the single-stranded G-rich telomeric overhang might act like a semaphore to signal senescent cell cycle arrest. Short telomere length, telomeric single-strand breaks, low levels of loop-stabilizing proteins, or other factors may trigger this opening of the loop. Thus, both telomere shortening and the ultimate signalling into senescence are able to integrate different environmental and genetic factors, especially oxidative stress-mediated damage, which might otherwise become a thread to genomic stability.
Variation of HPV Subtypes with Focus on HPV-Infection and Cancer in the Head and Neck Region.
Wichmann, Gunnar
The human papillomavirus (HPV) comprises a heterogeneous group of double-strand DNA viruses with variable potential to infect human epithelial cells and trigger neoplastic transformation. Its 8 kb genome encodes proteins required for virus replication and self-organized formation of infectious particles but also for early proteins E6 and E7 able to trigger neoplastic transformation. E6 and E7 of high-risk (HR) HPV subtypes can bind to p53 or release E2F and abrogate replication control. Due to variable amino acid sequence (AAS) in the binding sites of E6 and E7 particular HR-HPV variants within subtypes are essentially heterogeneous in efficacy triggering neoplastic transformation and cancer development. This could explain differences in the clinical course of HPV-driven head and neck cancer.
Investigation of Self Triggered Cosmic Ray Detectors using Silicon Photomultiplier
NASA Astrophysics Data System (ADS)
Knox, Adrian; Niduaza, Rommel; Hernandez, Victor; Ruiz, Daniel; Ramos, Daniel; Fan, Sewan; Fatuzzo, Laura; Ritt, Stefan
2015-04-01
The silicon photomultiplier (SiPM) is a highly sensitive light detector capable of measuring single photons. It costs a fraction of the photomultiplier tube and operates slightly above the breakdown voltage. At this conference we describe our investigation of SiPM, the multipixel photon counters (MPPC) from Hamamatsu as readout detectors for plastic scintillators working for detecting cosmic ray particles. Our setup consists of scintillator sheets embedded with blue to green wavelength shifting fibers optically coupled to MPPCs to detect scintillating light. Four detector assemblies would be constructed and arranged to work in self triggered mode. Using custom matching tee boxes, the amplified MPPC signals are fed to discriminators with threshold set to give a reasonable coincidence count rate. Moreover, the detector waveforms are digitized using a 5 Giga Samples per second waveform digitizer, the DRS4, and triggered with the coincidence logic to capture the MPPC waveforms. Offline analysis of the digitized waveforms is accomplished using the CERN package PAW and results of our experiments and the data analysis would also be discussed. US Department of Education Title V Grant Number PO31S090007.
Jégu, Teddy; Aeby, Eric; Lee, Jeannie T
2017-06-01
Extensive 3D folding is required to package a genome into the tiny nuclear space, and this packaging must be compatible with proper gene expression. Thus, in the well-hierarchized nucleus, chromosomes occupy discrete territories and adopt specific 3D organizational structures that facilitate interactions between regulatory elements for gene expression. The mammalian X chromosome exemplifies this structure-function relationship. Recent studies have shown that, upon X-chromosome inactivation, active and inactive X chromosomes localize to different subnuclear positions and adopt distinct chromosomal architectures that reflect their activity states. Here, we review the roles of long non-coding RNAs, chromosomal organizational structures and the subnuclear localization of chromosomes as they relate to X-linked gene expression.
Functional requirements of the yellow fever virus capsid protein.
Patkar, Chinmay G; Jones, Christopher T; Chang, Yu-hsuan; Warrier, Ranjit; Kuhn, Richard J
2007-06-01
Although it is known that the flavivirus capsid protein is essential for genome packaging and formation of infectious particles, the minimal requirements of the dimeric capsid protein for virus assembly/disassembly have not been characterized. By use of a trans-packaging system that involved packaging a yellow fever virus (YFV) replicon into pseudo-infectious particles by supplying the YFV structural proteins using a Sindbis virus helper construct, the functional elements within the YFV capsid protein (YFC) were characterized. Various N- and C-terminal truncations, internal deletions, and point mutations of YFC were analyzed for their ability to package the YFV replicon. Consistent with previous reports on the tick-borne encephalitis virus capsid protein, YFC demonstrates remarkable functional flexibility. Nearly 40 residues of YFC could be removed from the N terminus while the ability to package replicon RNA was retained. Additionally, YFC containing a deletion of approximately 27 residues of the C terminus, including a complete deletion of C-terminal helix 4, was functional. Internal deletions encompassing the internal hydrophobic sequence in YFC were, in general, tolerated to a lesser extent. Site-directed mutagenesis of helix 4 residues predicted to be involved in intermonomeric interactions were also analyzed, and although single mutations did not affect packaging, a YFC with the double mutation of leucine 81 and valine 88 was nonfunctional. The effects of mutations in YFC on the viability of YFV infection were also analyzed, and these results were similar to those obtained using the replicon packaging system, thus underscoring the flexibility of YFC with respect to the requirements for its functioning.
BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions
2010-01-01
Background Genome-scale metabolic reconstructions under the Constraint Based Reconstruction and Analysis (COBRA) framework are valuable tools for analyzing the metabolic capabilities of organisms and interpreting experimental data. As the number of such reconstructions and analysis methods increases, there is a greater need for data uniformity and ease of distribution and use. Description We describe BiGG, a knowledgebase of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different organisms. BiGG can be used to browse model content, visualize metabolic pathway maps, and export SBML files of the models for further analysis by external software packages. Users may follow links from BiGG to several external databases to obtain additional information on genes, proteins, reactions, metabolites and citations of interest. Conclusions BiGG addresses a need in the systems biology community to have access to high quality curated metabolic models and reconstructions. It is freely available for academic use at http://bigg.ucsd.edu. PMID:20426874
Han, Mira V; Thomas, Gregg W C; Lugo-Martinez, Jose; Hahn, Matthew W
2013-08-01
Current sequencing methods produce large amounts of data, but genome assemblies constructed from these data are often fragmented and incomplete. Incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. This means that methods attempting to estimate rates of gene duplication and loss often will be misled by such errors and that rates of gene family evolution will be consistently overestimated. Here, we present a method that takes these errors into account, allowing one to accurately infer rates of gene gain and loss among genomes even with low assembly and annotation quality. The method is implemented in the newest version of the software package CAFE, along with several other novel features. We demonstrate the accuracy of the method with extensive simulations and reanalyze several previously published data sets. Our results show that errors in genome annotation do lead to higher inferred rates of gene gain and loss but that CAFE 3 sufficiently accounts for these errors to provide accurate estimates of important evolutionary parameters.
Chromosomes in a genome-wise order: evidence for metaphase architecture.
Weise, Anja; Bhatt, Samarth; Piaszinski, Katja; Kosyakova, Nadezda; Fan, Xiaobo; Altendorf-Hofmann, Annelore; Tanomtong, Alongklod; Chaveerach, Arunrat; de Cioffi, Marcelo Bello; de Oliveira, Edivaldo; Walther, Joachim-U; Liehr, Thomas; Chaudhuri, Jyoti P
2016-01-01
One fundamental finding of the last decade is that, besides the primary DNA sequence information there are several epigenetic "information-layers" like DNA-and histone modifications, chromatin packaging and, last but not least, the position of genes in the nucleus. We postulate that the functional genomic architecture is not restricted to the interphase of the cell cycle but can also be observed in the metaphase stage, when chromosomes are most condensed and microscopically visible. If so, it offers the unique opportunity to directly analyze the functional aspects of genomic architecture in different cells, species and diseases. Another aspect not directly accessible by molecular techniques is the genome merged from two different haploid parental genomes represented by the homologous chromosome sets. Our results show that there is not only a well-known and defined nuclear architecture in interphase but also in metaphase leading to a bilateral organization of the two haploid sets of chromosomes. Moreover, evidence is provided for the parental origin of the haploid grouping. From our findings we postulate an additional epigenetic information layer within the genome including the organization of homologous chromosomes and their parental origin which may now substantially change the landscape of genetics.
FACT is a sensor of DNA torsional stress in eukaryotic cells
Safina, Alfiya; Cheney, Peter; Pal, Mahadeb; Brodsky, Leonid; Ivanov, Alexander; Kirsanov, Kirill; Lesovaya, Ekaterina; Naberezhnov, Denis; Nesher, Elimelech; Koman, Igor; Wang, Dan; Wang, Jianming; Yakubovskaya, Marianna; Winkler, Duane
2017-01-01
Abstract Transitions of B-DNA to alternative DNA structures (ADS) can be triggered by negative torsional strain, which occurs during replication and transcription, and may lead to genomic instability. However, how ADS are recognized in cells is unclear. We found that the binding of candidate anticancer drug, curaxin, to cellular DNA results in uncoiling of nucleosomal DNA, accumulation of negative supercoiling and conversion of multiple regions of genomic DNA into left-handed Z-form. Histone chaperone FACT binds rapidly to the same regions via the SSRP1 subunit in curaxin-treated cells. In vitro binding of purified SSRP1 or its isolated CID domain to a methylated DNA fragment containing alternating purine/pyrimidines, which is prone to Z-DNA transition, is much stronger than to other types of DNA. We propose that FACT can recognize and bind Z-DNA or DNA in transition from a B to Z form. Binding of FACT to these genomic regions triggers a p53 response. Furthermore, FACT has been shown to bind to other types of ADS through a different structural domain, which also leads to p53 activation. Thus, we propose that FACT acts as a sensor of ADS formation in cells. Recognition of ADS by FACT followed by a p53 response may explain the role of FACT in DNA damage prevention. PMID:28082391
BeadArray Expression Analysis Using Bioconductor
Ritchie, Matthew E.; Dunning, Mark J.; Smith, Mike L.; Shi, Wei; Lynch, Andy G.
2011-01-01
Illumina whole-genome expression BeadArrays are a popular choice in gene profiling studies. Aside from the vendor-provided software tools for analyzing BeadArray expression data (GenomeStudio/BeadStudio), there exists a comprehensive set of open-source analysis tools in the Bioconductor project, many of which have been tailored to exploit the unique properties of this platform. In this article, we explore a number of these software packages and demonstrate how to perform a complete analysis of BeadArray data in various formats. The key steps of importing data, performing quality assessments, preprocessing, and annotation in the common setting of assessing differential expression in designed experiments will be covered. PMID:22144879
Genomic analysis of Staphylococcus phage Stau2 isolated from medical specimen.
Hsieh, Sue-Er; Tseng, Yi-Hsiung; Lo, Hsueh-Hsia; Chen, Shui-Tu; Wu, Cheng-Nan
2016-02-01
Stau2 is a lytic myophage of Staphylococcus aureus isolated from medical specimen. Exhibiting a broad host range against S. aureus clinical isolates, Stau2 is potentially useful for topical phage therapy or as an additive in food preservation. In this study, Stau2 was firstly revealed to possess a circularly permuted linear genome of 133,798 bp, with low G + C content, containing 146 open reading frames, but encoding no tRNA. The genome is organized into several modules containing genes for packaging, structural proteins, replication/transcription and host-cell-lysis, with the structural proteins and DNA polymerase modules being organized similarly to that in Twort-like phages of Staphylococcus. With the encoded DNA replication genes, Stau2 can possibly use its own system for replication. In addition, analysis in silico found several introns in seven genes, including those involved in DNA metabolism, packaging, and structure, while one of them (helicase gene) is experimentally confirmed to undergo splicing. Furthermore, phylogenetic analysis suggested Stau2 to be most closely related to Staphylococcus phages SA11 and Remus, members of Twort-like phages. The results of sodium dodecyl sulfate polyacrylamide gel electrophoresis showed 14 structural proteins of Stau2 and N-terminal sequencing identified three of them. Importantly, this phage does not encode any proteins which are known or suspected to be involved in toxicity, pathogenicity, or antibiotic resistance. Therefore, further investigations of feasible therapeutic application of Stau2 are needed.
GWAMA: software for genome-wide association meta-analysis.
Mägi, Reedik; Morris, Andrew P
2010-05-28
Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies. We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results. The GWAMA (Genome-Wide Association Meta-Analysis) software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at http://www.well.ox.ac.uk/GWAMA.
Perspectives on Genetic and Genomic Technologies in an Academic Medical Center: The Duke Experience
Katsanis, Sara Huston; Minear, Mollie A.; Vorderstrasse, Allison; Yang, Nancy; Reeves, Jason W.; Rakhra-Burris, Tejinder; Cook-Deegan, Robert; Ginsburg, Geoffrey S.; Simmons, Leigh Ann
2015-01-01
In this age of personalized medicine, genetic and genomic testing is expected to become instrumental in health care delivery, but little is known about its actual implementation in clinical practice. Methods. We surveyed Duke faculty and healthcare providers to examine the extent of genetic and genomic testing adoption. We assessed providers’ use of genetic and genomic testing options and indications in clinical practice, providers’ awareness of pharmacogenetic applications, and providers’ opinions on returning research-generated genetic test results to participants. Most clinician respondents currently use family history routinely in their clinical practice, but only 18 percent of clinicians use pharmacogenetics. Only two respondents correctly identified the number of drug package inserts with pharmacogenetic indications. We also found strong support for the return of genetic research results to participants. Our results demonstrate that while Duke healthcare providers are enthusiastic about genomic technologies, use of genomic tools outside of research has been limited. Respondents favor return of research-based genetic results to participants, but clinicians lack knowledge about pharmacogenetic applications. We identified challenges faced by this institution when implementing genetic and genomic testing into patient care that should inform a policy and education agenda to improve provider support and clinician-researcher partnerships. PMID:25854543
Ou, Hong-Yu; He, Xinyi; Harrison, Ewan M.; Kulasekara, Bridget R.; Thani, Ali Bin; Kadioglu, Aras; Lory, Stephen; Hinton, Jay C. D.; Barer, Michael R.; Rajakumar, Kumar
2007-01-01
MobilomeFINDER (http://mml.sjtu.edu.cn/MobilomeFINDER) is an interactive online tool that facilitates bacterial genomic island or ‘mobile genome’ (mobilome) discovery; it integrates the ArrayOme and tRNAcc software packages. ArrayOme utilizes a microarray-derived comparative genomic hybridization input data set to generate ‘inferred contigs’ produced by merging adjacent genes classified as ‘present’. Collectively these ‘fragments’ represent a hypothetical ‘microarray-visualized genome (MVG)’. ArrayOme permits recognition of discordances between physical genome and MVG sizes, thereby enabling identification of strains rich in microarray-elusive novel genes. Individual tRNAcc tools facilitate automated identification of genomic islands by comparative analysis of the contents and contexts of tRNA sites and other integration hotspots in closely related sequenced genomes. Accessory tools facilitate design of hotspot-flanking primers for in silico and/or wet-science-based interrogation of cognate loci in unsequenced strains and analysis of islands for features suggestive of foreign origins; island-specific and genome-contextual features are tabulated and represented in schematic and graphical forms. To date we have used MobilomeFINDER to analyse several Enterobacteriaceae, Pseudomonas aeruginosa and Streptococcus suis genomes. MobilomeFINDER enables high-throughput island identification and characterization through increased exploitation of emerging sequence data and PCR-based profiling of unsequenced test strains; subsequent targeted yeast recombination-based capture permits full-length sequencing and detailed functional studies of novel genomic islands. PMID:17537813
Fang, Hai; Knezevic, Bogdan; Burnham, Katie L; Knight, Julian C
2016-12-13
Biological interpretation of genomic summary data such as those resulting from genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) studies is one of the major bottlenecks in medical genomics research, calling for efficient and integrative tools to resolve this problem. We introduce eXploring Genomic Relations (XGR), an open source tool designed for enhanced interpretation of genomic summary data enabling downstream knowledge discovery. Targeting users of varying computational skills, XGR utilises prior biological knowledge and relationships in a highly integrated but easily accessible way to make user-input genomic summary datasets more interpretable. We show how by incorporating ontology, annotation, and systems biology network-driven approaches, XGR generates more informative results than conventional analyses. We apply XGR to GWAS and eQTL summary data to explore the genomic landscape of the activated innate immune response and common immunological diseases. We provide genomic evidence for a disease taxonomy supporting the concept of a disease spectrum from autoimmune to autoinflammatory disorders. We also show how XGR can define SNP-modulated gene networks and pathways that are shared and distinct between diseases, how it achieves functional, phenotypic and epigenomic annotations of genes and variants, and how it enables exploring annotation-based relationships between genetic variants. XGR provides a single integrated solution to enhance interpretation of genomic summary data for downstream biological discovery. XGR is released as both an R package and a web-app, freely available at http://galahad.well.ox.ac.uk/XGR .
The impact of Docker containers on the performance of genomic pipelines
Palumbo, Emilio; Chatzou, Maria; Prieto, Pablo; Heuer, Michael L.; Notredame, Cedric
2015-01-01
Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these problems, as they allow the packaging of pipelines in an isolated and self-contained manner. This makes it easy to distribute and execute pipelines in a portable manner across a wide range of computing platforms. Thus, the question that arises is to what extent the use of Docker containers might affect the performance of these pipelines. Here we address this question and conclude that Docker containers have only a minor impact on the performance of common genomic pipelines, which is negligible when the executed jobs are long in terms of computational time. PMID:26421241
The impact of Docker containers on the performance of genomic pipelines.
Di Tommaso, Paolo; Palumbo, Emilio; Chatzou, Maria; Prieto, Pablo; Heuer, Michael L; Notredame, Cedric
2015-01-01
Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these problems, as they allow the packaging of pipelines in an isolated and self-contained manner. This makes it easy to distribute and execute pipelines in a portable manner across a wide range of computing platforms. Thus, the question that arises is to what extent the use of Docker containers might affect the performance of these pipelines. Here we address this question and conclude that Docker containers have only a minor impact on the performance of common genomic pipelines, which is negligible when the executed jobs are long in terms of computational time.
Complete genomic sequence of the Lactobacillus temperate phage LF1.
Yoon, Bo Hyun; Chang, Hyo Ihl
2011-10-01
Bacteriophage LF1, a newly isolated temperate phage from a mitomycin-C-induced lysate of wild type Lactobacillus fermentum, was found to contain a double-strand DNA of 42,606 base pairs (bp) with a G+C content of 45%. Bioinformatic analysis of the phage genome revealed 57 putative open reading frames (ORFs). The predicted protein products of ORFs were determined and described. According to morphological analysis by transmission electron microscopy (TEM), LF1 has an isometric head and a non-contractile tail, indicating that it belongs to the family Siphoviridae. The temperate phage LF1 has a good genetic mosaic relationship with ΦPYB5 in the packaging module. To our knowledge, this is first report of genomic sequencing and characterization of temperate phage LF1 from wild-type L. fermentum isolated from Kimchi in Korea.
DOE Office of Scientific and Technical Information (OSTI.GOV)
McLoughlin, K.
2016-01-11
The overall aim of this project is to develop a software package, called MetaQuant, that can determine the constituents of a complex microbial sample and estimate their relative abundances by analysis of metagenomic sequencing data. The goal for Task 1 is to create a generative model describing the stochastic process underlying the creation of sequence read pairs in the data set. The stages in this generative process include the selection of a source genome sequence for each read pair, with probability dependent on its abundance in the sample. The other stages describe the evolution of the source genome from itsmore » nearest common ancestor with a reference genome, breakage of the source DNA into short fragments, and the errors in sequencing the ends of the fragments to produce read pairs.« less
Genome-scale CRISPR-Cas9 Knockout and Transcriptional Activation Screening
Joung, Julia; Konermann, Silvana; Gootenberg, Jonathan S.; Abudayyeh, Omar O.; Platt, Randall J.; Brigham, Mark D.; Sanjana, Neville E.; Zhang, Feng
2017-01-01
Forward genetic screens are powerful tools for the unbiased discovery and functional characterization of specific genetic elements associated with a phenotype of interest. Recently, the RNA-guided endonuclease Cas9 from the microbial CRISPR (clustered regularly interspaced short palindromic repeats) immune system has been adapted for genome-scale screening by combining Cas9 with pooled guide RNA libraries. Here we describe a protocol for genome-scale knockout and transcriptional activation screening using the CRISPR-Cas9 system. Custom- or ready-made guide RNA libraries are constructed and packaged into lentiviral vectors for delivery into cells for screening. As each screen is unique, we provide guidelines for determining screening parameters and maintaining sufficient coverage. To validate candidate genes identified from the screen, we further describe strategies for confirming the screening phenotype as well as genetic perturbation through analysis of indel rate and transcriptional activation. Beginning with library design, a genome-scale screen can be completed in 9–15 weeks followed by 4–5 weeks of validation. PMID:28333914
Lando, David; Stevens, Tim J; Basu, Srinjan; Laue, Ernest D
2018-01-01
Single-cell chromosome conformation capture approaches are revealing the extent of cell-to-cell variability in the organization and packaging of genomes. These single-cell methods, unlike their multi-cell counterparts, allow straightforward computation of realistic chromosome conformations that may be compared and combined with other, independent, techniques to study 3D structure. Here we discuss how single-cell Hi-C and subsequent 3D genome structure determination allows comparison with data from microscopy. We then carry out a systematic evaluation of recently published single-cell Hi-C datasets to establish a computational approach for the evaluation of single-cell Hi-C protocols. We show that the calculation of genome structures provides a useful tool for assessing the quality of single-cell Hi-C data because it requires a self-consistent network of interactions, relating to the underlying 3D conformation, with few errors, as well as sufficient longer-range cis- and trans-chromosomal contacts.
Cis-acting RNA elements in the Hepatitis C virus RNA genome
Sagan, Selena M.; Chahal, Jasmin; Sarnow, Peter
2017-01-01
Hepatitis C virus (HCV) infection is a rapidly increasing global health problem with an estimated 170 million people infected worldwide. HCV is a hepatotropic, positive-sense RNA virus of the family Flaviviridae. As a positive-sense RNA virus, the HCV genome itself must serve as a template for translation, replication and packaging. The viral RNA must therefore be a dynamic structure that is able to readily accommodate structural changes to expose different regions of the genome to viral and cellular proteins to carry out the HCV life cycle. The ∼9600 nucleotide viral genome contains a single long open reading frame flanked by 5′ and 3′ non-coding regions that contain cis-acting RNA elements important for viral translation, replication and stability. Additional cis-acting RNA elements have also been identified in the coding sequences as well as in the 3′ end of the negative-strand replicative intermediate. Herein, we provide an overview of the importance of these cis-acting RNA elements in the HCV life cycle. PMID:25576644
Genome-scale CRISPR-Cas9 knockout and transcriptional activation screening.
Joung, Julia; Konermann, Silvana; Gootenberg, Jonathan S; Abudayyeh, Omar O; Platt, Randall J; Brigham, Mark D; Sanjana, Neville E; Zhang, Feng
2017-04-01
Forward genetic screens are powerful tools for the unbiased discovery and functional characterization of specific genetic elements associated with a phenotype of interest. Recently, the RNA-guided endonuclease Cas9 from the microbial CRISPR (clustered regularly interspaced short palindromic repeats) immune system has been adapted for genome-scale screening by combining Cas9 with pooled guide RNA libraries. Here we describe a protocol for genome-scale knockout and transcriptional activation screening using the CRISPR-Cas9 system. Custom- or ready-made guide RNA libraries are constructed and packaged into lentiviral vectors for delivery into cells for screening. As each screen is unique, we provide guidelines for determining screening parameters and maintaining sufficient coverage. To validate candidate genes identified by the screen, we further describe strategies for confirming the screening phenotype, as well as genetic perturbation, through analysis of indel rate and transcriptional activation. Beginning with library design, a genome-scale screen can be completed in 9-15 weeks, followed by 4-5 weeks of validation.
Mechanisms for RNA capture by ssDNA viruses: grand theft RNA.
Stedman, Kenneth
2013-06-01
Viruses contain three common types of packaged genomes; double-stranded DNA (dsDNA), RNA (mostly single and occasionally double stranded) and single-stranded DNA (ssDNA). There are relatively straightforward explanations for the prevalence of viruses with dsDNA and RNA genomes, but the evolutionary basis for the apparent success of ssDNA viruses is less clear. The recent discovery of four ssDNA virus genomes that appear to have been formed by recombination between co-infecting RNA and ssDNA viruses, together with the high mutation rate of ssDNA viruses provide possible explanations. RNA-DNA recombination allows ssDNA viruses to access much broader sequence space than through nucleotide substitution and DNA-DNA recombination alone. Multiple non-exclusive mechanisms, all due to the unique replication of ssDNA viruses, are proposed for this unusual RNA capture. RNA capture provides an explanation for the evolutionary success of the ssDNA viruses and may help elucidate the mystery of integrated RNA viruses in viral and cellular DNA genomes.
Yoo, Eung Jae; Cajiao, Isabela; Kim, Jeong-Seon; Kimura, Atsushi P.; Zhang, Aiwen; Cooke, Nancy E.; Liebhaber, Stephen A.
2006-01-01
Random assortment within mammalian genomes juxtaposes genes with distinct expression profiles. This organization, along with the prevalence of long-range regulatory controls, generates a potential for aberrant transcriptional interactions. The human CD79b/GH locus contains six tightly linked genes with three mutually exclusive tissue specificities and interdigitated control elements. One consequence of this compact organization is that the pituitarycell-specific transcriptional events that activate hGH-N also trigger ectopic activation of CD79b. However, the B-cell-specific events that activate CD79b do not trigger reciprocal activation of hGH-N. Here we utilized DNase I hypersensitive site mapping, chromatin immunoprecipitation, and transgenic models to explore the basis for this asymmetric relationship. The results reveal tissue-specific patterns of chromatin structures and transcriptional controls at the CD79b/GH locus in B cells distinct from those in the pituitary gland and placenta. These three unique transcriptional environments suggest a set of corresponding gene expression pathways and transcriptional interactions that are likely to be found juxtaposed at multiple sites within the eukaryotic genome. PMID:16847312
Gut microbiome in type 1 diabetes: A comprehensive review.
Zheng, Peilin; Li, Zhixia; Zhou, Zhiguang
2018-06-21
Type 1 diabetes (T1D) is an autoimmune disease, which is characterized by the destruction of islet β cells in the pancreas triggered by genetic and environmental factors. In past decades, extensive familial and genome-wide association studies have revealed more than 50 risk loci in the genome. However, genetic susceptibility cannot explain the increased incidence of T1D worldwide, which is very likely attributed by the growing impact of environmental factors, especially gut microbiome. Recently, the role of gut microbiome in the pathogenesis of T1D have been uncovered by the increasing evidence from both human subjects and animal models, strongly indicating that gut microbiome might be a pivotal hub of T1D-triggering factors, especially environmental factors. In this review, we summarize the current etiological and mechanism studies of gut microbiome in T1D. A better understanding of the role of gut microbiome in T1D may provide us with powerful prognostic and therapeutic tools in the near future. This article is protected by copyright. All rights reserved.
2011-01-01
The completion of the Human Genome Project triggered a whole new field of genomic research which is likely to lead to new opportunities for the promotion of population health. As a result, the distinction between genetic and environmental diseases has faded. Presently, genomics and knowledge deriving from systems biology, epigenomics, integrative genomics or genome-environmental interactions give a better insight on the pathophysiology of common diseases. However, it is barely used in the prevention and management of diseases. Together with the boost in the amount of genetic association studies, this demands for appropriate public health actions. The field of Public Health Genomics analyses how genome-based knowledge and technologies can responsibly and effectively be integrated into health services and public policy for the benefit of population health. Environmental exposures interact with the genome to produce health information which may help explain inter-individual differences in health, or disease risk. However today, prospects for concrete applications remain distant. In addition, this information has not been translated into health practice yet. Therefore, evidence-based recommendations are few. The lack of population-based research hampers the evaluation of the impact of genomic applications. Public Health Genomics also evaluates the benefits and risks on a larger scale, including normative, legal, economic and social issues. These new developments are likely to affect all domains of public health and require rethinking the role of genomics in every condition of public health interest. This article aims at providing an introduction to the field of and the ideas behind Public Health Genomics. PMID:22958637
1989-08-01
report demonstrates how flavors (object-oriented programming in Franz is carried out via flavors. can be u>,d for this programming. Different approaches...data structures that are part of Franz LISP. A method is a procedure that is invoked by a message to a flavor instance. The method triggered depends...keywordize is a procedure used to intern the :set-op name into the keyword package so that the flavor features of Franz recognize this operation. An
DOE Office of Scientific and Technical Information (OSTI.GOV)
Evans, David Edward
A description of the development of the mc_runjob software package used to manage large scale computing tasks for the D0 Experiment at Fermilab is presented, along with a review of the Digital Front End Trigger electronics and the software used to control them. A tracking study is performed on detector data to determine that the D0 Experiment can detect charged B mesons, and that these results are in accordance with current results. B mesons are found by searching for the decay channel B ± → J / Ψ K ± .
The reduction of a ""safety catastrophic'' potential hazard: A case history
NASA Technical Reports Server (NTRS)
Jones, J. P.
1971-01-01
A worst case analysis is reported on the safety of time watch movements for triggering explosive packages on the lunar surface in an experiment to investigate physical lunar structural characteristics through induced seismic energy waves. Considered are the combined effects of low pressure, low temperature, lunar gravity, gear train error, and position. Control measures constitute a seal control cavity and design requirements to prevent overbanking in the mainspring torque curve. Thus, the potential hazard is reduced to safety negligible.
Gutiérrez-Sacristán, Alba; Guedj, Romain; Korodi, Gabor; Stedman, Jason; Furlong, Laura I; Patel, Chirag J; Kohane, Isaac S; Avillach, Paul
2018-04-15
In the era of big data and precision medicine, the number of databases containing clinical, environmental, self-reported and biochemical variables is increasing exponentially. Enabling the experts to focus on their research questions rather than on computational data management, access and analysis is one of the most significant challenges nowadays. We present Rcupcake, an R package that contains a variety of functions for leveraging different databases through the BD2K PIC-SURE RESTful API and facilitating its query, analysis and interpretation. The package offers a variety of analysis and visualization tools, including the study of the phenotype co-occurrence and prevalence, according to multiple layers of data, such as phenome, exposome or genome. The package is implemented in R and is available under Mozilla v2 license from GitHub (https://github.com/hms-dbmi/Rcupcake). Two reproducible case studies are also available (https://github.com/hms-dbmi/Rcupcake-case-studies/blob/master/SSCcaseStudy_v01.ipynb, https://github.com/hms-dbmi/Rcupcake-case-studies/blob/master/NHANEScaseStudy_v01.ipynb). paul_avillach@hms.harvard.edu. Supplementary data are available at Bioinformatics online.
Huntley, Melanie A; Larson, Jessica L; Chaivorapol, Christina; Becker, Gabriel; Lawrence, Michael; Hackney, Jason A; Kaminker, Joshua S
2013-12-15
It is common for computational analyses to generate large amounts of complex data that are difficult to process and share with collaborators. Standard methods are needed to transform such data into a more useful and intuitive format. We present ReportingTools, a Bioconductor package, that automatically recognizes and transforms the output of many common Bioconductor packages into rich, interactive, HTML-based reports. Reports are not generic, but have been individually designed to reflect content specific to the result type detected. Tabular output included in reports is sortable, filterable and searchable and contains context-relevant hyperlinks to external databases. Additionally, in-line graphics have been developed for specific analysis types and are embedded by default within table rows, providing a useful visual summary of underlying raw data. ReportingTools is highly flexible and reports can be easily customized for specific applications using the well-defined API. The ReportingTools package is implemented in R and available from Bioconductor (version ≥ 2.11) at the URL: http://bioconductor.org/packages/release/bioc/html/ReportingTools.html. Installation instructions and usage documentation can also be found at the above URL.
Energetics of genome ejection from phage revealed by isothermal titration calorimetry
NASA Astrophysics Data System (ADS)
Jeembaeva, Meerim; Jonsson, Bengt; Castelnovo, Martin; Evilevitch, Alex
2009-03-01
It has been experimentally shown that ejection of double-stranded DNA from phage is driven by internal pressure reaching tens of atmospheres. This internal pressure is partially responsible for delivery of DNA into the host cell. While several theoretical models and simulations nicely describe the experimental data of internal forces either resisting active packaging or equivalently favoring spontaneous ejection, there are no direct energy measurements available that would help to verify how quantitative these theories are. We performed direct measurements of the enthalpy responsible for DNA ejection from phage λ, using Isothermal Titration Calorimetry. The phage capsids were ``opened'' in vitro by titrating λ into a solution with LamB receptor and the enthalpy of DNA ejection process was measured. In his way, enthalpy stored in λ was determined as a function of packaged DNA length comparing wild-type phage λ (48.5 kb) with a shorter λ-DNA length mutant (37.7 kb). The temperature dependence of the ejection enthalpy was also investigated. The values obtained were in good agreement with existing models and provide a better understanding of ds- DNA packaging and release mechanisms in motor-packaged viruses (e.g., tailed bacteriophages, Herpes Simplex, and adenoviruses).
cit: hypothesis testing software for mediation analysis in genomic applications.
Millstein, Joshua; Chen, Gary K; Breton, Carrie V
2016-08-01
The challenges of successfully applying causal inference methods include: (i) satisfying underlying assumptions, (ii) limitations in data/models accommodated by the software and (iii) low power of common multiple testing approaches. The causal inference test (CIT) is based on hypothesis testing rather than estimation, allowing the testable assumptions to be evaluated in the determination of statistical significance. A user-friendly software package provides P-values and optionally permutation-based FDR estimates (q-values) for potential mediators. It can handle single and multiple binary and continuous instrumental variables, binary or continuous outcome variables and adjustment covariates. Also, the permutation-based FDR option provides a non-parametric implementation. Simulation studies demonstrate the validity of the cit package and show a substantial advantage of permutation-based FDR over other common multiple testing strategies. The cit open-source R package is freely available from the CRAN website (https://cran.r-project.org/web/packages/cit/index.html) with embedded C ++ code that utilizes the GNU Scientific Library, also freely available (http://www.gnu.org/software/gsl/). joshua.millstein@usc.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Gonzalez-Vasconcellos, Iria; Alonso-Rodríguez, Silvia; López-Baltar, Isidoro; Fernández, José Luis
2015-01-01
Telomeres, the DNA-protein complexes located at the end of linear eukaryotic chromosomes are essential for genome stability. Improper higher-order chromatin organization at the chromosome ends can give rise to telomeric recombination and genomic instability. We report the development of an assay to quantify differences in the condensation of telomeric chromatin, thereby offering new opportunities to study telomere biology and stability. We have combined a DNA nuclease digestion with a quantitative PCR (qPCR) assay of telomeric DNA, which we term the Telomere Chromatin Condensation Assay (TCCA). By quantifying the relative quantities of telomeric DNA that are progressively digested with the exonuclease Bal 31 the method can discriminate between different levels of telomeric chromatin condensation. The structural chromatin packaging at telomeres shielded against exonuclease digestion delivered an estimate, which we term Chromatin Protection Factor (CPF) that ranged from 1.7 to 2.3 fold greater than that present in unpacked DNA. The CPF was significantly decreased when cell cultures were incubated with the DNA hypomethylating agent 5-azacytidine, demonstrating the ability of the TCCA assay to discriminate between packaging levels of telomeric DNA. Copyright © 2014 Elsevier B.V. All rights reserved.
Pombo, Marina A; Zheng, Yi; Fernandez-Pozo, Noe; Dunham, Diane M; Fei, Zhangjun; Martin, Gregory B
2014-01-01
Plants have two related immune systems to defend themselves against pathogen attack. Initially,pattern-triggered immunity is activated upon recognition of microbe-associated molecular patterns by pattern recognition receptors. Pathogenic bacteria deliver effector proteins into the plant cell that interfere with this immune response and promote disease. However, some plants express resistance proteins that detect the presence of specific effectors leading to a robust defense response referred to as effector-triggered immunity. The interaction of tomato with Pseudomonas syringae pv. tomato is an established model system for understanding the molecular basis of these plant immune responses. We apply high-throughput RNA sequencing to this pathosystem to identify genes whose expression changes specifically during pattern-triggered or effector-triggered immunity. We then develop reporter genes for each of these responses that will enable characterization of the host response to the large collection of P. s. pv. tomato strains that express different combinations of effectors. Virus-induced gene silencing of 30 of the effector-triggered immunity-specific genes identifies Epk1 which encodes a predicted protein kinase from a family previously unknown to be involved in immunity. Knocked-down expression of Epk1 compromises effector-triggered immunity triggered by three bacterial effectors but not by effectors from non-bacterial pathogens. Epistasis experiments indicate that Epk1 acts upstream of effector-triggered immunity-associated MAP kinase signaling. Using RNA-seq technology we identify genes involved in specific immune responses. A functional genomics screen led to the discovery of Epk1, a novel predicted protein kinase required for plant defense activation upon recognition of three different bacterial effectors.
EggLib: processing, analysis and simulation tools for population genetics and genomics
2012-01-01
Background With the considerable growth of available nucleotide sequence data over the last decade, integrated and flexible analytical tools have become a necessity. In particular, in the field of population genetics, there is a strong need for automated and reliable procedures to conduct repeatable and rapid polymorphism analyses, coalescent simulations, data manipulation and estimation of demographic parameters under a variety of scenarios. Results In this context, we present EggLib (Evolutionary Genetics and Genomics Library), a flexible and powerful C++/Python software package providing efficient and easy to use computational tools for sequence data management and extensive population genetic analyses on nucleotide sequence data. EggLib is a multifaceted project involving several integrated modules: an underlying computationally efficient C++ library (which can be used independently in pure C++ applications); two C++ programs; a Python package providing, among other features, a high level Python interface to the C++ library; and the egglib script which provides direct access to pre-programmed Python applications. Conclusions EggLib has been designed aiming to be both efficient and easy to use. A wide array of methods are implemented, including file format conversion, sequence alignment edition, coalescent simulations, neutrality tests and estimation of demographic parameters by Approximate Bayesian Computation (ABC). Classes implementing different demographic scenarios for ABC analyses can easily be developed by the user and included to the package. EggLib source code is distributed freely under the GNU General Public License (GPL) from its website http://egglib.sourceforge.net/ where a full documentation and a manual can also be found and downloaded. PMID:22494792
Cao, Huojun; Amendt, Brad A
2016-11-01
Developmental dental anomalies are common forms of congenital defects. The molecular mechanisms of dental anomalies are poorly understood. Systematic approaches such as clustering genes based on similar expression patterns could identify novel genes involved in dental anomalies and provide a framework for understanding molecular regulatory mechanisms of these genes during tooth development (odontogenesis). A python package (pySAPC) of sparse affinity propagation clustering algorithm for large datasets was developed. Whole genome pair-wise similarity was calculated based on expression pattern similarity based on 45 microarrays of several stages during odontogenesis. pySAPC identified 743 gene clusters based on expression pattern similarity during mouse tooth development. Three clusters are significantly enriched for genes associated with dental anomalies (with FDR <0.1). The three clusters of genes have distinct expression patterns during odontogenesis. Clustering genes based on similar expression profiles recovered several known regulatory relationships for genes involved in odontogenesis, as well as many novel genes that may be involved with the same genetic pathways as genes that have already been shown to contribute to dental defects. By using sparse similarity matrix, pySAPC use much less memory and CPU time compared with the original affinity propagation program that uses a full similarity matrix. This python package will be useful for many applications where dataset(s) are too large to use full similarity matrix. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016. Published by Elsevier B.V.
EggLib: processing, analysis and simulation tools for population genetics and genomics.
De Mita, Stéphane; Siol, Mathieu
2012-04-11
With the considerable growth of available nucleotide sequence data over the last decade, integrated and flexible analytical tools have become a necessity. In particular, in the field of population genetics, there is a strong need for automated and reliable procedures to conduct repeatable and rapid polymorphism analyses, coalescent simulations, data manipulation and estimation of demographic parameters under a variety of scenarios. In this context, we present EggLib (Evolutionary Genetics and Genomics Library), a flexible and powerful C++/Python software package providing efficient and easy to use computational tools for sequence data management and extensive population genetic analyses on nucleotide sequence data. EggLib is a multifaceted project involving several integrated modules: an underlying computationally efficient C++ library (which can be used independently in pure C++ applications); two C++ programs; a Python package providing, among other features, a high level Python interface to the C++ library; and the egglib script which provides direct access to pre-programmed Python applications. EggLib has been designed aiming to be both efficient and easy to use. A wide array of methods are implemented, including file format conversion, sequence alignment edition, coalescent simulations, neutrality tests and estimation of demographic parameters by Approximate Bayesian Computation (ABC). Classes implementing different demographic scenarios for ABC analyses can easily be developed by the user and included to the package. EggLib source code is distributed freely under the GNU General Public License (GPL) from its website http://egglib.sourceforge.net/ where a full documentation and a manual can also be found and downloaded.
Hart, Reece K; Rico, Rudolph; Hare, Emily; Garcia, John; Westbrook, Jody; Fusaro, Vincent A
2015-01-15
Biological sequence variants are commonly represented in scientific literature, clinical reports and databases of variation using the mutation nomenclature guidelines endorsed by the Human Genome Variation Society (HGVS). Despite the widespread use of the standard, no freely available and comprehensive programming libraries are available. Here we report an open-source and easy-to-use Python library that facilitates the parsing, manipulation, formatting and validation of variants according to the HGVS specification. The current implementation focuses on the subset of the HGVS recommendations that precisely describe sequence-level variation relevant to the application of high-throughput sequencing to clinical diagnostics. The package is released under the Apache 2.0 open-source license. Source code, documentation and issue tracking are available at http://bitbucket.org/hgvs/hgvs/. Python packages are available at PyPI (https://pypi.python.org/pypi/hgvs). Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Hart, Reece K.; Rico, Rudolph; Hare, Emily; Garcia, John; Westbrook, Jody; Fusaro, Vincent A.
2015-01-01
Summary: Biological sequence variants are commonly represented in scientific literature, clinical reports and databases of variation using the mutation nomenclature guidelines endorsed by the Human Genome Variation Society (HGVS). Despite the widespread use of the standard, no freely available and comprehensive programming libraries are available. Here we report an open-source and easy-to-use Python library that facilitates the parsing, manipulation, formatting and validation of variants according to the HGVS specification. The current implementation focuses on the subset of the HGVS recommendations that precisely describe sequence-level variation relevant to the application of high-throughput sequencing to clinical diagnostics. Availability and implementation: The package is released under the Apache 2.0 open-source license. Source code, documentation and issue tracking are available at http://bitbucket.org/hgvs/hgvs/. Python packages are available at PyPI (https://pypi.python.org/pypi/hgvs). Contact: reecehart@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25273102
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
Robinson, Mark D; McCarthy, Davis J; Smyth, Gordon K
2010-01-01
It is expected that emerging digital gene expression (DGE) technologies will overtake microarray technologies in the near future for many functional genomics applications. One of the fundamental data analysis tasks, especially for gene expression studies, involves determining whether there is evidence that counts for a transcript or exon are significantly different across experimental conditions. edgeR is a Bioconductor software package for examining differential expression of replicated count data. An overdispersed Poisson model is used to account for both biological and technical variability. Empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference. The methodology can be used even with the most minimal levels of replication, provided at least one phenotype or experimental condition is replicated. The software may have other applications beyond sequencing data, such as proteome peptide count data. The package is freely available under the LGPL licence from the Bioconductor web site (http://bioconductor.org).
TADtool: visual parameter identification for TAD-calling algorithms.
Kruse, Kai; Hug, Clemens B; Hernández-Rodríguez, Benjamín; Vaquerizas, Juan M
2016-10-15
Eukaryotic genomes are hierarchically organized into topologically associating domains (TADs). The computational identification of these domains and their associated properties critically depends on the choice of suitable parameters of TAD-calling algorithms. To reduce the element of trial-and-error in parameter selection, we have developed TADtool: an interactive plot to find robust TAD-calling parameters with immediate visual feedback. TADtool allows the direct export of TADs called with a chosen set of parameters for two of the most common TAD calling algorithms: directionality and insulation index. It can be used as an intuitive, standalone application or as a Python package for maximum flexibility. TADtool is available as a Python package from GitHub (https://github.com/vaquerizaslab/tadtool) or can be installed directly via PyPI, the Python package index (tadtool). kai.kruse@mpi-muenster.mpg.de, jmv@mpi-muenster.mpg.deSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K.; Sopory, Sudhir K.; Kapoor, Sanjay; Pandey, Girdhar K.
2013-01-01
Background Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. Methodology/Principal Findings An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. Conclusion/Significance The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future. PMID:23638098
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K; Sopory, Sudhir K; Kapoor, Sanjay; Pandey, Girdhar K
2013-01-01
Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future.
Novodvorska, Michaela; Stratford, Malcolm; Blythe, Martin J; Wilson, Raymond; Beniston, Richard G; Archer, David B
2016-09-01
The early stages of development of Aspergillus niger conidia during outgrowth were explored by combining genome-wide gene expression analysis (RNAseq), proteomics, Warburg manometry and uptake studies. Resting conidia suspended in water were demonstrated for the first time to be metabolically active as low levels of oxygen uptake and the generation of carbon dioxide were detected, suggesting that low-level respiratory metabolism occurs in conidia for maintenance. Upon triggering of spore germination, generation of CO2 increased dramatically. For a short period, which coincided with mobilisation of the intracellular polyol, trehalose, there was no increase in uptake of O2 indicating that trehalose was metabolised by fermentation. Data from genome-wide mRNA profiling showed the presence of transcripts associated with fermentative and respiratory metabolism in resting conidia. Following triggering of conidial outgrowth, there was a clear switch to respiration after 25min, confirmed by cyanide inhibition. No effect of SHAM, salicylhydroxamic acid, on respiration suggests electron flow via cytochrome c oxidase. Glucose entry into spores was not detectable before 1h after triggering germination. The impact of sorbic acid on germination was examined and we showed that it inhibits glucose uptake. O2 uptake was also inhibited, delaying the onset of respiration and extending the period of fermentation. In conclusion, we show that conidia suspended in water are not completely dormant and that conidial outgrowth involves fermentative metabolism that precedes respiration. Copyright © 2016. Published by Elsevier Inc.
Birkenbihl, Rainer P; Kracher, Barbara; Somssich, Imre E
2017-01-01
During microbial-associated molecular pattern-triggered immunity (MTI), molecules derived from microbes are perceived by cell surface receptors and upon signaling to the nucleus initiate a massive transcriptional reprogramming critical to mount an appropriate host defense response. WRKY transcription factors play an important role in regulating these transcriptional processes. Here, we determined on a genome-wide scale the flg22-induced in vivo DNA binding dynamics of three of the most prominent WRKY factors, WRKY18, WRKY40, and WRKY33. The three WRKY factors each bound to more than 1000 gene loci predominantly at W-box elements, the known WRKY binding motif. Binding occurred mainly in the 500-bp promoter regions of these genes. Many of the targeted genes are involved in signal perception and transduction not only during MTI but also upon damage-associated molecular pattern-triggered immunity, providing a mechanistic link between these functionally interconnected basal defense pathways. Among the additional targets were genes involved in the production of indolic secondary metabolites and in modulating distinct plant hormone pathways. Importantly, among the targeted genes were numerous transcription factors, encoding predominantly ethylene response factors, active during early MTI, and WRKY factors, supporting the previously hypothesized existence of a WRKY subregulatory network. Transcriptional analysis revealed that WRKY18 and WRKY40 function redundantly as negative regulators of flg22-induced genes often to prevent exaggerated defense responses. © 2016 American Society of Plant Biologists. All rights reserved.
Sun, Xun; Lu, You; Bish, Lawrence T; Calcedo, Roberto; Wilson, James M; Gao, Guangping
2010-06-01
Vectors based on several new adeno-associated viral (AAV) serotypes demonstrated strong hepatocyte tropism and transduction efficiency in both small- and large-animal models for liver-directed gene transfer. Efficiency of liver transduction by AAV vectors can be further improved in both murine and nonhuman primate (NHP) animals when the vector genomes are packaged in a self-complementary (sc) format. In an attempt to understand potential molecular mechanism(s) responsible for enhanced transduction efficiency of the sc vector in liver, we performed extensive molecular studies of genome structures of conventional single-stranded (ss) and sc AAV vectors from liver after AAV gene transfer in both mice and NHPs. These included treatment with exonucleases with specific substrate preferences, single-cutter restriction enzyme digestion and polarity-specific hybridization-based vector genome mapping, and bacteriophage phi29 DNA polymerase-mediated and double-stranded circular template-specific rescue of persisted circular genomes. In mouse liver, vector genomes of both genome formats seemed to persist primarily as episomal circular forms, but sc vectors converted into circular forms more rapidly and efficiently. However, the overall differences in vector genome abundance and structure in the liver between ss and sc vectors could not account for the remarkable differences in transduction. Molecular structures of persistent genomes of both ss and sc vectors were significantly more heterogeneous in macaque liver, with noticeable structural rearrangements that warrant further characterizations.
Sun, Xun; Lu, You; Bish, Lawrence T.; Calcedo, Roberto; Wilson, James M.
2010-01-01
Abstract Vectors based on several new adeno-associated viral (AAV) serotypes demonstrated strong hepatocyte tropism and transduction efficiency in both small- and large-animal models for liver-directed gene transfer. Efficiency of liver transduction by AAV vectors can be further improved in both murine and nonhuman primate (NHP) animals when the vector genomes are packaged in a self-complementary (sc) format. In an attempt to understand potential molecular mechanism(s) responsible for enhanced transduction efficiency of the sc vector in liver, we performed extensive molecular studies of genome structures of conventional single-stranded (ss) and sc AAV vectors from liver after AAV gene transfer in both mice and NHPs. These included treatment with exonucleases with specific substrate preferences, single-cutter restriction enzyme digestion and polarity-specific hybridization-based vector genome mapping, and bacteriophage ϕ29 DNA polymerase-mediated and double-stranded circular template-specific rescue of persisted circular genomes. In mouse liver, vector genomes of both genome formats seemed to persist primarily as episomal circular forms, but sc vectors converted into circular forms more rapidly and efficiently. However, the overall differences in vector genome abundance and structure in the liver between ss and sc vectors could not account for the remarkable differences in transduction. Molecular structures of persistent genomes of both ss and sc vectors were significantly more heterogeneous in macaque liver, with noticeable structural rearrangements that warrant further characterizations. PMID:20113166
Base-By-Base: single nucleotide-level analysis of whole viral genome alignments.
Brodie, Ryan; Smith, Alex J; Roper, Rachel L; Tcherepanov, Vasily; Upton, Chris
2004-07-14
With ever increasing numbers of closely related virus genomes being sequenced, it has become desirable to be able to compare two genomes at a level more detailed than gene content because two strains of an organism may share the same set of predicted genes but still differ in their pathogenicity profiles. For example, detailed comparison of multiple isolates of the smallpox virus genome (each approximately 200 kb, with 200 genes) is not feasible without new bioinformatics tools. A software package, Base-By-Base, has been developed that provides visualization tools to enable researchers to 1) rapidly identify and correct alignment errors in large, multiple genome alignments; and 2) generate tabular and graphical output of differences between the genomes at the nucleotide level. Base-By-Base uses detailed annotation information about the aligned genomes and can list each predicted gene with nucleotide differences, display whether variations occur within promoter regions or coding regions and whether these changes result in amino acid substitutions. Base-By-Base can connect to our mySQL database (Virus Orthologous Clusters; VOCs) to retrieve detailed annotation information about the aligned genomes or use information from text files. Base-By-Base enables users to quickly and easily compare large viral genomes; it highlights small differences that may be responsible for important phenotypic differences such as virulence. It is available via the Internet using Java Web Start and runs on Macintosh, PC and Linux operating systems with the Java 1.4 virtual machine.
Research advances on animal genetics in China in 2015.
Zhang, Bo; Chen, Xiao-fang; Huang, Xun; Yang, Xiao
2016-06-20
Chinese scientists have made significant achievements in the field of animal genetics in 2015. Incomplete statistics show that among all the publications of 2015 involving nematode (Caenorhabditis elegans), fly (Drosophila melanogaster), zebrafish (Danio rerio), African clawed frog (Xenopus) or mice (Mus musculus), about 1/5 publications are from China. Many innovative studies were published in high-impact international academic journals by Chinese scientists, including the identification of a putative magnetic receptor MagR, the genetic basis for the regulation of wing polyphenism in the insect brown planthopper (Nilaparvata lugens), DNA N 6 -methyladenine (6mA) modification in the Drosophila genome, a novel molecular mechanism regarding the dendritic spine pruning and maturation in the mammals, the mechanism for the CREB coactivator CRTC2 in the regulation of hepatic lipid metabolism, the control of systemic inflammation by neurotransmitter dopamine, the role of Gasdermin protein family in triggering pyroptosis, a parvalbumin-positive excitatory visual pathway to trigger fear responses in mice, etc. Chinese scientists have also made important contributions in genome editing via TALEN or CRISPR/Cas system. According to incomplete statistics, more than 1/5 of the publications related to genome editing in 2015 are from China, where a variety of animals with different approaches were targeted, ranging from the worm to primates. Particularly, CRISPR/Cas9-mediated gene editing in human tripronuclear zygotes was successfully achieved for the first time. China has been one of the leading countries in genome sequencing in recent years, and Chinese scientists reported the sequence and annotation of the genomes of several important animal species in 2015, including goose (Anser cygnoides), Schlegel's Japanese Gecko (Gekko japonicus), grass carp (Ctenopharyngodon idellus), large yellow croaker (Larimichthys crocea) and pig (Sus scrofa). They further analyzed the genome-wide genetic basis of the species-specific physiological and pathological characteristics as well as their adaptation to environmental conditions. In this review, we make a first attempt to summarize the research advances on animal genetics in China in 2015, with an emphasis on the achievements led by Chinese scientists and carried out in Chinese institutions. We will briefly discuss the significance of their research and contributions of Chinese scientists in animal genetics.
Stabilising the Herpes Simplex Virus capsid by DNA packaging
NASA Astrophysics Data System (ADS)
Wuite, Gijs; Radtke, Kerstin; Sodeik, Beate; Roos, Wouter
2009-03-01
Three different types of Herpes Simplex Virus type 1 (HSV-1) nuclear capsids can be distinguished, A, B and C capsids. These capsids types are, respectively, empty, contain scaffold proteins, or hold DNA. We investigate the physical properties of these three capsids by combining biochemical and nanoindentation techniques. Atomic Force Microscopy (AFM) experiments show that A and C capsids are mechanically indistinguishable whereas B capsids already break at much lower forces. By extracting the pentamers with 2.0 M GuHCl or 6.0 M Urea we demonstrate an increased flexibility of all three capsid types. Remarkably, the breaking force of the B capsids without pentamers does not change, while the modified A and C capsids show a large drop in their breaking force to approximately the value of the B capsids. This result indicates that upon DNA packaging a structural change at or near the pentamers occurs which mechanically reinforces the capsids structure. The reported binding of proteins UL17/UL25 to the pentamers of the A and C capsids seems the most likely candidate for such capsids strengthening. Finally, the data supports the view that initiation of DNA packaging triggers the maturation of HSV-1 capsids.
Micro- and nanoscale devices for investigation of epigenetics and chromatin dynamics
2014-01-01
DNA is the blueprint upon which life is based and transmitted, yet the manner in which chromatin, the dynamic complex of nucleic acids and proteins, is packaged and behaves within the cellular nucleus has only begun to be investigated. The packaging and modifications around the genome have been shown to exert significant influence on cellular behaviour and in turn, human development and disease. However, conventional techniques for studying epigenetic or conformational modifications of chromosomes have inherent limitations, and therefore, new methods based on micro- and nanoscale devices have been sought. Here, we review the development of these devices and explore their use in the study of DNA and chromatin modifications and higher order chromatin structure. PMID:24091454
Cloud prediction of protein structure and function with PredictProtein for Debian.
Kaján, László; Yachdav, Guy; Vicedo, Esmeralda; Steinegger, Martin; Mirdita, Milot; Angermüller, Christof; Böhm, Ariane; Domke, Simon; Ertl, Julia; Mertes, Christian; Reisinger, Eva; Staniewski, Cedric; Rost, Burkhard
2013-01-01
We report the release of PredictProtein for the Debian operating system and derivatives, such as Ubuntu, Bio-Linux, and Cloud BioLinux. The PredictProtein suite is available as a standard set of open source Debian packages. The release covers the most popular prediction methods from the Rost Lab, including methods for the prediction of secondary structure and solvent accessibility (profphd), nuclear localization signals (predictnls), and intrinsically disordered regions (norsnet). We also present two case studies that successfully utilize PredictProtein packages for high performance computing in the cloud: the first analyzes protein disorder for whole organisms, and the second analyzes the effect of all possible single sequence variants in protein coding regions of the human genome.
Cloud Prediction of Protein Structure and Function with PredictProtein for Debian
Kaján, László; Yachdav, Guy; Vicedo, Esmeralda; Steinegger, Martin; Mirdita, Milot; Angermüller, Christof; Böhm, Ariane; Domke, Simon; Ertl, Julia; Mertes, Christian; Reisinger, Eva; Rost, Burkhard
2013-01-01
We report the release of PredictProtein for the Debian operating system and derivatives, such as Ubuntu, Bio-Linux, and Cloud BioLinux. The PredictProtein suite is available as a standard set of open source Debian packages. The release covers the most popular prediction methods from the Rost Lab, including methods for the prediction of secondary structure and solvent accessibility (profphd), nuclear localization signals (predictnls), and intrinsically disordered regions (norsnet). We also present two case studies that successfully utilize PredictProtein packages for high performance computing in the cloud: the first analyzes protein disorder for whole organisms, and the second analyzes the effect of all possible single sequence variants in protein coding regions of the human genome. PMID:23971032
Xiao, Shijun; Li, Jiongtang; Ma, Fengshou; Fang, Lujing; Xu, Shuangbin; Chen, Wei; Wang, Zhi Yong
2015-09-03
Large yellow croaker (Larimichthys crocea) is an important commercial fish in China and East-Asia. The annual product of the species from the aqua-farming industry is about 90 thousand tons. In spite of its economic importance, genetic studies of economic traits and genomic selections of the species are hindered by the lack of genomic resources. Specifically, a whole-genome physical map of large yellow croaker is still missing. The traditional BAC-based fingerprint method is extremely time- and labour-consuming. Here we report the first genome map construction using the high-throughput whole-genome mapping technique by nanochannel arrays in BioNano Genomics Irys system. For an optimal marker density of ~10 per 100 kb, the nicking endonuclease Nt.BspQ1 was chosen for the genome map generation. 645,305 DNA molecules with a total length of ~112 Gb were labelled and detected, covering more than 160X of the large yellow croaker genome. Employing IrysView package and signature patterns in raw DNA molecules, a whole-genome map of large yellow croaker was assembled into 686 maps with a total length of 727 Mb, which was consistent with the estimated genome size. The N50 length of the whole-genome map, including 126 maps, was up to 1.7 Mb. The excellent hybrid alignment with large yellow croaker draft genome validated the consensus genome map assembly and highlighted a promising application of whole-genome mapping on draft genome sequence super-scaffolding. The genome map data of large yellow croaker are accessible on lycgenomics.jmu.edu.cn/pm. Using the state-of-the-art whole-genome mapping technique in Irys system, the first whole-genome map for large yellow croaker has been constructed and thus highly facilitates the ongoing genomic and evolutionary studies for the species. To our knowledge, this is the first public report on genome map construction by the whole-genome mapping for aquatic-organisms. Our study demonstrates a promising application of the whole-genome mapping on genome maps construction for other non-model organisms in a fast and reliable manner.
Progress of CRISPR-Cas Based Genome Editing in Photosynthetic Microbes.
Naduthodi, Mihris Ibnu Saleem; Barbosa, Maria J; van der Oost, John
2018-02-03
The carbon footprint caused by unsustainable development and its environmental and economic impact has become a major concern in the past few decades. Photosynthetic microbes such as microalgae and cyanobacteria are capable of accumulating value-added compounds from carbon dioxide, and have been regarded as environmentally friendly alternatives to reduce the usage of fossil fuels, thereby contributing to reducing the carbon footprint. This light-driven generation of green chemicals and biofuels has triggered the research for metabolic engineering of these photosynthetic microbes. CRISPR-Cas systems are successfully implemented across a wide range of prokaryotic and eukaryotic species for efficient genome editing. However, the inception of this genome editing tool in microalgal and cyanobacterial species took off rather slowly due to various complications. In this review, we elaborate on the established CRISPR-Cas based genome editing in various microalgal and cyanobacterial species. The complications associated with CRISPR-Cas based genome editing in these species are addressed along with possible strategies to overcome these issues. It is anticipated that in the near future this will result in improving and expanding the microalgal and cyanobacterial genome engineering toolbox. © 2018 The Authors. Biotechnology Journal Published by Wiley-VCH Verlag GmbH & Co. KGaA.
Canver, Matthew C.; Bauer, Daniel E.; Dass, Abhishek; Yien, Yvette Y.; Chung, Jacky; Masuda, Takeshi; Maeda, Takahiro; Paw, Barry H.; Orkin, Stuart H.
2014-01-01
The clustered regularly interspaced palindromic repeats (CRISPR)/CRISPR-associated (Cas) 9 nuclease system has provided a powerful tool for genome engineering. Double strand breaks may trigger nonhomologous end joining repair, leading to frameshift mutations, or homology-directed repair using an extrachromosomal template. Alternatively, genomic deletions may be produced by a pair of double strand breaks. The efficiency of CRISPR/Cas9-mediated genomic deletions has not been systematically explored. Here, we present a methodology for the production of deletions in mammalian cells, ranging from 1.3 kb to greater than 1 Mb. We observed a high frequency of intended genomic deletions. Nondeleted alleles are nonetheless often edited with inversions or small insertion/deletions produced at CRISPR recognition sites. Deleted alleles also typically include small insertion/deletions at predicted deletion junctions. We retrieved cells with biallelic deletion at a frequency exceeding that of probabilistic expectation. We demonstrate an inverse relationship between deletion frequency and deletion size. This work suggests that CRISPR/Cas9 is a robust system to produce a spectrum of genomic deletions to allow investigation of genes and genetic elements. PMID:24907273
The orbital TUS detector simulation
NASA Astrophysics Data System (ADS)
Grinyuk, A.; Grebenyuk, V.; Khrenov, B.; Klimov, P.; Lavrova, M.; Panasyuk, M.; Sharakin, S.; Shirokov, A.; Tkachenko, A.; Tkachev, L.; Yashin, I.
2017-04-01
The TUS space experiment is aimed at studying energy and arrival distribution of UHECR at E > 7 × 1019 eV by using the data of EAS fluorescent radiation in atmosphere. The TUS mission was launched at the end of April 2016 on board the dedicated ;Lomonosov; satellite. The TUSSIM software package has been developed to simulate performance of the TUS detector for the Fresnel mirror optical parameters, the light concentrator of the photo detector, the front end and trigger electronics. Trigger efficiency crucially depends on the background level which varies in a wide range: from 0.2 × 106 to 15 × 106 ph/(m2 μ s sr) at moonless and full moon nights respectively. The TUSSIM algorithms are described and the expected TUS statistics is presented for 5 years of data collection from the 500 km solar-synchronized orbit with allowance for the variability of the background light intensity during the space flight.
Dynamic evolution of alpha-gliadin prolamin gene family in homeologous genomes of hexaploid wheat
USDA-ARS?s Scientific Manuscript database
Bread wheat is an allohexaploid species containing the three closely related A, B, and D subgenomes. Homeologous Gli-2 loci located on chromosomes 6A, 6B and 6D encode complex groups of alpha-gliadin seed storage proteins that contribute to the functional properties of wheat flour, but also trigger ...
Molecular evolution of an Avirulence Homolog (Avh) gene subfamily in Phytophthora ramorum
GossErica M.; Caroline M. Press; Niklaus J. Grünwald
2008-01-01
Pathogen effectors can serve a virulence function on behalf of the pathogen or trigger a rapid defense response in resistant hosts. Sequencing of the Phytophthora ramorum genome and subsequent analysis identified a diverse superfamily of approximately 350 genes that are homologous to the four known avirulence genes in plant pathogenic oomycetes and...
USDA-ARS?s Scientific Manuscript database
The genetic tractability of the Hessian fly (HF, Mayetiola destructor) provides an opportunity to investigate the mechanisms insects use to induce plant gall formation. Here we demonstrate that capacity using the newly sequenced HF genome to identify the gene (vH24) that elicits the effector-trigger...
USDA-ARS?s Scientific Manuscript database
Lygus hesperus females exhibit a post-mating behavioral switch that triggers increased egg laying and decreased sexual interest. In Drosophila melanogaster, post-mating changes in behavior are controlled by sex peptide (SP) and the sex peptide receptor (DmSPR). SPR is present in most insect genome...
Genome Evolution Due to Allopolyploidization in Wheat
Feldman, Moshe; Levy, Avraham A.
2012-01-01
The wheat group has evolved through allopolyploidization, namely, through hybridization among species from the plant genera Aegilops and Triticum followed by genome doubling. This speciation process has been associated with ecogeographical expansion and with domestication. In the past few decades, we have searched for explanations for this impressive success. Our studies attempted to probe the bases for the wide genetic variation characterizing these species, which accounts for their great adaptability and colonizing ability. Central to our work was the investigation of how allopolyploidization alters genome structure and expression. We found in wheat that allopolyploidy accelerated genome evolution in two ways: (1) it triggered rapid genome alterations through the instantaneous generation of a variety of cardinal genetic and epigenetic changes (which we termed “revolutionary” changes), and (2) it facilitated sporadic genomic changes throughout the species’ evolution (i.e., evolutionary changes), which are not attainable at the diploid level. Our major findings in natural and synthetic allopolyploid wheat indicate that these alterations have led to the cytological and genetic diploidization of the allopolyploids. These genetic and epigenetic changes reflect the dynamic structural and functional plasticity of the allopolyploid wheat genome. The significance of this plasticity for the successful establishment of wheat allopolyploids, in nature and under domestication, is discussed. PMID:23135324
Thakur, Shalabh; Guttman, David S
2016-06-30
Comparative analysis of whole genome sequence data from closely related prokaryotic species or strains is becoming an increasingly important and accessible approach for addressing both fundamental and applied biological questions. While there are number of excellent tools developed for performing this task, most scale poorly when faced with hundreds of genome sequences, and many require extensive manual curation. We have developed a de-novo genome analysis pipeline (DeNoGAP) for the automated, iterative and high-throughput analysis of data from comparative genomics projects involving hundreds of whole genome sequences. The pipeline is designed to perform reference-assisted and de novo gene prediction, homolog protein family assignment, ortholog prediction, functional annotation, and pan-genome analysis using a range of proven tools and databases. While most existing methods scale quadratically with the number of genomes since they rely on pairwise comparisons among predicted protein sequences, DeNoGAP scales linearly since the homology assignment is based on iteratively refined hidden Markov models. This iterative clustering strategy enables DeNoGAP to handle a very large number of genomes using minimal computational resources. Moreover, the modular structure of the pipeline permits easy updates as new analysis programs become available. DeNoGAP integrates bioinformatics tools and databases for comparative analysis of a large number of genomes. The pipeline offers tools and algorithms for annotation and analysis of completed and draft genome sequences. The pipeline is developed using Perl, BioPerl and SQLite on Ubuntu Linux version 12.04 LTS. Currently, the software package accompanies script for automated installation of necessary external programs on Ubuntu Linux; however, the pipeline should be also compatible with other Linux and Unix systems after necessary external programs are installed. DeNoGAP is freely available at https://sourceforge.net/projects/denogap/ .
Advances and Challenges in Genomic Selection for Disease Resistance.
Poland, Jesse; Rutkoski, Jessica
2016-08-04
Breeding for disease resistance is a central focus of plant breeding programs, as any successful variety must have the complete package of high yield, disease resistance, agronomic performance, and end-use quality. With the need to accelerate the development of improved varieties, genomics-assisted breeding is becoming an important tool in breeding programs. With marker-assisted selection, there has been success in breeding for disease resistance; however, much of this work and research has focused on identifying, mapping, and selecting for major resistance genes that tend to be highly effective but vulnerable to breakdown with rapid changes in pathogen races. In contrast, breeding for minor-gene quantitative resistance tends to produce more durable varieties but is a more challenging breeding objective. As the genetic architecture of resistance shifts from single major R genes to a diffused architecture of many minor genes, the best approach for molecular breeding will shift from marker-assisted selection to genomic selection. Genomics-assisted breeding for quantitative resistance will therefore necessitate whole-genome prediction models and selection methodology as implemented for classical complex traits such as yield. Here, we examine multiple case studies testing whole-genome prediction models and genomic selection for disease resistance. In general, whole-genome models for disease resistance can produce prediction accuracy suitable for application in breeding. These models also largely outperform multiple linear regression as would be applied in marker-assisted selection. With the implementation of genomic selection for yield and other agronomic traits, whole-genome marker profiles will be available for the entire set of breeding lines, enabling genomic selection for disease at no additional direct cost. In this context, the scope of implementing genomics selection for disease resistance, and specifically for quantitative resistance and quarantined pathogens, becomes a tractable and powerful approach in breeding programs.
Self-Assembly of Measles Virus Nucleocapsid-like Particles: Kinetics and RNA Sequence Dependence.
Milles, Sigrid; Jensen, Malene Ringkjøbing; Communie, Guillaume; Maurin, Damien; Schoehn, Guy; Ruigrok, Rob W H; Blackledge, Martin
2016-08-01
Measles virus RNA genomes are packaged into helical nucleocapsids (NCs), comprising thousands of nucleo-proteins (N) that bind the entire genome. N-RNA provides the template for replication and transcription by the viral polymerase and is a promising target for viral inhibition. Elucidation of mechanisms regulating this process has been severely hampered by the inability to controllably assemble NCs. Here, we demonstrate self-organization of N into NC-like particles in vitro upon addition of RNA, providing a simple and versatile tool for investigating assembly. Real-time NMR and fluorescence spectroscopy reveals biphasic assembly kinetics. Remarkably, assembly depends strongly on the RNA-sequence, with the genomic 5' end and poly-Adenine sequences assembling efficiently, while sequences such as poly-Uracil are incompetent for NC formation. This observation has important consequences for understanding the assembly process. © 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.
Quality control and conduct of genome-wide association meta-analyses.
Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth J F
2014-05-01
Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete.
Chaparro, Cristian; Gayraud, Thomas; de Souza, Rogerio Fernandes; Domingues, Douglas Silva; Akaffou, Sélastique; Laforga Vanzela, Andre Luis; de Kochko, Alexandre; Rigoreau, Michel; Crouzillat, Dominique; Hamon, Serge; Hamon, Perla; Guyot, Romain
2015-01-01
A novel structure of nonautonomous long terminal repeat (LTR) retrotransposons called terminal repeat with GAG domain (TR-GAG) has been described in plants, both in monocotyledonous, dicotyledonous and basal angiosperm genomes. TR-GAGs are relatively short elements in length (<4 kb) showing the typical features of LTR-retrotransposons. However, they carry only one open reading frame coding for the GAG precursor protein involved for instance in transposition, the assembly, and the packaging of the element into the virus-like particle. GAG precursors show similarities with both Copia and Gypsy GAG proteins, suggesting evolutionary relationships of TR-GAG elements with both families. Despite the lack of the enzymatic machinery required for their mobility, strong evidences suggest that TR-GAGs are still active. TR-GAGs represent ubiquitous nonautonomous structures that could be involved in the molecular diversities of plant genomes. PMID:25573958
Quality control and conduct of genome-wide association meta-analyses
Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth JF
2014-01-01
Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for [1] organizational aspects of GWAMAs, and for [2] QC at the study file level, the meta-level across studies, and the meta-analysis output level. Real–world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for use of a powerful and flexible software package called EasyQC. For consortia of comparable size to the GIANT consortium, the present protocol takes a minimum of about 10 months to complete. PMID:24762786
Dynamics of HIV-1 RNA Near the Plasma Membrane during Virus Assembly.
Sardo, Luca; Hatch, Steven C; Chen, Jianbo; Nikolaitchik, Olga; Burdick, Ryan C; Chen, De; Westlake, Christopher J; Lockett, Stephen; Pathak, Vinay K; Hu, Wei-Shau
2015-11-01
To increase our understanding of the events that lead to HIV-1 genome packaging, we examined the dynamics of viral RNA and Gag-RNA interactions near the plasma membrane by using total internal reflection fluorescence microscopy. We labeled HIV-1 RNA with a photoconvertible Eos protein via an RNA-binding protein that recognizes stem-loop sequences engineered into the viral genome. Near-UV light exposure causes an irreversible structural change in Eos and alters its emitted fluorescence from green to red. We studied the dynamics of HIV-1 RNA by photoconverting Eos near the plasma membrane, and we monitored the population of photoconverted red-Eos-labeled RNA signals over time. We found that in the absence of Gag, most of the HIV-1 RNAs stayed near the plasma membrane transiently, for a few minutes. The presence of Gag significantly increased the time that RNAs stayed near the plasma membrane: most of the RNAs were still detected after 30 min. We then quantified the proportion of HIV-1 RNAs near the plasma membrane that were packaged into assembling viral complexes. By tagging Gag with blue fluorescent protein, we observed that only a portion, ∼13 to 34%, of the HIV-1 RNAs that reached the membrane were recruited into assembling particles in an hour, and the frequency of HIV-1 RNA packaging varied with the Gag expression level. Our studies reveal the HIV-1 RNA dynamics on the plasma membrane and the efficiency of RNA recruitment and provide insights into the events leading to the generation of infectious HIV-1 virions. Nascent HIV-1 particles assemble on plasma membranes. During the assembly process, HIV-1 RNA genomes must be encapsidated into viral complexes to generate infectious particles. To gain insights into the RNA packaging and virus assembly mechanisms, we labeled and monitored the HIV-1 RNA signals near the plasma membrane. Our results showed that most of the HIV-1 RNAs stayed near the plasma membrane for only a few minutes in the absence of Gag, whereas most HIV-1 RNAs stayed at the plasma membrane for 15 to 60 min in the presence of Gag. Our results also demonstrated that only a small proportion of the HIV-1 RNAs, approximately 1/10 to 1/3 of the RNAs that reached the plasma membrane, was incorporated into viral protein complexes. These studies determined the dynamics of HIV-1 RNA on the plasma membrane and obtained temporal information on RNA-Gag interactions that lead to RNA encapsidation. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Santpere, Gabriel; Darre, Fleur; Blanco, Soledad; Alcami, Antonio; Villoslada, Pablo; Mar Albà, M; Navarro, Arcadi
2014-04-01
Most people in the world (∼90%) are infected by the Epstein-Barr virus (EBV), which establishes itself permanently in B cells. Infection by EBV is related to a number of diseases including infectious mononucleosis, multiple sclerosis, and different types of cancer. So far, only seven complete EBV strains have been described, all of them coming from donors presenting EBV-related diseases. To perform a detailed comparative genomic analysis of EBV including, for the first time, EBV strains derived from healthy individuals, we reconstructed EBV sequences infecting lymphoblastoid cell lines (LCLs) from the 1000 Genomes Project. As strain B95-8 was used to transform B cells to obtain LCLs, it is always present, but a specific deletion in its genome sets it apart from natural EBV strains. After studying hundreds of individuals, we determined the presence of natural EBV in at least 10 of them and obtained a set of variants specific to wild-type EBV. By mapping the natural EBV reads into the EBV reference genome (NC007605), we constructed nearly complete wild-type viral genomes from three individuals. Adding them to the five disease-derived EBV genomic sequences available in the literature, we performed an in-depth comparative genomic analysis. We found that latency genes harbor more nucleotide diversity than lytic genes and that six out of nine latency-related genes, as well as other genes involved in viral attachment and entry into host cells, packaging, and the capsid, present the molecular signature of accelerated protein evolution rates, suggesting rapid host-parasite coevolution.
Lateral gene exchanges shape the genomes of amoeba-resisting microorganisms.
Bertelli, Claire; Greub, Gilbert
2012-01-01
Based on Darwin's concept of the tree of life, vertical inheritance was thought to be dominant, and mutations, deletions, and duplication were streaming the genomes of living organisms. In the current genomic era, increasing data indicated that both vertical and lateral gene inheritance interact in space and time to trigger genome evolution, particularly among microorganisms sharing a given ecological niche. As a paradigm to their diversity and their survival in a variety of cell types, intracellular microorganisms, and notably intracellular bacteria, were considered as less prone to lateral genetic exchanges. Such specialized microorganisms generally have a smaller gene repertoire because they do rely on their host's factors for some basic regulatory and metabolic functions. Here we review events of lateral gene transfer (LGT) that illustrate the genetic exchanges among intra-amoebal microorganisms or between the microorganism and its amoebal host. We tentatively investigate the functions of laterally transferred genes in the light of the interaction with their host as they should confer a selective advantage and success to the amoeba-resisting microorganisms (ARMs).
Salojärvi, Jarkko; Smolander, Olli-Pekka; Nieminen, Kaisa; Rajaraman, Sitaram; Safronov, Omid; Safdari, Pezhman; Lamminmäki, Airi; Immanen, Juha; Lan, Tianying; Tanskanen, Jaakko; Rastas, Pasi; Amiryousefi, Ali; Jayaprakash, Balamuralikrishna; Kammonen, Juhana I; Hagqvist, Risto; Eswaran, Gugan; Ahonen, Viivi Helena; Serra, Juan Alonso; Asiegbu, Fred O; de Dios Barajas-Lopez, Juan; Blande, Daniel; Blokhina, Olga; Blomster, Tiina; Broholm, Suvi; Brosché, Mikael; Cui, Fuqiang; Dardick, Chris; Ehonen, Sanna E; Elomaa, Paula; Escamez, Sacha; Fagerstedt, Kurt V; Fujii, Hiroaki; Gauthier, Adrien; Gollan, Peter J; Halimaa, Pauliina; Heino, Pekka I; Himanen, Kristiina; Hollender, Courtney; Kangasjärvi, Saijaliisa; Kauppinen, Leila; Kelleher, Colin T; Kontunen-Soppela, Sari; Koskinen, J Patrik; Kovalchuk, Andriy; Kärenlampi, Sirpa O; Kärkönen, Anna K; Lim, Kean-Jin; Leppälä, Johanna; Macpherson, Lee; Mikola, Juha; Mouhu, Katriina; Mähönen, Ari Pekka; Niinemets, Ülo; Oksanen, Elina; Overmyer, Kirk; Palva, E Tapio; Pazouki, Leila; Pennanen, Ville; Puhakainen, Tuula; Poczai, Péter; Possen, Boy J H M; Punkkinen, Matleena; Rahikainen, Moona M; Rousi, Matti; Ruonala, Raili; van der Schoot, Christiaan; Shapiguzov, Alexey; Sierla, Maija; Sipilä, Timo P; Sutela, Suvi; Teeri, Teemu H; Tervahauta, Arja I; Vaattovaara, Aleksia; Vahala, Jorma; Vetchinnikova, Lidia; Welling, Annikki; Wrzaczek, Michael; Xu, Enjun; Paulin, Lars G; Schulman, Alan H; Lascoux, Martin; Albert, Victor A; Auvinen, Petri; Helariutta, Ykä; Kangasjärvi, Jaakko
2017-06-01
Silver birch (Betula pendula) is a pioneer boreal tree that can be induced to flower within 1 year. Its rapid life cycle, small (440-Mb) genome, and advanced germplasm resources make birch an attractive model for forest biotechnology. We assembled and chromosomally anchored the nuclear genome of an inbred B. pendula individual. Gene duplicates from the paleohexaploid event were enriched for transcriptional regulation, whereas tandem duplicates were overrepresented by environmental responses. Population resequencing of 80 individuals showed effective population size crashes at major points of climatic upheaval. Selective sweeps were enriched among polyploid duplicates encoding key developmental and physiological triggering functions, suggesting that local adaptation has tuned the timing of and cross-talk between fundamental plant processes. Variation around the tightly-linked light response genes PHYC and FRS10 correlated with latitude and longitude and temperature, and with precipitation for PHYC. Similar associations characterized the growth-promoting cytokinin response regulator ARR1, and the wood development genes KAK and MED5A.
Malcher-Lopes, Renato; Franco, Alier; Tasker, Jeffrey G.
2008-01-01
Glucocorticoids are capable of exerting both genomic and non-genomic actions in target cells of multiple tissues, including the brain, which trigger an array of electrophysiological, metabolic, secretory and inflammatory regulatory responses. Here, we have attempted to show how glucocorticoids may generate a rapid anti-inflammatory response by promoting arachidonic acid-derived endocannabinoid biosynthesis. According to our hypothesized model, non-genomic action of glucocorticoids results in the global shift of membrane lipid metabolism, subverting metabolic pathways toward the synthesis of the anti-inflammatory endocannabinoids, anandamide (AEA) and 2-arachidonoyl-glycerol (2-AG), and away from arachidonic acid production. Post-transcriptional inhibition of cyclooxygenase-2 (COX2) synthesis by glucocorticoids assists this mechanism by suppressing the synthesis of pro-inflammatory prostaglandins as well as endocannabinoid-derived prostanoids. In the central nervous system (CNS) this may represent a major neuroprotective system, which may cross-talk with leptin signaling in the hypothalamus allowing for the coordination between energy homeostasis and the inflammatory response. PMID:18295199
Inference of Ancestral Recombination Graphs through Topological Data Analysis
Cámara, Pablo G.; Levine, Arnold J.; Rabadán, Raúl
2016-01-01
The recent explosion of genomic data has underscored the need for interpretable and comprehensive analyses that can capture complex phylogenetic relationships within and across species. Recombination, reassortment and horizontal gene transfer constitute examples of pervasive biological phenomena that cannot be captured by tree-like representations. Starting from hundreds of genomes, we are interested in the reconstruction of potential evolutionary histories leading to the observed data. Ancestral recombination graphs represent potential histories that explicitly accommodate recombination and mutation events across orthologous genomes. However, they are computationally costly to reconstruct, usually being infeasible for more than few tens of genomes. Recently, Topological Data Analysis (TDA) methods have been proposed as robust and scalable methods that can capture the genetic scale and frequency of recombination. We build upon previous TDA developments for detecting and quantifying recombination, and present a novel framework that can be applied to hundreds of genomes and can be interpreted in terms of minimal histories of mutation and recombination events, quantifying the scales and identifying the genomic locations of recombinations. We implement this framework in a software package, called TARGet, and apply it to several examples, including small migration between different populations, human recombination, and horizontal evolution in finches inhabiting the Galápagos Islands. PMID:27532298
Push back to respond better: regulatory inhibition of the DNA double-strand break response.
Panier, Stephanie; Durocher, Daniel
2013-10-01
Single DNA lesions such as DNA double-strand breaks (DSBs) can cause cell death or trigger genome rearrangements that have oncogenic potential, and so the pathways that mend and signal DNA damage must be highly sensitive but, at the same time, selective and reversible. When initiated, boundaries must be set to restrict the DSB response to the site of the lesion. The integration of positive and, crucially, negative control points involving post-translational modifications such as phosphorylation, ubiquitylation and acetylation is key for building fast, effective responses to DNA damage and for mitigating the impact of DNA lesions on genome integrity.
Comai, Luca; Maheshwari, Shamoni; Marimuthu, Mohan P A
2017-04-01
Plant centromeres, which are determined epigenetically by centromeric histone 3 (CENH3) have revealed surprising structural diversity, ranging from the canonical monocentric seen in vertebrates, to polycentric, and holocentric. Normally stable, centromeres can change position over evolutionary times or upon genomic stress, such as when chromosomes are broken. At the DNA level, centromeres can be based on single copy DNA or more commonly on repeats. Rapid evolution of centromeric sequences and of CENH3 protein remains a mystery, as evidence of co-adaptation is lacking. Epigenetic differences between parents can trigger uniparental centromere failure and genome elimination, contributing to postzygotic hybridization barriers.. Copyright © 2017 Elsevier Ltd. All rights reserved.
Histone Variant Regulates DNA Repair via Chromatin Condensation | Center for Cancer Research
Activating the appropriate DNA repair pathway is essential for maintaining the stability of the genome after a break in both strands of DNA. How a pathway is selected, however, is not well understood. Since these double strand breaks (DSBs) occur while DNA is packaged as chromatin, changes in its organization are necessary for repair to take place. Numerous alterations have
A greedy, graph-based algorithm for the alignment of multiple homologous gene lists.
Fostier, Jan; Proost, Sebastian; Dhoedt, Bart; Saeys, Yvan; Demeester, Piet; Van de Peer, Yves; Vandepoele, Klaas
2011-03-15
Many comparative genomics studies rely on the correct identification of homologous genomic regions using accurate alignment tools. In such case, the alphabet of the input sequences consists of complete genes, rather than nucleotides or amino acids. As optimal multiple sequence alignment is computationally impractical, a progressive alignment strategy is often employed. However, such an approach is susceptible to the propagation of alignment errors in early pairwise alignment steps, especially when dealing with strongly diverged genomic regions. In this article, we present a novel accurate and efficient greedy, graph-based algorithm for the alignment of multiple homologous genomic segments, represented as ordered gene lists. Based on provable properties of the graph structure, several heuristics are developed to resolve local alignment conflicts that occur due to gene duplication and/or rearrangement events on the different genomic segments. The performance of the algorithm is assessed by comparing the alignment results of homologous genomic segments in Arabidopsis thaliana to those obtained by using both a progressive alignment method and an earlier graph-based implementation. Especially for datasets that contain strongly diverged segments, the proposed method achieves a substantially higher alignment accuracy, and proves to be sufficiently fast for large datasets including a few dozens of eukaryotic genomes. http://bioinformatics.psb.ugent.be/software. The algorithm is implemented as a part of the i-ADHoRe 3.0 package.
van den Broek, Evert; van Lieshout, Stef; Rausch, Christian; Ylstra, Bauke; van de Wiel, Mark A; Meijer, Gerrit A; Fijneman, Remond J A; Abeln, Sanne
2016-01-01
Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs) of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large) series of tumor samples. 'GeneBreak' is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH) or by (low-pass) whole genome sequencing (WGS). First, 'GeneBreak' collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, 'GeneBreak', is implemented in R ( www.cran.r-project.org ) and is available from Bioconductor ( www.bioconductor.org/packages/release/bioc/html/GeneBreak.html ).
Characterization of the genome of the dairy Lactobacillus helveticus bacteriophage {Phi}AQ113.
Zago, Miriam; Scaltriti, Erika; Rossetti, Lia; Guffanti, Alessandro; Armiento, Angelarita; Fornasari, Maria Emanuela; Grolli, Stefano; Carminati, Domenico; Brini, Elena; Pavan, Paolo; Felsani, Armando; D'Urzo, Annalisa; Moles, Anna; Claude, Jean-Baptiste; Grandori, Rita; Ramoni, Roberto; Giraffa, Giorgio
2013-08-01
The complete genomic sequence of the dairy Lactobacillus helveticus bacteriophage ΦAQ113 was determined. Phage ΦAQ113 is a Myoviridae bacteriophage with an isometric capsid and a contractile tail. The final assembled consensus sequence revealed a linear, circularly permuted, double-stranded DNA genome with a size of 36,566 bp and a G+C content of 37%. Fifty-six open reading frames (ORFs) were predicted, and a putative function was assigned to approximately 90% of them. The ΦAQ113 genome shows functionally related genes clustered together in a genome structure composed of modules for DNA replication/regulation, DNA packaging, head and tail morphogenesis, cell lysis, and lysogeny. The identification of genes involved in the establishment of lysogeny indicates that it may have originated as a temperate phage, even if it was isolated from natural cheese whey starters as a virulent phage, because it is able to propagate in a sensitive host strain. Additionally, we discovered that the ΦAQ113 phage genome is closely related to Lactobacillus gasseri phage KC5a and Lactobacillus johnsonii phage Lj771 genomes. The phylogenetic similarities between L. helveticus phage ΦAQ113 and two phages that belong to gut species confirm a possible common ancestral origin and support the increasing consideration of L. helveticus as a health-promoting organism.
Characterization of the Genome of the Dairy Lactobacillus helveticus Bacteriophage ΦAQ113
Scaltriti, Erika; Rossetti, Lia; Guffanti, Alessandro; Armiento, Angelarita; Fornasari, Maria Emanuela; Grolli, Stefano; Carminati, Domenico; Brini, Elena; Pavan, Paolo; Felsani, Armando; D'Urzo, Annalisa; Moles, Anna; Claude, Jean-Baptiste; Grandori, Rita; Ramoni, Roberto; Giraffa, Giorgio
2013-01-01
The complete genomic sequence of the dairy Lactobacillus helveticus bacteriophage ΦAQ113 was determined. Phage ΦAQ113 is a Myoviridae bacteriophage with an isometric capsid and a contractile tail. The final assembled consensus sequence revealed a linear, circularly permuted, double-stranded DNA genome with a size of 36,566 bp and a G+C content of 37%. Fifty-six open reading frames (ORFs) were predicted, and a putative function was assigned to approximately 90% of them. The ΦAQ113 genome shows functionally related genes clustered together in a genome structure composed of modules for DNA replication/regulation, DNA packaging, head and tail morphogenesis, cell lysis, and lysogeny. The identification of genes involved in the establishment of lysogeny indicates that it may have originated as a temperate phage, even if it was isolated from natural cheese whey starters as a virulent phage, because it is able to propagate in a sensitive host strain. Additionally, we discovered that the ΦAQ113 phage genome is closely related to Lactobacillus gasseri phage KC5a and Lactobacillus johnsonii phage Lj771 genomes. The phylogenetic similarities between L. helveticus phage ΦAQ113 and two phages that belong to gut species confirm a possible common ancestral origin and support the increasing consideration of L. helveticus as a health-promoting organism. PMID:23728811
Martin, Stanton L; Blackmon, Barbara P; Rajagopalan, Ravi; Houfek, Thomas D; Sceeles, Robert G; Denn, Sheila O; Mitchell, Thomas K; Brown, Douglas E; Wing, Rod A; Dean, Ralph A
2002-01-01
We have created a federated database for genome studies of Magnaporthe grisea, the causal agent of rice blast disease, by integrating end sequence data from BAC clones, genetic marker data and BAC contig assembly data. A library of 9216 BAC clones providing >25-fold coverage of the entire genome was end sequenced and fingerprinted by HindIII digestion. The Image/FPC software package was then used to generate an assembly of 188 contigs covering >95% of the genome. The database contains the results of this assembly integrated with hybridization data of genetic markers to the BAC library. AceDB was used for the core database engine and a MySQL relational database, populated with numerical representations of BAC clones within FPC contigs, was used to create appropriately scaled images. The database is being used to facilitate sequencing efforts. The database also allows researchers mapping known genes or other sequences of interest, rapid and easy access to the fundamental organization of the M.grisea genome. This database, MagnaportheDB, can be accessed on the web at http://www.cals.ncsu.edu/fungal_genomics/mgdatabase/int.htm.
Gur, Ruben C; Irani, Farzin; Seligman, Sarah; Calkins, Monica E; Richard, Jan; Gur, Raquel E
2011-08-01
Genomics has been revolutionizing medicine over the past decade by offering mechanistic insights into disease processes and engendering the age of "individualized medicine." Because of the sheer number of measures generated by gene sequencing methods, genomics requires "Big Science" where large datasets on genes are analyzed in reference to electronic medical record data. This revolution has largely bypassed the behavioral neurosciences, mainly because of the paucity of behavioral data in medical records and the labor-intensity of available neuropsychological assessment methods. We describe the development and implementation of an efficient neuroscience-based computerized battery, coupled with a computerized clinical assessment procedure. This assessment package has been applied to a genomic study of 10,000 children aged 8-21, of whom 1000 also undergo neuroimaging. Results from the first 3000 participants indicate sensitivity to neurodevelopmental trajectories. Sex differences were evident, with females outperforming males in memory and social cognition domains, while for spatial processing males were more accurate and faster, and they were faster on simple motor tasks. The study illustrates what will hopefully become a major component of the work of clinical and research neuropsychologists as invaluable participants in the dawning age of Big Science neuropsychological genomics.
Zhang, Ke; Zhang, Li-Jie; Fang, Ya-Hong; Jin, Xin-Na; Qi, Lei; Wu, Xue-Chang; Zheng, Dao-Qiong
2016-03-01
Genomic structural variation (GSV) is a ubiquitous phenomenon observed in the genomes of Saccharomyces cerevisiae strains with different genetic backgrounds; however, the physiological and phenotypic effects of GSV are not well understood. Here, we first revealed the genetic characteristics of a widely used industrial S. cerevisiae strain, ZTW1, by whole genome sequencing. ZTW1 was identified as an aneuploidy strain and a large-scale GSV was observed in the ZTW1 genome compared with the genome of a diploid strain YJS329. These GSV events led to copy number variations (CNVs) in many chromosomal segments as well as one whole chromosome in the ZTW1 genome. Changes in the DNA dosage of certain functional genes directly affected their expression levels and the resultant ZTW1 phenotypes. Moreover, CNVs of large chromosomal regions triggered an aneuploidy stress in ZTW1. This stress decreased the proliferation ability and tolerance of ZTW1 to various stresses, while aneuploidy response stress may also provide some benefits to the fermentation performance of the yeast, including increased fermentation rates and decreased byproduct generation. This work reveals genomic characters of the bioethanol S. cerevisiae strain ZTW1 and suggests that GSV is an important kind of mutation that changes the traits of industrial S. cerevisiae strains. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
NKp44 expression, phylogenesis and function in non-human primate NK cells
De Maria, Andrea; Ugolotti, Elisabetta; Rutjens, Erik; Mazza, Stefania; Radic, Luana; Faravelli, Alessandro; Koopman, Gerrit; Di Marco, Eddi; Costa, Paola; Ensoli, Barbara; Cafaro, Aurelio; Mingari, Maria Cristina; Moretta, Lorenzo; Heeney, Jonathan
2009-01-01
Molecular and functional characterization of the natural cytotoxicity receptor (NCR) NKp44 in species other than Homo sapiens has been elusive, so far. Here, we provide complete phenotypic, molecular and functional characterization for NKp44 triggering receptor on Pan troglodytes NK cells, the closest human relative, and the analysis of NKp44-genomic locus and transcription in Macaca fascicularis. Similar to H. sapiens, NKp44 expression is detectable on chimpanzee NK cells only upon activation. However, basal NKp44 transcription is 5-fold higher in chimpanzees with lower differential increases upon cell activation compared with humans. Upon activation, an overall 12-fold lower NKp44 gene expression is observed in P. troglodytes compared with H. sapiens NK cells with only a slight reduction in NKp44 surface expression. Functional analysis of ‘in vitro’ activated purified NK cells confirms the NKp44 triggering potential compared with other major NCRs. These findings suggest the presence of a post-transcriptional regulation that evolved differently in H. sapiens. Analysis of cynomolgus NKp44-genomic sequence and transcription pattern showed very low levels of transcription with occurrence of out-of-frame transcripts and no surface expression. The present comparative analysis suggests that NKp44-genomic organization appears during macaque speciation, with considerable evolution of its transcriptional and post-transcriptional tuning. Thus, NKp44 may represent an NCR being only recently emerged during speciation, acquiring functional relevance only in non-human primates closest to H. sapiens. PMID:19147838
Structural Characterization of H-1 Parvovirus: Comparison of Infectious Virions to Empty Capsids
Halder, Sujata; Nam, Hyun-Joo; Govindasamy, Lakshmanan; Vogel, Michèle; Dinsart, Christiane; Salomé, Nathalie; McKenna, Robert
2013-01-01
The structure of single-stranded DNA (ssDNA) packaging H-1 parvovirus (H-1PV), which is being developed as an antitumor gene delivery vector, has been determined for wild-type (wt) virions and noninfectious (empty) capsids to 2.7- and 3.2-Å resolution, respectively, using X-ray crystallography. The capsid viral protein (VP) structure consists of an α-helix and an eight-stranded anti-parallel β-barrel with large loop regions between the strands. The β-barrel and loops form the capsid core and surface, respectively. In the wt structure, 600 nucleotides are ordered in an interior DNA binding pocket of the capsid. This accounts for ∼12% of the H-1PV genome. The wt structure is identical to the empty capsid structure, except for side chain conformation variations at the nucleotide binding pocket. Comparison of the H-1PV nucleotides to those observed in canine parvovirus and minute virus of mice, two members of the genus Parvovirus, showed both similarity in structure and analogous interactions. This observation suggests a functional role, such as in capsid stability and/or ssDNA genome recognition for encapsulation. The VP structure differs from those of other parvoviruses in surface loop regions that control receptor binding, tissue tropism, pathogenicity, and antibody recognition, including VP sequences reported to determine tumor cell tropism for oncotropic rodent parvoviruses. These structures of H-1PV provide insight into structural features that dictate capsid stabilization following genome packaging and three-dimensional information applicable for rational design of tumor-targeted recombinant gene delivery vectors. PMID:23449783
NASA Astrophysics Data System (ADS)
Boese, C. M.; Townend, J.; Chamberlain, C. J.; Warren-Smith, E.
2016-12-01
Microseismicity recorded since 2008 by the Southern Alps Microseismicity Borehole Array (SAMBA) and other predominantly short-period seismic networks deployed in the central Southern Alps, New Zealand, reveals distinctive patterns of triggering in response to regional seismicity (magnitudes larger than 5, epicentral distances of 100-500 km). Using matched-filter detection methods implemented in the EQcorrscan package (Chamberlain et al., in prep.), we analyze microseismicity occurring in several geographically distinct swarms in order to examine the responses of specific microearthquake sources to earthquakes of different sizes occurring at different distances and azimuths. The swarms exhibit complex responses to regional seismicity which reveal that microearthquake triggering in these cases involves a combination of extrinsic factors (related to the dynamic stresses produced by the regional earthquake) and intrinsic factors (controlled by the local state of stress and possibly by hydrogeological processes). We find also that the microearthquakes detected by individual templates have Gutenberg-Richter magnitude-frequency characteristics. Since the detected events, by design, have very similar hypocentres and focal mechanisms, the observed scaling pertains to a restricted set of fault planes.
ATACseqQC: a Bioconductor package for post-alignment quality assessment of ATAC-seq data.
Ou, Jianhong; Liu, Haibo; Yu, Jun; Kelliher, Michelle A; Castilla, Lucio H; Lawson, Nathan D; Zhu, Lihua Julie
2018-03-01
ATAC-seq (Assays for Transposase-Accessible Chromatin using sequencing) is a recently developed technique for genome-wide analysis of chromatin accessibility. Compared to earlier methods for assaying chromatin accessibility, ATAC-seq is faster and easier to perform, does not require cross-linking, has higher signal to noise ratio, and can be performed on small cell numbers. However, to ensure a successful ATAC-seq experiment, step-by-step quality assurance processes, including both wet lab quality control and in silico quality assessment, are essential. While several tools have been developed or adopted for assessing read quality, identifying nucleosome occupancy and accessible regions from ATAC-seq data, none of the tools provide a comprehensive set of functionalities for preprocessing and quality assessment of aligned ATAC-seq datasets. We have developed a Bioconductor package, ATACseqQC, for easily generating various diagnostic plots to help researchers quickly assess the quality of their ATAC-seq data. In addition, this package contains functions to preprocess aligned ATAC-seq data for subsequent peak calling. Here we demonstrate the utilities of our package using 25 publicly available ATAC-seq datasets from four studies. We also provide guidelines on what the diagnostic plots should look like for an ideal ATAC-seq dataset. This software package has been used successfully for preprocessing and assessing several in-house and public ATAC-seq datasets. Diagnostic plots generated by this package will facilitate the quality assessment of ATAC-seq data, and help researchers to evaluate their own ATAC-seq experiments as well as select high-quality ATAC-seq datasets from public repositories such as GEO to avoid generating hypotheses or drawing conclusions from low-quality ATAC-seq experiments. The software, source code, and documentation are freely available as a Bioconductor package at https://bioconductor.org/packages/release/bioc/html/ATACseqQC.html .
Altools: a user friendly NGS data analyser.
Camiolo, Salvatore; Sablok, Gaurav; Porceddu, Andrea
2016-02-17
Genotyping by re-sequencing has become a standard approach to estimate single nucleotide polymorphism (SNP) diversity, haplotype structure and the biodiversity and has been defined as an efficient approach to address geographical population genomics of several model species. To access core SNPs and insertion/deletion polymorphisms (indels), and to infer the phyletic patterns of speciation, most such approaches map short reads to the reference genome. Variant calling is important to establish patterns of genome-wide association studies (GWAS) for quantitative trait loci (QTLs), and to determine the population and haplotype structure based on SNPs, thus allowing content-dependent trait and evolutionary analysis. Several tools have been developed to investigate such polymorphisms as well as more complex genomic rearrangements such as copy number variations, presence/absence variations and large deletions. The programs available for this purpose have different strengths (e.g. accuracy, sensitivity and specificity) and weaknesses (e.g. low computation speed, complex installation procedure and absence of a user-friendly interface). Here we introduce Altools, a software package that is easy to install and use, which allows the precise detection of polymorphisms and structural variations. Altools uses the BWA/SAMtools/VarScan pipeline to call SNPs and indels, and the dnaCopy algorithm to achieve genome segmentation according to local coverage differences in order to identify copy number variations. It also uses insert size information from the alignment of paired-end reads and detects potential large deletions. A double mapping approach (BWA/BLASTn) identifies precise breakpoints while ensuring rapid elaboration. Finally, Altools implements several processes that yield deeper insight into the genes affected by the detected polymorphisms. Altools was used to analyse both simulated and real next-generation sequencing (NGS) data and performed satisfactorily in terms of positive predictive values, sensitivity, the identification of large deletion breakpoints and copy number detection. Altools is fast, reliable and easy to use for the mining of NGS data. The software package also attempts to link identified polymorphisms and structural variants to their biological functions thus providing more valuable information than similar tools.
Yang, Kui; Dang, Xiaoqun; Baines, Joel D
2017-10-15
Monomeric herpesvirus DNA is cleaved from concatemers and inserted into preformed capsids through the actions of the viral terminase. The terminase of herpes simplex virus (HSV) is composed of three subunits encoded by U L 15, U L 28, and U L 33. The U L 33-encoded protein (pU L 33) interacts with pU L 28, but its precise role in the DNA cleavage and packaging reaction is unclear. To investigate the function of pU L 33, we generated a panel of recombinant viruses with either deletions or substitutions in the most conserved regions of U L 33 using a bacterial artificial chromosome system. Deletion of 11 amino acids (residues 50 to 60 or residues 110 to 120) precluded viral replication, whereas the truncation of the last 10 amino acids from the pU L 33 C terminus did not affect viral replication or the interaction of pU L 33 with pU L 28. Mutations that replaced the lysine at codon 110 and the arginine at codon 111 with alanine codons failed to replicate, and the pU L 33 mutant interacted with pU L 28 less efficiently. Interestingly, genomic termini of the large (L) and small (S) components were detected readily in cells infected with these mutants, indicating that concatemeric DNA was cleaved efficiently. However, the release of monomeric genomes as assessed by pulsed-field gel electrophoresis was greatly diminished, and DNA-containing capsids were not observed. These results suggest that pU L 33 is necessary for one of the two viral DNA cleavage events required to release individual genomes from concatemeric viral DNA. IMPORTANCE This paper shows a role for pU L 33 in one of the two DNA cleavage events required to release monomeric genomes from concatemeric viral DNA. This is the first time that such a phenotype has been observed and is the first identification of a function of this protein relevant to DNA packaging other than its interaction with other terminase components. Copyright © 2017 Yang et al.
PanACEA: a bioinformatics tool for the exploration and visualization of bacterial pan-chromosomes.
Clarke, Thomas H; Brinkac, Lauren M; Inman, Jason M; Sutton, Granger; Fouts, Derrick E
2018-06-27
Bacterial pan-genomes, comprised of conserved and variable genes across multiple sequenced bacterial genomes, allow for identification of genomic regions that are phylogenetically discriminating or functionally important. Pan-genomes consist of large amounts of data, which can restrict researchers ability to locate and analyze these regions. Multiple software packages are available to visualize pan-genomes, but currently their ability to address these concerns are limited by using only pre-computed data sets, prioritizing core over variable gene clusters, or by not accounting for pan-chromosome positioning in the viewer. We introduce PanACEA (Pan-genome Atlas with Chromosome Explorer and Analyzer), which utilizes locally-computed interactive web-pages to view ordered pan-genome data. It consists of multi-tiered, hierarchical display pages that extend from pan-chromosomes to both core and variable regions to single genes. Regions and genes are functionally annotated to allow for rapid searching and visual identification of regions of interest with the option that user-supplied genomic phylogenies and metadata can be incorporated. PanACEA's memory and time requirements are within the capacities of standard laptops. The capability of PanACEA as a research tool is demonstrated by highlighting a variable region important in differentiating strains of Enterobacter hormaechei. PanACEA can rapidly translate the results of pan-chromosome programs into an intuitive and interactive visual representation. It will empower researchers to visually explore and identify regions of the pan-chromosome that are most biologically interesting, and to obtain publication quality images of these regions.
Pre-dispersal predation effect on seed packaging strategies and seed viability.
DeSoto, Lucía; Tutor, David; Torices, Rubén; Rodríguez-Echeverría, Susana; Nabais, Cristina
2016-01-01
An increased understanding of intraspecific seed packaging (i.e. seed size/number strategy) variation across different environments may improve current knowledge of the ecological forces that drive seed evolution in plants. In particular, pre-dispersal seed predation may influence seed packaging strategies, triggering a reduction of the resources allocated to undamaged seeds within the preyed fruits. Assessing plant reactions to pre-dispersal seed predation is crucial to a better understanding of predation effects, but the response of plants to arthropod attacks remains unexplored. We have assessed the effect of cone predation on the size and viability of undamaged seeds in populations of Juniperus thurifera with contrasting seed packaging strategies, namely, North African populations with single-large-seeded cones and South European populations with multi-small-seeded cones. Our results show that the incidence of predation was lower on the single-large-seeded African cones than on the multi-small-seeded European ones. Seeds from non-preyed cones were also larger and had a higher germination success than uneaten seeds from preyed cones, but only in populations with multi-seeded cones and in cones attacked by Trisetacus sp., suggesting a differential plastic response to predation. It is possible that pre-dispersal seed predation has been a strong selective pressure in European populations with high cone predation rates, being a process which maintains multi-small-seeded cones and empty seeds as a strategy to save some seeds from predation. Conversely, pre-dispersal predation might not have a strong effect in the African populations with single-large-seeded cones characterized by seed germination and filling rates higher than those in the European populations. Our results indicate that differences in pre-dispersal seed predators and predation levels may affect both selection on and intraspecific variation in seed packaging.
Usability study of clinical exome analysis software: top lessons learned and recommendations.
Shyr, Casper; Kushniruk, Andre; Wasserman, Wyeth W
2014-10-01
New DNA sequencing technologies have revolutionized the search for genetic disruptions. Targeted sequencing of all protein coding regions of the genome, called exome analysis, is actively used in research-oriented genetics clinics, with the transition to exomes as a standard procedure underway. This transition is challenging; identification of potentially causal mutation(s) amongst ∼10(6) variants requires specialized computation in combination with expert assessment. This study analyzes the usability of user interfaces for clinical exome analysis software. There are two study objectives: (1) To ascertain the key features of successful user interfaces for clinical exome analysis software based on the perspective of expert clinical geneticists, (2) To assess user-system interactions in order to reveal strengths and weaknesses of existing software, inform future design, and accelerate the clinical uptake of exome analysis. Surveys, interviews, and cognitive task analysis were performed for the assessment of two next-generation exome sequence analysis software packages. The subjects included ten clinical geneticists who interacted with the software packages using the "think aloud" method. Subjects' interactions with the software were recorded in their clinical office within an urban research and teaching hospital. All major user interface events (from the user interactions with the packages) were time-stamped and annotated with coding categories to identify usability issues in order to characterize desired features and deficiencies in the user experience. We detected 193 usability issues, the majority of which concern interface layout and navigation, and the resolution of reports. Our study highlights gaps in specific software features typical within exome analysis. The clinicians perform best when the flow of the system is structured into well-defined yet customizable layers for incorporation within the clinical workflow. The results highlight opportunities to dramatically accelerate clinician analysis and interpretation of patient genomic data. We present the first application of usability methods to evaluate software interfaces in the context of exome analysis. Our results highlight how the study of user responses can lead to identification of usability issues and challenges and reveal software reengineering opportunities for improving clinical next-generation sequencing analysis. While the evaluation focused on two distinctive software tools, the results are general and should inform active and future software development for genome analysis software. As large-scale genome analysis becomes increasingly common in healthcare, it is critical that efficient and effective software interfaces are provided to accelerate clinical adoption of the technology. Implications for improved design of such applications are discussed. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Quality changes of cuttlefish stored under various atmosphere modifications and vacuum packaging.
Bouletis, Achilleas D; Arvanitoyannis, Ioannis S; Hadjichristodoulou, Christos; Neofitou, Christos; Parlapani, Foteini F; Gkagtzis, Dimitrios C
2016-06-01
Seafood preservation and its shelf life prolongation are two of the main issues in the seafood industry. As a result, and in view of market globalization, research has been triggered in this direction by applying several techniques such as modified atmosphere packaging (MAP), vacuum packaging (VP) and active packaging (AP). However, seafood such as octopus, cuttlefish and others have not been thoroughly investigated up to now. The aim of this research was to determine the optimal conditions of modified atmosphere under which cuttlefish storage time and consequently shelf life time could be prolonged without endangering consumer safety. It was found that cuttlefish shelf life reached 2, 2, 4, 8 and 8 days for control, VP, MAP 1, MAP 2 and MAP 3 (20% CO2 -80% N2 , 50% CO2 -50% N2 and 70% CO2 -30% N2 for MAP 1, 2 and 3, respectively) samples, respectively, judging by their sensorial attributes. Elevated CO2 levels had a strong microbiostatic effect, whereas storage under vacuum did not offer significant advantages. All physicochemical attributes of MAP-treated samples were better preserved compared to control. Application of high CO2 atmospheres such as MAP 2 and MAP 3 proved to be an effective strategy toward preserving the characteristics and prolonging the shelf life of fresh cuttlefish and thereby improving its potential in the market. © 2015 Society of Chemical Industry. © 2015 Society of Chemical Industry.
Drosten, C.; Seifried, E.; Roth, W. K.
2001-01-01
Screening of blood donors for human immunodeficiency virus type 1 (HIV-1) infection by PCR permits the earlier diagnosis of HIV-1 infection compared with that by serologic assays. We have established a high-throughput reverse transcription (RT)-PCR assay based on 5′-nuclease PCR. By in-tube detection of HIV-1 RNA with a fluorogenic probe, the 5′-nuclease PCR technology (TaqMan PCR) eliminates the risk of carryover contamination, a major problem in PCR testing. We outline the development and evaluation of the PCR assay from a technical point of view. A one-step RT-PCR that targets the gag genes of all known HIV-1 group M isolates was developed. An internal control RNA detectable with a heterologous 5′-nuclease probe was derived from the viral target cDNA and was packaged into MS2 coliphages (Armored RNA). Because the RNA was protected against digestion with RNase, it could be spiked into patient plasma to control the complete sample preparation and amplification process. The assay detected 831 HIV-1 type B genome equivalents per ml of native plasma (95% confidence interval [CI], 759 to 936 HIV-1 B genome equivalents per ml) with a ≥95% probability of a positive result, as determined by probit regression analysis. A detection limit of 1,195 genome equivalents per ml of (individual) donor plasma (95% CI, 1,014 to 1,470 genome equivalents per ml of plasma pooled from individuals) was achieved when 96 samples were pooled and enriched by centrifugation. Up to 4,000 plasma samples per PCR run were tested in a 3-month trial period. Although data from the present pilot feasibility study will have to be complemented by a large clinical validation study, the assay is a promising approach to the high-throughput screening of blood donors and is the first noncommercial test for high-throughput screening for HIV-1. PMID:11724836
Combining clinical and genomics queries using i2b2 – Three methods
Murphy, Shawn N.; Avillach, Paul; Bellazzi, Riccardo; Phillips, Lori; Gabetta, Matteo; Eran, Alal; McDuffie, Michael T.; Kohane, Isaac S.
2017-01-01
We are fortunate to be living in an era of twin biomedical data surges: a burgeoning representation of human phenotypes in the medical records of our healthcare systems, and high-throughput sequencing making rapid technological advances. The difficulty representing genomic data and its annotations has almost by itself led to the recognition of a biomedical “Big Data” challenge, and the complexity of healthcare data only compounds the problem to the point that coherent representation of both systems on the same platform seems insuperably difficult. We investigated the capability for complex, integrative genomic and clinical queries to be supported in the Informatics for Integrating Biology and the Bedside (i2b2) translational software package. Three different data integration approaches were developed: The first is based on Sequence Ontology, the second is based on the tranSMART engine, and the third on CouchDB. These novel methods for representing and querying complex genomic and clinical data on the i2b2 platform are available today for advancing precision medicine. PMID:28388645
The Impact of Chromatin Dynamics on Cas9-Mediated Genome Editing in Human Cells.
Daer, René M; Cutts, Josh P; Brafman, David A; Haynes, Karmella A
2017-03-17
In order to efficiently edit eukaryotic genomes, it is critical to test the impact of chromatin dynamics on CRISPR/Cas9 function and develop strategies to adapt the system to eukaryotic contexts. So far, research has extensively characterized the relationship between the CRISPR endonuclease Cas9 and the composition of the RNA-DNA duplex that mediates the system's precision. Evidence suggests that chromatin modifications and DNA packaging can block eukaryotic genome editing by custom-built DNA endonucleases like Cas9; however, the underlying mechanism of Cas9 inhibition is unclear. Here, we demonstrate that closed, gene-silencing-associated chromatin is a mechanism for the interference of Cas9-mediated DNA editing. Our assays use a transgenic cell line with a drug-inducible switch to control chromatin states (open and closed) at a single genomic locus. We show that closed chromatin inhibits binding and editing at specific target sites and that artificial reversal of the silenced state restores editing efficiency. These results provide new insights to improve Cas9-mediated editing in human and other mammalian cells.
genipe: an automated genome-wide imputation pipeline with automatic reporting and statistical tools.
Lemieux Perreault, Louis-Philippe; Legault, Marc-André; Asselin, Géraldine; Dubé, Marie-Pierre
2016-12-01
Genotype imputation is now commonly performed following genome-wide genotyping experiments. Imputation increases the density of analyzed genotypes in the dataset, enabling fine-mapping across the genome. However, the process of imputation using the most recent publicly available reference datasets can require considerable computation power and the management of hundreds of large intermediate files. We have developed genipe, a complete genome-wide imputation pipeline which includes automatic reporting, imputed data indexing and management, and a suite of statistical tests for imputed data commonly used in genetic epidemiology (Sequence Kernel Association Test, Cox proportional hazards for survival analysis, and linear mixed models for repeated measurements in longitudinal studies). The genipe package is an open source Python software and is freely available for non-commercial use (CC BY-NC 4.0) at https://github.com/pgxcentre/genipe Documentation and tutorials are available at http://pgxcentre.github.io/genipe CONTACT: louis-philippe.lemieux.perreault@statgen.org or marie-pierre.dube@statgen.orgSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Alam, Tanfis I; Rao, Venigalla B
2008-03-07
Translocation of double-stranded DNA into a preformed capsid by tailed bacteriophages is driven by powerful motors assembled at the special portal vertex. The motor is thought to drive processive cycles of DNA binding, movement, and release to package the viral genome. In phage T4, there is evidence that the large terminase protein, gene product 17 (gp17), assembles into a multisubunit motor and translocates DNA by an inchworm mechanism. gp17 consists of two domains; an N-terminal ATPase domain (amino acids 1-360) that powers translocation of DNA, and a C-terminal nuclease domain (amino acids 361-610) that cuts concatemeric DNA to generate a headful-size viral genome. While the functional motifs of ATPase and nuclease have been well defined and the ATPase atomic structure has been solved, the DNA binding motif(s) responsible for viral DNA recognition, cutting, and translocation are unknown. Here we report the first evidence for the presence of a double-stranded DNA binding activity in the gp17 ATPase domain. Binding to DNA is sensitive to Mg(2+) and salt, but not the type of DNA used. DNA fragments as short as 20 bp can bind to the ATPase but preferential binding was observed to DNA greater than 1 kb. A high molecular weight ATPase-DNA complex was isolated by gel filtration, suggesting oligomerization of ATPase following DNA interaction. DNA binding was not observed with the full-length gp17, or the C-terminal nuclease domain. The small terminase protein, gp16, inhibited DNA binding, which was further accentuated by ATP. The presence of a DNA binding site in the ATPase domain and its binding properties implicate a role in the DNA packaging mechanism.
USDA-ARS?s Scientific Manuscript database
Shiga toxin producing Escherichia coli (STEC) represent a continuing threat to the Nation’s food supply and public health. Shiga toxin genes (stx) are encoded in lambda-like bacteriophages whose genome is inserted into the bacterial DNA. Environmental stress can trigger bacteriophage replication a...
Ni, Julie Z.; Grate, Leslie; Donohue, John Paul; Preston, Christine; Nobida, Naomi; O’Brien, Georgeann; Shiue, Lily; Clark, Tyson A.; Blume, John E.; Ares, Manuel
2007-01-01
Many alternative splicing events create RNAs with premature stop codons, suggesting that alternative splicing coupled with nonsense-mediated decay (AS-NMD) may regulate gene expression post-transcriptionally. We tested this idea in mice by blocking NMD and measuring changes in isoform representation using splicing-sensitive microarrays. We found a striking class of highly conserved stop codon-containing exons whose inclusion renders the transcript sensitive to NMD. A genomic search for additional examples identified >50 such exons in genes with a variety of functions. These exons are unusually frequent in genes that encode splicing activators and are unexpectedly enriched in the so-called “ultraconserved” elements in the mammalian lineage. Further analysis show that NMD of mRNAs for splicing activators such as SR proteins is triggered by splicing activation events, whereas NMD of the mRNAs for negatively acting hnRNP proteins is triggered by splicing repression, a polarity consistent with widespread homeostatic control of splicing regulator gene expression. We suggest that the extreme genomic conservation surrounding these regulatory splicing events within splicing factor genes demonstrates the evolutionary importance of maintaining tightly tuned homeostasis of RNA-binding protein levels in the vertebrate cell. PMID:17369403
Damgaard, Rasmus; Rasmussen, Mats; Buus, Peter; Mulhall, Brian; Guazzo, Dana Morton
2013-01-01
In Part 1 of this three-part research series, a leak test performed using high-voltage leak detection (HVLD) technology, also referred to as an electrical conductivity and capacitance leak test, was developed and validated for container-closure integrity verification of a small-volume laminate plastic bag containing an aqueous solution for injection. The sterile parenteral product is the rapid-acting insulin analogue, insulin aspart (NovoRapid®/NovoLog®, by Novo Nordisk A/S, Bagsværd, Denmark). The aseptically filled and sealed package is designed to preserve product sterility through expiry. Method development and validation work incorporated positive control packages with a single hole laser-drilled through the laminate film of each bag. A unique HVLD method characterized by specific high-voltage and potentiometer set points was established for testing bags positioned in each of three possible orientations as they are conveyed through the instrument's test zone in each of two possible directions-resulting in a total of six different test method options. Validation study results successfully demonstrated the ability of all six methods to accurately and reliably detect those packages with laser-drilled holes from 2.5-11.2 μm in nominal diameter. Part 2 of this series will further explore HVLD test results as a function of package seal and product storage variables. The final Part 3 will report the impact of HVLD exposure on product physico-chemical stability. In this Part 1 of a three-part research series, a leak test method based on electrical conductivity and capacitance, called high voltage leak detection (HVLD), was used to find leaks in small plastic bags filled with an insulin pharmaceutical solution for human injection by Novo Nordisk A/S (Bagsværd, Denmark). To perform the test, the package is electrically grounded while being conveyed past an electrode linked to a high-voltage, low-amperage transformer. The instrument measures the current that passes from the transformer to the electrode, through the packaged product and along the package walls, to the ground. Plastic packages without defect are relatively nonconductive and yield a low voltage reading; a leaking package with electrically conductive solution located in or near the leak triggers a spike in voltage reading. Test methods were optimized and validated, enabling the detection of leaking packages with holes as small as 2.5 μm in diameter. Part 2 of this series will further explore HVLD test results as a function of package seal and product storage variables. The final Part 3 will report the impact of HVLD exposure on product stability.
Quiles-Puchalt, Nuria; Tormo-Más, María Ángeles; Campoy, Susana; Toledo-Arana, Alejandro; Monedero, Vicente; Lasa, Íñigo; Novick, Richard P.; Christie, Gail E.; Penadés, José R.
2013-01-01
The propagation of bacteriophages and other mobile genetic elements requires exploitation of the phage mechanisms involved in virion assembly and DNA packaging. Here, we identified and characterized four different families of phage-encoded proteins that function as activators required for transcription of the late operons (morphogenetic and lysis genes) in a large group of phages infecting Gram-positive bacteria. These regulators constitute a super-family of proteins, here named late transcriptional regulators (Ltr), which share common structural, biochemical and functional characteristics and are unique to this group of phages. They are all small basic proteins, encoded by genes present at the end of the early gene cluster in their respective phage genomes and expressed under cI repressor control. To control expression of the late operon, the Ltr proteins bind to a DNA repeat region situated upstream of the terS gene, activating its transcription. This involves the C-terminal part of the Ltr proteins, which control specificity for the DNA repeat region. Finally, we show that the Ltr proteins are the only phage-encoded proteins required for the activation of the packaging and lysis modules. In summary, we provide evidence that phage packaging and lysis is a conserved mechanism in Siphoviridae infecting a wide variety of Gram-positive bacteria. PMID:23771138
Donzé, O; Spahr, P F
1992-01-01
The Rous sarcoma virus (RSV) RNA leader sequence carries three open reading frames (uORFs) upstream of the AUG initiator of the gag gene. We studied, in vivo, the role of these uORFs by changing two or three nucleotides of the three AUGs or by deleting the first uORF. Our results show that (i) unlike most previously characterized uORFs, which decrease translation, the first uORF (AUG1) of RSV acts as an enhancer of translation, since absence of the first AUG decreased translation; AUG3 also modulates translation, probably by interfering with scanning ribosomes as described for other upstream ORFs, and mutation of AUG2 had no effect on translation. (ii) Mutation of each of the upstream AUGs lowered the infectivity of progeny virions. (iii) Unexpectedly, mutation of AUG1 and/or AUG3 dramatically reduced RNA packaging by 50-to 100-fold, unlike mutation of AUG2 which did not alter RNA packaging efficiency. Additional mutants in the vicinity of uORF1 and uORF3 were constructed in order to elucidate the mechanism by which uORFs affect RNA packaging: a translation model requiring uORFs 1 and 3, and involving ribosome pausing at AUG 3 is discussed. Images PMID:1327749
López-Lastra, Marcelo; Ulrici, Sandrine; Gabus, Caroline; Darlix, Jean-Luc
1999-01-01
Mouse virus-like 30S RNAs (VL30m) constitute a family of retrotransposons, present at 100 to 200 copies, dispersed in the mouse genome. They display little sequence homology to Moloney murine leukemia virus (MoMLV), do not encode virus-like proteins, and have not been implicated in retroviral carcinogenesis. However, VL30 RNAs are efficiently packaged into MLV particles that are propagated in cell culture. In this study, we addressed whether the 5′ region of VL30m could replace the 5′ leader of MoMLV functionally in a recombinant vector construct. Our data confirm that the putative packaging sequence of VL30 is located within the 5′ region (nucleotides 362 to 1149 with respect to the cap structure) and that it can replace the packaging sequence of MoMLV. We also show that VL30m contains an internal ribosome entry segment (IRES) in the 5′ region, as do MoMLV, Friend murine leukemia virus, Harvey murine sarcoma virus, and avian reticuloendotheliosis virus type A. Our data show that both the packaging and IRES functions of the 5′ region of VL30m RNA can be efficiently used to develop retrotransposon-based vectors. PMID:10482590