Sample records for precursor genomic segment

  1. Identification of a precursor genomic segment that provided a sequence unique to glycophorin B and E genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Onda, M.; Kudo, S.; Fukuda, M.

    Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less

  2. Creation of Rift Valley Fever Viruses with Four-Segmented Genomes Reveals Flexibility in Bunyavirus Genome Packaging

    PubMed Central

    Oreshkova, Nadia; Moormann, Rob J. M.; Kortekaas, Jeroen

    2014-01-01

    ABSTRACT Bunyavirus genomes comprise a small (S), a medium (M), and a large (L) RNA segment of negative polarity. Although the untranslated regions have been shown to comprise signals required for transcription, replication, and encapsidation, the mechanisms that drive the packaging of at least one S, M, and L segment into a single virion to generate infectious virus are largely unknown. One of the most important members of the Bunyaviridae family that causes devastating disease in ruminants and occasionally humans is the Rift Valley fever virus (RVFV). We studied the flexibility of RVFV genome packaging by splitting the glycoprotein precursor gene, encoding the (NSm)GnGc polyprotein, into two individual genes encoding either (NSm)Gn or Gc. Using reverse genetics, six viruses with a segmented glycoprotein precursor gene were rescued, varying from a virus comprising two S-type segments in the absence of an M-type segment to a virus consisting of four segments (RVFV-4s), of which three are M-type. Despite that all virus variants were able to grow in mammalian cell lines, they were unable to spread efficiently in cells of mosquito origin. Moreover, in vivo studies demonstrated that RVFV-4s is unable to cause disseminated infection and disease in mice, even in the presence of the main virulence factor NSs, but induced a protective immune response against a lethal challenge with wild-type virus. In summary, splitting bunyavirus glycoprotein precursor genes provides new opportunities to study bunyavirus genome packaging and offers new methods to develop next-generation live-attenuated bunyavirus vaccines. IMPORTANCE Rift Valley fever virus (RVFV) causes devastating disease in ruminants and occasionally humans. Virions capable of productive infection comprise at least one copy of the small (S), medium (M), and large (L) RNA genome segments. The M segment encodes a glycoprotein precursor (GPC) protein that is cotranslationally cleaved into Gn and Gc, which are required for virus entry and fusion. We studied the flexibility of RVFV genome packaging and developed experimental live-attenuated vaccines by applying a unique strategy based on the splitting of the GnGc open reading frame. Several RVFV variants, varying from viruses comprising two S-type segments to viruses consisting of four segments (RVFV-4s), of which three are M-type, could be rescued and were shown to induce a rapid protective immune response. Altogether, the segmentation of bunyavirus GPCs provides a new method for studying bunyavirus genome packaging and facilitates the development of novel live-attenuated bunyavirus vaccines. PMID:25008937

  3. Creation of Rift Valley fever viruses with four-segmented genomes reveals flexibility in bunyavirus genome packaging.

    PubMed

    Wichgers Schreur, Paul J; Oreshkova, Nadia; Moormann, Rob J M; Kortekaas, Jeroen

    2014-09-01

    Bunyavirus genomes comprise a small (S), a medium (M), and a large (L) RNA segment of negative polarity. Although the untranslated regions have been shown to comprise signals required for transcription, replication, and encapsidation, the mechanisms that drive the packaging of at least one S, M, and L segment into a single virion to generate infectious virus are largely unknown. One of the most important members of the Bunyaviridae family that causes devastating disease in ruminants and occasionally humans is the Rift Valley fever virus (RVFV). We studied the flexibility of RVFV genome packaging by splitting the glycoprotein precursor gene, encoding the (NSm)GnGc polyprotein, into two individual genes encoding either (NSm)Gn or Gc. Using reverse genetics, six viruses with a segmented glycoprotein precursor gene were rescued, varying from a virus comprising two S-type segments in the absence of an M-type segment to a virus consisting of four segments (RVFV-4s), of which three are M-type. Despite that all virus variants were able to grow in mammalian cell lines, they were unable to spread efficiently in cells of mosquito origin. Moreover, in vivo studies demonstrated that RVFV-4s is unable to cause disseminated infection and disease in mice, even in the presence of the main virulence factor NSs, but induced a protective immune response against a lethal challenge with wild-type virus. In summary, splitting bunyavirus glycoprotein precursor genes provides new opportunities to study bunyavirus genome packaging and offers new methods to develop next-generation live-attenuated bunyavirus vaccines. Rift Valley fever virus (RVFV) causes devastating disease in ruminants and occasionally humans. Virions capable of productive infection comprise at least one copy of the small (S), medium (M), and large (L) RNA genome segments. The M segment encodes a glycoprotein precursor (GPC) protein that is cotranslationally cleaved into Gn and Gc, which are required for virus entry and fusion. We studied the flexibility of RVFV genome packaging and developed experimental live-attenuated vaccines by applying a unique strategy based on the splitting of the GnGc open reading frame. Several RVFV variants, varying from viruses comprising two S-type segments to viruses consisting of four segments (RVFV-4s), of which three are M-type, could be rescued and were shown to induce a rapid protective immune response. Altogether, the segmentation of bunyavirus GPCs provides a new method for studying bunyavirus genome packaging and facilitates the development of novel live-attenuated bunyavirus vaccines. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  4. Stability of local secondary structure determines selectivity of viral RNA chaperones.

    PubMed

    Bravo, Jack P K; Borodavka, Alexander; Barth, Anders; Calabrese, Antonio N; Mojzes, Peter; Cockburn, Joseph J B; Lamb, Don C; Tuma, Roman

    2018-05-18

    To maintain genome integrity, segmented double-stranded RNA viruses of the Reoviridae family must accurately select and package a complete set of up to a dozen distinct genomic RNAs. It is thought that the high fidelity segmented genome assembly involves multiple sequence-specific RNA-RNA interactions between single-stranded RNA segment precursors. These are mediated by virus-encoded non-structural proteins with RNA chaperone-like activities, such as rotavirus (RV) NSP2 and avian reovirus σNS. Here, we compared the abilities of NSP2 and σNS to mediate sequence-specific interactions between RV genomic segment precursors. Despite their similar activities, NSP2 successfully promotes inter-segment association, while σNS fails to do so. To understand the mechanisms underlying such selectivity in promoting inter-molecular duplex formation, we compared RNA-binding and helix-unwinding activities of both proteins. We demonstrate that octameric NSP2 binds structured RNAs with high affinity, resulting in efficient intramolecular RNA helix disruption. Hexameric σNS oligomerizes into an octamer that binds two RNAs, yet it exhibits only limited RNA-unwinding activity compared to NSP2. Thus, the formation of intersegment RNA-RNA interactions is governed by both helix-unwinding capacity of the chaperones and stability of RNA structure. We propose that this protein-mediated RNA selection mechanism may underpin the high fidelity assembly of multi-segmented RNA genomes in Reoviridae.

  5. Creation of a Recombinant Rift Valley Fever Virus with a Two-Segmented Genome ▿ †

    PubMed Central

    Brennan, Benjamin; Welch, Stephen R.; McLees, Angela; Elliott, Richard M.

    2011-01-01

    Rift Valley fever virus (RVFV; family Bunyaviridae) is a clinically important, mosquito-borne pathogen of both livestock and humans, which is found mainly in sub-Saharan Africa and the Arabian Peninsula. RVFV has a trisegmented single-stranded RNA (ssRNA) genome. The L and M segments are negative sense and encode the L protein (viral polymerase) on the L segment and the virion glycoproteins Gn and Gc as well as two other proteins, NSm and 78K, on the M segment. The S segment uses an ambisense coding strategy to express the nucleocapsid protein, N, and the nonstructural protein, NSs. Both the NSs and NSm proteins are dispensable for virus growth in tissue culture. Using reverse genetics, we generated a recombinant virus, designated r2segMP12, containing a two-segmented genome in which the NSs coding sequence was replaced with that for the Gn and Gc precursor. Thus, r2segMP12 lacks an M segment, and although it was attenuated in comparison to the three-segmented parental virus in both mammalian and insect cell cultures, it was genetically stable over multiple passages. We further show that the virus can stably maintain an M-like RNA segment encoding the enhanced green fluorescent protein gene. The implications of these findings for RVFV genome packaging and the potential to develop multivalent live-attenuated vaccines are discussed. PMID:21795328

  6. Self-Association of Lymphocytic Choriomeningitis Virus Nucleoprotein Is Mediated by Its N-Terminal Region and Is Not Required for Its Anti-Interferon Function

    PubMed Central

    Ortiz-Riaño, Emilio; Cheng, Benson Yee Hin

    2012-01-01

    Arenaviruses have a bisegmented, negative-strand RNA genome. Both the large (L) and small (S) genome segments use an ambisense coding strategy to direct the synthesis of two viral proteins. The L segment encodes the virus polymerase (L protein) and the matrix Z protein, whereas the S segment encodes the nucleoprotein (NP) and the glycoprotein precursor (GPC). NPs are the most abundant viral protein in infected cells and virions and encapsidate genomic RNA species to form an NP-RNA complex that, together with the virus L polymerase, forms the virus ribonucleoprotein (RNP) core capable of directing both replication and transcription of the viral genome. RNP formation predicts a self-association property of NPs. Here we document self-association (homotypic interaction) of the NP of the prototypic arenavirus lymphocytic choriomeningitis virus (LCMV), as well as those of the hemorrhagic fever (HF) arenaviruses Lassa virus (LASV) and Machupo virus (MACV). We also show heterotypic interaction between NPs from both closely (LCMV and LASV) and distantly (LCMV and MACV) genetically related arenaviruses. LCMV NP self-association was dependent on the presence of single-stranded RNA and mediated by an N-terminal region of the NP that did not overlap with the previously described C-terminal NP domain involved in either counteracting the host type I interferon response or interacting with LCMV Z. PMID:22258244

  7. Long-term surveillance of H7 influenza viruses in American wild aquatic birds: are the H7N3 influenza viruses in wild birds the precursors of highly pathogenic strains in domestic poultry?

    PubMed Central

    Krauss, Scott; Stucker, Karla M; Schobel, Seth A; Danner, Angela; Friedman, Kimberly; Knowles, James P; Kayali, Ghazi; Niles, Lawrence J; Dey, Amanda D; Raven, Garnet; Pryor, Paul; Lin, Xudong; Das, Suman R; Stockwell, Timothy B; Wentworth, David E; Webster, Robert G

    2015-01-01

    The emergence of influenza A virus (IAV) in domestic avian species and associated transmissions to mammals is unpredictable. In the Americas, the H7 IAVs are of particular concern, and there have been four separate outbreaks of highly pathogenic (HP) H7N3 in domestic poultry in North and South America between 2002 and 2012, with occasional spillover into humans. Here, we use long-term IAV surveillance in North American shorebirds at Delaware Bay, USA, from 1985 to 2012 and in ducks in Alberta, Canada, from 1976 to 2012 to determine which hemagglutinin (HA)–neuraminidase (NA) combinations predominated in Anseriformes (ducks) and Charadriiformes (shorebirds) and whether there is concordance between peaks of H7 prevalence and transmission in wild aquatic birds and the emergence of H7 IAVs in poultry and humans. Whole-genome sequencing supported phylogenetic and genomic constellation analyses to determine whether HP IAVs emerge in the context of specific internal gene segment sequences. Phylogenetic analysis of whole-genome sequences of the H7N3 influenza viruses from wild birds and HP H7N3 outbreaks in the Americas indicate that each HP outbreak was an independent emergence event and that the low pathogenic (LP) avian influenza precursors were most likely from dabbling ducks. The different polybasic cleavage sites in the four HP outbreaks support independent origins. At the 95% nucleotide percent identity-level phylogenetic analysis showed that the wild duck HA, PB1, and M sequences clustered with the poultry and human outbreak sequences. The genomic constellation analysis strongly suggests that gene segments/virus flow from wild birds to domestic poultry. PMID:26954883

  8. Four-segmented Rift Valley fever virus induces sterile immunity in sheep after a single vaccination.

    PubMed

    Wichgers Schreur, Paul J; Kant, Jet; van Keulen, Lucien; Moormann, Rob J M; Kortekaas, Jeroen

    2015-03-17

    Rift Valley fever virus (RVFV), a mosquito-borne virus in the Bunyaviridae family, causes recurrent outbreaks with severe disease in ruminants and occasionally humans. The virus comprises a segmented genome consisting of a small (S), medium (M) and large (L) RNA segment of negative polarity. The M-segment encodes a glycoprotein precursor (GPC) protein that is co-translationally cleaved into Gn and Gc, which are required for virus entry and fusion. Recently we developed a four-segmented RVFV (RVFV-4s) by splitting the M-genome segment, and used this virus to study RVFV genome packaging. Here we evaluated the potential of a RVFV-4s variant lacking the NSs gene (4s-ΔNSs) to induce protective immunity in sheep. Groups of seven lambs were either mock-vaccinated or vaccinated with 10(5) or 10(6) tissue culture infective dose (TCID50) of 4s-ΔNSs via the intramuscular (IM) or subcutaneous (SC) route. Three weeks post-vaccination all lambs were challenged with wild-type RVFV. Mock-vaccinated lambs developed high fever and high viremia within 2 days post-challenge and three animals eventually succumbed to the infection. In contrast, none of the 4s-ΔNSs vaccinated animals developed clinical signs during the course of the experiment. Vaccination with 10(5) TCID50 via the IM route provided sterile immunity, whereas a 10(6) dose was required to induce sterile immunity via SC vaccination. Protection was strongly correlated with the presence of RVFV neutralizing antibodies. This study shows that 4s-ΔNSs is able to induce sterile immunity in the natural target species after a single vaccination, preferably administrated via the IM route. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. Complete genome sequence of a Watermelon silver mottle virus isolate from China.

    PubMed

    Rao, Xueqin; Wu, Zhuyan; Li, Yuan

    2013-06-01

    The complete genome of a Watermelon silver mottle virus (WSMoV) (genus Tospovirus, family Bunyaviridae) isolate (WSMoV-GZ) from Guangdong province, China was sequenced. The genomes of WSMoV-GZ contained 3,603, 4,909, and 8,914 nt of small (S), medium (M), and large (L) RNA segments, respectively, and had a genomic organization characteristic of members of the genus Tospovirus. The amino acid sequence of the nucleocapsid (N) protein, S RNA-encoded nonstructural (NSs) protein, M RNA-encoded nonstructural (NSm) protein, Gn/Gc glycoprotein precursor, and RNA-dependent RNA polymerase (RdRp) protein showed 94.3-97.5 % identity with those of other WSMoV isolates. Phylogenetic analysis showed that the N protein of WSMoV-GZ was clustered together with those of the WSMoV isolates. The full sequence of WSMoV-GZ provides a reference genome for comparison with other tospoviruses.

  10. Unusual DNA Structures Associated With Germline Genetic Activity in Caenorhabditis elegans

    PubMed Central

    Fire, Andrew; Alcazar, Rosa; Tan, Frederick

    2006-01-01

    We describe a surprising long-range periodicity that underlies a substantial fraction of C. elegans genomic sequence. Extended segments (up to several hundred nucleotides) of the C. elegans genome show a strong bias toward occurrence of AA/TT dinucleotides along one face of the helix while little or no such constraint is evident on the opposite helical face. Segments with this characteristic periodicity are highly overrepresented in intron sequences and are associated with a large fraction of genes with known germline expression in C. elegans. In addition to altering the path and flexibility of DNA in vitro, sequences of this character have been shown by others to constrain DNA∷nucleosome interactions, potentially producing a structure that could resist the assembly of highly ordered (phased) nucleosome arrays that have been proposed as a precursor to heterochromatin. We propose a number of ways that the periodic occurrence of An/Tn clusters could reflect evolution and function of genes that express in the germ cell lineage of C. elegans. PMID:16648589

  11. Genomic analysis reveals Nairobi sheep disease virus to be highly diverse and present in both Africa, and in India in the form of the Ganjam virus variant.

    PubMed

    Yadav, Pragya D; Vincent, Martin J; Khristova, Marina; Kale, Charuta; Nichol, Stuart T; Mishra, Akhilesh C; Mourya, Devendra T

    2011-07-01

    Nairobi sheep disease (NSD) virus, the prototype tick-borne virus of the genus Nairovirus, family Bunyaviridae is associated with acute hemorrhagic gastroenteritis in sheep and goats in East and Central Africa. The closely related Ganjam virus found in India is associated with febrile illness in humans and disease in livestock. The complete S, M and L segment sequences of Ganjam and NSD virus and partial sequence analysis of Ganjam viral RNA genome S, M and L segments encoding regions (396 bp, 701 bp and 425 bp) of the viral nucleocapsid (N), glycoprotein precursor (GPC) and L polymerase (L) proteins, respectively, was carried out for multiple Ganjam virus isolates obtained from 1954 to 2002 and from various regions of India. M segments of NSD and Ganjam virus encode a large ORF for the glycoprotein precursor (GPC), (1627 and 1624 amino acids in length, respectively) and their L segments encode a very large L polymerase (3991 amino acids). The complete S, M and L segments of NSD and Ganjam viruses were more closely related to one another than to other characterized nairoviruses, and no evidence of reassortment was found. However, the NSD and Ganjam virus complete M segment differed by 22.90% and 14.70%, for nucleotide and amino acid respectively, and the complete L segment nucleotide and protein differing by 9.90% and 2.70%, respectively among themselves. Ganjam and NSD virus, complete S segment differed by 9.40-10.40% and 3.2-4.10 for nucleotide and proteins while among Ganjam viruses 0.0-6.20% and 0.0-1.4%, variation was found for nucleotide and amino acids. Ganjam virus isolates differed by up to 17% and 11% at the nucleotide level for the partial S and L gene fragments, respectively, with less variation observed at the deduced amino acid level (10.5 and 2%, S and L, respectively). However, the virus partial M gene fragment (which encodes the hypervariable mucin-like domain) of these viruses differed by as much as 56% at the nucleotide level. Phylogenetic analysis of partial sequence differences suggests considerable mixing and movement of Ganjam virus strains within India, with no clear relationship between genetic lineages and virus geographic origin or year of isolation. Surprisingly, NSD virus does not represent a distinct lineage, but appears as a variant with other Ganjam virus among NSD virus group. Copyright © 2011 Elsevier B.V. All rights reserved.

  12. Single-Molecule FISH Reveals Non-selective Packaging of Rift Valley Fever Virus Genome Segments

    PubMed Central

    Wichgers Schreur, Paul J.; Kortekaas, Jeroen

    2016-01-01

    The bunyavirus genome comprises a small (S), medium (M), and large (L) RNA segment of negative polarity. Although genome segmentation confers evolutionary advantages by enabling genome reassortment events with related viruses, genome segmentation also complicates genome replication and packaging. Accumulating evidence suggests that genomes of viruses with eight or more genome segments are incorporated into virions by highly selective processes. Remarkably, little is known about the genome packaging process of the tri-segmented bunyaviruses. Here, we evaluated, by single-molecule RNA fluorescence in situ hybridization (FISH), the intracellular spatio-temporal distribution and replication kinetics of the Rift Valley fever virus (RVFV) genome and determined the segment composition of mature virions. The results reveal that the RVFV genome segments start to replicate near the site of infection before spreading and replicating throughout the cytoplasm followed by translocation to the virion assembly site at the Golgi network. Despite the average intracellular S, M and L genome segments approached a 1:1:1 ratio, major differences in genome segment ratios were observed among cells. We also observed a significant amount of cells lacking evidence of M-segment replication. Analysis of two-segmented replicons and four-segmented viruses subsequently confirmed the previous notion that Golgi recruitment is mediated by the Gn glycoprotein. The absence of colocalization of the different segments in the cytoplasm and the successful rescue of a tri-segmented variant with a codon shuffled M-segment suggested that inter-segment interactions are unlikely to drive the copackaging of the different segments into a single virion. The latter was confirmed by direct visualization of RNPs inside mature virions which showed that the majority of virions lack one or more genome segments. Altogether, this study suggests that RVFV genome packaging is a non-selective process. PMID:27548280

  13. Human adenovirus serotype 12 virion precursors pMu and pVI are cleaved at amino-terminal and carboxy-terminal sites that conform to the adenovirus 2 endoproteinase cleavage consensus sequence.

    PubMed

    Freimuth, P; Anderson, C W

    1993-03-01

    The sequence of a 1158-base pair fragment of the human adenovirus serotype 12 (Ad12) genome was determined. This segment encodes the precursors for virion components Mu and VI. Both Ad12 precursors contain two sequences that conform to a consensus sequence motif for cleavage by the endoproteinase of adenovirus 2 (Ad2). Analysis of the amino terminus of VI and of the peptide fragments found in Ad12 virions demonstrated that these sites are cleaved during Ad12 maturation. This observation suggests that the recognition motif for adenovirus endoproteinases is highly conserved among human serotypes. The adenovirus 2 endoproteinase polypeptide requires additional co-factors for activity (C. W. Anderson, Protein Expression Purif., 1993, 4, 8-15). Synthetic Ad12 or Ad2 pVI carboxy-terminal peptides each permitted efficient cleavage of an artificial endoproteinase substrate by recombinant Ad2 endoproteinase polypeptide.

  14. The Avian-Origin PB1 Gene Segment Facilitated Replication and Transmissibility of the H3N2/1968 Pandemic Influenza Virus

    PubMed Central

    Wendel, Isabel; Rubbenstroth, Dennis; Doedt, Jennifer; Kochs, Georg; Wilhelm, Jochen; Staeheli, Peter; Klenk, Hans-Dieter

    2015-01-01

    ABSTRACT The H2N2/1957 and H3N2/1968 pandemic influenza viruses emerged via the exchange of genomic RNA segments between human and avian viruses. The avian hemagglutinin (HA) allowed the hybrid viruses to escape preexisting immunity in the human population. Both pandemic viruses further received the PB1 gene segment from the avian parent (Y. Kawaoka, S. Krauss, and R. G. Webster, J Virol 63:4603–4608, 1989), but the biological significance of this observation was not understood. To assess whether the avian-origin PB1 segment provided pandemic viruses with some selective advantage, either on its own or via cooperation with the homologous HA segment, we modeled by reverse genetics the reassortment event that led to the emergence of the H3N2/1968 pandemic virus. Using seasonal H2N2 virus A/California/1/66 (Cal) as a surrogate precursor human virus and pandemic virus A/Hong Kong/1/68 (H3N2) (HK) as a source of avian-derived PB1 and HA gene segments, we generated four reassortant recombinant viruses and compared pairs of viruses which differed solely by the origin of PB1. Replacement of the PB1 segment of Cal by PB1 of HK facilitated viral polymerase activity, replication efficiency in human cells, and contact transmission in guinea pigs. A combination of PB1 and HA segments of HK did not enhance replicative fitness of the reassortant virus compared with the single-gene PB1 reassortant. Our data suggest that the avian PB1 segment of the 1968 pandemic virus served to enhance viral growth and transmissibility, likely by enhancing activity of the viral polymerase complex. IMPORTANCE Despite the high impact of influenza pandemics on human health, some mechanisms underlying the emergence of pandemic influenza viruses still are poorly understood. Thus, it was unclear why both H2N2/1957 and H3N2/1968 reassortant pandemic viruses contained, in addition to the avian HA, the PB1 gene segment of the avian parent. Here, we addressed this long-standing question by modeling the emergence of the H3N2/1968 virus from its putative human and avian precursors. We show that the avian PB1 segment increased activity of the viral polymerase and facilitated viral replication. Our results suggest that in addition to the acquisition of antigenically novel HA (i.e., antigenic shift), enhanced viral polymerase activity is required for the emergence of pandemic influenza viruses from their seasonal human precursors. PMID:25631088

  15. The avian-origin PB1 gene segment facilitated replication and transmissibility of the H3N2/1968 pandemic influenza virus.

    PubMed

    Wendel, Isabel; Rubbenstroth, Dennis; Doedt, Jennifer; Kochs, Georg; Wilhelm, Jochen; Staeheli, Peter; Klenk, Hans-Dieter; Matrosovich, Mikhail

    2015-04-01

    The H2N2/1957 and H3N2/1968 pandemic influenza viruses emerged via the exchange of genomic RNA segments between human and avian viruses. The avian hemagglutinin (HA) allowed the hybrid viruses to escape preexisting immunity in the human population. Both pandemic viruses further received the PB1 gene segment from the avian parent (Y. Kawaoka, S. Krauss, and R. G. Webster, J Virol 63:4603-4608, 1989), but the biological significance of this observation was not understood. To assess whether the avian-origin PB1 segment provided pandemic viruses with some selective advantage, either on its own or via cooperation with the homologous HA segment, we modeled by reverse genetics the reassortment event that led to the emergence of the H3N2/1968 pandemic virus. Using seasonal H2N2 virus A/California/1/66 (Cal) as a surrogate precursor human virus and pandemic virus A/Hong Kong/1/68 (H3N2) (HK) as a source of avian-derived PB1 and HA gene segments, we generated four reassortant recombinant viruses and compared pairs of viruses which differed solely by the origin of PB1. Replacement of the PB1 segment of Cal by PB1 of HK facilitated viral polymerase activity, replication efficiency in human cells, and contact transmission in guinea pigs. A combination of PB1 and HA segments of HK did not enhance replicative fitness of the reassortant virus compared with the single-gene PB1 reassortant. Our data suggest that the avian PB1 segment of the 1968 pandemic virus served to enhance viral growth and transmissibility, likely by enhancing activity of the viral polymerase complex. Despite the high impact of influenza pandemics on human health, some mechanisms underlying the emergence of pandemic influenza viruses still are poorly understood. Thus, it was unclear why both H2N2/1957 and H3N2/1968 reassortant pandemic viruses contained, in addition to the avian HA, the PB1 gene segment of the avian parent. Here, we addressed this long-standing question by modeling the emergence of the H3N2/1968 virus from its putative human and avian precursors. We show that the avian PB1 segment increased activity of the viral polymerase and facilitated viral replication. Our results suggest that in addition to the acquisition of antigenically novel HA (i.e., antigenic shift), enhanced viral polymerase activity is required for the emergence of pandemic influenza viruses from their seasonal human precursors. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  16. Novel pedigree analysis implicates DNA repair and chromatin remodeling in multiple myeloma risk

    PubMed Central

    Curtin, Karen; Rajamanickam, Venkatesh; Jayabalan, David; Atanackovic, Djordje; Rajkumar, S. Vincent; Kumar, Shaji; Slager, Susan; Galia, Perrine; Demangel, Delphine; Salama, Mohamed; Joseph, Vijai; Lipkin, Steven M.; Dumontet, Charles; Vachon, Celine M.

    2018-01-01

    The high-risk pedigree (HRP) design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS) method in 11 extended, Utah, multiple myeloma (MM) HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance–a precursor to MM) cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu), a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val), a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits. PMID:29389935

  17. Novel pedigree analysis implicates DNA repair and chromatin remodeling in multiple myeloma risk.

    PubMed

    Waller, Rosalie G; Darlington, Todd M; Wei, Xiaomu; Madsen, Michael J; Thomas, Alun; Curtin, Karen; Coon, Hilary; Rajamanickam, Venkatesh; Musinsky, Justin; Jayabalan, David; Atanackovic, Djordje; Rajkumar, S Vincent; Kumar, Shaji; Slager, Susan; Middha, Mridu; Galia, Perrine; Demangel, Delphine; Salama, Mohamed; Joseph, Vijai; McKay, James; Offit, Kenneth; Klein, Robert J; Lipkin, Steven M; Dumontet, Charles; Vachon, Celine M; Camp, Nicola J

    2018-02-01

    The high-risk pedigree (HRP) design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS) method in 11 extended, Utah, multiple myeloma (MM) HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance-a precursor to MM) cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu), a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val), a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits.

  18. Assignment of simian rotavirus SA11 temperature-sensitive mutant groups B and E to genome segments

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gombold, J.L.; Estes, M.K.; Ramig, R.F.

    1985-05-01

    Recombinant (reassortant) viruses were selected from crosses between temperature-sensitive (ts) mutants of simian rotavirus SA11 and wild-type human rotavirus Wa. The double-stranded genome RNAs of the reassortants were examined by electrophoresis in Tris-glycine-buffered polyacrylamide gels and by dot hybridization with a cloned DNA probe for genome segment 2. Analysis of replacements of genome segments in the reassortants allowed construction of a map correlating genome segments providing functions interchangeable between SA11 and Wa. The reassortants revealed a functional correspondence in order of increasing electrophoretic mobility of genome segments. Analysis of the parental origin of genome segments in ts+ SA11/Wa reassortants derivedmore » from the crosses SA11 tsB(339) X Wa and SA11 tsE(1400) X Wa revealed that the group B lesion of tsB(339) was located on genome segment 3 and the group E lesion of tsE(1400) was on segment 8.« less

  19. Markov models of genome segmentation

    NASA Astrophysics Data System (ADS)

    Thakur, Vivek; Azad, Rajeev K.; Ramaswamy, Ram

    2007-01-01

    We introduce Markov models for segmentation of symbolic sequences, extending a segmentation procedure based on the Jensen-Shannon divergence that has been introduced earlier. Higher-order Markov models are more sensitive to the details of local patterns and in application to genome analysis, this makes it possible to segment a sequence at positions that are biologically meaningful. We show the advantage of higher-order Markov-model-based segmentation procedures in detecting compositional inhomogeneity in chimeric DNA sequences constructed from genomes of diverse species, and in application to the E. coli K12 genome, boundaries of genomic islands, cryptic prophages, and horizontally acquired regions are accurately identified.

  20. Laser ablation of persistent twist cells in Drosophila: muscle precursor fate is not segmentally restricted

    NASA Technical Reports Server (NTRS)

    Farrell, E. R.; Keshishian, H.

    1999-01-01

    In Drosophila the precursors of the adult musculature arise during embryogenesis. These precursor cells have been termed Persistent Twist Cells (PTCs), as they continue to express the transcription factor Twist after that gene ceases expression elsewhere in the mesoderm. In the larval abdomen, the PTCs are associated with peripheral nerves in stereotypic ventral, dorsal, and lateral clusters, which give rise, respectively, to the ventral, dorsal, and lateral muscle fiber groups of the adult. We tested the developmental potential of the PTCs by using a microbeam laser to ablate specific clusters in larvae. We found that the ablation of a single segmental PTC cluster does not usually result in the deletion of the corresponding adult fibers of that segment. Instead, normal or near normal numbers of adult fibers can form after the ablation. Examination of pupae following ablation showed that migrating PTCs from adjacent segments are able to invade the affected segment, replenishing the ablated cells. However, the ablation of homologous PTCs in multiple segments does result in the deletion of the corresponding adult muscle fibers. These data indicate that the PTCs in an abdominal segment can contribute to the formation of muscle fibers in adjacent abdominal segments, and thus are not inherently restricted to the formation of muscle fibers within their segment of origin.

  1. Characterization and Construction of Functional cDNA Clones of Pariacoto Virus, the First Alphanodavirus Isolated outside Australasia

    PubMed Central

    Johnson, Karyn N.; Zeddam, Jean-Louis; Ball, L. Andrew

    2000-01-01

    Pariacoto virus (PaV) was recently isolated in Peru from the Southern armyworm (Spodoptera eridania). PaV particles are isometric, nonenveloped, and about 30 nm in diameter. The virus has a bipartite RNA genome and a single major capsid protein with a molecular mass of 39.0 kDa, features that support its classification as a Nodavirus. As such, PaV is the first Alphanodavirus to have been isolated from outside Australasia. Here we report that PaV replicates in wax moth larvae and that PaV genomic RNAs replicate when transfected into cultured baby hamster kidney cells. The complete nucleotide sequences of both segments of the bipartite RNA genome were determined. The larger genome segment, RNA1, is 3,011 nucleotides long and contains a 973-amino-acid open reading frame (ORF) encoding protein A, the viral contribution to the RNA replicase. During replication, a 414-nucleotide long subgenomic RNA (RNA3) is synthesized which is coterminal with the 3′ end of RNA1. RNA3 contains a small ORF which could encode a protein of 90 amino acids similar to the B2 protein of other alphanodaviruses. RNA2 contains 1,311 nucleotides and encodes the 401 amino acids of the capsid protein precursor α. The amino acid sequences of the PaV capsid protein and the replicase subunit share 41 and 26% identity with homologous proteins of Flock house virus, the best characterized of the alphanodaviruses. These and other sequence comparisons indicate that PaV is evolutionarily the most distant of the alphanodaviruses described to date, consistent with its novel geographic origin. Although the PaV capsid precursor is cleaved into the two mature capsid proteins β and γ, the amino acid sequence at the cleavage site, which is Asn/Ala in all other alphanodaviruses, is Asn/Ser in PaV. To facilitate the investigation of PaV replication in cultured cells, we constructed plasmids that transcribed full-length PaV RNAs with authentic 5′ and 3′ termini. Transcription of these plasmids in cells recreated the replication of PaV RNA1 and RNA2, synthesis of subgenomic RNA3, and translation of viral proteins A and α. PMID:10799587

  2. Characterization and construction of functional cDNA clones of Pariacoto virus, the first Alphanodavirus isolated outside Australasia.

    PubMed

    Johnson, K N; Zeddam, J L; Ball, L A

    2000-06-01

    Pariacoto virus (PaV) was recently isolated in Peru from the Southern armyworm (Spodoptera eridania). PaV particles are isometric, nonenveloped, and about 30 nm in diameter. The virus has a bipartite RNA genome and a single major capsid protein with a molecular mass of 39.0 kDa, features that support its classification as a Nodavirus. As such, PaV is the first Alphanodavirus to have been isolated from outside Australasia. Here we report that PaV replicates in wax moth larvae and that PaV genomic RNAs replicate when transfected into cultured baby hamster kidney cells. The complete nucleotide sequences of both segments of the bipartite RNA genome were determined. The larger genome segment, RNA1, is 3,011 nucleotides long and contains a 973-amino-acid open reading frame (ORF) encoding protein A, the viral contribution to the RNA replicase. During replication, a 414-nucleotide long subgenomic RNA (RNA3) is synthesized which is coterminal with the 3' end of RNA1. RNA3 contains a small ORF which could encode a protein of 90 amino acids similar to the B2 protein of other alphanodaviruses. RNA2 contains 1,311 nucleotides and encodes the 401 amino acids of the capsid protein precursor alpha. The amino acid sequences of the PaV capsid protein and the replicase subunit share 41 and 26% identity with homologous proteins of Flock house virus, the best characterized of the alphanodaviruses. These and other sequence comparisons indicate that PaV is evolutionarily the most distant of the alphanodaviruses described to date, consistent with its novel geographic origin. Although the PaV capsid precursor is cleaved into the two mature capsid proteins beta and gamma, the amino acid sequence at the cleavage site, which is Asn/Ala in all other alphanodaviruses, is Asn/Ser in PaV. To facilitate the investigation of PaV replication in cultured cells, we constructed plasmids that transcribed full-length PaV RNAs with authentic 5' and 3' termini. Transcription of these plasmids in cells recreated the replication of PaV RNA1 and RNA2, synthesis of subgenomic RNA3, and translation of viral proteins A and alpha.

  3. Ins and Outs of Multipartite Positive-Strand RNA Plant Viruses: Packaging versus Systemic Spread

    PubMed Central

    Dall’Ara, Mattia; Ratti, Claudio; Bouzoubaa, Salah E.; Gilmer, David

    2016-01-01

    Viruses possessing a non-segmented genome require a specific recognition of their nucleic acid to ensure its protection in a capsid. A similar feature exists for viruses having a segmented genome, usually consisting of viral genomic segments joined together into one viral entity. While this appears as a rule for animal viruses, the majority of segmented plant viruses package their genomic segments individually. To ensure a productive infection, all viral particles and thereby all segments have to be present in the same cell. Progression of the virus within the plant requires as well a concerted genome preservation to avoid loss of function. In this review, we will discuss the “life aspects” of chosen phytoviruses and argue for the existence of RNA-RNA interactions that drive the preservation of viral genome integrity while the virus progresses in the plant. PMID:27548199

  4. Exploration of sequence space as the basis of viral RNA genome segmentation.

    PubMed

    Moreno, Elena; Ojosnegros, Samuel; García-Arriaza, Juan; Escarmís, Cristina; Domingo, Esteban; Perales, Celia

    2014-05-06

    The mechanisms of viral RNA genome segmentation are unknown. On extensive passage of foot-and-mouth disease virus in baby hamster kidney-21 cells, the virus accumulated multiple point mutations and underwent a transition akin to genome segmentation. The standard single RNA genome molecule was replaced by genomes harboring internal in-frame deletions affecting the L- or capsid-coding region. These genomes were infectious and killed cells by complementation. Here we show that the point mutations in the nonstructural protein-coding region (P2, P3) that accumulated in the standard genome before segmentation increased the relative fitness of the segmented version relative to the standard genome. Fitness increase was documented by intracellular expression of virus-coded proteins and infectious progeny production by RNAs with the internal deletions placed in the sequence context of the parental and evolved genome. The complementation activity involved several viral proteins, one of them being the leader proteinase L. Thus, a history of genetic drift with accumulation of point mutations was needed to allow a major variation in the structure of a viral genome. Thus, exploration of sequence space by a viral genome (in this case an unsegmented RNA) can reach a point of the space in which a totally different genome structure (in this case, a segmented RNA) is favored over the form that performed the exploration.

  5. The Influenza A Virus PB2, PA, NP, and M Segments Play a Pivotal Role during Genome Packaging

    PubMed Central

    Gao, Qinshan; Chou, Yi-Ying; Doğanay, Sultan; Vafabakhsh, Reza; Ha, Taekjip

    2012-01-01

    The genomes of influenza A viruses consist of eight negative-strand RNA segments. Recent studies suggest that influenza viruses are able to specifically package their segmented genomes into the progeny virions. Segment-specific packaging signals of influenza virus RNAs (vRNAs) are located in the 5′ and 3′ noncoding regions, as well as in the terminal regions, of the open reading frames. How these packaging signals function during genome packaging remains unclear. Previously, we generated a 7-segmented virus in which the hemagglutinin (HA) and neuraminidase (NA) segments of the influenza A/Puerto Rico/8/34 virus were replaced by a chimeric influenza C virus hemagglutinin/esterase/fusion (HEF) segment carrying the HA packaging sequences. The robust growth of the HEF virus suggested that the NA segment is not required for the packaging of other segments. In this study, in order to determine the roles of the other seven segments during influenza A virus genome assembly, we continued to use this HEF virus as a tool and analyzed the effects of replacing the packaging sequences of other segments with those of the NA segment. Our results showed that deleting the packaging signals of the PB1, HA, or NS segment had no effect on the growth of the HEF virus, while growth was greatly impaired when the packaging sequence of the PB2, PA, nucleoprotein (NP), or matrix (M) segment was removed. These results indicate that the PB2, PA, NP, and M segments play a more important role than the remaining four vRNAs during the genome-packaging process. PMID:22532680

  6. Assessing the Robustness of Complete Bacterial Genome Segmentations

    NASA Astrophysics Data System (ADS)

    Devillers, Hugo; Chiapello, Hélène; Schbath, Sophie; El Karoui, Meriem

    Comparison of closely related bacterial genomes has revealed the presence of highly conserved sequences forming a "backbone" that is interrupted by numerous, less conserved, DNA fragments. Segmentation of bacterial genomes into backbone and variable regions is particularly useful to investigate bacterial genome evolution. Several software tools have been designed to compare complete bacterial chromosomes and a few online databases store pre-computed genome comparisons. However, very few statistical methods are available to evaluate the reliability of these software tools and to compare the results obtained with them. To fill this gap, we have developed two local scores to measure the robustness of bacterial genome segmentations. Our method uses a simulation procedure based on random perturbations of the compared genomes. The scores presented in this paper are simple to implement and our results show that they allow to discriminate easily between robust and non-robust bacterial genome segmentations when using aligners such as MAUVE and MGA.

  7. Complete Coding Genome Sequence for Mogiana Tick Virus, a Jingmenvirus Isolated from Ticks in Brazil

    DTIC Science & Technology

    2017-05-04

    and capable of infecting a wide range of animal hosts (1–5). Here, we report the complete coding genome sequence (i.e., only missing portions of...segmented nature of the genome was not under- stood. Therefore, only the two genome segments with detectable sequence homolo- gies to flaviviruses were...originally reported (2). We revisited the data set of Maruyama et al. (2) and assembled the complete coding sequences for all four genome segments. We

  8. Asymmetric histone modifications between the original and derived loci of human segmental duplications

    PubMed Central

    Zheng, Deyou

    2008-01-01

    Background Sequencing and annotation of several mammalian genomes have revealed that segmental duplications are a common architectural feature of primate genomes; in fact, about 5% of the human genome is composed of large blocks of interspersed segmental duplications. These segmental duplications have been implicated in genomic copy-number variation, gene novelty, and various genomic disorders. However, the molecular processes involved in the evolution and regulation of duplicated sequences remain largely unexplored. Results In this study, the profile of about 20 histone modifications within human segmental duplications was characterized using high-resolution, genome-wide data derived from a ChIP-Seq study. The analysis demonstrates that derivative loci of segmental duplications often differ significantly from the original with respect to many histone methylations. Further investigation showed that genes are present three times more frequently in the original than in the derivative, whereas pseudogenes exhibit the opposite trend. These asymmetries tend to increase with the age of segmental duplications. The uneven distribution of genes and pseudogenes does not, however, fully account for the asymmetry in the profile of histone modifications. Conclusion The first systematic analysis of histone modifications between segmental duplications demonstrates that two seemingly 'identical' genomic copies are distinct in their epigenomic properties. Results here suggest that local chromatin environments may be implicated in the discrimination of derived copies of segmental duplications from their originals, leading to a biased pseudogenization of the new duplicates. The data also indicate that further exploration of the interactions between histone modification and sequence degeneration is necessary in order to understand the divergence of duplicated sequences. PMID:18598352

  9. A tick-borne segmented RNA virus contains genome segments derived from unsegmented viral ancestors

    PubMed Central

    Qin, Xin-Cheng; Shi, Mang; Tian, Jun-Hua; Lin, Xian-Dan; Gao, Dong-Ya; He, Jin-Rong; Wang, Jian-Bo; Li, Ci-Xiu; Kang, Yan-Jun; Yu, Bin; Zhou, Dun-Jin; Xu, Jianguo; Plyusnin, Alexander; Holmes, Edward C.; Zhang, Yong-Zhen

    2014-01-01

    Although segmented and unsegmented RNA viruses are commonplace, the evolutionary links between these two very different forms of genome organization are unclear. We report the discovery and characterization of a tick-borne virus—Jingmen tick virus (JMTV)—that reveals an unexpected connection between segmented and unsegmented RNA viruses. The JMTV genome comprises four segments, two of which are related to the nonstructural protein genes of the genus Flavivirus (family Flaviviridae), whereas the remaining segments are unique to this virus, have no known homologs, and contain a number of features indicative of structural protein genes. Remarkably, homology searching revealed that sequences related to JMTV were present in the cDNA library from Toxocara canis (dog roundworm; Nematoda), and that shared strong sequence and structural resemblances. Epidemiological studies showed that JMTV is distributed in tick populations across China, especially Rhipicephalus and Haemaphysalis spp., and experiences frequent host-switching and genomic reassortment. To our knowledge, JMTV is the first example of a segmented RNA virus with a genome derived in part from unsegmented viral ancestors. PMID:24753611

  10. Pax-3 expression in segmental mesoderm marks early stages in myogenic cell specification.

    PubMed

    Williams, B A; Ordahl, C P

    1994-04-01

    Specification of the myogenic lineage begins prior to gastrulation and culminates in the emergence of determined myogenic precursor cells from the somites. The myoD family (MDF) of transcriptional activators controls late step(s) in myogenic specification that are closely followed by terminal muscle differentiation. Genes expressed in myogenic specification at stages earlier than MDFs are unknown. The Pax-3 gene is expressed in all the cells of the caudal segmental plate, the early mesoderm compartment that contains the precursors of skeletal muscle. As somites form from the segmental plate and mature, Pax-3 expression is progressively modulated. Beginning at the time of segmentation, Pax-3 becomes repressed in the ventral half of the somite, leaving Pax-3 expression only in the dermomyotome. Subsequently, differential modulation of Pax-3 expression levels delineates the medial and lateral halves of the dermomyotome, which contain precursors of axial (back) muscle and limb muscle, respectively. Pax-3 expression is then repressed as dermomyotome-derived cells activate MDFs. Quail-chick chimera and ablation experiments confirmed that the migratory precursors of limb muscle continue to express Pax-3 during migration. Since limb muscle precursors do not activate MDFs until 2 days after they leave the somite, Pax-3 represents the first molecular marker for this migratory cell population. A null mutation of the mouse Pax-3 gene, Splotch, produces major disruptions in early limb muscle development (Franz, T., Kothary, R., Surani, M. A. H., Halata, Z. and Grim, M. (1993) Anat. Embryol. 187, 153-160; Goulding, M., Lumsden, A. and Paquette, A. (1994) Development 120, 957-971). We conclude, therefore, that Pax-3 gene expression in the paraxial mesoderm marks earlier stages in myogenic specification than MDFs and plays a crucial role in the specification and/or migration of limb myogenic precursors.

  11. Segmental Duplications and Copy-Number Variation in the Human Genome

    PubMed Central

    Sharp, Andrew J. ; Locke, Devin P. ; McGrath, Sean D. ; Cheng, Ze ; Bailey, Jeffrey A. ; Vallente, Rhea U. ; Pertz, Lisa M. ; Clark, Royden A. ; Schwartz, Stuart ; Segraves, Rick ; Oseroff, Vanessa V. ; Albertson, Donna G. ; Pinkel, Daniel ; Eichler, Evan E. 

    2005-01-01

    The human genome contains numerous blocks of highly homologous duplicated sequence. This higher-order architecture provides a substrate for recombination and recurrent chromosomal rearrangement associated with genomic disease. However, an assessment of the role of segmental duplications in normal variation has not yet been made. On the basis of the duplication architecture of the human genome, we defined a set of 130 potential rearrangement hotspots and constructed a targeted bacterial artificial chromosome (BAC) microarray (with 2,194 BACs) to assess copy-number variation in these regions by array comparative genomic hybridization. Using our segmental duplication BAC microarray, we screened a panel of 47 normal individuals, who represented populations from four continents, and we identified 119 regions of copy-number polymorphism (CNP), 73 of which were previously unreported. We observed an equal frequency of duplications and deletions, as well as a 4-fold enrichment of CNPs within hotspot regions, compared with control BACs (P < .000001), which suggests that segmental duplications are a major catalyst of large-scale variation in the human genome. Importantly, segmental duplications themselves were also significantly enriched >4-fold within regions of CNP. Almost without exception, CNPs were not confined to a single population, suggesting that these either are recurrent events, having occurred independently in multiple founders, or were present in early human populations. Our study demonstrates that segmental duplications define hotspots of chromosomal rearrangement, likely acting as mediators of normal variation as well as genomic disease, and it suggests that the consideration of genomic architecture can significantly improve the ascertainment of large-scale rearrangements. Our specialized segmental duplication BAC microarray and associated database of structural polymorphisms will provide an important resource for the future characterization of human genomic disorders. PMID:15918152

  12. Transgenerationally inherited piRNAs trigger piRNA biogenesis by changing the chromatin of piRNA clusters and inducing precursor processing

    PubMed Central

    Le Thomas, Adrien; Stuwe, Evelyn; Li, Sisi; Marinov, Georgi; Rozhkov, Nikolay; Chen, Yung-Chia Ariel; Luo, Yicheng; Sachidanandam, Ravi; Toth, Katalin Fejes; Patel, Dinshaw; Aravin, Alexei A.

    2014-01-01

    Small noncoding RNAs that associate with Piwi proteins, called piRNAs, serve as guides for repression of diverse transposable elements in germ cells of metazoa. In Drosophila, the genomic regions that give rise to piRNAs, the so-called piRNA clusters, are transcribed to generate long precursor molecules that are processed into mature piRNAs. How genomic regions that give rise to piRNA precursor transcripts are differentiated from the rest of the genome and how these transcripts are specifically channeled into the piRNA biogenesis pathway are not known. We found that transgenerationally inherited piRNAs provide the critical trigger for piRNA production from homologous genomic regions in the next generation by two different mechanisms. First, inherited piRNAs enhance processing of homologous transcripts into mature piRNAs by initiating the ping-pong cycle in the cytoplasm. Second, inherited piRNAs induce installment of the histone 3 Lys9 trimethylation (H3K9me3) mark on genomic piRNA cluster sequences. The heterochromatin protein 1 (HP1) homolog Rhino binds to the H3K9me3 mark through its chromodomain and is enriched over piRNA clusters. Rhino recruits the piRNA biogenesis factor Cutoff to piRNA clusters and is required for efficient transcription of piRNA precursors. We propose that transgenerationally inherited piRNAs act as an epigenetic memory for identification of substrates for piRNA biogenesis on two levels: by inducing a permissive chromatin environment for piRNA precursor synthesis and by enhancing processing of these precursors. PMID:25085419

  13. Mutational Analysis of the Rift Valley Fever Virus Glycoprotein Precursor Proteins for Gn Protein Expression

    PubMed Central

    Phoenix, Inaia; Lokugamage, Nandadeva; Nishiyama, Shoko; Ikegami, Tetsuro

    2016-01-01

    The Rift Valley fever virus (RVFV) M-segment encodes the 78 kD, NSm, Gn, and Gc proteins. The 1st AUG generates the 78 kD-Gc precursor, the 2nd AUG generates the NSm-Gn-Gc precursor, and the 3rd AUG makes the NSm’-Gn-Gc precursor. To understand biological changes due to abolishment of the precursors, we quantitatively measured Gn secretion using a reporter assay, in which a Gaussia luciferase (gLuc) protein is fused to the RVFV M-segment pre-Gn region. Using the reporter assay, the relative expression of Gn/gLuc fusion proteins was analyzed among various AUG mutants. The reporter assay showed efficient secretion of Gn/gLuc protein from the precursor made from the 2nd AUG, while the removal of the untranslated region upstream of the 2nd AUG (AUG2-M) increased the secretion of the Gn/gLuc protein. Subsequently, recombinant MP-12 strains encoding mutations in the pre-Gn region were rescued, and virological phenotypes were characterized. Recombinant MP-12 encoding the AUG2-M mutation replicated slightly less efficiently than the control, indicating that viral replication is further influenced by the biological processes occurring after Gn expression, rather than the Gn abundance. This study showed that, not only the abolishment of AUG, but also the truncation of viral UTR, affects the expression of Gn protein by the RVFV M-segment. PMID:27231931

  14. Mutational Analysis of the Rift Valley Fever Virus Glycoprotein Precursor Proteins for Gn Protein Expression.

    PubMed

    Phoenix, Inaia; Lokugamage, Nandadeva; Nishiyama, Shoko; Ikegami, Tetsuro

    2016-05-24

    The Rift Valley fever virus (RVFV) M-segment encodes the 78 kD, NSm, Gn, and Gc proteins. The 1st AUG generates the 78 kD-Gc precursor, the 2nd AUG generates the NSm-Gn-Gc precursor, and the 3rd AUG makes the NSm'-Gn-Gc precursor. To understand biological changes due to abolishment of the precursors, we quantitatively measured Gn secretion using a reporter assay, in which a Gaussia luciferase (gLuc) protein is fused to the RVFV M-segment pre-Gn region. Using the reporter assay, the relative expression of Gn/gLuc fusion proteins was analyzed among various AUG mutants. The reporter assay showed efficient secretion of Gn/gLuc protein from the precursor made from the 2nd AUG, while the removal of the untranslated region upstream of the 2nd AUG (AUG2-M) increased the secretion of the Gn/gLuc protein. Subsequently, recombinant MP-12 strains encoding mutations in the pre-Gn region were rescued, and virological phenotypes were characterized. Recombinant MP-12 encoding the AUG2-M mutation replicated slightly less efficiently than the control, indicating that viral replication is further influenced by the biological processes occurring after Gn expression, rather than the Gn abundance. This study showed that, not only the abolishment of AUG, but also the truncation of viral UTR, affects the expression of Gn protein by the RVFV M-segment.

  15. Origin of amphibian and avian chromosomes by fission, fusion, and retention of ancestral chromosomes

    PubMed Central

    Voss, Stephen R.; Kump, D. Kevin; Putta, Srikrishna; Pauly, Nathan; Reynolds, Anna; Henry, Rema J.; Basa, Saritha; Walker, John A.; Smith, Jeramiah J.

    2011-01-01

    Amphibian genomes differ greatly in DNA content and chromosome size, morphology, and number. Investigations of this diversity are needed to identify mechanisms that have shaped the evolution of vertebrate genomes. We used comparative mapping to investigate the organization of genes in the Mexican axolotl (Ambystoma mexicanum), a species that presents relatively few chromosomes (n = 14) and a gigantic genome (>20 pg/N). We show extensive conservation of synteny between Ambystoma, chicken, and human, and a positive correlation between the length of conserved segments and genome size. Ambystoma segments are estimated to be four to 51 times longer than homologous human and chicken segments. Strikingly, genes demarking the structures of 28 chicken chromosomes are ordered among linkage groups defining the Ambystoma genome, and we show that these same chromosomal segments are also conserved in a distantly related anuran amphibian (Xenopus tropicalis). Using linkage relationships from the amphibian maps, we predict that three chicken chromosomes originated by fusion, nine to 14 originated by fission, and 12–17 evolved directly from ancestral tetrapod chromosomes. We further show that some ancestral segments were fused prior to the divergence of salamanders and anurans, while others fused independently and randomly as chromosome numbers were reduced in lineages leading to Ambystoma and Xenopus. The maintenance of gene order relationships between chromosomal segments that have greatly expanded and contracted in salamander and chicken genomes, respectively, suggests selection to maintain synteny relationships and/or extremely low rates of chromosomal rearrangement. Overall, the results demonstrate the value of data from diverse, amphibian genomes in studies of vertebrate genome evolution. PMID:21482624

  16. Heuristic Bayesian segmentation for discovery of coexpressed genes within genomic regions.

    PubMed

    Pehkonen, Petri; Wong, Garry; Törönen, Petri

    2010-01-01

    Segmentation aims to separate homogeneous areas from the sequential data, and plays a central role in data mining. It has applications ranging from finance to molecular biology, where bioinformatics tasks such as genome data analysis are active application fields. In this paper, we present a novel application of segmentation in locating genomic regions with coexpressed genes. We aim at automated discovery of such regions without requirement for user-given parameters. In order to perform the segmentation within a reasonable time, we use heuristics. Most of the heuristic segmentation algorithms require some decision on the number of segments. This is usually accomplished by using asymptotic model selection methods like the Bayesian information criterion. Such methods are based on some simplification, which can limit their usage. In this paper, we propose a Bayesian model selection to choose the most proper result from heuristic segmentation. Our Bayesian model presents a simple prior for the segmentation solutions with various segment numbers and a modified Dirichlet prior for modeling multinomial data. We show with various artificial data sets in our benchmark system that our model selection criterion has the best overall performance. The application of our method in yeast cell-cycle gene expression data reveals potential active and passive regions of the genome.

  17. Partial structure of the phylloxin gene from the giant monkey frog, Phyllomedusa bicolor: parallel cloning of precursor cDNA and genomic DNA from lyophilized skin secretion.

    PubMed

    Chen, Tianbao; Gagliardo, Ron; Walker, Brian; Zhou, Mei; Shaw, Chris

    2005-12-01

    Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.

  18. MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level.

    PubMed

    Chiapello, Hélène; Gendrault, Annie; Caron, Christophe; Blum, Jérome; Petit, Marie-Agnès; El Karoui, Meriem

    2008-11-27

    The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons. Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons. The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic.

  19. Microplitis demolitor bracovirus genome segments vary in abundance and are individually packaged in virions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Beck, Markus H.; Inman, Ross B.; Strand, Michael R.

    2007-03-01

    Polydnaviruses (PDVs) are distinguished by their unique association with parasitoid wasps and their segmented, double-stranded (ds) DNA genomes that are non-equimolar in abundance. Relatively little is actually known, however, about genome packaging or segment abundance of these viruses. Here, we conducted electron microscopy (EM) and real-time polymerase chain reaction (PCR) studies to characterize packaging and segment abundance of Microplitis demolitor bracovirus (MdBV). Like other PDVs, MdBV replicates in the ovaries of females where virions accumulate to form a suspension called calyx fluid. Wasps then inject a quantity of calyx fluid when ovipositing into hosts. The MdBV genome consists of 15more » segments that range from 3.6 (segment A) to 34.3 kb (segment O). EM analysis indicated that MdBV virions contain a single nucleocapsid that encapsidates one circular DNA of variable size. We developed a semi-quantitative real-time PCR assay using SYBR Green I. This assay indicated that five (J, O, H, N and B) segments of the MdBV genome accounted for more than 60% of the viral DNAs in calyx fluid. Estimates of relative segment abundance using our real-time PCR assay were also very similar to DNA size distributions determined from micrographs. Analysis of parasitized Pseudoplusia includens larvae indicated that copy number of MdBV segments C, B and J varied between hosts but their relative abundance within a host was virtually identical to their abundance in calyx fluid. Among-tissue assays indicated that each viral segment was most abundant in hemocytes and least abundant in salivary glands. However, the relative abundance of each segment to one another was similar in all tissues. We also found no clear relationship between MdBV segment and transcript abundance in hemocytes and fat body.« less

  20. Complete genome sequence and phylogenetic analyses of an aquabirnavirus isolated from a diseased marbled eel culture in Taiwan.

    PubMed

    Wen, Chiu-Ming

    2017-08-01

    An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.

  1. Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences

    PubMed Central

    Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.

    2012-01-01

    ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136

  2. A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate

    PubMed Central

    Samuels, David C.; Boys, Richard J.; Henderson, Daniel A.; Chinnery, Patrick F.

    2003-01-01

    We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a ‘molecular clock’ to determine the rate of population and species divergence. PMID:14530452

  3. Coding Complete Genome for the Mogiana Tick Virus, a Jingmenvirus Isolated from Ticks in Brazil

    DTIC Science & Technology

    2017-05-04

    sequences for all four genome segments. We downloaded the raw Illumina sequence reads from the NCBI Short Read Archive (GenBank...MGTV genome segments through sequence similarity (BLASTN) to the published genome of Jingmen tick virus (JMTV) isolate SY84 (GenBank: KJ001579-KJ001582...2014. Standards for sequencing viral genomes in the era of high-throughput sequencing . MBio 5:e01360–14. 8. Bankevich A, Nurk S, Antipov

  4. Non-essential viral proteins of orbiviruses are essential for vector-borne spread by midges

    USDA-ARS?s Scientific Manuscript database

    Members of the Reoviridae family are non-enveloped multi-layered viruses with a double stranded RNA genome consisting of 9-12 genome segments. The Orbivirus genus contains vector borne virus species with 10 genome segments such as bluetongue virus (BTV) with about 30 serotypes, and African horse sic...

  5. Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits.

    PubMed

    Gebreyesus, Grum; Lund, Mogens S; Buitenhuis, Bart; Bovenhuis, Henk; Poulsen, Nina A; Janss, Luc G

    2017-12-05

    Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula. BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in prediction reliability compared to the univariate versions. Substantial improvement in prediction reliability was possible for most of the traits related to milk protein composition using our novel BayesAS models. Grouping adjacent SNPs into segments provided enhanced information to estimate parameters and allowing the segments to have different (co)variances helped disentangle heterogeneous (co)variances across the genome.

  6. Evaluation of a Phylogenetic Marker Based on Genomic Segment B of Infectious Bursal Disease Virus: Facilitating a Feasible Incorporation of this Segment to the Molecular Epidemiology Studies for this Viral Agent.

    PubMed

    Alfonso-Morales, Abdulahi; Rios, Liliam; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Ganges, Llilianne; Díaz de Arce, Heidy; Majó, Natàlia; Núñez, José I; Pérez, Lester J

    2015-01-01

    Infectious bursal disease (IBD) is a highly contagious and acute viral disease, which has caused high mortality rates in birds and considerable economic losses in different parts of the world for more than two decades and it still represents a considerable threat to poultry. The current study was designed to rigorously measure the reliability of a phylogenetic marker included into segment B. This marker can facilitate molecular epidemiology studies, incorporating this segment of the viral genome, to better explain the links between emergence, spreading and maintenance of the very virulent IBD virus (vvIBDV) strains worldwide. Sequences of the segment B gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank Database; Cuban sequences were obtained in the current work. A phylogenetic marker named B-marker was assessed by different phylogenetic principles such as saturation of substitution, phylogenetic noise and high consistency. This last parameter is based on the ability of B-marker to reconstruct the same topology as the complete segment B of the viral genome. From the results obtained from B-marker, demographic history for both main lineages of IBDV regarding segment B was performed by Bayesian skyline plot analysis. Phylogenetic analysis for both segments of IBDV genome was also performed, revealing the presence of a natural reassortant strain with segment A from vvIBDV strains and segment B from non-vvIBDV strains within Cuban IBDV population. This study contributes to a better understanding of the emergence of vvIBDV strains, describing molecular epidemiology of IBDV using the state-of-the-art methodology concerning phylogenetic reconstruction. This study also revealed the presence of a novel natural reassorted strain as possible manifest of change in the genetic structure and stability of the vvIBDV strains. Therefore, it highlights the need to obtain information about both genome segments of IBDV for molecular epidemiology studies.

  7. Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes).

    PubMed

    Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

    2011-09-01

    Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.

  8. Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)

    PubMed Central

    Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

    2011-01-01

    Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references. PMID:21712341

  9. Detection and correction of false segmental duplications caused by genome mis-assembly

    PubMed Central

    2010-01-01

    Diploid genomes with divergent chromosomes present special problems for assembly software as two copies of especially polymorphic regions may be mistakenly constructed, creating the appearance of a recent segmental duplication. We developed a method for identifying such false duplications and applied it to four vertebrate genomes. For each genome, we corrected mis-assemblies, improved estimates of the amount of duplicated sequence, and recovered polymorphisms between the sequenced chromosomes. PMID:20219098

  10. The Consequences of Reconfiguring the Ambisense S Genome Segment of Rift Valley Fever Virus on Viral Replication in Mammalian and Mosquito Cells and for Genome Packaging

    PubMed Central

    Elliott, Richard M.

    2014-01-01

    Rift Valley fever virus (RVFV, family Bunyaviridae) is a mosquito-borne pathogen of both livestock and humans, found primarily in Sub-Saharan Africa and the Arabian Peninsula. The viral genome comprises two negative-sense (L and M segments) and one ambisense (S segment) RNAs that encode seven proteins. The S segment encodes the nucleocapsid (N) protein in the negative-sense and a nonstructural (NSs) protein in the positive-sense, though NSs cannot be translated directly from the S segment but rather from a specific subgenomic mRNA. Using reverse genetics we generated a virus, designated rMP12:S-Swap, in which the N protein is expressed from the NSs locus and NSs from the N locus within the genomic S RNA. In cells infected with rMP12:S-Swap NSs is expressed at higher levels with respect to N than in cells infected with the parental rMP12 virus. Despite NSs being the main interferon antagonist and determinant of virulence, growth of rMP12:S-Swap was attenuated in mammalian cells and gave a small plaque phenotype. The increased abundance of the NSs protein did not lead to faster inhibition of host cell protein synthesis or host cell transcription in infected mammalian cells. In cultured mosquito cells, however, infection with rMP12:S-Swap resulted in cell death rather than establishment of persistence as seen with rMP12. Finally, altering the composition of the S segment led to a differential packaging ratio of genomic to antigenomic RNA into rMP12:S-Swap virions. Our results highlight the plasticity of the RVFV genome and provide a useful experimental tool to investigate further the packaging mechanism of the segmented genome. PMID:24550727

  11. A Python Analytical Pipeline to Identify Prohormone Precursors and Predict Prohormone Cleavage Sites

    PubMed Central

    Southey, Bruce R.; Sweedler, Jonathan V.; Rodriguez-Zas, Sandra L.

    2008-01-01

    Neuropeptides and hormones are signaling molecules that support cell–cell communication in the central nervous system. Experimentally characterizing neuropeptides requires significant efforts because of the complex and variable processing of prohormone precursor proteins into neuropeptides and hormones. We demonstrate the power and flexibility of the Python language to develop components of an bioinformatic analytical pipeline to identify precursors from genomic data and to predict cleavage as these precursors are en route to the final bioactive peptides. We identified 75 precursors in the rhesus genome, predicted cleavage sites using support vector machines and compared the rhesus predictions to putative assignments based on homology to human sequences. The correct classification rate of cleavage using the support vector machines was over 97% for both human and rhesus data sets. The functionality of Python has been important to develop and maintain NeuroPred (http://neuroproteomics.scs.uiuc.edu/neuropred.html), a user-centered web application for the neuroscience community that provides cleavage site prediction from a wide range of models, precision and accuracy statistics, post-translational modifications, and the molecular mass of potential peptides. The combined results illustrate the suitability of the Python language to implement an all-inclusive bioinformatics approach to predict neuropeptides that encompasses a large number of interdependent steps, from scanning genomes for precursor genes to identification of potential bioactive neuropeptides. PMID:19169350

  12. DNA Precursor Metabolism and Mitochondrial Genome Stability

    DTIC Science & Technology

    2003-04-01

    mitochondrial DNA replication , to learn how the pool sizes are regulated, and to understand how perturbations of normal dNTP metabolism within the...mitochondria raises the possibility, however unlikely, that it is serving a function in addition to its role in DNA replication . The literature on non-DNA...is below since many authors do not follow the 200 word limit 14. SUBJECT TERMS Mitochondria, Genome stability, DNA precursors, Mitochondrial DNA

  13. The infinite sites model of genome evolution.

    PubMed

    Ma, Jian; Ratan, Aakrosh; Raney, Brian J; Suh, Bernard B; Miller, Webb; Haussler, David

    2008-09-23

    We formalize the problem of recovering the evolutionary history of a set of genomes that are related to an unseen common ancestor genome by operations of speciation, deletion, insertion, duplication, and rearrangement of segments of bases. The problem is examined in the limit as the number of bases in each genome goes to infinity. In this limit, the chromosomes are represented by continuous circles or line segments. For such an infinite-sites model, we present a polynomial-time algorithm to find the most parsimonious evolutionary history of any set of related present-day genomes.

  14. First Complete Squash leaf curl China virus Genomic Segment DNA-A Sequence from East Timor

    PubMed Central

    Maina, Solomon; Edwards, Owain R.; de Almeida, Luis; Ximenes, Abel

    2017-01-01

    ABSTRACT We present here the first complete Squash leaf curl China virus (SLCCV) genomic segment DNA-A sequence from East Timor. It was isolated from a pumpkin plant. When compared with 15 complete SLCCV DNA-A genome sequences from other world regions, it most resembled the Malaysian isolate MC1 sequence. PMID:28619789

  15. Genomic reassortment of influenza A virus in North American swine, 1998–2011

    PubMed Central

    Detmer, Susan E.; Wentworth, David E.; Tan, Yi; Schwartzbard, Aaron; Halpin, Rebecca A.; Stockwell, Timothy B.; Lin, Xudong; Vincent, Amy L.; Gramer, Marie R.; Holmes, Edward C.

    2012-01-01

    Revealing the frequency and determinants of reassortment among RNA genome segments is fundamental to understanding basic aspects of the biology and evolution of the influenza virus. To estimate the extent of genomic reassortment in influenza viruses circulating in North American swine, we performed a phylogenetic analysis of 139 whole-genome viral sequences sampled during 1998–2011 and representing seven antigenically distinct viral lineages. The highest amounts of reassortment were detected between the H3 and the internal gene segments (PB2, PB1, PA, NP, M and NS), while the lowest reassortment frequencies were observed among the H1γ, H1pdm and neuraminidase segments, particularly N1. Less reassortment was observed among specific haemagglutinin–neuraminidase combinations that were more prevalent in swine, suggesting that some genome constellations may be evolutionarily more stable. PMID:22993190

  16. Structural constraints in the packaging of bluetongue virus genomic segments

    PubMed Central

    Burkhardt, Christiane; Sung, Po-Yu; Celma, Cristina C.

    2014-01-01

    The mechanism used by bluetongue virus (BTV) to ensure the sorting and packaging of its 10 genomic segments is still poorly understood. In this study, we investigated the packaging constraints for two BTV genomic segments from two different serotypes. Segment 4 (S4) of BTV serotype 9 was mutated sequentially and packaging of mutant ssRNAs was investigated by two newly developed RNA packaging assay systems, one in vivo and the other in vitro. Modelling of the mutated ssRNA followed by biochemical data analysis suggested that a conformational motif formed by interaction of the 5′ and 3′ ends of the molecule was necessary and sufficient for packaging. A similar structural signal was also identified in S8 of BTV serotype 1. Furthermore, the same conformational analysis of secondary structures for positive-sense ssRNAs was used to generate a chimeric segment that maintained the putative packaging motif but contained unrelated internal sequences. This chimeric segment was packaged successfully, confirming that the motif identified directs the correct packaging of the segment. PMID:24980574

  17. SeeGH--a software tool for visualization of whole genome array comparative genomic hybridization data.

    PubMed

    Chi, Bryan; DeLeeuw, Ronald J; Coe, Bradley P; MacAulay, Calum; Lam, Wan L

    2004-02-09

    Array comparative genomic hybridization (CGH) is a technique which detects copy number differences in DNA segments. Complete sequencing of the human genome and the development of an array representing a tiling set of tens of thousands of DNA segments spanning the entire human genome has made high resolution copy number analysis throughout the genome possible. Since array CGH provides signal ratio for each DNA segment, visualization would require the reassembly of individual data points into chromosome profiles. We have developed a visualization tool for displaying whole genome array CGH data in the context of chromosomal location. SeeGH is an application that translates spot signal ratio data from array CGH experiments to displays of high resolution chromosome profiles. Data is imported from a simple tab delimited text file obtained from standard microarray image analysis software. SeeGH processes the signal ratio data and graphically displays it in a conventional CGH karyotype diagram with the added features of magnification and DNA segment annotation. In this process, SeeGH imports the data into a database, calculates the average ratio and standard deviation for each replicate spot, and links them to chromosome regions for graphical display. Once the data is displayed, users have the option of hiding or flagging DNA segments based on user defined criteria, and retrieve annotation information such as clone name, NCBI sequence accession number, ratio, base pair position on the chromosome, and standard deviation. SeeGH represents a novel software tool used to view and analyze array CGH data. The software gives users the ability to view the data in an overall genomic view as well as magnify specific chromosomal regions facilitating the precise localization of genetic alterations. SeeGH is easily installed and runs on Microsoft Windows 2000 or later environments.

  18. The Encapsidated Genome of Microplitis demolitor Bracovirus Integrates into the Host Pseudoplusia includens ▿ ‡

    PubMed Central

    Beck, Markus H.; Zhang, Shu; Bitra, Kavita; Burke, Gaelen R.; Strand, Michael R.

    2011-01-01

    Polydnaviruses (PDVs) are symbionts of parasitoid wasps that function as gene delivery vehicles in the insects (hosts) that the wasps parasitize. PDVs persist in wasps as integrated proviruses but are packaged as circularized and segmented double-stranded DNAs into the virions that wasps inject into hosts. In contrast, little is known about how PDV genomic DNAs persist in host cells. Microplitis demolitor carries Microplitis demolitor bracovirus (MdBV) and parasitizes the host Pseudoplusia includens. MdBV infects primarily host hemocytes and also infects a hemocyte-derived cell line from P. includens called CiE1 cells. Here we report that all 15 genomic segments of the MdBV encapsidated genome exhibited long-term persistence in CiE1 cells. Most MdBV genes expressed in hemocytes were persistently expressed in CiE1 cells, including members of the glc gene family whose products transformed CiE1 cells into a suspension culture. PCR-based integration assays combined with cloning and sequencing of host-virus junctions confirmed that genomic segments J and C persisted in CiE1 cells by integration. These genomic DNAs also rapidly integrated into parasitized P. includens. Sequence analysis of wasp-viral junction clones showed that the integration of proviral segments in M. demolitor was associated with a wasp excision/integration motif (WIM) known from other bracoviruses. However, integration into host cells occurred in association with a previously unknown domain that we named the host integration motif (HIM). The presence of HIMs in most MdBV genomic DNAs suggests that the integration of each genomic segment into host cells occurs through a shared mechanism. PMID:21880747

  19. RNA structural constraints in the evolution of the influenza A virus genome NP segment

    PubMed Central

    Gultyaev, Alexander P; Tsyganov-Bodounov, Anton; Spronken, Monique IJ; van der Kooij, Sander; Fouchier, Ron AM; Olsthoorn, René CL

    2014-01-01

    Conserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length, including protein-coding regions. Calculations of mutual information values at the paired nucleotide positions demonstrate that these structures impose considerable constraints on the virus genome evolution. Functional importance of a pseudoknot structure, predicted in the NP packaging signal region, was confirmed by plaque assays of the mutant viruses with disrupted structure and those with restored folding using compensatory substitutions. Possible functions of the conserved RNA folding patterns in the influenza A virus genome are discussed. PMID:25180940

  20. Evaluation of a Phylogenetic Marker Based on Genomic Segment B of Infectious Bursal Disease Virus: Facilitating a Feasible Incorporation of this Segment to the Molecular Epidemiology Studies for this Viral Agent

    PubMed Central

    Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L.; Bertran, Kateri; Frías, Maria T.; Ganges, Llilianne; Díaz de Arce, Heidy; Majó, Natàlia; Núñez, José I.; Pérez, Lester J.

    2015-01-01

    Background Infectious bursal disease (IBD) is a highly contagious and acute viral disease, which has caused high mortality rates in birds and considerable economic losses in different parts of the world for more than two decades and it still represents a considerable threat to poultry. The current study was designed to rigorously measure the reliability of a phylogenetic marker included into segment B. This marker can facilitate molecular epidemiology studies, incorporating this segment of the viral genome, to better explain the links between emergence, spreading and maintenance of the very virulent IBD virus (vvIBDV) strains worldwide. Methodology/Principal Findings Sequences of the segment B gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank Database; Cuban sequences were obtained in the current work. A phylogenetic marker named B-marker was assessed by different phylogenetic principles such as saturation of substitution, phylogenetic noise and high consistency. This last parameter is based on the ability of B-marker to reconstruct the same topology as the complete segment B of the viral genome. From the results obtained from B-marker, demographic history for both main lineages of IBDV regarding segment B was performed by Bayesian skyline plot analysis. Phylogenetic analysis for both segments of IBDV genome was also performed, revealing the presence of a natural reassortant strain with segment A from vvIBDV strains and segment B from non-vvIBDV strains within Cuban IBDV population. Conclusions/Significance This study contributes to a better understanding of the emergence of vvIBDV strains, describing molecular epidemiology of IBDV using the state-of-the-art methodology concerning phylogenetic reconstruction. This study also revealed the presence of a novel natural reassorted strain as possible manifest of change in the genetic structure and stability of the vvIBDV strains. Therefore, it highlights the need to obtain information about both genome segments of IBDV for molecular epidemiology studies. PMID:25946336

  1. Chemical synthesis of the precursor molecule of the Aequorea green fluorescent protein, subsequent folding, and development of fluorescence

    PubMed Central

    Nishiuchi, Yuji; Inui, Tatsuya; Nishio, Hideki; Bódi, József; Kimura, Terutoshi; Tsuji, Frederick I.; Sakakibara, Shumpei

    1998-01-01

    The present paper describes the total chemical synthesis of the precursor molecule of the Aequorea green fluorescent protein (GFP). The molecule is made up of 238 amino acid residues in a single polypeptide chain and is nonfluorescent. To carry out the synthesis, a procedure, first described in 1981 for the synthesis of complex peptides, was used. The procedure is based on performing segment condensation reactions in solution while providing maximum protection to the segment. The effectiveness of the procedure has been demonstrated by the synthesis of various biologically active peptides and small proteins, such as human angiogenin, a 123-residue protein analogue of ribonuclease A, human midkine, a 121-residue protein, and pleiotrophin, a 136-residue protein analogue of midkine. The GFP precursor molecule was synthesized from 26 fully protected segments in solution, and the final 238-residue peptide was treated with anhydrous hydrogen fluoride to obtain the precursor molecule of GFP containing two Cys(acetamidomethyl) residues. After removal of the acetamidomethyl groups, the product was dissolved in 0.1 M Tris⋅HCl buffer (pH 8.0) in the presence of DTT. After several hours at room temperature, the solution began to emit a green fluorescence (λmax = 509 nm) under near-UV light. Both fluorescence excitation and fluorescence emission spectra were measured and were found to have the same shape and maxima as those reported for native GFP. The present results demonstrate the utility of the segment condensation procedure in synthesizing large protein molecules such as GFP. The result also provides evidence that the formation of the chromophore in GFP is not dependent on any external cofactor. PMID:9811837

  2. Molecular cytogenetic and genomic analyses reveal new insights into the origin of the wheat B genome.

    PubMed

    Zhang, Wei; Zhang, Mingyi; Zhu, Xianwen; Cao, Yaping; Sun, Qing; Ma, Guojia; Chao, Shiaoman; Yan, Changhui; Xu, Steven S; Cai, Xiwen

    2018-02-01

    This work pinpointed the goatgrass chromosomal segment in the wheat B genome using modern cytogenetic and genomic technologies, and provided novel insights into the origin of the wheat B genome. Wheat is a typical allopolyploid with three homoeologous subgenomes (A, B, and D). The donors of the subgenomes A and D had been identified, but not for the subgenome B. The goatgrass Aegilops speltoides (genome SS) has been controversially considered a possible candidate for the donor of the wheat B genome. However, the relationship of the Ae. speltoides S genome with the wheat B genome remains largely obscure. The present study assessed the homology of the B and S genomes using an integrative cytogenetic and genomic approach, and revealed the contribution of Ae. speltoides to the origin of the wheat B genome. We discovered noticeable homology between wheat chromosome 1B and Ae. speltoides chromosome 1S, but not between other chromosomes in the B and S genomes. An Ae. speltoides-originated segment spanning a genomic region of approximately 10.46 Mb was detected on the long arm of wheat chromosome 1B (1BL). The Ae. speltoides-originated segment on 1BL was found to co-evolve with the rest of the B genome. Evidently, Ae. speltoides had been involved in the origin of the wheat B genome, but should not be considered an exclusive donor of this genome. The wheat B genome might have a polyphyletic origin with multiple ancestors involved, including Ae. speltoides. These novel findings will facilitate genome studies in wheat and other polyploids.

  3. A greedy, graph-based algorithm for the alignment of multiple homologous gene lists.

    PubMed

    Fostier, Jan; Proost, Sebastian; Dhoedt, Bart; Saeys, Yvan; Demeester, Piet; Van de Peer, Yves; Vandepoele, Klaas

    2011-03-15

    Many comparative genomics studies rely on the correct identification of homologous genomic regions using accurate alignment tools. In such case, the alphabet of the input sequences consists of complete genes, rather than nucleotides or amino acids. As optimal multiple sequence alignment is computationally impractical, a progressive alignment strategy is often employed. However, such an approach is susceptible to the propagation of alignment errors in early pairwise alignment steps, especially when dealing with strongly diverged genomic regions. In this article, we present a novel accurate and efficient greedy, graph-based algorithm for the alignment of multiple homologous genomic segments, represented as ordered gene lists. Based on provable properties of the graph structure, several heuristics are developed to resolve local alignment conflicts that occur due to gene duplication and/or rearrangement events on the different genomic segments. The performance of the algorithm is assessed by comparing the alignment results of homologous genomic segments in Arabidopsis thaliana to those obtained by using both a progressive alignment method and an earlier graph-based implementation. Especially for datasets that contain strongly diverged segments, the proposed method achieves a substantially higher alignment accuracy, and proves to be sufficiently fast for large datasets including a few dozens of eukaryotic genomes. http://bioinformatics.psb.ugent.be/software. The algorithm is implemented as a part of the i-ADHoRe 3.0 package.

  4. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rorsman, F.; Bywater, M.; Knott, T.J.

    The human platelet-derived growth factor (PDGF) A-chain locus was characterized by restriction endonuclease analysis, and the nucleotide sequence of its exons was determined. Seven exons were identified, spanning approximately 22 kilobase pairs of genomic DNA. Alternative exon usage, identified by cDNA cloning, occurs in a human glioblastoma cell line and may give rise to two types of A-chain precursors with different C termini. The exon-intron arrangement was similar to that of the PDGF B-chain/sis locus and seemed to divide the precursor proteins into functional domains. Southern blot analysis of genomic DNA showed that a single PDGF A-chain gene was presentmore » in the human genome.« less

  5. Using Markov chains of nucleotide sequences as a possible precursor to predict functional roles of human genome: a case study on inactive chromatin regions.

    PubMed

    Lee, K-E; Lee, E-J; Park, H-S

    2016-08-30

    Recent advances in computational epigenetics have provided new opportunities to evaluate n-gram probabilistic language models. In this paper, we describe a systematic genome-wide approach for predicting functional roles in inactive chromatin regions by using a sequence-based Markovian chromatin map of the human genome. We demonstrate that Markov chains of sequences can be used as a precursor to predict functional roles in heterochromatin regions and provide an example comparing two publicly available chromatin annotations of large-scale epigenomics projects: ENCODE project consortium and Roadmap Epigenomics consortium.

  6. Chromosomal Evolution and Patterns of Introgression in Helianthus

    PubMed Central

    Barb, Jessica G.; Bowers, John E.; Renaut, Sebastien; Rey, Juan I.; Knapp, Steven J.; Rieseberg, Loren H.; Burke, John M.

    2014-01-01

    Knowledge of the nature and extent of karyotypic differences between species provides insight into the evolutionary history of the genomes in question and, in the case of closely related species, the potential for genetic exchange between taxa. We constructed high-density genetic maps of the silverleaf sunflower (Helianthus argophyllus) and Algodones Dune sunflower (H. niveus ssp. tephrodes) genomes and compared them to a consensus map of cultivated sunflower (H. annuus) to identify chromosomal rearrangements between species. The genetic maps of H. argophyllus and H. niveus ssp. tephrodes included 17 linkage groups each and spanned 1337 and 1478 cM, respectively. Comparative analyses revealed greater divergence between H. annuus and H. niveus ssp. tephrodes (13 inverted segments, 18 translocated segments) than between H. annuus and H. argophyllus (10 inverted segments, 8 translocated segments), consistent with their known phylogenetic relationships. Marker order was conserved across much of the genome, with 83 and 64% of the H. argophyllus and H. niveus ssp. tephrodes genomes, respectively, being syntenic with H. annuus. Population genomic analyses between H. annuus and H. argophyllus, which are sympatric across a portion of the natural range of H. annuus, revealed significantly elevated genetic structure in rearranged portions of the genome, indicating that such rearrangements are associated with restricted gene flow between these two species. PMID:24770331

  7. The Mitochondrial Genome and a 60-kb Nuclear DNA Segment from Naegleria fowleri, the Causative Agent of Primary Amoebic Meningoencephalitis

    PubMed Central

    Herman, Emily K.; Greninger, Alexander L.; Visvesvara, Govinda S.; Marciano-Cabral, Francine; Dacks, Joel B.; Chiu, Charles Y.

    2013-01-01

    Naegleria fowleri is a unicellular eukaryote causing primary amoebic meningoencephalitis, a neuropathic disease killing 99% of those infected, usually within 7–14 days. N. fowleri is found globally in regions including the US and Australia. The genome of the related non-pathogenic species Naegleria gruberi has been sequenced, but the genetic basis for N. fowleri pathogenicity is unclear. To generate such insight, we sequenced and assembled the mitochondrial genome and a 60-kb segment of nuclear genome from N. fowleri. The mitochondrial genome is highly similar to its counterpart in N. gruberi in gene complement and organization, while distinct lack of synteny is observed for the nuclear segments. Even in this short (60-kb) segment, we identified examples of potential factors for pathogenesis, including ten novel N. fowleri-specific genes. We also identified a homologue of cathepsin B; proteases proposed to be involved in the pathogenesis of diverse eukaryotic pathogens, including N. fowleri. Finally, we demonstrate a likely case of horizontal gene transfer between N. fowleri and two unrelated amoebae, one of which causes granulomatous amoebic encephalitis. This initial look into the N. fowleri nuclear genome has revealed several examples of potential pathogenesis factors, improving our understanding of a neglected pathogen of increasing global importance. PMID:23360210

  8. BAC sequencing using pooled methods.

    PubMed

    Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

    2015-01-01

    Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.

  9. Tempo and Mode in the Molecular Evolution of Influenza C

    PubMed Central

    Gatherer, Derek

    2010-01-01

    Abstract: Influenza C contributes to economic damage caused by working days lost through absence or inefficiency and may occasionally cause an acute respiratory illness in a paediatric setting. All Influenza C sequences from the NCBI Influenza Virus Resource were examined to determine the date of the most recent common ancestor (t-MRCA), the average nucleotide substitution rate, and the location of residues under positive selection, for each of the seven genome segments of this virus. The segment with the deepest phylogeny was found to be segment 4, encoding the haemagglutinin-esterase protein (HE) with mean t-MRCA at 1890 of the common era (AD), at a 95% highest posterior density (HPD) of 1857-1924 AD. Other genome segments have slightly more recent common ancestors, ranging from mean t-MRCAs of 1916 AD (HPD 1891-1937) for segment 7, encoding the two non-structural proteins (NS) to 1944 AD (HPD 1940-1948) for segment 2 encoding the type 1 basic polymerase (PB1). On the basis of the Bayesian analysis a reclassification of lineages within genome segments is proposed. Some evidence for positive selection was found in the receptor-binding domain of the haemagglutinin-esterase protein. However, average ω (omega) values ranged from 0.05 for polymerase basic protein 2 (PB2) to 0.38 for non-structural protein 2 (NS2), suggesting that strong to moderate purifying selection is the main trend. Characteristic combinations of segment lineages were identified (genome constellations) and shown to have a relatively short life-span before being broken up by reassortment. PMID:21127722

  10. Genome Duplication and Gene Loss Affect the Evolution of Heat Shock Transcription Factor Genes in Legumes

    PubMed Central

    Jin, Jing; Jin, Xiaolei; Jiang, Haiyang; Yan, Hanwei; Cheng, Beijiu

    2014-01-01

    Whole-genome duplication events (polyploidy events) and gene loss events have played important roles in the evolution of legumes. Here we show that the vast majority of Hsf gene duplications resulted from whole genome duplication events rather than tandem duplication, and significant differences in gene retention exist between species. By searching for intraspecies gene colinearity (microsynteny) and dating the age distributions of duplicated genes, we found that genome duplications accounted for 42 of 46 Hsf-containing segments in Glycine max, while paired segments were rarely identified in Lotus japonicas, Medicago truncatula and Cajanus cajan. However, by comparing interspecies microsynteny, we determined that the great majority of Hsf-containing segments in Lotus japonicas, Medicago truncatula and Cajanus cajan show extensive conservation with the duplicated regions of Glycine max. These segments formed 17 groups of orthologous segments. These results suggest that these regions shared ancient genome duplication with Hsf genes in Glycine max, but more than half of the copies of these genes were lost. On the other hand, the Glycine max Hsf gene family retained approximately 75% and 84% of duplicated genes produced from the ancient genome duplication and recent Glycine-specific genome duplication, respectively. Continuous purifying selection has played a key role in the maintenance of Hsf genes in Glycine max. Expression analysis of the Hsf genes in Lotus japonicus revealed their putative involvement in multiple tissue-/developmental stages and responses to various abiotic stimuli. This study traces the evolution of Hsf genes in legume species and demonstrates that the rates of gene gain and loss are far from equilibrium in different species. PMID:25047803

  11. Organizational differences between cytoplasmic male sterile and male fertile Brassica mitochondrial genomes are confined to a single transposed locus.

    PubMed Central

    L'Homme, Y; Brown, G G

    1993-01-01

    Comparison of the physical maps of male fertile (cam) and male sterile (pol) mitochondrial genomes of Brassica napus indicates that structural differences between the two mtDNAs are confined to a region immediately upstream of the atp6 gene. Relative to cam mtDNA, pol mtDNA possesses a 4.5 kb segment at this locus that includes a chimeric gene that is cotranscribed with atp6 and lacks an approximately 1kb region located upstream of the cam atp6 gene. The 4.5 kb pol segment is present and similarly organized in the mitochondrial genome of the common nap B.napus cytoplasm; however, the nap and pol DNA regions flanking this segment are different and the nap sequences are not expressed. The 4.5 kb CMS-associated pol segment has thus apparently undergone transposition during the evolution of the nap and pol cytoplasms and has been lost in the cam genome subsequent to the pol-cam divergence. This 4.5 kb segment comprises the single DNA region that is expressed differently in fertile, pol CMS and fertility restored pol cytoplasm plants. The finding that this locus is part of the single mtDNA region organized differently in the fertile and male sterile mitochondrial genomes provides strong support for the view that it specifies the pol CMS trait. Images PMID:8388101

  12. Automatic Segmentation of High-Throughput RNAi Fluorescent Cellular Images

    PubMed Central

    Yan, Pingkum; Zhou, Xiaobo; Shah, Mubarak; Wong, Stephen T. C.

    2010-01-01

    High-throughput genome-wide RNA interference (RNAi) screening is emerging as an essential tool to assist biologists in understanding complex cellular processes. The large number of images produced in each study make manual analysis intractable; hence, automatic cellular image analysis becomes an urgent need, where segmentation is the first and one of the most important steps. In this paper, a fully automatic method for segmentation of cells from genome-wide RNAi screening images is proposed. Nuclei are first extracted from the DNA channel by using a modified watershed algorithm. Cells are then extracted by modeling the interaction between them as well as combining both gradient and region information in the Actin and Rac channels. A new energy functional is formulated based on a novel interaction model for segmenting tightly clustered cells with significant intensity variance and specific phenotypes. The energy functional is minimized by using a multiphase level set method, which leads to a highly effective cell segmentation method. Promising experimental results demonstrate that automatic segmentation of high-throughput genome-wide multichannel screening can be achieved by using the proposed method, which may also be extended to other multichannel image segmentation problems. PMID:18270043

  13. Whole-Genome Sequencing for Optimized Patient Management

    PubMed Central

    Bainbridge, Matthew N.; Wiszniewski, Wojciech; Murdock, David R.; Friedman, Jennifer; Gonzaga-Jauregui, Claudia; Newsham, Irene; Reid, Jeffrey G.; Fink, John K.; Morgan, Margaret B.; Gingras, Marie-Claude; Muzny, Donna M.; Hoang, Linh D.; Yousaf, Shahed; Lupski, James R.; Gibbs, Richard A.

    2012-01-01

    Whole-genome sequencing of patient DNA can facilitate diagnosis of a disease, but its potential for guiding treatment has been under-realized. We interrogated the complete genome sequences of a 14-year-old fraternal twin pair diagnosed with dopa (3,4-dihydroxyphenylalanine)–responsive dystonia (DRD; Mendelian Inheritance in Man #128230). DRD is a genetically heterogeneous and clinically complex movement disorder that is usually treated with l-dopa, a precursor of the neurotransmitter dopamine. Whole-genome sequencing identified compound heterozygous mutations in the SPR gene encoding sepiapterin reductase. Disruption of SPR causes a decrease in tetrahydrobiopterin, a cofactor required for the hydroxylase enzymes that synthesize the neurotransmitters dopamine and serotonin. Supplementation of l-dopa therapy with 5-hydroxytryptophan, a serotonin precursor, resulted in clinical improvements in both twins. PMID:21677200

  14. CID-miRNA: A web server for prediction of novel miRNA precursors in human genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tyagi, Sonika; Vaz, Candida; Gupta, Vipin

    2008-08-08

    microRNAs (miRNA) are a class of non-protein coding functional RNAs that are thought to regulate expression of target genes by direct interaction with mRNAs. miRNAs have been identified through both experimental and computational methods in a variety of eukaryotic organisms. Though these approaches have been partially successful, there is a need to develop more tools for detection of these RNAs as they are also thought to be present in abundance in many genomes. In this report we describe a tool and a web server, named CID-miRNA, for identification of miRNA precursors in a given DNA sequence, utilising secondary structure-based filteringmore » systems and an algorithm based on stochastic context free grammar trained on human miRNAs. CID-miRNA analyses a given sequence using a web interface, for presence of putative miRNA precursors and the generated output lists all the potential regions that can form miRNA-like structures. It can also scan large genomic sequences for the presence of potential miRNA precursors in its stand-alone form. The web server can be accessed at (http://mirna.jnu.ac.in/cidmirna/)« less

  15. Assessment of Recombination in the S-segment Genome of Crimean-Congo Hemorrhagic Fever Virus in Iran.

    PubMed

    Chinikar, Sadegh; Shah-Hosseini, Nariman; Bouzari, Saeid; Shokrgozar, Mohammad Ali; Mostafavi, Ehsan; Jalali, Tahmineh; Khakifirouz, Sahar; Groschup, Martin H; Niedrig, Matthias

    2016-03-01

    Crimean-Congo Hemorrhagic Fever Virus (CCHFV) belongs to genus Nairovirus and family Bunyaviridae. The main aim of this study was to investigate the extent of recombination in S-segment genome of CCHFV in Iran. Samples were isolated from Iranian patients and those available in GenBank, and analyzed by phylogenetic and bootscan methods. Through comparison of the phylogenetic trees based on full length sequences and partial fragments in the S-segment genome of CCHFV, genetic switch was evident, due to recombination event. Moreover, evidence of multiple recombination events was detected in query isolates when bootscan analysis was used by SimPlot software. Switch of different genomic regions between different strains by recombination could contribute to CCHFV diversification and evolution. The occurrence of recombination in CCHFV has a critical impact on epidemiological investigations and vaccine design.

  16. Speed congenics: accelerated genome recovery using genetic markers.

    PubMed

    Visscher, P M

    1999-08-01

    Genetic markers throughout the genome can be used to speed up 'recovery' of the recipient genome in the backcrossing phase of the construction of a congenic strain. The prediction of the genomic proportion during backcrossing depends on the assumptions regarding the distribution of chromosome segments, the population structure, the marker spacing and the selection strategy. In this study simulation was used to investigate the rate of recovery of the recipient genome for a mouse, Drosophila and Arabidopsis genome. It was shown that an incorrect assumption of a binomial distribution of chromosome segments, and failing to take account of a reduction in variance in genomic proportion due to selection, can lead to a downward bias of up to two generations in the estimation of the number of generations required for the formation of a congenic strain.

  17. The mitochondrial genome and a 60-kb nuclear DNA segment from Naegleria fowleri, the causative agent of primary amoebic meningoencephalitis.

    PubMed

    Herman, Emily K; Greninger, Alexander L; Visvesvara, Govinda S; Marciano-Cabral, Francine; Dacks, Joel B; Chiu, Charles Y

    2013-01-01

    Naegleria fowleri is a unicellular eukaryote causing primary amoebic meningoencephalitis, a neuropathic disease killing 99% of those infected, usually within 7-14 days. Naegleria fowleri is found globally in regions including the US and Australia. The genome of the related nonpathogenic species Naegleria gruberi has been sequenced, but the genetic basis for N. fowleri pathogenicity is unclear. To generate such insight, we sequenced and assembled the mitochondrial genome and a 60-kb segment of nuclear genome from N. fowleri. The mitochondrial genome is highly similar to its counterpart in N. gruberi in gene complement and organization, while distinct lack of synteny is observed for the nuclear segments. Even in this short (60-kb) segment, we identified examples of potential factors for pathogenesis, including ten novel N. fowleri-specific genes. We also identified a homolog of cathepsin B; proteases proposed to be involved in the pathogenesis of diverse eukaryotic pathogens, including N. fowleri. Finally, we demonstrate a likely case of horizontal gene transfer between N. fowleri and two unrelated amoebae, one of which causes granulomatous amoebic encephalitis. This initial look into the N. fowleri nuclear genome has revealed several examples of potential pathogenesis factors, improving our understanding of a neglected pathogen of increasing global importance. © 2013 The Author(s) Journal of Eukaryotic Microbiology © 2013 International Society of Protistologists.

  18. [Comparative analysis of variable regions in the genomes of variola virus].

    PubMed

    Babkin, I V; Nepomniashchikh, T S; Maksiutov, R A; Gutorov, V V; Babkina, I N; Shchelkunov, S N

    2008-01-01

    Nucleotide sequences of two extended segments of the terminal variable regions in variola virus genome were determined. The size of the left segment was 13.5 kbp and of the right, 10.5 kbp. Totally, over 540 kbp were sequenced for 22 variola virus strains. The conducted phylogenetic analysis and the data published earlier allowed us to find the interrelations between 70 variola virus isolates, the character of their clustering, and the degree of intergroup and intragroup variations of the clusters of variola virus strains. The most polymorphic loci of the genome segments studied were determined. It was demonstrated that that these loci are localized to either noncoding genome regions or to the regions of destroyed open reading frames, characteristic of the ancestor virus. These loci are promising for development of the strategy for genotyping variola virus strains. Analysis of recombination using various methods demonstrated that, with the only exception, no statistically significant recombinational events in the genomes of variola virus strains studied were detectable.

  19. Detecting the borders between coding and non-coding DNA regions in prokaryotes based on recursive segmentation and nucleotide doublets statistics

    PubMed Central

    2012-01-01

    Background Detecting the borders between coding and non-coding regions is an essential step in the genome annotation. And information entropy measures are useful for describing the signals in genome sequence. However, the accuracies of previous methods of finding borders based on entropy segmentation method still need to be improved. Methods In this study, we first applied a new recursive entropic segmentation method on DNA sequences to get preliminary significant cuts. A 22-symbol alphabet is used to capture the differential composition of nucleotide doublets and stop codon patterns along three phases in both DNA strands. This process requires no prior training datasets. Results Comparing with the previous segmentation methods, the experimental results on three bacteria genomes, Rickettsia prowazekii, Borrelia burgdorferi and E.coli, show that our approach improves the accuracy for finding the borders between coding and non-coding regions in DNA sequences. Conclusions This paper presents a new segmentation method in prokaryotes based on Jensen-Rényi divergence with a 22-symbol alphabet. For three bacteria genomes, comparing to A12_JR method, our method raised the accuracy of finding the borders between protein coding and non-coding regions in DNA sequences. PMID:23282225

  20. Recommendations for the classification of group A rotaviruses using all 11 genomic RNA segments.

    PubMed

    Matthijnssens, Jelle; Ciarlet, Max; Rahman, Mustafizur; Attoui, Houssam; Bányai, Krisztián; Estes, Mary K; Gentsch, Jon R; Iturriza-Gómara, Miren; Kirkwood, Carl D; Martella, Vito; Mertens, Peter P C; Nakagomi, Osamu; Patton, John T; Ruggeri, Franco M; Saif, Linda J; Santos, Norma; Steyer, Andrej; Taniguchi, Koki; Desselberger, Ulrich; Van Ranst, Marc

    2008-01-01

    Recently, a classification system was proposed for rotaviruses in which all the 11 genomic RNA segments are used (Matthijnssens et al. in J Virol 82:3204-3219, 2008). Based on nucleotide identity cut-off percentages, different genotypes were defined for each genome segment. A nomenclature for the comparison of complete rotavirus genomes was considered in which the notations Gx-P[x]-Ix-Rx-Cx-Mx-Ax-Nx-Tx-Ex-Hx are used for the VP7-VP4-VP6-VP1-VP2-VP3-NSP1-NSP2-NSP3-NSP4-NSP5/6 encoding genes, respectively. This classification system is an extension of the previously applied genotype-based system which made use of the rotavirus gene segments encoding VP4, VP7, VP6, and NSP4. In order to assign rotavirus strains to one of the established genotypes or a new genotype, a standard procedure is proposed in this report. As more human and animal rotavirus genomes will be completely sequenced, new genotypes for each of the 11 gene segments may be identified. A Rotavirus Classification Working Group (RCWG) including specialists in molecular virology, infectious diseases, epidemiology, and public health was formed, which can assist in the appropriate delineation of new genotypes, thus avoiding duplications and helping minimize errors. Scientists discovering a potentially new rotavirus genotype for any of the 11 gene segments are invited to send the novel sequence to the RCWG, where the sequence will be analyzed, and a new nomenclature will be advised as appropriate. The RCWG will update the list of classified strains regularly and make this accessible on a website. Close collaboration with the Study Group Reoviridae of the International Committee on the Taxonomy of Viruses will be maintained.

  1. The origins and impact of primate segmental duplications.

    PubMed

    Marques-Bonet, Tomas; Girirajan, Santhosh; Eichler, Evan E

    2009-10-01

    Duplicated sequences are substrates for the emergence of new genes and are an important source of genetic instability associated with rare and common diseases. Analyses of primate genomes have shown an increase in the proportion of interspersed segmental duplications (SDs) within the genomes of humans and great apes. This contrasts with other mammalian genomes that seem to have their recently duplicated sequences organized in a tandem configuration. In this review, we focus on the mechanistic origin and impact of this difference with respect to evolution, genetic diversity and primate phenotype. Although many genomes will be sequenced in the future, resolution of this aspect of genomic architecture still requires high quality sequences and detailed analyses.

  2. Assessment of Recombination in the S-segment Genome of Crimean-Congo Hemorrhagic Fever Virus in Iran

    PubMed Central

    Chinikar, Sadegh; Shah-Hosseini, Nariman; Bouzari, Saeid; Shokrgozar, Mohammad Ali; Mostafavi, Ehsan; Jalali, Tahmineh; Khakifirouz, Sahar; Groschup, Martin H; Niedrig, Matthias

    2016-01-01

    Background: Crimean-Congo Hemorrhagic Fever Virus (CCHFV) belongs to genus Nairovirus and family Bunyaviridae. The main aim of this study was to investigate the extent of recombination in S-segment genome of CCHFV in Iran. Methods: Samples were isolated from Iranian patients and those available in GenBank, and analyzed by phylogenetic and bootscan methods. Results: Through comparison of the phylogenetic trees based on full length sequences and partial fragments in the S-segment genome of CCHFV, genetic switch was evident, due to recombination event. Moreover, evidence of multiple recombination events was detected in query isolates when bootscan analysis was used by SimPlot software. Conclusion: Switch of different genomic regions between different strains by recombination could contribute to CCHFV diversification and evolution. The occurrence of recombination in CCHFV has a critical impact on epidemiological investigations and vaccine design. PMID:27047968

  3. Precursors of Professionalism in Senior-Level Undergraduate Business Students and the Implications of These Precursors for Business Education and the Profession

    ERIC Educational Resources Information Center

    Nino, Lana Sami

    2012-01-01

    Understanding the professional identity of senior-level undergraduate business students may shed light on the rampant unethical acts of business managers in industry. Business education is the largest segment of undergraduate majors, constituting more than 20% of students in four-year institutions, year after year. To explain the professional…

  4. An efficient and high fidelity method for amplification, cloning and sequencing of complete tospovirus genomic RNA segments

    USDA-ARS?s Scientific Manuscript database

    Amplification and sequencing of the complete M- and S-RNA segments of Tomato spotted wilt virus and Impatiens necrotic spot virus as a single fragment is useful for whole genome sequencing of tospoviruses co-infecting a single host plant. It avoids issues associated with overlapping amplicon-based ...

  5. The diversity of sequence and chromosomal distribution of new transposable element-related segments in the rye genome revealed by FISH and lineage annotation

    USDA-ARS?s Scientific Manuscript database

    The rye genome features a high percentage of repetitive elements, especially transposable elements (TEs). However, studies about the constitution and organization of TEs on rye chromosomes are limited. In this study, 97 unique TE segments were isolated and characterized; 50 TE segmemts showed varyin...

  6. Sequence analysis of the PIP5K locus in Eimeria maxima provides further evidence for eimerian genome plasticity and segmental organization.

    PubMed

    Song, B K; Pan, M Z; Lau, Y L; Wan, K L

    2014-07-29

    Commercial flocks infected by Eimeria species parasites, including Eimeria maxima, have an increased risk of developing clinical or subclinical coccidiosis; an intestinal enteritis associated with increased mortality rates in poultry. Currently, infection control is largely based on chemotherapy or live vaccines; however, drug resistance is common and vaccines are relatively expensive. The development of new cost-effective intervention measures will benefit from unraveling the complex genetic mechanisms that underlie host-parasite interactions, including the identification and characterization of genes encoding proteins such as phosphatidylinositol 4-phosphate 5-kinase (PIP5K). We previously identified a PIP5K coding sequence within the E. maxima genome. In this study, we analyzed two bacterial artificial chromosome clones presenting a ~145-kb E. maxima (Weybridge strain) genomic region spanning the PIP5K gene locus. Sequence analysis revealed that ~95% of the simple sequence repeats detected were located within regions comparable to the previously described feature-rich segments of the Eimeria tenella genome. Comparative sequence analysis with the orthologous E. maxima (Houghton strain) region revealed a moderate level of conserved synteny. Unique segmental organizations and telomere-like repeats were also observed in both genomes. A number of incomplete transposable elements were detected and further scrutiny of these elements in both orthologous segments revealed interesting nesting events, which may play a role in facilitating genome plasticity in E. maxima. The current analysis provides more detailed information about the genome organization of E. maxima and may help to reveal genotypic differences that are important for expression of traits related to pathogenicity and virulence.

  7. Delineating slowly and rapidly evolving fractions of the Drosophila genome.

    PubMed

    Keith, Jonathan M; Adams, Peter; Stephen, Stuart; Mattick, John S

    2008-05-01

    Evolutionary conservation is an important indicator of function and a major component of bioinformatic methods to identify non-protein-coding genes. We present a new Bayesian method for segmenting pairwise alignments of eukaryotic genomes while simultaneously classifying segments into slowly and rapidly evolving fractions. We also describe an information criterion similar to the Akaike Information Criterion (AIC) for determining the number of classes. Working with pairwise alignments enables detection of differences in conservation patterns among closely related species. We analyzed three whole-genome and three partial-genome pairwise alignments among eight Drosophila species. Three distinct classes of conservation level were detected. Sequences comprising the most slowly evolving component were consistent across a range of species pairs, and constituted approximately 62-66% of the D. melanogaster genome. Almost all (>90%) of the aligned protein-coding sequence is in this fraction, suggesting much of it (comprising the majority of the Drosophila genome, including approximately 56% of non-protein-coding sequences) is functional. The size and content of the most rapidly evolving component was species dependent, and varied from 1.6% to 4.8%. This fraction is also enriched for protein-coding sequence (while containing significant amounts of non-protein-coding sequence), suggesting it is under positive selection. We also classified segments according to conservation and GC content simultaneously. This analysis identified numerous sub-classes of those identified on the basis of conservation alone, but was nevertheless consistent with that classification. Software, data, and results available at www.maths.qut.edu.au/-keithj/. Genomic segments comprising the conservation classes available in BED format.

  8. Disruption of Specific RNA-RNA Interactions in a Double-Stranded RNA Virus Inhibits Genome Packaging and Virus Infectivity

    PubMed Central

    Fajardo, Teodoro; Sung, Po-Yu; Roy, Polly

    2015-01-01

    Bluetongue virus (BTV) causes hemorrhagic disease in economically important livestock. The BTV genome is organized into ten discrete double-stranded RNA molecules (S1-S10) which have been suggested to follow a sequential packaging pathway from smallest to largest segment during virus capsid assembly. To substantiate and extend these studies, we have investigated the RNA sorting and packaging mechanisms with a new experimental approach using inhibitory oligonucleotides. Putative packaging signals present in the 3’untranslated regions of BTV segments were targeted by a number of nuclease resistant oligoribonucleotides (ORNs) and their effects on virus replication in cell culture were assessed. ORNs complementary to the 3’ UTR of BTV RNAs significantly inhibited virus replication without affecting protein synthesis. Same ORNs were found to inhibit complex formation when added to a novel RNA-RNA interaction assay which measured the formation of supramolecular complexes between and among different RNA segments. ORNs targeting the 3’UTR of BTV segment 10, the smallest RNA segment, were shown to be the most potent and deletions or substitution mutations of the targeted sequences diminished the RNA complexes and abolished the recovery of viable viruses using reverse genetics. Cell-free capsid assembly/RNA packaging assay also confirmed that the inhibitory ORNs could interfere with RNA packaging and further substitution mutations within the putative RNA packaging sequence have identified the recognition sequence concerned. Exchange of 3’UTR between segments have further demonstrated that RNA recognition was segment specific, most likely acting as part of the secondary structure of the entire genomic segment. Our data confirm that genome packaging in this segmented dsRNA virus occurs via the formation of supramolecular complexes formed by the interaction of specific sequences located in the 3’ UTRs. Additionally, the inhibition of packaging in-trans with inhibitory ORNs suggests this that interaction is a bona fide target for the design of compounds with antiviral activity. PMID:26646790

  9. Disruption of Specific RNA-RNA Interactions in a Double-Stranded RNA Virus Inhibits Genome Packaging and Virus Infectivity.

    PubMed

    Fajardo, Teodoro; Sung, Po-Yu; Roy, Polly

    2015-12-01

    Bluetongue virus (BTV) causes hemorrhagic disease in economically important livestock. The BTV genome is organized into ten discrete double-stranded RNA molecules (S1-S10) which have been suggested to follow a sequential packaging pathway from smallest to largest segment during virus capsid assembly. To substantiate and extend these studies, we have investigated the RNA sorting and packaging mechanisms with a new experimental approach using inhibitory oligonucleotides. Putative packaging signals present in the 3'untranslated regions of BTV segments were targeted by a number of nuclease resistant oligoribonucleotides (ORNs) and their effects on virus replication in cell culture were assessed. ORNs complementary to the 3' UTR of BTV RNAs significantly inhibited virus replication without affecting protein synthesis. Same ORNs were found to inhibit complex formation when added to a novel RNA-RNA interaction assay which measured the formation of supramolecular complexes between and among different RNA segments. ORNs targeting the 3'UTR of BTV segment 10, the smallest RNA segment, were shown to be the most potent and deletions or substitution mutations of the targeted sequences diminished the RNA complexes and abolished the recovery of viable viruses using reverse genetics. Cell-free capsid assembly/RNA packaging assay also confirmed that the inhibitory ORNs could interfere with RNA packaging and further substitution mutations within the putative RNA packaging sequence have identified the recognition sequence concerned. Exchange of 3'UTR between segments have further demonstrated that RNA recognition was segment specific, most likely acting as part of the secondary structure of the entire genomic segment. Our data confirm that genome packaging in this segmented dsRNA virus occurs via the formation of supramolecular complexes formed by the interaction of specific sequences located in the 3' UTRs. Additionally, the inhibition of packaging in-trans with inhibitory ORNs suggests this that interaction is a bona fide target for the design of compounds with antiviral activity.

  10. Pressure for Pattern-Specific Intertypic Recombination between Sabin Polioviruses: Evolutionary Implications.

    PubMed

    Korotkova, Ekaterina; Laassri, Majid; Zagorodnyaya, Tatiana; Petrovskaya, Svetlana; Rodionova, Elvira; Cherkasova, Elena; Gmyl, Anatoly; Ivanova, Olga E; Eremeeva, Tatyana P; Lipskaya, Galina Y; Agol, Vadim I; Chumakov, Konstantin

    2017-11-22

    Complete genomic sequences of a non-redundant set of 70 recombinants between three serotypes of attenuated Sabin polioviruses as well as location (based on partial sequencing) of crossover sites of 28 additional recombinants were determined and compared with the previously published data. It is demonstrated that the genomes of Sabin viruses contain distinct strain-specific segments that are eliminated by recombination. The presumed low fitness of these segments could be linked to mutations acquired upon derivation of the vaccine strains and/or may have been present in wild-type parents of Sabin viruses. These "weak" segments contribute to the propensity of these viruses to recombine with each other and with other enteroviruses as well as determine the choice of crossover sites. The knowledge of location of such segments opens additional possibilities for the design of more genetically stable and/or more attenuated variants, i.e., candidates for new oral polio vaccines. The results also suggest that the genome of wild polioviruses, and, by generalization, of other RNA viruses, may harbor hidden low-fitness segments that can be readily eliminated only by recombination.

  11. A segmentation/clustering model for the analysis of array CGH data.

    PubMed

    Picard, F; Robin, S; Lebarbier, E; Daudin, J-J

    2007-09-01

    Microarray-CGH (comparative genomic hybridization) experiments are used to detect and map chromosomal imbalances. A CGH profile can be viewed as a succession of segments that represent homogeneous regions in the genome whose representative sequences share the same relative copy number on average. Segmentation methods constitute a natural framework for the analysis, but they do not provide a biological status for the detected segments. We propose a new model for this segmentation/clustering problem, combining a segmentation model with a mixture model. We present a new hybrid algorithm called dynamic programming-expectation maximization (DP-EM) to estimate the parameters of the model by maximum likelihood. This algorithm combines DP and the EM algorithm. We also propose a model selection heuristic to select the number of clusters and the number of segments. An example of our procedure is presented, based on publicly available data sets. We compare our method to segmentation methods and to hidden Markov models, and we show that the new segmentation/clustering model is a promising alternative that can be applied in the more general context of signal processing.

  12. Inheritance of Trans Chromosomal Methylation patterns from Arabidopsis F1 hybrids

    PubMed Central

    Greaves, Ian K.; Groszmann, Michael; Wang, Aihua; Peacock, W. James; Dennis, Elizabeth S.

    2014-01-01

    Hybridization in plants leads to transinteractions between the parental genomes and epigenomes that can result in changes to both 24 nt siRNA and cytosine methylation (mC) levels in the hybrid. In Arabidopsis the principle processes altering the hybrid methylome are Trans Chromosomal Methylation (TCM) and Trans Chromosomal deMethylation (TCdM) in which the mC pattern of a genomic segment attains the same mC pattern of the corresponding segment on the other parental chromosome. We examined two loci that undergo TCM/TCdM in the Arabidopsis C24/Landsberg erecta (Ler) F1 hybrids, which show patterns of inheritance dependent on the properties of the particular donor and recipient chromosomal segments. At At1g64790 the TCM- and TCdM-derived mC patterns are maintained in the F2 generation but are transmitted in outcrosses or backcrosses only by the C24 genomic segment. At a region between and adjacent to At3g43340 and At3g43350, the originally unmethylated Ler genomic segment receives the C24 mC pattern in the F1, which is then maintained in backcross plants independent of the presence of the parental C24 segment. In backcrosses to an unmethylated Ler allele, the newly methylated F1 Ler segment may act as a TCM source in a process comparable to paramutation in maize. TCM-derived mC patterns are associated with reduced expression of both At3g43340 and At3g43350 in F1 and F2 plants, providing support for such events influencing the transcriptome. The inheritance of the F1 mC patterns and the segregation of other genetic and epigenetic determinants may contribute to the reduced hybrid vigor in the F2 and subsequent generations. PMID:24449910

  13. Inheritance of Trans Chromosomal Methylation patterns from Arabidopsis F1 hybrids.

    PubMed

    Greaves, Ian K; Groszmann, Michael; Wang, Aihua; Peacock, W James; Dennis, Elizabeth S

    2014-02-04

    Hybridization in plants leads to transinteractions between the parental genomes and epigenomes that can result in changes to both 24 nt siRNA and cytosine methylation ((m)C) levels in the hybrid. In Arabidopsis the principle processes altering the hybrid methylome are Trans Chromosomal Methylation (TCM) and Trans Chromosomal deMethylation (TCdM) in which the (m)C pattern of a genomic segment attains the same (m)C pattern of the corresponding segment on the other parental chromosome. We examined two loci that undergo TCM/TCdM in the Arabidopsis C24/Landsberg erecta (Ler) F1 hybrids, which show patterns of inheritance dependent on the properties of the particular donor and recipient chromosomal segments. At At1g64790 the TCM- and TCdM-derived (m)C patterns are maintained in the F2 generation but are transmitted in outcrosses or backcrosses only by the C24 genomic segment. At a region between and adjacent to At3g43340 and At3g43350, the originally unmethylated Ler genomic segment receives the C24 (m)C pattern in the F1, which is then maintained in backcross plants independent of the presence of the parental C24 segment. In backcrosses to an unmethylated Ler allele, the newly methylated F1 Ler segment may act as a TCM source in a process comparable to paramutation in maize. TCM-derived (m)C patterns are associated with reduced expression of both At3g43340 and At3g43350 in F1 and F2 plants, providing support for such events influencing the transcriptome. The inheritance of the F1 (m)C patterns and the segregation of other genetic and epigenetic determinants may contribute to the reduced hybrid vigor in the F2 and subsequent generations.

  14. Potential for La Crosse virus segment reassortment in nature

    PubMed Central

    Reese, Sara M; Blitvich, Bradley J; Blair, Carol D; Geske, Dave; Beaty, Barry J; Black, William C

    2008-01-01

    The evolutionary success of La Crosse virus (LACV, family Bunyaviridae) is due to its ability to adapt to changing conditions through intramolecular genetic changes and segment reassortment. Vertical transmission of LACV in mosquitoes increases the potential for segment reassortment. Studies were conducted to determine if segment reassortment was occurring in naturally infected Aedes triseriatus from Wisconsin and Minnesota in 2000, 2004, 2006 and 2007. Mosquito eggs were collected from various sites in Wisconsin and Minnesota. They were reared in the laboratory and adults were tested for LACV antigen by immunofluorescence assay. RNA was isolated from the abdomen of infected mosquitoes and portions of the small (S), medium (M) and large (L) viral genome segments were amplified by RT-PCR and sequenced. Overall, the viral sequences from 40 infected mosquitoes and 5 virus isolates were analyzed. Phylogenetic and linkage disequilibrium analyses revealed that approximately 25% of infected mosquitoes and viruses contained reassorted genome segments, suggesting that LACV segment reassortment is frequent in nature. PMID:19114023

  15. Segmenting the human genome based on states of neutral genetic divergence.

    PubMed

    Kuruppumullage Don, Prabhani; Ananda, Guruprasad; Chiaromonte, Francesca; Makova, Kateryna D

    2013-09-03

    Many studies have demonstrated that divergence levels generated by different mutation types vary and covary across the human genome. To improve our still-incomplete understanding of the mechanistic basis of this phenomenon, we analyze several mutation types simultaneously, anchoring their variation to specific regions of the genome. Using hidden Markov models on insertion, deletion, nucleotide substitution, and microsatellite divergence estimates inferred from human-orangutan alignments of neutrally evolving genomic sequences, we segment the human genome into regions corresponding to different divergence states--each uniquely characterized by specific combinations of divergence levels. We then parsed the mutagenic contributions of various biochemical processes associating divergence states with a broad range of genomic landscape features. We find that high divergence states inhabit guanine- and cytosine (GC)-rich, highly recombining subtelomeric regions; low divergence states cover inner parts of autosomes; chromosome X forms its own state with lowest divergence; and a state of elevated microsatellite mutability is interspersed across the genome. These general trends are mirrored in human diversity data from the 1000 Genomes Project, and departures from them highlight the evolutionary history of primate chromosomes. We also find that genes and noncoding functional marks [annotations from the Encyclopedia of DNA Elements (ENCODE)] are concentrated in high divergence states. Our results provide a powerful tool for biomedical data analysis: segmentations can be used to screen personal genome variants--including those associated with cancer and other diseases--and to improve computational predictions of noncoding functional elements.

  16. Reconstituted TOM core complex and Tim9/Tim10 complex of mitochondria are sufficient for translocation of the ADP/ATP carrier across membranes.

    PubMed

    Vasiljev, Andreja; Ahting, Uwe; Nargang, Frank E; Go, Nancy E; Habib, Shukry J; Kozany, Christian; Panneels, Valérie; Sinning, Irmgard; Prokisch, Holger; Neupert, Walter; Nussberger, Stephan; Rapaport, Doron

    2004-03-01

    Precursor proteins of the solute carrier family and of channel forming Tim components are imported into mitochondria in two main steps. First, they are translocated through the TOM complex in the outer membrane, a process assisted by the Tim9/Tim10 complex. They are passed on to the TIM22 complex, which facilitates their insertion into the inner membrane. In the present study, we have analyzed the function of the Tim9/Tim10 complex in the translocation of substrates across the outer membrane of mitochondria. The purified TOM core complex was reconstituted into lipid vesicles in which purified Tim9/Tim10 complex was entrapped. The precursor of the ADP/ATP carrier (AAC) was found to be translocated across the membrane of such lipid vesicles. Thus, these components are sufficient for translocation of AAC precursor across the outer membrane. Peptide libraries covering various substrate proteins were used to identify segments that are bound by Tim9/Tim10 complex upon translocation through the TOM complex. The patterns of binding sites on the substrate proteins suggest a mechanism by which portions of membrane-spanning segments together with flanking hydrophilic segments are recognized and bound by the Tim9/Tim10 complex as they emerge from the TOM complex into the intermembrane space.

  17. The distribution of genome shared identical by descent for a pair of full sibs by means of the continuous time Markov chain

    NASA Astrophysics Data System (ADS)

    Julie, Hongki; Pasaribu, Udjianna S.; Pancoro, Adi

    2015-12-01

    This paper will allow Markov Chain's application in genome shared identical by descent by two individual at full sibs model. The full sibs model was a continuous time Markov Chain with three state. In the full sibs model, we look for the cumulative distribution function of the number of sub segment which have 2 IBD haplotypes from a segment of the chromosome which the length is t Morgan and the cumulative distribution function of the number of sub segment which have at least 1 IBD haplotypes from a segment of the chromosome which the length is t Morgan. This cumulative distribution function will be developed by the moment generating function.

  18. Simultaneous and Sequential MS/MS Scan Combinations and Permutations in a Linear Quadrupole Ion Trap.

    PubMed

    Snyder, Dalton T; Szalwinski, Lucas J; Cooks, R Graham

    2017-10-17

    Methods of performing precursor ion scans as well as neutral loss scans in a single linear quadrupole ion trap have recently been described. In this paper we report methodology for performing permutations of MS/MS scan modes, that is, ordered combinations of precursor, product, and neutral loss scans following a single ion injection event. Only particular permutations are allowed; the sequences demonstrated here are (1) multiple precursor ion scans, (2) precursor ion scans followed by a single neutral loss scan, (3) precursor ion scans followed by product ion scans, and (4) segmented neutral loss scans. (5) The common product ion scan can be performed earlier in these sequences, under certain conditions. Simultaneous scans can also be performed. These include multiple precursor ion scans, precursor ion scans with an accompanying neutral loss scan, and multiple neutral loss scans. We argue that the new capability to perform complex simultaneous and sequential MS n operations on single ion populations represents a significant step in increasing the selectivity of mass spectrometry.

  19. MSuPDA: A Memory Efficient Algorithm for Sequence Alignment.

    PubMed

    Khan, Mohammad Ibrahim; Kamal, Md Sarwar; Chowdhury, Linkon

    2016-03-01

    Space complexity is a million dollar question in DNA sequence alignments. In this regard, memory saving under pushdown automata can help to reduce the occupied spaces in computer memory. Our proposed process is that anchor seed (AS) will be selected from given data set of nucleotide base pairs for local sequence alignment. Quick splitting techniques will separate the AS from all the DNA genome segments. Selected AS will be placed to pushdown automata's (PDA) input unit. Whole DNA genome segments will be placed into PDA's stack. AS from input unit will be matched with the DNA genome segments from stack of PDA. Match, mismatch and indel of nucleotides will be popped from the stack under the control unit of pushdown automata. During the POP operation on stack, it will free the memory cell occupied by the nucleotide base pair.

  20. Whole-genome analyses of DS-1-like human G2P[4] and G8P[4] rotavirus strains from Eastern, Western and Southern Africa

    PubMed Central

    Nyaga, Martin M.; Stucker, Karla M.; Esona, Mathew D.; Jere, Khuzwayo C.; Mwinyi, Bakari; Shonhai, Annie; Tsolenyanu, Enyonam; Mulindwa, Augustine; Chibumbya, Julia N.; Adolfine, Hokororo; Halpin, Rebecca A.; Roy, Sunando; Stockwell, Timothy B.; Berejena, Chipo; Seheri, Mapaseka L.; Mwenda, Jason M.; Steele, A. Duncan; Wentworth, David E.

    2018-01-01

    Group A rotaviruses (RVAs) with distinct G and P genotype combinations have been reported globally. We report the genome composition and possible origin of seven G8P[4] and five G2P[4] human RVA strains based on the genetic evolution of all 11 genome segments at the nucleotide level. Twelve RVA ELISA positive stool samples collected in the representative countries of Eastern, Southern and West Africa during the 2007–2012 surveillance seasons were subjected to sequencing using the Ion Torrent PGM and Illumina MiSeq platforms. A reference-based assembly was performed using CLC Bio’s clc_ref_assemble_long program, and full-genome consensus sequences were obtained. With the exception of the neutralising antigen, VP7, all study strains exhibited the DS-1-like genome constellation (P[4]-I2-R2-C2-M2-A2-N2-T2-E2-H2) and clustered phylogenetically with reference strains having a DS-1-like genetic backbone. Comparison of the nucleotide and amino acid sequences with selected global cognate genome segments revealed nucleotide and amino acid sequence identities of 81.7–100 % and 90.6–100 %, respectively, with NSP4 gene segment showing the most diversity among the strains. Bayesian analyses of all gene sequences to estimate the time of divergence of the lineage indicated that divergence times ranged from 16 to 44 years, except for the NSP4 gene where the lineage seemed to arise in the more distant past at an estimated 203 years ago. However, the long-term effects of changes found within the NSP4 genome segment should be further explored, and thus we recommend continued whole-genome analyses from larger sample sets to determine the evolutionary mechanisms of the DS-1-like strains collected in Africa. PMID:24952422

  1. Errant processing and structural alterations of genomes present in a varicella-zoster virus vaccine.

    PubMed Central

    Vlazny, D A; Hyman, R W

    1985-01-01

    Five minority populations of aberrant, varicella-zoster virus (VZV)-derived genomes were identified among the encapsidated DNAs obtained from the nuclear and cytoplasmic fractions of an in vitro infection initiated with a lyophilized sample of the BIKEN VZV vaccine (strain Oka). These were (i) VZV genomes, present within nuclear but not cytoplasmic viral capsids, which had been cleaved at a specific site within the short segment and which were, therefore, 3.15 megadaltons (approximately 4% of the VZV genome length) short of full length; (ii) highly deleted, repetitive VZV genomes which contained the errant cleavage site but not the usual VZV genome terminal sequences; (iii) VZV genomes into which multiples of 1 through 5 defective genome repeat units had been inserted into a homologous site; (iv) VZV genomes with additions of 0.1 or 0.18 megadaltons of DNA at both the terminal and internal ends of the short segment; and (v) VZV DNA which had lost the HindIII restriction site at map position 0.11. Images PMID:2993670

  2. Studies on cattle genomic structural variation provide insights into ruminant speciation and adaptation

    USDA-ARS?s Scientific Manuscript database

    Genomic structural variations, including segmental duplications (SD) and copy number variations (CNV), contribute significantly to individual health and disease in primates and rodents. As a part of the bovine genome annotation effort, we performed the first genome-wide analysis of SD in cattle usin...

  3. Comparative ruminant genomics highlights segmental duplication and mobile element insertion diversity

    USDA-ARS?s Scientific Manuscript database

    We have expanded upon a previously reported comparative genomics approach using a read-depth (JaRMs) and a hybrid read-pair, split-read (RAPTR-SV) copy number variation (CNV) detection method that uses read alignments to the cattle reference genome in order to identify species-specific genomic rearr...

  4. Self-organizing approach for meta-genomes.

    PubMed

    Zhu, Jianfeng; Zheng, Wei-Mou

    2014-12-01

    We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Identification of residual leukemic cells by flow cytometry in childhood B-cell precursor acute lymphoblastic leukemia: verification of leukemic state by flow-sorting and molecular/cytogenetic methods.

    PubMed

    Øbro, Nina F; Ryder, Lars P; Madsen, Hans O; Andersen, Mette K; Lausen, Birgitte; Hasle, Henrik; Schmiegelow, Kjeld; Marquart, Hanne V

    2012-01-01

    Reduction in minimal residual disease, measured by real-time quantitative PCR or flow cytometry, predicts prognosis in childhood B-cell precursor acute lymphoblastic leukemia. We explored whether cells reported as minimal residual disease by flow cytometry represent the malignant clone harboring clone-specific genomic markers (53 follow-up bone marrow samples from 28 children with B-cell precursor acute lymphoblastic leukemia). Cell populations (presumed leukemic and non-leukemic) were flow-sorted during standard flow cytometry-based minimal residual disease monitoring and explored by PCR and/or fluorescence in situ hybridization. We found good concordance between flow cytometry and genomic analyses in the individual flow-sorted leukemic (93% true positive) and normal (93% true negative) cell populations. Four cases with discrepant results had plausible explanations (e.g. partly informative immunophenotype and antigen modulation) that highlight important methodological pitfalls. These findings demonstrate that with sufficient experience, flow cytometry is reliable for minimal residual disease monitoring in B-cell precursor acute lymphoblastic leukemia, although rare cases require supplementary PCR-based monitoring.

  6. Segment-Wise Genome-Wide Association Analysis Identifies a Candidate Region Associated with Schizophrenia in Three Independent Samples

    PubMed Central

    Rietschel, Marcella; Mattheisen, Manuel; Breuer, René; Schulze, Thomas G.; Nöthen, Markus M.; Levinson, Douglas; Shi, Jianxin; Gejman, Pablo V.; Cichon, Sven; Ophoff, Roel A.

    2012-01-01

    Recent studies suggest that variation in complex disorders (e.g., schizophrenia) is explained by a large number of genetic variants with small effect size (Odds Ratio∼1.05–1.1). The statistical power to detect these genetic variants in Genome Wide Association (GWA) studies with large numbers of cases and controls (∼15,000) is still low. As it will be difficult to further increase sample size, we decided to explore an alternative method for analyzing GWA data in a study of schizophrenia, dramatically reducing the number of statistical tests. The underlying hypothesis was that at least some of the genetic variants related to a common outcome are collocated in segments of chromosomes at a wider scale than single genes. Our approach was therefore to study the association between relatively large segments of DNA and disease status. An association test was performed for each SNP and the number of nominally significant tests in a segment was counted. We then performed a permutation-based binomial test to determine whether this region contained significantly more nominally significant SNPs than expected under the null hypothesis of no association, taking linkage into account. Genome Wide Association data of three independent schizophrenia case/control cohorts with European ancestry (Dutch, German, and US) using segments of DNA with variable length (2 to 32 Mbp) was analyzed. Using this approach we identified a region at chromosome 5q23.3-q31.3 (128–160 Mbp) that was significantly enriched with nominally associated SNPs in three independent case-control samples. We conclude that considering relatively wide segments of chromosomes may reveal reliable relationships between the genome and schizophrenia, suggesting novel methodological possibilities as well as raising theoretical questions. PMID:22723893

  7. PhyloFlu, a DNA microarray for determining the phylogenetic origin of influenza A virus gene segments and the genomic fingerprint of viral strains.

    PubMed

    Paulin, Luis F; de los D Soto-Del Río, María; Sánchez, Iván; Hernández, Jesús; Gutiérrez-Ríos, Rosa M; López-Martínez, Irma; Wong-Chew, Rosa M; Parissi-Crivelli, Aurora; Isa, P; López, Susana; Arias, Carlos F

    2014-03-01

    Recent evidence suggests that most influenza A virus gene segments can contribute to the pathogenicity of the virus. In this regard, the hemagglutinin (HA) subtype of the circulating strains has been closely surveyed, but the reassortment of internal gene segments is usually not monitored as a potential source of an increased pathogenicity. In this work, an oligonucleotide DNA microarray (PhyloFlu) designed to determine the phylogenetic origins of the eight segments of the influenza virus genome was constructed and validated. Clades were defined for each segment and also for the 16 HA and 9 neuraminidase (NA) subtypes. Viral genetic material was amplified by reverse transcription-PCR (RT-PCR) with primers specific to the conserved 5' and 3' ends of the influenza A virus genes, followed by PCR amplification with random primers and Cy3 labeling. The microarray unambiguously determined the clades for all eight influenza virus genes in 74% (28/38) of the samples. The microarray was validated with reference strains from different animal origins, as well as from human, swine, and avian viruses from field or clinical samples. In most cases, the phylogenetic clade of each segment defined its animal host of origin. The genomic fingerprint deduced by the combined information of the individual clades allowed for the determination of the time and place that strains with the same genomic pattern were previously reported. PhyloFlu is useful for characterizing and surveying the genetic diversity and variation of animal viruses circulating in different environmental niches and for obtaining a more detailed surveillance and follow up of reassortant events that can potentially modify virus pathogenicity.

  8. Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

    PubMed

    Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

    2012-12-01

    In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.

  9. Characterization of a polyketide synthase in Aspergillus niger whose product is a precursor for both dihydroxynaphthalene (DHN) melanin and naphtho-γ-pyrone.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chiang, Yi Ming; Meyer, Kristen M; Praseuth, Michael

    2010-12-06

    The genome sequencing of the fungus Aspergillus niger, an industrial workhorse, uncovered a large cache of genes encoding enzymes thought to be involved in the production of secondary metabolites yet to be identified. Identification and structural characterization of many of these predicted secondary metabolites are hampered by their low concentration relative to the known A. niger metabolites such as the naphtho-γ-pyrone family of polyketides. We deleted a nonreducing PKS gene in A. niger strain ATCC 11414, a daughter strain of A. niger ATCC strain 1015 whose genome was sequenced by the DOE Joint Genome Institute. This PKS encoding gene ismore » a predicted ortholog of alb1 from Aspergillus fumigatus which is responsible for production of YWA1, a precursor of fungal DHN melanin. Our results show that the A. niger alb1 PKS is responsible for the production of the polyketide precursor for DHN melanin biosynthesis. Deletion of alb1 elimnates the production of major metabolites, naphtho-γ-pyrones. The generation of an A. niger strain devoid of naphtho-γ-pyrones will greatly facilitate the elucidation of cryptic biosynthetic pathways in this organism.« less

  10. Characterization of Urtica dioica agglutinin isolectins and the encoding gene family.

    PubMed

    Does, M P; Ng, D K; Dekker, H L; Peumans, W J; Houterman, P M; Van Damme, E J; Cornelissen, B J

    1999-01-01

    Urtica dioica agglutinin (UDA) has previously been found in roots and rhizomes of stinging nettles as a mixture of UDA-isolectins. Protein and cDNA sequencing have shown that mature UDA is composed of two hevein domains and is processed from a precursor protein. The precursor contains a signal peptide, two in-tandem hevein domains, a hinge region and a carboxyl-terminal chitinase domain. Genomic fragments encoding precursors for UDA-isolectins have been amplified by five independent polymerase chain reactions on genomic DNA from stinging nettle ecotype Weerselo. One amplified gene was completely sequenced. As compared to the published cDNA sequence, the genomic sequence contains, besides two basepair substitutions, two introns located at the same positions as in other plant chitinases. By partial sequence analysis of 40 amplified genes, 16 different genes were identified which encode seven putative UDA-isolectins. The deduced amino acid sequences share 78.9-98.9% identity. In extracts of roots and rhizomes of stinging nettle ecotype Weerselo six out of these seven isolectins were detected by mass spectrometry. One of them is an acidic form, which has not been identified before. Our results demonstrate that UDA is encoded by a large gene family.

  11. ICTV Virus Taxonomy Profile: Chrysoviridae.

    PubMed

    Ghabrial, Said A; Castón, José R; Coutts, Robert H A; Hillman, Bradley I; Jiang, Daohong; Kim, Dae-Hyun; Moriyama, Hiromitsu; Ictv Report Consortium

    2018-01-01

    The Chrysoviridae is a family of small, isometric, non-enveloped viruses (40 nm in diameter) with segmented dsRNA genomes (typically four segments). The genome segments are individually encapsidated and together comprise 11.5-12.8 kbp. The single genus Chrysovirus includes nine species. Chrysoviruses lack an extracellular phase to their life cycle; they are transmitted via intracellular routes within an individual during hyphal growth, in asexual or sexual spores, or between individuals via hyphal anastomosis. There are no known natural vectors for chrysoviruses. This is a summary of the International Committee on Taxonomy of Viruses (ICTV) Report on the taxonomy of the Chrysoviridae, which is available at www.ictv.global/report/chrysoviridae.

  12. MSuPDA: A memory efficient algorithm for sequence alignment.

    PubMed

    Khan, Mohammad Ibrahim; Kamal, Md Sarwar; Chowdhury, Linkon

    2015-01-16

    Space complexity is a million dollar question in DNA sequence alignments. In this regards, MSuPDA (Memory Saving under Pushdown Automata) can help to reduce the occupied spaces in computer memory. Our proposed process is that Anchor Seed (AS) will be selected from given data set of Nucleotides base pairs for local sequence alignment. Quick Splitting (QS) techniques will separate the Anchor Seed from all the DNA genome segments. Selected Anchor Seed will be placed to pushdown Automata's (PDA) input unit. Whole DNA genome segments will be placed into PDA's stack. Anchor Seed from input unit will be matched with the DNA genome segments from stack of PDA. Whatever matches, mismatches or Indel, of Nucleotides will be POP from the stack under the control of control unit of Pushdown Automata. During the POP operation on stack it will free the memory cell occupied by the Nucleotide base pair.

  13. Hox gene control of segment-specific bristle patterns in Drosophila

    PubMed Central

    Rozowski, Marion; Akam, Michael

    2002-01-01

    Hox genes specify the different morphologies of segments along the anteroposterior axis of animals. How they control complex segment morphologies is not well understood. We have studied how the Hox gene Ultrabithorax (Ubx) controls specific differences between the bristle patterns of the second and third thoracic segments (T2 and T3) of Drosophila melanogaster. We find that Ubx blocks the development of two particular bristles on T3 at different points in sensory organ development. For the apical bristle, a precursor is singled out and undergoes a first division in both the second and third legs, but in the third leg further differentiation of the second-order precursors is blocked. For the posterior sternopleural bristle, development on T3 ceases after proneural cluster initiation. Analysis of the temporal requirement for Ubx shows that in both cases Ubx function is required shortly before bristle development is blocked. We suggest that interactions between Ubx and the bristle patterning hierarchy have evolved independently on many occasions, affecting different molecular steps. The effects of Ubx on bristle development are highly dependent on the context of other patterning information. Suppression of bristle development or changes in bristle morphology in response to endogenous and ectopic Ubx expression are limited to bristles at specific locations. PMID:12000797

  14. Genetic Diversity of Crimean Congo Hemorrhagic Fever Virus Strains from Iran

    PubMed Central

    Chinikar, Sadegh; Bouzari, Saeid; Shokrgozar, Mohammad Ali; Mostafavi, Ehsan; Jalali, Tahmineh; Khakifirouz, Sahar; Nowotny, Norbert; Fooks, Anthony R.; Shah-Hosseini, Nariman

    2016-01-01

    Background: Crimean Congo hemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family and Nairovirus genus. It has a negative-sense, single stranded RNA genome approximately 19.2 kb, containing the Small, Medium, and Large segments. CCHFVs are relatively divergent in their genome sequence and grouped in seven distinct clades based on S-segment sequence analysis and six clades based on M-segment sequences. Our aim was to obtain new insights into the molecular epidemiology of CCHFV in Iran. Methods: We analyzed partial and complete nucleotide sequences of the S and M segments derived from 50 Iranian patients. The extracted RNA was amplified using one-step RT-PCR and then sequenced. The sequences were analyzed using Mega5 software. Results: Phylogenetic analysis of partial S segment sequences demonstrated that clade IV-(Asia 1), clade IV-(Asia 2) and clade V-(Europe) accounted for 80 %, 4 % and 14 % of the circulating genomic variants of CCHFV in Iran respectively. However, one of the Iranian strains (Iran-Kerman/22) was associated with none of other sequences and formed a new clade (VII). The phylogenetic analysis of complete S-segment nucleotide sequences from selected Iranian CCHFV strains complemented with representative strains from GenBank revealed similar topology as partial sequences with eight major clusters. A partial M segment phylogeny positioned the Iranian strains in either association with clade III (Asia-Africa) or clade V (Europe). Conclusion: The phylogenetic analysis revealed subtle links between distant geographic locations, which we propose might originate either from international livestock trade or from long-distance carriage of CCHFV by infected ticks via bird migration. PMID:27308271

  15. Full-Genome Sequencing as a Basis for Molecular Epidemiology Studies of Bluetongue Virus in India

    PubMed Central

    Maan, Sushila; Maan, Narender S.; Belaganahalli, Manjunatha N.; Rao, Pavuluri Panduranga; Singh, Karam Pal; Hemadri, Divakar; Putty, Kalyani; Kumar, Aman; Batra, Kanisht; Krishnajyothi, Yadlapati; Chandel, Bharat S.; Reddy, G. Hanmanth; Nomikou, Kyriaki; Reddy, Yella Narasimha; Attoui, Houssam; Hegde, Nagendra R.; Mertens, Peter P. C.

    2015-01-01

    Since 1998 there have been significant changes in the global distribution of bluetongue virus (BTV). Ten previously exotic BTV serotypes have been detected in Europe, causing severe disease outbreaks in naïve ruminant populations. Previously exotic BTV serotypes were also identified in the USA, Israel, Australia and India. BTV is transmitted by biting midges (Culicoides spp.) and changes in the distribution of vector species, climate change, increased international travel and trade are thought to have contributed to these events. Thirteen BTV serotypes have been isolated in India since first reports of the disease in the country during 1964. Efficient methods for preparation of viral dsRNA and cDNA synthesis, have facilitated full-genome sequencing of BTV strains from the region. These studies introduce a new approach for BTV characterization, based on full-genome sequencing and phylogenetic analyses, facilitating the identification of BTV serotype, topotype and reassortant strains. Phylogenetic analyses show that most of the equivalent genome-segments of Indian BTV strains are closely related, clustering within a major eastern BTV ‘topotype’. However, genome-segment 5 (Seg-5) encoding NS1, from multiple post 1982 Indian isolates, originated from a western BTV topotype. All ten genome-segments of BTV-2 isolates (IND2003/01, IND2003/02 and IND2003/03) are closely related (>99% identity) to a South African BTV-2 vaccine-strain (western topotype). Similarly BTV-10 isolates (IND2003/06; IND2005/04) show >99% identity in all genome segments, to the prototype BTV-10 (CA-8) strain from the USA. These data suggest repeated introductions of western BTV field and/or vaccine-strains into India, potentially linked to animal or vector-insect movements, or unauthorised use of ‘live’ South African or American BTV-vaccines in the country. The data presented will help improve nucleic acid based diagnostics for Indian serotypes/topotypes, as part of control strategies. PMID:26121128

  16. Segmental duplications: evolution and impact among the current Lepidoptera genomes.

    PubMed

    Zhao, Qian; Ma, Dongna; Vasseur, Liette; You, Minsheng

    2017-07-06

    Structural variation among genomes is now viewed to be as important as single nucleoid polymorphisms in influencing the phenotype and evolution of a species. Segmental duplication (SD) is defined as segments of DNA with homologous sequence. Here, we performed a systematic analysis of segmental duplications (SDs) among five lepidopteran reference genomes (Plutella xylostella, Danaus plexippus, Bombyx mori, Manduca sexta and Heliconius melpomene) to understand their potential impact on the evolution of these species. We find that the SDs content differed substantially among species, ranging from 1.2% of the genome in B. mori to 15.2% in H. melpomene. Most SDs formed very high identity (similarity higher than 90%) blocks but had very few large blocks. Comparative analysis showed that most of the SDs arose after the divergence of each linage and we found that P. xylostella and H. melpomene showed more duplications than other species, suggesting they might be able to tolerate extensive levels of variation in their genomes. Conserved ancestral and species specific SD events were assessed, revealing multiple examples of the gain, loss or maintenance of SDs over time. SDs content analysis showed that most of the genes embedded in SDs regions belonged to species-specific SDs ("Unique" SDs). Functional analysis of these genes suggested their potential roles in the lineage-specific evolution. SDs and flanking regions often contained transposable elements (TEs) and this association suggested some involvement in SDs formation. Further studies on comparison of gene expression level between SDs and non-SDs showed that the expression level of genes embedded in SDs was significantly lower, suggesting that structure changes in the genomes are involved in gene expression differences in species. The results showed that most of the SDs were "unique SDs", which originated after species formation. Functional analysis suggested that SDs might play different roles in different species. Our results provide a valuable resource beyond the genetic mutation to explore the genome structure for future Lepidoptera research.

  17. Terminal-Repeat Retrotransposons with GAG Domain in Plant Genomes: A New Testimony on the Complex World of Transposable Elements

    PubMed Central

    Chaparro, Cristian; Gayraud, Thomas; de Souza, Rogerio Fernandes; Domingues, Douglas Silva; Akaffou, Sélastique; Laforga Vanzela, Andre Luis; de Kochko, Alexandre; Rigoreau, Michel; Crouzillat, Dominique; Hamon, Serge; Hamon, Perla; Guyot, Romain

    2015-01-01

    A novel structure of nonautonomous long terminal repeat (LTR) retrotransposons called terminal repeat with GAG domain (TR-GAG) has been described in plants, both in monocotyledonous, dicotyledonous and basal angiosperm genomes. TR-GAGs are relatively short elements in length (<4 kb) showing the typical features of LTR-retrotransposons. However, they carry only one open reading frame coding for the GAG precursor protein involved for instance in transposition, the assembly, and the packaging of the element into the virus-like particle. GAG precursors show similarities with both Copia and Gypsy GAG proteins, suggesting evolutionary relationships of TR-GAG elements with both families. Despite the lack of the enzymatic machinery required for their mobility, strong evidences suggest that TR-GAGs are still active. TR-GAGs represent ubiquitous nonautonomous structures that could be involved in the molecular diversities of plant genomes. PMID:25573958

  18. Conserved intergenic sequences revealed by CTAG-profiling in Salmonella: thermodynamic modeling for function prediction

    NASA Astrophysics Data System (ADS)

    Tang, Le; Zhu, Songling; Mastriani, Emilio; Fang, Xin; Zhou, Yu-Jie; Li, Yong-Guo; Johnston, Randal N.; Guo, Zheng; Liu, Gui-Rong; Liu, Shu-Lin

    2017-03-01

    Highly conserved short sequences help identify functional genomic regions and facilitate genomic annotation. We used Salmonella as the model to search the genome for evolutionarily conserved regions and focused on the tetranucleotide sequence CTAG for its potentially important functions. In Salmonella, CTAG is highly conserved across the lineages and large numbers of CTAG-containing short sequences fall in intergenic regions, strongly indicating their biological importance. Computer modeling demonstrated stable stem-loop structures in some of the CTAG-containing intergenic regions, and substitution of a nucleotide of the CTAG sequence would radically rearrange the free energy and disrupt the structure. The postulated degeneration of CTAG takes distinct patterns among Salmonella lineages and provides novel information about genomic divergence and evolution of these bacterial pathogens. Comparison of the vertically and horizontally transmitted genomic segments showed different CTAG distribution landscapes, with the genome amelioration process to remove CTAG taking place inward from both terminals of the horizontally acquired segment.

  19. Translocations of chromosome end-segments and facultative heterochromatin promote meiotic ring formation in evening primroses.

    PubMed

    Golczyk, Hieronim; Massouh, Amid; Greiner, Stephan

    2014-03-01

    Due to reciprocal chromosomal translocations, many species of Oenothera (evening primrose) form permanent multichromosomal meiotic rings. However, regular bivalent pairing is also observed. Chiasmata are restricted to chromosomal ends, which makes homologous recombination virtually undetectable. Genetic diversity is achieved by changing linkage relations of chromosomes in rings and bivalents via hybridization and reciprocal translocations. Although the structural prerequisite for this system is enigmatic, whole-arm translocations are widely assumed to be the mechanistic driving force. We demonstrate that this prerequisite is genome compartmentation into two epigenetically defined chromatin fractions. The first one facultatively condenses in cycling cells into chromocenters negative both for histone H3 dimethylated at lysine 4 and for C-banding, and forms huge condensed middle chromosome regions on prophase chromosomes. Remarkably, it decondenses in differentiating cells. The second fraction is euchromatin confined to distal chromosome segments, positive for histone H3 lysine 4 dimethylation and for histone H3 lysine 27 trimethylation. The end-segments are deprived of canonical telomeres but capped with constitutive heterochromatin. This genomic organization promotes translocation breakpoints between the two chromatin fractions, thus facilitating exchanges of end-segments. We challenge the whole-arm translocation hypothesis by demonstrating why reciprocal translocations of chromosomal end-segments should strongly promote meiotic rings and evolution toward permanent translocation heterozygosity. Reshuffled end-segments, each possessing a major crossover hot spot, can furthermore explain meiotic compatibility between genomes with different translocation histories.

  20. Translocations of Chromosome End-Segments and Facultative Heterochromatin Promote Meiotic Ring Formation in Evening Primroses[W][OPEN

    PubMed Central

    Golczyk, Hieronim; Massouh, Amid; Greiner, Stephan

    2014-01-01

    Due to reciprocal chromosomal translocations, many species of Oenothera (evening primrose) form permanent multichromosomal meiotic rings. However, regular bivalent pairing is also observed. Chiasmata are restricted to chromosomal ends, which makes homologous recombination virtually undetectable. Genetic diversity is achieved by changing linkage relations of chromosomes in rings and bivalents via hybridization and reciprocal translocations. Although the structural prerequisite for this system is enigmatic, whole-arm translocations are widely assumed to be the mechanistic driving force. We demonstrate that this prerequisite is genome compartmentation into two epigenetically defined chromatin fractions. The first one facultatively condenses in cycling cells into chromocenters negative both for histone H3 dimethylated at lysine 4 and for C-banding, and forms huge condensed middle chromosome regions on prophase chromosomes. Remarkably, it decondenses in differentiating cells. The second fraction is euchromatin confined to distal chromosome segments, positive for histone H3 lysine 4 dimethylation and for histone H3 lysine 27 trimethylation. The end-segments are deprived of canonical telomeres but capped with constitutive heterochromatin. This genomic organization promotes translocation breakpoints between the two chromatin fractions, thus facilitating exchanges of end-segments. We challenge the whole-arm translocation hypothesis by demonstrating why reciprocal translocations of chromosomal end-segments should strongly promote meiotic rings and evolution toward permanent translocation heterozygosity. Reshuffled end-segments, each possessing a major crossover hot spot, can furthermore explain meiotic compatibility between genomes with different translocation histories. PMID:24681616

  1. Nucleotide sequence of a cluster of early and late genes in a conserved segment of the vaccinia virus genome.

    PubMed Central

    Plucienniczak, A; Schroeder, E; Zettlmeissl, G; Streeck, R E

    1985-01-01

    The nucleotide sequence of a 7.6 kb vaccinia DNA segment from a genomic region conserved among different orthopox virus has been determined. This segment contains a tight cluster of 12 partly overlapping open reading frames most of which can be correlated with previously identified early and late proteins and mRNAs. Regulatory signals used by vaccinia virus have been studied. Presumptive promoter regions are rich in A, T and carry the consensus sequences TATA and AATAA spaced at 20-24 base pairs. Tandem repeats of a CTATTC consensus sequence are proposed to be involved in the termination of early transcription. PMID:2987815

  2. FluReF, an automated flu virus reassortment finder based on phylogenetic trees.

    PubMed

    Yurovsky, Alisa; Moret, Bernard M E

    2011-01-01

    Reassortments are events in the evolution of the genome of influenza (flu), whereby segments of the genome are exchanged between different strains. As reassortments have been implicated in major human pandemics of the last century, their identification has become a health priority. While such identification can be done "by hand" on a small dataset, researchers and health authorities are building up enormous databases of genomic sequences for every flu strain, so that it is imperative to develop automated identification methods. However, current methods are limited to pairwise segment comparisons. We present FluReF, a fully automated flu virus reassortment finder. FluReF is inspired by the visual approach to reassortment identification and uses the reconstructed phylogenetic trees of the individual segments and of the full genome. We also present a simple flu evolution simulator, based on the current, source-sink, hypothesis for flu cycles. On synthetic datasets produced by our simulator, FluReF, tuned for a 0% false positive rate, yielded false negative rates of less than 10%. FluReF corroborated two new reassortments identified by visual analysis of 75 Human H3N2 New York flu strains from 2005-2008 and gave partial verification of reassortments found using another bioinformatics method. FluReF finds reassortments by a bottom-up search of the full-genome and segment-based phylogenetic trees for candidate clades--groups of one or more sampled viruses that are separated from the other variants from the same season. Candidate clades in each tree are tested to guarantee confidence values, using the lengths of key edges as well as other tree parameters; clades with reassortments must have validated incongruencies among segment trees. FluReF demonstrates robustness of prediction for geographically and temporally expanded datasets, and is not limited to finding reassortments with previously collected sequences. The complete source code is available from http://lcbb.epfl.ch/software.html.

  3. Impacts of Chromatin States and Long-Range Genomic Segments on Aging and DNA Methylation

    PubMed Central

    Sun, Dan; Yi, Soojin V.

    2015-01-01

    Understanding the fundamental dynamics of epigenome variation during normal aging is critical for elucidating key epigenetic alterations that affect development, cell differentiation and diseases. Advances in the field of aging and DNA methylation strongly support the aging epigenetic drift model. Although this model aligns with previous studies, the role of other epigenetic marks, such as histone modification, as well as the impact of sampling specific CpGs, must be evaluated. Ultimately, it is crucial to investigate how all CpGs in the human genome change their methylation with aging in their specific genomic and epigenomic contexts. Here, we analyze whole genome bisulfite sequencing DNA methylation maps of brain frontal cortex from individuals of diverse ages. Comparisons with blood data reveal tissue-specific patterns of epigenetic drift. By integrating chromatin state information, divergent degrees and directions of aging-associated methylation in different genomic regions are revealed. Whole genome bisulfite sequencing data also open a new door to investigate whether adjacent CpG sites exhibit coordinated DNA methylation changes with aging. We identified significant ‘aging-segments’, which are clusters of nearby CpGs that respond to aging by similar DNA methylation changes. These segments not only capture previously identified aging-CpGs but also include specific functional categories of genes with implications on epigenetic regulation of aging. For example, genes associated with development are highly enriched in positive aging segments, which are gradually hyper-methylated with aging. On the other hand, regions that are gradually hypo-methylated with aging (‘negative aging segments’) in the brain harbor genes involved in metabolism and protein ubiquitination. Given the importance of protein ubiquitination in proteome homeostasis of aging brains and neurodegenerative disorders, our finding suggests the significance of epigenetic regulation of this posttranslational modification pathway in the aging brain. Utilizing aging segments rather than individual CpGs will provide more comprehensive genomic and epigenomic contexts to understand the intricate associations between genomic neighborhoods and developmental and aging processes. These results complement the aging epigenetic drift model and provide new insights. PMID:26091484

  4. Short segment search method for phylogenetic analysis using nested sliding windows

    NASA Astrophysics Data System (ADS)

    Iskandar, A. A.; Bustamam, A.; Trimarsanto, H.

    2017-10-01

    To analyze phylogenetics in Bioinformatics, coding DNA sequences (CDS) segment is needed for maximal accuracy. However, analysis by CDS cost a lot of time and money, so a short representative segment by CDS, which is envelope protein segment or non-structural 3 (NS3) segment is necessary. After sliding window is implemented, a better short segment than envelope protein segment and NS3 is found. This paper will discuss a mathematical method to analyze sequences using nested sliding window to find a short segment which is representative for the whole genome. The result shows that our method can find a short segment which more representative about 6.57% in topological view to CDS segment than an Envelope segment or NS3 segment.

  5. Breast Cancer Diagnostics Based on Spatial Genome Organization

    DTIC Science & Technology

    2012-07-01

    using an already established imaging tool, called NMFA-FLO (Nuclei Manual and FISH automatic). In order to achieve accurate segmentation of nuclei...in tissue we used an artificial neuronal network (ANN)-based supervised pattern recognition approach to screen out well segmented nuclei, after image ... segmentation used to process images for automated nuclear segmentation . Part a) has been adapted from [15] and b) from [16]. Figure 4. Comparison of

  6. Reassortment between Influenza B Lineages and the Emergence of a Coadapted PB1–PB2–HA Gene Complex

    PubMed Central

    Dudas, Gytis; Bedford, Trevor; Lycett, Samantha; Rambaut, Andrew

    2015-01-01

    Influenza B viruses make a considerable contribution to morbidity attributed to seasonal influenza. Currently circulating influenza B isolates are known to belong to two antigenically distinct lineages referred to as B/Victoria and B/Yamagata. Frequent exchange of genomic segments of these two lineages has been noted in the past, but the observed patterns of reassortment have not been formalized in detail. We investigate interlineage reassortments by comparing phylogenetic trees across genomic segments. Our analyses indicate that of the eight segments of influenza B viruses only segments coding for polymerase basic 1 and 2 (PB1 and PB2) and hemagglutinin (HA) proteins have maintained separate Victoria and Yamagata lineages and that currently circulating strains possess PB1, PB2, and HA segments derived entirely from one or the other lineage; other segments have repeatedly reassorted between lineages thereby reducing genetic diversity. We argue that this difference between segments is due to selection against reassortant viruses with mixed-lineage PB1, PB2, and HA segments. Given sufficient time and continued recruitment to the reassortment-isolated PB1–PB2–HA gene complex, we expect influenza B viruses to eventually undergo sympatric speciation. PMID:25323575

  7. GenomeVista

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Poliakov, Alexander; Couronne, Olivier

    2002-11-04

    Aligning large vertebrate genomes that are structurally complex poses a variety of problems not encountered on smaller scales. Such genomes are rich in repetitive elements and contain multiple segmental duplications, which increases the difficulty of identifying true orthologous SNA segments in alignments. The sizes of the sequences make many alignment algorithms designed for comparing single proteins extremely inefficient when processing large genomic intervals. We integrated both local and global alignment tools and developed a suite of programs for automatically aligning large vertebrate genomes and identifying conserved non-coding regions in the alignments. Our method uses the BLAT local alignment program tomore » find anchors on the base genome to identify regions of possible homology for a query sequence. These regions are postprocessed to find the best candidates which are then globally aligned using the AVID global alignment program. In the last step conserved non-coding segments are identified using VISTA. Our methods are fast and the resulting alignments exhibit a high degree of sensitivity, covering more than 90% of known coding exons in the human genome. The GenomeVISTA software is a suite of Perl programs that is built on a MySQL database platform. The scheduler gets control data from the database, builds a queve of jobs, and dispatches them to a PC cluster for execution. The main program, running on each node of the cluster, processes individual sequences. A Perl library acts as an interface between the database and the above programs. The use of a separate library allows the programs to function independently of the database schema. The library also improves on the standard Perl MySQL database interfere package by providing auto-reconnect functionality and improved error handling.« less

  8. Structure, function and dynamics in adenovirus maturation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mangel, Walter F.; San Martín, Carmen

    2014-11-21

    Here we review the current knowledge on maturation of adenovirus, a non-enveloped icosahedral eukaryotic virus. The adenovirus dsDNA genome fills the capsid in complex with a large amount of histone-like viral proteins, forming the core. Maturation involves proteolytic cleavage of several capsid and core precursor proteins by the viral protease (AVP). AVP uses a peptide cleaved from one of its targets as a “molecular sled” to slide on the viral genome and reach its substrates, in a remarkable example of one-dimensional chemistry. Immature adenovirus containing the precursor proteins lacks infectivity because of its inability to uncoat. The immature core ismore » more compact and stable than the mature one, due to the condensing action of unprocessed core polypeptides; shell precursors underpin the vertex region and the connections between capsid and core. Maturation makes the virion metastable, priming it for stepwise uncoating by facilitating vertex release and loosening the condensed genome and its attachment to the icosahedral shell. The packaging scaffold protein L1 52/55k is also a substrate for AVP. Proteolytic processing of L1 52/55k disrupts its interactions with other virion components, providing a mechanism for its removal during maturation. In conclusion, possible roles for maturation of the terminal protein are discussed.« less

  9. Differential transferability of EST-SSR primers developed from diploid species Pseudoroegneria spicata, Thinopyrum bessarabicum, and Th. elongatum

    USDA-ARS?s Scientific Manuscript database

    Simple sequence repeat technology based on expressed sequence tag (EST-SSR) is a useful genomic tool for genome mapping, characterizing plant species relationships, elucidating genome evolution, and tracing genes on alien chromosome segments. EST-SSR primers developed from three perennial diploid T...

  10. Genome resilience and prevalence of segmental duplications following fast neutron irradiation of soybean

    USDA-ARS?s Scientific Manuscript database

    Fast neutron radiation has been used as a mutagen to develop extensive mutant collections. However, the genome-wide structural consequences of fast neutron radiation are not well understood. Here, we examine the genome-wide structural variants observed among 264 soybean (Glycine max (L.) Merrill) pl...

  11. Modeling the relaxation of internal DNA segments during genome mapping in nanochannels.

    PubMed

    Jain, Aashish; Sheats, Julian; Reifenberger, Jeffrey G; Cao, Han; Dorfman, Kevin D

    2016-09-01

    We have developed a multi-scale model describing the dynamics of internal segments of DNA in nanochannels used for genome mapping. In addition to the channel geometry, the model takes as its inputs the DNA properties in free solution (persistence length, effective width, molecular weight, and segmental hydrodynamic radius) and buffer properties (temperature and viscosity). Using pruned-enriched Rosenbluth simulations of a discrete wormlike chain model with circa 10 base pair resolution and a numerical solution for the hydrodynamic interactions in confinement, we convert these experimentally available inputs into the necessary parameters for a one-dimensional, Rouse-like model of the confined chain. The resulting coarse-grained model resolves the DNA at a length scale of approximately 6 kilobase pairs in the absence of any global hairpin folds, and is readily studied using a normal-mode analysis or Brownian dynamics simulations. The Rouse-like model successfully reproduces both the trends and order of magnitude of the relaxation time of the distance between labeled segments of DNA obtained in experiments. The model also provides insights that are not readily accessible from experiments, such as the role of the molecular weight of the DNA and location of the labeled segments that impact the statistical models used to construct genome maps from data acquired in nanochannels. The multi-scale approach used here, while focused towards a technologically relevant scenario, is readily adapted to other channel sizes and polymers.

  12. A novel chlorophyll a/b binding (Cab) protein gene from petunia which encodes the lower molecular weight Cab precursor protein.

    PubMed

    Stayton, M M; Black, M; Bedbrook, J; Dunsmuir, P

    1986-12-22

    The 16 petunia Cab genes which have been characterized are all closely related at the nucleotide sequence level and they encode Cab precursor polypeptides which are similar in sequence and length. Here we describe a novel petunia Cab gene which encodes a unique Cab precursor protein. This protein is a member of the smallest class of Cab precursor proteins for which no gene has previously been assigned in petunia or any other species. The features of this Cab precursor protein are that it is shorter by 2-3 amino acids than the formerly characterized Cab precursors, its transit peptide sequence is unrelated, and the mature polypeptide is significantly diverged at the functionally important N terminus from other petunia Cab proteins. Gene structure also discriminates this gene which is the only intron containing Cab gene in petunia genomic DNA.

  13. Timing Embryo Segmentation: Dynamics and Regulatory Mechanisms of the Vertebrate Segmentation Clock

    PubMed Central

    Resende, Tatiana P.; Andrade, Raquel P.; Palmeirim, Isabel

    2014-01-01

    All vertebrate species present a segmented body, easily observed in the vertebrate column and its associated components, which provides a high degree of motility to the adult body and efficient protection of the internal organs. The sequential formation of the segmented precursors of the vertebral column during embryonic development, the somites, is governed by an oscillating genetic network, the somitogenesis molecular clock. Herein, we provide an overview of the molecular clock operating during somite formation and its underlying molecular regulatory mechanisms. Human congenital vertebral malformations have been associated with perturbations in these oscillatory mechanisms. Thus, a better comprehension of the molecular mechanisms regulating somite formation is required in order to fully understand the origin of human skeletal malformations. PMID:24895605

  14. Transposition-mediated DNA re-replication in maize

    PubMed Central

    Zhang, Jianbo; Zuo, Tao; Wang, Dafang; Peterson, Thomas

    2014-01-01

    Every DNA segment in a eukaryotic genome normally replicates once and only once per cell cycle to maintain genome stability. We show here that this restriction can be bypassed through alternative transposition, a transposition reaction that utilizes the termini of two separate, nearby transposable elements (TEs). Our results suggest that alternative transposition during S phase can induce re-replication of the TEs and their flanking sequences. The DNA re-replication can spontaneously abort to generate double-strand breaks, which can be repaired to generate Composite Insertions composed of transposon termini flanking segmental duplications of various lengths. These results show how alternative transposition coupled with DNA replication and repair can significantly alter genome structure and may have contributed to rapid genome evolution in maize and possibly other eukaryotes. DOI: http://dx.doi.org/10.7554/eLife.03724.001 PMID:25406063

  15. Characterization of a Novel Orthomyxo-like Virus Causing Mass Die-Offs of Tilapia

    PubMed Central

    Bacharach, Eran; Mishra, Nischay; Briese, Thomas; Zody, Michael C.; Kembou Tsofack, Japhette Esther; Zamostiano, Rachel; Berkowitz, Asaf; Ng, James; Nitido, Adam; Corvelo, André; Toussaint, Nora C.; Abel Nielsen, Sandra Cathrine; Hornig, Mady; Del Pozo, Jorge; Bloom, Toby; Ferguson, Hugh

    2016-01-01

    ABSTRACT Tilapia are an important global food source due to their omnivorous diet, tolerance for high-density aquaculture, and relative disease resistance. Since 2009, tilapia aquaculture has been threatened by mass die-offs in farmed fish in Israel and Ecuador. Here we report evidence implicating a novel orthomyxo-like virus in these outbreaks. The tilapia lake virus (TiLV) has a 10-segment, negative-sense RNA genome. The largest segment, segment 1, contains an open reading frame with weak sequence homology to the influenza C virus PB1 subunit. The other nine segments showed no homology to other viruses but have conserved, complementary sequences at their 5′ and 3′ termini, consistent with the genome organization found in other orthomyxoviruses. In situ hybridization indicates TiLV replication and transcription at sites of pathology in the liver and central nervous system of tilapia with disease. PMID:27048802

  16. Isolation of a Novel Fusogenic Orthoreovirus from Eucampsipoda africana Bat Flies in South Africa

    PubMed Central

    Jansen van Vuren, Petrus; Wiley, Michael; Palacios, Gustavo; Storm, Nadia; McCulloch, Stewart; Markotter, Wanda; Birkhead, Monica; Kemp, Alan; Paweska, Janusz T.

    2016-01-01

    We report on the isolation of a novel fusogenic orthoreovirus from bat flies (Eucampsipoda africana) associated with Egyptian fruit bats (Rousettus aegyptiacus) collected in South Africa. Complete sequences of the ten dsRNA genome segments of the virus, tentatively named Mahlapitsi virus (MAHLV), were determined. Phylogenetic analysis places this virus into a distinct clade with Baboon orthoreovirus, Bush viper reovirus and the bat-associated Broome virus. All genome segments of MAHLV contain a 5' terminal sequence (5'-GGUCA) that is unique to all currently described viruses of the genus. The smallest genome segment is bicistronic encoding for a 14 kDa protein similar to p14 membrane fusion protein of Bush viper reovirus and an 18 kDa protein similar to p16 non-structural protein of Baboon orthoreovirus. This is the first report on isolation of an orthoreovirus from an arthropod host associated with bats, and phylogenetic and sequence data suggests that MAHLV constitutes a new species within the Orthoreovirus genus. PMID:27011199

  17. CTCF and Cohesin in Genome Folding and Transcriptional Gene Regulation.

    PubMed

    Merkenschlager, Matthias; Nora, Elphège P

    2016-08-31

    Genome function, replication, integrity, and propagation rely on the dynamic structural organization of chromosomes during the cell cycle. Genome folding in interphase provides regulatory segmentation for appropriate transcriptional control, facilitates ordered genome replication, and contributes to genome integrity by limiting illegitimate recombination. Here, we review recent high-resolution chromosome conformation capture and functional studies that have informed models of the spatial and regulatory compartmentalization of mammalian genomes, and discuss mechanistic models for how CTCF and cohesin control the functional architecture of mammalian chromosomes.

  18. Genomics Literacy: Implications for Teaching Students with a Range of Special Needs

    ERIC Educational Resources Information Center

    Rafter, Mary; Gillies, Robyn M.

    2018-01-01

    Recent developments in genomic-based knowledge is challenging educators to learn more about the early precursors of various difficulties children experience in learning and how they can use this information to identify preventative strategies or strategies that minimise their effect. The purpose of this article is to provide a brief outline of…

  19. Full Genome Sequence of Giant Panda Rotavirus Strain CH-1

    PubMed Central

    Guo, Ling; Yang, Shaolin; Wang, Chengdong; Chen, Shijie; Yang, Xiaonong; Hou, Rong; Quan, Zifang; Hao, Zhongxiang

    2013-01-01

    We report here the complete genomic sequence of the giant panda rotavirus strain CH-1. This work is the first to document the complete genomic sequence (segments 1 to 11) of the CH-1 strain, which offers an effective platform for providing authentic research experiences to novice scientists. PMID:23469354

  20. Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.

    PubMed Central

    Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F

    1984-01-01

    The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019

  1. Sinu Virus, a Novel and divergent Orthomyxovirus Related to Members of the Genus Thogotovirus, Isolated from Mosquitoes in Colombia

    PubMed Central

    Contreras-Gutiérrez, María Angélica; Nunes, Marcio R.T.; Guzman, Hilda; Uribe, Sandra; Gómez, Juan Carlos Gallego; Vasco, Juan David Suaza; Cardoso, Jedson F.; Popov, Vsevolod L.; Widen, Steven G.; Wood, Thomas G.; Vasilakis, Nikos; Tesh, Robert B.

    2016-01-01

    The genome and structural organization of a novel insect-specific orthomyxovirus, designated Sinu virus, is described. Sinu virus (SINUV) was isolated in cultures of C6/36 cells from a pool of mosquitoes collected in northwestern Colombia. The virus has six negative-sense ssRNA segments. Genetic analysis of each segment demonstrated the presence of six distinct ORFs encoding the following genes: PB2 (Segment 1), PB1, (Segment 2), PA protein (Segment 3), envelope GP gene (Segment 4), the NP (Segment 5), and M-like gene (Segment 6). Phylogenetically, SINUV appears to be most closed related to viruses in the genus Thogotovirus. PMID:27936462

  2. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

    PubMed

    Christen, Matthias; Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.

  3. Flexibility and symmetry of prokaryotic genome rearrangement reveal lineage-associated core-gene-defined genome organizational frameworks.

    PubMed

    Kang, Yu; Gu, Chaohao; Yuan, Lina; Wang, Yue; Zhu, Yanmin; Li, Xinna; Luo, Qibin; Xiao, Jingfa; Jiang, Daquan; Qian, Minping; Ahmed Khan, Aftab; Chen, Fei; Zhang, Zhang; Yu, Jun

    2014-11-25

    The prokaryotic pangenome partitions genes into core and dispensable genes. The order of core genes, albeit assumed to be stable under selection in general, is frequently interrupted by horizontal gene transfer and rearrangement, but how a core-gene-defined genome maintains its stability or flexibility remains to be investigated. Based on data from 30 species, including 425 genomes from six phyla, we grouped core genes into syntenic blocks in the context of a pangenome according to their stability across multiple isolates. A subset of the core genes, often species specific and lineage associated, formed a core-gene-defined genome organizational framework (cGOF). Such cGOFs are either single segmental (one-third of the species analyzed) or multisegmental (the rest). Multisegment cGOFs were further classified into symmetric or asymmetric according to segment orientations toward the origin-terminus axis. The cGOFs in Gram-positive species are exclusively symmetric and often reversible in orientation, as opposed to those of the Gram-negative bacteria, which are all asymmetric and irreversible. Meanwhile, all species showing strong strand-biased gene distribution contain symmetric cGOFs and often specific DnaE (α subunit of DNA polymerase III) isoforms. Furthermore, functional evaluations revealed that cGOF genes are hub associated with regard to cellular activities, and the stability of cGOF provides efficient indexes for scaffold orientation as demonstrated by assembling virtual and empirical genome drafts. cGOFs show species specificity, and the symmetry of multisegmental cGOFs is conserved among taxa and constrained by DNA polymerase-centric strand-biased gene distribution. The definition of species-specific cGOFs provides powerful guidance for genome assembly and other structure-based analysis. Prokaryotic genomes are frequently interrupted by horizontal gene transfer (HGT) and rearrangement. To know whether there is a set of genes not only conserved in position among isolates but also functionally essential for a given species and to further evaluate the stability or flexibility of such genome structures across lineages are of importance. Based on a large number of multi-isolate pangenomic data, our analysis reveals that a subset of core genes is organized into a core-gene-defined genome organizational framework, or cGOF. Furthermore, the lineage-associated cGOFs among Gram-positive and Gram-negative bacteria behave differently: the former, composed of 2 to 4 segments, have their fragments symmetrically rearranged around the origin-terminus axis, whereas the latter show more complex segmentation and are partitioned asymmetrically into chromosomal structures. The definition of cGOFs provides new insights into prokaryotic genome organization and efficient guidance for genome assembly and analysis. Copyright © 2014 Kang et al.

  4. Phylogenetic appearance of Neuropeptide S precursor proteins in tetrapods

    PubMed Central

    Reinscheid, Rainer K.

    2007-01-01

    Sleep and emotional behavior are two hallmarks of vertebrate animal behavior, implying that specialized neuronal circuits and dedicated neurochemical messengers may have been developed during evolution to regulate such complex behaviors. Neuropeptide S (NPS) is a newly identified peptide transmitter that activates a typical G protein-coupled receptor. Central administration of NPS produces profound arousal, enhances wakefulness and suppresses all stages of sleep. In addition, NPS can alleviate behavioral responses to stress by producing anxiolytic-like effects. A bioinformatic analysis of current genome databases revealed that the NPS peptide precursor gene is present in all vertebrates with the exception of fish. A high level of sequence conservation, especially of aminoterminal structures was detected, indicating stringent requirements for agonist-induced receptor activation. Duplication of the NPS precursor gene was only found in one out of two marsupial species with sufficient genome coverage (Monodelphis domestica; opossum), indicating that the duplicated opossum NPS sequence might have arisen as an isolated event. Pharmacological analysis of both Monodelphis NPS peptides revealed that only the closely related NPS peptide retained agonistic activity at NPS receptors. The duplicated precursor might be either a pseudogene or could have evolved different receptor selectivity. Together, these data show that NPS is a relatively recent gene in vertebrate evolution whose appearance might coincide with its specialized physiological functions in terrestrial vertebrates. PMID:17293003

  5. Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies.

    PubMed

    Zeng, Lu; Kortschak, R Daniel; Raison, Joy M; Bertozzi, Terry; Adelson, David L

    2018-01-01

    Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package.

  6. Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies

    PubMed Central

    Zeng, Lu; Kortschak, R. Daniel; Raison, Joy M.

    2018-01-01

    Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package. PMID:29538441

  7. Common position of indels that cause deviations from canonical genome organization in different measles virus strains.

    PubMed

    Ivancic-Jelecki, Jelena; Slovic, Anamarija; Šantak, Maja; Tešović, Goran; Forcic, Dubravko

    2016-07-29

    The canonical genome organization of measles virus (MV) is characterized by total size of 15 894 nucleotides (nts) and defined length of every genomic region, both coding and non-coding. Only rarely have reports of strains possessing non-canonical genomic properties (possessing indels, with or without the change of total genome length) been published. The observed mutations are mutually compensatory in a sense that the total genome length remains polyhexameric. Although programmed and highly precise pseudo-templated nucleotide additions during transcription are inherent to polymerases of all viruses belonging to family Paramyxoviridae, a similar mechanism that would serve to non-randomly correct genome length, if an indel has occurred during replication, has so far not been described in the context of a complete virus genome. We compiled all complete MV genomic sequences (64 in total) available in open access sequence databases. Multiple sequence comparisons and phylogenetic analyses were performed with the aim of exploring whether non-recombinant and non-evolutionary linked measles strains that show deviations from canonical genome organization possess a common genetic characteristic. In 11 MV sequences we detected deviations from canonical genome organization due to short indels located within homopolymeric stretches or next to them. In nine out of 11 identified non-canonical MV sequences, a common feature was observed: one mutation, either an insertion or a deletion, was located in a 28 nts long region in F gene 5' untranslated region (positions 5051-5078 in genomic cDNA of canonical strains). This segment is composed of five tandemly linked homopolymeric stretches, its consensus sequence is G6-7C7-8A6-7G1-3C5-6. Although none of the mononucleotide repeats within this segment has fixed length, the total number of nts in canonical strains is always 28. These nine non-canonical strains, as well as the tenth (not mutated in 5051-5078 segment), can be grouped in three clusters, based on their passage histories/epidemiological data/genetic similarities. There are no indications that the 3 clusters are evolutionary linked, other than the fact that they all belong to clade D. A common narrow genomic region was found to be mutated in different, non-related, wild type strains suggesting that this region might have a function in non-random genome length corrections occurring during MV replication.

  8. Complete genome sequence of a novel aquareovirus that infects the endangered fountain darter, Etheostoma fonticola

    USGS Publications Warehouse

    Iwanowicz, Luke R.; Iwanowicz, Deborah; Adams, Cynthia; Lewis, Teresa D.; Brandt, Thomas M.; Cornman, Robert S.; Sanders, Lakyn R.

    2016-01-01

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1).

  9. Complete Genome Sequence of a Novel Aquareovirus That Infects the Endangered Fountain Darter, Etheostoma fonticola

    PubMed Central

    Adams, Cynthia R.; Lewis, Teresa D.; Brandt, Thomas M.; Sanders, Lakyn

    2016-01-01

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1). PMID:28007856

  10. Detection of novel genomic aberrations in anaplastic astrocytomas by GTG-banding, SKY, locus-specific FISH, and high density SNP-array.

    PubMed

    Holland, Heidrun; Ahnert, Peter; Koschny, Ronald; Kirsten, Holger; Bauer, Manfred; Schober, Ralf; Meixensberger, Jürgen; Fritzsch, Dominik; Krupp, Wolfgang

    2012-06-15

    Astrocytomas represent the largest and most common subgroup of brain tumors. Anaplastic astrocytoma (WHO grade III) may arise from low-grade diffuse astrocytoma (WHO grade II) or as primary tumors without any precursor lesion. Comprehensive analyses of anaplastic astrocytomas combining both cytogenetic and molecular cytogenetic techniques are rare. Therefore, we analyzed genomic alterations of five anaplastic astrocytomas using high-density single nucleotide polymorphism arrays combined with GTG-banding and FISH-techniques. By cytogenetics, we found 169 structural chromosomal aberrations most frequently involving chromosomes 1, 2, 3, 4, 10, and 12, including two not previously described alterations, a nonreciprocal translocation t(3;11)(p12;q13), and one interstitial chromosomal deletion del(2)(q21q31). Additionally, we detected previously not documented loss of heterozygosity (LOH) without copy number changes in 4/5 anaplastic astrocytomas on chromosome regions 5q11.2, 5q22.1, 6q21, 7q21.11, 7q31.33, 8q11.22, 14q21.1, 17q21.31, and 17q22, suggesting segmental uniparental disomy (UPD), applying high-density single nucleotide polymorphism arrays. UPDs are currently considered to play an important role in the initiation and progression of different malignancies. The significance of previously not described genetic alterations in anaplastic astrocytomas presented here needs to be confirmed in a larger series. Copyright © 2012 Elsevier GmbH. All rights reserved.

  11. IBD Sharing between Africans, Neandertals, and Denisovans

    PubMed Central

    Povysil, Gundula

    2016-01-01

    Interbreeding between ancestors of humans and other hominins outside of Africa has been studied intensively, while their common history within Africa still lacks proper attention. However, shedding light on human evolution in this time period about which little is known, is essential for understanding subsequent events outside of Africa. We investigate the genetic relationships of humans, Neandertals, and Denisovans by identifying very short DNA segments in the 1000 Genomes Phase 3 data that these hominins share identical by descent (IBD). By focusing on low frequency and rare variants, we identify very short IBD segments with high confidence. These segments reveal events from a very distant past because shorter IBD segments are presumably older than longer ones. We extracted two types of very old IBD segments that are not only shared among humans, but also with Neandertals and/or Denisovans. The first type contains longer segments that are found primarily in Asians and Europeans where more segments are found in South Asians than in East Asians for both Neandertal and Denisovan. These longer segments indicate complex admixture events outside of Africa. The second type consists of shorter segments that are shared mainly by Africans and therefore may indicate events involving ancestors of humans and other ancient hominins within Africa. Our results from the autosomes are further supported by an analysis of chromosome X, on which segments that are shared by Africans and match the Neandertal and/or Denisovan genome were even more prominent. Our results indicate that interbreeding with other hominins was a common feature of human evolution starting already long before ancestors of modern humans left Africa. PMID:28158547

  12. mir-125a-5p-mediated Regulation of Lfng is Essential for the Avian Segmentation Clock

    PubMed Central

    Riley, Maurisa F.; Bochter, Matthew S.; Wahi, Kanu; Nuovo, Gerard J.; Cole, Susan E.

    2013-01-01

    Summary Somites are embryonic precursors of the axial skeleton and skeletal muscles, and establish the segmental vertebrate body plan. Somitogenesis is controlled in part by a segmentation clock that requires oscillatory expression of genes including Lunatic fringe (Lfng). Oscillatory genes must be tightly regulated both at the transcriptional and post-transcriptional levels for proper clock function. Here we demonstrate that microRNA-mediated regulation of Lfng is essential for proper segmentation during chick somitogenesis. We find that mir-125a-5p targets evolutionarily conserved sequences in the Lfng 3′UTR, and that preventing interactions between mir-125a-5p and Lfng transcripts in vivo causes abnormal segmentation and perturbs clock activity. This provides strong evidence that miRNAs function in the post-transcriptional regulation of oscillatory genes in the segmentation clock. Further, this demonstrates that the relatively subtle effects of miRNAs on target genes can have broad effects in developmental situations that have critical requirements for tight post-transcriptional regulation. PMID:23484856

  13. Novel approach for identification of influenza virus host range and zoonotic transmissible sequences by determination of host-related associative positions in viral genome segments.

    PubMed

    Kargarfard, Fatemeh; Sami, Ashkan; Mohammadi-Dehcheshmeh, Manijeh; Ebrahimie, Esmaeil

    2016-11-16

    Recent (2013 and 2009) zoonotic transmission of avian or porcine influenza to humans highlights an increase in host range by evading species barriers. Gene reassortment or antigenic shift between viruses from two or more hosts can generate a new life-threatening virus when the new shuffled virus is no longer recognized by antibodies existing within human populations. There is no large scale study to help understand the underlying mechanisms of host transmission. Furthermore, there is no clear understanding of how different segments of the influenza genome contribute in the final determination of host range. To obtain insight into the rules underpinning host range determination, various supervised machine learning algorithms were employed to mine reassortment changes in different viral segments in a range of hosts. Our multi-host dataset contained whole segments of 674 influenza strains organized into three host categories: avian, human, and swine. Some of the sequences were assigned to multiple hosts. In point of fact, the datasets are a form of multi-labeled dataset and we utilized a multi-label learning method to identify discriminative sequence sites. Then algorithms such as CBA, Ripper, and decision tree were applied to extract informative and descriptive association rules for each viral protein segment. We found informative rules in all segments that are common within the same host class but varied between different hosts. For example, for infection of an avian host, HA14V and NS1230S were the most important discriminative and combinatorial positions. Host range identification is facilitated by high support combined rules in this study. Our major goal was to detect discriminative genomic positions that were able to identify multi host viruses, because such viruses are likely to cause pandemic or disastrous epidemics.

  14. Biological and immunological characterization of a simian rotavirus SA11 variant with an altered genome segment 4.

    PubMed

    Burns, J W; Chen, D; Estes, M K; Ramig, R F

    1989-04-01

    We have studied a variant virus isolated from a stock of SA11 virus (H. G. Pereira, R. S. Azeredo, A. M. Fialho, and M. N. P. Vidal, 1984, J. Gen. Virol. 65, 815-818). This virus, designated 4F, was initially identified by its faster electrophoretic mobility for genome segment 4. The variant was analyzed to determine if the altered electrophoretic mobility of genome segment 4 could be correlated with phenotypic changes. Comparison of our standard laboratory SA11 virus (clone 3) with the 4F variant showed the following: (i) The 4F variant possesses a viral hemagglutinin (VP4) with a higher apparent molecular weight than clone 3. (ii) The 4F variant produces large plaques when assayed in vitro, as compared to clone 3. (iii) The 4F variant produces plaques in the absence of proteolytic enzymes, whereas clone 3 does not. (iv) The 4F variant reacts with serotype-specific neutralizing monoclonal antibodies to VP7, but fails to react with several neutralizing anti-VP4 monoclonal antibodies generated to SA11 clone 3. (v) The 4F variant grows to a higher titer and is more stable than clone 3. (vi) The 4F variant produces a VP4 that appears to be more susceptible to cleavage by trypsin than is the VP4 of clone 3. Further analyses with the 4F variant may lead to an understanding of the molecular basis for these altered phenotypes that appear to be related, at least in part, to the product of genome segment 4.

  15. Pyrolytic carbon membranes containing silica: morphological approach on gas transport behavior

    NASA Astrophysics Data System (ADS)

    Park, Ho Bum; Lee, Sun Yong; Lee, Young Moo

    2005-04-01

    Pyrolytic carbon membrane containing silica (C-SiO 2) is a new-class material for gas separation, and in the present work we will deal with it in view of the morphological changes arising from the difference in the molecular structure of the polymeric precursors. The silica embedded carbon membranes were fabricated by a predetermined pyrolysis step using imide-siloxane copolymers (PISs) that was synthesized from benzophenone tetracarboxylic dianhydrides (BTDA), 4,4'-oxydianiline (ODA), and amine-terminated polydimethylsiloxane (PDMS). To induce different morphologies at the same chemical composition, the copolymers were prepared using one-step (preferentially a random segmented copolymer) sand two-step polymerization (a block segmented copolymer) methods. The polymeric precursors and their pyrolytic C-SiO 2 membranes were analyzed using thermal analysis, atomic force microscopy, and transmission electron microscopy, etc. It was found that the C-SiO 2 membrane derived from the random PIS copolymer showed a micro-structure containing small well-dispersed silica domains, whereas the C-SiO 2 membrane from the block PIS copolymer exhibited a micro-structure containing large silica domains in the continuous carbon matrix. Eventually, the gas transport through these C-SiO 2 membranes was significantly affected by the morphological changes of the polymeric precursors.

  16. Building the Vertebrate Spine

    NASA Astrophysics Data System (ADS)

    Pourquié, Olivier

    2008-03-01

    The vertebrate body can be subdivided along the antero-posterior (AP) axis into repeated structures called segments. This periodic pattern is established during embryogenesis by the somitogenesis process. Somites are generated in a rhythmic fashion from the paraxial mesoderm and subsequently differentiate to give rise to the vertebrae and skeletal muscles of the body. Somite formation involves an oscillator-the segmentation clock-whose periodic signal is converted into the periodic array of somite boundaries. This clock drives the dynamic expression of cyclic genes in the presomitic mesoderm and requires Notch and Wnt signaling. Microarray studies of the mouse presomitic mesoderm transcriptome reveal that the segmentation clock drives the periodic expression of a large network of cyclic genes involved in cell signaling. Mutually exclusive activation of the Notch/FGF and Wnt pathways during each cycle suggests that coordinated regulation of these three pathways underlies the clock oscillator. In humans, mutations in the genes associated to the function of this oscillator such as Dll3 or Lunatic Fringe result in abnormal segmentation of the vertebral column such as those seen in congenital scoliosis. Whereas the segmentation clock is thought to set the pace of vertebrate segmentation, the translation of this pulsation into the reiterated arrangement of segment boundaries along the AP axis involves dynamic gradients of FGF and Wnt signaling. The FGF signaling gradient is established based on an unusual mechanism involving mRNA decay which provides an efficient means to couple the spatio-temporal activation of segmentation to the posterior elongation of the embryo. Another striking aspect of somite production is the strict bilateral symmetry of the process. Retinoic acid was shown to control aspects of this coordination by buffering destabilizing effects from the embryonic left-right machinery. Defects in this embryonic program controlling vertebral symmetry might lead to scoliosis in humans. Finally, the subsequent regional differentiation of the precursors of the vertebrae is controlled by Hox genes, whose collinear expression controls both gastrulation of somite precursors and their subsequent patterning into region-specific types of structures. Therefore somite development provides an outstanding paradigm to study patterning and differentiation in vertebrate embryos.

  17. Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    PubMed

    Jo, Yeong Deuk; Choi, Yoomi; Kim, Dong-Hwan; Kim, Byung-Dong; Kang, Byoung-Cheorl

    2014-07-04

    Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp. We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes. Although large portion of sequence context was shared by mitochondrial genomes of CMS and male-fertile pepper lines, extensive genome rearrangements were detected. CMS candidate genes located on the edges of highly-rearranged CMS-specific DNA regions and near to repeat sequences. These characteristics were detected among CMS-associated genes in other species, implying a common mechanism might be involved in the evolution of CMS-associated genes.

  18. Concentration of acrylamide in a polyacrylamide gel affects VP4 gene coding assignment of group A equine rotavirus strains with P[12] specificity

    PubMed Central

    2010-01-01

    Background It is universally acknowledged that genome segment 4 of group A rotavirus, the major etiologic agent of severe diarrhea in infants and neonatal farm animals, encodes outer capsid neutralization and protective antigen VP4. Results To determine which genome segment of three group A equine rotavirus strains (H-2, FI-14 and FI-23) with P[12] specificity encodes the VP4, we analyzed dsRNAs of strains H-2, FI-14 and FI-23 as well as their reassortants by polyacrylamide gel electrophoresis (PAGE) at varying concentrations of acrylamide. The relative position of the VP4 gene of the three equine P[12] strains varied (either genome segment 3 or 4) depending upon the concentration of acrylamide. The VP4 gene bearing P[3], P[4], P[6], P[7], P[8] or P[18] specificity did not exhibit this phenomenon when the PAGE running conditions were varied. Conclusions The concentration of acrylamide in a PAGE gel affected VP4 gene coding assignment of equine rotavirus strains bearing P[12] specificity. PMID:20573245

  19. Comparing genomes with rearrangements and segmental duplications.

    PubMed

    Shao, Mingfu; Moret, Bernard M E

    2015-06-15

    Large-scale evolutionary events such as genomic rearrange.ments and segmental duplications form an important part of the evolution of genomes and are widely studied from both biological and computational perspectives. A basic computational problem is to infer these events in the evolutionary history for given modern genomes, a task for which many algorithms have been proposed under various constraints. Algorithms that can handle both rearrangements and content-modifying events such as duplications and losses remain few and limited in their applicability. We study the comparison of two genomes under a model including general rearrangements (through double-cut-and-join) and segmental duplications. We formulate the comparison as an optimization problem and describe an exact algorithm to solve it by using an integer linear program. We also devise a sufficient condition and an efficient algorithm to identify optimal substructures, which can simplify the problem while preserving optimality. Using the optimal substructures with the integer linear program (ILP) formulation yields a practical and exact algorithm to solve the problem. We then apply our algorithm to assign in-paralogs and orthologs (a necessary step in handling duplications) and compare its performance with that of the state-of-the-art method MSOAR, using both simulations and real data. On simulated datasets, our method outperforms MSOAR by a significant margin, and on five well-annotated species, MSOAR achieves high accuracy, yet our method performs slightly better on each of the 10 pairwise comparisons. http://lcbb.epfl.ch/softwares/coser. © The Author 2015. Published by Oxford University Press.

  20. Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates

    PubMed Central

    Kikuta, Hiroshi; Laplante, Mary; Navratilova, Pavla; Komisarczuk, Anna Z.; Engström, Pär G.; Fredman, David; Akalin, Altuna; Caccamo, Mario; Sealy, Ian; Howe, Kerstin; Ghislain, Julien; Pezeron, Guillaume; Mourrain, Philippe; Ellingsen, Staale; Oates, Andrew C.; Thisse, Christine; Thisse, Bernard; Foucher, Isabelle; Adolf, Birgit; Geling, Andrea; Lenhard, Boris; Becker, Thomas S.

    2007-01-01

    We report evidence for a mechanism for the maintenance of long-range conserved synteny across vertebrate genomes. We found the largest mammal-teleost conserved chromosomal segments to be spanned by highly conserved noncoding elements (HCNEs), their developmental regulatory target genes, and phylogenetically and functionally unrelated “bystander” genes. Bystander genes are not specifically under the control of the regulatory elements that drive the target genes and are expressed in patterns that are different from those of the target genes. Reporter insertions distal to zebrafish developmental regulatory genes pax6.1/2, rx3, id1, and fgf8 and miRNA genes mirn9-1 and mirn9-5 recapitulate the expression patterns of these genes even if located inside or beyond bystander genes, suggesting that the regulatory domain of a developmental regulatory gene can extend into and beyond adjacent transcriptional units. We termed these chromosomal segments genomic regulatory blocks (GRBs). After whole genome duplication in teleosts, GRBs, including HCNEs and target genes, were often maintained in both copies, while bystander genes were typically lost from one GRB, strongly suggesting that evolutionary pressure acts to keep the single-copy GRBs of higher vertebrates intact. We show that loss of bystander genes and other mutational events suffered by duplicated GRBs in teleost genomes permits target gene identification and HCNE/target gene assignment. These findings explain the absence of evolutionary breakpoints from large vertebrate chromosomal segments and will aid in the recognition of position effect mutations within human GRBs. PMID:17387144

  1. Comparative analysis of isodisomic and heterodisomic segments in cases with maternal uniparental disomy 14 suggests more than one imprinted region.

    PubMed

    Kotzot, D

    2001-09-01

    The results of molecular investigations of 21 cases with complete or segmental maternal uniparental disomy (UPD) 14 published in the literature were compared with respect to isodisomic and heterodisomic segments. The aim of the study was to find hints toward imprinted regions other than the recently defined imprinted segment 14q32. Three regions with no isodisomic molecular marker were found. The most distal of these regions located on 14q32.12 and 14q32.13 supports the hypothesis of genomic imprinting as the cause of the maternal UPD 14 phenotype by synteny to the maternally imprinted region on mouse distal chromosome 12 and correlation with the recently defined imprinting cluster on human chromosome 14q32. The other two heterodisomic areas located on 14q11.2-->14q12 and 14q21.1-->14q31.2 are hints toward one or more additional regions of genomic imprinting on human chromosome 14.

  2. Complete Genome Sequence of a Novel Aquareovirus That Infects the Endangered Fountain Darter, Etheostoma fonticola.

    PubMed

    Iwanowicz, Luke R; Iwanowicz, Deborah D; Adams, Cynthia R; Lewis, Teresa D; Brandt, Thomas M; Cornman, Robert S; Sanders, Lakyn

    2016-12-22

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1). Copyright © 2016 Iwanowicz et al.

  3. Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs.

    PubMed

    Auch, Alexander F; Klenk, Hans-Peter; Göker, Markus

    2010-01-28

    DNA-DNA hybridization (DDH) is a widely applied wet-lab technique to obtain an estimate of the overall similarity between the genomes of two organisms. To base the species concept for prokaryotes ultimately on DDH was chosen by microbiologists as a pragmatic approach for deciding about the recognition of novel species, but also allowed a relatively high degree of standardization compared to other areas of taxonomy. However, DDH is tedious and error-prone and first and foremost cannot be used to incrementally establish a comparative database. Recent studies have shown that in-silico methods for the comparison of genome sequences can be used to replace DDH. Considering the ongoing rapid technological progress of sequencing methods, genome-based prokaryote taxonomy is coming into reach. However, calculating distances between genomes is dependent on multiple choices for software and program settings. We here provide an overview over the modifications that can be applied to distance methods based in high-scoring segment pairs (HSPs) or maximally unique matches (MUMs) and that need to be documented. General recommendations on determining HSPs using BLAST or other algorithms are also provided. As a reference implementation, we introduce the GGDC web server (http://ggdc.gbdp.org).

  4. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications

    PubMed Central

    Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner. PMID:28531174

  5. Position-based scanning for comparative genomics and identification of genetic islands in Haemophilus influenzae type b.

    PubMed

    Bergman, Nicholas H; Akerley, Brian J

    2003-03-01

    Bacteria exhibit extensive genetic heterogeneity within species. In many cases, these differences account for virulence properties unique to specific strains. Several such loci have been discovered in the genome of the type b serotype of Haemophilus influenzae, a human pathogen able to cause meningitis, pneumonia, and septicemia. Here we report application of a PCR-based scanning procedure to compare the genome of a virulent type b (Hib) strain with that of the laboratory-passaged Rd KW20 strain for which a complete genome sequence is available. We have identified seven DNA segments or H. influenzae genetic islands (HiGIs) present in the type b genome and absent from the Rd genome. These segments vary in size and content and show signs of horizontal gene transfer in that their percent G+C content differs from that of the rest of the H. influenzae genome, they contain genes similar to those found on phages or other mobile elements, or they are flanked by DNA repeats. Several of these loci represent potential pathogenicity islands, because they contain genes likely to mediate interactions with the host. These newly identified genetic islands provide areas of investigation into both the evolution and pathogenesis of H. influenzae. In addition, the genome scanning approach developed to identify these islands provides a rapid means to compare the genomes of phenotypically diverse bacterial strains once the genome sequence of one representative strain has been determined.

  6. Genetic and Epigenetic Changes in Oilseed Rape (Brassica napus L.) Extracted from Intergeneric Allopolyploid and Additions with Orychophragmus.

    PubMed

    Gautam, Mayank; Dang, Yanwei; Ge, Xianhong; Shao, Yujiao; Li, Zaiyun

    2016-01-01

    Allopolyploidization with the merger of the genomes from different species has been shown to be associated with genetic and epigenetic changes. But the maintenance of such alterations related to one parental species after the genome is extracted from the allopolyploid remains to be detected. In this study, the genome of Brassica napus L. (2n = 38, genomes AACC) was extracted from its intergeneric allohexaploid (2n = 62, genomes AACCOO) with another crucifer Orychophragmus violaceus (2n = 24, genome OO), by backcrossing and development of alien addition lines. B. napus-type plants identified in the self-pollinated progenies of nine monosomic additions were analyzed by the methods of amplified fragment length polymorphism, sequence-specific amplified polymorphism, and methylation-sensitive amplified polymorphism. They showed modifications to certain extents in genomic components (loss and gain of DNA segments and transposons, introgression of alien DNA segments) and DNA methylation, compared with B. napus donor. The significant differences in the changes between the B. napus types extracted from these additions likely resulted from the different effects of individual alien chromosomes. Particularly, the additions which harbored the O. violaceus chromosome carrying dominant rRNA genes over those of B. napus tended to result in the development of plants which showed fewer changes, suggesting a role of the expression levels of alien rRNA genes in genomic stability. These results provided new cues for the genetic alterations in one parental genome that are maintained even after the genome becomes independent.

  7. Genetic and Epigenetic Changes in Oilseed Rape (Brassica napus L.) Extracted from Intergeneric Allopolyploid and Additions with Orychophragmus

    PubMed Central

    Gautam, Mayank; Dang, Yanwei; Ge, Xianhong; Shao, Yujiao; Li, Zaiyun

    2016-01-01

    Allopolyploidization with the merger of the genomes from different species has been shown to be associated with genetic and epigenetic changes. But the maintenance of such alterations related to one parental species after the genome is extracted from the allopolyploid remains to be detected. In this study, the genome of Brassica napus L. (2n = 38, genomes AACC) was extracted from its intergeneric allohexaploid (2n = 62, genomes AACCOO) with another crucifer Orychophragmus violaceus (2n = 24, genome OO), by backcrossing and development of alien addition lines. B. napus-type plants identified in the self-pollinated progenies of nine monosomic additions were analyzed by the methods of amplified fragment length polymorphism, sequence-specific amplified polymorphism, and methylation-sensitive amplified polymorphism. They showed modifications to certain extents in genomic components (loss and gain of DNA segments and transposons, introgression of alien DNA segments) and DNA methylation, compared with B. napus donor. The significant differences in the changes between the B. napus types extracted from these additions likely resulted from the different effects of individual alien chromosomes. Particularly, the additions which harbored the O. violaceus chromosome carrying dominant rRNA genes over those of B. napus tended to result in the development of plants which showed fewer changes, suggesting a role of the expression levels of alien rRNA genes in genomic stability. These results provided new cues for the genetic alterations in one parental genome that are maintained even after the genome becomes independent. PMID:27148282

  8. Sequencing an Ashkenazi reference panel supports population-targeted personal genomics and illuminates Jewish and European origins

    PubMed Central

    Carmi, Shai; Hui, Ken Y.; Kochav, Ethan; Liu, Xinmin; Xue, James; Grady, Fillan; Guha, Saurav; Upadhyay, Kinnari; Ben-Avraham, Dan; Mukherjee, Semanti; Bowen, B. Monica; Thomas, Tinu; Vijai, Joseph; Cruts, Marc; Froyen, Guy; Lambrechts, Diether; Plaisance, Stéphane; Van Broeckhoven, Christine; Van Damme, Philip; Van Marck, Herwig; Barzilai, Nir; Darvasi, Ariel; Offit, Kenneth; Bressman, Susan; Ozelius, Laurie J.; Peter, Inga; Cho, Judy H.; Ostrer, Harry; Atzmon, Gil; Clark, Lorraine N.; Lencz, Todd; Pe’er, Itsik

    2014-01-01

    The Ashkenazi Jewish (AJ) population is a genetic isolate close to European and Middle Eastern groups, with genetic diversity patterns conducive to disease mapping. Here we report high-depth sequencing of 128 complete genomes of AJ controls. Compared with European samples, our AJ panel has 47% more novel variants per genome and is eightfold more effective at filtering benign variants out of AJ clinical genomes. Our panel improves imputation accuracy for AJ SNP arrays by 28%, and covers at least one haplotype in ≈67% of any AJ genome with long, identical-by-descent segments. Reconstruction of recent AJ history from such segments confirms a recent bottleneck of merely ≈350 individuals. Modelling of ancient histories for AJ and European populations using their joint allele frequency spectrum determines AJ to be an even admixture of European and likely Middle Eastern origins. We date the split between the two ancestral populations to ≈12–25 Kyr, suggesting a predominantly Near Eastern source for the repopulation of Europe after the Last Glacial Maximum. PMID:25203624

  9. Identification of a European interserotypic reassortant strain of infectious bursal disease virus.

    PubMed

    Soubies, Sébastien M; Courtillon, Céline; Briand, François-Xavier; Queguiner-Leroux, Maryline; Courtois, David; Amelot, Michel; Grousson, Karine; Morillon, Paul; Herin, Jean-Bernard; Eterradossi, Nicolas

    2017-02-01

    Infectious bursal disease virus (IBDV, family Birnaviridae) is a bi-segmented double-stranded RNA virus for which two serotypes are described. Serotype 1 replicates in the bursa of Fabricius and causes an immunosuppressive and potentially fatal disease in young chickens. Serotype 2 is apathogenic in poultry species. Up to now, only one natural event of interserotypic reassortment has been described after the introduction of very virulent IBDV (vvIBDV) in the USA in 2009, resulting in an IBDV strain with its segment A related to vvIBDV and its segment B related to US serotype 2 strain OH. Here, we present the first European isolate illustrative of interserotypic reassortment. The reassorting isolate, named 100056, exhibits a genomic segment A typical of current European vvIBDV but a segment B close to European serotype 2 viruses, supporting an origin distinct from US strains. When inoculated into SPF chickens, isolate 100056 induced mild clinical signs in the absence of mortality but caused a severe bursal atrophy, which strongly suggests an immunosuppressive potential. These results illustrate that interserotypic reassortment is another mechanism that can create IBDV strains with a modified acute pathogenicity. As a consequence, and for a more precise inference of the possible phenotype, care should be taken that the molecular identification of IBDV strains is targeted to both genome segments.

  10. Genome-wide association study in 79,366 European-ancestry individuals informs the genetic architecture of 25-hydroxyvitamin D levels

    USDA-ARS?s Scientific Manuscript database

    Vitamin D is a steroid hormone precursor that is associated with a range of human traits and diseases. Previous GWAS of serum 25-hydroxyvitamin D concentrations have identified four genome-wide significant loci (GC, NADSYN1/DHCR7, CYP2R1, CYP24A1). In this study, we expand the previous SUNLIGHT Cons...

  11. Molecular characterization of African orthobunyaviruses.

    PubMed

    Yandoko, E Nakouné; Gribaldo, S; Finance, C; Le Faou, A; Rihn, B H

    2007-06-01

    The genus Orthobunyavirus is composed of segmented, negative-sense RNA viruses that are responsible for mild to severe human diseases. To date, no molecular studies of bunyaviruses in the genus Orthobunyavirus from central Africa have been reported, and their classification relies on serological testing. Four new primer pairs for RT-PCR amplification and sequencing of the complete genomic small (S) RNA segments of 10 orthobunyaviruses isolated from the Central African Republic and pertaining to five different serogroups have been designed and evaluated. Phylogenetic analysis showed that these 10 viruses belong to the Bunyamwera serogroup. The S segment sequences differ from those of the Bunyamwera virus reference strain by 5-15 % at the nucleotide level, and both overlapping reading frames, encoding the nucleocapsid (N) and non-structural (NS) proteins, were evident in sequenced genomes. This study should improve diagnosis and surveillance of African bunyaviruses.

  12. Proteomic analysis of urine in patients with intestinal segments transposed into the urinary tract.

    PubMed

    Nabi, Ghulam; N'Dow, James; Hasan, Tahseen S; Booth, Ian R; Cash, Phil

    2005-04-01

    Intestinal segments are used to replace or reconstruct the urinary bladder when it has become dysfunctional or develops life-threatening disease such as cancer. The quality of life in patients with intestinal segments used to either enlarge or completely replace the native bladder is adversely affected by recurrent urinary tract infections, excessive mucus production and the occasional development of malignancy. At present, there is no reliable method of predicting or noninvasively monitoring these patients for the development of these complications. The characterisation of proteins secreted into urine from the transposed intestinal segments could serve as important indicators of these clinical complications. Urine is an ideal source of material in which to search for biomarkers, since it bathes the affected tissues and can be obtained relatively easily by noninvasive methods. The urinary proteome of patients with intestinal segments transposed into the urinary tract is unknown and we present the first global description of the urinary protein profile in these patients. Sample preparation is a critical step in achieving accurate and reliable data. We describe a method to prepare urinary proteins that was compatible with their subsequent analysis using two-dimensional polyacrylamide gel electrophoresis. This method helped to overcome some of the technical problems encountered in analysing urine from this patient cohort. The method was used to analyse urinary proteins recovered from five healthy controls and ten patients with intestinal segments transposed into the urinary tract. Four low molecular weight proteins were found to be present in nine out of ten for the patient group but for none of the healthy controls. The four proteins were identified as lithostathine-1 alpha precursor, pancreatitis associated protein-1 precursor, liver fatty acid binding protein and testis expressed protein-12. The role of these proteins as potential biomarkers of intestinal cell activity within the reconstructed bladder is discussed.

  13. Whole genome sequencing identifies influenza A H3N2 transmission and offers superior resolution to classical typing methods.

    PubMed

    Meinel, Dominik M; Heinzinger, Susanne; Eberle, Ute; Ackermann, Nikolaus; Schönberger, Katharina; Sing, Andreas

    2018-02-01

    Influenza with its annual epidemic waves is a major cause of morbidity and mortality worldwide. However, only little whole genome data are available regarding the molecular epidemiology promoting our understanding of viral spread in human populations. We implemented a RT-PCR strategy starting from patient material to generate influenza A whole genome sequences for molecular epidemiological surveillance. Samples were obtained within the Bavarian Influenza Sentinel. The complete influenza virus genome was amplified by a one-tube multiplex RT-PCR and sequenced on an Illumina MiSeq. We report whole genomic sequences for 50 influenza A H3N2 viruses, which was the predominating virus in the season 2014/15, directly from patient specimens. The dataset included random samples from Bavaria (Germany) throughout the influenza season and samples from three suspected transmission clusters. We identified the outbreak samples based on sequence identity. Whole genome sequencing (WGS) was superior in resolution compared to analysis of single segments or partial segment analysis. Additionally, we detected manifestation of substantial amounts of viral quasispecies in several patients, carrying mutations varying from the dominant virus in each patient. Our rapid whole genome sequencing approach for influenza A virus shows that WGS can effectively be used to detect and understand outbreaks in large communities. Additionally, the genomic data provide in-depth details about the circulating virus within one season.

  14. In situ structures of the segmented genome and RNA polymerase complex inside a dsRNA virus

    NASA Astrophysics Data System (ADS)

    Zhang, Xing; Ding, Ke; Yu, Xuekui; Chang, Winston; Sun, Jingchen; Hong Zhou, Z.

    2015-11-01

    Viruses in the Reoviridae, like the triple-shelled human rotavirus and the single-shelled insect cytoplasmic polyhedrosis virus (CPV), all package a genome of segmented double-stranded RNAs (dsRNAs) inside the viral capsid and carry out endogenous messenger RNA synthesis through a transcriptional enzyme complex (TEC). By direct electron-counting cryoelectron microscopy and asymmetric reconstruction, we have determined the organization of the dsRNA genome inside quiescent CPV (q-CPV) and the in situ atomic structures of TEC within CPV in both quiescent and transcribing (t-CPV) states. We show that the ten segmented dsRNAs in CPV are organized with ten TECs in a specific, non-symmetric manner, with each dsRNA segment attached directly to a TEC. The TEC consists of two extensively interacting subunits: an RNA-dependent RNA polymerase (RdRP) and an NTPase VP4. We find that the bracelet domain of RdRP undergoes marked conformational change when q-CPV is converted to t-CPV, leading to formation of the RNA template entry channel and access to the polymerase active site. An amino-terminal helix from each of two subunits of the capsid shell protein (CSP) interacts with VP4 and RdRP. These findings establish the link between sensing of environmental cues by the external proteins and activation of endogenous RNA transcription by the TEC inside the virus.

  15. The Statistical Segment Length of DNA: Opportunities for Biomechanical Modeling in Polymer Physics and Next-Generation Genomics.

    PubMed

    Dorfman, Kevin D

    2018-02-01

    The development of bright bisintercalating dyes for deoxyribonucleic acid (DNA) in the 1990s, most notably YOYO-1, revolutionized the field of polymer physics in the ensuing years. These dyes, in conjunction with modern molecular biology techniques, permit the facile observation of polymer dynamics via fluorescence microscopy and thus direct tests of different theories of polymer dynamics. At the same time, they have played a key role in advancing an emerging next-generation method known as genome mapping in nanochannels. The effect of intercalation on the bending energy of DNA as embodied by a change in its statistical segment length (or, alternatively, its persistence length) has been the subject of significant controversy. The precise value of the statistical segment length is critical for the proper interpretation of polymer physics experiments and controls the phenomena underlying the aforementioned genomics technology. In this perspective, we briefly review the model of DNA as a wormlike chain and a trio of methods (light scattering, optical or magnetic tweezers, and atomic force microscopy (AFM)) that have been used to determine the statistical segment length of DNA. We then outline the disagreement in the literature over the role of bisintercalation on the bending energy of DNA, and how a multiscale biomechanical approach could provide an important model for this scientifically and technologically relevant problem.

  16. Characterization of the complete genome segments from BmCPV-SZ, a novel Bombyx mori cypovirus 1 isolate.

    PubMed

    Cao, Guangli; Meng, Xiangkun; Xue, Renyu; Zhu, Yuexiong; Zhang, Xiaorong; Pan, Zhonghua; Zheng, Xiaojian; Gong, Chengliang

    2012-07-01

    A novel Bombyx mori cypovirus 1 isolated from infected silkworm larvae and tentatively assigned as Bombyx mori cypovirus 1 isolate Suzhou (BmCPV-SZ). The complete nucleotide sequences of genomic segments S1-S10 from BmCPV-SZ were determined. All segments possessed a single open reading frame; however, bioinformatic evidence suggested a short overlapping coding sequence in S1. Each BmCPV-SZ segment possessed the conserved terminal sequences AGUAA and GUUAGCC at the 5' and 3' ends, respectively. The conserved A/G at the -3 position in relation to the AUG codon could be found in the BmCPV-SZ genome, and it was postulated that this conserved A/G may be the most important nucleotide for efficient translation initiation in cypoviruses (CPVs). Examination of the putative amino acid sequences encoded by BmCPV-SZ revealed some characteristic motifs. Homology searches showed that viral structural proteins VP1, VP3, and VP4 had localized homologies with proteins of Rice ragged stunt virus , a member of the genus Oryzavirus within the family Reoviridae. A phylogenetic tree based on RNA-dependent RNA polymerase sequences demonstrated that CPV is more closely related to Rice ragged stunt virus and Aedes pseudoscutellaris reovirus than to other members of Reoviridae, suggesting that they may have originated from common ancestors.

  17. Four-segmented Rift Valley fever virus-based vaccines can be applied safely in ewes during pregnancy.

    PubMed

    Wichgers Schreur, Paul J; van Keulen, Lucien; Kant, Jet; Kortekaas, Jeroen

    2017-05-25

    Rift Valley fever virus (RVFV) causes severe and recurrent outbreaks on the African continent and the Arabian Peninsula and continues to expand its habitat. This mosquito-borne virus, belonging to the genus Phlebovirus of the family Bunyaviridae contains a tri-segmented negative-strand RNA genome. Previously, we developed four-segmented RVFV (RVFV-4s) variants by splitting the M-genome segment into two M-type segments each encoding one of the structural glycoproteins; Gn or Gc. Vaccination/challenge experiments with mice and lambs subsequently showed that RVFV-4s induces protective immunity against wild-type virus infection after a single administration. To demonstrate the unprecedented safety of RVFV-4s, we here report that the virus does not cause encephalitis after intranasal inoculation of mice. A study with pregnant ewes subsequently revealed that RVFV-4s does not cause viremia and does not cross the ovine placental barrier, as evidenced by the absence of teratogenic effects and virus in the blood and organs of the fetuses. Altogether, these results show that the RVFV-4s vaccine virus can be applied safely in pregnant ewes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Individualized cattle copy number and segmental duplication maps using next generation sequencing

    USDA-ARS?s Scientific Manuscript database

    Copy Number Variations (CNVs) affect a wide range of phenotypic traits; however, CNVs in or near segmental duplication regions are often intractable. Using a read depth approach based on next generation sequencing, we examined genome-wide copy number differences among five taurine (three Angus, one ...

  19. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1987-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3575113

  20. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1990-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2333227

  1. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1988-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3368330

  2. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1989-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2654889

  3. Sequence and Analysis of the Tomato JOINTLESS Locus1

    PubMed Central

    Mao, Long; Begum, Dilara; Goff, Stephen A.; Wing, Rod A.

    2001-01-01

    A 119-kb bacterial artificial chromosome from the JOINTLESS locus on the tomato (Lycopersicon esculentum) chromosome 11 contained 15 putative genes. Repetitive sequences in this region include one copia-like LTR retrotransposon, 13 simple sequence repeats, three copies of a novel type III foldback transposon, and four putative short DNA repeats. Database searches showed that the foldback transposon and the short DNA repeats seemed to be associated preferably with genes. The predicted tomato genes were compared with the complete Arabidopsis genome. Eleven out of 15 tomato open reading frames were found to be colinear with segments on five Arabidopsis bacterial artificial chromosome/P1-derived artificial chromosome clones. The synteny patterns, however, did not reveal duplicated segments in Arabidopsis, where over half of the genome is duplicated. Our analysis indicated that the microsynteny between the tomato and Arabidopsis genomes was still conserved at a very small scale but was complicated by the large number of gene families in the Arabidopsis genome. PMID:11457984

  4. Genetic Characterization of the Tick-Borne Orbiviruses

    PubMed Central

    Belaganahalli, Manjunatha N.; Maan, Sushila; Maan, Narender S.; Brownlie, Joe; Tesh, Robert; Attoui, Houssam; Mertens, Peter P. C.

    2015-01-01

    The International Committee for Taxonomy of Viruses (ICTV) recognizes four species of tick-borne orbiviruses (TBOs): Chenuda virus, Chobar Gorge virus, Wad Medani virus and Great Island virus (genus Orbivirus, family Reoviridae). Nucleotide (nt) and amino acid (aa) sequence comparisons provide a basis for orbivirus detection and classification, however full genome sequence data were only available for the Great Island virus species. We report representative genome-sequences for the three other TBO species (virus isolates: Chenuda virus (CNUV); Chobar Gorge virus (CGV) and Wad Medani virus (WMV)). Phylogenetic comparisons show that TBOs cluster separately from insect-borne orbiviruses (IBOs). CNUV, CGV, WMV and GIV share low level aa/nt identities with other orbiviruses, in ‘conserved’ Pol, T2 and T13 proteins/genes, identifying them as four distinct virus-species. The TBO genome segment encoding cell attachment, outer capsid protein 1 (OC1), is approximately half the size of the equivalent segment from insect-borne orbiviruses, helping to explain why tick-borne orbiviruses have a ~1 kb smaller genome. PMID:25928203

  5. Genetic characterization of the tick-borne orbiviruses.

    PubMed

    Belaganahalli, Manjunatha N; Maan, Sushila; Maan, Narender S; Brownlie, Joe; Tesh, Robert; Attoui, Houssam; Mertens, Peter P C

    2015-04-28

    The International Committee for Taxonomy of Viruses (ICTV) recognizes four species of tick-borne orbiviruses (TBOs): Chenuda virus, Chobar Gorge virus, Wad Medani virus and Great Island virus (genus Orbivirus, family Reoviridae). Nucleotide (nt) and amino acid (aa) sequence comparisons provide a basis for orbivirus detection and classification, however full genome sequence data were only available for the Great Island virus species. We report representative genome-sequences for the three other TBO species (virus isolates: Chenuda virus (CNUV); Chobar Gorge virus (CGV) and Wad Medani virus (WMV)). Phylogenetic comparisons show that TBOs cluster separately from insect-borne orbiviruses (IBOs). CNUV, CGV, WMV and GIV share low level aa/nt identities with other orbiviruses, in 'conserved' Pol, T2 and T13 proteins/genes, identifying them as four distinct virus-species. The TBO genome segment encoding cell attachment, outer capsid protein 1 (OC1), is approximately half the size of the equivalent segment from insect-borne orbiviruses, helping to explain why tick-borne orbiviruses have a ~1 kb smaller genome.

  6. The genome of Th17 cell-inducing segmented filamentous bacteria reveals extensive auxotrophy and adaptations to the intestinal environment

    PubMed Central

    Sczesnak, Andrew; Segata, Nicola; Qin, Xiang; Gevers, Dirk; Petrosino, Joseph F.; Huttenhower, Curtis; Littman, Dan R.; Ivanov, Ivaylo I.

    2011-01-01

    Summary Perturbations of the composition of the symbiotic intestinal microbiota can have profound consequences for host metabolism and immunity. In mice, segmented filamentous bacteria (SFB) direct the accumulation of potentially pro-inflammatory Th17 cells in the intestinal lamina propria. We present the genome sequence of SFB isolated from mono-colonized mice, which classifies SFB phylogenetically as a unique member of Clostridiales with a highly reduced genome. Annotation analysis demonstrates that SFB depends on its environment for amino acids and essential nutrients and may utilize host and dietary glycans for carbon, nitrogen, and energy. Comparative analyses reveal that SFB is functionally related to members of the genus Clostridium and several pathogenic or commensal “minimal” genera, including Finegoldia, Mycoplasma, Borrelia, and Phytoplasma. However, SFB is functionally distinct from all 1,200 examined genomes, indicating a gene complement representing biology relatively unique to its role as a gut commensal closely tied to host metabolism and immunity. PMID:21925113

  7. Targetable kinase-activating lesions in Ph-like acute lymphoblastic leukemia | Office of Cancer Genomics

    Cancer.gov

    Publication Abstract:  Philadelphia chromosome-like acute lymphoblastic leukemia (Ph-like ALL) is characterized by a gene-expression profile similar to that of BCR-ABL1-positive ALL, alterations of lymphoid transcription factor genes, and a poor outcome. The frequency and spectrum of genetic alterations in Ph-like ALL and its responsiveness to tyrosine kinase inhibition are undefined, especially in adolescents and adults. We performed genomic profiling of 1725 patients with precursor B-cell ALL and detailed genomic analysis of 154 patients with Ph-like ALL.

  8. Segmental Duplications in Euchromatic Regions of Human Chromosome 5: A Source of Evolutionary Instability and Transcriptional Innovation

    PubMed Central

    Courseaux, Anouk; Richard, Florence; Grosgeorge, Josiane; Ortola, Christine; Viale, Agnes; Turc-Carel, Claude; Dutrillaux, Bernard; Gaudray, Patrick; Nahon, Jean-Louis

    2003-01-01

    Recent analyses of the structure of pericentromeric and subtelomeric regions have revealed that these particular regions of human chromosomes are often composed of blocks of duplicated genomic segments that have been associated with rapid evolutionary turnover among the genomes of closely related primates. In the present study, we show that euchromatic regions of human chromosome 5—5p14, 5p13, 5q13, 5q15–5q21—also display such an accumulation of segmental duplications. The structure, organization and evolution of those primate-specific sequences were studied in detail by combining in silico and comparative FISH analyses on human, chimpanzee, gorilla, orangutang, macaca, and capuchin chromosomes. Our results lend support to a two-step model of transposition duplication in the euchromatic regions, with a founder insertional event at the time of divergence between Platyrrhini and Catarrhini (25–35 million years ago) and an apparent burst of inter- and intrachromosomal duplications in the Hominidae lineage. Furthermore, phylogenetic analysis suggests that the chronology and, likely, molecular mechanisms, differ regarding the region of primary insertion—euchromatic versus pericentromeric regions. Lastly, we show that as their counterparts located near the heterochromatic region, the euchromatic segmental duplications have consistently reshaped their region of insertion during primate evolution, creating putative mosaic genes, and they are obvious candidates for causing ectopic rearrangements that have contributed to evolutionary/genomic instability. [Supplemental material is available online at www.genome.org. The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: D. Le Paslier, A. McKenzie, J. Melki, C. Sargent, J. Scharf and S. Selig.] PMID:12618367

  9. Identification of the Genome Segments of Bluetongue Virus Serotype 26 (Isolate KUW2010/02) that Restrict Replication in a Culicoides sonorensis Cell Line (KC Cells).

    PubMed

    Pullinger, Gillian D; Guimerà Busquets, Marc; Nomikou, Kyriaki; Boyce, Mark; Attoui, Houssam; Mertens, Peter P

    2016-01-01

    Bluetongue virus (BTV) can infect most ruminant species and is usually transmitted by adult, vector-competent biting midges (Culicoides spp.). Infection with BTV can cause severe clinical signs and can be fatal, particularly in naïve sheep and some deer species. Although 24 distinct BTV serotypes were recognized for several decades, additional 'types' have recently been identified, including BTV-25 (from Switzerland), BTV-26 (from Kuwait) and BTV-27 from France (Corsica). Although BTV-25 has failed to grow in either insect or mammalian cell cultures, BTV-26 (isolate KUW2010/02), which can be transmitted horizontally between goats in the absence of vector insects, does not replicate in a Culicoides sonorensis cell line (KC cells) but can be propagated in mammalian cells (BSR cells). The BTV genome consists of ten segments of linear dsRNA. Mono-reassortant viruses were generated by reverse-genetics, each one containing a single BTV-26 genome segment in a BTV-1 genetic-background. However, attempts to recover a mono-reassortant containing genome-segment 2 (Seg-2) of BTV-26 (encoding VP2), were unsuccessful but a triple-reassortant was successfully generated containing Seg-2, Seg-6 and Seg-7 (encoding VP5 and VP7 respectively) of BTV-26. Reassortants were recovered and most replicated well in mammalian cells (BSR cells). However, mono-reassortants containing Seg-1 or Seg-3 of BTV-26 (encoding VP1, or VP3 respectively) and the triple reassortant failed to replicate, while a mono-reassortant containing Seg-7 of BTV-26 only replicated slowly in KC cells.

  10. Characterization and Evolution of Conserved MicroRNA through Duplication Events in Date Palm (Phoenix dactylifera)

    PubMed Central

    Yang, Yaodong; Mason, Annaliese S.; Lei, Xintao; Ma, Zilong

    2013-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events. PMID:23951162

  11. Characterization and evolution of conserved MicroRNA through duplication events in date palm (Phoenix dactylifera).

    PubMed

    Xiao, Yong; Xia, Wei; Yang, Yaodong; Mason, Annaliese S; Lei, Xintao; Ma, Zilong

    2013-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events.

  12. Both Genome Segments Contribute to the Pathogenicity of Very Virulent Infectious Bursal Disease Virus

    PubMed Central

    Escaffre, Olivier; Le Nouën, Cyril; Amelot, Michel; Ambroggio, Xavier; Ogden, Kristen M.; Guionie, Olivier; Toquin, Didier; Müller, Hermann; Islam, Mohammed R.

    2013-01-01

    Infectious bursal disease virus (IBDV) causes an economically significant disease of chickens worldwide. Very virulent IBDV (vvIBDV) strains have emerged and induce as much as 60% mortality. The molecular basis for vvIBDV pathogenicity is not understood, and the relative contributions of the two genome segments, A and B, to this phenomenon are not known. Isolate 94432 has been shown previously to be genetically related to vvIBDVs but exhibits atypical antigenicity and does not cause mortality. Here the full-length genome of 94432 was determined, and a reverse genetics system was established. The molecular clone was rescued and exhibited the same antigenicity and reduced pathogenicity as isolate 94432. Genetically modified viruses derived from 94432, whose vvIBDV consensus nucleotide sequence was restored in segment A and/or B, were produced, and their pathogenicity was assessed in specific-pathogen-free chickens. We found that a valine (position 321) that modifies the most exposed part of the capsid protein VP2 critically modified the antigenicity and partially reduced the pathogenicity of 94432. However, a threonine (position 276) located in the finger domain of the virus polymerase (VP1) contributed even more significantly to attenuation. This threonine is partially exposed in a hydrophobic groove on the VP1 surface, suggesting possible interactions between VP1 and another, as yet unidentified molecule at this amino acid position. The restored vvIBDV-like pathogenicity was associated with increased replication and lesions in the thymus and spleen. These results demonstrate that both genome segments influence vvIBDV pathogenicity and may provide new targets for the attenuation of vvIBDVs. PMID:23269788

  13. Rearrangement of Influenza Virus Spliced Segments for the Development of Live-Attenuated Vaccines

    PubMed Central

    Nogales, Aitor; DeDiego, Marta L.; Topham, David J.

    2016-01-01

    ABSTRACT Influenza viral infections represent a serious public health problem, with influenza virus causing a contagious respiratory disease which is most effectively prevented through vaccination. Segments 7 (M) and 8 (NS) of the influenza virus genome encode mRNA transcripts that are alternatively spliced to express two different viral proteins. This study describes the generation, using reverse genetics, of three different recombinant influenza A/Puerto Rico/8/1934 (PR8) H1N1 viruses containing M or NS viral segments individually or modified M or NS viral segments combined in which the overlapping open reading frames of matrix 1 (M1)/M2 for the modified M segment and the open reading frames of nonstructural protein 1 (NS1)/nuclear export protein (NEP) for the modified NS segment were split by using the porcine teschovirus 1 (PTV-1) 2A autoproteolytic cleavage site. Viruses with an M split segment were impaired in replication at nonpermissive high temperatures, whereas high viral titers could be obtained at permissive low temperatures (33°C). Furthermore, viruses containing the M split segment were highly attenuated in vivo, while they retained their immunogenicity and provided protection against a lethal challenge with wild-type PR8. These results indicate that influenza viruses can be effectively attenuated by the rearrangement of spliced segments and that such attenuated viruses represent an excellent option as safe, immunogenic, and protective live-attenuated vaccines. Moreover, this is the first time in which an influenza virus containing a restructured M segment has been described. Reorganization of the M segment to encode M1 and M2 from two separate, nonoverlapping, independent open reading frames represents a useful tool to independently study mutations in the M1 and M2 viral proteins without affecting the other viral M product. IMPORTANCE Vaccination represents our best therapeutic option against influenza viral infections. However, the efficacy of current influenza vaccines is suboptimal, and novel approaches are necessary for the prevention of disease caused by this important human respiratory pathogen. In this work, we describe a novel approach to generate safer and more efficient live-attenuated influenza virus vaccines (LAIVs) based on recombinant viruses whose genomes encode nonoverlapping and independent M1/M2 (split M segment [Ms]) or both M1/M2 and NS1/NEP (Ms and split NS segment [NSs]) open reading frames. Viruses containing a modified M segment were highly attenuated in mice but were able to confer, upon a single intranasal immunization, complete protection against a lethal homologous challenge with wild-type virus. Notably, the protection efficacy conferred by our viruses with split M segments was better than that conferred by the current temperature-sensitive LAIV. Altogether, these results open a new avenue for the development of safer and more protective LAIVs on the basis of the reorganization of spliced viral RNA segments in the genome. PMID:27122587

  14. Library Resources for Bac End Sequencing. Final Technical Report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pieter J. de Jong

    2000-10-01

    Studies directed towards the specific aims outlined for this research award are summarized. The RPCI II Human Bac Library has been expanded by the addition of 6.9-fold genomic coverage. This segment has been generated from a MBOI partial digest of the same anonymous donor DNA used for the rest of the library. A new cloning vector, pTARBAC1, has been constructed and used in the construction of RPCI-II segment 5. This new cloning vector provides a new strategy in identifying targeted genomic regions and will greatly facilitate a large-scale analysis for positional cloning. A new maleCS7BC/6J mouse BAC library has beenmore » constructed. RPCI-23 contain 576 plates (approx 210,000 clones) and represents approximately 11-fold coverage of the mouse genome.« less

  15. Superpixel guided active contour segmentation of retinal layers in OCT volumes

    NASA Astrophysics Data System (ADS)

    Bai, Fangliang; Gibson, Stuart J.; Marques, Manuel J.; Podoleanu, Adrian

    2018-03-01

    Retinal OCT image segmentation is a precursor to subsequent medical diagnosis by a clinician or machine learning algorithm. In the last decade, many algorithms have been proposed to detect retinal layer boundaries and simplify the image representation. Inspired by the recent success of superpixel methods for pre-processing natural images, we present a novel framework for segmentation of retinal layers in OCT volume data. In our framework, the region of interest (e.g. the fovea) is located using an adaptive-curve method. The cell layer boundaries are then robustly detected firstly using 1D superpixels, applied to A-scans, and then fitting active contours in B-scan images. Thereafter the 3D cell layer surfaces are efficiently segmented from the volume data. The framework was tested on healthy eye data and we show that it is capable of segmenting up to 12 layers. The experimental results imply the effectiveness of proposed method and indicate its robustness to low image resolution and intrinsic speckle noise.

  16. Comparative Genomics of the Ectomycorrhizal Sister Species Rhizopogon vinicolor and Rhizopogon vesiculosus (Basidiomycota: Boletales) Reveals a Divergence of the Mating Type B Locus

    PubMed Central

    Mujic, Alija Bajro; Kuo, Alan; Tritt, Andrew; Lipzen, Anna; Chen, Cindy; Johnson, Jenifer; Sharma, Aditi; Barry, Kerrie; Grigoriev, Igor V.; Spatafora, Joseph W.

    2017-01-01

    Divergence of breeding system plays an important role in fungal speciation. Ectomycorrhizal fungi, however, pose a challenge for the study of reproductive biology because most cannot be mated under laboratory conditions. To overcome this barrier, we sequenced the draft genomes of the ectomycorrhizal sister species Rhizopogon vinicolor Smith and Zeller and R. vesiculosus Smith and Zeller (Basidiomycota, Boletales)—the first genomes available for Basidiomycota truffles—and characterized gene content and organization surrounding their mating type loci. Both species possess a pair of homeodomain transcription factor homologs at the mating type A-locus as well as pheromone receptor and pheromone precursor homologs at the mating type B-locus. Comparison of Rhizopogon genomes with genomes from Boletales, Agaricales, and Polyporales revealed synteny of the A-locus region within Boletales, but several genomic rearrangements across orders. Our findings suggest correlation between gene content at the B-locus region and breeding system in Boletales with tetrapolar species possessing more diverse gene content than bipolar species. Rhizopogon vinicolor possesses a greater number of B-locus pheromone receptor and precursor genes than R. vesiculosus, as well as a pair of isoprenyl cysteine methyltransferase genes flanking the B-locus compared to a single copy in R. vesiculosus. Examination of dikaryotic single nucleotide polymorphisms within genomes revealed greater heterozygosity in R. vinicolor, consistent with increased rates of outcrossing. Both species possess the components of a heterothallic breeding system with R. vinicolor possessing a B-locus region structure consistent with tetrapolar Boletales and R. vesiculosus possessing a B-locus region structure intermediate between bipolar and tetrapolar Boletales. PMID:28450370

  17. Transcriptome Analysis of Human Immune Responses Following Live Vaccine Strain (LVS) Francisella Tularensis Vaccination

    DTIC Science & Technology

    2007-03-08

    with CD3D 50848 PAR1/UBE3A Prader–Willi syndrome chromosome region 1, GMCSFRalpha precursor, IL3Ralpha precursor (CD123) Brain development...intervention programs justifiable? Emerg. Infect. Dis. 3, 83–94. iebel, U., Kindler , B., Pepperkok, R., 2004. ‘Harvester’: a fast meta search engine of human...protein resources. Bioinformatics 20, 1962–1963. iebel, U., Kindler , B., Pepperkok, R., 2005. Bioinformatic “Harvester”: a search engine for genome

  18. I-SceI-mediated double-strand break does not increase the frequency of homologous recombination at the Dct locus in mouse embryonic stem cells.

    PubMed

    Fenina, Myriam; Simon-Chazottes, Dominique; Vandormael-Pournin, Sandrine; Soueid, Jihane; Langa, Francina; Cohen-Tannoudji, Michel; Bernard, Bruno A; Panthier, Jean-Jacques

    2012-01-01

    Targeted induction of double-strand breaks (DSBs) at natural endogenous loci was shown to increase the rate of gene replacement by homologous recombination in mouse embryonic stem cells. The gene encoding dopachrome tautomerase (Dct) is specifically expressed in melanocytes and their precursors. To construct a genetic tool allowing the replacement of Dct gene by any gene of interest, we generated an embryonic stem cell line carrying the recognition site for the yeast I-SceI meganuclease embedded in the Dct genomic segment. The embryonic stem cell line was electroporated with an I-SceI expression plasmid, and a template for the DSB-repair process that carried sequence homologies to the Dct target. The I-SceI meganuclease was indeed able to introduce a DSB at the Dct locus in live embryonic stem cells. However, the level of gene targeting was not improved by the DSB induction, indicating a limited capacity of I-SceI to mediate homologous recombination at the Dct locus. These data suggest that homologous recombination by meganuclease-induced DSB may be locus dependent in mammalian cells.

  19. Complete genome sequences of two Staphylococcus aureus ST5 isolates from California, USA

    USDA-ARS?s Scientific Manuscript database

    Staphylococcus aureus is a bacteria that can cause disease in humans and animals. S. aureus bacteria can transfer or exchange segments of genetic material with other bacteria. These segments are known as mobile genetic elements and in some instances they can encode for factors that increase the abil...

  20. Draft genome sequences of 14 Staphylococcus aureus ST5 isolates from California, USA

    USDA-ARS?s Scientific Manuscript database

    Staphylococcus aureus is a bacteria that can cause disease in humans and animals. S. aureus bacteria can transfer or exchange segments of genetic material with other bacteria. These segments are known as mobile genetic elements and in some instances they can encode for factors that increase the abil...

  1. Non-canonical ribosomal DNA segments in the human genome, and nucleoli functioning.

    PubMed

    Kupriyanova, Natalia S; Netchvolodov, Kirill K; Sadova, Anastasia A; Cherepanova, Marina D; Ryskov, Alexei P

    2015-11-10

    Ribosomal DNA (rDNA) in the human genome is represented by tandem repeats of 43 kb nucleotide sequences that form nucleoli organizers (NORs) on each of five pairs of acrocentric chromosomes. RDNA-similar segments of different lengths are also present on (NOR)(-) chromosomes. Many of these segments contain nucleotide substitutions, supplementary microsatellite clusters, and extended deletions. Recently, it was shown that, in addition to ribosome biogenesis, nucleoli exhibit additional functions, such as cell-cycle regulation and response to stresses. In particular, several stress-inducible loci located in the ribosomal intergenic spacer (rIGS) produce stimuli-specific noncoding nucleolus RNAs. By mapping the 5'/3' ends of the rIGS segments scattered throughout (NOR)(-) chromosomes, we discovered that the bonds in the rIGS that were most often susceptible to disruption in the rIGS were adjacent to, or overlapped with stimuli-specific inducible loci. This suggests the interconnection of the two phenomena - nucleoli functioning and the scattering of rDNA-like sequences on (NOR)(-) chromosomes. Copyright © 2015 Elsevier B.V. All rights reserved.

  2. Copy number variation of individual cattle genomes using next-generation sequencing

    USDA-ARS?s Scientific Manuscript database

    Copy number variations (CNVs) affect a wide range of phenotypic traits; however, CNVs in or near segmental duplication regions are often intractable. Using a read depth approach based on next-generation sequencing, we examined genome-wide copy number differences among five taurine (three Angus, one ...

  3. Copy number variation of individual cattle genomes using next-generation sequencing

    USDA-ARS?s Scientific Manuscript database

    Copy Number Variations (CNVs) affect a wide range of phenotypic traits; however, CNVs in or near segmental duplication regions are often difficult to track. Using a read depth approach based on next generation sequencing, we examined genome-wide copy number differences among five taurine (three Angu...

  4. Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan

    USDA-ARS?s Scientific Manuscript database

    Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be easily seen by analyzing the unexpected behaviour of the linkage disequilibrium (LD) decay. A heuristic process was proposed to identify those misassembly signatures and presented the ones found in ...

  5. Tracking of wild allele introgressions in a peanut chromosome segment substitution line population

    USDA-ARS?s Scientific Manuscript database

    Cultivated peanut arose from the hybridization of the diploids Arachis duranensis (A genome progenitor) and Arachis ipaensis (B genome progenitor), followed by spontaneous chromosome doubling to yield the current allotetraploid state (AABB; 2n=4x=40). This genetic heritage, short period since polyp...

  6. Constraint factor graph cut-based active contour method for automated cellular image segmentation in RNAi screening.

    PubMed

    Chen, C; Li, H; Zhou, X; Wong, S T C

    2008-05-01

    Image-based, high throughput genome-wide RNA interference (RNAi) experiments are increasingly carried out to facilitate the understanding of gene functions in intricate biological processes. Automated screening of such experiments generates a large number of images with great variations in image quality, which makes manual analysis unreasonably time-consuming. Therefore, effective techniques for automatic image analysis are urgently needed, in which segmentation is one of the most important steps. This paper proposes a fully automatic method for cells segmentation in genome-wide RNAi screening images. The method consists of two steps: nuclei and cytoplasm segmentation. Nuclei are extracted and labelled to initialize cytoplasm segmentation. Since the quality of RNAi image is rather poor, a novel scale-adaptive steerable filter is designed to enhance the image in order to extract long and thin protrusions on the spiky cells. Then, constraint factor GCBAC method and morphological algorithms are combined to be an integrated method to segment tight clustered cells. Compared with the results obtained by using seeded watershed and the ground truth, that is, manual labelling results by experts in RNAi screening data, our method achieves higher accuracy. Compared with active contour methods, our method consumes much less time. The positive results indicate that the proposed method can be applied in automatic image analysis of multi-channel image screening data.

  7. GeneBreak: detection of recurrent DNA copy number aberration-associated chromosomal breakpoints within genes.

    PubMed

    van den Broek, Evert; van Lieshout, Stef; Rausch, Christian; Ylstra, Bauke; van de Wiel, Mark A; Meijer, Gerrit A; Fijneman, Remond J A; Abeln, Sanne

    2016-01-01

    Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs) of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large) series of tumor samples. 'GeneBreak' is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH) or by (low-pass) whole genome sequencing (WGS). First, 'GeneBreak' collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, 'GeneBreak', is implemented in R ( www.cran.r-project.org ) and is available from Bioconductor ( www.bioconductor.org/packages/release/bioc/html/GeneBreak.html ).

  8. Towards the delineation of the ancestral eutherian genome organization: comparative genome maps of human and the African elephant (Loxodonta africana) generated by chromosome painting.

    PubMed Central

    Frönicke, Lutz; Wienberg, Johannes; Stone, Gary; Adams, Lisa; Stanyon, Roscoe

    2003-01-01

    This study presents a whole-genome comparison of human and a representative of the Afrotherian clade, the African elephant, generated by reciprocal Zoo-FISH. An analysis of Afrotheria genomes is of special interest, because recent DNA sequence comparisons identify them as the oldest placental mammalian clade. Complete sets of whole-chromosome specific painting probes for the African elephant and human were constructed by degenerate oligonucleotide-primed PCR amplification of flow-sorted chromosomes. Comparative genome maps are presented based on their hybridization patterns. These maps show that the elephant has a moderately rearranged chromosome complement when compared to humans. The human paint probes identified 53 evolutionary conserved segments on the 27 autosomal elephant chromosomes and the X chromosome. Reciprocal experiments with elephant probes delineated 68 conserved segments in the human genome. The comparison with a recent aardvark and elephant Zoo-FISH study delineates new chromosomal traits which link the two Afrotherian species phylogenetically. In the absence of any morphological evidence the chromosome painting data offer the first non-DNA sequence support for an Afrotherian clade. The comparative human and elephant genome maps provide new insights into the karyotype organization of the proto-afrotherian, the ancestor of extant placental mammals, which most probably consisted of 2n=46 chromosomes. PMID:12965023

  9. Underlying mechanisms for syntrophic metabolism of essential enzyme cofactors in microbial communities

    PubMed Central

    Romine, Margaret F; Rodionov, Dmitry A; Maezato, Yukari; Osterman, Andrei L; Nelson, William C

    2017-01-01

    Many microorganisms are unable to synthesize essential B vitamin-related enzyme cofactors de novo. The underlying mechanisms by which such microbes survive in multi-species communities are largely unknown. We previously reported the near-complete genome sequence of two ~18-member unicyanobacterial microbial consortia that maintain stable membership on defined medium lacking vitamins. Here we have used genome analysis and growth studies on isolates derived from the consortia to reconstruct pathways for biogenesis of eight essential cofactors and predict cofactor usage and precursor exchange in these communities. Our analyses revealed that all but the two Halomonas and cyanobacterial community members were auxotrophic for at least one cofactor. We also observed a mosaic distribution of salvage routes for a variety of cofactor precursors, including those produced by photolysis. Potentially bidirectional transporters were observed to be preferentially in prototrophs, suggesting a mechanism for controlled precursor release. Furthermore, we found that Halomonas sp. do not require cobalamin nor control its synthesis, supporting the hypothesis that they overproduce and export vitamins. Collectively, these observations suggest that the consortia rely on syntrophic metabolism of cofactors as a survival strategy for optimization of metabolic exchange within a shared pool of micronutrients. PMID:28186498

  10. Russian Doll Genes and Complex Chromosome Rearrangements in Oxytricha trifallax

    PubMed Central

    Braun, Jasper; Nabergall, Lukas; Neme, Rafik; Landweber, Laura F.; Saito, Masahico; Jonoska, Nataša

    2018-01-01

    Ciliates have two different types of nuclei per cell, with one acting as a somatic, transcriptionally active nucleus (macronucleus; abbr. MAC) and another serving as a germline nucleus (micronucleus; abbr. MIC). Furthermore, Oxytricha trifallax undergoes extensive genome rearrangements during sexual conjugation and post-zygotic development of daughter cells. These rearrangements are necessary because the precursor MIC loci are often both fragmented and scrambled, with respect to the corresponding MAC loci. Such genome architectures are remarkably tolerant of encrypted MIC loci, because RNA-guided processes during MAC development reorganize the gene fragments in the correct order to resemble the parental MAC sequence. Here, we describe the germline organization of several nested and highly scrambled genes in Oxytricha trifallax. These include cases with multiple layers of nesting, plus highly interleaved or tangled precursor loci that appear to deviate from previously described patterns. We present mathematical methods to measure the degree of nesting between precursor MIC loci, and revisit a method for a mathematical description of scrambling. After applying these methods to the chromosome rearrangement maps of O. trifallax we describe cases of nested arrangements with up to five layers of embedded genes, as well as the most scrambled loci in O. trifallax. PMID:29545465

  11. Detection of Low Temperature Volcanogenic Thermal Anomalies with ASTER

    NASA Astrophysics Data System (ADS)

    Pieri, D. C.; Baxter, S.

    2009-12-01

    Predicting volcanic eruptions is a thorny problem, as volcanoes typically exhibit idiosyncratic waxing and/or waning pre-eruption emission, geodetic, and seismic behavior. It is no surprise that increasing our accuracy and precision in eruption prediction depends on assessing the time-progressions of all relevant precursor geophysical, geochemical, and geological phenomena, and on more frequently observing volcanoes when they become restless. The ASTER instrument on the NASA Terra Earth Observing System satellite in low earth orbit provides important capabilities in the area of detection of volcanogenic anomalies such as thermal precursors and increased passive gas emissions. Its unique high spatial resolution multi-spectral thermal IR imaging data (90m/pixel; 5 bands in the 8-12um region), bore-sighted with visible and near-IR imaging data, and combined with off-nadir pointing and stereo-photogrammetric capabilities make ASTER a potentially important volcanic precursor detection tool. We are utilizing the JPL ASTER Volcano Archive (http://ava.jpl.nasa.gov) to systematically examine 80,000+ ASTER volcano images to analyze (a) thermal emission baseline behavior for over 1500 volcanoes worldwide, (b) the form and magnitude of time-dependent thermal emission variability for these volcanoes, and (c) the spatio-temporal limits of detection of pre-eruption temporal changes in thermal emission in the context of eruption precursor behavior. We are creating and analyzing a catalog of the magnitude, frequency, and distribution of volcano thermal signatures worldwide as observed from ASTER since 2000 at 90m/pixel. Of particular interest as eruption precursors are small low contrast thermal anomalies of low apparent absolute temperature (e.g., melt-water lakes, fumaroles, geysers, grossly sub-pixel hotspots), for which the signal-to-noise ratio may be marginal (e.g., scene confusion due to clouds, water and water vapor, fumarolic emissions, variegated ground emissivity, and their combinations). To systematically detect such intrinsically difficult anomalies within our large archive, we are exploring a four step approach: (a) the recursive application of a GPU-accelerated, edge-preserving bilateral filter prepares a thermal image by removing noise and fine detail; (b) the resulting stylized filtered image is segmented by a path-independent region-growing algorithm, (c) the resulting segments are fused based on thermal affinity, and (d) fused segments are subjected to thermal and geographical tests for hotspot detection and classification, to eliminate false alarms or non-volcanogenic anomalies. We will discuss our progress in creating the general thermal anomaly catalog as well as algorithm approach and results. This work was carried out at the Jet Propulsion Laboratory of the California Institute of Technology under contract to NASA.

  12. Accumulation of point mutations and reassortment of genomic RNA segments are involved in the microevolution of Puumala hantavirus in a bank vole (Myodes glareolus) population.

    PubMed

    Razzauti, Maria; Plyusnina, Angelina; Henttonen, Heikki; Plyusnin, Alexander

    2008-07-01

    The genetic diversity of Puumala hantavirus (PUUV) was studied in a local population of its natural host, the bank vole (Myodes glareolus). The trapping area (2.5 x 2.5 km) at Konnevesi, Central Finland, included 14 trapping sites, at least 500 m apart; altogether, 147 voles were captured during May and October 2005. Partial sequences of the S, M and L viral genome segments were recovered from 40 animals. Seven, 12 and 17 variants were detected for the S, M and L sequences, respectively; these represent new wild-type PUUV strains that belong to the Finnish genetic lineage. The genetic diversity of PUUV strains from Konnevesi was 0.2-4.9 % for the S segment, 0.2-4.8 % for the M segment and 0.2-9.7 % for the L segment. Most nucleotide substitutions were synonymous and most deduced amino acid substitutions were conservative, probably due to strong stabilizing selection operating at the protein level. Based on both sequence markers and phylogenetic clustering, the S, M and L sequences could be assigned to two groups, 'A' and 'B'. Notably, not all bank voles carried S, M and L sequences belonging to the same group, i.e. S(A)M(A)L(A) or S(B)M(B)L(B). A substantial proportion (8/40, 20 %) of the newly characterized PUUV strains possessed reassortant genomes such as S(B)M(A)L(A), S(A)M(B)L(B) or S(B)M(A)L(B). These results suggest that at least some of the PUUV reassortants are viable and can survive in the presence of their parental strains.

  13. Identification of a Recurrent Microdeletion at 17q23.1q23.2 Flanked by Segmental Duplications Associated with Heart Defects and Limb Abnormalities

    PubMed Central

    Ballif, Blake C.; Theisen, Aaron; Rosenfeld, Jill A.; Traylor, Ryan N.; Gastier-Foster, Julie; Thrush, Devon Lamb; Astbury, Caroline; Bartholomew, Dennis; McBride, Kim L.; Pyatt, Robert E.; Shane, Kate; Smith, Wendy E.; Banks, Valerie; Gallentine, William B.; Brock, Pamela; Rudd, M. Katharine; Adam, Margaret P.; Keene, Julia A.; Phillips, John A.; Pfotenhauer, Jean P.; Gowans, Gordon C.; Stankiewicz, Pawel; Bejjani, Bassem A.; Shaffer, Lisa G.

    2010-01-01

    Segmental duplications, which comprise ∼5%–10% of the human genome, are known to mediate medically relevant deletions, duplications, and inversions through nonallelic homologous recombination (NAHR) and have been suggested to be hot spots in chromosome evolution and human genomic instability. We report seven individuals with microdeletions at 17q23.1q23.2, identified by microarray-based comparative genomic hybridization (aCGH). Six of the seven deletions are ∼2.2 Mb in size and flanked by large segmental duplications of >98% sequence identity and in the same orientation. One of the deletions is ∼2.8 Mb in size and is flanked on the distal side by a segmental duplication, whereas the proximal breakpoint falls between segmental duplications. These characteristics suggest that NAHR mediated six out of seven of these rearrangements. These individuals have common features, including mild to moderate developmental delay (particularly speech delay), microcephaly, postnatal growth retardation, heart defects, and hand, foot, and limb abnormalities. Although all individuals had at least mild dysmorphic facial features, there was no characteristic constellation of features that would elicit clinical suspicion of a specific disorder. The identification of common clinical features suggests that microdeletions at 17q23.1q23.2 constitute a novel syndrome. Furthermore, the inclusion in the minimal deletion region of TBX2 and TBX4, transcription factors belonging to a family of genes implicated in a variety of developmental pathways including those of heart and limb, suggests that these genes may play an important role in the phenotype of this emerging syndrome. PMID:20206336

  14. Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

    PubMed Central

    Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

    1988-01-01

    In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125

  15. Transcriptome sequencing of Atlantic salmon (Salmo salar L.) notochord prior to development of the vertebrae provides clues to regulation of positional fate, chordoblast lineage and mineralisation.

    PubMed

    Wang, Shou; Furmanek, Tomasz; Kryvi, Harald; Krossøy, Christel; Totland, Geir K; Grotmol, Sindre; Wargelius, Anna

    2014-02-19

    In teleosts such as Atlantic salmon (Salmo salar L.), segmentation and subsequent mineralisation of the notochord during embryonic stages are essential for normal vertebrae formation. However, the molecular mechanisms leading to segmentation and mineralisation of the notochord are poorly understood. The aim of this study was to identify genes/pathways acting in gradients over time and along the anterior-posterior axis during notochord segmentation and immediately prior to mineralisation of the vertebral bodies in Atlantic salmon. Notochord samples were collected from unsegmented, pre-segmented and segmented developmental stages. In each stage, the cellular core of the notochord was cut into three pieces along the longitudinal axis (anterior, mid, posterior). RNA was sequenced (22 million pair-end 100 bp/ library) and mapped to the salmon genome. 66569 transcripts were predicted and 55775 were annotated. In order to identify possible gradients leading to segmentation of the notochord, all 71 notochord-expressed hox genes were investigated, most of them displaying a typical anterior-posterior expression pattern along the notochord axis. The clustering of hox genes revealed a pattern that could be related to notochord segmentation. We further investigated how mineralisation is initiated in the notochord, and several factors related to chondrogenic lineage were identified (sox9, sox5, sox6, tgfb3, ihhb and col2a1), suggesting a cartilage-like character of the notochord. KEGG analysis of differentially expressed genes between stages revealed down-regulation of pathways associated with ECM, cell division, metabolism and development at onset of notochord segmentation. This implies that inhibitory signals produce segmentation of the notochord. One such potential inhibitory signal was identified, col11a2, which was detected in segments of non-mineralising notochord. An incomplete salmon genome was successfully used to analyse RNA-seq data from the cellular core of the Atlantic salmon notochord. In transcriptome we found; hox gene patterns possibly linked to segmentation; down-regulation of pathways in the notochord at onset of segmentation; segmented expression of col11a2 in non-mineralised segments of the notochord; and a chondroblast-like footprint in the notochord.

  16. Complete Genomic Sequence and Comparative Analysis of the Genome Segments of Sweet Potato Chlorotic Stunt Virus in China

    PubMed Central

    Qin, Yanhong; Wang, Li; Zhang, Zhenchen; Qiao, Qi; Zhang, Desheng; Tian, Yuting; Wang, Shuang; Wang, Yongjiang; Yan, Zhaoling

    2014-01-01

    Background Sweet potato chlorotic stunt virus (family Closteroviridae, genus Crinivirus) features a large bipartite, single-stranded, positive-sense RNA genome. To date, only three complete genomic sequences of SPCSV can be accessed through GenBank. SPCSV was first detected from China in 2011, only partial genomic sequences have been determined in the country. No report on the complete genomic sequence and genome structure of Chinese SPCSV isolates or the genetic relation between isolates from China and other countries is available. Methodology/Principal Findings The complete genomic sequences of five isolates from different areas in China were characterized. This study is the first to report the complete genome sequences of SPCSV from whitefly vectors. Genome structure analysis showed that isolates of WA and EA strains from China have the same coding protein as isolates Can181-9 and m2-47, respectively. Twenty cp genes and four RNA1 partial segments were sequenced and analyzed, and the nucleotide identities of complete genomic, cp, and RNA1 partial sequences were determined. Results indicated high conservation among strains and significant differences between WA and EA strains. Genetic analysis demonstrated that, except for isolates from Guangdong Province, SPCSVs from other areas belong to the WA strain. Genome organization analysis showed that the isolates in this study lack the p22 gene. Conclusions/Significance We presented the complete genome sequences of SPCSV in China. Comparison of nucleotide identities and genome structures between these isolates and previously reported isolates showed slight differences. The nucleotide identities of different SPCSV isolates showed high conservation among strains and significant differences between strains. All nine isolates in this study lacked p22 gene. WA strains were more extensively distributed than EA strains in China. These data provide important insights into the molecular variation and genomic structure of SPCSV in China as well as genetic relationships among isolates from China and other countries. PMID:25170926

  17. A new cryptic virus belonging to the family Partitiviridae was found in watermelon co-infected with Melon necrotic spot virus.

    PubMed

    Sela, Noa; Lachman, Oded; Reingold, Victoria; Dombrovsky, Aviv

    2013-10-01

    A novel virus was detected in watermelon plants (Citrullus lanatus Thunb.) infected with Melon necrotic spot virus (MNSV) using SOLiD next-generation sequence analysis. In addition to the expected MSNV genome, two double-stranded RNA (dsRNA) segments of 1,312 and 1,118 bp were also identified and sequenced from the purified virus preparations. These two dsRNA segments encode two putative partitivirus-related proteins, an RNA-dependent RNA polymerase (RdRP) and a capsid protein, which were sequenced. Genomic-sequence analysis and analysis of phylogenetic relationships indicate that these two dsRNAs together make up the genome of a novel Partitivirus. This virus was found to be closely related to the Pepper cryptic virus 1 and Raphanus sativus cryptic virus. It is suggested that this novel virus putatively named Citrullus lanatus cryptic virus be considered as a new member of the family Partitiviridae.

  18. Long-range correlations and charge transport properties of DNA sequences

    NASA Astrophysics Data System (ADS)

    Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui

    2010-04-01

    By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5

  19. Piscine reovirus: Genomic and molecular phylogenetic analysis from farmed and wild salmonids collected on the Canada/US Pacific Coast

    USGS Publications Warehouse

    Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.

    2015-01-01

    Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.

  20. Piscine Reovirus: Genomic and Molecular Phylogenetic Analysis from Farmed and Wild Salmonids Collected on the Canada/US Pacific Coast

    PubMed Central

    Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.

    2015-01-01

    Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673

  1. Characterization of a Novel Orthomyxo-like Virus Causing Mass Die-Offs of Tilapia.

    PubMed

    Bacharach, Eran; Mishra, Nischay; Briese, Thomas; Zody, Michael C; Kembou Tsofack, Japhette Esther; Zamostiano, Rachel; Berkowitz, Asaf; Ng, James; Nitido, Adam; Corvelo, André; Toussaint, Nora C; Abel Nielsen, Sandra Cathrine; Hornig, Mady; Del Pozo, Jorge; Bloom, Toby; Ferguson, Hugh; Eldar, Avi; Lipkin, W Ian

    2016-04-05

    Tilapia are an important global food source due to their omnivorous diet, tolerance for high-density aquaculture, and relative disease resistance. Since 2009, tilapia aquaculture has been threatened by mass die-offs in farmed fish in Israel and Ecuador. Here we report evidence implicating a novel orthomyxo-like virus in these outbreaks. The tilapia lake virus (TiLV) has a 10-segment, negative-sense RNA genome. The largest segment, segment 1, contains an open reading frame with weak sequence homology to the influenza C virus PB1 subunit. The other nine segments showed no homology to other viruses but have conserved, complementary sequences at their 5' and 3' termini, consistent with the genome organization found in other orthomyxoviruses. In situ hybridization indicates TiLV replication and transcription at sites of pathology in the liver and central nervous system of tilapia with disease. The economic impact of worldwide trade in tilapia is estimated at $7.5 billion U.S. dollars (USD) annually. The infectious agent implicated in mass tilapia die-offs in two continents poses a threat to the global tilapia industry, which not only provides inexpensive dietary protein but also is a major employer in the developing world. Here we report characterization of the causative agent as a novel orthomyxo-like virus, tilapia lake virus (TiLV). We also describe complete genomic and protein sequences that will facilitate TiLV detection and containment and enable vaccine development. Copyright © 2016 Bacharach et al.

  2. Complete genomic sequence of an infectious pancreatic necrosis virus isolated from rainbow trout (Oncorhynchus mykiss) in China.

    PubMed

    Ji, Feng; Zhao, Jing-Zhuang; Liu, Miao; Lu, Tong-Yan; Liu, Hong-Bai; Yin, Jiasheng; Xu, Li-Ming

    2017-04-01

    Infectious pancreatic necrosis (IPN) is a significant disease of farmed salmonids resulting in direct economic losses due to high mortality in China. However, no gene sequence of any Chinese infectious pancreatic necrosis virus (IPNV) isolates was available. In the study, moribund rainbow trout fry samples were collected during an outbreak of IPN in Yunnan province of southwest China in 2013. An IPNV was isolated and tentatively named ChRtm213. We determined the full genome sequence of the IPNV ChRtm213 and compared it with previously identified IPNV sequences worldwide. The sequences of different structural and non-structural protein genes were compared to those of other aquatic birnaviruses sequenced to date. The results indicated that the complete genome sequence of ChRtm213 strain contains a segment A (3099 nucleotides) coding a polyprotein VP2-VP4-VP3, and a segment B (2789 nucleotides) coding a RNA-dependent RNA polymerase VP1. The phylogenetic analyses showed that ChRtm213 strain fell within genogroup 1, serotype A9 (Jasper), having similarities of 96.3% (segment A) and 97.3% (segment B) with the IPNV strain AM98 from Japan. The results suggest that the Chinese IPNV isolate has relative closer relationship with Japanese IPNV strains. The sequence of ChRtm213 was the first gene sequence of IPNV isolates in China. This study provided a robust reference for diagnosis and/or control of IPNV prevalent in China.

  3. Complete genome sequence of Streptomyces venezuelae ATCC 15439, a promising cell factory for production of secondary metabolites.

    PubMed

    Song, Ju Yeon; Yoo, Young Ji; Lim, Si-Kyu; Cha, Sun Ho; Kim, Ji-Eun; Roe, Jung-Hye; Kim, Jihyun F; Yoon, Yeo Joon

    2016-02-10

    Streptomyces venezuelae ATCC 15439, which produces 12- and 14-membered ring macrolide antibiotics, is a platform strain for heterologous expression of secondary metabolites. Its 9.05-Mb genome sequence revealed an abundance of genes involved in the biosynthesis of secondary metabolites and their precursors, which should be useful for the production of bioactive compounds. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. Drosophila Hox and Sex-Determination Genes Control Segment Elimination through EGFR and extramacrochetae Activity

    PubMed Central

    Foronda, David; Martín, Paloma; Sánchez-Herrero, Ernesto

    2012-01-01

    The formation or suppression of particular structures is a major change occurring in development and evolution. One example of such change is the absence of the seventh abdominal segment (A7) in Drosophila males. We show here that there is a down-regulation of EGFR activity and fewer histoblasts in the male A7 in early pupae. If this activity is elevated, cell number increases and a small segment develops in the adult. At later pupal stages, the remaining precursors of the A7 are extruded under the epithelium. This extrusion requires the up-regulation of the HLH protein Extramacrochetae and correlates with high levels of spaghetti-squash, the gene encoding the regulatory light chain of the non-muscle myosin II. The Hox gene Abdominal-B controls both the down-regulation of spitz, a ligand of the EGFR pathway, and the up-regulation of extramacrochetae, and also regulates the transcription of the sex-determining gene doublesex. The male Doublesex protein, in turn, controls extramacrochetae and spaghetti-squash expression. In females, the EGFR pathway is also down-regulated in the A7 but extramacrochetae and spaghetti-squash are not up-regulated and extrusion of precursor cells is almost absent. Our results show the complex orchestration of cellular and genetic events that lead to this important sexually dimorphic character change. PMID:22912593

  5. miRNAFold: a web server for fast miRNA precursor prediction in genomes.

    PubMed

    Tav, Christophe; Tempel, Sébastien; Poligny, Laurent; Tahi, Fariza

    2016-07-08

    Computational methods are required for prediction of non-coding RNAs (ncRNAs), which are involved in many biological processes, especially at post-transcriptional level. Among these ncRNAs, miRNAs have been largely studied and biologists need efficient and fast tools for their identification. In particular, ab initio methods are usually required when predicting novel miRNAs. Here we present a web server dedicated for miRNA precursors identification at a large scale in genomes. It is based on an algorithm called miRNAFold that allows predicting miRNA hairpin structures quickly with high sensitivity. miRNAFold is implemented as a web server with an intuitive and user-friendly interface, as well as a standalone version. The web server is freely available at: http://EvryRNA.ibisc.univ-evry.fr/miRNAFold. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Sequence Segmentation with changeptGUI.

    PubMed

    Tasker, Edward; Keith, Jonathan M

    2017-01-01

    Many biological sequences have a segmental structure that can provide valuable clues to their content, structure, and function. The program changept is a tool for investigating the segmental structure of a sequence, and can also be applied to multiple sequences in parallel to identify a common segmental structure, thus providing a method for integrating multiple data types to identify functional elements in genomes. In the previous edition of this book, a command line interface for changept is described. Here we present a graphical user interface for this package, called changeptGUI. This interface also includes tools for pre- and post-processing of data and results to facilitate investigation of the number and characteristics of segment classes.

  7. [Comparative chromosome painting shows the red panda (Ailurus fulgens) has a highly conserved karyotype].

    PubMed

    Tian, Ying; Nie, Wen-Hui; Wang, Jin-Huan; Yang, Yun-Fei; Yang, Feng-Tang

    2002-02-01

    We have established a comparative chromosome map between red panda (Ailurus fulgens, 2n = 36) and dog by chromosome painting with biotin-labelled chromosome-specific probes of the dog. Dog probes specific for the 38 automates delineated 71 homologous segments in the metaphase chromosomes of red panda. Of the 38 autosomal paints, 18 probes each delineated one homologous segment in red panda genome, while the other 20 ones each detected two to five homologous segments. The dog X chromosome-specific paint delineated the whole X chromosome of the red panda. The results indicate that at least 28 fissions (breaks), 49 fusions and 4 inversions were needed to "convert" the dog karyotype to that of the red panda, suggesting that extensive chromosome rearrangements differentiate the karyotypes of red panda and dog. Based on the established comparative chromosome homologies of dog and domestic cat, we could infer that there were 26 segments of conserved synteny between red panda and domestic cat. Comparative analysis of the distribution patterns of conserved segments defined by dog paints in red panda and domestic cat genomes revealed at least 2 cryptic inversions in two large chromosomal regions of conserved synteny between red panda and domestic cat. The karyotype of red panda shows high degree of homology with that of domestic cat.

  8. Discovery of a widely distributed toxin biosynthetic gene cluster

    PubMed Central

    Lee, Shaun W.; Mitchell, Douglas A.; Markley, Andrew L.; Hensler, Mary E.; Gonzalez, David; Wohlrab, Aaron; Dorrestein, Pieter C.; Nizet, Victor; Dixon, Jack E.

    2008-01-01

    Bacteriocins represent a large family of ribosomally produced peptide antibiotics. Here we describe the discovery of a widely conserved biosynthetic gene cluster for the synthesis of thiazole and oxazole heterocycles on ribosomally produced peptides. These clusters encode a toxin precursor and all necessary proteins for toxin maturation and export. Using the toxin precursor peptide and heterocycle-forming synthetase proteins from the human pathogen Streptococcus pyogenes, we demonstrate the in vitro reconstitution of streptolysin S activity. We provide evidence that the synthetase enzymes, as predicted from our bioinformatics analysis, introduce heterocycles onto precursor peptides, thereby providing molecular insight into the chemical structure of streptolysin S. Furthermore, our studies reveal that the synthetase exhibits relaxed substrate specificity and modifies toxin precursors from both related and distant species. Given our findings, it is likely that the discovery of similar peptidic toxins will rapidly expand to existing and emerging genomes. PMID:18375757

  9. Identification of Novel Genomic Islands in Liverpool Epidemic Strain of Pseudomonas aeruginosa Using Segmentation and Clustering

    PubMed Central

    Jani, Mehul; Mathee, Kalai; Azad, Rajeev K.

    2016-01-01

    Pseudomonas aeruginosa is an opportunistic pathogen implicated in a myriad of infections and a leading pathogen responsible for mortality in patients with cystic fibrosis (CF). Horizontal transfers of genes among the microorganisms living within CF patients have led to highly virulent and multi-drug resistant strains such as the Liverpool epidemic strain of P. aeruginosa, namely the LESB58 strain that has the propensity to acquire virulence and antibiotic resistance genes. Often these genes are acquired in large clusters, referred to as “genomic islands (GIs).” To decipher GIs and understand their contributions to the evolution of virulence and antibiotic resistance in P. aeruginosa LESB58, we utilized a recursive segmentation and clustering procedure, presented here as a genome-mining tool, “GEMINI.” GEMINI was validated on experimentally verified islands in the LESB58 strain before examining its potential to decipher novel islands. Of the 6062 genes in P. aeruginosa LESB58, 596 genes were identified to be resident on 20 GIs of which 12 have not been previously reported. Comparative genomics provided evidence in support of our novel predictions. Furthermore, GEMINI unraveled the mosaic structure of islands that are composed of segments of likely different evolutionary origins, and demonstrated its ability to identify potential strain biomarkers. These newly found islands likely have contributed to the hyper-virulence and multidrug resistance of the Liverpool epidemic strain of P. aeruginosa. PMID:27536294

  10. Genomic segments RNA1 and RNA2 of Prunus necrotic ringspot virus codetermine viral pathogenicity to adapt to alternating natural Prunus hosts.

    PubMed

    Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming

    2013-05-01

    Prunus necrotic ringspot virus (PNRSV) affects Prunus fruit production worldwide. To date, numerous PNRSV isolates with diverse pathological properties have been documented. To study the pathogenicity of PNRSV, which directly or indirectly determines the economic losses of infected fruit trees, we have recently sequenced the complete genome of peach isolate Pch12 and cherry isolate Chr3, belonging to the pathogenically aggressive PV32 group and mild PV96 group, respectively. Here, we constructed the Chr3- and Pch12-derived full-length cDNA clones that were infectious in the experimental host cucumber and their respective natural Prunus hosts. Pch12-derived clones induced much more severe symptoms than Chr3 in cucumber, and the pathogenicity discrepancy between Chr3 and Pch12 was associated with virus accumulation. By reassortment of genomic segments, swapping of partial genomic segments, and site-directed mutagenesis, we identified the 3' terminal nucleotide sequence (1C region) in RNA1 and amino acid K at residue 279 in RNA2-encoded P2 as the severe virulence determinants in Pch12. Gain-of-function experiments demonstrated that both the 1C region and K279 of Pch12 were required for severe virulence and high levels of viral accumulation. Our results suggest that PNRSV RNA1 and RNA2 codetermine viral pathogenicity to adapt to alternating natural Prunus hosts, likely through mediating viral accumulation.

  11. Rift Valley fever virus incorporates the 78kDa glycoprotein into virions matured in C6/36 2 mosquito cells

    USDA-ARS?s Scientific Manuscript database

    Rift Valley fever virus (RVFV), genus Phlebovirus, family Bunyaviridae is a zoonotic arthropod-borne virus able to transition between distant host species, causing potentially severe disease in humans and ruminants. Viral proteins are encoded by three genomic segments, with the medium M segment codi...

  12. Pooled-BAC sequencing of a black pod resistance region (cBPQTL12) in T. cacao

    USDA-ARS?s Scientific Manuscript database

    Whole genome sequencing (WGS) is an expensive and technically challenging endeavor. An alternative to WGS is to sequence specific chromosomal segments of biological interest (e.g. a QTL interval). This method is cheaper than WGS and reduces the risk of misassembly from distal parts of the genome. Us...

  13. Genome Wide DNA Methylation Profiles Provide Clues to the Origin and Pathogenesis of Germ Cell Tumors

    PubMed Central

    Rijlaarsdam, Martin A.; Tax, David M. J.; Gillis, Ad J. M.; Dorssers, Lambert C. J.; Koestler, Devin C.; de Ridder, Jeroen; Looijenga, Leendert H. J.

    2015-01-01

    The cell of origin of the five subtypes (I-V) of germ cell tumors (GCTs) are assumed to be germ cells from different maturation stages. This is (potentially) reflected in their methylation status as fetal maturing primordial germ cells are globally demethylated during migration from the yolk sac to the gonad. Imprinted regions are erased in the gonad and later become uniparentally imprinted according to fetal sex. Here, 91 GCTs (type I-IV) and four cell lines were profiled (Illumina’s HumanMethylation450BeadChip). Data was pre-processed controlling for cross hybridization, SNPs, detection rate, probe-type bias and batch effects. The annotation was extended, covering snRNAs/microRNAs, repeat elements and imprinted regions. A Hidden Markov Model-based genome segmentation was devised to identify differentially methylated genomic regions. Methylation profiles allowed for separation of clusters of non-seminomas (type II), seminomas/dysgerminomas (type II), spermatocytic seminomas (type III) and teratomas/dermoid cysts (type I/IV). The seminomas, dysgerminomas and spermatocytic seminomas were globally hypomethylated, in line with previous reports and their demethylated precursor. Differential methylation and imprinting status between subtypes reflected their presumed cell of origin. Ovarian type I teratomas and dermoid cysts showed (partial) sex specific uniparental maternal imprinting. The spermatocytic seminomas showed uniparental paternal imprinting while testicular teratomas exhibited partial imprinting erasure. Somatic imprinting in type II GCTs might indicate a cell of origin after global demethylation but before imprinting erasure. This is earlier than previously described, but agrees with the totipotent/embryonic stem cell like potential of type II GCTs and their rare extra-gonadal localization. The results support the common origin of the type I teratomas and show strong similarity between ovarian type I teratomas and dermoid cysts. In conclusion, we identified specific and global methylation differences between GCT subtypes, providing insight into their developmental timing and underlying developmental biology. Data and extended annotation are deposited at GEO (GSE58538 and GPL18809). PMID:25859847

  14. Perilobar nephrogenic rests are non-obligate molecular genetic precursor lesions of IGF2-associated Wilms tumours

    PubMed Central

    Vuononvirta, Raisa; Sebire, Neil J.; Dallosso, Anthony R.; Reis-Filho, Jorge S.; Williams, Richard D.; Mackay, Alan; Fenwick, Kerry; Grigoriadis, Anita; Ashworth, Alan; Pritchard-Jones, Kathy; Brown, Keith W.; Vujanic, Gordan M.; Jones, Chris

    2009-01-01

    Purpose: Perilobar nephrogenic rests (PLNRs) are abnormally persistent foci of embryonal immature blastema that have been associated with dysregulation at the 11p15 locus by genetic/epigenetic means, and are thought to be precursor lesions of Wilms tumour. The precise genomic events are, however, largely unknown. Experimental Design: We used arrayCGH to analyse a series of 50 PLNRs and 25 corresponding Wilms tumours characterised for 11p15 genetic/epigenetic alterations and IGF2 expression. Results: The genomic profiles of PLNRs could be subdivided into three categories: those with no copy number changes (22/50, 44%), those with single, whole chromosome alterations (8/50, 16%), and those with multiple gains/losses (20/50, 40%). The most frequent aberrations included 1p- (7/50, 14%) +18 (6/50, 12%), +13 (5/50, 10%) and +12 (3/50, 6%). For the majority (19/25, 76%) of cases, the rest harboured a subset of the copy number changes in the associated Wilms tumour. We identified a temporal order of genomic changes which occur during the IGF2/PLNR pathway of Wilms tumorigenesis, with large scale chromosomal alterations such as 1p-, +12, +13 and +18 regarded as ‘early’ events. In some of the cases (24%), the PLNRs harboured large-scale copy number changes not observed in the concurrent Wilms tumour, including +10p, +14q and +18. Conclusions: These data suggest that although the evidence for PLNRs as precursors is compelling, not all lesions must necessarily undergo malignant transformation. PMID:19047088

  15. Neuropeptides encoded by the genomes of the Akoya pearl oyster Pinctata fucata and Pacific oyster Crassostrea gigas: a bioinformatic and peptidomic survey.

    PubMed

    Stewart, Michael J; Favrel, Pascal; Rotgans, Bronwyn A; Wang, Tianfang; Zhao, Min; Sohail, Manzar; O'Connor, Wayne A; Elizur, Abigail; Henry, Joel; Cummins, Scott F

    2014-10-02

    Oysters impart significant socio-ecological benefits from primary production of food supply, to estuarine ecosystems via reduction of water column nutrients, plankton and seston biomass. Little though is known at the molecular level of what genes are responsible for how oysters reproduce, filter nutrients, survive stressful physiological events and form reef communities. Neuropeptides represent a diverse class of chemical messengers, instrumental in orchestrating these complex physiological events in other species. By a combination of in silico data mining and peptide analysis of ganglia, 74 putative neuropeptide genes were identified from genome and transcriptome databases of the Akoya pearl oyster, Pinctata fucata and the Pacific oyster, Crassostrea gigas, encoding precursors for over 300 predicted bioactive peptide products, including three newly identified neuropeptide precursors PFGx8amide, RxIamide and Wx3Yamide. Our findings also include a gene for the gonadotropin-releasing hormone (GnRH) and two egg-laying hormones (ELH) which were identified from both oysters. Multiple sequence alignments and phylogenetic analysis supports similar global organization of these mature peptides. Computer-based peptide modeling of the molecular tertiary structures of ELH highlights the structural homologies within ELH family, which may facilitate ELH activity leading to the release of gametes. Our analysis demonstrates that oysters possess conserved molluscan neuropeptide domains and overall precursor organization whilst highlighting many previously unrecognized bivalve idiosyncrasies. This genomic analysis provides a solid foundation from which further studies aimed at the functional characterization of these molluscan neuropeptides can be conducted to further stimulate advances in understanding the ecology and cultivation of oysters.

  16. Genomic inbreeding estimation in small populations: evaluation of runs of homozygosity in three local dairy cattle breeds.

    PubMed

    Mastrangelo, S; Tolone, M; Di Gerlando, R; Fontanesi, L; Sardina, M T; Portolano, B

    2016-05-01

    In the local breeds with small population size, one of the most important problems is the increase of inbreeding coefficient (F). High levels of inbreeding lead to reduced genetic diversity and inbreeding depression. The availability of high-density single nucleotide polymorphism (SNP) arrays has facilitated the quantification of F by genomic markers in farm animals. Runs of homozygosity (ROH) are contiguous lengths of homozygous genotypes and represent an estimate of the degree of autozygosity at genome-wide level. The current study aims to quantify the genomic F derived from ROH (F ROH) in three local dairy cattle breeds. F ROH values were compared with F estimated from the genomic relationship matrix (F GRM), based on the difference between observed v. expected number of homozygous genotypes (F HOM) and the genomic homozygosity of individual i (F MOL i ). The molecular coancestry coefficient (f MOL ij ) between individuals i and j was also estimated. Individuals of Cinisara (71), Modicana (72) and Reggiana (168) were genotyped with the 50K v2 Illumina BeadChip. Genotypes from 96 animals of Italian Holstein cattle breed were also included in the analysis. We used a definition of ROH as tracts of homozygous genotypes that were >4 Mb. Among breeds, 3661 ROH were identified. Modicana showed the highest mean number of ROH per individual and the highest value of F ROH, whereas Reggiana showed the lowest ones. Differences among breeds existed for the ROH lengths. The individuals of Italian Holstein showed high number of short ROH segments, related to ancient consanguinity. Similar results showed the Reggiana with some extreme animals with segments covering 400 Mb and more of genome. Modicana and Cinisara showed similar results between them with the total length of ROH characterized by the presence of large segments. High correlation was found between F HOM and F ROH ranged from 0.83 in Reggiana to 0.95 in Cinisara and Modicana. The correlations among F ROH and other estimated F coefficients were generally lower ranged from 0.45 (F MOL i -F ROH) in Cinisara to 0.17 (F GRM-F ROH) in Modicana. On the basis of our results, recent inbreeding was observed in local breeds, considering that 16 Mb segments are expected to present inbreeding up to three generations ago. Our results showed the necessity of implementing conservation programs to control the rise of inbreeding and coancestry in the three Italian local dairy cattle breeds.

  17. Relations between Shannon entropy and genome order index in segmenting DNA sequences.

    PubMed

    Zhang, Yi

    2009-04-01

    Shannon entropy H and genome order index S are used in segmenting DNA sequences. Zhang [Phys. Rev. E 72, 041917 (2005)] found that the two schemes are equivalent when a DNA sequence is converted to a binary sequence of S (strong H bond) and W (weak H bond). They left the mathematical proof to mathematicians who are interested in this issue. In this paper, a possible mathematical explanation is given. Moreover, we find that Chargaff parity rule 2 is the necessary condition of the equivalence, and the equivalence disappears when a DNA sequence is regarded as a four-symbol sequence. At last, we propose that S-2(-H) may be related to species evolution.

  18. Extreme-Scale De Novo Genome Assembly

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Georganas, Evangelos; Hofmeyr, Steven; Egan, Rob

    De novo whole genome assembly reconstructs genomic sequence from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMER, a high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code. Genome assembly software has many components, each of which stresses different components of a computer system. This chapter explains the computational challenges involved in each step of the HipMer pipeline, the key distributed data structures, and communication costs in detail. We present performance results of assembling the human genome and themore » large hexaploid wheat genome on large supercomputers up to tens of thousands of cores.« less

  19. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    PubMed Central

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which might shed light onto the enzymatic machineries that are involved in its production of biogenic amines. PMID:23405290

  20. Establishment of Hox vertebral identities in the embryonic spine precursors

    PubMed Central

    Iimura, Tadahiro; Denans, Nicolas; Pourquié, Olivier

    2012-01-01

    Summary The vertebrate spine exhibits two striking characteristics. The first one is the periodic arrangement of its elements – the vertebrae – along the antero-posterior axis. This segmented organization is the result of somitogenesis, which takes place during organogenesis. The segmentation machinery involves a molecular oscillator – the segmentation clock – which delivers a periodic signal controlling somite production. During embryonic axis elongation, this signal is displaced posteriorly by a system of traveling signaling gradients – the wavefront – which depends on the Wnt, FGF and retinoic acid pathways. The other characteristic feature of the spine is the subdivision of groups of vertebrae into anatomical domains, such as the cervical, thoracic, lumbar, sacral and caudal regions. This axial regionalization is controlled by a set of transcription factors called Hox genes. Hox genes exhibit nested expression domains in the somites which reflect their linear arrangement along the chromosomes– a property termed colinearity. The colinear disposition of Hox genes expression domains provides a blueprint for the regionalization of the future vertebral territories of the spine. In amniotes, Hox genes are activated in the somite precursors of the epiblast in a temporal colinear sequence and they were proposed to control their progressive ingression into the nascent paraxial mesoderm. Consequently, the positioning of the expression domains of Hox genes along the antero-posterior axis is largely controlled by the timing of Hox activation during gastrulation. Positioning of the somitic Hox domains is subsequently refined through a cross talk with the segmentation machinery in the presomitic mesoderm. In this review, we focus on our current understanding of the embryonic mechanisms that establish vertebral identities during vertebrate development. PMID:19651306

  1. Major Chromosomal Rearrangements Distinguish Willow and Poplar After the Ancestral “Salicoid” Genome Duplication

    PubMed Central

    Hou, Jing; Ye, Ning; Dong, Zhongyuan; Lu, Mengzhu; Li, Laigeng; Yin, Tongming

    2016-01-01

    Populus (poplar) and Salix (willow) are sister genera in the Salicaceae family. In both lineages extant species are predominantly diploid. Genome analysis previously revealed that the two lineages originated from a common tetraploid ancestor. In this study, we conducted a syntenic comparison of the corresponding 19 chromosome members of the poplar and willow genomes. Our observations revealed that almost every chromosomal segment had a parallel paralogous segment elsewhere in the genomes, and the two lineages shared a similar syntenic pinwheel pattern for most of the chromosomes, which indicated that the two lineages diverged after the genome reorganization in the common progenitor. The pinwheel patterns showed distinct differences for two chromosome pairs in each lineage. Further analysis detected two major interchromosomal rearrangements that distinguished the karyotypes of willow and poplar. Chromosome I of willow was a conjunction of poplar chromosome XVI and the lower portion of poplar chromosome I, whereas willow chromosome XVI corresponded to the upper portion of poplar chromosome I. Scientists have suggested that Populus is evolutionarily more primitive than Salix. Therefore, we propose that, after the “salicoid” duplication event, fission and fusion of the ancestral chromosomes first give rise to the diploid progenitor of extant Populus species. During the evolutionary process, fission and fusion of poplar chromosomes I and XVI subsequently give rise to the progenitor of extant Salix species. This study contributes to an improved understanding of genome divergence after ancient genome duplication in closely related lineages of higher plants. PMID:27352946

  2. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

    PubMed Central

    Purcell, Shaun ; Neale, Benjamin ; Todd-Brown, Kathe ; Thomas, Lori ; Ferreira, Manuel A. R. ; Bender, David ; Maller, Julian ; Sklar, Pamela ; de Bakker, Paul I. W. ; Daly, Mark J. ; Sham, Pak C. 

    2007-01-01

    Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis. PMID:17701901

  3. Segtor: Rapid Annotation of Genomic Coordinates and Single Nucleotide Variations Using Segment Trees

    PubMed Central

    Renaud, Gabriel; Neves, Pedro; Folador, Edson Luiz; Ferreira, Carlos Gil; Passetti, Fabio

    2011-01-01

    Various research projects often involve determining the relative position of genomic coordinates, intervals, single nucleotide variations (SNVs), insertions, deletions and translocations with respect to genes and their potential impact on protein translation. Due to the tremendous increase in throughput brought by the use of next-generation sequencing, investigators are routinely faced with the need to annotate very large datasets. We present Segtor, a tool to annotate large sets of genomic coordinates, intervals, SNVs, indels and translocations. Our tool uses segment trees built using the start and end coordinates of the genomic features the user wishes to use instead of storing them in a database management system. The software also produces annotation statistics to allow users to visualize how many coordinates were found within various portions of genes. Our system currently can be made to work with any species available on the UCSC Genome Browser. Segtor is a suitable tool for groups, especially those with limited access to programmers or with interest to analyze large amounts of individual genomes, who wish to determine the relative position of very large sets of mapped reads and subsequently annotate observed mutations between the reads and the reference. Segtor (http://lbbc.inca.gov.br/segtor/) is an open-source tool that can be freely downloaded for non-profit use. We also provide a web interface for testing purposes. PMID:22069465

  4. Detection of uncommon G3P[3] rotavirus A (RVA) strain in rat possessing a human RVA-like VP6 and a novel NSP2 genotype.

    PubMed

    Ianiro, Giovanni; Di Bartolo, Ilaria; De Sabato, Luca; Pampiglione, Guglielmo; Ruggeri, Franco M; Ostanello, Fabio

    2017-09-01

    Rotavirus is one of the leading causes of acute gastroenteritis in infants and young children. RVAs infect not only humans but also a wide range of mammals including rats, which represent a reservoir of several other zoonotic pathogens. Due to the segmented nature of the RVA genome, animal RVA strains can easily adapt to the human host by reassortment with co-infecting human viruses. This study aims to detect and characterize RVA in the intestinal content of Italian sinantropic rats (Rattus rattus). Out of 40 samples examined following molecular approach, one resulted positive for RVA. The molecular characterization of VP1-4, 6 and 7, and NSP1-5 genes by sequencing revealed the genomic constellation G3-P[3]-I1-R11-C11-M10-A22-N18-T14-E18-H13. This uncommon genomic combination includes: the VP1-4,VP7, the NSP1, 3, 4 and 5 gene segments, closely related to those of RVA from rodents, the N18 novel genotype established for the NSP2 gene segment and the human Wa-like VP6 gene, suggesting interspecies reassortment. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Genes encoding Xenopus laevis Ig L chains: Implications for the evolution of [kappa] and [lambda] chains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zezza, D.J.; Stewart, S.E.; Steiner, L.A.

    1992-12-15

    Xenopus laevis Ig contain two distinct types of L chains, designated [rho] or L1 and [sigma] or L2. The authors have analyzed Xenopus genomic DNA by Southern blotting with cDNA probes specific for L1 V and C regions. Many fragments hybridized to the V probe, but only one or two fragments hybridized to the C probe. Corresponding C, J, and V gene segments were identified on clones isolated from a genomic library prepared from the same DNA. One clone contains a C gene segment separated from a J gene segment by an intron of 3.4 kb. The J and Cmore » gene segments are nearly identical in sequence to cDNA clones analyzed previously. The C segment is somewhat more similar and the J segment considerably more similar in sequence to the corresponding segments of mammalian [kappa] chains than to those of mammalian [lambda] chains. Upstream of the J segment is a typical recombination signal sequence with a spacer of 23 bp, as in J[kappa]. A second clone from the library contains four V gene segments, separated by 2.1 to 3.6 kb. Two of these, V1 and V3, have the expected structural and regulatory features of V genes, and are very similar in sequence to each other and to mammalian V[kappa]. A third gene segment, V2, resembles V1 and V3 in its coding region and nearby 5[prime]-flanking region, but diverges in sequence 5[prime] to position [minus]95 with loss of the octamer promoter element. The fourth V-like segment is similar to the others at the 3[prime]-end, but upstream of codon 64 bears no resemblance in sequence to any Ig V region. All four V segments have typical recombination signal sequences with 12-bp spacers at their 3[prime]-ends, as in V[kappa]. Taken together, the data suggest that Xenopus L1 L chain genes are members of the [kappa] gene family. 80 refs., 9 figs.« less

  6. Characterization of the novel antifungal protein PgAFP and the encoding gene of Penicillium chrysogenum.

    PubMed

    Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A

    2010-04-01

    The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  7. Characterization of an internal ribosomal entry segment within the 5' leader of avian reticuloendotheliosis virus type A RNA and development of novel MLV-REV-based retroviral vectors.

    PubMed

    López-Lastra, M; Gabus, C; Darlix, J L

    1997-11-01

    The murine leukemia virus (MLV)-related type C viruses constitute a major class of retroviruses that includes numerous endogenous and exogenous mammalian viruses and the related avian spleen necrosis virus (SNV). The MLV-related viruses possess a long and multifunctional 5' untranslated leader involved in key steps of the viral life cycle--splicing, translation, RNA dimerization, encapsidation, and reverse transcription. Recent studies have shown that the 5' leader of Friend murine leukemia virus and Moloney murine leukemia virus can direct cap independent translation of gag precursor proteins (Berlioz et al., 1995; Vagner et al., 1995b). These data, together with structural homology studies (Koning et al., 1992), prompted us to undertake a search for new internal ribosome entry segment (IRES) of retroviral origin. Here we describe an IRES element within the 5' leader of avian reticuloendotheliosis virus type A (REV-A) genomic RNA. Data show that the REV-A 5' IRES element maps downstream of the packaging/dimerization (E/DLS) sequence (Watanabe and Temin, 1982; Darlix et al., 1992) and the minimal IRES sequence appears to be within a 129 nt fragment (nucleotides 452-580) of the 5' leader, immediately upstream of the gag AUG codon. The REV-A IRES has been successfully utilized in the construction of novel high titer MLV-based retroviral vectors, containing one or more IRES elements of retroviral origin. These retroviral constructs, which represent a starting point for the design of novel vectors suitable for gene therapy, are also of interest as a model system of internal translation initiation and its possible regulation during development, cancer, or virus infection.

  8. Migration, Integration and Maturation of Photoreceptor Precursors Following Transplantation in the Mouse Retina

    PubMed Central

    Warre-Cornish, Katherine; Barber, Amanda C.; Sowden, Jane C.; Ali, Robin R.

    2014-01-01

    Retinal degeneration leading to loss of photoreceptors is a major cause of untreatable blindness. Recent research has yielded definitive evidence for restoration of vision following the transplantation of rod photoreceptors in murine models of blindness, while advances in stem cell biology have enabled the generation of transplantable photoreceptors from embryonic stem cells. Importantly, the amount of visual function restored is dependent upon the number of photoreceptors that migrate correctly into the recipient retina. The developmental stage of the donor cells is important for their ability to migrate; they must be immature photoreceptor precursors. Little is known about how and when donor cell migration, integration, and maturation occurs. Here, we have performed a comprehensive histological analysis of the 6-week period following rod transplantation in mice. Donor cells migrate predominately as single entities during the first week undergoing a stereotyped sequence of morphological changes in their translocation from the site of transplantation, through the interphotoreceptor matrix and into the recipient retina. This includes initial polarization toward the outer nuclear layer (ONL), followed by formation of an apical attachment and rudimentary segment during migration into the ONL. Strikingly, acquisition of a nuclear architecture typical of mature rods was accelerated compared with normal development and a feature of migrating cells. Once within the ONL, precursors formed synaptic-like structures and outer segments in accordance with normal maturation. The restoration of visual function mediated by transplanted photoreceptors correlated with the later expression of rod α-transducin, achieving maximal function by 5 weeks. PMID:24328605

  9. Identifying uniformly mutated segments within repeats.

    PubMed

    Sahinalp, S Cenk; Eichler, Evan; Goldberg, Paul; Berenbrink, Petra; Friedetzky, Tom; Ergun, Funda

    2004-12-01

    Given a long string of characters from a constant size alphabet we present an algorithm to determine whether its characters have been generated by a single i.i.d. random source. More specifically, consider all possible n-coin models for generating a binary string S, where each bit of S is generated via an independent toss of one of the n coins in the model. The choice of which coin to toss is decided by a random walk on the set of coins where the probability of a coin change is much lower than the probability of using the same coin repeatedly. We present a procedure to evaluate the likelihood of a n-coin model for given S, subject a uniform prior distribution over the parameters of the model (that represent mutation rates and probabilities of copying events). In the absence of detailed prior knowledge of these parameters, the algorithm can be used to determine whether the a posteriori probability for n=1 is higher than for any other n>1. Our algorithm runs in time O(l4logl), where l is the length of S, through a dynamic programming approach which exploits the assumed convexity of the a posteriori probability for n. Our test can be used in the analysis of long alignments between pairs of genomic sequences in a number of ways. For example, functional regions in genome sequences exhibit much lower mutation rates than non-functional regions. Because our test provides means for determining variations in the mutation rate, it may be used to distinguish functional regions from non-functional ones. Another application is in determining whether two highly similar, thus evolutionarily related, genome segments are the result of a single copy event or of a complex series of copy events. This is particularly an issue in evolutionary studies of genome regions rich with repeat segments (especially tandemly repeated segments).

  10. BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in Upland cotton

    USDA-ARS?s Scientific Manuscript database

    New and emerging next generation sequencing technologies have reduced sequencing costs, but there is room for additional approaches that can be applied to complex polyploid plant genomes. Large (about 2.5GB) and highly repetitive tetraploid genome of G. hirsutum is still cost-intensive with traditi...

  11. Sorting cancer karyotypes using double-cut-and-joins, duplications and deletions.

    PubMed

    Zeira, Ron; Shamir, Ron

    2018-05-03

    Problems of genome rearrangement are central in both evolution and cancer research. Most genome rearrangement models assume that the genome contains a single copy of each gene and the only changes in the genome are structural, i.e., reordering of segments. In contrast, tumor genomes also undergo numerical changes such as deletions and duplications, and thus the number of copies of genes varies. Dealing with unequal gene content is a very challenging task, addressed by few algorithms to date. More realistic models are needed to help trace genome evolution during tumorigenesis. Here we present a model for the evolution of genomes with multiple gene copies using the operation types double-cut-and-joins, duplications and deletions. The events supported by the model are reversals, translocations, tandem duplications, segmental deletions, and chromosomal amplifications and deletions, covering most types of structural and numerical changes observed in tumor samples. Our goal is to find a series of operations of minimum length that transform one karyotype into the other. We show that the problem is NP-hard and give an integer linear programming formulation that solves the problem exactly under some mild assumptions. We test our method on simulated genomes and on ovarian cancer genomes. Our study advances the state of the art in two ways: It allows a broader set of operations than extant models, thus being more realistic, and it is the first study attempting to reconstruct the full sequence of structural and numerical events during cancer evolution. Code and data are available in https://github.com/Shamir-Lab/Sorting-Cancer-Karyotypes. ronzeira@post.tau.ac.il, rshamir@tau.ac.il. Supplementary data are available at Bioinformatics online.

  12. Genomic features of intertypic recombinant sabin poliovirus strains excreted by primary vaccinees.

    PubMed

    Cuervo, N S; Guillot, S; Romanenkova, N; Combiescu, M; Aubert-Combiescu, A; Seghier, M; Caro, V; Crainic, R; Delpeyroux, F

    2001-07-01

    The trivalent oral poliomyelitis vaccine (OPV) contains three different poliovirus serotypes. It use therefore creates particularly favorable conditions for mixed infection of gut cells, and indeed intertypic vaccine-derived recombinants (VdRec) have been frequently found in patients with vaccine-associated paralytic poliomyelitis. Nevertheless, there have not been extensive searches for VdRec in healthy vaccinees following immunization with OPV. To determine the incidence of VdRec and their excretion kinetics in primary vaccinees, and to establish the general genomic features of the corresponding recombinant genomes, we characterized poliovirus isolates excreted by vaccinees following primary immunization with OPV. Isolates were collected from 67 children 2 to 60 days following vaccination. Recombinant strains were identified by multiple restriction fragment length polymorphism assays. The localization of junction sites in recombinant genomes was also determined. VdRec excreted by vaccinees were first detected 2 to 4 days after vaccination. The highest rate of recombinants was on day 14. The frequency of VdRec depends strongly on the serotype of the analyzed isolates (2, 53, and 79% of recombinant strains in the last-excreted type 1, 2, and 3 isolates, respectively). Particular associations of genomic segments were preferred in the recombinant genomes, and recombination junctions were found in the genomic region encoding the nonstructural proteins. Recombination junctions generally clustered in particular subgenomic regions that were dependent on the serotype of the isolate and/or on the associations of genomic segments in recombinants. Thus, VdRec are frequently excreted by vaccinees, and the poliovirus replication machinery requirements or selection factors appear to act in vivo to shape the features of the recombinant genomes.

  13. Evolutionary characterization of Ty3/gypsy-like LTR retrotransposons in the parasitic cestode Echinococcus granulosus.

    PubMed

    Bae, Young-An

    2016-11-01

    Cyclophyllidean cestodes including Echinococcus granulosus have a smaller genome and show characteristics such as loss of the gut, a segmented body plan, and accelerated growth rate in hosts compared with other tissue-invading helminths. In an effort to address the molecular mechanism relevant to genome shrinkage, the evolutionary status of long-terminal-repeat (LTR) retrotransposons, which are known as the most potent genomic modulators, was investigated in the E. granulosus draft genome. A majority of the E. granulosus LTR retrotransposons were classified into a novel characteristic clade, named Saci-2, of the Ty3/gypsy family, while the remaining elements belonged to the CsRn1 clade of identical family. Their nucleotide sequences were heavily corrupted by frequent base substitutions and segmental losses. The ceased mobile activity of the major retrotransposons and the following intrinsic DNA loss in their inactive progenies might have contributed to decrease in genome size. Apart from the degenerate copies, a gag gene originating from a CsRn1-like element exhibited substantial evidences suggesting its domestication including a preserved coding profile and transcriptional activity, the presence of syntenic orthologues in cestodes, and selective pressure acting on the gene. To my knowledge, the endogenized gag gene is reported for the first time in invertebrates, though its biological function remains elusive.

  14. Immunoglobulin Genomics in the Guinea Pig (Cavia porcellus)

    PubMed Central

    Guo, Yongchen; Bao, Yonghua; Meng, Qingwen; Hu, Xiaoxiang; Meng, Qingyong; Ren, Liming; Li, Ning; Zhao, Yaofeng

    2012-01-01

    In science, the guinea pig is known as one of the gold standards for modeling human disease. It is especially important as a molecular and cellular biology model for studying the human immune system, as its immunological genes are more similar to human genes than are those of mice. The utility of the guinea pig as a model organism can be further enhanced by further characterization of the genes encoding components of the immune system. Here, we report the genomic organization of the guinea pig immunoglobulin (Ig) heavy and light chain genes. The guinea pig IgH locus is located in genomic scaffolds 54 and 75, and spans approximately 6,480 kb. 507 VH segments (94 potentially functional genes and 413 pseudogenes), 41 DH segments, six JH segments, four constant region genes (μ, γ, ε, and α), and one reverse δ remnant fragment were identified within the two scaffolds. Many VH pseudogenes were found within the guinea pig, and likely constituted a potential donor pool for gene conversion during evolution. The Igκ locus mapped to a 4,029 kb region of scaffold 37 and 24 is composed of 349 Vκ (111 potentially functional genes and 238 pseudogenes), three Jκ and one Cκ genes. The Igλ locus spans 1,642 kb in scaffold 4 and consists of 142 Vλ (58 potentially functional genes and 84 pseudogenes) and 11 Jλ -Cλ clusters. Phylogenetic analysis suggested the guinea pig’s large germline VH gene segments appear to form limited gene families. Therefore, this species may generate antibody diversity via a gene conversion-like mechanism associated with its pseudogene reserves. PMID:22761756

  15. Transcriptome sequencing of Atlantic salmon (Salmo salar L.) notochord prior to development of the vertebrae provides clues to regulation of positional fate, chordoblast lineage and mineralisation

    PubMed Central

    2014-01-01

    Background In teleosts such as Atlantic salmon (Salmo salar L.), segmentation and subsequent mineralisation of the notochord during embryonic stages are essential for normal vertebrae formation. However, the molecular mechanisms leading to segmentation and mineralisation of the notochord are poorly understood. The aim of this study was to identify genes/pathways acting in gradients over time and along the anterior-posterior axis during notochord segmentation and immediately prior to mineralisation of the vertebral bodies in Atlantic salmon. Results Notochord samples were collected from unsegmented, pre-segmented and segmented developmental stages. In each stage, the cellular core of the notochord was cut into three pieces along the longitudinal axis (anterior, mid, posterior). RNA was sequenced (22 million pair-end 100 bp/ library) and mapped to the salmon genome. 66569 transcripts were predicted and 55775 were annotated. In order to identify possible gradients leading to segmentation of the notochord, all 71 notochord-expressed hox genes were investigated, most of them displaying a typical anterior-posterior expression pattern along the notochord axis. The clustering of hox genes revealed a pattern that could be related to notochord segmentation. We further investigated how mineralisation is initiated in the notochord, and several factors related to chondrogenic lineage were identified (sox9, sox5, sox6, tgfb3, ihhb and col2a1), suggesting a cartilage-like character of the notochord. KEGG analysis of differentially expressed genes between stages revealed down-regulation of pathways associated with ECM, cell division, metabolism and development at onset of notochord segmentation. This implies that inhibitory signals produce segmentation of the notochord. One such potential inhibitory signal was identified, col11a2, which was detected in segments of non-mineralising notochord. Conclusions An incomplete salmon genome was successfully used to analyse RNA-seq data from the cellular core of the Atlantic salmon notochord. In transcriptome we found; hox gene patterns possibly linked to segmentation; down-regulation of pathways in the notochord at onset of segmentation; segmented expression of col11a2 in non-mineralised segments of the notochord; and a chondroblast-like footprint in the notochord. PMID:24548379

  16. Transformation-associated recombination (TAR) cloning for genomics studies and synthetic biology

    PubMed Central

    Kouprina, Natalay; Larionov, Vladimir

    2016-01-01

    Transformation-associated recombination (TAR) cloning represents a unique tool for isolation and manipulation of large DNA molecules. The technique exploits a high level of homologous recombination in the yeast Sacharomyces cerevisiae. So far, TAR cloning is the only method available to selectively recover chromosomal segments up to 300 kb in length from complex and simple genomes. In addition, TAR cloning allows the assembly and cloning of entire microbe genomes up to several Mb as well as engineering of large metabolic pathways. In this review, we summarize applications of TAR cloning for functional/structural genomics and synthetic biology. PMID:27116033

  17. The genome phylogeny of domestic cat, red panda and five mustelid species revealed by comparative chromosome painting and G-banding.

    PubMed

    Nie, Wenhui; Wang, Jinhuan; O'Brien, Patricia C M; Fu, Beiyuan; Ying, Tian; Ferguson-Smith, Malcolm A; Yang, Fengtang

    2002-01-01

    Genome-wide homology maps among stone marten (Martes foina, 2n = 38), domestic cat (Felis catus, 2n = 38), American mink (Mustela vison, 2n = 30), yellow-throated marten (Martes flavigula, 2n = 40), Old World badger (Meles meles, 2n = 44), ferret badger (Melogale moschata, 2n = 38) and red panda (Ailurus fulgens, 2n = 36) have been established by cross-species chromosome painting with a complete set of stone marten probes. In total, 18 stone marten autosomal probes reveal 20, 19, 21, 18 and 21 pairs of homologous chromosomal segments in the respective genomes of American mink, yellow-throated marten. Old World badger, ferret badger and red panda. Reciprocal painting between stone marten and cat delineated 21 pairs of homologous segments shared in both stone marten and cat genomes. The chromosomal painting results indicate that most chromosomes of these species are highly conserved and show one-to-one correspondence with stone marten and cat chromosomes or chromosomal arms, and that only a few interchromosomal rearrangements (Robertsonian fusions and fissions) have occurred during species radiation. By comparing the distribution patterns of conserved chromosomal segments in both these species and the putative ancestral carnivore karyotype, we have reconstructed the pathway of karyotype evolution of these species from the putative 2n = 42 ancestral carnivore karyotype. Our results support a close phylogenetic relationship between the red panda and mustelids. The homology data presented in these maps will allow us to transfer the cat gene mapping data to other unmapped carnivore species.

  18. Horizontally transferred genes in the genome of Pacific white shrimp, Litopenaeus vannamei

    PubMed Central

    2013-01-01

    Background In recent years, as the development of next-generation sequencing technology, a growing number of genes have been reported as being horizontally transferred from prokaryotes to eukaryotes, most of them involving arthropods. As a member of the phylum Arthropoda, the Pacific white shrimp Litopenaeus vannamei has to adapt to the complex water environments with various symbiotic or parasitic microorganisms, which provide a platform for horizontal gene transfer (HGT). Results In this study, we analyzed the genome-wide HGT events in L. vannamei. Through homology search and phylogenetic analysis, followed by experimental PCR confirmation, 14 genes with HGT event were identified: 12 of them were transferred from bacteria and two from fungi. Structure analysis of these genes showed that the introns of the two fungi-originated genes were substituted by shrimp DNA fragment, two genes transferred from bacteria had shrimp specific introns inserted in them. Furthermore, around other three bacteria-originated genes, there were three large DNA segments inserted into the shrimp genome. One segment was a transposon that fully transferred, and the other two segments contained only coding regions of bacteria. Functional prediction of these 14 genes showed that 6 of them might be related to energy metabolism, and 4 others related to defense of the organism. Conclusions HGT events from bacteria or fungi were happened in the genome of L. vannamei, and these horizontally transferred genes can be transcribed in shrimp. This is the first time to report the existence of horizontally transferred genes in shrimp. Importantly, most of these genes are exposed to a negative selection pressure and appeared to be functional. PMID:23914989

  19. The evolution of neuropeptide signalling: insights from echinoderms.

    PubMed

    Semmens, Dean C; Elphick, Maurice R

    2017-09-01

    Neuropeptides are evolutionarily ancient mediators of neuronal signalling that regulate a wide range of physiological processes and behaviours in animals. Neuropeptide signalling has been investigated extensively in vertebrates and protostomian invertebrates, which include the ecdysozoans Drosophila melanogaster (Phylum Arthropoda) and Caenorhabditis elegans (Phylum Nematoda). However, until recently, an understanding of evolutionary relationships between neuropeptide signalling systems in vertebrates and protostomes has been impaired by a lack of genome/transcriptome sequence data from non-ecdysozoan invertebrates. The echinoderms-a deuterostomian phylum that includes sea urchins, sea cucumbers and starfish-have been particularly important in providing new insights into neuropeptide evolution. Sequencing of the genome of the sea urchin Strongylocentrotus purpuratus (Class Echinoidea) enabled discovery of (i) the first invertebrate thyrotropin-releasing hormone-type precursor, (ii) the first deuterostomian pedal peptide/orcokinin-type precursors and (iii) NG peptides-the 'missing link' between neuropeptide S in tetrapod vertebrates and crustacean cardioactive peptide in protostomes. More recently, sequencing of the neural transcriptome of the starfish Asterias rubens (Class Asteroidea) enabled identification of 40 neuropeptide precursors, including the first kisspeptin and melanin-concentrating hormone-type precursors to be identified outside of the chordates. Furthermore, the characterization of a corazonin-type neuropeptide signalling system in A. rubens has provided important new insights into the evolution of gonadotropin-releasing hormone-related neuropeptides. Looking forward, the discovery of multiple neuropeptide signalling systems in echinoderms provides opportunities to investigate how these systems are used to regulate physiological and behavioural processes in the unique context of a decentralized, pentaradial bauplan. © The Author 2017. Published by Oxford University Press.

  20. Peptidomics of Neuropeptidergic Tissues of the Tsetse Fly Glossina morsitans morsitans

    NASA Astrophysics Data System (ADS)

    Caers, Jelle; Boonen, Kurt; Van Den Abbeele, Jan; Van Rompay, Liesbeth; Schoofs, Liliane; Van Hiel, Matthias B.

    2015-12-01

    Neuropeptides and peptide hormones are essential signaling molecules that regulate nearly all physiological processes. The recent release of the tsetse fly genome allowed the construction of a detailed in silico neuropeptide database (International Glossina Genome Consortium, Science 344, 380-386 (2014)), as well as an in-depth mass spectrometric analysis of the most important neuropeptidergic tissues of this medically and economically important insect species. Mass spectrometric confirmation of predicted peptides is a vital step in the functional characterization of neuropeptides, as in vivo peptides can be modified, cleaved, or even mispredicted. Using a nanoscale reversed phase liquid chromatography coupled to a Q Exactive Orbitrap mass spectrometer, we detected 51 putative bioactive neuropeptides encoded by 19 precursors: adipokinetic hormone (AKH) I and II, allatostatin A and B, capability/pyrokinin (capa/PK), corazonin, calcitonin-like diuretic hormone (CT/DH), FMRFamide, hugin, leucokinin, myosuppressin, natalisin, neuropeptide-like precursor (NPLP) 1, orcokinin, pigment dispersing factor (PDF), RYamide, SIFamide, short neuropeptide F (sNPF) and tachykinin. In addition, propeptides, truncated and spacer peptides derived from seven additional precursors were found, and include the precursors of allatostatin C, crustacean cardioactive peptide, corticotropin releasing factor-like diuretic hormone (CRF/DH), ecdysis triggering hormone (ETH), ion transport peptide (ITP), neuropeptide F, and proctolin, respectively. The majority of the identified neuropeptides are present in the central nervous system, with only a limited number of peptides in the corpora cardiaca-corpora allata and midgut. Owing to the large number of identified peptides, this study can be used as a reference for comparative studies in other insects.

  1. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  2. Distribution of genome shared identical by descent by two individuals in grandparent-type relationship.

    PubMed Central

    Stefanov, V T

    2000-01-01

    A methodology is introduced for numerical evaluation, with any given accuracy, of the cumulative probabilities of the proportion of genome shared identical by descent (IBD) on chromosome segments by two individuals in a grandparent-type relationship. Programs are provided in the popular software package Maple for rapidly implementing such evaluations in the cases of grandchild-grandparent and great-grandchild-great-grandparent relationships. Our results can be used to identify chromosomal segments that may contain disease genes. Also, exact P values in significance testing for resemblance of either a grandparent with a grandchild or a great-grandparent with a great-grandchild can be calculated. The genomic continuum model, with Haldane's model for the crossover process, is assumed. This is the model that has been used recently in the genetics literature devoted to IBD calculations. Our methodology is based on viewing the model as a special exponential family and elaborating on recent research results for such families. PMID:11063711

  3. The structure and evolution of angiosperm nuclear genomes.

    PubMed

    Bennetzen, J L

    1998-04-01

    Despite several decades of investigation, the organization of angiosperm genomes remained largely unknown until very recently. Data describing the sequence composition of large segments of genomes, covering hundreds of kilobases of contiguous sequence, have only become available in the past two years. Recent results indicate commonalities in the characteristics of many plant genomes, including in the structure of chromosomal components like telomeres and centromeres, and in the order and content of genes. Major differences between angiosperms have been associated mainly with repetitive DNAs, both gene families and mobile elements. Intriguing new studies have begun to characterize the dynamic three-dimensional structures of chromosomes and chromatin, and the relationship between genome structure and co-ordinated gene function.

  4. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-11-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons.

  5. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed Central

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-01-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons. PMID:2247475

  6. Draft Genome Sequences of Mycobacterium setense Type Strain DSM-45070 and the Nonpathogenic Strain Manresensis, Isolated from the Bank of the Cardener River in Manresa, Catalonia, Spain

    PubMed Central

    Vilaplana, Cristina; Velasco, Juan; Pluvinet, Raquel; Santín, Sheila; Prat, Cristina; Julián, Esther; Alcaide, Fernando; Comas, Iñaki; Sumoy, Lauro; Cardona, Pere-Joan

    2015-01-01

    We present here the draft genome sequences of two Mycobacterium setense strains. One of them corresponds to the M. setense type strain DSM-45070, originally isolated from a patient with a posttraumatic chronic skin abscess. The other one corresponds to the nonpathogenic M. setense strain Manresensis, isolated from the Cardener River crossing Manresa, Catalonia, Spain. A comparative genomic analysis shows a smaller genome size and fewer genes in M. setense strain Manresensis relative to those of the type strain, and it shows the genome segments unique to each strain. PMID:25657273

  7. RNA secondary structures of the bacteriophage phi6 packaging regions.

    PubMed

    Pirttimaa, M J; Bamford, D H

    2000-06-01

    Bacteriophage phi6 genome consists of three segments of double-stranded RNA. During maturation, single-stranded copies of these segments are packaged into preformed polymerase complex particles. Only phi6 RNA is packaged, and each particle contains only one copy of each segment. An in vitro packaging and replication assay has been developed for phi6, and the packaging signals (pac sites) have been mapped to the 5' ends of the RNA segments. In this study, we propose secondary structure models for the pac sites of phi6 single-stranded RNA segments. Our models accommodate data from structure-specific chemical modifications, free energy minimizations, and phylogenetic comparisons. Previously reported pac site deletion studies are also discussed. Each pac site possesses a unique architecture, that, however, contains common structural elements.

  8. Extensive concerted evolution of rice paralogs and the road to regaining independence.

    PubMed

    Wang, Xiyin; Tang, Haibao; Bowers, John E; Feltus, Frank A; Paterson, Andrew H

    2007-11-01

    Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the approximately 0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, approximately 8% of japonica paralogs produced 5-7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while approximately 70-MY-old "paleologs" resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice-sorghum divergence approximately 41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity--that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5-7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization.

  9. High-Resolution Patterns of Meiotic Recombination across the Human Major Histocompatibility Complex

    PubMed Central

    Cullen, Michael; Perfetto, Stephen P.; Klitz, William; Nelson, George; Carrington, Mary

    2002-01-01

    Definitive characteristics of meiotic recombination events over large (i.e., >1 Mb) segments of the human genome remain obscure, yet they are essential for establishing the haplotypic structure of the genome and for efficient mapping of complex traits. We present a high-resolution map of recombination at the kilobase level across a 3.3-Mb interval encompassing the major histocompatibility complex (MHC). Genotyping of 20,031 single sperm from 12 individuals resulted in the identification and fine mapping of 325 recombinant chromosomes within genomic intervals as small as 7 kb. Several principal characteristics of recombination in this region were observed: (1) rates of recombination can differ significantly between individuals; (2) intense hot spots of recombination occur at least every 0.8 Mb but are not necessarily evenly spaced; (3) distribution in the location of recombination events can differ significantly among individuals; (4) between hot spots, low levels of recombination occur fairly evenly across 100-kb segments, suggesting the presence of warm spots of recombination; and (5) specific sequence motifs associate significantly with recombination distribution. These data provide a plausible model for recombination patterns of the human genome overall. PMID:12297984

  10. Tracing the evolutionary history of the pandemic group A streptococcal M1T1 clone

    PubMed Central

    Maamary, Peter G.; Ben Zakour, Nouri L.; Cole, Jason N.; Hollands, Andrew; Aziz, Ramy K.; Barnett, Timothy C.; Cork, Amanda J.; Henningham, Anna; Sanderson-Smith, Martina; McArthur, Jason D.; Venturini, Carola; Gillen, Christine M.; Kirk, Joshua K.; Johnson, Dwight R.; Taylor, William L.; Kaplan, Edward L.; Kotb, Malak; Nizet, Victor; Beatson, Scott A.; Walker, Mark J.

    2012-01-01

    The past 50 years has witnessed the emergence of new viral and bacterial pathogens with global effect on human health. The hyperinvasive group A Streptococcus (GAS) M1T1 clone, first detected in the mid-1980s in the United States, has since disseminated worldwide and remains a major cause of severe invasive human infections. Although much is understood regarding the capacity of this pathogen to cause disease, much less is known of the precise evolutionary events selecting for its emergence. We used high-throughput technologies to sequence a World Health Organization strain collection of serotype M1 GAS and reconstructed its phylogeny based on the analysis of core genome single-nucleotide polymorphisms. We demonstrate that acquisition of a 36-kb genome segment from serotype M12 GAS and the bacteriophage-encoded DNase Sda1 led to increased virulence of the M1T1 precursor and occurred relatively early in the molecular evolutionary history of this strain. The more recent acquisition of the phage-encoded superantigen SpeA is likely to have provided selection advantage for the global dissemination of the M1T1 clone. This study provides an exemplar for the evolution and emergence of virulent clones from microbial populations existing commensally or causing only superficial infection.—Maamary, P. G., Ben Zakour, N. L., Cole, J. N., Hollands, A., Aziz, R. K., Barnett, T. C., Cork, A. J., Henningham, A., Sanderson-Smith, M., McArthur, J. D., Venturini, C., Gillen, C. M., Kirk, J. K., Johnson, D. R., Taylor, W. L., Kaplan, E. L., Kotb, M., Nizet, V., Beatson, S. A., Walker, M. J. Tracing the evolutionary history of the pandemic group A streptococcal M1T1 clone. PMID:22878963

  11. Rearrangements of mycoreovirus 1 S1, S2 and S3 induced by the multifunctional protein p29 encoded by the prototypic hypovirus Cryphonectria hypovirus 1 strain EP713.

    PubMed

    Tanaka, Toru; Sun, Liying; Tsutani, Kouhei; Suzuki, Nobuhiro

    2011-08-01

    Mycoreovirus 1 (MyRV1), a member of the family Reoviridae possessing a genome consisting of 11 dsRNA segments (S1-S11), infects the chestnut blight fungus and reduces its virulence (hypovirulence). Studies have previously demonstrated reproducible induction of intragenic rearrangements of MyRV1 S6 (S6L: almost full-length duplication) and S10 (S10ss: internal deletion of three-quarters of the ORF), mediated by the multifunctional protein p29 encoded by the prototype hypovirus, Cryphonectria hypovirus 1 (CHV1) strain EP713, of the family Hypoviridae with ssRNA genomes. The current study showed that CHV1 p29 also induced rearrangements of the three largest MyRV1 segments, S1, S2 and S3, which encode structural proteins. These rearranged segments involved in-frame extensions of almost two-thirds of the ORFs (S1L, S2L and S3L, respectively), which is rare for a reovirus rearrangement. MyRV1 variants carrying S1L, S2L or S3L always contained S10ss (MyRV1/S1L+S10ss2, MyRV1/S2L+S10ss2 or MyRV1/S3L+S10ss2). Levels of mRNAs for the rearranged and co-existing unaltered genome segments in fungal colonies infected with each of the MyRV1 variants appeared to be comparable to those for the corresponding normal segments in wild-type MyRV1-infected colonies, suggesting that the rearranged segments were fully competent for packaging and transcription. Protein products of the rearranged segments were detectable in fungal colonies infected with S2L MyRV1/S2L+S10ss2 and S3L MyRV1/S3L+S10ss2, whilst S1L-encoded protein remained undetectable. S1L, S2L and S3L were associated with enhancement of the aerial hyphae growth rate. This study has provided additional examples of MyRV1 intragenic rearrangements induced by p29, and suggests that normal S1, S2 and S3 are required for the symptoms caused by MyRV1.

  12. HapFABIA: Identification of very short segments of identity by descent characterized by rare variants in large sequencing data

    PubMed Central

    Hochreiter, Sepp

    2013-01-01

    Identity by descent (IBD) can be reliably detected for long shared DNA segments, which are found in related individuals. However, many studies contain cohorts of unrelated individuals that share only short IBD segments. New sequencing technologies facilitate identification of short IBD segments through rare variants, which convey more information on IBD than common variants. Current IBD detection methods, however, are not designed to use rare variants for the detection of short IBD segments. Short IBD segments reveal genetic structures at high resolution. Therefore, they can help to improve imputation and phasing, to increase genotyping accuracy for low-coverage sequencing and to increase the power of association studies. Since short IBD segments are further assumed to be old, they can shed light on the evolutionary history of humans. We propose HapFABIA, a computational method that applies biclustering to identify very short IBD segments characterized by rare variants. HapFABIA is designed to detect short IBD segments in genotype data that were obtained from next-generation sequencing, but can also be applied to DNA microarray data. Especially in next-generation sequencing data, HapFABIA exploits rare variants for IBD detection. HapFABIA significantly outperformed competing algorithms at detecting short IBD segments on artificial and simulated data with rare variants. HapFABIA identified 160 588 different short IBD segments characterized by rare variants with a median length of 23 kb (mean 24 kb) in data for chromosome 1 of the 1000 Genomes Project. These short IBD segments contain 752 000 single nucleotide variants (SNVs), which account for 39% of the rare variants and 23.5% of all variants. The vast majority—152 000 IBD segments—are shared by Africans, while only 19 000 and 11 000 are shared by Europeans and Asians, respectively. IBD segments that match the Denisova or the Neandertal genome are found significantly more often in Asians and Europeans but also, in some cases exclusively, in Africans. The lengths of IBD segments and their sharing between continental populations indicate that many short IBD segments from chromosome 1 existed before humans migrated out of Africa. Thus, rare variants that tag these short IBD segments predate human migration from Africa. The software package HapFABIA is available from Bioconductor. All data sets, result files and programs for data simulation, preprocessing and evaluation are supplied at http://www.bioinf.jku.at/research/short-IBD. PMID:24174545

  13. Single haplotype assembly of the human genome from a hydatidiform mole.

    PubMed

    Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K

    2014-12-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.

  14. Single haplotype assembly of the human genome from a hydatidiform mole

    PubMed Central

    Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.

    2014-01-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144

  15. Entropic Profiler – detection of conservation in genomes using information theory

    PubMed Central

    Fernandes, Francisco; Freitas, Ana T; Almeida, Jonas S; Vinga, Susana

    2009-01-01

    Background In the last decades, with the successive availability of whole genome sequences, many research efforts have been made to mathematically model DNA. Entropic Profiles (EP) were proposed recently as a new measure of continuous entropy of genome sequences. EP represent local information plots related to DNA randomness and are based on information theory and statistical concepts. They express the weighed relative abundance of motifs for each position in genomes. Their study is very relevant because under or over-representation segments are often associated with significant biological meaning. Findings The Entropic Profiler application here presented is a new tool designed to detect and extract under and over-represented DNA segments in genomes by using EP. It allows its computation in a very efficient way by recurring to improved algorithms and data structures, which include modified suffix trees. Available through a web interface and as downloadable source code, it allows to study positions and to search for motifs inside the whole sequence or within a specified range. DNA sequences can be entered from different sources, including FASTA files, pre-loaded examples or resuming a previously saved work. Besides the EP value plots, p-values and z-scores for each motif are also computed, along with the Chaos Game Representation of the sequence. Conclusion EP are directly related with the statistical significance of motifs and can be considered as a new method to extract and classify significant regions in genomes and estimate local scales in DNA. The present implementation establishes an efficient and useful tool for whole genome analysis. PMID:19416538

  16. Expansion of the receptor-like kinase/Pelle gene family and receptor-like proteins in Arabidopsis.

    PubMed

    Shiu, Shin Han; Bleecker, Anthony B

    2003-06-01

    Receptor-like kinases (RLKs) are a family of transmembrane proteins with versatile N-terminal extracellular domains and C-terminal intracellular kinases. They control a wide range of physiological responses in plants and belong to one of the largest gene families in the Arabidopsis genome with more than 600 members. Interestingly, this gene family constitutes 60% of all kinases in Arabidopsis and accounts for nearly all transmembrane kinases in Arabidopsis. Analysis of four fungal, six metazoan, and two Plasmodium sp. genomes indicates that the family was represented in all but fungal genomes, indicating an ancient origin for the family with a more recent expansion only in the plant lineages. The RLK/Pelle family can be divided into several subfamilies based on three independent criteria: the phylogeny based on kinase domain sequences, the extracellular domain identities, and intron locations and phases. A large number of receptor-like proteins (RLPs) resembling the extracellular domains of RLKs are also found in the Arabidopsis genome. However, not all RLK subfamilies have corresponding RLPs. Several RLK/Pelle subfamilies have undergone differential expansions. More than 33% of the RLK/Pelle members are found in tandem clusters, substantially higher than the genome average. In addition, 470 of the RLK/Pelle family members are located within the segmentally duplicated regions in the Arabidopsis genome and 268 of them have a close relative in the corresponding regions. Therefore, tandem duplications and segmental/whole-genome duplications represent two of the major mechanisms for the expansion of the RLK/Pelle family in Arabidopsis.

  17. A Genealogical Look at Shared Ancestry on the X Chromosome.

    PubMed

    Buffalo, Vince; Mount, Stephen M; Coop, Graham

    2016-09-01

    Close relatives can share large segments of their genome identical by descent (IBD) that can be identified in genome-wide polymorphism data sets. There are a range of methods to use these IBD segments to identify relatives and estimate their relationship. These methods have focused on sharing on the autosomes, as they provide a rich source of information about genealogical relationships. We hope to learn additional information about recent ancestry through shared IBD segments on the X chromosome, but currently lack the theoretical framework to use this information fully. Here, we fill this gap by developing probability distributions for the number and length of X chromosome segments shared IBD between an individual and an ancestor k generations back, as well as between half- and full-cousin relationships. Due to the inheritance pattern of the X and the fact that X homologous recombination occurs only in females (outside of the pseudoautosomal regions), the number of females along a genealogical lineage is a key quantity for understanding the number and length of the IBD segments shared among relatives. When inferring relationships among individuals, the number of female ancestors along a genealogical lineage will often be unknown. Therefore, our IBD segment length and number distributions marginalize over this unknown number of recombinational meioses through a distribution of recombinational meioses we derive. By using Bayes' theorem to invert these distributions, we can estimate the number of female ancestors between two relatives, giving us details about the genealogical relations between individuals not possible with autosomal data alone. Copyright © 2016 by the Genetics Society of America.

  18. Two pheromone precursor genes are transcriptionally expressed in the homothallic ascomycete Sordaria macrospora.

    PubMed

    Pöggeler, S

    2000-06-01

    In order to analyze the involvement of pheromones in cell recognition and mating in a homothallic fungus, two putative pheromone precursor genes, named ppg1 and ppg2, were isolated from a genomic library of Sordaria macrospora. The ppg1 gene is predicted to encode a precursor pheromone that is processed by a Kex2-like protease to yield a pheromone that is structurally similar to the alpha-factor of the yeast Saccharomyces cerevisiae. The ppg2 gene encodes a 24-amino-acid polypeptide that contains a putative farnesylated and carboxy methylated C-terminal cysteine residue. The sequences of the predicted pheromones display strong structural similarity to those encoded by putative pheromones of heterothallic filamentous ascomycetes. Both genes are expressed during the life cycle of S. macrospora. This is the first description of pheromone precursor genes encoded by a homothallic fungus. Southern-hybridization experiments indicated that ppg1 and ppg2 homologues are also present in other homothallic ascomycetes.

  19. Identification of Isopentenol Biosynthetic Genes from Bacillus subtilis by a Screening Method Based on Isoprenoid Precursor Toxicity▿

    PubMed Central

    Withers, Sydnor T.; Gottlieb, Shayin S.; Lieu, Bonny; Newman, Jack D.; Keasling, Jay D.

    2007-01-01

    We have developed a novel method to clone terpene synthase genes. This method relies on the inherent toxicity of the prenyl diphosphate precursors to terpenes, which resulted in a reduced-growth phenotype. When these precursors were consumed by a terpene synthase, normal growth was restored. We have demonstrated that this method is capable of enriching a population of engineered Escherichia coli for those clones that express the sesquiterpene-producing amorphadiene synthase. In addition, we enriched a library of genomic DNA from the isoprene-producing bacterium Bacillus subtilis strain 6051 in E. coli engineered to produce elevated levels of isopentenyl diphosphate and dimethylallyl diphosphate. The selection resulted in the discovery of two genes (yhfR and nudF) whose protein products acted directly on the prenyl diphosphate precursors and produced isopentenol. Expression of nudF in E. coli engineered with the mevalonate-based isopentenyl pyrophosphate biosynthetic pathway resulted in the production of isopentenol. PMID:17693564

  20. Discovery of human inversion polymorphisms by comparative analysis of human and chimpanzee DNA sequence assemblies.

    PubMed

    Feuk, Lars; MacDonald, Jeffrey R; Tang, Terence; Carson, Andrew R; Li, Martin; Rao, Girish; Khaja, Razi; Scherer, Stephen W

    2005-10-01

    With a draft genome-sequence assembly for the chimpanzee available, it is now possible to perform genome-wide analyses to identify, at a submicroscopic level, structural rearrangements that have occurred between chimpanzees and humans. The goal of this study was to investigate chromosomal regions that are inverted between the chimpanzee and human genomes. Using the net alignments for the builds of the human and chimpanzee genome assemblies, we identified a total of 1,576 putative regions of inverted orientation, covering more than 154 mega-bases of DNA. The DNA segments are distributed throughout the genome and range from 23 base pairs to 62 mega-bases in length. For the 66 inversions more than 25 kilobases (kb) in length, 75% were flanked on one or both sides by (often unrelated) segmental duplications. Using PCR and fluorescence in situ hybridization we experimentally validated 23 of 27 (85%) semi-randomly chosen regions; the largest novel inversion confirmed was 4.3 mega-bases at human Chromosome 7p14. Gorilla was used as an out-group to assign ancestral status to the variants. All experimentally validated inversion regions were then assayed against a panel of human samples and three of the 23 (13%) regions were found to be polymorphic in the human genome. These polymorphic inversions include 730 kb (at 7p22), 13 kb (at 7q11), and 1 kb (at 16q24) fragments with a 5%, 30%, and 48% minor allele frequency, respectively. Our results suggest that inversions are an important source of variation in primate genome evolution. The finding of at least three novel inversion polymorphisms in humans indicates this type of structural variation may be a more common feature of our genome than previously realized.

  1. Genetic Diversity of the Ordinary Strain of Potato virus Y (PVY) and Origin of Recombinant PVY Strains

    PubMed Central

    Karasev, Alexander V.; Hu, Xiaojun; Brown, Celeste J.; Kerlan, Camille; Nikolaeva, Olga V.; Crosslin, James M.; Gray, Stewart M.

    2011-01-01

    The ordinary strain of Potato virus Y (PVY), PVYO, causes mild mosaic in tobacco and induces necrosis and severe stunting in potato cultivars carrying the Ny gene. A novel substrain of PVYO was recently reported, PVYO-O5, which is spreading in the United States and is distinguished from other PVYO isolates serologically (i.e., reacting to the otherwise PVYN-specific monoclonal antibody 1F5). To characterize this new PVYO-O5 subgroup and address possible reasons for its continued spread, we conducted a molecular study of PVYO and PVYO-O5 isolates from a North American collection of PVY through whole-genome sequencing and phylogenetic analysis. In all, 44 PVYO isolates were sequenced, including 31 from the previously defined PVYO-O5 group, and subjected to whole-genome analysis. PVYO-O5 isolates formed a separate lineage within the PVYO genome cluster in the whole-genome phylogenetic tree and represented a novel evolutionary lineage of PVY from potato. On the other hand, the PVYO sequences separated into at least two distinct lineages on the whole-genome phylogenetic tree. To shed light on the origin of the three most common PVY recombinants, a more detailed phylogenetic analysis of a sequence fragment, nucleotides 2,406 to 5,821, that is present in all recombinant and nonrecombinant PVYO genomes was conducted. The analysis revealed that PVYN:O and PVYN-Wi recombinants acquired their PVYO segments from two separate PVYO lineages, whereas the PVYNTN recombinant acquired its PVYO segment from the same lineage as PVYN:O. These data suggest that PVYN:O and PVYN-Wi recombinants originated from two separate recombination events involving two different PVYO parental genomes, whereas the PVYNTN recombinants likely originated from the PVYN:O genome via additional recombination events. PMID:21675922

  2. Transcriptomic identification of starfish neuropeptide precursors yields new insights into neuropeptide evolution

    PubMed Central

    Semmens, Dean C.; Mirabeau, Olivier; Moghul, Ismail; Pancholi, Mahesh R.; Wurm, Yannick; Elphick, Maurice R.

    2016-01-01

    Neuropeptides are evolutionarily ancient mediators of neuronal signalling in nervous systems. With recent advances in genomics/transcriptomics, an increasingly wide range of species has become accessible for molecular analysis. The deuterostomian invertebrates are of particular interest in this regard because they occupy an ‘intermediate' position in animal phylogeny, bridging the gap between the well-studied model protostomian invertebrates (e.g. Drosophila melanogaster, Caenorhabditis elegans) and the vertebrates. Here we have identified 40 neuropeptide precursors in the starfish Asterias rubens, a deuterostomian invertebrate from the phylum Echinodermata. Importantly, these include kisspeptin-type and melanin-concentrating hormone-type precursors, which are the first to be discovered in a non-chordate species. Starfish tachykinin-type, somatostatin-type, pigment-dispersing factor-type and corticotropin-releasing hormone-type precursors are the first to be discovered in the echinoderm/ambulacrarian clade of the animal kingdom. Other precursors identified include vasopressin/oxytocin-type, gonadotropin-releasing hormone-type, thyrotropin-releasing hormone-type, calcitonin-type, cholecystokinin/gastrin-type, orexin-type, luqin-type, pedal peptide/orcokinin-type, glycoprotein hormone-type, bursicon-type, relaxin-type and insulin-like growth factor-type precursors. This is the most comprehensive identification of neuropeptide precursor proteins in an echinoderm to date, yielding new insights into the evolution of neuropeptide signalling systems. Furthermore, these data provide a basis for experimental analysis of neuropeptide function in the unique context of the decentralized, pentaradial echinoderm bauplan. PMID:26865025

  3. Whole-Genome Analysis of a Novel Fish Reovirus (MsReV) Discloses Aquareovirus Genomic Structure Relationship with Host in Saline Environments.

    PubMed

    Chen, Zhong-Yuan; Gao, Xiao-Chan; Zhang, Qi-Ya

    2015-08-03

    Aquareoviruses are serious pathogens of aquatic animals. Here, genome characterization and functional gene analysis of a novel aquareovirus, largemouth bass Micropterus salmoides reovirus (MsReV), was described. It comprises 11 dsRNA segments (S1-S11) covering 24,024 bp, and encodes 12 putative proteins including the inclusion forming-related protein NS87 and the fusion-associated small transmembrane (FAST) protein NS22. The function of NS22 was confirmed by expression in fish cells. Subsequently, MsReV was compared with two representative aquareoviruses, saltwater fish turbot Scophthalmus maximus reovirus (SMReV) and freshwater fish grass carp reovirus strain 109 (GCReV-109). MsReV NS87 and NS22 genes have the same structure and function with those of SMReV, whereas GCReV-109 is either missing the coiled-coil region in NS79 or the gene-encoding NS22. Significant similarities are also revealed among equivalent genome segments between MsReV and SMReV, but a difference is found between MsReV and GCReV-109. Furthermore, phylogenetic analysis showed that 13 aquareoviruses could be divided into freshwater and saline environments subgroups, and MsReV was closely related to SMReV in saline environments. Consequently, these viruses from hosts in saline environments have more genomic structural similarities than the viruses from hosts in freshwater. This is the first study of the relationships between aquareovirus genomic structure and their host environments.

  4. Discovery of novel representatives of bilaterian neuropeptide families and reconstruction of neuropeptide precursor evolution in ophiuroid echinoderms

    PubMed Central

    Abylkassimova, Nikara; Hugall, Andrew F.; O'Hara, Timothy D.; Elphick, Maurice R.

    2017-01-01

    Neuropeptides are a diverse class of intercellular signalling molecules that mediate neuronal regulation of many physiological and behavioural processes. Recent advances in genome/transcriptome sequencing are enabling identification of neuropeptide precursor proteins in species from a growing variety of animal taxa, providing new insights into the evolution of neuropeptide signalling. Here, detailed analysis of transcriptome sequence data from three brittle star species, Ophionotus victoriae, Amphiura filiformis and Ophiopsila aranea, has enabled the first comprehensive identification of neuropeptide precursors in the class Ophiuroidea of the phylum Echinodermata. Representatives of over 30 bilaterian neuropeptide precursor families were identified, some of which occur as paralogues. Furthermore, homologues of endothelin/CCHamide, eclosion hormone, neuropeptide-F/Y and nucleobinin/nesfatin were discovered here in a deuterostome/echinoderm for the first time. The majority of ophiuroid neuropeptide precursors contain a single copy of a neuropeptide, but several precursors comprise multiple copies of identical or non-identical, but structurally related, neuropeptides. Here, we performed an unprecedented investigation of the evolution of neuropeptide copy number over a period of approximately 270 Myr by analysing sequence data from over 50 ophiuroid species, with reference to a robust phylogeny. Our analysis indicates that the composition of neuropeptide ‘cocktails’ is functionally important, but with plasticity over long evolutionary time scales. PMID:28878039

  5. Genomic Sequences of Australian Bluetongue Virus Prototype Serotypes Reveal Global Relationships and Possible Routes of Entry into Australia

    PubMed Central

    Bulach, Dieter M.; Amos-Ritchie, Rachel; Adams, Mathew M.; Walker, Peter J.; Weir, Richard

    2012-01-01

    Bluetongue virus (BTV) is transmitted by biting midges (Culicoides spp.). It causes disease mainly in sheep and occasionally in cattle and other species. BTV has spread into northern Europe, causing disease in sheep and cattle. The introduction of new serotypes, changes in vector species, and climate change have contributed to these changes. Ten BTV serotypes have been isolated in Australia without apparent associated disease. Simplified methods for preferential isolation of double-stranded RNA (dsRNA) and template preparation enabled high-throughput sequencing of the 10 genome segments of all Australian BTV prototype serotypes. Phylogenetic analysis reinforced the Western and Eastern topotypes previously characterized but revealed unique features of several Australian BTVs. Many of the Australian BTV genome segments (Seg-) were closely related, clustering together within the Eastern topotypes. A novel Australian topotype for Seg-5 (NS1) was identified, with taxa spread across several serotypes and over time. Seg-1, -2, -3, -4, -6, -7, -9, and -10 of BTV_2_AUS_2008 were most closely related to the cognate segments of viruses from Taiwan and Asia and not other Australian viruses, supporting the conclusion that BTV_2 entered Australia recently. The Australian BTV_15_AUS_1982 prototype was revealed to be unusual among the Australian BTV isolates, with Seg-3 and -8 distantly related to other BTV sequences from all serotypes. PMID:22514341

  6. Fitness cost of reassortment in human influenza.

    PubMed

    Villa, Mara; Lässig, Michael

    2017-11-01

    Reassortment, which is the exchange of genome sequence between viruses co-infecting a host cell, plays an important role in the evolution of segmented viruses. In the human influenza virus, reassortment happens most frequently between co-existing variants within the same lineage. This process breaks genetic linkage and fitness correlations between viral genome segments, but the resulting net effect on viral fitness has remained unclear. In this paper, we determine rate and average selective effect of reassortment processes in the human influenza lineage A/H3N2. For the surface proteins hemagglutinin and neuraminidase, reassortant variants with a mean distance of at least 3 nucleotides to their parent strains get established at a rate of about 10-2 in units of the neutral point mutation rate. Our inference is based on a new method to map reassortment events from joint genealogies of multiple genome segments, which is tested by extensive simulations. We show that intra-lineage reassortment processes are, on average, under substantial negative selection that increases in strength with increasing sequence distance between the parent strains. The deleterious effects of reassortment manifest themselves in two ways: there are fewer reassortment events than expected from a null model of neutral reassortment, and reassortant strains have fewer descendants than their non-reassortant counterparts. Our results suggest that influenza evolves under ubiquitous epistasis across proteins, which produces fitness barriers against reassortment even between co-circulating strains within one lineage.

  7. Estimating reassortment rates in co-circulating Eurasian swine influenza viruses

    PubMed Central

    Baillie, G.; Coulter, E.; Bhatt, S.; Kellam, P.; McCauley, J. W.; Wood, J. L. N.; Brown, I. H.; Pybus, O. G.; Leigh Brown, A. J.

    2012-01-01

    Swine have often been considered as a mixing vessel for different influenza strains. In order to assess their role in more detail, we undertook a retrospective sequencing study to detect and characterize the reassortants present in European swine and to estimate the rate of reassortment between H1N1, H1N2 and H3N2 subtypes with Eurasian (avian-like) internal protein-coding segments. We analysed 69 newly obtained whole genome sequences of subtypes H1N1–H3N2 from swine influenza viruses sampled between 1982 and 2008, using Illumina and 454 platforms. Analyses of these genomes, together with previously published genomes, revealed a large monophyletic clade of Eurasian swine-lineage polymerase segments containing H1N1, H1N2 and H3N2 subtypes. We subsequently examined reassortments between the haemagglutinin and neuraminidase segments and estimated the reassortment rates between lineages using a recently developed evolutionary analysis method. High rates of reassortment between H1N2 and H1N1 Eurasian swine lineages were detected in European strains, with an average of one reassortment every 2–3 years. This rapid reassortment results from co-circulating lineages in swine, and in consequence we should expect further reassortments between currently circulating swine strains and the recent swine-origin H1N1v pandemic strain. PMID:22971819

  8. Immunoglobulin genomics in the guinea pig (Cavia porcellus).

    PubMed

    Guo, Yongchen; Bao, Yonghua; Meng, Qingwen; Hu, Xiaoxiang; Meng, Qingyong; Ren, Liming; Li, Ning; Zhao, Yaofeng

    2012-01-01

    In science, the guinea pig is known as one of the gold standards for modeling human disease. It is especially important as a molecular and cellular biology model for studying the human immune system, as its immunological genes are more similar to human genes than are those of mice. The utility of the guinea pig as a model organism can be further enhanced by further characterization of the genes encoding components of the immune system. Here, we report the genomic organization of the guinea pig immunoglobulin (Ig) heavy and light chain genes. The guinea pig IgH locus is located in genomic scaffolds 54 and 75, and spans approximately 6,480 kb. 507 V(H) segments (94 potentially functional genes and 413 pseudogenes), 41 D(H) segments, six J(H) segments, four constant region genes (μ, γ, ε, and α), and one reverse δ remnant fragment were identified within the two scaffolds. Many V(H) pseudogenes were found within the guinea pig, and likely constituted a potential donor pool for gene conversion during evolution. The Igκ locus mapped to a 4,029 kb region of scaffold 37 and 24 is composed of 349 V(κ) (111 potentially functional genes and 238 pseudogenes), three J(κ) and one C(κ) genes. The Igλ locus spans 1,642 kb in scaffold 4 and consists of 142 V(λ) (58 potentially functional genes and 84 pseudogenes) and 11 J(λ) -C(λ) clusters. Phylogenetic analysis suggested the guinea pig's large germline V(H) gene segments appear to form limited gene families. Therefore, this species may generate antibody diversity via a gene conversion-like mechanism associated with its pseudogene reserves.

  9. Human-Specific Duplication and Mosaic Transcripts: The Recent Paralogous Structure of Chromosome 22

    PubMed Central

    Bailey, Jeffrey A. ; Yavor, Amy M. ; Viggiano, Luigi ; Misceo, Doriana ; Horvath, Juliann E. ; Archidiacono, Nicoletta ; Schwartz, Stuart ; Rocchi, Mariano ; Eichler, Evan E. 

    2002-01-01

    In recent decades, comparative chromosomal banding, chromosome painting, and gene-order studies have shown strong conservation of gross chromosome structure and gene order in mammals. However, findings from the human genome sequence suggest an unprecedented degree of recent (<35 million years ago) segmental duplication. This dynamism of segmental duplications has important implications in disease and evolution. Here we present a chromosome-wide view of the structure and evolution of the most highly homologous duplications (⩾1 kb and ⩾90%) on chromosome 22. Overall, 10.8% (3.7/33.8 Mb) of chromosome 22 is duplicated, with an average sequence identity of 95.4%. To organize the duplications into tractable units, intron-exon structure and well-defined duplication boundaries were used to define 78 duplicated modules (minimally shared evolutionary segments) with 157 copies on chromosome 22. Analysis of these modules provides evidence for the creation or modification of 11 novel transcripts. Comparative FISH analyses of human, chimpanzee, gorilla, orangutan, and macaque reveal qualitative and quantitative differences in the distribution of these duplications—consistent with their recent origin. Several duplications appear to be human specific, including a ∼400-kb duplication (99.4%–99.8% sequence identity) that transposed from chromosome 14 to the most proximal pericentromeric region of chromosome 22. Experimental and in silico data further support a pericentromeric gradient of duplications where the most recent duplications transpose adjacent to the centromere. Taken together, these data suggest that segmental duplications have been an ongoing process of primate genome evolution, contributing to recent gene innovation and the dynamic transformation of genome architecture within and among closely related species. PMID:11731936

  10. Copy number variation at the 7q11.23 segmental duplications is a susceptibility factor for the Williams-Beuren syndrome deletion

    PubMed Central

    Cuscó, Ivon; Corominas, Roser; Bayés, Mònica; Flores, Raquel; Rivera-Brugués, Núria; Campuzano, Victoria; Pérez-Jurado, Luis A.

    2008-01-01

    Large copy number variants (CNVs) have been recently found as structural polymorphisms of the human genome of still unknown biological significance. CNVs are significantly enriched in regions with segmental duplications or low-copy repeats (LCRs). Williams-Beuren syndrome (WBS) is a neurodevelopmental disorder caused by a heterozygous deletion of contiguous genes at 7q11.23 mediated by nonallelic homologous recombination (NAHR) between large flanking LCRs and facilitated by a structural variant of the region, a ∼2-Mb paracentric inversion present in 20%–25% of WBS-transmitting progenitors. We now report that eight out of 180 (4.44%) WBS-transmitting progenitors are carriers of a CNV, displaying a chromosome with large deletion of LCRs. The prevalence of this CNV among control individuals and non-transmitting progenitors is much lower (1%, n = 600), thus indicating that it is a predisposing factor for the WBS deletion (odds ratio 4.6-fold, P = 0.002). LCR duplications were found in 2.22% of WBS-transmitting progenitors but also in 1.16% of controls, which implies a non–statistically significant increase in WBS-transmitting progenitors. We have characterized the organization and breakpoints of these CNVs, encompassing ∼100–300 kb of genomic DNA and containing several pseudogenes but no functional genes. Additional structural variants of the region have also been defined, all generated by NAHR between different blocks of segmental duplications. Our data further illustrate the highly dynamic structure of regions rich in segmental duplications, such as the WBS locus, and indicate that large CNVs can act as susceptibility alleles for disease-associated genomic rearrangements in the progeny. PMID:18292220

  11. Differentially regulated splice variants and systems biology analysis of Kaposi's sarcoma-associated herpesvirus-infected lymphatic endothelial cells.

    PubMed

    Chang, Ting-Yu; Wu, Yu-Hsuan; Cheng, Cheng-Chung; Wang, Hsei-Wei

    2011-09-01

    Alternative RNA splicing greatly increases proteome diversity, and the possibility of studying genome-wide alternative splicing (AS) events becomes available with the advent of high-throughput genomics tools devoted to this issue. Kaposi's sarcoma associated herpesvirus (KSHV) is the etiological agent of KS, a tumor of lymphatic endothelial cell (LEC) lineage, but little is known about the AS variations induced by KSHV. We analyzed KSHV-controlled AS using high-density microarrays capable of detecting all exons in the human genome. Splicing variants and altered exon-intron usage in infected LEC were found, and these correlated with protein domain modification. The different 3'-UTR used in new transcripts also help isoforms to escape microRNA-mediated surveillance. Exome-level analysis further revealed information that cannot be disclosed using classical gene-level profiling: a significant exon usage difference existed between LEC and CD34(+) precursor cells, and KSHV infection resulted in LEC-to-precursor, dedifferentiation-like exon level reprogramming. Our results demonstrate the application of exon arrays in systems biology research, and suggest the regulatory effects of AS in endothelial cells are far more complex than previously observed. This extra layer of molecular diversity helps to account for various aspects of endothelial biology, KSHV life cycle and disease pathogenesis that until now have been unexplored.

  12. Three-Dimensional Genome Organization and Function in Drosophila

    PubMed Central

    Schwartz, Yuri B.; Cavalli, Giacomo

    2017-01-01

    Understanding how the metazoan genome is used during development and cell differentiation is one of the major challenges in the postgenomic era. Early studies in Drosophila suggested that three-dimensional (3D) chromosome organization plays important regulatory roles in this process and recent technological advances started to reveal connections at the molecular level. Here we will consider general features of the architectural organization of the Drosophila genome, providing historical perspective and insights from recent work. We will compare the linear and spatial segmentation of the fly genome and focus on the two key regulators of genome architecture: insulator components and Polycomb group proteins. With its unique set of genetic tools and a compact, well annotated genome, Drosophila is poised to remain a model system of choice for rapid progress in understanding principles of genome organization and to serve as a proving ground for development of 3D genome-engineering techniques. PMID:28049701

  13. Complete genome sequence of 'Mycobacterium neoaurum' NRRL B-3805, an androstenedione (AD) producer for industrial biotransformation of sterols.

    PubMed

    Rodríguez-García, Antonio; Fernández-Alegre, Estela; Morales, Alejandro; Sola-Landa, Alberto; Lorraine, Jess; Macdonald, Sandy; Dovbnya, Dmitry; Smith, Margaret C M; Donova, Marina; Barreiro, Carlos

    2016-04-20

    Microbial bioconversion of sterols into high value steroid precursors, such as 4-androstene-3,17-dione (AD), is an industrial challenge. Genes and enzymes involved in sterol degradation have been proposed, although the complete pathway is not yet known. The genome sequencing of the AD producer strain 'Mycobacterium neoaurum' NRRL B-3805 (formerly Mycobacterium sp. NRRL B-3805) will serve to elucidate the critical steps for industrial processes and will provide the basis for further genetic engineering. The genome comprises a circular chromosome (5 421 338bp), is devoid of plasmids and contains 4844 protein-coding genes. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. Somatic POLE exonuclease domain mutations are early events in sporadic endometrial and colorectal carcinogenesis, determining driver mutational landscape, clonal neoantigen burden and immune response.

    PubMed

    Temko, Daniel; Van Gool, Inge C; Rayner, Emily; Glaire, Mark; Makino, Seiko; Brown, Matthew; Chegwidden, Laura; Palles, Claire; Depreeuw, Jeroen; Beggs, Andrew; Stathopoulou, Chaido; Mason, John; Baker, Ann-Marie; Williams, Marc; Cerundolo, Vincenzo; Rei, Margarida; Taylor, Jenny C; Schuh, Anna; Ahmed, Ahmed; Amant, Frédéric; Lambrechts, Diether; Smit, Vincent Thbm; Bosse, Tjalling; Graham, Trevor A; Church, David N; Tomlinson, Ian

    2018-03-31

    Genomic instability, which is a hallmark of cancer, is generally thought to occur in the middle to late stages of tumourigenesis, following the acquisition of permissive molecular aberrations such as TP53 mutation or whole genome doubling. Tumours with somatic POLE exonuclease domain mutations are notable for their extreme genomic instability (their mutation burden is among the highest in human cancer), distinct mutational signature, lymphocytic infiltrate, and excellent prognosis. To what extent these characteristics are determined by the timing of POLE mutations in oncogenesis is unknown. Here, we have shown that pathogenic POLE mutations are detectable in non-malignant precursors of endometrial and colorectal cancer. Using genome and exome sequencing, we found that multiple driver mutations in POLE-mutant cancers show the characteristic POLE mutational signature, including those in genes conventionally regarded as initiators of tumourigenesis. In POLE-mutant cancers, the proportion of monoclonal predicted neoantigens was similar to that in other cancers, but the absolute number was much greater. We also found that the prominent CD8 + T-cell infiltrate present in POLE-mutant cancers was evident in their precursor lesions. Collectively, these data indicate that somatic POLE mutations are early, quite possibly initiating, events in the endometrial and colorectal cancers in which they occur. The resulting early onset of genomic instability may account for the striking immune response and excellent prognosis of these tumours, as well as their early presentation. © 2018 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland. © 2018 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.

  15. A conserved segmental duplication within ELA.

    PubMed

    Brinkmeyer-Langford, C L; Murphy, W J; Childers, C P; Skow, L C

    2010-12-01

    The assembled genomic sequence of the horse major histocompatibility complex (MHC) (equine lymphocyte antigen, ELA) is very similar to the homologous human HLA, with the notable exception of a large segmental duplication at the boundary of ELA class I and class III that is absent in HLA. The segmental duplication consists of a ∼ 710 kb region of at least 11 repeated blocks: 10 blocks each contain an MHC class I-like sequence and the helicase domain portion of a BAT1-like sequence, and the remaining unit contains the full-length BAT1 gene. Similar genomic features were found in other Perissodactyls, indicating an ancient origin, which is consistent with phylogenetic analyses. Reverse-transcriptase PCR (RT-PCR) of mRNA from peripheral white blood cells of healthy and chronically or acutely infected horses detected transcription from predicted open reading frames in several of the duplicated blocks. This duplication is not present in the sequenced MHCs of most other mammals, although a similar feature at the same relative position is present in the feline MHC (FLA). Striking sequence conservation throughout Perissodactyl evolution is consistent with a functional role for at least some of the genes included within this segmental duplication. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.

  16. Partitioning of genetic variation between regulatory and coding gene segments: the predominance of software variation in genes encoding introvert proteins.

    PubMed

    Mitchison, A

    1997-01-01

    In considering genetic variation in eukaryotes, a fundamental distinction can be made between variation in regulatory (software) and coding (hardware) gene segments. For quantitative traits the bulk of variation, particularly that near the population mean, appears to reside in regulatory segments. The main exceptions to this rule concern proteins which handle extrinsic substances, here termed extrovert proteins. The immune system includes an unusually large proportion of this exceptional category, but even so its chief source of variation may well be polymorphism in regulatory gene segments. The main evidence for this view emerges from genome scanning for quantitative trait loci (QTL), which in the case of the immune system points to a major contribution of pro-inflammatory cytokine genes. Further support comes from sequencing of major histocompatibility complex (Mhc) class II promoters, where a high level of polymorphism has been detected. These Mhc promoters appear to act, in part at least, by gating the back-signal from T cells into antigen-presenting cells. Both these forms of polymorphism are likely to be sustained by the need for flexibility in the immune response. Future work on promoter polymorphism is likely to benefit from the input from genome informatics.

  17. Photoluminescence Segmentation within Individual Hexagonal Monolayer Tungsten Disulfide Domains Grown by Chemical Vapor Deposition.

    PubMed

    Sheng, Yuewen; Wang, Xiaochen; Fujisawa, Kazunori; Ying, Siqi; Elias, Ana Laura; Lin, Zhong; Xu, Wenshuo; Zhou, Yingqiu; Korsunsky, Alexander M; Bhaskaran, Harish; Terrones, Mauricio; Warner, Jamie H

    2017-05-03

    We show that hexagonal domains of monolayer tungsten disulfide (WS 2 ) grown by chemical vapor deposition (CVD) with powder precursors can have discrete segmentation in their photoluminescence (PL) emission intensity, forming symmetric patterns with alternating bright and dark regions. Two-dimensional maps of the PL reveal significant reduction within the segments associated with the longest sides of the hexagonal domains. Analysis of the PL spectra shows differences in the exciton to trion ratio, indicating variations in the exciton recombination dynamics. Monolayers of WS 2 hexagonal islands transferred to new substrates still exhibit this PL segmentation, ruling out local strain in the regions as the dominant cause. High-power laser irradiation causes preferential degradation of the bright segments by sulfur removal, indicating the presence of a more defective region that is higher in oxidative reactivity. Atomic force microscopy (AFM) images of topography and amplitude modes show uniform thickness of the WS 2 domains and no signs of segmentation. However, AFM phase maps do show the same segmentation of the domain as the PL maps and indicate that it is caused by some kind of structural difference that we could not clearly identify. These results provide important insights into the spatially varying properties of these CVD-grown transition metal dichalcogenide materials, which may be important for their effective implementation in fast photo sensors and optical switches.

  18. spiel ohne grenzen/pou2 is required for zebrafish hindbrain segmentation.

    PubMed

    Hauptmann, Giselbert; Belting, Heinz-Georg; Wolke, Uta; Lunde, Karen; Söll, Iris; Abdelilah-Seyfried, Salim; Prince, Victoria; Driever, Wolfgang

    2002-04-01

    Segmentation of the vertebrate hindbrain leads to the formation of a series of rhombomeres with distinct identities. In mouse, Krox20 and kreisler play important roles in specifying distinct rhombomeres and in controlling segmental identity by directly regulating rhombomere-specific expression of Hox genes. We show that spiel ohne grenzen (spg) zebrafish mutants develop rhombomeric territories that are abnormal in both size and shape. Rhombomere boundaries are malpositioned or absent and the segmental pattern of neuronal differentiation is perturbed. Segment-specific expression of hoxa2, hoxb2 and hoxb3 is severely affected during initial stages of hindbrain development in spg mutants and the establishment of krx20 (Krox20 ortholog) and valentino (val; kreisler ortholog) expression is impaired. spg mutants carry loss-of-function mutations in the pou2 gene. pou2 is expressed at high levels in the hindbrain primordium of wild-type embryos prior to activation of krx20 and val. Widespread overexpression of Pou2 can rescue the segmental krx20 and val domains in spg mutants, but does not induce ectopic expression of these genes. This suggests that spg/pou2 acts in a permissive manner and is essential for normal expression of krx20 and val. We propose that spg/pou2 is an essential component of the regulatory cascade controlling hindbrain segmentation and acts before krx20 and val in the establishment of rhombomere precursor territories.

  19. Position-dependent effects of polylysine on Sec protein transport.

    PubMed

    Liang, Fu-Cheng; Bageshwar, Umesh K; Musser, Siegfried M

    2012-04-13

    The bacterial Sec protein translocation system catalyzes the transport of unfolded precursor proteins across the cytoplasmic membrane. Using a recently developed real time fluorescence-based transport assay, the effects of the number and distribution of positive charges on the transport time and transport efficiency of proOmpA were examined. As expected, an increase in the number of lysine residues generally increased transport time and decreased transport efficiency. However, the observed effects were highly dependent on the polylysine position in the mature domain. In addition, a string of consecutive positive charges generally had a more significant effect on transport time and efficiency than separating the charges into two or more charged segments. Thirty positive charges distributed throughout the mature domain resulted in effects similar to 10 consecutive charges near the N terminus of the mature domain. These data support a model in which the local effects of positive charge on the translocation kinetics dominate over total thermodynamic constraints. The rapid translocation kinetics of some highly charged proOmpA mutants suggest that the charge is partially shielded from the electric field gradient during transport, possibly by the co-migration of counter ions. The transport times of precursors with multiple positively charged sequences, or "pause sites," were fairly well predicted by a local effect model. However, the kinetic profile predicted by this local effect model was not observed. Instead, the transport kinetics observed for precursors with multiple polylysine segments support a model in which translocation through the SecYEG pore is not the rate-limiting step of transport.

  20. Position-dependent Effects of Polylysine on Sec Protein Transport*

    PubMed Central

    Liang, Fu-Cheng; Bageshwar, Umesh K.; Musser, Siegfried M.

    2012-01-01

    The bacterial Sec protein translocation system catalyzes the transport of unfolded precursor proteins across the cytoplasmic membrane. Using a recently developed real time fluorescence-based transport assay, the effects of the number and distribution of positive charges on the transport time and transport efficiency of proOmpA were examined. As expected, an increase in the number of lysine residues generally increased transport time and decreased transport efficiency. However, the observed effects were highly dependent on the polylysine position in the mature domain. In addition, a string of consecutive positive charges generally had a more significant effect on transport time and efficiency than separating the charges into two or more charged segments. Thirty positive charges distributed throughout the mature domain resulted in effects similar to 10 consecutive charges near the N terminus of the mature domain. These data support a model in which the local effects of positive charge on the translocation kinetics dominate over total thermodynamic constraints. The rapid translocation kinetics of some highly charged proOmpA mutants suggest that the charge is partially shielded from the electric field gradient during transport, possibly by the co-migration of counter ions. The transport times of precursors with multiple positively charged sequences, or “pause sites,” were fairly well predicted by a local effect model. However, the kinetic profile predicted by this local effect model was not observed. Instead, the transport kinetics observed for precursors with multiple polylysine segments support a model in which translocation through the SecYEG pore is not the rate-limiting step of transport. PMID:22367204

  1. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. Copyright © 2014, American Association for the Advancement of Science.

  2. Homoallylglycine residues are superior precursors to orthogonally modified thioether containing polypeptides.

    PubMed

    Perlin, Pesach; Gharakhanian, Eric G; Deming, Timothy J

    2018-06-12

    Homoallylglycine N-carboxyanhydride, Hag NCA, monomers were synthesized and used to prepare polypeptides containing Hag segments with controllable lengths of up to 245 repeats. Poly(l-homoallylglycine), GHA, was found to adopt an α-helical conformation, which provided good solubility in organic solvents and allowed high yield functionalization of its alkene side-chains via radical promoted addition of thiols. The conformations of these derivatives were shown to be switchable between α-helical and disordered states in aqueous media using thioether alkylation or oxidation reactions. Incorporation of GHA segments into block copolymers with poly(l-methionine), M, segments provided a means to orthogonally modify thioether side-chains different ways in separate copolypeptide domains. This approach allows preparation of functional polypeptides containing discrete domains of oxidized and alkylated thioether containing residues, where chain conformation and functionality of each domain can be independently modified.

  3. Complete genome sequence of the first bluetongue virus serotype 7 isolate from China: evidence for entry of African-lineage strains and reassortment between the introduced and native strains.

    PubMed

    Yang, Heng; Lv, Minna; Sun, Minfei; Lin, Liqin; Kou, Meilin; Gao, Lin; Liao, Defang; Xiong, Heli; He, Yuwen; Li, Huachun

    2016-01-01

    Bluetongue virus (BTV) mainly infects sheep but can be transmitted to other domestic and wild ruminants, resulting in a considerable financial burden and trade restriction. Our understanding of the origin, movement, and distribution of BTV has been hindered by the fact that this virus has a segmented genome with the possibility of reassortment, the existence of 27 identified serotypes, and a lack of complete sequences of viruses isolated from different parts of the world. BTV serotype 7 is one of the prevalent BTV serotypes in Asia. Nonetheless, no complete genomic sequence of an Asian isolate of this serotype is available. In an effort to understand the molecular epidemiology of BTV infection in China, for the first time, we report here the complete genome sequence of a BTV serotype 7 strain, GDST008, which was isolated in 2014 in China. This sequence also represents the first complete genome sequence of a BTV serotype 7 from Asia and the third one in the world. Sequence analysis suggests that GDST008 consists of segments from BTV viruses of African lineage as well as those from China. Together, these results improve our understanding of the origin, emergence/re-emergence, and movement of BTV and thus can be applied in the development of vaccines and diagnostics.

  4. Genomics and its role in Cancer Risk Assessment

    EPA Science Inventory

    The traditional risk assessment paradigm is based on exposure - dose - response. The individual is exposed to a chemical or other stressor at some dose and a response in the organism or tissue is elicited. Though precursor events such as taret cell proliferation may be used as ...

  5. Genomics of Escherichia and Shigella

    NASA Astrophysics Data System (ADS)

    Perna, Nicole T.

    The laboratory workhorse Escherichia coli K-12 is among the most intensively studied living organisms on earth, and this single strain serves as the model system behind much of our understanding of prokaryotic molecular biology. Dense genome sequencing and recent insightful comparative analyses are making the species E. coli, as a whole, an emerging system for studying prokaryotic population genetics and the relationship between system-scale, or genome-scale, molecular evolution and complex traits like host range and pathogenic potential. Genomic perspective has revealed a coherent but dynamic species united by intraspecific gene flow via homologous lateral or horizontal transfer and differentiated by content flux mediated by acquisition of DNA segments from interspecies transfers.

  6. Industrialization and the increasing risk of genome instability in developing countries: nutrigenomics as a promising antidote.

    PubMed

    Anetor, J I

    2010-12-01

    Increased reliance on chemicals in the industrializing developing countries places new demands on them, as they have limited resources to adequately regulate exposure to these chemicals. Majority of the chemicals cause mutation in DNA among others. The consequences of increased exposure to chemicals on the genome and their mitigation by Nutrigenomics, a science concerned with the prevention of genome damage by nutritional factors is poorly recognized in these countries. Growing evidence indicates that genome instability in the absence of overt exposure to genotoxicants is a sensitive marker of nutritional deficiency. Therefore, the increasing prevalence of chemicals in these countries which contribute to genome disturbances and the widespread nutritional deficiency, at least double the risk of genome instability.Environmental pollutants such polychlorobiphenyls, metal fumes, and fly ash, common in these countries are known to increase urinary level of 8-hydroxy deoxyguanosine (8-OHdG), a marker of oxidative DNA damage, precursor of genome instability.Increasing evidence emphasizes the importance of zinc in both genetic stability and function. Zinc deficiency has been linked with oxidative stress, DNA damage and impairment of repair mechanisms as well as risk of cancer. Zinc plays an important role in vitamin A metabolism from which the retinoids are derived. Zinc is also an important component of the p53 protein, a DNA damage sensor which prevents genetic lesions contributing to genome instability.Zinc deficiency ranks among the top 10 leading causes of death in developing countries. A large proportion of the population in these countries ingests less than 50% of the RDA for Zn.This makes this genome protective nutrient among others grossly inadequate. Folate now also recognized for its role in genome stability, is among the nutrients frequently cited as critical to genome stability. Folate deficiency of sub- clinical degree is common. Reduced folate intake causes as much genome damage as that induced by exposure to a high dose of ionizing radiation. Even moderate folate deficiency causes very severe damage to the genome in the general population. All these accentuate the susceptibility of populations in these nations to environmental toxic assault requiring preventive measures employing the science of Nutrigenomics, probably augmented with adaptive response pathways such as the Nrf2 signaling pathway. Human populations in developing countries are increasingly exposed to a diverse array of industrial chemicals, which adversely modify the genome, the precursor of many diseases especially cancer. Nutrigenomics encompasses nutritional factors that protect the genome from damage and is a promising new field that can be exploited, perhaps augmented with the Nrf2 signaling pathway with international collaboration in these nations as an antidote to chemical-induced genome instability.

  7. Genome sequence analysis of CsRV1: a pathogenic reovirus that infects the blue crab Callinectes sapidus across its trans-hemispheric range

    USDA-ARS?s Scientific Manuscript database

    The blue crab, Callinectes sapidus (Rathbun 1896), which is a commercially important trophic link in coastal ecosystems of the western Atlantic, is infected in both North and South America by C. sapidus Reovirus 1 (CsRV1), a double stranded RNA virus. The 12 genome segments of a North American strai...

  8. Characterization of a highly polymorphic region 5′ to JH in the human immunoglobulin heavy chain

    PubMed Central

    Silva, Alcino J.; Johnson, John P.; White, Raymond L.

    1987-01-01

    A cloned DNA segment 1.25 kilobases (kb) upstream from the joining segments of the human heavy chain immunoglobulin gene revealed extensive polymorphic variation at this locus, and the polymorphic pattern was stably transmitted to the next generation. Genomic restriction analysis showed that the polymorphism was caused by insertions/deletions within an MspI/BamHI fragment. Sequencing of one allele, 848 base pairs (bp) long, revealed eleven 50-base-pair tandem repeats. A second allele, 648 bp long, was cloned from a human genomic cosmid library, sequenced, and found to contain four fewer repeats than the first allele. A survey of 186 chromosomes from unrelated individuals of primarily northern European descent revealed at least six alleles. Images PMID:2884636

  9. Mapping of Leaf Rust Resistance Genes and Molecular Characterization of the 2NS/2AS Translocation in the Wheat Cultivar Jagger.

    PubMed

    Xue, Shulin; Kolmer, James A; Wang, Shuwen; Yan, Liuling

    2018-05-31

    Winter wheat cultivar 'Jagger' was recently found to have an alien chromosomal segment 2NS that has Lr37 , a gene conferring resistance against leaf rust caused by Puccinia triticina The objective of this study was to map and characterize the gene(s) for seedling leaf rust resistance in Jagger. The recombinant inbred line (RIL) population of Jagger × '2174' was inoculated with leaf rust pathogen THBJG and BBBDB, and evaluated for infection type (IT) response. A major quantitative trait locus (QTL) for THBJG and BBBDB was coincidently mapped to chromosome arm 2AS, and the QTL accounted for 56.6-66.2% of total phenotypic variation in infection type (IT) response to THBJG, and 72.1-86.9% to BBBDB. The causal gene for resistance to these rust races was mapped to the 2NS segment in Jagger. The 2NS segment was located in a region of approximately 27.8 Mb starting from the telomere of chromosome arm 2AS, based on the sequences of the A genome in tetraploid wheat. The Lr17a gene on chromosome arm 2AS was delimited to 3.1 Mb in the genomic region, which was orthologous to the 2NS segment. Therefore, the Lr37 gene in the 2NS segment can be pyramided with other effective resistance genes, rather than Lr17a in wheat, to improve resistance to rust diseases. Copyright © 2018 Xue et al.

  10. New treatments for influenza.

    PubMed

    Barik, Sailen

    2012-09-13

    Influenza has a long history of causing morbidity and mortality in the human population through routine seasonal spread and global pandemics. The high mutation rate of the RNA genome of the influenza virus, combined with assortment of its multiple genomic segments, promote antigenic diversity and new subtypes, allowing the virus to evade vaccines and become resistant to antiviral drugs. There is thus a continuing need for new anti-influenza therapy using novel targets and creative strategies. In this review, we summarize prospective future therapeutic regimens based on recent molecular and genomic discoveries.

  11. Loss of p19Arf in a Rag1−/− B-cell precursor population initiates acute B-lymphoblastic leukemia

    PubMed Central

    Hauer, Julia; Mullighan, Charles; Morillon, Estelle; Wang, Gary; Bruneau, Julie; Brousse, Nicole; Lelorc'h, Marc; Romana, Serge; Boudil, Amine; Tiedau, Daniela; Kracker, Sven; Bushmann, Frederic D.; Borkhardt, Arndt; Fischer, Alain; Hacein-Bey-Abina, Salima

    2011-01-01

    In human B-acute lymphoblastic leukemia (B-ALL), RAG1-induced genomic alterations are important for disease progression. However, given that biallelic loss of the RAG1 locus is observed in a subset of cases, RAG1's role in the development of B-ALL remains unclear. We chose a p19Arf−/−Rag1−/− mouse model to confirm the previously published results concerning the contribution of CDKN2A (p19ARF /INK4a) and RAG1 copy number alterations in precursor B cells to the initiation and/or progression to B-acute lymphoblastic leukemia (B-ALL). In this murine model, we identified a new, Rag1-independent leukemia-initiating mechanism originating from a Sca1+CD19+ precursor cell population and showed that Notch1 expression accelerates the cells' self-renewal capacity in vitro. In human RAG1-deficient BM, a similar CD34+CD19+ population expressed p19ARF. These findings suggest that combined loss of p19Arf and Rag1 results in B-cell precursor leukemia in mice and may contribute to the progression of precursor B-ALL in humans. PMID:21622646

  12. Evolution of ESA's SSA Conjunction Prediction Service

    NASA Astrophysics Data System (ADS)

    Escobar, D.; Sancho, A. Tirado, J.; Agueda, A.; Martin, L.; Luque, F.; Fletcher, E.; Navarro, V.

    2013-08-01

    This paper presents the recent evolution of ESA's SSA Conjunction Prediction Service (CPS) as a result of an on-going activity in the Space Surveillance and Tracking (SST) Segment of ESA's Space Situational Awareness (SSA) Programme. The CPS is one of a number of precursor services being developed as part of the SST segment. It has been implemented as a service to provide external users with web-based access to conjunction information and designed with a service-oriented architecture. The paper encompasses the following topics: service functionality enhancements, integration with a live objects catalogue, all vs. all analyses supporting an operational concept based on low and high fidelity screenings, and finally conjunction detection and probability algorithms.

  13. From 20th century metabolic wall charts to 21st century systems biology: database of mammalian metabolic enzymes

    PubMed Central

    Corcoran, Callan C.; Grady, Cameron R.; Pisitkun, Trairak; Parulekar, Jaya

    2017-01-01

    The organization of the mammalian genome into gene subsets corresponding to specific functional classes has provided key tools for systems biology research. Here, we have created a web-accessible resource called the Mammalian Metabolic Enzyme Database (https://hpcwebapps.cit.nih.gov/ESBL/Database/MetabolicEnzymes/MetabolicEnzymeDatabase.html) keyed to the biochemical reactions represented on iconic metabolic pathway wall charts created in the previous century. Overall, we have mapped 1,647 genes to these pathways, representing ~7 percent of the protein-coding genome. To illustrate the use of the database, we apply it to the area of kidney physiology. In so doing, we have created an additional database (Database of Metabolic Enzymes in Kidney Tubule Segments: https://hpcwebapps.cit.nih.gov/ESBL/Database/MetabolicEnzymes/), mapping mRNA abundance measurements (mined from RNA-Seq studies) for all metabolic enzymes to each of 14 renal tubule segments. We carry out bioinformatics analysis of the enzyme expression pattern among renal tubule segments and mine various data sources to identify vasopressin-regulated metabolic enzymes in the renal collecting duct. PMID:27974320

  14. Identification of an miRNA candidate reflects the possible significance of transcribed microsatellites in the hairpin precursors of black pepper.

    PubMed

    Joy, Nisha; Soniya, Eppurathu Vasudevan

    2012-06-01

    Plant miRNAs (18-24nt) are generated by the RNase III-type Dicer endonuclease from the endogenous hairpin precursors ('pre-miRNAs') with significant regulatory functions. The transcribed regions display a higher frequency of microsatellites, when compared to other regions of the genomic DNA. Simple sequence repeats (SSRs) resulting from replication slippage occurring in transcripts affect the expression of genes. The available experimental evidence for the incidence of SSRs in the miRNA precursors is limited. Considering the potential significance of SSRs in the miRNA genes, we carried out a preliminary analysis to verify the presence of SSRs in the pri-miRNAs of black pepper (Piper nigrum L.). We isolated a (CT) dinucleotide SSR bearing transcript using SMART strategy. The transcript was predicted to be a 'pri-miRNA candidate' with Dicer sites based on miRNA prediction tools and MFOLD structural predictions. The presence of this 'miRNA candidate' was confirmed by real-time TaqMan assays. The upstream sequence of the 'miRNA candidate' by genome walking when subjected to PlantCARE showed the presence of certain promoter elements, and the deduced amino acid showed significant similarity with NAP1 gene, which affects the transcription of many genes. Moreover the hairpin-like precursor overlapped the neighbouring NAP1 gene. In silico analysis revealed distinct putative functions for the 'miRNA candidate', of which majority were related to growth. Hence, we assume that this 'miRNA candidate' may get activated during transcription of NAP gene, thereby regulating the expression of many genes involved in developmental processes.

  15. Cell-type-specific enrichment of risk-associated regulatory elements at ovarian cancer susceptibility loci.

    PubMed

    Coetzee, Simon G; Shen, Howard C; Hazelett, Dennis J; Lawrenson, Kate; Kuchenbaecker, Karoline; Tyrer, Jonathan; Rhie, Suhn K; Levanon, Keren; Karst, Alison; Drapkin, Ronny; Ramus, Susan J; Couch, Fergus J; Offit, Kenneth; Chenevix-Trench, Georgia; Monteiro, Alvaro N A; Antoniou, Antonis; Freedman, Matthew; Coetzee, Gerhard A; Pharoah, Paul D P; Noushmehr, Houtan; Gayther, Simon A

    2015-07-01

    Understanding the regulatory landscape of the human genome is a central question in complex trait genetics. Most single-nucleotide polymorphisms (SNPs) associated with cancer risk lie in non-protein-coding regions, implicating regulatory DNA elements as functional targets of susceptibility variants. Here, we describe genome-wide annotation of regions of open chromatin and histone modification in fallopian tube and ovarian surface epithelial cells (FTSECs, OSECs), the debated cellular origins of high-grade serous ovarian cancers (HGSOCs) and in endometriosis epithelial cells (EECs), the likely precursor of clear cell ovarian carcinomas (CCOCs). The regulatory architecture of these cell types was compared with normal human mammary epithelial cells and LNCaP prostate cancer cells. We observed similar positional patterns of global enhancer signatures across the three different ovarian cancer precursor cell types, and evidence of tissue-specific regulatory signatures compared to non-gynecological cell types. We found significant enrichment for risk-associated SNPs intersecting regulatory biofeatures at 17 known HGSOC susceptibility loci in FTSECs (P = 3.8 × 10(-30)), OSECs (P = 2.4 × 10(-23)) and HMECs (P = 6.7 × 10(-15)) but not for EECs (P = 0.45) or LNCaP cells (P = 0.88). Hierarchical clustering of risk SNPs conditioned on the six different cell types indicates FTSECs and OSECs are highly related (96% of samples using multi-scale bootstrapping) suggesting both cell types may be precursors of HGSOC. These data represent the first description of regulatory catalogues of normal precursor cells for different ovarian cancer subtypes, and provide unique insights into the tissue specific regulatory variation with respect to the likely functional targets of germline genetic susceptibility variants for ovarian cancer. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  16. Implications of segment mismatch for influenza A virus evolution

    PubMed Central

    White, Maria C.; Lowen, Anice C.

    2018-01-01

    Influenza A virus (IAV) is an RNA virus with a segmented genome. These viral properties allow for the rapid evolution of IAV under selective pressure, due to mutation occurring from error-prone replication and the exchange of gene segments within a co-infected cell, termed reassortment. Both mutation and reassortment give rise to genetic diversity, but constraints shape their impact on viral evolution: just as most mutations are deleterious, most reassortment events result in genetic incompatibilities. The phenomenon of segment mismatch encompasses both RNA- and protein-based incompatibilities between co-infecting viruses and results in the production of progeny viruses with fitness defects. Segment mismatch is an important determining factor of the outcomes of mixed IAV infections and has been addressed in multiple risk assessment studies undertaken to date. However, due to the complexity of genetic interactions among the eight viral gene segments, our understanding of segment mismatch and its underlying mechanisms remain incomplete. Here, we summarize current knowledge regarding segment mismatch and discuss the implications of this phenomenon for IAV reassortment and diversity. PMID:29244017

  17. Both UDP N-acetylglucosamine pyrophosphorylases of Tribolium castaneum are critical for molting, survival, and fecundity

    USDA-ARS?s Scientific Manuscript database

    A bioinformatics search of the genome of the red flour beetle, Tribolium castaneum, resulted in the identification of two genes encoding proteins closely related to UDP-N-acetylglucosamine pyrophosphorylases (UAP), which provide the activated precursor, UDP-N-acetylglucosamine, for the synthesis of ...

  18. Exponential decay of GC content detected by strand-symmetric substitution rates influences the evolution of isochore structure.

    PubMed

    Karro, J E; Peifer, M; Hardison, R C; Kollmann, M; von Grünberg, H H

    2008-02-01

    The distribution of guanine and cytosine nucleotides throughout a genome, or the GC content, is associated with numerous features in mammals; understanding the pattern and evolutionary history of GC content is crucial to our efforts to annotate the genome. The local GC content is decaying toward an equilibrium point, but the causes and rates of this decay, as well as the value of the equilibrium point, remain topics of debate. By comparing the results of 2 methods for estimating local substitution rates, we identify 620 Mb of the human genome in which the rates of the various types of nucleotide substitutions are the same on both strands. These strand-symmetric regions show an exponential decay of local GC content at a pace determined by local substitution rates. DNA segments subjected to higher rates experience disproportionately accelerated decay and are AT rich, whereas segments subjected to lower rates decay more slowly and are GC rich. Although we are unable to draw any conclusions about causal factors, the results support the hypothesis proposed by Khelifi A, Meunier J, Duret L, and Mouchiroud D (2006. GC content evolution of the human and mouse genomes: insights from the study of processed pseudogenes in regions of different recombination rates. J Mol Evol. 62:745-752.) that the isochore structure has been reshaped over time. If rate variation were a determining factor, then the current isochore structure of mammalian genomes could result from the local differences in substitution rates. We predict that under current conditions strand-symmetric portions of the human genome will stabilize at an average GC content of 30% (considerably less than the current 42%), thus confirming that the human genome has not yet reached equilibrium.

  19. The Complete Genome Sequence of Herpesvirus Papio 2 (Cercopithecine Herpesvirus 16) Shows Evidence of Recombination Events among Various Progenitor Herpesviruses†

    PubMed Central

    Tyler, Shaun D.; Severini, Alberto

    2006-01-01

    We have sequenced the entire genome of herpesvirus papio 2 (HVP-2; Cercopithecine herpesvirus 16) strain X313, a baboon herpesvirus with close homology to other primate alphaherpesviruses, such as SA8, monkey B virus, and herpes simplex virus (HSV) type 1 and type 2. The genome of HVP-2 is 156,487 bp in length, with an overall GC content of 76.5%. The genome organization is identical to that of the other members of the genus Simplexvirus, with a long and a short unique region, each bordered by inverted repeats which end with an “a” sequence. All of the open reading frames detected in this genome were homologous and colinear with those of SA8 and B virus. The HSV gene RL1 (γ134.5; neurovirulence factor) is not present in HVP-2, as is the case for SA8 and B virus. The HVP-2 genome is 85% homologous to its closest relative, SA8. However, segment-by-segment bootstrap analysis of the genome revealed at least two regions that display closer homology to the corresponding sequences of B virus. The first region comprises the UL41 to UL44 genes, and the second region is located within the UL36 gene. We hypothesize that this localized and defined shift in homology is due to recombination events between an SA8-like progenitor of HVP-2 and a herpesvirus species more closely related to the B virus. Since some of the genes involved in these putative recombination events are determinants of virulence, a comparative analysis of their function may provide insight into the pathogenic mechanism of simplexviruses. PMID:16414998

  20. The complete genome sequence of herpesvirus papio 2 (Cercopithecine herpesvirus 16) shows evidence of recombination events among various progenitor herpesviruses.

    PubMed

    Tyler, Shaun D; Severini, Alberto

    2006-02-01

    We have sequenced the entire genome of herpesvirus papio 2 (HVP-2; Cercopithecine herpesvirus 16) strain X313, a baboon herpesvirus with close homology to other primate alphaherpesviruses, such as SA8, monkey B virus, and herpes simplex virus (HSV) type 1 and type 2. The genome of HVP-2 is 156,487 bp in length, with an overall GC content of 76.5%. The genome organization is identical to that of the other members of the genus Simplexvirus, with a long and a short unique region, each bordered by inverted repeats which end with an "a" sequence. All of the open reading frames detected in this genome were homologous and colinear with those of SA8 and B virus. The HSV gene RL1 (gamma(1)34.5; neurovirulence factor) is not present in HVP-2, as is the case for SA8 and B virus. The HVP-2 genome is 85% homologous to its closest relative, SA8. However, segment-by-segment bootstrap analysis of the genome revealed at least two regions that display closer homology to the corresponding sequences of B virus. The first region comprises the UL41 to UL44 genes, and the second region is located within the UL36 gene. We hypothesize that this localized and defined shift in homology is due to recombination events between an SA8-like progenitor of HVP-2 and a herpesvirus species more closely related to the B virus. Since some of the genes involved in these putative recombination events are determinants of virulence, a comparative analysis of their function may provide insight into the pathogenic mechanism of simplexviruses.

  1. Comparative analysis of the L, M, and S RNA segments of Crimean-Congo haemorrhagic fever virus isolates from southern Africa.

    PubMed

    Goedhals, Dominique; Bester, Phillip A; Paweska, Janusz T; Swanepoel, Robert; Burt, Felicity J

    2015-05-01

    Crimean-Congo haemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family with a tripartite, negative sense RNA genome. This study used predictive software to analyse the L (large), M (medium), and S (small) segments of 14 southern African CCHFV isolates. The OTU-like cysteine protease domain and the RdRp domain of the L segment are highly conserved among southern African CCHFV isolates. The M segment encodes the structural glycoproteins, GN and GC, and the non-structural glycoproteins which are post-translationally cleaved at highly conserved furin and subtilase SKI-1 cleavage sites. All of the sites previously identified were shown to be conserved among southern African CCHFV isolates. The heavily O-glycosylated N-terminal variable mucin-like domain of the M segment shows the highest sequence variability of the CCHFV proteins. Five transmembrane domains are predicted in the M segment polyprotein resulting in three regions internal to and three regions external to the membrane across the G(N), NS(M) and G(C) glycoproteins. The corroboration of conserved genome domains and sequence identity among geographically diverse isolates may assist in the identification of protein function and pathogenic mechanisms, as well as the identification of potential targets for antiviral therapy and vaccine design. As detailed functional studies are lacking for many of the CCHFV proteins, identification of functional domains by prediction of protein structure, and identification of amino acid level similarity to functionally characterised proteins of related viruses or viruses with similar pathogenic mechanisms are a necessary step for selection of areas for further study. © 2015 Wiley Periodicals, Inc.

  2. Genomic characterization of H14 subtype influenza A viruses in New World waterfowl and experimental infectivity in mallards Anas platyrhynchos

    USGS Publications Warehouse

    Ramey, Andy M.; Poulson, Rebecca L.; Gonzalez-Reiche, Ana S.; Perez, Daniel R.; Stalknecht, David E.; Brown, Justin D.

    2014-01-01

    Recent repeated isolation of H14 hemagglutinin subtype influenza A viruses (IAVs) in the New World waterfowl provides evidence to suggest that host and/or geographic ranges for viruses of this subtype may be expanding. In this study, we used genomic analyses to gain inference on the origin and evolution of H14 viruses in New World waterfowl and conducted an experimental challenge study in mallards (Anas platyrhynchos) to evaluate pathogenicity, viral replication, and transmissibility of a representative viral strain in a natural host species. Genomic characterization of H14 subtype IAVs isolated from New World waterfowl, including three isolates sequenced specifically for this study, revealed high nucleotide identity among individual gene segments (e.g. ≥95% shared identity among H14 HA gene segments). In contrast, lower shared identity was observed among internal gene segments. Furthermore, multiple neuraminidase subtypes were observed for H14 IAVs isolated in the New World. Gene segments of H14 viruses isolated after 2010 shared ancestral genetic lineages with IAVs isolated from wild birds throughout North America. Thus, genomic characterization provided evidence for viral evolution in New World waterfowl through genetic drift and genetic shift since purported introduction from Eurasia. In the challenge study, no clinical disease or lesions were observed among mallards experimentally inoculated with A/blue-winged teal/Texas/AI13-1028/2013(H14N5) or exposed via contact with infected birds. Titers of viral shedding for mallards challenged with the H14N5 IAV were highest at two days post-inoculation (DPI); however shedding was detected up to nine DPI using cloacal swabs. The distribution of viral antigen among mallards infected with H14N5 IAV was largely restricted to enterocytes lining the villi in the lower intestinal tract and in the epithelium of the bursa of Fabricius. Characterization of the infectivity of A/blue-winged teal/Texas/AI13-1028/2013(H14N5) in mallards provides support for similarities in viral replication and shedding as compared to previously described waterfowl-adapted, low pathogenic IAV strains in ducks.

  3. Proflavine sensitivity of RNA processing in isolated nuclei.

    PubMed Central

    Yannarell, A; Niemann, M; Schumm, D E; Webb, T E

    1977-01-01

    The intercalating agent proflavine inhibits the processing and subsequent release of preformed messenger RNA and ribosomal RNA from isolated liver nuclei to surrogate cytoplasm. The direct effect of proflavine on these processes, as monitored in a reconstituted cell-free system, supports the theory that base-paired segments (i.e. hairpin loops) in the precursor RNA's are involved as recognition sites in nuclear RNA processing. PMID:866181

  4. NGFFFamide and echinotocin: structurally unrelated myoactive neuropeptides derived from neurophysin-containing precursors in sea urchins.

    PubMed

    Elphick, Maurice R; Rowe, Matthew L

    2009-04-01

    The myoactive neuropeptide NGIWYamide was originally isolated from the holothurian (sea cucumber) Apostichopus japonicus but there is evidence that NGIWYamide-like peptides also occur in other echinoderms. Here we report the discovery of a gene in the sea urchin Strongylocentrotus purpuratus that encodes two copies of an NGIWYamide-like peptide: Asn-Gly-Phe-Phe-Phe-(NH(2)) or NGFFFamide. Interestingly, the C-terminal region of the NGFFFamide precursor shares sequence similarity with neurophysins, carrier proteins hitherto uniquely associated with precursors of vasopressin/oxytocin-like neuropeptides. Thus, the NGFFFamide precursor is the first neurophysin-containing neuropeptide precursor to be discovered that does not contain a vasopressin/oxytocin-like peptide. However, it remains to be determined whether neurophysin acts as a carrier protein for NGFFFamide. The S. purpuratus genome also contains a gene encoding a precursor comprising a neurophysin polypeptide and 'echinotocin' (CFISNCPKGamide) - the first vasopressin/oxytocin-like peptide to be identified in an echinoderm. Therefore, in S. purpuratus there are two genes encoding precursors that have a neurophysin domain but which encode neuropeptides that are structurally unrelated. Furthermore, both NGFFFamide and echinotocin cause contraction of tube foot and oesophagus preparations from the sea urchin Echinus esculentus, consistent with the myoactivity of NGIWYamide in sea cucumbers and the myoactivity of vasopressin/oxytocin-like peptides in other animal phyla. Presumably the NGFFFamide precursor acquired its neurophysin domain following partial or complete duplication of a gene encoding a vasopressin/oxytocin-like peptide, but it remains to be determined when in evolutionary history this occurred.

  5. Evolution of H3N2v viruses in North American swine and humans, 2009-2011

    USDA-ARS?s Scientific Manuscript database

    Novel H3N2 influenza viruses (H3N2v) containing seven genome segments from swine-lineage triple reassortant H3N2 viruses and a 2009 pandemic H1N1 (H1N1pdm09) matrix protein segment (pM) have been isolated from 12 humans in the United States between August – December 2011. To understand the evolution...

  6. Strategies for the Segmentation of Subcutaneous Vascular Patterns in Thermographic Images

    NASA Astrophysics Data System (ADS)

    Chan, Eric K. Y.; Pearce, John A.

    1989-05-01

    Computer-assisted segmentation of vascular patterns in thermographic images provides the clinician with graphic outlines of thermally significant subcutaneous blood vessels. Segmentation strategies compared here consist of image smoothing protocols followed by thresholding and zero-crossing edge detectors. Median prefiltering followed by the Frei-Chen algorithm gave the most reproducible results, with an execution time of 143 seconds for 256 X 256 images. The Laplacian of Gaussian operator was not suitable due to streak artifacts in the thermographic imaging system. This computerized process may be adopted in a fast paced clinical environment to aid in the diagnosis and assessment of peripheral circulatory diseases, Raynaud's Disease3, phlebitis, varicose veins, as well as diseases of the autonomic nervous system. The same methodology may be applied to enhance the appearance of abnormal breast vascular patterns, and hence serve as an adjunct to mammography in the diagnosis of breast cancer. The automatically segmented vascular patterns, which have a hand drawn appearance, may also be used as a data reduction precursor to higher level pattern analysis and classification tasks.

  7. Extensive Concerted Evolution of Rice Paralogs and the Road to Regaining Independence

    PubMed Central

    Wang, Xiyin; Tang, Haibao; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.

    2007-01-01

    Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the ∼0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, ∼8% of japonica paralogs produced 5–7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while ∼70-MY-old “paleologs” resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice–sorghum divergence ∼41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity—that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5–7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization. PMID:18039882

  8. Transposon-like properties of the major, long repetitive sequence family in the genome of Physarum polycephalum

    PubMed Central

    Pearston, Douglas H.; Gordon, Mairi; Hardman, Norman

    1985-01-01

    A family of long, highly-repetitive sequences, referred to previously as `HpaII-repeats', dominates the genome of the eukaryotic slime mould Physarum polycephalum. These sequences are found exclusively in scrambled clusters. They account for about one-half of the total complement of repetitive DNA in Physarum, and represent the major sequence component found in hypermethylated, 20-50 kb segments of Physarum genomic DNA that fail to be cleaved using the restriction endonuclease HpaII. The structure of this abundant repetitive element was investigated by analysing cloned segments derived from the hypermethylated genomic DNA compartment. We show that the `HpaII-repeat' forms part of a larger repetitive DNA structure, ∼8.6 kb in length, with several structural features in common with recognised eukaryotic transposable genetic elements. Scrambled clusters of the sequence probably arise as a result of transposition-like events, during which the element preferentially recombines in either orientation with target sites located in other copies of the same repeated sequence. The target sites for transposition/recombination are not related in sequence but in all cases studied they are potentially capable of promoting the formation of small `cruciforms' or `Z-DNA' structures which might be recognised during the recombination process. ImagesFig. 3.Fig. 4. PMID:16453652

  9. Isolation and Complete Genome Sequencing of Bluetongue Virus Serotype 12 from India.

    PubMed

    Rao, P P; Reddy, Y V; Hegde, N R

    2015-10-01

    Bluetongue virus (BTV) causes disease mainly in sheep, but can be transmitted via other domestic and wild ruminants, resulting in pecuniary burden and trade restrictions. Segmented genome with the possibility of reassortment, existence of 26 serotypes, geographical restriction in the distribution of many of the serotypes, use of live attenuated vaccines and the lack of complete sequences of viruses isolated from several parts of the globe have complicated our understanding of the origin, movement and distribution of BTV. Recent efforts in genome sequencing of several strains have helped in better comprehending BTV epidemiology. In an effort to contribute to the genetic epidemiology of BTV in India, we report the isolation and complete genome sequencing of a BTV serotype 12 virus (designated NMO1). This is the first BTV-12 isolated from India and the second BTV-12 to be sequenced worldwide. The analysis of sequences of this virus suggests that NMO1 derived its segments from viruses belonging to western topotype viruses, as well as those from South-East Asia and India. The results have implications for understanding the origin, emergence/re-emergence and movement of BTV as well as for the development of vaccines and diagnostics based on robust epidemiological data. © 2013 Blackwell Verlag GmbH.

  10. Influenza A virus evolution and spatio-temporal dynamics in Eurasian wild birds: a phylogenetic and phylogeographical study of whole-genome sequence data

    PubMed Central

    Lewis, Nicola S.; Verhagen, Josanne H.; Javakhishvili, Zurab; Russell, Colin A.; Lexmond, Pascal; Westgeest, Kim B.; Bestebroer, Theo M.; Halpin, Rebecca A.; Lin, Xudong; Ransier, Amy; Fedorova, Nadia B.; Stockwell, Timothy B.; Latorre-Margalef, Neus; Olsen, Björn; Smith, Gavin; Bahl, Justin; Wentworth, David E.; Waldenström, Jonas; Fouchier, Ron A. M.

    2015-01-01

    Low pathogenic avian influenza A viruses (IAVs) have a natural host reservoir in wild waterbirds and the potential to spread to other host species. Here, we investigated the evolutionary, spatial and temporal dynamics of avian IAVs in Eurasian wild birds. We used whole-genome sequences collected as part of an intensive long-term Eurasian wild bird surveillance study, and combined this genetic data with temporal and spatial information to explore the virus evolutionary dynamics. Frequent reassortment and co-circulating lineages were observed for all eight genomic RNA segments over time. There was no apparent species-specific effect on the diversity of the avian IAVs. There was a spatial and temporal relationship between the Eurasian sequences and significant viral migration of avian IAVs from West Eurasia towards Central Eurasia. The observed viral migration patterns differed between segments. Furthermore, we discuss the challenges faced when analysing these surveillance and sequence data, and the caveats to be borne in mind when drawing conclusions from the apparent results of such analyses. PMID:25904147

  11. Tectono-stratigraphic evolution of normal fault zones: Thal Fault Zone, Suez Rift, Egypt

    NASA Astrophysics Data System (ADS)

    Leppard, Christopher William

    The evolution of linkage of normal fault populations to form continuous, basin bounding normal fault zones is recognised as an important control on the stratigraphic evolution of rift-basins. This project aims to investigate the temporal and spatial evolution of normal fault populations and associated syn-rift deposits from the initiation of early-formed, isolated normal faults (rift-initiation) to the development of a through-going fault zone (rift-climax) by documenting the tectono-stratigraphic evolution of the Sarbut EI Gamal segment of the exceptionally well-exposed Thai fault zone, Suez Rift, Egypt. A number of dated stratal surfaces mapped around the syn-rift depocentre of the Sarbut El Gamal segment allow constraints to be placed on the timing and style of deformation, and the spatial variability of facies along this segment of the fault zone. Data collected indicates that during the first 3.5 My of rifting the structural style was characterised by numerous, closely spaced, short (< 3 km), low displacement (< 200 m) synthetic and antithetic normal faults within 1 - 2 km of the present-day fault segment trace, accommodating surface deformation associated with the development of a fault propagation monocline above the buried, pre-cursor strands of the Sarbut El Gamal fault segment. The progressive localisation of displacement onto the fault segment during rift-climax resulted in the development of a major, surface-breaking fault 3.5 - 5 My after the onset of rifting and is recorded by the death of early-formed synthetic and antithetic faults up-section, and thickening of syn-rift strata towards the fault segment. The influence of intrabasinal highs at the tips of the Sarbut EI Gamal fault segment on the pre-rift sub-crop level, combined with observations from the early-formed structures and coeval deposits suggest that the overall length of the fault segment was fixed from an early stage. The fault segment is interpreted to have grown through rapid lateral propagation and early linkage of the precursor fault strands at depth before the fault segment broke surface, followed by the accumulation of displacement on the linked fault segment with minimal lateral propagation. This style of fault growth contrasts conventional fault growth models by which growth occurs through incremental increases in both displacement and length through time. The evolution of normal fault populations and fault zones exerts a first- order control on basin physiography and sediment supply, and therefore, the architecture and distribution of coeval syn-rift stratigraphy. The early syn-rift continental, Abu Zenima Formation, to shallow marine, Nukhul Formation show a pronounced westward increase in thickness controlled by the series of synthetic and antithetic faults up to 3 km west of present day Thai fault. The orientation of these faults controlled the location of fluvial conglomerates, sandstones and mudstones that shifted to the topographic lows created. The progressive localisation of displacement onto the Sarbut El Gamal fault segment during rift-climax resulted in an overall change in basin geometry. Accelerated subsidence rates led to sedimentation rates being outpaced by subsidence resulting in the development of a marine, sediment-starved, underfilled hangingwall depocentre characterised by slope-to-basinal depositional environments, with a laterally continuous slope apron in the immediate hangingwall, and point-sourced submarine fans. Controls on the spatial distribution, three dimensional architecture, and facies stacking patterns of coeval syn-rift deposits are identified as: I) structural style of the evolution and linkage of normal fault populations, ii) basin physiography, iii) evolution of drainage catchments, iv) bedrock lithology, and v) variations in sea/lake level.

  12. [Clonal association of flat epithelial atypia and tubular breast cancer].

    PubMed

    Aulmann, S; Elsawaf, Z; Penzel, R; Schirmacher, P; Sinn, H P

    2008-11-01

    Flat epithelial atypia (FEA) of the breast has recently gained attention as a possible precursor lesion of highly differentiated breast cancer. Especially tubular carcinomas, with which FEA shares cytological features, often occur in close proximity to each other. To examine a possible clonal relationship, we analysed mutations of the highly variable region of the mitochondrial genome in a series of tubular carcinomas, associated FEA and normal glands. Multiple sequence alignment showed identical mtDNA mutations in approximately 50% of paired FEA and tumour samples, indicative of a clonal relationship. Our data indicate a possible precursor role of FEA in the development of tubular breast cancer.

  13. Genomic evolution and chemoresistance in germ-cell tumours.

    PubMed

    Taylor-Weiner, Amaro; Zack, Travis; O'Donnell, Elizabeth; Guerriero, Jennifer L; Bernard, Brandon; Reddy, Anita; Han, G Celine; AlDubayan, Saud; Amin-Mansour, Ali; Schumacher, Steven E; Litchfield, Kevin; Turnbull, Clare; Gabriel, Stacey; Beroukhim, Rameen; Getz, Gad; Carter, Scott L; Hirsch, Michelle S; Letai, Anthony; Sweeney, Christopher; Van Allen, Eliezer M

    2016-11-30

    Germ-cell tumours (GCTs) are derived from germ cells and occur most frequently in the testes. GCTs are histologically heterogeneous and distinctly curable with chemotherapy. Gains of chromosome arm 12p and aneuploidy are nearly universal in GCTs, but specific somatic genomic features driving tumour initiation, chemosensitivity and progression are incompletely characterized. Here, using clinical whole-exome and transcriptome sequencing of precursor, primary (testicular and mediastinal) and chemoresistant metastatic human GCTs, we show that the primary somatic feature of GCTs is highly recurrent chromosome arm level amplifications and reciprocal deletions (reciprocal loss of heterozygosity), variations that are significantly enriched in GCTs compared to 19 other cancer types. These tumours also acquire KRAS mutations during the development from precursor to primary disease, and primary testicular GCTs (TGCTs) are uniformly wild type for TP53. In addition, by functional measurement of apoptotic signalling (BH3 profiling) of fresh tumour and adjacent tissue, we find that primary TGCTs have high mitochondrial priming that facilitates chemotherapy-induced apoptosis. Finally, by phylogenetic analysis of serial TGCTs that emerge with chemotherapy resistance, we show how TGCTs gain additional reciprocal loss of heterozygosity and that this is associated with loss of pluripotency markers (NANOG and POU5F1) in chemoresistant teratomas or transformed carcinomas. Our results demonstrate the distinct genomic features underlying the origins of this disease and associated with the chemosensitivity phenotype, as well as the rare progression to chemoresistance. These results identify the convergence of cancer genomics, mitochondrial priming and GCT evolution, and may provide insights into chemosensitivity and resistance in other cancers.

  14. ASFinder: a tool for genome-wide identification of alternatively splicing transcripts from EST-derived sequences.

    PubMed

    Min, Xiang Jia

    2013-01-01

    Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.

  15. Pathogenicity Island-Directed Transfer of Unlinked Chromosomal Virulence Genes

    PubMed Central

    Chen, John; Ram, Geeta; Penadés, José R.; Brown, Stuart; Novick, Richard P.

    2014-01-01

    Summary In recent decades, the notorious pathogen Staphylococcus aureus has become progressively more contagious, more virulent and more resistant to antibiotics. This implies a rather dynamic evolutionary capability, representing a remarkable level of genomic plasticity, most probably maintained by horizontal gene transfer. Here we report that the staphylococcal pathogenicity islands have a dual role in gene transfer: they not only mediate their own transfer, but they can independently direct the transfer of unlinked chromosomal segments containing virulence genes. While transfer of the island itself requires specific helper phages, transfer of unlinked chromosomal segments does not, so that potentially any pac-type phage will serve. These results reveal that SaPIs can increase the horizontal exchange of accessory genes associated with disease, and may shape pathogen genomes beyond the confines of their attachment sites. PMID:25498143

  16. Fitness cost of reassortment in human influenza

    PubMed Central

    Lässig, Michael

    2017-01-01

    Reassortment, which is the exchange of genome sequence between viruses co-infecting a host cell, plays an important role in the evolution of segmented viruses. In the human influenza virus, reassortment happens most frequently between co-existing variants within the same lineage. This process breaks genetic linkage and fitness correlations between viral genome segments, but the resulting net effect on viral fitness has remained unclear. In this paper, we determine rate and average selective effect of reassortment processes in the human influenza lineage A/H3N2. For the surface proteins hemagglutinin and neuraminidase, reassortant variants with a mean distance of at least 3 nucleotides to their parent strains get established at a rate of about 10−2 in units of the neutral point mutation rate. Our inference is based on a new method to map reassortment events from joint genealogies of multiple genome segments, which is tested by extensive simulations. We show that intra-lineage reassortment processes are, on average, under substantial negative selection that increases in strength with increasing sequence distance between the parent strains. The deleterious effects of reassortment manifest themselves in two ways: there are fewer reassortment events than expected from a null model of neutral reassortment, and reassortant strains have fewer descendants than their non-reassortant counterparts. Our results suggest that influenza evolves under ubiquitous epistasis across proteins, which produces fitness barriers against reassortment even between co-circulating strains within one lineage. PMID:29112968

  17. Dynamic evolution at pericentromeres.

    PubMed

    Hall, Anne E; Kettler, Gregory C; Preuss, Daphne

    2006-03-01

    Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.

  18. iCopyDAV: Integrated platform for copy number variations—Detection, annotation and visualization

    PubMed Central

    Vogeti, Sriharsha

    2018-01-01

    Discovery of copy number variations (CNVs), a major category of structural variations, have dramatically changed our understanding of differences between individuals and provide an alternate paradigm for the genetic basis of human diseases. CNVs include both copy gain and copy loss events and their detection genome-wide is now possible using high-throughput, low-cost next generation sequencing (NGS) methods. However, accurate detection of CNVs from NGS data is not straightforward due to non-uniform coverage of reads resulting from various systemic biases. We have developed an integrated platform, iCopyDAV, to handle some of these issues in CNV detection in whole genome NGS data. It has a modular framework comprising five major modules: data pre-treatment, segmentation, variant calling, annotation and visualization. An important feature of iCopyDAV is the functional annotation module that enables the user to identify and prioritize CNVs encompassing various functional elements, genomic features and disease-associations. Parallelization of the segmentation algorithms makes the iCopyDAV platform even accessible on a desktop. Here we show the effect of sequencing coverage, read length, bin size, data pre-treatment and segmentation approaches on accurate detection of the complete spectrum of CNVs. Performance of iCopyDAV is evaluated on both simulated data and real data for different sequencing depths. It is an open-source integrated pipeline available at https://github.com/vogetihrsh/icopydav and as Docker’s image at http://bioinf.iiit.ac.in/icopydav/. PMID:29621297

  19. Widespread Recombination, Reassortment, and Transmission of Unbalanced Compound Viral Genotypes in Natural Arenavirus Infections

    PubMed Central

    Stenglein, Mark D.; Jacobson, Elliott R.; Chang, Li-Wen; Sanders, Chris; Hawkins, Michelle G.; Guzman, David S-M.; Drazenovich, Tracy; Dunker, Freeland; Kamaka, Elizabeth K.; Fisher, Debbie; Reavill, Drury R.; Meola, Linda F.; Levens, Gregory; DeRisi, Joseph L.

    2015-01-01

    Arenaviruses are one of the largest families of human hemorrhagic fever viruses and are known to infect both mammals and snakes. Arenaviruses package a large (L) and small (S) genome segment in their virions. For segmented RNA viruses like these, novel genotypes can be generated through mutation, recombination, and reassortment. Although it is believed that an ancient recombination event led to the emergence of a new lineage of mammalian arenaviruses, neither recombination nor reassortment has been definitively documented in natural arenavirus infections. Here, we used metagenomic sequencing to survey the viral diversity present in captive arenavirus-infected snakes. From 48 infected animals, we determined the complete or near complete sequence of 210 genome segments that grouped into 23 L and 11 S genotypes. The majority of snakes were multiply infected, with up to 4 distinct S and 11 distinct L segment genotypes in individual animals. This S/L imbalance was typical: in all cases intrahost L segment genotypes outnumbered S genotypes, and a particular S segment genotype dominated in individual animals and at a population level. We corroborated sequencing results by qRT-PCR and virus isolation, and isolates replicated as ensembles in culture. Numerous instances of recombination and reassortment were detected, including recombinant segments with unusual organizations featuring 2 intergenic regions and superfluous content, which were capable of stable replication and transmission despite their atypical structures. Overall, this represents intrahost diversity of an extent and form that goes well beyond what has been observed for arenaviruses or for viruses in general. This diversity can be plausibly attributed to the captive intermingling of sub-clinically infected wild-caught snakes. Thus, beyond providing a unique opportunity to study arenavirus evolution and adaptation, these findings allow the investigation of unintended anthropogenic impacts on viral ecology, diversity, and disease potential. PMID:25993603

  20. Z-DNA-induced super-transport of energy within genomes

    NASA Astrophysics Data System (ADS)

    Kulish, Vladimir V.; Heng, Li; Dröge, Peter

    2007-10-01

    Spontaneous transitions of genomic DNA segments from right-handed B-DNA into the left-handed, high-energy Z conformation are unstable within topologically relaxed DNA molecules, such as mammalian chromosomes. Here we show, from direct application of the principles of statistical physics with a promoter region in the mouse genome as a representative example, that the life span for this alternate DNA conformation may be much smaller than the characteristic time of thermal fluctuations that cause the B-to-Z transition. Surprisingly, such a short existence of Z-DNA is important because it can be responsible for super-transport of energy within a genome. This type of energy transport can be utilized by a cell to communicate information about the state of particular chromatin domains within chromosomes or as a buffer against genome instability.

  1. The genome sequence of taurine cattle: a window to ruminant biology and evolution.

    PubMed

    Elsik, Christine G; Tellam, Ross L; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Weinstock, George M; Adelson, David L; Eichler, Evan E; Elnitski, Laura; Guigó, Roderic; Hamernik, Debora L; Kappes, Steve M; Lewin, Harris A; Lynn, David J; Nicholas, Frank W; Reymond, Alexandre; Rijnkels, Monique; Skow, Loren C; Zdobnov, Evgeny M; Schook, Lawrence; Womack, James; Alioto, Tyler; Antonarakis, Stylianos E; Astashyn, Alex; Chapple, Charles E; Chen, Hsiu-Chuan; Chrast, Jacqueline; Câmara, Francisco; Ermolaeva, Olga; Henrichsen, Charlotte N; Hlavina, Wratko; Kapustin, Yuri; Kiryutin, Boris; Kitts, Paul; Kokocinski, Felix; Landrum, Melissa; Maglott, Donna; Pruitt, Kim; Sapojnikov, Victor; Searle, Stephen M; Solovyev, Victor; Souvorov, Alexandre; Ucla, Catherine; Wyss, Carine; Anzola, Juan M; Gerlach, Daniel; Elhaik, Eran; Graur, Dan; Reese, Justin T; Edgar, Robert C; McEwan, John C; Payne, Gemma M; Raison, Joy M; Junier, Thomas; Kriventseva, Evgenia V; Eyras, Eduardo; Plass, Mireya; Donthu, Ravikiran; Larkin, Denis M; Reecy, James; Yang, Mary Q; Chen, Lin; Cheng, Ze; Chitko-McKown, Carol G; Liu, George E; Matukumalli, Lakshmi K; Song, Jiuzhou; Zhu, Bin; Bradley, Daniel G; Brinkman, Fiona S L; Lau, Lilian P L; Whiteside, Matthew D; Walker, Angela; Wheeler, Thomas T; Casey, Theresa; German, J Bruce; Lemay, Danielle G; Maqbool, Nauman J; Molenaar, Adrian J; Seo, Seongwon; Stothard, Paul; Baldwin, Cynthia L; Baxter, Rebecca; Brinkmeyer-Langford, Candice L; Brown, Wendy C; Childers, Christopher P; Connelley, Timothy; Ellis, Shirley A; Fritz, Krista; Glass, Elizabeth J; Herzig, Carolyn T A; Iivanainen, Antti; Lahmers, Kevin K; Bennett, Anna K; Dickens, C Michael; Gilbert, James G R; Hagen, Darren E; Salih, Hanni; Aerts, Jan; Caetano, Alexandre R; Dalrymple, Brian; Garcia, Jose Fernando; Gill, Clare A; Hiendleder, Stefan G; Memili, Erdogan; Spurlock, Diane; Williams, John L; Alexander, Lee; Brownstein, Michael J; Guan, Leluo; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Moore, Richard; Moore, Stephen S; Roberts, Andy; Taniguchi, Masaaki; Waterman, Richard C; Chacko, Joseph; Chandrabose, Mimi M; Cree, Andy; Dao, Marvin Diep; Dinh, Huyen H; Gabisi, Ramatu Ayiesha; Hines, Sandra; Hume, Jennifer; Jhangiani, Shalini N; Joshi, Vandita; Kovar, Christie L; Lewis, Lora R; Liu, Yih-Shin; Lopez, John; Morgan, Margaret B; Nguyen, Ngoc Bich; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Wright, Rita A; Buhay, Christian; Ding, Yan; Dugan-Rocha, Shannon; Herdandez, Judith; Holder, Michael; Sabo, Aniko; Egan, Amy; Goodell, Jason; Wilczek-Boney, Katarzyna; Fowler, Gerald R; Hitchens, Matthew Edward; Lozado, Ryan J; Moen, Charles; Steffen, David; Warren, James T; Zhang, Jingkun; Chiu, Readman; Schein, Jacqueline E; Durbin, K James; Havlak, Paul; Jiang, Huaiyang; Liu, Yue; Qin, Xiang; Ren, Yanru; Shen, Yufeng; Song, Henry; Bell, Stephanie Nicole; Davis, Clay; Johnson, Angela Jolivet; Lee, Sandra; Nazareth, Lynne V; Patel, Bella Mayurkumar; Pu, Ling-Ling; Vattathil, Selina; Williams, Rex Lee; Curry, Stacey; Hamilton, Cerissa; Sodergren, Erica; Wheeler, David A; Barris, Wes; Bennett, Gary L; Eggen, André; Green, Ronnie D; Harhay, Gregory P; Hobbs, Matthew; Jann, Oliver; Keele, John W; Kent, Matthew P; Lien, Sigbjørn; McKay, Stephanie D; McWilliam, Sean; Ratnakumar, Abhirami; Schnabel, Robert D; Smith, Timothy; Snelling, Warren M; Sonstegard, Tad S; Stone, Roger T; Sugimoto, Yoshikazu; Takasuga, Akiko; Taylor, Jeremy F; Van Tassell, Curtis P; Macneil, Michael D; Abatepaulo, Antonio R R; Abbey, Colette A; Ahola, Virpi; Almeida, Iassudara G; Amadio, Ariel F; Anatriello, Elen; Bahadue, Suria M; Biase, Fernando H; Boldt, Clayton R; Carroll, Jeffery A; Carvalho, Wanessa A; Cervelatti, Eliane P; Chacko, Elsa; Chapin, Jennifer E; Cheng, Ye; Choi, Jungwoo; Colley, Adam J; de Campos, Tatiana A; De Donato, Marcos; Santos, Isabel K F de Miranda; de Oliveira, Carlo J F; Deobald, Heather; Devinoy, Eve; Donohue, Kaitlin E; Dovc, Peter; Eberlein, Annett; Fitzsimmons, Carolyn J; Franzin, Alessandra M; Garcia, Gustavo R; Genini, Sem; Gladney, Cody J; Grant, Jason R; Greaser, Marion L; Green, Jonathan A; Hadsell, Darryl L; Hakimov, Hatam A; Halgren, Rob; Harrow, Jennifer L; Hart, Elizabeth A; Hastings, Nicola; Hernandez, Marta; Hu, Zhi-Liang; Ingham, Aaron; Iso-Touru, Terhi; Jamis, Catherine; Jensen, Kirsty; Kapetis, Dimos; Kerr, Tovah; Khalil, Sari S; Khatib, Hasan; Kolbehdari, Davood; Kumar, Charu G; Kumar, Dinesh; Leach, Richard; Lee, Justin C-M; Li, Changxi; Logan, Krystin M; Malinverni, Roberto; Marques, Elisa; Martin, William F; Martins, Natalia F; Maruyama, Sandra R; Mazza, Raffaele; McLean, Kim L; Medrano, Juan F; Moreno, Barbara T; Moré, Daniela D; Muntean, Carl T; Nandakumar, Hari P; Nogueira, Marcelo F G; Olsaker, Ingrid; Pant, Sameer D; Panzitta, Francesca; Pastor, Rosemeire C P; Poli, Mario A; Poslusny, Nathan; Rachagani, Satyanarayana; Ranganathan, Shoba; Razpet, Andrej; Riggs, Penny K; Rincon, Gonzalo; Rodriguez-Osorio, Nelida; Rodriguez-Zas, Sandra L; Romero, Natasha E; Rosenwald, Anne; Sando, Lillian; Schmutz, Sheila M; Shen, Libing; Sherman, Laura; Southey, Bruce R; Lutzow, Ylva Strandberg; Sweedler, Jonathan V; Tammen, Imke; Telugu, Bhanu Prakash V L; Urbanski, Jennifer M; Utsunomiya, Yuri T; Verschoor, Chris P; Waardenberg, Ashley J; Wang, Zhiquan; Ward, Robert; Weikard, Rosemarie; Welsh, Thomas H; White, Stephen N; Wilming, Laurens G; Wunderlich, Kris R; Yang, Jianqi; Zhao, Feng-Qi

    2009-04-24

    To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.

  2. Stochastic Modeling based on Dictionary Approach for the Generation of Daily Precipitation Occurrences

    NASA Astrophysics Data System (ADS)

    Panu, U. S.; Ng, W.; Rasmussen, P. F.

    2009-12-01

    The modeling of weather states (i.e., precipitation occurrences) is critical when the historical data are not long enough for the desired analysis. Stochastic models (e.g., Markov Chain and Alternating Renewal Process (ARP)) of the precipitation occurrence processes generally assume the existence of short-term temporal-dependency between the neighboring states while implying the existence of long-term independency (randomness) of states in precipitation records. Existing temporal-dependent models for the generation of precipitation occurrences are restricted either by the fixed-length memory (e.g., the order of a Markov chain model), or by the reining states in segments (e.g., persistency of homogenous states within dry/wet-spell lengths of an ARP). The modeling of variable segment lengths and states could be an arduous task and a flexible modeling approach is required for the preservation of various segmented patterns of precipitation data series. An innovative Dictionary approach has been developed in the field of genome pattern recognition for the identification of frequently occurring genome segments in DNA sequences. The genome segments delineate the biologically meaningful ``words" (i.e., segments with a specific patterns in a series of discrete states) that can be jointly modeled with variable lengths and states. A meaningful “word”, in hydrology, can be referred to a segment of precipitation occurrence comprising of wet or dry states. Such flexibility would provide a unique advantage over the traditional stochastic models for the generation of precipitation occurrences. Three stochastic models, namely, the alternating renewal process using Geometric distribution, the second-order Markov chain model, and the Dictionary approach have been assessed to evaluate their efficacy for the generation of daily precipitation sequences. Comparisons involved three guiding principles namely (i) the ability of models to preserve the short-term temporal-dependency in data through the concepts of autocorrelation, average mutual information, and Hurst exponent, (ii) the ability of models to preserve the persistency within the homogenous dry/wet weather states through analysis of dry/wet-spell lengths between the observed and generated data, and (iii) the ability to assesses the goodness-of-fit of models through the likelihood estimates (i.e., AIC and BIC). Past 30 years of observed daily precipitation records from 10 Canadian meteorological stations were utilized for comparative analyses of the three models. In general, the Markov chain model performed well. The remainders of the models were found to be competitive from one another depending upon the scope and purpose of the comparison. Although the Markov chain model has a certain advantage in the generation of daily precipitation occurrences, the structural flexibility offered by the Dictionary approach in modeling the varied segment lengths of heterogeneous weather states provides a distinct and powerful advantage in the generation of precipitation sequences.

  3. Genomic and phylogenetic evidence that Maize rough dwarf and Rice black-streaked dwarf fijiviruses should be classified as different geographic strains of a single species.

    PubMed

    Xie, L; Lv, M-F; Yang, J; Chen, J-P; Zhang, H-M

    Maize rough dwarf disease (MRDD) has long been known as one of the most devastating viral diseases of maize worldwide and is caused by single or complex infection by four fijiviruses: Maize rough dwarf virus (MRDV) in Europe and the Middle East, Mal de Rio Cuarto virus (MRCV) in South America, rice black-streaked dwarf virus (RBSDV), and Southern rice black-streaked dwarf virus (SRBSDV or Rice black-streaked dwarf virus 2, RBSDV-2) in East Asia. These are currently classified as four distinct species in the genus Fijivirus, family Reoviridae, but their taxonomic status has been questioned. To help resolve this, the nucleotide sequences of the ten genomic segments of an Italian isolate of MRDV have been determined, providing the first complete genomic sequence of this virus. Its genome has 29144 nucleotides and is similar in organization to those of RBSDV, SRBSDV, and MRCV. The 13 ORFs always share highest identities (81.3-97.2%) with the corresponding ORFs of RBSDV and phylogenetic analyses of the different genome segments and ORFs all confirm that MRDV clusters most closely with RBSDV and that MRCV and SRBSDV are slightly more distantly related. The results suggest that MRDV and RBSDV should be classified as different geographic strains of the same virus species and we suggest the name cereal black-streaked dwarf fijivirus (CBSDV) for consideration.

  4. An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.

    PubMed

    Shao, Mingfu; Lin, Yu; Moret, Bernard M E

    2015-05-01

    Computing the edit distance between two genomes is a basic problem in the study of genome evolution. The double-cut-and-join (DCJ) model has formed the basis for most algorithmic research on rearrangements over the last few years. The edit distance under the DCJ model can be computed in linear time for genomes without duplicate genes, while the problem becomes NP-hard in the presence of duplicate genes. In this article, we propose an integer linear programming (ILP) formulation to compute the DCJ distance between two genomes with duplicate genes. We also provide an efficient preprocessing approach to simplify the ILP formulation while preserving optimality. Comparison on simulated genomes demonstrates that our method outperforms MSOAR in computing the edit distance, especially when the genomes contain long duplicated segments. We also apply our method to assign orthologous gene pairs among human, mouse, and rat genomes, where once again our method outperforms MSOAR.

  5. [Analysis of genomic copy number variations in two unrelated neonates with 8p deletion and duplication associated with congenital heart disease].

    PubMed

    Mei, Mei; Yang, Lin; Zhan, Guodong; Wang, Huijun; Ma, Duan; Zhou, Wenhao; Huang, Guoying

    2014-06-01

    To screen for genomic copy number variations (CNVs) in two unrelated neonates with multiple congenital abnormalities using Affymetrix SNP chip and try to find the critical region associated with congenital heart disease. Two neonates were tested for genomic copy number variations by using Cytogenetic SNP chip.Rare CNVs with potential clinical significance were selected of which deletion segments' size was larger than 50 kb and duplication segments' size was larger than 150 kb based on the analysis of ChAs software, without false positive CNVs and segments of normal population. The identified CNVs were compared with those of the cases in DECIPHER and ISCA databases. Eleven rare CNVs with size from 546.6-27 892 kb were identified in the 2 neonates. The deletion region and size of case 1 were 8p23.3-p23.1 (387 912-11 506 771 bp) and 11.1 Mb respectively, the duplication region and size of case 1 were 8p23.1-p11.1 (11 508 387-43 321 279 bp) and 31.8 Mb respectively. The deletion region and size of case 2 were 8p23.3-p23.1 (46 385-7 809 878 bp) and 7.8 Mb respectively, the duplication region and size of case 2 were 8p23.1-p11.21 (12 260 914-40 917 092 bp) and 28.7 Mb respectively. The comparison with Decipher and ISCA databases supported previous viewpoint that 8p23.1 had been associated with congenital heart disease and the region between 7 809 878-11 506 771 bp may play a role in the severe cardiac defects associated with 8p23.1 deletions. Case 1 had serious cardiac abnormalities whose GATA4 was located in the duplication segment and the copy number increased while SOX7 was located in the deletion segment and the copy number decreased. The region between 7 809 878-11 506 771 bp in 8p23.1 is associated with heart defects and copy number variants of SOX7 and GATA4 may result in congenital heart disease.

  6. Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees.

    PubMed

    Williams, Philip H; Eyles, Rod; Weiller, Georg

    2012-01-01

    MicroRNAs (miRNAs) are nonprotein coding RNAs between 20 and 22 nucleotides long that attenuate protein production. Different types of sequence data are being investigated for novel miRNAs, including genomic and transcriptomic sequences. A variety of machine learning methods have successfully predicted miRNA precursors, mature miRNAs, and other nonprotein coding sequences. MirTools, mirDeep2, and miRanalyzer require "read count" to be included with the input sequences, which restricts their use to deep-sequencing data. Our aim was to train a predictor using a cross-section of different species to accurately predict miRNAs outside the training set. We wanted a system that did not require read-count for prediction and could therefore be applied to short sequences extracted from genomic, EST, or RNA-seq sources. A miRNA-predictive decision-tree model has been developed by supervised machine learning. It only requires that the corresponding genome or transcriptome is available within a sequence window that includes the precursor candidate so that the required sequence features can be collected. Some of the most critical features for training the predictor are the miRNA:miRNA(∗) duplex energy and the number of mismatches in the duplex. We present a cross-species plant miRNA predictor with 84.08% sensitivity and 98.53% specificity based on rigorous testing by leave-one-out validation.

  7. The NAD+ precursor nicotinic acid improves genomic integrity in human peripheral blood mononuclear cells after X-irradiation.

    PubMed

    Weidele, Kathrin; Beneke, Sascha; Bürkle, Alexander

    2017-04-01

    NAD + is an essential cofactor for enzymes catalyzing redox-reactions as well as an electron carrier in energy metabolism. Aside from this, NAD + consuming enzymes like poly(ADP-ribose) polymerases and sirtuins are important regulators involved in chromatin-restructuring processes during repair and epigenetics/transcriptional adaption. In order to replenish cellular NAD + levels after cleavage, synthesis starts from precursors such as nicotinamide, nicotinamide riboside or nicotinic acid to match the need for this essential molecule. In the present study, we investigated the impact of supplementation with nicotinic acid on resting and proliferating human mononuclear blood cells with a focus on DNA damage and repair processes. We observed that nicotinic acid supplementation increased NAD + levels as well as DNA repair efficiency and enhanced genomic stability evaluated by micronucleus test after x-ray treatment. Interestingly, resting cells displayed lower basal levels of DNA breaks compared to proliferating cells, but break-induction rates were identical. Despite similar levels of p53 protein upregulation after irradiation, higher NAD + concentrations led to reduced acetylation of this protein, suggesting enhanced SIRT1 activity. Our data reveal that even in normal primary human cells cellular NAD + levels may be limiting under conditions of genotoxic stress and that boosting the NAD + system with nicotinic acid can improve genomic stability. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Engineering improved bio-jet fuel tolerance in Escherichia coli using a transgenic library from the hydrocarbon-degrader Marinobacter aquaeolei.

    PubMed

    Tomko, Timothy A; Dunlop, Mary J

    2015-01-01

    Recent metabolic engineering efforts have generated microorganisms that can produce biofuels, including bio-jet fuels, however these fuels are often toxic to cells, limiting production yields. There are natural examples of microorganisms that have evolved mechanisms for tolerating hydrocarbon-rich environments, such as those that thrive near natural oil seeps and in oil-polluted waters. Using genomic DNA from the hydrocarbon-degrading microbe Marinobacter aquaeolei, we constructed a transgenic library that we expressed in Escherichia coli. We exposed cells to inhibitory levels of pinene, a monoterpene that can serve as a jet fuel precursor with chemical properties similar to existing tactical fuels. Using a sequential strategy with a fosmid library followed by a plasmid library, we were able to isolate a region of DNA from the M. aquaeolei genome that conferred pinene tolerance when expressed in E. coli. We determined that a single gene, yceI, was responsible for the tolerance improvements. Overexpression of this gene placed no additional burden on the host. We also tested tolerance to other monoterpenes and showed that yceI selectively improves tolerance. The genomes of hydrocarbon-tolerant microbes represent a rich resource for tolerance engineering. Using a transgenic library, we were able to identify a single gene that improves E. coli's tolerance to the bio-jet fuel precursor pinene.

  9. Genome-wide network analysis of Wnt signaling in three pediatric cancers

    NASA Astrophysics Data System (ADS)

    Bao, Ju; Lee, Ho-Jin; Zheng, Jie J.

    2013-10-01

    Genomic structural alteration is common in pediatric cancers, and analysis of data generated by the Pediatric Cancer Genome Project reveals such tumor-related alterations in many Wnt signaling-associated genes. Most pediatric cancers are thought to arise within developing tissues that undergo substantial expansion during early organ formation, growth and maturation, and Wnt signaling plays an important role in this development. We examined three pediatric tumors--medullobastoma, early T-cell precursor acute lymphoblastic leukemia, and retinoblastoma--that show multiple genomic structural variations within Wnt signaling pathways. We mathematically modeled this pathway to investigate the effects of cancer-related structural variations on Wnt signaling. Surprisingly, we found that an outcome measure of canonical Wnt signaling was consistently similar in matched cancer cells and normal cells, even in the context of different cancers, different mutations, and different Wnt-related genes. Our results suggest that the cancer cells maintain a normal level of Wnt signaling by developing multiple mutations.

  10. Genome Sequence of “Candidatus Walczuchella monophlebidarum” the Flavobacterial Endosymbiont of Llaveia axin axin (Hemiptera: Coccoidea: Monophlebidae)

    PubMed Central

    Rosas-Pérez, Tania; Rosenblueth, Mónica; Rincón-Rosales, Reiner; Mora, Jaime; Martínez-Romero, Esperanza

    2014-01-01

    Scale insects (Hemiptera: Coccoidae) constitute a very diverse group of sap-feeding insects with a large diversity of symbiotic associations with bacteria. Here, we present the complete genome sequence, metabolic reconstruction, and comparative genomics of the flavobacterial endosymbiont of the giant scale insect Llaveia axin axin. The gene repertoire of its 309,299 bp genome was similar to that of other flavobacterial insect endosymbionts though not syntenic. According to its genetic content, essential amino acid biosynthesis is likely to be the flavobacterial endosymbiont's principal contribution to the symbiotic association with its insect host. We also report the presence of a γ-proteobacterial symbiont that may be involved in waste nitrogen recycling and also has amino acid biosynthetic capabilities that may provide metabolic precursors to the flavobacterial endosymbiont. We propose “Candidatus Walczuchella monophlebidarum” as the name of the flavobacterial endosymbiont of insects from the Monophlebidae family. PMID:24610838

  11. Draft Genome Sequences from a Novel Clade of Bacillus cereus Sensu Lato Strains, Isolated from the International Space Station.

    PubMed

    Venkateswaran, Kasthuri; Checinska Sielaff, Aleksandra; Ratnayake, Shashikala; Pope, Robert K; Blank, Thomas E; Stepanov, Victor G; Fox, George E; van Tongeren, Sandra P; Torres, Clinton; Allen, Jonathan; Jaing, Crystal; Pierson, Duane; Perry, Jay; Koren, Sergey; Phillippy, Adam; Klubnik, Joy; Treangen, Todd J; Rosovitz, M J; Bergman, Nicholas H

    2017-08-10

    The draft genome sequences of six Bacillus strains, isolated from the International Space Station and belonging to the Bacillus anthracis - B. cereus - B. thuringiensis group, are presented here. These strains were isolated from the Japanese Experiment Module (one strain), U.S. Harmony Node 2 (three strains), and Russian Segment Zvezda Module (two strains). Copyright © 2017 Venkateswaran et al.

  12. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome

    PubMed Central

    2009-01-01

    Background Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. Results We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. Conclusion We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes. PMID:19656416

  13. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.

    PubMed

    Hamberger, Björn; Hall, Dawn; Yuen, Mack; Oddy, Claire; Hamberger, Britta; Keeling, Christopher I; Ritland, Carol; Ritland, Kermit; Bohlmann, Jörg

    2009-08-06

    Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes.

  14. i-ADHoRe 2.0: an improved tool to detect degenerated genomic homology using genomic profiles.

    PubMed

    Simillion, Cedric; Janssens, Koen; Sterck, Lieven; Van de Peer, Yves

    2008-01-01

    i-ADHoRe is a software tool that combines gene content and gene order information of homologous genomic segments into profiles to detect highly degenerated homology relations within and between genomes. The new version offers, besides a significant increase in performance, several optimizations to the algorithm, most importantly to the profile alignment routine. As a result, the annotations of multiple genomes, or parts thereof, can be fed simultaneously into the program, after which it will report all regions of homology, both within and between genomes. The i-ADHoRe 2.0 package contains the C++ source code for the main program as well as various Perl scripts and a fully documented Perl API to facilitate post-processing. The software runs on any Linux- or -UNIX based platform. The package is freely available for academic users and can be downloaded from http://bioinformatics.psb.ugent.be/

  15. An overview on genome organization of marine organisms.

    PubMed

    Costantini, Maria

    2015-12-01

    In this review we will concentrate on some general genome features of marine organisms and their evolution, ranging from vertebrate to invertebrates until unicellular organisms. Before genome sequencing, the ultracentrifugation in CsCl led to high resolution of mammalian DNA (without seeing at the sequence). The analytical profile of human DNA showed that the vertebrate genome is a mosaic of isochores, typically megabase-size DNA segments that belong in a small number of families characterized by different GC levels. The recent availability of a number of fully sequenced genomes allowed mapping very precisely the isochores, based on DNA sequences. Since isochores are tightly linked to biological properties such as gene density, replication timing and recombination, the new level of detail provided by the isochore map helped the understanding of genome structure, function and evolution. This led the current level of knowledge and to further insights. Copyright © 2015. Published by Elsevier B.V.

  16. Relaxation dynamics of internal segments of DNA chains in nanochannels

    NASA Astrophysics Data System (ADS)

    Jain, Aashish; Muralidhar, Abhiram; Dorfman, Kevin; Dorfman Group Team

    We will present relaxation dynamics of internal segments of a DNA chain confined in nanochannel. The results have direct application in genome mapping technology, where long DNA molecules containing sequence-specific fluorescent probes are passed through an array of nanochannels to linearize them, and then the distances between these probes (the so-called ``DNA barcode'') are measured. The relaxation dynamics of internal segments set the experimental error due to dynamic fluctuations. We developed a multi-scale simulation algorithm, combining a Pruned-Enriched Rosenbluth Method (PERM) simulation of a discrete wormlike chain model with hard spheres with Brownian dynamics (BD) simulations of a bead-spring chain. Realistic parameters such as the bead friction coefficient and spring force law parameters are obtained from PERM simulations and then mapped onto the bead-spring model. The BD simulations are carried out to obtain the extension autocorrelation functions of various segments, which furnish their relaxation times. Interestingly, we find that (i) corner segments relax faster than the center segments and (ii) relaxation times of corner segments do not depend on the contour length of DNA chain, whereas the relaxation times of center segments increase linearly with DNA chain size.

  17. Segmentation of time series with long-range fractal correlations.

    PubMed

    Bernaola-Galván, P; Oliver, J L; Hackenberg, M; Coronado, A V; Ivanov, P Ch; Carpena, P

    2012-06-01

    Segmentation is a standard method of data analysis to identify change-points dividing a nonstationary time series into homogeneous segments. However, for long-range fractal correlated series, most of the segmentation techniques detect spurious change-points which are simply due to the heterogeneities induced by the correlations and not to real nonstationarities. To avoid this oversegmentation, we present a segmentation algorithm which takes as a reference for homogeneity, instead of a random i.i.d. series, a correlated series modeled by a fractional noise with the same degree of correlations as the series to be segmented. We apply our algorithm to artificial series with long-range correlations and show that it systematically detects only the change-points produced by real nonstationarities and not those created by the correlations of the signal. Further, we apply the method to the sequence of the long arm of human chromosome 21, which is known to have long-range fractal correlations. We obtain only three segments that clearly correspond to the three regions of different G + C composition revealed by means of a multi-scale wavelet plot. Similar results have been obtained when segmenting all human chromosome sequences, showing the existence of previously unknown huge compositional superstructures in the human genome.

  18. De novo transcriptome sequencing reveals a considerable bias in the incidence of simple sequence repeats towards the downstream of 'Pre-miRNAs' of black pepper.

    PubMed

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of '43 pre-miRNA candidates bearing different types of SSR motifs'. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted 'pre-miRNA candidates bearing SSRs'. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted 'pre-miRNA candidates'. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of 'tandem repeats' in miRNAs.

  19. De novo Transcriptome Sequencing Reveals a Considerable Bias in the Incidence of Simple Sequence Repeats towards the Downstream of ‘Pre-miRNAs’ of Black Pepper

    PubMed Central

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176

  20. Biosynthetic studies on clavulanic acid: its biopathway and stereochemical course

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mao, S.S.

    A degradative analysis allowed determination of the stereochemistry at C-9 of clavulanic acid produced by Streptomyces clavuigerus. An over-all inversion of configuration from the C/sub 5/-unit precursor ornithine was observed. The diastereomeric (1R,2R)- and (1S,2R)-(1-/sup 3/H)-glycerols were separately synthesized and administered. Complementary results demonstrated an overall retention of configuration paralleling cysteine incorporation in the biosynthesis of penicillin. 3-Hydroxyornithine, a potential precursor to clavulanic acid, was prepared by a 1,3-dipolar addition of a nitrone and vinylglycine. However, 3-hydroxyornithine was not taken up by the organism and this possible intermediate could not be shown to be a specific precursor to clavulanic acid.more » (2-/sup 3/H)-L-Ornithine displays a preferential incorporation relative to D-ornithine. An epimerization by a one-base mechanism is suggested by the retention of half the tritium activity. ..beta..-Alanine, a potential precursor of the ..beta..-lactam segment was examined and shown not to play a direct role in the biosynthesis. Further, 3-hydroxypropionyl-ornithine, a parallel amide to the tripeptide intermediate in penicillin biosynthesis, was not incorporated into clavulanic acid. The role of 3-hydroxypropionate and glycerol were examined in both starch and triglyceride fermentation media.« less

  1. CoCoNUT: an efficient system for the comparison and analysis of genomes

    PubMed Central

    2008-01-01

    Background Comparative genomics is the analysis and comparison of genomes from different species. This area of research is driven by the large number of sequenced genomes and heavily relies on efficient algorithms and software to perform pairwise and multiple genome comparisons. Results Most of the software tools available are tailored for one specific task. In contrast, we have developed a novel system CoCoNUT (Computational Comparative geNomics Utility Toolkit) that allows solving several different tasks in a unified framework: (1) finding regions of high similarity among multiple genomic sequences and aligning them, (2) comparing two draft or multi-chromosomal genomes, (3) locating large segmental duplications in large genomic sequences, and (4) mapping cDNA/EST to genomic sequences. Conclusion CoCoNUT is competitive with other software tools w.r.t. the quality of the results. The use of state of the art algorithms and data structures allows CoCoNUT to solve comparative genomics tasks more efficiently than previous tools. With the improved user interface (including an interactive visualization component), CoCoNUT provides a unified, versatile, and easy-to-use software tool for large scale studies in comparative genomics. PMID:19014477

  2. Identification and characterization of the reptilian GnRH-II gene in the leopard gecko, Eublepharis macularius, and its evolutionary considerations.

    PubMed

    Ikemoto, Tadahiro; Park, Min Kyun

    2003-10-16

    To elucidate the molecular phylogeny and evolution of a particular peptide, one must analyze not the limited primary amino acid sequences of the low molecular weight mature polypeptide, but rather the sequences of the corresponding precursors from various species. Of all the structural variants of gonadotropin-releasing hormone (GnRH), GnRH-II (chicken GnRH-II, or cGnRH-II) is remarkably conserved without any sequence substitutions among vertebrates, but its precursor sequences vary considerably. We have identified and characterized the full-length complementary DNA (cDNA) encoding the GnRH-II precursor and determined its genomic structure, consisting of four exons and three introns, in a reptilian species, the leopard gecko Eublepharis macularius. This is the first report about the GnRH-II precursor cDNA/gene from reptiles. The deduced leopard gecko prepro-GnRH-II polypeptide had the highest identities with the corresponding polypeptides of amphibians. The GnRH-II precursor mRNA was detected in more than half of the tissues and organs examined. This widespread expression is consistent with the previous findings in several species, though the roles of GnRH outside the hypothalamus-pituitary-gonadal axis remain largely unknown. Molecular phylogenetic analysis combined with sequence comparison showed that the leopard gecko is more similar to fishes and amphibians than to eutherian mammals with respect to the GnRH-II precursor sequence. These results strongly suggest that the divergence of the GnRH-II precursor sequences seen in eutherian mammals may have occurred along with amniote evolution.

  3. Plant STAND P-loop NTPases: a current perspective of genome distribution, evolution, and function : Plant STAND P-loop NTPases: genomic organization, evolution, and molecular mechanism models contribute broadly to plant pathogen defense.

    PubMed

    Arya, Preeti; Acharya, Vishal

    2018-02-01

    STAND P-loop NTPase is the common weapon used by plant and other organisms from all three kingdoms of life to defend themselves against pathogen invasion. The purpose of this study is to review comprehensively the latest finding of plant STAND P-loop NTPase related to their genomic distribution, evolution, and their mechanism of action. Earlier, the plant STAND P-loop NTPase known to be comprised of only NBS-LRRs/AP-ATPase/NB-ARC ATPase. However, recent finding suggests that genome of early green plants comprised of two types of STAND P-loop NTPases: (1) mammalian NACHT NTPases and (2) NBS-LRRs. Moreover, YchF (unconventional G protein and members of P-loop NTPase) subfamily has been reported to be exceptionally involved in biotic stress (in case of Oryza sativa), thereby a novel member of STAND P-loop NTPase in green plants. The lineage-specific expansion and genome duplication events are responsible for abundance of plant STAND P-loop NTPases; where "moderate tandem and low segmental duplication" trajectory followed in majority of plant species with few exception (equal contribution of tandem and segmental duplication). Since the past decades, systematic research is being investigated into NBS-LRR function supported the direct recognition of pathogen or pathogen effectors by the latest models proposed via 'integrated decoy' or 'sensor domains' model. Here, we integrate the recently published findings together with the previous literature on the genomic distribution, evolution, and distinct models proposed for functional molecular mechanism of plant STAND P-loop NTPases.

  4. Genome Engineering in Bacillus anthracis Using Cre Recombinase

    PubMed Central

    Pomerantsev, Andrei P.; Sitaraman, Ramakrishnan; Galloway, Craig R.; Kivovich, Violetta; Leppla, Stephen H.

    2006-01-01

    Genome engineering is a powerful method for the study of bacterial virulence. With the availability of the complete genomic sequence of Bacillus anthracis, it is now possible to inactivate or delete selected genes of interest. However, many current methods for disrupting or deleting more than one gene require use of multiple antibiotic resistance determinants. In this report we used an approach that temporarily inserts an antibiotic resistance marker into a selected region of the genome and subsequently removes it, leaving the target region (a single gene or a larger genomic segment) permanently mutated. For this purpose, a spectinomycin resistance cassette flanked by bacteriophage P1 loxP sites oriented as direct repeats was inserted within a selected gene. After identification of strains having the spectinomycin cassette inserted by a double-crossover event, a thermo-sensitive plasmid expressing Cre recombinase was introduced at the permissive temperature. Cre recombinase action at the loxP sites excised the spectinomycin marker, leaving a single loxP site within the targeted gene or genomic segment. The Cre-expressing plasmid was then removed by growth at the restrictive temperature. The procedure could then be repeated to mutate additional genes. In this way, we sequentially mutated two pairs of genes: pepM and spo0A, and mcrB and mrr. Furthermore, loxP sites introduced at distant genes could be recombined by Cre recombinase to cause deletion of large intervening regions. In this way, we deleted the capBCAD region of the pXO2 plasmid and the entire 30 kb of chromosomal DNA between the mcrB and mrr genes, and in the latter case we found that the 32 intervening open reading frames were not essential to growth. PMID:16369025

  5. Solving the problem of Trans-Genomic Query with alignment tables.

    PubMed

    Parker, Douglass Stott; Hsiao, Ruey-Lung; Xing, Yi; Resch, Alissa M; Lee, Christopher J

    2008-01-01

    The trans-genomic query (TGQ) problem--enabling the free query of biological information, even across genomes--is a central challenge facing bioinformatics. Solutions to this problem can alter the nature of the field, moving it beyond the jungle of data integration and expanding the number and scope of questions that can be answered. An alignment table is a binary relationship on locations (sequence segments). An important special case of alignment tables are hit tables ? tables of pairs of highly similar segments produced by alignment tools like BLAST. However, alignment tables also include general binary relationships, and can represent any useful connection between sequence locations. They can be curated, and provide a high-quality queryable backbone of connections between biological information. Alignment tables thus can be a natural foundation for TGQ, as they permit a central part of the TGQ problem to be reduced to purely technical problems involving tables of locations.Key challenges in implementing alignment tables include efficient representation and indexing of sequence locations. We define a location datatype that can be incorporated naturally into common off-the-shelf database systems. We also describe an implementation of alignment tables in BLASTGRES, an extension of the open-source POSTGRESQL database system that provides indexing and operators on locations required for querying alignment tables. This paper also reviews several successful large-scale applications of alignment tables for Trans-Genomic Query. Tables with millions of alignments have been used in queries about alternative splicing, an area of genomic analysis concerning the way in which a single gene can yield multiple transcripts. Comparative genomics is a large potential application area for TGQ and alignment tables.

  6. Orphan and gene related CpG Islands follow power-law-like distributions in several genomes: evidence of function-related and taxonomy-related modes of distribution.

    PubMed

    Tsiagkas, Giannis; Nikolaou, Christoforos; Almirantis, Yannis

    2014-12-01

    CpG Islands (CGIs) are compositionally defined short genomic stretches, which have been studied in the human, mouse, chicken and later in several other genomes. Initially, they were assigned the role of transcriptional regulation of protein-coding genes, especially the house-keeping ones, while more recently there is found evidence that they are involved in several other functions as well, which might include regulation of the expression of RNA genes, DNA replication etc. Here, an investigation of their distributional characteristics in a variety of genomes is undertaken for both whole CGI populations as well as for CGI subsets that lie away from known genes (gene-unrelated or "orphan" CGIs). In both cases power-law-like linearity in double logarithmic scale is found. An evolutionary model, initially put forward for the explanation of a similar pattern found in gene populations is implemented. It includes segmental duplication events and eliminations of most of the duplicated CGIs, while a moderate rate of non-duplicated CGI eliminations is also applied in some cases. Simulations reproduce all the main features of the observed inter-CGI chromosomal size distributions. Our results on power-law-like linearity found in orphan CGI populations suggest that the observed distributional pattern is independent of the analogous pattern that protein coding segments were reported to follow. The power-law-like patterns in the genomic distributions of CGIs described herein are found to be compatible with several other features of the composition, abundance or functional role of CGIs reported in the current literature across several genomes, on the basis of the proposed evolutionary model. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Genetic transformation of the fungal pathogen responsible for rice blast disease

    PubMed Central

    Parsons, Kenneth A.; Chumley, Forrest G.; Valent, Barbara

    1987-01-01

    The analysis of complex genetic determinants that control the ability of a fungus to colonize its host has been impaired by the lack of sophisticated genetic tools for characterizing important pathogens. We have developed a system for the genetic transformation of Magnaporthe grisea, the causal agent of rice blast disease, to overcome this limitation. A M. grisea arginine auxotroph was shown to contain a mutation (arg3-12) that abolishes ornithine carbamoyltransferase activity. M. grisea strains that contain arg3-12 were used as recipients in transformation experiments with plasmid pMA2, which carries the ArgB+ gene from Aspergillus nidulans. Stable prototrophic transformants arose at a frequency of about 35 per microgram of plasmid DNA. Integration of single or multiple plasmid copies occurred at a single site in the genome of each transformant; rearrangements were often created during integration. When M. grisea genomic segments were incorporated into pMA2, the presence of any one of five different M. grisea segments did not greatly affect the efficiency of transformation. Integration via homologous recombination occurred when the donor plasmid was linearized by cleaving at a unique restriction site within the M. grisea segment. Images PMID:16593854

  8. Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features

    PubMed Central

    Bakas, Spyridon; Akbari, Hamed; Sotiras, Aristeidis; Bilello, Michel; Rozycki, Martin; Kirby, Justin S.; Freymann, John B.; Farahani, Keyvan; Davatzikos, Christos

    2017-01-01

    Gliomas belong to a group of central nervous system tumors, and consist of various sub-regions. Gold standard labeling of these sub-regions in radiographic imaging is essential for both clinical and computational studies, including radiomic and radiogenomic analyses. Towards this end, we release segmentation labels and radiomic features for all pre-operative multimodal magnetic resonance imaging (MRI) (n=243) of the multi-institutional glioma collections of The Cancer Genome Atlas (TCGA), publicly available in The Cancer Imaging Archive (TCIA). Pre-operative scans were identified in both glioblastoma (TCGA-GBM, n=135) and low-grade-glioma (TCGA-LGG, n=108) collections via radiological assessment. The glioma sub-region labels were produced by an automated state-of-the-art method and manually revised by an expert board-certified neuroradiologist. An extensive panel of radiomic features was extracted based on the manually-revised labels. This set of labels and features should enable i) direct utilization of the TCGA/TCIA glioma collections towards repeatable, reproducible and comparative quantitative studies leading to new predictive, prognostic, and diagnostic assessments, as well as ii) performance evaluation of computer-aided segmentation methods, and comparison to our state-of-the-art method. PMID:28872634

  9. From 20th century metabolic wall charts to 21st century systems biology: database of mammalian metabolic enzymes.

    PubMed

    Corcoran, Callan C; Grady, Cameron R; Pisitkun, Trairak; Parulekar, Jaya; Knepper, Mark A

    2017-03-01

    The organization of the mammalian genome into gene subsets corresponding to specific functional classes has provided key tools for systems biology research. Here, we have created a web-accessible resource called the Mammalian Metabolic Enzyme Database ( https://hpcwebapps.cit.nih.gov/ESBL/Database/MetabolicEnzymes/MetabolicEnzymeDatabase.html) keyed to the biochemical reactions represented on iconic metabolic pathway wall charts created in the previous century. Overall, we have mapped 1,647 genes to these pathways, representing ~7 percent of the protein-coding genome. To illustrate the use of the database, we apply it to the area of kidney physiology. In so doing, we have created an additional database ( Database of Metabolic Enzymes in Kidney Tubule Segments: https://hpcwebapps.cit.nih.gov/ESBL/Database/MetabolicEnzymes/), mapping mRNA abundance measurements (mined from RNA-Seq studies) for all metabolic enzymes to each of 14 renal tubule segments. We carry out bioinformatics analysis of the enzyme expression pattern among renal tubule segments and mine various data sources to identify vasopressin-regulated metabolic enzymes in the renal collecting duct. Copyright © 2017 the American Physiological Society.

  10. Sequence diversity of wheat mosaic virus isolates.

    PubMed

    Stewart, Lucy R

    2016-02-02

    Wheat mosaic virus (WMoV), transmitted by eriophyid wheat curl mites (Aceria tosichella) is the causal agent of High Plains disease in wheat and maize. WMoV and other members of the genus Emaravirus evaded thorough molecular characterization for many years due to the experimental challenges of mite transmission and manipulating multisegmented negative sense RNA genomes. Recently, the complete genome sequence of a Nebraska isolate of WMoV revealed eight segments, plus a variant sequence of the nucleocapsid protein-encoding segment. Here, near-complete and partial consensus sequences of five more WMoV isolates are reported and compared to the Nebraska isolate: an Ohio maize isolate (GG1), a Kansas barley isolate (KS7), and three Ohio wheat isolates (H1, K1, W1). Results show two distinct groups of WMoV isolates: Ohio wheat isolate RNA segments had 84% or lower nucleotide sequence identity to the NE isolate, whereas GG1 and KS7 had 98% or higher nucleotide sequence identity to the NE isolate. Knowledge of the sequence variability of WMoV isolates is a step toward understanding virus biology, and potentially explaining observed biological variation. Published by Elsevier B.V.

  11. Direct-to-consumer personalized genomic testing

    PubMed Central

    Bloss, Cinnamon S.; Darst, Burcu F.; Topol, Eric J.; Schork, Nicholas J.

    2011-01-01

    Over the past 18 months, there have been notable developments in the direct-to-consumer (DTC) genomic testing arena, in particular with regard to issues surrounding governmental regulation in the USA. While commentaries continue to proliferate on this topic, actual empirical research remains relatively scant. In terms of DTC genomic testing for disease susceptibility, most of the research has centered on uptake, perceptions and attitudes toward testing among health care professionals and consumers. Only a few available studies have examined actual behavioral response among consumers, and we are not aware of any studies that have examined response to DTC genetic testing for ancestry or for drug response. We propose that further research in this area is desperately needed, despite challenges in designing appropriate studies given the rapid pace at which the field is evolving. Ultimately, DTC genomic testing for common markers and conditions is only a precursor to the eventual cost-effectiveness and wide availability of whole genome sequencing of individuals, although it remains unclear whether DTC genomic information will still be attainable. Either way, however, current knowledge needs to be extended and enhanced with respect to the delivery, impact and use of increasingly accurate and comprehensive individualized genomic data. PMID:21828075

  12. Enzyme-triggered cargo release from methionine sulfoxide containing copolypeptide vesicles.

    PubMed

    Rodriguez, April R; Kramer, Jessica R; Deming, Timothy J

    2013-10-14

    We have developed a facile, scalable method for preparation of enzyme-responsive copolypeptide vesicles that requires no protecting groups or expensive components. We designed amphiphilic copolypeptides containing segments of water-soluble methionine sulfoxide, M(O), residues that were prepared by synthesis of a fully hydrophobic precursor diblock copolypeptide, poly(l-methionine)65-b-poly(L-leucine0.5-stat-L-phenylalanine0.5)20, M65(L0.5/F0.5)20, followed by its direct oxidation in water to give the amphiphilic M(O) derivative, M(O)65(L0.5/F0.5)20. Assembly of M(O)65(L0.5/F0.5)20 in water gave vesicles with average diameters of a few micrometers that could then be extruded to nanoscale diameters. The M(O) segments in the vesicles were found to be substrates for reductase enzymes, which regenerated hydrophobic M segments and resulted in a change in supramolecular morphology that caused vesicle disruption and release of cargos.

  13. One-pot synthesis of MWW zeolite nanosheets using a rationally designed organic structure-directing agent

    DOE PAGES

    Luo, Helen Y.; Michaelis, Vladimir K.; Hodges, Sydney; ...

    2015-07-22

    A new material MIT-1 comprised of delaminated MWW zeolite nanosheets is synthesized in one-pot using a rationally designed organic structure-directing agent (OSDA). The OSDA is comprised of a hydrophilic head segment that resembles the OSDA used to synthesize the zeolite precursor MCM22(P), a hydrophobic tail segment that resembles the swelling agent used to swell MCM22(P), and a di-quaternary ammonium linker that connects both segments. MIT-1 features high crystallinity and surface areas exceeding 500 m 2g -1, and can be synthesized over a wide synthesis window that includes Si/Al ratios ranging from 13 to 67. Characterization data reveal high mesoporosity andmore » acid strength with no detectable amorphous silica phases. In conclusion, compared to MCM-22 and MCM-56, MIT-1 shows a three-fold increase in catalytic activity for the Friedel-Crafts alkylation of benzene with benzyl alcohol.« less

  14. Experimental analysis of control mechanisms in somite segmentation in avian embryos. II. Reduction of material in the gastrula stages of the chick.

    PubMed

    Bellairs, R; Veini, M

    1984-02-01

    A new theory of control of somite segmentation in chick embryos is proposed. This supposses that tiny clusters of already programmed cells are present throughout the presumptive somite area at stage 4, but that in order to fulfill their destiny they probably depend on the addition of further cells from the primitive streak. Evidence is based on the two groups of experiments: a) Experiments involving transection across the primitive streak at various stages, (which results in a 'tail' which possesses mesodermal derivatives) and across the segmental plate (which results in a 'tail' lacking mesodermal derivatives). b) Experiments in which parts of embryos have been explanted with or without their primitive streak. It is suggested that the initial clusters of pre-programmed cells move further and further posteriorly, developing into somitomeres (the precursors of true somites) only as they receive re-inforcements from the primitive streak or, ultimately, from the tail bud.

  15. Substitutions of short heterologous DNA segments of intragenomic or extragenomic origins produce clustered genomic polymorphisms

    PubMed Central

    Harms, Klaus; Lunnan, Asbjørn; Hülter, Nils; Mourier, Tobias; Vinner, Lasse; Andam, Cheryl P.; Marttinen, Pekka; Fridholm, Helena; Hansen, Anders Johannes; Hanage, William P.; Nielsen, Kaare Magne; Willerslev, Eske; Johnsen, Pål Jarle

    2016-01-01

    In a screen for unexplained mutation events we identified a previously unrecognized mechanism generating clustered DNA polymorphisms such as microindels and cumulative SNPs. The mechanism, short-patch double illegitimate recombination (SPDIR), facilitates short single-stranded DNA molecules to invade and replace genomic DNA through two joint illegitimate recombination events. SPDIR is controlled by key components of the cellular genome maintenance machinery in the gram-negative bacterium Acinetobacter baylyi. The source DNA is primarily intragenomic but can also be acquired through horizontal gene transfer. The DNA replacements are nonreciprocal and locus independent. Bioinformatic approaches reveal occurrence of SPDIR events in the gram-positive human pathogen Streptococcus pneumoniae and in the human genome. PMID:27956618

  16. Complete Genome Sequence of an Isolate of Bluetongue Virus Serotype 2, Demonstrating Circulation of a Western Topotype in Southern India

    PubMed Central

    Maan, Narender S.; Maan, Sushila; Guimera, Marc; Pullinger, Gillian; Singh, Karam Pal; Nomikou, Kyriaki; Belaganahalli, Manjunatha N.

    2012-01-01

    Bluetongue virus serotype 2 (IND2003/02) was isolated in Tiruneveli City, Tamil Nadu State, India, and is stored in the Orbivirus Reference Collection at the Institute for Animal Health, Pirbright, United Kingdom. The entire genome of this isolate was sequenced, showing that it is composed of a total of 19,203 bp (all 10 genome segments). This is the first report of the entire genome sequence of a western strain of BTV-2 isolated in India, indicating that this virus has been introduced and is circulating in the region. These data will aid in the development of diagnostics and molecular epidemiology studies of BTV-2 in the subcontinent. PMID:22492927

  17. Digital genotyping of avian influenza viruses of H7 subtype detected in central Europe in 2007-2011.

    PubMed

    Nagy, Alexander; Cerníková, Lenka; Křivda, Vlastimil; Horníčková, Jitka

    2012-05-01

    The objective of our study was to provide a genotype analysis of H7N7 and H7N9 influenza A viruses (IAV) and infer their relationships to co-circulating non-H7 IAV genomes. The H7N7 strains were collected in central Europe (Hungary-1, Czech Republic-1, Slovenia-1 and Poland-4) and the H7N9 in the Czech Republic and Spain between 2007 and 2011. Hand in hand with this effort, a novel IAV genotype visualization approach called digital genotyping was developed. This approach relies on phylogenetic data summarization and transformation into a pixel array called a segment identity matrix. The digital genotyping revealed a complicated genetic interplay between the H7 and co-circulating non-H7 IAV genotypes. At the H7 IAV level the most obvious relationships were observed between one Polish H7N7/446/09 and Czech H7N7/11 viruses which, despite the special and temporal distance of 800 km and 15 months, retained at least 6/8 genome segments. Close relationships were also observed between the Czech H7N9, Polish and Slovenian H7N7 on one hand and Hungarian and Slovenian H7N7 isolates on the other. In addition the former genomes exhibited close interplays with the Czech H6N2/09 and H11N9/10-like viruses. The Czech and Spanish H7N9 genomes were completely different and 6/8 of the Czech H7N9-like segments were traced to either the Czech H3N8/07, H11N9/09 and Polish H7N7/09-like viruses. The results of digital genotyping correlated with the previous observations obtained on the Polish H7N7 isolates. As was demonstrated, the digital genotyping provides a well-arranged and easily interpretable output and may serve as an alternative genotyping tool useful for handling and analysing even a large panel of IAV genomes. Copyright © 2012 Elsevier B.V. All rights reserved.

  18. Genomic copy number variants: evidence for association with antibody response to anthrax vaccine adsorbed.

    PubMed

    Falola, Michael I; Wiener, Howard W; Wineinger, Nathan E; Cutter, Gary R; Kimberly, Robert P; Edberg, Jeffrey C; Arnett, Donna K; Kaslow, Richard A; Tang, Jianming; Shrestha, Sadeep

    2013-01-01

    Anthrax and its etiologic agent remain a biological threat. Anthrax vaccine is highly effective, but vaccine-induced IgG antibody responses vary widely following required doses of vaccinations. Such variation can be related to genetic factors, especially genomic copy number variants (CNVs) that are known to be enriched among genes with immunologic function. We have tested this hypothesis in two study populations from a clinical trial of anthrax vaccination. We performed CNV-based genome-wide association analyses separately on 794 European Americans and 200 African-Americans. Antibodies to protective antigen were measured at week 8 (early response) and week 30 (peak response) using an enzyme-linked immunosorbent assay. We used DNA microarray data (Affymetrix 6.0) and two CNV detection algorithms, hidden markov model (PennCNV) and circular binary segmentation (GeneSpring) to determine CNVs in all individuals. Multivariable regression analyses were used to identify CNV-specific associations after adjusting for relevant non-genetic covariates. Within the 22 autosomal chromosomes, 2,943 non-overlapping CNV regions were detected by both algorithms. Genomic insertions containing HLA-DRB5, DRB1 and DQA1/DRA genes in the major histocompatibility complex (MHC) region (chromosome 6p21.3) were moderately associated with elevated early antibody response (β = 0.14, p = 1.78×10(-3)) among European Americans, and the strongest association was observed between peak antibody response and a segmental insertion on chromosome 1, containing NBPF4, NBPF5, STXMP3, CLCC1, and GPSM2 genes (β = 1.66, p = 6.06×10(-5)). For African-Americans, segmental deletions spanning PRR20, PCDH17 and PCH68 genes on chromosome 13 were associated with elevated early antibody production (β = 0.18, p = 4.47×10(-5)). Population-specific findings aside, one genomic insertion on chromosome 17 (containing NSF, ARL17 and LRRC37A genes) was associated with elevated peak antibody response in both populations. Multiple CNV regions, including the one consisting of MHC genes that is consistent with earlier research, can be important to humoral immune responses to anthrax vaccine adsorbed.

  19. Low-pathogenic avian influenza virus A/turkey/Ontario/6213/1966 (H5N1) is the progenitor of highly pathogenic A/turkey/Ontario/7732/1966 (H5N9)

    PubMed Central

    Ping, Jihui; Selman, Mohammed; Tyler, Shaun; Forbes, Nicole; Keleta, Liya

    2012-01-01

    The first confirmed outbreak of highly pathogenic avian influenza (HPAI) virus infections in North America was caused by A/turkey/Ontario/7732/1966 (H5N9); however, the phylogeny of this virus is largely unknown. This study performed genomic sequence analysis of 11 avian influenza isolates from 1956 to 1979 for comparison with A/turkey/Ontario/7732/1966 (H5N9). Phylogenetic and genetic analyses included these viruses in combination with all known full-genome sequences of avian viruses isolated before 1981. It was shown that a low-pathogenic avian influenza virus, A/turkey/Ontario/6213/1966 (H5N1), that had been isolated 3 months previously, was the closest known genetic relative with six genome segments of common lineage encoding the polymerase subunits PB2, PB1 and PA, nucleoprotein (NP), haemagglutinin (HA) and non-structural (NS) proteins. The lineages of these genome segments included reassortment with other North American turkey viruses that were all rooted in North American wild waterfowl with the HA gene originating from the H5N2 serotype. The phylogenies demonstrated adaptation from North American wild birds to turkeys with the possible involvement of domestic waterfowl. The turkey isolate, A/turkey/Wisconsin/1968 (H5N9), was the second most closely related poultry isolate to A/turkey/Ontario/7732/1966 (H5N9), possessing five common lineage genome segments (PB2, PB1, PA, HA and neuraminidase). The A/turkey/Ontario/6213/1966 (H5N1) virus was more virulent than A/turkey/Wisconsin/68 (H5N9) for chicken embryos and mice, indicating a greater biological similarity to A/turkey/Ontario/7732/1966 (H5N9). Thus, A/turkey/Ontario/6213/1966 (H5N1) was identified as the closest known ancestral relative of HPAI A/turkey/Ontario/7732/1966 (H5N9), which will serve as a useful reference virus for characterizing the early genetic and biological properties associated with the emergence of pathogenic avian influenza strains. PMID:22592261

  20. Crop to wild introgression in lettuce: following the fate of crop genome segments in backcross populations

    PubMed Central

    2012-01-01

    Background After crop-wild hybridization, some of the crop genomic segments may become established in wild populations through selfing of the hybrids or through backcrosses to the wild parent. This constitutes a possible route through which crop (trans)genes could become established in natural populations. The likelihood of introgression of transgenes will not only be determined by fitness effects from the transgene itself but also by the crop genes linked to it. Although lettuce is generally regarded as self-pollinating, outbreeding does occur at a low frequency. Backcrossing to wild lettuce is a likely pathway to introgression along with selfing, due to the high frequency of wild individuals relative to the rarely occurring crop-wild hybrids. To test the effect of backcrossing on the vigour of inter-specific hybrids, Lactuca serriola, the closest wild relative of cultivated lettuce, was crossed with L. sativa and the F1 hybrid was backcrossed to L. serriola to generate BC1 and BC2 populations. Experiments were conducted on progeny from selfed plants of the backcrossing families (BC1S1 and BC2S1). Plant vigour of these two backcrossing populations was determined in the greenhouse under non-stress and abiotic stress conditions (salinity, drought, and nutrient deficiency). Results Despite the decreasing contribution of crop genomic blocks in the backcross populations, the BC1S1 and BC2S1 hybrids were characterized by a substantial genetic variation under both non-stress and stress conditions. Hybrids were identified that performed equally or better than the wild genotypes, indicating that two backcrossing events did not eliminate the effect of the crop genomic segments that contributed to the vigour of the BC1 and BC2 hybrids. QTLs for plant vigour under non-stress and the various stress conditions were detected in the two populations with positive as well as negative effects from the crop. Conclusion As it was shown that the crop contributed QTLs with either a positive or a negative effect on plant vigour, we hypothesize that genomic regions exist where transgenes could preferentially be located in order to mitigate their persistence in natural populations through genetic hitchhiking. PMID:22448748

  1. Crop to wild introgression in lettuce: following the fate of crop genome segments in backcross populations.

    PubMed

    Uwimana, Brigitte; Smulders, Marinus J M; Hooftman, Danny A P; Hartman, Yorike; van Tienderen, Peter H; Jansen, Johannes; McHale, Leah K; Michelmore, Richard W; Visser, Richard G F; van de Wiel, Clemens C M

    2012-03-26

    After crop-wild hybridization, some of the crop genomic segments may become established in wild populations through selfing of the hybrids or through backcrosses to the wild parent. This constitutes a possible route through which crop (trans)genes could become established in natural populations. The likelihood of introgression of transgenes will not only be determined by fitness effects from the transgene itself but also by the crop genes linked to it. Although lettuce is generally regarded as self-pollinating, outbreeding does occur at a low frequency. Backcrossing to wild lettuce is a likely pathway to introgression along with selfing, due to the high frequency of wild individuals relative to the rarely occurring crop-wild hybrids. To test the effect of backcrossing on the vigour of inter-specific hybrids, Lactuca serriola, the closest wild relative of cultivated lettuce, was crossed with L. sativa and the F(1) hybrid was backcrossed to L. serriola to generate BC(1) and BC(2) populations. Experiments were conducted on progeny from selfed plants of the backcrossing families (BC(1)S(1) and BC(2)S(1)). Plant vigour of these two backcrossing populations was determined in the greenhouse under non-stress and abiotic stress conditions (salinity, drought, and nutrient deficiency). Despite the decreasing contribution of crop genomic blocks in the backcross populations, the BC(1)S(1) and BC(2)S(1) hybrids were characterized by a substantial genetic variation under both non-stress and stress conditions. Hybrids were identified that performed equally or better than the wild genotypes, indicating that two backcrossing events did not eliminate the effect of the crop genomic segments that contributed to the vigour of the BC(1) and BC(2) hybrids. QTLs for plant vigour under non-stress and the various stress conditions were detected in the two populations with positive as well as negative effects from the crop. As it was shown that the crop contributed QTLs with either a positive or a negative effect on plant vigour, we hypothesize that genomic regions exist where transgenes could preferentially be located in order to mitigate their persistence in natural populations through genetic hitchhiking.

  2. Immunohistochemical localization of beta-amyloid precursor protein sequences in Alzheimer and normal brain tissue by light and electron microscopy.

    PubMed

    McGeer, P L; Akiyama, H; Kawamata, T; Yamada, T; Walker, D G; Ishii, T

    1992-03-01

    Immunohistochemical staining with antibodies directed against four segments of the amyloid precursor protein (APP) was studied by light and electron microscopy in normal and Alzheimer (AD) brain tissue. The segments according to the Kang et al. sequence were: 18-38 (T97); 527-540 (R36); 597-620 (1-24 of beta-amyloid protein [BAP], R17); and 681-695 (R37) (Kang et al. [1987]: Nature 325:733-736). The antibodies recognized full length APP in Western blots of extracts of APP transfected cells. They stained cytoplasmic granules in some pyramidal neurons in normal appearing tissue from control and AD cases. In AD affected tissue, the antibodies to amino terminal sections of APP stained tangled neurons and neuropil threads, and intensely stained dystrophic neurites in senile plaques. By electron microscopy, this staining was localized to abnormal filaments. The antibody to the carboxy terminal segment failed to stain neurofibrillary tangles or neuropil threads; it did stain some neurites with globular swellings. It also stained globular and elongated deposits in senile plaque areas. The antibody against the BAP intensely stained extracellular material in senile plaques and diffuse deposits. By electron microscopy, the antibodies all stained intramicroglial deposits. Some of the extracellular and intracellular BAP-positive deposits were fibrillary. Communication between intramicroglial and extracellular fibrils was detected in plaque areas. These data suggest the following sequence of events. APP is normally concentrated in intraneuronal granules. In AD, it accumulates in damaged neuronal fibers. The amino terminal portion binds to abnormal neurofilaments. Major fragments of APP are phagocytosed and processed by microglia with the BAP portion being preserved. The preserved BAP is then extruded and accumulates in extracellular tissue.

  3. The complete genomic sequence of egg drop syndrome virus strain AAV-2.

    PubMed

    Jin, Q; Zeng, L; Yang, F; Li, M; Hou, Y

    1999-12-01

    In the search for the genome of egg drop syndrome virus (EDSV-76) Chinese strain AAV-2, part of restriction endonuclease physical map is analyzed, the complete genomic library is organized. On basis of this, the complete genome nucleotide sequences (32 838 bp in length, including terminal structures) are determined. The data analysis shows: compared with the other Adenoviruses, strain AAV-2 has more disparity on genomic structure and the distribution of open reading frame (ORF). There are no clear E1, E3 and E4 regions in AAV-2 genome. Two segments located at both ends of genome (1.1 kb and 8.3 kb in length respectively) have no homology with the other adenovirus genomes. In addition, strain AAV-2 genome lacks ORFs encoding ElA, pV and pIX, which are common ORFs encoding early, lately proteins in Adenovirus. This reveals differences between EDSA-76, the sole standard strain of group III Avian Adenoviruses, and the other Avian Adenoviruses for the first time. It will help the search for Avian Adenovirus and will also help the search of all Adenoviruses.

  4. Inverse Symmetry in Complete Genomes and Whole-Genome Inverse Duplication

    PubMed Central

    Kong, Sing-Guan; Fan, Wen-Lang; Chen, Hong-Da; Hsu, Zi-Ting; Zhou, Nengji; Zheng, Bo; Lee, Hoong-Chien

    2009-01-01

    The cause of symmetry is usually subtle, and its study often leads to a deeper understanding of the bearer of the symmetry. To gain insight into the dynamics driving the growth and evolution of genomes, we conducted a comprehensive study of textual symmetries in 786 complete chromosomes. We focused on symmetry based on our belief that, in spite of their extreme diversity, genomes must share common dynamical principles and mechanisms that drive their growth and evolution, and that the most robust footprints of such dynamics are symmetry related. We found that while complement and reverse symmetries are essentially absent in genomic sequences, inverse–complement plus reverse–symmetry is prevalent in complex patterns in most chromosomes, a vast majority of which have near maximum global inverse symmetry. We also discovered relations that can quantitatively account for the long observed but unexplained phenomenon of -mer skews in genomes. Our results suggest segmental and whole-genome inverse duplications are important mechanisms in genome growth and evolution, probably because they are efficient means by which the genome can exploit its double-stranded structure to enrich its code-inventory. PMID:19898631

  5. Genetic Structure of Avian Influenza Viruses from Ducks of the Atlantic Flyway of North America

    PubMed Central

    Huang, Yanyan; Wille, Michelle; Dobbin, Ashley; Walzthöni, Natasha M.; Robertson, Gregory J.; Ojkic, Davor; Whitney, Hugh; Lang, Andrew S.

    2014-01-01

    Wild birds, including waterfowl such as ducks, are reservoir hosts of influenza A viruses. Despite the increased number of avian influenza virus (AIV) genome sequences available, our understanding of AIV genetic structure and transmission through space and time in waterfowl in North America is still limited. In particular, AIVs in ducks of the Atlantic flyway of North America have not been thoroughly investigated. To begin to address this gap, we analyzed 109 AIV genome sequences from ducks in the Atlantic flyway to determine their genetic structure and to document the extent of gene flow in the context of sequences from other locations and other avian and mammalian host groups. The analyses included 25 AIVs from ducks from Newfoundland, Canada, from 2008–2011 and 84 available reference duck AIVs from the Atlantic flyway from 2006–2011. A vast diversity of viral genes and genomes was identified in the 109 viruses. The genetic structure differed amongst the 8 viral segments with predominant single lineages found for the PB2, PB1 and M segments, increased diversity found for the PA, NP and NS segments (2, 3 and 3 lineages, respectively), and the highest diversity found for the HA and NA segments (12 and 9 lineages, respectively). Identification of inter-hemispheric transmissions was rare with only 2% of the genes of Eurasian origin. Virus transmission between ducks and other bird groups was investigated, with 57.3% of the genes having highly similar (≥99% nucleotide identity) genes detected in birds other than ducks. Transmission between North American flyways has been frequent and 75.8% of the genes were highly similar to genes found in other North American flyways. However, the duck AIV genes did display spatial distribution bias, which was demonstrated by the different population sizes of specific viral genes in one or two neighbouring flyways compared to more distant flyways. PMID:24498009

  6. Genomic signal analysis of pathogen variability

    NASA Astrophysics Data System (ADS)

    Cristea, Paul Dan

    2006-02-01

    The paper presents results in the study of pathogen variability by using genomic signals. The conversion of symbolic nucleotide sequences into digital signals offers the possibility to apply signal processing methods to the analysis of genomic data. The method is particularly well suited to characterize small size genomic sequences, such as those found in viruses and bacteria, being a promising tool in tracking the variability of pathogens, especially in the context of developing drug resistance. The paper is based on data downloaded from GenBank [32], and comprises results on the variability of the eight segments of the influenza type A, subtype H5N1, virus genome, and of the Hemagglutinin (HA) gene, for the H1, H2, H3, H4, H5 and H16 types. Data from human and avian virus isolates are used.

  7. The Genome Sequence of Taurine Cattle: A window to ruminant biology and evolution

    PubMed Central

    Elsik, Christine G.; Tellam, Ross L.; Worley, Kim C.

    2010-01-01

    To understand the biology and evolution of ruminants, the cattle genome was sequenced to ∼7× coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1,217 are absent or undetected in non-eutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides an enabling resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production. PMID:19390049

  8. Genome-wide identification of WRKY transcription factors in kiwifruit (Actinidia spp.) and analysis of WRKY expression in responses to biotic and abiotic stresses.

    PubMed

    Jing, Zhaobin; Liu, Zhande

    2018-04-01

    As one of the largest transcriptional factor families in plants, WRKY transcription factors play important roles in various biotic and abiotic stress responses. To date, WRKY genes in kiwifruit (Actinidia spp.) remain poorly understood. In our study, o total of 97 AcWRKY genes have been identified in the kiwifruit genome. An overview of these AcWRKY genes is analyzed, including the phylogenetic relationships, exon-intron structures, synteny and expression profiles. The 97 AcWRKY genes were divided into three groups based on the conserved WRKY domain. Synteny analysis indicated that segmental duplication events contributed to the expansion of the kiwifruit AcWRKY family. In addition, the synteny analysis between kiwifruit and Arabidopsis suggested that some of the AcWRKY genes were derived from common ancestors before the divergence of these two species. Conserved motifs outside the AcWRKY domain may reflect their functional conservation. Genome-wide segmental and tandem duplication were found, which may contribute to the expansion of AcWRKY genes. Furthermore, the analysis of selected AcWRKY genes showed a variety of expression patterns in five different organs as well as during biotic and abiotic stresses. The genome-wide identification and characterization of kiwifruit WRKY transcription factors provides insight into the evolutionary history and is a useful resource for further functional analyses of kiwifruit.

  9. RNA Encapsidation and Packaging in the Phleboviruses

    PubMed Central

    Hornak, Katherine E.; Lanchy, Jean-Marc; Lodmell, J. Stephen

    2016-01-01

    The Bunyaviridae represents the largest family of segmented RNA viruses, which infect a staggering diversity of plants, animals, and insects. Within the family Bunyaviridae, the Phlebovirus genus includes several important human and animal pathogens, including Rift Valley fever virus (RVFV), severe fever with thrombocytopenia syndrome virus (SFTSV), Uukuniemi virus (UUKV), and the sandfly fever viruses. The phleboviruses have small tripartite RNA genomes that encode a repertoire of 5–7 proteins. These few proteins accomplish the daunting task of recognizing and specifically packaging a tri-segment complement of viral genomic RNA in the midst of an abundance of host components. The critical nucleation events that eventually lead to virion production begin early on in the host cytoplasm as the first strands of nascent viral RNA (vRNA) are synthesized. The interaction between the vRNA and the viral nucleocapsid (N) protein effectively protects and masks the RNA from the host, and also forms the ribonucleoprotein (RNP) architecture that mediates downstream interactions and drives virion formation. Although the mechanism by which all three genomic counterparts are selectively co-packaged is not completely understood, we are beginning to understand the hierarchy of interactions that begins with N-RNA packaging and culminates in RNP packaging into new virus particles. In this review we focus on recent progress that highlights the molecular basis of RNA genome packaging in the phleboviruses. PMID:27428993

  10. Segmental allotetraploidy and allelic interactions in buffelgrass (Pennisetum ciliare (L.) Link syn. Cenchrus ciliaris L.) as revealed by genome mapping.

    PubMed

    Jessup, R W; Burson, B L; Burow, O; Wang, Y W; Chang, C; Li, Z; Paterson, A H; Hussey, M A

    2003-04-01

    Linkage analyses increasingly complement cytological and traditional plant breeding techniques by providing valuable information regarding genome organization and transmission genetics of complex polyploid species. This study reports a genome map of buffelgrass (Pennisetum ciliare (L.) Link syn. Cenchrus ciliaris L.). Maternal and paternal maps were constructed with restriction fragment length polymorphisms (RFLPs) segregating in 87 F1 progeny from an intraspecific cross between two heterozygous genotypes. A survey of 862 heterologous cDNAs and gDNAs from across the Poaceae, as well as 443 buffelgrass cDNAs, yielded 100 and 360 polymorphic probes, respectively. The maternal map included 322 RFLPs, 47 linkage groups, and 3464 cM, whereas the paternal map contained 245 RFLPs, 42 linkage groups, and 2757 cM. Approximately 70 to 80% of the buffelgrass genome was covered, and the average marker spacing was 10.8 and 11.3 cM on the respective maps. Preferential pairing was indicated between many linkage groups, which supports cytological reports that buffelgrass is a segmental allotetraploid. More preferential pairing (disomy) was found in the maternal than paternal parent across linkage groups (55 vs. 38%) and loci (48 vs. 15%). Comparison of interval lengths in 15 allelic bridges indicated significantly less meiotic recombination in paternal gametes. Allelic interactions were detected in four regions of the maternal map and were absent in the paternal map.

  11. Automatic pelvis segmentation from x-ray images of a mouse model

    NASA Astrophysics Data System (ADS)

    Al Okashi, Omar M.; Du, Hongbo; Al-Assam, Hisham

    2017-05-01

    The automatic detection and quantification of skeletal structures has a variety of different applications for biological research. Accurate segmentation of the pelvis from X-ray images of mice in a high-throughput project such as the Mouse Genomes Project not only saves time and cost but also helps achieving an unbiased quantitative analysis within the phenotyping pipeline. This paper proposes an automatic solution for pelvis segmentation based on structural and orientation properties of the pelvis in X-ray images. The solution consists of three stages including pre-processing image to extract pelvis area, initial pelvis mask preparation and final pelvis segmentation. Experimental results on a set of 100 X-ray images showed consistent performance of the algorithm. The automated solution overcomes the weaknesses of a manual annotation procedure where intra- and inter-observer variations cannot be avoided.

  12. The aromatic amino acid hydroxylase genes AAH1 and AAH2 in Toxoplasma gondii contribute to transmission in the cat

    USDA-ARS?s Scientific Manuscript database

    The Toxoplasma gondii genome contains two aromatic amino acid hydroxylase genes, AAH1 and AAH2, which encode proteins that produce L-DOPA, which can serve as a precursor of catecholamine neurotransmitters. It has been suggested that this pathway elevates host dopamine levels thus making infected rod...

  13. Long-Term Acoustic Real-Time Sensor for Polar Areas (LARA)

    DTIC Science & Technology

    2015-09-30

    segment in the northeast Pacific Ocean. Both areas have seafloor volcanic eruptions forecast for the near future, and the LARA moorings will allow us...time monitoring of deep-ocean seismic and volcanic activity (e.g., Dziak et al., 2012) - especially in areas where SOSUS coverage no longer exists...precursors and magma ascent before the April 2011 eruption at Axial Seamount. Nature Geoscience, 5, pp. 478-482. Klatt, O., Boebel, O., and Fahrbach, E

  14. An Inhibitory Motif on the 5’UTR of Several Rotavirus Genome Segments Affects Protein Expression and Reverse Genetics Strategies

    PubMed Central

    Papa, Guido; Eichwald, Catherine; Burrone, Oscar R.

    2016-01-01

    Rotavirus genome consists of eleven segments of dsRNA, each encoding one single protein. Viral mRNAs contain an open reading frame (ORF) flanked by relatively short untranslated regions (UTRs), whose role in the viral cycle remains elusive. Here we investigated the role of 5’UTRs in T7 polymerase-driven cDNAs expression in uninfected cells. The 5’UTRs of eight genome segments (gs3, gs5-6, gs7-11) of the simian SA11 strain showed a strong inhibitory effect on the expression of viral proteins. Decreased protein expression was due to both compromised transcription and translation and was independent of the ORF and the 3’UTR sequences. Analysis of several mutants of the 21-nucleotide long 5’UTR of gs 11 defined an inhibitory motif (IM) represented by its primary sequence rather than its secondary structure. IM was mapped to the 5’ terminal 6-nucleotide long pyrimidine-rich tract 5’-GGY(U/A)UY-3’. The 5’ terminal position within the mRNA was shown to be essentially required, as inhibitory activity was lost when IM was moved to an internal position. We identified two mutations (insertion of a G upstream the 5’UTR and the U to A mutation of the fifth nucleotide of IM) that render IM non-functional and increase the transcription and translation rate to levels that could considerably improve the efficiency of virus helper-free reverse genetics strategies. PMID:27846320

  15. The most conserved genome segments for life detection on Earth and other planets.

    PubMed

    Isenbarger, Thomas A; Carr, Christopher E; Johnson, Sarah Stewart; Finney, Michael; Church, George M; Gilbert, Walter; Zuber, Maria T; Ruvkun, Gary

    2008-12-01

    On Earth, very simple but powerful methods to detect and classify broad taxa of life by the polymerase chain reaction (PCR) are now standard practice. Using DNA primers corresponding to the 16S ribosomal RNA gene, one can survey a sample from any environment for its microbial inhabitants. Due to massive meteoritic exchange between Earth and Mars (as well as other planets), a reasonable case can be made for life on Mars or other planets to be related to life on Earth. In this case, the supremely sensitive technologies used to study life on Earth, including in extreme environments, can be applied to the search for life on other planets. Though the 16S gene has become the standard for life detection on Earth, no genome comparisons have established that the ribosomal genes are, in fact, the most conserved DNA segments across the kingdoms of life. We present here a computational comparison of full genomes from 13 diverse organisms from the Archaea, Bacteria, and Eucarya to identify genetic sequences conserved across the widest divisions of life. Our results identify the 16S and 23S ribosomal RNA genes as well as other universally conserved nucleotide sequences in genes encoding particular classes of transfer RNAs and within the nucleotide binding domains of ABC transporters as the most conserved DNA sequence segments across phylogeny. This set of sequences defines a core set of DNA regions that have changed the least over billions of years of evolution and provides a means to identify and classify divergent life, including ancestrally related life on other planets.

  16. Region VI of cauliflower mosaic virus encodes a host range determinant.

    PubMed Central

    Schoelz, J; Shepherd, R J; Daubert, S

    1986-01-01

    A domain of cauliflower mosaic virus (CaMV) which controls systemic spread in two solanaceous hosts (Datura stramonium and Nicotiana bigelovii) was mapped to the first half of open reading frame 6. Whereas ordinary strains of CaMV are unable to infect solanaceous species except to replicate locally in inoculated leaves, a new CaMV strain (D4) induces chlorotic local lesions and systemically infects both D. stramonium and N. bigelovii. To determine which portion of the CaMV genome controls systemic spread of the virus in solanaceous hosts, nine recombinant genomes constructed between D4 and two ordinary strains of the virus were tested for their ability to infect solanaceous hosts. A 496-base-pair DNA segment comprising the first half of open reading frame 6 specified the type of local lesions and systemic spread of the virus in solanaceous hosts. Exchange of this segment of the genome between strains of CaMV converted a compatible host reaction to an incompatible (hypersensitive) one in response to infection. This suggests that the gene VI protein interacts with the plant to suppress hypersensitivity, the normal response of solanaceous hosts to CaMV infection. Images PMID:3785205

  17. Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential

    PubMed Central

    Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael

    2013-01-01

    Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328

  18. Genetic and Phylogenetic Characterization of Tataguine and Witwatersrand Viruses and Other Orthobunyaviruses of the Anopheles A, Capim, Guamá, Koongol, Mapputta, Tete, and Turlock Serogroups

    PubMed Central

    Shchetinin, Alexey M.; Lvov, Dmitry K.; Deriabin, Petr G.; Botikov, Andrey G.; Gitelman, Asya K.; Kuhn, Jens H.; Alkhovsky, Sergey V.

    2015-01-01

    The family Bunyaviridae has more than 530 members that are distributed among five genera or remain to be classified. The genus Orthobunyavirus is the most diverse bunyaviral genus with more than 220 viruses that have been assigned to more than 18 serogroups based on serological cross-reactions and limited molecular-biological characterization. Sequence information for all three orthobunyaviral genome segments is only available for viruses belonging to the Bunyamwera, Bwamba/Pongola, California encephalitis, Gamboa, Group C, Mapputta, Nyando, and Simbu serogroups. Here we present coding-complete sequences for all three genome segments of 15 orthobunyaviruses belonging to the Anopheles A, Capim, Guamá, Kongool, Tete, and Turlock serogroups, and of two unclassified bunyaviruses previously not known to be orthobunyaviruses (Tataguine and Witwatersrand viruses). Using those sequence data, we established the most comprehensive phylogeny of the Orthobunyavirus genus to date, now covering 15 serogroups. Our results emphasize the high genetic diversity of orthobunyaviruses and reveal that the presence of the small nonstructural protein (NSs)-encoding open reading frame is not as common in orthobunyavirus genomes as previously thought. PMID:26610546

  19. Near Full-Length Identification of a Novel HIV-1 CRF01_AE/B/C Recombinant in Northern Myanmar.

    PubMed

    Zhou, Yan-Heng; Chen, Xin; Liang, Yue-Bo; Pang, Wei; Qin, Wei-Hong; Zhang, Chiyu; Zheng, Yong-Tang

    2015-08-01

    The Myanmar-China border appears to be the "hot spot" region for the occurrence of HIV-1 recombination. The majority of the previous analyses of HIV-1 recombination were based on partial genomic sequences, which obviously cannot reflect the reality of the genetic diversity of HIV-1 in this area well. Here, we present a near full-length characterization of a novel HIV-1 CRF01_AE/B/C recombinant isolated from a long-distance truck driver in Northern Myanmar. It is the first description of a near full-length genomic sequence in Myanmar since 2003, and might be one of the most complicated HIV-1 chimeras ever detected in Myanmar, containing four CRF01_AE, six B segments, and five C segments separated by 14 breakpoints throughout its genome. The discovery and characterization of this new CRF01_AE/B/C recombinant indicate that intersubtype recombination is ongoing in Myanmar, continuously generating new forms of HIV-1. More work based on near full-length sequence analyses is urgently needed to better understand the genetic diversity of HIV-1 in these regions.

  20. Recapitulating X-Linked Juvenile Retinoschisis in Mouse Model by Knock-In Patient-Specific Novel Mutation.

    PubMed

    Chen, Ding; Xu, Tao; Tu, Mengjun; Xu, Jinlin; Zhou, Chenchen; Cheng, Lulu; Yang, Ruqing; Yang, Tanchu; Zheng, Weiwei; He, Xiubin; Deng, Ruzhi; Ge, Xianglian; Li, Jin; Song, Zongming; Zhao, Junzhao; Gu, Feng

    2017-01-01

    X-linked juvenile retinoschisis (XLRS) is a retinal disease caused by mutations in the gene encoding retinoschisin (RS1), which leads to a significant proportion of visual impairment and blindness. To develop personalized genome editing based gene therapy, knock-in animal disease models that have the exact mutation identified in the patients is extremely crucial, and that the way which genome editing in knock-in animals could be easily transferred to the patients. Here we recruited a family diagnosed with XLRS and identified the causative mutation ( RS1 , p.Y65X), then a knock-in mouse model harboring this disease-causative mutation was generated via TALEN (transcription activator-like effector nucleases). We found that the b-wave amplitude of the ERG of the RS1 -KI mice was significantly decreased. Moreover, we observed that the structure of retina in RS1 -KI mice has become disordered, including the disarray of inner nuclear layer and outer nuclear layer, chaos of outer plexiform layer, decreased inner segments of photoreceptor and the loss of outer segments. The novel knock-in mice ( RS1 -KI) harboring patient-specific mutation will be valuable for development of treatment via genome editing mediated gene correction.

  1. Ancestral synteny shared between distantly-related plant species from the asterid (Coffea canephora and Solanum Sp.) and rosid (Vitis vinifera) clades

    PubMed Central

    2012-01-01

    Background Coffee trees (Rubiaceae) and tomato (Solanaceae) belong to the Asterid clade, while grapevine (Vitaceae) belongs to the Rosid clade. Coffee and tomato separated from grapevine 125 million years ago, while coffee and tomato diverged 83-89 million years ago. These long periods of divergent evolution should have permitted the genomes to reorganize significantly. So far, very few comparative mappings have been performed between very distantly related species belonging to different clades. We report the first multiple comparison between species from Asterid and Rosid clades, to examine both macro-and microsynteny relationships. Results Thanks to a set of 867 COSII markers, macrosynteny was detected between coffee, tomato and grapevine. While coffee and tomato genomes share 318 orthologous markers and 27 conserved syntenic segments (CSSs), coffee and grapevine also share a similar number of syntenic markers and CSSs: 299 and 29 respectively. Despite large genome macrostructure reorganization, several large chromosome segments showed outstanding macrosynteny shedding new insights into chromosome evolution between Asterids and Rosids. We also analyzed a sequence of 174 kb containing the ovate gene, conserved in a syntenic block between coffee, tomato and grapevine that showed a high-level of microstructure conservation. A higher level of conservation was observed between coffee and grapevine, both woody and long life-cycle plants, than between coffee and tomato. Out of 16 coffee genes of this syntenic segment, 7 and 14 showed complete synteny between coffee and tomato or grapevine, respectively. Conclusions These results show that significant conservation is found between distantly related species from the Asterid (Coffea canephora and Solanum sp.) and Rosid (Vitis vinifera) clades, at the genome macrostructure and microstructure levels. At the ovate locus, conservation did not decline in relation to increasing phylogenetic distance, suggesting that the time factor alone does not explain divergences. Our results are considerably useful for syntenic studies between supposedly remote species for the isolation of important genes for agronomy. PMID:22433423

  2. Evolutionary dynamics of human rotaviruses: balancing reassortment with preferred genome constellations.

    PubMed

    McDonald, Sarah M; Matthijnssens, Jelle; McAllen, John K; Hine, Erin; Overton, Larry; Wang, Shiliang; Lemey, Philippe; Zeller, Mark; Van Ranst, Marc; Spiro, David J; Patton, John T

    2009-10-01

    Group A human rotaviruses (RVs) are a major cause of severe gastroenteritis in infants and young children. Yet, aside from the genes encoding serotype antigens (VP7; G-type and VP4; P-type), little is known about the genetic make-up of emerging and endemic human RV strains. To gain insight into the diversity and evolution of RVs circulating at a single location over a period of time, we sequenced the eleven-segmented, double-stranded RNA genomes of fifty-one G3P[8] strains collected from 1974 to 1991 at Children's Hospital National Medical Center, Washington, D. C. During this period, G1P[8] strains typically dominated, comprising on average 56% of RV infections each year in hospitalized children. A notable exception was in the 1976 and 1991 winter seasons when the incidence of G1P[8] infections decreased dramatically, a trend that correlated with a significant increase in G3P[8] infections. Our sequence analysis indicates that the 1976 season was characterized by the presence of several genetically distinct, co-circulating clades of G3P[8] viruses, which contained minor but significant differences in their encoded proteins. These 1976 lineages did not readily exchange gene segments with each other, but instead remained stable over the course of the season. In contrast, the 1991 season contained a single major clade, whose genome constellation was similar to one of the 1976 clades. The 1991 clade may have gained a fitness advantage after reassorting with as of yet unidentified RV strain(s). This study reveals for the first time that genetically distinct RV clades of the same G/P-type can co-circulate and cause disease. The findings from this study also suggest that, although gene segment exchange occurs, most reassortant strains are replaced over time by lineages with preferred genome constellations. Elucidation of the selective pressures that favor maintenance of RVs with certain sets of genes may be necessary to anticipate future vaccine needs.

  3. Evolutionary Dynamics of Human Rotaviruses: Balancing Reassortment with Preferred Genome Constellations

    PubMed Central

    McDonald, Sarah M.; Matthijnssens, Jelle; McAllen, John K.; Hine, Erin; Overton, Larry; Wang, Shiliang; Lemey, Philippe; Zeller, Mark; Van Ranst, Marc; Spiro, David J.; Patton, John T.

    2009-01-01

    Group A human rotaviruses (RVs) are a major cause of severe gastroenteritis in infants and young children. Yet, aside from the genes encoding serotype antigens (VP7; G-type and VP4; P-type), little is known about the genetic make-up of emerging and endemic human RV strains. To gain insight into the diversity and evolution of RVs circulating at a single location over a period of time, we sequenced the eleven-segmented, double-stranded RNA genomes of fifty-one G3P[8] strains collected from 1974 to 1991 at Children's Hospital National Medical Center, Washington, D. C. During this period, G1P[8] strains typically dominated, comprising on average 56% of RV infections each year in hospitalized children. A notable exception was in the 1976 and 1991 winter seasons when the incidence of G1P[8] infections decreased dramatically, a trend that correlated with a significant increase in G3P[8] infections. Our sequence analysis indicates that the 1976 season was characterized by the presence of several genetically distinct, co-circulating clades of G3P[8] viruses, which contained minor but significant differences in their encoded proteins. These 1976 lineages did not readily exchange gene segments with each other, but instead remained stable over the course of the season. In contrast, the 1991 season contained a single major clade, whose genome constellation was similar to one of the 1976 clades. The 1991 clade may have gained a fitness advantage after reassorting with as of yet unidentified RV strain(s). This study reveals for the first time that genetically distinct RV clades of the same G/P-type can co-circulate and cause disease. The findings from this study also suggest that, although gene segment exchange occurs, most reassortant strains are replaced over time by lineages with preferred genome constellations. Elucidation of the selective pressures that favor maintenance of RVs with certain sets of genes may be necessary to anticipate future vaccine needs. PMID:19851457

  4. The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

    PubMed

    Holland, M J; Holland, J P; Thill, G P; Jackson, K A

    1981-02-10

    Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.

  5. Proviruses with Long-Term Stable Expression Accumulate in Transcriptionally Active Chromatin Close to the Gene Regulatory Elements: Comparison of ASLV-, HIV- and MLV-Derived Vectors

    PubMed Central

    Miklík, Dalibor; Šenigl, Filip; Hejnar, Jiří

    2018-01-01

    Individual groups of retroviruses and retroviral vectors differ in their integration site preference and interaction with the host genome. Hence, immediately after infection genome-wide distribution of integrated proviruses is non-random. During long-term in vitro or persistent in vivo infection, the genomic position and chromatin environment of the provirus affects its transcriptional activity. Thus, a selection of long-term stably expressed proviruses and elimination of proviruses, which have been gradually silenced by epigenetic mechanisms, helps in the identification of genomic compartments permissive for proviral transcription. We compare here the extent and time course of provirus silencing in single cell clones of the K562 human myeloid lymphoblastoma cell line that have been infected with retroviral reporter vectors derived from avian sarcoma/leukosis virus (ASLV), human immunodeficiency virus type 1 (HIV) and murine leukaemia virus (MLV). While MLV proviruses remain transcriptionally active, ASLV proviruses are prone to rapid silencing. The HIV provirus displays gradual silencing only after an extended time period in culture. The analysis of integration sites of long-term stably expressed proviruses shows a strong bias for some genomic features—especially integration close to the transcription start sites of active transcription units. Furthermore, complex analysis of histone modifications enriched at the site of integration points to the accumulation of proviruses of all three groups in gene regulatory segments, particularly close to the enhancer loci. We conclude that the proximity to active regulatory chromatin segments correlates with stable provirus expression in various retroviral species. PMID:29517993

  6. On the inversion-indel distance

    PubMed Central

    2013-01-01

    Background The inversion distance, that is the distance between two unichromosomal genomes with the same content allowing only inversions of DNA segments, can be computed thanks to a pioneering approach of Hannenhalli and Pevzner in 1995. In 2000, El-Mabrouk extended the inversion model to allow the comparison of unichromosomal genomes with unequal contents, thus insertions and deletions of DNA segments besides inversions. However, an exact algorithm was presented only for the case in which we have insertions alone and no deletion (or vice versa), while a heuristic was provided for the symmetric case, that allows both insertions and deletions and is called the inversion-indel distance. In 2005, Yancopoulos, Attie and Friedberg started a new branch of research by introducing the generic double cut and join (DCJ) operation, that can represent several genome rearrangements (including inversions). Among others, the DCJ model gave rise to two important results. First, it has been shown that the inversion distance can be computed in a simpler way with the help of the DCJ operation. Second, the DCJ operation originated the DCJ-indel distance, that allows the comparison of genomes with unequal contents, considering DCJ, insertions and deletions, and can be computed in linear time. Results In the present work we put these two results together to solve an open problem, showing that, when the graph that represents the relation between the two compared genomes has no bad components, the inversion-indel distance is equal to the DCJ-indel distance. We also give a lower and an upper bound for the inversion-indel distance in the presence of bad components. PMID:24564182

  7. Reovirus infections

    USDA-ARS?s Scientific Manuscript database

    Avian reoviruses (ARV) are widespread worldwide and may infect turkeys, chickens and other avian species, including domestic waterfowl and game birds. The virus is non-enveloped double-stranded RNA, therefore is environmentally stable and due to its segmented genome can generate variants easily. A...

  8. Adaptive evolution during the establishment of European avian-like H1N1 influenza A virus in swine.

    PubMed

    Joseph, Udayan; Vijaykrishna, Dhanasekaran; Smith, Gavin J D; Su, Yvonne C F

    2018-04-01

    An H1N1 subtype influenza A virus with all eight gene segments derived from wild birds (including mallards), ducks and chickens, caused severe disease outbreaks in swine populations in Europe beginning in 1979 and successfully adapted to form the European avian-like swine (EA-swine) influenza lineage. Genes of the EA-swine lineage that are clearly segregated from its closest avian relatives continue to circulate in swine populations globally and represent a unique opportunity to study the adaptive process of an avian-to-mammalian cross-species transmission. Here, we used a relaxed molecular clock model to test whether the EA-swine virus originated through the introduction of a single avian ancestor as an entire genome, followed by an analysis of host-specific selection pressures among different gene segments. Our data indicated independent introduction of gene segments via transmission of avian viruses into swine followed by reassortment events that occurred at least 1-4 years prior to the EA-swine outbreak. All EA-swine gene segments exhibit greater selection pressure than avian viruses, reflecting both adaptive pressures and relaxed selective constraints that are associated with host switching. Notably, we identified key amino acid mutations in the viral surface proteins (H1 and N1) that play a role in adaptation to new hosts. Following the establishment of EA-swine lineage, we observed an increased frequency of intrasubtype reassortment of segments compared to the earlier strains that has been associated with adaptive amino acid replacements, disease severity and vaccine escape. Taken together, our study provides key insights into the adaptive changes in viral genomes following the transmission of avian influenza viruses to swine and the early establishment of the EA-swine lineage.

  9. Initiation and Along-Axis Segmentation of Seaward-Dipping Volcanic Sequences Captured in Afar

    NASA Astrophysics Data System (ADS)

    Ebinger, C.; Wolfenden, E.; Yirgu, G.; Keir, D.

    2003-12-01

    The Afar triple junction zone provides a unique opportunity to examine the early development of magmatic margins, as respective limbs of the triple junction capture different stages of the breakup process. Initial rifting in the southernmost Red Sea occurred concurrent with, or soon after flood basaltic magmatism at ~31 Ma in the Ethiopia-Yemen plume province, whereas the northern part of the Main Ethiopian rift initiated after 12 Ma. Both rift systems initiated with the development of high-angle border fault systems bounding broad basins, but 8-10 My after rifting we see riftward migration of strain from the western border fault to narrow zones of increasingly more basaltic magmatism. These localised zones of faulting and volcanism (magmatic segments) show a segmentation independent of the border fault segmentation. The much older, more evolved magmatic segments in the southern Red Sea, where not onlapped by Pliocene-Recent sedimentary strata, dip steeply riftward and define a regional eastward flexure into transitional oceanic crust, as indicated by gravity models constrained by seismic refraction and receiver function data. The southern Red Sea magmatic segments have been abandoned in Pliocene-Recent triple junction reorganisations, whereas the process of seaward-dipping volcanic sequence emplacement is ongoing in the seismically and volcanically active Main Ethiopian rift. Field, remote sensing, gravity, and seismicity data from the Main Ethiopian and southern Red Sea rifts indicate that seaward-dipping volcanic sequences initiate in moderately stretched continental crust above a narrow zone of dike-intrusion. Our comparison of active and ancient magmatic segments show that they are the precursors to seaward-dipping volcanic sequences analogous to those seen on passive continental margins, and provides insights into the initiation of along-axis segmentation of seafloor-spreading centers.

  10. Introductions and Evolution of Human-Origin Seasonal Influenza A Viruses in Multinational Swine Populations

    PubMed Central

    Wentworth, David E.; Culhane, Marie R.; Vincent, Amy L.; Viboud, Cecile; LaPointe, Matthew P.; Lin, Xudong; Holmes, Edward C.; Detmer, Susan E.

    2014-01-01

    ABSTRACT The capacity of influenza A viruses to cross species barriers presents a continual threat to human and animal health. Knowledge of the human-swine interface is particularly important for understanding how viruses with pandemic potential evolve in swine hosts. We sequenced the genomes of 141 influenza viruses collected from North American swine during 2002 to 2011 and identified a swine virus that possessed all eight genome segments of human seasonal A/H3N2 virus origin. A molecular clock analysis indicates that this virus—A/sw/Saskatchewan/02903/2009(H3N2)—has likely circulated undetected in swine for at least 7 years. For historical context, we performed a comprehensive phylogenetic analysis of an additional 1,404 whole-genome sequences from swine influenza A viruses collected globally during 1931 to 2013. Human-to-swine transmission occurred frequently over this time period, with 20 discrete introductions of human seasonal influenza A viruses showing sustained onward transmission in swine for at least 1 year since 1965. Notably, human-origin hemagglutinin (H1 and H3) and neuraminidase (particularly N2) segments were detected in swine at a much higher rate than the six internal gene segments, suggesting an association between the acquisition of swine-origin internal genes via reassortment and the adaptation of human influenza viruses to new swine hosts. Further understanding of the fitness constraints on the adaptation of human viruses to swine, and vice versa, at a genomic level is central to understanding the complex multihost ecology of influenza and the disease threats that swine and humans pose to each other. IMPORTANCE The swine origin of the 2009 A/H1N1 pandemic virus underscored the importance of understanding how influenza A virus evolves in these animals hosts. While the importance of reassortment in generating genetically diverse influenza viruses in swine is well documented, the role of human-to-swine transmission has not been as intensively studied. Through a large-scale sequencing effort, we identified a novel influenza virus of wholly human origin that has been circulating undetected in swine for at least 7 years. In addition, we demonstrate that human-to-swine transmission has occurred frequently on a global scale over the past decades but that there is little persistence of human virus internal gene segments in swine. PMID:24965467

  11. Further delineation of nonhomologous-based recombination and evidence for subtelomeric segmental duplications in 1p36 rearrangements.

    PubMed

    D'Angelo, Carla S; Gajecka, Marzena; Kim, Chong A; Gentles, Andrew J; Glotzbach, Caron D; Shaffer, Lisa G; Koiffmann, Célia P

    2009-06-01

    The mechanisms involved in the formation of subtelomeric rearrangements are now beginning to be elucidated. Breakpoint sequencing analysis of 1p36 rearrangements has made important contributions to this line of inquiry. Despite the unique architecture of segmental duplications inherent to human subtelomeres, no common mechanism has been identified thus far and different nonexclusive recombination-repair mechanisms seem to predominate. In order to gain further insights into the mechanisms of chromosome breakage, repair, and stabilization mediating subtelomeric rearrangements in humans, we investigated the constitutional rearrangements of 1p36. Cloning of the breakpoint junctions in a complex rearrangement and three non-reciprocal translocations revealed similarities at the junctions, such as microhomology of up to three nucleotides, along with no significant sequence identity in close proximity to the breakpoint regions. All the breakpoints appeared to be unique and their occurrence was limited to non-repetitive, unique DNA sequences. Several recombination- or cleavage-associated motifs that may promote non-homologous recombination were observed in close proximity to the junctions. We conclude that NHEJ is likely the mechanism of DNA repair that generates these rearrangements. Additionally, two apparently pure terminal deletions were also investigated, and the refinement of the breakpoint regions identified two distinct genomic intervals ~25-kb apart, each containing a series of 1p36 specific segmental duplications with 90-98% identity. Segmental duplications can serve as substrates for ectopic homologous recombination or stimulate genomic rearrangements.

  12. Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

    PubMed

    Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M

    2017-03-27

    Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.

  13. Cooperativity among Short Amyloid Stretches in Long Amyloidogenic Sequences

    PubMed Central

    He, Zhisong; Shi, Xiaohe; Feng, Kaiyan; Ma, Buyong; Cai, Yu-Dong

    2012-01-01

    Amyloid fibrillar aggregates of polypeptides are associated with many neurodegenerative diseases. Short peptide segments in protein sequences may trigger aggregation. Identifying these stretches and examining their behavior in longer protein segments is critical for understanding these diseases and obtaining potential therapies. In this study, we combined machine learning and structure-based energy evaluation to examine and predict amyloidogenic segments. Our feature selection method discovered that windows consisting of long amino acid segments of ∼30 residues, instead of the commonly used short hexapeptides, provided the highest accuracy. Weighted contributions of an amino acid at each position in a 27 residue window revealed three cooperative regions of short stretch, resemble the β-strand-turn-β-strand motif in A-βpeptide amyloid and β-solenoid structure of HET-s(218–289) prion (C). Using an in-house energy evaluation algorithm, the interaction energy between two short stretches in long segment is computed and incorporated as an additional feature. The algorithm successfully predicted and classified amyloid segments with an overall accuracy of 75%. Our study revealed that genome-wide amyloid segments are not only dependent on short high propensity stretches, but also on nearby residues. PMID:22761773

  14. Organizational heterogeneity of vertebrate genomes.

    PubMed

    Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

    2012-01-01

    Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  15. Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

    PubMed

    Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan

    2016-07-01

    This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.

  16. Xenopus microRNA genes are predominantly located within introns and are differentially expressed in adult frog tissues via post-transcriptional regulation

    PubMed Central

    Tang, Guo-Qing; Maxwell, E. Stuart

    2008-01-01

    The amphibian Xenopus provides a model organism for investigating microRNA expression during vertebrate embryogenesis and development. Searching available Xenopus genome databases using known human pre-miRNAs as query sequences, more than 300 genes encoding 142 Xenopus tropicalis miRNAs were identified. Analysis of Xenopus tropicalis miRNA genes revealed a predominate positioning within introns of protein-coding and nonprotein-coding RNA Pol II-transcribed genes. MiRNA genes were also located in pre-mRNA exons and positioned intergenically between known protein-coding genes. Many miRNA species were found in multiple locations and in more than one genomic context. MiRNA genes were also clustered throughout the genome, indicating the potential for the cotranscription and coordinate expression of miRNAs located in a given cluster. Northern blot analysis confirmed the expression of many identified miRNAs in both X. tropicalis and X. laevis. Comparison of X. tropicalis and X. laevis blots revealed comparable expression profiles, although several miRNAs exhibited species-specific expression in different tissues. More detailed analysis revealed that for some miRNAs, the tissue-specific expression profile of the pri-miRNA precursor was distinctly different from that of the mature miRNA profile. Differential miRNA precursor processing in both the nucleus and cytoplasm was implicated in the observed tissue-specific differences. These observations indicated that post-transcriptional processing plays an important role in regulating miRNA expression in the amphibian Xenopus. PMID:18032731

  17. Segmentation of time series with long-range fractal correlations

    PubMed Central

    Bernaola-Galván, P.; Oliver, J.L.; Hackenberg, M.; Coronado, A.V.; Ivanov, P.Ch.; Carpena, P.

    2012-01-01

    Segmentation is a standard method of data analysis to identify change-points dividing a nonstationary time series into homogeneous segments. However, for long-range fractal correlated series, most of the segmentation techniques detect spurious change-points which are simply due to the heterogeneities induced by the correlations and not to real nonstationarities. To avoid this oversegmentation, we present a segmentation algorithm which takes as a reference for homogeneity, instead of a random i.i.d. series, a correlated series modeled by a fractional noise with the same degree of correlations as the series to be segmented. We apply our algorithm to artificial series with long-range correlations and show that it systematically detects only the change-points produced by real nonstationarities and not those created by the correlations of the signal. Further, we apply the method to the sequence of the long arm of human chromosome 21, which is known to have long-range fractal correlations. We obtain only three segments that clearly correspond to the three regions of different G + C composition revealed by means of a multi-scale wavelet plot. Similar results have been obtained when segmenting all human chromosome sequences, showing the existence of previously unknown huge compositional superstructures in the human genome. PMID:23645997

  18. Mechanisms of alternative splicing regulation: insights from molecular and genomics approaches

    PubMed Central

    Chen, Mo; Manley, James L.

    2010-01-01

    Alternative splicing of mRNA precursors provides an important means of genetic control and is a crucial step in the expression of most genes. Alternative splicing markedly affects human development, and its misregulation underlies many human diseases. Although the mechanisms of alternative splicing have been studied extensively, until the past few years we had not begun to realize fully the diversity and complexity of alternative splicing regulation by an intricate protein–RNA network. Great progress has been made by studying individual transcripts and through genome-wide approaches, which together provide a better picture of the mechanistic regulation of alternative pre-mRNA splicing. PMID:19773805

  19. The Rosa genome provides new insights into the domestication of modern roses.

    PubMed

    Raymond, Olivier; Gouzy, Jérôme; Just, Jérémy; Badouin, Hélène; Verdenaud, Marion; Lemainque, Arnaud; Vergne, Philippe; Moja, Sandrine; Choisne, Nathalie; Pont, Caroline; Carrère, Sébastien; Caissard, Jean-Claude; Couloux, Arnaud; Cottret, Ludovic; Aury, Jean-Marc; Szécsi, Judit; Latrasse, David; Madoui, Mohammed-Amin; François, Léa; Fu, Xiaopeng; Yang, Shu-Hua; Dubois, Annick; Piola, Florence; Larrieu, Antoine; Perez, Magali; Labadie, Karine; Perrier, Lauriane; Govetto, Benjamin; Labrousse, Yoan; Villand, Priscilla; Bardoux, Claudia; Boltz, Véronique; Lopez-Roques, Céline; Heitzler, Pascal; Vernoux, Teva; Vandenbussche, Michiel; Quesneville, Hadi; Boualem, Adnane; Bendahmane, Abdelhafid; Liu, Chang; Le Bris, Manuel; Salse, Jérôme; Baudino, Sylvie; Benhamed, Moussa; Wincker, Patrick; Bendahmane, Mohammed

    2018-06-01

    Roses have high cultural and economic importance as ornamental plants and in the perfume industry. We report the rose whole-genome sequencing and assembly and resequencing of major genotypes that contributed to rose domestication. We generated a homozygous genotype from a heterozygous diploid modern rose progenitor, Rosa chinensis 'Old Blush'. Using single-molecule real-time sequencing and a meta-assembly approach, we obtained one of the most comprehensive plant genomes to date. Diversity analyses highlighted the mosaic origin of 'La France', one of the first hybrids combining the growth vigor of European species and the recurrent blooming of Chinese species. Genomic segments of Chinese ancestry identified new candidate genes for recurrent blooming. Reconstructing regulatory and secondary metabolism pathways allowed us to propose a model of interconnected regulation of scent and flower color. This genome provides a foundation for understanding the mechanisms governing rose traits and should accelerate improvement in roses, Rosaceae and ornamentals.

  20. Exploring the virome of diseased horses

    PubMed Central

    Li, Linlin; Giannitti, Federico; Low, Jason; Keyes, Casey; Ullmann, Leila S.; Deng, Xutao; Aleman, Monica; Pesavento, Patricia A.; Pusterla, Nicola

    2015-01-01

    Metagenomics was used to characterize viral genomes in clinical specimens of horses with various organ-specific diseases of unknown aetiology. A novel parvovirus as well as a previously described hepacivirus closely related to human hepatitis C virus and equid herpesvirus 2 were identified in the cerebrospinal fluid of horses with neurological signs. Four co-infecting picobirnaviruses, including an unusual genome with fused RNA segments, and a divergent anellovirus were found in the plasma of two febrile horses. A novel cyclovirus genome was characterized from the nasal secretion of another febrile animal. Lastly, a small circular DNA genome with a Rep gene, from a virus we called kirkovirus, was identified in the liver and spleen of a horse with fatal idiopathic hepatopathy. This study expands the number of viruses found in horses, and characterizes their genomes to assist future epidemiological studies of their transmission and potential association with various equine diseases. PMID:26044792

  1. Comparative Genomics of Mycobacteria: Some Answers, Yet More New Questions

    PubMed Central

    Behr, Marcel A.

    2015-01-01

    Comparative genomic studies permit a genus-level perspective on the distinction between environmental mycobacteria and Mycobacterium tuberculosis, as well as a species-level assessment of genetic variability within M. tuberculosis. Both of these strata of evolutionary analysis serve to generate hypotheses regarding the genomic basis of M. tuberculosis virulence. In contrasting lessons from macroevolutionary study and microevolutionary study, one can form predictions about which segments of the genome are likely to be essential for or dispensable for the pathogenesis of tuberculosis. Although some of these predictions have been experimentally verified, notable exceptions challenge the direct link between these virulence factors and the capacity of M. tuberculosis to successfully cause disease and propagate between human hosts. These unexpected findings serve as the stimulus for further studies, using genomic comparisons and other approaches, to better define the remarkable success of this recalcitrant pathogen. PMID:25395374

  2. Artificial selection increased body weight but induced increase of runs of homozygosity in Hanwoo cattle

    PubMed Central

    Kim, Kwondo; Jung, Jaehoon; Caetano-Anollés, Kelsey; Sung, Samsun; Yoo, DongAhn; Choi, Bong-Hwan; Kim, Hyung-Chul; Jeong, Jin-Young; Cho, Yong-Min; Park, Eung-Woo; Choi, Tae-Jeong; Park, Byoungho; Lim, Dajeong

    2018-01-01

    Artificial selection has been demonstrated to have a rapid and significant effect on the phenotype and genome of an organism. However, most previous studies on artificial selection have focused solely on genomic sequences modified by artificial selection or genomic sequences associated with a specific trait. In this study, we generated whole genome sequencing data of 126 cattle under artificial selection, and 24,973,862 single nucleotide variants to investigate the relationship among artificial selection, genomic sequences and trait. Using runs of homozygosity detected by the variants, we showed increase of inbreeding for decades, and at the same time demonstrated a little influence of recent inbreeding on body weight. Also, we could identify ~0.2 Mb runs of homozygosity segment which may be created by recent artificial selection. This approach may aid in development of genetic markers directly influenced by artificial selection, and provide insight into the process of artificial selection. PMID:29561881

  3. A Simple Measure of the Dynamics of Segmented Genomes: An Application to Influenza

    NASA Astrophysics Data System (ADS)

    Aris-Brosou, Stéphane

    The severity of influenza epidemics, which can potentially become a pandemic, has been very difficult to predict. However, past efforts were focusing on gene-by-gene approaches, while it is acknowledged that the whole genome dynamics contribute to the severity of an epidemic. Here, putting this rationale into action, I describe a simple measure of the amount of reassortment that affects influenza at a genomic scale during a particular year. The analysis of 530 complete genomes of the H1N1 subtype, sampled over eleven years, shows that the proposed measure explains 58% of the variance in the prevalence of H1 influenza in the US population. The proposed measure, denoted nRF, could therefore improve influenza surveillance programs at a minimal cost.

  4. Chromosomal duplications in bacteria, fruit flies, and humans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lupski, J.R.; Weinstock, G.M.; Roth, J.R.

    1996-01-01

    Tandem duplication of chromosomal segments has been recognized as a frequent mutational mechanism in several genetic model systems. In bacteria, fruit flies, and humans, duplications form by similar molecular mechanisms and appear to be important in genome evolution. 80 refs.

  5. Molecular cloning and expression of a gene for a factor which stabilizes formation of inhibitor-mitochondrial ATPase complex from Saccharomyces cerevisiae.

    PubMed

    Akashi, A; Yoshida, Y; Nakagoshi, H; Kuroki, K; Hashimoto, T; Tagawa, K; Imamoto, F

    1988-10-01

    Stabilizing factor, a 9 kDa protein, stabilizes and facilitates formation of the complex between mitochondrial ATP synthase and its intrinsic inhibitor protein. A clone containing the gene encoding the 9 kDa protein was selected from a yeast genomic library to determine the structure of its precursor protein. As deduced from the nucleotide sequence, the precursor of the yeast 9 kDa stabilizing factor contains 86 amino acid residues and has a molecular weight of 10,062. From the predicted sequence we infer that the stabilizing factor precursor contains a presequence of 23 amino acid residues at its amino terminus. We also used S1 mapping to determine the initiation site of transcription under glucose-repressed or derepressed conditions. These experiments suggest that transcription of this gene starts at three different sites and that only one of them is not affected by the presence of glucose.

  6. RNAi Mediated curcin precursor gene silencing in Jatropha (Jatropha curcas L.).

    PubMed

    Patade, Vikas Yadav; Khatri, Deepti; Kumar, Kamal; Grover, Atul; Kumari, Maya; Gupta, Sanjay Mohan; Kumar, Devender; Nasim, Mohammed

    2014-07-01

    Curcin, a type I ribosomal inhibiting protein-RIP, encoded by curcin precursor gene, is a phytotoxin present in Jatropha (Jatropha curcas L.). Here, we report designing of RNAi construct for the curcin precursor gene and further its genetic transformation of Jatropha to reduce its transcript expression. Curcin precursor gene was first cloned from Jatropha strain DARL-2 and part of the gene sequence was cloned in sense and antisense orientation separated by an intron sequence in plant expression binary vector pRI101 AN. The construction of the RNAi vector was confirmed by double digestion and nucleotide sequencing. The vector was then mobilized into Agrobacterium tumefaciens strain GV 3101 and used for tissue culture independent in planta transformation protocol optimized for Jatropha. Germinating seeds were injured with a needle before infection with Agrobacterium and then transferred to sterilized sand medium. The seedlings were grown for 90 days and genomic DNA was isolated from leaves for transgenic confirmation based on real time PCR with NPT II specific dual labeled probe. Result of the transgenic confirmation analysis revealed presence of the gene silencing construct in ten out of 30 tested seedlings. Further, quantitative transcript expression analysis of the curcin precursor gene revealed reduction in the transcript abundance by more than 98% to undetectable level. The transgenic plants are being grown in containment for further studies on reduction in curcin protein content in Jatropha seeds.

  7. Integrase inhibitor reversal dynamics indicate unintegrated HIV-1 dna initiate de novo integration.

    PubMed

    Thierry, Sylvain; Munir, Soundasse; Thierry, Eloïse; Subra, Frédéric; Leh, Hervé; Zamborlini, Alessia; Saenz, Dyana; Levy, David N; Lesbats, Paul; Saïb, Ali; Parissi, Vincent; Poeschla, Eric; Deprez, Eric; Delelis, Olivier

    2015-03-12

    Genomic integration, an obligate step in the HIV-1 replication cycle, is blocked by the integrase inhibitor raltegravir. A consequence is an excess of unintegrated viral DNA genomes, which undergo intramolecular ligation and accumulate as 2-LTR circles. These circularized genomes are also reliably observed in vivo in the absence of antiviral therapy and they persist in non-dividing cells. However, they have long been considered as dead-end products that are not precursors to integration and further viral propagation. Here, we show that raltegravir action is reversible and that unintegrated viral DNA is integrated in the host cell genome after raltegravir removal leading to HIV-1 replication. Using quantitative PCR approach, we analyzed the consequences of reversing prolonged raltegravir-induced integration blocks. We observed, after RAL removal, a decrease of 2-LTR circles and a transient increase of linear DNA that is subsequently integrated in the host cell genome and fuel new cycles of viral replication. Our data highly suggest that 2-LTR circles can be used as a reserve supply of genomes for proviral integration highlighting their potential role in the overall HIV-1 replication cycle.

  8. Dietary nitrogen alters codon bias and genome composition in parasitic microorganisms.

    PubMed

    Seward, Emily A; Kelly, Steven

    2016-11-15

    Genomes are composed of long strings of nucleotide monomers (A, C, G and T) that are either scavenged from the organism's environment or built from metabolic precursors. The biosynthesis of each nucleotide differs in atomic requirements with different nucleotides requiring different quantities of nitrogen atoms. However, the impact of the relative availability of dietary nitrogen on genome composition and codon bias is poorly understood. Here we show that differential nitrogen availability, due to differences in environment and dietary inputs, is a major determinant of genome nucleotide composition and synonymous codon use in both bacterial and eukaryotic microorganisms. Specifically, low nitrogen availability species use nucleotides that require fewer nitrogen atoms to encode the same genes compared to high nitrogen availability species. Furthermore, we provide a novel selection-mutation framework for the evaluation of the impact of metabolism on gene sequence evolution and show that it is possible to predict the metabolic inputs of related organisms from an analysis of the raw nucleotide sequence of their genes. Taken together, these results reveal a previously hidden relationship between cellular metabolism and genome evolution and provide new insight into how genome sequence evolution can be influenced by adaptation to different diets and environments.

  9. Molecular evolution of the neurohypophysial hormone precursors in mammals: Comparative genomics reveals novel mammalian oxytocin and vasopressin analogues.

    PubMed

    Wallis, Michael

    2012-11-01

    Among vertebrates the neurohypophysial hormones show considerable variation. However, in eutherian mammals they have been considered rather conserved, with arginine vasopressin (AVP) and oxytocin (OT) in all species except pig and some relatives, where lysine vasopressin replaces AVP. The availability of genomic data for a wide range of mammals makes it possible to assess whether these peptides and their precursors may be more variable in Eutheria than previously suspected. A survey of these data confirms that AVP and OT occur in most eutherians, but with exceptions. In a New-World monkey (marmoset, Callithrix jacchus) and in tree shrew (Tupaia belangeri), Pro(8)OT replaces OT, confirming a recent report for these species. In armadillo (Dasypus novemcinctus) Leu(3)OT replaces OT, while in tenrec (Echinops telfairi) Thr(4)AVP replaces AVP. In these two species there is also evidence for additional genes/pseudogenes, encoding much-modified forms of AVP, but in most other eutherian species there is no evidence for additional neurohypophysial hormone genes. Evolutionary analysis shows that sequences of eutherian neurohypophysial hormone precursors are generally strongly conserved, particularly those regions encoding active peptide and neurophysin. The close association between OT and VP genes has led to frequent gene conversion of sequences encoding neurophysins. A monotreme, platypus (Ornithorhynchus anatinus) has genes for OT and AVP, organized tail-to-tail as in eutherians, but in marsupials 3-4 genes are present for neurohypophysial hormones, organized tail-to-head as in lower vertebrates. Copyright © 2012 Elsevier Inc. All rights reserved.

  10. Structural organization of poliovirus RNA replication is mediated by viral proteins of the P2 genomic region

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bienz, K.; Egger, D.; Troxler, M.

    1990-03-01

    Transcriptionally active replication complexes bound to smooth membrane vesicles were isolated from poliovirus-infected cells. In electron microscopic, negatively stained preparations, the replication complex appeared as an irregularly shaped, oblong structure attached to several virus-induced vesicles of a rosettelike arrangement. Electron microscopic immunocytochemistry of such preparations demonstrated that the poliovirus replication complex contains the proteins coded by the P2 genomic region (P2 proteins) in a membrane-associated form. In addition, the P2 proteins are also associated with viral RNA, and they can be cross-linked to viral RNA by UV irradiation. Guanidine hydrochloride prevented the P2 proteins from becoming membrane bound but didmore » not change their association with viral RNA. The findings allow the conclusion that the protein 2C or 2C-containing precursor(s) is responsible for the attachment of the viral RNA to the vesicular membrane and for the spatial organization of the replication complex necessary for its proper functioning in viral transcription. A model for the structure of the viral replication complex and for the function of the 2C-containing P2 protein(s) and the vesicular membranes is proposed.« less

  11. Developmental Design of Synthetic Bacterial Architectures by Morphogenetic Engineering.

    PubMed

    Pascalie, Jonathan; Potier, Martin; Kowaliw, Taras; Giavitto, Jean-Louis; Michel, Olivier; Spicher, Antoine; Doursat, René

    2016-08-19

    Synthetic biology is an emerging scientific field that promotes the standardized manufacturing of biological components without natural equivalents. Its goal is to create artificial living systems that can meet various needs in health care or energy domains. While most works are focused on the individual bacterium as a chemical reactor, our project, SynBioTIC, addresses a novel and more complex challenge: shape engineering; that is, the redesign of natural morphogenesis toward a new kind of developmental 3D printing. Potential applications include organ growth, natural computing in biocircuits, or future vegetal houses. To create in silico multicellular organisms that exhibit specific shapes, we construe their development as an iterative process combining fundamental collective phenomena such as homeostasis, patterning, segmentation, and limb growth. Our numerical experiments rely on the existing Escherichia coli simulator Gro, a physicochemical computation platform offering reaction-diffusion and collision dynamics solvers. The synthetic bioware of our model executes a set of rules, or genome, in each cell. Cells can differentiate into several predefined types associated with specific actions (divide, emit signal, detect signal, die). Transitions between types are triggered by conditions involving internal and external sensors that detect various protein levels inside and around the cell. Indirect communication between bacteria is relayed by morphogen diffusion and the mechanical constraints of 2D packing. Starting from a single bacterium, the overall architecture emerges in a purely endogenous fashion through a series of developmental stages, inlcuding proliferation, differentiation, morphogen diffusion, and synchronization. The genome can be parametrized to control the growth and features of appendages individually. As exemplified by the L and T shapes that we obtain, certain precursor cells can be inhibited while others can create limbs of varying size (divergence of the homology). Such morphogenetic phenotypes open the way to more complex shapes made of a recursive array of core bodies and limbs and, most importantly, to an evolutionary developmental exploration of unplanned functional forms.

  12. Granada virus: a natural phlebovirus reassortant of the sandfly fever Naples serocomplex with low seroprevalence in humans.

    PubMed

    Collao, Ximena; Palacios, Gustavo; de Ory, Fernando; Sanbonmatsu, Sara; Pérez-Ruiz, Mercedes; Navarro, José María; Molina, Ricardo; Hutchison, Stephen K; Lipkin, W Ian; Tenorio, Antonio; Sánchez-Seco, María Paz

    2010-10-01

    A new member of the phlebovirus genus, tentatively named Granada virus, was detected in sandflies collected in Spain. By showing the presence of specific neutralizing antibodies in human serum collected in Granada, we show that Granada virus infects humans. The analysis of the complete genome of Granada virus revealed that this agent is likely to be a natural reassortant of the recently described Massilia virus (donor of the long and short segments) with a yet unidentified phlebovirus (donor of the medium segment).

  13. Granada Virus: a Natural Phlebovirus Reassortant of the Sandfly Fever Naples Serocomplex with Low Seroprevalence in Humans

    PubMed Central

    Collao, Ximena; Palacios, Gustavo; de Ory, Fernando; Sanbonmatsu, Sara; Pérez-Ruiz, Mercedes; Navarro, José María; Molina, Ricardo; Hutchison, Stephen K.; Lipkin, Ian W.; Tenorio, Antonio; Sánchez-Seco, María Paz

    2010-01-01

    A new member of the phlebovirus genus, tentatively named Granada virus, was detected in sandflies collected in Spain. By showing the presence of specific neutralizing antibodies in human serum collected in Granada, we show that Granada virus infects humans. The analysis of the complete genome of Granada virus revealed that this agent is likely to be a natural reassortant of the recently described Massilia virus (donor of the long and short segments) with a yet unidentified phlebovirus (donor of the medium segment). PMID:20889862

  14. A new genome-mining tool redefines the lasso peptide biosynthetic landscape

    PubMed Central

    Tietz, Jonathan I.; Schwalen, Christopher J.; Patel, Parth S.; Maxson, Tucker; Blair, Patricia M.; Tai, Hua-Chia; Zakai, Uzma I.; Mitchell, Douglas A.

    2016-01-01

    Ribosomally synthesized and post-translationally modified peptide (RiPP) natural products are attractive for genome-driven discovery and re-engineering, but limitations in bioinformatic methods and exponentially increasing genomic data make large-scale mining difficult. We report RODEO (Rapid ORF Description and Evaluation Online), which combines hidden Markov model-based analysis, heuristic scoring, and machine learning to identify biosynthetic gene clusters and predict RiPP precursor peptides. We initially focused on lasso peptides, which display intriguing physiochemical properties and bioactivities, but their hypervariability renders them challenging prospects for automated mining. Our approach yielded the most comprehensive mapping of lasso peptide space, revealing >1,300 compounds. We characterized the structures and bioactivities of six lasso peptides, prioritized based on predicted structural novelty, including an unprecedented handcuff-like topology and another with a citrulline modification exceptionally rare among bacteria. These combined insights significantly expand the knowledge of lasso peptides, and more broadly, provide a framework for future genome-mining efforts. PMID:28244986

  15. The consequences of chromosomal aneuploidy on the transcriptome of cancer cells☆

    PubMed Central

    Ried, Thomas; Hu, Yue; Difilippantonio, Michael J.; Ghadimi, B. Michael; Grade, Marian; Camps, Jordi

    2016-01-01

    Chromosomal aneuploidies are a defining feature of carcinomas, i.e., tumors of epithelial origin. Such aneuploidies result in tumor specific genomic copy number alterations. The patterns of genomic imbalances are tumor specific, and to a certain extent specific for defined stages of tumor development. Genomic imbalances occur already in premalignant precursor lesions, i.e., before the transition to invasive disease, and their distribution is maintained in metastases, and in cell lines derived from primary tumors. These observations are consistent with the interpretation that tumor specific genomic imbalances are drivers of malignant transformation. Naturally, this precipitates the question of how such imbalances influence the expression of resident genes. A number of laboratories have systematically integrated copy number alterations with gene expression changes in primary tumors and metastases, cell lines, and experimental models of aneuploidy to address the question as to whether genomic imbalances deregulate the expression of one or few key genes, or rather affect the cancer transcriptome more globally. The majority of these studies showed that gene expression levels follow genomic copy number. Therefore, gross genomic copy number changes, including aneuploidies of entire chromosome arms and chromosomes, result in a massive deregulation of the transcriptome of cancer cells. This article is part of a Special Issue entitled: Chromatin in time and space. PMID:22426433

  16. Structural Genomics: Correlation Blocks, Population Structure, and Genome Architecture

    PubMed Central

    Hu, Xin-Sheng; Yeh, Francis C.; Wang, Zhiquan

    2011-01-01

    An integration of the pattern of genome-wide inter-site associations with evolutionary forces is important for gaining insights into the genomic evolution in natural or artificial populations. Here, we assess the inter-site correlation blocks and their distributions along chromosomes. A correlation block is broadly termed as the DNA segment within which strong correlations exist between genetic diversities at any two sites. We bring together the population genetic structure and the genomic diversity structure that have been independently built on different scales and synthesize the existing theories and methods for characterizing genomic structure at the population level. We discuss how population structure could shape correlation blocks and their patterns within and between populations. Effects of evolutionary forces (selection, migration, genetic drift, and mutation) on the pattern of genome-wide correlation blocks are discussed. In eukaryote organisms, we briefly discuss the associations between the pattern of correlation blocks and genome assembly features in eukaryote organisms, including the impacts of multigene family, the perturbation of transposable elements, and the repetitive nongenic sequences and GC-rich isochores. Our reviews suggest that the observable pattern of correlation blocks can refine our understanding of the ecological and evolutionary processes underlying the genomic evolution at the population level. PMID:21886455

  17. Chemo-sensitivity in a panel of B-cell precursor acute lymphoblastic leukemia cell lines, YCUB series, derived from children.

    PubMed

    Goto, Hiroaki; Naruto, Takuya; Tanoshima, Reo; Kato, Hiromi; Yokosuka, Tomoko; Yanagimachi, Masakatsu; Fujii, Hisaki; Yokota, Shumpei; Komine, Hiromi

    2009-10-01

    Sensitivity to 10 anticancer drugs was evaluated in 6 childhood B-cell precursor acute lymphoblastic leukemia (BCP-ALL) cell lines. Authenticity of newly established cell lines was confirmed by genomic fingerprinting. The line YCUB-5R established at relapse was more resistant to 4-hydroperoxy-cyclophosphamide, cytarabine, L-asparaginase, topotecan, fludarabine, and etoposide than YCUB-5 from the same patient at diagnosis. Of the drugs tested, etoposide and SN-38 (irinotecan) showed highest efficacy in the panel, with 50% growth inhibition at 0.22-1.8 microg/ml and 0.57-3.6 ng/ml, respectively. This cell line panel offers an in vitro model for the development of new therapies for childhood BCP-ALL.

  18. The PLUTO plastidial nucleobase transporter also transports the thiamin precursor hydroxymethylpyrimidine

    PubMed Central

    Beaudoin, Guillaume A.W.; Johnson, Timothy S.; Hanson, Andrew D.

    2018-01-01

    In plants, the hydroxymethylpyrimidine (HMP) and thiazole precursors of thiamin are synthesized and coupled together to form thiamin in plastids. Mutants unable to form HMP can be rescued by exogenous HMP, implying the presence of HMP transporters in the plasma membrane and plastids. Analysis of bacterial genomes revealed a transporter gene that is chromosomally clustered with thiamin biosynthesis and salvage genes. Its closest Arabidopsis homolog, the plastidic nucleobase transporter (PLUTO), is co-expressed with several thiamin biosynthetic enzymes. Heterologous expression of PLUTO in Escherichia coli or Saccharomyces cerevisiae increased sensitivity to a toxic HMP analog, and disrupting PLUTO in an HMP-requiring Arabidopsis line reduced root growth at low HMP concentrations. These data implicate PLUTO in plastidial transport and salvage of HMP. PMID:29507060

  19. CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data

    PubMed Central

    De, Rajat K.

    2015-01-01

    Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision. PMID:26291322

  20. CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.

    PubMed

    Sinha, Rituparna; Samaddar, Sandip; De, Rajat K

    2015-01-01

    Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision.

  1. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus

    PubMed Central

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus. PMID:27322342

  2. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus.

    PubMed

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus.

  3. Progress in Understanding and Sequencing the Genome of Brassica rapa

    PubMed Central

    Hong, Chang Pyo; Kwon, Soo-Jin; Kim, Jung Sun; Yang, Tae-Jin; Park, Beom-Seok; Lim, Yong Pyo

    2008-01-01

    Brassica rapa, which is closely related to Arabidopsis thaliana, is an important crop and a model plant for studying genome evolution via polyploidization. We report the current understanding of the genome structure of B. rapa and efforts for the whole-genome sequencing of the species. The tribe Brassicaceae, which comprises ca. 240 species, descended from a common hexaploid ancestor with a basic genome similar to that of Arabidopsis. Chromosome rearrangements, including fusions and/or fissions, resulted in the present-day “diploid” Brassica species with variation in chromosome number and phenotype. Triplicated genomic segments of B. rapa are collinear to those of A. thaliana with InDels. The genome triplication has led to an approximately 1.7-fold increase in the B. rapa gene number compared to that of A. thaliana. Repetitive DNA of B. rapa has also been extensively amplified and has diverged from that of A. thaliana. For its whole-genome sequencing, the Brassica rapa Genome Sequencing Project (BrGSP) consortium has developed suitable genomic resources and constructed genetic and physical maps. Ten chromosomes of B. rapa are being allocated to BrGSP consortium participants, and each chromosome will be sequenced by a BAC-by-BAC approach. Genome sequencing of B. rapa will offer a new perspective for plant biology and evolution in the context of polyploidization. PMID:18288250

  4. DNA Data Visualization (DDV): Software for Generating Web-Based Interfaces Supporting Navigation and Analysis of DNA Sequence Data of Entire Genomes.

    PubMed

    Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard

    2015-01-01

    Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.

  5. Single-cell sequencing provides clues about the host interactions of segmented filamentous bacteria (SFB)

    PubMed Central

    Pamp, Sünje J.; Harrington, Eoghan D.; Quake, Stephen R.; Relman, David A.; Blainey, Paul C.

    2012-01-01

    Segmented filamentous bacteria (SFB) are host-specific intestinal symbionts that comprise a distinct clade within the Clostridiaceae, designated Candidatus Arthromitus. SFB display a unique life cycle within the host, involving differentiation into multiple cell types. The latter include filaments that attach intimately to intestinal epithelial cells, and from which “holdfasts” and spores develop. SFB induce a multifaceted immune response, leading to host protection from intestinal pathogens. Cultivation resistance has hindered characterization of these enigmatic bacteria. In the present study, we isolated five SFB filaments from a mouse using a microfluidic device equipped with laser tweezers, generated genome sequences from each, and compared these sequences with each other, as well as to recently published SFB genome sequences. Based on the resulting analyses, SFB appear to be dependent on the host for a variety of essential nutrients. SFB have a relatively high abundance of predicted proteins devoted to cell cycle control and to envelope biogenesis, and have a group of SFB-specific autolysins and a dynamin-like protein. Among the five filament genomes, an average of 8.6% of predicted proteins were novel, including a family of secreted SFB-specific proteins. Four ADP-ribosyltransferase (ADPRT) sequence types, and a myosin-cross-reactive antigen (MCRA) protein were discovered; we hypothesize that they are involved in modulation of host responses. The presence of polymorphisms among mouse SFB genomes suggests the evolution of distinct SFB lineages. Overall, our results reveal several aspects of SFB adaptation to the mammalian intestinal tract. PMID:22434425

  6. A novel variant associated with HDL-C levels by modifying DAGLB expression levels: An annotation-based genome-wide association study.

    PubMed

    Zhou, Dan; Zhang, Dandan; Sun, Xiaohui; Li, Zhiqiang; Ni, Yaqin; Shan, Zhongyan; Li, Hong; Liu, Chengguo; Zhang, Shuai; Liu, Yi; Zheng, Ruizhi; Pan, Feixia; Zhu, Yimin; Shi, Yongyong; Lai, Maode

    2018-06-01

    Although numbers of genome-wide association studies (GWAS) have been performed for serum lipid levels, limited heritability has been explained. Studies showed that combining data from GWAS and expression quantitative trait loci (eQTLs) signals can both enhance the discovery of trait-associated SNPs and gain a better understanding of the mechanism. We performed an annotation-based, multistage genome-wide screening for serum-lipid-level-associated loci in totally 6863 Han Chinese. A serum high-density lipoprotein cholesterol (HDL-C) associated variant rs1880118 (hg19 chr7:g. 6435220G>C) was replicated (P combined  = 1.4E-10). rs1880118 was associated with DAGLB (diacylglycerol lipase, beta) expression levels in subcutaneous adipose tissue (P = 5.9E-42) and explained 47.7% of the expression variance. After the replication, an active segment covering variants tagged by rs1880118 near 5' of DAGLB was annotated using histone modification and transcription factor binding signals. The luciferase report assay revealed that the segment containing the minor alleles showed increased transcriptional activity compared with segment contains the major alleles, which was consistent with the eQTL analyses. The expression-trait association tests indicated the association between the DAGLB and serum HDL-C levels using gene-based approaches called "TWAS" (P = 3.0E-8), "SMR" (P = 1.1E-4), and "Sherlock" (P = 1.6E-6). To summarize, we identified a novel HDL-C-associated variant which explained nearly half of the expression variance of DAGLB. Integrated analyses established a genotype-gene-phenotype three-way association and expanded our knowledge of DAGLB in lipid metabolism.

  7. Whole-genome analysis of piscine reovirus (PRV) shows PRV represents a new genus in family Reoviridae and its genome segment S1 sequences group it into two separate sub-genotypes.

    PubMed

    Kibenge, Molly J T; Iwamoto, Tokinori; Wang, Yingwei; Morton, Alexandra; Godoy, Marcos G; Kibenge, Frederick S B

    2013-07-11

    Piscine reovirus (PRV) is a newly discovered fish reovirus of anadromous and marine fish ubiquitous among fish in Norwegian salmon farms, and likely the causative agent of heart and skeletal muscle inflammation (HSMI). HSMI is an increasingly economically significant disease in Atlantic salmon (Salmo salar) farms. The nucleotide sequence data available for PRV are limited, and there is no genetic information on this virus outside of Norway and none from wild fish. RT-PCR amplification and sequencing were used to obtain the complete viral genome of PRV (10 segments) from western Canada and Chile. The genetic diversity among the PRV strains and their relationship to Norwegian PRV isolates were determined by phylogenetic analyses and sequence identity comparisons. PRV is distantly related to members of the genera Orthoreovirus and Aquareovirus and an unambiguous new genus within the family Reoviridae. The Canadian and Norwegian PRV strains are most divergent in the segment S1 and S4 encoded proteins. Phylogenetic analysis of PRV S1 sequences, for which the largest number of complete sequences from different "isolates" is available, grouped Norwegian PRV strains into a single genotype, Genotype I, with sub-genotypes, Ia and Ib. The Canadian PRV strains matched sub-genotype Ia and Chilean PRV strains matched sub-genotype Ib. PRV should be considered as a member of a new genus within the family Reoviridae with two major Norwegian sub-genotypes. The Canadian PRV diverged from Norwegian sub-genotype Ia around 2007 ± 1, whereas the Chilean PRV diverged from Norwegian sub-genotype Ib around 2008 ± 1.

  8. Whole-genome analysis of piscine reovirus (PRV) shows PRV represents a new genus in family Reoviridae and its genome segment S1 sequences group it into two separate sub-genotypes

    PubMed Central

    2013-01-01

    Background Piscine reovirus (PRV) is a newly discovered fish reovirus of anadromous and marine fish ubiquitous among fish in Norwegian salmon farms, and likely the causative agent of heart and skeletal muscle inflammation (HSMI). HSMI is an increasingly economically significant disease in Atlantic salmon (Salmo salar) farms. The nucleotide sequence data available for PRV are limited, and there is no genetic information on this virus outside of Norway and none from wild fish. Methods RT-PCR amplification and sequencing were used to obtain the complete viral genome of PRV (10 segments) from western Canada and Chile. The genetic diversity among the PRV strains and their relationship to Norwegian PRV isolates were determined by phylogenetic analyses and sequence identity comparisons. Results PRV is distantly related to members of the genera Orthoreovirus and Aquareovirus and an unambiguous new genus within the family Reoviridae. The Canadian and Norwegian PRV strains are most divergent in the segment S1 and S4 encoded proteins. Phylogenetic analysis of PRV S1 sequences, for which the largest number of complete sequences from different “isolates” is available, grouped Norwegian PRV strains into a single genotype, Genotype I, with sub-genotypes, Ia and Ib. The Canadian PRV strains matched sub-genotype Ia and Chilean PRV strains matched sub-genotype Ib. Conclusions PRV should be considered as a member of a new genus within the family Reoviridae with two major Norwegian sub-genotypes. The Canadian PRV diverged from Norwegian sub-genotype Ia around 2007 ± 1, whereas the Chilean PRV diverged from Norwegian sub-genotype Ib around 2008 ± 1. PMID:23844948

  9. Regional Heritability Mapping Provides Insights into Dry Matter Content in African White and Yellow Cassava Populations.

    PubMed

    Okeke, Uche Godfrey; Akdemir, Deniz; Rabbi, Ismail; Kulakow, Peter; Jannink, Jean-Luc

    2018-03-01

    The HarvestPlus program for cassava ( Crantz) fortifies cassava with β-carotene by breeding for carotene-rich tubers (yellow cassava). However, a negative correlation between yellowness and dry matter (DM) content has been identified. We investigated the genetic control of DM in white and yellow cassava. We used regional heritability mapping (RHM) to associate DM with genomic segments in both subpopulations. Significant segments were subjected to candidate gene analysis and candidates were validated with prediction accuracies. The RHM procedure was validated via a simulation approach and revealed significant hits for white cassava on chromosomes 1, 4, 5, 10, 17, and 18, whereas hits for the yellow were on chromosome 1. Candidate gene analysis revealed genes in the carbohydrate biosynthesis pathway including plant serine-threonine protein kinases (SnRKs), UDP (uridine diphosphate)-glycosyltransferases, UDP-sugar transporters, invertases, pectinases, and regulons. Validation using 1252 unique identifiers from the SnRK gene family genome-wide recovered 50% of the predictive accuracy of whole-genome single nucleotide polymorphisms for DM, whereas validation using 53 likely genes (extracted from the literature) from significant segments recovered 32%. Genes including an acid invertase, a neutral or alkaline invertase, and a glucose-6-phosphate isomerase were validated on the basis of an a priori list for the cassava starch pathway, and also a fructose-biphosphate aldolase from the Calvin cycle pathway. The power of the RHM procedure was estimated as 47% when the causal quantitative trait loci generated 10% of the phenotypic variance (sample size = 451). Cassava DM genetics are complex and RHM may be useful for complex traits. Copyright © 2018 Crop Science Society of America.

  10. Interspecific Y chromosome variation is sufficient to rescue hybrid male sterility and is influenced by the grandparental origin of the chromosomes.

    PubMed

    Araripe, L O; Tao, Y; Lemos, B

    2016-06-01

    Y chromosomes display population variation within and between species. Co-evolution within populations is expected to produce adaptive interactions between Y chromosomes and the rest of the genome. One consequence is that Y chromosomes from disparate populations could disrupt harmonious interactions between co-evolved genetic elements and result in reduced male fertility, sterility or inviability. Here we address the contribution of 'heterospecific Y chromosomes' to fertility in hybrid males carrying a homozygous region of Drosophila mauritiana introgressed in the Drosophila simulans background. In order to detect Y chromosome-autosome interactions, which may go unnoticed in a single-species background of autosomes, we constructed hybrid genotypes involving three sister species: Drosophila simulans, D. mauritiana, and D. sechellia. These engineered strains varied due to: (i) species origin of the Y chromosome (D. simulans or D. sechellia); (ii) location of the introgressed D. mauritiana segment on the D. simulans third chromosome, and (iii) grandparental genomic background (three genotypes of D. simulans). We find complex interactions between the species origin of the Y chromosome, the identity of the D. mauritiana segment and the grandparental genetic background donating the chromosomes. Unexpectedly, the interaction of the Y chromosome and one segment of D. mauritiana drastically reduced fertility in the presence of Ysim, whereas the fertility is partially rescued by the Y chromosome of D. sechellia when it descends from a specific grandparental genotype. The restoration of fertility occurs in spite of an autosomal and X-linked genome that is mostly of D. simulans origin. These results illustrate the multifactorial basis of genetic interactions involving the Y chromosome. Our study supports the hypothesis that the Y chromosome can contribute significantly to the evolution of reproductive isolation and highlights the conditional manifestation of infertility in specific genotypic combinations.

  11. Segmental Duplication, Microinversion, and Gene Loss Associated with a Complex Inversion Breakpoint Region in Drosophila

    PubMed Central

    Calvete, Oriol; González, Josefa; Betrán, Esther; Ruiz, Alfredo

    2012-01-01

    Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ∼13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ∼9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics. PMID:22328714

  12. Simultaneous non-contiguous deletions using large synthetic DNA and site-specific recombinases

    PubMed Central

    Krishnakumar, Radha; Grose, Carissa; Haft, Daniel H.; Zaveri, Jayshree; Alperovich, Nina; Gibson, Daniel G.; Merryman, Chuck; Glass, John I.

    2014-01-01

    Toward achieving rapid and large scale genome modification directly in a target organism, we have developed a new genome engineering strategy that uses a combination of bioinformatics aided design, large synthetic DNA and site-specific recombinases. Using Cre recombinase we swapped a target 126-kb segment of the Escherichia coli genome with a 72-kb synthetic DNA cassette, thereby effectively eliminating over 54 kb of genomic DNA from three non-contiguous regions in a single recombination event. We observed complete replacement of the native sequence with the modified synthetic sequence through the action of the Cre recombinase and no competition from homologous recombination. Because of the versatility and high-efficiency of the Cre-lox system, this method can be used in any organism where this system is functional as well as adapted to use with other highly precise genome engineering systems. Compared to present-day iterative approaches in genome engineering, we anticipate this method will greatly speed up the creation of reduced, modularized and optimized genomes through the integration of deletion analyses data, transcriptomics, synthetic biology and site-specific recombination. PMID:24914053

  13. Screening of duplicated loci reveals hidden divergence patterns in a complex salmonid genome

    USGS Publications Warehouse

    Limborg, Morten T.; Larson, Wesley; Seeb, Lisa W.; Seeb, James E.

    2017-01-01

    A whole-genome duplication (WGD) doubles the entire genomic content of a species and is thought to have catalysed adaptive radiation in some polyploid-origin lineages. However, little is known about general consequences of a WGD because gene duplicates (i.e., paralogs) are commonly filtered in genomic studies; such filtering may remove substantial portions of the genome in data sets from polyploid-origin species. We demonstrate a new method that enables genome-wide scans for signatures of selection at both nonduplicated and duplicated loci by taking locus-specific copy number into account. We apply this method to RAD sequence data from different ecotypes of a polyploid-origin salmonid (Oncorhynchus nerka) and reveal signatures of divergent selection that would have been missed if duplicated loci were filtered. We also find conserved signatures of elevated divergence at pairs of homeologous chromosomes with residual tetrasomic inheritance, suggesting that joint evolution of some nondiverged gene duplicates may affect the adaptive potential of these genes. These findings illustrate that including duplicated loci in genomic analyses enables novel insights into the evolutionary consequences of WGDs and local segmental gene duplications.

  14. A Parthenogenesis Gene Candidate and Evidence for Segmental Allopolyploidy in Apomictic Brachiaria decumbens

    PubMed Central

    Worthington, Margaret; Heffelfinger, Christopher; Bernal, Diana; Quintero, Constanza; Zapata, Yeny Patricia; Perez, Juan Guillermo; De Vega, Jose; Miles, John; Dellaporta, Stephen; Tohme, Joe

    2016-01-01

    Apomixis, asexual reproduction through seed, enables breeders to identify and faithfully propagate superior heterozygous genotypes by seed without the disadvantages of vegetative propagation or the expense and complexity of hybrid seed production. The availability of new tools such as genotyping by sequencing and bioinformatics pipelines for species lacking reference genomes now makes the construction of dense maps possible in apomictic species, despite complications including polyploidy, multisomic inheritance, self-incompatibility, and high levels of heterozygosity. In this study, we developed saturated linkage maps for the maternal and paternal genomes of an interspecific Brachiaria ruziziensis (R. Germ. and C. M. Evrard) × B. decumbens Stapf. F1 mapping population in order to identify markers linked to apomixis. High-resolution molecular karyotyping and comparative genomics with Setaria italica (L.) P. Beauv provided conclusive evidence for segmental allopolyploidy in B. decumbens, with strong preferential pairing of homologs across the genome and multisomic segregation relatively more common in chromosome 8. The apospory-specific genomic region (ASGR) was mapped to a region of reduced recombination on B. decumbens chromosome 5. The Pennisetum squamulatum (L.) R.Br. PsASGR-BABY BOOM-like (psASGR–BBML)-specific primer pair p779/p780 was in perfect linkage with the ASGR in the F1 mapping population and diagnostic for reproductive mode in a diversity panel of known sexual and apomict Brachiaria (Trin.) Griseb. and P. maximum Jacq. germplasm accessions and cultivars. These findings indicate that ASGR–BBML gene sequences are highly conserved across the Paniceae and add further support for the postulation of the ASGR–BBML as candidate genes for the apomictic function of parthenogenesis. PMID:27206716

  15. A Parthenogenesis Gene Candidate and Evidence for Segmental Allopolyploidy in Apomictic Brachiaria decumbens.

    PubMed

    Worthington, Margaret; Heffelfinger, Christopher; Bernal, Diana; Quintero, Constanza; Zapata, Yeny Patricia; Perez, Juan Guillermo; De Vega, Jose; Miles, John; Dellaporta, Stephen; Tohme, Joe

    2016-07-01

    Apomixis, asexual reproduction through seed, enables breeders to identify and faithfully propagate superior heterozygous genotypes by seed without the disadvantages of vegetative propagation or the expense and complexity of hybrid seed production. The availability of new tools such as genotyping by sequencing and bioinformatics pipelines for species lacking reference genomes now makes the construction of dense maps possible in apomictic species, despite complications including polyploidy, multisomic inheritance, self-incompatibility, and high levels of heterozygosity. In this study, we developed saturated linkage maps for the maternal and paternal genomes of an interspecific Brachiaria ruziziensis (R. Germ. and C. M. Evrard) × B. decumbens Stapf. F1 mapping population in order to identify markers linked to apomixis. High-resolution molecular karyotyping and comparative genomics with Setaria italica (L.) P. Beauv provided conclusive evidence for segmental allopolyploidy in B. decumbens, with strong preferential pairing of homologs across the genome and multisomic segregation relatively more common in chromosome 8. The apospory-specific genomic region (ASGR) was mapped to a region of reduced recombination on B. decumbens chromosome 5. The Pennisetum squamulatum (L.) R.Br. PsASGR-BABY BOOM-like (psASGR-BBML)-specific primer pair p779/p780 was in perfect linkage with the ASGR in the F1 mapping population and diagnostic for reproductive mode in a diversity panel of known sexual and apomict Brachiaria (Trin.) Griseb. and P. maximum Jacq. germplasm accessions and cultivars. These findings indicate that ASGR-BBML gene sequences are highly conserved across the Paniceae and add further support for the postulation of the ASGR-BBML as candidate genes for the apomictic function of parthenogenesis. Copyright © 2016 by the Genetics Society of America.

  16. The Midline Protein Regulates Axon Guidance by Blocking the Reiteration of Neuroblast Rows within the Drosophila Ventral Nerve Cord

    PubMed Central

    Manavalan, Mary Ann; Gaziova, Ivana; Bhat, Krishna Moorthi

    2013-01-01

    Guiding axon growth cones towards their targets is a fundamental process that occurs in a developing nervous system. Several major signaling systems are involved in axon-guidance, and disruption of these systems causes axon-guidance defects. However, the specific role of the environment in which axons navigate in regulating axon-guidance has not been examined in detail. In Drosophila, the ventral nerve cord is divided into segments, and half-segments and the precursor neuroblasts are formed in rows and columns in individual half-segments. The row-wise expression of segment-polarity genes within the neuroectoderm provides the initial row-wise identity to neuroblasts. Here, we show that in embryos mutant for the gene midline, which encodes a T-box DNA binding protein, row-2 neuroblasts and their neuroectoderm adopt a row-5 identity. This reiteration of row-5 ultimately creates a non-permissive zone or a barrier, which prevents the extension of interneuronal longitudinal tracts along their normal anterior-posterior path. While we do not know the nature of the barrier, the axon tracts either stall when they reach this region or project across the midline or towards the periphery along this zone. Previously, we had shown that midline ensures ancestry-dependent fate specification in a neuronal lineage. These results provide the molecular basis for the axon guidance defects in midline mutants and the significance of proper specification of the environment to axon-guidance. These results also reveal the importance of segmental polarity in guiding axons from one segment to the next, and a link between establishment of broad segmental identity and axon guidance. PMID:24385932

  17. Establishment of segment polarity in the ectoderm of the leech Helobdella

    NASA Technical Reports Server (NTRS)

    Seaver, E. C.; Shankland, M.

    2001-01-01

    The segmented ectoderm and mesoderm of the leech arise via a stereotyped cell lineage from embryonic stem cells called teloblasts. Each teloblast gives rise to a column of primary blast cell daughters, and the blast cells generate descendant clones that serve as the segmental repeats of their particular teloblast lineage. We have examined the mechanism by which the leech primary blast cell clones acquire segment polarity - i.e. a fixed sequence of positional values ordered along the anteroposterior axis of the segmental repeat. In the O and P teloblast lineages, the earliest divisions of the primary blast cell segregate anterior and posterior cell fates along the anteroposterior axis. Using a laser microbeam, we ablated single cells from both o and p blast cell clones at stages when the clone was two to four cells in length. The developmental fate of the remaining cells was characterized with rhodamine-dextran lineage tracer. Twelve different progeny cells were ablated, and in every case the ablation eliminated the normal descendants of the ablated cell while having little or no detectable effect on the developmental fate of the remaining cells. This included experiments in which we specifically ablated those blast cell progeny that are known to express the engrailed gene, or their lineal precursors. These findings confirm and extend a previous study by showing that the establishment of segment polarity in the leech ectoderm is largely independent of cell interactions conveyed along the anteroposterior axis. Both intercellular signaling and engrailed expression play an important role in the segment polarity specification of the Drosophila embryo, and our findings suggest that there may be little or no conservation of this developmental mechanism between those two organisms.

  18. The Genome and Methylome of a Subsocial Small Carpenter Bee, Ceratina calcarata

    PubMed Central

    Rehan, Sandra M.; Glastad, Karl M.; Lawson, Sarah P.; Hunt, Brendan G.

    2016-01-01

    Understanding the evolution of animal societies, considered to be a major transition in evolution, is a key topic in evolutionary biology. Recently, new gateways for understanding social evolution have opened up due to advances in genomics, allowing for unprecedented opportunities in studying social behavior on a molecular level. In particular, highly eusocial insect species (caste-containing societies with nonreproductives that care for siblings) have taken center stage in studies of the molecular evolution of sociality. Despite advances in genomic studies of both solitary and eusocial insects, we still lack genomic resources for early insect societies. To study the genetic basis of social traits requires comparison of genomes from a diversity of organisms ranging from solitary to complex social forms. Here we present the genome of a subsocial bee, Ceratina calcarata. This study begins to address the types of genomic changes associated with the earliest origins of simple sociality using the small carpenter bee. Genes associated with lipid transport and DNA recombination have undergone positive selection in C. calcarata relative to other bee lineages. Furthermore, we provide the first methylome of a noneusocial bee. Ceratina calcarata contains the complete enzymatic toolkit for DNA methylation. As in the honey bee and many other holometabolous insects, DNA methylation is targeted to exons. The addition of this genome allows for new lines of research into the genetic and epigenetic precursors to complex social behaviors. PMID:27048475

  19. Novel Insights on Hantavirus Evolution: The Dichotomy in Evolutionary Pressures Acting on Different Hantavirus Segments.

    PubMed

    Sankar, Sathish; Upadhyay, Mohita; Ramamurthy, Mageshbabu; Vadivel, Kumaran; Sagadevan, Kalaiselvan; Nandagopal, Balaji; Vivekanandan, Perumal; Sridharan, Gopalan

    2015-01-01

    Hantaviruses are important emerging zoonotic pathogens. The current understanding of hantavirus evolution is complicated by the lack of consensus on co-divergence of hantaviruses with their animal hosts. In addition, hantaviruses have long-term associations with their reservoir hosts. Analyzing the relative abundance of dinucleotides may shed new light on hantavirus evolution. We studied the relative abundance of dinucleotides and the evolutionary pressures shaping different hantavirus segments. A total of 118 sequences were analyzed; this includes 51 sequences of the S segment, 43 sequences of the M segment and 23 sequences of the L segment. The relative abundance of dinucleotides, effective codon number (ENC), codon usage biases were analyzed. Standard methods were used to investigate the relative roles of mutational pressure and translational selection on the three hantavirus segments. All three segments of hantaviruses are CpG depleted. Mutational pressure is the predominant evolutionary force leading to CpG depletion among hantaviruses. Interestingly, the S segment of hantaviruses is GpU depleted and in contrast to CpG depletion, the depletion of GpU dinucleotides from the S segment is driven by translational selection. Our findings also suggest that mutational pressure is the primary evolutionary pressure acting on the S and the M segments of hantaviruses. While translational selection plays a key role in shaping the evolution of the L segment. Our findings highlight how different evolutionary pressures may contribute disproportionally to the evolution of the three hantavirus segments. These findings provide new insights on the current understanding of hantavirus evolution. There is a dichotomy among evolutionary pressures shaping a) the relative abundance of different dinucleotides in hantavirus genomes b) the evolution of the three hantavirus segments.

  20. Nucleation Process of a Fibril Precursor in the C-Terminal Segment of Amyloid-β

    NASA Astrophysics Data System (ADS)

    Baftizadeh, Fahimeh; Pietrucci, Fabio; Biarnés, Xevi; Laio, Alessandro

    2013-04-01

    By extended atomistic simulations in explicit solvent and bias-exchange metadynamics, we study the aggregation process of 18 chains of the C-terminal segment of amyloid-β, an intrinsically disordered protein involved in Alzheimer’s disease and prone to form fibrils. Starting from a disordered aggregate, we are able to observe the formation of an ordered nucleus rich in beta sheets. The rate limiting step in the nucleation pathway involves crossing a barrier of approximately 40kcal/mol and is associated with the formation of a very specific interdigitation of the side chains belonging to different sheets. This structural pattern is different from the one observed experimentally in a microcrystal of the same system, indicating that the structure of a “nascent” fibril may differ from the one of an “extended” fibril.

  1. High resolution melting analysis: rapid and precise characterisation of recombinant influenza A genomes

    PubMed Central

    2013-01-01

    Background High resolution melting analysis (HRM) is a rapid and cost-effective technique for the characterisation of PCR amplicons. Because the reverse genetics of segmented influenza A viruses allows the generation of numerous influenza A virus reassortants within a short time, methods for the rapid selection of the correct recombinants are very useful. Methods PCR primer pairs covering the single nucleotide polymorphism (SNP) positions of two different influenza A H5N1 strains were designed. Reassortants of the two different H5N1 isolates were used as a model to prove the suitability of HRM for the selection of the correct recombinants. Furthermore, two different cycler instruments were compared. Results Both cycler instruments generated comparable average melting peaks, which allowed the easy identification and selection of the correct cloned segments or reassorted viruses. Conclusions HRM is a highly suitable method for the rapid and precise characterisation of cloned influenza A genomes. PMID:24028349

  2. Modified screening and ranking algorithm for copy number variation detection.

    PubMed

    Xiao, Feifei; Min, Xiaoyi; Zhang, Heping

    2015-05-01

    Copy number variation (CNV) is a type of structural variation, usually defined as genomic segments that are 1 kb or larger, which present variable copy numbers when compared with a reference genome. The screening and ranking algorithm (SaRa) was recently proposed as an efficient approach for multiple change-points detection, which can be applied to CNV detection. However, some practical issues arise from application of SaRa to single nucleotide polymorphism data. In this study, we propose a modified SaRa on CNV detection to address these issues. First, we use the quantile normalization on the original intensities to guarantee that the normal mean model-based SaRa is a robust method. Second, a novel normal mixture model coupled with a modified Bayesian information criterion is proposed for candidate change-point selection and further clustering the potential CNV segments to copy number states. Simulations revealed that the modified SaRa became a robust method for identifying change-points and achieved better performance than the circular binary segmentation (CBS) method. By applying the modified SaRa to real data from the HapMap project, we illustrated its performance on detecting CNV segments. In conclusion, our modified SaRa method improves SaRa theoretically and numerically, for identifying CNVs with high-throughput genotyping data. The modSaRa package is implemented in R program and freely available at http://c2s2.yale.edu/software/modSaRa. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  3. Deciphering the fine-structure of tribal admixture in the Bedouin population using genomic data

    PubMed Central

    Markus, B; Alshafee, I; Birk, O S

    2014-01-01

    The Bedouin Israeli population is highly inbred and structured with a very high prevalence of recessive diseases. Many studies in the past two decades focused on linkage analysis in large, multiple consanguineous pedigrees of this population. The advent of high-throughput technologies motivated researchers to search for rare variants shared between smaller pedigrees, integrating data from clinically similar yet seemingly non-related sporadic cases. However, such analyses are challenging because, without pedigree data, there is no prior knowledge regarding possible relatedness between the sporadic cases. Here, we describe models and techniques for the study of relationships between pedigrees and use them for the inference of tribal co-ancestry, delineating the complex social interactions between different tribes in the Negev Bedouins of southern Israel. Through our analysis, we differentiate between tribes that share many yet small genomic segments because of co-ancestry versus tribes that share larger segments because of recent admixture. The emergent pattern is well correlated with the prevalence of rare mutations in the different tribes. Tribes that do not intermarry, mostly because of social restrictions, hold private mutations, whereas tribes that do intermarry demonstrate a genetic flow of mutations between them. Thus, social structure within an inbred community can be delineated through genomic data, with implications to genetic counseling and genetic mapping. PMID:24084643

  4. Deciphering the fine-structure of tribal admixture in the Bedouin population using genomic data.

    PubMed

    Markus, B; Alshafee, I; Birk, O S

    2014-02-01

    The Bedouin Israeli population is highly inbred and structured with a very high prevalence of recessive diseases. Many studies in the past two decades focused on linkage analysis in large, multiple consanguineous pedigrees of this population. The advent of high-throughput technologies motivated researchers to search for rare variants shared between smaller pedigrees, integrating data from clinically similar yet seemingly non-related sporadic cases. However, such analyses are challenging because, without pedigree data, there is no prior knowledge regarding possible relatedness between the sporadic cases. Here, we describe models and techniques for the study of relationships between pedigrees and use them for the inference of tribal co-ancestry, delineating the complex social interactions between different tribes in the Negev Bedouins of southern Israel. Through our analysis, we differentiate between tribes that share many yet small genomic segments because of co-ancestry versus tribes that share larger segments because of recent admixture. The emergent pattern is well correlated with the prevalence of rare mutations in the different tribes. Tribes that do not intermarry, mostly because of social restrictions, hold private mutations, whereas tribes that do intermarry demonstrate a genetic flow of mutations between them. Thus, social structure within an inbred community can be delineated through genomic data, with implications to genetic counseling and genetic mapping.

  5. [Microbial community structure in bio-ceramics and biological activated carbon analyzed by PCR-SSCP technique].

    PubMed

    Liu, Xiao-Lin; Liu, Wen-Jun

    2007-04-01

    Analyses of microbial community structure in bio-ceramics (BC) and biological activated carbon (BAC), which widely used in drinking water treatment were performed by polymerase-chain-reaction-single-strand-conformation-polymorphism (PCR-SSCP) targeted eubacterial 16S ribosomal RNA gene. Microorganisms on bio-ceramics and biological activated carbon were detached by ultrasonic, culturing on R2A and LB agar, respectively, followed by genome DNA extracting. Results show that larger than 10 kb genome DNA could be extracted from all the samples except the BAC samples processed by ultrasonic. However, quantities of the extracted DNA were different. 408 bp gene fragments were observed after PCR using the extracted genome DNA as templates. These gene fragments were digested with lambda exonuclease followed by SSCP electrophoresis. Same SSCP profiles were observed between ultrasonic eluting, R2A and LB agar culturing. The identity of the segment from bio-ceramics with uncultured Pseudomonas sp. Clone FTL201 16S rDNA (GenBank, AF509293.1) fragment was 92%, and identities of the two segments from BAC with Bacillus sp. JH19 16S rDNA (GenBank , DQ232748.1) fragment and Bacterium VA-S-11 16S rDNA (GenBank, AY395279.1) fragment were 100% and 99%, respectively.

  6. The retina visual cycle is driven by cis retinol oxidation in the outer segments of cones

    PubMed Central

    Sato, Shinya; Frederiksen, Rikard; Cornwall, M. Carter; Kefalov, Vladimir J.

    2017-01-01

    Vertebrate rod and cone photoreceptors require continuous supply of chromophore for regenerating their visual pigments after photoactivation. Cones, which mediate our daytime vision, demand a particularly rapid supply of 11-cis retinal chromophore in order to maintain their function in bright light. An important contribution to this process is thought to be the chromophore precursor 11-cis retinol, which is supplied to cones from Müller cells in the retina and subsequently oxidized to 11-cis retinal as part of the retina visual cycle. However, the molecular identity of the cis retinol oxidase in cones remains unclear. Here, as a first step in characterizing this enzymatic reaction, we sought to determine the subcellular localization of this activity in salamander red cones. We found that the onset of dark adaptation of isolated salamander red cones was substantially faster when exposing directly their outer vs. their inner segment to 9-cis retinol, an analogue of 11-cis retinol. In contrast, this difference was not observed when treating the outer vs. inner segment with 9-cis retinal, a chromophore analogue which can directly support pigment regeneration. These results suggest, surprisingly, that the cis-retinol oxidation occurs in the outer segments of cone photoreceptors. Confirming this notion, pigment regeneration with exogenously added 9-cis retinol was directly observed in the truncated outer segments of cones, but not in rods. We conclude that the enzymatic machinery required for the oxidation of recycled cis retinol as part of the retina visual cycle is present in the outer segments of cones. PMID:28359344

  7. A sequence-based survey of the complex structural organization of tumor genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav

    2008-04-03

    The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison ofmore » the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.« less

  8. Managing Risk on a Technology Development Project/Advanced Mirror System Demonstrator

    NASA Technical Reports Server (NTRS)

    Byberg, Alicia; Russell, J. Kevin; Stahl, Phil (Technical Monitor)

    2002-01-01

    The risk management study applied to the Advanced Mirror System Demonstrator (AMSD), a precursor mirror technology development for the Next Generation Space Telescope (NGST) is documented. The AMSD will be developed as a segment of a lightweight primary mirror system that can be produced at a low cost and with a short manufacturing schedule. The technology gained from the program will support the risk mitigation strategy for the NGST, as well as other government agency space mirror programs.

  9. Field Testing and Performance Evaluation of the Long-Range Acoustic Real-Time Sensor for Polar Areas (LARA)

    DTIC Science & Technology

    2015-09-30

    Valley Ridge segment in the northeast Pacific Ocean. Both areas have seafloor volcanic eruptions forecast for the near future, and the LARA moorings...useful for real-time monitoring of deep-ocean seismic and volcanic activity (e.g., Dziak et al., 2012) - especially in areas where SOSUS coverage no...2012): Seismic precursors and magma ascent before the April 2011 eruption at Axial Seamount. Nature Geoscience, 5, pp. 478-482. Klinck, H., and

  10. MRIVIEW: An interactive computational tool for investigation of brain structure and function

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ranken, D.; George, J.

    MRIVIEW is a software system which uses image processing and visualization to provide neuroscience researchers with an integrated environment for combining functional and anatomical information. Key features of the software include semi-automated segmentation of volumetric head data and an interactive coordinate reconciliation method which utilizes surface visualization. The current system is a precursor to a computational brain atlas. We describe features this atlas will incorporate, including methods under development for visualizing brain functional data obtained from several different research modalities.

  11. Genetic makeup of amantadine-resistant and oseltamivir-resistant human influenza A/H1N1 viruses.

    PubMed

    Zaraket, Hassan; Saito, Reiko; Suzuki, Yasushi; Baranovich, Tatiana; Dapat, Clyde; Caperig-Dapat, Isolde; Suzuki, Hiroshi

    2010-04-01

    The emergence and widespread occurrence of antiviral drug-resistant seasonal human influenza A viruses, especially oseltamivir-resistant A/H1N1 virus, are major concerns. To understand the genetic background of antiviral drug-resistant A/H1N1 viruses, we performed full genome sequencing of prepandemic A/H1N1 strains. Seasonal influenza A/H1N1 viruses, including antiviral-susceptible viruses, amantadine-resistant viruses, and oseltamivir-resistant viruses, obtained from several areas in Japan during the 2007-2008 and 2008-2009 influenza seasons were analyzed. Sequencing of the full genomes of these viruses was performed, and the phylogenetic relationships among the sequences of each individual genome segment were inferred. Reference genome sequences from the Influenza Virus Resource database were included to determine the closest ancestor for each segment. Phylogenetic analysis revealed that the oseltamivir-resistant strain evolved from a reassortant oseltamivir-susceptible strain (clade 2B) which circulated in the 2007-2008 season by acquiring the H275Y resistance-conferring mutation in the NA gene. The oseltamivir-resistant lineage (corresponding to the Northern European resistant lineage) represented 100% of the H1N1 isolates from the 2008-2009 season and further acquired at least one mutation in each of the polymerase basic protein 2 (PB2), polymerase basic protein 1 (PB1), hemagglutinin (HA), and neuraminidase (NA) genes. Therefore, a reassortment event involving two distinct oseltamivir-susceptible lineages, followed by the H275Y substitution in the NA gene and other mutations elsewhere in the genome, contributed to the emergence of the oseltamivir-resistant lineage. In contrast, amantadine-resistant viruses from the 2007-2008 season distinctly clustered in clade 2C and were characterized by extensive amino acid substitutions across their genomes, suggesting that a fitness gap among its genetic components might have driven these mutations to maintain it in the population.

  12. Plasmid Characterization and Chromosome Analysis of Two netF+ Clostridium perfringens Isolates Associated with Foal and Canine Necrotizing Enteritis.

    PubMed

    Mehdizadeh Gohari, Iman; Kropinski, Andrew M; Weese, Scott J; Parreira, Valeria R; Whitehead, Ashley E; Boerlin, Patrick; Prescott, John F

    2016-01-01

    The recent discovery of a novel beta-pore-forming toxin, NetF, which is strongly associated with canine and foal necrotizing enteritis should improve our understanding of the role of type A Clostridium perfringens associated disease in these animals. The current study presents the complete genome sequence of two netF-positive strains, JFP55 and JFP838, which were recovered from cases of foal necrotizing enteritis and canine hemorrhagic gastroenteritis, respectively. Genome sequencing was done using Single Molecule, Real-Time (SMRT) technology-PacBio and Illumina Hiseq2000. The JFP55 and JFP838 genomes include a single 3.34 Mb and 3.53 Mb chromosome, respectively, and both genomes include five circular plasmids. Plasmid annotation revealed that three plasmids were shared by the two newly sequenced genomes, including a NetF/NetE toxins-encoding tcp-conjugative plasmid, a CPE/CPB2 toxins-encoding tcp-conjugative plasmid and a putative bacteriocin-encoding plasmid. The putative beta-pore-forming toxin genes, netF, netE and netG, were located in unique pathogenicity loci on tcp-conjugative plasmids. The C. perfringens JFP55 chromosome carries 2,825 protein-coding genes whereas the chromosome of JFP838 contains 3,014 protein-encoding genes. Comparison of these two chromosomes with three available reference C. perfringens chromosome sequences identified 48 (~247 kb) and 81 (~430 kb) regions unique to JFP55 and JFP838, respectively. Some of these divergent genomic regions in both chromosomes are phage- and plasmid-related segments. Sixteen of these unique chromosomal regions (~69 kb) were shared between the two isolates. Five of these shared regions formed a mosaic of plasmid-integrated segments, suggesting that these elements were acquired early in a clonal lineage of netF-positive C. perfringens strains. These results provide significant insight into the basis of canine and foal necrotizing enteritis and are the first to demonstrate that netF resides on a large and unique plasmid-encoded locus.

  13. Family genome browser: visualizing genomes with pedigree information.

    PubMed

    Juan, Liran; Liu, Yongzhuang; Wang, Yongtian; Teng, Mingxiang; Zang, Tianyi; Wang, Yadong

    2015-07-15

    Families with inherited diseases are widely used in Mendelian/complex disease studies. Owing to the advances in high-throughput sequencing technologies, family genome sequencing becomes more and more prevalent. Visualizing family genomes can greatly facilitate human genetics studies and personalized medicine. However, due to the complex genetic relationships and high similarities among genomes of consanguineous family members, family genomes are difficult to be visualized in traditional genome visualization framework. How to visualize the family genome variants and their functions with integrated pedigree information remains a critical challenge. We developed the Family Genome Browser (FGB) to provide comprehensive analysis and visualization for family genomes. The FGB can visualize family genomes in both individual level and variant level effectively, through integrating genome data with pedigree information. Family genome analysis, including determination of parental origin of the variants, detection of de novo mutations, identification of potential recombination events and identical-by-decent segments, etc., can be performed flexibly. Diverse annotations for the family genome variants, such as dbSNP memberships, linkage disequilibriums, genes, variant effects, potential phenotypes, etc., are illustrated as well. Moreover, the FGB can automatically search de novo mutations and compound heterozygous variants for a selected individual, and guide investigators to find high-risk genes with flexible navigation options. These features enable users to investigate and understand family genomes intuitively and systematically. The FGB is available at http://mlg.hit.edu.cn/FGB/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Defining functional DNA elements in the human genome

    PubMed Central

    Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

    2014-01-01

    With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594

  15. Spatiotemporal dynamics of HSV genome nuclear entry and compaction state transitions using bioorthogonal chemistry and super-resolution microscopy

    PubMed Central

    2017-01-01

    We investigated the spatiotemporal dynamics of HSV genome transport during the initiation of infection using viruses containing bioorthogonal traceable precursors incorporated into their genomes (HSVEdC). In vitro assays revealed a structural alteration in the capsid induced upon HSVEdC binding to solid supports that allowed coupling to external capture agents and demonstrated that the vast majority of individual virions contained bioorthogonally-tagged genomes. Using HSVEdC in vivo we reveal novel aspects of the kinetics, localisation, mechanistic entry requirements and morphological transitions of infecting genomes. Uncoating and nuclear import was observed within 30 min, with genomes in a defined compaction state (ca. 3-fold volume increase from capsids). Free cytosolic uncoated genomes were infrequent (7–10% of the total uncoated genomes), likely a consequence of subpopulations of cells receiving high particle numbers. Uncoated nuclear genomes underwent temporal transitions in condensation state and while ICP4 efficiently associated with condensed foci of initial infecting genomes, this relationship switched away from residual longer lived condensed foci to increasingly decondensed genomes as infection progressed. Inhibition of transcription had no effect on nuclear entry but in the absence of transcription, genomes persisted as tightly condensed foci. Ongoing transcription, in the absence of protein synthesis, revealed a distinct spatial clustering of genomes, which we have termed genome congregation, not seen with non-transcribing genomes. Genomes expanded to more decondensed forms in the absence of DNA replication indicating additional transitional steps. During full progression of infection, genomes decondensed further, with a diffuse low intensity signal dissipated within replication compartments, but frequently with tight foci remaining peripherally, representing unreplicated genomes or condensed parental strands of replicated DNA. Uncoating and nuclear entry was independent of proteasome function and resistant to inhibitors of nuclear export. Together with additional data our results reveal new insight into the spatiotemporal dynamics of HSV genome uncoating, transport and organisation. PMID:29121649

  16. Evolutionary conservation of the presumptive neural plate markers AmphiSox1/2/3 and AmphiNeurogenin in the invertebrate chordate amphioxus

    NASA Technical Reports Server (NTRS)

    Holland, L. Z.; Schubert, M.; Holland, N. D.; Neuman, T.

    2000-01-01

    Amphioxus, as the closest living invertebrate relative of the vertebrates, can give insights into the evolutionary origin of the vertebrate body plan. Therefore, to investigate the evolution of genetic mechanisms for establishing and patterning the neuroectoderm, we cloned and determined the embryonic expression of two amphioxus transcription factors, AmphiSox1/2/3 and AmphiNeurogenin. These genes are the earliest known markers for presumptive neuroectoderm in amphioxus. By the early neurula stage, AmphiNeurogenin expression becomes restricted to two bilateral columns of segmentally arranged neural plate cells, which probably include precursors of motor neurons. This is the earliest indication of segmentation in the amphioxus nerve cord. Later, expression extends to dorsal cells in the nerve cord, which may include precursors of sensory neurons. By the midneurula, AmphiSox1/2/3 expression becomes limited to the dorsal part of the forming neural tube. These patterns resemble those of their vertebrate and Drosophila homologs. Taken together with the evolutionarily conserved expression of the dorsoventral patterning genes, BMP2/4 and chordin, in nonneural and neural ectoderm, respectively, of chordates and Drosophila, our results are consistent with the evolution of the chordate dorsal nerve cord and the insect ventral nerve cord from a longitudinal nerve cord in a common bilaterian ancestor. However, AmphiSox1/2/3 differs from its vertebrate homologs in not being expressed outside the CNS, suggesting that additional roles for this gene have evolved in connection with gene duplication in the vertebrate lineage. In contrast, expression in the midgut of AmphiNeurogenin together with the gene encoding the insulin-like peptide suggests that amphioxus may have homologs of vertebrate pancreatic islet cells, which express neurogenin3. In addition, AmphiNeurogenin, like its vertebrate and Drosophila homologs, is expressed in apparent precursors of epidermal chemosensory and possibly mechanosensory cells, suggesting a common origin for protostome and deuterostome epidermal sensory cells in the ancestral bilaterian. Copyright 2000 Academic Press.

  17. Recovery of Recombinant Crimean Congo Hemorrhagic Fever Virus Reveals a Function for Non-structural Glycoproteins Cleavage by Furin.

    PubMed

    Bergeron, Éric; Zivcec, Marko; Chakrabarti, Ayan K; Nichol, Stuart T; Albariño, César G; Spiropoulou, Christina F

    2015-05-01

    Crimean Congo hemorrhagic fever virus (CCHFV) is a negative-strand RNA virus of the family Bunyaviridae (genus: Nairovirus). In humans, CCHFV causes fever, hemorrhage, severe thrombocytopenia, and high fatality. A major impediment in precisely determining the basis of CCHFV's high pathogenicity has been the lack of methodology to produce recombinant CCHFV. We developed a reverse genetics system based on transfecting plasmids into BSR-T7/5 and Huh7 cells. In our system, bacteriophage T7 RNA polymerase produced complementary RNA copies of the viral S, M, and L segments that were encapsidated with the support, in trans, of CCHFV nucleoprotein and L polymerase. The system was optimized to systematically recover high yields of infectious CCHFV. Additionally, we tested the ability of the system to produce specifically designed CCHFV mutants. The M segment encodes a polyprotein that is processed by host proprotein convertases (PCs), including the site-1 protease (S1P) and furin-like PCs. S1P and furin cleavages are necessary for producing the non-structural glycoprotein GP38, while S1P cleavage yields structural Gn. We studied the role of furin cleavage by rescuing a recombinant CCHFV encoding a virus glycoprotein precursor lacking a functional furin cleavage motif (RSKR mutated to ASKA). The ASKA mutation blocked glycoprotein precursor's maturation to GP38, and Gn precursor's maturation to Gn was slightly diminished. Furin cleavage was not essential for replication, as blocking furin cleavage resulted only in transient reduction of CCHFV titers, suggesting that either GP38 and/or decreased Gn maturation accounted for the reduced virion production. Our data demonstrate that nairoviruses can be produced by reverse genetics, and the utility of our system uncovered a function for furin cleavage. This viral rescue system could be further used to study the CCHFV replication cycle and facilitate the development of efficacious vaccines to counter this biological and public health threat.

  18. Reenacting the birth of an intron

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hellsten, Uffe; Aspden, Julie L.; Rio, Donald C.

    2011-07-01

    An intron is an extended genomic feature whose function requires multiple constrained positions - donor and acceptor splice sites, a branch point, a polypyrimidine tract and suitable splicing enhancers - that may be distributed over hundreds or thousands of nucleotides. New introns are therefore unlikely to emerge by incremental accumulation of functional sub-elements. Here we demonstrate that a functional intron can be created de novo in a single step by a segmental genomic duplication. This experiment recapitulates in vivo the birth of an intron that arose in the ancestral jawed vertebrate lineage nearly half a billion years ago.

  19. The Florida manatee (Trichechus manatus latirostris) immunoglobulin heavy chain suggests the importance of clan III variable segments in repertoire diversity

    USGS Publications Warehouse

    Breaux, Breanna; Deiss, Thaddeus C.; Chen, Patricia L.; Cruz-Schneider, Maria Paula; Sena, Leonardo; Hunter, Margaret E.; Bonde, Robert K.; Criscitiello, Michael F.

    2017-01-01

    Manatees are a vulnerable, charismatic sentinel species from the evolutionarily divergent Afrotheria. Manatee health and resistance to infectious disease is of great concern to conservation groups, but little is known about their immune system. To develop manatee-specific tools for monitoring health, we first must have a general knowledge of how the immunoglobulin heavy (IgH) chain locus is organized and transcriptionally expressed. Using the genomic scaffolds of the Florida manatee (Trichechus manatus latirostris), we characterized the potential IgH segmental diversity and constant region isotypic diversity and performed the first Afrotherian repertoire analysis. The Florida manatee has low V(D)J combinatorial diversity (3744 potential combinations) and few constant region isotypes. They also lack clan III V segments, which may have caused reduced VH segment numbers. However, we found productive somatic hypermutation concentrated in the complementarity determining regions. In conclusion, manatees have limited IGHV clan and combinatorial diversity. This suggests that clan III V segments are essential for maintaining IgH locus diversity.

  20. Genetic recombination is associated with intrinsic disorder in plant proteomes.

    PubMed

    Yruela, Inmaculada; Contreras-Moreira, Bruno

    2013-11-09

    Intrinsically disordered proteins, found in all living organisms, are essential for basic cellular functions and complement the function of ordered proteins. It has been shown that protein disorder is linked to the G + C content of the genome. Furthermore, recent investigations have suggested that the evolutionary dynamics of the plant nucleus adds disordered segments to open reading frames alike, and these segments are not necessarily conserved among orthologous genes. In the present work the distribution of intrinsically disordered proteins along the chromosomes of several representative plants was analyzed. The reported results support a non-random distribution of disordered proteins along the chromosomes of Arabidopsis thaliana and Oryza sativa, two model eudicot and monocot plant species, respectively. In fact, for most chromosomes positive correlations between the frequency of disordered segments of 30+ amino acids and both recombination rates and G + C content were observed. These analyses demonstrate that the presence of disordered segments among plant proteins is associated with the rates of genetic recombination of their encoding genes. Altogether, these findings suggest that high recombination rates, as well as chromosomal rearrangements, could induce disordered segments in proteins during evolution.

  1. The Florida manatee (Trichechus manatus latirostris) immunoglobulin heavy chain suggests the importance of clan III variable segments in repertoire diversity.

    PubMed

    Breaux, Breanna; Deiss, Thaddeus C; Chen, Patricia L; Cruz-Schneider, Maria Paula; Sena, Leonardo; Hunter, Margaret E; Bonde, Robert K; Criscitiello, Michael F

    2017-07-01

    Manatees are a vulnerable, charismatic sentinel species from the evolutionarily divergent Afrotheria. Manatee health and resistance to infectious disease is of great concern to conservation groups, but little is known about their immune system. To develop manatee-specific tools for monitoring health, we first must have a general knowledge of how the immunoglobulin heavy (IgH) chain locus is organized and transcriptionally expressed. Using the genomic scaffolds of the Florida manatee (Trichechus manatus latirostris), we characterized the potential IgH segmental diversity and constant region isotypic diversity and performed the first Afrotherian repertoire analysis. The Florida manatee has low V(D)J combinatorial diversity (3744 potential combinations) and few constant region isotypes. They also lack clan III V segments, which may have caused reduced VH segment numbers. However, we found productive somatic hypermutation concentrated in the complementarity determining regions. In conclusion, manatees have limited IGHV clan and combinatorial diversity. This suggests that clan III V segments are essential for maintaining IgH locus diversity. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. Genome Mining for Ribosomally Synthesized Natural Products

    PubMed Central

    Velásquez, Juan E.; van der Donk, Wilfred

    2011-01-01

    In recent years, the number of known peptide natural products that are synthesized via the ribosomal pathway has rapidly grown. Taking advantage of sequence homology among genes encoding precursor peptides or biosynthetic proteins, in silico mining of genomes combined with molecular biology approaches has guided the discovery of a large number of new ribosomal natural products, including lantipeptides, cyanobactins, linear thiazole/oxazole-containing peptides, microviridins, lasso peptides, amatoxins, cyclotides, and conopeptides. In this review, we describe the strategies used for the identification of these ribosomally-synthesized and posttranslationally modified peptides (RiPPs) and the structures of newly identified compounds. The increasing number of chemical entities and their remarkable structural and functional diversity may lead to novel pharmaceutical applications. PMID:21095156

  3. Evolution of New miRNAs and Cerebro-Cortical Development.

    PubMed

    Kosik, Kenneth S; Nowakowski, Tomasz

    2018-04-04

    The noncoding portion of the genome, including microRNAs, has been fertile evolutionary soil for cortical development in primates. A major contribution to cortical expansion in primates is the generation of novel precursor cell populations. Because miRNA expression profiles track closely with cell identity, it is likely that numerous novel microRNAs have contributed to cellular diversity in the brain. The tools to determine the genomic context within which novel microRNAs emerge and how they become integrated into molecular circuitry are now in hand. Expected final online publication date for the Annual Review of Neuroscience Volume 41 is July 8, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

  4. Characterization of a novel orthoreovirus isolated from fruit bat, China.

    PubMed

    Hu, Tingsong; Qiu, Wei; He, Biao; Zhang, Yan; Yu, Jing; Liang, Xiu; Zhang, Wendong; Chen, Gang; Zhang, Yingguo; Wang, Yiyin; Zheng, Ying; Feng, Ziliang; Hu, Yonghe; Zhou, Weiguo; Tu, Changchun; Fan, Quanshui; Zhang, Fuqiang

    2014-11-30

    In recent years novel human respiratory disease agents have been described for Southeast Asia and Australia. The causative pathogens were classified as pteropine orthoreoviruses with a strong phylogenetic relationship to orthoreoviruses of bat origin. In this report, we isolated a novel Melaka-like reovirus (named "Cangyuan virus") from intestinal content samples of one fruit bat residing in China's Yunnan province. Phylogenetic analysis of the whole Cangyuan virus genome sequences of segments L, M and S demonstrated the genetic diversity of the Cangyuan virus. In contrast to the L and M segments, the phylogenetic trees for the S segments of Cangyuan virus demonstrated a greater degree of heterogeneity. Phylogenetic analysis indicated that the Cangyuan virus was a novel orthoreovirus and substantially different from currently known members of Pteropine orthoreovirus (PRV) species group.

  5. Comparative analysis of the complete genome of KPC-2-producing Klebsiella pneumoniae Kp13 reveals remarkable genome plasticity and a wide repertoire of virulence and resistance mechanisms

    PubMed Central

    2014-01-01

    Background Klebsiella pneumoniae is an important opportunistic pathogen associated with nosocomial and community-acquired infections. A wide repertoire of virulence and antimicrobial resistance genes is present in K. pneumoniae genomes, which can constitute extra challenges in the treatment of infections caused by some strains. K. pneumoniae Kp13 is a multidrug-resistant strain responsible for causing a large nosocomial outbreak in a teaching hospital located in Southern Brazil. Kp13 produces K. pneumoniae carbapenemase (KPC-2) but is unrelated to isolates belonging to ST 258 and ST 11, the main clusters associated with the worldwide dissemination of KPC-producing K. pneumoniae. In this report, we perform a genomic comparison between Kp13 and each of the following three K. pneumoniae genomes: MGH 78578, NTUH-K2044 and 342. Results We have completely determined the genome of K. pneumoniae Kp13, which comprises one chromosome (5.3 Mbp) and six plasmids (0.43 Mbp). Several virulence and resistance determinants were identified in strain Kp13. Specifically, we detected genes coding for six beta-lactamases (SHV-12, OXA-9, TEM-1, CTX-M-2, SHV-110 and KPC-2), eight adhesin-related gene clusters, including regions coding for types 1 (fim) and 3 (mrk) fimbrial adhesins. The rmtG plasmidial 16S rRNA methyltransferase gene was also detected, as well as efflux pumps belonging to five different families. Mutations upstream the OmpK35 porin-encoding gene were evidenced, possibly affecting its expression. SNPs analysis relative to the compared strains revealed 141 mutations falling within CDSs related to drug resistance which could also influence the Kp13 lifestyle. Finally, the genetic apparatus for synthesis of the yersiniabactin siderophore was identified within a plasticity region. Chromosomal architectural analysis allowed for the detection of 13 regions of difference in Kp13 relative to the compared strains. Conclusions Our results indicate that the plasticity occurring at many hierarchical levels (from whole genomic segments to individual nucleotide bases) may play a role on the lifestyle of K. pneumoniae Kp13 and underlie the importance of whole-genome sequencing to study bacterial pathogens. The general chromosomal structure was somewhat conserved among the compared bacteria, and recombination events with consequent gain/loss of genomic segments appears to be driving the evolution of these strains. PMID:24450656

  6. Rift Valley Fever Virus MP-12 Vaccine Is Fully Attenuated by a Combination of Partial Attenuations in the S, M, and L Segments

    PubMed Central

    Hill, Terence E.; Smith, Jennifer K.; Zhang, Lihong; Juelich, Terry L.; Gong, Bin; Slack, Olga A. L.; Ly, Hoai J.; Lokugamage, Nandadeva; Freiberg, Alexander N.

    2015-01-01

    ABSTRACT Rift Valley fever (RVF) is a mosquito-borne zoonotic disease endemic to Africa and characterized by a high rate of abortion in ruminants and hemorrhagic fever, encephalitis, or blindness in humans. RVF is caused by Rift Valley fever virus (RVFV; family Bunyaviridae, genus Phlebovirus), which has a tripartite negative-stranded RNA genome (consisting of the S, M, and L segments). Further spread of RVF into countries where the disease is not endemic may affect the economy and public health, and vaccination is an effective approach to prevent the spread of RVFV. A live-attenuated MP-12 vaccine is one of the best-characterized RVF vaccines for safety and efficacy and is currently conditionally licensed for use for veterinary purposes in the United States. Meanwhile, as of 2015, no other RVF vaccine has been conditionally or fully licensed for use in the United States. The MP-12 strain is derived from wild-type pathogenic strain ZH548, and its genome encodes 23 mutations in the three genome segments. However, the mechanism of MP-12 attenuation remains unknown. We characterized the attenuation of wild-type pathogenic strain ZH501 carrying a mutation(s) of the MP-12 S, M, or L segment in a mouse model. Our results indicated that MP-12 is attenuated by the mutations in the S, M, and L segments, while the mutations in the M and L segments confer stronger attenuation than those in the S segment. We identified a combination of 3 amino acid changes, Y259H (Gn), R1182G (Gc), and R1029K (L), that was sufficient to attenuate ZH501. However, strain MP-12 with reversion mutations at those 3 sites was still highly attenuated. Our results indicate that MP-12 attenuation is supported by a combination of multiple partial attenuation mutations and a single reversion mutation is less likely to cause a reversion to virulence of the MP-12 vaccine. IMPORTANCE Rift Valley fever (RVF) is a mosquito-transmitted viral disease that is endemic to Africa and that has the potential to spread into other countries. Vaccination is considered an effective way to prevent the disease, and the only available veterinary RVF vaccine in the United States is a live-attenuated MP-12 vaccine, which is conditionally licensed. Strain MP-12 is different from its parental pathogenic RVFV strain, strain ZH548, because of the presence of 23 mutations. This study determined the role of individual mutations in the attenuation of the MP-12 strain. We found that full attenuation of MP-12 occurs by a combination of multiple mutations. Our findings indicate that a single reversion mutation will less likely cause a major reversion to virulence of the MP-12 vaccine. PMID:25948740

  7. Rift Valley Fever Virus MP-12 Vaccine Is Fully Attenuated by a Combination of Partial Attenuations in the S, M, and L Segments.

    PubMed

    Ikegami, Tetsuro; Hill, Terence E; Smith, Jennifer K; Zhang, Lihong; Juelich, Terry L; Gong, Bin; Slack, Olga A L; Ly, Hoai J; Lokugamage, Nandadeva; Freiberg, Alexander N

    2015-07-01

    Rift Valley fever (RVF) is a mosquito-borne zoonotic disease endemic to Africa and characterized by a high rate of abortion in ruminants and hemorrhagic fever, encephalitis, or blindness in humans. RVF is caused by Rift Valley fever virus (RVFV; family Bunyaviridae, genus Phlebovirus), which has a tripartite negative-stranded RNA genome (consisting of the S, M, and L segments). Further spread of RVF into countries where the disease is not endemic may affect the economy and public health, and vaccination is an effective approach to prevent the spread of RVFV. A live-attenuated MP-12 vaccine is one of the best-characterized RVF vaccines for safety and efficacy and is currently conditionally licensed for use for veterinary purposes in the United States. Meanwhile, as of 2015, no other RVF vaccine has been conditionally or fully licensed for use in the United States. The MP-12 strain is derived from wild-type pathogenic strain ZH548, and its genome encodes 23 mutations in the three genome segments. However, the mechanism of MP-12 attenuation remains unknown. We characterized the attenuation of wild-type pathogenic strain ZH501 carrying a mutation(s) of the MP-12 S, M, or L segment in a mouse model. Our results indicated that MP-12 is attenuated by the mutations in the S, M, and L segments, while the mutations in the M and L segments confer stronger attenuation than those in the S segment. We identified a combination of 3 amino acid changes, Y259H (Gn), R1182G (Gc), and R1029K (L), that was sufficient to attenuate ZH501. However, strain MP-12 with reversion mutations at those 3 sites was still highly attenuated. Our results indicate that MP-12 attenuation is supported by a combination of multiple partial attenuation mutations and a single reversion mutation is less likely to cause a reversion to virulence of the MP-12 vaccine. Rift Valley fever (RVF) is a mosquito-transmitted viral disease that is endemic to Africa and that has the potential to spread into other countries. Vaccination is considered an effective way to prevent the disease, and the only available veterinary RVF vaccine in the United States is a live-attenuated MP-12 vaccine, which is conditionally licensed. Strain MP-12 is different from its parental pathogenic RVFV strain, strain ZH548, because of the presence of 23 mutations. This study determined the role of individual mutations in the attenuation of the MP-12 strain. We found that full attenuation of MP-12 occurs by a combination of multiple mutations. Our findings indicate that a single reversion mutation will less likely cause a major reversion to virulence of the MP-12 vaccine. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  8. The Oxytricha trifallax Macronuclear Genome: A Complex Eukaryotic Genome with 16,000 Tiny Chromosomes

    PubMed Central

    Swart, Estienne C.; Bracht, John R.; Magrini, Vincent; Minx, Patrick; Chen, Xiao; Zhou, Yi; Khurana, Jaspreet S.; Goldman, Aaron D.; Nowacki, Mariusz; Schotanus, Klaas; Jung, Seolkyoung; Fulton, Robert S.; Ly, Amy; McGrath, Sean; Haub, Kevin; Wiggins, Jessica L.; Storton, Donna; Matese, John C.; Parsons, Lance; Chang, Wei-Jen; Bowen, Michael S.; Stover, Nicholas A.; Jones, Thomas A.; Eddy, Sean R.; Herrick, Glenn A.; Doak, Thomas G.; Wilson, Richard K.; Mardis, Elaine R.; Landweber, Laura F.

    2013-01-01

    The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5%) of its precursor “silent” germline micronuclear genome by a process of “unscrambling” and fragmentation. The tiny macronuclear “nanochromosomes” typically encode single, protein-coding genes (a small portion, 10%, encode 2–8 genes), have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size) that vary from 469 bp to 66 kb long (mean ∼3.2 kb) and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%), suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb) suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing studies of rearrangements arising during evolution and disease. PMID:23382650

  9. Organization and sequence of four flagellin-encoding genes of Edwardsiella icataluri

    USDA-ARS?s Scientific Manuscript database

    Edwardsiella ictaluri, the cause of enteric septicemia in channel catfish (Ictalurus punctatus), is motile by means of peritrichous flagella. We determined the complete flagellin gene sequences and their organization in E. ictaluri by sequencing genomic segments selected from a lambda-ZAP phage gen...

  10. DETECTION OF GIARDIA IN ENVIRONMENTAL WATERS BY IMMUNO-PCR AMPLIFICATION METHODS

    EPA Science Inventory

    Genomic DNA was extracted either directly from Giardia muris cysts seeded into environmental surface waters or from cysts isolated by immunomagnetic beads (IMB).A 0.171-kbp segment of the giardin gene was PCR-amplified following "direct extraction" of Giardia DNA from seeded Caha...

  11. DETECTION OF GIARDIA IN ENVIRONMENTAL WATERS BY IMMUNO-PCR AMPLIFICATION METHODS

    EPA Science Inventory

    Genomic DNA was extracted either directly from Giardia muris cysts seeded into environmental surface waters or from cysts isolated by immunomagnetic beads (IMB}. A 0.171-kbp segment of the giardin gene was PCR-amplified following "direct extraction" of Giardia DNA from seeded Cah...

  12. Hypotheses on the evolution of hyaluronan: A highly ironic acid

    PubMed Central

    Csoka, Antonei B; Stern, Robert

    2013-01-01

    Hyaluronan is a high-molecular-weight glycosaminoglycan (GAG) prominent in the extracellular matrix. Emerging relatively late in evolution, it may have evolved to evade immune recognition. Chondroitin is a more ancient GAG and a possible hyaluronan precursor. Epimerization of a 4-hydroxyl in N-acetylgalactosamine in chondroitin to N-acetylglucosamine of hyaluronan is the only structural difference other than chain length between these two polymers. The axial 4-hydroxyl group extends out perpendicular from the equatorial plane of N-acetylgalactosamine in chondroitin. We suspect that this hydroxyl is a prime target for immune recognition. Conversion of a thumbs-up hydroxyl group into a thumbs-down position in the plane of the sugar endows hyaluronan with the ability to avoid immune recognition. Chitin is another potential precursor to hyaluronan. But regardless whether of chondroitin or of chitin origin, an ancient chondroitinase enzyme sequence seems to have been commandeered to catalyze the cleavage of the new hyaluronan substrate. The evolution of six hyaluronidase-like sequences in the human genome from a single chondroitinase as found in Caenorhabditis elegans can now be traced. Confirming our previous predictions, two duplication events occurred, with three hyaluronidase-like sequences occurring in the genome of Ciona intestinalis (sea squirt), the earliest known chordate. This was probably followed by en masse duplication, with six such genes present in the genome of zebra fish onwards. These events occurred, however, much earlier than predicted. It is also apparent on an evolutionary time scale that in several species, this gene family is continuing to evolve. PMID:23315448

  13. Pea chloroplast DNA encodes homologues of Escherichia coli ribosomal subunit S2 and the beta'-subunit of RNA polymerase.

    PubMed Central

    Cozens, A L; Walker, J E

    1986-01-01

    The nucleotide sequence has been determined of a segment of 4680 bases of the pea chloroplast genome. It adjoins a sequence described elsewhere that encodes subunits of the F0 membrane domain of the ATP-synthase complex. The sequence contains a potential gene encoding a protein which is strongly related to the S2 polypeptide of Escherichia coli ribosomes. It also encodes an incomplete protein which contains segments that are homologous to the beta'-subunit of E. coli RNA polymerase and to yeast RNA polymerases II and III. PMID:3530249

  14. Genome-wide characterization of microRNA in foxtail millet (Setaria italica)

    PubMed Central

    2013-01-01

    Background MicroRNAs (miRNAs) are a class of short non-coding, endogenous RNAs that play key roles in many biological processes in both animals and plants. Although many miRNAs have been identified in a large number of organisms, the miRNAs in foxtail millet (Setaria italica) have, until now, been poorly understood. Results In this study, two replicate small RNA libraries from foxtail millet shoots were sequenced, and 40 million reads representing over 10 million unique sequences were generated. We identified 43 known miRNAs, 172 novel miRNAs and 2 mirtron precursor candidates in foxtail millet. Some miRNA*s of the known and novel miRNAs were detected as well. Further, eight novel miRNAs were validated by stem-loop RT-PCR. Potential targets of the foxtail millet miRNAs were predicted based on our strict criteria. Of the predicted target genes, 79% (351) had functional annotations in InterPro and GO analyses, indicating the targets of the miRNAs were involved in a wide range of regulatory functions and some specific biological processes. A total of 69 pairs of syntenic miRNA precursors that were conserved between foxtail millet and sorghum were found. Additionally, stem-loop RT-PCR was conducted to confirm the tissue-specific expression of some miRNAs in the four tissues identified by deep-sequencing. Conclusions We predicted, for the first time, 215 miRNAs and 447 miRNA targets in foxtail millet at a genome-wide level. The precursors, expression levels, miRNA* sequences, target functions, conservation, and evolution of miRNAs we identified were investigated. Some of the novel foxtail millet miRNAs and miRNA targets were validated experimentally. PMID:24330712

  15. Genome-wide characterization of microRNA in foxtail millet (Setaria italica).

    PubMed

    Yi, Fei; Xie, Shaojun; Liu, Yuwei; Qi, Xin; Yu, Jingjuan

    2013-12-13

    MicroRNAs (miRNAs) are a class of short non-coding, endogenous RNAs that play key roles in many biological processes in both animals and plants. Although many miRNAs have been identified in a large number of organisms, the miRNAs in foxtail millet (Setaria italica) have, until now, been poorly understood. In this study, two replicate small RNA libraries from foxtail millet shoots were sequenced, and 40 million reads representing over 10 million unique sequences were generated. We identified 43 known miRNAs, 172 novel miRNAs and 2 mirtron precursor candidates in foxtail millet. Some miRNA*s of the known and novel miRNAs were detected as well. Further, eight novel miRNAs were validated by stem-loop RT-PCR. Potential targets of the foxtail millet miRNAs were predicted based on our strict criteria. Of the predicted target genes, 79% (351) had functional annotations in InterPro and GO analyses, indicating the targets of the miRNAs were involved in a wide range of regulatory functions and some specific biological processes. A total of 69 pairs of syntenic miRNA precursors that were conserved between foxtail millet and sorghum were found. Additionally, stem-loop RT-PCR was conducted to confirm the tissue-specific expression of some miRNAs in the four tissues identified by deep-sequencing. We predicted, for the first time, 215 miRNAs and 447 miRNA targets in foxtail millet at a genome-wide level. The precursors, expression levels, miRNA* sequences, target functions, conservation, and evolution of miRNAs we identified were investigated. Some of the novel foxtail millet miRNAs and miRNA targets were validated experimentally.

  16. Advances in synthetic biology of oleaginous yeast Yarrowia lipolytica for producing non-native chemicals.

    PubMed

    Darvishi, Farshad; Ariana, Mehdi; Marella, Eko Roy; Borodina, Irina

    2018-07-01

    Oleaginous yeast Yarrowia lipolytica is an important industrial host for the production of enzymes, oils, fragrances, surfactants, cosmetics, and pharmaceuticals. More recently, improved synthetic biology tools have allowed more extensive engineering of this yeast species, which lead to the production of non-native metabolites. In this review, we summarize the recent advances of genome editing tools for Y. lipolytica, including the application of CRISPR/Cas9 system and discuss case studies, where Y. lipolytica was engineered to produce various non-native chemicals: short-chain fatty alcohols and alkanes as biofuels, polyunsaturated fatty acids for nutritional and pharmaceutical applications, polyhydroxyalkanoates and dicarboxylic acids as precursors for biodegradable plastics, carotenoid-type pigments for food and feed, and campesterol as a precursor for steroid drugs.

  17. The PLUTO plastidial nucleobase transporter also transports the thiamin precursor hydroxymethylpyrimidine.

    PubMed

    Beaudoin, Guillaume A W; Johnson, Timothy S; Hanson, Andrew D

    2018-04-27

    In plants, the hydroxymethylpyrimidine (HMP) and thiazole precursors of thiamin are synthesized and coupled together to form thiamin in plastids. Mutants unable to form HMP can be rescued by exogenous HMP, implying the presence of HMP transporters in the plasma membrane and plastids. Analysis of bacterial genomes revealed a transporter gene that is chromosomally clustered with thiamin biosynthesis and salvage genes. Its closest Arabidopsis homolog, the plastidic nucleobase transporter (PLUTO), is co-expressed with several thiamin biosynthetic enzymes. Heterologous expression of PLUTO in Escherichia coli or Saccharomyces cerevisiae increased sensitivity to a toxic HMP analog, and disrupting PLUTO in an HMP-requiring Arabidopsis line reduced root growth at low HMP concentrations. These data implicate PLUTO in plastidial transport and salvage of HMP. © 2018 The Author(s).

  18. Engineered LINE-1 retrotransposition in nondividing human neurons

    PubMed Central

    Macia, Angela; Widmann, Thomas J.; Heras, Sara R.; Ayllon, Veronica; Sanchez, Laura; Benkaddour-Boumzaouad, Meriem; Muñoz-Lopez, Martin; Rubio, Alejandro; Amador-Cubero, Suyapa; Blanco-Jimenez, Eva; Garcia-Castro, Javier; Menendez, Pablo; Ng, Philip; Muotri, Alysson R.; Goodier, John L.; Garcia-Perez, Jose L.

    2017-01-01

    Half the human genome is made of transposable elements (TEs), whose ongoing activity continues to impact our genome. LINE-1 (or L1) is an autonomous non-LTR retrotransposon in the human genome, comprising 17% of its genomic mass and containing an average of 80–100 active L1s per average genome that provide a source of inter-individual variation. New LINE-1 insertions are thought to accumulate mostly during human embryogenesis. Surprisingly, the activity of L1s can further impact the somatic human brain genome. However, it is currently unknown whether L1 can retrotranspose in other somatic healthy tissues or if L1 mobilization is restricted to neuronal precursor cells (NPCs) in the human brain. Here, we took advantage of an engineered L1 retrotransposition assay to analyze L1 mobilization rates in human mesenchymal (MSCs) and hematopoietic (HSCs) somatic stem cells. Notably, we have observed that L1 expression and engineered retrotransposition is much lower in both MSCs and HSCs when compared to NPCs. Remarkably, we have further demonstrated for the first time that engineered L1s can retrotranspose efficiently in mature nondividing neuronal cells. Thus, these findings suggest that the degree of somatic mosaicism and the impact of L1 retrotransposition in the human brain is likely much higher than previously thought. PMID:27965292

  19. Visualization of A- and B-genome chromosomes in wheat (Triticum aestivum L.) x jointed goatgrass (Aegilops cylindrica Host) backcross progenies.

    PubMed

    Wang, Z N; Hang, A; Hansen, J; Burton, C; Mallory-Smith, C A; Zemetra, R S

    2000-12-01

    Wheat (Triticum aestivum) and jointed goatgrass (Aegilops cylindrica) can cross with each other, and their self-fertile backcross progenies frequently have extra chromosomes and chromosome segments, presumably retained from wheat, raising the possibility that a herbicide resistance gene might transfer from wheat to jointed goatgrass. Genomic in situ hybridization (GISH) was used to clarify the origin of these extra chromosomes. By using T. durum DNA (AABB genome) as a probe and jointed goatgrass DNA (CCDD genome) as blocking DNA, one, two, and three A- or B-genome chromosomes were identified in three BC2S2 individuals where 2n = 29, 30, and 31 chromosomes, respectively. A translocation between wheat and jointed goatgrass chromosomes was also detected in an individual with 30 chromosomes. In pollen mother cells with meiotic configuration of 14 II + 2 I, the two univalents were identified as being retained from the A or B genome of wheat. By using Ae. markgrafii DNA (CC genome) as a probe and wheat DNA (AABBDD genome) as blocking DNA. 14 C-genome chromosomes were visualized in all BC2S2 individuals. The GISH procedure provides a powerful tool to detect the A or B-genome chromatin in a jointed goatgrass background, making it possible to assess the risk of transfer of herbicide resistance genes located on the A or B genome of wheat to jointed goatgrass.

  20. Whole genome sequences in pulse crops: a global community resource to expedite translational genomics and knowledge-based crop improvement.

    PubMed

    Bohra, Abhishek; Singh, Narendra P

    2015-08-01

    Unprecedented developments in legume genomics over the last decade have resulted in the acquisition of a wide range of modern genomic resources to underpin genetic improvement of grain legumes. The genome enabled insights direct investigators in various ways that primarily include unearthing novel structural variations, retrieving the lost genetic diversity, introducing novel/exotic alleles from wider gene pools, finely resolving the complex quantitative traits and so forth. To this end, ready availability of cost-efficient and high-density genotyping assays allows genome wide prediction to be increasingly recognized as the key selection criterion in crop breeding. Further, the high-dimensional measurements of agronomically significant phenotypes obtained by using new-generation screening techniques will empower reference based resequencing as well as allele mining and trait mapping methods to comprehensively associate genome diversity with the phenome scale variation. Besides stimulating the forward genetic systems, accessibility to precisely delineated genomic segments reveals novel candidates for reverse genetic techniques like targeted genome editing. The shifting paradigm in plant genomics in turn necessitates optimization of crop breeding strategies to enable the most efficient integration of advanced omics knowledge and tools. We anticipate that the crop improvement schemes will be bolstered remarkably with rational deployment of these genome-guided approaches, ultimately resulting in expanded plant breeding capacities and improved crop performance.

Top