Sample records for genomic island specific

  1. Islander: A database of precisely mapped genomic islands in tRNA and tmRNA genes

    DOE PAGES

    Hudson, Corey M.; Lau, Britney Y.; Williams, Kelly P.

    2014-11-05

    Genomic islands are mobile DNAs that are major agents of bacterial and archaeal evolution. Integration into prokaryotic chromosomes usually occurs site-specifically at tRNA or tmRNA gene (together, tDNA) targets, catalyzed by tyrosine integrases. This splits the target gene, yet sequences within the island restore the disrupted gene; the regenerated target and its displaced fragment precisely mark the endpoints of the island. We applied this principle to search for islands in genomic DNA sequences. Our algorithm identifies tDNAs, finds fragments of those tDNAs in the same replicon and removes unlikely candidate islands through a series of filters. A search for islandsmore » in 2168 whole prokaryotic genomes produced 3919 candidates. The website Islander (recently moved to http://bioinformatics.sandia.gov/islander/) presents these precisely mapped candidate islands, the gene content and the island sequence. The algorithm further insists that each island encode an integrase, and attachment site sequence identity is carefully noted; therefore, the database also serves in the study of integrase site-specificity and its evolution.« less

  2. Site-Specific Mobilization of Vinyl Chloride Respiration Islands by a Mechanism Common in Dehalococcoides

    PubMed Central

    2011-01-01

    Background Vinyl chloride is a widespread groundwater pollutant and Group 1 carcinogen. A previous comparative genomic analysis revealed that the vinyl chloride reductase operon, vcrABC, of Dehalococcoides sp. strain VS is embedded in a horizontally-acquired genomic island that integrated at the single-copy tmRNA gene, ssrA. Results We targeted conserved positions in available genomic islands to amplify and sequence four additional vcrABC -containing genomic islands from previously-unsequenced vinyl chloride respiring Dehalococcoides enrichments. We identified a total of 31 ssrA-specific genomic islands from Dehalococcoides genomic data, accounting for 47 reductive dehalogenase homologous genes and many other non-core genes. Sixteen of these genomic islands contain a syntenic module of integration-associated genes located adjacent to the predicted site of integration, and among these islands, eight contain vcrABC as genetic 'cargo'. These eight vcrABC -containing genomic islands are syntenic across their ~12 kbp length, but have two phylogenetically discordant segments that unambiguously differentiate the integration module from the vcrABC cargo. Using available Dehalococcoides phylogenomic data we estimate that these ssrA-specific genomic islands are at least as old as the Dehalococcoides group itself, which in turn is much older than human civilization. Conclusions The vcrABC -containing genomic islands are a recently-acquired subset of a diverse collection of ssrA-specific mobile elements that are a major contributor to strain-level diversity in Dehalococcoides, and may have been throughout its evolution. The high similarity between vcrABC sequences is quantitatively consistent with recent horizontal acquisition driven by ~100 years of industrial pollution with chlorinated ethenes. PMID:21635780

  3. Genomic islands link secondary metabolism to functional adaptation in marine Actinobacteria

    PubMed Central

    Penn, Kevin; Jenkins, Caroline; Nett, Markus; Udwary, Daniel W.; Gontang, Erin A.; McGlinchey, Ryan P.; Foster, Brian; Lapidus, Alla; Podell, Sheila; Allen, Eric E.; Moore, Bradley S.; Jensen, Paul R.

    2009-01-01

    Genomic islands have been shown to harbor functional traits that differentiate ecologically distinct populations of environmental bacteria. A comparative analysis of the complete genome sequences of the marine Actinobacteria Salinispora tropica and S. arenicola reveals that 75% of the species-specific genes are located in 21 genomic islands. These islands are enriched in genes associated with secondary metabolite biosynthesis providing evidence that secondary metabolism is linked to functional adaptation. Secondary metabolism accounts for 8.8% and 10.9% of the genes in the S. tropica and S. arenicola genomes, respectively, and represents the major functional category of annotated genes that differentiates the two species. Genomic islands harbor all 25 of the species-specific biosynthetic pathways, the majority of which occur in S. arenicola and may contribute to the cosmopolitan distribution of this species. Genome evolution is dominated by gene duplication and acquisition, which in the case of secondary metabolism provide immediate opportunities for the production of new bioactive products. Evidence that secondary metabolic pathways are exchanged horizontally, coupled with prior evidence for fixation among globally distributed populations, supports a functional role and suggests that the acquisition of natural product biosynthetic gene clusters represents a previously unrecognized force driving bacterial diversification. Species-specific differences observed in CRISPR (clustered regularly interspaced short palindromic repeat) sequences suggest that S. arenicola may possess a higher level of phage immunity, while a highly duplicated family of polymorphic membrane proteins provides evidence of a new mechanism of marine adaptation in Gram-positive bacteria. PMID:19474814

  4. Patterns and architecture of genomic islands in marine bacteria

    PubMed Central

    2012-01-01

    Background Genomic Islands (GIs) have key roles since they modulate the structure and size of bacterial genomes displaying a diverse set of laterally transferred genes. Despite their importance, GIs in marine bacterial genomes have not been explored systematically to uncover possible trends and to analyze their putative ecological significance. Results We carried out a comprehensive analysis of GIs in 70 selected marine bacterial genomes detected with IslandViewer to explore the distribution, patterns and functional gene content in these genomic regions. We detected 438 GIs containing a total of 8152 genes. GI number per genome was strongly and positively correlated with the total GI size. In 50% of the genomes analyzed the GIs accounted for approximately 3% of the genome length, with a maximum of 12%. Interestingly, we found transposases particularly enriched within Alphaproteobacteria GIs, and site-specific recombinases in Gammaproteobacteria GIs. We described specific Homologous Recombination GIs (HR-GIs) in several genera of marine Bacteroidetes and in Shewanella strains among others. In these HR-GIs, we recurrently found conserved genes such as the β-subunit of DNA-directed RNA polymerase, regulatory sigma factors, the elongation factor Tu and ribosomal protein genes typically associated with the core genome. Conclusions Our results indicate that horizontal gene transfer mediated by phages, plasmids and other mobile genetic elements, and HR by site-specific recombinases play important roles in the mobility of clusters of genes between taxa and within closely related genomes, modulating the flexible pool of the genome. Our findings suggest that GIs may increase bacterial fitness under environmental changing conditions by acquiring novel foreign genes and/or modifying gene transcription and/or transduction. PMID:22839777

  5. Microbial genomic island discovery, visualization and analysis.

    PubMed

    Bertelli, Claire; Tilley, Keith E; Brinkman, Fiona S L

    2018-06-03

    Horizontal gene transfer (also called lateral gene transfer) is a major mechanism for microbial genome evolution, enabling rapid adaptation and survival in specific niches. Genomic islands (GIs), commonly defined as clusters of bacterial or archaeal genes of probable horizontal origin, are of particular medical, environmental and/or industrial interest, as they disproportionately encode virulence factors and some antimicrobial resistance genes and may harbor entire metabolic pathways that confer a specific adaptation (solvent resistance, symbiosis properties, etc). As large-scale analyses of microbial genomes increases, such as for genomic epidemiology investigations of infectious disease outbreaks in public health, there is increased appreciation of the need to accurately predict and track GIs. Over the past decade, numerous computational tools have been developed to tackle the challenges inherent in accurate GI prediction. We review here the main types of GI prediction methods and discuss their advantages and limitations for a routine analysis of microbial genomes in this era of rapid whole-genome sequencing. An assessment is provided of 20 GI prediction software methods that use sequence-composition bias to identify the GIs, using a reference GI data set from 104 genomes obtained using an independent comparative genomics approach. Finally, we present guidelines to assist researchers in effectively identifying these key genomic regions.

  6. MobilomeFINDER: web-based tools for in silico and experimental discovery of bacterial genomic islands

    PubMed Central

    Ou, Hong-Yu; He, Xinyi; Harrison, Ewan M.; Kulasekara, Bridget R.; Thani, Ali Bin; Kadioglu, Aras; Lory, Stephen; Hinton, Jay C. D.; Barer, Michael R.; Rajakumar, Kumar

    2007-01-01

    MobilomeFINDER (http://mml.sjtu.edu.cn/MobilomeFINDER) is an interactive online tool that facilitates bacterial genomic island or ‘mobile genome’ (mobilome) discovery; it integrates the ArrayOme and tRNAcc software packages. ArrayOme utilizes a microarray-derived comparative genomic hybridization input data set to generate ‘inferred contigs’ produced by merging adjacent genes classified as ‘present’. Collectively these ‘fragments’ represent a hypothetical ‘microarray-visualized genome (MVG)’. ArrayOme permits recognition of discordances between physical genome and MVG sizes, thereby enabling identification of strains rich in microarray-elusive novel genes. Individual tRNAcc tools facilitate automated identification of genomic islands by comparative analysis of the contents and contexts of tRNA sites and other integration hotspots in closely related sequenced genomes. Accessory tools facilitate design of hotspot-flanking primers for in silico and/or wet-science-based interrogation of cognate loci in unsequenced strains and analysis of islands for features suggestive of foreign origins; island-specific and genome-contextual features are tabulated and represented in schematic and graphical forms. To date we have used MobilomeFINDER to analyse several Enterobacteriaceae, Pseudomonas aeruginosa and Streptococcus suis genomes. MobilomeFINDER enables high-throughput island identification and characterization through increased exploitation of emerging sequence data and PCR-based profiling of unsequenced test strains; subsequent targeted yeast recombination-based capture permits full-length sequencing and detailed functional studies of novel genomic islands. PMID:17537813

  7. Proteus genomic island 1 (PGI1), a new resistance genomic island from two Proteus mirabilis French clinical isolates.

    PubMed

    Siebor, Eliane; Neuwirth, Catherine

    2014-12-01

    To analyse the genetic environment of the antibiotic resistance genes in two clinical Proteus mirabilis isolates resistant to multiple antibiotics. PCR, gene walking and whole-genome sequencing were used to determine the sequence of the resistance regions, the surrounding genetic structure and the flanking chromosomal regions. A genomic island of 81.1 kb named Proteus genomic island 1 (PGI1) located at the 3'-end of trmE (formerly known as thdF) was characterized. The large MDR region of PGI1 (55.4 kb) included a class 1 integron (aadB and aadA2) and regions deriving from several transposons: Tn2 (blaTEM-135), Tn21, Tn6020-like transposon (aphA1b), a hybrid Tn502/Tn5053 transposon, Tn501, a hybrid Tn1696/Tn1721 transposon [tetA(A)] carrying a class 1 integron (aadA1) and Tn5393 (strA and strB). Several ISs were also present (IS4321, IS1R and IS26). The PGI1 backbone (25.7 kb) was identical to that identified in Salmonella Heidelberg SL476 and shared some identity with the Salmonella genomic island 1 (SGI1) backbone. An IS26-mediated recombination event caused the division of the MDR region into two parts separated by a large chromosomal DNA fragment of 197 kb, the right end of PGI1 and this chromosomal sequence being in inverse orientation. PGI1 is a new resistance genomic island from P. mirabilis belonging to the same island family as SGI1. The role of PGI1 in the spread of antimicrobial resistance genes among Enterobacteriaceae of medical importance needs to be evaluated. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. A Hybrid Approach for CpG Island Detection in the Human Genome.

    PubMed

    Yang, Cheng-Hong; Lin, Yu-Da; Chiang, Yi-Cheng; Chuang, Li-Yeh

    2016-01-01

    CpG islands have been demonstrated to influence local chromatin structures and simplify the regulation of gene activity. However, the accurate and rapid determination of CpG islands for whole DNA sequences remains experimentally and computationally challenging. A novel procedure is proposed to detect CpG islands by combining clustering technology with the sliding-window method (PSO-based). Clustering technology is used to detect the locations of all possible CpG islands and process the data, thus effectively obviating the need for the extensive and unnecessary processing of DNA fragments, and thus improving the efficiency of sliding-window based particle swarm optimization (PSO) search. This proposed approach, named ClusterPSO, provides versatile and highly-sensitive detection of CpG islands in the human genome. In addition, the detection efficiency of ClusterPSO is compared with eight CpG island detection methods in the human genome. Comparison of the detection efficiency for the CpG islands in human genome, including sensitivity, specificity, accuracy, performance coefficient (PC), and correlation coefficient (CC), ClusterPSO revealed superior detection ability among all of the test methods. Moreover, the combination of clustering technology and PSO method can successfully overcome their respective drawbacks while maintaining their advantages. Thus, clustering technology could be hybridized with the optimization algorithm method to optimize CpG island detection. The prediction accuracy of ClusterPSO was quite high, indicating the combination of CpGcluster and PSO has several advantages over CpGcluster and PSO alone. In addition, ClusterPSO significantly reduced implementation time.

  9. GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.

    PubMed

    Lee, Chi-Ching; Chen, Yi-Ping Phoebe; Yao, Tzu-Jung; Ma, Cheng-Yu; Lo, Wei-Cheng; Lyu, Ping-Chiang; Tang, Chuan Yi

    2013-04-10

    Sequencing of microbial genomes is important because of microbial-carrying antibiotic and pathogenetic activities. However, even with the help of new assembling software, finishing a whole genome is a time-consuming task. In most bacteria, pathogenetic or antibiotic genes are carried in genomic islands. Therefore, a quick genomic island (GI) prediction method is useful for ongoing sequencing genomes. In this work, we built a Web server called GI-POP (http://gipop.life.nthu.edu.tw) which integrates a sequence assembling tool, a functional annotation pipeline, and a high-performance GI predicting module, in a support vector machine (SVM)-based method called genomic island genomic profile scanning (GI-GPS). The draft genomes of the ongoing genome projects in contigs or scaffolds can be submitted to our Web server, and it provides the functional annotation and highly probable GI-predicting results. GI-POP is a comprehensive annotation Web server designed for ongoing genome project analysis. Researchers can perform annotation and obtain pre-analytic information include possible GIs, coding/non-coding sequences and functional analysis from their draft genomes. This pre-analytic system can provide useful information for finishing a genome sequencing project. Copyright © 2012 Elsevier B.V. All rights reserved.

  10. The Perchlorate Reduction Genomic Island: Mechanisms and Pathways of Evolution by Horizontal Gene Transfer.

    PubMed

    Melnyk, Ryan A; Coates, John D

    2015-10-26

    Perchlorate is a widely distributed anion that is toxic to humans, but serves as a valuable electron acceptor for several lineages of bacteria. The ability to utilize perchlorate is conferred by a horizontally transferred piece of DNA called the perchlorate reduction genomic island (PRI). We compared genomes of perchlorate reducers using phylogenomics, SNP mapping, and differences in genomic architecture to interrogate the evolutionary history of perchlorate respiration. Here we report on the PRI of 13 genomes of perchlorate-reducing bacteria from four different classes of Phylum Proteobacteria (the Alpha-, Beta-, Gamma- and Epsilonproteobacteria). Among the different phylogenetic classes, the island varies considerably in genetic content as well as in its putative mechanism and location of integration. However, the islands of the densely sampled genera Azospira and Magnetospirillum have striking nucleotide identity despite divergent genomes, implying horizontal transfer and positive selection within narrow phylogenetic taxa. We also assess the phylogenetic origin of accessory genes in the various incarnations of the island, which can be traced to chromosomal paralogs from phylogenetically similar organisms. These observations suggest a complex phylogenetic history where the island is rarely transferred at the class level but undergoes frequent and continuous transfer within narrow phylogenetic groups. This restricted transfer is seen directly by the independent integration of near-identical islands within a genus and indirectly due to the acquisition of lineage-specific accessory genes. The genomic reversibility of perchlorate reduction may present a unique equilibrium for a metabolism that confers a competitive advantage only in the presence of an electron acceptor, which although widely distributed, is generally present at low concentrations in nature.

  11. Identification of another module involved in the horizontal transfer of the Haemophilus genomic island ICEHin1056.

    PubMed

    Juhas, Mario; Dimopoulou, Ioanna; Robinson, Esther; Elamin, Abdel; Harding, Rosalind; Hood, Derek; Crook, Derrick

    2013-09-01

    A significant part of horizontal gene transfer is facilitated by genomic islands. Haemophilus influenzae genomic island ICEHin1056 is an archetype of a genomic island that accounts for pandemic spread of antibiotics resistance. ICEHin1056 has modular structure and harbors modules involved in type IV secretion and integration. Previous studies have shown that ICEHin1056 encodes a functional type IV secretion system; however, other modules have not been characterized yet. Here we show that the module on the 5' extremity of ICEHin1056 consists of 15 genes that are well conserved in a number of related genomic islands. Furthermore by disrupting six genes of the investigated module of ICEHin1056 by site-specific mutagenesis we demonstrate that in addition to type IV secretion system module, the investigated module is also important for the successful conjugal transfer of ICEHin1056 from donor to recipient cells. Copyright © 2013 Elsevier Inc. All rights reserved.

  12. Genome Island: A Virtual Science Environment in Second Life

    ERIC Educational Resources Information Center

    Clark, Mary Anne

    2009-01-01

    Mary Anne CLark describes the organization and uses of Genome Island, a virtual laboratory complex constructed in Second Life. Genome Island was created for teaching genetics to university undergraduates but also provides a public space where anyone interested in genetics can spend a few minutes, or a few hours, interacting with genetic…

  13. CRISPR-based screening of genomic island excision events in bacteria.

    PubMed

    Selle, Kurt; Klaenhammer, Todd R; Barrangou, Rodolphe

    2015-06-30

    Genomic analysis of Streptococcus thermophilus revealed that mobile genetic elements (MGEs) likely contributed to gene acquisition and loss during evolutionary adaptation to milk. Clustered regularly interspaced short palindromic repeats-CRISPR-associated genes (CRISPR-Cas), the adaptive immune system in bacteria, limits genetic diversity by targeting MGEs including bacteriophages, transposons, and plasmids. CRISPR-Cas systems are widespread in streptococci, suggesting that the interplay between CRISPR-Cas systems and MGEs is one of the driving forces governing genome homeostasis in this genus. To investigate the genetic outcomes resulting from CRISPR-Cas targeting of integrated MGEs, in silico prediction revealed four genomic islands without essential genes in lengths from 8 to 102 kbp, totaling 7% of the genome. In this study, the endogenous CRISPR3 type II system was programmed to target the four islands independently through plasmid-based expression of engineered CRISPR arrays. Targeting lacZ within the largest 102-kbp genomic island was lethal to wild-type cells and resulted in a reduction of up to 2.5-log in the surviving population. Genotyping of Lac(-) survivors revealed variable deletion events between the flanking insertion-sequence elements, all resulting in elimination of the Lac-encoding island. Chimeric insertion sequence footprints were observed at the deletion junctions after targeting all of the four genomic islands, suggesting a common mechanism of deletion via recombination between flanking insertion sequences. These results established that self-targeting CRISPR-Cas systems may direct significant evolution of bacterial genomes on a population level, influencing genome homeostasis and remodeling.

  14. Position-based scanning for comparative genomics and identification of genetic islands in Haemophilus influenzae type b.

    PubMed

    Bergman, Nicholas H; Akerley, Brian J

    2003-03-01

    Bacteria exhibit extensive genetic heterogeneity within species. In many cases, these differences account for virulence properties unique to specific strains. Several such loci have been discovered in the genome of the type b serotype of Haemophilus influenzae, a human pathogen able to cause meningitis, pneumonia, and septicemia. Here we report application of a PCR-based scanning procedure to compare the genome of a virulent type b (Hib) strain with that of the laboratory-passaged Rd KW20 strain for which a complete genome sequence is available. We have identified seven DNA segments or H. influenzae genetic islands (HiGIs) present in the type b genome and absent from the Rd genome. These segments vary in size and content and show signs of horizontal gene transfer in that their percent G+C content differs from that of the rest of the H. influenzae genome, they contain genes similar to those found on phages or other mobile elements, or they are flanked by DNA repeats. Several of these loci represent potential pathogenicity islands, because they contain genes likely to mediate interactions with the host. These newly identified genetic islands provide areas of investigation into both the evolution and pathogenesis of H. influenzae. In addition, the genome scanning approach developed to identify these islands provides a rapid means to compare the genomes of phenotypically diverse bacterial strains once the genome sequence of one representative strain has been determined.

  15. Identification of a lineage specific zinc responsive genomic island in Mycobacterium avium ssp. paratuberculosis.

    PubMed

    Eckelt, Elke; Jarek, Michael; Frömke, Cornelia; Meens, Jochen; Goethe, Ralph

    2014-12-06

    Maintenance of metal homeostasis is crucial in bacterial pathogenicity as metal starvation is the most important mechanism in the nutritional immunity strategy of host cells. Thus, pathogenic bacteria have evolved sensitive metal scavenging systems to overcome this particular host defence mechanism. The ruminant pathogen Mycobacterium avium ssp. paratuberculosis (MAP) displays a unique gut tropism and causes a chronic progressive intestinal inflammation. MAP possesses eight conserved lineage specific large sequence polymorphisms (LSP), which distinguish MAP from its ancestral M. avium ssp. hominissuis or other M. avium subspecies. LSP14 and LSP15 harbour many genes proposed to be involved in metal homeostasis and have been suggested to substitute for a MAP specific, impaired mycobactin synthesis. In the present study, we found that a LSP14 located putative IrtAB-like iron transporter encoded by mptABC was induced by zinc but not by iron starvation. Heterologous reporter gene assays with the lacZ gene under control of the mptABC promoter in M. smegmatis (MSMEG) and in a MSMEG∆furB deletion mutant revealed a zinc dependent, metalloregulator FurB mediated expression of mptABC via a conserved mycobacterial FurB recognition site. Deep sequencing of RNA from MAP cultures treated with the zinc chelator TPEN revealed that 70 genes responded to zinc limitation. Remarkably, 45 of these genes were located on a large genomic island of approximately 90 kb which harboured LSP14 and LSP15. Thirty-five of these genes were predicted to be controlled by FurB, due to the presence of putative binding sites. This clustering of zinc responsive genes was exclusively found in MAP and not in other mycobacteria. Our data revealed a particular genomic signature for MAP given by a unique zinc specific locus, thereby suggesting an exceptional relevance of zinc for the metabolism of MAP. MAP seems to be well adapted to maintain zinc homeostasis which might contribute to the peculiarity of MAP

  16. Defense islands in bacterial and archaeal genomes and prediction of novel defense systems.

    PubMed

    Makarova, Kira S; Wolf, Yuri I; Snir, Sagi; Koonin, Eugene V

    2011-11-01

    The arms race between cellular life forms and viruses is a major driving force of evolution. A substantial fraction of bacterial and archaeal genomes is dedicated to antivirus defense. We analyzed the distribution of defense genes and typical mobilome components (such as viral and transposon genes) in bacterial and archaeal genomes and demonstrated statistically significant clustering of antivirus defense systems and mobile genes and elements in genomic islands. The defense islands are enriched in putative operons and contain numerous overrepresented gene families. A detailed sequence analysis of the proteins encoded by genes in these families shows that many of them are diverged variants of known defense system components, whereas others show features, such as characteristic operonic organization, that are suggestive of novel defense systems. Thus, genomic islands provide abundant material for the experimental study of bacterial and archaeal antivirus defense. Except for the CRISPR-Cas systems, different classes of defense systems, in particular toxin-antitoxin and restriction-modification systems, show nonrandom clustering in defense islands. It remains unclear to what extent these associations reflect functional cooperation between different defense systems and to what extent the islands are genomic "sinks" that accumulate diverse nonessential genes, particularly those acquired via horizontal gene transfer. The characteristics of defense islands resemble those of mobilome islands. Defense and mobilome genes are nonrandomly associated in islands, suggesting nonadaptive evolution of the islands via a preferential attachment-like mechanism underpinned by the addictive properties of defense systems such as toxins-antitoxins and an important role of horizontal mobility in the evolution of these islands.

  17. Genomic islands of divergence are not affected by geography of speciation in sunflowers.

    PubMed

    Renaut, S; Grassa, C J; Yeaman, S; Moyers, B T; Lai, Z; Kane, N C; Bowers, J E; Burke, J M; Rieseberg, L H

    2013-01-01

    Genomic studies of speciation often report the presence of highly differentiated genomic regions interspersed within a milieu of weakly diverged loci. The formation of these speciation islands is generally attributed to reduced inter-population gene flow near loci under divergent selection, but few studies have critically evaluated this hypothesis. Here, we report on transcriptome scans among four recently diverged pairs of sunflower (Helianthus) species that vary in the geographical context of speciation. We find that genetic divergence is lower in sympatric and parapatric comparisons, consistent with a role for gene flow in eroding neutral differences. However, genomic islands of divergence are numerous and small in all comparisons, and contrary to expectations, island number and size are not significantly affected by levels of interspecific gene flow. Rather, island formation is strongly associated with reduced recombination rates. Overall, our results indicate that the functional architecture of genomes plays a larger role in shaping genomic divergence than does the geography of speciation.

  18. GaussianCpG: a Gaussian model for detection of CpG island in human genome sequences.

    PubMed

    Yu, Ning; Guo, Xuan; Zelikovsky, Alexander; Pan, Yi

    2017-05-24

    As crucial markers in identifying biological elements and processes in mammalian genomes, CpG islands (CGI) play important roles in DNA methylation, gene regulation, epigenetic inheritance, gene mutation, chromosome inactivation and nuclesome retention. The generally accepted criteria of CGI rely on: (a) %G+C content is ≥ 50%, (b) the ratio of the observed CpG content and the expected CpG content is ≥ 0.6, and (c) the general length of CGI is greater than 200 nucleotides. Most existing computational methods for the prediction of CpG island are programmed on these rules. However, many experimentally verified CpG islands deviate from these artificial criteria. Experiments indicate that in many cases %G+C is < 50%, CpG obs /CpG exp varies, and the length of CGI ranges from eight nucleotides to a few thousand of nucleotides. It implies that CGI detection is not just a straightly statistical task and some unrevealed rules probably are hidden. A novel Gaussian model, GaussianCpG, is developed for detection of CpG islands on human genome. We analyze the energy distribution over genomic primary structure for each CpG site and adopt the parameters from statistics of Human genome. The evaluation results show that the new model can predict CpG islands efficiently by balancing both sensitivity and specificity over known human CGI data sets. Compared with other models, GaussianCpG can achieve better performance in CGI detection. Our Gaussian model aims to simplify the complex interaction between nucleotides. The model is computed not by the linear statistical method but by the Gaussian energy distribution and accumulation. The parameters of Gaussian function are not arbitrarily designated but deliberately chosen by optimizing the biological statistics. By using the pseudopotential analysis on CpG islands, the novel model is validated on both the real and artificial data sets.

  19. [Plasticity of bacterial genomes: pathogenicity islands and the locus of enterocyte effacement (LEE)].

    PubMed

    Kirsch, Petra; Jores, Jörg; Wieler, Lothar H

    2004-01-01

    Many bacterial virulence attributes, like toxins, adhesins, invasins, iron uptake systems, are encoded within specific regions of the bacterial genome. These in size varying regions are termed pathogenicity islands (PAIs) since they confer pathogenic properties to the respective micro-organism. Per definition PAIs are exclusively found in pathogenic strains and are often inserted near transfer-RNA genes. Nevertheless, non-pathogenic bacteria also possess foreign DNA elements that confer advantageous features, leading to improved fitness. These additional DNA elements as well as PAIs are termed genomic islands and were acquired during bacterial evolution. Significant G+C content deviation in pathogenicity islands with respect to the rest of the genome, the presence of direct repeat sequences at the flanking regions, the presence of integrase gene determinants as other mobility features,the particular insertion site (tRNA gene) as well as the observed genetic instability suggests that pathogenicity islands were acquired by horizontal gene transfer. PAIs are the fascinating proof of the plasticity of bacterial genomes. PAIs were originally described in human pathogenic Escherichia (E.) coli strains. In the meantime PAIs have been found in various pathogenic bacteria of humans, animals and even plants. The Locus of Enterocyte Effacement (LEE) is one particular widely distributed PAI of E coli. In addition, it also confers pathogenicity to the related species Citrobacter (C.) rodentium and Escherichia (E.) alvei. The LEE is an important virulence feature of several animal pathogens. It is an obligate PAI of all animal and human enteropathogenic E. coli (EPEC), and most enterohaemorrhegic E. coli (EHEC) also harbor the LEE. The LEE encodes a type III secretion system, an adhesion (intimin) that mediates the intimate contact between the bacterium and the epithelial cell, as well as various proteins which are secreted via the type III secretion system. The LEE encoded

  20. Pre_GI: a global map of ontological links between horizontally transferred genomic islands in bacterial and archaeal genomes

    PubMed Central

    Pierneef, Rian; Cronje, Louis; Bezuidt, Oliver; Reva, Oleg N.

    2015-01-01

    Abstract The Predicted Genomic Islands database (Pre_GI) is a comprehensive repository of prokaryotic genomic islands (islands, GIs) freely accessible at http://pregi.bi.up.ac.za/index.php . Pre_GI, Version 2015, catalogues 26 744 islands identified in 2407 bacterial/archaeal chromosomes and plasmids. It provides an easy-to-use interface which allows users the ability to query against the database with a variety of fields, parameters and associations. Pre_GI is constructed to be a web-resource for the analysis of ontological roads between islands and cartographic analysis of the global fluxes of mobile genetic elements through bacterial and archaeal taxonomic borders. Comparison of newly identified islands against Pre_GI presents an alternative avenue to identify their ontology, origin and relative time of acquisition. Pre_GI aims to aid research on horizontal transfer events and materials through providing data and tools for holistic investigation of migration of genes through ecological niches and taxonomic boundaries. Database URL: http://pregi.bi.up.ac.za/index.php , Version 2015 PMID:26200753

  1. Defense Islands in Bacterial and Archaeal Genomes and Prediction of Novel Defense Systems ▿†‡

    PubMed Central

    Makarova, Kira S.; Wolf, Yuri I.; Snir, Sagi; Koonin, Eugene V.

    2011-01-01

    The arms race between cellular life forms and viruses is a major driving force of evolution. A substantial fraction of bacterial and archaeal genomes is dedicated to antivirus defense. We analyzed the distribution of defense genes and typical mobilome components (such as viral and transposon genes) in bacterial and archaeal genomes and demonstrated statistically significant clustering of antivirus defense systems and mobile genes and elements in genomic islands. The defense islands are enriched in putative operons and contain numerous overrepresented gene families. A detailed sequence analysis of the proteins encoded by genes in these families shows that many of them are diverged variants of known defense system components, whereas others show features, such as characteristic operonic organization, that are suggestive of novel defense systems. Thus, genomic islands provide abundant material for the experimental study of bacterial and archaeal antivirus defense. Except for the CRISPR-Cas systems, different classes of defense systems, in particular toxin-antitoxin and restriction-modification systems, show nonrandom clustering in defense islands. It remains unclear to what extent these associations reflect functional cooperation between different defense systems and to what extent the islands are genomic “sinks” that accumulate diverse nonessential genes, particularly those acquired via horizontal gene transfer. The characteristics of defense islands resemble those of mobilome islands. Defense and mobilome genes are nonrandomly associated in islands, suggesting nonadaptive evolution of the islands via a preferential attachment-like mechanism underpinned by the addictive properties of defense systems such as toxins-antitoxins and an important role of horizontal mobility in the evolution of these islands. PMID:21908672

  2. Excess of genomic defects in a woolly mammoth on Wrangel island

    PubMed Central

    Slatkin, Montgomery

    2017-01-01

    Woolly mammoths (Mammuthus primigenius) populated Siberia, Beringia, and North America during the Pleistocene and early Holocene. Recent breakthroughs in ancient DNA sequencing have allowed for complete genome sequencing for two specimens of woolly mammoths (Palkopoulou et al. 2015). One mammoth specimen is from a mainland population 45,000 years ago when mammoths were plentiful. The second, a 4300 yr old specimen, is derived from an isolated population on Wrangel island where mammoths subsisted with small effective population size more than 43-fold lower than previous populations. These extreme differences in effective population size offer a rare opportunity to test nearly neutral models of genome architecture evolution within a single species. Using these previously published mammoth sequences, we identify deletions, retrogenes, and non-functionalizing point mutations. In the Wrangel island mammoth, we identify a greater number of deletions, a larger proportion of deletions affecting gene sequences, a greater number of candidate retrogenes, and an increased number of premature stop codons. This accumulation of detrimental mutations is consistent with genomic meltdown in response to low effective population sizes in the dwindling mammoth population on Wrangel island. In addition, we observe high rates of loss of olfactory receptors and urinary proteins, either because these loci are non-essential or because they were favored by divergent selective pressures in island environments. Finally, at the locus of FOXQ1 we observe two independent loss-of-function mutations, which would confer a satin coat phenotype in this island woolly mammoth. PMID:28253255

  3. Comparative genomic analysis shows that Streptococcus suis meningitis isolate SC070731 contains a unique 105K genomic island.

    PubMed

    Wu, Zongfu; Wang, Weixue; Tang, Min; Shao, Jing; Dai, Chen; Zhang, Wei; Fan, Hongjie; Yao, Huochun; Zong, Jie; Chen, Dai; Wang, Junning; Lu, Chengping

    2014-02-10

    Streptococcus suis (SS) is an important swine pathogen worldwide that occasionally causes serious infections in humans. SS infection may result in meningitis in pigs and humans. The pathogenic mechanisms of SS are poorly understood. Here, we provide the complete genome sequence of S. suis serotype 2 (SS2) strain SC070731 isolated from a pig with meningitis. The chromosome is 2,138,568bp in length. There are 1933 predicted protein coding sequences and 96.7% (57/59) of the known virulence-associated genes are present in the genome. Strain SC070731 showed similar virulence with SS2 virulent strains HA9801 and ZY05719, but was more virulent than SS2 virulent strain P1/7 in the zebrafish infection model. Comparative genomic analysis revealed a unique 105K genomic island in strain SC070731 that is absent in seven other sequenced SS2 strains. Further analysis of the 105K genomic island indicated that it contained a complete nisin locus similar to the nisin U locus in S. uberis strain 42, a prophage similar to S. oralis phage PH10 and several antibiotic resistance genes. Several proteins in the 105K genomic island, including nisin and RelBE toxin-antitoxin system, contribute to the bacterial fitness and virulence in other pathogenic bacteria. Further investigation of newly identified gene products, including four putative new virulence-associated surface proteins, will improve our understanding of SS pathogenesis. Copyright © 2013 Elsevier B.V. All rights reserved.

  4. The role of genomic islands in Escherichia coli K1 interactions with intestinal and kidney epithelial cells.

    PubMed

    Yousuf, Farzana Abubakar; Rafiq, Sahar; Siddiqui, Ruqaiyyah; Khan, Naveed Ahmed

    2016-04-01

    The completion of Escherichia coli K1 genome has identified several genomic islands that are present in meningitis-causing E. coli RS218 but absent in the non-pathogenic E. coli MG1655. In this study, the role of various genomic islands in E. coli K1 interactions with intestinal epithelial cells (Caco-2) and kidney epithelial cells (MA104) was determined. Using association assays, invasion assays, and intracellular survival assays, the findings revealed that the genomic island deletion mutants of RS218 related to P fimbriae, S fimbriae, F17-like fimbriae, non-fimbrial adhesins, Hek and hemagglutinin, protein secretion system (T1SS for hemolysin; T2SS; T5SS for antigen 43), Iro system and hmu system), invasins (CNF1, IbeA), toxins (α-hemolysin), K1 capsule biosynthesis, metabolism (d-serine catabolism, dihydroxyacetone, glycerol, and glyoxylate metabolism), prophage genes, showed reduced interactions with both cell types. Next, we determined the role of various genomic islands in E. coli K1 resistance to serum. When exposed to the normal human serum, the viability of the genomic island deletion mutants related to adhesins such as S fimbriae, P fimbriae, F17-like fimbriae, non-fimbrial adhesins, Hek and hemagglutinin, antigen 43 and T5SS for antigen 43, T2SS, and T1SS for hemolysin, Iro system and hmu system, prophage genes, metabolism (sugar metabolism and d-serine catabolism), K1 capsule biosynthesis, and invasins such as CNF1 was affected, suggesting their role in bacteremia. The characterization of these genomic islands should reveal mechanisms of E. coli K1 pathogenicity that could be of value as therapeutic targets. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Genomic Evidence for Island Population Conversion Resolves Conflicting Theories of Polar Bear Evolution

    PubMed Central

    Cahill, James A.; Green, Richard E.; Fulton, Tara L.; Stiller, Mathias; Jay, Flora; Ovsyanikov, Nikita; Salamzade, Rauf; St. John, John; Stirling, Ian; Slatkin, Montgomery; Shapiro, Beth

    2013-01-01

    Despite extensive genetic analysis, the evolutionary relationship between polar bears (Ursus maritimus) and brown bears (U. arctos) remains unclear. The two most recent comprehensive reports indicate a recent divergence with little subsequent admixture or a much more ancient divergence followed by extensive admixture. At the center of this controversy are the Alaskan ABC Islands brown bears that show evidence of shared ancestry with polar bears. We present an analysis of genome-wide sequence data for seven polar bears, one ABC Islands brown bear, one mainland Alaskan brown bear, and a black bear (U. americanus), plus recently published datasets from other bears. Surprisingly, we find clear evidence for gene flow from polar bears into ABC Islands brown bears but no evidence of gene flow from brown bears into polar bears. Importantly, while polar bears contributed <1% of the autosomal genome of the ABC Islands brown bear, they contributed 6.5% of the X chromosome. The magnitude of sex-biased polar bear ancestry and the clear direction of gene flow suggest a model wherein the enigmatic ABC Island brown bears are the descendants of a polar bear population that was gradually converted into brown bears via male-dominated brown bear admixture. We present a model that reconciles heretofore conflicting genetic observations. We posit that the enigmatic ABC Islands brown bears derive from a population of polar bears likely stranded by the receding ice at the end of the last glacial period. Since then, male brown bear migration onto the island has gradually converted these bears into an admixed population whose phenotype and genotype are principally brown bear, except at mtDNA and X-linked loci. This process of genome erosion and conversion may be a common outcome when climate change or other forces cause a population to become isolated and then overrun by species with which it can hybridize. PMID:23516372

  6. Genomic evidence for island population conversion resolves conflicting theories of polar bear evolution.

    PubMed

    Cahill, James A; Green, Richard E; Fulton, Tara L; Stiller, Mathias; Jay, Flora; Ovsyanikov, Nikita; Salamzade, Rauf; St John, John; Stirling, Ian; Slatkin, Montgomery; Shapiro, Beth

    2013-01-01

    Despite extensive genetic analysis, the evolutionary relationship between polar bears (Ursus maritimus) and brown bears (U. arctos) remains unclear. The two most recent comprehensive reports indicate a recent divergence with little subsequent admixture or a much more ancient divergence followed by extensive admixture. At the center of this controversy are the Alaskan ABC Islands brown bears that show evidence of shared ancestry with polar bears. We present an analysis of genome-wide sequence data for seven polar bears, one ABC Islands brown bear, one mainland Alaskan brown bear, and a black bear (U. americanus), plus recently published datasets from other bears. Surprisingly, we find clear evidence for gene flow from polar bears into ABC Islands brown bears but no evidence of gene flow from brown bears into polar bears. Importantly, while polar bears contributed <1% of the autosomal genome of the ABC Islands brown bear, they contributed 6.5% of the X chromosome. The magnitude of sex-biased polar bear ancestry and the clear direction of gene flow suggest a model wherein the enigmatic ABC Island brown bears are the descendants of a polar bear population that was gradually converted into brown bears via male-dominated brown bear admixture. We present a model that reconciles heretofore conflicting genetic observations. We posit that the enigmatic ABC Islands brown bears derive from a population of polar bears likely stranded by the receding ice at the end of the last glacial period. Since then, male brown bear migration onto the island has gradually converted these bears into an admixed population whose phenotype and genotype are principally brown bear, except at mtDNA and X-linked loci. This process of genome erosion and conversion may be a common outcome when climate change or other forces cause a population to become isolated and then overrun by species with which it can hybridize.

  7. Methyl-CpG island-associated genome signature tags

    DOEpatents

    Dunn, John J

    2014-05-20

    Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.

  8. A novel family of integrases associated with prophages and genomic islands integrated within the tRNA-dihydrouridine synthase A (dusA) gene

    PubMed Central

    Farrugia, Daniel N.; Elbourne, Liam D. H.; Mabbutt, Bridget C.; Paulsen, Ian T.

    2015-01-01

    Genomic islands play a key role in prokaryotic genome plasticity. Genomic islands integrate into chromosomal loci such as transfer RNA genes and protein coding genes, whilst retaining various cargo genes that potentially bestow novel functions on the host organism. A gene encoding a putative integrase was identified at a single site within the 5′ end of the dusA gene in the genomes of over 200 bacteria. This integrase was discovered to be a component of numerous genomic islands, which appear to share a target site within the dusA gene. dusA encodes the tRNA-dihydrouridine synthase A enzyme, which catalyses the post-transcriptional reduction of uridine to dihydrouridine in tRNA. Genomic islands encoding homologous dusA-associated integrases were found at a much lower frequency within the related dusB and dusC genes, and non-dus genes. Excision of these dusA-associated islands from the chromosome as circularized intermediates was confirmed by polymerase chain reaction. Analysis of the dusA-associated islands indicated that they were highly diverse, with the integrase gene representing the only universal common feature. PMID:25883135

  9. CpG island mapping by epigenome prediction.

    PubMed

    Bock, Christoph; Walter, Jörn; Paulsen, Martina; Lengauer, Thomas

    2007-06-01

    CpG islands were originally identified by epigenetic and functional properties, namely, absence of DNA methylation and frequent promoter association. However, this concept was quickly replaced by simple DNA sequence criteria, which allowed for genome-wide annotation of CpG islands in the absence of large-scale epigenetic datasets. Although widely used, the current CpG island criteria incur significant disadvantages: (1) reliance on arbitrary threshold parameters that bear little biological justification, (2) failure to account for widespread heterogeneity among CpG islands, and (3) apparent lack of specificity when applied to the human genome. This study is driven by the idea that a quantitative score of "CpG island strength" that incorporates epigenetic and functional aspects can help resolve these issues. We construct an epigenome prediction pipeline that links the DNA sequence of CpG islands to their epigenetic states, including DNA methylation, histone modifications, and chromatin accessibility. By training support vector machines on epigenetic data for CpG islands on human Chromosomes 21 and 22, we identify informative DNA attributes that correlate with open versus compact chromatin structures. These DNA attributes are used to predict the epigenetic states of all CpG islands genome-wide. Combining predictions for multiple epigenetic features, we estimate the inherent CpG island strength for each CpG island in the human genome, i.e., its inherent tendency to exhibit an open and transcriptionally competent chromatin structure. We extensively validate our results on independent datasets, showing that the CpG island strength predictions are applicable and informative across different tissues and cell types, and we derive improved maps of predicted "bona fide" CpG islands. The mapping of CpG islands by epigenome prediction is conceptually superior to identifying CpG islands by widely used sequence criteria since it links CpG island detection to their characteristic

  10. Prevalence of Avian-Pathogenic Escherichia coli Strain O1 Genomic Islands among Extraintestinal and Commensal E. coli Isolates

    PubMed Central

    Johnson, Timothy J.; Wannemuehler, Yvonne; Kariyawasam, Subhashinie; Johnson, James R.; Logue, Catherine M.

    2012-01-01

    Escherichia coli strains that cause disease outside the intestine are known as extraintestinal pathogenic E. coli (ExPEC) and include pathogens of humans and animals. Previously, the genome of avian-pathogenic E. coli (APEC) O1:K1:H7 strain O1, from ST95, was sequenced and compared to those of several other E. coli strains, identifying 43 genomic islands. Here, the genomic islands of APEC O1 were compared to those of other sequenced E. coli strains, and the distribution of 81 genes belonging to 12 APEC O1 genomic islands among 828 human and avian ExPEC and commensal E. coli isolates was determined. Multiple islands were highly prevalent among isolates belonging to the O1 and O18 serogroups within phylogenetic group B2, which are implicated in human neonatal meningitis. Because of the extensive genomic similarities between APEC O1 and other human ExPEC strains belonging to the ST95 phylogenetic lineage, its ability to cause disease in a rat model of sepsis and meningitis was assessed. Unlike other ST95 lineage strains, APEC O1 was unable to cause bacteremia or meningitis in the neonatal rat model and was significantly less virulent than uropathogenic E. coli (UPEC) CFT073 in a mouse sepsis model, despite carrying multiple neonatal meningitis E. coli (NMEC) virulence factors and belonging to the ST95 phylogenetic lineage. These results suggest that host adaptation or genome modifications have occurred either in APEC O1 or in highly virulent ExPEC isolates, resulting in differences in pathogenicity. Overall, the genomic islands examined provide targets for further discrimination of the different ExPEC subpathotypes, serogroups, phylogenetic types, and sequence types. PMID:22467781

  11. Prevalence of avian-pathogenic Escherichia coli strain O1 genomic islands among extraintestinal and commensal E. coli isolates.

    PubMed

    Johnson, Timothy J; Wannemuehler, Yvonne; Kariyawasam, Subhashinie; Johnson, James R; Logue, Catherine M; Nolan, Lisa K

    2012-06-01

    Escherichia coli strains that cause disease outside the intestine are known as extraintestinal pathogenic E. coli (ExPEC) and include pathogens of humans and animals. Previously, the genome of avian-pathogenic E. coli (APEC) O1:K1:H7 strain O1, from ST95, was sequenced and compared to those of several other E. coli strains, identifying 43 genomic islands. Here, the genomic islands of APEC O1 were compared to those of other sequenced E. coli strains, and the distribution of 81 genes belonging to 12 APEC O1 genomic islands among 828 human and avian ExPEC and commensal E. coli isolates was determined. Multiple islands were highly prevalent among isolates belonging to the O1 and O18 serogroups within phylogenetic group B2, which are implicated in human neonatal meningitis. Because of the extensive genomic similarities between APEC O1 and other human ExPEC strains belonging to the ST95 phylogenetic lineage, its ability to cause disease in a rat model of sepsis and meningitis was assessed. Unlike other ST95 lineage strains, APEC O1 was unable to cause bacteremia or meningitis in the neonatal rat model and was significantly less virulent than uropathogenic E. coli (UPEC) CFT073 in a mouse sepsis model, despite carrying multiple neonatal meningitis E. coli (NMEC) virulence factors and belonging to the ST95 phylogenetic lineage. These results suggest that host adaptation or genome modifications have occurred either in APEC O1 or in highly virulent ExPEC isolates, resulting in differences in pathogenicity. Overall, the genomic islands examined provide targets for further discrimination of the different ExPEC subpathotypes, serogroups, phylogenetic types, and sequence types.

  12. Draft Genome of the Scarab Beetle Oryctes borbonicus on La Réunion Island

    PubMed Central

    Meyer, Jan M.; Markov, Gabriel V.; Baskaran, Praveen; Herrmann, Matthias; Sommer, Ralf J.; Rödelsperger, Christian

    2016-01-01

    Beetles represent the largest insect order and they display extreme morphological, ecological and behavioral diversity, which makes them ideal models for evolutionary studies. Here, we present the draft genome of the scarab beetle Oryctes borbonicus, which has a more basal phylogenetic position than the two previously sequenced pest species Tribolium castaneum and Dendroctonus ponderosae providing the potential for sequence polarization. Oryctes borbonicus is endemic to La Réunion, an island located in the Indian Ocean, and is the host of the nematode Pristionchus pacificus, a well-established model organism for integrative evolutionary biology. At 518 Mb, the O. borbonicus genome is substantially larger and encodes more genes than T. castaneum and D. ponderosae. We found that only 25% of the predicted genes of O. borbonicus are conserved as single copy genes across the nine investigated insect genomes, suggesting substantial gene turnover within insects. Even within beetles, up to 21% of genes are restricted to only one species, whereas most other genes have undergone lineage-specific duplications and losses. We illustrate lineage-specific duplications using detailed phylogenetic analysis of two gene families. This study serves as a reference point for insect/coleopteran genomics, although its original motivation was to find evidence for potential horizontal gene transfer (HGT) between O. borbonicus and P. pacificus. The latter was previously shown to be the recipient of multiple horizontally transferred genes including some genes from insect donors. However, our study failed to provide any clear evidence for additional HGTs between the two species. PMID:27289092

  13. GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome.

    PubMed

    Lu, Bingxin; Leong, Hon Wai

    2016-02-01

    Genomic islands (GIs) are clusters of functionally related genes acquired by lateral genetic transfer (LGT), and they are present in many bacterial genomes. GIs are extremely important for bacterial research, because they not only promote genome evolution but also contain genes that enhance adaption and enable antibiotic resistance. Many methods have been proposed to predict GI. But most of them rely on either annotations or comparisons with other closely related genomes. Hence these methods cannot be easily applied to new genomes. As the number of newly sequenced bacterial genomes rapidly increases, there is a need for methods to detect GI based solely on sequences of a single genome. In this paper, we propose a novel method, GI-SVM, to predict GIs given only the unannotated genome sequence. GI-SVM is based on one-class support vector machine (SVM), utilizing composition bias in terms of k-mer content. From our evaluations on three real genomes, GI-SVM can achieve higher recall compared with current methods, without much loss of precision. Besides, GI-SVM allows flexible parameter tuning to get optimal results for each genome. In short, GI-SVM provides a more sensitive method for researchers interested in a first-pass detection of GI in newly sequenced genomes.

  14. Campylobacter fetus subspecies contain conserved type IV secretion systems on multiple genomic islands and plasmids

    USDA-ARS?s Scientific Manuscript database

    The features contributing to the differences in pathogenicity of the C. fetus subspecies are unknown. Putative factors involved in pathogenesis are located in genomic islands that encode type IV secretion system (T4SS) and fic-domain (filamentation induced by cyclic AMP) proteins. In the genomes of ...

  15. Mitochondrial genomes suggest rapid evolution of dwarf California Channel Islands foxes (Urocyon littoralis).

    PubMed

    Hofman, Courtney A; Rick, Torben C; Hawkins, Melissa T R; Funk, W Chris; Ralls, Katherine; Boser, Christina L; Collins, Paul W; Coonan, Tim; King, Julie L; Morrison, Scott A; Newsome, Seth D; Sillett, T Scott; Fleischer, Robert C; Maldonado, Jesus E

    2015-01-01

    Island endemics are typically differentiated from their mainland progenitors in behavior, morphology, and genetics, often resulting from long-term evolutionary change. To examine mechanisms for the origins of island endemism, we present a phylogeographic analysis of whole mitochondrial genomes from the endangered island fox (Urocyon littoralis), endemic to California's Channel Islands, and mainland gray foxes (U. cinereoargenteus). Previous genetic studies suggested that foxes first appeared on the islands >16,000 years ago, before human arrival (~13,000 cal BP), while archaeological and paleontological data supported a colonization >7000 cal BP. Our results are consistent with initial fox colonization of the northern islands probably by rafting or human introduction ~9200-7100 years ago, followed quickly by human translocation of foxes from the northern to southern Channel Islands. Mitogenomes indicate that island foxes are monophyletic and most closely related to gray foxes from northern California that likely experienced a Holocene climate-induced range shift. Our data document rapid morphological evolution of island foxes (in ~2000 years or less). Despite evidence for bottlenecks, island foxes have generated and maintained multiple mitochondrial haplotypes. This study highlights the intertwined evolutionary history of island foxes and humans, and illustrates a new approach for investigating the evolutionary histories of other island endemics.

  16. Mitochondrial Genomes Suggest Rapid Evolution of Dwarf California Channel Islands Foxes (Urocyon littoralis)

    PubMed Central

    Hofman, Courtney A.; Rick, Torben C.; Hawkins, Melissa T. R.; Funk, W. Chris; Ralls, Katherine; Boser, Christina L.; Collins, Paul W.; Coonan, Tim; King, Julie L.; Morrison, Scott A.; Newsome, Seth D.; Sillett, T. Scott; Fleischer, Robert C.; Maldonado, Jesus E.

    2015-01-01

    Island endemics are typically differentiated from their mainland progenitors in behavior, morphology, and genetics, often resulting from long-term evolutionary change. To examine mechanisms for the origins of island endemism, we present a phylogeographic analysis of whole mitochondrial genomes from the endangered island fox (Urocyon littoralis), endemic to California’s Channel Islands, and mainland gray foxes (U. cinereoargenteus). Previous genetic studies suggested that foxes first appeared on the islands >16,000 years ago, before human arrival (~13,000 cal BP), while archaeological and paleontological data supported a colonization >7000 cal BP. Our results are consistent with initial fox colonization of the northern islands probably by rafting or human introduction ~9200–7100 years ago, followed quickly by human translocation of foxes from the northern to southern Channel Islands. Mitogenomes indicate that island foxes are monophyletic and most closely related to gray foxes from northern California that likely experienced a Holocene climate-induced range shift. Our data document rapid morphological evolution of island foxes (in ~2000 years or less). Despite evidence for bottlenecks, island foxes have generated and maintained multiple mitochondrial haplotypes. This study highlights the intertwined evolutionary history of island foxes and humans, and illustrates a new approach for investigating the evolutionary histories of other island endemics. PMID:25714775

  17. Compositional searching of CpG islands in the human genome

    NASA Astrophysics Data System (ADS)

    Luque-Escamilla, Pedro Luis; Martínez-Aroza, José; Oliver, José L.; Gómez-Lopera, Juan Francisco; Román-Roldán, Ramón

    2005-06-01

    We report on an entropic edge detector based on the local calculation of the Jensen-Shannon divergence with application to the search for CpG islands. CpG islands are pieces of the genome related to gene expression and cell differentiation, and thus to cancer formation. Searching for these CpG islands is a major task in genetics and bioinformatics. Some algorithms have been proposed in the literature, based on moving statistics in a sliding window, but its size may greatly influence the results. The local use of Jensen-Shannon divergence is a completely different strategy: the nucleotide composition inside the islands is different from that in their environment, so a statistical distance—the Jensen-Shannon divergence—between the composition of two adjacent windows may be used as a measure of their dissimilarity. Sliding this double window over the entire sequence allows us to segment it compositionally. The fusion of those segments into greater ones that satisfy certain identification criteria must be achieved in order to obtain the definitive results. We find that the local use of Jensen-Shannon divergence is very suitable in processing DNA sequences for searching for compositionally different structures such as CpG islands, as compared to other algorithms in literature.

  18. Identification of Novel Genomic Islands in Liverpool Epidemic Strain of Pseudomonas aeruginosa Using Segmentation and Clustering

    PubMed Central

    Jani, Mehul; Mathee, Kalai; Azad, Rajeev K.

    2016-01-01

    Pseudomonas aeruginosa is an opportunistic pathogen implicated in a myriad of infections and a leading pathogen responsible for mortality in patients with cystic fibrosis (CF). Horizontal transfers of genes among the microorganisms living within CF patients have led to highly virulent and multi-drug resistant strains such as the Liverpool epidemic strain of P. aeruginosa, namely the LESB58 strain that has the propensity to acquire virulence and antibiotic resistance genes. Often these genes are acquired in large clusters, referred to as “genomic islands (GIs).” To decipher GIs and understand their contributions to the evolution of virulence and antibiotic resistance in P. aeruginosa LESB58, we utilized a recursive segmentation and clustering procedure, presented here as a genome-mining tool, “GEMINI.” GEMINI was validated on experimentally verified islands in the LESB58 strain before examining its potential to decipher novel islands. Of the 6062 genes in P. aeruginosa LESB58, 596 genes were identified to be resident on 20 GIs of which 12 have not been previously reported. Comparative genomics provided evidence in support of our novel predictions. Furthermore, GEMINI unraveled the mosaic structure of islands that are composed of segments of likely different evolutionary origins, and demonstrated its ability to identify potential strain biomarkers. These newly found islands likely have contributed to the hyper-virulence and multidrug resistance of the Liverpool epidemic strain of P. aeruginosa. PMID:27536294

  19. A genomic island in Vibrio cholerae with VPI-1 site-specific recombination characteristics contains CRISPR-Cas and type VI secretion modules

    PubMed Central

    Labbate, Maurizio; Orata, Fabini D.; Petty, Nicola K.; Jayatilleke, Nathasha D.; King, William L.; Kirchberger, Paul C.; Allen, Chris; Mann, Gulay; Mutreja, Ankur; Thomson, Nicholas R.; Boucher, Yan; Charles, Ian G.

    2016-01-01

    Cholera is a devastating diarrhoeal disease caused by certain strains of serogroup O1/O139 Vibrio cholerae. Mobile genetic elements such as genomic islands (GIs) have been pivotal in the evolution of O1/O139 V. cholerae. Perhaps the most important GI involved in cholera disease is the V. cholerae pathogenicity island 1 (VPI-1). This GI contains the toxin-coregulated pilus (TCP) gene cluster that is necessary for colonization of the human intestine as well as being the receptor for infection by the cholera-toxin bearing CTX phage. In this study, we report a GI (designated GIVchS12) from a non-O1/O139 strain of V. cholerae that is present in the same chromosomal location as VPI-1, contains an integrase gene with 94% nucleotide and 100% protein identity to the VPI-1 integrase, and attachment (att) sites 100% identical to those found in VPI-1. However, instead of TCP and the other accessory genes present in VPI-1, GIVchS12 contains a CRISPR-Cas element and a type VI secretion system (T6SS). GIs similar to GIVchS12 were identified in other V. cholerae genomes, also containing CRISPR-Cas elements and/or T6SS’s. This study highlights the diversity of GIs circulating in natural V. cholerae populations and identifies GIs with VPI-1 recombination characteristics as a propagator of CRISPR-Cas and T6SS modules. PMID:27845364

  20. Genome organization of epidemic Acinetobacter baumannii strains.

    PubMed

    Di Nocera, Pier Paolo; Rocco, Francesco; Giannouli, Maria; Triassi, Maria; Zarrilli, Raffaele

    2011-10-10

    Acinetobacter baumannii is an opportunistic pathogen responsible for hospital-acquired infections. A. baumannii epidemics described world-wide were caused by few genotypic clusters of strains. The occurrence of epidemics caused by multi-drug resistant strains assigned to novel genotypes have been reported over the last few years. In the present study, we compared whole genome sequences of three A. baumannii strains assigned to genotypes ST2, ST25 and ST78, representative of the most frequent genotypes responsible for epidemics in several Mediterranean hospitals, and four complete genome sequences of A. baumannii strains assigned to genotypes ST1, ST2 and ST77. Comparative genome analysis showed extensive synteny and identified 3068 coding regions which are conserved, at the same chromosomal position, in all A. baumannii genomes. Genome alignments also identified 63 DNA regions, ranging in size from 4 o 126 kb, all defined as genomic islands, which were present in some genomes, but were either missing or replaced by non-homologous DNA sequences in others. Some islands are involved in resistance to drugs and metals, others carry genes encoding surface proteins or enzymes involved in specific metabolic pathways, and others correspond to prophage-like elements. Accessory DNA regions encode 12 to 19% of the potential gene products of the analyzed strains. The analysis of a collection of epidemic A. baumannii strains showed that some islands were restricted to specific genotypes. The definition of the genome components of A. baumannii provides a scaffold to rapidly evaluate the genomic organization of novel clinical A. baumannii isolates. Changes in island profiling will be useful in genomic epidemiology of A. baumannii population.

  1. Genomic islands of differentiation in two songbird species reveal candidate genes for hybrid female sterility.

    PubMed

    Mořkovský, Libor; Janoušek, Václav; Reif, Jiří; Rídl, Jakub; Pačes, Jan; Choleva, Lukáš; Janko, Karel; Nachman, Michael W; Reifová, Radka

    2018-02-01

    Hybrid sterility is a common first step in the evolution of postzygotic reproductive isolation. According to Haldane's Rule, it affects predominantly the heterogametic sex. While the genetic basis of hybrid male sterility in organisms with heterogametic males has been studied for decades, the genetic basis of hybrid female sterility in organisms with heterogametic females has received much less attention. We investigated the genetic basis of reproductive isolation in two closely related avian species, the common nightingale (Luscinia megarhynchos) and the thrush nightingale (L. luscinia), that hybridize in a secondary contact zone and produce viable hybrid progeny. In accordance with Haldane's Rule, hybrid females are sterile, while hybrid males are fertile, allowing gene flow to occur between the species. Using transcriptomic data from multiple individuals of both nightingale species, we identified genomic islands of high differentiation (F ST ) and of high divergence (D xy ), and we analysed gene content and patterns of molecular evolution within these islands. Interestingly, we found that these islands were enriched for genes related to female meiosis and metabolism. The islands of high differentiation and divergence were also characterized by higher levels of linkage disequilibrium than the rest of the genome in both species indicating that they might be situated in genomic regions of low recombination. This study provides one of the first insights into genetic basis of hybrid female sterility in organisms with heterogametic females. © 2018 John Wiley & Sons Ltd.

  2. Phenotype-specific CpG island methylation events in a murine model of prostate cancer.

    PubMed

    Camoriano, Marta; Kinney, Shannon R Morey; Moser, Michael T; Foster, Barbara A; Mohler, James L; Trump, Donald L; Karpf, Adam R; Smiraglia, Dominic J

    2008-06-01

    Aberrant DNA methylation plays a significant role in nearly all human cancers and may contribute to disease progression to advanced phenotypes. Study of advanced prostate cancer phenotypes in the human disease is hampered by limited availability of tissues. We therefore took advantage of the Transgenic Adenocarcinoma of Mouse Prostate (TRAMP) model to study whether three different phenotypes of TRAMP tumors (PRIM, late-stage primary tumors; AIP, androgen-independent primary tumors; and MET, metastases) displayed specific patterns of CpG island hypermethylation using Restriction Landmark Genomic Scanning. Each tumor phenotype displayed numerous hypermethylation events, with the most homogeneous methylation pattern in AIP and the most heterogeneous pattern in MET. Several loci displayed a phenotype-specific methylation pattern; the most striking pattern being loci methylated at high frequency in PRIM and AIP but rarely in MET. Examination of the mRNA expression of three genes, BC058385, Goosecoid, and Neurexin 2, which exhibited nonpromoter methylation, revealed increased expression associated with downstream methylation. Only methylated samples showed mRNA expression, in which tumor phenotype was a key factor determining the level of expression. The CpG island in the human orthologue of BC058385 was methylated in human AIP but not in primary androgen-stimulated prostate cancer or benign prostate. The clinical data show a proof-of-principle that the TRAMP model can be used to identify targets of aberrant CpG island methylation relevant to human disease. In conclusion, phenotype-specific hypermethylation events were associated with the overexpression of different genes and may provide new markers of prostate tumorigenesis.

  3. Island-Model Genomic Selection for Long-Term Genetic Improvement of Autogamous Crops.

    PubMed

    Yabe, Shiori; Yamasaki, Masanori; Ebana, Kaworu; Hayashi, Takeshi; Iwata, Hiroyoshi

    2016-01-01

    Acceleration of genetic improvement of autogamous crops such as wheat and rice is necessary to increase cereal production in response to the global food crisis. Population and pedigree methods of breeding, which are based on inbred line selection, are used commonly in the genetic improvement of autogamous crops. These methods, however, produce a few novel combinations of genes in a breeding population. Recurrent selection promotes recombination among genes and produces novel combinations of genes in a breeding population, but it requires inaccurate single-plant evaluation for selection. Genomic selection (GS), which can predict genetic potential of individuals based on their marker genotype, might have high reliability of single-plant evaluation and might be effective in recurrent selection. To evaluate the efficiency of recurrent selection with GS, we conducted simulations using real marker genotype data of rice cultivars. Additionally, we introduced the concept of an "island model" inspired by evolutionary algorithms that might be useful to maintain genetic variation through the breeding process. We conducted GS simulations using real marker genotype data of rice cultivars to evaluate the efficiency of recurrent selection and the island model in an autogamous species. Results demonstrated the importance of producing novel combinations of genes through recurrent selection. An initial population derived from admixture of multiple bi-parental crosses showed larger genetic gains than a population derived from a single bi-parental cross in whole cycles, suggesting the importance of genetic variation in an initial population. The island-model GS better maintained genetic improvement in later generations than the other GS methods, suggesting that the island-model GS can utilize genetic variation in breeding and can retain alleles with small effects in the breeding population. The island-model GS will become a new breeding method that enhances the potential of genomic

  4. Island-Model Genomic Selection for Long-Term Genetic Improvement of Autogamous Crops

    PubMed Central

    Yabe, Shiori; Yamasaki, Masanori; Ebana, Kaworu; Hayashi, Takeshi; Iwata, Hiroyoshi

    2016-01-01

    Acceleration of genetic improvement of autogamous crops such as wheat and rice is necessary to increase cereal production in response to the global food crisis. Population and pedigree methods of breeding, which are based on inbred line selection, are used commonly in the genetic improvement of autogamous crops. These methods, however, produce a few novel combinations of genes in a breeding population. Recurrent selection promotes recombination among genes and produces novel combinations of genes in a breeding population, but it requires inaccurate single-plant evaluation for selection. Genomic selection (GS), which can predict genetic potential of individuals based on their marker genotype, might have high reliability of single-plant evaluation and might be effective in recurrent selection. To evaluate the efficiency of recurrent selection with GS, we conducted simulations using real marker genotype data of rice cultivars. Additionally, we introduced the concept of an “island model” inspired by evolutionary algorithms that might be useful to maintain genetic variation through the breeding process. We conducted GS simulations using real marker genotype data of rice cultivars to evaluate the efficiency of recurrent selection and the island model in an autogamous species. Results demonstrated the importance of producing novel combinations of genes through recurrent selection. An initial population derived from admixture of multiple bi-parental crosses showed larger genetic gains than a population derived from a single bi-parental cross in whole cycles, suggesting the importance of genetic variation in an initial population. The island-model GS better maintained genetic improvement in later generations than the other GS methods, suggesting that the island-model GS can utilize genetic variation in breeding and can retain alleles with small effects in the breeding population. The island-model GS will become a new breeding method that enhances the potential of

  5. Genome-wide identification of runs of homozygosity islands and associated genes in local dairy cattle breeds.

    PubMed

    Mastrangelo, S; Sardina, M T; Tolone, M; Di Gerlando, R; Sutera, A M; Fontanesi, L; Portolano, B

    2018-03-26

    Runs of homozygosity (ROH) are widely used as predictors of whole-genome inbreeding levels in cattle. They identify regions that have an unfavorable effect on a phenotype when homozygous, but also identify the genes associated with traits of economic interest present in these regions. Here, the distribution of ROH islands and enriched genes within these regions in four dairy cattle breeds were investigated. Cinisara (71), Modicana (72), Reggiana (168) and Italian Holstein (96) individuals were genotyped using the 50K v2 Illumina BeadChip. The genomic regions most commonly associated with ROHs were identified by selecting the top 1% of the single nucleotide polymorphisms (SNPs) most commonly observed in the ROH of each breed. In total, 11 genomic regions were identified in Cinisara and Italian Holstein, and eight in Modicana and Reggiana, indicating an increased ROH frequency level. Generally, ROH islands differed between breeds. The most homozygous region (>45% of individuals with ROH) was found in Modicana on chromosome 6 within a quantitative trail locus affecting milk fat and protein concentrations. We identified between 126 and 347 genes within ROH islands, which are involved in multiple signaling and signal transduction pathways in a wide variety of biological processes. The gene ontology enrichment provided information on possible molecular functions, biological processes and cellular components under selection related to milk production, reproduction, immune response and resistance/susceptibility to infection and diseases. Thus, scanning the genome for ROH could be an alternative strategy to detect genomic regions and genes related to important economic traits.

  6. Extensive genome rearrangements and multiple horizontal gene transfers in a population of pyrococcus isolates from Vulcano Island, Italy.

    PubMed

    White, James R; Escobar-Paramo, Patricia; Mongodin, Emmanuel F; Nelson, Karen E; DiRuggiero, Jocelyne

    2008-10-01

    The extent of chromosome rearrangements in Pyrococcus isolates from marine hydrothermal vents in Vulcano Island, Italy, was evaluated by high-throughput genomic methods. The results illustrate the dynamic nature of the genomes of the genus Pyrococcus and raise the possibility of a connection between rapidly changing environmental conditions and adaptive genomic properties.

  7. Draft Genome of Rhodococcus rhodochrous TRN7, Isolated from the Coast of Trindade Island, Brazil

    PubMed Central

    Rodrigues, Edmo M.; Pylro, Victor S.; Dobbler, Priscila T.; Victoria, Filipe

    2016-01-01

    Here, we present a draft genome and annotation of Rhodococcus rhodochrous TRN7, isolated from Trindade Island, Brazil, which will provide genetic data to benefit the understanding of its metabolism. PMID:26941155

  8. Reconstructing Demography and Social Behavior During the Neolithic Expansion from Genomic Diversity Across Island Southeast Asia.

    PubMed

    Vallée, François; Luciani, Aurélien; Cox, Murray P

    2016-12-01

    Archaeology, linguistics, and increasingly genetics are clarifying how populations moved from mainland Asia, through Island Southeast Asia, and out into the Pacific during the farming revolution. Yet key features of this process remain poorly understood, particularly how social behaviors intersected with demographic drivers to create the patterns of genomic diversity observed across Island Southeast Asia today. Such questions are ripe for computer modeling. Here, we construct an agent-based model to simulate human mobility across Island Southeast Asia from the Neolithic period to the present, with a special focus on interactions between individuals with Asian, Papuan, and mixed Asian-Papuan ancestry. Incorporating key features of the region, including its complex geography (islands and sea), demographic drivers (fecundity and migration), and social behaviors (marriage preferences), the model simultaneously tracks a full suite of genomic markers (autosomes, X chromosome, mitochondrial DNA, and Y chromosome). Using Bayesian inference, model parameters were determined that produce simulations that closely resemble the admixture profiles of 2299 individuals from 84 populations across Island Southeast Asia. The results highlight that greater propensity to migrate and elevated birth rates are related drivers behind the expansion of individuals with Asian ancestry relative to individuals with Papuan ancestry, that offspring preferentially resulted from marriages between Asian women and Papuan men, and that in contrast to current thinking, individuals with Asian ancestry were likely distributed across large parts of western Island Southeast Asia before the Neolithic expansion. Copyright © 2016 Vallée et al.

  9. Reconstructing Demography and Social Behavior During the Neolithic Expansion from Genomic Diversity Across Island Southeast Asia

    PubMed Central

    Vallée, François; Luciani, Aurélien; Cox, Murray P.

    2016-01-01

    Archaeology, linguistics, and increasingly genetics are clarifying how populations moved from mainland Asia, through Island Southeast Asia, and out into the Pacific during the farming revolution. Yet key features of this process remain poorly understood, particularly how social behaviors intersected with demographic drivers to create the patterns of genomic diversity observed across Island Southeast Asia today. Such questions are ripe for computer modeling. Here, we construct an agent-based model to simulate human mobility across Island Southeast Asia from the Neolithic period to the present, with a special focus on interactions between individuals with Asian, Papuan, and mixed Asian–Papuan ancestry. Incorporating key features of the region, including its complex geography (islands and sea), demographic drivers (fecundity and migration), and social behaviors (marriage preferences), the model simultaneously tracks a full suite of genomic markers (autosomes, X chromosome, mitochondrial DNA, and Y chromosome). Using Bayesian inference, model parameters were determined that produce simulations that closely resemble the admixture profiles of 2299 individuals from 84 populations across Island Southeast Asia. The results highlight that greater propensity to migrate and elevated birth rates are related drivers behind the expansion of individuals with Asian ancestry relative to individuals with Papuan ancestry, that offspring preferentially resulted from marriages between Asian women and Papuan men, and that in contrast to current thinking, individuals with Asian ancestry were likely distributed across large parts of western Island Southeast Asia before the Neolithic expansion. PMID:27683274

  10. Microdiversification of a Pelagic Polynucleobacter Species Is Mainly Driven by Acquisition of Genomic Islands from a Partially Interspecific Gene Pool

    PubMed Central

    Schmidt, Johanna; Jezberová, Jitka; Koll, Ulrike; Hahn, Martin W.

    2016-01-01

    ABSTRACT Microdiversification of a planktonic freshwater bacterium was studied by comparing 37 Polynucleobacter asymbioticus strains obtained from three geographically separated sites in the Austrian Alps. Genome comparison of nine strains revealed a core genome of 1.8 Mb, representing 81% of the average genome size. Seventy-five percent of the remaining flexible genome is clustered in genomic islands (GIs). Twenty-four genomic positions could be identified where GIs are potentially located. These positions are occupied strain specifically from a set of 28 GI variants, classified according to similarities in their gene content. One variant, present in 62% of the isolates, encodes a pathway for the degradation of aromatic compounds, and another, found in 78% of the strains, contains an operon for nitrate assimilation. Both variants were shown in ecophysiological tests to be functional, thus providing the potential for microniche partitioning. In addition, detected interspecific horizontal exchange of GIs indicates a large gene pool accessible to Polynucleobacter species. In contrast to core genes, GIs are spread more successfully across spatially separated freshwater habitats. The mobility and functional diversity of GIs allow for rapid evolution, which may be a key aspect for the ubiquitous occurrence of Polynucleobacter bacteria. IMPORTANCE Assessing the ecological relevance of bacterial diversity is a key challenge for current microbial ecology. The polyphasic approach which was applied in this study, including targeted isolation of strains, genome analysis, and ecophysiological tests, is crucial for the linkage of genetic and ecological knowledge. Particularly great importance is attached to the high number of closely related strains which were investigated, represented by genome-wide average nucleotide identities (ANI) larger than 97%. The extent of functional diversification found on this narrow phylogenetic scale is compelling. Moreover, the transfer of

  11. Identification of genomic islands in six plant pathogens.

    PubMed

    Chen, Ling-Ling

    2006-06-07

    Genomic islands (GIs) play important roles in microbial evolution, which are acquired by horizontal gene transfer. In this paper, the GIs of six completely sequenced plant pathogens are identified using a windowless method based on Z curve representation of DNA sequences. Consequently, four, eight, four, one, two and four GIs are recognized with the length greater than 20-Kb in plant pathogens Agrobacterium tumefaciens str. C58, Rolstonia solanacearum GMI1000, Xanthomonas axonopodis pv. citri str. 306 (Xac), Xanthomonas campestris pv. campestris str. ATCC33913 (Xcc), Xylella fastidiosa 9a5c and Pseudomonas syringae pv. tomato str. DC3000, respectively. Most of these regions share a set of conserved features of GIs, including an abrupt change in GC content compared with that of the rest of the genome, the existence of integrase genes at the junction, the use of tRNA as the integration sites, the presence of genetic mobility genes, the difference of codon usage, codon preference and amino acid usage, etc. The identification of these GIs will benefit the research for the six important phytopathogens.

  12. Comparative analysis of tandem T7-like promoter containing regions in enterobacterial genomes reveals a novel group of genetic islands | Center for Cancer Research

    Cancer.gov

    Twelve prophage-like T7 islands have been discovered in pathogenic bacterial genomes. These islands contain two or three tandem T7-like promoters that should be activated when a bacterial cell is infected by bacteriophage T7 or a related phage. The illustration shows genetic maps for four of the islands, Ty2, BS512, E22 and ECA, which are found in the genomes of S. enterica

  13. Interactions of neuropathogenic Escherichia coli K1 (RS218) and its derivatives lacking genomic islands with phagocytic Acanthamoeba castellanii and nonphagocytic brain endothelial cells.

    PubMed

    Yousuf, Farzana Abubakar; Yousuf, Zuhair; Iqbal, Junaid; Siddiqui, Ruqaiyyah; Khan, Hafsa; Khan, Naveed Ahmed

    2014-01-01

    Here we determined the role of various genomic islands in E. coli K1 interactions with phagocytic A. castellanii and nonphagocytic brain microvascular endothelial cells. The findings revealed that the genomic islands deletion mutants of RS218 related to toxins (peptide toxin, α -hemolysin), adhesins (P fimbriae, F17-like fimbriae, nonfimbrial adhesins, Hek, and hemagglutinin), protein secretion system (T1SS for hemolysin), invasins (IbeA, CNF1), metabolism (D-serine catabolism, dihydroxyacetone, glycerol, and glyoxylate metabolism) showed reduced interactions with both A. castellanii and brain microvascular endothelial cells. Interestingly, the deletion of RS218-derived genomic island 21 containing adhesins (P fimbriae, F17-like fimbriae, nonfimbrial adhesins, Hek, and hemagglutinin), protein secretion system (T1SS for hemolysin), invasins (CNF1), metabolism (D-serine catabolism) abolished E. coli K1-mediated HBMEC cytotoxicity in a CNF1-independent manner. Therefore, the characterization of these genomic islands should reveal mechanisms of evolutionary gain for E. coli K1 pathogenicity.

  14. Interactions of Neuropathogenic Escherichia coli K1 (RS218) and Its Derivatives Lacking Genomic Islands with Phagocytic Acanthamoeba castellanii and Nonphagocytic Brain Endothelial Cells

    PubMed Central

    Yousuf, Farzana Abubakar; Yousuf, Zuhair; Iqbal, Junaid; Siddiqui, Ruqaiyyah; Khan, Hafsa; Khan, Naveed Ahmed

    2014-01-01

    Here we determined the role of various genomic islands in E. coli K1 interactions with phagocytic A. castellanii and nonphagocytic brain microvascular endothelial cells. The findings revealed that the genomic islands deletion mutants of RS218 related to toxins (peptide toxin, α-hemolysin), adhesins (P fimbriae, F17-like fimbriae, nonfimbrial adhesins, Hek, and hemagglutinin), protein secretion system (T1SS for hemolysin), invasins (IbeA, CNF1), metabolism (D-serine catabolism, dihydroxyacetone, glycerol, and glyoxylate metabolism) showed reduced interactions with both A. castellanii and brain microvascular endothelial cells. Interestingly, the deletion of RS218-derived genomic island 21 containing adhesins (P fimbriae, F17-like fimbriae, nonfimbrial adhesins, Hek, and hemagglutinin), protein secretion system (T1SS for hemolysin), invasins (CNF1), metabolism (D-serine catabolism) abolished E. coli K1-mediated HBMEC cytotoxicity in a CNF1-independent manner. Therefore, the characterization of these genomic islands should reveal mechanisms of evolutionary gain for E. coli K1 pathogenicity. PMID:24818136

  15. Draft Genome of Rhodococcus rhodochrous TRN7, Isolated from the Coast of Trindade Island, Brazil.

    PubMed

    Rodrigues, Edmo M; Pylro, Victor S; Dobbler, Priscila T; Victoria, Filipe; Roesch, Luiz F W; Tótola, Marcos R

    2016-03-03

    Here, we present a draft genome and annotation of Rhodococcus rhodochrous TRN7, isolated from Trindade Island, Brazil, which will provide genetic data to benefit the understanding of its metabolism. Copyright © 2016 Rodrigues et al.

  16. Aeromonas salmonicida subsp. salmonicida strains isolated from Chinese freshwater fish contain a novel genomic island and possible regional-specific mobile genetic elements profiles.

    PubMed

    Long, Meng; Nielsen, Tue K; Leisner, Jørgen J; Hansen, Lars H; Shen, Zhi X; Zhang, Qian Q; Li, Aihua

    2016-09-01

    Two strains of Aeromonas salmonicida, YK and BG, were isolated from largemouth bronze gudgeon and northern whitefish in China, and identified as A. salmonicida subsp. salmonicida based on phylogenetic analysis of vapA and 16S rRNA gene sequences. YK and BG originated from freshwater fish, one of which belonged to the cyprinid family, and the strains showed a difference in virulence. Subsequently, we performed whole genome sequencing of the strains, and comparison of their genomic sequences to the genome of the A449 reference strain revealed various genomic rearrangements, including a new variant of the genomic island AsaGEI in BG, designated as AsaGEI2c This is the first report on a GEI of A. salmonicida strain from China. Furthermore, both YK and BG strains contained a Tn7 transposon inserted at the same position in the chromosome. Finally, IS-dependent rearrangements on pAsa5 are deemed likely to have occurred, with omission of the resD gene in both strains as well as omission of genes related to the IncF conjugal transfer system in the YK isolate. This study demonstrates that A. salmonicida subsp. salmonicida can infect non-salmonids (cyprinids) in addition to salmonids, and that AsaGEI2c might be useful as a geographical indicator of Chinese A. salmonicida subsp. salmonicida isolates. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  17. Extensive Genome Rearrangements and Multiple Horizontal Gene Transfers in a Population of Pyrococcus Isolates from Vulcano Island, Italy▿ †

    PubMed Central

    White, James R.; Escobar-Paramo, Patricia; Mongodin, Emmanuel F.; Nelson, Karen E.; DiRuggiero, Jocelyne

    2008-01-01

    The extent of chromosome rearrangements in Pyrococcus isolates from marine hydrothermal vents in Vulcano Island, Italy, was evaluated by high-throughput genomic methods. The results illustrate the dynamic nature of the genomes of the genus Pyrococcus and raise the possibility of a connection between rapidly changing environmental conditions and adaptive genomic properties. PMID:18723649

  18. Genome characterization of Long Island tick rhabdovirus, a new virus identified in Amblyomma americanum ticks.

    PubMed

    Tokarz, Rafal; Sameroff, Stephen; Leon, Maria Sanchez; Jain, Komal; Lipkin, W Ian

    2014-02-11

    Ticks are implicated as hosts to a wide range of animal and human pathogens. The full range of microbes harbored by ticks has not yet been fully explored. As part of a viral surveillance and discovery project in arthropods, we used unbiased high-throughput sequencing to examine viromes of ticks collected on Long Island, New York in 2013. We detected and sequenced the complete genome of a novel rhabdovirus originating from a pool of Amblyomma americanum ticks. This virus, which we provisionally name Long Island tick rhabdovirus, is distantly related to Moussa virus from Africa. The Long Island tick rhabdovirus may represent a novel species within family Rhabdoviridae.

  19. Complete genome sequences of four avian paramyxoviruses of serotype 10 isolated from Rockhopper Penguins on the Falkland Islands

    USDA-ARS?s Scientific Manuscript database

    The first complete genome sequences of four Avian paramyxovirus serotype 10 (APMV-10) isolates are described here. The viruses were isolated from Rockhopper Penguins sampled in 2007 on the Falkland Islands. All four genomes are 15,456 nucleotides in length and phylogenetic analyses show them to be c...

  20. Complete Genome Sequence of Salmonella enterica Serovar Typhimurium Strain YU15 (Sequence Type 19) Harboring the Salmonella Genomic Island 1 and Virulence Plasmid pSTV

    PubMed Central

    Calva, Edmundo; Puente, José L.; Zaidi, Mussaret B.

    2016-01-01

    The complete genome of Salmonella enterica subsp. enterica serovar Typhimurium sequence type 19 (ST19) strain YU15, isolated in Yucatán, Mexico, from a human baby stool culture, was determined using PacBio technology. The chromosome contains five intact prophages and the Salmonella genomic island 1 (SGI1). This strain carries the Salmonella virulence plasmid pSTV. PMID:27081132

  1. Structure of a short-chain dehydrogenase/reductase (SDR) within a genomic island from a clinical strain of Acinetobacter baumannii

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shah, Bhumika S., E-mail: bhumika.shah@mq.edu.au; Tetu, Sasha G.; Harrop, Stephen J.

    2014-09-25

    The structure of a short-chain dehydrogenase encoded within genomic islands of A. baumannii strains has been solved to 2.4 Å resolution. This classical SDR incorporates a flexible helical subdomain. The NADP-binding site and catalytic side chains are identified. Over 15% of the genome of an Australian clinical isolate of Acinetobacter baumannii occurs within genomic islands. An uncharacterized protein encoded within one island feature common to this and other International Clone II strains has been studied by X-ray crystallography. The 2.4 Å resolution structure of SDR-WM99c reveals it to be a new member of the classical short-chain dehydrogenase/reductase (SDR) superfamily. Themore » enzyme contains a nucleotide-binding domain and, like many other SDRs, is tetrameric in form. The active site contains a catalytic tetrad (Asn117, Ser146, Tyr159 and Lys163) and water molecules occupying the presumed NADP cofactor-binding pocket. An adjacent cleft is capped by a relatively mobile helical subdomain, which is well positioned to control substrate access.« less

  2. Genome characterization of Long Island tick rhabdovirus, a new virus identified in Amblyomma americanum ticks

    PubMed Central

    2014-01-01

    Background Ticks are implicated as hosts to a wide range of animal and human pathogens. The full range of microbes harbored by ticks has not yet been fully explored. Methods As part of a viral surveillance and discovery project in arthropods, we used unbiased high-throughput sequencing to examine viromes of ticks collected on Long Island, New York in 2013. Results We detected and sequenced the complete genome of a novel rhabdovirus originating from a pool of Amblyomma americanum ticks. This virus, which we provisionally name Long Island tick rhabdovirus, is distantly related to Moussa virus from Africa. Conclusions The Long Island tick rhabdovirus may represent a novel species within family Rhabdoviridae. PMID:24517260

  3. Population genomic analysis uncovers African and European admixture in Drosophila melanogaster populations from the south-eastern United States and Caribbean Islands.

    PubMed

    Kao, Joyce Y; Zubair, Asif; Salomon, Matthew P; Nuzhdin, Sergey V; Campo, Daniel

    2015-04-01

    Drosophila melanogaster is postulated to have colonized North America in the past several 100 years in two waves. Flies from Europe colonized the east coast United States while flies from Africa inhabited the Caribbean, which if true, make the south-east US and Caribbean Islands a secondary contact zone for African and European D. melanogaster. This scenario has been proposed based on phenotypes and limited genetic data. In our study, we have sequenced individual whole genomes of flies from populations in the south-east US and Caribbean Islands and examined these populations in conjunction with population sequences from the west coast US, Africa, and Europe. We find that west coast US populations are closely related to the European population, likely reflecting a rapid westward expansion upon first settlements into North America. We also find genomic evidence of African and European admixture in south-east US and Caribbean populations, with a clinal pattern of decreasing proportions of African ancestry with higher latitude. Our genomic analysis of D. melanogaster populations from the south-east US and Caribbean Islands provides more evidence for the Caribbean Islands as the source of previously reported novel African alleles found in other east coast US populations. We also find the border between the south-east US and the Caribbean island to be the admixture hot zone where distinctly African-like Caribbean flies become genomically more similar to European-like south-east US flies. Our findings have important implications for previous studies examining the generation of east coast US clines via selection. © 2015 John Wiley & Sons Ltd.

  4. Complete Genome Sequences of Four Avian Paramyxoviruses of Serotype 10 Isolated from Rockhopper Penguins on the Falkland Islands

    PubMed Central

    Goraichuk, Iryna V.; Dimitrov, Kiril M.; Sharma, Poonam; Miller, Patti J.; Swayne, David E.; Suarez, David L.

    2017-01-01

    ABSTRACT The first complete genome sequences of four avian paramyxovirus serotype 10 (APMV-10) isolates are described here. The viruses were isolated from rockhopper penguins on the Falkland Islands, sampled in 2007. All four genomes are 15,456 nucleotides in length, and phylogenetic analyses show them to be closely related. PMID:28572332

  5. Microbial Lifestyle and Genome Signatures

    PubMed Central

    Dutta, Chitra; Paul, Sandip

    2012-01-01

    Microbes are known for their unique ability to adapt to varying lifestyle and environment, even to the extreme or adverse ones. The genomic architecture of a microbe may bear the signatures not only of its phylogenetic position, but also of the kind of lifestyle to which it is adapted. The present review aims to provide an account of the specific genome signatures observed in microbes acclimatized to distinct lifestyles or ecological niches. Niche-specific signatures identified at different levels of microbial genome organization like base composition, GC-skew, purine-pyrimidine ratio, dinucleotide abundance, codon bias, oligonucleotide composition etc. have been discussed. Among the specific cases highlighted in the review are the phenomena of genome shrinkage in obligatory host-restricted microbes, genome expansion in strictly intra-amoebal pathogens, strand-specific codon usage in intracellular species, acquisition of genome islands in pathogenic or symbiotic organisms, discriminatory genomic traits of marine microbes with distinct trophic strategies, and conspicuous sequence features of certain extremophiles like those adapted to high temperature or high salinity. PMID:23024607

  6. High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Osman, Wan Adnawani Meor; van Berkum, Peter; León-Barrios, Milagros

    Ensifer meliloti Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata. This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here in this paper, the features of E. meliloti Mlalz-1 are described, together with high-qualitymore » permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to Ensifer meliloti IAM 12611 T, Ensifer medicae A 321T and Ensifer numidicus ORS 1407 T, based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as E. meliloti . Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata-nodulating Ensifer strains, but ≤93% with nodC of Ensifer strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced E. meliloti strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In E. medicae strain WSM419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of E. medicae strains, which suggests genetic recombination between strain Mlalz-1 and E. medicae and the horizontal gene transfer of lpiA-acvB.« less

  7. High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain

    DOE PAGES

    Osman, Wan Adnawani Meor; van Berkum, Peter; León-Barrios, Milagros; ...

    2017-09-25

    Ensifer meliloti Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata. This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here in this paper, the features of E. meliloti Mlalz-1 are described, together with high-qualitymore » permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to Ensifer meliloti IAM 12611 T, Ensifer medicae A 321T and Ensifer numidicus ORS 1407 T, based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as E. meliloti . Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata-nodulating Ensifer strains, but ≤93% with nodC of Ensifer strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced E. meliloti strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In E. medicae strain WSM419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of E. medicae strains, which suggests genetic recombination between strain Mlalz-1 and E. medicae and the horizontal gene transfer of lpiA-acvB.« less

  8. The scope and strength of sex-specific selection in genome evolution

    PubMed Central

    Wright, A E; Mank, J E

    2013-01-01

    Males and females share the vast majority of their genomes and yet are often subject to different, even conflicting, selection. Genomic and transcriptomic developments have made it possible to assess sex-specific selection at the molecular level, and it is clear that sex-specific selection shapes the evolutionary properties of several genomic characteristics, including transcription, post-transcriptional regulation, imprinting, genome structure and gene sequence. Sex-specific selection is strongly influenced by mating system, which also causes neutral evolutionary changes that affect different regions of the genome in different ways. Here, we synthesize theoretical and molecular work in order to provide a cohesive view of the role of sex-specific selection and mating system in genome evolution. We also highlight the need for a combined approach, incorporating both genomic data and experimental phenotypic studies, in order to understand precisely how sex-specific selection drives evolutionary change across the genome. PMID:23848139

  9. Genomic islands 1 and 2 play key roles in the evolution of extensively drug-resistant ST235 isolates of Pseudomonas aeruginosa

    PubMed Central

    Scott, Martin; Worden, Paul; Huntington, Peter; Hudson, Bernard; Karagiannis, Thomas; Charles, Ian G.; Djordjevic, Steven P.

    2016-01-01

    Pseudomonas aeruginosa are noscomially acquired, opportunistic pathogens that pose a major threat to the health of burns patients and the immunocompromised. We sequenced the genomes of P. aeruginosa isolates RNS_PA1, RNS_PA46 and RNS_PAE05, which displayed resistance to almost all frontline antibiotics, including gentamicin, piperacillin, timentin, meropenem, ceftazidime and colistin. We provide evidence that the isolates are representatives of P. aeruginosa sequence type (ST) 235 and carry Tn6162 and Tn6163 in genomic islands 1 (GI1) and 2 (GI2), respectively. GI1 disrupts the endA gene at precisely the same chromosomal location as in P. aeruginosa strain VR-143/97, of unknown ST, creating an identical CA direct repeat. The class 1 integron associated with Tn6163 in GI2 carries a blaGES-5–aacA4–gcuE15–aphA15 cassette array conferring resistance to carbapenems and aminoglycosides. GI2 is flanked by a 12 nt direct repeat motif, abuts a tRNA-gly gene, and encodes proteins with putative roles in integration, conjugative transfer as well as integrative conjugative element-specific proteins. This suggests that GI2 may have evolved from a novel integrative conjugative element. Our data provide further support to the hypothesis that genomic islands play an important role in de novo evolution of multiple antibiotic resistance phenotypes in P. aeruginosa. PMID:26962050

  10. The scope and strength of sex-specific selection in genome evolution.

    PubMed

    Wright, A E; Mank, J E

    2013-09-01

    Males and females share the vast majority of their genomes and yet are often subject to different, even conflicting, selection. Genomic and transcriptomic developments have made it possible to assess sex-specific selection at the molecular level, and it is clear that sex-specific selection shapes the evolutionary properties of several genomic characteristics, including transcription, post-transcriptional regulation, imprinting, genome structure and gene sequence. Sex-specific selection is strongly influenced by mating system, which also causes neutral evolutionary changes that affect different regions of the genome in different ways. Here, we synthesize theoretical and molecular work in order to provide a cohesive view of the role of sex-specific selection and mating system in genome evolution. We also highlight the need for a combined approach, incorporating both genomic data and experimental phenotypic studies, in order to understand precisely how sex-specific selection drives evolutionary change across the genome. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.

  11. Nanoparticles for Site Specific Genome Editing

    NASA Astrophysics Data System (ADS)

    McNeer, Nicole Ali

    Triplex-forming peptide nucleic acids (PNAs) can be used to coordinate the recombination of short 50-60 by "donor DNA" fragments into genomic DNA, resulting in site-specific correction of genetic mutations or the introduction of advantageous genetic modifications. Site-specific gene editing in hematopoietic stem and progenitor cells (HSPCs) could result in treatment or cure of inherited disorders of the blood such as beta-thalassemia. Gene editing in HSPCs and differentiated T cells could help combat HIV/AIDs by modifying receptors, such as CCR5, necessary for R5-tropic HIV entry. However, translation of genome modification technologies to clinical practice is limited by challenges in intracellular delivery, especially in difficult-to-transfect hematolymphoid cells. In vivo gene editing could also provide novel treatment for systemic monogenic disorders such as cystic fibrosis, an autosomal recessive disorder caused by mutations in the cystic fibrosis transmembrane receptor. Here, we have engineered biodegradable nanoparticles to deliver oligonucleotides for site-specific genome editing of disease-relevant genes in human cells, with high efficiency, low toxicity, and editing of clinically relevant cell types. We designed nanoparticles to edit the human beta-globin and CCR5 genes in hematopoietic cells. We show that poly(lactic-co-glycolic acid) (PLGA) nanoparticles can delivery PNA and donor DNA for site-specific gene modification in human hematopoietic cells in vitro and in vivo in NOD-scid IL2rgammanull mice. Nanoparticles delivered by tail vein localized to hematopoietic compartments in the spleen and bone marrow of humanized mice, resulting in modification of the beta-globin and CCR5 genes. Modification frequencies ranged from 0.005 to 20% of cells depending on the organ and cell type, without detectable toxicity. This project developed highly versatile methods for delivery of therapeutics to hematolymphoid cells and hematopoietic stem cells, and will help to

  12. Genomic analysis and temperature-dependent transcriptome profiles of the rhizosphere originating strain Pseudomonas aeruginosa M18

    PubMed Central

    2011-01-01

    Background Our previously published reports have described an effective biocontrol agent named Pseudomonas sp. M18 as its 16S rDNA sequence and several regulator genes share homologous sequences with those of P. aeruginosa, but there are several unusual phenotypic features. This study aims to explore its strain specific genomic features and gene expression patterns at different temperatures. Results The complete M18 genome is composed of a single chromosome of 6,327,754 base pairs containing 5684 open reading frames. Seven genomic islands, including two novel prophages and five specific non-phage islands were identified besides the conserved P. aeruginosa core genome. Each prophage contains a putative chitinase coding gene, and the prophage II contains a capB gene encoding a putative cold stress protein. The non-phage genomic islands contain genes responsible for pyoluteorin biosynthesis, environmental substance degradation and type I and III restriction-modification systems. Compared with other P. aeruginosa strains, the fewest number (3) of insertion sequences and the most number (3) of clustered regularly interspaced short palindromic repeats in M18 genome may contribute to the relative genome stability. Although the M18 genome is most closely related to that of P. aeruginosa strain LESB58, the strain M18 is more susceptible to several antimicrobial agents and easier to be erased in a mouse acute lung infection model than the strain LESB58. The whole M18 transcriptomic analysis indicated that 10.6% of the expressed genes are temperature-dependent, with 22 genes up-regulated at 28°C in three non-phage genomic islands and one prophage but none at 37°C. Conclusions The P. aeruginosa strain M18 has evolved its specific genomic structures and temperature dependent expression patterns to meet the requirement of its fitness and competitiveness under selective pressures imposed on the strain in rhizosphere niche. PMID:21884571

  13. Genomic evaluation, breed identification, and population structure of North American, English and Island Guernsey dairy cattle

    USDA-ARS?s Scientific Manuscript database

    Genomic evaluations of dairy cattle in the United States have been available for Brown Swiss, Holsteins, and Jerseys since 2009 and for Ayrshires since 2013. As of February 2015, 2,281 Guernsey bulls and cows had genotypes from collaboration between the United States, Canada, England, and the island...

  14. Genome-wide screen of ovary-specific DNA methylation in polycystic ovary syndrome.

    PubMed

    Yu, Ying-Ying; Sun, Cui-Xiang; Liu, Yin-Kun; Li, Yan; Wang, Li; Zhang, Wei

    2015-07-01

    To compare genome-wide DNA methylation profiles in ovary tissue from women with polycystic ovary syndrome (PCOS) and healthy controls. Case-control study matched for age and body mass index. University-affiliated hospital. Ten women with PCOS who underwent ovarian drilling to induce ovulation and 10 healthy women who were undergoing laparoscopic sterilization, hysterectomy for benign conditions, diagnostic laparoscopy for pelvic pain, or oophorectomy for nonovarian indications. None. Genome-wide DNA methylation patterns determined by immunoprecipitation and microarray (MeDIP-chip) analysis. The methylation levels were statistically significantly higher in CpG island shores (CGI shores), which lie outside of core promoter regions, and lower within gene bodies in women with PCOS relative to the controls. In addition, high CpG content promoters were the most frequently hypermethylated promoters in PCOS ovaries but were more often hypomethylated in controls. Second, 872 CGIs, specifically methylated in PCOS, represented 342 genes that could be associated with various molecular functions, including protein binding, hormone activity, and transcription regulator activity. Finally, methylation differences were validated in seven genes by methylation-specific polymerase chain reaction. These genes correlated to several functional families related to the pathogenesis of PCOS and may be potential biomarkers for this disease. Our results demonstrated that epigenetic modification differs between PCOS and normal ovaries, which may help to further understand the pathophysiology of this disease. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.

  15. Molecular epidemiology and phylogenetic distribution of the Escherichia coli pks genomic island.

    PubMed

    Johnson, James R; Johnston, Brian; Kuskowski, Michael A; Nougayrede, Jean-Philippe; Oswald, Eric

    2008-12-01

    Epidemiological and phylogenetic associations of the pks genomic island of extraintestinal pathogenic Escherichia coli (ExPEC), which encodes the genotoxin colibactin, are incompletely defined. clbB and clbN (as markers for the 5' and 3' regions of the pks island, respectively), clbA and clbQ (as supplemental pks island markers), and 12 other putative ExPEC virulence genes were newly sought by PCR among 131 published E. coli isolates from hospitalized veterans (62 blood isolates and 69 fecal isolates). Blood and fecal isolates and clbB-positive and -negative isolates were compared for 66 newly and previously assessed traits. Among the 14 newly sought traits, clbB and clbN (colibactin polyketide synthesis system), hra (heat-resistant agglutinin), and vat (vacuolating toxin) were significantly associated with bacteremia. clbB and clbN identified a subset within phylogenetic group B2 with extremely high virulence scores and a high proportion of blood isolates. However, by multivariable analysis, other traits were more predictive of blood source than clbB and clbN were; indeed, among the newly sought traits, only pic significantly predicted bacteremia (negative association). By correspondence analysis, clbB and clbN were closely associated with group B2 and multiple B2-associated traits; by principal coordinate analysis, clbB and clbN partitioned the data set better than did blood versus fecal source. Thus, the pks island was significantly associated with bacteremia, multiple ExPEC-associated virulence genes, and group B2, and within group B2, it identified an especially high-virulence subset. This extends previous work regarding the pks island and supports investigation of the colibactin system as a potential therapeutic target.

  16. Family-specific scaling laws in bacterial genomes.

    PubMed

    De Lazzari, Eleonora; Grilli, Jacopo; Maslov, Sergei; Cosentino Lagomarsino, Marco

    2017-07-27

    Among several quantitative invariants found in evolutionary genomics, one of the most striking is the scaling of the overall abundance of proteins, or protein domains, sharing a specific functional annotation across genomes of given size. The size of these functional categories change, on average, as power-laws in the total number of protein-coding genes. Here, we show that such regularities are not restricted to the overall behavior of high-level functional categories, but also exist systematically at the level of single evolutionary families of protein domains. Specifically, the number of proteins within each family follows family-specific scaling laws with genome size. Functionally similar sets of families tend to follow similar scaling laws, but this is not always the case. To understand this systematically, we provide a comprehensive classification of families based on their scaling properties. Additionally, we develop a quantitative score for the heterogeneity of the scaling of families belonging to a given category or predefined group. Under the common reasonable assumption that selection is driven solely or mainly by biological function, these findings point to fine-tuned and interdependent functional roles of specific protein domains, beyond our current functional annotations. This analysis provides a deeper view on the links between evolutionary expansion of protein families and the functional constraints shaping the gene repertoire of bacterial genomes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Mammalian-specific genomic functions: Newly acquired traits generated by genomic imprinting and LTR retrotransposon-derived genes in mammals.

    PubMed

    Kaneko-Ishino, Tomoko; Ishino, Fumitoshi

    2015-01-01

    Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is "mammalian-specific genomic functions", a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of "mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons", based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes.

  18. Adaptive divergence despite strong genetic drift: genomic analysis of the evolutionary mechanisms causing genetic differentiation in the island fox (Urocyon littoralis).

    PubMed

    Funk, W Chris; Lovich, Robert E; Hohenlohe, Paul A; Hofman, Courtney A; Morrison, Scott A; Sillett, T Scott; Ghalambor, Cameron K; Maldonado, Jesus E; Rick, Torben C; Day, Mitch D; Polato, Nicholas R; Fitzpatrick, Sarah W; Coonan, Timothy J; Crooks, Kevin R; Dillon, Adam; Garcelon, David K; King, Julie L; Boser, Christina L; Gould, Nicholas; Andelt, William F

    2016-05-01

    The evolutionary mechanisms generating the tremendous biodiversity of islands have long fascinated evolutionary biologists. Genetic drift and divergent selection are predicted to be strong on islands and both could drive population divergence and speciation. Alternatively, strong genetic drift may preclude adaptation. We conducted a genomic analysis to test the roles of genetic drift and divergent selection in causing genetic differentiation among populations of the island fox (Urocyon littoralis). This species consists of six subspecies, each of which occupies a different California Channel Island. Analysis of 5293 SNP loci generated using Restriction-site Associated DNA (RAD) sequencing found support for genetic drift as the dominant evolutionary mechanism driving population divergence among island fox populations. In particular, populations had exceptionally low genetic variation, small Ne (range = 2.1-89.7; median = 19.4), and significant genetic signatures of bottlenecks. Moreover, islands with the lowest genetic variation (and, by inference, the strongest historical genetic drift) were most genetically differentiated from mainland grey foxes, and vice versa, indicating genetic drift drives genome-wide divergence. Nonetheless, outlier tests identified 3.6-6.6% of loci as high FST outliers, suggesting that despite strong genetic drift, divergent selection contributes to population divergence. Patterns of similarity among populations based on high FST outliers mirrored patterns based on morphology, providing additional evidence that outliers reflect adaptive divergence. Extremely low genetic variation and small Ne in some island fox populations, particularly on San Nicolas Island, suggest that they may be vulnerable to fixation of deleterious alleles, decreased fitness and reduced adaptive potential. © 2016 John Wiley & Sons Ltd.

  19. Adaptive divergence despite strong genetic drift: genomic analysis of the evolutionary mechanisms causing genetic differentiation in the island fox (Urocyon littoralis)

    PubMed Central

    FUNK, W. CHRIS; LOVICH, ROBERT E.; HOHENLOHE, PAUL A.; HOFMAN, COURTNEY A.; MORRISON, SCOTT A.; SILLETT, T. SCOTT; GHALAMBOR, CAMERON K.; MALDONADO, JESUS E.; RICK, TORBEN C.; DAY, MITCH D.; POLATO, NICHOLAS R.; FITZPATRICK, SARAH W.; COONAN, TIMOTHY J.; CROOKS, KEVIN R.; DILLON, ADAM; GARCELON, DAVID K.; KING, JULIE L.; BOSER, CHRISTINA L.; GOULD, NICHOLAS; ANDELT, WILLIAM F.

    2016-01-01

    The evolutionary mechanisms generating the tremendous biodiversity of islands have long fascinated evolutionary biologists. Genetic drift and divergent selection are predicted to be strong on islands and both could drive population divergence and speciation. Alternatively, strong genetic drift may preclude adaptation. We conducted a genomic analysis to test the roles of genetic drift and divergent selection in causing genetic differentiation among populations of the island fox (Urocyon littoralis). This species consists of 6 subspecies, each of which occupies a different California Channel Island. Analysis of 5293 SNP loci generated using Restriction-site Associated DNA (RAD) sequencing found support for genetic drift as the dominant evolutionary mechanism driving population divergence among island fox populations. In particular, populations had exceptionally low genetic variation, small Ne (range = 2.1–89.7; median = 19.4), and significant genetic signatures of bottlenecks. Moreover, islands with the lowest genetic variation (and, by inference, the strongest historical genetic drift) were most genetically differentiated from mainland gray foxes, and vice versa, indicating genetic drift drives genome-wide divergence. Nonetheless, outlier tests identified 3.6–6.6% of loci as high FST outliers, suggesting that despite strong genetic drift, divergent selection contributes to population divergence. Patterns of similarity among populations based on high FST outliers mirrored patterns based on morphology, providing additional evidence that outliers reflect adaptive divergence. Extremely low genetic variation and small Ne in some island fox populations, particularly on San Nicolas Island, suggest that they may be vulnerable to fixation of deleterious alleles, decreased fitness, and reduced adaptive potential. PMID:26992010

  20. Genomic Species Are Ecological Species as Revealed by Comparative Genomics in Agrobacterium tumefaciens

    PubMed Central

    Lassalle, Florent; Campillo, Tony; Vial, Ludovic; Baude, Jessica; Costechareyre, Denis; Chapulliot, David; Shams, Malek; Abrouk, Danis; Lavire, Céline; Oger-Desfeux, Christine; Hommais, Florence; Guéguen, Laurent; Daubin, Vincent; Muller, Daniel; Nesme, Xavier

    2011-01-01

    The definition of bacterial species is based on genomic similarities, giving rise to the operational concept of genomic species, but the reasons of the occurrence of differentiated genomic species remain largely unknown. We used the Agrobacterium tumefaciens species complex and particularly the genomic species presently called genomovar G8, which includes the sequenced strain C58, to test the hypothesis of genomic species having specific ecological adaptations possibly involved in the speciation process. We analyzed the gene repertoire specific to G8 to identify potential adaptive genes. By hybridizing 25 strains of A. tumefaciens on DNA microarrays spanning the C58 genome, we highlighted the presence and absence of genes homologous to C58 in the taxon. We found 196 genes specific to genomovar G8 that were mostly clustered into seven genomic islands on the C58 genome—one on the circular chromosome and six on the linear chromosome—suggesting higher plasticity and a major adaptive role of the latter. Clusters encoded putative functional units, four of which had been verified experimentally. The combination of G8-specific functions defines a hypothetical species primary niche for G8 related to commensal interaction with a host plant. This supports that the G8 ancestor was able to exploit a new ecological niche, maybe initiating ecological isolation and thus speciation. Searching genomic data for synapomorphic traits is a powerful way to describe bacterial species. This procedure allowed us to find such phenotypic traits specific to genomovar G8 and thus propose a Latin binomial, Agrobacterium fabrum, for this bona fide genomic species. PMID:21795751

  1. Mammalian-specific genomic functions: Newly acquired traits generated by genomic imprinting and LTR retrotransposon-derived genes in mammals

    PubMed Central

    KANEKO-ISHINO, Tomoko; ISHINO, Fumitoshi

    2015-01-01

    Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is “mammalian-specific genomic functions”, a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of “mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons”, based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes. PMID:26666304

  2. The clc Element of Pseudomonas sp. Strain B13, a Genomic Island with Various Catabolic Properties

    PubMed Central

    Gaillard, Muriel; Vallaeys, Tatiana; Vorhölter, Frank Jörg; Minoia, Marco; Werlen, Christoph; Sentchilo, Vladimir; Pühler, Alfred; van der Meer, Jan Roelof

    2006-01-01

    Pseudomonas sp. strain B13 is a bacterium known to degrade chloroaromatic compounds. The properties to use 3- and 4-chlorocatechol are determined by a self-transferable DNA element, the clc element, which normally resides at two locations in the cell's chromosome. Here we report the complete nucleotide sequence of the clc element, demonstrating the unique catabolic properties while showing its relatedness to genomic islands and integrative and conjugative elements rather than to other known catabolic plasmids. As far as catabolic functions, the clc element harbored, in addition to the genes for chlorocatechol degradation, a complete functional operon for 2-aminophenol degradation and genes for a putative aromatic compound transport protein and for a multicomponent aromatic ring dioxygenase similar to anthranilate hydroxylase. The genes for catabolic functions were inducible under various conditions, suggesting a network of catabolic pathway induction. For about half of the open reading frames (ORFs) on the clc element, no clear functional prediction could be given, although some indications were found for functions that were similar to plasmid conjugation. The region in which these ORFs were situated displayed a high overall conservation of nucleotide sequence and gene order to genomic regions in other recently completed bacterial genomes or to other genomic islands. Most notably, except for two discrete regions, the clc element was almost 100% identical over the whole length to a chromosomal region in Burkholderia xenovorans LB400. This indicates the dynamic evolution of this type of element and the continued transition between elements with a more pathogenic character and those with catabolic properties. PMID:16484212

  3. Complete chloroplast genome of Prunus yedoensis Matsum.(Rosaceae), wild and endemic flowering cherry on Jeju Island, Korea.

    PubMed

    Cho, Myong-Suk; Hyun Cho, Chung; Yeon Kim, Su; Su Yoon, Hwan; Kim, Seung-Chul

    2016-09-01

    The complete chloroplast genome sequences of the wild flowering cherry, Prunus yedoensis Matsum., which is native and endemic to Jeju Island, Korea, is reported in this study. The genome size is 157 786 bp in length with 36.7% GC content, which is composed of LSC region of 85 908 bp, SSC region of 19 120 bp and two IR copies of 26 379 bp each. The cp genome contains 131 genes, including 86 coding genes, 8 rRNA genes and 37 tRNA genes. The maximum likelihood analysis was conducted to verify a phylogenetic position of the newly sequenced cp genome of P. yedoensis using 11 representatives of complete cp genome sequences within the family Rosaceae. The genus Prunus exhibited monophyly and the result of the phylogenetic relationship agreed with the previous phylogenetic analyses within Rosaceae.

  4. Islands of non-essential genes, including a DNA translocation operon, in the genome of bacteriophage 0305ϕ8-36

    PubMed Central

    Pathria, Saurav; Rolando, Mandy; Lieman, Karen; Hayes, Shirley; Hardies, Stephen; Serwer, Philip

    2012-01-01

    We investigate genes of lytic, Bacillus thuringiensis bacteriophage 0305ϕ8-36 that are non-essential for laboratory propagation, but might have a function in the wild. We isolate deletion mutants to identify these genes. The non-permutation of the genome (218.948 Kb, with a 6.479 Kb terminal repeat and 247 identified orfs) simplifies isolation of deletion mutants. We find two islands of non-essential genes. The first island (3.01% of the genomic DNA) has an informatically identified DNA translocation operon. Deletion causes no detectable growth defect during propagation in a dilute agarose overlay. Identification of the DNA translocation operon begins with a DNA relaxase and continues with a translocase and membrane-binding anchor proteins. The relaxase is in a family, first identified here, with homologs in other bacteriophages. The second deleted island (3.71% of the genome) has genes for two metallo-protein chaperonins and two tRNAs. Deletion causes a significant growth defect. In addition, (1) we find by “in situ” (in-plaque) single-particle fluorescence microscopy that adsorption to the host occurs at the tip of the 486 nm long tail, (2) we develop a procedure of 0305ϕ8-36 purification that does not cause tail contraction, and (3) we then find by electron microscopy that 0305ϕ8-36 undergoes tail tip-tail tip dimerization that potentially blocks adsorption to host cells, presumably with effectiveness that increases as the bacteriophage particle concentration increases. These observations provide an explanation of the previous observation that 0305ϕ8-36 does not lyse liquid cultures, even though 0305ϕ8-36 is genomically lytic. PMID:22666654

  5. Genetics of Genome-Wide Recombination Rate Evolution in Mice from an Isolated Island.

    PubMed

    Wang, Richard J; Payseur, Bret A

    2017-08-01

    Recombination rate is a heritable quantitative trait that evolves despite the fundamentally conserved role that recombination plays in meiosis. Differences in recombination rate can alter the landscape of the genome and the genetic diversity of populations. Yet our understanding of the genetic basis of recombination rate evolution in nature remains limited. We used wild house mice ( Mus musculus domesticus ) from Gough Island (GI), which diverged recently from their mainland counterparts, to characterize the genetics of recombination rate evolution. We quantified genome-wide autosomal recombination rates by immunofluorescence cytology in spermatocytes from 240 F 2 males generated from intercrosses between GI-derived mice and the wild-derived inbred strain WSB/EiJ. We identified four quantitative trait loci (QTL) responsible for inter-F 2 variation in this trait, the strongest of which had effects that opposed the direction of the parental trait differences. Candidate genes and mutations for these QTL were identified by overlapping the detected intervals with whole-genome sequencing data and publicly available transcriptomic profiles from spermatocytes. Combined with existing studies, our findings suggest that genome-wide recombination rate divergence is not directional and its evolution within and between subspecies proceeds from distinct genetic loci. Copyright © 2017 by the Genetics Society of America.

  6. Isolation by environment in White-breasted Nuthatches (Sitta carolinensis) of the Madrean Archipelago sky islands: a landscape genomics approach.

    PubMed

    Manthey, Joseph D; Moyle, Robert G

    2015-07-01

    Understanding landscape processes driving patterns of population genetic differentiation and diversity has been a long-standing focus of ecology and evolutionary biology. Gene flow may be reduced by historical, ecological or geographic factors, resulting in patterns of isolation by distance (IBD) or isolation by environment (IBE). Although IBE has been found in many natural systems, most studies investigating patterns of IBD and IBE in nature have used anonymous neutral genetic markers, precluding inference of selection mechanisms or identification of genes potentially under selection. Using landscape genomics, the simultaneous study of genomic and ecological landscapes, we investigated the processes driving population genetic patterns of White-breasted Nuthatches (Sitta carolinensis) in sky islands (montane forest habitat islands) of the Madrean Archipelago. Using more than 4000 single nucleotide polymorphisms and multiple tests to investigate the relationship between genetic differentiation and geographic or ecological distance, we identified IBE, and a lack of IBD, among sky island populations of S. carolinensis. Using three tests to identify selection, we found 79 loci putatively under selection; of these, seven matched CDS regions in the Zebra Finch. The loci under selection were highly associated with climate extremes (maximum temperature of warmest month and minimum precipitation of driest month). These results provide evidence for IBE - disentangled from IBD - in sky island vertebrates and identify potential adaptive genetic variation. © 2015 John Wiley & Sons Ltd.

  7. Whole genome sequencing of a banana wild relative Musa itinerans provides insights into lineage-specific diversification of the Musa genus

    PubMed Central

    Wu, Wei; Yang, Yu-Lan; He, Wei-Ming; Rouard, Mathieu; Li, Wei-Ming; Xu, Meng; Roux, Nicolas; Ge, Xue-Jun

    2016-01-01

    Crop wild relatives are valuable resources for future genetic improvement. Here, we report the de novo genome assembly of Musa itinerans, a disease-resistant wild banana relative in subtropical China. The assembled genome size was 462.1 Mb, covering 75.2% of the genome (615.2Mb) and containing 32, 456 predicted protein-coding genes. Since the approximate divergence around 5.8 million years ago, the genomes of Musa itinerans and Musa acuminata have shown conserved collinearity. Gene family expansions and contractions enrichment analysis revealed that some pathways were associated with phenotypic or physiological innovations. These include a transition from wood to herbaceous in the ancestral Musaceae, intensification of cold and drought tolerances, and reduced diseases resistance genes for subtropical marginally distributed Musa species. Prevalent purifying selection and transposed duplications were found to facilitate the diversification of NBS-encoding gene families for two Musa species. The population genome history analysis of M. itinerans revealed that the fluctuated population sizes were caused by the Pleistocene climate oscillations, and that the formation of Qiongzhou Strait might facilitate the population downsizing on the isolated Hainan Island about 10.3 Kya. The qualified assembly of the M. itinerans genome provides deep insights into the lineage-specific diversification and also valuable resources for future banana breeding. PMID:27531320

  8. Whole genome sequencing of a banana wild relative Musa itinerans provides insights into lineage-specific diversification of the Musa genus.

    PubMed

    Wu, Wei; Yang, Yu-Lan; He, Wei-Ming; Rouard, Mathieu; Li, Wei-Ming; Xu, Meng; Roux, Nicolas; Ge, Xue-Jun

    2016-08-17

    Crop wild relatives are valuable resources for future genetic improvement. Here, we report the de novo genome assembly of Musa itinerans, a disease-resistant wild banana relative in subtropical China. The assembled genome size was 462.1 Mb, covering 75.2% of the genome (615.2Mb) and containing 32, 456 predicted protein-coding genes. Since the approximate divergence around 5.8 million years ago, the genomes of Musa itinerans and Musa acuminata have shown conserved collinearity. Gene family expansions and contractions enrichment analysis revealed that some pathways were associated with phenotypic or physiological innovations. These include a transition from wood to herbaceous in the ancestral Musaceae, intensification of cold and drought tolerances, and reduced diseases resistance genes for subtropical marginally distributed Musa species. Prevalent purifying selection and transposed duplications were found to facilitate the diversification of NBS-encoding gene families for two Musa species. The population genome history analysis of M. itinerans revealed that the fluctuated population sizes were caused by the Pleistocene climate oscillations, and that the formation of Qiongzhou Strait might facilitate the population downsizing on the isolated Hainan Island about 10.3 Kya. The qualified assembly of the M. itinerans genome provides deep insights into the lineage-specific diversification and also valuable resources for future banana breeding.

  9. A distinct and divergent lineage of genomic island-associated Type IV Secretion Systems in Legionella.

    PubMed

    Wee, Bryan A; Woolfit, Megan; Beatson, Scott A; Petty, Nicola K

    2013-01-01

    Legionella encodes multiple classes of Type IV Secretion Systems (T4SSs), including the Dot/Icm protein secretion system that is essential for intracellular multiplication in amoebal and human hosts. Other T4SSs not essential for virulence are thought to facilitate the acquisition of niche-specific adaptation genes including the numerous effector genes that are a hallmark of this genus. Previously, we identified two novel gene clusters in the draft genome of Legionella pneumophila strain 130b that encode homologues of a subtype of T4SS, the genomic island-associated T4SS (GI-T4SS), usually associated with integrative and conjugative elements (ICE). In this study, we performed genomic analyses of 14 homologous GI-T4SS clusters found in eight publicly available Legionella genomes and show that this cluster is unusually well conserved in a region of high plasticity. Phylogenetic analyses show that Legionella GI-T4SSs are substantially divergent from other members of this subtype of T4SS and represent a novel clade of GI-T4SSs only found in this genus. The GI-T4SS was found to be under purifying selection, suggesting it is functional and may play an important role in the evolution and adaptation of Legionella. Like other GI-T4SSs, the Legionella clusters are also associated with ICEs, but lack the typical integration and replication modules of related ICEs. The absence of complete replication and DNA pre-processing modules, together with the presence of Legionella-specific regulatory elements, suggest the Legionella GI-T4SS-associated ICE is unique and may employ novel mechanisms of regulation, maintenance and excision. The Legionella GI-T4SS cluster was found to be associated with several cargo genes, including numerous antibiotic resistance and virulence factors, which may confer a fitness benefit to the organism. The in-silico characterisation of this new T4SS furthers our understanding of the diversity of secretion systems involved in the frequent horizontal gene

  10. A Distinct and Divergent Lineage of Genomic Island-Associated Type IV Secretion Systems in Legionella

    PubMed Central

    Wee, Bryan A.; Woolfit, Megan; Beatson, Scott A.; Petty, Nicola K.

    2013-01-01

    Legionella encodes multiple classes of Type IV Secretion Systems (T4SSs), including the Dot/Icm protein secretion system that is essential for intracellular multiplication in amoebal and human hosts. Other T4SSs not essential for virulence are thought to facilitate the acquisition of niche-specific adaptation genes including the numerous effector genes that are a hallmark of this genus. Previously, we identified two novel gene clusters in the draft genome of Legionella pneumophila strain 130b that encode homologues of a subtype of T4SS, the genomic island-associated T4SS (GI-T4SS), usually associated with integrative and conjugative elements (ICE). In this study, we performed genomic analyses of 14 homologous GI-T4SS clusters found in eight publicly available Legionella genomes and show that this cluster is unusually well conserved in a region of high plasticity. Phylogenetic analyses show that Legionella GI-T4SSs are substantially divergent from other members of this subtype of T4SS and represent a novel clade of GI-T4SSs only found in this genus. The GI-T4SS was found to be under purifying selection, suggesting it is functional and may play an important role in the evolution and adaptation of Legionella. Like other GI-T4SSs, the Legionella clusters are also associated with ICEs, but lack the typical integration and replication modules of related ICEs. The absence of complete replication and DNA pre-processing modules, together with the presence of Legionella-specific regulatory elements, suggest the Legionella GI-T4SS-associated ICE is unique and may employ novel mechanisms of regulation, maintenance and excision. The Legionella GI-T4SS cluster was found to be associated with several cargo genes, including numerous antibiotic resistance and virulence factors, which may confer a fitness benefit to the organism. The in-silico characterisation of this new T4SS furthers our understanding of the diversity of secretion systems involved in the frequent horizontal gene

  11. Pan-cancer stratification of solid human epithelial tumors and cancer cell lines reveals commonalities and tissue-specific features of the CpG island methylator phenotype.

    PubMed

    Sánchez-Vega, Francisco; Gotea, Valer; Margolin, Gennady; Elnitski, Laura

    2015-01-01

    The term CpG island methylator phenotype (CIMP) has been used to describe widespread DNA hypermethylation at CpG-rich genomic regions affecting clinically distinct subsets of cancer patients. Even though there have been numerous studies of CIMP in individual cancer types, a uniform analysis across tissues is still lacking. We analyze genome-wide patterns of CpG island hypermethylation in 5,253 solid epithelial tumors from 15 cancer types from TCGA and 23 cancer cell lines from ENCODE. We identify differentially methylated loci that define CIMP+ and CIMP- samples, and we use unsupervised clustering to provide a robust molecular stratification of tumor methylomes for 12 cancer types and all cancer cell lines. With a minimal set of 89 discriminative loci, we demonstrate accurate pan-cancer separation of the 12 CIMP+/- subpopulations, based on their average levels of methylation. Tumor samples in different CIMP subclasses show distinctive correlations with gene expression profiles and recurrence of somatic mutations, copy number variations, and epigenetic silencing. Enrichment analyses indicate shared canonical pathways and upstream regulators for CIMP-targeted regions across cancer types. Furthermore, genomic alterations showing consistent associations with CIMP+/- status include genes involved in DNA repair, chromatin remodeling genes, and several histone methyltransferases. Associations of CIMP status with specific clinical features, including overall survival in several cancer types, highlight the importance of the CIMP+/- designation for individual tumor evaluation and personalized medicine. We present a comprehensive computational study of CIMP that reveals pan-cancer commonalities and tissue-specific differences underlying concurrent hypermethylation of CpG islands across tumors. Our stratification of solid tumors and cancer cell lines based on CIMP status is data-driven and agnostic to tumor type by design, which protects against known biases that have hindered

  12. Pyrosequencing-based comparative genome analysis of the nosocomial pathogen Enterococcus faecium and identification of a large transferable pathogenicity island

    PubMed Central

    2010-01-01

    Background The Gram-positive bacterium Enterococcus faecium is an important cause of nosocomial infections in immunocompromized patients. Results We present a pyrosequencing-based comparative genome analysis of seven E. faecium strains that were isolated from various sources. In the genomes of clinical isolates several antibiotic resistance genes were identified, including the vanA transposon that confers resistance to vancomycin in two strains. A functional comparison between E. faecium and the related opportunistic pathogen E. faecalis based on differences in the presence of protein families, revealed divergence in plant carbohydrate metabolic pathways and oxidative stress defense mechanisms. The E. faecium pan-genome was estimated to be essentially unlimited in size, indicating that E. faecium can efficiently acquire and incorporate exogenous DNA in its gene pool. One of the most prominent sources of genomic diversity consists of bacteriophages that have integrated in the genome. The CRISPR-Cas system, which contributes to immunity against bacteriophage infection in prokaryotes, is not present in the sequenced strains. Three sequenced isolates carry the esp gene, which is involved in urinary tract infections and biofilm formation. The esp gene is located on a large pathogenicity island (PAI), which is between 64 and 104 kb in size. Conjugation experiments showed that the entire esp PAI can be transferred horizontally and inserts in a site-specific manner. Conclusions Genes involved in environmental persistence, colonization and virulence can easily be aquired by E. faecium. This will make the development of successful treatment strategies targeted against this organism a challenge for years to come. PMID:20398277

  13. Pathogenicity Island-Directed Transfer of Unlinked Chromosomal Virulence Genes

    PubMed Central

    Chen, John; Ram, Geeta; Penadés, José R.; Brown, Stuart; Novick, Richard P.

    2014-01-01

    Summary In recent decades, the notorious pathogen Staphylococcus aureus has become progressively more contagious, more virulent and more resistant to antibiotics. This implies a rather dynamic evolutionary capability, representing a remarkable level of genomic plasticity, most probably maintained by horizontal gene transfer. Here we report that the staphylococcal pathogenicity islands have a dual role in gene transfer: they not only mediate their own transfer, but they can independently direct the transfer of unlinked chromosomal segments containing virulence genes. While transfer of the island itself requires specific helper phages, transfer of unlinked chromosomal segments does not, so that potentially any pac-type phage will serve. These results reveal that SaPIs can increase the horizontal exchange of accessory genes associated with disease, and may shape pathogen genomes beyond the confines of their attachment sites. PMID:25498143

  14. Strain/species identification in metagenomes using genome-specific markers

    PubMed Central

    Tu, Qichao; He, Zhili; Zhou, Jizhong

    2014-01-01

    Shotgun metagenome sequencing has become a fast, cheap and high-throughput technology for characterizing microbial communities in complex environments and human body sites. However, accurate identification of microorganisms at the strain/species level remains extremely challenging. We present a novel k-mer-based approach, termed GSMer, that identifies genome-specific markers (GSMs) from currently sequenced microbial genomes, which were then used for strain/species-level identification in metagenomes. Using 5390 sequenced microbial genomes, 8 770 321 50-mer strain-specific and 11 736 360 species-specific GSMs were identified for 4088 strains and 2005 species (4933 strains), respectively. The GSMs were first evaluated against mock community metagenomes, recently sequenced genomes and real metagenomes from different body sites, suggesting that the identified GSMs were specific to their targeting genomes. Sensitivity evaluation against synthetic metagenomes with different coverage suggested that 50 GSMs per strain were sufficient to identify most microbial strains with ≥0.25× coverage, and 10% of selected GSMs in a database should be detected for confident positive callings. Application of GSMs identified 45 and 74 microbial strains/species significantly associated with type 2 diabetes patients and obese/lean individuals from corresponding gastrointestinal tract metagenomes, respectively. Our result agreed with previous studies but provided strain-level information. The approach can be directly applied to identify microbial strains/species from raw metagenomes, without the effort of complex data pre-processing. PMID:24523352

  15. Methods for Optimizing CRISPR-Cas9 Genome Editing Specificity

    PubMed Central

    Tycko, Josh; Myer, Vic E.; Hsu, Patrick D.

    2016-01-01

    Summary Advances in the development of delivery, repair, and specificity strategies for the CRISPR-Cas9 genome engineering toolbox are helping researchers understand gene function with unprecedented precision and sensitivity. CRISPR-Cas9 also holds enormous therapeutic potential for the treatment of genetic disorders by directly correcting disease-causing mutations. Although the Cas9 protein has been shown to bind and cleave DNA at off-target sites, the field of Cas9 specificity is rapidly progressing with marked improvements in guide RNA selection, protein and guide engineering, novel enzymes, and off-target detection methods. We review important challenges and breakthroughs in the field as a comprehensive practical guide to interested users of genome editing technologies, highlighting key tools and strategies for optimizing specificity. The genome editing community should now strive to standardize such methods for measuring and reporting off-target activity, while keeping in mind that the goal for specificity should be continued improvement and vigilance. PMID:27494557

  16. Symbiosis Island Shuffling with Abundant Insertion Sequences in the Genomes of Extra-Slow-Growing Strains of Soybean Bradyrhizobia

    PubMed Central

    Iida, Takayuki; Itakura, Manabu; Anda, Mizue; Sugawara, Masayuki; Isawa, Tsuyoshi; Okubo, Takashi; Sato, Shusei; Chiba-Kakizaki, Kaori

    2015-01-01

    Extra-slow-growing bradyrhizobia from root nodules of field-grown soybeans harbor abundant insertion sequences (ISs) and are termed highly reiterated sequence-possessing (HRS) strains. We analyzed the genome organization of HRS strains with the focus on IS distribution and symbiosis island structure. Using pulsed-field gel electrophoresis, we consistently detected several plasmids (0.07 to 0.4 Mb) in the HRS strains (NK5, NK6, USDA135, 2281, USDA123, and T2), whereas no plasmids were detected in the non-HRS strain USDA110. The chromosomes of the six HRS strains (9.7 to 10.7 Mb) were larger than that of USDA110 (9.1 Mb). Using MiSeq sequences of 6 HRS and 17 non-HRS strains mapped to the USDA110 genome, we found that the copy numbers of ISRj1, ISRj2, ISFK1, IS1632, ISB27, ISBj8, and IS1631 were markedly higher in HRS strains. Whole-genome sequencing showed that the HRS strain NK6 had four small plasmids (136 to 212 kb) and a large chromosome (9,780 kb). Strong colinearity was found between 7.4-Mb core regions of the NK6 and USDA110 chromosomes. USDA110 symbiosis islands corresponded mainly to five small regions (S1 to S5) within two variable regions, V1 (0.8 Mb) and V2 (1.6 Mb), of the NK6 chromosome. The USDA110 nif gene cluster (nifDKENXSBZHQW-fixBCX) was split into two regions, S2 and S3, where ISRj1-mediated rearrangement occurred between nifS and nifB. ISs were also scattered in NK6 core regions, and ISRj1 insertion often disrupted some genes important for survival and environmental responses. These results suggest that HRS strains of soybean bradyrhizobia were subjected to IS-mediated symbiosis island shuffling and core genome degradation. PMID:25862225

  17. GSP: A web-based platform for designing genome-specific primers in polyploids

    USDA-ARS?s Scientific Manuscript database

    The sequences among subgenomes in a polyploid species have high similarity. This makes difficult to design genome-specific primers for sequence analysis. We present a web-based platform named GSP for designing genome-specific primers to distinguish subgenome sequences in the polyploid genome backgr...

  18. Prediction of CpG-island function: CpG clustering vs. sliding-window methods

    PubMed Central

    2010-01-01

    Background Unmethylated stretches of CpG dinucleotides (CpG islands) are an outstanding property of mammal genomes. Conventionally, these regions are detected by sliding window approaches using %G + C, CpG observed/expected ratio and length thresholds as main parameters. Recently, clustering methods directly detect clusters of CpG dinucleotides as a statistical property of the genome sequence. Results We compare sliding-window to clustering (i.e. CpGcluster) predictions by applying new ways to detect putative functionality of CpG islands. Analyzing the co-localization with several genomic regions as a function of window size vs. statistical significance (p-value), CpGcluster shows a higher overlap with promoter regions and highly conserved elements, at the same time showing less overlap with Alu retrotransposons. The major difference in the prediction was found for short islands (CpG islets), often exclusively predicted by CpGcluster. Many of these islets seem to be functional, as they are unmethylated, highly conserved and/or located within the promoter region. Finally, we show that window-based islands can spuriously overlap several, differentially regulated promoters as well as different methylation domains, which might indicate a wrong merge of several CpG islands into a single, very long island. The shorter CpGcluster islands seem to be much more specific when concerning the overlap with alternative transcription start sites or the detection of homogenous methylation domains. Conclusions The main difference between sliding-window approaches and clustering methods is the length of the predicted islands. Short islands, often differentially methylated, are almost exclusively predicted by CpGcluster. This suggests that CpGcluster may be the algorithm of choice to explore the function of these short, but putatively functional CpG islands. PMID:20500903

  19. Genome sequence of Bradyrhizobium sp. WSM1253; a microsymbiont of Ornithopus compressus from the Greek Island of Sifnos

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tiwari, Ravi; Howieson, John; Yates, Ron

    Bradyrhizobium sp. WSM1253 is a novel N 2-fixing bacterium isolated from a root nodule of the herbaceous annual legume Ornithopus compressus that was growing on the Greek Island of Sifnos. WSM1253 emerged as a strain of interest in an Australian program that was selecting inoculant quality bradyrhizobial strains for inoculation of Mediterranean species of lupins ( Lupinus angustifolius, L. princei, L. atlanticus, L. pilosus ). In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 8,719,808 bp genome has a G + C content of 63.09 % with 71 contigsmore » arranged into two scaffolds. The assembled genome contains 8,432 protein-coding genes, 66 RNA genes and a single rRNA operon. In conclusion, this improved-high-quality draft rhizobial genome is one of 20 sequenced through a DOE Joint Genome Institute 2010 Community Sequencing Project.« less

  20. Genome sequence of Bradyrhizobium sp. WSM1253; a microsymbiont of Ornithopus compressus from the Greek Island of Sifnos

    DOE PAGES

    Tiwari, Ravi; Howieson, John; Yates, Ron; ...

    2015-11-30

    Bradyrhizobium sp. WSM1253 is a novel N 2-fixing bacterium isolated from a root nodule of the herbaceous annual legume Ornithopus compressus that was growing on the Greek Island of Sifnos. WSM1253 emerged as a strain of interest in an Australian program that was selecting inoculant quality bradyrhizobial strains for inoculation of Mediterranean species of lupins ( Lupinus angustifolius, L. princei, L. atlanticus, L. pilosus ). In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 8,719,808 bp genome has a G + C content of 63.09 % with 71 contigsmore » arranged into two scaffolds. The assembled genome contains 8,432 protein-coding genes, 66 RNA genes and a single rRNA operon. In conclusion, this improved-high-quality draft rhizobial genome is one of 20 sequenced through a DOE Joint Genome Institute 2010 Community Sequencing Project.« less

  1. High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain.

    PubMed

    Osman, Wan Adnawani Meor; van Berkum, Peter; León-Barrios, Milagros; Velázquez, Encarna; Elia, Patrick; Tian, Rui; Ardley, Julie; Gollagher, Margaret; Seshadri, Rekha; Reddy, T B K; Ivanova, Natalia; Woyke, Tanja; Pati, Amrita; Markowitz, Victor; Baeshen, Mohamed N; Baeshen, Naseebh Nabeeh; Kyrpides, Nikos; Reeve, Wayne

    2017-01-01

    10.1601/nm.1335 Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata . This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here the features of 10.1601/nm.1335 Mlalz-1 are described, together with high-quality permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T , 10.1601/nm.1334 A 321 T and 10.1601/nm.17831 10.1601/strainfinder?urlappend=%3Fid%3DORS+1407 T , based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as 10.1601/nm.1335. Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata -nodulating 10.1601/nm.1328 strains, but ≤93% with nodC of 10.1601/nm.1328 strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced 10.1601/nm.1335 strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In 10.1601/nm.1334 strain 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of 10.1601/nm.1334 strains, which suggests genetic

  2. Alignment-free genome tree inference by learning group-specific distance metrics.

    PubMed

    Patil, Kaustubh R; McHardy, Alice C

    2013-01-01

    Understanding the evolutionary relationships between organisms is vital for their in-depth study. Gene-based methods are often used to infer such relationships, which are not without drawbacks. One can now attempt to use genome-scale information, because of the ever increasing number of genomes available. This opportunity also presents a challenge in terms of computational efficiency. Two fundamentally different methods are often employed for sequence comparisons, namely alignment-based and alignment-free methods. Alignment-free methods rely on the genome signature concept and provide a computationally efficient way that is also applicable to nonhomologous sequences. The genome signature contains evolutionary signal as it is more similar for closely related organisms than for distantly related ones. We used genome-scale sequence information to infer taxonomic distances between organisms without additional information such as gene annotations. We propose a method to improve genome tree inference by learning specific distance metrics over the genome signature for groups of organisms with similar phylogenetic, genomic, or ecological properties. Specifically, our method learns a Mahalanobis metric for a set of genomes and a reference taxonomy to guide the learning process. By applying this method to more than a thousand prokaryotic genomes, we showed that, indeed, better distance metrics could be learned for most of the 18 groups of organisms tested here. Once a group-specific metric is available, it can be used to estimate the taxonomic distances for other sequenced organisms from the group. This study also presents a large scale comparison between 10 methods--9 alignment-free and 1 alignment-based.

  3. Intersectional gene flow between insular endemics of Ilex (Aquifoliaceae) on the Bonin Islands and the Ryukyu Islands.

    PubMed

    Setoguchi, H; Watanabe, I

    2000-06-01

    Hybridization and introgression play important roles in plant evolution, and their occurrence on the oceanic islands provides good examples of plant speciation and diversification. Restriction fragment length polymorphisms (RFLPs) and trnL (UAA) 3'exon-trnF (GAA) intergenic spacer (IGS) sequences of chloroplast DNA (cpDNA), and the sequences of internal transcribed spacer (ITS) of nuclear ribosomal DNA were examined to investigate the occurrence of gene transfer in Ilex species on the Bonin Islands and the Ryukyu Islands in Japan. A gene phylogeny for the plastid genome is in agreement with the morphologically based taxonomy, whereas the nuclear genome phylogeny clusters putatively unrelated endemics both on the Bonin and the Ryukyu Islands. Intersectional hybridization and nuclear gene flow were independently observed in insular endemics of Ilex on both sets of islands without evidence of plastid introgression. Gene flow observed in these island systems can be explained by ecological features of insular endemics, i.e., limits of distribution range or sympatric distribution in a small land area.

  4. Genomic Diversity of Burkholderia pseudomallei Clinical Isolates: Subtractive Hybridization Reveals a Burkholderia mallei-Specific Propage in B. pseudomallei 1026b

    DTIC Science & Technology

    2004-06-01

    identification of several new virulence gene candidates. In particular, K96243 harbors multiple genomic islands with relatively low GC contents, suggesting...coli, Streptococcus pyogenes, Staphylococcus aureus, S. enterica, and Xylella fastidiosa (11, 16, 17). The genomic sequencing results for multiple... virulence genes by subtractive hybridization: identifica- tion of capsular polysaccharide of Burkholderia pseudomallei as a major virulence determinant

  5. The minimum information about a genome sequence (MIGS) specification

    PubMed Central

    Field, Dawn; Garrity, George; Gray, Tanya; Morrison, Norman; Selengut, Jeremy; Sterk, Peter; Tatusova, Tatiana; Thomson, Nicholas; Allen, Michael J; Angiuoli, Samuel V; Ashburner, Michael; Axelrod, Nelson; Baldauf, Sandra; Ballard, Stuart; Boore, Jeffrey; Cochrane, Guy; Cole, James; Dawyndt, Peter; De Vos, Paul; dePamphilis, Claude; Edwards, Robert; Faruque, Nadeem; Feldman, Robert; Gilbert, Jack; Gilna, Paul; Glöckner, Frank Oliver; Goldstein, Philip; Guralnick, Robert; Haft, Dan; Hancock, David; Hermjakob, Henning; Hertz-Fowler, Christiane; Hugenholtz, Phil; Joint, Ian; Kagan, Leonid; Kane, Matthew; Kennedy, Jessie; Kowalchuk, George; Kottmann, Renzo; Kolker, Eugene; Kravitz, Saul; Kyrpides, Nikos; Leebens-Mack, Jim; Lewis, Suzanna E; Li, Kelvin; Lister, Allyson L; Lord, Phillip; Maltsev, Natalia; Markowitz, Victor; Martiny, Jennifer; Methe, Barbara; Mizrachi, Ilene; Moxon, Richard; Nelson, Karen; Parkhill, Julian; Proctor, Lita; White, Owen; Sansone, Susanna-Assunta; Spiers, Andrew; Stevens, Robert; Swift, Paul; Taylor, Chris; Tateno, Yoshio; Tett, Adrian; Turner, Sarah; Ussery, David; Vaughan, Bob; Ward, Naomi; Whetzel, Trish; Gil, Ingio San; Wilson, Gareth; Wipat, Anil

    2008-01-01

    With the quantity of genomic data increasing at an exponential rate, it is imperative that these data be captured electronically, in a standard format. Standardization activities must proceed within the auspices of open-access and international working bodies. To tackle the issues surrounding the development of better descriptions of genomic investigations, we have formed the Genomic Standards Consortium (GSC). Here, we introduce the minimum information about a genome sequence (MIGS) specification with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange. As part of its wider goals, the GSC also supports improving the ‘transparency’ of the information contained in existing genomic databases. PMID:18464787

  6. Characterization of Genomic Island 3 and Genetic Variability of Chilean Field Strains of Brucella abortus▿

    PubMed Central

    Céspedes, Sandra; Salgado, Paulina; Valenzuela, Patricio; Vidal, Roberto; Oñate, Angel A.

    2011-01-01

    One of the capabilities developed by bacteria is the ability to gain large fragments of DNA from other bacteria or to lose portions of their own genomes. Among these exchangeable fragments are the genomic islands (GIs). Nine GIs have been identified in Brucella, and genomic island 3 (GI-3) is shared by two pathogenic species, B. melitensis and B. abortus. GI-3 encodes mostly unknown proteins. One of the aims of this study was to perform pulsed-field gel electrophoresis (PFGE) on field isolates of B. abortus from Chile to determine whether these isolates are clonally related. Furthermore, we focused on the characterization of GI-3, studying its organization and the genetic conservation of the GI-3 sequence using techniques such as tiling-path PCR (TP-PCR) and restriction fragment length polymorphism-PCR (RFLP-PCR). Our results, after PFGE was performed on 69 field isolates of B. abortus from Chile, showed that the strains were genetically homogeneous. To increase the power of genetic discrimination among these strains, we used multiple locus variable-number tandem-repeat (VNTR) analysis with 16 loci (MLVA-16). The results obtained by MLVA-16 showed that the strains of B. abortus were genetically heterogeneous and that most of them clustered according to their geographic origin. Of the genetic loci studied, panel 2B was the one describing the highest diversity in the analysis, as well as locus Bruce19 in panel 2A. In relation to the study of GI-3, our experimental analysis by TP-PCR identified and confirmed that GI-3 is present in all wild strains of B. abortus, demonstrating the high stability of gene cluster GI-3 in Chilean field strains. PMID:21543580

  7. Unique DNA methylome profiles in CpG island methylator phenotype colon cancers

    PubMed Central

    Xu, Yaomin; Hu, Bo; Choi, Ae-Jin; Gopalan, Banu; Lee, Byron H.; Kalady, Matthew F.; Church, James M.; Ting, Angela H.

    2012-01-01

    A subset of colorectal cancers was postulated to have the CpG island methylator phenotype (CIMP), a higher propensity for CpG island DNA methylation. The validity of CIMP, its molecular basis, and its prognostic value remain highly controversial. Using MBD-isolated genome sequencing, we mapped and compared genome-wide DNA methylation profiles of normal, non-CIMP, and CIMP colon specimens. Multidimensional scaling analysis revealed that each specimen could be clearly classified as normal, non-CIMP, and CIMP, thus signifying that these three groups have distinctly different global methylation patterns. We discovered 3780 sites in various genomic contexts that were hypermethylated in both non-CIMP and CIMP colon cancers when compared with normal colon. An additional 2026 sites were found to be hypermethylated in CIMP tumors only; and importantly, 80% of these sites were located in CpG islands. These data demonstrate on a genome-wide level that the additional hypermethylation seen in CIMP tumors occurs almost exclusively at CpG islands and support definitively that these tumors were appropriately named. When these sites were examined more closely, we found that 25% were adjacent to sites that were also hypermethylated in non-CIMP tumors. Thus, CIMP is also characterized by more extensive methylation of sites that are already prone to be hypermethylated in colon cancer. These observations indicate that CIMP tumors have specific defects in controlling both DNA methylation seeding and spreading and serve as an important first step in delineating molecular mechanisms that control these processes. PMID:21990380

  8. Stability, Entrapment and Variant Formation of Salmonella Genomic Island 1

    PubMed Central

    Kiss, János; Nagy, Béla; Olasz, Ferenc

    2012-01-01

    Background The Salmonella genomic island 1 (SGI1) is a 42.4 kb integrative mobilizable element containing several antibiotic resistance determinants embedded in a complex integron segment In104. The numerous SGI1 variants identified so far, differ mainly in this segment and the explanations of their emergence were mostly based on comparative structure analyses. Here we provide experimental studies on the stability, entrapment and variant formation of this peculiar gene cluster originally found in S. Typhimurium. Methodology/Principal Findings Segregation and conjugation tests and various molecular techniques were used to detect the emerging SGI1 variants in Salmonella populations of 17 Salmonella enterica serovar Typhimurium DT104 isolates from Hungary. The SGI1s in these isolates proved to be fully competent in excision, conjugal transfer by the IncA/C helper plasmid R55, and integration into the E. coli chromosome. A trap vector has been constructed and successfully applied to capture the island on a plasmid. Monitoring of segregation of SGI1 indicated high stability of the island. SGI1-free segregants did not accumulate during long-term propagation, but several SGI1 variants could be obtained. Most of them appeared to be identical to SGI1-B and SGI1-C, but two new variants caused by deletions via a short-homology-dependent recombination process have also been detected. We have also noticed that the presence of the conjugation helper plasmid increased the formation of these deletion variants considerably. Conclusions/Significance Despite that excision of SGI1 from the chromosome was proven in SGI1+ Salmonella populations, its complete loss could not be observed. On the other hand, we demonstrated that several variants, among them two newly identified ones, arose with detectable frequencies in these populations in a short timescale and their formation was promoted by the helper plasmid. This reflects that IncA/C helper plasmids are not only involved in the

  9. The use of genomic coancestry matrices in the optimisation of contributions to maintain genetic diversity at specific regions of the genome.

    PubMed

    Gómez-Romano, Fernando; Villanueva, Beatriz; Fernández, Jesús; Woolliams, John A; Pong-Wong, Ricardo

    2016-01-13

    Optimal contribution methods have proved to be very efficient for controlling the rates at which coancestry and inbreeding increase and therefore, for maintaining genetic diversity. These methods have usually relied on pedigree information for estimating genetic relationships between animals. However, with the large amount of genomic information now available such as high-density single nucleotide polymorphism (SNP) chips that contain thousands of SNPs, it becomes possible to calculate more accurate estimates of relationships and to target specific regions in the genome where there is a particular interest in maximising genetic diversity. The objective of this study was to investigate the effectiveness of using genomic coancestry matrices for: (1) minimising the loss of genetic variability at specific genomic regions while restricting the overall loss in the rest of the genome; or (2) maximising the overall genetic diversity while restricting the loss of diversity at specific genomic regions. Our study shows that the use of genomic coancestry was very successful at minimising the loss of diversity and outperformed the use of pedigree-based coancestry (genetic diversity even increased in some scenarios). The results also show that genomic information allows a targeted optimisation to maintain diversity at specific genomic regions, whether they are linked or not. The level of variability maintained increased when the targeted regions were closely linked. However, such targeted management leads to an important loss of diversity in the rest of the genome and, thus, it is necessary to take further actions to constrain this loss. Optimal contribution methods also proved to be effective at restricting the loss of diversity in the rest of the genome, although the resulting rate of coancestry was higher than the constraint imposed. The use of genomic matrices when optimising contributions permits the control of genetic diversity and inbreeding at specific regions of the

  10. Pangenome Analysis of Burkholderia pseudomallei: Genome Evolution Preserves Gene Order despite High Recombination Rates.

    PubMed

    Spring-Pearson, Senanu M; Stone, Joshua K; Doyle, Adina; Allender, Christopher J; Okinaka, Richard T; Mayo, Mark; Broomall, Stacey M; Hill, Jessica M; Karavis, Mark A; Hubbard, Kyle S; Insalaco, Joseph M; McNew, Lauren A; Rosenzweig, C Nicole; Gibbons, Henry S; Currie, Bart J; Wagner, David M; Keim, Paul; Tuanyok, Apichai

    2015-01-01

    The pangenomic diversity in Burkholderia pseudomallei is high, with approximately 5.8% of the genome consisting of genomic islands. Genomic islands are known hotspots for recombination driven primarily by site-specific recombination associated with tRNAs. However, recombination rates in other portions of the genome are also high, a feature we expected to disrupt gene order. We analyzed the pangenome of 37 isolates of B. pseudomallei and demonstrate that the pangenome is 'open', with approximately 136 new genes identified with each new genome sequenced, and that the global core genome consists of 4568±16 homologs. Genes associated with metabolism were statistically overrepresented in the core genome, and genes associated with mobile elements, disease, and motility were primarily associated with accessory portions of the pangenome. The frequency distribution of genes present in between 1 and 37 of the genomes analyzed matches well with a model of genome evolution in which 96% of the genome has very low recombination rates but 4% of the genome recombines readily. Using homologous genes among pairs of genomes, we found that gene order was highly conserved among strains, despite the high recombination rates previously observed. High rates of gene transfer and recombination are incompatible with retaining gene order unless these processes are either highly localized to specific sites within the genome, or are characterized by symmetrical gene gain and loss. Our results demonstrate that both processes occur: localized recombination introduces many new genes at relatively few sites, and recombination throughout the genome generates the novel multi-locus sequence types previously observed while preserving gene order.

  11. Carnivore-specific SINEs (Can-SINEs): distribution, evolution, and genomic impact.

    PubMed

    Walters-Conte, Kathryn B; Johnson, Diana L E; Allard, Marc W; Pecon-Slattery, Jill

    2011-01-01

    Short interspersed nuclear elements (SINEs) are a type of class 1 transposable element (retrotransposon) with features that allow investigators to resolve evolutionary relationships between populations and species while providing insight into genome composition and function. Characterization of a Carnivora-specific SINE family, Can-SINEs, has, has aided comparative genomic studies by providing rare genomic changes, and neutral sequence variants often needed to resolve difficult evolutionary questions. In addition, Can-SINEs constitute a significant source of functional diversity with Carnivora. Publication of the whole-genome sequence of domestic dog, domestic cat, and giant panda serves as a valuable resource in comparative genomic inferences gleaned from Can-SINEs. In anticipation of forthcoming studies bolstered by new genomic data, this review describes the discovery and characterization of Can-SINE motifs as well as describes composition, distribution, and effect on genome function. As the contribution of noncoding sequences to genomic diversity becomes more apparent, SINEs and other transposable elements will play an increasingly large role in mammalian comparative genomics.

  12. Carnivore-Specific SINEs (Can-SINEs): Distribution, Evolution, and Genomic Impact

    PubMed Central

    Johnson, Diana L.E.; Allard, Marc W.; Pecon-Slattery, Jill

    2011-01-01

    Short interspersed nuclear elements (SINEs) are a type of class 1 transposable element (retrotransposon) with features that allow investigators to resolve evolutionary relationships between populations and species while providing insight into genome composition and function. Characterization of a Carnivora-specific SINE family, Can-SINEs, has, has aided comparative genomic studies by providing rare genomic changes, and neutral sequence variants often needed to resolve difficult evolutionary questions. In addition, Can-SINEs constitute a significant source of functional diversity with Carnivora. Publication of the whole-genome sequence of domestic dog, domestic cat, and giant panda serves as a valuable resource in comparative genomic inferences gleaned from Can-SINEs. In anticipation of forthcoming studies bolstered by new genomic data, this review describes the discovery and characterization of Can-SINE motifs as well as describes composition, distribution, and effect on genome function. As the contribution of noncoding sequences to genomic diversity becomes more apparent, SINEs and other transposable elements will play an increasingly large role in mammalian comparative genomics. PMID:21846743

  13. Lineage-specific genomics: Frequent birth and death in the human genome: The human genome contains many lineage-specific elements created by both sequence and functional turnover.

    PubMed

    Young, Robert S

    2016-07-01

    Frequent evolutionary birth and death events have created a large quantity of biologically important, lineage-specific DNA within mammalian genomes. The birth and death of DNA sequences is so frequent that the total number of these insertions and deletions in the human population remains unknown, although there are differences between these groups, e.g. transposable elements contribute predominantly to sequence insertion. Functional turnover - where the activity of a locus is specific to one lineage, but the underlying DNA remains conserved - can also drive birth and death. However, this does not appear to be a major driver of divergent transcriptional regulation. Both sequence and functional turnover have contributed to the birth and death of thousands of functional promoters in the human and mouse genomes. These findings reveal the pervasive nature of evolutionary birth and death and suggest that lineage-specific regions may play an important but previously underappreciated role in human biology and disease. © 2016 The Authors BioEssays Published by WILEY Periodicals, Inc.

  14. DNA motifs associated with aberrant CpG island methylation.

    PubMed

    Feltus, F Alex; Lee, Eva K; Costello, Joseph F; Plass, Christoph; Vertino, Paula M

    2006-05-01

    Epigenetic silencing involving the aberrant methylation of promoter region CpG islands is widely recognized as a tumor suppressor silencing mechanism in cancer. However, the molecular pathways underlying aberrant DNA methylation remain elusive. Recently we showed that, on a genome-wide level, CpG island loci differ in their intrinsic susceptibility to aberrant methylation and that this susceptibility can be predicted based on underlying sequence context. These data suggest that there are sequence/structural features that contribute to the protection from or susceptibility to aberrant methylation. Here we use motif elicitation coupled with classification techniques to identify DNA sequence motifs that selectively define methylation-prone or methylation-resistant CpG islands. Motifs common to 28 methylation-prone or 47 methylation-resistant CpG island-containing genomic fragments were determined using the MEME and MAST algorithms (). The five most discriminatory motifs derived from methylation-prone sequences were found to be associated with CpG islands in general and were nonrandomly distributed throughout the genome. In contrast, the eight most discriminatory motifs derived from the methylation-resistant CpG islands were randomly distributed throughout the genome. Interestingly, this latter group tended to associate with Alu and other repetitive sequences. Used together, the frequency of occurrence of these motifs successfully discriminated methylation-prone and methylation-resistant CpG island groups with an accuracy of 87% after 10-fold cross-validation. The motifs identified here are candidate methylation-targeting or methylation-protection DNA sequences.

  15. Inter- and intra-specific pan-genomes of Borrelia burgdorferi sensu lato: genome stability and adaptive radiation

    PubMed Central

    2013-01-01

    Background Lyme disease is caused by spirochete bacteria from the Borrelia burgdorferi sensu lato (B. burgdorferi s.l.) species complex. To reconstruct the evolution of B. burgdorferi s.l. and identify the genomic basis of its human virulence, we compared the genomes of 23 B. burgdorferi s.l. isolates from Europe and the United States, including B. burgdorferi sensu stricto (B. burgdorferi s.s., 14 isolates), B. afzelii (2), B. garinii (2), B. “bavariensis” (1), B. spielmanii (1), B. valaisiana (1), B. bissettii (1), and B. “finlandensis” (1). Results Robust B. burgdorferi s.s. and B. burgdorferi s.l. phylogenies were obtained using genome-wide single-nucleotide polymorphisms, despite recombination. Phylogeny-based pan-genome analysis showed that the rate of gene acquisition was higher between species than within species, suggesting adaptive speciation. Strong positive natural selection drives the sequence evolution of lipoproteins, including chromosomally-encoded genes 0102 and 0404, cp26-encoded ospC and b08, and lp54-encoded dbpA, a07, a22, a33, a53, a65. Computer simulations predicted rapid adaptive radiation of genomic groups as population size increases. Conclusions Intra- and inter-specific pan-genome sizes of B. burgdorferi s.l. expand linearly with phylogenetic diversity. Yet gene-acquisition rates in B. burgdorferi s.l. are among the lowest in bacterial pathogens, resulting in high genome stability and few lineage-specific genes. Genome adaptation of B. burgdorferi s.l. is driven predominantly by copy-number and sequence variations of lipoprotein genes. New genomic groups are likely to emerge if the current trend of B. burgdorferi s.l. population expansion continues. PMID:24112474

  16. Genomic Diversity of Burkholderia pseudomallei Clinical Isolates: Subtractive Hybridization Reveals a Burkholderia mallei-Specific Prophage in B. pseudomallei 1026b

    DTIC Science & Technology

    2004-06-01

    identification of several new virulence gene candidates. In particular, K96243 harbors multiple genomic islands with relatively low GC contents...differences were observed. Prophage-encoded virulence factors in other bacterial species have been described (5), and it was of interest to see if gene ... Xylella fastidiosa (11, 16, 17). The genomic sequencing results for multiple strains of Streptococcus and Xylella suggest that different disease

  17. Transposable element islands facilitate adaptation to novel environments in an invasive species

    PubMed Central

    Schrader, Lukas; Kim, Jay W.; Ence, Daniel; Zimin, Aleksey; Klein, Antonia; Wyschetzki, Katharina; Weichselgartner, Tobias; Kemena, Carsten; Stökl, Johannes; Schultner, Eva; Wurm, Yannick; Smith, Christopher D.; Yandell, Mark; Heinze, Jürgen; Gadau, Jürgen; Oettler, Jan

    2014-01-01

    Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species. PMID:25510865

  18. Experimental evidence supports a sex-specific selective sieve in mitochondrial genome evolution.

    PubMed

    Innocenti, Paolo; Morrow, Edward H; Dowling, Damian K

    2011-05-13

    Mitochondria are maternally transmitted; hence, their genome can only make a direct and adaptive response to selection through females, whereas males represent an evolutionary dead end. In theory, this creates a sex-specific selective sieve, enabling deleterious mutations to accumulate in mitochondrial genomes if they exert male-specific effects. We tested this hypothesis, expressing five mitochondrial variants alongside a standard nuclear genome in Drosophila melanogaster, and found striking sexual asymmetry in patterns of nuclear gene expression. Mitochondrial polymorphism had few effects on nuclear gene expression in females but major effects in males, modifying nearly 10% of transcripts. These were mostly male-biased in expression, with enrichment hotspots in the testes and accessory glands. Our results suggest an evolutionary mechanism that results in mitochondrial genomes harboring male-specific mutation loads.

  19. Limits of variation, specific infectivity, and genome packaging of massively recoded poliovirus genomes.

    PubMed

    Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard

    2017-10-10

    Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.

  20. Microsatellite Interruptions Stabilize Primate Genomes and Exist as Population-Specific Single Nucleotide Polymorphisms within Individual Human Genomes

    PubMed Central

    Ananda, Guruprasad; Hile, Suzanne E.; Breski, Amanda; Wang, Yanli; Kelkar, Yogeshwar; Makova, Kateryna D.; Eckert, Kristin A.

    2014-01-01

    Interruptions of microsatellite sequences impact genome evolution and can alter disease manifestation. However, human polymorphism levels at interrupted microsatellites (iMSs) are not known at a genome-wide scale, and the pathways for gaining interruptions are poorly understood. Using the 1000 Genomes Phase-1 variant call set, we interrogated mono-, di-, tri-, and tetranucleotide repeats up to 10 units in length. We detected ∼26,000–40,000 iMSs within each of four human population groups (African, European, East Asian, and American). We identified population-specific iMSs within exonic regions, and discovered that known disease-associated iMSs contain alleles present at differing frequencies among the populations. By analyzing longer microsatellites in primate genomes, we demonstrate that single interruptions result in a genome-wide average two- to six-fold reduction in microsatellite mutability, as compared with perfect microsatellites. Centrally located interruptions lowered mutability dramatically, by two to three orders of magnitude. Using a biochemical approach, we tested directly whether the mutability of a specific iMS is lower because of decreased DNA polymerase strand slippage errors. Modeling the adenomatous polyposis coli tumor suppressor gene sequence, we observed that a single base substitution interruption reduced strand slippage error rates five- to 50-fold, relative to a perfect repeat, during synthesis by DNA polymerases α, β, or η. Computationally, we demonstrate that iMSs arise primarily by base substitution mutations within individual human genomes. Our biochemical survey of human DNA polymerase α, β, δ, κ, and η error rates within certain microsatellites suggests that interruptions are created most frequently by low fidelity polymerases. Our combined computational and biochemical results demonstrate that iMSs are abundant in human genomes and are sources of population-specific genetic variation that may affect genome stability. The

  1. A genomic island harboring arsenic resistance genes varies in gene content and is located in different chromosomal loci among Listeria monocytogenes strains

    USDA-ARS?s Scientific Manuscript database

    In the foodborne pathogen Listeria monocytogenes, arsenic resistance has been often encountered among certain clonal groups of serotype 4b and was earlier found to be strongly associated with an arsenic resistance gene cluster within a 35 kb chromosomal region, designated Listeria genomic island 2 (...

  2. Profile analysis and prediction of tissue-specific CpG island methylation classes

    PubMed Central

    2009-01-01

    Background The computational prediction of DNA methylation has become an important topic in the recent years due to its role in the epigenetic control of normal and cancer-related processes. While previous prediction approaches focused merely on differences between methylated and unmethylated DNA sequences, recent experimental results have shown the presence of much more complex patterns of methylation across tissues and time in the human genome. These patterns are only partially described by a binary model of DNA methylation. In this work we propose a novel approach, based on profile analysis of tissue-specific methylation that uncovers significant differences in the sequences of CpG islands (CGIs) that predispose them to a tissue- specific methylation pattern. Results We defined CGI methylation profiles that separate not only between constitutively methylated and unmethylated CGIs, but also identify CGIs showing a differential degree of methylation across tissues and cell-types or a lack of methylation exclusively in sperm. These profiles are clearly distinguished by a number of CGI attributes including their evolutionary conservation, their significance, as well as the evolutionary evidence of prior methylation. Additionally, we assess profile functionality with respect to the different compartments of protein coding genes and their possible use in the prediction of DNA methylation. Conclusion Our approach provides new insights into the biological features that determine if a CGI has a functional role in the epigenetic control of gene expression and the features associated with CGI methylation susceptibility. Moreover, we show that the ability to predict CGI methylation is based primarily on the quality of the biological information used and the relationships uncovered between different sources of knowledge. The strategy presented here is able to predict, besides the constitutively methylated and unmethylated classes, two more tissue specific methylation classes

  3. Characterization of a novel chaperone/usher fimbrial operon present on KpGI-5, a methionine tRNA gene-associated genomic island in Klebsiella pneumoniae

    PubMed Central

    2012-01-01

    Background Several strain-specific Klebsiella pneumoniae virulence determinants have been described, though these have almost exclusively been linked with hypervirulent liver abscess-associated strains. Through PCR interrogation of integration hotspots, chromosome walking, island-tagging and fosmid-based marker rescue we captured and sequenced KpGI-5, a novel genomic island integrated into the met56 tRNA gene of K. pneumoniae KR116, a bloodstream isolate from a patient with pneumonia and neutropenic sepsis. Results The 14.0 kb KpGI-5 island exhibited a genome-anomalous G + C content, possessed near-perfect 46 bp direct repeats, encoded a γ1-chaperone/usher fimbrial cluster (fim2) and harboured seven other predicted genes of unknown function. Transcriptional analysis demonstrated expression of three fim2 genes, and suggested that the fim2A-fim2K cluster comprised an operon. As fimbrial systems are frequently implicated in pathogenesis, we examined the role of fim2 by analysing KR2107, a streptomycin-resistant derivative of KR116, and three isogenic mutants (Δfim, Δfim2 and ΔfimΔfim2) using biofilm assays, human cell adhesion assays and pair-wise competition-based murine models of intestinal colonization, lung infection and ascending urinary tract infection. Although no statistically significant role for fim2 was demonstrable, liver and kidney CFU counts for lung and urinary tract infection models, respectively, hinted at an ordered gradation of virulence: KR2107 (most virulent), KR2107∆fim2, KR2107∆fim and KR2107∆fim∆fim2 (least virulent). Thus, despite lack of statistical evidence there was a suggestion that fim and fim2 contribute additively to virulence in these murine infection models. However, further studies would be necessary to substantiate this hypothesis. Conclusion Although fim2 was present in 13% of Klebsiella spp. strains investigated, no obvious in vitro or in vivo role for the locus was identified, although there were subtle hints of

  4. Comparative Genomics of 12 Strains of Erwinia amylovora Identifies a Pan-Genome with a Large Conserved Core

    PubMed Central

    Mann, Rachel A.; Smits, Theo H. M.; Bühlmann, Andreas; Blom, Jochen; Goesmann, Alexander; Frey, Jürg E.; Plummer, Kim M.; Beer, Steven V.; Luck, Joanne; Duffy, Brion; Rodoni, Brendan

    2013-01-01

    The plant pathogen Erwinia amylovora can be divided into two host-specific groupings; strains infecting a broad range of hosts within the Rosaceae subfamily Spiraeoideae (e.g., Malus, Pyrus, Crataegus, Sorbus) and strains infecting Rubus (raspberries and blackberries). Comparative genomic analysis of 12 strains representing distinct populations (e.g., geographic, temporal, host origin) of E. amylovora was used to describe the pan-genome of this major pathogen. The pan-genome contains 5751 coding sequences and is highly conserved relative to other phytopathogenic bacteria comprising on average 89% conserved, core genes. The chromosomes of Spiraeoideae-infecting strains were highly homogeneous, while greater genetic diversity was observed between Spiraeoideae- and Rubus-infecting strains (and among individual Rubus-infecting strains), the majority of which was attributed to variable genomic islands. Based on genomic distance scores and phylogenetic analysis, the Rubus-infecting strain ATCC BAA-2158 was genetically more closely related to the Spiraeoideae-infecting strains of E. amylovora than it was to the other Rubus-infecting strains. Analysis of the accessory genomes of Spiraeoideae- and Rubus-infecting strains has identified putative host-specific determinants including variation in the effector protein HopX1Ea and a putative secondary metabolite pathway only present in Rubus-infecting strains. PMID:23409014

  5. Comparative genomics of 12 strains of Erwinia amylovora identifies a pan-genome with a large conserved core.

    PubMed

    Mann, Rachel A; Smits, Theo H M; Bühlmann, Andreas; Blom, Jochen; Goesmann, Alexander; Frey, Jürg E; Plummer, Kim M; Beer, Steven V; Luck, Joanne; Duffy, Brion; Rodoni, Brendan

    2013-01-01

    The plant pathogen Erwinia amylovora can be divided into two host-specific groupings; strains infecting a broad range of hosts within the Rosaceae subfamily Spiraeoideae (e.g., Malus, Pyrus, Crataegus, Sorbus) and strains infecting Rubus (raspberries and blackberries). Comparative genomic analysis of 12 strains representing distinct populations (e.g., geographic, temporal, host origin) of E. amylovora was used to describe the pan-genome of this major pathogen. The pan-genome contains 5751 coding sequences and is highly conserved relative to other phytopathogenic bacteria comprising on average 89% conserved, core genes. The chromosomes of Spiraeoideae-infecting strains were highly homogeneous, while greater genetic diversity was observed between Spiraeoideae- and Rubus-infecting strains (and among individual Rubus-infecting strains), the majority of which was attributed to variable genomic islands. Based on genomic distance scores and phylogenetic analysis, the Rubus-infecting strain ATCC BAA-2158 was genetically more closely related to the Spiraeoideae-infecting strains of E. amylovora than it was to the other Rubus-infecting strains. Analysis of the accessory genomes of Spiraeoideae- and Rubus-infecting strains has identified putative host-specific determinants including variation in the effector protein HopX1(Ea) and a putative secondary metabolite pathway only present in Rubus-infecting strains.

  6. Recruiting Human Microbiome Shotgun Data to Site-Specific Reference Genomes

    PubMed Central

    Xie, Gary; Lo, Chien-Chi; Scholz, Matthew; Chain, Patrick S. G.

    2014-01-01

    The human body consists of innumerable multifaceted environments that predispose colonization by a number of distinct microbial communities, which play fundamental roles in human health and disease. In addition to community surveys and shotgun metagenomes that seek to explore the composition and diversity of these microbiomes, there are significant efforts to sequence reference microbial genomes from many body sites of healthy adults. To illustrate the utility of reference genomes when studying more complex metagenomes, we present a reference-based analysis of sequence reads generated from 55 shotgun metagenomes, selected from 5 major body sites, including 16 sub-sites. Interestingly, between 13% and 92% (62.3% average) of these shotgun reads were aligned to a then-complete list of 2780 reference genomes, including 1583 references for the human microbiome. However, no reference genome was universally found in all body sites. For any given metagenome, the body site-specific reference genomes, derived from the same body site as the sample, accounted for an average of 58.8% of the mapped reads. While different body sites did differ in abundant genera, proximal or symmetrical body sites were found to be most similar to one another. The extent of variation observed, both between individuals sampled within the same microenvironment, or at the same site within the same individual over time, calls into question comparative studies across individuals even if sampled at the same body site. This study illustrates the high utility of reference genomes and the need for further site-specific reference microbial genome sequencing, even within the already well-sampled human microbiome. PMID:24454771

  7. Tracing common origins of Genomic Islands in prokaryotes based on genome signature analyses.

    PubMed

    van Passel, Mark Wj

    2011-09-01

    Horizontal gene transfer constitutes a powerful and innovative force in evolution, but often little is known about the actual origins of transferred genes. Sequence alignments are generally of limited use in tracking the original donor, since still only a small fraction of the total genetic diversity is thought to be uncovered. Alternatively, approaches based on similarities in the genome specific relative oligonucleotide frequencies do not require alignments. Even though the exact origins of horizontally transferred genes may still not be established using these compositional analyses, it does suggest that compositionally very similar regions are likely to have had a common origin. These analyses have shown that up to a third of large acquired gene clusters that reside in the same genome are compositionally very similar, indicative of a shared origin. This brings us closer to uncovering the original donors of horizontally transferred genes, and could help in elucidating possible regulatory interactions between previously unlinked sequences.

  8. Genetic variability of psychrotolerant Acidithiobacillus ferrivorans revealed by (meta)genomic analysis.

    PubMed

    González, Carolina; Yanquepe, María; Cardenas, Juan Pablo; Valdes, Jorge; Quatrini, Raquel; Holmes, David S; Dopson, Mark

    2014-11-01

    Acidophilic microorganisms inhabit low pH environments such as acid mine drainage that is generated when sulfide minerals are exposed to air. The genome sequence of the psychrotolerant Acidithiobacillus ferrivorans SS3 was compared to a metagenome from a low temperature acidic stream dominated by an A. ferrivorans-like strain. Stretches of genomic DNA characterized by few matches to the metagenome, termed 'metagenomic islands', encoded genes associated with metal efflux and pH homeostasis. The metagenomic islands were enriched in mobile elements such as phage proteins, transposases, integrases and in one case, predicted to be flanked by truncated tRNAs. Cus gene clusters predicted to be involved in copper efflux and further Cus-like RND systems were predicted to be located in metagenomic islands and therefore, constitute part of the flexible gene complement of the species. Phylogenetic analysis of Cus clusters showed both lineage specificity within the Acidithiobacillus genus as well as niche specificity associated with an acidic environment. The metagenomic islands also contained a predicted copper efflux P-type ATPase system and a polyphosphate kinase potentially involved in polyphosphate mediated copper resistance. This study identifies genetic variability of low temperature acidophiles that likely reflects metal resistance selective pressures in the copper rich environment. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  9. Novel genomic island modifies DNA with 7-deazaguanine derivatives

    PubMed Central

    Thiaville, Jennifer J.; Kellner, Stefanie M.; Yuan, Yifeng; Hutinet, Geoffrey; Thiaville, Patrick C.; Jumpathong, Watthanachai; Mohapatra, Susovan; Brochier-Armanet, Celine; Letarov, Andrey V.; Hillebrand, Roman; Malik, Chanchal K.; Rizzo, Carmelo J.; Dedon, Peter C.; de Crécy-Lagard, Valérie

    2016-01-01

    The discovery of ∼20-kb gene clusters containing a family of paralogs of tRNA guanosine transglycosylase genes, called tgtA5, alongside 7-cyano-7-deazaguanine (preQ0) synthesis and DNA metabolism genes, led to the hypothesis that 7-deazaguanine derivatives are inserted in DNA. This was established by detecting 2’-deoxy-preQ0 and 2’-deoxy-7-amido-7-deazaguanosine in enzymatic hydrolysates of DNA extracted from the pathogenic, Gram-negative bacteria Salmonella enterica serovar Montevideo. These modifications were absent in the closely related S. enterica serovar Typhimurium LT2 and from a mutant of S. Montevideo, each lacking the gene cluster. This led us to rename the genes of the S. Montevideo cluster as dpdA-K for 7-deazapurine in DNA. Similar gene clusters were analyzed in ∼150 phylogenetically diverse bacteria, and the modifications were detected in DNA from other organisms containing these clusters, including Kineococcus radiotolerans, Comamonas testosteroni, and Sphingopyxis alaskensis. Comparative genomic analysis shows that, in Enterobacteriaceae, the cluster is a genomic island integrated at the leuX locus, and the phylogenetic analysis of the TgtA5 family is consistent with widespread horizontal gene transfer. Comparison of transformation efficiencies of modified or unmodified plasmids into isogenic S. Montevideo strains containing or lacking the cluster strongly suggests a restriction–modification role for the cluster in Enterobacteriaceae. Another preQ0 derivative, 2’-deoxy-7-formamidino-7-deazaguanosine, was found in the Escherichia coli bacteriophage 9g, as predicted from the presence of homologs of genes involved in the synthesis of the archaeosine tRNA modification. These results illustrate a deep and unexpected evolutionary connection between DNA and tRNA metabolism. PMID:26929322

  10. High-quality permanent draft genome sequence of Bradyrhizobium sp. Tv2a.2, a microsymbiont of Tachigali versicolor discovered in Barro Colorado Island of Panama

    DOE PAGES

    Tian, Rui; Parker, Matthew; Seshadri, Rekha; ...

    2015-05-17

    Bradyrhizobiumsp. Tv2a.2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Tachigali versicolor collected in Barro Colorado Island of Panama. Here we describe the features of Bradyrhizobiumsp. Tv2a.2, together with high-quality permanent draft genome sequence information and annotation. The 8,496,279 bp high-quality draft genome is arranged in 87 scaffolds of 87 contigs, contains 8,109 protein-coding genes and 72 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  11. Complete genome sequence of Leptospirillum ferrooxidans strain C2-3, isolated from a fresh volcanic ash deposit on the island of Miyake, Japan.

    PubMed

    Fujimura, Reiko; Sato, Yoshinori; Nishizawa, Tomoyasu; Oshima, Kenshiro; Kim, Seok-Won; Hattori, Masahira; Kamijo, Takashi; Ohta, Hiroyuki

    2012-08-01

    A diazotrophic, acidophilic, iron-oxidizing bacterium, Leptospirillum ferrooxidans, known to be difficult to cultivate, was isolated from a fresh volcanic ash deposit on the island of Miyake, Japan. Here, we report the complete genome sequence of a cultured strain, C2-3.

  12. Molecular characteristics of Salmonella genomic island 1 in Proteus mirabilis isolates from poultry farms in China.

    PubMed

    Lei, Chang-Wei; Zhang, An-Yun; Liu, Bi-Hui; Wang, Hong-Ning; Guan, Zhong-Bin; Xu, Chang-Wen; Xia, Qing-Qing; Cheng, Han; Zhang, Dong-Dong

    2014-12-01

    Six out of the 64 studied Proteus mirabilis isolates from 11 poultry farms in China contained Salmonella genomic island 1 (SGI1). PCR mapping showed that the complete nucleotide sequences of SGI1s ranged from 33.2 to 42.5 kb. Three novel variants, SGI1-W, SGI1-X, and SGI1-Y, have been characterized. Resistance genes lnuF, dfrA25, and qnrB2 were identified in SGI1 for the first time. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  13. Comparative genomic analysis of bacteriophages specific to the channel catfish pathogen Edwardsiella ictaluri

    PubMed Central

    2011-01-01

    Background The bacterial pathogen Edwardsiella ictaluri is a primary cause of mortality in channel catfish raised commercially in aquaculture farms. Additional treatment and diagnostic regimes are needed for this enteric pathogen, motivating the discovery and characterization of bacteriophages specific to E. ictaluri. Results The genomes of three Edwardsiella ictaluri-specific bacteriophages isolated from geographically distant aquaculture ponds, at different times, were sequenced and analyzed. The genomes for phages eiAU, eiDWF, and eiMSLS are 42.80 kbp, 42.12 kbp, and 42.69 kbp, respectively, and are greater than 95% identical to each other at the nucleotide level. Nucleotide differences were mostly observed in non-coding regions and in structural proteins, with significant variability in the sequences of putative tail fiber proteins. The genome organization of these phages exhibit a pattern shared by other Siphoviridae. Conclusions These E. ictaluri-specific phage genomes reveal considerable conservation of genomic architecture and sequence identity, even with considerable temporal and spatial divergence in their isolation. Their genomic homogeneity is similarly observed among E. ictaluri bacterial isolates. The genomic analysis of these phages supports the conclusion that these are virulent phages, lacking the capacity for lysogeny or expression of virulence genes. This study contributes to our knowledge of phage genomic diversity and facilitates studies on the diagnostic and therapeutic applications of these phages. PMID:21214923

  14. Generation of non-genomic oligonucleotide tag sequences for RNA template-specific PCR

    PubMed Central

    Pinto, Fernando Lopes; Svensson, Håkan; Lindblad, Peter

    2006-01-01

    Background In order to overcome genomic DNA contamination in transcriptional studies, reverse template-specific polymerase chain reaction, a modification of reverse transcriptase polymerase chain reaction, is used. The possibility of using tags whose sequences are not found in the genome further improves reverse specific polymerase chain reaction experiments. Given the absence of software available to produce genome suitable tags, a simple tool to fulfill such need was developed. Results The program was developed in Perl, with separate use of the basic local alignment search tool, making the tool platform independent (known to run on Windows XP and Linux). In order to test the performance of the generated tags, several molecular experiments were performed. The results show that Tagenerator is capable of generating tags with good priming properties, which will deliberately not result in PCR amplification of genomic DNA. Conclusion The program Tagenerator is capable of generating tag sequences that combine genome absence with good priming properties for RT-PCR based experiments, circumventing the effects of genomic DNA contamination in an RNA sample. PMID:16820068

  15. Complete Genome Sequence of Leptospirillum ferrooxidans Strain C2-3, Isolated from a Fresh Volcanic Ash Deposit on the Island of Miyake, Japan

    PubMed Central

    Fujimura, Reiko; Sato, Yoshinori; Nishizawa, Tomoyasu; Oshima, Kenshiro; Kim, Seok-Won; Hattori, Masahira; Kamijo, Takashi

    2012-01-01

    A diazotrophic, acidophilic, iron-oxidizing bacterium, Leptospirillum ferrooxidans, known to be difficult to cultivate, was isolated from a fresh volcanic ash deposit on the island of Miyake, Japan. Here, we report the complete genome sequence of a cultured strain, C2-3. PMID:22815442

  16. AnnotateGenomicRegions: a web application.

    PubMed

    Zammataro, Luca; DeMolfetta, Rita; Bucci, Gabriele; Ceol, Arnaud; Muller, Heiko

    2014-01-01

    Modern genomic technologies produce large amounts of data that can be mapped to specific regions in the genome. Among the first steps in interpreting the results is annotation of genomic regions with known features such as genes, promoters, CpG islands etc. Several tools have been published to perform this task. However, using these tools often requires a significant amount of bioinformatics skills and/or downloading and installing dedicated software. Here we present AnnotateGenomicRegions, a web application that accepts genomic regions as input and outputs a selection of overlapping and/or neighboring genome annotations. Supported organisms include human (hg18, hg19), mouse (mm8, mm9, mm10), zebrafish (danRer7), and Saccharomyces cerevisiae (sacCer2, sacCer3). AnnotateGenomicRegions is accessible online on a public server or can be installed locally. Some frequently used annotations and genomes are embedded in the application while custom annotations may be added by the user. The increasing spread of genomic technologies generates the need for a simple-to-use annotation tool for genomic regions that can be used by biologists and bioinformaticians alike. AnnotateGenomicRegions meets this demand. AnnotateGenomicRegions is an open-source web application that can be installed on any personal computer or institute server. AnnotateGenomicRegions is available at: http://cru.genomics.iit.it/AnnotateGenomicRegions.

  17. A genomic view of food-related and probiotic Enterococcus strains

    PubMed Central

    Suárez, Nadia; Hormigo, Ricardo; Fadda, Silvina; Saavedra, Lucila

    2017-01-01

    Abstract The study of enterococcal genomes has grown considerably in recent years. While special attention is paid to comparative genomic analysis among clinical relevant isolates, in this study we performed an exhaustive comparative analysis of enterococcal genomes of food origin and/or with potential to be used as probiotics. Beyond common genetic features, we especially aimed to identify those that are specific to enterococcal strains isolated from a certain food-related source as well as features present in a species-specific manner. Thus, the genome sequences of 25 Enterococcus strains, from 7 different species, were examined and compared. Their phylogenetic relationship was reconstructed based on orthologous proteins and whole genomes. Likewise, markers associated with a successful colonization (bacteriocin genes and genomic islands) and genome plasticity (phages and clustered regularly interspaced short palindromic repeats) were investigated for lifestyle specific genetic features. At the same time, a search for antibiotic resistance genes was carried out, since they are of big concern in the food industry. Finally, it was possible to locate 1617 FIGfam families as a core proteome universally present among the genera and to determine that most of the accessory genes code for hypothetical proteins, providing reasonable hints to support their functional characterization. PMID:27773878

  18. Genome analysis of E. coli isolated from Crohn's disease patients.

    PubMed

    Rakitina, Daria V; Manolov, Alexander I; Kanygina, Alexandra V; Garushyants, Sofya K; Baikova, Julia P; Alexeev, Dmitry G; Ladygina, Valentina G; Kostryukova, Elena S; Larin, Andrei K; Semashko, Tatiana A; Karpova, Irina Y; Babenko, Vladislav V; Ismagilova, Ruzilya K; Malanin, Sergei Y; Gelfand, Mikhail S; Ilina, Elena N; Gorodnichev, Roman B; Lisitsyna, Eugenia S; Aleshkin, Gennady I; Scherbakov, Petr L; Khalif, Igor L; Shapina, Marina V; Maev, Igor V; Andreev, Dmitry N; Govorun, Vadim M

    2017-07-19

    Escherichia coli (E. coli) has been increasingly implicated in the pathogenesis of Crohn's disease (CD). The phylogeny of E. coli isolated from Crohn's disease patients (CDEC) was controversial, and while genotyping results suggested heterogeneity, the sequenced strains of E. coli from CD patients were closely related. We performed the shotgun genome sequencing of 28 E. coli isolates from ten CD patients and compared genomes from these isolates with already published genomes of CD strains and other pathogenic and non-pathogenic strains. CDEC was shown to belong to A, B1, B2 and D phylogenetic groups. The plasmid and several operons from the reference CD-associated E. coli strain LF82 were demonstrated to be more often present in CDEC genomes belonging to different phylogenetic groups than in genomes of commensal strains. The operons include carbon-source induced invasion GimA island, prophage I, iron uptake operons I and II, capsular assembly pathogenetic island IV and propanediol and galactitol utilization operons. Our findings suggest that CDEC are phylogenetically diverse. However, some strains isolated from independent sources possess highly similar chromosome or plasmids. Though no CD-specific genes or functional domains were present in all CD-associated strains, some genes and operons are more often found in the genomes of CDEC than in commensal E. coli. They are principally linked to gut colonization and utilization of propanediol and other sugar alcohols.

  19. Integration of the blaNDM-1 carbapenemase gene into Proteus genomic island 1 (PGI1-PmPEL) in a Proteus mirabilis clinical isolate.

    PubMed

    Girlich, Delphine; Dortet, Laurent; Poirel, Laurent; Nordmann, Patrice

    2015-01-01

    To decipher the mechanisms and their associated genetic determinants responsible for β-lactam resistance in a Proteus mirabilis clinical isolate. The entire genetic structure surrounding the β-lactam resistance genes was characterized by PCR, gene walking and DNA sequencing. Genes encoding the carbapenemase NDM-1 and the ESBL VEB-6 were located in a 38.5 kb MDR structure, which itself was inserted into a new variant of the Proteus genomic island 1 (PGI1). This new PGI1-PmPEL variant of 64.4 kb was chromosomally located, as an external circular form in the P. mirabilis isolate, suggesting potential mobility. This is the first known description of the bla(NDM-1) gene in a genomic island structure, which might further enhance the spread of the bla(NDM-1) carbapenemase gene among enteric pathogens. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  20. EuGI: a novel resource for studying genomic islands to facilitate horizontal gene transfer detection in eukaryotes.

    PubMed

    Clasen, Frederick Johannes; Pierneef, Rian Ewald; Slippers, Bernard; Reva, Oleg

    2018-05-03

    Genomic islands (GIs) are inserts of foreign DNA that have potentially arisen through horizontal gene transfer (HGT). There are evidences that GIs can contribute significantly to the evolution of prokaryotes. The acquisition of GIs through HGT in eukaryotes has, however, been largely unexplored. In this study, the previously developed GI prediction tool, SeqWord Gene Island Sniffer (SWGIS), is modified to predict GIs in eukaryotic chromosomes. Artificial simulations are used to estimate ratios of predicting false positive and false negative GIs by inserting GIs into different test chromosomes and performing the SWGIS v2.0 algorithm. Using SWGIS v2.0, GIs are then identified in 36 fungal, 22 protozoan and 8 invertebrate genomes. SWGIS v2.0 predicts GIs in large eukaryotic chromosomes based on the atypical nucleotide composition of these regions. Averages for predicting false negative and false positive GIs were 20.1% and 11.01% respectively. A total of 10,550 GIs were identified in 66 eukaryotic species with 5299 of these GIs coding for at least one functional protein. The EuGI web-resource, freely accessible at http://eugi.bi.up.ac.za , was developed that allows browsing the database created from identified GIs and genes within GIs through an interactive and visual interface. SWGIS v2.0 along with the EuGI database, which houses GIs identified in 66 different eukaryotic species, and the EuGI web-resource, provide the first comprehensive resource for studying HGT in eukaryotes.

  1. Site-Specific Integration of Exogenous Genes Using Genome Editing Technologies in Zebrafish.

    PubMed

    Kawahara, Atsuo; Hisano, Yu; Ota, Satoshi; Taimatsu, Kiyohito

    2016-05-13

    The zebrafish (Danio rerio) is an ideal vertebrate model to investigate the developmental molecular mechanism of organogenesis and regeneration. Recent innovation in genome editing technologies, such as zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR associated protein 9 (Cas9) system, have allowed researchers to generate diverse genomic modifications in whole animals and in cultured cells. The CRISPR/Cas9 and TALEN techniques frequently induce DNA double-strand breaks (DSBs) at the targeted gene, resulting in frameshift-mediated gene disruption. As a useful application of genome editing technology, several groups have recently reported efficient site-specific integration of exogenous genes into targeted genomic loci. In this review, we provide an overview of TALEN- and CRISPR/Cas9-mediated site-specific integration of exogenous genes in zebrafish.

  2. A genomic island provides Acidithiobacillus ferrooxidans ATCC 53993 additional copper resistance: a possible competitive advantage.

    PubMed

    Orellana, Luis H; Jerez, Carlos A

    2011-11-01

    There is great interest in understanding how extremophilic biomining bacteria adapt to exceptionally high copper concentrations in their environment. Acidithiobacillus ferrooxidans ATCC 53993 genome possesses the same copper resistance determinants as strain ATCC 23270. However, the former strain contains in its genome a 160-kb genomic island (GI), which is absent in ATCC 23270. This GI contains, amongst other genes, several genes coding for an additional putative copper ATPase and a Cus system. A. ferrooxidans ATCC 53993 showed a much higher resistance to CuSO(4) (>100 mM) than that of strain ATCC 23270 (<25 mM). When a similar number of bacteria from each strain were mixed and allowed to grow in the absence of copper, their respective final numbers remained approximately equal. However, in the presence of copper, there was a clear overgrowth of strain ATCC 53993 compared to ATCC 23270. This behavior is most likely explained by the presence of the additional copper-resistance genes in the GI of strain ATCC 53993. As determined by qRT-PCR, it was demonstrated that these genes are upregulated when A. ferrooxidans ATCC 53993 is grown in the presence of copper and were shown to be functional when expressed in copper-sensitive Escherichia coli mutants. Thus, the reason for resistance to copper of two strains of the same acidophilic microorganism could be determined by slight differences in their genomes, which may not only lead to changes in their capacities to adapt to their environment, but may also help to select the more fit microorganisms for industrial biomining operations. © Springer-Verlag 2011

  3. Serratia marcescens harbouring SME-type class A carbapenemases in Canada and the presence of blaSME on a novel genomic island, SmarGI1-1.

    PubMed

    Mataseje, L F; Boyd, D A; Delport, J; Hoang, L; Imperial, M; Lefebvre, B; Kuhn, M; Van Caeseele, P; Willey, B M; Mulvey, M R

    2014-07-01

    An increasing prevalence since 2010 of Serratia marcescens harbouring the Ambler class A carbapenemase SME prompted us to further characterize these isolates. Isolates harbouring bla(SME) were identified by PCR and sequencing. Phenotypic analysis for carbapenemase activity was carried out by a modified Hodge test and a modified Carba NP test. Antimicrobial susceptibilities were determined by Etest and Vitek 2. Typing was by PFGE of macrorestriction digests. Whole-genome sequencing of three isolates was carried out to characterize the genomic region harbouring the bla(SME)-type genes. All S. marcescens harbouring SME-type enzymes could be detected using a modified Carba NP test. Isolates harbouring bla(SME) were resistant to penicillins and carbapenems, but remained susceptible to third-generation cephalosporins, as well as fluoroquinolones and trimethoprim/sulfamethoxazole. Isolates exhibited diverse genetic backgrounds, though 57% of isolates were found in three clusters. Analysis of whole-genome sequence data from three isolates revealed that the bla(SME) gene occurred in a novel cryptic prophage genomic island, SmarGI1-1. There has been an increasing occurrence of S. marcescens harbouring bla(SME) in Canada since 2010. The bla(SME) gene was found on a genomic island, SmarGI1-1, that can be excised and circularized, which probably contributes to its dissemination amongst S. marcescens. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Genome-wide CpG island methylation and intergenic demethylation propensities vary among different tumor sites.

    PubMed

    Lee, Seung-Tae; Wiemels, Joseph L

    2016-02-18

    The epigenetic landscape of cancer includes both focal hypermethylation and broader hypomethylation in a genome-wide manner. By means of a comprehensive genomic analysis on 6637 tissues of 21 tumor types, we here show that the degrees of overall methylation in CpG island (CGI) and demethylation in intergenic regions, defined as 'backbone', largely vary among different tumors. Depending on tumor type, both CGI methylation and backbone demethylation are often associated with clinical, epidemiological and biological features such as age, sex, smoking history, anatomic location, histological type and grade, stage, molecular subtype and biological pathways. We found connections between CGI methylation and hypermutability, microsatellite instability, IDH1 mutation, 19p gain and polycomb features, and backbone demethylation with chromosomal instability, NSD1 and TP53 mutations, 5q and 19p loss and long repressive domains. These broad epigenetic patterns add a new dimension to our understanding of tumor biology and its clinical implications. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Complete genome sequence of bluetongue virus serotype 4 that emerged on the French island of Corsica in December 2016.

    PubMed

    Sailleau, C; Breard, E; Viarouge, C; Gorlier, A; Quenault, H; Hirchaud, E; Touzain, F; Blanchard, Y; Vitour, D; Zientara, S

    2018-02-01

    In November 2016, sheep located in the south of Corsica island exhibited clinical signs suggestive of bluetongue virus (BTV) infection. Laboratory analyses allowed to isolate and identify a BTV strain of serotype 4. The analysis of the full viral genome showed that all the 10 genomic segments were closely related to those of the BTV-4 present in Hungary in 2014 and involved in a large BT outbreak in the Balkan Peninsula. These results together with epidemiological data suggest that BTV-4 has been introduced to Corsica from Italy (Sardinia) where BTV-4 outbreaks have been reported in autumn 2016. This is the first report of the introduction in Corsica of a BTV strain previously spreading in eastern Europe. © 2017 Blackwell Verlag GmbH.

  6. Sequence-specific epigenetic effects of the maternal somatic genome on developmental rearrangements of the zygotic genome in Paramecium primaurelia.

    PubMed Central

    Meyer, E; Butler, A; Dubrana, K; Duharcourt, S; Caron, F

    1997-01-01

    In ciliates, the germ line genome is extensively rearranged during the development of the somatic macronucleus from a mitotic product of the zygotic nucleus. Germ line chromosomes are fragmented in specific regions, and a large number of internal sequence elements are eliminated. It was previously shown that transformation of the vegetative macronucleus of Paramecium primaurelia with a plasmid containing a subtelomeric surface antigen gene can affect the processing of the homologous germ line genomic region during development of a new macronucleus in sexual progeny of transformed clones. The gene and telomere-proximal flanking sequences are deleted from the new macronuclear genome, although the germ line genome remains wild type. Here we show that plasmids containing nonoverlapping segments of the same genomic region are able to induce similar terminal deletions; the locations of deletion end points depend on the particular sequence used. Transformation of the maternal macronucleus with a sequence internal to a macronuclear chromosome also causes the occurrence of internal deletions between short direct repeats composed of alternating thymines and adenines. The epigenetic influence of maternal macronuclear sequences on developmental rearrangements of the zygotic genome thus appears to be both sequence specific and general, suggesting that this trans-nucleus effect is mediated by pairing of homologous sequences. PMID:9199294

  7. Race to the Top. Rhode Island Report. Year 2: School Year 2011-2012. [State-Specific Summary Report

    ERIC Educational Resources Information Center

    US Department of Education, 2013

    2013-01-01

    This State-specific summary report serves as an assessment of Rhode Island's Year 2 Race to the Top implementation, highlighting successes and accomplishments, identifying challenges, and providing lessons learned from implementation from approximately September 2011 through September 2012. In Year 2, Rhode Island Department of Education (RIDE)…

  8. Race to the Top. Rhode Island Report. Year 1: School Year 2010-2011. [State-Specific Summary Report

    ERIC Educational Resources Information Center

    US Department of Education, 2012

    2012-01-01

    This State-specific summary report serves as an assessment of Rhode Island's Year 1 Race to the Top implementation, highlighting successes and accomplishments, identifying challenges, and providing lessons learned from implementation to date. According to the State, in Year 1, Rhode Island greatly increased statewide capacity to begin…

  9. PrimerStation: a highly specific multiplex genomic PCR primer design server for the human genome

    PubMed Central

    Yamada, Tomoyuki; Soma, Haruhiko; Morishita, Shinichi

    2006-01-01

    PrimerStation () is a web service that calculates primer sets guaranteeing high specificity against the entire human genome. To achieve high accuracy, we used the hybridization ratio of primers in liquid solution. Calculating the status of sequence hybridization in terms of the stringent hybridization ratio is computationally costly, and no web service checks the entire human genome and returns a highly specific primer set calculated using a precise physicochemical model. To shorten the response time, we precomputed candidates for specific primers using a massively parallel computer with 100 CPUs (SunFire 15 K) about 3 months in advance. This enables PrimerStation to search and output qualified primers interactively. PrimerStation can select highly specific primers suitable for multiplex PCR by seeking a wider temperature range that minimizes the possibility of cross-reaction. It also allows users to add heuristic rules to the primer design, e.g. the exclusion of single nucleotide polymorphisms (SNPs) in primers, the avoidance of poly(A) and CA-repeats in the PCR products, and the elimination of defective primers using the secondary structure prediction. We performed several tests to verify the PCR amplification of randomly selected primers for ChrX, and we confirmed that the primers amplify specific PCR products perfectly. PMID:16845094

  10. AnnotateGenomicRegions: a web application

    PubMed Central

    2014-01-01

    Background Modern genomic technologies produce large amounts of data that can be mapped to specific regions in the genome. Among the first steps in interpreting the results is annotation of genomic regions with known features such as genes, promoters, CpG islands etc. Several tools have been published to perform this task. However, using these tools often requires a significant amount of bioinformatics skills and/or downloading and installing dedicated software. Results Here we present AnnotateGenomicRegions, a web application that accepts genomic regions as input and outputs a selection of overlapping and/or neighboring genome annotations. Supported organisms include human (hg18, hg19), mouse (mm8, mm9, mm10), zebrafish (danRer7), and Saccharomyces cerevisiae (sacCer2, sacCer3). AnnotateGenomicRegions is accessible online on a public server or can be installed locally. Some frequently used annotations and genomes are embedded in the application while custom annotations may be added by the user. Conclusions The increasing spread of genomic technologies generates the need for a simple-to-use annotation tool for genomic regions that can be used by biologists and bioinformaticians alike. AnnotateGenomicRegions meets this demand. AnnotateGenomicRegions is an open-source web application that can be installed on any personal computer or institute server. AnnotateGenomicRegions is available at: http://cru.genomics.iit.it/AnnotateGenomicRegions. PMID:24564446

  11. A 38-Kilobase Pathogenicity Island Specific for Mycobacterium avium subsp. paratuberculosis Encodes Cell Surface Proteins Expressed in the Host

    PubMed Central

    Stratmann, Janin; Strommenger, Birgit; Goethe, Ralph; Dohmann, Karen; Gerlach, Gerald-F.; Stevenson, Karen; Li, Ling-ling; Zhang, Qing; Kapur, Vivek; Bull, Tim J.

    2004-01-01

    We have used representational difference analysis to identify a novel Mycobacterium avium subsp. paratuberculosis-specific ABC transporter operon (mpt), which comprises six open reading frames designated mptA to -F and is immediately preceded by two putative Fur boxes. Functional genomics revealed that the mpt operon is flanked on one end by a fep cluster encoding proteins involved in the uptake of Fe3+ and on the other end by a sid cluster encoding non-ribosome-dependent heterocyclic siderophore synthases. Together these genes form a 38-kb M. avium subsp. paratuberculosis-specific locus flanked by an insertion sequence similar to IS1110. Expression studies using Western blot analyses showed that MptC is present in the envelope fraction of M. avium subsp. paratuberculosis. The MptD protein was shown to be surface exposed, using a specific phage (fMptD) isolated from a phage-peptide library, by differential screening of Mycobacterium smegmatis transformants. The phage fMptD-derived peptide could be used in a peptide-mediated capture PCR with milk from infected dairy herds, thereby showing surface-exposed expression of the MptD protein in the host. Together, these data suggest that the 38-kb locus constitutes an M. avium subsp. paratuberculosis pathogenicity island. PMID:14977927

  12. TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

    PubMed

    Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

    2018-04-11

    Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions

  13. An Enterotoxin-Bearing Pathogenicity Island in Staphylococcus epidermidis▿†

    PubMed Central

    Madhusoodanan, Jyoti; Seo, Keun Seok; Remortel, Brian; Park, Joo Youn; Hwang, Sun Young; Fox, Lawrence K.; Park, Yong Ho; Deobald, Claudia F.; Wang, Dan; Liu, Song; Daugherty, Sean C.; Gill, Ann Lindley; Bohach, Gregory A.; Gill, Steven R.

    2011-01-01

    Cocolonization of human mucosal surfaces causes frequent encounters between various staphylococcal species, creating opportunities for the horizontal acquisition of mobile genetic elements. The majority of Staphylococcus aureus toxins and virulence factors are encoded on S. aureus pathogenicity islands (SaPIs). Horizontal movement of SaPIs between S. aureus strains plays a role in the evolution of virulent clinical isolates. Although there have been reports of the production of toxic shock syndrome toxin 1 (TSST-1), enterotoxin, and other superantigens by coagulase-negative staphylococci, no associated pathogenicity islands have been found in the genome of Staphylococcus epidermidis, a generally less virulent relative of S. aureus. We show here the first evidence of a composite S. epidermidis pathogenicity island (SePI), the product of multiple insertions in the genome of a clinical isolate. The taxonomic placement of S. epidermidis strain FRI909 was confirmed by a number of biochemical tests and multilocus sequence typing. The genome sequence of this strain was analyzed for other unique gene clusters and their locations. This pathogenicity island encodes and expresses staphylococcal enterotoxin C3 (SEC3) and staphylococcal enterotoxin-like toxin L (SElL), as confirmed by quantitative reverse transcription-PCR (qRT-PCR) and immunoblotting. We present here an initial characterization of this novel pathogenicity island, and we establish that it is stable, expresses enterotoxins, and is not obviously transmissible by phage transduction. We also describe the genome sequence, excision, replication, and packaging of a novel bacteriophage in S. epidermidis FRI909, as well as attempts to mobilize the SePI element by this phage. PMID:21317317

  14. A genomic view of food-related and probiotic Enterococcus strains.

    PubMed

    Bonacina, Julieta; Suárez, Nadia; Hormigo, Ricardo; Fadda, Silvina; Lechner, Marcus; Saavedra, Lucila

    2017-02-01

    The study of enterococcal genomes has grown considerably in recent years. While special attention is paid to comparative genomic analysis among clinical relevant isolates, in this study we performed an exhaustive comparative analysis of enterococcal genomes of food origin and/or with potential to be used as probiotics. Beyond common genetic features, we especially aimed to identify those that are specific to enterococcal strains isolated from a certain food-related source as well as features present in a species-specific manner. Thus, the genome sequences of 25 Enterococcus strains, from 7 different species, were examined and compared. Their phylogenetic relationship was reconstructed based on orthologous proteins and whole genomes. Likewise, markers associated with a successful colonization (bacteriocin genes and genomic islands) and genome plasticity (phages and clustered regularly interspaced short palindromic repeats) were investigated for lifestyle specific genetic features. At the same time, a search for antibiotic resistance genes was carried out, since they are of big concern in the food industry. Finally, it was possible to locate 1617 FIGfam families as a core proteome universally present among the genera and to determine that most of the accessory genes code for hypothetical proteins, providing reasonable hints to support their functional characterization. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  15. GSP: a web-based platform for designing genome-specific primers in polyploids

    USDA-ARS?s Scientific Manuscript database

    The primary goal of this research was to develop a web-based platform named GSP for designing genome-specific primers to distinguish subgenome sequences in the polyploid genome background. GSP uses BLAST to extract homeologous sequences of the subgenomes in the existing databases, performed a multip...

  16. Allele-specific locus binding and genome editing by CRISPR at the p16INK4a locus.

    PubMed

    Fujita, Toshitsugu; Yuno, Miyuki; Fujii, Hodaka

    2016-07-28

    The clustered regularly interspaced short palindromic repeats (CRISPR) system has been adopted for a wide range of biological applications including genome editing. In some cases, dissection of genome functions requires allele-specific genome editing, but the use of CRISPR for this purpose has not been studied in detail. In this study, using the p16INK4a gene in HCT116 as a model locus, we investigated whether chromatin states, such as CpG methylation, or a single-nucleotide gap form in a target site can be exploited for allele-specific locus binding and genome editing by CRISPR in vivo. First, we showed that allele-specific locus binding and genome editing could be achieved by targeting allele-specific CpG-methylated regions, which was successful for one, but not all guide RNAs. In this regard, molecular basis underlying the success remains elusive at this stage. Next, we demonstrated that an allele-specific single-nucleotide gap form could be employed for allele-specific locus binding and genome editing by CRISPR, although it was important to avoid CRISPR tolerance of a single nucleotide mismatch brought about by mismatched base skipping. Our results provide information that might be useful for applications of CRISPR in studies of allele-specific functions in the genomes.

  17. Testing models of speciation from genome sequences: divergence and asymmetric admixture in Island South-East Asian Sus species during the Plio-Pleistocene climatic fluctuations

    PubMed Central

    Frantz, Laurent A F; Madsen, Ole; Megens, Hendrik-Jan; Groenen, Martien A M; Lohse, Konrad

    2014-01-01

    In many temperate regions, ice ages promoted range contractions into refugia resulting in divergence (and potentially speciation), while warmer periods led to range expansions and hybridization. However, the impact these climatic oscillations had in many parts of the tropics remains elusive. Here, we investigate this issue using genome sequences of three pig (Sus) species, two of which are found on islands of the Sunda-shelf shallow seas in Island South-East Asia (ISEA). A previous study revealed signatures of interspecific admixture between these Sus species (Genome biology,14, 2013, R107). However, the timing, directionality and extent of this admixture remain unknown. Here, we use a likelihood-based model comparison to more finely resolve this admixture history and test whether it was mediated by humans or occurred naturally. Our analyses suggest that interspecific admixture between Sunda-shelf species was most likely asymmetric and occurred long before the arrival of humans in the region. More precisely, we show that these species diverged during the late Pliocene but around 23% of their genomes have been affected by admixture during the later Pleistocene climatic transition. In addition, we show that our method provides a significant improvement over D-statistics which are uninformative about the direction of admixture. PMID:25294645

  18. Site-specific recombination in the chicken genome using Flipase recombinase-mediated cassette exchange.

    PubMed

    Lee, Hong Jo; Lee, Hyung Chul; Kim, Young Min; Hwang, Young Sun; Park, Young Hyun; Park, Tae Sub; Han, Jae Yong

    2016-02-01

    Targeted genome recombination has been applied in diverse research fields and has a wide range of possible applications. In particular, the discovery of specific loci in the genome that support robust and ubiquitous expression of integrated genes and the development of genome-editing technology have facilitated rapid advances in various scientific areas. In this study, we produced transgenic (TG) chickens that can induce recombinase-mediated gene cassette exchange (RMCE), one of the site-specific recombination technologies, and confirmed RMCE in TG chicken-derived cells. As a result, we established TG chicken lines that have, Flipase (Flp) recognition target (FRT) pairs in the chicken genome, mediated by piggyBac transposition. The transgene integration patterns were diverse in each TG chicken line, and the integration diversity resulted in diverse levels of expression of exogenous genes in each tissue of the TG chickens. In addition, the replaced gene cassette was expressed successfully and maintained by RMCE in the FRT predominant loci of TG chicken-derived cells. These results indicate that targeted genome recombination technology with RMCE could be adaptable to TG chicken models and that the technology would be applicable to specific gene regulation by cis-element insertion and customized expression of functional proteins at predicted levels without epigenetic influence. © FASEB.

  19. Emergence and Evolution of Hominidae-Specific Coding and Noncoding Genomic Sequences

    PubMed Central

    Saber, Morteza Mahmoudi; Adeyemi Babarinde, Isaac; Hettiarachchi, Nilmini; Saitou, Naruya

    2016-01-01

    Family Hominidae, which includes humans and great apes, is recognized for unique complex social behavior and intellectual abilities. Despite the increasing genome data, however, the genomic origin of its phenotypic uniqueness has remained elusive. Clade-specific genes and highly conserved noncoding sequences (HCNSs) are among the high-potential evolutionary candidates involved in driving clade-specific characters and phenotypes. On this premise, we analyzed whole genome sequences along with gene orthology data retrieved from major DNA databases to find Hominidae-specific (HS) genes and HCNSs. We discovered that Down syndrome critical region 4 (DSCR4) is the only experimentally verified gene uniquely present in Hominidae. DSCR4 has no structural homology to any known protein and was inferred to have emerged in several steps through LTR/ERV1, LTR/ERVL retrotransposition, and transversion. Using the genomic distance as neutral evolution threshold, we identified 1,658 HS HCNSs. Polymorphism coverage and derived allele frequency analysis of HS HCNSs showed that these HCNSs are under purifying selection, indicating that they may harbor important functions. They are overrepresented in promoters/untranslated regions, in close proximity of genes involved in sensory perception of sound and developmental process, and also showed a significantly lower nucleosome occupancy probability. Interestingly, many ancestral sequences of the HS HCNSs showed very high evolutionary rates. This suggests that new functions emerged through some kind of positive selection, and then purifying selection started to operate to keep these functions. PMID:27289096

  20. Recombination rate variation in mice from an isolated island

    PubMed Central

    Wang, Richard J.; Gray, Melissa M.; Parmenter, Michelle D.; Broman, Karl W.; Payseur, Bret A.

    2016-01-01

    Recombination rate is a heritable trait that varies among individuals. Despite the major impact of recombination rate on patterns of genetic diversity and the efficacy of selection, natural variation in this phenotype remains poorly characterized. We present a comparison of genetic maps, sampling 1,212 meioses, from a unique population of wild house mice (Mus musculus domesticus) that recently colonized remote Gough Island. Crosses to a mainland reference strain (WSB/EiJ) reveal pervasive variation in recombination rate among Gough Island mice, including sub-chromosomal intervals spanning up to 28% of the genome. In spite of this high level of polymorphism, the genome-wide recombination rate does not significantly vary. In general, we find that recombination rate varies more when measured in smaller genomic intervals. Using the current standard genetic map of the laboratory mouse to polarize intervals with divergent recombination rates, we infer that the majority of evolutionary change occurred in one of the two tested lines of Gough Island mice. Our results confirm that natural populations harbor a high level of recombination rate polymorphism and highlight the disparities in recombination rate evolution across genomic scales. PMID:27864900

  1. Tissue-specific NETs alter genome organization and regulation even in a heterologous system.

    PubMed

    de Las Heras, Jose I; Zuleger, Nikolaj; Batrakou, Dzmitry G; Czapiewski, Rafal; Kerr, Alastair R W; Schirmer, Eric C

    2017-01-02

    Different cell types exhibit distinct patterns of 3D genome organization that correlate with changes in gene expression in tissue and differentiation systems. Several tissue-specific nuclear envelope transmembrane proteins (NETs) have been found to influence the spatial positioning of genes and chromosomes that normally occurs during tissue differentiation. Here we study 3 such NETs: NET29, NET39, and NET47, which are expressed preferentially in fat, muscle and liver, respectively. We found that even when exogenously expressed in a heterologous system they can specify particular genome organization patterns and alter gene expression. Each NET affected largely different subsets of genes. Notably, the liver-specific NET47 upregulated many genes in HT1080 fibroblast cells that are normally upregulated in hepatogenesis, showing that tissue-specific NETs can favor expression patterns associated with the tissue where the NET is normally expressed. Similarly, global profiling of peripheral chromatin after exogenous expression of these NETs using lamin B1 DamID revealed that each NET affected the nuclear positioning of distinct sets of genomic regions with a significant tissue-specific component. Thus NET influences on genome organization can contribute to gene expression changes associated with differentiation even in the absence of other factors and overt cellular differentiation changes.

  2. Discovery of Nigri/nox and Panto/pox site-specific recombinase systems facilitates advanced genome engineering.

    PubMed

    Karimova, Madina; Splith, Victoria; Karpinski, Janet; Pisabarro, M Teresa; Buchholz, Frank

    2016-07-22

    Precise genome engineering is instrumental for biomedical research and holds great promise for future therapeutic applications. Site-specific recombinases (SSRs) are valuable tools for genome engineering due to their exceptional ability to mediate precise excision, integration and inversion of genomic DNA in living systems. The ever-increasing complexity of genome manipulations and the desire to understand the DNA-binding specificity of these enzymes are driving efforts to identify novel SSR systems with unique properties. Here, we describe two novel tyrosine site-specific recombination systems designated Nigri/nox and Panto/pox. Nigri originates from Vibrio nigripulchritudo (plasmid VIBNI_pA) and recombines its target site nox with high efficiency and high target-site selectivity, without recombining target sites of the well established SSRs Cre, Dre, Vika and VCre. Panto, derived from Pantoea sp. aB, is less specific and in addition to its native target site, pox also recombines the target site for Dre recombinase, called rox. This relaxed specificity allowed the identification of residues that are involved in target site selectivity, thereby advancing our understanding of how SSRs recognize their respective DNA targets.

  3. CpG island methylator phenotype-low (CIMP-low) colorectal cancer shows not only few methylated CIMP-high-specific CpG islands, but also low-level methylation at individual loci.

    PubMed

    Kawasaki, Takako; Ohnishi, Mutsuko; Nosho, Katsuhiko; Suemoto, Yuko; Kirkner, Gregory J; Meyerhardt, Jeffrey A; Fuchs, Charles S; Ogino, Shuji

    2008-03-01

    The CpG island methylator phenotype (CIMP or CIMP-high) with widespread promoter methylation is a distinct phenotype in colorectal cancer. However, the concept of CIMP-low with less extensive CpG island methylation is still evolving. Our aim is to examine whether density of methylation in individual CpG islands was different between CIMP-low and CIMP-high tumors. Utilizing MethyLight technology and 889 population-based colorectal cancers, we quantified DNA methylation (methylation index, percentage of methylated reference) at 14 CpG islands, including 8 CIMP-high-specific loci (CACNA1G, CDKN2A (p16), CRABP1, IGF2, MLH1, NEUROG1, RUNX3 and SOCS1). Methylation positivity in each locus was defined as methylation index>4. Low-level methylation (methylation index>0, <20) in each CIMP-high-specific locus was significantly more common in 340 CIMP-low tumors (1/8-5/8 methylation-positive loci) than 133 CIMP-high tumors (> or =6/8 methylation-positive loci) and 416 CIMP-0 tumors (0/8 methylation-positive loci) (P< or =0.002). In the other six loci (CHFR, HIC1, IGFBP3, MGMT, MINT31 and WRN), which were not highly specific for CIMP-high, low-level methylation, was not persistently more prevalent in CIMP-low tumors. In conclusion, compared to CIMP-high and CIMP-0 tumors, CIMP-low colorectal cancers show not only few methylated CIMP-high-specific CpG islands, but also more frequent low-level methylation at individual loci. Our data may provide supporting evidence for a difference in pathogenesis of DNA methylation between CIMP-low and CIMP-high tumors.

  4. Consequences of Normalizing Transcriptomic and Genomic Libraries of Plant Genomes Using a Duplex-Specific Nuclease and Tetramethylammonium Chloride

    PubMed Central

    Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

    2013-01-01

    Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce. PMID:23409088

  5. Consequences of normalizing transcriptomic and genomic libraries of plant genomes using a duplex-specific nuclease and tetramethylammonium chloride.

    PubMed

    Matvienko, Marta; Kozik, Alexander; Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

    2013-01-01

    Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.

  6. Emergence and Evolution of Hominidae-Specific Coding and Noncoding Genomic Sequences.

    PubMed

    Saber, Morteza Mahmoudi; Adeyemi Babarinde, Isaac; Hettiarachchi, Nilmini; Saitou, Naruya

    2016-07-12

    Family Hominidae, which includes humans and great apes, is recognized for unique complex social behavior and intellectual abilities. Despite the increasing genome data, however, the genomic origin of its phenotypic uniqueness has remained elusive. Clade-specific genes and highly conserved noncoding sequences (HCNSs) are among the high-potential evolutionary candidates involved in driving clade-specific characters and phenotypes. On this premise, we analyzed whole genome sequences along with gene orthology data retrieved from major DNA databases to find Hominidae-specific (HS) genes and HCNSs. We discovered that Down syndrome critical region 4 (DSCR4) is the only experimentally verified gene uniquely present in Hominidae. DSCR4 has no structural homology to any known protein and was inferred to have emerged in several steps through LTR/ERV1, LTR/ERVL retrotransposition, and transversion. Using the genomic distance as neutral evolution threshold, we identified 1,658 HS HCNSs. Polymorphism coverage and derived allele frequency analysis of HS HCNSs showed that these HCNSs are under purifying selection, indicating that they may harbor important functions. They are overrepresented in promoters/untranslated regions, in close proximity of genes involved in sensory perception of sound and developmental process, and also showed a significantly lower nucleosome occupancy probability. Interestingly, many ancestral sequences of the HS HCNSs showed very high evolutionary rates. This suggests that new functions emerged through some kind of positive selection, and then purifying selection started to operate to keep these functions. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. GenomeD3Plot: a library for rich, interactive visualizations of genomic data in web applications.

    PubMed

    Laird, Matthew R; Langille, Morgan G I; Brinkman, Fiona S L

    2015-10-15

    A simple static image of genomes and associated metadata is very limiting, as researchers expect rich, interactive tools similar to the web applications found in the post-Web 2.0 world. GenomeD3Plot is a light weight visualization library written in javascript using the D3 library. GenomeD3Plot provides a rich API to allow the rapid visualization of complex genomic data using a convenient standards based JSON configuration file. When integrated into existing web services GenomeD3Plot allows researchers to interact with data, dynamically alter the view, or even resize or reposition the visualization in their browser window. In addition GenomeD3Plot has built in functionality to export any resulting genome visualization in PNG or SVG format for easy inclusion in manuscripts or presentations. GenomeD3Plot is being utilized in the recently released Islandviewer 3 (www.pathogenomics.sfu.ca/islandviewer/) to visualize predicted genomic islands with other genome annotation data. However, its features enable it to be more widely applicable for dynamic visualization of genomic data in general. GenomeD3Plot is licensed under the GNU-GPL v3 at https://github.com/brinkmanlab/GenomeD3Plot/. brinkman@sfu.ca. © The Author 2015. Published by Oxford University Press.

  8. Differential methylation of tissue- and cancer-specific CpG island shores distinguishes human induced pluripotent stem cells, embryonic stem cells and fibroblasts

    PubMed Central

    Doi, Akiko; Park, In-Hyun; Wen, Bo; Murakami, Peter; Aryee, Martin J; Irizarry, Rafael; Herb, Brian; Ladd-Acosta, Christine; Rho, Junsung; Loewer, Sabine; Miller, Justine; Schlaeger, Thorsten; Daley, George Q; Feinberg, Andrew P

    2010-01-01

    Induced pluripotent stem (iPS) cells are derived by epigenetic reprogramming, but their DNA methylation patterns have not yet been analyzed on a genome-wide scale. Here, we find substantial hypermethylation and hypomethylation of cytosine-phosphate-guanine (CpG) island shores in nine human iPS cell lines as compared to their parental fibroblasts. The differentially methylated regions (DMRs) in the reprogrammed cells (denoted R-DMRs) were significantly enriched in tissue-specific (T-DMRs; 2.6-fold, P < 10−4) and cancer-specific DMRs (C-DMRs; 3.6-fold, P < 10−4). Notably, even though the iPS cells are derived from fibroblasts, their R-DMRs can distinguish between normal brain, liver and spleen cells and between colon cancer and normal colon cells. Thus, many DMRs are broadly involved in tissue differentiation, epigenetic reprogramming and cancer. We observed colocalization of hypomethylated R-DMRs with hypermethylated C-DMRs and bivalent chromatin marks, and colocalization of hypermethylated R-DMRs with hypomethylated C-DMRs and the absence of bivalent marks, suggesting two mechanisms for epigenetic reprogramming in iPS cells and cancer. PMID:19881528

  9. Comparative Genomics of Completely Sequenced Lactobacillus helveticus Genomes Provides Insights into Strain-Specific Genes and Resolves Metagenomics Data Down to the Strain Level.

    PubMed

    Schmid, Michael; Muri, Jonathan; Melidis, Damianos; Varadarajan, Adithi R; Somerville, Vincent; Wicki, Adrian; Moser, Aline; Bourqui, Marc; Wenzel, Claudia; Eugster-Meier, Elisabeth; Frey, Juerg E; Irmler, Stefan; Ahrens, Christian H

    2018-01-01

    Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences' long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus -to our knowledge-identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus . Notably, the functional Clusters of Orthologous Groups of proteins categories "cell wall/membrane biogenesis" and "defense mechanisms" were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences proved to be

  10. Comparative Genomics of Completely Sequenced Lactobacillus helveticus Genomes Provides Insights into Strain-Specific Genes and Resolves Metagenomics Data Down to the Strain Level

    PubMed Central

    Schmid, Michael; Muri, Jonathan; Melidis, Damianos; Varadarajan, Adithi R.; Somerville, Vincent; Wicki, Adrian; Moser, Aline; Bourqui, Marc; Wenzel, Claudia; Eugster-Meier, Elisabeth; Frey, Juerg E.; Irmler, Stefan; Ahrens, Christian H.

    2018-01-01

    Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences' long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus—to our knowledge—identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus. Notably, the functional Clusters of Orthologous Groups of proteins categories “cell wall/membrane biogenesis” and “defense mechanisms” were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences

  11. Cytotoxic chromosomal targeting by CRISPR/Cas systems can reshape bacterial genomes and expel or remodel pathogenicity islands.

    PubMed

    Vercoe, Reuben B; Chang, James T; Dy, Ron L; Taylor, Corinda; Gristwood, Tamzin; Clulow, James S; Richter, Corinna; Przybilski, Rita; Pitman, Andrew R; Fineran, Peter C

    2013-04-01

    In prokaryotes, clustered regularly interspaced short palindromic repeats (CRISPRs) and their associated (Cas) proteins constitute a defence system against bacteriophages and plasmids. CRISPR/Cas systems acquire short spacer sequences from foreign genetic elements and incorporate these into their CRISPR arrays, generating a memory of past invaders. Defence is provided by short non-coding RNAs that guide Cas proteins to cleave complementary nucleic acids. While most spacers are acquired from phages and plasmids, there are examples of spacers that match genes elsewhere in the host bacterial chromosome. In Pectobacterium atrosepticum the type I-F CRISPR/Cas system has acquired a self-complementary spacer that perfectly matches a protospacer target in a horizontally acquired island (HAI2) involved in plant pathogenicity. Given the paucity of experimental data about CRISPR/Cas-mediated chromosomal targeting, we examined this process by developing a tightly controlled system. Chromosomal targeting was highly toxic via targeting of DNA and resulted in growth inhibition and cellular filamentation. The toxic phenotype was avoided by mutations in the cas operon, the CRISPR repeats, the protospacer target, and protospacer-adjacent motif (PAM) beside the target. Indeed, the natural self-targeting spacer was non-toxic due to a single nucleotide mutation adjacent to the target in the PAM sequence. Furthermore, we show that chromosomal targeting can result in large-scale genomic alterations, including the remodelling or deletion of entire pre-existing pathogenicity islands. These features can be engineered for the targeted deletion of large regions of bacterial chromosomes. In conclusion, in DNA-targeting CRISPR/Cas systems, chromosomal interference is deleterious by causing DNA damage and providing a strong selective pressure for genome alterations, which may have consequences for bacterial evolution and pathogenicity.

  12. Profiling of gene duplication patterns of sequenced teleost genomes: evidence for rapid lineage-specific genome expansion mediated by recent tandem duplications.

    PubMed

    Lu, Jianguo; Peatman, Eric; Tang, Haibao; Lewis, Joshua; Liu, Zhanjiang

    2012-06-15

    Gene duplication has had a major impact on genome evolution. Localized (or tandem) duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks), and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting their origin of independent and continuous duplication

  13. Meta-analysis of sex-specific genome-wide association studies.

    PubMed

    Magi, Reedik; Lindgren, Cecilia M; Morris, Andrew P

    2010-12-01

    Despite the success of genome-wide association studies, much of the genetic contribution to complex human traits is still unexplained. One potential source of genetic variation that may contribute to this "missing heritability" is that which differs in magnitude and/or direction between males and females, which could result from sexual dimorphism in gene expression. Such sex-differentiated effects are common in model organisms, and are becoming increasingly evident in human complex traits through large-scale male- and female-specific meta-analyses. In this article, we review the methodology for meta-analysis of sex-specific genome-wide association studies, and propose a sex-differentiated test of association with quantitative or dichotomous traits, which allows for heterogeneity of allelic effects between males and females. We perform detailed simulations to compare the power of the proposed sex-differentiated meta-analysis with the more traditional "sex-combined" approach, which is ambivalent to gender. The results of this study highlight only a small loss in power for the sex-differentiated meta-analysis when the allelic effects of the causal variant are the same in males and females. However, over a range of models of heterogeneity in allelic effects between genders, our sex-differentiated meta-analysis strategy offers substantial gains in power, and thus has the potential to discover novel loci contributing effects to complex human traits with existing genome-wide association data. © 2010 Wiley-Liss, Inc.

  14. A site specific model and analysis of the neutral somatic mutation rate in whole-genome cancer data.

    PubMed

    Bertl, Johanna; Guo, Qianyun; Juul, Malene; Besenbacher, Søren; Nielsen, Morten Muhlig; Hornshøj, Henrik; Pedersen, Jakob Skou; Hobolth, Asger

    2018-04-19

    Detailed modelling of the neutral mutational process in cancer cells is crucial for identifying driver mutations and understanding the mutational mechanisms that act during cancer development. The neutral mutational process is very complex: whole-genome analyses have revealed that the mutation rate differs between cancer types, between patients and along the genome depending on the genetic and epigenetic context. Therefore, methods that predict the number of different types of mutations in regions or specific genomic elements must consider local genomic explanatory variables. A major drawback of most methods is the need to average the explanatory variables across the entire region or genomic element. This procedure is particularly problematic if the explanatory variable varies dramatically in the element under consideration. To take into account the fine scale of the explanatory variables, we model the probabilities of different types of mutations for each position in the genome by multinomial logistic regression. We analyse 505 cancer genomes from 14 different cancer types and compare the performance in predicting mutation rate for both regional based models and site-specific models. We show that for 1000 randomly selected genomic positions, the site-specific model predicts the mutation rate much better than regional based models. We use a forward selection procedure to identify the most important explanatory variables. The procedure identifies site-specific conservation (phyloP), replication timing, and expression level as the best predictors for the mutation rate. Finally, our model confirms and quantifies certain well-known mutational signatures. We find that our site-specific multinomial regression model outperforms the regional based models. The possibility of including genomic variables on different scales and patient specific variables makes it a versatile framework for studying different mutational mechanisms. Our model can serve as the neutral null model

  15. Patient-Specific Bacteroides Genome Variants in Pouchitis

    DOE PAGES

    Vineis, Joseph H.; Ringus, Daina L.; Morrison, Hilary G.; ...

    2016-11-15

    Here, a 2-year longitudinal microbiome study of 22 patients who underwent colectomy with an ileal pouch anal anastomosis detected significant increases in distinct populations of Bacteroides during 9 of 11 patient visits that coincided with inflammation (pouchitis). Oligotyping and metagenomic short-read annotation identified Bacteroides populations that occurred in early samples, bloomed during inflammation, and reappeared after antibiotic treatment. Targeted cultivation of Bacteroides isolates from the same individual at multiple time points and from several patients detected subtle genomic changes, including the identification of rapidly evolving genomic elements that differentiate isogenic strains of Bacteroides fragilis from the mucosa versus lumen. Eachmore » patient harbored Bacteroides spp. that are closely related to commonly occurring clinical isolates, including Bacteroides ovatus, B. thetaiotaomicron, B. vulgatus, and B. fragilis, which contained unique loci in different patients for synthesis of capsular polysaccharides. The presence of unique Bacteroides capsular polysaccharide loci within different hosts and between the lumen and mucosa may represent adaptations to stimulate, suppress, and evade host-specific immune responses at different microsites of the ileal pouch.« less

  16. Patient-Specific Bacteroides Genome Variants in Pouchitis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vineis, Joseph H.; Ringus, Daina L.; Morrison, Hilary G.

    Here, a 2-year longitudinal microbiome study of 22 patients who underwent colectomy with an ileal pouch anal anastomosis detected significant increases in distinct populations of Bacteroides during 9 of 11 patient visits that coincided with inflammation (pouchitis). Oligotyping and metagenomic short-read annotation identified Bacteroides populations that occurred in early samples, bloomed during inflammation, and reappeared after antibiotic treatment. Targeted cultivation of Bacteroides isolates from the same individual at multiple time points and from several patients detected subtle genomic changes, including the identification of rapidly evolving genomic elements that differentiate isogenic strains of Bacteroides fragilis from the mucosa versus lumen. Eachmore » patient harbored Bacteroides spp. that are closely related to commonly occurring clinical isolates, including Bacteroides ovatus, B. thetaiotaomicron, B. vulgatus, and B. fragilis, which contained unique loci in different patients for synthesis of capsular polysaccharides. The presence of unique Bacteroides capsular polysaccharide loci within different hosts and between the lumen and mucosa may represent adaptations to stimulate, suppress, and evade host-specific immune responses at different microsites of the ileal pouch.« less

  17. Genome Sequence of Exiguobacterium antarcticum B7, Isolated from a Biofilm in Ginger Lake, King George Island, Antarctica

    PubMed Central

    Carneiro, Adriana Ribeiro; Ramos, Rommel Thiago Jucá; Dall'Agnol, Hivana; Pinto, Anne Cybelle; de Castro Soares, Siomar; Santos, Anderson Rodrigues; Guimarães, Luis Carlos; Almeida, Sintia Silva; Baraúna, Rafael Azevedo; das Graças, Diego Assis; Franco, Luciano Chaves; Ali, Amjad; Hassan, Syed Shah; Nunes, Catarina Isabel P.; Barbosa, Maria Silvanira; Fiaux, Karina Kelly; Aburjaile, Flávia Figueira; Barbosa, Eudes Guilherme Vieira; Bakhtiar, Syeda Marriam; Vilela, Daniella; Nóbrega, Felipe; dos Santos, Adriana Lopes; Carepo, Marta Sofia P.; Azevedo, Vasco; Schneider, Maria Paula Cruz; Pellizari, Vivian Helena

    2012-01-01

    Exiguobacterium antarcticum is a psychotropic bacterium isolated for the first time from microbial mats of Lake Fryxell in Antarctica. Many organisms of the genus Exiguobacterium are extremophiles and have properties of biotechnological interest, e.g., the capacity to adapt to cold, which make this genus a target for discovering new enzymes, such as lipases and proteases, in addition to improving our understanding of the mechanisms of adaptation and survival at low temperatures. This study presents the genome of E. antarcticum B7, isolated from a biofilm sample of Ginger Lake on King George Island, Antarctic peninsula. PMID:23144424

  18. A ddRAD-based genetic map and its integration with the genome assembly of Japanese eel (Anguilla japonica) provides insights into genome evolution after the teleost-specific genome duplication

    PubMed Central

    2014-01-01

    Background Recent advancements in next-generation sequencing technology have enabled cost-effective sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To enrich genomic resources for Japanese eel (Anguilla japonica), we constructed a ddRAD-based genetic map using an Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 19 linkage groups of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the history of genome evolution after the teleost-specific genome duplication. Results We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal rearrangement events that occurred soon after the teleost-specific genome duplication. Conclusions The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with economically important traits and for investigating comparative genomics of the Japanese eel. PMID:24669946

  19. A ddRAD-based genetic map and its integration with the genome assembly of Japanese eel (Anguilla japonica) provides insights into genome evolution after the teleost-specific genome duplication.

    PubMed

    Kai, Wataru; Nomura, Kazuharu; Fujiwara, Atushi; Nakamura, Yoji; Yasuike, Motoshige; Ojima, Nobuhiko; Masaoka, Tetsuji; Ozaki, Akiyuki; Kazeto, Yukinori; Gen, Koichiro; Nagao, Jiro; Tanaka, Hideki; Kobayashi, Takanori; Ototake, Mitsuru

    2014-03-26

    Recent advancements in next-generation sequencing technology have enabled cost-effective sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To enrich genomic resources for Japanese eel (Anguilla japonica), we constructed a ddRAD-based genetic map using an Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 19 linkage groups of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the history of genome evolution after the teleost-specific genome duplication. We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal rearrangement events that occurred soon after the teleost-specific genome duplication. The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with economically important traits and for investigating comparative genomics of the Japanese eel.

  20. Improved Prediction of Non-methylated Islands in Vertebrates Highlights Different Characteristic Sequence Patterns

    PubMed Central

    Vingron, Martin

    2016-01-01

    Non-methylated islands (NMIs) of DNA are genomic regions that are important for gene regulation and development. A recent study of genome-wide non-methylation data in vertebrates by Long et al. (eLife 2013;2:e00348) has shown that many experimentally identified non-methylated regions do not overlap with classically defined CpG islands which are computationally predicted using simple DNA sequence features. This is especially true in cold-blooded vertebrates such as Danio rerio (zebrafish). In order to investigate how predictive DNA sequence is of a region’s methylation status, we applied a supervised learning approach using a spectrum kernel support vector machine, to see if a more complex model and supervised learning can be used to improve non-methylated island prediction and to understand the sequence properties of these regions. We demonstrate that DNA sequence is highly predictive of methylation status, and that in contrast to existing CpG island prediction methods our method is able to provide more useful predictions of NMIs genome-wide in all vertebrate organisms that were studied. Our results also show that in cold-blooded vertebrates (Anolis carolinensis, Xenopus tropicalis and Danio rerio) where genome-wide classical CpG island predictions consist primarily of false positives, longer primarily AT-rich DNA sequence features are able to identify these regions much more accurately. PMID:27984582

  1. Discovery of Nigri/nox and Panto/pox site-specific recombinase systems facilitates advanced genome engineering

    PubMed Central

    Karimova, Madina; Splith, Victoria; Karpinski, Janet; Pisabarro, M. Teresa; Buchholz, Frank

    2016-01-01

    Precise genome engineering is instrumental for biomedical research and holds great promise for future therapeutic applications. Site-specific recombinases (SSRs) are valuable tools for genome engineering due to their exceptional ability to mediate precise excision, integration and inversion of genomic DNA in living systems. The ever-increasing complexity of genome manipulations and the desire to understand the DNA-binding specificity of these enzymes are driving efforts to identify novel SSR systems with unique properties. Here, we describe two novel tyrosine site-specific recombination systems designated Nigri/nox and Panto/pox. Nigri originates from Vibrio nigripulchritudo (plasmid VIBNI_pA) and recombines its target site nox with high efficiency and high target-site selectivity, without recombining target sites of the well established SSRs Cre, Dre, Vika and VCre. Panto, derived from Pantoea sp. aB, is less specific and in addition to its native target site, pox also recombines the target site for Dre recombinase, called rox. This relaxed specificity allowed the identification of residues that are involved in target site selectivity, thereby advancing our understanding of how SSRs recognize their respective DNA targets. PMID:27444945

  2. Hematopoietic transcriptional mechanisms: from locus-specific to genome-wide vantage points.

    PubMed

    DeVilbiss, Andrew W; Sanalkumar, Rajendran; Johnson, Kirby D; Keles, Sunduz; Bresnick, Emery H

    2014-08-01

    Hematopoiesis is an exquisitely regulated process in which stem cells in the developing embryo and the adult generate progenitor cells that give rise to all blood lineages. Master regulatory transcription factors control hematopoiesis by integrating signals from the microenvironment and dynamically establishing and maintaining genetic networks. One of the most rudimentary aspects of cell type-specific transcription factor function, how they occupy a highly restricted cohort of cis-elements in chromatin, remains poorly understood. Transformative technologic advances involving the coupling of next-generation DNA sequencing technology with the chromatin immunoprecipitation assay (ChIP-seq) have enabled genome-wide mapping of factor occupancy patterns. However, formidable problems remain; notably, ChIP-seq analysis yields hundreds to thousands of chromatin sites occupied by a given transcription factor, and only a fraction of the sites appear to be endowed with critical, non-redundant function. It has become en vogue to map transcription factor occupancy patterns genome-wide, while using powerful statistical tools to establish correlations to inform biology and mechanisms. With the advent of revolutionary genome editing technologies, one can now reach beyond correlations to conduct definitive hypothesis testing. This review focuses on key discoveries that have emerged during the path from single loci to genome-wide analyses, specifically in the context of hematopoietic transcriptional mechanisms. Copyright © 2014 ISEH - International Society for Experimental Hematology. Published by Elsevier Inc. All rights reserved.

  3. A genomic island integrated into recA of Vibrio cholerae contains a divergent recA and provides multi-pathway protection from DNA damage.

    PubMed

    Rapa, Rita A; Islam, Atiqul; Monahan, Leigh G; Mutreja, Ankur; Thomson, Nicholas; Charles, Ian G; Stokes, Harold W; Labbate, Maurizio

    2015-04-01

    Lateral gene transfer (LGT) has been crucial in the evolution of the cholera pathogen, Vibrio cholerae. The two major virulence factors are present on two different mobile genetic elements, a bacteriophage containing the cholera toxin genes and a genomic island (GI) containing the intestinal adhesin genes. Non-toxigenic V. cholerae in the aquatic environment are a major source of novel DNA that allows the pathogen to morph via LGT. In this study, we report a novel GI from a non-toxigenic V. cholerae strain containing multiple genes involved in DNA repair including the recombination repair gene recA that is 23% divergent from the indigenous recA and genes involved in the translesion synthesis pathway. This is the first report of a GI containing the critical gene recA and the first report of a GI that targets insertion into a specific site within recA. We show that possession of the island in Escherichia coli is protective against DNA damage induced by UV-irradiation and DNA targeting antibiotics. This study highlights the importance of genetic elements such as GIs in the evolution of V. cholerae and emphasizes the importance of environmental strains as a source of novel DNA that can influence the pathogenicity of toxigenic strains. © 2014 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.

  4. Comparative genomics of defense systems in archaea and bacteria

    PubMed Central

    Makarova, Kira S.; Wolf, Yuri I.; Koonin, Eugene V.

    2013-01-01

    Our knowledge of prokaryotic defense systems has vastly expanded as the result of comparative genomic analysis, followed by experimental validation. This expansion is both quantitative, including the discovery of diverse new examples of known types of defense systems, such as restriction-modification or toxin-antitoxin systems, and qualitative, including the discovery of fundamentally new defense mechanisms, such as the CRISPR-Cas immunity system. Large-scale statistical analysis reveals that the distribution of different defense systems in bacterial and archaeal taxa is non-uniform, with four groups of organisms distinguishable with respect to the overall abundance and the balance between specific types of defense systems. The genes encoding defense system components in bacterial and archaea typically cluster in defense islands. In addition to genes encoding known defense systems, these islands contain numerous uncharacterized genes, which are candidates for new types of defense systems. The tight association of the genes encoding immunity systems and dormancy- or cell death-inducing defense systems in prokaryotic genomes suggests that these two major types of defense are functionally coupled, providing for effective protection at the population level. PMID:23470997

  5. Genome-wide detection of conservative site-specific recombination in bacteria

    PubMed Central

    Mathias Garrett, Elizabeth; Camilli, Andrew

    2018-01-01

    The ability of clonal bacterial populations to generate genomic and phenotypic heterogeneity is thought to be of great importance for many commensal and pathogenic bacteria. One common mechanism contributing to diversity formation relies on the inversion of small genomic DNA segments in a process commonly referred to as conservative site-specific recombination. This phenomenon is known to occur in several bacterial lineages, however it remains notoriously difficult to identify due to the lack of conserved features. Here, we report an easy-to-implement method based on high-throughput paired-end sequencing for genome-wide detection of conservative site-specific recombination on a single-nucleotide level. We demonstrate the effectiveness of the method by successfully detecting several novel inversion sites in an epidemic isolate of the enteric pathogen Clostridium difficile. Using an experimental approach, we validate the inversion potential of all detected sites in C. difficile and quantify their prevalence during exponential and stationary growth in vitro. In addition, we demonstrate that the master recombinase RecV is responsible for the inversion of some but not all invertible sites. Using a fluorescent gene-reporter system, we show that at least one gene from a two-component system located next to an invertible site is expressed in an on-off mode reminiscent of phase variation. We further demonstrate the applicability of our method by mining 209 publicly available sequencing datasets and show that conservative site-specific recombination is common in the bacterial realm but appears to be absent in some lineages. Finally, we show that the gene content associated with the inversion sites is diverse and goes beyond traditionally described surface components. Overall, our method provides a robust platform for detection of conservative site-specific recombination in bacteria and opens a new avenue for global exploration of this important phenomenon. PMID:29621238

  6. Genome-wide histone state profiling of fibroblasts from the opossum, Monodelphis domestica, identifies the first marsupial-specific imprinted gene

    PubMed Central

    2014-01-01

    Background Imprinted genes have been extensively documented in eutherian mammals and found to exhibit significant interspecific variation in the suites of genes that are imprinted and in their regulation between tissues and developmental stages. Much less is known about imprinted loci in metatherian (marsupial) mammals, wherein studies have been limited to a small number of genes previously known to be imprinted in eutherians. We describe the first ab initio search for imprinted marsupial genes, in fibroblasts from the opossum, Monodelphis domestica, based on a genome-wide ChIP-seq strategy to identify promoters that are simultaneously marked by mutually exclusive, transcriptionally opposing histone modifications. Results We identified a novel imprinted gene (Meis1) and two additional monoallelically expressed genes, one of which (Cstb) showed allele-specific, but non-imprinted expression. Imprinted vs. allele-specific expression could not be resolved for the third monoallelically expressed gene (Rpl17). Transcriptionally opposing histone modifications H3K4me3, H3K9Ac, and H3K9me3 were found at the promoters of all three genes, but differential DNA methylation was not detected at CpG islands at any of these promoters. Conclusions In generating the first genome-wide histone modification profiles for a marsupial, we identified the first gene that is imprinted in a marsupial but not in eutherian mammals. This outcome demonstrates the practicality of an ab initio discovery strategy and implicates histone modification, but not differential DNA methylation, as a conserved mechanism for marking imprinted genes in all therian mammals. Our findings suggest that marsupials use multiple epigenetic mechanisms for imprinting and support the concept that lineage-specific selective forces can produce sets of imprinted genes that differ between metatherian and eutherian lines. PMID:24484454

  7. Cell Context Dependent p53 Genome-Wide Binding Patterns and Enrichment at Repeats

    DOE PAGES

    Botcheva, Krassimira; McCorkle, Sean R.

    2014-11-21

    The p53 ability to elicit stress specific and cell type specific responses is well recognized, but how that specificity is established remains to be defined. Whether upon activation p53 binds to its genomic targets in a cell type and stress type dependent manner is still an open question. Here we show that the p53 binding to the human genome is selective and cell context-dependent. We mapped the genomic binding sites for the endogenous wild type p53 protein in the human cancer cell line HCT116 and compared them to those we previously determined in the normal cell line IMR90. We reportmore » distinct p53 genome-wide binding landscapes in two different cell lines, analyzed under the same treatment and experimental conditions, using the same ChIP-seq approach. This is evidence for cell context dependent p53 genomic binding. The observed differences affect the p53 binding sites distribution with respect to major genomic and epigenomic elements (promoter regions, CpG islands and repeats). We correlated the high-confidence p53 ChIP-seq peaks positions with the annotated human repeats (UCSC Human Genome Browser) and observed both common and cell line specific trends. In HCT116, the p53 binding was specifically enriched at LINE repeats, compared to IMR90 cells. The p53 genome-wide binding patterns in HCT116 and IMR90 likely reflect the different epigenetic landscapes in these two cell lines, resulting from cancer-associated changes (accumulated in HCT116) superimposed on tissue specific differences (HCT116 has epithelial, while IMR90 has mesenchymal origin). In conclusion, our data support the model for p53 binding to the human genome in a highly selective manner, mobilizing distinct sets of genes, contributing to distinct pathways.« less

  8. Plant nodulation inducers enhance horizontal gene transfer of Azorhizobium caulinodans symbiosis island

    PubMed Central

    Ling, Jun; Wang, Hui; Wu, Ping; Li, Tao; Tang, Yu; Naseer, Nawar; Zheng, Huiming; Masson-Boivin, Catherine; Zhong, Zengtao

    2016-01-01

    Horizontal gene transfer (HGT) of genomic islands is a driving force of bacterial evolution. Many pathogens and symbionts use this mechanism to spread mobile genetic elements that carry genes important for interaction with their eukaryotic hosts. However, the role of the host in this process remains unclear. Here, we show that plant compounds inducing the nodulation process in the rhizobium-legume mutualistic symbiosis also enhance the transfer of symbiosis islands. We demonstrate that the symbiosis island of the Sesbania rostrata symbiont, Azorhizobium caulinodans, is an 87.6-kb integrative and conjugative element (ICEAc) that is able to excise, form a circular DNA, and conjugatively transfer to a specific site of gly-tRNA gene of other rhizobial genera, expanding their host range. The HGT frequency was significantly increased in the rhizosphere. An ICEAc-located LysR-family transcriptional regulatory protein AhaR triggered the HGT process in response to plant flavonoids that induce the expression of nodulation genes through another LysR-type protein, NodD. Our study suggests that rhizobia may sense rhizosphere environments and transfer their symbiosis gene contents to other genera of rhizobia, thereby broadening rhizobial host-range specificity. PMID:27849579

  9. Plant nodulation inducers enhance horizontal gene transfer of Azorhizobium caulinodans symbiosis island.

    PubMed

    Ling, Jun; Wang, Hui; Wu, Ping; Li, Tao; Tang, Yu; Naseer, Nawar; Zheng, Huiming; Masson-Boivin, Catherine; Zhong, Zengtao; Zhu, Jun

    2016-11-29

    Horizontal gene transfer (HGT) of genomic islands is a driving force of bacterial evolution. Many pathogens and symbionts use this mechanism to spread mobile genetic elements that carry genes important for interaction with their eukaryotic hosts. However, the role of the host in this process remains unclear. Here, we show that plant compounds inducing the nodulation process in the rhizobium-legume mutualistic symbiosis also enhance the transfer of symbiosis islands. We demonstrate that the symbiosis island of the Sesbania rostrata symbiont, Azorhizobium caulinodans, is an 87.6-kb integrative and conjugative element (ICE Ac ) that is able to excise, form a circular DNA, and conjugatively transfer to a specific site of gly-tRNA gene of other rhizobial genera, expanding their host range. The HGT frequency was significantly increased in the rhizosphere. An ICE Ac -located LysR-family transcriptional regulatory protein AhaR triggered the HGT process in response to plant flavonoids that induce the expression of nodulation genes through another LysR-type protein, NodD. Our study suggests that rhizobia may sense rhizosphere environments and transfer their symbiosis gene contents to other genera of rhizobia, thereby broadening rhizobial host-range specificity.

  10. Prevalence of genomic island PAPI-1 in clinical isolates of Pseudomonas aeruginosa in Iran.

    PubMed

    Sadeghifard, Nourkhoda; Rasaei, Seyedeh Zahra; Ghafourian, Sobhan; Zolfaghary, Mohammad Reza; Ranjbar, Reza; Raftari, Mohammad; Mohebi, Reza; Maleki, Abbas; Rahbar, Mohammad

    2012-03-01

    Pseudomonas aeruginosa, a gram-negative rod-shaped bacterium, is an opportunistic pathogen, which causes various serious diseases in humans and animals. The aims of this study were to evaluate of the presence of genomic island PAPI-1 in Pseudomonas aeruginosa isolated from Reference Laboratory of Ilam, Milad Hospital and Emam Khomeini Hospital, Iran and to study the frequency of extended spectrum beta-lactamases (ESBLs) among isolates. Forty-eight clinical isolates of P. aeruginosa were obtained during April to September 2010, and were evaluated for ESBLs by screening and confirmatory disk diffusion methods and PAPI-1 by PCR. Fifteen of 48 P. aeruginosa isolates were positive for ESBLs and 17 isolates positive for PAPI-1. This was first study of the prevalence of PAPI-1 in clinical isolates of P. aeruginosa in Iran, showing that most of PAPI-1 positive strains had high levels of antibiotic resistance and produced ESBLs.

  11. Island biology: looking towards the future

    PubMed Central

    Kueffer, Christoph; Drake, Donald R.; Fernández-Palacios, José María

    2014-01-01

    Oceanic islands are renowned for the profound scientific insights that their fascinating biotas have provided to biologists during the past two centuries. Research presented at Island Biology 2014—an international conference, held in Honolulu, Hawaii (7–11 July 2014), which attracted 253 presenters and 430 participants from at least 35 countries1—demonstrated that islands are reclaiming a leading role in ecology and evolution, especially for synthetic studies at the intersections of macroecology, evolution, community ecology and applied ecology. New dynamics in island biology are stimulated by four major developments. We are experiencing the emergence of a truly global and comprehensive island research community incorporating previously neglected islands and taxa. Macroecology and big-data analyses yield a wealth of global-scale synthetic studies and detailed multi-island comparisons, while other modern research approaches such as genomics, phylogenetic and functional ecology, and palaeoecology, are also dispersing to islands. And, increasingly tight collaborations between basic research and conservation management make islands places where new conservation solutions for the twenty-first century are being tested. Islands are home to a disproportionate share of the world's rare (and extinct) species, and there is an urgent need to develop increasingly collaborative and innovative research to address their conservation requirements. PMID:25339655

  12. Cytotoxic Chromosomal Targeting by CRISPR/Cas Systems Can Reshape Bacterial Genomes and Expel or Remodel Pathogenicity Islands

    PubMed Central

    Vercoe, Reuben B.; Chang, James T.; Dy, Ron L.; Taylor, Corinda; Gristwood, Tamzin; Clulow, James S.; Richter, Corinna; Przybilski, Rita; Pitman, Andrew R.; Fineran, Peter C.

    2013-01-01

    In prokaryotes, clustered regularly interspaced short palindromic repeats (CRISPRs) and their associated (Cas) proteins constitute a defence system against bacteriophages and plasmids. CRISPR/Cas systems acquire short spacer sequences from foreign genetic elements and incorporate these into their CRISPR arrays, generating a memory of past invaders. Defence is provided by short non-coding RNAs that guide Cas proteins to cleave complementary nucleic acids. While most spacers are acquired from phages and plasmids, there are examples of spacers that match genes elsewhere in the host bacterial chromosome. In Pectobacterium atrosepticum the type I-F CRISPR/Cas system has acquired a self-complementary spacer that perfectly matches a protospacer target in a horizontally acquired island (HAI2) involved in plant pathogenicity. Given the paucity of experimental data about CRISPR/Cas–mediated chromosomal targeting, we examined this process by developing a tightly controlled system. Chromosomal targeting was highly toxic via targeting of DNA and resulted in growth inhibition and cellular filamentation. The toxic phenotype was avoided by mutations in the cas operon, the CRISPR repeats, the protospacer target, and protospacer-adjacent motif (PAM) beside the target. Indeed, the natural self-targeting spacer was non-toxic due to a single nucleotide mutation adjacent to the target in the PAM sequence. Furthermore, we show that chromosomal targeting can result in large-scale genomic alterations, including the remodelling or deletion of entire pre-existing pathogenicity islands. These features can be engineered for the targeted deletion of large regions of bacterial chromosomes. In conclusion, in DNA–targeting CRISPR/Cas systems, chromosomal interference is deleterious by causing DNA damage and providing a strong selective pressure for genome alterations, which may have consequences for bacterial evolution and pathogenicity. PMID:23637624

  13. Co-circulation of bluetongue and epizootic haemorrhagic disease viruses in cattle in Reunion Island.

    PubMed

    Sailleau, Corinne; Zanella, Gina; Breard, Emmanuel; Viarouge, Cyril; Desprat, Alexandra; Vitour, Damien; Adam, Micheline; Lasne, Laurent; Martrenchar, Arnaud; Bakkali-Kassimi, Labib; Costes, Laura; Zientara, Stéphan

    2012-03-23

    Bluetongue virus (BTV) and epizootic haemorrhagic disease virus (EHDV) in deer have already been isolated in Reunion Island and have caused more or less severe clinical signs in cattle (EHDV) or in sheep (BTV), as observed in 2003. In January 2009, cattle in Reunion Island showed clinical signs suggesting infection by one or the other of these arboviral diseases. A study was set up to determine the etiology of the disease. Analysis by reverse transcriptase-polymerase chain reaction (RT-PCR) performed on blood samples from 116 cattle from different districts of the island detected the presence of the EHDV genome in 106 samples and, in 5 of them, the simultaneous occurrence of BTV and EHDV. One strain of EHDV (7 isolates) and one of BTV were isolated in embryonated eggs and a BHK-21 cell culture. Group and subgroup primer-pairs were designed on the segment 2 sequences available in GenBank to identify and type the EHDV strains. Phylogenetic analysis of the genomic segment 2 (encoding the VP2 serotype-specific protein) of the isolates confirmed the serotypes of these two orbiviruses as BTV-2 and EHDV-6 and allowed them to be compared with previously isolated strains. Copyright © 2011 Elsevier B.V. All rights reserved.

  14. Genome dynamics and its impact on evolution of Escherichia coli.

    PubMed

    Dobrindt, Ulrich; Chowdary, M Geddam; Krumbholz, G; Hacker, J

    2010-08-01

    The Escherichia coli genome consists of a conserved part, the so-called core genome, which encodes essential cellular functions and of a flexible, strain-specific part. Genes that belong to the flexible genome code for factors involved in bacterial fitness and adaptation to different environments. Adaptation includes increase in fitness and colonization capacity. Pathogenic as well as non-pathogenic bacteria carry mobile and accessory genetic elements such as plasmids, bacteriophages, genomic islands and others, which code for functions required for proper adaptation. Escherichia coli is a very good example to study the interdependency of genome architecture and lifestyle of bacteria. Thus, these species include pathogenic variants as well as commensal bacteria adapted to different host organisms. In Escherichia coli, various genetic elements encode for pathogenicity factors as well as factors, which increase the fitness of non-pathogenic bacteria. The processes of genome dynamics, such as gene transfer, genome reduction, rearrangements as well as point mutations contribute to the adaptation of the bacteria into particular environments. Using Escherichia coli model organisms, such as uropathogenic strain 536 or commensal strain Nissle 1917, we studied mechanisms of genome dynamics and discuss these processes in the light of the evolution of microbes.

  15. Comprehensive analysis of CpG islands in human chromosomes 21 and 22

    NASA Astrophysics Data System (ADS)

    Takai, Daiya; Jones, Peter A.

    2002-03-01

    CpG islands are useful markers for genes in organisms containing 5-methylcytosine in their genomes. In addition, CpG islands located in the promoter regions of genes can play important roles in gene silencing during processes such as X-chromosome inactivation, imprinting, and silencing of intragenomic parasites. The generally accepted definition of what constitutes a CpG island was proposed in 1987 by Gardiner-Garden and Frommer [Gardiner-Garden, M. & Frommer, M. (1987) J. Mol. Biol. 196, 261-282] as being a 200-bp stretch of DNA with a C+G content of 50% and an observed CpG/expected CpG in excess of 0.6. Any definition of a CpG island is somewhat arbitrary, and this one, which was derived before the sequencing of mammalian genomes, will include many sequences that are not necessarily associated with controlling regions of genes but rather are associated with intragenomic parasites. We have therefore used the complete genomic sequences of human chromosomes 21 and 22 to examine the properties of CpG islands in different sequence classes by using a search algorithm that we have developed. Regions of DNA of greater than 500 bp with a G+C equal to or greater than 55% and observed CpG/expected CpG of 0.65 were more likely to be associated with the 5' regions of genes and this definition excluded most Alu-repetitive elements. We also used genome sequences to show strong CpG suppression in the human genome and slight suppression in Drosophila melanogaster and Saccharomyces cerevisiae. This finding is compatible with the recent detection of 5-methylcytosine in Drosophila, and might suggest that S. cerevisiae has, or once had, CpG methylation.

  16. Whole genome sequence analysis of unidentified genetically modified papaya for development of a specific detection method.

    PubMed

    Nakamura, Kosuke; Kondo, Kazunari; Akiyama, Hiroshi; Ishigaki, Takumi; Noguchi, Akio; Katsumata, Hiroshi; Takasaki, Kazuto; Futo, Satoshi; Sakata, Kozue; Fukuda, Nozomi; Mano, Junichi; Kitta, Kazumi; Tanaka, Hidenori; Akashi, Ryo; Nishimaki-Mogami, Tomoko

    2016-08-15

    Identification of transgenic sequences in an unknown genetically modified (GM) papaya (Carica papaya L.) by whole genome sequence analysis was demonstrated. Whole genome sequence data were generated for a GM-positive fresh papaya fruit commodity detected in monitoring using real-time polymerase chain reaction (PCR). The sequences obtained were mapped against an open database for papaya genome sequence. Transgenic construct- and event-specific sequences were identified as a GM papaya developed to resist infection from a Papaya ringspot virus. Based on the transgenic sequences, a specific real-time PCR detection method for GM papaya applicable to various food commodities was developed. Whole genome sequence analysis enabled identifying unknown transgenic construct- and event-specific sequences in GM papaya and development of a reliable method for detecting them in papaya food commodities. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. Strand-specific transcriptome profiling with directly labeled RNA on genomic tiling microarrays

    PubMed Central

    2011-01-01

    Background With lower manufacturing cost, high spot density, and flexible probe design, genomic tiling microarrays are ideal for comprehensive transcriptome studies. Typically, transcriptome profiling using microarrays involves reverse transcription, which converts RNA to cDNA. The cDNA is then labeled and hybridized to the probes on the arrays, thus the RNA signals are detected indirectly. Reverse transcription is known to generate artifactual cDNA, in particular the synthesis of second-strand cDNA, leading to false discovery of antisense RNA. To address this issue, we have developed an effective method using RNA that is directly labeled, thus by-passing the cDNA generation. This paper describes this method and its application to the mapping of transcriptome profiles. Results RNA extracted from laboratory cultures of Porphyromonas gingivalis was fluorescently labeled with an alkylation reagent and hybridized directly to probes on genomic tiling microarrays specifically designed for this periodontal pathogen. The generated transcriptome profile was strand-specific and produced signals close to background level in most antisense regions of the genome. In contrast, high levels of signal were detected in the antisense regions when the hybridization was done with cDNA. Five antisense areas were tested with independent strand-specific RT-PCR and none to negligible amplification was detected, indicating that the strong antisense cDNA signals were experimental artifacts. Conclusions An efficient method was developed for mapping transcriptome profiles specific to both coding strands of a bacterial genome. This method chemically labels and uses extracted RNA directly in microarray hybridization. The generated transcriptome profile was free of cDNA artifactual signals. In addition, this method requires fewer processing steps and is potentially more sensitive in detecting small amount of RNA compared to conventional end-labeling methods due to the incorporation of more

  18. Genomic Islands in Pathogenic Filamentous Fungus Aspergillus fumigatus

    USDA-ARS?s Scientific Manuscript database

    We present the genome sequences of a new clinical isolate, CEA10, of an important human pathogen, Aspergillus fumigatus, and two closely related, but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of CEA10 with the recently sequen...

  19. Recombination rate variation in mice from an isolated island.

    PubMed

    Wang, Richard J; Gray, Melissa M; Parmenter, Michelle D; Broman, Karl W; Payseur, Bret A

    2017-01-01

    Recombination rate is a heritable trait that varies among individuals. Despite the major impact of recombination rate on patterns of genetic diversity and the efficacy of selection, natural variation in this phenotype remains poorly characterized. We present a comparison of genetic maps, sampling 1212 meioses, from a unique population of wild house mice (Mus musculus domesticus) that recently colonized remote Gough Island. Crosses to a mainland reference strain (WSB/EiJ) reveal pervasive variation in recombination rate among Gough Island mice, including subchromosomal intervals spanning up to 28% of the genome. In spite of this high level of polymorphism, the genomewide recombination rate does not significantly vary. In general, we find that recombination rate varies more when measured in smaller genomic intervals. Using the current standard genetic map of the laboratory mouse to polarize intervals with divergent recombination rates, we infer that the majority of evolutionary change occurred in one of the two tested lines of Gough Island mice. Our results confirm that natural populations harbour a high level of recombination rate polymorphism and highlight the disparities in recombination rate evolution across genomic scales. © 2016 John Wiley & Sons Ltd.

  20. Hydrologic data for Block Island, Rhode Island

    USGS Publications Warehouse

    Burns, Emily

    1993-01-01

    This report was compiled as part of a study to assess the hydrogeology and the quality and quantity of fresh ground water on Block Island, Rhode Island. Hydrologic data were collected on Block Island during 1988-91. The data are pre- sented in illustrations and tables. Data collec- ted include precipitation, surfae-water, ground- water, lithologic, and well-construction and dis- charge information. Precipitation data include total monthly precipitation values from 11 rain gages and water-quality analyses of 14 precipi- tation samples from one station. Surface-water data include water-level measurements at 12 ponds, water-quality data for five ponds, and field specific-conductance measurements at 56 surface- water sites (streams, ponds, and springs). Ground- water data include water-level measurements at 159 wells, water-quality data at 150 wells, and field specific-conductance data at 52 wells. Lithologic logs for 375 wells and test borings, and construc- tion and location data for 570 wells, springs, and test borings are included. In addition, the data set contains data on water quality of water samples, collected by the Rhode Island Department of Health during 1976-91, from Fresh and Sands Ponds and from wells at the Block Island Water Company well field north of Sands Pond.

  1. Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

    PubMed Central

    Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

    2009-01-01

    The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303

  2. Rearrangement of a large novel Pseudomonas aeruginosa gene island in strains isolated from a patient developing ventilator-associated pneumonia.

    PubMed

    Singh, G; Srinivasan, R; Cheng, J; Peng, Z; Fujimura, K; Baek, M S; Panzer, A R; Tringe, S G; Chen, F; Sorek, R; Weng, L; Bristow, J; Wiener-Kronish, J P; Lynch, S V

    2014-07-01

    Bacterial gene islands add to the genetic repertoire of opportunistic pathogens. Here, we perform comparative analyses of three Pseudomonas aeruginosa strains isolated sequentially over a 3-week period from a patient with ventilator-associated pneumonia (VAP) who received clindamycin and piperacillin-tazobactam as part of their treatment regime. While all three strains appeared to be clonal by standard pulsed-field gel electrophoresis, whole-genome sequencing revealed subtle alterations in the chromosomal organization of the last two strains; specifically, an inversion event within a novel 124-kb gene island (PAGI 12) composed of 137 open reading frames [ORFs]. Predicted ORFs in the island included metabolism and virulence genes. Overexpression of a gene island-borne putative β-lactamase gene was observed following piperacillin-tazobactam exposure and only in those strains that had undergone the inversion event, indicating altered gene regulation following genomic remodeling. Examination of a separate cohort of 76 patients with VAP for integration at this tRNA(lys) recombination site demonstrated that patients exhibiting evidence of integration at this site had significantly higher 28-day mortality. These findings provide evidence that P. aeruginosa can integrate, rapidly remodel, and express exogenous genes, which likely contributes to its fitness in a clinical setting. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  3. Rearrangement of a Large Novel Pseudomonas aeruginosa Gene Island in Strains Isolated from a Patient Developing Ventilator-Associated Pneumonia

    PubMed Central

    Singh, G.; Srinivasan, R.; Cheng, J.; Peng, Z.; Fujimura, K.; Baek, M. S.; Panzer, A. R.; Tringe, S. G.; Chen, F.; Sorek, R.; Weng, L.; Bristow, J.; Wiener-Kronish, J. P.

    2014-01-01

    Bacterial gene islands add to the genetic repertoire of opportunistic pathogens. Here, we perform comparative analyses of three Pseudomonas aeruginosa strains isolated sequentially over a 3-week period from a patient with ventilator-associated pneumonia (VAP) who received clindamycin and piperacillin-tazobactam as part of their treatment regime. While all three strains appeared to be clonal by standard pulsed-field gel electrophoresis, whole-genome sequencing revealed subtle alterations in the chromosomal organization of the last two strains; specifically, an inversion event within a novel 124-kb gene island (PAGI 12) composed of 137 open reading frames [ORFs]. Predicted ORFs in the island included metabolism and virulence genes. Overexpression of a gene island-borne putative β-lactamase gene was observed following piperacillin-tazobactam exposure and only in those strains that had undergone the inversion event, indicating altered gene regulation following genomic remodeling. Examination of a separate cohort of 76 patients with VAP for integration at this tRNAlys recombination site demonstrated that patients exhibiting evidence of integration at this site had significantly higher 28-day mortality. These findings provide evidence that P. aeruginosa can integrate, rapidly remodel, and express exogenous genes, which likely contributes to its fitness in a clinical setting. PMID:24789195

  4. Hierarchical distance-sampling models to estimate population size and habitat-specific abundance of an island endemic

    USGS Publications Warehouse

    Sillett, Scott T.; Chandler, Richard B.; Royle, J. Andrew; Kéry, Marc; Morrison, Scott A.

    2012-01-01

    Population size and habitat-specific abundance estimates are essential for conservation management. A major impediment to obtaining such estimates is that few statistical models are able to simultaneously account for both spatial variation in abundance and heterogeneity in detection probability, and still be amenable to large-scale applications. The hierarchical distance-sampling model of J. A. Royle, D. K. Dawson, and S. Bates provides a practical solution. Here, we extend this model to estimate habitat-specific abundance and rangewide population size of a bird species of management concern, the Island Scrub-Jay (Aphelocoma insularis), which occurs solely on Santa Cruz Island, California, USA. We surveyed 307 randomly selected, 300 m diameter, point locations throughout the 250-km2 island during October 2008 and April 2009. Population size was estimated to be 2267 (95% CI 1613-3007) and 1705 (1212-2369) during the fall and spring respectively, considerably lower than a previously published but statistically problematic estimate of 12 500. This large discrepancy emphasizes the importance of proper survey design and analysis for obtaining reliable information for management decisions. Jays were most abundant in low-elevation chaparral habitat; the detection function depended primarily on the percent cover of chaparral and forest within count circles. Vegetation change on the island has been dramatic in recent decades, due to release from herbivory following the eradication of feral sheep (Ovis aries) from the majority of the island in the mid-1980s. We applied best-fit fall and spring models of habitat-specific jay abundance to a vegetation map from 1985, and estimated the population size of A. insularis was 1400-1500 at that time. The 20-30% increase in the jay population suggests that the species has benefited from the recovery of native vegetation since sheep removal. Nevertheless, this jay's tiny range and small population size make it vulnerable to natural

  5. Genomic evidence of geographically widespread effect of gene flow from polar bears into brown bears

    PubMed Central

    Cahill, James A; Stirling, Ian; Kistler, Logan; Salamzade, Rauf; Ersmark, Erik; Fulton, Tara L; Stiller, Mathias; Green, Richard E; Shapiro, Beth

    2015-01-01

    Polar bears are an arctic, marine adapted species that is closely related to brown bears. Genome analyses have shown that polar bears are distinct and genetically homogeneous in comparison to brown bears. However, these analyses have also revealed a remarkable episode of polar bear gene flow into the population of brown bears that colonized the Admiralty, Baranof and Chichagof islands (ABC islands) of Alaska. Here, we present an analysis of data from a large panel of polar bear and brown bear genomes that includes brown bears from the ABC islands, the Alaskan mainland and Europe. Our results provide clear evidence that gene flow between the two species had a geographically wide impact, with polar bear DNA found within the genomes of brown bears living both on the ABC islands and in the Alaskan mainland. Intriguingly, while brown bear genomes contain up to 8.8% polar bear ancestry, polar bear genomes appear to be devoid of brown bear ancestry, suggesting the presence of a barrier to gene flow in that direction. PMID:25490862

  6. Genomic evidence of geographically widespread effect of gene flow from polar bears into brown bears.

    PubMed

    Cahill, James A; Stirling, Ian; Kistler, Logan; Salamzade, Rauf; Ersmark, Erik; Fulton, Tara L; Stiller, Mathias; Green, Richard E; Shapiro, Beth

    2015-03-01

    Polar bears are an arctic, marine adapted species that is closely related to brown bears. Genome analyses have shown that polar bears are distinct and genetically homogeneous in comparison to brown bears. However, these analyses have also revealed a remarkable episode of polar bear gene flow into the population of brown bears that colonized the Admiralty, Baranof and Chichagof islands (ABC islands) of Alaska. Here, we present an analysis of data from a large panel of polar bear and brown bear genomes that includes brown bears from the ABC islands, the Alaskan mainland and Europe. Our results provide clear evidence that gene flow between the two species had a geographically wide impact, with polar bear DNA found within the genomes of brown bears living both on the ABC islands and in the Alaskan mainland. Intriguingly, while brown bear genomes contain up to 8.8% polar bear ancestry, polar bear genomes appear to be devoid of brown bear ancestry, suggesting the presence of a barrier to gene flow in that direction. © 2014 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  7. Variability among the Most Rapidly Evolving Plastid Genomic Regions is Lineage-Specific: Implications of Pairwise Genome Comparisons in Pyrus (Rosaceae) and Other Angiosperms for Marker Choice

    PubMed Central

    Ter-Voskanyan, Hasmik; Allgaier, Martin; Borsch, Thomas

    2014-01-01

    Plastid genomes exhibit different levels of variability in their sequences, depending on the respective kinds of genomic regions. Genes are usually more conserved while noncoding introns and spacers evolve at a faster pace. While a set of about thirty maximum variable noncoding genomic regions has been suggested to provide universally promising phylogenetic markers throughout angiosperms, applications often require several regions to be sequenced for many individuals. Our project aims to illuminate evolutionary relationships and species-limits in the genus Pyrus (Rosaceae)—a typical case with very low genetic distances between taxa. In this study, we have sequenced the plastid genome of Pyrus spinosa and aligned it to the already available P. pyrifolia sequence. The overall p-distance of the two Pyrus genomes was 0.00145. The intergenic spacers between ndhC–trnV, trnR–atpA, ndhF–rpl32, psbM–trnD, and trnQ–rps16 were the most variable regions, also comprising the highest total numbers of substitutions, indels and inversions (potentially informative characters). Our comparative analysis of further plastid genome pairs with similar low p-distances from Oenothera (representing another rosid), Olea (asterids) and Cymbidium (monocots) showed in each case a different ranking of genomic regions in terms of variability and potentially informative characters. Only two intergenic spacers (ndhF–rpl32 and trnK–rps16) were consistently found among the 30 top-ranked regions. We have mapped the occurrence of substitutions and microstructural mutations in the four genome pairs. High AT content in specific sequence elements seems to foster frequent mutations. We conclude that the variability among the fastest evolving plastid genomic regions is lineage-specific and thus cannot be precisely predicted across angiosperms. The often lineage-specific occurrence of stem-loop elements in the sequences of introns and spacers also governs lineage-specific mutations

  8. Allele-specific control of replication timing and genome organization during development.

    PubMed

    Rivera-Mulia, Juan Carlos; Dimond, Andrew; Vera, Daniel; Trevilla-Garcia, Claudia; Sasaki, Takayo; Zimmerman, Jared; Dupont, Catherine; Gribnau, Joost; Fraser, Peter; Gilbert, David M

    2018-05-07

    DNA replication occurs in a defined temporal order known as the replication-timing (RT) program. RT is regulated during development in discrete chromosomal units, coordinated with transcriptional activity and 3D genome organization. Here, we derived distinct cell types from F1 hybrid musculus X castaneus mouse crosses and exploited the high single nucleotide polymorphism (SNP) density to characterize allelic differences in RT (Repli-seq), genome organization (Hi-C and promoter-capture Hi-C), gene expression (total nuclear RNA-seq) and chromatin accessibility (ATAC-seq). We also present HARP: a new computational tool for sorting SNPs in phased genomes to efficiently measure allele-specific genome-wide data. Analysis of six different hybrid mESC clones with different genomes (C57BL/6, 129/sv and CAST/Ei), parental configurations and gender revealed significant RT asynchrony between alleles across ~12% of the autosomal genome linked to sub-species genomes but not to parental origin, growth conditions or gender. RT asynchrony in mESCs strongly correlated with changes in Hi-C compartments between alleles but not SNP density, gene expression, imprinting or chromatin accessibility. We then tracked mESC RT asynchronous regions during development by analyzing differentiated cell types including extraembryonic endoderm stem (XEN) cells, 4 male and female primary mouse embryonic fibroblasts (MEFs) and neural precursor cells (NPCs) differentiated in vitro from mESCs with opposite parental configurations. We found that RT asynchrony and allelic discordance in Hi-C compartments seen in mESCs was largely lost in all differentiated cell types, coordinated with a more uniform Hi-C compartment arrangement, suggesting that genome organization of homologues converges to similar folding patterns during cell fate commitment. Published by Cold Spring Harbor Laboratory Press.

  9. Comparative genome analysis of Lactobacillus plantarum GB-LP3 provides candidates of survival-related genetic factors.

    PubMed

    Jeon, Soomin; Jung, Jaehoon; Kim, Kwondo; Yoo, DongAhn; Lee, Chanho; Kang, Jungsun; Cho, Kyungjin; Kang, Dae-Kyung; Kwak, Woori; Yoon, Sook Hee; Kim, Heebal; Cho, Seoae

    2017-09-01

    Lactobacillus plantarum is found in various environmental niches such as in the gastrointestinal tract of an animal host or a fermented food. This species isolated from a certain environment is known to possess a variety of properties according to inhabited environment's adaptation. However, a causal relationship of a genetic factor and phenotype affected by a specific environment has not been systematically comprehended. L. plantarum GB-LP3 strain was isolated from Korean traditional fermented vegetable and the whole genome of GB-LP3 was sequenced. Comparative genome analysis of GB-LP3, with other 14 L. plantarum strains, was conducted. In addition, genomic island regions were investigated. The assembled whole GB-LP3 genome contained a single circular chromosome of 3,206,111bp with the GC content of 44.7%. In the phylogenetic tree analysis, GB-LP3 was in the closest distance from ZJ316. The genomes of GB-LP3 and ZJ316 have the high level of synteny. Functional genes that are related to prophage, bacteriocin, and quorum sensing were found through comparative genomic analysis with ZJ316 and investigation of genomic islands. dN/dS analysis identified that the gene coding for phosphonate ABC transporter ATP-binding protein is evolutionarily accelerated in GB-LP3. Our study found that potential candidate genes that are affected by environmental adaptation in Korea traditional fermented vegetable. Copyright © 2017. Published by Elsevier B.V.

  10. GRIDSS: sensitive and specific genomic rearrangement detection using positional de Bruijn graph assembly

    PubMed Central

    Do, Hongdo; Molania, Ramyar

    2017-01-01

    The identification of genomic rearrangements with high sensitivity and specificity using massively parallel sequencing remains a major challenge, particularly in precision medicine and cancer research. Here, we describe a new method for detecting rearrangements, GRIDSS (Genome Rearrangement IDentification Software Suite). GRIDSS is a multithreaded structural variant (SV) caller that performs efficient genome-wide break-end assembly prior to variant calling using a novel positional de Bruijn graph-based assembler. By combining assembly, split read, and read pair evidence using a probabilistic scoring, GRIDSS achieves high sensitivity and specificity on simulated, cell line, and patient tumor data, recently winning SV subchallenge #5 of the ICGC-TCGA DREAM8.5 Somatic Mutation Calling Challenge. On human cell line data, GRIDSS halves the false discovery rate compared to other recent methods while matching or exceeding their sensitivity. GRIDSS identifies nontemplate sequence insertions, microhomologies, and large imperfect homologies, estimates a quality score for each breakpoint, stratifies calls into high or low confidence, and supports multisample analysis. PMID:29097403

  11. Site-Specific Genome Engineering in Human Pluripotent Stem Cells.

    PubMed

    Merkert, Sylvia; Martin, Ulrich

    2016-06-24

    The possibility to generate patient-specific induced pluripotent stem cells (iPSCs) offers an unprecedented potential of applications in clinical therapy and medical research. Human iPSCs and their differentiated derivatives are tools for diseases modelling, drug discovery, safety pharmacology, and toxicology. Moreover, they allow for the engineering of bioartificial tissue and are promising candidates for cellular therapies. For many of these applications, the ability to genetically modify pluripotent stem cells (PSCs) is indispensable, but efficient site-specific and safe technologies for genetic engineering of PSCs were developed only recently. By now, customized engineered nucleases provide excellent tools for targeted genome editing, opening new perspectives for biomedical research and cellular therapies.

  12. High-Affinity Quasi-Specific Sites in the Genome: How the DNA-Binding Proteins Cope with Them

    PubMed Central

    Chakrabarti, J.; Chandra, Navin; Raha, Paromita; Roy, Siddhartha

    2011-01-01

    Many prokaryotic transcription factors home in on one or a few target sites in the presence of a huge number of nonspecific sites. Our analysis of λ-repressor in the Escherichia coli genome based on single basepair substitution experiments shows the presence of hundreds of sites having binding energy within 3 Kcal/mole of the OR1 binding energy, and thousands of sites with binding energy above the nonspecific binding energy. The effect of such sites on DNA-based processes has not been fully explored. The presence of such sites dramatically lowers the occupation probability of the specific site far more than if the genome were composed of nonspecific sites only. Our Brownian dynamics studies show that the presence of quasi-specific sites results in very significant kinetic effects as well. In contrast to λ-repressor, the E. coli genome has orders of magnitude lower quasi-specific sites for GalR, an integral transcription factor, thus causing little competition for the specific site. We propose that GalR and perhaps repressors of the same family have evolved binding modes that lead to much smaller numbers of quasi-specific sites to remove the untoward effects of genomic DNA. PMID:21889449

  13. Developmentally Programmed 3′ CpG Island Methylation Confers Tissue- and Cell-Type-Specific Transcriptional Activation

    PubMed Central

    Yu, Da-Hai; Ware, Carol; Waterland, Robert A.; Zhang, Jiexin; Chen, Miao-Hsueh; Gadkari, Manasi; Kunde-Ramamoorthy, Govindarajan; Nosavanh, Lagina M.

    2013-01-01

    During development, a small but significant number of CpG islands (CGIs) become methylated. The timing of developmentally programmed CGI methylation and associated mechanisms of transcriptional regulation during cellular differentiation, however, remain poorly characterized. Here, we used genome-wide DNA methylation microarrays to identify epigenetic changes during human embryonic stem cell (hESC) differentiation. We discovered a group of CGIs associated with developmental genes that gain methylation after hESCs differentiate. Conversely, erasure of methylation was observed at the identified CGIs during subsequent reprogramming to induced pluripotent stem cells (iPSCs), further supporting a functional role for the CGI methylation. Both global gene expression profiling and quantitative reverse transcription-PCR (RT-PCR) validation indicated opposing effects of CGI methylation in transcriptional regulation during differentiation, with promoter CGI methylation repressing and 3′ CGI methylation activating transcription. By studying diverse human tissues and mouse models, we further confirmed that developmentally programmed 3′ CGI methylation confers tissue- and cell-type-specific gene activation in vivo. Importantly, luciferase reporter assays provided evidence that 3′ CGI methylation regulates transcriptional activation via a CTCF-dependent enhancer-blocking mechanism. These findings expand the classic view of mammalian CGI methylation as a mechanism for transcriptional silencing and indicate a functional role for 3′ CGI methylation in developmental gene regulation. PMID:23459939

  14. Custom-Designed Molecular Scissors for Site-Specific Manipulation of the Plant and Mammalian Genomes

    NASA Astrophysics Data System (ADS)

    Kandavelou, Karthikeyan; Chandrasegaran, Srinivasan

    Zinc finger nucleases (ZFNs) are custom-designed molecular scissors, engineered to cut at specific DNA sequences. ZFNs combine the zinc finger proteins (ZFPs) with the nonspecific cleavage domain of the FokI restriction enzyme. The DNA-binding specificity of ZFNs can be easily altered experimentally. This easy manipulation of the ZFN recognition specificity enables one to deliver a targeted double-strand break (DSB) to a genome. The targeted DSB stimulates local gene targeting by several orders of magnitude at that specific cut site via homologous recombination (HR). Thus, ZFNs have become an important experimental tool to make site-specific and permanent alterations to genomes of not only plants and mammals but also of many other organisms. Engineering of custom ZFNs involves many steps. The first step is to identify a ZFN site at or near the chosen chromosomal target within the genome to which ZFNs will bind and cut. The second step is to design and/or select various ZFP combinations that will bind to the chosen target site with high specificity and affinity. The DNA coding sequence for the designed ZFPs are then assembled by polymerase chain reaction (PCR) using oligonucleotides. The third step is to fuse the ZFP constructs to the FokI cleavage domain. The ZFNs are then expressed as proteins by using the rabbit reticulocyte in vitro transcription/translation system and the protein products assayed for their DNA cleavage specificity.

  15. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    DOE PAGES

    Bendall, Matthew L.; Stevens, Sarah L.R.; Chan, Leong-Keat; ...

    2016-01-08

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Using a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of genemore » gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. Furthermore, these patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model’ of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Finally, evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.« less

  16. Origins of cattle on Chirikof Island, Alaska, elucidated from genome-wide SNP genotypes

    PubMed Central

    Decker, J E; Taylor, J F; Kantanen, J; Millbrooke, A; Schnabel, R D; Alexander, L J; MacNeil, M D

    2016-01-01

    Feral livestock may harbor genetic variation of commercial, scientific, historical or esthetic value. The origins and uniqueness of feral cattle on Chirikof Island, Alaska, are uncertain. The island is now part of the Alaska Maritime Wildlife Refuge and Federal wildlife managers want grazing to cease, presumably leading to demise of the cattle. Here we characterize the cattle of Chirikof Island relative to extant breeds and discern their origins. Our analyses support the inference that Yakut cattle from Russia arrived first on Chirikof Island, then ~120 years ago the first European taurine cattle were introduced to the island, and finally a large wave of Hereford cattle were introduced on average 40 years ago. In addition, this mixture of European and East-Asian cattle is unique compared with other North American breeds and we find evidence that natural selection in the relatively harsh environment of Chirikof Island has further impacted their genetic architecture. These results provide an objective basis for decisions regarding conservation of the Chirikof Island cattle. PMID:26860198

  17. Along for the ride or missing it altogether: exploring the host specificity and diversity of haemogregarines in the Canary Islands.

    PubMed

    Tomé, Beatriz; Pereira, Ana; Jorge, Fátima; Carretero, Miguel A; Harris, D James; Perera, Ana

    2018-03-19

    Host-parasite relationships are expected to be strongly shaped by host specificity, a crucial factor in parasite adaptability and diversification. Because whole host communities have to be considered to assess host specificity, oceanic islands are ideal study systems given their simplified biotic assemblages. Previous studies on insular parasites suggest host range broadening during colonization. Here, we investigate the association between one parasite group (haemogregarines) and multiple sympatric hosts (of three lizard genera: Gallotia, Chalcides and Tarentola) in the Canary Islands. Given haemogregarine characteristics and insular conditions, we hypothesized low host specificity and/or occurrence of host-switching events. A total of 825 samples were collected from the three host taxa inhabiting the seven main islands of the Canarian Archipelago, including locations where the different lizards occurred in sympatry. Blood slides were screened to assess prevalence and parasitaemia, while parasite genetic diversity and phylogenetic relationships were inferred from 18S rRNA gene sequences. Infection levels and diversity of haplotypes varied geographically and across host groups. Infections were found in all species of Gallotia across the seven islands, in Tarentola from Tenerife, La Gomera and La Palma, and in Chalcides from Tenerife, La Gomera and El Hierro. Gallotia lizards presented the highest parasite prevalence, parasitaemia and diversity (seven haplotypes), while the other two host groups (Chalcides and Tarentola) harbored one haplotype each, with low prevalence and parasitaemia levels, and very restricted geographical ranges. Host-sharing of the same haemogregarine haplotype was only detected twice, but these rare instances likely represent occasional cross-infections. Our results suggest that: (i) Canarian haemogregarine haplotypes are highly host-specific, which might have restricted parasite host expansion; (ii) haemogregarines most probably reached the

  18. Bivalve-specific gene expansion in the pearl oyster genome: implications of adaptation to a sessile lifestyle.

    PubMed

    Takeuchi, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Kanda, Miyuki; Hisata, Kanako; Fujie, Manabu; Goto, Hiroki; Yamasaki, Shinichi; Nagai, Kiyohito; Morino, Yoshiaki; Miyamoto, Hiroshi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Noriyuki; Kawashima, Takeshi

    2016-01-01

    Bivalve molluscs have flourished in marine environments, and many species constitute important aquatic resources. Recently, whole genome sequences from two bivalves, the pearl oyster, Pinctada fucata, and the Pacific oyster, Crassostrea gigas, have been decoded, making it possible to compare genomic sequences among molluscs, and to explore general and lineage-specific genetic features and trends in bivalves. In order to improve the quality of sequence data for these purposes, we have updated the entire P. fucata genome assembly. We present a new genome assembly of the pearl oyster, Pinctada fucata (version 2.0). To update the assembly, we conducted additional sequencing, obtaining accumulated sequence data amounting to 193× the P. fucata genome. Sequence redundancy in contigs that was caused by heterozygosity was removed in silico, which significantly improved subsequent scaffolding. Gene model version 2.0 was generated with the aid of manual gene annotations supplied by the P. fucata research community. Comparison of mollusc and other bilaterian genomes shows that gene arrangements of Hox, ParaHox, and Wnt clusters in the P. fucata genome are similar to those of other molluscs. Like the Pacific oyster, P. fucata possesses many genes involved in environmental responses and in immune defense. Phylogenetic analyses of heat shock protein70 and C1q domain-containing protein families indicate that extensive expansion of genes occurred independently in each lineage. Several gene duplication events prior to the split between the pearl oyster and the Pacific oyster are also evident. In addition, a number of tandem duplications of genes that encode shell matrix proteins are also well characterized in the P. fucata genome. Both the Pinctada and Crassostrea lineages have expanded specific gene families in a lineage-specific manner. Frequent duplication of genes responsible for shell formation in the P. fucata genome explains the diversity of mollusc shell structures. These

  19. Markov models of genome segmentation

    NASA Astrophysics Data System (ADS)

    Thakur, Vivek; Azad, Rajeev K.; Ramaswamy, Ram

    2007-01-01

    We introduce Markov models for segmentation of symbolic sequences, extending a segmentation procedure based on the Jensen-Shannon divergence that has been introduced earlier. Higher-order Markov models are more sensitive to the details of local patterns and in application to genome analysis, this makes it possible to segment a sequence at positions that are biologically meaningful. We show the advantage of higher-order Markov-model-based segmentation procedures in detecting compositional inhomogeneity in chimeric DNA sequences constructed from genomes of diverse species, and in application to the E. coli K12 genome, boundaries of genomic islands, cryptic prophages, and horizontally acquired regions are accurately identified.

  20. Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome

    PubMed Central

    Johnston, Henry Richard; Hu, Yi-Juan; Gao, Jingjing; O’Connor, Timothy D.; Abecasis, Gonçalo R.; Wojcik, Genevieve L; Gignoux, Christopher R.; Gourraud, Pierre-Antoine; Lizee, Antoine; Hansen, Mark; Genuario, Rob; Bullis, Dave; Lawley, Cindy; Kenny, Eimear E.; Bustamante, Carlos; Beaty, Terri H.; Mathias, Rasika A.; Barnes, Kathleen C.; Qin, Zhaohui S.; Preethi Boorgula, Meher; Campbell, Monica; Chavan, Sameer; Ford, Jean G.; Foster, Cassandra; Gao, Li; Hansel, Nadia N.; Horowitz, Edward; Huang, Lili; Ortiz, Romina; Potee, Joseph; Rafaels, Nicholas; Ruczinski, Ingo; Scott, Alan F.; Taub, Margaret A.; Vergara, Candelaria; Levin, Albert M.; Padhukasahasram, Badri; Williams, L. Keoki; Dunston, Georgia M.; Faruque, Mezbah U.; Gietzen, Kimberly; Deshpande, Aniket; Grus, Wendy E.; Locke, Devin P.; Foreman, Marilyn G.; Avila, Pedro C.; Grammer, Leslie; Kim, Kwang-Youn A.; Kumar, Rajesh; Schleimer, Robert; De La Vega, Francisco M.; Shringarpure, Suyash S.; Musharoff, Shaila; Burchard, Esteban G.; Eng, Celeste; Hernandez, Ryan D.; Pino-Yanes, Maria; Torgerson, Dara G.; Szpiech, Zachary A.; Torres, Raul; Nicolae, Dan L.; Ober, Carole; Olopade, Christopher O; Olopade, Olufunmilayo; Oluwole, Oluwafemi; Arinola, Ganiyu; Song, Wei; Correa, Adolfo; Musani, Solomon; Wilson, James G.; Lange, Leslie A.; Akey, Joshua; Bamshad, Michael; Chong, Jessica; Fu, Wenqing; Nickerson, Deborah; Reiner, Alexander; Hartert, Tina; Ware, Lorraine B.; Bleecker, Eugene; Meyers, Deborah; Ortega, Victor E.; Maul, Pissamai; Maul, Trevor; Watson, Harold; Ilma Araujo, Maria; Riccio Oliveira, Ricardo; Caraballo, Luis; Marrugo, Javier; Martinez, Beatriz; Meza, Catherine; Ayestas, Gerardo; Francisco Herrera-Paz, Edwin; Landaverde-Torres, Pamela; Erazo, Said Omar Leiva; Martinez, Rosella; Mayorga, Alvaro; Mayorga, Luis F.; Mejia-Mejia, Delmy-Aracely; Ramos, Hector; Saenz, Allan; Varela, Gloria; Marina Vasquez, Olga; Ferguson, Trevor; Knight-Madden, Jennifer; Samms-Vaughan, Maureen; Wilks, Rainford J.; Adegnika, Akim; Ateba-Ngoa, Ulysse; Yazdanbakhsh, Maria

    2017-01-01

    A primary goal of The Consortium on Asthma among African-ancestry Populations in the Americas (CAAPA) is to develop an ‘African Diaspora Power Chip’ (ADPC), a genotyping array consisting of tagging SNPs, useful in comprehensively identifying African specific genetic variation. This array is designed based on the novel variation identified in 642 CAAPA samples of African ancestry with high coverage whole genome sequence data (~30× depth). This novel variation extends the pattern of variation catalogued in the 1000 Genomes and Exome Sequencing Projects to a spectrum of populations representing the wide range of West African genomic diversity. These individuals from CAAPA also comprise a large swath of the African Diaspora population and incorporate historical genetic diversity covering nearly the entire Atlantic coast of the Americas. Here we show the results of designing and producing such a microchip array. This novel array covers African specific variation far better than other commercially available arrays, and will enable better GWAS analyses for researchers with individuals of African descent in their study populations. A recent study cataloging variation in continental African populations suggests this type of African-specific genotyping array is both necessary and valuable for facilitating large-scale GWAS in populations of African ancestry. PMID:28429804

  1. Comparative Genomic Analysis Reveals Habitat-Specific Genes and Regulatory Hubs within the Genus Novosphingobium

    PubMed Central

    Kumar, Roshan; Verma, Helianthous; Haider, Shazia; Bajaj, Abhay; Sood, Utkarsh; Ponnusamy, Kalaiarasan; Nagar, Shekhar; Shakarad, Mallikarjun N.; Negi, Ram Krishan; Singh, Yogendra; Khurana, J. P.; Gilbert, Jack A.

    2017-01-01

    ABSTRACT Species belonging to the genus Novosphingobium are found in many different habitats and have been identified as metabolically versatile. Through comparative genomic analysis, we identified habitat-specific genes and regulatory hubs that could determine habitat selection for Novosphingobium spp. Genomes from 27 Novosphingobium strains isolated from diverse habitats such as rhizosphere soil, plant surfaces, heavily contaminated soils, and marine and freshwater environments were analyzed. Genome size and coding potential were widely variable, differing significantly between habitats. Phylogenetic relationships between strains were less likely to describe functional genotype similarity than the habitat from which they were isolated. In this study, strains (19 out of 27) with a recorded habitat of isolation, and at least 3 representative strains per habitat, comprised four ecological groups—rhizosphere, contaminated soil, marine, and freshwater. Sulfur acquisition and metabolism were the only core genomic traits to differ significantly in proportion between these ecological groups; for example, alkane sulfonate (ssuABCD) assimilation was found exclusively in all of the rhizospheric isolates. When we examined osmolytic regulation in Novosphingobium spp. through ectoine biosynthesis, which was assumed to be marine habitat specific, we found that it was also present in isolates from contaminated soil, suggesting its relevance beyond the marine system. Novosphingobium strains were also found to harbor a wide variety of mono- and dioxygenases, responsible for the metabolism of several aromatic compounds, suggesting their potential to act as degraders of a variety of xenobiotic compounds. Protein-protein interaction analysis revealed β-barrel outer membrane proteins as habitat-specific hubs in each of the four habitats—freshwater (Saro_1868), marine water (PP1Y_AT17644), rhizosphere (PMI02_00367), and soil (V474_17210). These outer membrane proteins could play a

  2. The Genome of the “Great Speciator” Provides Insights into Bird Diversification

    PubMed Central

    Cornetti, Luca; Valente, Luis M.; Dunning, Luke T.; Quan, Xueping; Black, Richard A.; Hébert, Olivier; Savolainen, Vincent

    2015-01-01

    Among birds, white-eyes (genus Zosterops) have diversified so extensively that Jared Diamond and Ernst Mayr referred to them as the “great speciator.” The Zosterops lineage exhibits some of the fastest rates of species diversification among vertebrates, and its members are the most prolific passerine island colonizers. We present a high-quality genome assembly for the silvereye (Zosterops lateralis), a white-eye species consisting of several subspecies distributed across multiple islands. We investigate the genetic basis of rapid diversification in white-eyes by conducting genomic analyses at varying taxonomic levels. First, we compare the silvereye genome with those of birds from different families and searched for genomic features that may be unique to Zosterops. Second, we compare the genomes of different species of white-eyes from Lifou island (South Pacific), using whole genome resequencing and restriction site associated DNA. Third, we contrast the genomes of two subspecies of silvereye that differ in plumage color. In accordance with theory, we show that white-eyes have high rates of substitutions, gene duplication, and positive selection relative to other birds. Below genus level, we find that genomic differentiation accumulates rapidly and reveals contrasting demographic histories between sympatric species on Lifou, indicative of past interspecific interactions. Finally, we highlight genes possibly involved in color polymorphism between the subspecies of silvereye. By providing the first whole-genome sequence resources for white-eyes and by conducting analyses at different taxonomic levels, we provide genomic evidence underpinning this extraordinary bird radiation. PMID:26338191

  3. USE OF COMPETITIVE GENOMIC HYBRIDIZATION TO ENRICH FOR GENOME-SPECIFIC DIFFERENCES BETWEEN TWO CLOSELY RELATED HUMAN FECAL INDICATOR BACTERIA

    EPA Science Inventory

    Enterococci are frequently used as indicators of fecal pollution in surface waters. To accelerate the identification of Enterococcus faecalis-specific DNA sequences, we employed a comparative genomic strategy utilizing a positive selection process to compare E. faec...

  4. Identification and characterization of insect-specific proteins by genome data analysis

    PubMed Central

    Zhang, Guojie; Wang, Hongsheng; Shi, Junjie; Wang, Xiaoling; Zheng, Hongkun; Wong, Gane Ka-Shu; Clark, Terry; Wang, Wen; Wang, Jun; Kang, Le

    2007-01-01

    Background Insects constitute the vast majority of known species with their importance including biodiversity, agricultural, and human health concerns. It is likely that the successful adaptation of the Insecta clade depends on specific components in its proteome that give rise to specialized features. However, proteome determination is an intensive undertaking. Here we present results from a computational method that uses genome analysis to characterize insect and eukaryote proteomes as an approximation complementary to experimental approaches. Results Homologs in common to Drosophila melanogaster, Anopheles gambiae, Bombyx mori, Tribolium castaneum, and Apis mellifera were compared to the complete genomes of three non-insect eukaryotes (opisthokonts) Homo sapiens, Caenorhabditis elegans and Saccharomyces cerevisiae. This operation yielded 154 groups of orthologous proteins in Drosophila to be insect-specific homologs; 466 groups were determined to be common to eukaryotes (represented by three opisthokonts). ESTs from the hemimetabolous insect Locust migratoria were also considered in order to approximate their corresponding genes in the insect-specific homologs. Stress and stimulus response proteins were found to constitute a higher fraction in the insect-specific homologs than in the homologs common to eukaryotes. Conclusion The significant representation of stress response and stimulus response proteins in proteins determined to be insect-specific, along with specific cuticle and pheromone/odorant binding proteins, suggest that communication and adaptation to environments may distinguish insect evolution relative to other eukaryotes. The tendency for low Ka/Ks ratios in the insect-specific protein set suggests purifying selection pressure. The generally larger number of paralogs in the insect-specific proteins may indicate adaptation to environment changes. Instances in our insect-specific protein set have been arrived at through experiments reported in the

  5. Strategic Environmental Assessment practices in European small islands: Insights from Azores and Orkney islands

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Polido, Alexandra, E-mail: a.polido@campus.fct.unl.pt; João, Elsa, E-mail: elsa.joao@strath.ac.uk; Ramos, Tomás B., E-mail: tabr@fct.unl.pt

    The literature concerning Strategic Environmental Assessment (SEA) often refers to the importance of context-specific approaches. However, there is a lack of systematised and consistent studies that enhance tailor-made SEA practices and procedures. Small islands are bounded units of study which may help explore SEA theory and practice in special territories. Small islands present particular features and unique values, such as, small size and population, geographic isolation, limited resources and vulnerable ecosystems. Hence, the main goal of this research was to profile SEA practices and procedures in European small islands and provide a background for future research aiming to improve context-specificmore » SEA applications. To achieve this goal, an exploratory case study was developed using Azores (Portugal) and Orkney (Scotland) archipelagos. An analysis of the corresponding mainland was also carried out to contextualise both case studies. The data collection was achieved through a qualitative content analysis of 43 Environmental Reports. The research found that there is not an SEA context-specific approach used within these European small islands, including guidelines, assessment topics, assessment techniques, follow-up and stakeholders engagement. The debate concerning specific approaches to small islands must be re-focused on the enhancement of SEA capacity-building amongst different stakeholders (including decision-makers), on the development and implementation of collaborative approaches, and on the exchange of knowledge and experiences between small islands networks. - Highlights: • Reviewed the differences between the Portuguese and Scottish SEA system • Showed a low integration of SEA specific features in reports of European small islands • Provides background for future SEA research for small islands approaches.« less

  6. Specific detection of Mycobacterium sp. genomic DNA using dual labeled gold nanoparticle based electrochemical biosensor.

    PubMed

    Thiruppathiraja, Chinnasamy; Kamatchiammal, Senthilkumar; Adaikkappan, Periyakaruppan; Santhosh, Devakirubakaran Jayakar; Alagar, Muthukaruppan

    2011-10-01

    The present study was aimed at the development and evaluation of a DNA electrochemical biosensor for Mycobacterium sp. genomic DNA detection in a clinical specimen using a signal amplifier as dual-labeled AuNPs. The DNA electrochemical biosensors were fabricated using a sandwich detection strategy involving two kinds of DNA probes specific to Mycobacterium sp. genomic DNA. The probes of enzyme ALP and the detector probe both conjugated on the AuNPs and subsequently hybridized with target DNA immobilized in a SAM/ITO electrode followed by characterization with CV, EIS, and DPV analysis using the electroactive species para-nitrophenol generated by ALP through hydrolysis of para-nitrophenol phosphate. The effect of enhanced sensitivity was obtained due to the AuNPs carrying numerous ALPs per hybridization and a detection limit of 1.25 ng/ml genomic DNA was determined under optimized conditions. The dual-labeled AuNP-facilitated electrochemical sensor was also evaluated by clinical sputum samples, showing a higher sensitivity and specificity and the outcome was in agreement with the PCR analysis. In conclusion, the developed electrochemical sensor demonstrated unique sensitivity and specificity for both genomic DNA and sputum samples and can be employed as a regular diagnostics tool for Mycobacterium sp. monitoring in clinical samples. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. A new genome of Acidithiobacillus thiooxidans provides insights into adaptation to a bioleaching environment.

    PubMed

    Travisany, Dante; Cortés, María Paz; Latorre, Mauricio; Di Genova, Alex; Budinich, Marko; Bobadilla-Fazzini, Roberto A; Parada, Pilar; González, Mauricio; Maass, Alejandro

    2014-11-01

    Acidithiobacillus thiooxidans is a sulfur oxidizing acidophilic bacterium found in many sulfur-rich environments. It is particularly interesting due to its role in bioleaching of sulphide minerals. In this work, we report the genome sequence of At. thiooxidans Licanantay, the first strain from a copper mine to be sequenced and currently used in bioleaching industrial processes. Through comparative genomic analysis with two other At. thiooxidans non-metal mining strains (ATCC 19377 and A01) we determined that these strains share a large core genome of 2109 coding sequences and a high average nucleotide identity over 98%. Nevertheless, the presence of 841 strain-specific genes (absent in other At. thiooxidans strains) suggests a particular adaptation of Licanantay to its specific biomining environment. Among this group, we highlight genes encoding for proteins involved in heavy metal tolerance, mineral cell attachment and cysteine biosynthesis. Several of these genes were located near genetic motility genes (e.g. transposases and integrases) in genomic regions of over 10 kbp absent in the other strains, suggesting the presence of genomic islands in the Licanantay genome probably produced by horizontal gene transfer in mining environments. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  8. Genomic Pangea: coordinate gene regulation and cell-specific chromosomal topologies.

    PubMed

    Laster, Kyle; Kosak, Steven T

    2010-06-01

    The eukaryotic nucleus is functionally organized. Gene loci, for example, often reveal altered localization patterns according to their developmental regulation. Whole chromosomes also demonstrate non-random nuclear positions, correlated with inherent characteristics such as gene density or size. Given that hundreds to thousands of genes are coordinately regulated in any given cell type, interest has grown in whether chromosomes may be specifically localized according to gene regulation. A synthesis of the evidence for preferential chromosomal organization suggests that, beyond basic characteristics, chromosomes can assume positions functionally related to gene expression. Moreover, analysis of total chromosome organization during cellular differentiation indicates that unique chromosome topologies, albeit probabilistic, in effect define a cell lineage. Future work with new techniques, including the advanced forms of the chromosome conformation capture (3C), and the development of next-generation whole-genome imaging approaches, will help to refine our view of chromosomal organization. We suggest that genomic organization during cellular differentiation should be viewed as a dynamic process, with gene expression patterns leading to chromosome associations that feed back on themselves, leading to the self-organization of the genome according to coordinate gene regulation. Copyright 2010 Elsevier Ltd. All rights reserved.

  9. The Arsenic Resistance-Associated Listeria Genomic Island LGI2 Exhibits Sequence and Integration Site Diversity and a Propensity for Three Listeria monocytogenes Clones with Enhanced Virulence.

    PubMed

    Lee, Sangmi; Ward, Todd J; Jima, Dereje D; Parsons, Cameron; Kathariou, Sophia

    2017-11-01

    In the foodborne pathogen Listeria monocytogenes , arsenic resistance is encountered primarily in serotype 4b clones considered to have enhanced virulence and is associated with an arsenic resistance gene cluster within a 35-kb chromosomal region, Listeria genomic island 2 (LGI2). LGI2 was first identified in strain Scott A and includes genes putatively involved in arsenic and cadmium resistance, DNA integration, conjugation, and pathogenicity. However, the genomic localization and sequence content of LGI2 remain poorly characterized. Here we investigated 85 arsenic-resistant L. monocytogenes strains, mostly of serotype 4b. All but one of the 70 serotype 4b strains belonged to clonal complex 1 (CC1), CC2, and CC4, three major clones associated with enhanced virulence. PCR analysis suggested that 53 strains (62.4%) harbored an island highly similar to LGI2 of Scott A, frequently (42/53) in the same location as Scott A ( LMOf2365_2257 homolog). Random-primed PCR and whole-genome sequencing revealed seven novel insertion sites, mostly internal to chromosomal coding sequences, among strains harboring LGI2 outside the LMOf2365_2257 homolog. Interestingly, many CC1 strains harbored a noticeably diversified LGI2 (LGI2-1) in a unique location ( LMOf2365_0902 homolog) and with a novel additional gene. With few exceptions, the tested LGI2 genes were not detected in arsenic-resistant strains of serogroup 1/2, which instead often harbored a Tn 554 -associated arsenic resistance determinant not encountered in serotype 4b. These findings indicate that in L. monocytogenes , LGI2 has a propensity for certain serotype 4b clones, exhibits content diversity, and is highly promiscuous, suggesting an ability to mobilize various accessory genes into diverse chromosomal loci. IMPORTANCE Listeria monocytogenes is widely distributed in the environment and causes listeriosis, a foodborne disease with high mortality and morbidity. Arsenic and other heavy metals can powerfully shape the

  10. The distribution of intra-genomically variable dinoflagellate symbionts at Lord Howe Island, Australia

    NASA Astrophysics Data System (ADS)

    Wilkinson, Shaun P.; Pontasch, Stefanie; Fisher, Paul L.; Davy, Simon K.

    2016-06-01

    The symbiotic dinoflagellates of corals and other marine invertebrates ( Symbiodinium) are essential to the development of shallow-water coral reefs. This genus contains considerable genetic diversity and a corresponding range of physiological and ecological traits. Most genetic variation arises through the accumulation of somatic mutations that arise during asexual reproduction. Yet growing evidence suggests that occasional sexual reproductive events also occur within, and perhaps between, Symbiodinium lineages, further contributing to the pool of genetic variation available for evolutionary adaptation. Intra-genomic variation can therefore arise from both sexual and asexual reproductive processes, making it difficult to discern its underlying causes and consequences. We used quantitative PCR targeting the ITS2 locus to estimate proportions of genetically homogeneous symbionts and intra-genomically variable Symbiodinium (IGV Symbiodinium) in the reef-building coral Pocillopora damicornis at Lord Howe Island, Australia. We then sampled colonies through time and at a variety of spatial scales to find out whether the distribution of these symbionts followed patterns consistent with niche partitioning. Estimated ratios of homogeneous to IGV Symbiodinium varied between colonies within sites (metres to tens of metres) and between sites separated by hundreds to thousands of metres, but remained stable within colonies through time. Symbiont ratios followed a temperature gradient, with the local thermal maximum emerging as a negative predictor for the estimated proportional abundance of IGV Symbiodinium. While this pattern may result from fine-scale spatial population structure, it is consistent with an increased susceptibility to thermal stress, suggesting that the evolutionary processes that generate IGV (such as inter-lineage recombination and the accumulation of somatic mutations at the ITS2 locus) may have important implications for the fitness of the symbiont and

  11. Inclusion of Population-specific Reference Panel from India to the 1000 Genomes Phase 3 Panel Improves Imputation Accuracy.

    PubMed

    Ahmad, Meraj; Sinha, Anubhav; Ghosh, Sreya; Kumar, Vikrant; Davila, Sonia; Yajnik, Chittaranjan S; Chandak, Giriraj R

    2017-07-27

    Imputation is a computational method based on the principle of haplotype sharing allowing enrichment of genome-wide association study datasets. It depends on the haplotype structure of the population and density of the genotype data. The 1000 Genomes Project led to the generation of imputation reference panels which have been used globally. However, recent studies have shown that population-specific panels provide better enrichment of genome-wide variants. We compared the imputation accuracy using 1000 Genomes phase 3 reference panel and a panel generated from genome-wide data on 407 individuals from Western India (WIP). The concordance of imputed variants was cross-checked with next-generation re-sequencing data on a subset of genomic regions. Further, using the genome-wide data from 1880 individuals, we demonstrate that WIP works better than the 1000 Genomes phase 3 panel and when merged with it, significantly improves the imputation accuracy throughout the minor allele frequency range. We also show that imputation using only South Asian component of the 1000 Genomes phase 3 panel works as good as the merged panel, making it computationally less intensive job. Thus, our study stresses that imputation accuracy using 1000 Genomes phase 3 panel can be further improved by including population-specific reference panels from South Asia.

  12. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources.

    PubMed

    Vuilleumier, Stéphane; Chistoserdova, Ludmila; Lee, Ming-Chun; Bringel, Françoise; Lajus, Aurélie; Zhou, Yang; Gourion, Benjamin; Barbe, Valérie; Chang, Jean; Cruveiller, Stéphane; Dossat, Carole; Gillett, Will; Gruffaz, Christelle; Haugen, Eric; Hourcade, Edith; Levy, Ruth; Mangenot, Sophie; Muller, Emilie; Nadalig, Thierry; Pagni, Marco; Penny, Christian; Peyraud, Rémi; Robinson, David G; Roche, David; Rouy, Zoé; Saenampechek, Channakhone; Salvignol, Grégory; Vallenet, David; Wu, Zaining; Marx, Christopher J; Vorholt, Julia A; Olson, Maynard V; Kaul, Rajinder; Weissenbach, Jean; Médigue, Claudine; Lidstrom, Mary E

    2009-01-01

    Methylotrophy describes the ability of organisms to grow on reduced organic compounds without carbon-carbon bonds. The genomes of two pink-pigmented facultative methylotrophic bacteria of the Alpha-proteobacterial genus Methylobacterium, the reference species Methylobacterium extorquens strain AM1 and the dichloromethane-degrading strain DM4, were compared. The 6.88 Mb genome of strain AM1 comprises a 5.51 Mb chromosome, a 1.26 Mb megaplasmid and three plasmids, while the 6.12 Mb genome of strain DM4 features a 5.94 Mb chromosome and two plasmids. The chromosomes are highly syntenic and share a large majority of genes, while plasmids are mostly strain-specific, with the exception of a 130 kb region of the strain AM1 megaplasmid which is syntenic to a chromosomal region of strain DM4. Both genomes contain large sets of insertion elements, many of them strain-specific, suggesting an important potential for genomic plasticity. Most of the genomic determinants associated with methylotrophy are nearly identical, with two exceptions that illustrate the metabolic and genomic versatility of Methylobacterium. A 126 kb dichloromethane utilization (dcm) gene cluster is essential for the ability of strain DM4 to use DCM as the sole carbon and energy source for growth and is unique to strain DM4. The methylamine utilization (mau) gene cluster is only found in strain AM1, indicating that strain DM4 employs an alternative system for growth with methylamine. The dcm and mau clusters represent two of the chromosomal genomic islands (AM1: 28; DM4: 17) that were defined. The mau cluster is flanked by mobile elements, but the dcm cluster disrupts a gene annotated as chelatase and for which we propose the name "island integration determinant" (iid). These two genome sequences provide a platform for intra- and interspecies genomic comparisons in the genus Methylobacterium, and for investigations of the adaptive mechanisms which allow bacterial lineages to acquire methylotrophic

  13. Methylobacterium Genome Sequences: A Reference Blueprint to Investigate Microbial Metabolism of C1 Compounds from Natural and Industrial Sources

    PubMed Central

    Lee, Ming-Chun; Bringel, Françoise; Lajus, Aurélie; Zhou, Yang; Gourion, Benjamin; Barbe, Valérie; Chang, Jean; Cruveiller, Stéphane; Dossat, Carole; Gillett, Will; Gruffaz, Christelle; Haugen, Eric; Hourcade, Edith; Levy, Ruth; Mangenot, Sophie; Muller, Emilie; Nadalig, Thierry; Pagni, Marco; Penny, Christian; Peyraud, Rémi; Robinson, David G.; Roche, David; Rouy, Zoé; Saenampechek, Channakhone; Salvignol, Grégory; Vallenet, David; Wu, Zaining; Marx, Christopher J.; Vorholt, Julia A.; Olson, Maynard V.; Kaul, Rajinder; Weissenbach, Jean; Médigue, Claudine; Lidstrom, Mary E.

    2009-01-01

    Background Methylotrophy describes the ability of organisms to grow on reduced organic compounds without carbon-carbon bonds. The genomes of two pink-pigmented facultative methylotrophic bacteria of the Alpha-proteobacterial genus Methylobacterium, the reference species Methylobacterium extorquens strain AM1 and the dichloromethane-degrading strain DM4, were compared. Methodology/Principal Findings The 6.88 Mb genome of strain AM1 comprises a 5.51 Mb chromosome, a 1.26 Mb megaplasmid and three plasmids, while the 6.12 Mb genome of strain DM4 features a 5.94 Mb chromosome and two plasmids. The chromosomes are highly syntenic and share a large majority of genes, while plasmids are mostly strain-specific, with the exception of a 130 kb region of the strain AM1 megaplasmid which is syntenic to a chromosomal region of strain DM4. Both genomes contain large sets of insertion elements, many of them strain-specific, suggesting an important potential for genomic plasticity. Most of the genomic determinants associated with methylotrophy are nearly identical, with two exceptions that illustrate the metabolic and genomic versatility of Methylobacterium. A 126 kb dichloromethane utilization (dcm) gene cluster is essential for the ability of strain DM4 to use DCM as the sole carbon and energy source for growth and is unique to strain DM4. The methylamine utilization (mau) gene cluster is only found in strain AM1, indicating that strain DM4 employs an alternative system for growth with methylamine. The dcm and mau clusters represent two of the chromosomal genomic islands (AM1: 28; DM4: 17) that were defined. The mau cluster is flanked by mobile elements, but the dcm cluster disrupts a gene annotated as chelatase and for which we propose the name “island integration determinant” (iid). Conclusion/Significance These two genome sequences provide a platform for intra- and interspecies genomic comparisons in the genus Methylobacterium, and for investigations of the adaptive

  14. Procainamide Is a Specific Inhibitor of DNA Methyltransferase 1*

    PubMed Central

    Lee, Byron H.; Yegnasubramanian, Srinivasan; Lin, Xiaohui; Nelson, William G.

    2007-01-01

    CpG island hypermethylation occurs in most cases of cancer, typically resulting in the transcriptional silencing of critical cancer genes. Procainamide has been shown to inhibit DNA methyltransferase activity and reactivate silenced gene expression in cancer cells by reversing CpG island hypermethylation. We report here that procainamide specifically inhibits the hemimethylase activity of DNA methyltransferase 1 (DNMT1), the mammalian enzyme thought to be responsible for maintaining DNA methylation patterns during replication. At micromolar concentrations, procainamide was found to be a partial competitive inhibitor of DNMT1, reducing the affinity of the enzyme for its two substrates, hemimethylated DNA and S-adenosyl-l-methionine. By doing so, procainamide significantly decreased the processivity of DNMT1 on hemimethylated DNA. Procainamide was not a potent inhibitor of the de novo methyltransferases DNMT3a and DNMT3b2. As further evidence of the specificity of procainamide for DNMT1, procainamide failed to lower genomic 5-methyl-2′-deoxycytidine levels in HCT116 colorectal cancer cells when DNMT1 was genetically deleted but significantly reduced genomic 5-methyl-2′-deoxycyti-dine content in parental HCT116 cells and in HCT116 cells where DNMT3b was genetically deleted. Because many reports have strongly linked DNMT1 with epigenetic alterations in carcinogenesis, procainamide may be a useful drug in the prevention of cancer. PMID:16230360

  15. Comprehensive molecular, genomic and phenotypic analysis of a major clone of Enterococcus faecalis MLST ST40.

    PubMed

    Zischka, Melanie; Künne, Carsten T; Blom, Jochen; Wobser, Dominique; Sakιnç, Türkân; Schmidt-Hohagen, Kerstin; Dabrowski, P Wojtek; Nitsche, Andreas; Hübner, Johannes; Hain, Torsten; Chakraborty, Trinad; Linke, Burkhard; Goesmann, Alexander; Voget, Sonja; Daniel, Rolf; Schomburg, Dietmar; Hauck, Rüdiger; Hafez, Hafez M; Tielen, Petra; Jahn, Dieter; Solheim, Margrete; Sadowy, Ewa; Larsen, Jesper; Jensen, Lars B; Ruiz-Garbajosa, Patricia; Quiñones Pérez, Dianelys; Mikalsen, Theresa; Bender, Jennifer; Steglich, Matthias; Nübel, Ulrich; Witte, Wolfgang; Werner, Guido

    2015-03-12

    Enterococcus faecalis is a multifaceted microorganism known to act as a beneficial intestinal commensal bacterium. It is also a dreaded nosocomial pathogen causing life-threatening infections in hospitalised patients. Isolates of a distinct MLST type ST40 represent the most frequent strain type of this species, distributed worldwide and originating from various sources (animal, human, environmental) and different conditions (colonisation/infection). Since enterococci are known to be highly recombinogenic we determined to analyse the microevolution and niche adaptation of this highly distributed clonal type. We compared a set of 42 ST40 isolates by assessing key molecular determinants, performing whole genome sequencing (WGS) and a number of phenotypic assays including resistance profiling, formation of biofilm and utilisation of carbon sources. We generated the first circular closed reference genome of an E. faecalis isolate D32 of animal origin and compared it with the genomes of other reference strains. D32 was used as a template for detailed WGS comparisons of high-quality draft genomes of 14 ST40 isolates. Genomic and phylogenetic analyses suggest a high level of similarity regarding the core genome, also demonstrated by similar carbon utilisation patterns. Distribution of known and putative virulence-associated genes did not differentiate between ST40 strains from a commensal and clinical background or an animal or human source. Further analyses of mobile genetic elements (MGE) revealed genomic diversity owed to: (1) a modularly structured pathogenicity island; (2) a site-specifically integrated and previously unknown genomic island of 138 kb in two strains putatively involved in exopolysaccharide synthesis; and (3) isolate-specific plasmid and phage patterns. Moreover, we used different cell-biological and animal experiments to compare the isolate D32 with a closely related ST40 endocarditis isolate whose draft genome sequence was also generated. D32

  16. Complete genome sequence of the fire blight pathogen Erwinia pyrifoliae DSM 12163T and comparative genomic insights into plant pathogenicity

    PubMed Central

    2010-01-01

    Background Erwinia pyrifoliae is a newly described necrotrophic pathogen, which causes fire blight on Asian (Nashi) pear and is geographically restricted to Eastern Asia. Relatively little is known about its genetics compared to the closely related main fire blight pathogen E. amylovora. Results The genome of the type strain of E. pyrifoliae strain DSM 12163T, was sequenced using both 454 and Solexa pyrosequencing and annotated. The genome contains a circular chromosome of 4.026 Mb and four small plasmids. Based on their respective role in virulence in E. amylovora or related organisms, we identified several putative virulence factors, including type III and type VI secretion systems and their effectors, flagellar genes, sorbitol metabolism, iron uptake determinants, and quorum-sensing components. A deletion in the rpoS gene covering the most conserved region of the protein was identified which may contribute to the difference in virulence/host-range compared to E. amylovora. Comparative genomics with the pome fruit epiphyte Erwinia tasmaniensis Et1/99 showed that both species are overall highly similar, although specific differences were identified, for example the presence of some phage gene-containing regions and a high number of putative genomic islands containing transposases in the E. pyrifoliae DSM 12163T genome. Conclusions The E. pyrifoliae genome is an important addition to the published genome of E. tasmaniensis and the unfinished genome of E. amylovora providing a foundation for re-sequencing additional strains that may shed light on the evolution of the host-range and virulence/pathogenicity of this important group of plant-associated bacteria. PMID:20047678

  17. Genome Sequence of Vibrio cholerae Strain O1 Ogawa El Tor, Isolated in Mexico, 2013

    PubMed Central

    Hernández-Monroy, Irma; López-Martínez, Irma; Ortiz-Alcántara, Joanna; González-Durán, Elizabeth; Ruiz-Matus, Cuitláhuac; Kuri-Morales, Pablo; Ramírez-González, José Ernesto

    2014-01-01

    We present the draft genome sequence of Vibrio cholerae InDRE 3140 recovered in 2013 during a cholera outbreak in Mexico. The genome showed the Vibrio 7th pandemic islands VSP1 and VSP2, the pathogenic islands VPI-1 and VPI-2, the integrative and conjugative element SXT/R391 (ICE-SXT), and both prophages CTXφ and RS1φ. PMID:25359919

  18. Whole-Genome Sequence Variation among Multiple Isolates of Pseudomonas aeruginosa

    PubMed Central

    Spencer, David H.; Kas, Arnold; Smith, Eric E.; Raymond, Christopher K.; Sims, Elizabeth H.; Hastings, Michele; Burns, Jane L.; Kaul, Rajinder; Olson, Maynard V.

    2003-01-01

    Whole-genome shotgun sequencing was used to study the sequence variation of three Pseudomonas aeruginosa isolates, two from clonal infections of cystic fibrosis patients and one from an aquatic environment, relative to the genomic sequence of reference strain PAO1. The majority of the PAO1 genome is represented in these strains; however, at least three prominent islands of PAO1-specific sequence are apparent. Conversely, ∼10% of the sequencing reads derived from each isolate fail to align with the PAO1 backbone. While average sequence variation among all strains is roughly 0.5%, regions of pronounced differences were evident in whole-genome scans of nucleotide diversity. We analyzed two such divergent loci, the pyoverdine and O-antigen biosynthesis regions, by complete resequencing. A thorough analysis of isolates collected over time from one of the cystic fibrosis patients revealed independent mutations resulting in the loss of O-antigen synthesis alternating with a mucoid phenotype. Overall, we conclude that most of the PAO1 genome represents a core P. aeruginosa backbone sequence while the strains addressed in this study possess additional genetic material that accounts for at least 10% of their genomes. Approximately half of these additional sequences are novel. PMID:12562802

  19. A distinct group of CpG islands shows differential DNA methylation between replicas of the same cell line in vitro

    PubMed Central

    2013-01-01

    Background CpG dinucleotide-rich genomic DNA regions, known as CpG islands (CGIs), can be methylated at their cytosine residues as an epigenetic mark that is stably inherited during cell mitosis. Differentially methylated regions (DMRs) are genomic regions showing different degrees of DNA methylation in multiple samples. In this study, we focused our attention on CGIs showing different DNA methylation between two culture replicas of the same cell line. Results We used methylation data of 35 cell lines from the Encyclopedia of DNA Elements (ENCODE) consortium to identify CpG islands that were differentially methylated between replicas of the same cell line and denoted them Inter Replicas Differentially Methylated CpG islands (IRDM-CGIs). We identified a group of IRDM-CGIs that was consistently shared by different cell lines, and denoted it common IRDM-CGIs. X chromosome CGIs were overrepresented among common IRDM-CGIs. Autosomal IRDM-CGIs were preferentially located in gene bodies and intergenic regions had a lower G + C content, a smaller mean length, and a reduced CpG percentage. Functional analysis of the genes associated with autosomal IRDM-CGIs showed that many of them are involved in DNA binding and development. Conclusions Our results show that several specific functional and structural features characterize common IRDM-CGIs. They may represent a specific subset of CGIs that are more prone to being differentially methylated for their intrinsic characteristics. PMID:24106769

  20. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium.

    PubMed

    Machado, Henrique; Gram, Lone

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur , amino-acid usage, ANI), which allowed us to identify two misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan- and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity could be traced to the smaller chromosome and plasmids. Several of the physiological traits studied in the genus did not correlate with phylogenetic data. Since horizontal gene transfer (HGT) is often suggested as a source of genetic diversity and a potential driver of genomic evolution in bacterial species, we looked into evidence of such in Photobacterium genomes. Genomic islands were the source of genomic differences between strains of the same species. Also, we found transposase genes and CRISPR arrays that suggest multiple encounters with foreign DNA. Presence of genomic exchange traits was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms.

  1. Comparative Genome Analysis Provides Insights into Both the Lifestyle of Acidithiobacillus ferrivorans Strain CF27 and the Chimeric Nature of the Iron-Oxidizing Acidithiobacilli Genomes.

    PubMed

    Tran, Tam T T; Mangenot, Sophie; Magdelenat, Ghislaine; Payen, Emilie; Rouy, Zoé; Belahbib, Hassiba; Grail, Barry M; Johnson, D Barrie; Bonnefoy, Violaine; Talla, Emmanuel

    2017-01-01

    The iron-oxidizing species Acidithiobacillus ferrivorans is one of few acidophiles able to oxidize ferrous iron and reduced inorganic sulfur compounds at low temperatures (<10°C). To complete the genome of At. ferrivorans strain CF27, new sequences were generated, and an update assembly and functional annotation were undertaken, followed by a comparative analysis with other Acidithiobacillus species whose genomes are publically available. The At. ferrivorans CF27 genome comprises a 3,409,655 bp chromosome and a 46,453 bp plasmid. At. ferrivorans CF27 possesses genes allowing its adaptation to cold, metal(loid)-rich environments, as well as others that enable it to sense environmental changes, allowing At. ferrivorans CF27 to escape hostile conditions and to move toward favorable locations. Interestingly, the genome of At. ferrivorans CF27 exhibits a large number of genomic islands (mostly containing genes of unknown function), suggesting that a large number of genes has been acquired by horizontal gene transfer over time. Furthermore, several genes specific to At. ferrivorans CF27 have been identified that could be responsible for the phenotypic differences of this strain compared to other Acidithiobacillus species. Most genes located inside At. ferrivorans CF27-specific gene clusters which have been analyzed were expressed by both ferrous iron-grown and sulfur-attached cells, indicating that they are not pseudogenes and may play a role in both situations. Analysis of the taxonomic composition of genomes of the Acidithiobacillia infers that they are chimeric in nature, supporting the premise that they belong to a particular taxonomic class, distinct to other proteobacterial subgroups.

  2. Genome-wide identification of lineage-specific genes in Arabidopsis, Oryza and Populus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Xiaohan; Jawdy, Sara; Tschaplinski, Timothy J

    2009-01-01

    Protein sequences were compared among Arabidopsis, Oryza and Populus to identify differential gene (DG) sets that are in one but not the other two genomes. The DG sets were screened against a plant transcript database, the NR protein database and six newly-sequenced genomes (Carica, Glycine, Medicago, Sorghum, Vitis and Zea) to identify a set of species-specific genes (SS). Gene expression, protein motif and intron number were examined. 192, 641 and 109 SS genes were identified in Arabidopsis, Oryza and Populus, respectively. Some SS genes were preferentially expressed in flowers, roots, xylem and cambium or up-regulated by stress. Six conserved motifsmore » in Arabidopsis and Oryza SS proteins were found in other distant lineages. The SS gene sets were enriched with intronless genes. The results reflect functional and/or anatomical differences between monocots and eudicots or between herbaceous and woody plants. The Populus-specific genes are candidates for carbon sequestration and biofuel research.« less

  3. Enhanced sensitivity of CpG island search and primer design based on predicted CpG island position.

    PubMed

    Park, Hyun-Chul; Ahn, Eu-Ree; Jung, Ju Yeon; Park, Ji-Hye; Lee, Jee Won; Lim, Si-Keun; Kim, Won

    2018-05-01

    DNA methylation has important biological roles, such as gene expression regulation, as well as practical applications in forensics, such as in body fluid identification and age estimation. DNA methylation often occurs in the CpG site, and methylation within the CpG islands affects various cellular functions and is related to tissue-specific identification. Several programs have been developed to identify CpG islands; however, the size, location, and number of predicted CpG islands are not identical due to different search algorithms. In addition, they only provide structural information for predicted CpG islands without experimental information, such as primer design. We developed an analysis pipeline package, CpGPNP, to integrate CpG island prediction and primer design. CpGPNP predicts CpG islands more accurately and sensitively than other programs, and designs primers easily based on the predicted CpG island locations. The primer design function included standard, bisulfite, and methylation-specific PCR to identify the methylation of particular CpG sites. In this study, we performed CpG island prediction on all chromosomes and compared CpG island search performance of CpGPNP with other CpG island prediction programs. In addition, we compared the position of primers designed for a specific region within the predicted CpG island using other bisulfite PCR primer programs. The primers designed by CpGPNP were used to experimentally verify the amplification of the target region of markers for body fluid identification and age estimation. CpGPNP is freely available at http://forensicdna.kr/cpgpnp/. Copyright © 2018 Elsevier B.V. All rights reserved.

  4. Predicting aberrant CpG island methylation

    PubMed Central

    Feltus, F. A.; Lee, E. K.; Costello, J. F.; Plass, C.; Vertino, P. M.

    2003-01-01

    Epigenetic silencing associated with aberrant methylation of promoter region CpG islands is one mechanism leading to loss of tumor suppressor function in human cancer. Profiling of CpG island methylation indicates that some genes are more frequently methylated than others, and that each tumor type is associated with a unique set of methylated genes. However, little is known about why certain genes succumb to this aberrant event. To address this question, we used Restriction Landmark Genome Scanning to analyze the susceptibility of 1,749 unselected CpG islands to de novo methylation driven by overexpression of DNA cytosine-5-methyltransferase 1 (DNMT1). We found that although the overall incidence of CpG island methylation was increased in cells overexpressing DNMT1, not all loci were equally affected. The majority of CpG islands (69.9%) were resistant to de novo methylation, regardless of DNMT1 overexpression. In contrast, we identified a subset of methylation-prone CpG islands (3.8%) that were consistently hypermethylated in multiple DNMT1 overexpressing clones. Methylation-prone and methylation-resistant CpG islands were not significantly different with respect to size, C+G content, CpG frequency, chromosomal location, or promoter association. We used DNA pattern recognition and supervised learning techniques to derive a classification function based on the frequency of seven novel sequence patterns that was capable of discriminating methylation-prone from methylation-resistant CpG islands with 82% accuracy. The data indicate that CpG islands differ in their intrinsic susceptibility to de novo methylation, and suggest that the propensity for a CpG island to become aberrantly methylated can be predicted based on its sequence context. PMID:14519846

  5. Predicting aberrant CpG island methylation.

    PubMed

    Feltus, F A; Lee, E K; Costello, J F; Plass, C; Vertino, P M

    2003-10-14

    Epigenetic silencing associated with aberrant methylation of promoter region CpG islands is one mechanism leading to loss of tumor suppressor function in human cancer. Profiling of CpG island methylation indicates that some genes are more frequently methylated than others, and that each tumor type is associated with a unique set of methylated genes. However, little is known about why certain genes succumb to this aberrant event. To address this question, we used Restriction Landmark Genome Scanning to analyze the susceptibility of 1,749 unselected CpG islands to de novo methylation driven by overexpression of DNA cytosine-5-methyltransferase 1 (DNMT1). We found that although the overall incidence of CpG island methylation was increased in cells overexpressing DNMT1, not all loci were equally affected. The majority of CpG islands (69.9%) were resistant to de novo methylation, regardless of DNMT1 overexpression. In contrast, we identified a subset of methylation-prone CpG islands (3.8%) that were consistently hypermethylated in multiple DNMT1 overexpressing clones. Methylation-prone and methylation-resistant CpG islands were not significantly different with respect to size, C+G content, CpG frequency, chromosomal location, or promoter association. We used DNA pattern recognition and supervised learning techniques to derive a classification function based on the frequency of seven novel sequence patterns that was capable of discriminating methylation-prone from methylation-resistant CpG islands with 82% accuracy. The data indicate that CpG islands differ in their intrinsic susceptibility to de novo methylation, and suggest that the propensity for a CpG island to become aberrantly methylated can be predicted based on its sequence context.

  6. Genomic selection for crossbred performance accounting for breed-specific effects.

    PubMed

    Lopes, Marcos S; Bovenhuis, Henk; Hidalgo, André M; van Arendonk, Johan A M; Knol, Egbert F; Bastiaansen, John W M

    2017-06-26

    Breed-specific effects are observed when the same allele of a given genetic marker has a different effect depending on its breed origin, which results in different allele substitution effects across breeds. In such a case, single-breed breeding values may not be the most accurate predictors of crossbred performance. Our aim was to estimate the contribution of alleles from each parental breed to the genetic variance of traits that are measured in crossbred offspring, and to compare the prediction accuracies of estimated direct genomic values (DGV) from a traditional genomic selection model (GS) that are trained on purebred or crossbred data, with accuracies of DGV from a model that accounts for breed-specific effects (BS), trained on purebred or crossbred data. The final dataset was composed of 924 Large White, 924 Landrace and 924 two-way cross (F1) genotyped and phenotyped animals. The traits evaluated were litter size (LS) and gestation length (GL) in pigs. The genetic correlation between purebred and crossbred performance was higher than 0.88 for both LS and GL. For both traits, the additive genetic variance was larger for alleles inherited from the Large White breed compared to alleles inherited from the Landrace breed (0.74 and 0.56 for LS, and 0.42 and 0.40 for GL, respectively). The highest prediction accuracies of crossbred performance were obtained when training was done on crossbred data. For LS, prediction accuracies were the same for GS and BS DGV (0.23), while for GL, prediction accuracy for BS DGV was similar to the accuracy of GS DGV (0.53 and 0.52, respectively). In this study, training on crossbred data resulted in higher prediction accuracy than training on purebred data and evidence of breed-specific effects for LS and GL was demonstrated. However, when training was done on crossbred data, both GS and BS models resulted in similar prediction accuracies. In future studies, traits with a lower genetic correlation between purebred and crossbred

  7. Race to the Top. Rhode Island Report. Year 4: School Year 2013-2014. [State-Specific Summary Report

    ERIC Educational Resources Information Center

    US Department of Education, 2015

    2015-01-01

    This State-specific summary report serves as an assessment of Rhode Island's Year 3 Race to the Top implementation, highlighting successes and accomplishments, identifying challenges, and providing lessons learned from implementation from approximately September 2013 through September 2014. Building upon the successes of Years 1 through 3, in Year…

  8. Race to the Top. Rhode Island Report. Year 3: School Year 2012-2013. [State-Specific Summary Report

    ERIC Educational Resources Information Center

    US Department of Education, 2014

    2014-01-01

    This State-specific summary report serves as an assessment of Rhode Island's Year 3 Race to the Top implementation, highlighting successes and accomplishments, identifying challenges, and providing lessons learned from implementation from approximately September 2012 through September 2013. In Year 3, many initiatives that were in the development…

  9. Whole-genome relationships among Francisella bacteria of diverse origins define new species and provide specific regions for detection

    DOE PAGES

    Challacombe, Jean Faust; Petersen, Jeannine M.; Gallegos-Graves, La Verne A.; ...

    2016-11-23

    Francisella tularensis is a highly virulent zoonotic pathogen that causes tularemia and, because of weaponization efforts in past world wars, is considered a tier 1 biothreat agent. Detection and surveillance of F. tularensis may be confounded by the presence of uncharacterized, closely related organisms. Through DNA-based diagnostics and environmental surveys, novel clinical and environmental Francisella isolates have been obtained in recent years. Here we present 7 new Francisella genomes and a comparison of their characteristics to each other and to 24 publicly available genomes as well as a comparative analysis of 16S rRNA and sdhA genes from over 90 Francisellamore » strains. Delineation of new species in bacteria is challenging, especially when isolates having very close genomic characteristics exhibit different physiological features—for example, when some are virulent pathogens in humans and animals while others are nonpathogenic or are opportunistic pathogens. Species resolution within Francisella varies with analyses of single genes, multiple gene or protein sets, or whole-genome comparisons of nucleic acid and amino acid sequences. Analyses focusing on single genes (16S rRNA, sdhA), multiple gene sets (virulence genes, lipopolysaccharide [LPS] biosynthesis genes, pathogenicity island), and whole-genome comparisons (nucleotide and protein) gave congruent results, but with different levels of discrimination confidence. We designate four new species within the genus; Francisella opportunistica sp. nov. (MA06-7296), Francisella salina sp. nov. (TX07-7308), Francisella uliginis sp. nov. (TX07-7310), and Francisella frigiditurris sp. nov. (CA97-1460). Lastly, this study provides a robust comparative framework to discern species and virulence features of newly detected Francisella bacteria.« less

  10. Whole-genome relationships among Francisella bacteria of diverse origins define new species and provide specific regions for detection

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Challacombe, Jean Faust; Petersen, Jeannine M.; Gallegos-Graves, La Verne A.

    Francisella tularensis is a highly virulent zoonotic pathogen that causes tularemia and, because of weaponization efforts in past world wars, is considered a tier 1 biothreat agent. Detection and surveillance of F. tularensis may be confounded by the presence of uncharacterized, closely related organisms. Through DNA-based diagnostics and environmental surveys, novel clinical and environmental Francisella isolates have been obtained in recent years. Here we present 7 new Francisella genomes and a comparison of their characteristics to each other and to 24 publicly available genomes as well as a comparative analysis of 16S rRNA and sdhA genes from over 90 Francisellamore » strains. Delineation of new species in bacteria is challenging, especially when isolates having very close genomic characteristics exhibit different physiological features—for example, when some are virulent pathogens in humans and animals while others are nonpathogenic or are opportunistic pathogens. Species resolution within Francisella varies with analyses of single genes, multiple gene or protein sets, or whole-genome comparisons of nucleic acid and amino acid sequences. Analyses focusing on single genes (16S rRNA, sdhA), multiple gene sets (virulence genes, lipopolysaccharide [LPS] biosynthesis genes, pathogenicity island), and whole-genome comparisons (nucleotide and protein) gave congruent results, but with different levels of discrimination confidence. We designate four new species within the genus; Francisella opportunistica sp. nov. (MA06-7296), Francisella salina sp. nov. (TX07-7308), Francisella uliginis sp. nov. (TX07-7310), and Francisella frigiditurris sp. nov. (CA97-1460). Lastly, this study provides a robust comparative framework to discern species and virulence features of newly detected Francisella bacteria.« less

  11. Comparative Genomics and Identification of an Enterotoxin-Bearing Pathogenicity Island, SEPI-1/SECI-1, in Staphylococcus epidermidis Pathogenic Strains.

    PubMed

    Argemi, Xavier; Nanoukon, Chimène; Affolabi, Dissou; Keller, Daniel; Hansmann, Yves; Riegel, Philippe; Baba-Moussa, Lamine; Prévost, Gilles

    2018-02-25

    Staphylococcus epidermidis is a leading cause of nosocomial infections, majorly resistant to beta-lactam antibiotics, and may transfer several mobile genetic elements among the members of its own species, as well as to Staphylococcus aureus ; however, a genetic exchange from S. aureus to S. epidermidis remains controversial. We recently identified two pathogenic clinical strains of S. epidermidis that produce a staphylococcal enterotoxin C3-like (SEC) similar to that by S. aureus pathogenicity islands. This study aimed to determine the genetic environment of the SEC-coding sequence and to identify the mobile genetic elements. Whole-genome sequencing and annotation of the S. epidermidis strains were performed using Illumina technology and a bioinformatics pipeline for assembly, which provided evidence that the SEC-coding sequences were located in a composite pathogenicity island that was previously described in the S. epidermidis strain FRI909, called SePI-1/SeCI-1, with 83.8-89.7% nucleotide similarity. Various other plasmids were identified, particularly p_3_95 and p_4_95, which carry antibiotic resistance genes ( hsrA and dfrG , respectively), and share homologies with SAP085A and pUSA04-2-SUR11, two plasmids described in S. aureus . Eventually, one complete prophage was identified, ΦSE90, sharing 30 out of 52 coding sequences with the Acinetobacter phage vB_AbaM_IME200. Thus, the SePI-1/SeCI-1 pathogenicity island was identified in two pathogenic strains of S. epidermidis that produced a SEC enterotoxin causing septic shock. These findings suggest the existence of in vivo genetic exchange from S. aureus to S. epidermidis .

  12. Comparative Genomics and Identification of an Enterotoxin-Bearing Pathogenicity Island, SEPI-1/SECI-1, in Staphylococcus epidermidis Pathogenic Strains

    PubMed Central

    Nanoukon, Chimène; Affolabi, Dissou; Keller, Daniel; Hansmann, Yves; Riegel, Philippe; Baba-Moussa, Lamine; Prévost, Gilles

    2018-01-01

    Staphylococcus epidermidis is a leading cause of nosocomial infections, majorly resistant to beta-lactam antibiotics, and may transfer several mobile genetic elements among the members of its own species, as well as to Staphylococcus aureus; however, a genetic exchange from S. aureus to S. epidermidis remains controversial. We recently identified two pathogenic clinical strains of S. epidermidis that produce a staphylococcal enterotoxin C3-like (SEC) similar to that by S. aureus pathogenicity islands. This study aimed to determine the genetic environment of the SEC-coding sequence and to identify the mobile genetic elements. Whole-genome sequencing and annotation of the S. epidermidis strains were performed using Illumina technology and a bioinformatics pipeline for assembly, which provided evidence that the SEC-coding sequences were located in a composite pathogenicity island that was previously described in the S. epidermidis strain FRI909, called SePI-1/SeCI-1, with 83.8–89.7% nucleotide similarity. Various other plasmids were identified, particularly p_3_95 and p_4_95, which carry antibiotic resistance genes (hsrA and dfrG, respectively), and share homologies with SAP085A and pUSA04-2-SUR11, two plasmids described in S. aureus. Eventually, one complete prophage was identified, ΦSE90, sharing 30 out of 52 coding sequences with the Acinetobacter phage vB_AbaM_IME200. Thus, the SePI-1/SeCI-1 pathogenicity island was identified in two pathogenic strains of S. epidermidis that produced a SEC enterotoxin causing septic shock. These findings suggest the existence of in vivo genetic exchange from S. aureus to S. epidermidis. PMID:29495323

  13. Versatile Gene-Specific Sequence Tags for Arabidopsis Functional Genomics: Transcript Profiling and Reverse Genetics Applications

    PubMed Central

    Hilson, Pierre; Allemeersch, Joke; Altmann, Thomas; Aubourg, Sébastien; Avon, Alexandra; Beynon, Jim; Bhalerao, Rishikesh P.; Bitton, Frédérique; Caboche, Michel; Cannoot, Bernard; Chardakov, Vasil; Cognet-Holliger, Cécile; Colot, Vincent; Crowe, Mark; Darimont, Caroline; Durinck, Steffen; Eickhoff, Holger; de Longevialle, Andéol Falcon; Farmer, Edward E.; Grant, Murray; Kuiper, Martin T.R.; Lehrach, Hans; Léon, Céline; Leyva, Antonio; Lundeberg, Joakim; Lurin, Claire; Moreau, Yves; Nietfeld, Wilfried; Paz-Ares, Javier; Reymond, Philippe; Rouzé, Pierre; Sandberg, Goran; Segura, Maria Dolores; Serizet, Carine; Tabrett, Alexandra; Taconnat, Ludivine; Thareau, Vincent; Van Hummelen, Paul; Vercruysse, Steven; Vuylsteke, Marnik; Weingartner, Magdalena; Weisbeek, Peter J.; Wirta, Valtteri; Wittink, Floyd R.A.; Zabeau, Marc; Small, Ian

    2004-01-01

    Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics. PMID:15489341

  14. EG-13GENOME-WIDE METHYLATION ANALYSIS IDENTIFIES GENOMIC DNA DEMETHYLATION DURING MALIGNANT PROGRESSION OF GLIOMAS

    PubMed Central

    Saito, Kuniaki; Mukasa, Akitake; Nagae, Genta; Aihara, Koki; Otani, Ryohei; Takayanagi, Shunsaku; Omata, Mayu; Tanaka, Shota; Shibahara, Junji; Takahashi, Miwako; Momose, Toshimitsu; Shimamura, Teppei; Miyano, Satoru; Narita, Yoshitaka; Ueki, Keisuke; Nishikawa, Ryo; Nagane, Motoo; Aburatani, Hiroyuki; Saito, Nobuhito

    2014-01-01

    Low-grade gliomas often undergo malignant progression, and these transformations are a leading cause of death in patients with low-grade gliomas. However, the molecular mechanisms underlying malignant tumor progression are still not well understood. Recent evidence indicates that epigenetic deregulation is an important cause of gliomagenesis; therefore, we examined the impact of epigenetic changes during malignant progression of low-grade gliomas. Specifically, we used the Illumina Infinium Human Methylation 450K BeadChip to perform genome-wide DNA methylation analysis of 120 gliomas and four normal brains. This study sample included 25 matched-pairs of initial low-grade gliomas and recurrent tumors (temporal heterogeneity) and 20 of the 25 recurring tumors recurred as malignant progressions, and one matched-pair of newly emerging malignant lesions and pre-existing lesions (spatial heterogeneity). Analyses of methylation profiles demonstrated that most low-grade gliomas in our sample (43/51; 84%) had a CpG island methylator phenotype (G-CIMP). Remarkably, approximately 50% of secondary glioblastomas that had progressed from low-grade tumors with the G-CIMP status exhibited a characteristic partial demethylation of genomic DNA during malignant progression, but other recurrent gliomas showed no apparent change in DNA methylation pattern. Interestingly, we found that most loci that were demethylated during malignant progression were located outside of CpG islands. The information of histone modifications patterns in normal human astrocytes and embryonal stem cells also showed that the ratio of active marks at the site corresponding to DNA demethylated loci in G-CIMP-demethylated tumors was significantly lower; this finding indicated that most demethylated loci in G-CIMP-demethylated tumors were likely transcriptionally inactive. A small number of the genes that were upregulated and had demethylated CpG islands were associated with cell cycle-related pathway. In

  15. Involvement of β-carbonic anhydrase (β-CA) genes in bacterial genomic islands and horizontal transfer to protists.

    PubMed

    Zolfaghari Emameh, Reza; Barker, Harlan R; Hytönen, Vesa P; Parkkila, Seppo

    2018-05-25

    Genomic islands (GIs) are a type of mobile genetic element (MGE) that are present in bacterial chromosomes. They consist of a cluster of genes which produce proteins that contribute to a variety of functions, including, but not limited to, regulation of cell metabolism, anti-microbial resistance, pathogenicity, virulence, and resistance to heavy metals. The genes carried in MGEs can be used as a trait reservoir in times of adversity. Transfer of genes using MGEs, occurring outside of reproduction, is called horizontal gene transfer (HGT). Previous literature has shown that numerous HGT events have occurred through endosymbiosis between prokaryotes and eukaryotes.Beta carbonic anhydrase (β-CA) enzymes play a critical role in the biochemical pathways of many prokaryotes and eukaryotes. We have previously suggested horizontal transfer of β-CA genes from plasmids of some prokaryotic endosymbionts to their protozoan hosts. In this study, we set out to identify β-CA genes that might have transferred between prokaryotic and protist species through HGT in GIs. Therefore, we investigated prokaryotic chromosomes containing β-CA-encoding GIs and utilized multiple bioinformatics tools to reveal the distinct movements of β-CA genes among a wide variety of organisms. Our results identify the presence of β-CA genes in GIs of several medically and industrially relevant bacterial species, and phylogenetic analyses reveal multiple cases of likely horizontal transfer of β-CA genes from GIs of ancestral prokaryotes to protists. IMPORTANCE The evolutionary process is mediated by mobile genetic elements (MGEs), such as genomic islands (GIs). A gene or set of genes in the GIs are exchanged between and within various species through horizontal gene transfer (HGT). Based on the crucial role that GIs can play in bacterial survival and proliferation, they were introduced as the environmental- and pathogen-associated factors. Carbonic anhydrases (CAs) are involved in many critical

  16. Legionella pneumophila pangenome reveals strain-specific virulence factors.

    PubMed

    D'Auria, Giuseppe; Jiménez-Hernández, Nuria; Peris-Bondia, Francesc; Moya, Andrés; Latorre, Amparo

    2010-03-17

    Legionella pneumophila subsp. pneumophila is a gram-negative gamma-Proteobacterium and the causative agent of Legionnaires' disease, a form of epidemic pneumonia. It has a water-related life cycle. In industrialized cities L. pneumophila is commonly encountered in refrigeration towers and water pipes. Infection is always via infected aerosols to humans. Although many efforts have been made to eradicate Legionella from buildings, it still contaminates the water systems. The town of Alcoy (Valencian Region, Spain) has had recurrent outbreaks since 1999. The strain "Alcoy 2300/99" is a particularly persistent and recurrent strain that was isolated during one of the most significant outbreaks between the years 1999-2000. We have sequenced the genome of the particularly persistent L. pneumophila strain Alcoy 2300/99 and have compared it with four previously sequenced strains known as Philadelphia (USA), Lens (France), Paris (France) and Corby (England).Pangenome analysis facilitated the identification of strain-specific features, as well as some that are shared by two or more strains. We identified: (1) three islands related to anti-drug resistance systems; (2) a system for transport and secretion of heavy metals; (3) three systems related to DNA transfer; (4) two CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) systems, known to provide resistance against phage infections, one similar in the Lens and Alcoy strains, and another specific to the Paris strain; and (5) seven islands of phage-related proteins, five of which seem to be strain-specific and two shared. The dispensable genome disclosed by the pangenomic analysis seems to be a reservoir of new traits that have mainly been acquired by horizontal gene transfer and could confer evolutionary advantages over strains lacking them.

  17. New Insights into the Classification and Integration Specificity of Streptococcus Integrative Conjugative Elements through Extensive Genome Exploration.

    PubMed

    Ambroset, Chloé; Coluzzi, Charles; Guédon, Gérard; Devignes, Marie-Dominique; Loux, Valentin; Lacroix, Thomas; Payot, Sophie; Leblond-Bourget, Nathalie

    2015-01-01

    Recent genome analyses suggest that integrative and conjugative elements (ICEs) are widespread in bacterial genomes and therefore play an essential role in horizontal transfer. However, only a few of these elements are precisely characterized and correctly delineated within sequenced bacterial genomes. Even though previous analysis showed the presence of ICEs in some species of Streptococci, the global prevalence and diversity of ICEs was not analyzed in this genus. In this study, we searched for ICEs in the completely sequenced genomes of 124 strains belonging to 27 streptococcal species. These exhaustive analyses revealed 105 putative ICEs and 26 slightly decayed elements whose limits were assessed and whose insertion site was identified. These ICEs were grouped in seven distinct unrelated or distantly related families, according to their conjugation modules. Integration of these streptococcal ICEs is catalyzed either by a site-specific tyrosine integrase, a low-specificity tyrosine integrase, a site-specific single serine integrase, a triplet of site-specific serine integrases or a DDE transposase. Analysis of their integration site led to the detection of 18 target-genes for streptococcal ICE insertion including eight that had not been identified previously (ftsK, guaA, lysS, mutT, rpmG, rpsI, traG, and ebfC). It also suggests that all specificities have evolved to minimize the impact of the insertion on the host. This overall analysis of streptococcal ICEs emphasizes their prevalence and diversity and demonstrates that exchanges or acquisitions of conjugation and recombination modules are frequent.

  18. Tourism and Specific Risk Areas for Cryptococcus gattii, Vancouver Island, Canada

    PubMed Central

    Chambers, Catharine; MacDougall, Laura; Li, Min

    2008-01-01

    We compared travel histories of case-patients with Cryptococcus gattii infection during 1999–2006 to travel destinations of the general public on Vancouver Island, British Columbia, Canada. Findings validated and refined estimates of risk on the basis of place of residence and showed no spatial progression of risk areas on this island over time. PMID:18976570

  19. Comparative (Meta)genomic Analysis and Ecological Profiling of Human Gut-Specific Bacteriophage φB124-14

    PubMed Central

    Ogilvie, Lesley A.; Caplin, Jonathan; Dedi, Cinzia; Diston, David; Cheek, Elizabeth; Bowler, Lucas; Taylor, Huw; Ebdon, James; Jones, Brian V.

    2012-01-01

    Bacteriophage associated with the human gut microbiome are likely to have an important impact on community structure and function, and provide a wealth of biotechnological opportunities. Despite this, knowledge of the ecology and composition of bacteriophage in the gut bacterial community remains poor, with few well characterized gut-associated phage genomes currently available. Here we describe the identification and in-depth (meta)genomic, proteomic, and ecological analysis of a human gut-specific bacteriophage (designated φB124-14). In doing so we illuminate a fraction of the biological dark matter extant in this ecosystem and its surrounding eco-genomic landscape, identifying a novel and uncharted bacteriophage gene-space in this community. φB124-14 infects only a subset of closely related gut-associated Bacteroides fragilis strains, and the circular genome encodes functions previously found to be rare in viral genomes and human gut viral metagenome sequences, including those which potentially confer advantages upon phage and/or host bacteria. Comparative genomic analyses revealed φB124-14 is most closely related to φB40-8, the only other publically available Bacteroides sp. phage genome, whilst comparative metagenomic analysis of both phage failed to identify any homologous sequences in 136 non-human gut metagenomic datasets searched, supporting the human gut-specific nature of this phage. Moreover, a potential geographic variation in the carriage of these and related phage was revealed by analysis of their distribution and prevalence within 151 human gut microbiomes and viromes from Europe, America and Japan. Finally, ecological profiling of φB124-14 and φB40-8, using both gene-centric alignment-driven phylogenetic analyses, as well as alignment-free gene-independent approaches was undertaken. This not only verified the human gut-specific nature of both phage, but also indicated that these phage populate a distinct and unexplored ecological landscape

  20. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

    PubMed Central

    Machado, Henrique; Gram, Lone

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur, amino-acid usage, ANI), which allowed us to identify two misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan- and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity could be traced to the smaller chromosome and plasmids. Several of the physiological traits studied in the genus did not correlate with phylogenetic data. Since horizontal gene transfer (HGT) is often suggested as a source of genetic diversity and a potential driver of genomic evolution in bacterial species, we looked into evidence of such in Photobacterium genomes. Genomic islands were the source of genomic differences between strains of the same species. Also, we found transposase genes and CRISPR arrays that suggest multiple encounters with foreign DNA. Presence of genomic exchange traits was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms. PMID:28706512

  1. Genomic Science in Understanding Cholera Outbreaks and Evolution of Vibrio cholerae as a Human Pathogen

    PubMed Central

    Mekalanos, John J.

    2014-01-01

    Modern genomic and bioinformatic approaches have been applied to interrogate the V. cholerae genome, the role of genomic elements in cholera disease, and the origin, relatedness, and dissemination of epidemic strains. A universal attribute of choleragenic strains includes a repertoire of pathogenicity islands and virulence genes, namely the CTX–ϕ prophage and Toxin Co-regulated Pilus (TCP) in addition to other virulent genetic elements including those referred to as Seventh Pandemic Islands. During the last decade, the advent of Next Generation Sequencing (NGS) has provided highly resolved and often complete genomic sequences of epidemic isolates in addition to both clinical and environmental strains isolated from geographically unconnected regions. Genomic comparisons of these strains, as was completed during and following the Haitian outbreak in 2010, reveals that most epidemic strains appear closely related, regardless of region of origin. Non-O1 clinical or environmental strains may also possess some virulence islands, but phylogenic analysis of the core genome suggests they are more diverse and distantly related than those isolated during epidemics. Like Haiti, genomic studies that examine both the Vibrio core- and pan-genome in addition to Single Nucleotide Polymorphisms (SNPs) conclude that a number of epidemics are caused by strains that closely resemble those in Asia, and often appear to originate there and then spread globally. The accumulation of SNPs in the epidemic strains over time can then be applied to better understand the evolution of the V. cholerae genome as an etiological agent. PMID:24590676

  2. The CpG island searcher: a new WWW resource.

    PubMed

    Takai, Daiya; Jones, Peter A

    2003-01-01

    Clusters of CpG dinucleotides in GC rich regions of the genome called "CpG islands" frequently occur in the 5' ends of genes. Methylation of CpG islands plays a role in transcriptional silencing in higher organisms in certain situations. We have established a CpG-island-extraction algorithm, which we previously developed [Takai and Jones, 2002], on a web site which has a simple user interface to identify CpG islands from submitted sequences of up to 50kb. The web site determines the locations of CpG islands using parameters (lower limit of %GC, ObsCpG/ExpCpG, length) set by the user, to display the value of parameters on each CpG island, and provides a graphical map of CpG dinucleotide distribution and borders of CpG islands. A command-line version of the CpG islands searcher has also been developed for larger sequences. The CpG Island Searcher was applied to the latest sequence and mapping information of human chromosomes 20, 21 and 22, and a total of 2345 CpG islands were extracted and 534 (23%) of them contained first coding exons and 650 (28%) contained other exons. The CpG Island Searcher is available on the World Wide Web at http://www.cpgislands.com or http://www.uscnorris.com/cpgislands/cpg.cgi.

  3. Polyphyletic Nature of Salmonella enterica Serotype Derby and Lineage-Specific Host-Association Revealed by Genome-Wide Analysis

    PubMed Central

    Sévellec, Yann; Vignaud, Marie-Léone; Granier, Sophie A.; Lailler, Renaud; Feurer, Carole; Le Hello, Simon; Mistou, Michel-Yves; Cadel-Six, Sabrina

    2018-01-01

    In France, Salmonella Derby is one of the most prevalent serotypes in pork and poultry meat. Since 2006, it has ranked among the 10 most frequent Salmonella serotypes isolated in humans. In previous publications, Salmonella Derby isolates have been characterized by pulsed field gel electrophoresis (PFGE) and antimicrobial resistance (AMR) profiles revealing the existence of different pulsotypes and AMR phenotypic groups. However, these results suffer from the low discriminatory power of these typing methods. In the present study, we built a collection of 140 strains of S. Derby collected in France from 2014 to 2015 representative of the pork and poultry food sectors. The whole collection was characterized using whole genome sequencing (WGS), providing a significant contribution to the knowledge of this underrepresented serotype, with few genomes available in public databases. The genetic diversity of the S. Derby strains was analyzed by single-nucleotide polymorphism (SNP). We also investigated AMR by both genome and phenotype, the main Salmonella pathogenicity island (SPI) and the fimH gene sequences. Our results show that this S. Derby collection is spread across four different lineages genetically distant by an average of 15k SNPs. These lineages correspond to four multilocus sequence typing (MLST) types (ST39, ST40, ST71, and ST682), which were found to be associated with specific animal hosts: pork and poultry. While the ST71 and ST682 strains are pansusceptible, ST40 isolates are characterized by the multidrug resistant profile STR-SSS-TET. Considering virulence determinants, only ST39 and ST40 present the SPI-23, which has previously been associated with pork enterocyte invasion. Furthermore, the pork ST682 isolates were found to carry mutations in the fimH sequence that could participate in the host tropism of this group. Our phylogenetic analysis demonstrates the polyphyletic nature of the Salmonella serotype Derby and provides an opportunity to identify

  4. Genome Sequence of Vibrio cholerae Strain O1 Ogawa El Tor, Isolated in Mexico, 2013.

    PubMed

    Díaz-Quiñonez, José Alberto; Hernández-Monroy, Irma; López-Martínez, Irma; Ortiz-Alcántara, Joanna; González-Durán, Elizabeth; Ruiz-Matus, Cuitláhuac; Kuri-Morales, Pablo; Ramírez-González, José Ernesto

    2014-10-30

    We present the draft genome sequence of Vibrio cholerae InDRE 3140 recovered in 2013 during a cholera outbreak in Mexico. The genome showed the Vibrio 7th pandemic islands VSP1 and VSP2, the pathogenic islands VPI-1 and VPI-2, the integrative and conjugative element SXT/R391 (ICE-SXT), and both prophages CTXφ and RS1φ. Copyright © 2014 Díaz-Quiñonez et al.

  5. Identification of a recently active Prunus-specific non-autonomous Mutator element with considerable genome shaping force.

    PubMed

    Halász, Júlia; Kodad, Ossama; Hegedűs, Attila

    2014-07-01

    Miniature inverted-repeat transposable elements (MITEs) are known to contribute to the evolution of plants, but only limited information is available for MITEs in the Prunus genome. We identified a MITE that has been named Falling Stones, FaSt. All structural features (349-bp size, 82-bp terminal inverted repeats and 9-bp target site duplications) are consistent with this MITE being a putative member of the Mutator transposase superfamily. FaSt showed a preferential accumulation in the short AT-rich segments of the euchromatin region of the peach genome. DNA sequencing and pollination experiments have been performed to confirm that the nested insertion of FaSt into the S-haplotype-specific F-box gene of apricot resulted in the breakdown of self-incompatibility (SI). A bioinformatics-based survey of the known Rosaceae and other genomes and a newly designed polymerase chain reaction (PCR) assay verified the Prunoideae-specific occurrence of FaSt elements. Phylogenetic analysis suggested a recent activity of FaSt in the Prunus genome. The occurrence of a nested insertion in the apricot genome further supports the recent activity of FaSt in response to abiotic stress conditions. This study reports on a presumably active non-autonomous Mutator element in Prunus that exhibits a major indirect genome shaping force through inducing loss-of-function mutation in the SI locus. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  6. Collective Dynamics of Specific Gene Ensembles Crucial for Neutrophil Differentiation: The Existence of Genome Vehicles Revealed

    PubMed Central

    Giuliani, Alessandro; Tomita, Masaru

    2010-01-01

    Cell fate decision remarkably generates specific cell differentiation path among the multiple possibilities that can arise through the complex interplay of high-dimensional genome activities. The coordinated action of thousands of genes to switch cell fate decision has indicated the existence of stable attractors guiding the process. However, origins of the intracellular mechanisms that create “cellular attractor” still remain unknown. Here, we examined the collective behavior of genome-wide expressions for neutrophil differentiation through two different stimuli, dimethyl sulfoxide (DMSO) and all-trans-retinoic acid (atRA). To overcome the difficulties of dealing with single gene expression noises, we grouped genes into ensembles and analyzed their expression dynamics in correlation space defined by Pearson correlation and mutual information. The standard deviation of correlation distributions of gene ensembles reduces when the ensemble size is increased following the inverse square root law, for both ensembles chosen randomly from whole genome and ranked according to expression variances across time. Choosing the ensemble size of 200 genes, we show the two probability distributions of correlations of randomly selected genes for atRA and DMSO responses overlapped after 48 hours, defining the neutrophil attractor. Next, tracking the ranked ensembles' trajectories, we noticed that only certain, not all, fall into the attractor in a fractal-like manner. The removal of these genome elements from the whole genomes, for both atRA and DMSO responses, destroys the attractor providing evidence for the existence of specific genome elements (named “genome vehicle”) responsible for the neutrophil attractor. Notably, within the genome vehicles, genes with low or moderate expression changes, which are often considered noisy and insignificant, are essential components for the creation of the neutrophil attractor. Further investigations along with our findings might

  7. Genome-wide and caste-specific DNA methylomes of the ants Camponotus floridanus and Harpegnathos saltator

    PubMed Central

    Bonasio, Roberto; Li, Qiye; Lian, Jinmin; Mutti, Navdeep S.; Jin, Lijun; Zhao, Hongmei; Zhang, Pei; Wen, Ping; Xiang, Hui; Ding, Yun; Jin, Zonghui; Shen, Steven S.; Wang, Zongji; Wang, Wen; Wang, Jun; Berger, Shelley L.; Liebig, Jürgen; Zhang, Guojie; Reinberg, Danny

    2012-01-01

    SUMMARY Background Ant societies comprise individuals belonging to different castes characterized by specialized morphologies and behaviors. Because ant embryos can follow different developmental trajectories, epigenetic mechanisms must play a role in caste determination. Ants have a full set of DNA methyltransferase and their genomes contain methylcytosine. To determine the relationship between DNA methylation and phenotypic plasticity in ants, we obtained and compared the genome-wide methylomes of different castes and developmental stages of Camponotus floridanus and Harpegnathos saltator. Results In the ant genomes, methylcytosines are found both in CpG and non-CpG contexts and are strongly enriched at exons of active genes. Changes in exonic DNA methylation correlate with alternative splicing events such as exon skipping and alternative splice site selection. Several genes exhibit caste-specific and developmental changes in DNA methylation that are conserved between the two species, including genes involved in reproduction, telomere maintenance, and noncoding RNA metabolism. Several loci are methylated and expressed monoallelically, and in some cases the choice of methylated allele depends on the caste. Conclusions These first ant methylomes and their intra- and inter-species comparison reveal an exonic methylation pattern that points to a connection between DNA methylation and splicing. The presence of monoallelic DNA methylation and the methylation of non-CpG sites in all samples suggest roles in genome regulation in these social insects, including the intriguing possibility of parental or caste-specific genomic imprinting. PMID:22885060

  8. Genome-wide specificity of DNA binding, gene regulation, and chromatin remodeling by TALE- and CRISPR/Cas9-based transcriptional activators

    PubMed Central

    Polstein, Lauren R.; Perez-Pinera, Pablo; Kocak, D. Dewran; Vockley, Christopher M.; Bledsoe, Peggy; Song, Lingyun; Safi, Alexias; Crawford, Gregory E.; Reddy, Timothy E.; Gersbach, Charles A.

    2015-01-01

    Genome engineering technologies based on the CRISPR/Cas9 and TALE systems are enabling new approaches in science and biotechnology. However, the specificity of these tools in complex genomes and the role of chromatin structure in determining DNA binding are not well understood. We analyzed the genome-wide effects of TALE- and CRISPR-based transcriptional activators in human cells using ChIP-seq to assess DNA-binding specificity and RNA-seq to measure the specificity of perturbing the transcriptome. Additionally, DNase-seq was used to assess genome-wide chromatin remodeling that occurs as a result of their action. Our results show that these transcription factors are highly specific in both DNA binding and gene regulation and are able to open targeted regions of closed chromatin independent of gene activation. Collectively, these results underscore the potential for these technologies to make precise changes to gene expression for gene and cell therapies or fundamental studies of gene function. PMID:26025803

  9. Analysis of RET promoter CpG island methylation using methylation-specific PCR (MSP), pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM): impact on stage II colon cancer patient outcome.

    PubMed

    Draht, Muriel X G; Smits, Kim M; Jooste, Valérie; Tournier, Benjamin; Vervoort, Martijn; Ramaekers, Chantal; Chapusot, Caroline; Weijenberg, Matty P; van Engeland, Manon; Melotte, Veerle

    2016-01-01

    Already since the 1990s, promoter CpG island methylation markers have been considered promising diagnostic, prognostic, and predictive cancer biomarkers. However, so far, only a limited number of DNA methylation markers have been introduced into clinical practice. One reason why the vast majority of methylation markers do not translate into clinical applications is lack of independent validation of methylation markers, often caused by differences in methylation analysis techniques. We recently described RET promoter CpG island methylation as a potential prognostic marker in stage II colorectal cancer (CRC) patients of two independent series. In the current study, we analyzed the RET promoter CpG island methylation of 241 stage II colon cancer patients by direct methylation-specific PCR (MSP), nested-MSP, pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM). All primers were designed as close as possible to the same genomic region. In order to investigate the effect of different DNA methylation assays on patient outcome, we assessed the clinical sensitivity and specificity as well as the association of RET methylation with overall survival for three and five years of follow-up. Using direct-MSP and nested-MSP, 12.0 % (25/209) and 29.6 % (71/240) of the patients showed RET promoter CpG island methylation. Methylation frequencies detected by pyrosequencing were related to the threshold for positivity that defined RET methylation. Methylation frequencies obtained by pyrosequencing (threshold for positivity at 20 %) and MS-HRM were 13.3 % (32/240) and 13.8 % (33/239), respectively. The pyrosequencing threshold for positivity of 20 % showed the best correlation with MS-HRM and direct-MSP results. Nested-MSP detected RET promoter CpG island methylation in deceased patients with a higher sensitivity (33.1 %) compared to direct-MSP (10.7 %), pyrosequencing (14.4 %), and MS-HRM (15.4 %). While RET methylation frequencies detected by nested

  10. Polycomb-like proteins link the PRC2 complex to CpG islands

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Haojie; Liefke, Robert; Jiang, Junyi

    The Polycomb repressive complex 2 (PRC2) mainly mediates transcriptional repression1,2 and has essential roles in various biological processes including the maintenance of cell identity and proper differentiation. Polycomb-like (PCL) proteins, such as PHF1, MTF2 and PHF19, are PRC2-associated factors that form sub-complexes with PRC2 core components3, and have been proposed to modulate the enzymatic activity of PRC2 or the recruitment of PRC2 to specific genomic loci4,5,6,7,8,9,10,11,12,13. Mammalian PRC2-binding sites are enriched in CG content, which correlates with CpG islands that display a low level of DNA methylation14. However, the mechanism of PRC2 recruitment to CpG islands is not fully understood.more » Here we solve the crystal structures of the N-terminal domains of PHF1 and MTF2 with bound CpG-containing DNAs in the presence of H3K36me3-containing histone peptides. We show that the extended homologous regions of both proteins fold into a winged-helix structure, which specifically binds to the unmethylated CpG motif but in a completely different manner from the canonical winged-helix DNA recognition motif. We also show that the PCL extended homologous domains are required for efficient recruitment of PRC2 to CpG island-containing promoters in mouse embryonic stem cells. Our research provides the first, to our knowledge, direct evidence to demonstrate that PCL proteins are crucial for PRC2 recruitment to CpG islands, and further clarifies the roles of these proteins in transcriptional regulation in vivo.« less

  11. Comparative genomics of Fructobacillus spp. and Leuconostoc spp. reveals niche-specific evolution of Fructobacillus spp.

    DOE PAGES

    Endo, Akihito; Tanizawa, Yasuhiro; Tanaka, Naoto; ...

    2015-12-29

    In this study, Fructobacillus spp. in fructose-rich niches belong to the family Leuconostocaceae. They were originally classified as Leuconostoc spp., but were later grouped into a novel genus, Fructobacillus , based on their phylogenetic position, morphology and specific biochemical characteristics. The unique characters, so called fructophilic characteristics, had not been reported in the group of lactic acid bacteria, suggesting unique evolution at the genome level. Here we studied four draft genome sequences of Fructobacillus spp. and compared their metabolic properties against those of Leuconostoc spp. As a result, Fructobacillus species possess significantly less protein coding sequences in their small genomes.more » The number of genes was significantly smaller in carbohydrate transport and metabolism. Several other metabolic pathways, including TCA cycle, ubiquinone and other terpenoid-quinone biosynthesis and phosphotransferase systems, were characterized as discriminative pathways between the two genera. The adhE gene for bifunctional acetaldehyde/alcohol dehydrogenase, and genes for subunits of the pyruvate dehydrogenase complex were absent in Fructobacillus spp. The two genera also show different levels of GC contents, which are mainly due to the different GC contents at the third codon position. In conclusion, the present genome characteristics in Fructobacillus spp. suggest reductive evolution that took place to adapt to specific niches.« less

  12. Comparative genomics of Fructobacillus spp. and Leuconostoc spp. reveals niche-specific evolution of Fructobacillus spp.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Endo, Akihito; Tanizawa, Yasuhiro; Tanaka, Naoto

    In this study, Fructobacillus spp. in fructose-rich niches belong to the family Leuconostocaceae. They were originally classified as Leuconostoc spp., but were later grouped into a novel genus, Fructobacillus , based on their phylogenetic position, morphology and specific biochemical characteristics. The unique characters, so called fructophilic characteristics, had not been reported in the group of lactic acid bacteria, suggesting unique evolution at the genome level. Here we studied four draft genome sequences of Fructobacillus spp. and compared their metabolic properties against those of Leuconostoc spp. As a result, Fructobacillus species possess significantly less protein coding sequences in their small genomes.more » The number of genes was significantly smaller in carbohydrate transport and metabolism. Several other metabolic pathways, including TCA cycle, ubiquinone and other terpenoid-quinone biosynthesis and phosphotransferase systems, were characterized as discriminative pathways between the two genera. The adhE gene for bifunctional acetaldehyde/alcohol dehydrogenase, and genes for subunits of the pyruvate dehydrogenase complex were absent in Fructobacillus spp. The two genera also show different levels of GC contents, which are mainly due to the different GC contents at the third codon position. In conclusion, the present genome characteristics in Fructobacillus spp. suggest reductive evolution that took place to adapt to specific niches.« less

  13. 33 CFR 117.615 - Plum Island River.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... DRAWBRIDGE OPERATION REGULATIONS Specific Requirements Massachusetts § 117.615 Plum Island River. The draw of the Plum Island Turnpike Bridge, mile 3.3 between Newburyport and Plum Island, shall operate as... 33 Navigation and Navigable Waters 1 2013-07-01 2013-07-01 false Plum Island River. 117.615...

  14. 33 CFR 117.615 - Plum Island River.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... DRAWBRIDGE OPERATION REGULATIONS Specific Requirements Massachusetts § 117.615 Plum Island River. The draw of the Plum Island Turnpike Bridge, mile 3.3 between Newburyport and Plum Island, shall operate as... 33 Navigation and Navigable Waters 1 2012-07-01 2012-07-01 false Plum Island River. 117.615...

  15. 33 CFR 117.615 - Plum Island River.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... DRAWBRIDGE OPERATION REGULATIONS Specific Requirements Massachusetts § 117.615 Plum Island River. The draw of the Plum Island Turnpike Bridge, mile 3.3 between Newburyport and Plum Island, shall operate as... 33 Navigation and Navigable Waters 1 2011-07-01 2011-07-01 false Plum Island River. 117.615...

  16. 33 CFR 117.615 - Plum Island River.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... DRAWBRIDGE OPERATION REGULATIONS Specific Requirements Massachusetts § 117.615 Plum Island River. The draw of the Plum Island Turnpike Bridge, mile 3.3 between Newburyport and Plum Island, shall operate as... 33 Navigation and Navigable Waters 1 2010-07-01 2010-07-01 false Plum Island River. 117.615...

  17. 33 CFR 117.615 - Plum Island River.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... DRAWBRIDGE OPERATION REGULATIONS Specific Requirements Massachusetts § 117.615 Plum Island River. The draw of the Plum Island Turnpike Bridge, mile 3.3 between Newburyport and Plum Island, shall operate as... 33 Navigation and Navigable Waters 1 2014-07-01 2014-07-01 false Plum Island River. 117.615...

  18. Spontaneous Transformation of Murine Epithelial Cells Requires the Early Acquisition of Specific Chromosomal Aneuploidies and Genomic Imbalances

    PubMed Central

    Padilla-Nash, Hesed M.; Hathcock, Karen; McNeil, Nicole E.; Mack, David; Hoeppner, Daniel; Ravin, Rea; Knutsen, Turid; Yonescu, Raluca; Wangsa, Danny; Dorritie, Kathleen; Barenboim, Linda; Hu, Yue; Ried, Thomas

    2011-01-01

    Human carcinomas are defined by recurrent chromosomal aneuploidies, which result in tissue-specific distribution of genomic imbalances. In order to develop models for these genome mutations and determine their role in tumorigenesis, we generated 45 spontaneously transformed murine cell lines from normal epithelial cells derived from bladder, cervix, colon, kidney, lung, and mammary gland. Phenotypic changes, chromosomal aberrations, centrosome number, and telomerase activity were assayed in control uncultured cells and in three subsequent stages of transformation. Supernumerary centrosomes, bi-nucleate cells, and tetraploidy were observed as early as 48 hr after explantation. In addition, telomerase activity increased throughout progression. Live-cell imaging revealed that failure of cytokinesis, not cell fusion, promoted genome duplication. Spectral karyotyping demonstrated that aneuploidy preceded immortalization, consisting predominantly of whole chromosome losses (4, 9, 12, 13, 16, and Y) and gains (1, 10, 15, and 19). After transformation, focal amplifications of the oncogenes Myc and Mdm2 were frequently detected. Fifty percent of the transformed lines resulted in tumors upon injection into immuno-compromised mice. The phenotypic and genomic alterations observed in spontaneously transformed murine epithelial cells recapitulated the aberration pattern observed during human carcinogenesis. The dominant aberration of these cell lines was the presence of specific chromosomal aneuploidies. We propose that our newly derived cancer models will be useful tools to dissect the sequential steps of genome mutations during malignant transformation, and also to identify cancer-specific genes, signaling pathways, and the role of chromosomal instability in this process. PMID:22161874

  19. Comparative analyses of Xanthomonas and Xylella complete genomes.

    PubMed

    Moreira, Leandro M; De Souza, Robson F; Digiampietri, Luciano A; Da Silva, Ana C R; Setubal, João C

    2005-01-01

    Computational analyses of four bacterial genomes of the Xanthomonadaceae family reveal new unique genes that may be involved in adaptation, pathogenicity, and host specificity. The Xanthomonas genus presents 3636 unique genes distributed in 1470 families, while Xylella genus presents 1026 unique genes distributed in 375 families. Among Xanthomonas-specific genes, we highlight a large number of cell wall degrading enzymes, proteases, and iron receptors, a set of energy metabolism genes, second copy of the type II secretion system, type III secretion system, flagella and chemotactic machinery, and the xanthomonadin synthesis gene cluster. Important genes unique to the Xylella genus are an additional copy of a type IV pili gene cluster and the complete machinery of colicin V synthesis and secretion. Intersections of gene sets from both genera reveal a cluster of genes homologous to Salmonella's SPI-7 island in Xanthomonas axonopodis pv citri and Xylella fastidiosa 9a5c, which might be involved in host specificity. Each genome also presents important unique genes, such as an HMS cluster, the kdgT gene, and O-antigen in Xanthomonas axonopodis pv citri; a number of avrBS genes and a distinct O-antigen in Xanthomonas campestris pv campestris, a type I restriction-modification system and a nickase gene in Xylella fastidiosa 9a5c, and a type II restriction-modification system and two genes related to peptidoglycan biosynthesis in Xylella fastidiosa temecula 1. All these differences imply a considerable number of gene gains and losses during the divergence of the four lineages, and are associated with structural genome modifications that may have a direct relation with the mode of transmission, adaptation to specific environments and pathogenicity of each organism.

  20. New Insights into the Classification and Integration Specificity of Streptococcus Integrative Conjugative Elements through Extensive Genome Exploration

    PubMed Central

    Ambroset, Chloé; Coluzzi, Charles; Guédon, Gérard; Devignes, Marie-Dominique; Loux, Valentin; Lacroix, Thomas; Payot, Sophie; Leblond-Bourget, Nathalie

    2016-01-01

    Recent genome analyses suggest that integrative and conjugative elements (ICEs) are widespread in bacterial genomes and therefore play an essential role in horizontal transfer. However, only a few of these elements are precisely characterized and correctly delineated within sequenced bacterial genomes. Even though previous analysis showed the presence of ICEs in some species of Streptococci, the global prevalence and diversity of ICEs was not analyzed in this genus. In this study, we searched for ICEs in the completely sequenced genomes of 124 strains belonging to 27 streptococcal species. These exhaustive analyses revealed 105 putative ICEs and 26 slightly decayed elements whose limits were assessed and whose insertion site was identified. These ICEs were grouped in seven distinct unrelated or distantly related families, according to their conjugation modules. Integration of these streptococcal ICEs is catalyzed either by a site-specific tyrosine integrase, a low-specificity tyrosine integrase, a site-specific single serine integrase, a triplet of site-specific serine integrases or a DDE transposase. Analysis of their integration site led to the detection of 18 target-genes for streptococcal ICE insertion including eight that had not been identified previously (ftsK, guaA, lysS, mutT, rpmG, rpsI, traG, and ebfC). It also suggests that all specificities have evolved to minimize the impact of the insertion on the host. This overall analysis of streptococcal ICEs emphasizes their prevalence and diversity and demonstrates that exchanges or acquisitions of conjugation and recombination modules are frequent. PMID:26779141

  1. Methylation detection oligonucleotide microarray analysis: a high-resolution method for detection of CpG island methylation

    PubMed Central

    Kamalakaran, Sitharthan; Kendall, Jude; Zhao, Xiaoyue; Tang, Chunlao; Khan, Sohail; Ravi, Kandasamy; Auletta, Theresa; Riggs, Michael; Wang, Yun; Helland, Åslaug; Naume, Bjørn; Dimitrova, Nevenka; Børresen-Dale, Anne-Lise; Hicks, Jim; Lucito, Robert

    2009-01-01

    Methylation of CpG islands associated with genes can affect the expression of the proximal gene, and methylation of non-associated CpG islands correlates to genomic instability. This epigenetic modification has been shown to be important in many pathologies, from development and disease to cancer. We report the development of a novel high-resolution microarray that detects the methylation status of over 25 000 CpG islands in the human genome. Experiments were performed to demonstrate low system noise in the methodology and that the array probes have a high signal to noise ratio. Methylation measurements between different cell lines were validated demonstrating the accuracy of measurement. We then identified alterations in CpG islands, both those associated with gene promoters, as well as non-promoter-associated islands in a set of breast and ovarian tumors. We demonstrate that this methodology accurately identifies methylation profiles in cancer and in principle it can differentiate any CpG methylation alterations and can be adapted to analyze other species. PMID:19474344

  2. CpG island methylator phenotype (CIMP) in cancer: causes and implications.

    PubMed

    Teodoridis, Jens M; Hardie, Catriona; Brown, Robert

    2008-09-18

    Strong evidence exists for a subgroup of tumours, from a variety of tissue types, exhibiting concordant tumour specific DNA methylation: the "CpG island methylator phenotype" (CIMP). Occurrence of CIMP is associated with a range of genetic and environmental factors, although the molecular causes are not well-understood. Both increased expression and aberrant targeting of DNA methyltransferases (DNMTs) could contribute to the occurrence of CIMP. One under-explored area is the possibility that DNA damage may induce or select for CIMP during carcinogenesis or treatment of tumours with chemotherapy. DNA damaging agents can induce DNA damage at guanine rich regions throughout the genome, including CpG islands. This DNA damage can result in stalled DNA synthesis, which will lead to localised increased DNMT1 concentration and therefore potentially increased DNA methylation at these sites. Chemotherapy can select for cells which have increased tolerance to DNA damage due to increased lesion bypass, in some cases by mechanisms which involve inactivation of genes by CpG island methylation. CIMP has been associated with worse patient prognosis, probably due to increased epigenetic plasticity. Therefore, further clinical testing of the diagnostic and prognostic value of the current CIMP markers, as well as increasing our understanding of the molecular causes underlying CIMP are required.

  3. The Chlamydia suis Genome Exhibits High Levels of Diversity, Plasticity, and Mobile Antibiotic Resistance: Comparative Genomics of a Recent Livestock Cohort Shows Influence of Treatment Regimes.

    PubMed

    Seth-Smith, Helena M B; Wanninger, Sabrina; Bachmann, Nathan; Marti, Hanna; Qi, Weihong; Donati, Manuela; di Francesco, Antonietta; Polkinghorne, Adam; Borel, Nicole

    2017-03-01

    Chlamydia suis is an endemic pig pathogen, belonging to a fascinating genus of obligate intracellular pathogens. Of particular interest, this is the only chlamydial species to have naturally acquired genes encoding for tetracycline resistance. To date, the distribution and mobility of the Tet-island are not well understood. Our study focused on whole genome sequencing of 29 C. suis isolates from a recent porcine cohort within Switzerland, combined with data from USA tetracycline-resistant isolates. Our findings show that the genome of C. suis is very plastic, with unprecedented diversity, highly affected by recombination and plasmid exchange. A large diversity of isolates circulates within Europe, even within individual Swiss farms, suggesting that C. suis originated around Europe. New World isolates have more restricted diversity and appear to derive from European isolates, indicating that historical strain transfers to the United States have occurred. The architecture of the Tet-island is variable, but the tetA(C) gene is always intact, and recombination has been a major factor in its transmission within C. suis. Selective pressure from tetracycline use within pigs leads to a higher number of Tet-island carrying isolates, which appear to be lost in the absence of such pressure, whereas the loss or gain of the Tet-island from individual strains is not observed. The Tet-island appears to be a recent import into the genome of C. suis, with a possible American origin. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  4. Intra-specific variation in genome size in maize: cytological and phenotypic correlates

    PubMed Central

    Realini, María Florencia; Poggio, Lidia; Cámara-Hernández, Julián; González, Graciela Esther

    2016-01-01

    Genome size variation accompanies the diversification and evolution of many plant species. Relationships between DNA amount and phenotypic and cytological characteristics form the basis of most hypotheses that ascribe a biological role to genome size. The goal of the present research was to investigate the intra-specific variation in the DNA content in maize populations from Northeastern Argentina and further explore the relationship between genome size and the phenotypic traits seed weight and length of the vegetative cycle. Moreover, cytological parameters such as the percentage of heterochromatin as well as the number, position and sequence composition of knobs were analysed and their relationships with 2C DNA values were explored. The populations analysed presented significant differences in 2C DNA amount, from 4.62 to 6.29 pg, representing 36.15 % of the inter-populational variation. Moreover, intra-populational genome size variation was found, varying from 1.08 to 1.63-fold. The variation in the percentage of knob heterochromatin as well as in the number, chromosome position and sequence composition of the knobs was detected among and within the populations. Although a positive relationship between genome size and the percentage of heterochromatin was observed, a significant correlation was not found. This confirms that other non-coding repetitive DNA sequences are contributing to the genome size variation. A positive relationship between DNA amount and the seed weight has been reported in a large number of species, this relationship was not found in the populations studied here. The length of the vegetative cycle showed a positive correlation with the percentage of heterochromatin. This result allowed attributing an adaptive effect to heterochromatin since the length of this cycle would be optimized via selection for an appropriate percentage of heterochromatin. PMID:26644343

  5. Identification of a functional toxin-antitoxin system located in the genomic island PYG1 of piezophilic hyperthermophilic archaeon Pyrococcus yayanosii.

    PubMed

    Li, Zhen; Song, Qinghao; Wang, Yinzhao; Xiao, Xiang; Xu, Jun

    2018-05-01

    Toxin-antitoxin (TA) system is bacterial or archaeal genetic module consisting of toxin and antitoxin gene that be organized as a bicistronic operon. TA system could elicit programmed cell death, which is supposed to play important roles for the survival of prokaryotic population under various physiological stress conditions. The phage abortive infection system (AbiE family) belongs to bacterial type IV TA system. However, no archaeal AbiE family TA system has been reported so far. In this study, a putative AbiE TA system (PygAT), which is located in a genomic island PYG1 in the chromosome of Pyrococcus yayanosii CH1, was identified and characterized. In Escherichia coli, overexpression of the toxin gene pygT inhibited its growth while the toxic effect can be suppressed by introducing the antitoxin gene pygA in the same cell. PygAT also enhances the stability of shuttle plasmids with archaeal plasmid replication protein Rep75 in E. coli. In P. yayanosii, disruption of antitoxin gene pygA cause a significantly growth delayed under high hydrostatic pressure (HHP). The antitoxin protein PygA can specifically bind to the PygAT promoter region and regulate the transcription of pygT gene in vivo. These results show that PygAT is a functional TA system in P. yayanosii, and also may play a role in the adaptation to HHP environment.

  6. DNA methylation of intragenic CpG islands depends on their transcriptional activity during differentiation and disease

    PubMed Central

    Jeziorska, Danuta M.; Murray, Robert J. S.; De Gobbi, Marco; Gaentzsch, Ricarda; Garrick, David; Ayyub, Helena; Chen, Taiping; Li, En; Telenius, Jelena; Lynch, Magnus; Graham, Bryony; Smith, Andrew J. H.; Lund, Jonathan N.; Hughes, Jim R.; Higgs, Douglas R.

    2017-01-01

    The human genome contains ∼30,000 CpG islands (CGIs). While CGIs associated with promoters nearly always remain unmethylated, many of the ∼9,000 CGIs lying within gene bodies become methylated during development and differentiation. Both promoter and intragenic CGIs may also become abnormally methylated as a result of genome rearrangements and in malignancy. The epigenetic mechanisms by which some CGIs become methylated but others, in the same cell, remain unmethylated in these situations are poorly understood. Analyzing specific loci and using a genome-wide analysis, we show that transcription running across CGIs, associated with specific chromatin modifications, is required for DNA methyltransferase 3B (DNMT3B)-mediated DNA methylation of many naturally occurring intragenic CGIs. Importantly, we also show that a subgroup of intragenic CGIs is not sensitive to this process of transcription-mediated methylation and that this correlates with their individual intrinsic capacity to initiate transcription in vivo. We propose a general model of how transcription could act as a primary determinant of the patterns of CGI methylation in normal development and differentiation, and in human disease. PMID:28827334

  7. Novel genomic tools for specific and real-time detection of biothreat and frequently encountered foodborne pathogens.

    PubMed

    Woubit, Abdela; Yehualaeshet, Teshome; Habtemariam, Tsegaye; Samuel, Temesgen

    2012-04-01

    The bacterial genera Escherichia, Salmonella, Shigella, Vibrio, Yersinia, and Francisella include important food safety and biothreat agents. By extensive mining of the whole genome and protein databases of diverse, closely and distantly related bacterial species and strains, we have identified novel genome regions, which we utilized to develop a rapid detection platform for these pathogens. The specific genomic targets we have identified to design the primers in Francisella tularensis subsp. tularensis, F. tularensis subsp. novicida, Shigella dysenteriae, Salmonella enterica serovar Typhimurium, Vibrio cholerae, Yersinia pestis, and Yersinia pseudotuberculosis contained either known genes or putative proteins. Primer sets were designed from the target regions for use in real-time PCR assays to detect specific biothreat pathogens at species or strain levels. The primer sets were first tested by in silico PCR against whole-genome sequences of different species, subspecies, or strains and then by in vitro PCR against genomic DNA preparations from 23 strains representing six biothreat agents (Escherichia coli O157:H7 strain EDL 933, Shigella dysenteriae, S. enterica serovar Typhi, F. tularensis subsp. tularensis, V. cholerae, and Y. pestis) and six foodborne pathogens (Salmonella Typhimurium, Salmonella Saintpaul, Shigella sonnei, F. tularensis subsp. novicida, Vibrio parahaemolyticus, and Y. pseudotuberculosis). Each pathogen was specifically identifiable at the genus and species levels. Sensitivity assays performed with purified DNA showed the lowest detection limit of 128 fg of DNA/μl for F. tularensis subsp. tularensis. A preliminary test to detect Shigella organisms in a milk matrix also enabled the detection of 6 to 60 CFU/ml. These new tools could ultimately be used to develop platforms to simultaneously detect these pathogens.

  8. Population Genomics of Infectious and Integrated Wolbachia pipientis Genomes in Drosophila ananassae

    PubMed Central

    Choi, Jae Young; Bubnell, Jaclyn E.; Aquadro, Charles F.

    2015-01-01

    Coevolution between Drosophila and its endosymbiont Wolbachia pipientis has many intriguing aspects. For example, Drosophila ananassae hosts two forms of W. pipientis genomes: One being the infectious bacterial genome and the other integrated into the host nuclear genome. Here, we characterize the infectious and integrated genomes of W. pipientis infecting D. ananassae (wAna), by genome sequencing 15 strains of D. ananassae that have either the infectious or integrated wAna genomes. Results indicate evolutionarily stable maternal transmission for the infectious wAna genome suggesting a relatively long-term coevolution with its host. In contrast, the integrated wAna genome showed pseudogene-like characteristics accumulating many variants that are predicted to have deleterious effects if present in an infectious bacterial genome. Phylogenomic analysis of sequence variation together with genotyping by polymerase chain reaction of large structural variations indicated several wAna variants among the eight infectious wAna genomes. In contrast, only a single wAna variant was found among the seven integrated wAna genomes examined in lines from Africa, south Asia, and south Pacific islands suggesting that the integration occurred once from a single infectious wAna genome and then spread geographically. Further analysis revealed that for all D. ananassae we examined with the integrated wAna genomes, the majority of the integrated wAna genomic regions is represented in at least two copies suggesting a double integration or single integration followed by an integrated genome duplication. The possible evolutionary mechanism underlying the widespread geographical presence of the duplicate integration of the wAna genome is an intriguing question remaining to be answered. PMID:26254486

  9. Determining coding CpG islands by identifying regions significant for pattern statistics on Markov chains.

    PubMed

    Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior

    2011-09-23

    Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.

  10. Chromosome arm-specific BAC end sequences permit comparative analysis of homoeologous chromosomes and genomes of polyploid wheat

    PubMed Central

    2012-01-01

    Background Bread wheat, one of the world’s staple food crops, has the largest, highly repetitive and polyploid genome among the cereal crops. The wheat genome holds the key to crop genetic improvement against challenges such as climate change, environmental degradation, and water scarcity. To unravel the complex wheat genome, the International Wheat Genome Sequencing Consortium (IWGSC) is pursuing a chromosome- and chromosome arm-based approach to physical mapping and sequencing. Here we report on the use of a BAC library made from flow-sorted telosomic chromosome 3A short arm (t3AS) for marker development and analysis of sequence composition and comparative evolution of homoeologous genomes of hexaploid wheat. Results The end-sequencing of 9,984 random BACs from a chromosome arm 3AS-specific library (TaaCsp3AShA) generated 11,014,359 bp of high quality sequence from 17,591 BAC-ends with an average length of 626 bp. The sequence represents 3.2% of t3AS with an average DNA sequence read every 19 kb. Overall, 79% of the sequence consisted of repetitive elements, 1.38% as coding regions (estimated 2,850 genes) and another 19% of unknown origin. Comparative sequence analysis suggested that 70-77% of the genes present in both 3A and 3B were syntenic with model species. Among the transposable elements, gypsy/sabrina (12.4%) was the most abundant repeat and was significantly more frequent in 3A compared to homoeologous chromosome 3B. Twenty novel repetitive sequences were also identified using de novo repeat identification. BESs were screened to identify simple sequence repeats (SSR) and transposable element junctions. A total of 1,057 SSRs were identified with a density of one per 10.4 kb, and 7,928 junctions between transposable elements (TE) and other sequences were identified with a density of one per 1.39 kb. With the objective of enhancing the marker density of chromosome 3AS, oligonucleotide primers were successfully designed from 758 SSRs and 695

  11. Reconstructing Austronesian population history in Island Southeast Asia

    PubMed Central

    Lipson, Mark; Loh, Po-Ru; Patterson, Nick; Moorjani, Priya; Ko, Ying-Chin; Stoneking, Mark; Berger, Bonnie; Reich, David

    2014-01-01

    Austronesian languages are spread across half the globe, from Easter Island to Madagascar. Evidence from linguistics and archaeology indicates that the ‘Austronesian expansion,’ which began 4,000–5,000 years ago, likely had roots in Taiwan, but the ancestry of present-day Austronesian-speaking populations remains controversial. Here, we analyse genome-wide data from 56 populations using new methods for tracing ancestral gene flow, focusing primarily on Island Southeast Asia. We show that all sampled Austronesian groups harbour ancestry that is more closely related to aboriginal Taiwanese than to any present-day mainland population. Surprisingly, western Island Southeast Asian populations have also inherited ancestry from a source nested within the variation of present-day populations speaking Austro-Asiatic languages, which have historically been nearly exclusive to the mainland. Thus, either there was once a substantial Austro-Asiatic presence in Island Southeast Asia, or Austronesian speakers migrated to and through the mainland, admixing there before continuing to western Indonesia. PMID:25137359

  12. Reconstructing Austronesian population history in Island Southeast Asia.

    PubMed

    Lipson, Mark; Loh, Po-Ru; Patterson, Nick; Moorjani, Priya; Ko, Ying-Chin; Stoneking, Mark; Berger, Bonnie; Reich, David

    2014-08-19

    Austronesian languages are spread across half the globe, from Easter Island to Madagascar. Evidence from linguistics and archaeology indicates that the 'Austronesian expansion,' which began 4,000-5,000 years ago, likely had roots in Taiwan, but the ancestry of present-day Austronesian-speaking populations remains controversial. Here, we analyse genome-wide data from 56 populations using new methods for tracing ancestral gene flow, focusing primarily on Island Southeast Asia. We show that all sampled Austronesian groups harbour ancestry that is more closely related to aboriginal Taiwanese than to any present-day mainland population. Surprisingly, western Island Southeast Asian populations have also inherited ancestry from a source nested within the variation of present-day populations speaking Austro-Asiatic languages, which have historically been nearly exclusive to the mainland. Thus, either there was once a substantial Austro-Asiatic presence in Island Southeast Asia, or Austronesian speakers migrated to and through the mainland, admixing there before continuing to western Indonesia.

  13. Cre/lox-recombinase-mediated cassette exchange for reversible site-specific genomic targeting of the disease vector, Aedes aegypti

    USDA-ARS?s Scientific Manuscript database

    Site-specific genome modification is an important tool for mosquito functional genomics studies that help to uncover gene functions, identify gene regulatory elements, and perform comparative gene expression studies, all of which contribute to a better understanding of mosquito biology and are thus ...

  14. Comprehensive Genome-Wide Classification Reveals That Many Plant-Specific Transcription Factors Evolved in Streptophyte Algae

    PubMed Central

    Wilhelmsson, Per K I; Mühlich, Cornelia; Ullrich, Kristian K

    2017-01-01

    Abstract Plant genomes encode many lineage-specific, unique transcription factors. Expansion of such gene families has been previously found to coincide with the evolution of morphological complexity, although comparative analyses have been hampered by severe sampling bias. Here, we make use of the recently increased availability of plant genomes. We have updated and expanded previous rule sets for domain-based classification of transcription associated proteins (TAPs), comprising transcription factors and transcriptional regulators. The genome-wide annotation of these protein families has been analyzed and made available via the novel TAPscan web interface. We find that many TAP families previously thought to be specific for land plants actually evolved in streptophyte (charophyte) algae; 26 out of 36 TAP family gains are inferred to have occurred in the common ancestor of the Streptophyta (uniting the land plants—Embryophyta—with their closest algal relatives). In contrast, expansions of TAP families were found to occur throughout streptophyte evolution. 17 out of 76 expansion events were found to be common to all land plants and thus probably evolved concomitant with the water-to-land-transition. PMID:29216360

  15. The island rule: made to be broken?

    PubMed Central

    Meiri, Shai; Cooper, Natalie; Purvis, Andy

    2007-01-01

    The island rule is a hypothesis whereby small mammals evolve larger size on islands while large insular mammals dwarf. The rule is believed to emanate from small mammals growing larger to control more resources and enhance metabolic efficiency, while large mammals evolve smaller size to reduce resource requirements and increase reproductive output. We show that there is no evidence for the existence of the island rule when phylogenetic comparative methods are applied to a large, high-quality dataset. Rather, there are just a few clade-specific patterns: carnivores; heteromyid rodents; and artiodactyls typically evolve smaller size on islands whereas murid rodents usually grow larger. The island rule is probably an artefact of comparing distantly related groups showing clade-specific responses to insularity. Instead of a rule, size evolution on islands is likely to be governed by the biotic and abiotic characteristics of different islands, the biology of the species in question and contingency. PMID:17986433

  16. Genome-wide specificity of DNA binding, gene regulation, and chromatin remodeling by TALE- and CRISPR/Cas9-based transcriptional activators.

    PubMed

    Polstein, Lauren R; Perez-Pinera, Pablo; Kocak, D Dewran; Vockley, Christopher M; Bledsoe, Peggy; Song, Lingyun; Safi, Alexias; Crawford, Gregory E; Reddy, Timothy E; Gersbach, Charles A

    2015-08-01

    Genome engineering technologies based on the CRISPR/Cas9 and TALE systems are enabling new approaches in science and biotechnology. However, the specificity of these tools in complex genomes and the role of chromatin structure in determining DNA binding are not well understood. We analyzed the genome-wide effects of TALE- and CRISPR-based transcriptional activators in human cells using ChIP-seq to assess DNA-binding specificity and RNA-seq to measure the specificity of perturbing the transcriptome. Additionally, DNase-seq was used to assess genome-wide chromatin remodeling that occurs as a result of their action. Our results show that these transcription factors are highly specific in both DNA binding and gene regulation and are able to open targeted regions of closed chromatin independent of gene activation. Collectively, these results underscore the potential for these technologies to make precise changes to gene expression for gene and cell therapies or fundamental studies of gene function. © 2015 Polstein et al.; Published by Cold Spring Harbor Laboratory Press.

  17. Genome-wide oxidative bisulfite sequencing identifies sex-specific methylation differences in the human placenta

    PubMed Central

    Johnson, Michelle D; Dopierala, Justyna

    2018-01-01

    ABSTRACT DNA methylation is an important regulator of gene function. Fetal sex is associated with the risk of several specific pregnancy complications related to placental function. However, the association between fetal sex and placental DNA methylation remains poorly understood. We carried out whole-genome oxidative bisulfite sequencing in the placentas of two healthy female and two healthy male pregnancies generating an average genome depth of coverage of 25x. Most highly ranked differentially methylated regions (DMRs) were located on the X chromosome but we identified a 225 kb sex-specific DMR in the body of the CUB and Sushi Multiple Domains 1 (CSMD1) gene on chromosome 8. The sex-specific differential methylation pattern observed in this region was validated in additional placentas using in-solution target capture. In a new RNA-seq data set from 64 female and 67 male placentas, CSMD1 mRNA was 1.8-fold higher in male than in female placentas (P value = 8.5 × 10−7, Mann-Whitney test). Exon-level quantification of CSMD1 mRNA from these 131 placentas suggested a likely placenta-specific CSMD1 isoform not detected in the 21 somatic tissues analyzed. We show that the gene body of an autosomal gene, CSMD1, is differentially methylated in a sex- and placental-specific manner, displaying sex-specific differences in placental transcript abundance. PMID:29376485

  18. Systematic, genome-wide, sex-specific linkage of cardiovascular traits in French Canadians.

    PubMed

    Seda, Ondrej; Tremblay, Johanne; Gaudet, Daniel; Brunelle, Pierre-Luc; Gurau, Alexandru; Merlo, Ettore; Pilote, Louise; Orlov, Sergei N; Boulva, Francis; Petrovich, Milan; Kotchen, Theodore A; Cowley, Allen W; Hamet, Pavel

    2008-04-01

    The sexual dimorphism of cardiovascular traits, as well as susceptibility to a variety of related diseases, has long been recognized, yet their sex-specific genomic determinants are largely unknown. We systematically assessed the sex-specific heritability and linkage of 539 hemodynamic, metabolic, anthropometric, and humoral traits in 120 French-Canadian families from the Saguenay-Lac-St-Jean region of Quebec, Canada. We performed multipoint linkage analysis using microsatellite markers followed by peak-wide linkage scan based on Affymetrix Human Mapping 50K Array Xba240 single nucleotide polymorphism genotypes in 3 settings, including the entire sample and then separately in men and women. Nearly one half of the traits were age and sex independent, one quarter were both age and sex dependent, and one eighth were exclusively age or sex dependent. Sex-specific phenotypes are most frequent in heart rate and blood pressure categories, whereas sex- and age-independent determinants are predominant among humoral and biochemical parameters. Twenty sex-specific loci passing multiple testing criteria were corroborated by 2-point single nucleotide polymorphism linkage. Several resting systolic blood pressure measurements showed significant genotype-by-sex interaction, eg, male-specific locus at chromosome 12 (male-female logarithm of odds difference: 4.16; interaction P=0.0002), which was undetectable in the entire population, even after adjustment for sex. Detailed interrogation of this locus revealed a 220-kb block overlapping parts of TAO-kinase 3 and SUDS3 genes. In summary, a large number of complex cardiovascular traits display significant sexual dimorphism, for which we have demonstrated genomic determinants at the haplotype level. Many of these would have been missed in a traditional, sex-adjusted setting.

  19. Genome-Wide Comparison of Magnaporthe Species Reveals a Host-Specific Pattern of Secretory Proteins and Transposable Elements

    PubMed Central

    Gowda, Malali

    2016-01-01

    Blast disease caused by the Magnaporthe species is a major factor affecting the productivity of rice, wheat and millets. This study was aimed at generating genomic information for rice and non-rice Magnaporthe isolates to understand the extent of genetic variation. We have sequenced the whole genome of the Magnaporthe isolates, infecting rice (leaf and neck), finger millet (leaf and neck), foxtail millet (leaf) and buffel grass (leaf). Rice and finger millet isolates infecting both leaf and neck tissues were sequenced, since the damage and yield loss caused due to neck blast is much higher as compared to leaf blast. The genome-wide comparison was carried out to study the variability in gene content, candidate effectors, repeat element distribution, genes involved in carbohydrate metabolism and SNPs. The analysis of repeat element footprints revealed some genes such as naringenin, 2-oxoglutarate 3-dioxygenase being targeted by Pot2 and Occan, in isolates from different host species. Some repeat insertions were host-specific while other insertions were randomly shared between isolates. The distributions of repeat elements, secretory proteins, CAZymes and SNPs showed significant variation across host-specific lineages of Magnaporthe indicating an independent genome evolution orchestrated by multiple genomic factors. PMID:27658241

  20. PSE-HMM: genome-wide CNV detection from NGS data using an HMM with Position-Specific Emission probabilities.

    PubMed

    Malekpour, Seyed Amir; Pezeshk, Hamid; Sadeghi, Mehdi

    2016-11-03

    Copy Number Variation (CNV) is envisaged to be a major source of large structural variations in the human genome. In recent years, many studies apply Next Generation Sequencing (NGS) data for the CNV detection. However, still there is a necessity to invent more accurate computational tools. In this study, mate pair NGS data are used for the CNV detection in a Hidden Markov Model (HMM). The proposed HMM has position specific emission probabilities, i.e. a Gaussian mixture distribution. Each component in the Gaussian mixture distribution captures a different type of aberration that is observed in the mate pairs, after being mapped to the reference genome. These aberrations may include any increase (decrease) in the insertion size or change in the direction of mate pairs that are mapped to the reference genome. This HMM with Position-Specific Emission probabilities (PSE-HMM) is utilized for the genome-wide detection of deletions and tandem duplications. The performance of PSE-HMM is evaluated on a simulated dataset and also on a real data of a Yoruban HapMap individual, NA18507. PSE-HMM is effective in taking observation dependencies into account and reaches a high accuracy in detecting genome-wide CNVs. MATLAB programs are available at http://bs.ipm.ir/softwares/PSE-HMM/ .

  1. Complete Genome Analysis of Thermus parvatiensis and Comparative Genomics of Thermus spp. Provide Insights into Genetic Variability and Evolution of Natural Competence as Strategic Survival Attributes

    PubMed Central

    Tripathi, Charu; Mishra, Harshita; Khurana, Himani; Dwivedi, Vatsala; Kamra, Komal; Negi, Ram K.; Lal, Rup

    2017-01-01

    Thermophilic environments represent an interesting niche. Among thermophiles, the genus Thermus is among the most studied genera. In this study, we have sequenced the genome of Thermus parvatiensis strain RL, a thermophile isolated from Himalayan hot water springs (temperature >96°C) using PacBio RSII SMRT technique. The small genome (2.01 Mbp) comprises a chromosome (1.87 Mbp) and a plasmid (143 Kbp), designated in this study as pTP143. Annotation revealed a high number of repair genes, a squeezed genome but containing highly plastic plasmid with transposases, integrases, mobile elements and hypothetical proteins (44%). We performed a comparative genomic study of the group Thermus with an aim of analysing the phylogenetic relatedness as well as niche specific attributes prevalent among the group. We compared the reference genome RL with 16 Thermus genomes to assess their phylogenetic relationships based on 16S rRNA gene sequences, average nucleotide identity (ANI), conserved marker genes (31 and 400), pan genome and tetranucleotide frequency. The core genome of the analyzed genomes contained 1,177 core genes and many singleton genes were detected in individual genomes, reflecting a conserved core but adaptive pan repertoire. We demonstrated the presence of metagenomic islands (chromosome:5, plasmid:5) by recruiting raw metagenomic data (from the same niche) against the genomic replicons of T. parvatiensis. We also dissected the CRISPR loci wide all genomes and found widespread presence of this system across Thermus genomes. Additionally, we performed a comparative analysis of competence loci wide Thermus genomes and found evidence for recent horizontal acquisition of the locus and continued dispersal among members reflecting that natural competence is a beneficial survival trait among Thermus members and its acquisition depicts unending evolution in order to accomplish optimal fitness. PMID:28798737

  2. The Qatar genome: a population-specific tool for precision medicine in the Middle East

    PubMed Central

    Fakhro, Khalid A; Staudt, Michelle R; Ramstetter, Monica Denise; Robay, Amal; Malek, Joel A; Badii, Ramin; Al-Marri, Ajayeb Al-Nabet; Khalil, Charbel Abi; Al-Shakaki, Alya; Chidiac, Omar; Stadler, Dora; Zirie, Mahmoud; Jayyousi, Amin; Salit, Jacqueline; Mezey, Jason G; Crystal, Ronald G; Rodriguez-Flores, Juan L

    2016-01-01

    Reaching the full potential of precision medicine depends on the quality of personalized genome interpretation. In order to facilitate precision medicine in regions of the Middle East and North Africa (MENA), a population-specific genome for the indigenous Arab population of Qatar (QTRG) was constructed by incorporating allele frequency data from sequencing of 1,161 Qataris, representing 0.4% of the population. A total of 20.9 million single nucleotide polymorphisms (SNPs) and 3.1 million indels were observed in Qatar, including an average of 1.79% novel variants per individual genome. Replacement of the GRCh37 standard reference with QTRG in a best practices genome analysis workflow resulted in an average of 7* deeper coverage depth (an improvement of 23%) and 756,671 fewer variants on average, a reduction of 16% that is attributed to common Qatari alleles being present in QTRG. The benefit for using QTRG varies across ancestries, a factor that should be taken into consideration when selecting an appropriate reference for analysis. PMID:27408750

  3. Further evidence of an Amerindian contribution to the Polynesian gene pool on Easter Island.

    PubMed

    Thorsby, E; Flåm, S T; Woldseth, B; Dupuy, B M; Sanchez-Mazas, A; Fernandez-Vina, M A

    2009-06-01

    Available evidence suggests a Polynesian origin of the Easter Island population. We recently found that some native Easter Islanders also carried some common American Indian (Amerindian) human leukocyte antigen (HLA) alleles, which probably were introduced before Europeans discovered the island in 1722. In this study, we report molecular genetic investigations of 21 other selected native Easter Islanders. Analysis of mitochondrial DNA and Y chromosome markers showed no traces of an Amerindian contribution. However, high-resolution genomic HLA typing showed that two individuals carried some other common Amerindian HLA alleles, different from those found in our previous investigations. The new data support our previous evidence of an Amerindian contribution to the gene pool on Easter Island.

  4. Brucellosis vaccines based on the open reading frames from genomic island 3 of Brucella abortus.

    PubMed

    Gómez, Leonardo; Alvarez, Francisco; Betancur, Daniel; Oñate, Angel

    2018-05-17

    Brucella abortus is the etiological agent of brucellosis, a zoonotic disease affecting cattle and humans. This disease has been partially controlled in cattle by immunization with live attenuated B. abortus S19 and RB51 strains. However, use of these vaccine strains has been associated with safety issues in animals and humans. New vaccines have since emerged in the prevention of brucellosis, particularly DNA vaccines, which have shown effectiveness and a good safety profile. Their protection efficacy in mice is associated with the induction of Th1 type and cytotoxic T cell mediated immune response against structural antigens and virulence factors expressed during B. abortus infection. Some antigenic candidate for vaccine design against brucellosis (mainly DNA vaccines) have been obtained from genomic island 3 (GI-3) of B. abortus, which encodes several open reading frames (ORFs) involved in the intracellular survival and virulence of this pathogen. The immunogenicity and protection conferred by these DNA vaccines in a murine model is reviewed in this article, suggesting that some of them could be safe and effective vaccine candidates against to prevent B. abortus infection. Copyright © 2018 Elsevier Ltd. All rights reserved.

  5. The genome-wide DNA sequence specificity of the anti-tumour drug bleomycin in human cells.

    PubMed

    Murray, Vincent; Chen, Jon K; Tanaka, Mark M

    2016-07-01

    The cancer chemotherapeutic agent, bleomycin, cleaves DNA at specific sites. For the first time, the genome-wide DNA sequence specificity of bleomycin breakage was determined in human cells. Utilising Illumina next-generation DNA sequencing techniques, over 200 million bleomycin cleavage sites were examined to elucidate the bleomycin genome-wide DNA selectivity. The genome-wide bleomycin cleavage data were analysed by four different methods to determine the cellular DNA sequence specificity of bleomycin strand breakage. For the most highly cleaved DNA sequences, the preferred site of bleomycin breakage was at 5'-GT* dinucleotide sequences (where the asterisk indicates the bleomycin cleavage site), with lesser cleavage at 5'-GC* dinucleotides. This investigation also determined longer bleomycin cleavage sequences, with preferred cleavage at 5'-GT*A and 5'- TGT* trinucleotide sequences, and 5'-TGT*A tetranucleotides. For cellular DNA, the hexanucleotide DNA sequence 5'-RTGT*AY (where R is a purine and Y is a pyrimidine) was the most highly cleaved DNA sequence. It was striking that alternating purine-pyrimidine sequences were highly cleaved by bleomycin. The highest intensity cleavage sites in cellular and purified DNA were very similar although there were some minor differences. Statistical nucleotide frequency analysis indicated a G nucleotide was present at the -3 position (relative to the cleavage site) in cellular DNA but was absent in purified DNA.

  6. Chromosomal targeting by CRISPR-Cas systems can contribute to genome plasticity in bacteria

    PubMed Central

    Dy, Ron L; Pitman, Andrew R; Fineran, Peter C

    2013-01-01

    The clustered regularly interspaced short palindromic repeats (CRISPR) and their associated (Cas) proteins form adaptive immune systems in bacteria to combat phage and other foreign genetic elements. Typically, short spacer sequences are acquired from the invader DNA and incorporated into CRISPR arrays in the bacterial genome. Small RNAs are generated that contain these spacer sequences and enable sequence-specific destruction of the foreign nucleic acids. Occasionally, spacers are acquired from the chromosome, which instead leads to targeting of the host genome. Chromosomal targeting is highly toxic to the bacterium, providing a strong selective pressure for a variety of evolutionary routes that enable host cell survival. Mutations that inactivate the CRISPR-Cas functionality, such as within the cas genes, CRISPR repeat, protospacer adjacent motifs (PAM), and target sequence, mediate escape from toxicity. This self-targeting might provide some explanation for the incomplete distribution of CRISPR-Cas systems in less than half of sequenced bacterial genomes. More importantly, self-genome targeting can cause large-scale genomic alterations, including remodeling or deletion of pathogenicity islands and other non-mobile chromosomal regions. While control of horizontal gene transfer is perceived as their main function, our recent work illuminates an alternative role of CRISPR-Cas systems in causing host genomic changes and influencing bacterial evolution. PMID:24251073

  7. Visualization of specific repetitive genomic sequences with fluorescent TALEs in Arabidopsis thaliana

    PubMed Central

    Fujimoto, Satoru; Sugano, Shigeo S.; Kuwata, Keiko; Osakabe, Keishi; Matsunaga, Sachihiro

    2016-01-01

    Live imaging of the dynamics of nuclear organization provides the opportunity to uncover the mechanisms responsible for four-dimensional genome architecture. Here, we describe the use of fluorescent protein (FP) fusions of transcription activator-like effectors (TALEs) to visualize endogenous genomic sequences in Arabidopsis thaliana. The ability to engineer sequence-specific TALEs permits the investigation of precise genomic sequences. We could detect TALE-FP signals associated with centromeric, telomeric, and rDNA repeats and the signal distribution was consistent with that observed by fluorescent in situ hybridization. TALE-FPs are advantageous because they permit the observation of intact tissues. We used our TALE-FP method to investigate the nuclei of several multicellular plant tissues including roots, hypocotyls, leaves, and flowers. Because TALE-FPs permit live-cell imaging, we successfully observed the temporal dynamics of centromeres and telomeres in plant organs. Fusing TALEs to multimeric FPs enhanced the signal intensity when observing telomeres. We found that the mobility of telomeres was different in sub-nuclear regions. Transgenic plants stably expressing TALE-FPs will provide new insights into chromatin organization and dynamics in multicellular organisms. PMID:27811079

  8. Autumn monitoring of resident avifauna on Guana Island, British Virgin Islands

    USGS Publications Warehouse

    Boal, Clint W.; Wunderle, Joseph M.; Arendt, Wayne J.

    2013-01-01

    Although the Caribbean region is considered a biodiversity hotspot and a priority for ecological conservation efforts, little information exists on population trends of West Indian landbirds. We combined avian survey data collected from three studies spanning a 16-year period on a small island with a minimal human presence in the British Virgin Islands. Although abundances varied among surveys, the same species were detected with rare exceptions. Despite stability in species composition, the resident landbirds were variable in their individual detectabilities. Survey detections relatively mirrored net captures for some species, but are quite different for others. We suspect that this is likely due to differences in detectability due to species-specific behaviors mediated by environmental conditions, such as rainfall, during the month or months prior to our surveys. It is difficult to assess the influence of timing or amount of precipitation on bird detections rates among our surveys due to a lack of consistent collection of location-specific weather data in the British Virgin Islands. Our study suggests monitoring efforts conducted in concert with collection of site-specific climate data would facilitate improved interpretation of survey data and a better understanding of avian species response to climate mediated changes.

  9. Genomic characterization of a fructophilic bee symbiont Lactobacillus kunkeei reveals its niche-specific adaptation.

    PubMed

    Maeno, Shintaro; Tanizawa, Yasuhiro; Kanesaki, Yu; Kubota, Eri; Kumar, Himanshu; Dicks, Leon; Salminen, Seppo; Nakagawa, Junichi; Arita, Masanori; Endo, Akihito

    2016-12-01

    Lactobacillus kunkeei is classified as a sole obligate fructophilic lactic acid bacterium that is found in fructose-rich niches, including the guts of honeybees. The species is differentiated from other lactobacilli based on its poor growth with glucose, enhanced growth in the presence of oxygen and other electron acceptors, and production of high concentrations of acetate from the metabolism of glucose. These characteristics are similar to phylogenetically distant Fructobacillus spp. In the present study, the genomic structure of L. kunkeei was characterized by using 16 different strains, and it had significantly less genes and smaller genomes when compared with other lactobacilli. Functional gene classification revealed that L. kunkeei had lost genes specifically involved in carbohydrate transport and metabolism. The species also lacked most of the genes for respiration, although growth was enhanced in the presence of oxygen. The adhE gene of L. kunkeei, encoding a bifunctional alcohol dehydrogenase (ADH)/aldehyde dehydrogenase (ALDH) protein, lacked the part encoding the ADH domain, which is reported here for the first time in lactic acid bacteria. The deletion resulted in the lack of ADH activity, implying a requirement for electron acceptors in glucose assimilation. These results clearly indicated that L. kunkeei had undergone a specific reductive evolution in order to adapt to fructose-rich environments. The reduction characteristics were similar to those of Fructobacillus spp., but distinct from other lactobacilli with small genomes, such as Lactobacillus gasseri and Lactobacillus vaginalis. Fructose-richness thus induced an environment-specific gene reduction in phylogenetically distant microorganisms. Copyright © 2016 Elsevier GmbH. All rights reserved.

  10. Peroxisome Proliferator-Activated Receptor Subtype- and Cell-Type-Specific Activation of Genomic Target Genes upon Adenoviral Transgene Delivery

    PubMed Central

    Nielsen, Ronni; Grøntved, Lars; Stunnenberg, Hendrik G.; Mandrup, Susanne

    2006-01-01

    Investigations of the molecular events involved in activation of genomic target genes by peroxisome proliferator-activated receptors (PPARs) have been hampered by the inability to establish a clean on/off state of the receptor in living cells. Here we show that the combination of adenoviral delivery and chromatin immunoprecipitation (ChIP) is ideal for dissecting these mechanisms. Adenoviral delivery of PPARs leads to a rapid and synchronous expression of the PPAR subtypes, establishment of transcriptional active complexes at genomic loci, and immediate activation of even silent target genes. We demonstrate that PPARγ2 possesses considerable ligand-dependent as well as independent transactivation potential and that agonists increase the occupancy of PPARγ2/retinoid X receptor at PPAR response elements. Intriguingly, by direct comparison of the PPARs (α, γ, and β/δ), we show that the subtypes have very different abilities to gain access to target sites and that in general the genomic occupancy correlates with the ability to activate the corresponding target gene. In addition, the specificity and potency of activation by PPAR subtypes are highly dependent on the cell type. Thus, PPAR subtype-specific activation of genomic target genes involves an intricate interplay between the properties of the subtype- and cell-type-specific settings at the individual target loci. PMID:16847324

  11. Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies

    PubMed Central

    Lasitschka, Bärbel; Jones, David; Northcott, Paul; Hutter, Barbara; Jäger, Natalie; Kool, Marcel; Taylor, Michael; Lichter, Peter; Pfister, Stefan; Wolf, Stephan; Brors, Benedikt; Eils, Roland

    2013-01-01

    The emergence of high-throughput, next-generation sequencing technologies has dramatically altered the way we assess genomes in population genetics and in cancer genomics. Currently, there are four commonly used whole-genome sequencing platforms on the market: Illumina’s HiSeq2000, Life Technologies’ SOLiD 4 and its completely redesigned 5500xl SOLiD, and Complete Genomics’ technology. A number of earlier studies have compared a subset of those sequencing platforms or compared those platforms with Sanger sequencing, which is prohibitively expensive for whole genome studies. Here we present a detailed comparison of the performance of all currently available whole genome sequencing platforms, especially regarding their ability to call SNVs and to evenly cover the genome and specific genomic regions. Unlike earlier studies, we base our comparison on four different samples, allowing us to assess the between-sample variation of the platforms. We find a pronounced GC bias in GC-rich regions for Life Technologies’ platforms, with Complete Genomics performing best here, while we see the least bias in GC-poor regions for HiSeq2000 and 5500xl. HiSeq2000 gives the most uniform coverage and displays the least sample-to-sample variation. In contrast, Complete Genomics exhibits by far the smallest fraction of bases not covered, while the SOLiD platforms reveal remarkable shortcomings, especially in covering CpG islands. When comparing the performance of the four platforms for calling SNPs, HiSeq2000 and Complete Genomics achieve the highest sensitivity, while the SOLiD platforms show the lowest false positive rate. Finally, we find that integrating sequencing data from different platforms offers the potential to combine the strengths of different technologies. In summary, our results detail the strengths and weaknesses of all four whole-genome sequencing platforms. It indicates application areas that call for a specific sequencing platform and disallow other platforms

  12. Strand-specific, real-time RT-PCR assays for quantification of genomic and positive-sense RNAs of the fish rhabdovirus, Infectious hematopoietic necrosis virus

    USGS Publications Warehouse

    Purcell, Maureen K.; Hart, S. Alexandra; Kurath, Gael; Winton, James R.

    2006-01-01

    The fish rhabdovirus, Infectious hematopoietic necrosis virus (IHNV), is an important pathogen of salmonids. Cell culture assays have traditionally been used to quantify levels of IHNV in samples; however, real-time or quantitative RT-PCR assays have been proposed as a rapid alternative. For viruses having a single-stranded, negative-sense RNA genome, standard qRT-PCR assays do not distinguish between the negative-sense genome and positive-sense RNA species including mRNA and anti-genome. Thus, these methods do not determine viral genome copy number. This study reports development of strand-specific, qRT-PCR assays that use tagged primers for enhancing strand specificity during cDNA synthesis and quantitative PCR. Protocols were developed for positive-strand specific (pss-qRT-PCR) and negative-strand specific (nss-qRT-PCR) assays for IHNV glycoprotein (G) gene sequences. Validation with synthetic RNA transcripts demonstrated the assays could discriminate the correct strand with greater than 1000-fold fidelity. The number of genome copies in livers of IHNV-infected fish determined by nss-qRT-PCR was, on average, 8000-fold greater than the number of infectious units as determined by plaque assay. We also compared the number of genome copies with the quantity of positive-sense RNA and determined that the ratio of positive-sense molecules to negative-sense genome copies was, on average, 2.7:1. Potential future applications of these IHNV strand-specific qRT-PCR assays are discussed.

  13. Adaptive introgression from distant Caribbean islands contributed to the diversification of a microendemic adaptive radiation of trophic specialist pupfishes

    PubMed Central

    2017-01-01

    Rapid diversification often involves complex histories of gene flow that leave variable and conflicting signatures of evolutionary relatedness across the genome. Identifying the extent and source of variation in these evolutionary relationships can provide insight into the evolutionary mechanisms involved in rapid radiations. Here we compare the discordant evolutionary relationships associated with species phenotypes across 42 whole genomes from a sympatric adaptive radiation of Cyprinodon pupfishes endemic to San Salvador Island, Bahamas and several outgroup pupfish species in order to understand the rarity of these trophic specialists within the larger radiation of Cyprinodon. 82% of the genome depicts close evolutionary relationships among the San Salvador Island species reflecting their geographic proximity, but the vast majority of variants fixed between specialist species lie in regions with discordant topologies. Top candidate adaptive introgression regions include signatures of selective sweeps and adaptive introgression of genetic variation from a single population in the northwestern Bahamas into each of the specialist species. Hard selective sweeps of genetic variation on San Salvador Island contributed 5 times more to speciation of trophic specialists than adaptive introgression of Caribbean genetic variation; however, four of the 11 introgressed regions came from a single distant island and were associated with the primary axis of oral jaw divergence within the radiation. For example, standing variation in a proto-oncogene (ski) known to have effects on jaw size introgressed into one San Salvador Island specialist from an island 300 km away approximately 10 kya. The complex emerging picture of the origins of adaptive radiation on San Salvador Island indicates that multiple sources of genetic variation contributed to the adaptive phenotypes of novel trophic specialists on the island. Our findings suggest that a suite of factors, including rare adaptive

  14. Genomic comparison between pathogenic Streptococcus agalactiae isolated from Nile tilapia in Thailand and fish-derived ST7 strains.

    PubMed

    Kayansamruaj, Pattanapon; Pirarat, Nopadon; Kondo, Hidehiro; Hirono, Ikuo; Rodkhum, Channarong

    2015-12-01

    Streptococcus agalactiae, or Group B streptococcus (GBS), is a highly virulent pathogen in aquatic animals, causing huge mortalities worldwide. In Thailand, the serotype Ia, β-hemolytic GBS, belonging to sequence type (ST) 7 of clonal complex (CC) 7, was found to be the major cause of streptococcosis outbreaks in fish farms. In this study, we performed an in silico genomic comparison, aiming to investigate the phylogenetic relationship between the pathogenic fish strains of Thai ST7 and other ST7 from different hosts and geographical origins. In general, the genomes of Thai ST7 strains are closely related to other fish ST7s, as the core genome is shared by 92-95% of any individual fish ST7 genome. Among the fish ST7 genomes, we observed only small dissimilarities, based on the analysis of clustered regularly interspaced short palindromic repeats (CRISPRs), surface protein markers, insertions sequence (IS) elements and putative virulence genes. The phylogenetic tree based on single nucleotide polymorphisms (SNPs) of the core genome sequences clearly categorized the ST7 strains according to their geographical and host origins, with the human ST7 being genetically distant from other fish ST7 strains. A pan-genome analysis of ST7 strains detected a 48-kb gene island specifically in the Thai ST7 isolates. The orientations and predicted amino acid sequences of the genes in the island closely matched those of Tn5252, a streptococcal conjugative transposon, in GBS 2603V/R serotype V, Streptococcus pneumoniae and Streptococcus suis. Thus, it was presumed that Thai ST7 acquired this Tn5252 homologue from related streptococci. The close phylogenetic relationship between the fish ST7 strains suggests that these strains were derived from a common ancestor and have diverged in different geographical regions and in different hosts. Copyright © 2015 Elsevier B.V. All rights reserved.

  15. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database.

    PubMed

    Winsor, Geoffrey L; Griffiths, Emma J; Lo, Raymond; Dhillon, Bhavjinder K; Shay, Julie A; Brinkman, Fiona S L

    2016-01-04

    The Pseudomonas Genome Database (http://www.pseudomonas.com) is well known for the application of community-based annotation approaches for producing a high-quality Pseudomonas aeruginosa PAO1 genome annotation, and facilitating whole-genome comparative analyses with other Pseudomonas strains. To aid analysis of potentially thousands of complete and draft genome assemblies, this database and analysis platform was upgraded to integrate curated genome annotations and isolate metadata with enhanced tools for larger scale comparative analysis and visualization. Manually curated gene annotations are supplemented with improved computational analyses that help identify putative drug targets and vaccine candidates or assist with evolutionary studies by identifying orthologs, pathogen-associated genes and genomic islands. The database schema has been updated to integrate isolate metadata that will facilitate more powerful analysis of genomes across datasets in the future. We continue to place an emphasis on providing high-quality updates to gene annotations through regular review of the scientific literature and using community-based approaches including a major new Pseudomonas community initiative for the assignment of high-quality gene ontology terms to genes. As we further expand from thousands of genomes, we plan to provide enhancements that will aid data visualization and analysis arising from whole-genome comparative studies including more pan-genome and population-based approaches. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. The tad locus: postcards from the widespread colonization island.

    PubMed

    Tomich, Mladen; Planet, Paul J; Figurski, David H

    2007-05-01

    The Tad (tight adherence) macromolecular transport system, which is present in many bacterial and archaeal species, represents an ancient and major new subtype of type II secretion. The tad genes are present on a genomic island named the widespread colonization island (WCI), and encode the machinery that is required for the assembly of adhesive Flp (fimbrial low-molecular-weight protein) pili. The tad genes are essential for biofilm formation, colonization and pathogenesis in the genera Aggregatibacter (Actinobacillus), Haemophilus, Pasteurella, Pseudomonas, Yersinia, Caulobacter and perhaps others. Here we review the structure, function and evolution of the Tad secretion system.

  17. Genomic organization of the neurofibromatosis 1 gene (NF1)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Y.; O`Connell, P.; Huntsman Breidenbach, H.

    Neurofibromatosis 1 maps to chromosome band 17q11.2, and the NF1 locus has been partially characterized. Even though the full-length NF1 cDNA has been sequenced, the complete genomic structure of the NF1 gene has not been elucidated. The 5{prime} end of NF1 is embedded in a CpG island containing a NotI restriction site, and the remainder of the gene lies in the adjacent 350-kb NotI fragment. In our efforts to develop a comprehensive screen for NF1 mutations, we have isolated genomic DNA clones that together harbor the entire NF1 cDNA sequence. We have identified all intron-exon boundaries of the coding regionmore » and established that it is composed of 59 exons. Furthermore, we have defined the 3{prime}-untranslated region (3{prime}-UTR) of the NF1 gene; it spans approximately 3.5 kb of genomic DNA sequence and is continuous with the stop codon. Oligonucleotide primer pairs synthesized from exon-flanking DNA sequences were used in the polymerase chain reaction with cloned, chromosome 17-specific genomic DNA as template to amplify NF1 exons 1 through 27b and the exon containing the 3{prime}-UTR separately. This information should be useful for implementing a comprehensive NF1 mutation screen using genomic DNA as template. 41 refs., 3 figs., 2 tabs.« less

  18. Complete genome analysis of three Acinetobacter baumannii clinical isolates in China for insight into the diversification of drug resistance elements.

    PubMed

    Zhu, Lingxiang; Yan, Zhongqiang; Zhang, Zhaojun; Zhou, Qiming; Zhou, Jinchun; Wakeland, Edward K; Fang, Xiangdong; Xuan, Zhenyu; Shen, Dingxia; Li, Quan-Zhen

    2013-01-01

    The emergence and rapid spreading of multidrug-resistant Acinetobacter baumannii strains has become a major health threat worldwide. To better understand the genetic recombination related with the acquisition of drug-resistant elements during bacterial infection, we performed complete genome analysis on three newly isolated multidrug-resistant A. baumannii strains from Beijing using next-generation sequencing technology. Whole genome comparison revealed that all 3 strains share some common drug resistant elements including carbapenem-resistant bla OXA-23 and tetracycline (tet) resistance islands, but the genome structures are diversified among strains. Various genomic islands intersperse on the genome with transposons and insertions, reflecting the recombination flexibility during the acquisition of the resistant elements. The blood-isolated BJAB07104 and ascites-isolated BJAB0868 exhibit high similarity on their genome structure with most of the global clone II strains, suggesting these two strains belong to the dominant outbreak strains prevalent worldwide. A large resistance island (RI) of about 121-kb, carrying a cluster of resistance-related genes, was inserted into the ATPase gene on BJAB07104 and BJAB0868 genomes. A 78-kb insertion element carrying tra-locus and bla OXA-23 island, can be either inserted into one of the tniB gene in the 121-kb RI on the chromosome, or transformed to conjugative plasmid in the two BJAB strains. The third strains of this study, BJAB0715, which was isolated from spinal fluid, exhibit much more divergence compared with above two strains. It harbors multiple drug-resistance elements including a truncated AbaR-22-like RI on its genome. One of the unique features of this strain is that it carries both bla OXA-23 and bla OXA-58 genes on its genome. Besides, an Acinetobacter lwoffii adeABC efflux element was found inserted into the ATPase position in BJAB0715. Our comparative analysis on currently completed Acinetobacter baumannii

  19. The Neisseria meningitidis CRISPR-Cas9 System Enables Specific Genome Editing in Mammalian Cells

    PubMed Central

    Lee, Ciaran M; Cradick, Thomas J; Bao, Gang

    2016-01-01

    The clustered regularly-interspaced short palindromic repeats (CRISPR)—CRISPR-associated (Cas) system from Streptococcus pyogenes (Spy) has been successfully adapted for RNA-guided genome editing in a wide range of organisms. However, numerous reports have indicated that Spy CRISPR-Cas9 systems may have significant off-target cleavage of genomic DNA sequences differing from the intended on-target site. Here, we report the performance of the Neisseria meningitidis (Nme) CRISPR-Cas9 system that requires a longer protospacer-adjacent motif for site-specific cleavage, and present a comparison between the Spy and Nme CRISPR-Cas9 systems targeting the same protospacer sequence. The results with the native crRNA and tracrRNA as well as a chimeric single guide RNA for the Nme CRISPR-Cas9 system were also compared. Our results suggest that, compared with the Spy system, the Nme CRISPR-Cas9 system has similar or lower on-target cleavage activity but a reduced overall off-target effect on a genomic level when sites containing three or fewer mismatches are considered. Thus, the Nme CRISPR-Cas9 system may represent a safer alternative for precision genome engineering applications. PMID:26782639

  20. The Neisseria meningitidis CRISPR-Cas9 System Enables Specific Genome Editing in Mammalian Cells.

    PubMed

    Lee, Ciaran M; Cradick, Thomas J; Bao, Gang

    2016-03-01

    The clustered regularly-interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) system from Streptococcus pyogenes (Spy) has been successfully adapted for RNA-guided genome editing in a wide range of organisms. However, numerous reports have indicated that Spy CRISPR-Cas9 systems may have significant off-target cleavage of genomic DNA sequences differing from the intended on-target site. Here, we report the performance of the Neisseria meningitidis (Nme) CRISPR-Cas9 system that requires a longer protospacer-adjacent motif for site-specific cleavage, and present a comparison between the Spy and Nme CRISPR-Cas9 systems targeting the same protospacer sequence. The results with the native crRNA and tracrRNA as well as a chimeric single guide RNA for the Nme CRISPR-Cas9 system were also compared. Our results suggest that, compared with the Spy system, the Nme CRISPR-Cas9 system has similar or lower on-target cleavage activity but a reduced overall off-target effect on a genomic level when sites containing three or fewer mismatches are considered. Thus, the Nme CRISPR-Cas9 system may represent a safer alternative for precision genome engineering applications.

  1. Conjugative type IVb pilus recognizes lipopolysaccharide of recipient cells to initiate PAPI-1 pathogenicity island transfer in Pseudomonas aeruginosa

    USDA-ARS?s Scientific Manuscript database

    Pseudomonas aeruginosa pathogenicity island 1 (PAPI-1) is one of the largest genomic islands of this important opportunistic human pathogen. Previous studies have shown that PAPI-1 encodes several putative virulence factors, a major regulator of biofilm formation, and antibiotic-resistance traits, a...

  2. Leadership in Context: Observations from Two Island Communities

    ERIC Educational Resources Information Center

    Billot, Jennie

    2005-01-01

    This article refers to just one example of specificity of context, the small island community. Islands can be viewed as well-bounded communities, often with an identity that seeks to be one step removed from being the politically dependent neighbour. Two islands serve to exemplify the significance of context: The Island of Jersey (Channel…

  3. Measuring specific receptor binding of a PET radioligand in human brain without pharmacological blockade: The genomic plot.

    PubMed

    Veronese, Mattia; Zanotti-Fregonara, Paolo; Rizzo, Gaia; Bertoldo, Alessandra; Innis, Robert B; Turkheimer, Federico E

    2016-04-15

    PET studies allow in vivo imaging of the density of brain receptor species. The PET signal, however, is the sum of the fraction of radioligand that is specifically bound to the target receptor and the non-displaceable fraction (i.e. the non-specifically bound radioligand plus the free ligand in tissue). Therefore, measuring the non-displaceable fraction, which is generally assumed to be constant across the brain, is a necessary step to obtain regional estimates of the specific fractions. The nondisplaceable binding can be directly measured if a reference region, i.e. a region devoid of any specific binding, is available. Many receptors are however widely expressed across the brain, and a true reference region is rarely available. In these cases, the nonspecific binding can be obtained after competitive pharmacological blockade, which is often contraindicated in humans. In this work we introduce the genomic plot for estimating the nondisplaceable fraction using baseline scans only. The genomic plot is a transformation of the Lassen graphical method in which the brain maps of mRNA transcripts of the target receptor obtained from the Allen brain atlas are used as a surrogate measure of the specific binding. Thus, the genomic plot allows the calculation of the specific and nondisplaceable components of radioligand uptake without the need of pharmacological blockade. We first assessed the statistical properties of the method with computer simulations. Then we sought ground-truth validation using human PET datasets of seven different neuroreceptor radioligands, where nonspecific fractions were either obtained separately using drug displacement or available from a true reference region. The population nondisplaceable fractions estimated by the genomic plot were very close to those measured by actual human blocking studies (mean relative difference between 2% and 7%). However, these estimates were valid only when mRNA expressions were predictive of protein levels (i

  4. Measuring specific receptor binding of a PET radioligand in human brain without pharmacological blockade: The genomic plot

    PubMed Central

    Veronese, Mattia; Zanotti-Fregonara, Paolo; Rizzo, Gaia; Bertoldo, Alessandra; Innis, Robert B.; Turkheimer, Federico E.

    2016-01-01

    PET studies allow in vivo imaging of the density of brain receptor species. The PET signal, however, is the sum of the fraction of radioligand that is specifically bound to the target receptor and the non-displaceable fraction (i.e. the non-specifically bound radioligand plus the free ligand in tissue). Therefore, measuring the non-displaceable fraction, which is generally assumed to be constant across the brain, is a necessary step to obtain regional estimates of the specific fractions. The nondisplaceable binding can be directly measured if a reference region, i.e. a region devoid of any specific binding, is available. Many receptors are however widely expressed across the brain, and a true reference region is rarely available. In these cases, the nonspecific binding can be obtained after competitive pharmacological blockade, which is often contraindicated in humans. In this work we introduce the genomic plot for estimating the nondisplaceable fraction using baseline scans only. The genomic plot is a transformation of the Lassen graphical method in which the brain maps of mRNA transcripts of the target receptor obtained from the Allen brain atlas are used as a surrogate measure of the specific binding. Thus, the genomic plot allows the calculation of the specific and nondisplaceable components of radioligand uptake without the need of pharmacological blockade. We first assessed the statistical properties of the method with computer simulations. Then we sought ground-truth validation using human PET datasets of seven different neuroreceptor radioligands, where nonspecific fractions were either obtained separately using drug displacement or available from a true reference region. The population nondisplaceable fractions estimated by the genomic plot were very close to those measured by actual human blocking studies (mean relative difference between 2% and 7%). However, these estimates were valid only when mRNA expressions were predictive of protein levels (i

  5. (Meta)genomic insights into the pathogenome of Cellulosimicrobium cellulans

    DOE PAGES

    Sharma, Anukriti; Gilbert, Jack A.; Lal, Rup

    2016-05-06

    Despite having serious clinical manifestations, Cellulosimicrobium cellulans remain under-reported with only three genome sequences available at the time of writing. Genome sequences of C. cellulans LMG16121, C. cellulans J36 and Cellulosimicrobium sp. strain MM were used to determine distribution of pathogenicity islands (PAIs) across C. cellulans, which revealed 49 potential marker genes with known association to human infections, e.g. Fic and VbhA toxin-antitoxin system. Oligonucleotide composition-based analysis of orthologous proteins (n = 791) across three genomes revealed significant negative correlation (P < 0.05) between frequency of optimal codons ( Fopt) and gene G+C content, highlighting the G+C-biased gene conversion (gBGC)more » effect across Cellulosimicrobium strains. Bayesian molecular-clock analysis performed on three virulent PAI proteins (Fic; D-alanyl-D-alanine-carboxypeptidase; transposase) dated the divergence event at 300 million years ago from the most common recent ancestor. Synteny-based annotation of hypothetical proteins highlighted gene transfers from non-pathogenic bacteria as a key factor in the evolution of PAIs. Additonally, deciphering the metagenomic islands using strain MM's genome with environmental data from the site of isolation (hot-spring biofilm) revealed (an)aerobic respiration as population segregation factor across the in situ cohorts. Furthermore, using reference genomes and metagenomic data, our results highlight the emergence and evolution of PAIs in the genus Cellulosimicrobium.« less

  6. Cre/lox-Recombinase-Mediated Cassette Exchange for Reversible Site-Specific Genomic Targeting of the Disease Vector, Aedes aegypti.

    PubMed

    Häcker, Irina; Harrell Ii, Robert A; Eichner, Gerrit; Pilitt, Kristina L; O'Brochta, David A; Handler, Alfred M; Schetelig, Marc F

    2017-03-07

    Site-specific genome modification (SSM) is an important tool for mosquito functional genomics and comparative gene expression studies, which contribute to a better understanding of mosquito biology and are thus a key to finding new strategies to eliminate vector-borne diseases. Moreover, it allows for the creation of advanced transgenic strains for vector control programs. SSM circumvents the drawbacks of transposon-mediated transgenesis, where random transgene integration into the host genome results in insertional mutagenesis and variable position effects. We applied the Cre/lox recombinase-mediated cassette exchange (RMCE) system to Aedes aegypti, the vector of dengue, chikungunya, and Zika viruses. In this context we created four target site lines for RMCE and evaluated their fitness costs. Cre-RMCE is functional in a two-step mechanism and with good efficiency in Ae. aegypti. The advantages of Cre-RMCE over existing site-specific modification systems for Ae. aegypti, phiC31-RMCE and CRISPR, originate in the preservation of the recombination sites, which 1) allows successive modifications and rapid expansion or adaptation of existing systems by repeated targeting of the same site; and 2) provides reversibility, thus allowing the excision of undesired sequences. Thereby, Cre-RMCE complements existing genomic modification tools, adding flexibility and versatility to vector genome targeting.

  7. Isabela Island, Galapagos Islands

    NASA Image and Video Library

    1996-01-20

    STS072-732-072 (11-20 Jan. 1996) --- Three of the nineteen Galapagos Islands are visible in this image, photographed from the Earth-orbiting Space Shuttle Endeavour. The Galapagos Islands are located 600 miles (1,000 kilometers) to the west of Ecuador. The largest of the islands, Isabela, is at center (north is toward the upper right corner). The numerous circular features on the island, highlighted by clouds, are volcanoes. The Galapagos Islands owe their existence to a hot spot, or persistent heat source in the mantle, which also is located over a rift, or place where plates are separating and new crust is being created. The rift is located between the Cocos and Nazca Plates. The dark linear features on the islands are lava flows from past eruptions. The island to the left of Isabela is Fernandina, while the island to the right is San Salvador. The Galapagos Islands were visited by the English naturalist Charles Darwin in 1835.

  8. Engineered Cpf1 variants with altered PAM specificities increase genome targeting range

    PubMed Central

    Gao, Linyi; Cox, David B.T.; Yan, Winston X.; Manteiga, John C.; Schneider, Martin W.; Yamano, Takashi; Nishimasu, Hiroshi; Nureki, Osamu; Crosetto, Nicola; Zhang, Feng

    2017-01-01

    The RNA-guided endonuclease Cpf1 is a promising tool for genome editing in eukaryotic cells1–7. However, the utility of the commonly used Acidaminococcus sp. BV3L6 Cpf1 (AsCpf1) and Lachnospiraceae bacterium ND2006 Cpf1 (LbCpf1) is limited by their requirement of a TTTV protospacer adjacent motif (PAM) in the DNA substrate. To address this limitation, we performed a structure-guided mutagenesis screen to increase the targeting range of Cpf1. We engineered two AsCpf1 variants carrying the mutations S542R/K607R and S542R/K548V/N552R, which recognize TYCV and TATV PAMs, respectively, with enhanced activities in vitro and in human cells. Genome-wide assessment of off-target activity using BLISS7 assay indicated that these variants retain high DNA targeting specificity, which we further improved by introducing an additional non-PAM-interacting mutation. Introducing the identified mutations at their corresponding positions in LbCpf1 similarly altered its PAM specificity. Together, these variants increase the targeting range of Cpf1 by approximately three-fold in human coding sequences to one cleavage site per ~11 bp. PMID:28581492

  9. Draft genome of agar-degrading marine bacterium Gilvimarinus agarilyticus JEA5.

    PubMed

    Lee, Youngdeuk; Lee, Su-Jin; Park, Gun-Hoo; Heo, Soo-Jin; Umasuthan, Navaneethaiyer; Kang, Do-Hyung; Oh, Chulhong

    2015-06-01

    Gilvimarinus agarilyticus JEA5, which effectively degrades agar, was isolated from the seawater of Jeju Island, Republic of Korea. Here, we report the draft genome sequence of G. agarilyticus JEA5 with a total genome size of 4,179,438bp from 2 scaffolds (21 contigs) with 53.15% G+C content. Various polysaccharidases including 11 predicted agarases were observed from the draft genome of G. agarilyticus JEA5. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. Comparative genomics of Clostridium bolteae and Clostridium clostridioforme reveals species-specific genomic properties and numerous putative antibiotic resistance determinants.

    PubMed

    Dehoux, Pierre; Marvaud, Jean Christophe; Abouelleil, Amr; Earl, Ashlee M; Lambert, Thierry; Dauga, Catherine

    2016-10-21

    Clostridium bolteae and Clostridium clostridioforme, previously included in the complex C. clostridioforme in the group Clostridium XIVa, remain difficult to distinguish by phenotypic methods. These bacteria, prevailing in the human intestinal microbiota, are opportunistic pathogens with various drug susceptibility patterns. In order to better characterize the two species and to obtain information on their antibiotic resistance genes, we analyzed the genomes of six strains of C. bolteae and six strains of C. clostridioforme, isolated from human infection. The genome length of C. bolteae varied from 6159 to 6398 kb, and 5719 to 6059 CDSs were detected. The genomes of C. clostridioforme were smaller, between 5467 and 5927 kb, and contained 5231 to 5916 CDSs. The two species display different metabolic pathways. The genomes of C. bolteae contained lactose operons involving PTS system and complex regulation, which contribute to phenotypic differentiation from C. clostridioforme. The Acetyl-CoA pathway, similar to that of Faecalibacterium prausnitzii, a major butyrate producer in the human gut, was only found in C. clostridioforme. The two species have also developed diverse flagella mobility systems contributing to gut colonization. Their genomes harboured many CDSs involved in resistance to beta-lactams, glycopeptides, macrolides, chloramphenicol, lincosamides, rifampin, linezolid, bacitracin, aminoglycosides and tetracyclines. Overall antimicrobial resistance genes were similar within a species, but strain-specific resistance genes were found. We discovered a new group of genes coding for rifampin resistance in C. bolteae. C. bolteae 90B3 was resistant to phenicols and linezolide in producing a 23S rRNA methyltransferase. C. clostridioforme 90A8 contained the VanB-type Tn1549 operon conferring vancomycin resistance. We also detected numerous genes encoding proteins related to efflux pump systems. Genomic comparison of C. bolteae and C. clostridiofrome revealed

  11. Culturally-specific physical activity measures for Native Hawaiian and Pacific Islanders.

    PubMed

    Moy, Karen L; Sallis, James F; Tanjasiri, Sora P

    2010-05-01

    Physical activity is an important contributor to the health disparities experienced by Native Hawaiian and Pacific Islander (NHPI) populations. A culturally-specific measurement instrument that minimizes interpretation bias is necessary to obtain accurate assessments of this lifestyle behavior. The purpose of this study was to 1) create two versions of the Pacific Islander Physical Activity Questionnaire (PIPAQ-short and PIPAQ-long) for United States NHPI, and 2) pilot test the PIPAQ instruments and two objective physical activity monitors to evaluate cultural-appropriateness and acceptability. Forty NHPI adults (20M, 20F) aged 21-65 years attended focus group discussions addressing cultural perspectives related to physical activity. Feedback from participants, community leaders and physical activity experts guided cultural modifications to existing questionnaires to create PIPAQ-short and PIPAQ-long with accompanying showcards. Pilot testing of both PIPAQs and two objective physical activity monitors, the Actiheart and ActiTrainer, was carried out in another sample of 32 NHPI adults (17M, 15F) aged 18-63 years. Participants were instructed to wear one monitor for ≥10 hours/day for 7 consecutive days. At the follow-up visit, participants completed PIPAQ-short and PIPAQ-long, and a written and verbal exit interview to provide feedback on both subjective and objective instruments. The majority of participants felt PIPAQ-long provided a more accurate reflection of activity levels, compared to PIPAQ-short. The Actiheart was the preferred monitor due to higher comfort and lower participant burden. Self-reported duration of physical activities was most difficult to recall, compared to activity type, frequency and intensity. Both PIPAQ instruments and the Actiheart monitor have demonstrated cultural acceptability and appropriateness for NHPI adults. Future studies will investigate the validity and reliability of both PIPAQ instruments in larger samples of NHPI adults

  12. Deep analysis of wild Vitis flower transcriptome reveals unexplored genome regions associated with sex specification.

    PubMed

    Ramos, Miguel Jesus Nunes; Coito, João Lucas; Fino, Joana; Cunha, Jorge; Silva, Helena; de Almeida, Patrícia Gomes; Costa, Maria Manuela Ribeiro; Amâncio, Sara; Paulo, Octávio S; Rocheta, Margarida

    2017-01-01

    RNA-seq of Vitis during early stages of bud development, in male, female and hermaphrodite flowers, identified new loci outside of annotated gene models, suggesting their involvement in sex establishment. The molecular mechanisms responsible for flower sex specification remain unclear for most plant species. In the case of V. vinifera ssp. vinifera, it is not fully understood what determines hermaphroditism in the domesticated subspecies and male or female flowers in wild dioecious relatives (Vitis vinifera ssp. sylvestris). Here, we describe a de novo assembly of the transcriptome of three flower developmental stages from the three Vitis vinifera flower types. The validation of de novo assembly showed a correlation of 0.825. The main goals of this work were the identification of V. v. sylvestris exclusive transcripts and the characterization of differential gene expression during flower development. RNA from several flower developmental stages was used previously to generate Illumina sequence reads. Through a sequential de novo assembly strategy one comprehensive transcriptome comprising 95,516 non-redundant transcripts was assembled. From this dataset 81,064 transcripts were annotated to V. v. vinifera reference transcriptome and 11,084 were annotated against V. v. vinifera reference genome. Moreover, we found 3368 transcripts that could not be mapped to Vitis reference genome. From all the non-redundant transcripts that were assembled, bioinformatics analysis identified 133 specific of V. v. sylvestris and 516 transcripts differentially expressed among the three flower types. The detection of transcription from areas of the genome not currently annotated suggests active transcription of previously unannotated genomic loci during early stages of bud development.

  13. Analysis of Genome Plasticity in Pathogenic and Commensal Escherichia coli Isolates by Use of DNA Arrays

    PubMed Central

    Dobrindt, Ulrich; Agerer, Franziska; Michaelis, Kai; Janka, Andreas; Buchrieser, Carmen; Samuelson, Martin; Svanborg, Catharina; Gottschalk, Gerhard; Karch, Helge; Hacker, Jörg

    2003-01-01

    Genomes of prokaryotes differ significantly in size and DNA composition. Escherichia coli is considered a model organism to analyze the processes involved in bacterial genome evolution, as the species comprises numerous pathogenic and commensal variants. Pathogenic and nonpathogenic E. coli strains differ in the presence and absence of additional DNA elements contributing to specific virulence traits and also in the presence and absence of additional genetic information. To analyze the genetic diversity of pathogenic and commensal E. coli isolates, a whole-genome approach was applied. Using DNA arrays, the presence of all translatable open reading frames (ORFs) of nonpathogenic E. coli K-12 strain MG1655 was investigated in 26 E. coli isolates, including various extraintestinal and intestinal pathogenic E. coli isolates, 3 pathogenicity island deletion mutants, and commensal and laboratory strains. Additionally, the presence of virulence-associated genes of E. coli was determined using a DNA “pathoarray” developed in our laboratory. The frequency and distributional pattern of genomic variations vary widely in different E. coli strains. Up to 10% of the E. coli K-12-specific ORFs were not detectable in the genomes of the different strains. DNA sequences described for extraintestinal or intestinal pathogenic E. coli are more frequently detectable in isolates of the same origin than in other pathotypes. Several genes coding for virulence or fitness factors are also present in commensal E. coli isolates. Based on these results, the conserved E. coli core genome is estimated to consist of at least 3,100 translatable ORFs. The absence of K-12-specific ORFs was detectable in all chromosomal regions. These data demonstrate the great genome heterogeneity and genetic diversity among E. coli strains and underline the fact that both the acquisition and deletion of DNA elements are important processes involved in the evolution of prokaryotes. PMID:12618447

  14. Long-term in situ persistence of biodiversity in tropical sky islands revealed by landscape genomics.

    PubMed

    Mastretta-Yanes, Alicia; Xue, Alexander T; Moreno-Letelier, Alejandra; Jorgensen, Tove H; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C

    2018-01-01

    Tropical mountains are areas of high species richness and endemism. Two historical phenomena may have contributed to this: (i) fragmentation and isolation of habitats may have promoted the genetic differentiation of populations and increased the possibility of allopatric divergence and speciation and (ii) the mountain areas may have allowed long-term population persistence during global climate fluctuations. These two phenomena have been studied using either species occurrence data or estimating species divergence times. However, only few studies have used intraspecific genetic data to analyse the mechanisms by which endemism may emerge at the microevolutionary scale. Here, we use landscape analysis of genomic SNP data sampled from two high-elevation plant species from an archipelago of tropical sky islands (the Trans-Mexican Volcanic Belt) to test for population genetic differentiation, synchronous demographic changes and habitat persistence. We show that genetic differentiation can be explained by the degree of glacial habitat connectivity among mountains and that mountains have facilitated the persistence of populations throughout glacial/interglacial cycles. Our results support the ongoing role of tropical mountains as cradles for biodiversity by uncovering cryptic differentiation and limits to gene flow. © 2017 John Wiley & Sons Ltd.

  15. Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome

    USDA-ARS?s Scientific Manuscript database

    CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of human, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific t...

  16. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes

    USDA-ARS?s Scientific Manuscript database

    Background: BAC-based physical maps provide for sequencing across an entire genome or selected sub-genome regions of biological interest. Using the minimum tiling path as a guide, it is possible to select specific BAC clones from prioritized genome sections such as a genetically defined QTL interv...

  17. Modeling and interoperability of heterogeneous genomic big data for integrative processing and querying.

    PubMed

    Masseroli, Marco; Kaitoua, Abdulrahman; Pinoli, Pietro; Ceri, Stefano

    2016-12-01

    While a huge amount of (epi)genomic data of multiple types is becoming available by using Next Generation Sequencing (NGS) technologies, the most important emerging problem is the so-called tertiary analysis, concerned with sense making, e.g., discovering how different (epi)genomic regions and their products interact and cooperate with each other. We propose a paradigm shift in tertiary analysis, based on the use of the Genomic Data Model (GDM), a simple data model which links genomic feature data to their associated experimental, biological and clinical metadata. GDM encompasses all the data formats which have been produced for feature extraction from (epi)genomic datasets. We specifically describe the mapping to GDM of SAM (Sequence Alignment/Map), VCF (Variant Call Format), NARROWPEAK (for called peaks produced by NGS ChIP-seq or DNase-seq methods), and BED (Browser Extensible Data) formats, but GDM supports as well all the formats describing experimental datasets (e.g., including copy number variations, DNA somatic mutations, or gene expressions) and annotations (e.g., regarding transcription start sites, genes, enhancers or CpG islands). We downloaded and integrated samples of all the above-mentioned data types and formats from multiple sources. The GDM is able to homogeneously describe semantically heterogeneous data and makes the ground for providing data interoperability, e.g., achieved through the GenoMetric Query Language (GMQL), a high-level, declarative query language for genomic big data. The combined use of the data model and the query language allows comprehensive processing of multiple heterogeneous data, and supports the development of domain-specific data-driven computations and bio-molecular knowledge discovery. Copyright © 2016 Elsevier Inc. All rights reserved.

  18. Biallelic Mutations of Methionyl-tRNA Synthetase Cause a Specific Type of Pulmonary Alveolar Proteinosis Prevalent on Réunion Island

    PubMed Central

    Hadchouel, Alice; Wieland, Thomas; Griese, Matthias; Baruffini, Enrico; Lorenz-Depiereux, Bettina; Enaud, Laurent; Graf, Elisabeth; Dubus, Jean Christophe; Halioui-Louhaichi, Sonia; Coulomb, Aurore; Delacourt, Christophe; Eckstein, Gertrud; Zarbock, Ralf; Schwarzmayr, Thomas; Cartault, François; Meitinger, Thomas; Lodi, Tiziana; de Blic, Jacques; Strom, Tim M.

    2015-01-01

    Methionyl-tRNA synthetase (MARS) catalyzes the ligation of methionine to tRNA and is critical for protein biosynthesis. We identified biallelic missense mutations in MARS in a specific form of pediatric pulmonary alveolar proteinosis (PAP), a severe lung disorder that is prevalent on the island of Réunion and the molecular basis of which is unresolved. Mutations were found in 26 individuals from Réunion and nearby islands and in two families from other countries. Functional consequences of the mutated alleles were assessed by growth of wild-type and mutant strains and methionine-incorporation assays in yeast. Enzyme activity was attenuated in a liquid medium without methionine but could be restored by methionine supplementation. In summary, identification of a founder mutation in MARS led to the molecular definition of a specific type of PAP and will enable carrier screening in the affected community and possibly open new treatment opportunities. PMID:25913036

  19. Genomic diversity and versatility of Lactobacillus plantarum, a natural metabolic engineer.

    PubMed

    Siezen, Roland J; van Hylckama Vlieg, Johan E T

    2011-08-30

    In the past decade it has become clear that the lactic acid bacterium Lactobacillus plantarum occupies a diverse range of environmental niches and has an enormous diversity in phenotypic properties, metabolic capacity and industrial applications. In this review, we describe how genome sequencing, comparative genome hybridization and comparative genomics has provided insight into the underlying genomic diversity and versatility of L. plantarum. One of the main features appears to be genomic life-style islands consisting of numerous functional gene cassettes, in particular for carbohydrates utilization, which can be acquired, shuffled, substituted or deleted in response to niche requirements. In this sense, L. plantarum can be considered a "natural metabolic engineer".

  20. Testing the 'island rule' for a tenebrionid beetle (Coleoptera, Tenebrionidae)

    NASA Astrophysics Data System (ADS)

    Palmer, Miquel

    2002-05-01

    Insular populations and their closest mainland counterparts commonly display body size differences that are considered to fit the island rule, a theoretical framework to explain both dwarfism and gigantism in isolated animal populations. The island rule is used to explain the pattern of change of body size at the inter-specific level. But the model implicitly makes also a prediction for the body size of isolated populations of a single species. It suggests that, for a hypothetical species covering a wide range of island sizes, there exists a specific island size where this species reaches the largest body size. Body size would be small (in relative terms) in the smallest islets of the species range. It would increase with island size, and reach a maximum at some specific island size. However, additional increases from such a specific island size would instead promote body size reduction, and small (in relative terms) body sizes would be found again on the largest islands. The biogeographical patterns predicted by the island rule have been described and analysed for vertebrates only (mainly mammals), but remain largely untested for insects or other invertebrates. I analyse here the pattern of body size variation between seven isolated insular populations of a flightless beetle, Asida planipennis (Coleoptera, Tenebrionidae). This is an endemic species of Mallorca, Menorca and a number of islands and islets in the Balearic archipelago (western Mediterranean). The study covers seven of the 15 known populations (i.e., there are only 15 islands or islets inhabited by the species). The populations studied fit the pattern advanced above and we could, therefore, extrapolate the island rule to a very different kind of organism. However, the small sample size of some of the populations invites some caution at this early stage.

  1. Lineage-specific rediploidization is a mechanism to explain time-lags between genome duplication and evolutionary diversification.

    PubMed

    Robertson, Fiona M; Gundappa, Manu Kumar; Grammes, Fabian; Hvidsten, Torgeir R; Redmond, Anthony K; Lien, Sigbjørn; Martin, Samuel A M; Holland, Peter W H; Sandve, Simen R; Macqueen, Daniel J

    2017-06-14

    The functional divergence of duplicate genes (ohnologues) retained from whole genome duplication (WGD) is thought to promote evolutionary diversification. However, species radiation and phenotypic diversification are often temporally separated from WGD. Salmonid fish, whose ancestor underwent WGD by autotetraploidization ~95 million years ago, fit such a 'time-lag' model of post-WGD radiation, which occurred alongside a major delay in the rediploidization process. Here we propose a model, 'lineage-specific ohnologue resolution' (LORe), to address the consequences of delayed rediploidization. Under LORe, speciation precedes rediploidization, allowing independent ohnologue divergence in sister lineages sharing an ancestral WGD event. Using cross-species sequence capture, phylogenomics and genome-wide analyses of ohnologue expression divergence, we demonstrate the major impact of LORe on salmonid evolution. One-quarter of each salmonid genome, harbouring at least 4550 ohnologues, has evolved under LORe, with rediploidization and functional divergence occurring on multiple independent occasions >50 million years post-WGD. We demonstrate the existence and regulatory divergence of many LORe ohnologues with functions in lineage-specific physiological adaptations that potentially facilitated salmonid species radiation. We show that LORe ohnologues are enriched for different functions than 'older' ohnologues that began diverging in the salmonid ancestor. LORe has unappreciated significance as a nested component of post-WGD divergence that impacts the functional properties of genes, whilst providing ohnologues available solely for lineage-specific adaptation. Under LORe, which is predicted following many WGD events, the functional outcomes of WGD need not appear 'explosively', but can arise gradually over tens of millions of years, promoting lineage-specific diversification regimes under prevailing ecological pressures.

  2. The Sequenced Angiosperm Genomes and Genome Databases.

    PubMed

    Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng

    2018-01-01

    Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.

  3. The Sequenced Angiosperm Genomes and Genome Databases

    PubMed Central

    Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng

    2018-01-01

    Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology. PMID:29706973

  4. Orphan and gene related CpG Islands follow power-law-like distributions in several genomes: evidence of function-related and taxonomy-related modes of distribution.

    PubMed

    Tsiagkas, Giannis; Nikolaou, Christoforos; Almirantis, Yannis

    2014-12-01

    CpG Islands (CGIs) are compositionally defined short genomic stretches, which have been studied in the human, mouse, chicken and later in several other genomes. Initially, they were assigned the role of transcriptional regulation of protein-coding genes, especially the house-keeping ones, while more recently there is found evidence that they are involved in several other functions as well, which might include regulation of the expression of RNA genes, DNA replication etc. Here, an investigation of their distributional characteristics in a variety of genomes is undertaken for both whole CGI populations as well as for CGI subsets that lie away from known genes (gene-unrelated or "orphan" CGIs). In both cases power-law-like linearity in double logarithmic scale is found. An evolutionary model, initially put forward for the explanation of a similar pattern found in gene populations is implemented. It includes segmental duplication events and eliminations of most of the duplicated CGIs, while a moderate rate of non-duplicated CGI eliminations is also applied in some cases. Simulations reproduce all the main features of the observed inter-CGI chromosomal size distributions. Our results on power-law-like linearity found in orphan CGI populations suggest that the observed distributional pattern is independent of the analogous pattern that protein coding segments were reported to follow. The power-law-like patterns in the genomic distributions of CGIs described herein are found to be compatible with several other features of the composition, abundance or functional role of CGIs reported in the current literature across several genomes, on the basis of the proposed evolutionary model. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Islands on the edge: housing development and other threats to America's Pacific and Caribbean Island forests: a Forests on the Edge report

    Treesearch

    Susan M. Stein; Mary A. Carr; Greg C. Liknes; Sara J. Comas

    2014-01-01

    This report provides an overview of expected housing density changes and related impacts to private forests on America's islands in the Pacific and Caribbean, specifically Hawaii, Guam, American Samoa, the Commonwealth of the Northern Mariana Islands, Puerto Rico, and the U.S. Virgin Islands. We discuss the vulnerability of island forests to conversion for housing...

  6. Genome-wide study of correlations between genomic features and their relationship with the regulation of gene expression.

    PubMed

    Kravatsky, Yuri V; Chechetkin, Vladimir R; Tchurikov, Nikolai A; Kravatskaya, Galina I

    2015-02-01

    The broad class of tasks in genetics and epigenetics can be reduced to the study of various features that are distributed over the genome (genome tracks). The rapid and efficient processing of the huge amount of data stored in the genome-scale databases cannot be achieved without the software packages based on the analytical criteria. However, strong inhomogeneity of genome tracks hampers the development of relevant statistics. We developed the criteria for the assessment of genome track inhomogeneity and correlations between two genome tracks. We also developed a software package, Genome Track Analyzer, based on this theory. The theory and software were tested on simulated data and were applied to the study of correlations between CpG islands and transcription start sites in the Homo sapiens genome, between profiles of protein-binding sites in chromosomes of Drosophila melanogaster, and between DNA double-strand breaks and histone marks in the H. sapiens genome. Significant correlations between transcription start sites on the forward and the reverse strands were observed in genomes of D. melanogaster, Caenorhabditis elegans, Mus musculus, H. sapiens, and Danio rerio. The observed correlations may be related to the regulation of gene expression in eukaryotes. Genome Track Analyzer is freely available at http://ancorr.eimb.ru/. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  7. Comparative genomics of multidrug resistance in Acinetobacter baumannii.

    PubMed

    Fournier, Pierre-Edouard; Vallenet, David; Barbe, Valérie; Audic, Stéphane; Ogata, Hiroyuki; Poirel, Laurent; Richet, Hervé; Robert, Catherine; Mangenot, Sophie; Abergel, Chantal; Nordmann, Patrice; Weissenbach, Jean; Raoult, Didier; Claverie, Jean-Michel

    2006-01-01

    Acinetobacter baumannii is a species of nonfermentative gram-negative bacteria commonly found in water and soil. This organism was susceptible to most antibiotics in the 1970s. It has now become a major cause of hospital-acquired infections worldwide due to its remarkable propensity to rapidly acquire resistance determinants to a wide range of antibacterial agents. Here we use a comparative genomic approach to identify the complete repertoire of resistance genes exhibited by the multidrug-resistant A. baumannii strain AYE, which is epidemic in France, as well as to investigate the mechanisms of their acquisition by comparison with the fully susceptible A. baumannii strain SDF, which is associated with human body lice. The assembly of the whole shotgun genome sequences of the strains AYE and SDF gave an estimated size of 3.9 and 3.2 Mb, respectively. A. baumannii strain AYE exhibits an 86-kb genomic region termed a resistance island--the largest identified to date--in which 45 resistance genes are clustered. At the homologous location, the SDF strain exhibits a 20 kb-genomic island flanked by transposases but devoid of resistance markers. Such a switching genomic structure might be a hotspot that could explain the rapid acquisition of resistance markers under antimicrobial pressure. Sequence similarity and phylogenetic analyses confirm that most of the resistance genes found in the A. baumannii strain AYE have been recently acquired from bacteria of the genera Pseudomonas, Salmonella, or Escherichia. This study also resulted in the discovery of 19 new putative resistance genes. Whole-genome sequencing appears to be a fast and efficient approach to the exhaustive identification of resistance genes in epidemic infectious agents of clinical significance.

  8. Genome-wide inference of transcription factor-DNA binding specificity in cell regeneration using a combination strategy.

    PubMed

    Wang, Xiaofeng; Zhang, Aiqun; Ren, Weizheng; Chen, Caiyu; Dong, Jiahong

    2012-11-01

    The cell growth, development, and regeneration of tissue and organ are associated with a large number of gene regulation events, which are mediated in part by transcription factors (TFs) binding to cis-regulatory elements involved in the genome. Predicting the binding affinity and inferring the binding specificity of TF-DNA interactions at the genomic level would be fundamentally helpful for our understanding of the molecular mechanism and biological implication underlying sequence-specific TF-DNA recognition. In this study, we report the development of a combination method to characterize the interaction behavior of a 11-mer oligonucleotide segment and its mutations with the Gcn4p protein, a homodimeric, basic leucine zipper TF, and to predict the binding affinity and specificity of potential Gcn4p binders in the genome-wide scale. In this procedure, a position-mutated energy matrix is created based on molecular modeling analysis of native and mutated Gcn4p-DNA complex structures to describe the position-independent interaction energy profile of Gcn4p with different nucleotide types at each position of the oligonucleotide, and the energy terms extracted from the matrix and their interactives are then correlated with experimentally measured affinities of 19268 distinct oligonucleotides using statistical modeling methodology. Subsequently, the best one of built regression models is successfully applied to screen those of potential high-affinity Gcn4p binders from the complete genome. The findings arising from this study are briefly listed below: (i) The 11 positions of oligonucleotides are highly interactive and non-additive in contribution to Gcn4p-DNA binding affinity; (ii) Indirect conformational effects upon nucleotide mutations as well as associated subtle changes in interfacial atomic contacts, but not the direct nonbonded interactions, are primarily responsible for the sequence-specific recognition; (iii) The intrinsic synergistic effects among the sequence

  9. ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis

    PubMed Central

    Gupta, Amit Kumar; Kaur, Karambir; Rajput, Akanksha; Dhanda, Sandeep Kumar; Sehgal, Manika; Khan, Md. Shoaib; Monga, Isha; Dar, Showkat Ahmad; Singh, Sandeep; Nagpal, Gandharva; Usmani, Salman Sadullah; Thakur, Anamika; Kaur, Gazaldeep; Sharma, Shivangi; Bhardwaj, Aman; Qureshi, Abid; Raghava, Gajendra Pal Singh; Kumar, Manoj

    2016-01-01

    Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates. PMID:27633273

  10. Draft Genome Sequence of Pseudomonas sp. Strain B1, Isolated from a Contaminated Sediment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pathak, Ashish; Jaswal, Rajneesh; Stothard, Paul

    ABSTRACT The draft genome sequence of Pseudomonas sp. strain B1, isolated from a contaminated soil, is reported. The genome comprises 6,706,934 bases, 6,059 coding sequences, and 70 RNAs and has a G+C content of 60.3%. A suite of biodegradative genes, many located on genomic islands, were identified from strain B1, further enhancing our understanding of the versatile pseudomonads.

  11. Draft Genome Sequence of Pseudomonas sp. Strain B1, Isolated from a Contaminated Sediment

    DOE PAGES

    Pathak, Ashish; Jaswal, Rajneesh; Stothard, Paul; ...

    2018-06-21

    ABSTRACT The draft genome sequence of Pseudomonas sp. strain B1, isolated from a contaminated soil, is reported. The genome comprises 6,706,934 bases, 6,059 coding sequences, and 70 RNAs and has a G+C content of 60.3%. A suite of biodegradative genes, many located on genomic islands, were identified from strain B1, further enhancing our understanding of the versatile pseudomonads.

  12. The genomic ancestry, landscape genetics and invasion history of introduced mice in New Zealand

    PubMed Central

    Russell, James C.; King, Carolyn M.

    2018-01-01

    The house mouse (Mus musculus) provides a fascinating system for studying both the genomic basis of reproductive isolation, and the patterns of human-mediated dispersal. New Zealand has a complex history of mouse invasions, and the living descendants of these invaders have genetic ancestry from all three subspecies, although most are primarily descended from M. m. domesticus. We used the GigaMUGA genotyping array (approximately 135 000 loci) to describe the genomic ancestry of 161 mice, sampled from 34 locations from across New Zealand (and one Australian city—Sydney). Of these, two populations, one in the south of the South Island, and one on Chatham Island, showed complete mitochondrial lineage capture, featuring two different lineages of M. m. castaneus mitochondrial DNA but with only M. m. domesticus nuclear ancestry detectable. Mice in the northern and southern parts of the North Island had small traces (approx. 2–3%) of M. m. castaneus nuclear ancestry, and mice in the upper South Island had approximately 7–8% M. m. musculus nuclear ancestry including some Y-chromosomal ancestry—though no detectable M. m. musculus mitochondrial ancestry. This is the most thorough genomic study of introduced populations of house mice yet conducted, and will have relevance to studies of the isolation mechanisms separating subspecies of mice. PMID:29410804

  13. Ancestry-specific and sex-specific risk alleles identified in a genome-wide gene-by-alcohol dependence interaction study of risky sexual behaviors.

    PubMed

    Polimanti, Renato; Zhao, Hongyu; Farrer, Lindsay A; Kranzler, Henry R; Gelernter, Joel

    2017-12-01

    We previously mapped loci for the genome-wide association studies (GWAS) and genome-wide gene-by-alcohol dependence interaction (GW-GxAD) analyses of risky sexual behaviors (RSB). This study extends those findings by analyzing the ancestry- and sex-specific AD-stratified effects on RSB. We examined the concordance of findings for the AD-stratified GWAS and the GW-GxAD analysis of RSB, with concordance defined as genome-wide significance in one analysis and at least nominal significance in the second analysis. A total of 2,173 African-American (AA) and 1,751 European-American (EA) subjects were investigated. Information regarding RSB (lifetime experiences of unprotected sex and multiple sexual partners) and DSM-IV diagnosis of lifetime AD were derived from the Semi-Structured Assessment for Drug Dependence and Alcoholism (SSADDA). In our ancestry- and sex-specific analyses, we identified four independent genome-wide significant (GWS) loci (p < 5*10 -8 ) and one suggestive locus (p < 6*10 -8 ). In men, we observed a GWS signal in FAM162A (rs2002594, p = 4.96*10 -8 ). In women, there was a suggestive locus in PLGRKT (rs3824435, p = 5.52*10 -8 ). In AAs, there was a GWS signal in GRK5 (rs1316543, p = 1.25*10 -9 ). In AA men, we observed an intergenic GWS signal (rs12898370, p = 4.49*10 -8 ) near LINGO1. In EA men, there was a GWS signal in CCSER1 (rs62313897; p = 7.93*10 -10 ). The loci identified in this GWAS implicate molecular mechanisms related to psychiatric illness and personality features, suggesting that the interplay between AD and RSB is mediated by alleles associated with behavioral traits. © 2017 Wiley Periodicals, Inc.

  14. Genome-wide Association Study Identifies African-Specific Susceptibility Loci in African Americans with Inflammatory Bowel Disease

    PubMed Central

    Brant, Steven R.; Okou, David T.; Simpson, Claire L.; Cutler, David J.; Haritunians, Talin; Bradfield, Jonathan P.; Chopra, Pankaj; Prince, Jarod; Begum, Ferdouse; Kumar, Archana; Huang, Chengrui; Venkateswaran, Suresh; Datta, Lisa W.; Wei, Zhi; Thomas, Kelly; Herrinton, Lisa J.; Klapproth, Jan-Micheal A.; Quiros, Antonio J.; Seminerio, Jenifer; Liu, Zhenqiu; Alexander, Jonathan S.; Baldassano, Robert N.; Dudley-Brown, Sharon; Cross, Raymond K.; Dassopoulos, Themistocles; Denson, Lee A.; Dhere, Tanvi A.; Dryden, Gerald W.; Hanson, John S.; Hou, Jason K.; Hussain, Sunny Z.; Hyams, Jeffrey S.; Isaacs, Kim L.; Kader, Howard; Kappelman, Michael D.; Katz, Jeffry; Kellermayer, Richard; Kirschner, Barbara S.; Kuemmerle, John F.; Kwon, John H.; Lazarev, Mark; Li, Ellen; Mack, David; Mannon, Peter; Moulton, Dedrick E.; Newberry, Rodney D.; Osuntokun, Bankole O.; Patel, Ashish S.; Saeed, Shehzad A.; Targan, Stephan R.; Valentine, John F.; Wang, Ming-Hsi; Zonca, Martin; Rioux, John D.; Duerr, Richard H.; Silverberg, Mark S.; Cho, Judy H.; Hakonarson, Hakon; Zwick, Michael E.; McGovern, Dermot P.B.; Kugathasan, Subra

    2016-01-01

    Background & Aims The inflammatory bowel diseases (IBD) ulcerative colitis (UC) and Crohn’s disease (CD) cause significant morbidity and are increasing in prevalence among all populations, including African Americans. More than 200 susceptibility loci have been identified in populations of predominantly European ancestry, but few loci have been associated with IBD in other ethnicities. Methods We performed 2 high-density, genome-wide scans comprising 2345 cases of African Americans with IBD (1646 with CD, 583 with UC, and 116 inflammatory bowel disease unclassified [IBD-U]) and 5002 individuals without IBD (controls, identified from the Health Retirement Study and Kaiser Permanente database). Single-nucleotide polymorphisms (SNPs) associated at P<5.0×10−8 in meta-analysis with a nominal evidence (P<.05) in each scan were considered to have genome-wide significance. Results We detected SNPs at HLA-DRB1, and African-specific SNPs at ZNF649 and LSAMP, with associations of genome-wide significance for UC. We detected SNPs at USP25 with associations of genome-wide significance associations for IBD. No associations of genome-wide significance were detected for CD. In addition, 9 genes previously associated with IBD contained SNPs with significant evidence for replication (P<1.6×10−6): ADCY3, CXCR6, HLA-DRB1 to HLA-DQA1 (genome-wide significance on conditioning), IL12B, PTGER4, and TNC for IBD; IL23R, PTGER4, and SNX20 (in strong linkage disequilibrium with NOD2) for CD; and KCNQ2 (near TNFRSF6B) for UC. Several of these genes, such as TNC (near TNFSF15), CXCR6, and genes associated with IBD at the HLA locus, contained SNPs with unique association patterns with African-specific alleles. Conclusions We performed a genome-wide association study of African Americans with IBD and identified loci associated with CD and UC in only this population; we also replicated loci identified in European populations. The detection of variants associated with IBD risk in only

  15. Genome-Wide Association Study Identifies African-Specific Susceptibility Loci in African Americans With Inflammatory Bowel Disease.

    PubMed

    Brant, Steven R; Okou, David T; Simpson, Claire L; Cutler, David J; Haritunians, Talin; Bradfield, Jonathan P; Chopra, Pankaj; Prince, Jarod; Begum, Ferdouse; Kumar, Archana; Huang, Chengrui; Venkateswaran, Suresh; Datta, Lisa W; Wei, Zhi; Thomas, Kelly; Herrinton, Lisa J; Klapproth, Jan-Micheal A; Quiros, Antonio J; Seminerio, Jenifer; Liu, Zhenqiu; Alexander, Jonathan S; Baldassano, Robert N; Dudley-Brown, Sharon; Cross, Raymond K; Dassopoulos, Themistocles; Denson, Lee A; Dhere, Tanvi A; Dryden, Gerald W; Hanson, John S; Hou, Jason K; Hussain, Sunny Z; Hyams, Jeffrey S; Isaacs, Kim L; Kader, Howard; Kappelman, Michael D; Katz, Jeffry; Kellermayer, Richard; Kirschner, Barbara S; Kuemmerle, John F; Kwon, John H; Lazarev, Mark; Li, Ellen; Mack, David; Mannon, Peter; Moulton, Dedrick E; Newberry, Rodney D; Osuntokun, Bankole O; Patel, Ashish S; Saeed, Shehzad A; Targan, Stephan R; Valentine, John F; Wang, Ming-Hsi; Zonca, Martin; Rioux, John D; Duerr, Richard H; Silverberg, Mark S; Cho, Judy H; Hakonarson, Hakon; Zwick, Michael E; McGovern, Dermot P B; Kugathasan, Subra

    2017-01-01

    The inflammatory bowel diseases (IBD) ulcerative colitis (UC) and Crohn's disease (CD) cause significant morbidity and are increasing in prevalence among all populations, including African Americans. More than 200 susceptibility loci have been identified in populations of predominantly European ancestry, but few loci have been associated with IBD in other ethnicities. We performed 2 high-density, genome-wide scans comprising 2345 cases of African Americans with IBD (1646 with CD, 583 with UC, and 116 inflammatory bowel disease unclassified) and 5002 individuals without IBD (controls, identified from the Health Retirement Study and Kaiser Permanente database). Single-nucleotide polymorphisms (SNPs) associated at P < 5.0 × 10 -8 in meta-analysis with a nominal evidence (P < .05) in each scan were considered to have genome-wide significance. We detected SNPs at HLA-DRB1, and African-specific SNPs at ZNF649 and LSAMP, with associations of genome-wide significance for UC. We detected SNPs at USP25 with associations of genome-wide significance for IBD. No associations of genome-wide significance were detected for CD. In addition, 9 genes previously associated with IBD contained SNPs with significant evidence for replication (P < 1.6 × 10 -6 ): ADCY3, CXCR6, HLA-DRB1 to HLA-DQA1 (genome-wide significance on conditioning), IL12B,PTGER4, and TNC for IBD; IL23R, PTGER4, and SNX20 (in strong linkage disequilibrium with NOD2) for CD; and KCNQ2 (near TNFRSF6B) for UC. Several of these genes, such as TNC (near TNFSF15), CXCR6, and genes associated with IBD at the HLA locus, contained SNPs with unique association patterns with African-specific alleles. We performed a genome-wide association study of African Americans with IBD and identified loci associated with UC in only this population; we also replicated IBD, CD, and UC loci identified in European populations. The detection of variants associated with IBD risk in only people of African descent demonstrates the

  16. Genome-wide Analyses of the Structural Gene Families Involved in the Legume-specific 5-Deoxyisoflavonoid Biosynthesis of Lotus japonicus

    PubMed Central

    Shimada, Norimoto; Sato, Shusei; Akashi, Tomoyoshi; Nakamura, Yasukazu; Tabata, Satoshi; Ayabe, Shin-ichi; Aoki, Toshio

    2007-01-01

    Abstract A model legume Lotus japonicus (Regel) K. Larsen is one of the subjects of genome sequencing and functional genomics programs. In the course of targeted approaches to the legume genomics, we analyzed the genes encoding enzymes involved in the biosynthesis of the legume-specific 5-deoxyisoflavonoid of L. japonicus, which produces isoflavan phytoalexins on elicitor treatment. The paralogous biosynthetic genes were assigned as comprehensively as possible by biochemical experiments, similarity searches, comparison of the gene structures, and phylogenetic analyses. Among the 10 biosynthetic genes investigated, six comprise multigene families, and in many cases they form gene clusters in the chromosomes. Semi-quantitative reverse transcriptase–PCR analyses showed coordinate up-regulation of most of the genes during phytoalexin induction and complex accumulation patterns of the transcripts in different organs. Some paralogous genes exhibited similar expression specificities, suggesting their genetic redundancy. The molecular evolution of the biosynthetic genes is discussed. The results presented here provide reliable annotations of the genes and genetic markers for comparative and functional genomics of leguminous plants. PMID:17452423

  17. Distinct p53 genomic binding patterns in normal and cancer-derived human cells

    PubMed Central

    McCorkle, Sean R; McCombie, WR; Dunn, John J

    2011-01-01

    Here, we report genome-wide analysis of the tumor suppressor p53 binding sites in normal human cells. 743 high-confidence ChIP-seq peaks representing putative genomic binding sites were identified in normal IMR90 fibroblasts using a reference chromatin sample. More than 40% were located within 2 kb of a transcription start site (TSS), a distribution similar to that documented for individually studied, functional p53 binding sites and, to date, not observed by previous p53 genome-wide studies. Nearly half of the high-confidence binding sites in the IMR90 cells reside in CpG islands in marked contrast to sites reported in cancer-derived cells. The distinct genomic features of the IMR90 binding sites do not reflect a distinct preference for specific sequences, since the de novo developed p53 motif based on our study is similar to those reported by genome-wide studies of cancer cells. More likely, the different chromatin landscape in normal, compared with cancer-derived cells, influences p53 binding via modulating availability of the sites. We compared the IMR90 ChIP-seq peaks to the recently published IMR90 methylome1 and demonstrated that they are enriched at hypomethylated DNA. Our study represents the first genome-wide, de novo mapping of p53 binding sites in normal human cells and reveals that p53 binding sites reside in distinct genomic landscapes in normal and cancer-derived human cells. PMID:22127205

  18. The Global Genome Biodiversity Network (GGBN) Data Standard specification

    PubMed Central

    Droege, G.; Barker, K.; Seberg, O.; Coddington, J.; Benson, E.; Berendsohn, W. G.; Bunk, B.; Butler, C.; Cawsey, E. M.; Deck, J.; Döring, M.; Flemons, P.; Gemeinholzer, B.; Güntsch, A.; Hollowell, T.; Kelbert, P.; Kostadinov, I.; Kottmann, R.; Lawlor, R. T.; Lyal, C.; Mackenzie-Dodds, J.; Meyer, C.; Mulcahy, D.; Nussbeck, S. Y.; O'Tuama, É.; Orrell, T.; Petersen, G.; Robertson, T.; Söhngen, C.; Whitacre, J.; Wieczorek, J.; Yilmaz, P.; Zetzsche, H.; Zhang, Y.; Zhou, X.

    2016-01-01

    Genomic samples of non-model organisms are becoming increasingly important in a broad range of studies from developmental biology, biodiversity analyses, to conservation. Genomic sample definition, description, quality, voucher information and metadata all need to be digitized and disseminated across scientific communities. This information needs to be concise and consistent in today’s ever-increasing bioinformatic era, for complementary data aggregators to easily map databases to one another. In order to facilitate exchange of information on genomic samples and their derived data, the Global Genome Biodiversity Network (GGBN) Data Standard is intended to provide a platform based on a documented agreement to promote the efficient sharing and usage of genomic sample material and associated specimen information in a consistent way. The new data standard presented here build upon existing standards commonly used within the community extending them with the capability to exchange data on tissue, environmental and DNA sample as well as sequences. The GGBN Data Standard will reveal and democratize the hidden contents of biodiversity biobanks, for the convenience of everyone in the wider biobanking community. Technical tools exist for data providers to easily map their databases to the standard. Database URL: http://terms.tdwg.org/wiki/GGBN_Data_Standard PMID:27694206

  19. The Global Genome Biodiversity Network (GGBN) Data Standard specification.

    PubMed

    Droege, G; Barker, K; Seberg, O; Coddington, J; Benson, E; Berendsohn, W G; Bunk, B; Butler, C; Cawsey, E M; Deck, J; Döring, M; Flemons, P; Gemeinholzer, B; Güntsch, A; Hollowell, T; Kelbert, P; Kostadinov, I; Kottmann, R; Lawlor, R T; Lyal, C; Mackenzie-Dodds, J; Meyer, C; Mulcahy, D; Nussbeck, S Y; O'Tuama, É; Orrell, T; Petersen, G; Robertson, T; Söhngen, C; Whitacre, J; Wieczorek, J; Yilmaz, P; Zetzsche, H; Zhang, Y; Zhou, X

    2016-01-01

    Genomic samples of non-model organisms are becoming increasingly important in a broad range of studies from developmental biology, biodiversity analyses, to conservation. Genomic sample definition, description, quality, voucher information and metadata all need to be digitized and disseminated across scientific communities. This information needs to be concise and consistent in today's ever-increasing bioinformatic era, for complementary data aggregators to easily map databases to one another. In order to facilitate exchange of information on genomic samples and their derived data, the Global Genome Biodiversity Network (GGBN) Data Standard is intended to provide a platform based on a documented agreement to promote the efficient sharing and usage of genomic sample material and associated specimen information in a consistent way. The new data standard presented here build upon existing standards commonly used within the community extending them with the capability to exchange data on tissue, environmental and DNA sample as well as sequences. The GGBN Data Standard will reveal and democratize the hidden contents of biodiversity biobanks, for the convenience of everyone in the wider biobanking community. Technical tools exist for data providers to easily map their databases to the standard.Database URL: http://terms.tdwg.org/wiki/GGBN_Data_Standard. © The Author(s) 2016. Published by Oxford University Press.

  20. The genome sequence of Sea-Island cotton (Gossypium barbadense) provides insights into the allopolyploidization and development of superior spinnable fibres

    PubMed Central

    Yuan, Daojun; Tang, Zhonghui; Wang, Maojun; Gao, Wenhui; Tu, Lili; Jin, Xin; Chen, Lingling; He, Yonghui; Zhang, Lin; Zhu, Longfu; Li, Yang; Liang, Qiqi; Lin, Zhongxu; Yang, Xiyan; Liu, Nian; Jin, Shuangxia; Lei, Yang; Ding, Yuanhao; Li, Guoliang; Ruan, Xiaoan; Ruan, Yijun; Zhang, Xianlong

    2015-01-01

    Gossypium hirsutum contributes the most production of cotton fibre, but G. barbadense is valued for its better comprehensive resistance and superior fibre properties. However, the allotetraploid genome of G. barbadense has not been comprehensively analysed. Here we present a high-quality assembly of the 2.57 gigabase genome of G. barbadense, including 80,876 protein-coding genes. The double-sized genome of the A (or At) (1.50 Gb) against D (or Dt) (853 Mb) primarily resulted from the expansion of Gypsy elements, including Peabody and Retrosat2 subclades in the Del clade, and the Athila subclade in the Athila/Tat clade. Substantial gene expansion and contraction were observed and rich homoeologous gene pairs with biased expression patterns were identified, suggesting abundant gene sub-functionalization occurred by allopolyploidization. More specifically, the CesA gene family has adapted differentially temporal expression patterns, suggesting an integrated regulatory mechanism of CesA genes from At and Dt subgenomes for the primary and secondary cellulose biosynthesis of cotton fibre in a “relay race”-like fashion. We anticipate that the G. barbadense genome sequence will advance our understanding the mechanism of genome polyploidization and underpin genome-wide comparison research in this genus. PMID:26634818

  1. Fundamental differences in promoter CpG island DNA hypermethylation between human cancer and genetically engineered mouse models of cancer.

    PubMed

    Diede, Scott J; Yao, Zizhen; Keyes, C Chip; Tyler, Ashlee E; Dey, Joyoti; Hackett, Christopher S; Elsaesser, Katrina; Kemp, Christopher J; Neiman, Paul E; Weiss, William A; Olson, James M; Tapscott, Stephen J

    2013-12-01

    Genetic and epigenetic alterations are essential for the initiation and progression of human cancer. We previously reported that primary human medulloblastomas showed extensive cancer-specific CpG island DNA hypermethylation in critical developmental pathways. To determine whether genetically engineered mouse models (GEMMs) of medulloblastoma have comparable epigenetic changes, we assessed genome-wide DNA methylation in three mouse models of medulloblastoma. In contrast to human samples, very few loci with cancer-specific DNA hypermethylation were detected, and in almost all cases the degree of methylation was relatively modest compared with the dense hypermethylation in the human cancers. To determine if this finding was common to other GEMMs, we examined a Burkitt lymphoma and breast cancer model and did not detect promoter CpG island DNA hypermethylation, suggesting that human cancers and at least some GEMMs are fundamentally different with respect to this epigenetic modification. These findings provide an opportunity to both better understand the mechanism of aberrant DNA methylation in human cancer and construct better GEMMs to serve as preclinical platforms for therapy development.

  2. A new species of iguana Brachylophus Cuvier 1829 (Sauria: Iguania: Iguanidae) from Gau Island, Fiji Islands

    USGS Publications Warehouse

    Fisher, Robert N.; Niukula, Jone; Watling, Dick; Harlow, Peter S.

    2017-01-01

    The south Pacific iguanas (Brachylophus) currently have three recognized living species in Fiji.  Recent surveys have uncovered more specific variation (morphological and genetic) within the genus and have better defined the geographic ranges of the named species.  One of these recent discoveries is a strikingly different iguana from all other island populations in Fiji which is restricted to Gau Island of the Lomaiviti Province.  Gau is the fifth largest island in Fiji and maintains excellent upland forests in the higher elevations.  We describe this population from Gau Island as a new species, Brachylophus gau sp. nov., in recognition of its type locality.

  3. Genomics-Enabled Next-Generation Breeding Approaches for Developing System-Specific Drought Tolerant Hybrids in Maize

    PubMed Central

    Nepolean, Thirunavukkarsau; Kaul, Jyoti; Mukri, Ganapati; Mittal, Shikha

    2018-01-01

    Breeding science has immensely contributed to the global food security. Several varieties and hybrids in different food crops including maize have been released through conventional breeding. The ever growing population, decreasing agricultural land, lowering water table, changing climate, and other variables pose tremendous challenge to the researchers to improve the production and productivity of food crops. Drought is one of the major problems to sustain and improve the productivity of food crops including maize in tropical and subtropical production systems. With advent of novel genomics and breeding tools, the way of doing breeding has been tremendously changed in the last two decades. Drought tolerance is a combination of several component traits with a quantitative mode of inheritance. Rapid DNA and RNA sequencing tools and high-throughput SNP genotyping techniques, trait mapping, functional characterization, genomic selection, rapid generation advancement, and other tools are now available to understand the genetics of drought tolerance and to accelerate the breeding cycle. Informatics play complementary role by managing the big-data generated from the large-scale genomics and breeding experiments. Genome editing is the latest technique to alter specific genes to improve the trait expression. Integration of novel genomics, next-generation breeding, and informatics tools will accelerate the stress breeding process and increase the genetic gain under different production systems. PMID:29696027

  4. Development of genomic tools in a widespread tropical tree, Symphonia globulifera L.f.: a new low-coverage draft genome, SNP and SSR markers.

    PubMed

    Olsson, Sanna; Seoane-Zonjic, Pedro; Bautista, Rocío; Claros, M Gonzalo; González-Martínez, Santiago C; Scotti, Ivan; Scotti-Saintagne, Caroline; Hardy, Olivier J; Heuertz, Myriam

    2017-07-01

    Population genetic studies in tropical plants are often challenging because of limited information on taxonomy, phylogenetic relationships and distribution ranges, scarce genomic information and logistic challenges in sampling. We describe a strategy to develop robust and widely applicable genetic markers based on a modest development of genomic resources in the ancient tropical tree species Symphonia globulifera L.f. (Clusiaceae), a keystone species in African and Neotropical rainforests. We provide the first low-coverage (11X) fragmented draft genome sequenced on an individual from Cameroon, covering 1.027 Gbp or 67.5% of the estimated genome size. Annotation of 565 scaffolds (7.57 Mbp) resulted in the prediction of 1046 putative genes (231 of them containing a complete open reading frame) and 1523 exact simple sequence repeats (SSRs, microsatellites). Aligning a published transcriptome of a French Guiana population against this draft genome produced 923 high-quality single nucleotide polymorphisms. We also preselected genic SSRs in silico that were conserved and polymorphic across a wide geographical range, thus reducing marker development tests on rare DNA samples. Of 23 SSRs tested, 19 amplified and 18 were successfully genotyped in four S. globulifera populations from South America (Brazil and French Guiana) and Africa (Cameroon and São Tomé island, F ST  = 0.34). Most loci showed only population-specific deviations from Hardy-Weinberg proportions, pointing to local population effects (e.g. null alleles). The described genomic resources are valuable for evolutionary studies in Symphonia and for comparative studies in plants. The methods are especially interesting for widespread tropical or endangered taxa with limited DNA availability. © 2016 John Wiley & Sons Ltd.

  5. An homolog of the Frz Phosphoenolpyruvate:carbohydrate phosphoTransferase System of extraintestinal pathogenic Escherichia coli is encoded on a genomic island in specific lineages of Streptococcus agalactiae.

    PubMed

    Patron, Kévin; Gilot, Philippe; Camiade, Emilie; Mereghetti, Laurent

    2015-06-01

    We identified a Streptococcus agalactiae metabolic region (fru2) coding for a Phosphoenolpyruvate:carbohydrate phosphoTransferase System (PTS) homologous to the Frz system of extraintestinal pathogenic Escherichia coli strains. The Frz system is involved in environmental sensing and regulation of the expression of adaptation and virulence genes in E. coli. The S. agalactiae fru2 region codes three subunits of a PTS transporter of the fructose-mannitol family, a transcriptional activator of PTSs of the MtlR family, an allulose-6 phosphate-3-epimerase, a transaldolase and a transketolase. We demonstrated that all these genes form an operon. The fru2 operon is present in a 17494-bp genomic island. We analyzed by multilocus sequence typing a population of 492 strains representative of the S. agalactiae population and we showed that the presence of the fru2 operon is linked to the phylogeny of S. agalactiae. The fru2 operon is always present within strains of clonal complexes CC 1, CC 7, CC 10, CC 283 and singletons ST 130 and ST 288, but never found in other CCs and STs. Our results indicate that the fru2 operon was acquired during the evolution of the S. agalactiae species from a common ancestor before the divergence of CC 1, CC 7, CC 10, CC 283, ST 130 and ST 288. As S. agalactiae strains of CC 1 and CC 10 are frequently isolated from adults with invasive disease, we hypothesize that the S. agalactiae Fru2 system senses the environment to allow the bacterium to adapt to new conditions encountered during the infection of adults. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Educational Challenge to the Island States

    ERIC Educational Resources Information Center

    Saemala, Francis J.

    1973-01-01

    Argues that educational developments in the South Pacific island communities have been such that education itself has become a war against the people's cultural enrichment, and proposes a possible strategy as an initial step towards reorienting them; the discussion focuses specifically on the Solomon Islands scene. (Author/JM)

  7. Genomic diversity and versatility of Lactobacillus plantarum, a natural metabolic engineer

    PubMed Central

    2011-01-01

    In the past decade it has become clear that the lactic acid bacterium Lactobacillus plantarum occupies a diverse range of environmental niches and has an enormous diversity in phenotypic properties, metabolic capacity and industrial applications. In this review, we describe how genome sequencing, comparative genome hybridization and comparative genomics has provided insight into the underlying genomic diversity and versatility of L. plantarum. One of the main features appears to be genomic life-style islands consisting of numerous functional gene cassettes, in particular for carbohydrates utilization, which can be acquired, shuffled, substituted or deleted in response to niche requirements. In this sense, L. plantarum can be considered a “natural metabolic engineer”. PMID:21995294

  8. Genome Sequence of the Shiga Toxin-Producing Escherichia coli Strain NCCP15657

    PubMed Central

    Kim, Byung Kwon; Song, Geun Cheol; Hong, Gun Hyong; Seong, Won-Keun; Kim, Seon-Young; Jeong, Haeyoung; Kang, Sung Gyun; Kwon, Soon-Kyeong; Lee, Choong Hoon; Song, Ju Yeon; Yu, Dong Su; Park, Mi-Sun

    2012-01-01

    Shiga toxin-producing Escherichia coli causes bloody diarrhea and hemolytic-uremic syndrome and serious outbreaks worldwide. Here, we report the draft genome sequence of E. coli NCCP15657 isolated from a patient. The genome has virulence genes, many in the locus of enterocyte effacement (LEE) island, encoding a metalloprotease, the Shiga toxin, and constituents of type III secretion. PMID:22740674

  9. GenomeFingerprinter: the genome fingerprint and the universal genome fingerprint analysis for systematic comparative genomics.

    PubMed

    Ai, Yuncan; Ai, Hannan; Meng, Fanmei; Zhao, Lei

    2013-01-01

    No attention has been paid on comparing a set of genome sequences crossing genetic components and biological categories with far divergence over large size range. We define it as the systematic comparative genomics and aim to develop the methodology. First, we create a method, GenomeFingerprinter, to unambiguously produce a set of three-dimensional coordinates from a sequence, followed by one three-dimensional plot and six two-dimensional trajectory projections, to illustrate the genome fingerprint of a given genome sequence. Second, we develop a set of concepts and tools, and thereby establish a method called the universal genome fingerprint analysis (UGFA). Particularly, we define the total genetic component configuration (TGCC) (including chromosome, plasmid, and phage) for describing a strain as a systematic unit, the universal genome fingerprint map (UGFM) of TGCC for differentiating strains as a universal system, and the systematic comparative genomics (SCG) for comparing a set of genomes crossing genetic components and biological categories. Third, we construct a method of quantitative analysis to compare two genomes by using the outcome dataset of genome fingerprint analysis. Specifically, we define the geometric center and its geometric mean for a given genome fingerprint map, followed by the Euclidean distance, the differentiate rate, and the weighted differentiate rate to quantitatively describe the difference between two genomes of comparison. Moreover, we demonstrate the applications through case studies on various genome sequences, giving tremendous insights into the critical issues in microbial genomics and taxonomy. We have created a method, GenomeFingerprinter, for rapidly computing, geometrically visualizing, intuitively comparing a set of genomes at genome fingerprint level, and hence established a method called the universal genome fingerprint analysis, as well as developed a method of quantitative analysis of the outcome dataset. These have set

  10. A genome-specific repetitive DNA sequence from Oryza eichingeri: characterization, localization, and introgression to O. sativa.

    PubMed

    Yan, H. H.; Liu, G. Q.; Cheng, Z. K.; Li, X. B.; Liu, G. Z.; Min, S. K.; Zhu, L.H.

    2002-02-01

    In the course of transferring the brown planthopper resistance from a diploid, CC-genome wild rice species, Oryza eichingeri (IRGC acc. 105159 and 105163), to the cultivated rice variety 02428, we have isolated many alien addition and introgression lines. The O. eichingeri chromatin in some of these lines has previously been identified using genomic in situ hybridization and molecular-marker analysis. Here we cloned a tandemly repetitive DNA sequence from O. eichingeri IRGC acc105163, and detected it in 25 introgression lines. This repetitive DNA sequence showed high specificity to the rice CC genome, but was absent from all the four tetraploid species with BBCC or CCDD genomes. The monomer in this repetitive DNA sequence is 325-366-bp long, with a copy number of about 5,000 per 1 C of the O. eichingerigenome, showing 88% homology to a repetitive DNA sequence isolated from Oryza officinalis(2n=2 x=24, CC). Fluorescent in situ hybridization revealed 11 signals distributed over eight O. eichingeri chromosomes, mostly in terminal or subterminal regions.

  11. Identifying Specific Genes Controlling Complex Traits Through A Genome-Wide Screen For cis-Acting Regulatory Elements - An Example Using Marek's Disease

    USDA-ARS?s Scientific Manuscript database

    The identification of specific genes underlying phenotypic variation of complex traits remains one of the greatest challenges in biology despite having genome sequences and more powerful tools. Most genome-wide screens lack sufficient resolving power as they typically depend on linkage. One altern...

  12. Spatial organization of the budding yeast genome in the cell nucleus and identification of specific chromatin interactions from multi-chromosome constrained chromatin model.

    PubMed

    Gürsoy, Gamze; Xu, Yun; Liang, Jie

    2017-07-01

    Nuclear landmarks and biochemical factors play important roles in the organization of the yeast genome. The interaction pattern of budding yeast as measured from genome-wide 3C studies are largely recapitulated by model polymer genomes subject to landmark constraints. However, the origin of inter-chromosomal interactions, specific roles of individual landmarks, and the roles of biochemical factors in yeast genome organization remain unclear. Here we describe a multi-chromosome constrained self-avoiding chromatin model (mC-SAC) to gain understanding of the budding yeast genome organization. With significantly improved sampling of genome structures, both intra- and inter-chromosomal interaction patterns from genome-wide 3C studies are accurately captured in our model at higher resolution than previous studies. We show that nuclear confinement is a key determinant of the intra-chromosomal interactions, and centromere tethering is responsible for the inter-chromosomal interactions. In addition, important genomic elements such as fragile sites and tRNA genes are found to be clustered spatially, largely due to centromere tethering. We uncovered previously unknown interactions that were not captured by genome-wide 3C studies, which are found to be enriched with tRNA genes, RNAPIII and TFIIS binding. Moreover, we identified specific high-frequency genome-wide 3C interactions that are unaccounted for by polymer effects under landmark constraints. These interactions are enriched with important genes and likely play biological roles.

  13. Genome build information is an essential part of genomic track files.

    PubMed

    Kanduri, Chakravarthi; Domanska, Diana; Hovig, Eivind; Sandve, Geir Kjetil

    2017-09-14

    Genomic locations are represented as coordinates on a specific genome build version, but the build information is frequently missing when coordinates are provided. We show that this information is essential to correctly interpret and analyse the genomic intervals contained in genomic track files. Although not a substitute for best practices, we also provide a tool to predict the genome build version of genomic track files.

  14. Localized Plasticity in the Streamlined Genomes of Vinyl Chloride Respiring Dehalococcoides

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McMurdie, Paul J.; Behrens, Sebastien F.; Muller, Jochen A.

    2009-06-30

    Vinyl chloride (VC) is a human carcinogen and widespread priority pollutant. Here we report the first, to our knowledge, complete genome sequences of microorganisms able to respire VC, Dehalococcoides sp. strains VS and BAV1. Notably, the respective VC reductase encoding genes, vcrAB and bvcAB, were found embedded in distinct genomic islands (GEIs) with different predicted integration sites, suggesting that these genes were acquired horizontally and independently by distinct mechanisms. A comparative analysis that included two previously sequenced Dehalococcoides genomes revealed a contextually conserved core that is interrupted by two high plasticity regions (HPRs) near the Ori. These HPRs contain themore » majority of GEIs and strain-specific genes identified in the four Dehalococcoides genomes, an elevated number of repeated elements including insertion sequences (IS), as well as 91 of 96 rdhAB, genes that putatively encode terminal reductases in organohalide respiration. Only three core rdhA orthologous groups were identified, and only one of these groups is supported by synteny. The low number of core rdhAB, contrasted with the high rdhAB numbers per genome (up to 36 in strain VS), as well as their colocalization with GEIs and other signatures for horizontal transfer, suggests that niche adaptation via organohalide respiration is a fundamental ecological strategy in Dehalococccoides. This adaptation has been exacted through multiple mechanisms of recombination that are mainly confined within HPRs of an otherwise remarkably stable, syntenic, streamlined genome among the smallest of any free-living microorganism.« less

  15. The Mouse-colored Tyrannulet (Phaeomyias murina) is a species complex that includes the Cocos Flycatcher (Nesotriccus ridgwayi), an island form that underwent a population bottleneck.

    PubMed

    Zucker, Marc R; Harvey, Michael G; Oswald, Jessica A; Cuervo, Andrés; Derryberry, Elizabeth; Brumfield, Robb T

    2016-08-01

    Simultaneous examination of evolutionary history in island forms and closely related mainland relatives can provide reciprocal insight into the evolution of island and mainland faunas. The Cocos Flycatcher (Nesotriccus ridgwayi) is a small tyrant flycatcher (Tyrannidae) endemic to Cocos Island, an oceanic island in the eastern Pacific Ocean. We first established its close relationship to the mainland species Mouse-colored Tyrannulet (Phaeomyias murina) using a phylogeny from genome-wide ultraconserved elements and exons. We then used mitochondrial DNA to explore the relationships between Nesotriccus and Phaeomyias populations from across its distribution in Central and South America. We found that Nesotriccus is nested within the Phaeomyias evolutionary tree, and that Phaeomyias represents a complex of at least four evolutionarily distinct species that differ in plumage, voice, and habitat association. Nesotriccus underwent a population bottleneck subsequent to its divergence from Central American and northern South American Phaeomyias populations in the middle Pleistocene. The 46 UCE loci containing alleles that are fixed between the two species are widely distributed across the genome, which suggests that selective or neutral processes responsible for divergence have occurred genome-wide. Overall, our simultaneous examination of Phaeomyias and Nesotriccus revealed divergent levels of genetic diversity and evolutionary histories between island and mainland forms. Copyright © 2016 Elsevier Inc. All rights reserved.

  16. Genome-Wide Specific Selection in Three Domestic Sheep Breeds.

    PubMed

    Wang, Huihua; Zhang, Li; Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin

    2015-01-01

    Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.

  17. Towards a Molecular Definition of Enterohemorrhagic Escherichia coli (EHEC): Detection of Genes Located on O Island 57 as Markers To Distinguish EHEC from Closely Related Enteropathogenic E. coli Strains

    PubMed Central

    Delannoy, Sabine; Beutin, Lothar

    2013-01-01

    Among strains of Shiga-toxin (Stx) producing Escherichia coli (STEC), seven serogroups (O26, O45, O103, O111, O121, O145, and O157) are associated with severe clinical illness in humans. These strains are also called enterohemorrhagic E. coli (EHEC), and the development of methods for their reliable detection from food has been challenging thus far. PCR detection of major EHEC virulence genes stx1, stx2, eae, and O-serogroup-specific genes is useful but does not identify EHEC strains specifically. Searching for the presence of additional genes issued from E. coli O157:H7 genomic islands OI-122 and OI-71 increases the specificity but does not clearly discriminate EHEC from enteropathogenic E. coli (EPEC) strains. Here, we identified two putative genes, called Z2098 and Z2099, from the genomic island OI-57 that were closely associated with EHEC and their stx-negative derivative strains (87% for Z2098 and 91% for Z2099). Z2098 and Z2099 were rarely found in EPEC (10% for Z2098 and 12% for Z2099), STEC (2 and 15%), and apathogenic E. coli (1% each) strains. Our findings indicate that Z2098 and Z2099 are useful genetic markers for a more targeted diagnosis of typical EHEC and new emerging EHEC strains. PMID:23325824

  18. Bifidobacterium animalis subsp. lactis ATCC 27673 Is a Genomically Unique Strain within Its Conserved Subspecies

    PubMed Central

    Loquasto, Joseph R.; Barrangou, Rodolphe; Dudley, Edward G.; Stahl, Buffy; Chen, Chun

    2013-01-01

    Many strains of Bifidobacterium animalis subsp. lactis are considered health-promoting probiotic microorganisms and are commonly formulated into fermented dairy foods. Analyses of previously sequenced genomes of B. animalis subsp. lactis have revealed little genetic diversity, suggesting that it is a monomorphic subspecies. However, during a multilocus sequence typing survey of Bifidobacterium, it was revealed that B. animalis subsp. lactis ATCC 27673 gave a profile distinct from that of the other strains of the subspecies. As part of an ongoing study designed to understand the genetic diversity of this subspecies, the genome of this strain was sequenced and compared to other sequenced genomes of B. animalis subsp. lactis and B. animalis subsp. animalis. The complete genome of ATCC 27673 was 1,963,012 bp, contained 1,616 genes and 4 rRNA operons, and had a G+C content of 61.55%. Comparative analyses revealed that the genome of ATCC 27673 contained six distinct genomic islands encoding 83 open reading frames not found in other strains of the same subspecies. In four islands, either phage or mobile genetic elements were identified. In island 6, a novel clustered regularly interspaced short palindromic repeat (CRISPR) locus which contained 81 unique spacers was identified. This type I-E CRISPR-cas system differs from the type I-C systems previously identified in this subspecies, representing the first identification of a different system in B. animalis subsp. lactis. This study revealed that ATCC 27673 is a strain of B. animalis subsp. lactis with novel genetic content and suggests that the lack of genetic variability observed is likely due to the repeated sequencing of a limited number of widely distributed commercial strains. PMID:23995933

  19. Analysis of illegitimate genomic integration mediated by zinc-finger nucleases: implications for specificity of targeted gene correction

    PubMed Central

    2010-01-01

    Background Formation of site specific genomic double strand breaks (DSBs), induced by the expression of a pair of engineered zinc-finger nucleases (ZFNs), dramatically increases the rates of homologous recombination (HR) between a specific genomic target and a donor plasmid. However, for the safe use of ZFN induced HR in practical applications, possible adverse effects of the technology such as cytotoxicity and genotoxicity need to be well understood. In this work, off-target activity of a pair of ZFNs has been examined by measuring the ratio between HR and illegitimate genomic integration in cells that are growing exponentially, and in cells that have been arrested in the G2/M phase. Results A reporter cell line that contained consensus ZFN binding sites in an enhanced green fluorescent protein (EGFP) reporter gene was used to measure ratios between HR and non-homologous integration of a plasmid template. Both in human cells (HEK 293) containing the consensus ZFN binding sites and in cells lacking the ZFN binding sites, a 3.5 fold increase in the level of illegitimate integration was observed upon ZFN expression. Since the reporter gene containing the consensus ZFN target sites was found to be intact in cells where illegitimate integration had occurred, increased rates of illegitimate integration most likely resulted from the formation of off-target genomic DSBs. Additionally, in a fraction of the ZFN treated cells the co-occurrence of both specific HR and illegitimate integration was observed. As a mean to minimize unspecific effects, cell cycle manipulation of the target cells by induction of a transient G2/M cell cycle arrest was shown to stimulate the activity of HR while having little effect on the levels of illegitimate integration, thus resulting in a nearly eight fold increase in the ratio between the two processes. Conclusions The demonstration that ZFN expression, in addition to stimulating specific gene targeting by HR, leads to increased rates of

  20. Tissue-Specific Transcriptomic Profiling of Sorghum propinquum using a Rice Genome Array

    PubMed Central

    Zhang, Ting; Zhao, Xiuqin; Huang, Liyu; Liu, Xiaoyue; Zong, Ying; Zhu, Linghua; Yang, Daichang; Fu, Binying

    2013-01-01

    Sorghum (Sorghum bicolor) is one of the world's most important cereal crops. S. propinquum is a perennial wild relative of S. bicolor with well-developed rhizomes. Functional genomics analysis of S. propinquum, especially with respect to molecular mechanisms related to rhizome growth and development, can contribute to the development of more sustainable grain, forage, and bioenergy cropping systems. In this study, we used a whole rice genome oligonucleotide microarray to obtain tissue-specific gene expression profiles of S. propinquum with special emphasis on rhizome development. A total of 548 tissue-enriched genes were detected, including 31 and 114 unique genes that were expressed predominantly in the rhizome tips (RT) and internodes (RI), respectively. Further GO analysis indicated that the functions of these tissue-enriched genes corresponded to their characteristic biological processes. A few distinct cis-elements, including ABA-responsive RY repeat CATGCA, sugar-repressive TTATCC, and GA-responsive TAACAA, were found to be prevalent in RT-enriched genes, implying an important role in rhizome growth and development. Comprehensive comparative analysis of these rhizome-enriched genes and rhizome-specific genes previously identified in Oryza longistaminata and S. propinquum indicated that phytohormones, including ABA, GA, and SA, are key regulators of gene expression during rhizome development. Co-localization of rhizome-enriched genes with rhizome-related QTLs in rice and sorghum generated functional candidates for future cloning of genes associated with rhizome growth and development. PMID:23536906

  1. The CpG island methylator phenotype (CIMP) in colorectal cancer.

    PubMed

    Nazemalhosseini Mojarad, Ehsan; Kuppen, Peter Jk; Aghdaei, Hamid Asadzadeh; Zali, Mohammad Reza

    2013-01-01

    It is clear that colorectal cancer (CRC) develops through multiple genetic and epigenetic pathways. These pathways may be determined on the basis of three molecular features: (i) mutations in DNA mismatch repair genes, leading to a DNA microsatellite instability (MSI) phenotype, (ii) mutations in APC and other genes that activate Wnt pathway, characterized by chromosomal instability (CIN) phenotype, and (iii) global genome hypermethylation, resulting in switch off of tumor suppressor genes, indicated as CpG island methylator phenotype (CIMP). Each of these pathways is characterized by specific pathological features, mechanisms of carcinogenesis and process of tumor development. The molecular aspects of these pathways have been used clinically in the diagnosis, screening and management of patients with colorectal cancer. In this review we especially describe various aspects of CIMP, one of the important and rather recently discovered pathways that lead to colorectal cancer.

  2. Response of Everglades tree islands to environmental change

    USGS Publications Warehouse

    Willard, Debra A.; Bernhardt, Christopher E.; Holmes, Charles W.; Landacre, Bryan; Marot, Marci E.

    2006-01-01

    Tree islands are centers of biodiversity within the Florida Everglades, USA, but the factors controlling their distribution, formation, and development are poorly understood. We use pollen assemblages from tree islands throughout the greater Everglades ecosystem to reconstruct the timing of tree island formation, patterns of development, and response to specific climatic and environmental stressors. These data indicate that fixed (teardrop-shaped) and strand tree islands developed well before substantial human alteration of the system, with initial tree island vegetation in place between 3500 and 500 calibrated years before present (cal yr BP), depending on the location in the Everglades wetland. Tree island development appears to have been triggered by regional- to global-scale climatic events at 2800 cal yr BP, 1600–1500 cal yr BP, 1200–1000 cal yr BP (early Medieval Warm Period), and 500–200 cal yr BP (Little Ice Age). These periods correspond to drought intervals documented in Central and South America and periods of southward displacement of the Intertropical Convergence Zone. The records indicate a coherence of climate patterns in both subtropical North America and the Northern Hemisphere Neotropics. Water management practices of the 20th century altered plant communities and size of tree islands throughout the Everglades. Responses range from loss of tree islands due to artificially long hydroperiods and deep water to expansion of tree islands after flow reductions. These data provide evidence for the rapidity of tree island response to specific hydrologic change and facilitate prediction of the response to future changes associated with Everglades restoration plans.

  3. Towards pathogenomics: a web-based resource for pathogenicity islands

    PubMed Central

    Yoon, Sung Ho; Park, Young-Kyu; Lee, Soohyun; Choi, Doil; Oh, Tae Kwang; Hur, Cheol-Goo; Kim, Jihyun F.

    2007-01-01

    Pathogenicity islands (PAIs) are genetic elements whose products are essential to the process of disease development. They have been horizontally (laterally) transferred from other microbes and are important in evolution of pathogenesis. In this study, a comprehensive database and search engines specialized for PAIs were established. The pathogenicity island database (PAIDB) is a comprehensive relational database of all the reported PAIs and potential PAI regions which were predicted by a method that combines feature-based analysis and similarity-based analysis. Also, using the PAI Finder search application, a multi-sequence query can be analyzed onsite for the presence of potential PAIs. As of April 2006, PAIDB contains 112 types of PAIs and 889 GenBank accessions containing either partial or all PAI loci previously reported in the literature, which are present in 497 strains of pathogenic bacteria. The database also offers 310 candidate PAIs predicted from 118 sequenced prokaryotic genomes. With the increasing number of prokaryotic genomes without functional inference and sequenced genetic regions of suspected involvement in diseases, this web-based, user-friendly resource has the potential to be of significant use in pathogenomics. PAIDB is freely accessible at . PMID:17090594

  4. Phage-inducible islands in the Gram-positive cocci.

    PubMed

    Martínez-Rubio, Roser; Quiles-Puchalt, Nuria; Martí, Miguel; Humphrey, Suzanne; Ram, Geeta; Smyth, Davida; Chen, John; Novick, Richard P; Penadés, José R

    2017-04-01

    The SaPIs are a cohesive subfamily of extremely common phage-inducible chromosomal islands (PICIs) that reside quiescently at specific att sites in the staphylococcal chromosome and are induced by helper phages to excise and replicate. They are usually packaged in small capsids composed of phage virion proteins, giving rise to very high transfer frequencies, which they enhance by interfering with helper phage reproduction. As the SaPIs represent a highly successful biological strategy, with many natural Staphylococcus aureus strains containing two or more, we assumed that similar elements would be widespread in the Gram-positive cocci. On the basis of resemblance to the paradigmatic SaPI genome, we have readily identified large cohesive families of similar elements in the lactococci and pneumococci/streptococci plus a few such elements in Enterococcus faecalis. Based on extensive ortholog analyses, we found that the PICI elements in the four different genera all represent distinct but parallel lineages, suggesting that they represent convergent evolution towards a highly successful lifestyle. We have characterized in depth the enterococcal element, EfCIV583, and have shown that it very closely resembles the SaPIs in functionality as well as in genome organization, setting the stage for expansion of the study of elements of this type. In summary, our findings greatly broaden the PICI family to include elements from at least three genera of cocci.

  5. Permanent Draft Genome of Strain ESFC-1: Ecological Genomics of a Newly Discovered Lineage of Filamentous Diazotrophic Cyanobacteria

    NASA Technical Reports Server (NTRS)

    Everroad, R. Craig; Stuart, Rhona K.; Bebout, Brad M.; Detweiler, Angela M.; Lee, Jackson Zan; Woebken, Dagmar; Bebout, Leslie E.; Pett-Ridge, Jennifer

    2016-01-01

    The nonheterocystous filamentous cyanobacterium, strain ESFC-1, is a recently described member of the order Oscillatoriales within the Cyanobacteria. ESFC-1 has been shown to be a major diazotroph in the intertidal microbial mat system at Elkhorn Slough, CA, USA. Based on phylogenetic analyses of the 16S RNA gene, ESFC-1 appears to belong to a unique, genus-level divergence; the draft genome sequence of this strain has now been determined. Here we report features of this genome as they relate to the ecological functions and capabilities of strain ESFC-1. The 5,632,035 bp genome sequence encodes 4914 protein-coding genes and 92 RNA genes. One striking feature of this cyanobacterium is the apparent lack of either uptake or bi-directional hydrogenases typically expected within a diazotroph. Additionally, a large genomic island is found that contains numerous low GC-content genes and genes related to extracellular polysaccharide production and cell wall synthesis and maintenance.

  6. Permanent draft genome of strain ESFC-1: ecological genomics of a newly discovered lineage of filamentous diazotrophic cyanobacteria

    DOE PAGES

    Everroad, R. Craig; Stuart, Rhona K.; Bebout, Brad M.; ...

    2016-08-24

    The nonheterocystous filamentous cyanobacterium, strain ESFC-1, is a recently described member of the order Oscillatoriales within the Cyanobacteria. ESFC-1 has been shown to be a major diazotroph in the intertidal microbial mat system at Elkhorn Slough, CA, USA. Based on phylogenetic analyses of the 16S RNA gene, ESFC-1 appears to belong to a unique, genus-level divergence; the draft genome sequence of this strain has now been determined. Here we report features of this genome as they relate to the ecological functions and capabilities of strain ESFC-1. The 5,632,035 bp genome sequence encodes 4914 protein-coding genes and 92 RNA genes. Onemore » striking feature of this cyanobacterium is the apparent lack of either uptake or bi-directional hydrogenases typically expected within a diazotroph. In addition, a large genomic island is found that contains numerous low GC-content genes and genes related to extracellular polysaccharide production and cell wall synthesis and maintenance.« less

  7. The autoinducer synthase LqsA and putative sensor kinase LqsS regulate phagocyte interactions, extracellular filaments and a genomic island of Legionella pneumophila.

    PubMed

    Tiaden, André; Spirig, Thomas; Sahr, Tobias; Wälti, Martin A; Boucke, Karin; Buchrieser, Carmen; Hilbi, Hubert

    2010-05-01

    The amoebae-resistant opportunistic pathogen Legionella pneumophila employs a biphasic life cycle to replicate in host cells and spread to new niches. Upon entering the stationary growth phase, the bacteria switch to a transmissive (virulent) state, which involves a complex regulatory network including the lqs gene cluster (lqsA-lqsR-hdeD-lqsS). LqsR is a putative response regulator that promotes host-pathogen interactions and represses replication. The autoinducer synthase LqsA catalyses the production of the diffusible signalling molecule 3-hydroxypentadecan-4-one (LAI-1) that is presumably recognized by the sensor kinase LqsS. Here, we analysed L. pneumophila strains lacking lqsA or lqsS. Compared with wild-type L. pneumophila, the DeltalqsS strain was more salt-resistant and impaired for the Icm/Dot type IV secretion system-dependent uptake by phagocytes. Legionella pneumophila strains lacking lqsS, lqsR or the alternative sigma factor rpoS sedimented more slowly and produced extracellular filaments. Deletion of lqsA moderately reduced the uptake of L. pneumophila by phagocytes, and the defect was complemented by expressing lqsA in trans. Unexpectedly, the overexpression of lqsA also restored the virulence defect and reduced filament production of L. pneumophila mutant strains lacking lqsS or lqsR, but not the phenotypes of strains lacking rpoS or icmT. These results suggest that LqsA products also signal through sensors not encoded by the lqs gene cluster. A transcriptome analysis of the DeltalqsA and DeltalqsS mutant strains revealed that under the conditions tested, lqsA regulated only few genes, whereas lqsS upregulated the expression of 93 genes at least twofold. These include 52 genes clustered in a 133 kb high plasticity genomic island, which is flanked by putative DNA-mobilizing genes and encodes multiple metal ion efflux pumps. Upon overexpression of lqsA, a cluster of 19 genes in the genomic island was also upregulated, suggesting that LqsA and Lqs

  8. Genome-Wide Cell Type-Specific Mapping of In Vivo Chromatin Protein Binding Using an FLP-Inducible DamID System in Drosophila.

    PubMed

    Pindyurin, Alexey V

    2017-01-01

    A thorough study of the genome-wide binding patterns of chromatin proteins is essential for understanding the regulatory mechanisms of genomic processes in eukaryotic nuclei, including DNA replication, transcription, and repair. The DNA adenine methyltransferase identification (DamID) method is a powerful tool to identify genomic binding sites of chromatin proteins. This method does not require fixation of cells and the use of specific antibodies, and has been used to generate genome-wide binding maps of more than a hundred different proteins in Drosophila tissue culture cells. Recent versions of inducible DamID allow performing cell type-specific profiling of chromatin proteins even in small samples of Drosophila tissues that contain heterogeneous cell types. Importantly, with these methods sorting of cells of interest or their nuclei is not necessary as genomic DNA isolated from the whole tissue can be used as an input. Here, I describe in detail an FLP-inducible DamID method, namely generation of suitable transgenic flies, activation of the Dam transgenes by the FLP recombinase, isolation of DNA from small amounts of dissected tissues, and subsequent identification of the DNA binding sites of the chromatin proteins.

  9. NRAS and EPHB6 mutation rates differ in metastatic melanomas of patients in the North Island versus South Island of New Zealand

    PubMed Central

    Jones, Angela M.; Ferguson, Peter; Gardner, Jacqui; Rooker, Serena; Sutton, Tim; Ahn, Antonio; Chatterjee, Aniruddha; Bickley, Vivienne M.; Sarwar, Makhdoom; Emanuel, Patrick; Kenwright, Diane; Shepherd, Peter R.; Eccles, Michael R.

    2016-01-01

    Melanoma, the most aggressive skin cancer type, is responsible for 75% of skin cancer related deaths worldwide. Given that New Zealand (NZ) has the world's highest melanoma incidence, we sought to determine the frequency of mutations in NZ melanomas in recurrently mutated genes. NZ melanomas were from localities distributed between North (35°S-42°S) and South Islands (41°S-47°S). A total of 529 melanomas were analyzed for BRAF exon 15 mutations by Sanger sequencing, and also by Sequenom MelaCarta MassARRAY. While, a relatively low incidence of BRAFV600E mutations (23.4%) was observed overall in NZ melanomas, the incidence of NRAS mutations in South Island melanomas was high compared to North Island melanomas (38.3% vs. 21.9%, P=0.0005), and to The Cancer Genome Atlas database (TCGA) (38.3% vs. 22%, P=0.0004). In contrast, the incidence of EPHB6G404S mutations was 0% in South Island melanomas, and was 7.8% in North Island (P=0.0002). Overall, these data suggest that melanomas from geographically different regions in NZ have markedly different mutation frequencies, in particular in the NRAS and EPHB6 genes, when compared to TCGA or other populations. These data have implications for the causation and treatment of malignant melanoma in NZ. PMID:27191502

  10. NRAS and EPHB6 mutation rates differ in metastatic melanomas of patients in the North Island versus South Island of New Zealand.

    PubMed

    Jones, Angela M; Ferguson, Peter; Gardner, Jacqui; Rooker, Serena; Sutton, Tim; Ahn, Antonio; Chatterjee, Aniruddha; Bickley, Vivienne M; Sarwar, Makhdoom; Emanuel, Patrick; Kenwright, Diane; Shepherd, Peter R; Eccles, Michael R

    2016-07-05

    Melanoma, the most aggressive skin cancer type, is responsible for 75% of skin cancer related deaths worldwide. Given that New Zealand (NZ) has the world's highest melanoma incidence, we sought to determine the frequency of mutations in NZ melanomas in recurrently mutated genes. NZ melanomas were from localities distributed between North (35°S-42°S) and South Islands (41°S-47°S). A total of 529 melanomas were analyzed for BRAF exon 15 mutations by Sanger sequencing, and also by Sequenom MelaCarta MassARRAY. While, a relatively low incidence of BRAFV600E mutations (23.4%) was observed overall in NZ melanomas, the incidence of NRAS mutations in South Island melanomas was high compared to North Island melanomas (38.3% vs. 21.9%, P=0.0005), and to The Cancer Genome Atlas database (TCGA) (38.3% vs. 22%, P=0.0004). In contrast, the incidence of EPHB6G404S mutations was 0% in South Island melanomas, and was 7.8% in North Island (P=0.0002). Overall, these data suggest that melanomas from geographically different regions in NZ have markedly different mutation frequencies, in particular in the NRAS and EPHB6 genes, when compared to TCGA or other populations. These data have implications for the causation and treatment of malignant melanoma in NZ.

  11. Complete Genomic Structure of the Cultivated Rice Endophyte Azospirillum sp. B510

    PubMed Central

    Kaneko, Takakazu; Minamisawa, Kiwamu; Isawa, Tsuyoshi; Nakatsukasa, Hiroki; Mitsui, Hisayuki; Kawaharada, Yasuyuki; Nakamura, Yasukazu; Watanabe, Akiko; Kawashima, Kumiko; Ono, Akiko; Shimizu, Yoshimi; Takahashi, Chika; Minami, Chiharu; Fujishiro, Tsunakazu; Kohara, Mitsuyo; Katoh, Midori; Nakazaki, Naomi; Nakayama, Shinobu; Yamada, Manabu; Tabata, Satoshi; Sato, Shusei

    2010-01-01

    We determined the nucleotide sequence of the entire genome of a diazotrophic endophyte, Azospirillum sp. B510. Strain B510 is an endophytic bacterium isolated from stems of rice plants (Oryza sativa cv. Nipponbare). The genome of B510 consisted of a single chromosome (3 311 395 bp) and six plasmids, designated as pAB510a (1 455 109 bp), pAB510b (723 779 bp), pAB510c (681 723 bp), pAB510d (628 837 bp), pAB510e (537 299 bp), and pAB510f (261 596 bp). The chromosome bears 2893 potential protein-encoding genes, two sets of rRNA gene clusters (rrns), and 45 tRNA genes representing 37 tRNA species. The genomes of the six plasmids contained a total of 3416 protein-encoding genes, seven sets of rrns, and 34 tRNAs representing 19 tRNA species. Eight genes for plasmid-specific tRNA species are located on either pAB510a or pAB510d. Two out of eight genomic islands are inserted in the plasmids, pAB510b and pAB510e, and one of the islands is inserted into trnfM-CAU in the rrn located on pAB510e. Genes other than the nif gene cluster that are involved in N2 fixation and are homologues of Bradyrhizobium japonicum USDA110 include fixABCX, fixNOQP, fixHIS, fixG, and fixLJK. Three putative plant hormone-related genes encoding tryptophan 2-monooxytenase (iaaM) and indole-3-acetaldehyde hydrolase (iaaH), which are involved in IAA biosynthesis, and ACC deaminase (acdS), which reduces ethylene levels, were identified. Multiple gene-clusters for tripartite ATP-independent periplasmic-transport systems and a diverse set of malic enzymes were identified, suggesting that B510 utilizes C4-dicarboxylate during its symbiotic relationship with the host plant. PMID:20047946

  12. How to kill the honey bee larva: genomic potential and virulence mechanisms of Paenibacillus larvae.

    PubMed

    Djukic, Marvin; Brzuszkiewicz, Elzbieta; Fünfhaus, Anne; Voss, Jörn; Gollnow, Kathleen; Poppinga, Lena; Liesegang, Heiko; Garcia-Gonzalez, Eva; Genersch, Elke; Daniel, Rolf

    2014-01-01

    Paenibacillus larvae, a Gram positive bacterial pathogen, causes American Foulbrood (AFB), which is the most serious infectious disease of honey bees. In order to investigate the genomic potential of P. larvae, two strains belonging to two different genotypes were sequenced and used for comparative genome analysis. The complete genome sequence of P. larvae strain DSM 25430 (genotype ERIC II) consisted of 4,056,006 bp and harbored 3,928 predicted protein-encoding genes. The draft genome sequence of P. larvae strain DSM 25719 (genotype ERIC I) comprised 4,579,589 bp and contained 4,868 protein-encoding genes. Both strains harbored a 9.7 kb plasmid and encoded a large number of virulence-associated proteins such as toxins and collagenases. In addition, genes encoding large multimodular enzymes producing nonribosomally peptides or polyketides were identified. In the genome of strain DSM 25719 seven toxin associated loci were identified and analyzed. Five of them encoded putatively functional toxins. The genome of strain DSM 25430 harbored several toxin loci that showed similarity to corresponding loci in the genome of strain DSM 25719, but were non-functional due to point mutations or disruption by transposases. Although both strains cause AFB, significant differences between the genomes were observed including genome size, number and composition of transposases, insertion elements, predicted phage regions, and strain-specific island-like regions. Transposases, integrases and recombinases are important drivers for genome plasticity. A total of 390 and 273 mobile elements were found in strain DSM 25430 and strain DSM 25719, respectively. Comparative genomics of both strains revealed acquisition of virulence factors by horizontal gene transfer and provided insights into evolution and pathogenicity.

  13. Late Quaternary climate change shapes island biodiversity.

    PubMed

    Weigelt, Patrick; Steinbauer, Manuel Jonas; Cabral, Juliano Sarmento; Kreft, Holger

    2016-04-07

    Island biogeographical models consider islands either as geologically static with biodiversity resulting from ecologically neutral immigration-extinction dynamics, or as geologically dynamic with biodiversity resulting from immigration-speciation-extinction dynamics influenced by changes in island characteristics over millions of years. Present climate and spatial arrangement of islands, however, are rather exceptional compared to most of the Late Quaternary, which is characterized by recurrent cooler and drier glacial periods. These climatic oscillations over short geological timescales strongly affected sea levels and caused massive changes in island area, isolation and connectivity, orders of magnitude faster than the geological processes of island formation, subsidence and erosion considered in island theory. Consequences of these oscillations for present biodiversity remain unassessed. Here we analyse the effects of present and Last Glacial Maximum (LGM) island area, isolation, elevation and climate on key components of angiosperm diversity on islands worldwide. We find that post-LGM changes in island characteristics, especially in area, have left a strong imprint on present diversity of endemic species. Specifically, the number and proportion of endemic species today is significantly higher on islands that were larger during the LGM. Native species richness, in turn, is mostly determined by present island characteristics. We conclude that an appreciation of Late Quaternary environmental change is essential to understand patterns of island endemism and its underlying evolutionary dynamics.

  14. Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

    PubMed

    Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

    2014-01-01

    A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.

  15. A new species of iguana Brachylophus Cuvier 1829 (Sauria: Iguania: Iguanidae) from Gau Island, Fiji Islands.

    PubMed

    Fisher, Robert N; Niukula, Jone; Watling, Dick; Harlow, Peter S

    2017-06-06

    The south Pacific iguanas (Brachylophus) currently have three recognized living species in Fiji.  Recent surveys have uncovered more specific variation (morphological and genetic) within the genus and have better defined the geographic ranges of the named species.  One of these recent discoveries is a strikingly different iguana from all other island populations in Fiji which is restricted to Gau Island of the Lomaiviti Province.  Gau is the fifth largest island in Fiji and maintains excellent upland forests in the higher elevations.  We describe this population from Gau Island as a new species, Brachylophus gau sp. nov., in recognition of its type locality.

  16. Real-time imaging of specific genomic loci in eukaryotic cells using the ANCHOR DNA labelling system.

    PubMed

    Germier, Thomas; Sylvain, Audibert; Silvia, Kocanova; David, Lane; Kerstin, Bystricky

    2018-06-01

    Spatio-temporal organization of the cell nucleus adapts to and regulates genomic processes. Microscopy approaches that enable direct monitoring of specific chromatin sites in single cells and in real time are needed to better understand the dynamics involved. In this chapter, we describe the principle and development of ANCHOR, a novel tool for DNA labelling in eukaryotic cells. Protocols for use of ANCHOR to visualize a single genomic locus in eukaryotic cells are presented. We describe an approach for live cell imaging of a DNA locus during the entire cell cycle in human breast cancer cells. Copyright © 2018 Elsevier Inc. All rights reserved.

  17. Links between DNA methylation and nucleosome occupancy in the human genome.

    PubMed

    Collings, Clayton K; Anderson, John N

    2017-01-01

    DNA methylation is an epigenetic modification that is enriched in heterochromatin but depleted at active promoters and enhancers. However, the debate on whether or not DNA methylation is a reliable indicator of high nucleosome occupancy has not been settled. For example, the methylation levels of DNA flanking CTCF sites are higher in linker DNA than in nucleosomal DNA, while other studies have shown that the nucleosome core is the preferred site of methylation. In this study, we make progress toward understanding these conflicting phenomena by implementing a bioinformatics approach that combines MNase-seq and NOMe-seq data and by comprehensively profiling DNA methylation and nucleosome occupancy throughout the human genome. The results demonstrated that increasing methylated CpG density is correlated with nucleosome occupancy in the total genome and within nearly all subgenomic regions. Features with elevated methylated CpG density such as exons, SINE-Alu sequences, H3K36-trimethylated peaks, and methylated CpG islands are among the highest nucleosome occupied elements in the genome, while some of the lowest occupancies are displayed by unmethylated CpG islands and unmethylated transcription factor binding sites. Additionally, outside of CpG islands, the density of CpGs within nucleosomes was shown to be important for the nucleosomal location of DNA methylation with low CpG frequencies favoring linker methylation and high CpG frequencies favoring core particle methylation. Prominent exceptions to the correlations between methylated CpG density and nucleosome occupancy include CpG islands marked by H3K27me3 and CpG-poor heterochromatin marked by H3K9me3, and these modifications, along with DNA methylation, distinguish the major silencing mechanisms of the human epigenome. Thus, the relationship between DNA methylation and nucleosome occupancy is influenced by the density of methylated CpG dinucleotides and by other epigenomic components in chromatin.

  18. Genomics and functional genomics in Chlamydomonas reinhardtii

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Blaby, Ian K.; Blaby-Haas, Crysten E.

    The availability of the Chlamydomonas reinhardtii nuclear genome sequence continues to enable researchers to address biological questions relevant to algae, land plants and animals in unprecedented ways. As we continue to characterize and understand biological processes in C. reinhardtii and translate that knowledge to other systems, we are faced with the realization that many genes encode proteins without a defined function. The field of functional genomics aims to close this gap between genome sequence and protein function. Transcriptomes, proteomes and phenomes can each provide layers of gene-specific functional data while supplying a global snapshot of cellular behavior under different conditions.more » Herein we present a brief history of functional genomics, the present status of the C. reinhardtii genome, how genome-wide experiments can aid in supplying protein function inferences, and provide an outlook for functional genomics in C. reinhardtii.« less

  19. Genomics and functional genomics in Chlamydomonas reinhardtii

    DOE PAGES

    Blaby, Ian K.; Blaby-Haas, Crysten E.

    2017-03-21

    The availability of the Chlamydomonas reinhardtii nuclear genome sequence continues to enable researchers to address biological questions relevant to algae, land plants and animals in unprecedented ways. As we continue to characterize and understand biological processes in C. reinhardtii and translate that knowledge to other systems, we are faced with the realization that many genes encode proteins without a defined function. The field of functional genomics aims to close this gap between genome sequence and protein function. Transcriptomes, proteomes and phenomes can each provide layers of gene-specific functional data while supplying a global snapshot of cellular behavior under different conditions.more » Herein we present a brief history of functional genomics, the present status of the C. reinhardtii genome, how genome-wide experiments can aid in supplying protein function inferences, and provide an outlook for functional genomics in C. reinhardtii.« less

  20. Meta-Analysis of Genome-Wide Scans Provides Evidence for Sex- and Site-Specific Regulation of Bone Mass

    PubMed Central

    Sham, Pak C; Zintzaras, Elias; Lewis, Cathryn M; Deng, Hong-Wen; Econs, Michael J; Karasik, David; Devoto, Marcella; Kammerer, Candace M; Spector, Tim; Andrew, Toby; Cupples, L Adrienne; Duncan, Emma L; Foroud, Tatiana; Kiel, Douglas P; Koller, Daniel; Langdahl, Bente; Mitchell, Braxton D; Peacock, Munro; Recker, Robert; Shen, Hui; Sol-Church, Katia; Spotila, Loretta D; Uitterlinden, Andre G; Wilson, Scott G; Kung, Annie WC; Ralston, Stuart H

    2014-01-01

    Several genome-wide scans have been performed to detect loci that regulate BMD, but these have yielded inconsistent results, with limited replication of linkage peaks in different studies. In an effort to improve statistical power for detection of these loci, we performed a meta-analysis of genome-wide scans in which spine or hip BMD were studied. Evidence was gained to suggest that several chromosomal loci regulate BMD in a site-specific and sex-specific manner. Introduction BMD is a heritable trait and an important predictor of osteoporotic fracture risk. Several genome-wide scans have been performed in an attempt to detect loci that regulate BMD, but there has been limited replication of linkage peaks between studies. In an attempt to resolve these inconsistencies, we conducted a collaborative meta-analysis of genome-wide linkage scans in which femoral neck BMD (FN-BMD) or lumbar spine BMD (LS-BMD) had been studied. Materials and Methods Data were accumulated from nine genome-wide scans involving 11,842 subjects. Data were analyzed separately for LS-BMD and FN-BMD and by sex. For each study, genomic bins of 30 cM were defined and ranked according to the maximum LOD score they contained. While various densitometers were used in different studies, the ranking approach that we used means that the results are not confounded by the fact that different measurement devices were used. Significance for high average rank and heterogeneity was obtained through Monte Carlo testing. Results For LS-BMD, the quantitative trait locus (QTL) with greatest significance was on chromosome 1p13.3-q23.3 (p = 0.004), but this exhibited high heterogeneity and the effect was specific for women. Other significant LS-BMD QTLs were on chromosomes 12q24.31-qter, 3p25.3-p22.1, 11p12-q13.3, and 1q32-q42.3, including one on 18p11-q12.3 that had not been detected by individual studies. For FN-BMD, the strongest QTL was on chromosome 9q31.1-q33.3 (p = 0.002). Other significant QTLs were

  1. Nucleosome dynamics and maintenance of epigenetic states of CpG islands

    NASA Astrophysics Data System (ADS)

    Sneppen, Kim; Dodd, Ian B.

    2016-06-01

    Methylation of mammalian DNA occurs primarily at CG dinucleotides. These CpG sites are located nonrandomly in the genome, tending to occur within high density clusters of CpGs (islands) or within large regions of low CpG density. Cluster methylation tends to be bimodal, being dominantly unmethylated or mostly methylated. For CpG clusters near promoters, low methylation is associated with transcriptional activity, while high methylation is associated with gene silencing. Alternative CpG methylation states are thought to be stable and heritable, conferring localized epigenetic memory that allows transient signals to create long-lived gene expression states. Positive feedback where methylated CpG sites recruit enzymes that methylate nearby CpGs, can produce heritable bistability but does not easily explain that as clusters increase in size or density they change from being primarily methylated to primarily unmethylated. Here, we show that an interaction between the methylation state of a cluster and its occupancy by nucleosomes provides a mechanism to generate these features and explain genome wide systematics of CpG islands.

  2. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    PubMed

    Navarro, Fábio C P; Galante, Pedro A F

    2015-07-29

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome.

    PubMed

    Wang, Yi; Liu, Xianju; Ren, Chong; Zhong, Gan-Yuan; Yang, Long; Li, Shaohua; Liang, Zhenchang

    2016-04-21

    CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of humans, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific target sites for CRISPR/Cas9 have been computationally identified for several annual model and crop species, but such sites have not been reported for perennial, woody fruit species. In this study, we identified and characterized five types of CRISPR/Cas9 target sites in the widely cultivated grape species Vitis vinifera and developed a user-friendly database for editing grape genomes in the future. A total of 35,767,960 potential CRISPR/Cas9 target sites were identified from grape genomes in this study. Among them, 22,597,817 target sites were mapped to specific genomic locations and 7,269,788 were found to be highly specific. Protospacers and PAMs were found to distribute uniformly and abundantly in the grape genomes. They were present in all the structural elements of genes with the coding region having the highest abundance. Five PAM types, TGG, AGG, GGG, CGG and NGG, were observed. With the exception of the NGG type, they were abundantly present in the grape genomes. Synteny analysis of similar genes revealed that the synteny of protospacers matched the synteny of homologous genes. A user-friendly database containing protospacers and detailed information of the sites was developed and is available for public use at the Grape-CRISPR website ( http://biodb.sdau.edu.cn/gc/index.html ). Grape genomes harbour millions of potential CRISPR/Cas9 target sites. These sites are widely distributed among and within chromosomes with predominant abundance in the coding regions of genes. We developed a publicly-accessible Grape-CRISPR database for facilitating the use of the CRISPR/Cas9 system as a genome editing tool for functional studies and molecular breeding of grapes. Among

  4. Complete genome sequence of the thermotolerant foodborne pathogen Salmonella enterica serovar Senftenberg ATCC 43845 and phylogenetic analysis of loci encoding thermotolerance

    USDA-ARS?s Scientific Manuscript database

    Introduction: Previous studies in Cronobacter sakazakii, Klebsiella spp., and Escherichia coli have identified a genomic island that confers thermotolerance to its hosts. This island has recently been identified in Salmonella enterica serovar Senfentenberg ATCC 43845, a historically important, heat ...

  5. Microdiversification in genome-streamlined ubiquitous freshwater Actinobacteria.

    PubMed

    Neuenschwander, Stefan M; Ghai, Rohit; Pernthaler, Jakob; Salcher, Michaela M

    2018-01-01

    Actinobacteria of the acI lineage are the most abundant microbes in freshwater systems, but there are so far no pure living cultures of these organisms, possibly because of metabolic dependencies on other microbes. This, in turn, has hampered an in-depth assessment of the genomic basis for their success in the environment. Here we present genomes from 16 axenic cultures of acI Actinobacteria. The isolates were not only of minute cell size, but also among the most streamlined free-living microbes, with extremely small genome sizes (1.2-1.4 Mbp) and low genomic GC content. Genome reduction in these bacteria might have led to auxotrophy for various vitamins, amino acids and reduced sulphur sources, thus creating dependencies to co-occurring organisms (the 'Black Queen' hypothesis). Genome analyses, moreover, revealed a surprising degree of inter- and intraspecific diversity in metabolic pathways, especially of carbohydrate transport and metabolism, and mainly encoded in genomic islands. The striking genotype microdiversification of acI Actinobacteria might explain their global success in highly dynamic freshwater environments with complex seasonal patterns of allochthonous and autochthonous carbon sources. We propose a new order within Actinobacteria ('Candidatus Nanopelagicales') with two new genera ('Candidatus Nanopelagicus' and 'Candidatus Planktophila') and nine new species.

  6. Site-specific genome editing for correction of induced pluripotent stem cells derived from dominant dystrophic epidermolysis bullosa.

    PubMed

    Shinkuma, Satoru; Guo, Zongyou; Christiano, Angela M

    2016-05-17

    Genome editing with engineered site-specific endonucleases involves nonhomologous end-joining, leading to reading frame disruption. The approach is applicable to dominant negative disorders, which can be treated simply by knocking out the mutant allele, while leaving the normal allele intact. We applied this strategy to dominant dystrophic epidermolysis bullosa (DDEB), which is caused by a dominant negative mutation in the COL7A1 gene encoding type VII collagen (COL7). We performed genome editing with TALENs and CRISPR/Cas9 targeting the mutation, c.8068_8084delinsGA. We then cotransfected Cas9 and guide RNA expression vectors expressed with GFP and DsRed, respectively, into induced pluripotent stem cells (iPSCs) generated from DDEB fibroblasts. After sorting, 90% of the iPSCs were edited, and we selected four gene-edited iPSC lines for further study. These iPSCs were differentiated into keratinocytes and fibroblasts secreting COL7. RT-PCR and Western blot analyses revealed gene-edited COL7 with frameshift mutations degraded at the protein level. In addition, we confirmed that the gene-edited truncated COL7 could neither associate with normal COL7 nor undergo triple helix formation. Our data establish the feasibility of mutation site-specific genome editing in dominant negative disorders.

  7. Comparative genome analysis in the integrated microbial genomes (IMG) system.

    PubMed

    Markowitz, Victor M; Kyrpides, Nikos C

    2007-01-01

    Comparative genome analysis is critical for the effective exploration of a rapidly growing number of complete and draft sequences for microbial genomes. The Integrated Microbial Genomes (IMG) system (img.jgi.doe.gov) has been developed as a community resource that provides support for comparative analysis of microbial genomes in an integrated context. IMG allows users to navigate the multidimensional microbial genome data space and focus their analysis on a subset of genes, genomes, and functions of interest. IMG provides graphical viewers, summaries, and occurrence profile tools for comparing genes, pathways, and functions (terms) across specific genomes. Genes can be further examined using gene neighborhoods and compared with sequence alignment tools.

  8. The human genome: a multifractal analysis

    PubMed Central

    2011-01-01

    Background Several studies have shown that genomes can be studied via a multifractal formalism. Recently, we used a multifractal approach to study the genetic information content of the Caenorhabditis elegans genome. Here we investigate the possibility that the human genome shows a similar behavior to that observed in the nematode. Results We report here multifractality in the human genome sequence. This behavior correlates strongly on the presence of Alu elements and to a lesser extent on CpG islands and (G+C) content. In contrast, no or low relationship was found for LINE, MIR, MER, LTRs elements and DNA regions poor in genetic information. Gene function, cluster of orthologous genes, metabolic pathways, and exons tended to increase their frequencies with ranges of multifractality and large gene families were located in genomic regions with varied multifractality. Additionally, a multifractal map and classification for human chromosomes are proposed. Conclusions Based on these findings, we propose a descriptive non-linear model for the structure of the human genome, with some biological implications. This model reveals 1) a multifractal regionalization where many regions coexist that are far from equilibrium and 2) this non-linear organization has significant molecular and medical genetic implications for understanding the role of Alu elements in genome stability and structure of the human genome. Given the role of Alu sequences in gene regulation, genetic diseases, human genetic diversity, adaptation and phylogenetic analyses, these quantifications are especially useful. PMID:21999602

  9. Genome specific PPARαB duplicates in salmonids and insights into estrogenic regulation in brown trout.

    PubMed

    Madureira, Tânia Vieira; Pinheiro, Ivone; de Paula Freire, Rafaelle; Rocha, Eduardo; Castro, Luis Filipe; Urbatzka, Ralph

    2017-06-01

    Peroxisome proliferator-activated receptors (PPARs) are key regulators of many processes in vertebrates, such as carbohydrate and lipid metabolism. PPARα, a member of the PPAR nuclear receptor gene subfamily (NR1C1), is involved in fatty acid metabolism, namely in peroxisomal β-oxidation. Two gene paralogues, pparαA and pparαB, were described in several teleost species with their origin dating back to the teleost-specific genome duplication (3R). Given the additional salmonid-specific genome duplication (4R), four genes could be theoretically anticipated for this gene subfamily. In this work, we examined the pparα gene repertoire in brown trout, Salmo trutta f. fario. Data disclosed two pparα-like sequences in brown trout. Phylogenetic analyses further revealed that the isolated genes are most likely genome pparαB duplicates, pparαBa and pparαBb, while pparαA is apparently absent in salmonids. Both genes showed a ubiquitous mRNA expression across a panel of 11 different organs. In vitro exposed primary brown trout hepatocytes strongly suggest that pparα gene paralogues are differently regulated by ethinylestradiol (EE2). PparαBb mRNA expression significantly decreased with dosage, reaching significance after exposure to 50μM EE2, while pparαBa mRNA increased, significant at 1μM EE2. The present data enhances the understanding of pparα function and evolution in teleost, and reinforces the evidence of a potential crosstalk between estrogenic and pparα signaling pathways. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. The complete genome sequencing of Prevotella intermedia strain OMA14 and a subsequent fine-scale, intra-species genomic comparison reveal an unusual amplification of conjugative and mobile transposons and identify a novel Prevotella-lineage-specific repeat

    PubMed Central

    Naito, Mariko; Ogura, Yoshitoshi; Itoh, Takehiko; Shoji, Mikio; Okamoto, Masaaki; Hayashi, Tetsuya; Nakayama, Koji

    2016-01-01

    Prevotella intermedia is a pathogenic bacterium involved in periodontal diseases. Here, we present the complete genome sequence of a clinical strain, OMA14, of this bacterium along with the results of comparative genome analysis with strain 17 of the same species whose genome has also been sequenced, but not fully analysed yet. The genomes of both strains consist of two circular chromosomes: the larger chromosomes are similar in size and exhibit a high overall linearity of gene organizations, whereas the smaller chromosomes show a significant size variation and have undergone remarkable genome rearrangements. Unique features of the Pre. intermedia genomes are the presence of a remarkable number of essential genes on the second chromosomes and the abundance of conjugative and mobilizable transposons (CTns and MTns). The CTns/MTns are particularly abundant in the second chromosomes, involved in its extensive genome rearrangement, and have introduced a number of strain-specific genes into each strain. We also found a novel 188-bp repeat sequence that has been highly amplified in Pre. intermedia and are specifically distributed among the Pre. intermedia-related species. These findings expand our understanding of the genetic features of Pre. intermedia and the roles of CTns and MTns in the evolution of bacteria. PMID:26645327

  11. Bisulfite-independent analysis of CpG island methylation enables genome-scale stratification of single cells.

    PubMed

    Han, Lin; Wu, Hua-Jun; Zhu, Haiying; Kim, Kun-Yong; Marjani, Sadie L; Riester, Markus; Euskirchen, Ghia; Zi, Xiaoyuan; Yang, Jennifer; Han, Jasper; Snyder, Michael; Park, In-Hyun; Irizarry, Rafael; Weissman, Sherman M; Michor, Franziska; Fan, Rong; Pan, Xinghua

    2017-06-02

    Conventional DNA bisulfite sequencing has been extended to single cell level, but the coverage consistency is insufficient for parallel comparison. Here we report a novel method for genome-wide CpG island (CGI) methylation sequencing for single cells (scCGI-seq), combining methylation-sensitive restriction enzyme digestion and multiple displacement amplification for selective detection of methylated CGIs. We applied this method to analyzing single cells from two types of hematopoietic cells, K562 and GM12878 and small populations of fibroblasts and induced pluripotent stem cells. The method detected 21 798 CGIs (76% of all CGIs) per cell, and the number of CGIs consistently detected from all 16 profiled single cells was 20 864 (72.7%), with 12 961 promoters covered. This coverage represents a substantial improvement over results obtained using single cell reduced representation bisulfite sequencing, with a 66-fold increase in the fraction of consistently profiled CGIs across individual cells. Single cells of the same type were more similar to each other than to other types, but also displayed epigenetic heterogeneity. The method was further validated by comparing the CpG methylation pattern, methylation profile of CGIs/promoters and repeat regions and 41 classes of known regulatory markers to the ENCODE data. Although not every minor methylation differences between cells are detectable, scCGI-seq provides a solid tool for unsupervised stratification of a heterogeneous cell population. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Bisulfite-independent analysis of CpG island methylation enables genome-scale stratification of single cells

    PubMed Central

    Han, Lin; Wu, Hua-Jun; Zhu, Haiying; Kim, Kun-Yong; Marjani, Sadie L.; Riester, Markus; Euskirchen, Ghia; Zi, Xiaoyuan; Yang, Jennifer; Han, Jasper; Snyder, Michael; Park, In-Hyun; Irizarry, Rafael; Weissman, Sherman M.

    2017-01-01

    Abstract Conventional DNA bisulfite sequencing has been extended to single cell level, but the coverage consistency is insufficient for parallel comparison. Here we report a novel method for genome-wide CpG island (CGI) methylation sequencing for single cells (scCGI-seq), combining methylation-sensitive restriction enzyme digestion and multiple displacement amplification for selective detection of methylated CGIs. We applied this method to analyzing single cells from two types of hematopoietic cells, K562 and GM12878 and small populations of fibroblasts and induced pluripotent stem cells. The method detected 21 798 CGIs (76% of all CGIs) per cell, and the number of CGIs consistently detected from all 16 profiled single cells was 20 864 (72.7%), with 12 961 promoters covered. This coverage represents a substantial improvement over results obtained using single cell reduced representation bisulfite sequencing, with a 66-fold increase in the fraction of consistently profiled CGIs across individual cells. Single cells of the same type were more similar to each other than to other types, but also displayed epigenetic heterogeneity. The method was further validated by comparing the CpG methylation pattern, methylation profile of CGIs/promoters and repeat regions and 41 classes of known regulatory markers to the ENCODE data. Although not every minor methylation differences between cells are detectable, scCGI-seq provides a solid tool for unsupervised stratification of a heterogeneous cell population. PMID:28126923

  13. Chromosomal inversions promote genomic islands of concerted evolution of Hsp70 genes in the Drosophila subobscura species subgroup.

    PubMed

    Puig Giribets, Marta; García Guerreiro, María Pilar; Santos, Mauro; Ayala, Francisco J; Tarrío, Rosa; Rodríguez-Trelles, Francisco

    2018-02-07

    Heat-shock (HS) assays to understand the connection between standing inversion variation and evolutionary response to climate change in Drosophila subobscura found that "warm-climate" inversion O 3+4 exhibits non-HS levels of Hsp70 protein like those of "cold-climate" O ST after HS induction. This was unexpected, as overexpression of Hsp70 can incur multiple fitness costs. To understand the genetic basis of this finding, we have determined the genomic sequence organization of the Hsp70 family in four different inversions, including O ST , O 3+4 , O 3+4+8 and O 3+4+16 , using as outgroups the remainder of the subobscura species subgroup, namely Drosophila madeirensis and Drosophila guanche. We found (i) in all the assayed lines, the Hsp70 family resides in cytological locus 94A and consists of only two genes, each with four HS elements (HSEs) and three GAGA sites on its promoter. Yet, in O ST , the family is comparatively more compact; (ii) the two Hsp70 copies evolve in concert through gene conversion, except in D. guanche; (iii) within D. subobscura, the rate of concerted evolution is strongly structured by inversion, being higher in O ST than in O 3+4 ; and (iv) in D. guanche, the two copies accumulated multiple differences, including a newly evolved "gap-type" HSE2. The absence of concerted evolution in this species may be related to a long-gone-unnoticed observation that it lacks Hsp70 HS response, perhaps because it has evolved within a narrow thermal range in an oceanic island. Our results point to a previously unrealized link between inversions and concerted evolution, with potentially major implications for understanding genome evolution. © 2018 John Wiley & Sons Ltd.

  14. An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes

    PubMed Central

    Cho, Yun Sung; Kim, Hyunho; Kim, Hak-Min; Jho, Sungwoong; Jun, JeHoon; Lee, Yong Joo; Chae, Kyun Shik; Kim, Chang Geun; Kim, Sangsoo; Eriksson, Anders; Edwards, Jeremy S.; Lee, Semin; Kim, Byung Chul; Manica, Andrea; Oh, Tae-Kwang; Church, George M.; Bhak, Jong

    2016-01-01

    Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity. PMID:27882922

  15. Draft Genome Sequence of Marinobacter sp. Strain ANT_B65, Isolated from Antarctic Marine Sponge.

    PubMed

    de França, Paula; Camilo, Esther; Fantinatti-Garboginni, Fabiana

    2018-01-04

    Marinobacter sp. strain ANT_B65 was isolated from sponge collected in King George Island, Antarctica. The draft genome of 4,173,840 bp encodes 3,743 protein-coding open reading frames. The genome will provide insights into the strain's potential use in the production of natural products. Copyright © 2018 de França et al.

  16. Mechanistically Distinct Pathways of Divergent Regulatory DNA Creation Contribute to Evolution of Human-Specific Genomic Regulatory Networks Driving Phenotypic Divergence of Homo sapiens.

    PubMed

    Glinsky, Gennadi V

    2016-09-19

    Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8-10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific

  17. Mechanistically Distinct Pathways of Divergent Regulatory DNA Creation Contribute to Evolution of Human-Specific Genomic Regulatory Networks Driving Phenotypic Divergence of Homo sapiens

    PubMed Central

    Glinsky, Gennadi V.

    2016-01-01

    Abstract Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8–10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of

  18. The CpG island methylator phenotype (CIMP) in colorectal cancer

    PubMed Central

    Mojarad, Ehsan Nazemalhosseini; Kuppen, Peter JK; Aghdaei, Hamid Asadzadeh

    2013-01-01

    It is clear that colorectal cancer (CRC) develops through multiple genetic and epigenetic pathways. These pathways may be determined on the basis of three molecular features: (i) mutations in DNA mismatch repair genes, leading to a DNA microsatellite instability (MSI) phenotype, (ii) mutations in APC and other genes that activate Wnt pathway, characterized by chromosomal instability (CIN) phenotype, and (iii) global genome hypermethylation, resulting in switch off of tumor suppressor genes, indicated as CpG island methylator phenotype (CIMP). Each of these pathways is characterized by specific pathological features, mechanisms of carcinogenesis and process of tumor development. The molecular aspects of these pathways have been used clinically in the diagnosis, screening and management of patients with colorectal cancer. In this review we especially describe various aspects of CIMP, one of the important and rather recently discovered pathways that lead to colorectal cancer. PMID:24834258

  19. A highly specific phage defense system is a conserved feature of the Vibrio cholerae mobilome

    PubMed Central

    O’Hara, Brendan J.

    2017-01-01

    Vibrio cholerae-specific bacteriophages are common features of the microbial community during cholera infection in humans. Phages impose strong selective pressure that favors the expansion of phage-resistant strains over their vulnerable counterparts. The mechanisms allowing virulent V. cholerae strains to defend against the ubiquitous threat of predatory phages have not been established. Here, we show that V. cholerae PLEs (phage-inducible chromosomal island-like elements) are widespread genomic islands dedicated to phage defense. Analysis of V. cholerae isolates spanning a 60-year collection period identified five unique PLEs. Remarkably, we found that all PLEs (regardless of geographic or temporal origin) respond to infection by a myovirus called ICP1, the most prominent V. cholerae phage found in cholera patient stool samples from Bangladesh. We found that PLE activity reduces phage genome replication and accelerates cell lysis following ICP1 infection, killing infected host cells and preventing the production of progeny phage. PLEs are mobilized by ICP1 infection and can spread to neighboring cells such that protection from phage predation can be horizontally acquired. Our results reveal that PLEs are a persistent feature of the V. cholerae mobilome that are adapted to providing protection from a single predatory phage and advance our understanding of how phages influence pathogen evolution. PMID:28594826

  20. A highly specific phage defense system is a conserved feature of the Vibrio cholerae mobilome.

    PubMed

    O'Hara, Brendan J; Barth, Zachary K; McKitterick, Amelia C; Seed, Kimberley D

    2017-06-01

    Vibrio cholerae-specific bacteriophages are common features of the microbial community during cholera infection in humans. Phages impose strong selective pressure that favors the expansion of phage-resistant strains over their vulnerable counterparts. The mechanisms allowing virulent V. cholerae strains to defend against the ubiquitous threat of predatory phages have not been established. Here, we show that V. cholerae PLEs (phage-inducible chromosomal island-like elements) are widespread genomic islands dedicated to phage defense. Analysis of V. cholerae isolates spanning a 60-year collection period identified five unique PLEs. Remarkably, we found that all PLEs (regardless of geographic or temporal origin) respond to infection by a myovirus called ICP1, the most prominent V. cholerae phage found in cholera patient stool samples from Bangladesh. We found that PLE activity reduces phage genome replication and accelerates cell lysis following ICP1 infection, killing infected host cells and preventing the production of progeny phage. PLEs are mobilized by ICP1 infection and can spread to neighboring cells such that protection from phage predation can be horizontally acquired. Our results reveal that PLEs are a persistent feature of the V. cholerae mobilome that are adapted to providing protection from a single predatory phage and advance our understanding of how phages influence pathogen evolution.

  1. Genome-wide methylation analysis identifies a core set of hypermethylated genes in CIMP-H colorectal cancer.

    PubMed

    McInnes, Tyler; Zou, Donghui; Rao, Dasari S; Munro, Francesca M; Phillips, Vicky L; McCall, John L; Black, Michael A; Reeve, Anthony E; Guilford, Parry J

    2017-03-28

    Aberrant DNA methylation profiles are a characteristic of all known cancer types, epitomized by the CpG island methylator phenotype (CIMP) in colorectal cancer (CRC). Hypermethylation has been observed at CpG islands throughout the genome, but it is unclear which factors determine whether an individual island becomes methylated in cancer. DNA methylation in CRC was analysed using the Illumina HumanMethylation450K array. Differentially methylated loci were identified using Significance Analysis of Microarrays (SAM) and the Wilcoxon Signed Rank (WSR) test. Unsupervised hierarchical clustering was used to identify methylation subtypes in CRC. In this study we characterized the DNA methylation profiles of 94 CRC tissues and their matched normal counterparts. Consistent with previous studies, unsupervized hierarchical clustering of genome-wide methylation data identified three subtypes within the tumour samples, designated CIMP-H, CIMP-L and CIMP-N, that showed high, low and very low methylation levels, respectively. Differential methylation between normal and tumour samples was analysed at the individual CpG level, and at the gene level. The distribution of hypermethylation in CIMP-N tumours showed high inter-tumour variability and appeared to be highly stochastic in nature, whereas CIMP-H tumours exhibited consistent hypermethylation at a subset of genes, in addition to a highly variable background of hypermethylated genes. EYA4, TFPI2 and TLX1 were hypermethylated in more than 90% of all tumours examined. One-hundred thirty-two genes were hypermethylated in 100% of CIMP-H tumours studied and these were highly enriched for functions relating to skeletal system development (Bonferroni adjusted p value =2.88E-15), segment specification (adjusted p value =9.62E-11), embryonic development (adjusted p value =1.52E-04), mesoderm development (adjusted p value =1.14E-20), and ectoderm development (adjusted p value =7.94E-16). Our genome-wide characterization of DNA

  2. OI-57, a Genomic Island of Escherichia coli O157, Is Present in Other Seropathotypes of Shiga Toxin-Producing E. coli Associated with Severe Human Disease▿

    PubMed Central

    Imamovic, Lejla; Tozzoli, Rosangela; Michelacci, Valeria; Minelli, Fabio; Marziano, Maria Luisa; Caprioli, Alfredo; Morabito, Stefano

    2010-01-01

    Strains of Shiga toxin-producing Escherichia coli (STEC) are a heterogeneous E. coli group that may cause severe disease in humans. STEC have been categorized into seropathotypes (SPTs) based on their phenotypic and molecular characteristics and the clinical features of the associated diseases. SPTs range from A to E, according to a decreasing rank of pathogenicity. To define the virulence gene asset (“virulome”) characterizing the highly pathogenic SPTs, we used microarray hybridization to compare the whole genomes of STEC belonging to SPTs B, C, and D with that of STEC O157 (SPT A). The presence of the open reading frames (ORFs) associated with SPTs A and B was subsequently investigated by PCR in a larger panel of STEC and in other E. coli strains. A genomic island termed OI-57 was present in SPTs A and B but not in the other SPTs. OI-57 harbors the putative virulence gene adfO, encoding a factor enhancing the adhesivity of STEC O157, and ckf, encoding a putative killing factor for the bacterial cell. PCR analyses showed that OI-57 was present in its entirety in the majority of the STEC genomes examined, indicating that it represents a stable acquisition of the positive clonal lineages. OI-57 was also present in a high proportion of the human enteropathogenic E. coli genomes assayed, suggesting that it could be involved in the attaching-and-effacing colonization of the intestinal mucosa. In conclusion, OI-57 appears to be part of the virulome of pathogenic STEC and further studies are needed to elucidate its role in the pathogenesis of STEC infections. PMID:20823207

  3. Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.

    PubMed

    Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A

    2016-01-01

    One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Comparative genomics and the role of lateral gene transfer in the evolution of bovine adapted Streptococcus agalactiae

    PubMed Central

    Richards, Vincent P.; Lang, Ping; Pavinski Bitar, Paulina D.; Lefébure, Tristan; Schukken, Ynte H.; Zadoks, Ruth N.; Stanhope, Michael J.

    2011-01-01

    In addition to causing severe invasive infections in humans, Streptococcus agalactiae, or group B Streptococcus (GBS), is also a major cause of bovine mastitis. Here we provide the first genome sequence for S. agalactiae isolated from a cow diagnosed with clinical mastitis (strain FSL S3-026). Comparison to eight S. agalactiae genomes obtained from human disease isolates revealed 183 genes specific to the bovine strain. Subsequent polymerase chain reaction (PCR) screening for the presence/absence of a subset of these loci in additional bovine and human strains revealed strong differentiation between the two groups (Fisher exact test: p < 0.0001). The majority of the bovine strain-specific genes (~85%) clustered tightly into eight genomic islands, suggesting these genes were acquired through lateral gene transfer (LGT). This bovine GBS also contained an unusually high proportion of insertion sequences (4.3% of the total genome), suggesting frequent genomic rearrangement. Comparison to other mastitis-causing species of bacteria provided strong evidence for two cases of interspecies LGT within the shared bovine environment: bovine S. agalactiae with Streptococcus uberis (nisin U operon) and Streptococcus dysgalactiae subsp. dysgalactiae (lactose operon). We also found evidence for LGT, involving the salivaricin operon, between the bovine S. agalactiae strain and either Streptococcus pyogenes or Streptococcus salivarius. Our findings provide insight intomechanismsfacilitatingenvironmentaladaptationandacquisitionofpotential virulence factors, while highlighting both the key role LGT has played in the recent evolution of the bovine S. agalactiae strain, and the importance of LGT among pathogens within a shared environment. PMID:21536150

  5. Current and Emerging Technologies for the Analysis of the Genome-Wide and Locus-Specific DNA Methylation Patterns.

    PubMed

    Tost, Jörg

    2016-01-01

    DNA methylation is the most studied epigenetic modification, and altered DNA methylation patterns have been identified in cancer and more recently also in many other complex diseases. Furthermore, DNA methylation is influenced by a variety of environmental factors, and the analysis of DNA methylation patterns might allow deciphering previous exposure. Although a large number of techniques to study DNA methylation either genome-wide or at specific loci have been devised, they all are based on a limited number of principles for differentiating the methylation state, viz., methylation-specific/methylation-dependent restriction enzymes, antibodies or methyl-binding proteins, chemical-based enrichment, or bisulfite conversion. Second-generation sequencing has largely replaced microarrays as readout platform and is also becoming more popular for locus-specific DNA methylation analysis. In this chapter, the currently used methods for both genome-wide and locus-specific analysis of 5-methylcytosine and as its oxidative derivatives, such as 5-hydroxymethylcytosine, are reviewed in detail, and the advantages and limitations of each approach are discussed. Furthermore, emerging technologies avoiding PCR amplification and allowing a direct readout of DNA methylation are summarized, together with novel applications, such as the detection of DNA methylation in single cells or in circulating cell-free DNA.

  6. Introduction to the Special Issue: Advances in island plant biology since Sherwin Carlquist's Island Biology

    PubMed Central

    Traveset, Anna; Fernández-Palacios, José María; Kueffer, Christoph; Bellingham, Peter J.; Morden, Clifford; Drake, Donald R.

    2016-01-01

    Sherwin Carlquist's seminal publications—in particular his classic Island Biology, published in 1974—formulated hypotheses specific to island biology that remain valuable today. This special issue brings together some of the most interesting contributions presented at the First Island Biology Symposium hosted in Honolulu on 7–11 July 2014. We compiled a total of 18 contributions that present data from multiple archipelagos across the world and from different disciplines within the plant sciences. In this introductory paper, we first provide a short overview of Carlquist's life and work and then summarize the main findings of the collated papers. A first group of papers deals with issues to which Carlquist notably contributed: long-distance dispersal, adaptive radiation and plant reproductive biology. The findings of such studies demonstrate the extent to which the field has advanced thanks to (i) the increasing availability and richness of island data, covering many taxonomic groups and islands; (ii) new information from the geosciences, phylogenetics and palaeoecology, which allows us a more realistic understanding of the geological and biological development of islands and their biotas; and (iii) the new theoretical and methodological advances that allow us to assess patterns of abundance, diversity and distribution of island biota over large spatial scales. Most other papers in the issue cover a range of topics related to plant conservation on islands, such as causes and consequences of mutualistic disruptions (due to pollinator or disperser losses, introduction of alien predators, etc.). Island biologists are increasingly considering reintroducing ecologically important species to suitable habitats within their historic range and to neighbouring islands with depauperate communities of vertebrate seed dispersers, and an instructive example is given here. Finally, contributions on ecological networks demonstrate the usefulness of this methodological tool to

  7. Detecting non-orthology in the COGs database and other approaches grouping orthologs using genome-specific best hits.

    PubMed

    Dessimoz, Christophe; Boeckmann, Brigitte; Roth, Alexander C J; Gonnet, Gaston H

    2006-01-01

    Correct orthology assignment is a critical prerequisite of numerous comparative genomics procedures, such as function prediction, construction of phylogenetic species trees and genome rearrangement analysis. We present an algorithm for the detection of non-orthologs that arise by mistake in current orthology classification methods based on genome-specific best hits, such as the COGs database. The algorithm works with pairwise distance estimates, rather than computationally expensive and error-prone tree-building methods. The accuracy of the algorithm is evaluated through verification of the distribution of predicted cases, case-by-case phylogenetic analysis and comparisons with predictions from other projects using independent methods. Our results show that a very significant fraction of the COG groups include non-orthologs: using conservative parameters, the algorithm detects non-orthology in a third of all COG groups. Consequently, sequence analysis sensitive to correct orthology assignments will greatly benefit from these findings.

  8. Genomic and Transcriptomic Analysis of Growth-Supporting Dehalogenation of Chlorinated Methanes in Methylobacterium

    PubMed Central

    Chaignaud, Pauline; Maucourt, Bruno; Weiman, Marion; Alberti, Adriana; Kolb, Steffen; Cruveiller, Stéphane; Vuilleumier, Stéphane; Bringel, Françoise

    2017-01-01

    Bacterial adaptation to growth with toxic halogenated chemicals was explored in the context of methylotrophic metabolism of Methylobacterium extorquens, by comparing strains CM4 and DM4, which show robust growth with chloromethane and dichloromethane, respectively. Dehalogenation of chlorinated methanes initiates growth-supporting degradation, with intracellular release of protons and chloride ions in both cases. The core, variable and strain-specific genomes of strains CM4 and DM4 were defined by comparison with genomes of non-dechlorinating strains. In terms of gene content, adaptation toward dehalogenation appears limited, strains CM4 and DM4 sharing between 75 and 85% of their genome with other strains of M. extorquens. Transcript abundance in cultures of strain CM4 grown with chloromethane and of strain DM4 grown with dichloromethane was compared to growth with methanol as a reference C1 growth substrate. Previously identified strain-specific dehalogenase-encoding genes were the most transcribed with chlorinated methanes, alongside other genes encoded by genomic islands (GEIs) and plasmids involved in growth with chlorinated compounds as carbon and energy source. None of the 163 genes shared by strains CM4 and DM4 but not by other strains of M. extorquens showed higher transcript abundance in cells grown with chlorinated methanes. Among the several thousand genes of the M. extorquens core genome, 12 genes were only differentially abundant in either strain CM4 or strain DM4. Of these, 2 genes of known function were detected, for the membrane-bound proton translocating pyrophosphatase HppA and the housekeeping molecular chaperone protein DegP. This indicates that the adaptive response common to chloromethane and dichloromethane is limited at the transcriptional level, and involves aspects of the general stress response as well as of a dehalogenation-specific response to intracellular hydrochloric acid production. Core genes only differentially abundant in either

  9. Genomic and Transcriptomic Analysis of Growth-Supporting Dehalogenation of Chlorinated Methanes in Methylobacterium.

    PubMed

    Chaignaud, Pauline; Maucourt, Bruno; Weiman, Marion; Alberti, Adriana; Kolb, Steffen; Cruveiller, Stéphane; Vuilleumier, Stéphane; Bringel, Françoise

    2017-01-01

    Bacterial adaptation to growth with toxic halogenated chemicals was explored in the context of methylotrophic metabolism of Methylobacterium extorquens , by comparing strains CM4 and DM4, which show robust growth with chloromethane and dichloromethane, respectively. Dehalogenation of chlorinated methanes initiates growth-supporting degradation, with intracellular release of protons and chloride ions in both cases. The core, variable and strain-specific genomes of strains CM4 and DM4 were defined by comparison with genomes of non-dechlorinating strains. In terms of gene content, adaptation toward dehalogenation appears limited, strains CM4 and DM4 sharing between 75 and 85% of their genome with other strains of M. extorquens . Transcript abundance in cultures of strain CM4 grown with chloromethane and of strain DM4 grown with dichloromethane was compared to growth with methanol as a reference C 1 growth substrate. Previously identified strain-specific dehalogenase-encoding genes were the most transcribed with chlorinated methanes, alongside other genes encoded by genomic islands (GEIs) and plasmids involved in growth with chlorinated compounds as carbon and energy source. None of the 163 genes shared by strains CM4 and DM4 but not by other strains of M. extorquens showed higher transcript abundance in cells grown with chlorinated methanes. Among the several thousand genes of the M. extorquens core genome, 12 genes were only differentially abundant in either strain CM4 or strain DM4. Of these, 2 genes of known function were detected, for the membrane-bound proton translocating pyrophosphatase HppA and the housekeeping molecular chaperone protein DegP. This indicates that the adaptive response common to chloromethane and dichloromethane is limited at the transcriptional level, and involves aspects of the general stress response as well as of a dehalogenation-specific response to intracellular hydrochloric acid production. Core genes only differentially abundant in

  10. Bryophyte Tissue-specific Carbon Isotope Record from Galindez Island, Argentine Islands, Western Antarctic Peninsula

    NASA Astrophysics Data System (ADS)

    Beilman, D. W.; Yumol, L. M.; Yu, Z.; Parnikoza, I.

    2016-12-01

    Mossbank ecosystems of the western Antarctic Peninsula (AP) provide an under-utilized archive of past terrestrial environmental change. We measured the stable carbon isotope values (δ13C) of both modern and subfossil bryophytes to characterize differences between species and tissues and to identify changes over time. Living plants of common species including Polytrichum strictum and Chorisodontium aciphyllum were collected from several populations between 64° 09' and 67°35'S and had a wide range of δ13C values from -22 to -32‰ that were distinct between species and tissues. In particular, leaves were consistently more enriched in 13C than stems on average by 2‰. Radiocarbon-dated subfossil leaf tissue in a mossbank peat core raised from Galindez Island (65° 14' 51.4"S, 64° 15' 2.3" W) showed that peat formation began 2300 years ago, and provided evidence for very slow growth or a hiatus between about 1100 and 600 years ago during a period of colder air temperatures evident in depleted hydrogen and oxygen isotope values in James Ross Island ice on the eastern AP. Bryophyte macrofossil remains showed a relatively simple bryophyte community of mainly P. strictum throughout the core, but several periods when wet-adapted species became dominant. Subfossil leaf δ13C values of P. strictum varied from -24 to -30‰, and revealed source-independent discrimination that was higher in recent decades than any time during the last 2300 years. Changes in species' abundance between P. strictum and Pohlia nutans varied with discrimination, suggesting that mossbanks have been sensitive to hydroclimate variation during the Late Holocene, and that moss growth conditions at this western AP site have been anomalous in recent decades.

  11. Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The speciesmore » P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but

  12. Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

    DOE PAGES

    Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat; ...

    2016-01-01

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The speciesmore » P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but

  13. LncRNA/DNA binding analysis reveals losses and gains and lineage specificity of genomic imprinting in mammals.

    PubMed

    Liu, Haihua; Shang, Xiaoxiao; Zhu, Hao

    2017-05-15

    Genomic imprinting is regulated by lncRNAs and is important for embryogenesis, physiology and behaviour in mammals. Aberrant imprinting causes diseases and disorders. Experimental studies have examined genomic imprinting primarily in humans and mice, thus leaving some fundamental issues poorly addressed. The cost of experimentally examining imprinted genes in many tissues in diverse species makes computational analysis of lncRNAs' DNA binding sites valuable. We performed lncRNA/DNA binding analysis in imprinting clusters from multiple mammalian clades and discovered the following: (i) lncRNAs and imprinting sites show significant losses and gains and distinct lineage-specificity; (ii) binding of lncRNAs to promoters of imprinted genes may occur widely throughout the genome; (iii) a considerable number of imprinting sites occur in only evolutionarily more derived species; and (iv) multiple lncRNAs may bind to the same imprinting sites, and some lncRNAs have multiple DNA binding motifs. These results suggest that the occurrence of abundant lncRNAs in mammalian genomes makes genomic imprinting a mechanism of adaptive evolution at the epigenome level. The data and program are available at the database LongMan at lncRNA.smu.edu.cn. zhuhao@smu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  14. Component identification of electron transport chains in curdlan-producing Agrobacterium sp. ATCC 31749 and its genome-specific prediction using comparative genome and phylogenetic trees analysis.

    PubMed

    Zhang, Hongtao; Setubal, Joao Carlos; Zhan, Xiaobei; Zheng, Zhiyong; Yu, Lijun; Wu, Jianrong; Chen, Dingqiang

    2011-06-01

    Agrobacterium sp. ATCC 31749 (formerly named Alcaligenes faecalis var. myxogenes) is a non-pathogenic aerobic soil bacterium used in large scale biotechnological production of curdlan. However, little is known about its genomic information. DNA partial sequence of electron transport chains (ETCs) protein genes were obtained in order to understand the components of ETC and genomic-specificity in Agrobacterium sp. ATCC 31749. Degenerate primers were designed according to ETC conserved sequences in other reported species. DNA partial sequences of ETC genes in Agrobacterium sp. ATCC 31749 were cloned by the PCR method using degenerate primers. Based on comparative genomic analysis, nine electron transport elements were ascertained, including NADH ubiquinone oxidoreductase, succinate dehydrogenase complex II, complex III, cytochrome c, ubiquinone biosynthesis protein ubiB, cytochrome d terminal oxidase, cytochrome bo terminal oxidase, cytochrome cbb (3)-type terminal oxidase and cytochrome caa (3)-type terminal oxidase. Similarity and phylogenetic analyses of these genes revealed that among fully sequenced Agrobacterium species, Agrobacterium sp. ATCC 31749 is closest to Agrobacterium tumefaciens C58. Based on these results a comprehensive ETC model for Agrobacterium sp. ATCC 31749 is proposed.

  15. The complete genome sequencing of Prevotella intermedia strain OMA14 and a subsequent fine-scale, intra-species genomic comparison reveal an unusual amplification of conjugative and mobile transposons and identify a novel Prevotella-lineage-specific repeat.

    PubMed

    Naito, Mariko; Ogura, Yoshitoshi; Itoh, Takehiko; Shoji, Mikio; Okamoto, Masaaki; Hayashi, Tetsuya; Nakayama, Koji

    2016-02-01

    Prevotella intermedia is a pathogenic bacterium involved in periodontal diseases. Here, we present the complete genome sequence of a clinical strain, OMA14, of this bacterium along with the results of comparative genome analysis with strain 17 of the same species whose genome has also been sequenced, but not fully analysed yet. The genomes of both strains consist of two circular chromosomes: the larger chromosomes are similar in size and exhibit a high overall linearity of gene organizations, whereas the smaller chromosomes show a significant size variation and have undergone remarkable genome rearrangements. Unique features of the Pre. intermedia genomes are the presence of a remarkable number of essential genes on the second chromosomes and the abundance of conjugative and mobilizable transposons (CTns and MTns). The CTns/MTns are particularly abundant in the second chromosomes, involved in its extensive genome rearrangement, and have introduced a number of strain-specific genes into each strain. We also found a novel 188-bp repeat sequence that has been highly amplified in Pre. intermedia and are specifically distributed among the Pre. intermedia-related species. These findings expand our understanding of the genetic features of Pre. intermedia and the roles of CTns and MTns in the evolution of bacteria. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  16. CpG islands: algorithms and applications in methylation studies.

    PubMed

    Zhao, Zhongming; Han, Leng

    2009-05-15

    Methylation occurs frequently at 5'-cytosine of the CpG dinucleotides in vertebrate genomes; however, this epigenetic feature is rarely observed in CpG islands (CGIs) or CpG clusters in the promoter regions of genes. Aberrant methylation of the promoter-associated CGIs might influence gene expression and cause carcinogenesis. Because of the functional importance, multiple algorithms have been available for identifying CGIs in a genome or a sequence. They can be categorized into the traditional algorithms (e.g., Gardiner-Garden and Frommer (1987), Takai and Jones (2002), and CpGPRoD (2002)) or statistical property based algorithms (CpGcluster (2006) and CG cluster (2007)). We reviewed the features of these algorithms and evaluated their performance on identifying functional CGIs using genome-wide methylation data. Moreover, identification of CGIs is an initial step in many recent studies for predicting methylation status as well as in the design of methylation detection platforms. We reviewed the benchmarks and features used in these studies.

  17. Genomic insights of Pannonibacter phragmitetus strain 31801 isolated from a patient with a liver abscess.

    PubMed

    Zhou, Yajun; Jiang, Tao; Hu, Shaohua; Wang, Mingxi; Ming, Desong; Chen, Shicheng

    2017-12-01

    Pannonibacter phragmitetus is a bioremediation reagent for the detoxification of heavy metals and polycyclic aromatic compounds (PAHs) while it rarely infects healthy populations. However, infection by the opportunistic pathogen P. phragmitetus complicates diagnosis and treatments, and poses a serious threat to immunocompromised patients owing to its multidrug resistance. Unfortunately, genome features, antimicrobial resistance, and virulence potentials in P. phragmitetus have not been reported before. A predominant colony (31801) was isolated from a liver abscess patient, indicating that it accounted for the infection. To investigate its infection mechanism(s) in depth, we sequenced this bacterial genome and tested its antimicrobial resistance. Average nucleotide identity (ANI) analysis assigned the bacterium to the species P. phragmitetus (ANI, >95%). Comparative genomics analyses among Pannonibacter spp. representing the different living niches were used to describe the Pannonibacter pan-genomes and to examine virulence factors, prophages, CRISPR arrays, and genomic islands. Pannonibacter phragmitetus 31801 consisted of one chromosome and one plasmid, while the plasmid was absent in other Pannonibacter isolates. Pannonibacter phragmitetus 31801 may have a great infection potential because a lot of genes encoding toxins, flagellum formation, iron uptake, and virulence factor secretion systems in its genome. Moreover, the genome has 24 genomic islands and 2 prophages. A combination of antimicrobial susceptibility tests and the detailed antibiotic resistance gene analysis provide useful information about the drug resistance mechanisms and therefore can be used to guide the treatment strategy for the bacterial infection. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.

  18. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing

    PubMed Central

    Wang, WeiBo; Wang, Wei; Sun, Wei; Crowley, James J.; Szatkiewicz, Jin P.

    2015-01-01

    Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/. PMID:25883151

  19. CpG island methylator phenotype in colorectal cancer

    PubMed Central

    Toyota, Minoru; Ahuja, Nita; Ohe-Toyota, Mutsumi; Herman, James G.; Baylin, Stephen B.; Issa, Jean-Pierre J.

    1999-01-01

    Aberrant methylation of promoter region CpG islands is associated with transcriptional inactivation of tumor-suppressor genes in neoplasia. To understand global patterns of CpG island methylation in colorectal cancer, we have used a recently developed technique called methylated CpG island amplification to examine 30 newly cloned differentially methylated DNA sequences. Of these 30 clones, 19 (63%) were progressively methylated in an age-dependent manner in normal colon, 7 (23%) were methylated in a cancer-specific manner, and 4 (13%) were methylated only in cell lines. Thus, a majority of CpG islands methylated in colon cancer are also methylated in a subset of normal colonic cells during the process of aging. In contrast, methylation of the cancer-specific clones was found exclusively in a subset of colorectal cancers, which appear to display a CpG island methylator phenotype (CIMP). CIMP+ tumors also have a high incidence of p16 and THBS1 methylation, and they include the majority of sporadic colorectal cancers with microsatellite instability related to hMLH1 methylation. We thus define a pathway in colorectal cancer that appears to be responsible for the majority of sporadic tumors with mismatch repair deficiency. PMID:10411935

  20. Phylogenetic analyses suggest a hybrid origin of the figs (Moraceae: Ficus) that are endemic to the Ogasawara (Bonin) Islands, Japan.

    PubMed

    Kusumi, Junko; Azuma, Hiroshi; Tzeng, Hsy-Yu; Chou, Lien-Siang; Peng, Yan-Qiong; Nakamura, Keiko; Su, Zhi-Hui

    2012-04-01

    The Ogasawara Islands are oceanic islands and harbor a unique endemic flora. There are three fig species (Ficus boninsimae, F. nishimurae and F. iidaiana) endemic to the Ogasawara Islands, and these species have been considered to be closely related to Ficus erecta, and to have diverged within the islands. However, this hypothesis remains uncertain. To investigate this issue, we assessed the phylogenetic relationships of the Ogasawara figs and their close relatives occurring in Japan, Taiwan and South China based on six plastid genome regions, nuclear ITS region and two nuclear genes. The plastid genome-based tree indicated a close relationship between the Ogasawara figs and F. erecta, whereas some of the nuclear gene-based trees suggested this relationship was not so close. In addition, the phylogenetic analyses of the pollinating wasps associated with these fig species based on the nuclear 28S rRNA and mitochondrial cytB genes suggested that the fig-pollinating wasps of F. erecta are not sister to those of the Ogasawara figs These results suggest the occurrence of an early hybridization event(s) in the lineage leading to the Ogasawara figs. Copyright © 2012 Elsevier Inc. All rights reserved.

  1. Soybean seed extracts preferentially express genomic loci of Bradyrhizobium japonicum in the initial interaction with soybean, Glycine max (L.) Merr.

    PubMed

    Wei, Min; Yokoyama, Tadashi; Minamisawa, Kiwamu; Mitsui, Hisayuki; Itakura, Manabu; Kaneko, Takakazu; Tabata, Satoshi; Saeki, Kazuhiko; Omori, Hirofumi; Tajima, Shigeyuki; Uchiumi, Toshiki; Abe, Mikiko; Ohwada, Takuji

    2008-08-01

    Initial interaction between rhizobia and legumes actually starts via encounters of both partners in the rhizosphere. In this study, the global expression profiles of Bradyrhizobium japonicum USDA 110 in response to soybean (Glycine max) seed extracts (SSE) and genistein, a major soybean-released isoflavone for nod genes induction of B. japonicum, were compared. SSE induced many genomic loci as compared with genistein (5.0 microM), nevertheless SSE-supplemented medium contained 4.7 microM genistein. SSE markedly induced four predominant genomic regions within a large symbiosis island (681 kb), which include tts genes (type III secretion system) and various nod genes. In addition, SSE-treated cells expressed many genomic loci containing genes for polygalacturonase (cell-wall degradation), exopolysaccharide synthesis, 1-aminocyclopropane-1-carboxylate deaminase, ribosome proteins family and energy metabolism even outside symbiosis island. On the other hand, genistein-treated cells exclusively showed one expression cluster including common nod gene operon within symbiosis island and six expression loci including multidrug resistance, which were shared with SSE-treated cells. Twelve putatively regulated genes were indeed validated by quantitative RT-PCR. Several SSE-induced genomic loci likely participate in the initial interaction with legumes. Thus, these results can provide a basic knowledge for screening novel genes relevant to the B. japonicum- soybean symbiosis.

  2. Comparative genome analysis and characterization of the Salmonella Typhimurium strain CCRJ_26 isolated from swine carcasses using whole-genome sequencing approach.

    PubMed

    Panzenhagen, P H N; Cabral, C C; Suffys, P N; Franco, R M; Rodrigues, D P; Conte-Junior, C A

    2018-04-01

    Salmonella pathogenicity relies on virulence factors many of which are clustered within the Salmonella pathogenicity islands. Salmonella also harbours mobile genetic elements such as virulence plasmids, prophage-like elements and antimicrobial resistance genes which can contribute to increase its pathogenicity. Here, we have genetically characterized a selected S. Typhimurium strain (CCRJ_26) from our previous study with Multiple Drugs Resistant profile and high-frequency PFGE clonal profile which apparently persists in the pork production centre of Rio de Janeiro State, Brazil. By whole-genome sequencing, we described the strain's genome virulent content and characterized the repertoire of bacterial plasmids, antibiotic resistance genes and prophage-like elements. Here, we have shown evidence that strain CCRJ_26 genome possible represent a virulence-associated phenotype which may be potentially virulent in human infection. Whole-genome sequencing technologies are still costly and remain underexplored for applied microbiology in Brazil. Hence, this genomic description of S. Typhimurium strain CCRJ_26 will provide help in future molecular epidemiological studies. The analysis described here reveals a quick and useful pipeline for bacterial virulence characterization using whole-genome sequencing approach. © 2018 The Society for Applied Microbiology.

  3. Unraveling the genomic mosaic of a ubiquitous genus of marine cyanobacteria

    PubMed Central

    Dufresne, Alexis; Ostrowski, Martin; Scanlan, David J; Garczarek, Laurence; Mazard, Sophie; Palenik, Brian P; Paulsen, Ian T; de Marsac, Nicole Tandeau; Wincker, Patrick; Dossat, Carole; Ferriera, Steve; Johnson, Justin; Post, Anton F; Hess, Wolfgang R; Partensky, Frédéric

    2008-01-01

    Background The picocyanobacterial genus Synechococcus occurs over wide oceanic expanses, having colonized most available niches in the photic zone. Large scale distribution patterns of the different Synechococcus clades (based on 16S rRNA gene markers) suggest the occurrence of two major lifestyles ('opportunists'/'specialists'), corresponding to two distinct broad habitats ('coastal'/'open ocean'). Yet, the genetic basis of niche partitioning is still poorly understood in this ecologically important group. Results Here, we compare the genomes of 11 marine Synechococcus isolates, representing 10 distinct lineages. Phylogenies inferred from the core genome allowed us to refine the taxonomic relationships between clades by revealing a clear dichotomy within the main subcluster, reminiscent of the two aforementioned lifestyles. Genome size is strongly correlated with the cumulative lengths of hypervariable regions (or 'islands'). One of these, encompassing most genes encoding the light-harvesting phycobilisome rod complexes, is involved in adaptation to changes in light quality and has clearly been transferred between members of different Synechococcus lineages. Furthermore, we observed that two strains (RS9917 and WH5701) that have similar pigmentation and physiology have an unusually high number of genes in common, given their phylogenetic distance. Conclusion We propose that while members of a given marine Synechococcus lineage may have the same broad geographical distribution, local niche occupancy is facilitated by lateral gene transfers, a process in which genomic islands play a key role as a repository for transferred genes. Our work also highlights the need for developing picocyanobacterial systematics based on genome-derived parameters combined with ecological and physiological data. PMID:18507822

  4. HLA in anthropology: the enigma of Easter Island.

    PubMed

    Sanchez-Mazas, Alicia; Thorsby, Erik

    2013-01-01

    In this article, we first present four significant cases where human leukocyte antigen (HLA) studies have been useful for the reconstruction of human peopling history on the worldwide scale; i.e., the spread of modern humans from East Africa, the colonization of East Asia along two geographic routes, the co-evolution of genes and languages in Africa, and the peopling of Europe through a main northward migration. These examples show that natural selection did not erase the genetic signatures of our past migrations in the HLA genetic diversity patterns observed today. In the second part, we summarize our studies on Easter Island. Using genomic HLA typing, we could trace an introduction of HLA alleles of native American (Amerindian) origin to Easter Island before the Peruvian slave trades; i.e., before the 1860s, and provide suggestive evidence that they may have already been introduced in prehistoric time. Our results give further support to an initial Polynesian population of the island, but also reveal an early contribution by Amerindians. Together, our data illustrate the usefulness of typing for HLA alleles to complement genetic analyses in anthropological investigations.

  5. Identification of Human Lineage-Specific Transcriptional Coregulators Enabled by a Glossary of Binding Modules and Tunable Genomic Backgrounds.

    PubMed

    Mariani, Luca; Weinand, Kathryn; Vedenko, Anastasia; Barrera, Luis A; Bulyk, Martha L

    2017-09-27

    Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Comparative genome-scale modelling of Staphylococcus aureus strains identifies strain-specific metabolic capabilities linked to pathogenicity

    PubMed Central

    Bosi, Emanuele; Monk, Jonathan M.; Aziz, Ramy K.; Fondi, Marco; Nizet, Victor; Palsson, Bernhard Ø.

    2016-01-01

    Staphylococcus aureus is a preeminent bacterial pathogen capable of colonizing diverse ecological niches within its human host. We describe here the pangenome of S. aureus based on analysis of genome sequences from 64 strains of S. aureus spanning a range of ecological niches, host types, and antibiotic resistance profiles. Based on this set, S. aureus is expected to have an open pangenome composed of 7,411 genes and a core genome composed of 1,441 genes. Metabolism was highly conserved in this core genome; however, differences were identified in amino acid and nucleotide biosynthesis pathways between the strains. Genome-scale models (GEMs) of metabolism were constructed for the 64 strains of S. aureus. These GEMs enabled a systems approach to characterizing the core metabolic and panmetabolic capabilities of the S. aureus species. All models were predicted to be auxotrophic for the vitamins niacin (vitamin B3) and thiamin (vitamin B1), whereas strain-specific auxotrophies were predicted for riboflavin (vitamin B2), guanosine, leucine, methionine, and cysteine, among others. GEMs were used to systematically analyze growth capabilities in more than 300 different growth-supporting environments. The results identified metabolic capabilities linked to pathogenic traits and virulence acquisitions. Such traits can be used to differentiate strains responsible for mild vs. severe infections and preference for hosts (e.g., animals vs. humans). Genome-scale analysis of multiple strains of a species can thus be used to identify metabolic determinants of virulence and increase our understanding of why certain strains of this deadly pathogen have spread rapidly throughout the world. PMID:27286824

  7. Ceratocystis cacaofunesta genome analysis reveals a large expansion of extracellular phosphatidylinositol-specific phospholipase-C genes (PI-PLC).

    PubMed

    Molano, Eddy Patricia Lopez; Cabrera, Odalys García; Jose, Juliana; do Nascimento, Leandro Costa; Carazzolle, Marcelo Falsarella; Teixeira, Paulo José Pereira Lima; Alvarez, Javier Correa; Tiburcio, Ricardo Augusto; Tokimatu Filho, Paulo Massanari; de Lima, Gustavo Machado Alvares; Guido, Rafael Victório Carvalho; Corrêa, Thamy Lívia Ribeiro; Leme, Adriana Franco Paes; Mieczkowski, Piotr; Pereira, Gonçalo Amarante Guimarães

    2018-01-17

    The Ceratocystis genus harbors a large number of phytopathogenic fungi that cause xylem parenchyma degradation and vascular destruction on a broad range of economically important plants. Ceratocystis cacaofunesta is a necrotrophic fungus responsible for lethal wilt disease in cacao. The aim of this work is to analyze the genome of C. cacaofunesta through a comparative approach with genomes of other Sordariomycetes in order to better understand the molecular basis of pathogenicity in the Ceratocystis genus. We present an analysis of the C. cacaofunesta genome focusing on secreted proteins that might constitute pathogenicity factors. Comparative genome analyses among five Ceratocystidaceae species and 23 other Sordariomycetes fungi showed a strong reduction in gene content of the Ceratocystis genus. However, some gene families displayed a remarkable expansion, in particular, the Phosphatidylinositol specific phospholipases-C (PI-PLC) family. Also, evolutionary rate calculations suggest that the evolution process of this family was guided by positive selection. Interestingly, among the 82 PI-PLCs genes identified in the C. cacaofunesta genome, 70 genes encoding extracellular PI-PLCs are grouped in eight small scaffolds surrounded by transposon fragments and scars that could be involved in the rapid evolution of the PI-PLC family. Experimental secretome using LC-MS/MS validated 24% (86 proteins) of the total predicted secretome (342 proteins), including four PI-PLCs and other important pathogenicity factors. Analysis of the Ceratocystis cacaofunesta genome provides evidence that PI-PLCs may play a role in pathogenicity. Subsequent functional studies will be aimed at evaluating this hypothesis. The observed genetic arsenals, together with the analysis of the PI-PLC family shown in this work, reveal significant differences in the Ceratocystis genome compared to the classical vascular fungi, Verticillium and Fusarium. Altogether, our analyses provide new insights into the

  8. Intraclonal Genome Stability of the Metallo-β-lactamase SPM-1-producing Pseudomonas aeruginosa ST277, an Endemic Clone Disseminated in Brazilian Hospitals.

    PubMed

    Nascimento, Ana P B; Ortiz, Mauro F; Martins, Willames M B S; Morais, Guilherme L; Fehlberg, Lorena C C; Almeida, Luiz G P; Ciapina, Luciane P; Gales, Ana C; Vasconcelos, Ana T R

    2016-01-01

    Carbapenems represent the mainstay therapy for the treatment of serious P. aeruginosa infections. However, the emergence of carbapenem resistance has jeopardized the clinical use of this important class of compounds. The production of SPM-1 metallo-β-lactamase has been the most common mechanism of carbapenem resistance identified in P. aeruginosa isolated from Brazilian medical centers. Interestingly, a single SPM-1-producing P. aeruginosa clone belonging to the ST277 has been widely spread within the Brazilian territory. In the current study, we performed a next-generation sequencing of six SPM-1-producing P. aeruginosa ST277 isolates. The core genome contains 5899 coding genes relative to the reference strain P. aeruginos a PAO1. A total of 26 genomic islands were detected in these isolates. We identified remarkable elements inside these genomic islands, such as copies of the bla SPM-1 gene conferring resistance to carbapenems and a type I-C CRISPR-Cas system, which is involved in protection of the chromosome against foreign DNA. In addition, we identified single nucleotide polymorphisms causing amino acid changes in antimicrobial resistance and virulence-related genes. Together, these factors could contribute to the marked resistance and persistence of the SPM-1-producing P. aeruginosa ST277 clone. A comparison of the SPM-1-producing P. aeruginosa ST277 genomes showed that their core genome has a high level nucleotide similarity and synteny conservation. The variability observed was mainly due to acquisition of genomic islands carrying several antibiotic resistance genes.

  9. AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae

    PubMed Central

    Song, Giltae; Dickins, Benjamin J. A.; Demeter, Janos; Engel, Stacia; Dunn, Barbara; Cherry, J. Michael

    2015-01-01

    The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community. PMID:25781462

  10. Non-specific filtering of beta-distributed data.

    PubMed

    Wang, Xinhui; Laird, Peter W; Hinoue, Toshinori; Groshen, Susan; Siegmund, Kimberly D

    2014-06-19

    Non-specific feature selection is a dimension reduction procedure performed prior to cluster analysis of high dimensional molecular data. Not all measured features are expected to show biological variation, so only the most varying are selected for analysis. In DNA methylation studies, DNA methylation is measured as a proportion, bounded between 0 and 1, with variance a function of the mean. Filtering on standard deviation biases the selection of probes to those with mean values near 0.5. We explore the effect this has on clustering, and develop alternate filter methods that utilize a variance stabilizing transformation for Beta distributed data and do not share this bias. We compared results for 11 different non-specific filters on eight Infinium HumanMethylation data sets, selected to span a variety of biological conditions. We found that for data sets having a small fraction of samples showing abnormal methylation of a subset of normally unmethylated CpGs, a characteristic of the CpG island methylator phenotype in cancer, a novel filter statistic that utilized a variance-stabilizing transformation for Beta distributed data outperformed the common filter of using standard deviation of the DNA methylation proportion, or its log-transformed M-value, in its ability to detect the cancer subtype in a cluster analysis. However, the standard deviation filter always performed among the best for distinguishing subgroups of normal tissue. The novel filter and standard deviation filter tended to favour features in different genome contexts; for the same data set, the novel filter always selected more features from CpG island promoters and the standard deviation filter always selected more features from non-CpG island intergenic regions. Interestingly, despite selecting largely non-overlapping sets of features, the two filters did find sample subsets that overlapped for some real data sets. We found two different filter statistics that tended to prioritize features with

  11. Typing and comparative genome analysis of Brucella melitensis isolated from Lebanon.

    PubMed

    Abou Zaki, Natalia; Salloum, Tamara; Osman, Marwan; Rafei, Rayane; Hamze, Monzer; Tokajian, Sima

    2017-10-16

    Brucella melitensis is the main causative agent of the zoonotic disease brucellosis. This study aimed at typing and characterizing genetic variation in 33 Brucella isolates recovered from patients in Lebanon. Bruce-ladder multiplex PCR and PCR-RFLP of omp31, omp2a and omp2b were performed. Sixteen representative isolates were chosen for draft-genome sequencing and analyzed to determine variations in virulence, resistance, genomic islands, prophages and insertion sequences. Comparative whole-genome single nucleotide polymorphism analysis was also performed. The isolates were confirmed to be B. melitensis. Genome analysis revealed multiple virulence determinants and efflux pumps. Genome comparisons and single nucleotide polymorphisms divided the isolates based on geographical distribution but revealed high levels of similarity between the strains. Sequence divergence in B. melitensis was mainly due to lateral gene transfer of mobile elements. This is the first report of an in-depth genomic characterization of B. melitensis in Lebanon. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  12. Marked genomic heterogeneity of rat hepatitis E virus strains in Indonesia demonstrated on a full-length genome analysis.

    PubMed

    Mulyanto; Suparyatmo, Joseph Benedictus; Andayani, I Gusti Ayu Sri; Khalid; Takahashi, Masaharu; Ohnishi, Hiroshi; Jirintai, Suljid; Nagashima, Shigeo; Nishizawa, Tsutomu; Okamoto, Hiroaki

    2014-01-22

    Rat hepatitis E virus (HEV) strains have recently been isolated in several areas of Germany, Vietnam, the United States, Indonesia and China. However, genetic information regarding these rat HEV strains is limited. A total of 369 wild rats (Rattus rattus) captured in Central Java (Solo) and on Lombok Island, Indonesia were tested for the presence of rat HEV-specific antibodies and RNA. Overall, 137 rats (37.1%) tested positive for rat anti-HEV antibodies, while 97 (26.3%) had rat HEV RNA detectable on reverse transcription-PCR with primers targeting the ORF1-ORF2 junctional region. The 97 HEV strains recovered from these viremic rats were 76.3-100% identical to each other in an 840-nucleotide sequence and 75.4-88.4% identical to the rat HEV strains reported in Germany and Vietnam. Five representative Indonesian strains, one from each of five phylogenetic clusters, whose entire genomic sequence was determined, were segregated into three genetic groups (a German type, Vietnamese type and novel type), which differed from each other by 19.5-23.5 (22.0 ± 1.7)% over the entire genome. These results suggest the presence of at least three genetic groups of rat HEV and indicate the circulation of polyphyletic strains of rat HEV belonging to three distinct genetic groups in Indonesia. Copyright © 2013 Elsevier B.V. All rights reserved.

  13. Genomic Analysis of Bacillus sp. Strain B25, a Biocontrol Agent of Maize Pathogen Fusarium verticillioides.

    PubMed

    Douriet-Gámez, Nadia R; Maldonado-Mendoza, Ignacio E; Ibarra-Laclette, Enrique; Blom, Jochen; Calderón-Vázquez, Carlos L

    2018-03-01

    Bacillus sp. B25 is an effective biocontrol agent against the maize pathogenic fungus Fusarium verticillioides (Fv). Previous in vitro assays have shown that B25 has protease, glucanase, and chitinase activities and siderophores production; however, specific mechanisms by which B25 controls Fv are still unknown. To determine the genetic traits involved in biocontrol, B25 genome was sequenced and analyzed. B25 genome is composed of 5,113,413 bp and 5251 coding genes. A multilocus phylogenetic analysis (MLPA) suggests that B25 is closely related to the Bacillus cereus group and a high percentage (70-75%) of the genetic information is conserved between B25 and related strains, which include most of the genes associated to fungal antagonism. Some of these genes are shared with some biocontrol agents of the Bacillus genus and less with Pseudomonas and Serratia strains. We performed a genomic comparison between B25 and five Bacillus spp., Pseudomonas and Serratia strains. B25 contains genes involved in a wide variety of antagonistic mechanisms including chitinases, glycoside hydrolases, siderophores, antibiotics, and biofilm production that could be implicated in root colonization. Also, 24 genomic islands and 3 CRISPR sequences were identified in the B25 genome. This is the first comparative genome analysis between strains belonging to the B. cereus group and biocontrol agents of phytopathogenic fungi. These results are the starting point for further studies on B25 gene expression during its interaction with Fv.

  14. Informed consent in direct-to-consumer personal genome testing: the outline of a model between specific and generic consent.

    PubMed

    Bunnik, Eline M; Janssens, A Cecile J W; Schermer, Maartje H N

    2014-09-01

    Broad genome-wide testing is increasingly finding its way to the public through the online direct-to-consumer marketing of so-called personal genome tests. Personal genome tests estimate genetic susceptibilities to multiple diseases and other phenotypic traits simultaneously. Providers commonly make use of Terms of Service agreements rather than informed consent procedures. However, to protect consumers from the potential physical, psychological and social harms associated with personal genome testing and to promote autonomous decision-making with regard to the testing offer, we argue that current practices of information provision are insufficient and that there is a place--and a need--for informed consent in personal genome testing, also when it is offered commercially. The increasing quantity, complexity and diversity of most testing offers, however, pose challenges for information provision and informed consent. Both specific and generic models for informed consent fail to meet its moral aims when applied to personal genome testing. Consumers should be enabled to know the limitations, risks and implications of personal genome testing and should be given control over the genetic information they do or do not wish to obtain. We present the outline of a new model for informed consent which can meet both the norm of providing sufficient information and the norm of providing understandable information. The model can be used for personal genome testing, but will also be applicable to other, future forms of broad genetic testing or screening in commercial and clinical settings. © 2012 John Wiley & Sons Ltd.

  15. Genome health nutrigenomics and nutrigenetics--diagnosis and nutritional treatment of genome damage on an individual basis.

    PubMed

    Fenech, Michael

    2008-04-01

    The term nutrigenomics refers to the effect of diet on gene expression. The term nutrigenetics refers to the impact of inherited traits on the response to a specific dietary pattern, functional food or supplement on a specific health outcome. The specific fields of genome health nutrigenomics and genome health nutrigenetics are emerging as important new research areas because it is becoming increasingly evident that (a) risk for developmental and degenerative disease increases with DNA damage which in turn is dependent on nutritional status and (b) optimal concentration of micronutrients for prevention of genome damage is also dependent on genetic polymorphisms that alter function of genes involved directly or indirectly in uptake and metabolism of micronutrients required for DNA repair and DNA replication. Development of dietary patterns, functional foods and supplements that are designed to improve genome health maintenance in humans with specific genetic backgrounds may provide an important contribution to a new optimum health strategy based on the diagnosis and individualised nutritional treatment of genome instability i.e. Genome Health Clinics.

  16. Genomic timetree and historical biogeography of Caribbean island ameiva lizards (Pholidoscelis: Teiidae).

    PubMed

    Tucker, Derek B; Hedges, Stephen Blair; Colli, Guarino R; Pyron, Robert Alexander; Sites, Jack W

    2017-09-01

    The phylogenetic relationships and biogeographic history of Caribbean island ameivas ( Pholidoscelis ) are not well-known because of incomplete sampling, conflicting datasets, and poor support for many clades. Here, we use phylogenomic and mitochondrial DNA datasets to reconstruct a well-supported phylogeny and assess historical colonization patterns in the group. We obtained sequence data from 316 nuclear loci and one mitochondrial marker for 16 of 19 extant species of the Caribbean endemic genus Pholidoscelis . Phylogenetic analyses were carried out using both concatenation and species tree approaches. To estimate divergence times, we used fossil teiids to calibrate a timetree which was used to elucidate the historical biogeography of these lizards. All phylogenetic analyses recovered four well-supported species groups (clades) recognized previously and supported novel relationships of those groups, including a ( P. auberi + P. lineolatus ) clade (western + central Caribbean), and a ( P. exsul + P. plei ) clade (eastern Caribbean). Divergence between Pholidoscelis and its sister clade was estimated to have occurred ~25 Ma, with subsequent diversification on Caribbean islands occurring over the last 11 Myr. Of the six models compared in the biogeographic analyses, the scenario which considered the distance among islands and allowed dispersal in all directions best fit the data. These reconstructions suggest that the ancestor of this group colonized either Hispaniola or Puerto Rico from Middle America. We provide a well-supported phylogeny of Pholidoscelis with novel relationships not reported in previous studies that were based on significantly smaller datasets. We propose that Pholidoscelis colonized the eastern Greater Antilles from Middle America based on our biogeographic analysis, phylogeny, and divergence time estimates. The closing of the Central American Seaway and subsequent formation of the modern Atlantic meridional overturning circulation may

  17. T-DNA-genome junctions form early after infection and are influenced by the chromatin state of the host genome

    PubMed Central

    Tripathi, Pooja; Muth, Theodore R.

    2017-01-01

    Agrobacterium tumefaciens mediated T-DNA integration is a common tool for plant genome manipulation. However, there is controversy regarding whether T-DNA integration is biased towards genes or randomly distributed throughout the genome. In order to address this question, we performed high-throughput mapping of T-DNA-genome junctions obtained in the absence of selection at several time points after infection. T-DNA-genome junctions were detected as early as 6 hours post-infection. T-DNA distribution was apparently uniform throughout the chromosomes, yet local biases toward AT-rich motifs and T-DNA border sequence micro-homology were detected. Analysis of the epigenetic landscape of previously isolated sites of T-DNA integration in Kanamycin-selected transgenic plants showed an association with extremely low methylation and nucleosome occupancy. Conversely, non-selected junctions from this study showed no correlation with methylation and had chromatin marks, such as high nucleosome occupancy and high H3K27me3, that correspond to three-dimensional-interacting heterochromatin islands embedded within euchromatin. Such structures may play a role in capturing and silencing invading T-DNA. PMID:28742090

  18. Searching whole genome sequences for biochemical identification features of emerging and reemerging pathogenic Corynebacterium species.

    PubMed

    Santos, André S; Ramos, Rommel T; Silva, Artur; Hirata, Raphael; Mattos-Guaraldi, Ana L; Meyer, Roberto; Azevedo, Vasco; Felicori, Liza; Pacheco, Luis G C

    2018-05-11

    Biochemical tests are traditionally used for bacterial identification at the species level in clinical microbiology laboratories. While biochemical profiles are generally efficient for the identification of the most important corynebacterial pathogen Corynebacterium diphtheriae, their ability to differentiate between biovars of this bacterium is still controversial. Besides, the unambiguous identification of emerging human pathogenic species of the genus Corynebacterium may be hampered by highly variable biochemical profiles commonly reported for these species, including Corynebacterium striatum, Corynebacterium amycolatum, Corynebacterium minutissimum, and Corynebacterium xerosis. In order to identify the genomic basis contributing for the biochemical variabilities observed in phenotypic identification methods of these bacteria, we combined a comprehensive literature review with a bioinformatics approach based on reconstruction of six specific biochemical reactions/pathways in 33 recently released whole genome sequences. We used data retrieved from curated databases (MetaCyc, PathoSystems Resource Integration Center (PATRIC), The SEED, TransportDB, UniProtKB) associated with homology searches by BLAST and profile Hidden Markov Models (HMMs) to detect enzymes participating in the various pathways and performed ab initio protein structure modeling and molecular docking to confirm specific results. We found a differential distribution among the various strains of genes that code for some important enzymes, such as beta-phosphoglucomutase and fructokinase, and also for individual components of carbohydrate transport systems, including the fructose-specific phosphoenolpyruvate-dependent sugar phosphotransferase (PTS) and the ribose-specific ATP-binging cassette (ABC) transporter. Horizontal gene transfer plays a role in the biochemical variability of the isolates, as some genes needed for sucrose fermentation were seen to be present in genomic islands. Noteworthy

  19. Identification of rhizome-specific genes by genome-wide differential expression Analysis in Oryza longistaminata

    PubMed Central

    2011-01-01

    Background Rhizomatousness is a key component of perenniality of many grasses that contribute to competitiveness and invasiveness of many noxious grass weeds, but can potentially be used to develop perennial cereal crops for sustainable farmers in hilly areas of tropical Asia. Oryza longistaminata, a perennial wild rice with strong rhizomes, has been used as the model species for genetic and molecular dissection of rhizome development and in breeding efforts to transfer rhizome-related traits into annual rice species. In this study, an effort was taken to get insights into the genes and molecular mechanisms underlying the rhizomatous trait in O. longistaminata by comparative analysis of the genome-wide tissue-specific gene expression patterns of five different tissues of O. longistaminata using the Affymetrix GeneChip Rice Genome Array. Results A total of 2,566 tissue-specific genes were identified in five different tissues of O. longistaminata, including 58 and 61 unique genes that were specifically expressed in the rhizome tips (RT) and internodes (RI), respectively. In addition, 162 genes were up-regulated and 261 genes were down-regulated in RT compared to the shoot tips. Six distinct cis-regulatory elements (CGACG, GCCGCC, GAGAC, AACGG, CATGCA, and TAAAG) were found to be significantly more abundant in the promoter regions of genes differentially expressed in RT than in the promoter regions of genes uniformly expressed in all other tissues. Many of the RT and/or RI specifically or differentially expressed genes were located in the QTL regions associated with rhizome expression, rhizome abundance and rhizome growth-related traits in O. longistaminata and thus are good candidate genes for these QTLs. Conclusion The initiation and development of the rhizomatous trait in O. longistaminata are controlled by very complex gene networks involving several plant hormones and regulatory genes, different members of gene families showing tissue specificity and their

  20. Identification of rhizome-specific genes by genome-wide differential expression analysis in Oryza longistaminata.

    PubMed

    Hu, Fengyi; Wang, Di; Zhao, Xiuqin; Zhang, Ting; Sun, Haixi; Zhu, Linghua; Zhang, Fan; Li, Lijuan; Li, Qiong; Tao, Dayun; Fu, Binying; Li, Zhikang

    2011-01-24

    Rhizomatousness is a key component of perenniality of many grasses that contribute to competitiveness and invasiveness of many noxious grass weeds, but can potentially be used to develop perennial cereal crops for sustainable farmers in hilly areas of tropical Asia. Oryza longistaminata, a perennial wild rice with strong rhizomes, has been used as the model species for genetic and molecular dissection of rhizome development and in breeding efforts to transfer rhizome-related traits into annual rice species. In this study, an effort was taken to get insights into the genes and molecular mechanisms underlying the rhizomatous trait in O. longistaminata by comparative analysis of the genome-wide tissue-specific gene expression patterns of five different tissues of O. longistaminata using the Affymetrix GeneChip Rice Genome Array. A total of 2,566 tissue-specific genes were identified in five different tissues of O. longistaminata, including 58 and 61 unique genes that were specifically expressed in the rhizome tips (RT) and internodes (RI), respectively. In addition, 162 genes were up-regulated and 261 genes were down-regulated in RT compared to the shoot tips. Six distinct cis-regulatory elements (CGACG, GCCGCC, GAGAC, AACGG, CATGCA, and TAAAG) were found to be significantly more abundant in the promoter regions of genes differentially expressed in RT than in the promoter regions of genes uniformly expressed in all other tissues. Many of the RT and/or RI specifically or differentially expressed genes were located in the QTL regions associated with rhizome expression, rhizome abundance and rhizome growth-related traits in O. longistaminata and thus are good candidate genes for these QTLs. The initiation and development of the rhizomatous trait in O. longistaminata are controlled by very complex gene networks involving several plant hormones and regulatory genes, different members of gene families showing tissue specificity and their regulated pathways. Auxin

  1. Investigating the Relatedness of Enteroinvasive Escherichia coli to Other E. coli and Shigella Isolates by Using Comparative Genomics

    PubMed Central

    Hazen, Tracy H.; Leonard, Susan R.; Lampel, Keith A.; Lacher, David W.

    2016-01-01

    Enteroinvasive Escherichia coli (EIEC) is a unique pathovar that has a pathogenic mechanism nearly indistinguishable from that of Shigella species. In contrast to isolates of the four Shigella species, which are widespread and can be frequent causes of human illness, EIEC causes far fewer reported illnesses each year. In this study, we analyzed the genome sequences of 20 EIEC isolates, including 14 first described in this study. Phylogenomic analysis of the EIEC genomes demonstrated that 17 of the isolates are present in three distinct lineages that contained only EIEC genomes, compared to reference genomes from each of the E. coli pathovars and Shigella species. Comparative genomic analysis identified genes that were unique to each of the three identified EIEC lineages. While many of the EIEC lineage-specific genes have unknown functions, those with predicted functions included a colicin and putative proteins involved in transcriptional regulation or carbohydrate metabolism. In silico detection of the Shigella virulence plasmid (pINV), which is essential for the invasion of host cells, demonstrated that a form of pINV was present in nearly all EIEC genomes, but the Mxi-Spa-Ipa region of the plasmid that encodes the invasion-associated proteins was absent from several of the EIEC isolates. The comparative genomic findings in this study support the hypothesis that multiple EIEC lineages have evolved independently from multiple distinct lineages of E. coli via the acquisition of the Shigella virulence plasmid and, in some cases, the Shigella pathogenicity islands. PMID:27271741

  2. Holocentromeres in Rhynchospora are associated with genome-wide centromere-specific repeat arrays interspersed among euchromatin.

    PubMed

    Marques, André; Ribeiro, Tiago; Neumann, Pavel; Macas, Jiří; Novák, Petr; Schubert, Veit; Pellino, Marco; Fuchs, Jörg; Ma, Wei; Kuhlmann, Markus; Brandt, Ronny; Vanzela, André L L; Beseda, Tomáš; Šimková, Hana; Pedrosa-Harand, Andrea; Houben, Andreas

    2015-11-03

    Holocentric chromosomes lack a primary constriction, in contrast to monocentrics. They form kinetochores distributed along almost the entire poleward surface of the chromatids, to which spindle fibers attach. No centromere-specific DNA sequence has been found for any holocentric organism studied so far. It was proposed that centromeric repeats, typical for many monocentric species, could not occur in holocentrics, most likely because of differences in the centromere organization. Here we show that the holokinetic centromeres of the Cyperaceae Rhynchospora pubera are highly enriched by a centromeric histone H3 variant-interacting centromere-specific satellite family designated "Tyba" and by centromeric retrotransposons (i.e., CRRh) occurring as genome-wide interspersed arrays. Centromeric arrays vary in length from 3 to 16 kb and are intermingled with gene-coding sequences and transposable elements. We show that holocentromeres of metaphase chromosomes are composed of multiple centromeric units rather than possessing a diffuse organization, thus favoring the polycentric model. A cell-cycle-dependent shuffling of multiple centromeric units results in the formation of functional (poly)centromeres during mitosis. The genome-wide distribution of centromeric repeat arrays interspersing the euchromatin provides a previously unidentified type of centromeric chromatin organization among eukaryotes. Thus, different types of holocentromeres exist in different species, namely with and without centromeric repetitive sequences.

  3. Detailed mtDNA genotypes permit a reassessment of the settlement and population structure of the Andaman Islands.

    PubMed

    Barik, S S; Sahani, R; Prasad, B V R; Endicott, P; Metspalu, M; Sarkar, B N; Bhattacharya, S; Annapoorna, P C H; Sreenath, J; Sun, D; Sanchez, J J; Ho, S Y W; Chandrasekar, A; Rao, V R

    2008-05-01

    The population genetics of the Indian subcontinent is central to understanding early human prehistory due to its strategic location on the proposed corridor of human movement from Africa to Australia during the late Pleistocene. Previous genetic research using mtDNA has emphasized the relative isolation of the late Pleistocene colonizers, and the physically isolated Andaman Island populations of Island South-East Asia remain the source of claims supporting an early split between the populations that formed the patchy settlement pattern along the coast of the Indian Ocean. Using whole-genome sequencing, combined with multiplexed SNP typing, this study investigates the deep structure of mtDNA haplogroups M31 and M32 in India and the Andaman Islands. The identification of a so far unnoticed rare polymorphism shared between these two lineages suggests that they are actually sister groups within a single haplogroup, M31'32. The enhanced resolution of M31 allows for the inference of a more recent colonization of the Andaman Islands than previously suggested, but cannot reject the very early peopling scenario. We further demonstrate a widespread overlap of mtDNA and cultural markers between the two major language groups of the Andaman archipelago. Given the "completeness" of the genealogy based on whole genome sequences, and the multiple scenarios for the peopling of the Andaman Islands sustained by this inferred genealogy, our study hints that further mtDNA based phylogeographic studies are unlikely to unequivocally support any one of these possibilities. (c) 2008 Wiley-Liss, Inc.

  4. The Genetics of Symbiotic Nitrogen Fixation: Comparative Genomics of 14 Rhizobia Strains by Resolution of Protein Clusters

    PubMed Central

    Black, Michael; Moolhuijzen, Paula; Chapman, Brett; Barrero, Roberto; Howieson, John; Hungria, Mariangela; Bellgard, Matthew

    2012-01-01

    The symbiotic relationship between legumes and nitrogen fixing bacteria is critical for agriculture, as it may have profound impacts on lowering costs for farmers, on land sustainability, on soil quality, and on mitigation of greenhouse gas emissions. However, despite the importance of the symbioses to the global nitrogen cycling balance, very few rhizobial genomes have been sequenced so far, although there are some ongoing efforts in sequencing elite strains. In this study, the genomes of fourteen selected strains of the order Rhizobiales, all previously fully sequenced and annotated, were compared to assess differences between the strains and to investigate the feasibility of defining a core ‘symbiome’—the essential genes required by all rhizobia for nodulation and nitrogen fixation. Comparison of these whole genomes has revealed valuable information, such as several events of lateral gene transfer, particularly in the symbiotic plasmids and genomic islands that have contributed to a better understanding of the evolution of contrasting symbioses. Unique genes were also identified, as well as omissions of symbiotic genes that were expected to be found. Protein comparisons have also allowed the identification of a variety of similarities and differences in several groups of genes, including those involved in nodulation, nitrogen fixation, production of exopolysaccharides, Type I to Type VI secretion systems, among others, and identifying some key genes that could be related to host specificity and/or a better saprophytic ability. However, while several significant differences in the type and number of proteins were observed, the evidence presented suggests no simple core symbiome exists. A more abstract systems biology concept of nitrogen fixing symbiosis may be required. The results have also highlighted that comparative genomics represents a valuable tool for capturing specificities and generalities of each genome. PMID:24704847

  5. Genome and Transcriptome Sequences Reveal the Specific Parasitism of the Nematophagous Purpureocillium lilacinum 36-1

    PubMed Central

    Xie, Jialian; Li, Shaojun; Mo, Chenmi; Xiao, Xueqiong; Peng, Deliang; Wang, Gaofeng; Xiao, Yannong

    2016-01-01

    Purpureocillium lilacinum is a promising nematophagous ascomycete able to adapt diverse environments and it is also an opportunistic fungus that infects humans. A microbial inoculant of P. lilacinum has been registered to control plant parasitic nematodes. However, the molecular mechanism of the toxicological processes is still unclear because of the relatively few reports on the subject. In this study, using Illumina paired-end sequencing, the draft genome sequence and the transcriptome of P. lilacinum strain 36-1 infecting nematode-eggs were determined. Whole genome alignment indicated that P. lilacinum 36-1 possessed a more dynamic genome in comparison with P. lilacinum India strain. Moreover, a phylogenetic analysis showed that the P. lilacinum 36-1 had a closer relation to entomophagous fungi. The protein-coding genes in P. lilacinum 36-1 occurred much more frequently than they did in other fungi, which was a result of the depletion of repeat-induced point mutations (RIP). Comparative genome and transcriptome analyses revealed the genes that were involved in pathogenicity, particularly in the recognition, adhesion of nematode-eggs, downstream signal transduction pathways and hydrolase genes. By contrast, certain numbers of cellulose and xylan degradation genes and a lack of polysaccharide lyase genes showed the potential of P. lilacinum 36-1 as an endophyte. Notably, the expression of appressorium-formation and antioxidants-related genes exhibited similar infection patterns in P. lilacinum strain 36-1 to those of the model entomophagous fungi Metarhizium spp. These results uncovered the specific parasitism of P. lilacinum and presented the genes responsible for the infection of nematode-eggs. PMID:27486440

  6. Frog size on continental islands of the coast of Rio de Janeiro and the generality of the Island Rule

    PubMed Central

    2018-01-01

    Island Rule postulated that individuals on islands tend to dwarfism when individuals from mainland populations are large and to gigantism when mainland populations present small individuals. There has been much discussion about this rule, but only few studies were carried out aiming to reveal this pattern for anurans. Our study focused on measuring the size of individuals on islands and to find a possible pattern of size modification for insular anurans. Individuals were collected on continental islands, measured and compared to mainland populations. We selected four species with different natural history aspects during these analyses. Island parameters were compared to size of individuals in order to find an explanation to size modification. Three of the four species presented size shifting on islands. Ololygon trapicheiroi and Adenomera marmorata showed dwarfism, Boana albomarginata showed gigantism and in Thoropa miliaris there was no evident size modification. Allometric analysis also revealed differential modification, which might be a result of different selective pressures on islands in respect of mainland populations. Regression model explained most of the size modification in B. albomarginata, but not for the other species. Our results indicate that previous assumptions, usually proposed for mammals from older islands, do not fit to the anurans studied here. We support the assumption that size modification on islands are population-specific. Hence, in B. albomarginata some factor associated to competition, living area and isolation time might likely be responsible for gigantism on islands. PMID:29324790

  7. Comparative genome analysis of two Streptococcus phocae subspecies provides novel insights into pathogenicity.

    PubMed

    Bethke, J; Avendaño-Herrera, R

    2017-02-01

    Streptococcus phocae is a beta-hemolytic, Gram-positive bacterium that was first isolated in Norway from clinical specimens of harbor seal (Phoca vitulina) affected by pneumonia or respiratory infection, and in 2005, this bacterium was identified from disease outbreaks at an Atlantic salmon farm. A recent comparative polyphasic study reclassified Streptococcus phocae as subsp. phocae and subsp. salmonis, and there are currently two S. phocae NCBI sequencing projects for the type strains ATCC 51973 T and C-4 T . The present study compared these genome sequences to determine shared properties between the pathogenic mammalian and fish S. phocae subspecies. Both subspecies presented genomic islands, prophages, CRISPRs, and multiple gene activator and RofA regulator regions that could play key roles in the pathogenesis of streptococcal species. Likewise, proteins possibly influencing immune system evasion and virulence strategies were identified in both genomes, including Streptokinases, Streptolysin S, IgG endopeptidase, Fibronectin binding proteins, Daunorubicin, and Penicillin resistance proteins. Comparative differences in phage, non-phage, and genomic island sequences may form the genetic basis for the virulence, pathogenicity, and ability of S. phocae subsp. salmonis to infect and cause disease in Atlantic salmon, in contrast to S. phocae subsp. phocae. This comparative genomic study between two S. phocae subsp. provides novel insights into virulence factors and pathogenicity, offering important information that will facilitate the development of preventive and treatment measures against this pathogen. Copyright © 2016 Elsevier B.V. All rights reserved.

  8. Human-specific protein isoforms produced by novel splice sites in the human genome after the human-chimpanzee divergence.

    PubMed

    Kim, Dong Seon; Hahn, Yoonsoo

    2012-11-13

    Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.

  9. Genomic Changes Associated with Reproductive and Migratory Ecotypes in Sockeye Salmon (Oncorhynchus nerka)

    PubMed Central

    Veale, Andrew J.

    2017-01-01

    Mechanisms underlying adaptive evolution can best be explored using paired populations displaying similar phenotypic divergence, illuminating the genomic changes associated with specific life history traits. Here, we used paired migratory [anadromous vs. resident (kokanee)] and reproductive [shore- vs. stream-spawning] ecotypes of sockeye salmon (Oncorhynchus nerka) sampled from seven lakes and two rivers spanning three catchments (Columbia, Fraser, and Skeena) in British Columbia, Canada to investigate the patterns and processes underlying their divergence. Restriction-site associated DNA sequencing was used to genotype this sampling at 7,347 single nucleotide polymorphisms, 334 of which were identified as outlier loci and candidates for divergent selection within at least one ecotype comparison. Sixty-eight of these outliers were present in two or more comparisons, with 33 detected across multiple catchments. Of particular note, one locus was detected as the most significant outlier between shore and stream-spawning ecotypes in multiple comparisons and across catchments (Columbia, Fraser, and Snake). We also detected several genomic islands of divergence, some shared among comparisons, potentially showing linked signals of differential selection. The single nucleotide polymorphisms and genomic regions identified in our study offer a range of mechanistic hypotheses associated with the genetic basis of O. nerka life history variation and provide novel tools for informing fisheries management. PMID:29045601

  10. 33 CFR 117.169 - Mare Island Strait and the Napa River.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... SECURITY BRIDGES DRAWBRIDGE OPERATION REGULATIONS Specific Requirements California § 117.169 Mare Island Strait and the Napa River. (a) The draw of the Mare Island Drawbridge, mile 2.8, at Vallejo shall open on... 33 Navigation and Navigable Waters 1 2012-07-01 2012-07-01 false Mare Island Strait and the Napa...

  11. 33 CFR 117.169 - Mare Island Strait and the Napa River.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... SECURITY BRIDGES DRAWBRIDGE OPERATION REGULATIONS Specific Requirements California § 117.169 Mare Island Strait and the Napa River. (a) The draw of the Mare Island Drawbridge, mile 2.8, at Vallejo shall open on... 33 Navigation and Navigable Waters 1 2013-07-01 2013-07-01 false Mare Island Strait and the Napa...

  12. 33 CFR 117.169 - Mare Island Strait and the Napa River.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... SECURITY BRIDGES DRAWBRIDGE OPERATION REGULATIONS Specific Requirements California § 117.169 Mare Island Strait and the Napa River. (a) The draw of the Mare Island Drawbridge, mile 2.8, at Vallejo shall open on... 33 Navigation and Navigable Waters 1 2014-07-01 2014-07-01 false Mare Island Strait and the Napa...

  13. 33 CFR 117.169 - Mare Island Strait and the Napa River.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... SECURITY BRIDGES DRAWBRIDGE OPERATION REGULATIONS Specific Requirements California § 117.169 Mare Island Strait and the Napa River. (a) The draw of the Mare Island Drawbridge, mile 2.8, at Vallejo shall open on... 33 Navigation and Navigable Waters 1 2010-07-01 2010-07-01 false Mare Island Strait and the Napa...

  14. 33 CFR 117.169 - Mare Island Strait and the Napa River.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... SECURITY BRIDGES DRAWBRIDGE OPERATION REGULATIONS Specific Requirements California § 117.169 Mare Island Strait and the Napa River. (a) The draw of the Mare Island Drawbridge, mile 2.8, at Vallejo shall open on... 33 Navigation and Navigable Waters 1 2011-07-01 2011-07-01 false Mare Island Strait and the Napa...

  15. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.

    PubMed

    Wang, WeiBo; Wang, Wei; Sun, Wei; Crowley, James J; Szatkiewicz, Jin P

    2015-08-18

    Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. A comprehensive analysis of Helicobacter pylori plasticity zones reveals that they are integrating conjugative elements with intermediate integration specificity.

    PubMed

    Fischer, Wolfgang; Breithaupt, Ute; Kern, Beate; Smith, Stella I; Spicher, Carolin; Haas, Rainer

    2014-04-27

    The human gastric pathogen Helicobacter pylori is a paradigm for chronic bacterial infections. Its persistence in the stomach mucosa is facilitated by several mechanisms of immune evasion and immune modulation, but also by an unusual genetic variability which might account for the capability to adapt to changing environmental conditions during long-term colonization. This variability is reflected by the fact that almost each infected individual is colonized by a genetically unique strain. Strain-specific genes are dispersed throughout the genome, but clusters of genes organized as genomic islands may also collectively be present or absent. We have comparatively analysed such clusters, which are commonly termed plasticity zones, in a high number of H. pylori strains of varying geographical origin. We show that these regions contain fixed gene sets, rather than being true regions of genome plasticity, but two different types and several subtypes with partly diverging gene content can be distinguished. Their genetic diversity is incongruent with variations in the rest of the genome, suggesting that they are subject to horizontal gene transfer within H. pylori populations. We identified 40 distinct integration sites in 45 genome sequences, with a conserved heptanucleotide motif that seems to be the minimal requirement for integration. The significant number of possible integration sites, together with the requirement for a short conserved integration motif and the high level of gene conservation, indicates that these elements are best described as integrating conjugative elements (ICEs) with an intermediate integration site specificity.

  17. Ober's Island: The Mallard Ober's Island, One of the ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    Ober's Island: The Mallard - Ober's Island, One of the Review Islands on Rainy Lake, bounded on the south by The Hawk Island and on the north by The Crow Island. These islands are located seven miles east of Ranier, Minnesota, three miles west of Voyageur National Park, and one mile south of the international border of the United States of America and Canada. The legal description of Mallard Island is Lot 6, Section 19, T-17-N, R-22-W, Koochiching County, Minnesota, Ranier, Koochiching County, MN

  18. Pan-genome analysis of human gastric pathogen H. pylori: comparative genomics and pathogenomics approaches to identify regions associated with pathogenicity and prediction of potential core therapeutic targets.

    PubMed

    Ali, Amjad; Naz, Anam; Soares, Siomar C; Bakhtiar, Marriam; Tiwari, Sandeep; Hassan, Syed S; Hanan, Fazal; Ramos, Rommel; Pereira, Ulisses; Barh, Debmalya; Figueiredo, Henrique César Pereira; Ussery, David W; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco

    2015-01-01

    Helicobacter pylori is a human gastric pathogen implicated as the major cause of peptic ulcer and second leading cause of gastric cancer (~70%) around the world. Conversely, an increased resistance to antibiotics and hindrances in the development of vaccines against H. pylori are observed. Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are presented in this paper. Phylogenetic analyses have revealed close relationships among geographically diverse strains of H. pylori. The conservation among these genomes was further analyzed by pan-genome approach; the predicted conserved gene families (1,193) constitute ~77% of the average H. pylori genome and 45% of the global gene repertoire of the species. Reverse vaccinology strategies have been adopted to identify and narrow down the potential core-immunogenic candidates. Total of 28 nonhost homolog proteins were characterized as universal therapeutic targets against H. pylori based on their functional annotation and protein-protein interaction. Finally, pathogenomics and genome plasticity analysis revealed 3 highly conserved and 2 highly variable putative pathogenicity islands in all of the H. pylori genomes been analyzed.

  19. Project 1: Microbial Genomes: A Genomic Approach to Understanding the Evolution of Virulence. Project 2: From Genomes to Life: Drosophilia Development in Space and Time

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Robert DeSalle

    2004-09-10

    This project seeks to use the genomes of two close relatives, A. actinomycetemcomitans and H. aphrophilus, to understand the evolutionary changes that take place in a genome to make it more or less virulent. Our primary specific aim of this project was to sequence, annotate, and analyze the genomes of Actinobacillus actinomycetemcomitans (CU1000, serotype f) and Haemophilus aphrophilus. With these genome sequences we have then compared the whole genome sequences to each other and to the current Aa (HK1651 www.genome.ou.edu) genome project sequence along with other fully sequenced Pasteurellaceae to determine inter and intra species differences that may account formore » the differences and similarities in disease. We also propose to create and curate a comprehensive database where sequence information and analysis for the Pasteurellaceae (family that includes the genera Actinobacillus and Haemophilus) are readily accessible. And finally we have proposed to develop phylogenetic techniques that can be used to efficiently and accurately examine the evolution of genomes. Below we report on progress we have made on these major specific aims. Progress on the specific aims is reported below under two major headings--experimental approaches and bioinformatics and systematic biology approaches.« less

  20. Comparative genomics reveals cotton-specific virulence factors in flexible genomic regions in Verticillium dahliae and evidence of horizontal gene transfer from Fusarium.

    PubMed

    Chen, Jie-Yin; Liu, Chun; Gui, Yue-Jing; Si, Kai-Wei; Zhang, Dan-Dan; Wang, Jie; Short, Dylan P G; Huang, Jin-Qun; Li, Nan-Yang; Liang, Yong; Zhang, Wen-Qi; Yang, Lin; Ma, Xue-Feng; Li, Ting-Gang; Zhou, Lei; Wang, Bao-Li; Bao, Yu-Ming; Subbarao, Krishna V; Zhang, Geng-Yun; Dai, Xiao-Feng

    2018-01-01

    Verticillium dahliae isolates are most virulent on the host from which they were originally isolated. Mechanisms underlying these dominant host adaptations are currently unknown. We sequenced the genome of V. dahliae Vd991, which is highly virulent on its original host, cotton, and performed comparisons with the reference genomes of JR2 (from tomato) and VdLs.17 (from lettuce). Pathogenicity-related factor prediction, orthology and multigene family classification, transcriptome analyses, phylogenetic analyses, and pathogenicity experiments were performed. The Vd991 genome harbored several exclusive, lineage-specific (LS) genes within LS regions (LSRs). Deletion mutants of the seven genes within one LSR (G-LSR2) in Vd991 were less virulent only on cotton. Integration of G-LSR2 genes individually into JR2 and VdLs.17 resulted in significantly enhanced virulence on cotton but did not affect virulence on tomato or lettuce. Transcription levels of the seven LS genes in Vd991 were higher during the early stages of cotton infection, as compared with other hosts. Phylogenetic analyses suggested that G-LSR2 was acquired from Fusarium oxysporum f. sp. vasinfectum through horizontal gene transfer. Our results provide evidence that horizontal gene transfer from Fusarium to Vd991 contributed significantly to its adaptation to cotton and may represent a significant mechanism in the evolution of an asexual plant pathogen. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  1. Island Formation: Constructing a Coral Island

    ERIC Educational Resources Information Center

    Austin, Heather; Edd, Amelia

    2009-01-01

    The process of coral island formation is often difficult for middle school students to comprehend. Coral island formation is a dynamic process, and students should have the opportunity to experience this process in a synergistic context. The authors provide instructional guidelines for constructing a coral island. Students play an interactive role…

  2. Comparative genomics of Lactobacillus

    PubMed Central

    Kant, Ravi; Blom, Jochen; Palva, Airi; Siezen, Roland J.; de Vos, Willem M.

    2011-01-01

    Summary The genus Lactobacillus includes a diverse group of bacteria consisting of many species that are associated with fermentations of plants, meat or milk. In addition, various lactobacilli are natural inhabitants of the intestinal tract of humans and other animals. Finally, several Lactobacillus strains are marketed as probiotics as their consumption can confer a health benefit to host. Presently, 154 Lactobacillus species are known and a growing fraction of these are subject to draft genome sequencing. However, complete genome sequences are needed to provide a platform for detailed genomic comparisons. Therefore, we selected a total of 20 genomes of various Lactobacillus strains for which complete genomic sequences have been reported. These genomes had sizes varying from 1.8 to 3.3 Mb and other characteristic features, such as G+C content that ranged from 33% to 51%. The Lactobacillus pan genome was found to consist of approximately 14 000 protein‐encoding genes while all 20 genomes shared a total of 383 sets of orthologous genes that defined the Lactobacillus core genome (LCG). Based on advanced phylogeny of the proteins encoded by this LCG, we grouped the 20 strains into three main groups and defined core group genes present in all genomes of a single group, signature group genes shared in all genomes of one group but absent in all other Lactobacillus genomes, and Group‐specific ORFans present in core group genes of one group and absent in all other complete genomes. The latter are of specific value in defining the different groups of genomes. The study provides a platform for present individual comparisons as well as future analysis of new Lactobacillus genomes. PMID:21375712

  3. Comprehensive meta-analysis of Signal Transducers and Activators of Transcription (STAT) genomic binding patterns discerns cell-specific cis-regulatory modules

    PubMed Central

    2013-01-01

    Background Cytokine-activated transcription factors from the STAT (Signal Transducers and Activators of Transcription) family control common and context-specific genetic programs. It is not clear to what extent cell-specific features determine the binding capacity of seven STAT members and to what degree they share genetic targets. Molecular insight into the biology of STATs was gained from a meta-analysis of 29 available ChIP-seq data sets covering genome-wide occupancy of STATs 1, 3, 4, 5A, 5B and 6 in several cell types. Results We determined that the genomic binding capacity of STATs is primarily defined by the cell type and to a lesser extent by individual family members. For example, the overlap of shared binding sites between STATs 3 and 5 in T cells is greater than that between STAT5 in T cells and non-T cells. Even for the top 1,000 highly enriched STAT binding sites, ~15% of STAT5 binding sites in mouse female liver are shared by other STATs in different cell types while in T cells ~90% of STAT5 binding sites are co-occupied by STAT3, STAT4 and STAT6. In addition, we identified 116 cis-regulatory modules (CRM), which are recognized by all STAT members across cell types defining a common JAK-STAT signature. Lastly, in liver STAT5 binding significantly coincides with binding of the cell-specific transcription factors HNF4A, FOXA1 and FOXA2 and is associated with cell-type specific gene transcription. Conclusions Our results suggest that genomic binding of STATs is primarily determined by the cell type and further specificity is achieved in part by juxtaposed binding of cell-specific transcription factors. PMID:23324445

  4. Big Data and Genome Editing Technology: A New Paradigm of Cardiovascular Genomics.

    PubMed

    Krittanawong, Chayakrit; Sun, Tao; Herzog, Eyal

    2017-01-01

    Opinion Statements: Cardiovascular diseases (CVDs) encompass a range of conditions extending from congenital heart disease to acute coronary syndrome most of which are heterogenous in nature and some of them are multiple genetic loci. However, the pathogenesis of most CVDs remains incompletely understood. The advance in genome-editing technologies, an engineering process of DNA sequences at precise genomic locations, has enabled a new paradigm that human genome can be precisely modified to achieve a therapeutic effect. Genome-editing includes the correction of genetic variants that cause disease, the addition of therapeutic genes to specific sites in the genomic locations, and the removal of deleterious genes or genome sequences. Site-specific genome engineering can be used as nucleases (known as molecular scissors) including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) systems to provide remarkable opportunities for developing novel therapies in cardiovascular clinical care. Here we discuss genetic polymorphisms and mechanistic insights in CVDs with an emphasis on the impact of genome-editing technologies. The current challenges and future prospects for genomeediting technologies in cardiovascular medicine are also discussed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  5. PROMISE: a tool to identify genomic features with a specific biologically interesting pattern of associations with multiple endpoint variables.

    PubMed

    Pounds, Stan; Cheng, Cheng; Cao, Xueyuan; Crews, Kristine R; Plunkett, William; Gandhi, Varsha; Rubnitz, Jeffrey; Ribeiro, Raul C; Downing, James R; Lamba, Jatinder

    2009-08-15

    In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables. Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis. Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org.

  6. The influence of specific neighboring bases on substitution bias in noncoding regions of the plant chloroplast genome.

    PubMed

    Morton, B R; Oberholzer, V M; Clegg, M T

    1997-09-01

    Substitutions occurring in noncoding sequences of the plant chloroplast genome violate the independence of sites that is assumed by substitution models in molecular evolution. The probability that a substitution at a site is a transversion, as opposed to a transition, increases significantly with increasing A + T content of the two adjacent nucleotides. In the present study, this dependency of substitutions on local context is examined further in a number of noncoding regions from the chloroplast genome of members of the grass family (Poaceae). Two features were examined; the influence of specific neighboring bases, as opposed to the general A + T content, on transversion proportion and an influence on substitutions by nucleotides other than the two immediately adjacent to the site of substitution. In both cases, a significant effect was found. In the case of specific nucleotides, transversion proportion is significantly higher at sites with a pyrimidine immediately 5' on either strand. Substitutions at sites of the type YNR, where N is the site of substitution, have the highest rate of transversion. This specific effect is secondary to the A + T content effect such that, in terms of proportion of substitutions that are transversions, the nucleotides are ranked T > A > C > G as to their effect when they are immediately 5' to the site of substitution. In the case of nucleotides other than the immediate neighbors, a significant influence on substitution dynamics is observed in the case where the two neighboring bases are both A and/or T. Thus, substitutions are primarily, but not exclusively, influenced by the composition of the two nucleotides that are immediately adjacent. These results indicate that the pattern of molecular evolution of the plant chloroplast genome is extremely complex as a result of a variety of inter-site dependencies.

  7. GenPlay Multi-Genome, a tool to compare and analyze multiple human genomes in a graphical interface.

    PubMed

    Lajugie, Julien; Fourel, Nicolas; Bouhassira, Eric E

    2015-01-01

    Parallel visualization of multiple individual human genomes is a complex endeavor that is rapidly gaining importance with the increasing number of personal, phased and cancer genomes that are being generated. It requires the display of variants such as SNPs, indels and structural variants that are unique to specific genomes and the introduction of multiple overlapping gaps in the reference sequence. Here, we describe GenPlay Multi-Genome, an application specifically written to visualize and analyze multiple human genomes in parallel. GenPlay Multi-Genome is ideally suited for the comparison of allele-specific expression and functional genomic data obtained from multiple phased genomes in a graphical interface with access to multiple-track operation. It also allows the analysis of data that have been aligned to custom genomes rather than to a standard reference and can be used as a variant calling format file browser and as a tool to compare different genome assembly, such as hg19 and hg38. GenPlay is available under the GNU public license (GPL-3) from http://genplay.einstein.yu.edu. The source code is available at https://github.com/JulienLajugie/GenPlay. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Habitat and environment of islands: primary and supplemental island sets

    USGS Publications Warehouse

    Matalas, Nicholas C.; Grossling, Bernardo F.

    2002-01-01

    The original intent of the study was to develop a first-order synopsis of island hydrology with an integrated geologic basis on a global scale. As the study progressed, the aim was broadened to provide a framework for subsequent assessments on large regional or global scales of island resources and impacts on those resources that are derived from global changes. Fundamental to the study was the development of a comprehensive framework?a wide range of parameters that describe a set of 'saltwater' islands sufficiently large to Characterize the spatial distribution of the world?s islands; Account for all major archipelagos; Account for almost all oceanically isolated islands, and Account collectively for a very large proportion of the total area of the world?s islands whereby additional islands would only marginally contribute to the representativeness and accountability of the island set. The comprehensive framework, which is referred to as the ?Primary Island Set,? is built on 122 parameters that describe 1,000 islands. To complement the investigations based on the Primary Island Set, two supplemental island sets, Set A?Other Islands (not in the Primary Island Set) and Set B?Lagoonal Atolls, are included in the study. The Primary Island Set, together with the Supplemental Island Sets A and B, provides a framework that can be used in various scientific disciplines for their island-based studies on broad regional or global scales. The study uses an informal, coherent, geophysical organization of the islands that belong to the three island sets. The organization is in the form of a global island chain, which is a particular sequential ordering of the islands referred to as the 'Alisida.' The Alisida was developed through a trial-and-error procedure by seeking to strike a balance between 'minimizing the length of the global chain' and 'maximizing the chain?s geophysical coherence.' The fact that an objective function cannot be minimized and maximized simultaneously

  9. The Genomic HyperBrowser: an analysis web server for genome-scale data

    PubMed Central

    Sandve, Geir K.; Gundersen, Sveinung; Johansen, Morten; Glad, Ingrid K.; Gunathasan, Krishanthi; Holden, Lars; Holden, Marit; Liestøl, Knut; Nygård, Ståle; Nygaard, Vegard; Paulsen, Jonas; Rydbeck, Halfdan; Trengereid, Kai; Clancy, Trevor; Drabløs, Finn; Ferkingstad, Egil; Kalaš, Matúš; Lien, Tonje; Rye, Morten B.; Frigessi, Arnoldo; Hovig, Eivind

    2013-01-01

    The immense increase in availability of genomic scale datasets, such as those provided by the ENCODE and Roadmap Epigenomics projects, presents unprecedented opportunities for individual researchers to pose novel falsifiable biological questions. With this opportunity, however, researchers are faced with the challenge of how to best analyze and interpret their genome-scale datasets. A powerful way of representing genome-scale data is as feature-specific coordinates relative to reference genome assemblies, i.e. as genomic tracks. The Genomic HyperBrowser (http://hyperbrowser.uio.no) is an open-ended web server for the analysis of genomic track data. Through the provision of several highly customizable components for processing and statistical analysis of genomic tracks, the HyperBrowser opens for a range of genomic investigations, related to, e.g., gene regulation, disease association or epigenetic modifications of the genome. PMID:23632163

  10. The Genomic HyperBrowser: an analysis web server for genome-scale data.

    PubMed

    Sandve, Geir K; Gundersen, Sveinung; Johansen, Morten; Glad, Ingrid K; Gunathasan, Krishanthi; Holden, Lars; Holden, Marit; Liestøl, Knut; Nygård, Ståle; Nygaard, Vegard; Paulsen, Jonas; Rydbeck, Halfdan; Trengereid, Kai; Clancy, Trevor; Drabløs, Finn; Ferkingstad, Egil; Kalas, Matús; Lien, Tonje; Rye, Morten B; Frigessi, Arnoldo; Hovig, Eivind

    2013-07-01

    The immense increase in availability of genomic scale datasets, such as those provided by the ENCODE and Roadmap Epigenomics projects, presents unprecedented opportunities for individual researchers to pose novel falsifiable biological questions. With this opportunity, however, researchers are faced with the challenge of how to best analyze and interpret their genome-scale datasets. A powerful way of representing genome-scale data is as feature-specific coordinates relative to reference genome assemblies, i.e. as genomic tracks. The Genomic HyperBrowser (http://hyperbrowser.uio.no) is an open-ended web server for the analysis of genomic track data. Through the provision of several highly customizable components for processing and statistical analysis of genomic tracks, the HyperBrowser opens for a range of genomic investigations, related to, e.g., gene regulation, disease association or epigenetic modifications of the genome.

  11. Vaccinating Asian Pacific Islander children against hepatitis B: ethnic-specific influences and barriers.

    PubMed

    Pulido, M J; Alvarado, E A; Berger, W; Nelson, A; Todoroff, C

    2001-01-01

    Hepatitis B virus (HBV) is a known cause of liver cancer, especially among Asian and Pacific Islanders (API). Despite national recommendations and school entry requirements for vaccination, many children are not fully vaccinated with the Hepatitis B vaccine (Hep B) before entering school. The purpose of this study was to measure ethnic group-specific hepatitis B vaccination rates among school-aged API children after implementation of universal recommendations and school laws, and quantify ethnic-specific risk factors associated with late and incomplete vaccinations. A multilingual questionnaire was distributed to parents of second and fourth graders in nine Los Angeles County (LAC) elementary schools with high proportions of API students. Data on Hepatitis B vaccination dates, source of health care and health information, cultural factors, and general knowledge and attitudes about HBV and vaccination were collected and analyzed. Overall, 1,696 (77%) of 2,183 questionnaires were returned. Of these, 1,024 were from API children. The API second graders in this survey had a 72% coverage rate, ranging from 46% to 94% among the individual ethnic groups. Fifty-one percent of API fourth graders had three doses of Hep B vaccine, ranging from 38% to 69% among the individual ethnic groups. Factors influencing coverage levels among API fourth graders were speaking limited English at home, living in the United States less than five years, and not having discussed hepatitis B vaccination with a health care provider. Factors influencing low immunization levels differed among the API ethnic groups. Analysis and intervention on a non-aggregate level are necessary for designing both effective and cultural-specific outreach programs for diverse API communities such as LAC's.

  12. Heat Islands

    EPA Pesticide Factsheets

    EPA's Heat Island Effect Site provides information on heat islands, their impacts, mitigation strategies, related research, a directory of heat island reduction initiatives in U.S. communities, and EPA's Heat Island Reduction Program.

  13. Comparative Genomics of an Unusual Biogeographic Disjunction in the Cotton Tribe (Gossypieae) Yields Insights into Genome Downsizing

    PubMed Central

    Arick, Mark A; Conover, Justin L; Thrash, Adam; Sanders, William S; Hsu, Chuan-Yu; Naqvi, Rubab Zahra; Farooq, Muhammad; Li, Xiaochong; Gong, Lei; Mudge, Joann; Ramaraj, Thiruvarangan; Udall, Joshua A; Peterson, Daniel G

    2017-01-01

    Abstract Long-distance insular dispersal is associated with divergence and speciation because of founder effects and strong genetic drift. The cotton tribe (Gossypieae) has experienced multiple transoceanic dispersals, generating an aggregate geographic range that encompasses much of the tropics and subtropics worldwide. Two genera in the Gossypieae, Kokia and Gossypioides, exhibit a remarkable geographic disjunction, being restricted to the Hawaiian Islands and Madagascar/East Africa, respectively. We assembled and use de novo genome sequences to address questions regarding the divergence of these two genera from each other and from their sister-group, Gossypium. In addition, we explore processes underlying the genome downsizing that characterizes Kokia and Gossypioides relative to other genera in the tribe. Using 13,000 gene orthologs and synonymous substitution rates, we show that the two disjuncts last shared a common ancestor ∼5 Ma, or half as long ago as their divergence from Gossypium. We report relative stasis in the transposable element fraction. In comparison to Gossypium, there is loss of ∼30% of the gene content in the two disjunct genera and a history of genome-wide accumulation of deletions. In both genera, there is a genome-wide bias toward deletions over insertions, and the number of gene losses exceeds the number of gains by ∼2- to 4-fold. The genomic analyses presented here elucidate genomic consequences of the demographic and biogeographic history of these closest relatives of Gossypium, and enhance their value as phylogenetic outgroups. PMID:29194487

  14. DLGP: A database for lineage-conserved and lineage-specific gene pairs in animal and plant genomes.

    PubMed

    Wang, Dapeng

    2016-01-15

    The conservation of gene organization in the genome with lineage-specificity is an invaluable resource to decipher their potential functionality with diverse selective constraints, especially in higher animals and plants. Gene pairs appear to be the minimal structure for such kind of gene clusters that tend to reside in their preferred locations, representing the distinctive genomic characteristics in single species or a given lineage. Despite gene families having been investigated in a widespread manner, the definition of gene pair families in various taxa still lacks adequate attention. To address this issue, we report DLGP (http://lcgbase.big.ac.cn/DLGP/) that stores the pre-calculated lineage-based gene pairs in currently available 134 animal and plant genomes and inspect them under the same analytical framework, bringing out a set of innovational features. First, the taxonomy or lineage has been classified into four levels such as Kingdom, Phylum, Class and Order. It adopts all-to-all comparison strategy to identify the possible conserved gene pairs in all species for each gene pair in certain species and reckon those that are conserved in over a significant proportion of species in a given lineage (e.g. Primates, Diptera or Poales) as the lineage-conserved gene pairs. Furthermore, it predicts the lineage-specific gene pairs by retaining the above-mentioned lineage-conserved gene pairs that are not conserved in any other lineages. Second, it carries out pairwise comparison for the gene pairs between two compared species and creates the table including all the conserved gene pairs and the image elucidating the conservation degree of gene pairs in chromosomal level. Third, it supplies gene order browser to extend gene pairs to gene clusters, allowing users to view the evolution dynamics in the gene context in an intuitive manner. This database will be able to facilitate the particular comparison between animals and plants, between vertebrates and arthropods, and

  15. Autosomal and Mitochondrial Adaptation Following Admixture: A Case Study on the Honeybees of Reunion Island

    PubMed Central

    Wragg, David; Techer, Maéva Angélique; Canale-Tabet, Kamila; Basso, Benjamin; Bidanel, Jean-Pierre; Labarthe, Emmanuelle; Bouchez, Olivier; Le Conte, Yves; Clémencet, Johanna; Delatte, Hélène

    2018-01-01

    Abstract The honeybee population of the tropical Reunion Island is a genetic admixture of the Apis mellifera unicolor subspecies, originally described in Madagascar, and of European subspecies, mainly A. m. carnica and A. m. ligustica, regularly imported to the island since the late 19th century. We took advantage of this population to study genetic admixing of the tropical-adapted indigenous and temperate-adapted European genetic backgrounds. Whole genome sequencing of 30 workers and 6 males from Reunion, compared with samples from Europe, Madagascar, Mauritius, Rodrigues, and the Seychelles, revealed the Reunion honeybee population to be composed on an average of 53.2 ± 5.9% A. m. unicolor nuclear genomic background, the rest being mainly composed of A. m. carnica and to a lesser extent A. m. ligustica. In striking contrast to this, only 1 out of the 36 honeybees from Reunion had a mitochondrial genome of European origin, suggesting selection has favored the A. m. unicolor mitotype, which is possibly better adapted to the island’s bioclimate. Local ancestry was determined along the chromosomes for all Reunion samples, and a test for preferential selection for the A. m. unicolor or European background revealed 15 regions significantly associated with the A. m. unicolor lineage and 9 regions with the European lineage. Our results provide insights into the long-term consequences of introducing exotic specimen on the nuclear and mitochondrial genomes of locally adapted populations. PMID:29202174

  16. Genome-wide association study reveals sex-specific selection signals against autosomal nucleotide variants.

    PubMed

    Ryu, Dongchan; Ryu, Jihye; Lee, Chaeyoung

    2016-05-01

    A genome-wide association study (GWAS) was conducted to examine genetic associations of common autosomal nucleotide variants with sex in a Korean population with 4183 males and 4659 females. Nine genetic association signals were identified in four intragenic and five intergenic regions (P<5 × 10(-8)). Further analysis with an independent data set confirmed two intragenic association signals in the genes encoding protein phosphatase 1, regulatory subunit 12B (PPP1R12B, intron 12, rs1819043) and dynein, axonemal, heavy chain 11 (DNAH11, intron 61, rs10255013), which are directly involved in the reproductive system. This study revealed autosomal genetic variants associated with sex ratio by GWAS for the first time. This implies that genetic variants in proximity to the association signals may influence sex-specific selection and contribute to sex ratio variation. Further studies are required to reveal the mechanisms underlying sex-specific selection.

  17. Prostate cancer screening by prostate-specific antigen (PSA); a relevant approach for the small population of the Cayman Islands.

    PubMed

    Jyoti, Shravana Kumar; Blacke, Camille; Patil, Pallavi; Amblihalli, Vibha P; Nicholson, Amanda

    2018-01-01

    The common tool for diagnosing prostate cancer is prostate-specific antigen (PSA), but the high sensitivity and low specificity of PSA testing are the problems in clinical practice. There are no proper guidelines to investigate the suspected prostate cancer in the Cayman Islands. We correlated PSA levels with the incidence of prostate cancers by tissue diagnosis and proposed logical protocol for prostate screening by using PSA test in this small population. A total of 165 Afro Caribbean individuals who had prostate biopsy done after the investigations for PSA levels from year 2005 to 2015 were studied retrospectively. The patients were divided into subgroups by baseline PSA levels as follows: <4, 4.1-10, 10.1-20, 20.1-50, 50.1-100, and >100 ng/mL and were correlated to the age and presence of cancer. Benign lesions had lower PSA levels compared to cancer which generally had higher values. Only three cases that had less than 4 ng/mg were turned out to be malignant. When PSA value was more than 100 ng/mL, all the cases were malignant. Between PSA values of 4-100 ng/mL, the probability of cancer diagnosis was 56.71% (76 cancers out of 134 in this range). Limitation of PSA testing has the risk of over diagnosis and the resultant negative biopsies owing to poor specificity. Whereas the cutoff limit for cancer diagnosis still remains 4 ng/mL from our study, most of the patients can be assured of benign lesion below this level and thus morbidity associated with the biopsy can be prevented. When the PSA value is greater than 100 ng, biopsy procedure was mandatory as there were 100% cancers above this level. On the background of vast literature linking PSA to prostate cancer and its difficulty in implementing in clinical practice, we studied literature of this conflicting and complex topic and tried to bring relevant protocols to the small population of Cayman Islands for the screening of prostate cancer. In this study, a total of 165 Afro Caribbean individuals who

  18. Genetic structure and diversity of the selfing model grass Brachypodium stacei (Poaceae) in Western Mediterranean: out of the Iberian Peninsula and into the islands.

    PubMed

    Shiposha, Valeriia; Catalán, Pilar; Olonova, Marina; Marques, Isabel

    2016-01-01

    Annual Mediterranean species of the genus Brachypodium are promising model plants for energy crops since their selfing nature and short-life cycles are an advantage in breeding programs. The false brome, B. distachyon, has already been sequenced and new genomic initiatives have triggered the de-novo genome sequencing of its close relatives such as B. stacei, a species that was until recently mistaken for B. distachyon. However, the success of these initiatives hinges on detailed knowledge about the distribution of genetic variation within and among populations for the effective use of germplasm in a breeding program. Understanding population genetic diversity and genetic structure is also an important prerequisite for designing effective experimental populations for genomic wide studies. However, population genetic data are still limited in B. stacei. We therefore selected and amplified 10 nuclear microsatellite markers to depict patterns of population structure and genetic variation among 181 individuals from 19 populations of B. stacei occurring in its predominant range, the western Mediterranean area: mainland Iberian Peninsula, continental Balearic Islands and oceanic Canary Islands. Our genetic results support the occurrence of a predominant selfing system with extremely high levels of homozygosity across the analyzed populations. Despite the low level of genetic variation found, two different genetic clusters were retrieved, one clustering all SE Iberian mainland populations and the island of Minorca and another one grouping all S Iberian mainland populations, the Canary Islands and all Majorcan populations except one that clustered with the former group. These results, together with a high sharing of alleles (89%) suggest different colonization routes from the mainland Iberian Peninsula into the islands. A recent colonization scenario could explain the relatively low levels of genetic diversity and low number of alleles found in the Canary Islands

  19. Genetic structure and diversity of the selfing model grass Brachypodium stacei (Poaceae) in Western Mediterranean: out of the Iberian Peninsula and into the islands

    PubMed Central

    Shiposha, Valeriia; Catalán, Pilar; Olonova, Marina

    2016-01-01

    Annual Mediterranean species of the genus Brachypodium are promising model plants for energy crops since their selfing nature and short-life cycles are an advantage in breeding programs. The false brome, B. distachyon, has already been sequenced and new genomic initiatives have triggered the de-novo genome sequencing of its close relatives such as B. stacei, a species that was until recently mistaken for B. distachyon. However, the success of these initiatives hinges on detailed knowledge about the distribution of genetic variation within and among populations for the effective use of germplasm in a breeding program. Understanding population genetic diversity and genetic structure is also an important prerequisite for designing effective experimental populations for genomic wide studies. However, population genetic data are still limited in B. stacei. We therefore selected and amplified 10 nuclear microsatellite markers to depict patterns of population structure and genetic variation among 181 individuals from 19 populations of B. stacei occurring in its predominant range, the western Mediterranean area: mainland Iberian Peninsula, continental Balearic Islands and oceanic Canary Islands. Our genetic results support the occurrence of a predominant selfing system with extremely high levels of homozygosity across the analyzed populations. Despite the low level of genetic variation found, two different genetic clusters were retrieved, one clustering all SE Iberian mainland populations and the island of Minorca and another one grouping all S Iberian mainland populations, the Canary Islands and all Majorcan populations except one that clustered with the former group. These results, together with a high sharing of alleles (89%) suggest different colonization routes from the mainland Iberian Peninsula into the islands. A recent colonization scenario could explain the relatively low levels of genetic diversity and low number of alleles found in the Canary Islands

  20. Genome-wide mapping and analysis of active promoters in mouse embryonic stem cells and adult organs

    PubMed Central

    Barrera, Leah O.; Li, Zirong; Smith, Andrew D.; Arden, Karen C.; Cavenee, Webster K.; Zhang, Michael Q.; Green, Roland D.; Ren, Bing

    2008-01-01

    By integrating genome-wide maps of RNA polymerase II (Polr2a) binding with gene expression data and H3ac and H3K4me3 profiles, we characterized promoters with enriched activity in mouse embryonic stem cells (mES) as well as adult brain, heart, kidney, and liver. We identified ∼24,000 promoters across these samples, including 16,976 annotated mRNA 5′ ends and 5153 additional sites validating cap-analysis of gene expression (CAGE) 5′ end data. We showed that promoters with CpG islands are typically non-tissue specific, with the majority associated with Polr2a and the active chromatin modifications in nearly all the tissues examined. By contrast, the promoters without CpG islands are generally associated with Polr2a and the active chromatin marks in a tissue-dependent way. We defined 4396 tissue-specific promoters by adapting a quantitative index of tissue-specificity based on Polr2a occupancy. While there is a general correspondence between Polr2a occupancy and active chromatin modifications at the tissue-specific promoters, a subset of them appear to be persistently marked by active chromatin modifications in the absence of detectable Polr2a binding, highlighting the complexity of the functional relationship between chromatin modification and gene expression. Our results provide a resource for exploring promoter Polr2a binding and epigenetic states across pluripotent and differentiated cell types in mammals. PMID:18042645