Science.gov

Sample records for affymetrix ath1 genome

  1. Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform

    PubMed Central

    2011-01-01

    Background Copy number data are routinely being extracted from genome-wide association study chips using a variety of software. We empirically evaluated and compared four freely-available software packages designed for Affymetrix SNP chips to estimate copy number: Affymetrix Power Tools (APT), Aroma.Affymetrix, PennCNV and CRLMM. Our evaluation used 1,418 GENOA samples that were genotyped on the Affymetrix Genome-Wide Human SNP Array 6.0. We compared bias and variance in the locus-level copy number data, the concordance amongst regions of copy number gains/deletions and the false-positive rate amongst deleted segments. Results APT had median locus-level copy numbers closest to a value of two, whereas PennCNV and Aroma.Affymetrix had the smallest variability associated with the median copy number. Of those evaluated, only PennCNV provides copy number specific quality-control metrics and identified 136 poor CNV samples. Regions of copy number variation (CNV) were detected using the hidden Markov models provided within PennCNV and CRLMM/VanillaIce. PennCNV detected more CNVs than CRLMM/VanillaIce; the median number of CNVs detected per sample was 39 and 30, respectively. PennCNV detected most of the regions that CRLMM/VanillaIce did as well as additional CNV regions. The median concordance between PennCNV and CRLMM/VanillaIce was 47.9% for duplications and 51.5% for deletions. The estimated false-positive rate associated with deletions was similar for PennCNV and CRLMM/VanillaIce. Conclusions If the objective is to perform statistical tests on the locus-level copy number data, our empirical results suggest that PennCNV or Aroma.Affymetrix is optimal. If the objective is to perform statistical tests on the summarized segmented data then PennCNV would be preferred over CRLMM/VanillaIce. Specifically, PennCNV allows the analyst to estimate locus-level copy number, perform segmentation and evaluate CNV-specific quality-control metrics within a single software package

  2. Global Expression Patterns of Three Festuca Species Exposed to Different Doses of Glyphosate Using the Affymetrix GeneChip Wheat Genome Array

    PubMed Central

    Cebeci, Ozge; Budak, Hikmet

    2009-01-01

    Glyphosate has been shown to act as an inhibitor of an aromatic amino acid biosynthetic pathway, while other pathways that may be affected by glyphosate are not known. Cross species hybridizations can provide a tool for elucidating biological pathways conserved among organisms. Comparative genome analyses have indicated a high level of colinearity among grass species and Festuca, on which we focus here, and showed rearrangements common to the Pooideae family. Based on sequence conservation among grass species, we selected the Affymetrix GeneChip Wheat Genome Array as a tool for the analysis of expression profiles of three Festuca (fescue) species with distinctly different tolerances to varying levels of glyphosate. Differences in transcript expression were recorded upon foliar glyphosate application at 1.58 mM and 6.32 mM, representing 5% and 20%, respectively, of the recommended rate. Differences highlighted categories of general metabolic processes, such as photosynthesis, protein synthesis, stress responses, and a larger number of transcripts responded to 20% glyphosate application. Differential expression of genes encoding proteins involved in the shikimic acid pathway could not be identified by cross hybridization. Microarray data were confirmed by RT-PCR and qRT-PCR analyses. This is the first report to analyze the potential of cross species hybridization in Fescue species and the data and analyses will help extend our knowledge on the cellular processes affected by glyphosate. PMID:20182642

  3. Development and Evaluation of an Affymetrix array for Aspergillus flavus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A multi-species Affymetrix GeneChip array was developed to study development, metabolism and pathogenicity of A. flavus. This chip based on the whole genome sequence of A. flavus, contains 13,000 A. flavus genes, 8,000 maize genes and 25 human and mouse innate immune response genes, as well as the ...

  4. VIZARD: analysis of Affymetrix Arabidopsis GeneChip data

    NASA Technical Reports Server (NTRS)

    Moseyko, Nick; Feldman, Lewis J.

    2002-01-01

    SUMMARY: The Affymetrix GeneChip Arabidopsis genome array has proved to be a very powerful tool for the analysis of gene expression in Arabidopsis thaliana, the most commonly studied plant model organism. VIZARD is a Java program created at the University of California, Berkeley, to facilitate analysis of Arabidopsis GeneChip data. It includes several integrated tools for filtering, sorting, clustering and visualization of gene expression data as well as tools for the discovery of regulatory motifs in upstream sequences. VIZARD also includes annotation and upstream sequence databases for the majority of genes represented on the Affymetrix Arabidopsis GeneChip array. AVAILABILITY: VIZARD is available free of charge for educational, research, and not-for-profit purposes, and can be downloaded at http://www.anm.f2s.com/research/vizard/ CONTACT: moseyko@uclink4.berkeley.edu.

  5. BLADE-ON-PETIOLE1 and 2 regulate Arabidopsis inflorescence architecture in conjunction with homeobox genes KNAT6 and ATH1

    PubMed Central

    Khan, Madiha; Tabb, Paul; Hepworth, Shelley R.

    2012-01-01

    Inflorescence architecture varies widely among flowering plants, serving to optimize the display of flowers for reproductive success. In Arabidopsis thaliana, internode elongation begins at the floral transition, generating a regular spiral arrangement of upwardly-oriented flowers on the primary stem. Post-elongation, differentiation of lignified interfascicular fibers in the stem provides mechanical support. Correct inflorescence patterning requires two interacting homeodomain transcription factors: the KNOTTED1-like protein BREVIPEDICELLUS (BP) and its BEL1-like interaction partner PENNYWISE (PNY). Mutations in BP and PNY cause short internodes, irregular spacing and/or orientation of lateral organs, and altered lignin deposition in stems. Recently, we showed that these defects are caused by the misexpression of lateral organ boundary genes, BLADE-ON-PETIOLE1 (BOP1) and BOP2, which function downstream of BP-PNY in an antagonistic fashion. BOP1/2 gain-of-function in stems promotes expression of the boundary gene KNOTTED1-LIKE FROM ARABIDOPSIS THALIANA6 (KNAT6) and shown here, ARABIDOPSIS THALIANA HOMEOBOX GENE1 (ATH1), providing KNAT6 with a BEL1-like co-factor. Our further analyses show that defects caused by BOP1/2 gain-of-function require both KNAT6 and ATH1. These data reveal how BOP1/2-dependent activation of a boundary module in stems exerts changes in inflorescence architecture. PMID:22751300

  6. Qualitative assessment of gene expression in affymetrix genechip arrays

    NASA Astrophysics Data System (ADS)

    Nagarajan, Radhakrishnan; Upreti, Meenakshi

    2007-01-01

    Affymetrix Genechip microarrays are used widely to determine the simultaneous expression of genes in a given biological paradigm. Probes on the Genechip array are atomic entities which by definition are randomly distributed across the array and in turn govern the gene expression. In the present study, we make several interesting observations. We show that there is considerable correlation between the probe intensities across the array which defy the independence assumption. While the mechanism behind such correlations is unclear, we show that scaling behavior and the profiles of perfect match (PM) as well as mismatch (MM) probes are similar and immune-to-background subtraction. We believe that the observed correlations are possibly an outcome of inherent non-stationarities or patchiness in the array devoid of biological significance. This is demonstrated by inspecting their scaling behavior and profiles of the PM and MM probe intensities obtained from publicly available Genechip arrays from three eukaryotic genomes, namely: Drosophila melanogaster (fruit fly), Homo sapiens (humans) and Mus musculus (house mouse) across distinct biological paradigms and across laboratories, with and without background subtraction. The fluctuation functions were estimated using detrended fluctuation analysis (DFA) with fourth-order polynomial detrending. The results presented in this study provide new insights into correlation signatures of PM and MM probe intensities and suggests the choice of DFA as a tool for qualitative assessment of Affymetrix Genechip microarrays prior to their analysis. A more detailed investigation is necessary in order to understand the source of these correlations.

  7. Discovery and mapping of single feature polymorphisms in wheat using affymetrix arrays

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single feature polymorphisms (SFPs) can be a rich source of markers for gene mapping and function studies. To explore the feasibility of using the Affymetrix GeneChip to discover and map SFPs in the large hexaploid wheat genome, six wheat varieties of diverse origins were analyzed for significant pr...

  8. Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.

    PubMed

    Guzzi, Pietro Hiram; Cannataro, Mario

    2013-08-01

    A current trend in genomics is the investigation of the cell mechanism using different technologies, in order to explain the relationship among genes, molecular processes and diseases. For instance, the combined use of gene-expression arrays and genomic arrays has been demonstrated as an effective instrument in clinical practice. Consequently, in a single experiment different kind of microarrays may be used, resulting in the production of different types of binary data (images and textual raw data). The analysis of microarray data requires an initial preprocessing phase, that makes raw data suitable for use on existing analysis platforms, such as the TIGR M4 (TM4) Suite. An additional challenge to be faced by emerging data analysis platforms is the ability to treat in a combined way those different microarray formats coupled with clinical data. In fact, resulting integrated data may include both numerical and symbolic data (e.g. gene expression and SNPs regarding molecular data), as well as temporal data (e.g. the response to a drug, time to progression and survival rate), regarding clinical data. Raw data preprocessing is a crucial step in analysis but is often performed in a manual and error prone way using different software tools. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of different microarray data are needed. The paper presents Micro-Analyzer (Microarray Analyzer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix gene expression and SNP binary data. It represents the evolution of the μ-CS tool, extending the preprocessing to SNP arrays that were not allowed in μ-CS. The Micro-Analyzer is provided as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data (gene expression and SNPs) by invoking TM4 platform. It avoids: (i) the manual invocation of external tools (e.g. the Affymetrix Power

  9. Arabidopsis transcriptional responses differentiating closely related chemicals (herbicides) and cross-species extrapolation to Brassica

    EPA Science Inventory

    Using whole genome Affymetrix ATH1 GeneChips we characterized the transcriptional response of Arabidopsis thaliana Columbia 24 hours after treatment with five different herbicides. Four of them (chloransulam, imazapyr, primisulfuron, sulfometuron) inhibit acetolactate synthase (A...

  10. Reverse engineering and analysis of large genome-scale gene networks

    PubMed Central

    Aluru, Maneesha; Zola, Jaroslaw; Nettleton, Dan; Aluru, Srinivas

    2013-01-01

    Reverse engineering the whole-genome networks of complex multicellular organisms continues to remain a challenge. While simpler models easily scale to large number of genes and gene expression datasets, more accurate models are compute intensive limiting their scale of applicability. To enable fast and accurate reconstruction of large networks, we developed Tool for Inferring Network of Genes (TINGe), a parallel mutual information (MI)-based program. The novel features of our approach include: (i) B-spline-based formulation for linear-time computation of MI, (ii) a novel algorithm for direct permutation testing and (iii) development of parallel algorithms to reduce run-time and facilitate construction of large networks. We assess the quality of our method by comparison with ARACNe (Algorithm for the Reconstruction of Accurate Cellular Networks) and GeneNet and demonstrate its unique capability by reverse engineering the whole-genome network of Arabidopsis thaliana from 3137 Affymetrix ATH1 GeneChips in just 9 min on a 1024-core cluster. We further report on the development of a new software Gene Network Analyzer (GeNA) for extracting context-specific subnetworks from a given set of seed genes. Using TINGe and GeNA, we performed analysis of 241 Arabidopsis AraCyc 8.0 pathways, and the results are made available through the web. PMID:23042249

  11. Genetic and genomic analysis of Rhizoctonia solani interactions with Arabidopsis; evidence of resistance mediated through NADPH oxidases.

    PubMed

    Foley, Rhonda C; Gleason, Cynthia A; Anderson, Jonathan P; Hamann, Thorsten; Singh, Karam B

    2013-01-01

    Rhizoctonia solani is an important soil-borne necrotrophic fungal pathogen, with a broad host range and little effective resistance in crop plants. Arabidopsis is resistant to R. solani AG8 but susceptible to R. solani AG2-1. A screen of 36 Arabidopsis ecotypes and mutants affected in the auxin, camalexin, salicylic acid, abscisic acid and ethylene/jasmonic acid pathways did not reveal any variation in response to R. solani and demonstrated that resistance to AG8 was independent of these defense pathways. The Arabidopsis Affymetrix ATH1 Genome array was used to assess global gene expression changes in plants infected with AG8 and AG2-1 at seven days post-infection. While there was considerable overlap in the response, some gene families were differentially affected by AG8 or AG2-1 and included those involved in oxidative stress, cell wall associated proteins, transcription factors and heat shock protein genes. Since a substantial proportion of the gene expression changes were associated with oxidative stress responses, we analysed the role of NADPH oxidases in resistance. While single NADPH oxidase mutants had no effect, a NADPH oxidase double mutant atrbohf atrbohd resulted in an almost complete loss of resistance to AG8, suggesting that reactive oxidative species play an important role in Arabidopsis's resistance to R. solani. PMID:23451091

  12. Evaluation of the Affymetrix CytoScan® Dx Assay for Developmental Delay

    PubMed Central

    Webb, Bryn D.; Scharf, Rebecca J.; Spear, Emily A.; Edelmann, Lisa J.; Stroustrup, Annemarie

    2015-01-01

    The goal of molecular cytogenetic testing for children presenting with developmental delay is to identify or exclude genetic abnormalities that are associated with cognitive, behavioral, and/or motor symptoms. Until 2010, chromosome analysis was the standard first-line genetic screening test for evaluation of patients with developmental delay when a specific syndrome was not suspected. In 2010, The American College of Medical Genetics and several other groups recommended chromosomal microarray (CMA) as the first-line test in children with developmental delays, multiple congenital anomalies, and/or autism. This test is able to detect regions of genomic imbalances at a much finer resolution than G-banded karyotyping. Until recently, no CMA testing had been approved by the United States Food and Drug Administration (FDA). This review will focus on the use of the Affymetrix CytoScan® Dx Assay, the first CMA to receive FDA approval for the genetic evaluation of individuals with developmental delay. PMID:25350348

  13. Exon array data analysis using Affymetrix power tools and R statistical software

    PubMed Central

    2011-01-01

    The use of microarray technology to measure gene expression on a genome-wide scale has been well established for more than a decade. Methods to process and analyse the vast quantity of expression data generated by a typical microarray experiment are similarly well-established. The Affymetrix Exon 1.0 ST array is a relatively new type of array, which has the capability to assess expression at the individual exon level. This allows a more comprehensive analysis of the transcriptome, and in particular enables the study of alternative splicing, a gene regulation mechanism important in both normal conditions and in diseases. Some aspects of exon array data analysis are shared with those for standard gene expression data but others present new challenges that have required development of novel tools. Here, I will introduce the exon array and present a detailed example tutorial for analysis of data generated using this platform. PMID:21498550

  14. CEL_INTERROGATOR: A FREE AND OPEN SOURCE PACKAGE FOR AFFYMETRIX CEL FILE PARSING

    Technology Transfer Automated Retrieval System (TEKTRAN)

    CEL_Interrogator Package is a suite of programs designed to extract the average probe intensity and other information for each probe sequence from an Affymetrix GeneChip CEL file and unite them with their human-readable Affymetrix consensus sequence names. The resulting text file is suitable for di...

  15. High Fidelity Copy Number Analysis of Formalin-Fixed and Paraffin-Embedded Tissues Using Affymetrix Cytoscan HD Chip

    PubMed Central

    Yu, Yan P.; Michalopoulos, Amantha; Ding, Ying; Tseng, George; Luo, Jian-Hua

    2014-01-01

    Detection of human genome copy number variation (CNV) is one of the most important analyses in diagnosing human malignancies. Genome CNV detection in formalin-fixed and paraffin-embedded (FFPE) tissues remains challenging due to suboptimal DNA quality and failure to use appropriate baseline controls for such tissues. Here, we report a modified method in analyzing CNV in FFPE tissues using microarray with Affymetrix Cytoscan HD chips. Gel purification was applied to select DNA with good quality and data of fresh frozen and FFPE tissues from healthy individuals were included as baseline controls in our data analysis. Our analysis showed a 91% overlap between CNV detection by microarray with FFPE tissues and chromosomal abnormality detection by karyotyping with fresh tissues on 8 cases of lymphoma samples. The CNV overlap between matched frozen and FFPE tissues reached 93.8%. When the analyses were restricted to regions containing genes, 87.1% concordance between FFPE and fresh frozen tissues was found. The analysis was further validated by Fluorescence In Situ Hybridization on these samples using probes specific for BRAF and CITED2. The results suggested that the modified method using Affymetrix Cytoscan HD chip gave rise to a significant improvement over most of the previous methods in terms of accuracy in detecting CNV in FFPE tissues. This FFPE microarray methodology may hold promise for broad application of CNV analysis on clinical samples. PMID:24699316

  16. Using The Affymetrix Wheat Microarray As An Oat Expression Platform

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recent advances in sequencing have resulted in the sequence of a large number of plant expressed sequence tags (ESTs) to entire plant genomes. Using these EST sequences, oligonucleotide microarray chips have been developed for several species including barley (Hordeum vulgare), maize (Zea mays), ric...

  17. Genome-wide analysis of hydrogen peroxide-regulated gene expression in Arabidopsis reveals a high light-induced transcriptional cluster involved in anthocyanin biosynthesis.

    PubMed

    Vanderauwera, Sandy; Zimmermann, Philip; Rombauts, Stéphane; Vandenabeele, Steven; Langebartels, Christian; Gruissem, Wilhelm; Inzé, Dirk; Van Breusegem, Frank

    2005-10-01

    In plants, reactive oxygen species and, more particularly, hydrogen peroxide (H(2)O(2)) play a dual role as toxic by-products of normal cell metabolism and as regulatory molecules in stress perception and signal transduction. Peroxisomal catalases are an important sink for photorespiratory H(2)O(2). Using ATH1 Affymetrix microarrays, expression profiles were compared between control and catalase-deficient Arabidopsis (Arabidopsis thaliana) plants. Reduced catalase levels already provoked differences in nuclear gene expression under ambient growth conditions, and these effects were amplified by high light exposure in a sun simulator for 3 and 8 h. This genome-wide expression analysis allowed us to reveal the expression characteristics of complete pathways and functional categories during H(2)O(2) stress. In total, 349 transcripts were significantly up-regulated by high light in catalase-deficient plants and 88 were down-regulated. From this data set, H(2)O(2) was inferred to play a key role in the transcriptional up-regulation of small heat shock proteins during high light stress. In addition, several transcription factors and candidate regulatory genes involved in H(2)O(2) transcriptional gene networks were identified. Comparisons with other publicly available transcriptome data sets of abiotically stressed Arabidopsis revealed an important intersection with H(2)O(2)-deregulated genes, positioning elevated H(2)O(2) levels as an important signal within abiotic stress-induced gene expression. Finally, analysis of transcriptional changes in a combination of a genetic (catalase deficiency) and an environmental (high light) perturbation identified a transcriptional cluster that was strongly and rapidly induced by high light in control plants, but impaired in catalase-deficient plants. This cluster comprises the complete known anthocyanin regulatory and biosynthetic pathway, together with genes encoding unknown proteins. PMID:16183842

  18. SFP Genotyping from Affymetrix Arrays is Robust but Largely Detects Cis-acting Expression Regulators

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The recent development of Affymetrix chips designed from assembled EST sequences has spawned considerable interest in identifying single-feature polymorphisms (SFPs) from transcriptome data. SFPs are valuable genetic markers that potentially offer a physical link to the structural genes themselves....

  19. A composite transcriptional signature differentiates responses towards closely related herbicides in Arabidopsis thaliana and brassica napus

    EPA Science Inventory

    In this study, genome-wide expression profiling based on Affymetrix ATH1 arrays was used to identify discriminating responses of Arabidopsis thaliana to five herbicides, which contain active ingredients targeting two different branches of amino acid biosynthesis. One herbicide co...

  20. An orthologous transcriptional signature differentiates responses towards closely related chemicals in Arabidopsis thaliana and brassica napus

    EPA Science Inventory

    Herbicides are structurally diverse chemicals that inhibit plant-specific targets, however their off-target and potentially differentiating side-effects are less well defined. In this study, genome-wide expression profiling based on Affymetrix AtH1 arrays was used to identify dis...

  1. Motif effects in Affymetrix GeneChips seriously affect probe intensities

    PubMed Central

    Upton, Graham J. G.; Harrison, Andrew P.

    2012-01-01

    An Affymetrix GeneChip consists of an array of hundreds of thousands of probes (each a sequence of 25 bases) with the probe values being used to infer the extent to which genes are expressed in the biological material under investigation. In this article, we demonstrate that these probe values are also strongly influenced by their precise base sequence. We use data from >28 000 CEL files relating to 10 different Affymetrix GeneChip platforms and involving nearly 1000 experiments. Our results confirm known effects (those due to the T7-primer and the formation of G-quadruplexes) but reveal other effects. We show that there can be huge variations from one experiment to another, and that there may also be sizeable disparities between batches within an experiment and between CEL files within a batch. PMID:22904084

  2. Normalization of Affymetrix miRNA Microarrays for the Analysis of Cancer Samples.

    PubMed

    Wu, Di; Gantier, Michael P

    2016-01-01

    microRNA (miRNA) microarray normalization is a critical step for the identification of truly differentially expressed miRNAs. This is particularly important when dealing with cancer samples that have a global miRNA decrease. In this chapter, we provide a simple step-by-step procedure that can be used to normalize Affymetrix miRNA microarrays, relying on robust normal-exponential background correction with cyclic loess normalization. PMID:25971910

  3. A comparison of statistical tests for detecting differential expression using Affymetrix oligonucleotide microarrays.

    PubMed

    Vardhanabhuti, Saran; Blakemore, Steven J; Clark, Steven M; Ghosh, Sujoy; Stephens, Richard J; Rajagopalan, Dilip

    2006-01-01

    Signal quantification and detection of differential expression are critical steps in the analysis of Affymetrix microarray data. Many methods have been proposed in the literature for each of these steps. The goal of this paper is to evaluate several signal quantification methods (GCRMA, RSVD, VSN, MAS5, and Resolver) and statistical methods for differential expression (t test, Cyber-T, SAM, LPE, RankProducts, Resolver RatioBuild). Our particular focus is on the ability to detect differential expression via statistical tests. We have used two different datasets for our evaluation. First, we have used the HG-U133 Latin Square spike in dataset developed by Affymetrix. Second, we have used data from an in-house rat liver transcriptomics study following 30 different drug treatments generated using the Affymetrix RAE230A chip. Our overall recommendation based on this study is to use GCRMA for signal quantification. For detection of differential expression, GCRMA coupled with Cyber-T or SAM is the best approach, as measured by area under the receiver operating characteristic (ROC) curve. The integrated pipeline in Resolver RatioBuild combining signal quantification and detection of differential expression is an equally good alternative for detecting differentially expressed genes. For most of the differential expression algorithms we considered, the performance using MAS5 signal quantification was inferior to that of the other methods we evaluated. PMID:17233564

  4. MMBGX: a method for estimating expression at the isoform level and detecting differential splicing using whole-transcript Affymetrix arrays

    PubMed Central

    Turro, Ernest; Lewin, Alex; Rose, Anna; Dallman, Margaret J.; Richardson, Sylvia

    2010-01-01

    Affymetrix has recently developed whole-transcript GeneChips—‘Gene’ and ‘Exon’ arrays—which interrogate exons along the length of each gene. Although each probe on these arrays is intended to hybridize perfectly to only one transcriptional target, many probes match multiple transcripts located in different parts of the genome or alternative isoforms of the same gene. Existing statistical methods for estimating expression do not take this into account and are thus prone to producing inflated estimates. We propose a method, Multi-Mapping Bayesian Gene eXpression (MMBGX), which disaggregates the signal at ‘multi-match’ probes. When applied to Gene arrays, MMBGX removes the upward bias of gene-level expression estimates. When applied to Exon arrays, it can further disaggregate the signal between alternative transcripts of the same gene, providing expression estimates of individual splice variants. We demonstrate the performance of MMBGX on simulated data and a tissue mixture data set. We then show that MMBGX can estimate the expression of alternative isoforms within one experimental condition, confirming our results by RT-PCR. Finally, we show that our method for detecting differential splicing has a lower error rate than standard exon-level approaches on a previously validated colon cancer data set. PMID:19854940

  5. The efficacy of detecting variants with small effects on the Affymetrix 6.0 platform using pooled DNA

    PubMed Central

    Chiang, Charleston W. K.; Gajdos, Zofia K. Z.; Butler, Johannah L.; Hackett, Rachel; Guiducci, Candace; Nguyen, Thutrang T.; Wilks, Rainford; Forrester, Terrence; Henderson, Katherine D.; Le Marchand, Loic; Henderson, Brian E.; Haiman, Christopher A.; Cooper, Richard S.; Lyon, Helen N.; Zhu, Xiaofeng; McKenzie, Colin A.; Palmer, Mark R.; Hirschhorn, Joel N.

    2012-01-01

    Genome-wide genotyping of a cohort using pools rather than individual samples has long been proposed as a cost-saving alternative for performing genome-wide association (GWA) studies. However, successful disease gene mapping using pooled genotyping has thus far been limited to detecting common variants with large effect sizes, which tend not to exist for many complex common diseases or traits. Therefore, for DNA pooling to be a viable strategy for conducting GWA studies, it is important to determine whether commonly used genome-wide SNP array platforms such as the Affymetrix 6.0 array can reliably detect common variants of small effect sizes using pooled DNA. Taking obesity and age at menarche as examples of human complex traits, we assessed the feasibility of genome-wide genotyping of pooled DNA as a single-stage design for phenotype association. By individually genotyping the top associations identified by pooling, we obtained a 14- to 16-fold enrichment of SNPs nominally associated with the phenotype, but we likely missed the top true associations. In addition, we assessed whether genotyping pooled DNA can serve as an inexpensive screen as the second stage of a multi-stage design with a large number of samples by comparing the most cost-effective 3-stage designs with 80% power to detect common variants with genotypic relative risk of 1.1, with and without pooling. Given the current state of the specific technology we employed and the associated genotyping costs, we showed through simulation that a design involving pooling would be 1.07 times more expensive than a design without pooling. Thus, while a significant amount of information exists within the data from pooled DNA, our analysis does not support genotyping pooled DNA as a means to efficiently identify common variants contributing small effects to phenotypes of interest. While our conclusions were based on the specific technology and study design we employed, the approach presented here will be useful for

  6. Using probe secondary structure information to enhance Affymetrix GeneChip background estimates

    PubMed Central

    Gharaibeh, Raad Z.; Fodor, Anthony A.; Gibas, Cynthia J.

    2007-01-01

    High-density short oligonucleotide microarrays are a primary research tool for assessing global gene expression. Background noise on microarrays comprises a significant portion of the measured raw data. A number of statistical techniques have been developed to correct for this background noise. Here, we demonstrate that probe minimum folding energy and structure can be used to enhance a previously existing model for background noise correction. We estimate that probe secondary structure accounts for up to 3% of all variation on Affymetrix microarrays. PMID:17387043

  7. A model of binding on DNA microarrays: understanding the combined effect of probe synthesis failure, cross-hybridization, DNA fragmentation and other experimental details of affymetrix arrays

    PubMed Central

    2012-01-01

    Background DNA microarrays are used both for research and for diagnostics. In research, Affymetrix arrays are commonly used for genome wide association studies, resequencing, and for gene expression analysis. These arrays provide large amounts of data. This data is analyzed using statistical methods that quite often discard a large portion of the information. Most of the information that is lost comes from probes that systematically fail across chips and from batch effects. The aim of this study was to develop a comprehensive model for hybridization that predicts probe intensities for Affymetrix arrays and that could provide a basis for improved microarray analysis and probe development. The first part of the model calculates probe binding affinities to all the possible targets in the hybridization solution using the Langmuir isotherm. In the second part of the model we integrate details that are specific to each experiment and contribute to the differences between hybridization in solution and on the microarray. These details include fragmentation, wash stringency, temperature, salt concentration, and scanner settings. Furthermore, the model fits probe synthesis efficiency and target concentration parameters directly to the data. All the parameters used in the model have a well-established physical origin. Results For the 302 chips that were analyzed the mean correlation between expected and observed probe intensities was 0.701 with a range of 0.88 to 0.55. All available chips were included in the analysis regardless of the data quality. Our results show that batch effects arise from differences in probe synthesis, scanner settings, wash strength, and target fragmentation. We also show that probe synthesis efficiencies for different nucleotides are not uniform. Conclusions To date this is the most complete model for binding on microarrays. This is the first model that includes both probe synthesis efficiency and hybridization kinetics/cross-hybridization. These

  8. A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

    PubMed Central

    Li, Ming; Wen, Yalu; Fu, Wenjiang

    2014-01-01

    Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease. PMID:26279618

  9. The Affymetrix DMET Plus Platform Reveals Unique Distribution of ADME-Related Variants in Ethnic Arabs

    PubMed Central

    Wakil, Salma M.; Nguyen, Cao; Muiya, Nzioka P.; Andres, Editha; Lykowska-Tarnowska, Agnieszka; Baz, Batoul; Meyer, Brian F.; Morahan, Grant

    2015-01-01

    Background. The Affymetrix Drug Metabolizing Enzymes and Transporters (DMET) Plus Premier Pack has been designed to genotype 1936 gene variants thought to be essential for screening patients in personalized drug therapy. These variants include the cytochrome P450s (CYP450s), the key metabolizing enzymes, many other enzymes involved in phase I and phase II pharmacokinetic reactions, and signaling mediators associated with variability in clinical response to numerous drugs not only among individuals, but also between ethnic populations. Materials and Methods. We genotyped 600 Saudi individuals for 1936 variants on the DMET platform to evaluate their clinical potential in personalized medicine in ethnic Arabs. Results. Approximately 49% each of the 437 CYP450 variants, 56% of the 581 transporters, 56% of 419 transferases, 48% of the 104 dehydrogenases, and 58% of the remaining 390 variants were detected. Several variants, such as rs3740071, rs6193, rs258751, rs6199, rs11568421, and rs8187797, exhibited significantly either higher or lower minor allele frequencies (MAFs) than those in other ethnic groups. Discussion. The present study revealed some unique distribution trends for several variants in Arabs, which displayed partly inverse allelic prevalence compared to other ethnic populations. The results point therefore to the need to verify and ascertain the prevalence of a variant as a prerequisite for engaging it in clinical routine screening in personalized medicine in any given population. PMID:25802476

  10. AGRONOMICS1: A New Resource for Arabidopsis Transcriptome Profiling1[W][OA

    PubMed Central

    Rehrauer, Hubert; Aquino, Catharine; Gruissem, Wilhelm; Henz, Stefan R.; Hilson, Pierre; Laubinger, Sascha; Naouar, Naira; Patrignani, Andrea; Rombauts, Stephane; Shu, Huan; Van de Peer, Yves; Vuylsteke, Marnik; Weigel, Detlef; Zeller, Georg; Hennig, Lars

    2010-01-01

    Transcriptome profiling has become a routine tool in biology. For Arabidopsis (Arabidopsis thaliana), the Affymetrix ATH1 expression array is most commonly used, but it lacks about one-third of all annotated genes present in the reference strain. An alternative are tiling arrays, but previous designs have not allowed the simultaneous analysis of both strands on a single array. We introduce AGRONOMICS1, a new Affymetrix Arabidopsis microarray that contains the complete paths of both genome strands, with on average one 25mer probe per 35-bp genome sequence window. In addition, the new AGRONOMICS1 array contains all perfect match probes from the original ATH1 array, allowing for seamless integration of the very large existing ATH1 knowledge base. The AGRONOMICS1 array can be used for diverse functional genomics applications such as reliable expression profiling of more than 30,000 genes, detection of alternative splicing, and chromatin immunoprecipitation coupled to microarrays (ChIP-chip). Here, we describe the design of the array and compare its performance with that of the ATH1 array. We find results from both microarrays to be of similar quality, but AGRONOMICS1 arrays yield robust expression information for many more genes, as expected. Analysis of the ATH1 probes on AGRONOMICS1 arrays produces results that closely mirror those of ATH1 arrays. Finally, the AGRONOMICS1 array is shown to be useful for ChIP-chip experiments. We show that heterochromatic H3K9me2 is strongly confined to the gene body of target genes in euchromatic chromosome regions, suggesting that spreading of heterochromatin is limited outside of pericentromeric regions. PMID:20032078

  11. Gene Expression Quantitative Trait Locus Analysis of 16,000 Barley Genes Reveals a Complex Pattern of Genome-wide Transcriptional Regulation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Transcript abundance data from cRNA hybridizations to Affymetrix microarrays can be used for simultaneous marker development and genome-wide eQTL (expression Quantitative Trait Loci) analysis of crops. We have shown that it is easily possible to use the information from Affymetrix expression arrays ...

  12. Evaluating the Influence of Quality Control Decisions and Software Algorithms on SNP Calling for the Affymetrix 6.0 SNP Array Platform

    PubMed Central

    de Andrade, Mariza; Atkinson, Elizabeth J.; Bamlet, William R.; Matsumoto, Martha E.; Maharjan, Sooraj; Slager, Susan L.; Vachon, Celine M.; Cunningham, Julie M.; Kardia, Sharon L.R.

    2011-01-01

    Objective Our goal was to evaluate the influence of quality control (QC) decisions using two genotype calling algorithms, CRLMM and Birdseed, designed for the Affymetrix SNP Array 6.0. Methods Various QC options were tried using the two algorithms and comparisons were made on subject and call rate and on association results using two data sets. Results For Birdseed, we recommend using the contrast QC instead of QC call rate for sample QC. For CRLMM, we recommend using the signal-to-noise rate ≥4 for sample QC and a posterior probability of 90% for genotype accuracy. For both algorithms, we recommend calling the genotype separately for each plate, and dropping SNPs with a lower call rate (<95%) before evaluating samples with lower call rates. To investigate whether the genotype calls from the two algorithms impacted the genome-wide association results, we performed association analysis using data from the GENOA cohort; we observed that the number of significant SNPs were similar using either CRLMM or Birdseed. Conclusions Using our suggested workflow both algorithms performed similarly; however, fewer samples were removed and CRLMM took half the time to run our 854 study samples (4.2 h) compared to Birdseed (8.4 h). PMID:21734406

  13. Large-scale analysis of antisense transcription in wheat using the Affymetrix GeneChip Wheat Genome Array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Natural antisense transcripts (NATs) are transcripts of the opposite DNA strand to the sense-strand either at the same locus (cis-encoded) or a different locus (trans-encoded). They can affect gene expression at multiple stages including transcription, RNA processing and transport, and translation....

  14. Starr: Simple Tiling ARRay analysis of Affymetrix ChIP-chip data

    PubMed Central

    2010-01-01

    Background Chromatin immunoprecipitation combined with DNA microarrays (ChIP-chip) is an assay used for investigating DNA-protein-binding or post-translational chromatin/histone modifications. As with all high-throughput technologies, it requires thorough bioinformatic processing of the data for which there is no standard yet. The primary goal is to reliably identify and localize genomic regions that bind a specific protein. Further investigation compares binding profiles of functionally related proteins, or binding profiles of the same proteins in different genetic backgrounds or experimental conditions. Ultimately, the goal is to gain a mechanistic understanding of the effects of DNA binding events on gene expression. Results We present a free, open-source R/Bioconductor package Starr that facilitates comparative analysis of ChIP-chip data across experiments and across different microarray platforms. The package provides functions for data import, quality assessment, data visualization and exploration. Starr includes high-level analysis tools such as the alignment of ChIP signals along annotated features, correlation analysis of ChIP signals with complementary genomic data, peak-finding and comparative display of multiple clusters of binding profiles. It uses standard Bioconductor classes for maximum compatibility with other software. Moreover, Starr automatically updates microarray probe annotation files by a highly efficient remapping of microarray probe sequences to an arbitrary genome. Conclusion Starr is an R package that covers the complete ChIP-chip workflow from data processing to binding pattern detection. It focuses on the high-level data analysis, e.g., it provides methods for the integration and combined statistical analysis of binding profiles and complementary functional genomics data. Starr enables systematic assessment of binding behaviour for groups of genes that are alingned along arbitrary genomic features. PMID:20398407

  15. Development and application of a 6.5 million feature Affymetrix Genechip® for massively parallel discovery of single position polymorphisms in lettuce (Lactuca spp.)

    PubMed Central

    2012-01-01

    Background High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa). Results We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types

  16. FULL-GENOME ANALYSIS OF ALTERNATIVE SPLICING IN MOUSE LIVER AFTER HEPATOTOXICANT EXPOSURE

    EPA Science Inventory

    Alternative splicing plays a role in determining gene function and protein diversity. We have employed whole genome exon profiling using Affymetrix Mouse Exon 1.0 ST arrays to understand the significance of alternative splicing on a genome-wide scale in response to multiple toxic...

  17. Gene Expression in the Rat Brain during Sleep Deprivation and Recovery Sleep: An Affymetrix GeneChip® Study

    PubMed Central

    Terao, A.; Wisor, J.P.; Peyron, C.; Apte-Deshpande, A.; Wurts, S.W.; Edgar, D.M.; Kilduff, T.S.

    2016-01-01

    Previous studies have demonstrated that macromolecular synthesis in the brain is modulated in association with the occurrence of sleep and wakefulness. Similarly, the spectral composition of electroencephalographic activity that occurs during sleep is dependent on the duration of prior wakefulness. Since this homeostatic relationship between wake and sleep is highly conserved across mammalian species, genes that are truly involved in the electroencephalographic response to sleep deprivation (SD) might be expected to be conserved across mammalian species. Therefore, in the rat cerebral cortex, we have studied the effects of SD on the expression of immediate early gene (IEG) and heat shock protein (HSP) mRNAs previously shown to be upregulated in the mouse brain in SD and in recovery sleep (RS) after SD. We find that the molecular response to SD and RS in the brain is highly conserved between these two mammalian species, at least in terms of expression of IEG and HSP family members. Using Affymetrix Neurobiology U34 GeneChips®, we also screened the rat cerebral cortex, basal forebrain, and hypothalamus for other genes whose expression may be modulated by SD or RS. We find that the response of the basal forebrain to SD is more similar to that of the cerebral cortex than to the hypothalamus. Together, these results suggest that sleep-dependent changes in gene expression in the cerebral cortex are similar across rodent species and therefore may underlie sleep history-dependent changes in sleep electroencephalographic activity. PMID:16257491

  18. Linkage Disequilibrium And Genome-Wide Association Studies In O. sativa

    Technology Transfer Automated Retrieval System (TEKTRAN)

    There is increasing evidence that genome-wide association studies provide a powerful approach to find the genetic basis of complex phenotypic variation in all kinds of species. For this purpose, we developed the first generation 44K Affymetrix SNP array in rice (see Tung et al. poster). We genotyped...

  19. A Microarray Analysis for Differential Gene Expression in the Soybean Genome Using Bioconductor and R

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This paper describes specific procedures for conducting quality assessment of Affymetrix GeneChip® soybean genome data and performing analyses to determine differential gene expression using the open-source R language and environment in conjunction with the open-source Bioconductor package. Procedu...

  20. Computational Integration of Structural and Functional Genomics Data Across Species to Develop Information on Porcine Inflammatory Gene Regulatory Pathway

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Comparative integration of structural and functional genomic data across species holds great promise in finding genes controlling disease resistance. We are investigating the porcine gut immune response to infection through gene expression profiling. We have collected porcine Affymetrix GeneChip da...

  1. BIOINFORMATIC INTEGRATION OF STRUCTURAL AND FUNCTIONAL GENOMICS DATA ACROSS SPECIES TO DEVELOP PORCINE INFLAMMATORY GENE REGULATORY PATHWAY INFORMATION

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Integration of structural and functional genomic data across species holds great promise in finding genes controlling disease resistance. We are investigating the porcine gut immune response to infection through gene expression profiling. We have collected porcine Affymetrix GeneChip data from RNA ...

  2. Computational Integration Of Structural And Functional Genomics Data Across Species To Develop Porcine Inflammatory Gene Regulatory Pathway Information

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Comparative integration of structural and functional genomic data across species holds great promise in finding genes controlling disease resistance. We are investigating the porcine gut immune response to infection through gene expression profiling. We have collected porcine Affymetrix GeneChip da...

  3. Performance of the Affymetrix GeneChip HIV PRT 440 Platform for Antiretroviral Drug Resistance Genotyping of Human Immunodeficiency Virus Type 1 Clades and Viral Isolates with Length Polymorphisms

    PubMed Central

    Vahey, Maryanne; Nau, Martin E.; Barrick, Sandra; Cooley, John D.; Sawyer, Robert; Sleeker, Alex A.; Vickerman, Peter; Bloor, Stuart; Larder, Brendan; Michael, Nelson L.; Wegner, Scott A.

    1999-01-01

    The performance of a silica chip-based resequencing method, the Affymetrix HIV PRT 440 assay (hereafter referred to as the Affymetrix assay), was evaluated on a panel of well-characterized nonclade B viral isolates and on isolates exhibiting length polymorphisms. Sequencing of human immunodeficiency virus type 1 (HIV-1) pol cDNAs from clades A, C, D, E, and F resulted in clade-specific regions of base-calling ambiguities in regions not known to be associated with resistance polymorphisms, as well as a small number of spurious resistance polymorphisms. The Affymetrix assay failed to detect the presence of additional serine codons distal to reverse transcriptase (RT) codon 68 that are associated with multinucleoside RT inhibitor resistance. The increasing prevalence of non-clade B HIV-1 strains in the United States and Europe and the identification of clinically relevant pol gene length polymorphisms will impact the generalizability of the Affymetrix assay, emphasizing the need to accommodate this expanding pool of pol genotypes in future assay versions. PMID:10405396

  4. Genomic Copy Number Variations in the Genomes of Leukocytes Predict Prostate Cancer Clinical Outcomes

    PubMed Central

    Huo, Zhiguang; Martin, Amantha; Nelson, Joel B.; Tseng, George C.; Luo, Jian-Hua

    2015-01-01

    Accurate prediction of prostate cancer clinical courses remains elusive. In this study, we performed whole genome copy number analysis on leukocytes of 273 prostate cancer patients using Affymetrix SNP6.0 chip. Copy number variations (CNV) were found across all chromosomes of the human genome. An average of 152 CNV fragments per genome was identified in the leukocytes from prostate cancer patients. The size distributions of CNV in the genome of leukocytes were highly correlative with prostate cancer aggressiveness. A prostate cancer outcome prediction model was developed based on large size ratio of CNV from the leukocyte genomes. This prediction model generated an average prediction rate of 75.2%, with sensitivity of 77.3% and specificity of 69.0% for prostate cancer recurrence. When combined with Nomogram and the status of fusion transcripts, the average prediction rate was improved to 82.5% with sensitivity of 84.8% and specificity of 78.2%. In addition, the leukocyte prediction model was 62.6% accurate in predicting short prostate specific antigen doubling time. When combined with Gleason’s grade, Nomogram and the status of fusion transcripts, the prediction model generated a correct prediction rate of 77.5% with 73.7% sensitivity and 80.1% specificity. To our knowledge, this is the first study showing that CNVs in leukocyte genomes are predictive of clinical outcomes of a human malignancy. PMID:26295840

  5. Sequencing genomes from single cells by polymerase cloning.

    PubMed

    Zhang, Kun; Martiny, Adam C; Reppas, Nikos B; Barry, Kerrie W; Malek, Joel; Chisholm, Sallie W; Church, George M

    2006-06-01

    Genome sequencing currently requires DNA from pools of numerous nearly identical cells (clones), leaving the genome sequences of many difficult-to-culture microorganisms unattainable. We report a sequencing strategy that eliminates culturing of microorganisms by using real-time isothermal amplification to form polymerase clones (plones) from the DNA of single cells. Two Escherichia coli plones, analyzed by Affymetrix chip hybridization, demonstrate that plonal amplification is specific and the bias is randomly distributed. Whole-genome shotgun sequencing of Prochlorococcus MIT9312 plones showed 62% coverage of the genome from one plone at a sequencing depth of 3.5x, and 66% coverage from a second plone at a depth of 4.7x. Genomic regions not revealed in the initial round of sequencing are recovered by sequencing PCR amplicons derived from plonal DNA. The mutation rate in single-cell amplification is <2 x 10(5), better than that of current genome sequencing standards. Polymerase cloning should provide a critical tool for systematic characterization of genome diversity in the biosphere. PMID:16732271

  6. Arabidopsis transcriptional responses differentiate between O3 and herbicides

    EPA Science Inventory

    Using published data based on Affymetrix ATH1 Gene-Chips we characterized the transcriptional response of Arabidopsis thaliana Columbia to O3 and a few other major environmental stresses including oxidative stress . A set of 101 markers could be extracted which provided a compo...

  7. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  8. FLNA genomic rearrangements cause periventricular nodular heterotopia

    PubMed Central

    Clapham, K.R.; Yu, T.W.; Ganesh, V.S.; Barry, B.; Chan, Y.; Mei, D.; Parrini, E.; Funalot, B.; Dupuis, L.; Nezarati, M.M.; du Souich, C.; van Karnebeek, C.

    2012-01-01

    Objective: To identify copy number variant (CNV) causes of periventricular nodular heterotopia (PNH) in patients for whom FLNA sequencing is negative. Methods: Screening of 35 patients from 33 pedigrees on an Affymetrix 6.0 microarray led to the identification of one individual bearing a CNV that disrupted FLNA. FLNA-disrupting CNVs were also isolated in 2 other individuals by multiplex ligation probe amplification. These 3 cases were further characterized by high-resolution oligo array comparative genomic hybridization (CGH), and the precise junctional breakpoints of the rearrangements were identified by PCR amplification and sequencing. Results: We report 3 cases of PNH caused by nonrecurrent genomic rearrangements that disrupt one copy of FLNA. The first individual carried a 113-kb deletion that removes all but the first exon of FLNA. A second patient harbored a complex rearrangement including a deletion of the 3′ end of FLNA accompanied by a partial duplication event. A third patient bore a 39-kb deletion encompassing all of FLNA and the neighboring gene EMD. High-resolution oligo array CGH of the FLNA locus suggests distinct molecular mechanisms for each of these rearrangements, and implicates nearby low copy repeats in their pathogenesis. Conclusions: These results demonstrate that FLNA is prone to pathogenic rearrangements, and highlight the importance of screening for CNVs in individuals with PNH lacking FLNA point mutations. Neurology® 2012;78:269–278 PMID:22238415

  9. Case-Control Genome-Wide Association of Attention-Deficit / Hyperactivity Disorder

    PubMed Central

    Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J.L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schäfer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Biederman, Joseph

    2010-01-01

    Objective Although twin and family studies have shown attention deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus, additional genomewide association studies (GWAS) are needed. Method We used case-control analyses of 896 cases with DSM-IV ADHD genotyped using the Affymetrix 5.0 array and 2,455 repository controls screened for psychotic and bipolar symptoms genotyped using Affymetrix 6.0 arrays. A consensus SNP set was imputed using BEAGLE 3.0, resulting in an analysis dataset of 1,033,244 SNPs. The data were analyzed using a generalized linear model. Results No genome-wide significant associations were found. The most significant results implicated the following genes: PRKG1, FLNC, TCERG1L, PPM1H, NXPH1, PPM1H, CDH13, HK1 and HKDC1. Conclusions The current analyses are a useful addition to the present literature and will make a valuable contribution to future meta-analyses. The candidate gene findings are consistent with a prior meta-analysis in suggesting that the effects of ADHD risk variants must, individually, be very small and/or include multiple rare alleles. PMID:20732627

  10. Genome walking.

    PubMed

    Shapter, Frances M; Waters, Daniel L E

    2014-01-01

    Genome walking is a method for determining the DNA sequence of unknown genomic regions flanking a region of known DNA sequence. The Genome walking has the potential to capture 6-7 kb of sequence in a single round. Ideal for identifying gene promoter regions where only the coding region. Genome walking also has significant utility for capturing homologous genes in new species when there are areas in the target gene with strong sequence conservation to the characterized species. The increasing use of next-generation sequencing technologies will see the principles of genome walking adapted to in silico methods. However, for smaller projects, PCR-based genome walking will remain an efficient method of characterizing unknown flanking sequence. PMID:24243201

  11. Prophage Genomics

    PubMed Central

    Canchaya, Carlos; Proux, Caroline; Fournous, Ghislain; Bruttin, Anne; Brüssow, Harald

    2003-01-01

    The majority of the bacterial genome sequences deposited in the National Center for Biotechnology Information database contain prophage sequences. Analysis of the prophages suggested that after being integrated into bacterial genomes, they undergo a complex decay process consisting of inactivating point mutations, genome rearrangements, modular exchanges, invasion by further mobile DNA elements, and massive DNA deletion. We review the technical difficulties in defining such altered prophage sequences in bacterial genomes and discuss theoretical frameworks for the phage-bacterium interaction at the genomic level. The published genome sequences from three groups of eubacteria (low- and high-G+C gram-positive bacteria and γ-proteobacteria) were screened for prophage sequences. The prophages from Streptococcus pyogenes served as test case for theoretical predictions of the role of prophages in the evolution of pathogenic bacteria. The genomes from further human, animal, and plant pathogens, as well as commensal and free-living bacteria, were included in the analysis to see whether the same principles of prophage genomics apply for bacteria living in different ecological niches and coming from distinct phylogenetical affinities. The effect of selection pressure on the host bacterium is apparently an important force shaping the prophage genomes in low-G+C gram-positive bacteria and γ-proteobacteria. PMID:12794192

  12. Aquaculture Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genomics chapter covers the basics of genome mapping and sequencing and the current status of several relevant species. The chapter briefly describes the development and use of (cDNA, BAC, etc.) libraries for mapping and obtaining specific sequence information. Other topics include comparative ...

  13. Genetics and genomics of Drosophila mating behavior

    PubMed Central

    Mackay, Trudy F. C.; Heinsohn, Stefanie L.; Lyman, Richard F.; Moehring, Amanda J.; Morgan, Theodore J.; Rollmann, Stephanie M.

    2005-01-01

    The first steps of animal speciation are thought to be the development of sexual isolating mechanisms. In contrast to recent progress in understanding the genetic basis of postzygotic isolating mechanisms, little is known about the genetic architecture of sexual isolation. Here, we have subjected Drosophila melanogaster to 29 generations of replicated divergent artificial selection for mating speed. The phenotypic response to selection was highly asymmetrical in the direction of reduced mating speed, with estimates of realized heritability averaging 7%. The selection response was largely attributable to a reduction in female receptivity. We assessed the whole genome transcriptional response to selection for mating speed using Affymetrix GeneChips and a rigorous statistical analysis. Remarkably, >3,700 probe sets (21% of the array elements) exhibited a divergence in message levels between the Fast and Slow replicate lines. Genes with altered transcriptional abundance in response to selection fell into many different biological process and molecular function Gene Ontology categories, indicating substantial pleiotropy for this complex behavior. Future functional studies are necessary to test the extent to which transcript profiling of divergent selection lines accurately predicts genes that directly affect the selected trait. PMID:15851659

  14. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  15. Genome-wide association study of periodontal pathogen colonization.

    PubMed

    Divaris, K; Monda, K L; North, K E; Olshan, A F; Lange, E M; Moss, K; Barros, S P; Beck, J D; Offenbacher, S

    2012-07-01

    Pathological shifts of the human microbiome are characteristic of many diseases, including chronic periodontitis. To date, there is limited evidence on host genetic risk loci associated with periodontal pathogen colonization. We conducted a genome-wide association (GWA) study among 1,020 white participants of the Atherosclerosis Risk in Communities Study, whose periodontal diagnosis ranged from healthy to severe chronic periodontitis, and for whom "checkerboard" DNA-DNA hybridization quantification of 8 periodontal pathogens was performed. We examined 3 traits: "high red" and "high orange" bacterial complexes, and "high" Aggregatibacter actinomycetemcomitans (Aa) colonization. Genotyping was performed on the Affymetrix 6.0 platform. Imputation to 2.5 million markers was based on HapMap II-CEU, and a multiple-test correction was applied (genome-wide threshold of p < 5 × 10(-8)). We detected no genome-wide significant signals. However, 13 loci, including KCNK1, FBXO38, UHRF2, IL33, RUNX2, TRPS1, CAMTA1, and VAMP3, provided suggestive evidence (p < 5 × 10(-6)) of association. All associations reported for "red" and "orange" complex microbiota, but not for Aa, had the same effect direction in a second sample of 123 African-American participants. None of these polymorphisms was associated with periodontitis diagnosis. Investigations replicating these findings may lead to an improved understanding of the complex nature of host-microbiome interactions that characterizes states of health and disease. PMID:22699663

  16. Antarctic Genomics

    PubMed Central

    Clarke, Andrew; Cockell, Charles S.; Convey, Peter; Detrich III, H. William; Fraser, Keiron P. P.; Johnston, Ian A.; Methe, Barbara A.; Murray, Alison E.; Peck, Lloyd S.; Römisch, Karin; Rogers, Alex D.

    2004-01-01

    With the development of genomic science and its battery of technologies, polar biology stands on the threshold of a revolution, one that will enable the investigation of important questions of unprecedented scope and with extraordinary depth and precision. The exotic organisms of polar ecosystems are ideal candidates for genomic analysis. Through such analyses, it will be possible to learn not only the novel features that enable polar organisms to survive, and indeed thrive, in their extreme environments, but also fundamental biological principles that are common to most, if not all, organisms. This article aims to review recent developments in Antarctic genomics and to demonstrate the global context of such studies. PMID:18629155

  17. Genomic Testing

    MedlinePlus

    ... Working Group Independent Web site Informing the effective integration of genomics into health practice—Lynch syndrome ACCE Model for Evaluating Genetic Tests Recommendations by the EGAPP Working Group Top of ... ...

  18. Application of Whole Genome Expression Analysis to Assess Bacterial Responses to Environmental Conditions

    NASA Astrophysics Data System (ADS)

    Vukanti, R. V.; Mintz, E. M.; Leff, L. G.

    2005-05-01

    Bacterial responses to environmental signals are multifactorial and are coupled to changes in gene expression. An understanding of bacterial responses to environmental conditions is possible using microarray expression analysis. In this study, the utility of microarrays for examining changes in gene expression in Escherichia coli under different environmental conditions was assessed. RNA was isolated, hybridized to Affymetrix E. coli Genome 2.0 chips and analyzed using Affymetrix GCOS and Genespring software. Major limiting factors were obtaining enough quality RNA (107-108 cells to get 10μg RNA)and accounting for differences in growth rates under different conditions. Stabilization of RNA prior to isolation and taking extreme precautions while handling RNA were crucial. In addition, use of this method in ecological studies is limited by availability and cost of commercial arrays; choice of primers for cDNA synthesis, reproducibility, complexity of results generated and need to validate findings. This method may be more widely applicable with the development of better approaches for RNA recovery from environmental samples and increased number of available strain-specific arrays. Diligent experimental design and verification of results with real-time PCR or northern blots is needed. Overall, there is a great potential for use of this technology to discover mechanisms underlying organisms' responses to environmental conditions.

  19. Integrative genomics identifies molecular alterations that challenge the linear model of melanoma progression.

    PubMed

    Rose, Amy E; Poliseno, Laura; Wang, Jinhua; Clark, Michael; Pearlman, Alexander; Wang, Guimin; Vega Y Saenz de Miera, Eleazar C; Medicherla, Ratna; Christos, Paul J; Shapiro, Richard; Pavlick, Anna; Darvishian, Farbod; Zavadil, Jiri; Polsky, David; Hernando, Eva; Ostrer, Harry; Osman, Iman

    2011-04-01

    Superficial spreading melanoma (SSM) and nodular melanoma (NM) are believed to represent sequential phases of linear progression from radial to vertical growth. Several lines of clinical, pathologic, and epidemiologic evidence suggest, however, that SSM and NM might be the result of independent pathways of tumor development. We utilized an integrative genomic approach that combines single nucleotide polymorphism array (6.0; Affymetrix) with gene expression array (U133A 2.0; Affymetrix) to examine molecular differences between SSM and NM. Pathway analysis of the most differentially expressed genes between SSM and NM (N = 114) revealed significant differences related to metabolic processes. We identified 8 genes (DIS3, FGFR1OP, G3BP2, GALNT7, MTAP, SEC23IP, USO1, and ZNF668) in which NM/SSM-specific copy number alterations correlated with differential gene expression (P < 0.05; Spearman's rank). SSM-specific genomic deletions in G3BP2, MTAP, and SEC23IP were independently verified in two external data sets. Forced overexpression of metabolism-related gene MTAP (methylthioadenosine phosphorylase) in SSM resulted in reduced cell growth. The differential expression of another metabolic-related gene, aldehyde dehydrogenase 7A1 (ALDH7A1), was validated at the protein level by using tissue microarrays of human melanoma. In addition, we show that the decreased ALDH7A1 expression in SSM may be the result of epigenetic modifications. Our data reveal recurrent genomic deletions in SSM not present in NM, which challenge the linear model of melanoma progression. Furthermore, our data suggest a role for altered regulation of metabolism-related genes as a possible cause of the different clinical behavior of SSM and NM. PMID:21343389

  20. Patterns of Positive Selection in Six Mammalian Genomes

    PubMed Central

    Kosiol, Carolin; Vinař, Tomáš; da Fonseca, Rute R.; Hubisz, Melissa J.; Bustamante, Carlos D.; Nielsen, Rasmus; Siepel, Adam

    2008-01-01

    Genome-wide scans for positively selected genes (PSGs) in mammals have provided insight into the dynamics of genome evolution, the genetic basis of differences between species, and the functions of individual genes. However, previous scans have been limited in power and accuracy owing to small numbers of available genomes. Here we present the most comprehensive examination of mammalian PSGs to date, using the six high-coverage genome assemblies now available for eutherian mammals. The increased phylogenetic depth of this dataset results in substantially improved statistical power, and permits several new lineage- and clade-specific tests to be applied. Of ∼16,500 human genes with high-confidence orthologs in at least two other species, 400 genes showed significant evidence of positive selection (FDR<0.05), according to a standard likelihood ratio test. An additional 144 genes showed evidence of positive selection on particular lineages or clades. As in previous studies, the identified PSGs were enriched for roles in defense/immunity, chemosensory perception, and reproduction, but enrichments were also evident for more specific functions, such as complement-mediated immunity and taste perception. Several pathways were strongly enriched for PSGs, suggesting possible co-evolution of interacting genes. A novel Bayesian analysis of the possible “selection histories” of each gene indicated that most PSGs have switched multiple times between positive selection and nonselection, suggesting that positive selection is often episodic. A detailed analysis of Affymetrix exon array data indicated that PSGs are expressed at significantly lower levels, and in a more tissue-specific manner, than non-PSGs. Genes that are specifically expressed in the spleen, testes, liver, and breast are significantly enriched for PSGs, but no evidence was found for an enrichment for PSGs among brain-specific genes. This study provides additional evidence for widespread positive selection in

  1. The Genomic Landscape of Pancreatic and Periampullary Adenocarcinoma.

    PubMed

    Sandhu, Vandana; Wedge, David C; Bowitz Lothe, Inger Marie; Labori, Knut Jørgen; Dentro, Stefan C; Buanes, Trond; Skrede, Martina L; Dalsgaard, Astrid M; Munthe, Else; Myklebost, Ola; Lingjærde, Ole Christian; Børresen-Dale, Anne-Lise; Ikdahl, Tone; Van Loo, Peter; Nord, Silje; Kure, Elin H

    2016-09-01

    Despite advances in diagnostics, less than 5% of patients with periampullary tumors experience an overall survival of five years or more. Periampullary tumors are neoplasms that arise in the vicinity of the ampulla of Vater, an enlargement of liver and pancreas ducts where they join and enter the small intestine. In this study, we analyzed copy number aberrations using Affymetrix SNP 6.0 arrays in 60 periampullary adenocarcinomas from Oslo University Hospital to identify genome-wide copy number aberrations, putative driver genes, deregulated pathways, and potential prognostic markers. Results were validated in a separate cohort derived from The Cancer Genome Atlas Consortium (n = 127). In contrast to many other solid tumors, periampullary adenocarcinomas exhibited more frequent genomic deletions than gains. Genes in the frequently codeleted region 17p13 and 18q21/22 were associated with cell cycle, apoptosis, and p53 and Wnt signaling. By integrating genomics and transcriptomics data from the same patients, we identified CCNE1 and ERBB2 as candidate driver genes. Morphologic subtypes of periampullary adenocarcinomas (i.e., pancreatobiliary or intestinal) harbor many common genomic aberrations. However, gain of 13q and 3q, and deletions of 5q were found specific to the intestinal subtype. Our study also implicated the use of the PAM50 classifier in identifying a subgroup of patients with a high proliferation rate, which had impaired survival. Furthermore, gain of 18p11 (18p11.21-23, 18p11.31-32) and 19q13 (19q13.2, 19q13.31-32) and subsequent overexpression of the genes in these loci were associated with impaired survival. Our work identifies potential prognostic markers for periampullary tumors, the genetic characterization of which has lagged. Cancer Res; 76(17); 5092-102. ©2016 AACR. PMID:27488532

  2. Genome databases

    SciTech Connect

    Courteau, J.

    1991-10-11

    Since the Genome Project began several years ago, a plethora of databases have been developed or are in the works. They range from the massive Genome Data Base at Johns Hopkins University, the central repository of all gene mapping information, to small databases focusing on single chromosomes or organisms. Some are publicly available, others are essentially private electronic lab notebooks. Still others limit access to a consortium of researchers working on, say, a single human chromosome. An increasing number incorporate sophisticated search and analytical software, while others operate as little more than data lists. In consultation with numerous experts in the field, a list has been compiled of some key genome-related databases. The list was not limited to map and sequence databases but also included the tools investigators use to interpret and elucidate genetic data, such as protein sequence and protein structure databases. Because a major goal of the Genome Project is to map and sequence the genomes of several experimental animals, including E. coli, yeast, fruit fly, nematode, and mouse, the available databases for those organisms are listed as well. The author also includes several databases that are still under development - including some ambitious efforts that go beyond data compilation to create what are being called electronic research communities, enabling many users, rather than just one or a few curators, to add or edit the data and tag it as raw or confirmed.

  3. Listeria Genomics

    NASA Astrophysics Data System (ADS)

    Cabanes, Didier; Sousa, Sandra; Cossart, Pascale

    The opportunistic intracellular foodborne pathogen Listeria monocytogenes has become a paradigm for the study of host-pathogen interactions and bacterial adaptation to mammalian hosts. Analysis of L. monocytogenes infection has provided considerable insight into how bacteria invade cells, move intracellularly, and disseminate in tissues, as well as tools to address fundamental processes in cell biology. Moreover, the vast amount of knowledge that has been gathered through in-depth comparative genomic analyses and in vivo studies makes L. monocytogenes one of the most well-studied bacterial pathogens. This chapter provides an overview of progress in the exploration of genomic, transcriptomic, and proteomic data in Listeria spp. to understand genome evolution and diversity, as well as physiological aspects of metabolism used by bacteria when growing in diverse environments, in particular in infected hosts.

  4. Genome Informatics

    PubMed Central

    Winslow, Raimond L.; Boguski, Mark S.

    2005-01-01

    This article reviews recent advances in genomics and informatics relevant to cardiovascular research. In particular, we review the status of (1) whole genome sequencing efforts in human, mouse, rat, zebrafish, and dog; (2) the development of data mining and analysis tools; (3) the launching of the National Heart, Lung, and Blood Institute Programs for Genomics Applications and Proteomics Initiative; (4) efforts to characterize the cardiac transcriptome and proteome; and (5) the current status of computational modeling of the cardiac myocyte. In each instance, we provide links to relevant sources of information on the World Wide Web and critical appraisals of the promises and the challenges of an expanding and diverse information landscape. PMID:12750305

  5. Integrating Sequencing Technologies in Personal Genomics: Optimal Low Cost Reconstruction of Structural Variants

    PubMed Central

    Du, Jiang; Bjornson, Robert D.; Zhang, Zhengdong D.; Kong, Yong; Snyder, Michael; Gerstein, Mark B.

    2009-01-01

    The goal of human genome re-sequencing is obtaining an accurate assembly of an individual's genome. Recently, there has been great excitement in the development of many technologies for this (e.g. medium and short read sequencing from companies such as 454 and SOLiD, and high-density oligo-arrays from Affymetrix and NimbelGen), with even more expected to appear. The costs and sensitivities of these technologies differ considerably from each other. As an important goal of personal genomics is to reduce the cost of re-sequencing to an affordable point, it is worthwhile to consider optimally integrating technologies. Here, we build a simulation toolbox that will help us optimally combine different technologies for genome re-sequencing, especially in reconstructing large structural variants (SVs). SV reconstruction is considered the most challenging step in human genome re-sequencing. (It is sometimes even harder than de novo assembly of small genomes because of the duplications and repetitive sequences in the human genome.) To this end, we formulate canonical problems that are representative of issues in reconstruction and are of small enough scale to be computationally tractable and simulatable. Using semi-realistic simulations, we show how we can combine different technologies to optimally solve the assembly at low cost. With mapability maps, our simulations efficiently handle the inhomogeneous repeat-containing structure of the human genome and the computational complexity of practical assembly algorithms. They quantitatively show how combining different read lengths is more cost-effective than using one length, how an optimal mixed sequencing strategy for reconstructing large novel SVs usually also gives accurate detection of SNPs/indels, how paired-end reads can improve reconstruction efficiency, and how adding in arrays is more efficient than just sequencing for disentangling some complex SVs. Our strategy should facilitate the sequencing of human genomes at

  6. Whole genome association analysis shows that ACE is a risk factor for Alzheimer's disease and fails to replicate most candidates from Meta-analysis.

    PubMed

    Webster, Jennifer; Reiman, Eric M; Zismann, Victoria L; Joshipura, Keta D; Pearson, John V; Hu-Lince, Diane; Huentelman, Matthew J; Craig, David W; Coon, Keith D; Beach, Thomas; Rohrer, Kristen C; Zhao, Alice S; Leung, Doris; Bryden, Leslie; Marlowe, Lauren; Kaleem, Mona; Mastroeni, Diego; Grover, Andrew; Rogers, Joseph; Heun, Reinhard; Jessen, Frank; Kölsch, Heike; Heward, Christopher B; Ravid, Rivka; Hutton, Michael L; Melquist, Stacey; Petersen, Ron C; Caselli, Richard J; Papassotiropoulos, Andreas; Stephan, Dietrich A; Hardy, John; Myers, Amanda

    2010-01-01

    For late onset Alzheimer's disease (LOAD), the only confirmed, genetic association is with the apolipoprotein E (APOE) locus on chromosome 19. Meta-analysis is often employed to sort the true associations from the false positives. LOAD research has the advantage of a continuously updated meta-analysis of candidate gene association studies in the web-based AlzGene database. The top 30 AlzGene loci on May 1(st), 2007 were investigated in our whole genome association data set consisting of 1411 LOAD cases and neuropathoiogicaiiy verified controls genotyped at 312,316 SNPs using the Affymetrix 500K Mapping Platform. Of the 30 "top AlzGenes", 32 SNPs in 24 genes had odds ratios (OR) whose 95% confidence intervals that did not include 1. Of these 32 SNPs, six were part of the Affymetrix 500K Mapping panel and another ten had proxies on the Affymetrix array that had >80% power to detect an association with α=0.001. Two of these 16 SNPs showed significant association with LOAD in our sample series. One was rs4420638 at the APOE locus (uncorrected p-value=4.58E-37) and the other was rs4293, located in the angiotensin converting enzyme (ACE) locus (uncorrected p-value=0.014). Since this result was nominally significant, but did not survive multiple testing correction for 16 independent tests, this association at rs4293 was verified in a geographically distinct German cohort (p-value=0.03). We present the results of our ACE replication aiongwith a discussion of the statistical limitations of multiple test corrections in whole genome studies. PMID:21537449

  7. Admixture mapping identifies introgressed genomic regions in North American canids.

    PubMed

    vonHoldt, Bridgett M; Kays, Roland; Pollinger, John P; Wayne, Robert K

    2016-06-01

    Hybrid zones typically contain novel gene combinations that can be tested by natural selection in a unique genetic context. Parental haplotypes that increase fitness can introgress beyond the hybrid zone, into the range of parental species. We used the Affymetrix canine SNP genotyping array to identify genomic regions tagged by multiple ancestry informative markers that are more frequent in an admixed population than expected. We surveyed a hybrid zone formed in the last 100 years as coyotes expanded their range into eastern North America. Concomitant with expansion, coyotes hybridized with wolves and some populations became more wolflike, such that coyotes in the northeast have the largest body size of any coyote population. Using a set of 3102 ancestry informative markers, we identified 60 differentially introgressed regions in 44 canines across this admixture zone. These regions are characterized by an excess of exogenous ancestry and, in northeastern coyotes, are enriched for genes affecting body size and skeletal proportions. Further, introgressed wolf-derived alleles have penetrated into Southern US coyote populations. Because no wolves currently exist in this area, these alleles are unlikely to have originated from recent hybridization. Instead, they probably originated from intraspecific gene flow or ancient admixture. We show that grey wolf and coyote admixture has far-reaching effects and, in addition to phenotypically transforming admixed populations, allows for the differential movement of alleles from different parental species to be tested in new genomic backgrounds. PMID:27106273

  8. Defining the genomic signature of the parous breast

    PubMed Central

    2012-01-01

    Background It is accepted that a woman's lifetime risk of developing breast cancer after menopause is reduced by early full term pregnancy and multiparity. This phenomenon is thought to be associated with the development and differentiation of the breast during pregnancy. Methods In order to understand the underlying molecular mechanisms of pregnancy induced breast cancer protection, we profiled and compared the transcriptomes of normal breast tissue biopsies from 71 parous (P) and 42 nulliparous (NP) healthy postmenopausal women using Affymetrix Human Genome U133 Plus 2.0 arrays. To validate the results, we performed real time PCR and immunohistochemistry. Results We identified 305 differentially expressed probesets (208 distinct genes). Of these, 267 probesets were up- and 38 down-regulated in parous breast samples; bioinformatics analysis using gene ontology enrichment revealed that up-regulated genes in the parous breast represented biological processes involving differentiation and development, anchoring of epithelial cells to the basement membrane, hemidesmosome and cell-substrate junction assembly, mRNA and RNA metabolic processes and RNA splicing machinery. The down-regulated genes represented biological processes that comprised cell proliferation, regulation of IGF-like growth factor receptor signaling, somatic stem cell maintenance, muscle cell differentiation and apoptosis. Conclusions This study suggests that the differentiation of the breast imprints a genomic signature that is centered in the mRNA processing reactome. These findings indicate that pregnancy may induce a safeguard mechanism at post-transcriptional level that maintains the fidelity of the transcriptional process. PMID:23057841

  9. Whither genomics?

    PubMed Central

    Murray, Andrew W

    2000-01-01

    The flood of data from genome-wide analysis is transforming biology. We need to develop new, interdisciplinary approaches to convert these data into information about the components and structures of individual biological pathways and to use the resulting information to yield knowledge about general principles that explain the functions and evolution of life. PMID:11104516

  10. Comparison of Comparative Genomic Hybridization Technologies across Microarray Platforms

    EPA Science Inventory

    In the 2007 Association of Biomolecular Resource Facilities (ABRF) Microarray Research Group (MARG) project, we analyzed HL-60 DNA with five platforms: Agilent, Affymetrix 500K, Affymetrix U133 Plus 2.0, Illumina, and RPCI 19K BAC arrays. Copy number variation (CNV) was analyzed ...

  11. Citrus Genomics

    PubMed Central

    Talon, Manuel; Gmitter Jr., Fred G.

    2008-01-01

    Citrus is one of the most widespread fruit crops globally, with great economic and health value. It is among the most difficult plants to improve through traditional breeding approaches. Currently, there is risk of devastation by diseases threatening to limit production and future availability to the human population. As technologies rapidly advance in genomic science, they are quickly adapted to address the biological challenges of the citrus plant system and the world's industries. The historical developments of linkage mapping, markers and breeding, EST projects, physical mapping, an international citrus genome sequencing project, and critical functional analysis are described. Despite the challenges of working with citrus, there has been substantial progress. Citrus researchers engaged in international collaborations provide optimism about future productivity and contributions to the benefit of citrus industries worldwide and to the human population who can rely on future widespread availability of this health-promoting and aesthetically pleasing fruit crop. PMID:18509486

  12. Ancient genomics

    PubMed Central

    Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

    2015-01-01

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338

  13. Genomic Imprinting

    PubMed Central

    Bajrami, Emirjeta; Spiroski, Mirko

    2016-01-01

    BACKGROUND: Genomic imprinting is the inheritance out of Mendelian borders. Many of inherited diseases and human development violates Mendelian law of inheritance, this way of inheriting is studied by epigenetics. AIM: The aim of this review is to analyze current opinions and options regarding to this way of inheriting. RESULTS: Epigenetics shows that gene expression undergoes changes more complex than modifications in the DNA sequence; it includes the environmental influence on the gametes before conception. Humans inherit two alleles from mother and father, both are functional for the majority of the genes, but sometimes one is turned off or “stamped” and doesn’t show in offspring, that gene is imprinted. Imprinting means that that gene is silenced, and gene from other parent is expressed. The mechanisms for imprinting are still incompletely defined, but they involve epigenetic modifications that are erased and then reset during the creation of eggs and sperm. Genomic imprinting is a process of silencing genes through DNA methylation. The repressed allele is methylated, while the active allele is unmethylated. The most well-known conditions include Prader-Willi syndrome, and Angelman syndrome. Both of these syndromes can be caused by imprinting or other errors involving genes on the long arm of chromosome 15. CONCLUSIONS: Genomic imprinting and other epigenetic mechanisms such as environment is shown that plays role in offspring neurodevelopment and autism spectrum disorder. PMID:27275355

  14. A Fast Implementation of a Scan Statistic for Identifying Chromosomal Patterns of Genome Wide Association Studies

    PubMed Central

    Sun, Yan V.; Jacobsen, Douglas M.; Turner, Stephen T.; Boerwinkle, Eric; Kardia, Sharon L.R.

    2009-01-01

    In order to take into account the complex genomic distribution of SNP variations when identifying chromosomal regions with significant SNP effects, a single nucleotide polymorphism (SNP) association scan statistic was developed. To address the computational needs of genome wide association (GWA) studies, a fast Java application, which combines single-locus SNP tests and a scan statistic for identifying chromosomal regions with significant clusters of significant SNP effects, was developed and implemented. To illustrate this application, SNP associations were analyzed in a pharmacogenomic study of the blood pressure lowering effect of thiazide-diuretics (N=195) using the Affymetrix Human Mapping 100K Set. 55,335 tagSNPs (pair-wise linkage disequilibrium R2<0.5) were selected to reduce the frequency correlation between SNPs. A typical workstation can complete the whole genome scan including 10,000 permutation tests within 3 hours. The most significant regions locate on chromosome 3, 6, 13 and 16, two of which contain candidate genes that may be involved in the underlying drug response mechanism. The computational performance of ChromoScan-GWA and its scalability were tested with up to 1,000,000 SNPs and up to 4,000 subjects. Using 10,000 permutations, the computation time grew linearly in these datasets. This scan statistic application provides a robust statistical and computational foundation for identifying genomic regions associated with disease and provides a method to compare GWA results even across different platforms. PMID:20161066

  15. Development of the catfish 250K SNP array for genome-wide association studies

    PubMed Central

    2014-01-01

    Background Quantitative traits, such as disease resistance, are most often controlled by a set of genes involving a complex array of regulation. The dissection of genetic basis of quantitative traits requires large numbers of genetic markers with good genome coverage. The application of next-generation sequencing technologies has allowed discovery of over eight million SNPs in catfish, but the challenge remains as to how to efficiently and economically use such SNP resources for genetic analysis. Results In this work, we developed a catfish 250K SNP array using Affymetrix Axiom genotyping technology. The SNPs were obtained from multiple sources including gene-associated SNPs, anonymous genomic SNPs, and inter-specific SNPs. A set of 640K high-quality SNPs obtained following specific requirements of array design were submitted. A panel of 250,113 SNPs was finalized for inclusion on the array. The performance evaluated by genotyping individuals from wild populations and backcross families suggested the good utility of the catfish 250K SNP array. Conclusions This is the first high-density SNP array for catfish. The array should be a valuable resource for genome-wide association studies (GWAS), fine QTL mapping, high-density linkage map construction, haplotype analysis, and whole genome-based selection. PMID:24618043

  16. Diversity in global gene expression and morphology across a watercress (Nasturtium officinale R. Br.) germplasm collection: first steps to breeding.

    PubMed

    Payne, Adrienne C; Clarkson, Graham J J; Rothwell, Steve; Taylor, Gail

    2015-01-01

    Watercress (Nasturtium officinale R. Br.) is a nutrient intense, leafy crop that is consumed raw or in soups across the globe, but for which, currently no genomic resources or breeding programme exists. Promising morphological, biochemical and functional genomic variation was identified for the first time in a newly established watercress germplasm collection, consisting of 48 watercress accessions sourced from contrasting global locations. Stem length, stem diameter and anti-oxidant (AO) potential varied across the accessions. This variation was used to identify three extreme contrasting accessions for further analysis. Variation in global gene expression was investigated using an Affymetrix Arabidopsis ATH1 microarray gene chip, using the commercial control (C), an accession selected for dwarf phenotype with a high AO potential (dwarfAO, called 'Boldrewood') and one with high AO potential alone. A set of transcripts significantly differentially expressed between these three accessions, were identified, including transcripts involved in the regulation of growth and development and those involved in secondary metabolism. In particular, when differential gene expression was compared between C and dwarfAO, the dwarfAO was characterised by increased expression of genes encoding glucosinolates, which are known precursors of phenethyl isothiocyanate, linked to the anti-carcinogenic effects well-documented in watercress. This study provides the first analysis of natural variation across the watercress genome and has identified important underpinning information for future breeding for enhanced anti-carcinogenic properties and morphology traits in this nutrient-intense crop. PMID:26504575

  17. Diversity in global gene expression and morphology across a watercress (Nasturtium officinale R. Br.) germplasm collection: first steps to breeding

    PubMed Central

    Payne, Adrienne C.; Clarkson, Graham J.J.; Rothwell, Steve; Taylor, Gail

    2015-01-01

    Watercress (Nasturtium officinale R. Br.) is a nutrient intense, leafy crop that is consumed raw or in soups across the globe, but for which, currently no genomic resources or breeding programme exists. Promising morphological, biochemical and functional genomic variation was identified for the first time in a newly established watercress germplasm collection, consisting of 48 watercress accessions sourced from contrasting global locations. Stem length, stem diameter and anti-oxidant (AO) potential varied across the accessions. This variation was used to identify three extreme contrasting accessions for further analysis. Variation in global gene expression was investigated using an Affymetrix Arabidopsis ATH1 microarray gene chip, using the commercial control (C), an accession selected for dwarf phenotype with a high AO potential (dwarfAO, called ‘Boldrewood’) and one with high AO potential alone. A set of transcripts significantly differentially expressed between these three accessions, were identified, including transcripts involved in the regulation of growth and development and those involved in secondary metabolism. In particular, when differential gene expression was compared between C and dwarfAO, the dwarfAO was characterised by increased expression of genes encoding glucosinolates, which are known precursors of phenethyl isothiocyanate, linked to the anti-carcinogenic effects well-documented in watercress. This study provides the first analysis of natural variation across the watercress genome and has identified important underpinning information for future breeding for enhanced anti-carcinogenic properties and morphology traits in this nutrient-intense crop. PMID:26504575

  18. Integrative Genomics Identifies Gene Signature Associated with Melanoma Ulceration

    PubMed Central

    Toth, Reka; Vizkeleti, Laura; Herandez-Vargas, Hector; Lazar, Viktoria; Emri, Gabriella; Szatmari, Istvan; Herceg, Zdenko; Adany, Roza; Balazs, Margit

    2013-01-01

    Background Despite the extensive research approaches applied to characterise malignant melanoma, no specific molecular markers are available that are clearly related to the progression of this disease. In this study, our aims were to define a gene expression signature associated with the clinical outcome of melanoma patients and to provide an integrative interpretation of the gene expression -, copy number alterations -, and promoter methylation patterns that contribute to clinically relevant molecular functional alterations. Methods Gene expression profiles were determined using the Affymetrix U133 Plus2.0 array. The NimbleGen Human CGH Whole-Genome Tiling array was used to define CNAs, and the Illumina GoldenGate Methylation platform was applied to characterise the methylation patterns of overlapping genes. Results We identified two subclasses of primary melanoma: one representing patients with better prognoses and the other being characteristic of patients with unfavourable outcomes. We assigned 1,080 genes as being significantly correlated with ulceration, 987 genes were downregulated and significantly enriched in the p53, Nf-kappaB, and WNT/beta-catenin pathways. Through integrated genome analysis, we defined 150 downregulated genes whose expression correlated with copy number losses in ulcerated samples. These genes were significantly enriched on chromosome 6q and 10q, which contained a total of 36 genes. Ten of these genes were downregulated and involved in cell-cell and cell-matrix adhesion or apoptosis. The expression and methylation patterns of additional genes exhibited an inverse correlation, suggesting that transcriptional silencing of these genes is driven by epigenetic events. Conclusion Using an integrative genomic approach, we were able to identify functionally relevant molecular hotspots characterised by copy number losses and promoter hypermethylation in distinct molecular subtypes of melanoma that contribute to specific transcriptomic silencing

  19. Robust Demographic Inference from Genomic and SNP Data

    PubMed Central

    Excoffier, Laurent; Dupanloup, Isabelle; Huerta-Sánchez, Emilia; Sousa, Vitor C.; Foll, Matthieu

    2013-01-01

    We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with , the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets. PMID:24204310

  20. Genomes on ice.

    PubMed

    Parkhill, Julian

    2016-03-01

    This month's Genome Watch discusses the analysis of a Helicobacter pylori genome from the preserved Copper-Age mummy known as the Iceman and how ancient genomes shed light on the history of bacterial pathogens. PMID:26853114

  1. Whole Genome Sequencing

    MedlinePlus

    ... you want to learn. Search form Search Whole Genome Sequencing You are here Home Testing & Services Testing ... the full story, click here . What is whole genome sequencing? Whole genome sequencing is the mapping out ...

  2. Ensembl Genomes 2016: more genomes, more complexity.

    PubMed

    Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  3. Ensembl genomes 2016: more genomes, more complexity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent...

  4. Ensembl Genomes 2016: more genomes, more complexity

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J.; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K.; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D.; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello–Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin L.; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  5. Gene family analysis of the Arabidopsis pollen transcriptome reveals biological implications for cell growth, division control, and gene expression regulation.

    PubMed

    Pina, Cristina; Pinto, Francisco; Feijó, José A; Becker, Jörg D

    2005-06-01

    Upon germination, pollen forms a tube that elongates dramatically through female tissues to reach and fertilize ovules. While essential for the life cycle of higher plants, the genetic basis underlying most of the process is not well understood. We previously used a combination of flow cytometry sorting of viable hydrated pollen grains and GeneChip array analysis of one-third of the Arabidopsis (Arabidopsis thaliana) genome to define a first overview of the pollen transcriptome. We now extend that study to approximately 80% of the genome of Arabidopsis by using Affymetrix Arabidopsis ATH1 arrays and perform comparative analysis of gene family and gene ontology representation in the transcriptome of pollen and vegetative tissues. Pollen grains have a smaller and overall unique transcriptome (6,587 genes expressed) with greater proportions of selectively expressed (11%) and enriched (26%) genes than any vegetative tissue. Relative gene ontology category representations in pollen and vegetative tissues reveal a functional skew of the pollen transcriptome toward signaling, vesicle transport, and the cytoskeleton, suggestive of a commitment to germination and tube growth. Cell cycle analysis reveals an accumulation of G2/M-associated factors that may play a role in the first mitotic division of the zygote. Despite the relative underrepresentation of transcription-associated transcripts, nonclassical MADS box genes emerge as a class with putative unique roles in pollen. The singularity of gene expression control in mature pollen grains is further highlighted by the apparent absence of small RNA pathway components. PMID:15908605

  6. A Global Survey of Gene Regulation during Cold Acclimation in Arabidopsis thaliana

    PubMed Central

    Hannah, Matthew A; Heyer, Arnd G; Hincha, Dirk K

    2005-01-01

    Many temperate plant species such as Arabidopsis thaliana are able to increase their freezing tolerance when exposed to low, nonfreezing temperatures in a process called cold acclimation. This process is accompanied by complex changes in gene expression. Previous studies have investigated these changes but have mainly focused on individual or small groups of genes. We present a comprehensive statistical analysis of the genome-wide changes of gene expression in response to 14 d of cold acclimation in Arabidopsis, and provide a large-scale validation of these data by comparing datasets obtained for the Affymetrix ATH1 Genechip and MWG 50-mer oligonucleotide whole-genome microarrays. We combine these datasets with existing published and publicly available data investigating Arabidopsis gene expression in response to low temperature. All data are integrated into a database detailing the cold responsiveness of 22,043 genes as a function of time of exposure at low temperature. We concentrate our functional analysis on global changes marking relevant pathways or functional groups of genes. These analyses provide a statistical basis for many previously reported changes, identify so far unreported changes, and show which processes predominate during different times of cold acclimation. This approach offers the fullest characterization of global changes in gene expression in response to low temperature available to date. PMID:16121258

  7. Genomic analysis of gum disease and hypertrichosis in foxes.

    PubMed

    Clark, J-A B J; Whalen, D; Marshall, H D

    2016-01-01

    Since the 1940s, a proliferative gingival disease called hereditary hyperplastic gingivitis (HHG) has been described in the farmed silver fox, Vulpes vulpes (Dyrendahl and Henricson 1960). HHG displays an autosomal recessive transmission and has a pleiotropic relationship with superior fur quality in terms of length and thickness of guard hairs. An analogous human disease, hereditary gingival fibromatosis (HGF), is characterized by a predominantly autosomal dominant transmission and a complex etiology, occurring either as an isolated condition or as a part of a syndrome. Similar to HHG, the symptom most commonly associated with syndromic HGF is hypertrichosis. Here we explore potential mechanisms involved in HHG by comparison to known genetic information about hypertrichosis co-occurring with HGF, using an Affymetrix canine genome microarray platform, quantitative PCR, and candidate gene sequencing. We conclude that the mitogen-activated protein kinase pathway is involved in HHG, however despite involvement of the mitogen-activated protein kinase kinase 6 gene in congenital hypertrichosis with gingival fibromatosis in humans, this gene did not contain any fixed mutations in exons or exon-intron boundaries in HHG-affected foxes, suggesting that it is not causative of HHG in the farmed silver fox population. Differential up-regulation of MAP2K6 gene in HHG-affected foxes does implicate this gene in the HHG phenotype. PMID:27323055

  8. Rapid Identification of Potential Drugs for Diabetic Nephropathy Using Whole-Genome Expression Profiles of Glomeruli

    PubMed Central

    Shi, Jingsong; Jiang, Song; Qiu, Dandan; Le, Weibo; Wang, Xiao; Lu, Yinhui; Liu, Zhihong

    2016-01-01

    Objective. To investigate potential drugs for diabetic nephropathy (DN) using whole-genome expression profiles and the Connectivity Map (CMAP). Methodology. Eighteen Chinese Han DN patients and six normal controls were included in this study. Whole-genome expression profiles of microdissected glomeruli were measured using the Affymetrix human U133 plus 2.0 chip. Differentially expressed genes (DEGs) between late stage and early stage DN samples and the CMAP database were used to identify potential drugs for DN using bioinformatics methods. Results. (1) A total of 1065 DEGs (FDR < 0.05 and fold change > 1.5) were found in late stage DN patients compared with early stage DN patients. (2) Piperlongumine, 15d-PGJ2 (15-delta prostaglandin J2), vorinostat, and trichostatin A were predicted to be the most promising potential drugs for DN, acting as NF-κB inhibitors, histone deacetylase inhibitors (HDACIs), PI3K pathway inhibitors, or PPARγ agonists, respectively. Conclusion. Using whole-genome expression profiles and the CMAP database, we rapidly predicted potential DN drugs, and therapeutic potential was confirmed by previously published studies. Animal experiments and clinical trials are needed to confirm both the safety and efficacy of these drugs in the treatment of DN. PMID:27069916

  9. Soybean Knowledge Base (SoyKB): a Web Resource for Soybean Translational Genomics

    SciTech Connect

    Joshi, Trupti; Patil, Kapil; Fitzpatrick, Michael R.; Franklin, Levi D.; Yao, Qiuming; Cook, Jeffrey R.; Wang, Zhem; Libault, Marc; Brechenmacher, Laurent; Valliyodan, Babu; Wu, Xiaolei; Cheng, Jianlin; Stacey, Gary; Nguyen, Henry T.; Xu, Dong

    2012-01-17

    Background: Soybean Knowledge Base (SoyKB) is a comprehensive all-inclusive web resource for soybean translational genomics. SoyKB is designed to handle the management and integration of soybean genomics, transcriptomics, proteomics and metabolomics data along with annotation of gene function and biological pathway. It contains information on four entities, namely genes, microRNAs, metabolites and single nucleotide polymorphisms (SNPs). Methods: SoyKB has many useful tools such as Affymetrix probe ID search, gene family search, multiple gene/ metabolite search supporting co-expression analysis, and protein 3D structure viewer as well as download and upload capacity for experimental data and annotations. It has four tiers of registration, which control different levels of access to public and private data. It allows users of certain levels to share their expertise by adding comments to the data. It has a user-friendly web interface together with genome browser and pathway viewer, which display data in an intuitive manner to the soybean researchers, producers and consumers. Conclusions: SoyKB addresses the increasing need of the soybean research community to have a one-stop-shop functional and translational omics web resource for information retrieval and analysis in a user-friendly way. SoyKB can be publicly accessed at http://soykb.org/.

  10. Assessment of bagging GBLUP for whole-genome prediction of broiler chicken traits.

    PubMed

    Abdollahi-Arpanahi, R; Morota, G; Valente, B D; Kranis, A; Rosa, G J M; Gianola, D

    2015-06-01

    Bootstrap aggregation (bagging) is a resampling method known to produce more accurate predictions when predictors are unstable or when the number of markers is much larger than sample size, because of variance reduction capabilities. The purpose of this study was to compare genomic best linear unbiased prediction (GBLUP) with bootstrap aggregated sampling GBLUP (Bagged GBLUP, or BGBLUP) in terms of prediction accuracy. We used a 600 K Affymetrix platform with 1351 birds genotyped and phenotyped for three traits in broiler chickens; body weight, ultrasound measurement of breast muscle and hen house egg production. The predictive performance of GBLUP versus BGBLUP was evaluated in different scenarios consisting of including or excluding the TOP 20 markers from a standard genome-wide association study (GWAS) as fixed effects in the GBLUP model, and varying training sample sizes and allelic frequency bins. Predictive performance was assessed via five replications of a threefold cross-validation using the correlation between observed and predicted values, and prediction mean-squared error. GBLUP overfitted the training set data, and BGBLUP delivered a better predictive ability in testing sets. Treating the TOP 20 markers from the GWAS into the model as fixed effects improved prediction accuracy and added advantages to BGBLUP over GBLUP. The performance of GBLUP and BGBLUP at different allele frequency bins and training sample sizes was similar. In general, results of this study confirm that BGBLUP can be valuable for enhancing genome-enabled prediction of complex traits. PMID:25727456

  11. Genome-Wide Methylation Analysis of Prostate Tissues Reveals Global Methylation Patterns of Prostate Cancer

    PubMed Central

    Luo, Jian-Hua; Ding, Ying; Chen, Rui; Michalopoulos, George; Nelson, Joel; Tseng, George; Yu, Yan P.

    2014-01-01

    Altered genome methylation is a hallmark of human malignancies. In this study, high-throughput analyses of concordant gene methylation and expression events were performed for 91 human prostate specimens, including prostate tumor (T), matched normal adjacent to tumor (AT), and organ donor (OD). Methylated DNA in genomic DNA was immunoprecipitated with anti-methylcytidine antibodies and detected by Affymetrix human whole genome SNP 6.0 chips. Among the methylated CpG islands, 11,481 islands were found located in the promoter and exon 1 regions of 9295 genes. Genes (7641) were methylated frequently across OD, AT, and T samples, whereas 239 genes were differentially methylated in only T and 785 genes in both AT and T but not OD. Genes with promoter methylation and concordantly suppressed expression were identified. Pathway analysis suggested that many of the methylated genes in T and AT are involved in cell growth and mitogenesis. Classification analysis of the differentially methylated genes in T or OD produced a specificity of 89.4% and a sensitivity of 85.7%. The T and AT groups, however, were only slightly separated by the prediction analysis, indicating a strong field effect. A gene methylation prediction model was shown to predict prostate cancer relapse with sensitivity of 80.0% and specificity of 85.0%. These results suggest methylation patterns useful in predicting clinical outcomes of prostate cancer. PMID:23583283

  12. Classification and Subtype Prediction of Adult Soft Tissue Sarcoma by Functional Genomics

    PubMed Central

    Segal, Neil H.; Pavlidis, Paul; Antonescu, Cristina R.; Maki, Robert G.; Noble, William S.; DeSantis, Diann; Woodruff, James M.; Lewis, Jonathan J.; Brennan, Murray F.; Houghton, Alan N.; Cordon-Cardo, Carlos

    2003-01-01

    Adult soft tissue sarcomas are a heterogeneous group of tumors, including well-described subtypes by histological and genotypic criteria, and pleomorphic tumors typically characterized by non-recurrent genetic aberrations and karyotypic heterogeneity. The latter pose a diagnostic challenge, even to experienced pathologists. We proposed that gene expression profiling in soft tissue sarcoma would identify a genomic-based classification scheme that is useful in diagnosis. RNA samples from 51 pathologically confirmed cases, representing nine different histological subtypes of adult soft tissue sarcoma, were examined using the Affymetrix U95A GeneChip. Statistical tests were performed on experimental groups identified by cluster analysis, to find discriminating genes that could subsequently be applied in a support vector machine algorithm. Synovial sarcomas, round-cell/myxoid liposarcomas, clear-cell sarcomas and gastrointestinal stromal tumors displayed remarkably distinct and homogenous gene expression profiles. Pleomorphic tumors were heterogeneous. Notably, a subset of malignant fibrous histiocytomas, a controversialhistological subtype, was identified as a distinct genomic group. The support vector machine algorithm supported a genomic basis for diagnosis, with both high sensitivity and specificity. In conclusion, we showed gene expression profiling to be useful in classification and diagnosis, providing insights into pathogenesis and pointing to potential new therapeutic targets of soft tissue sarcoma. PMID:12875988

  13. Genome-wide patterns of population structure and admixture in West Africans and African Americans.

    PubMed

    Bryc, Katarzyna; Auton, Adam; Nelson, Matthew R; Oksenberg, Jorge R; Hauser, Stephen L; Williams, Scott; Froment, Alain; Bodo, Jean-Marie; Wambebe, Charles; Tishkoff, Sarah A; Bustamante, Carlos D

    2010-01-12

    Quantifying patterns of population structure in Africans and African Americans illuminates the history of human populations and is critical for undertaking medical genomic studies on a global scale. To obtain a fine-scale genome-wide perspective of ancestry, we analyze Affymetrix GeneChip 500K genotype data from African Americans (n = 365) and individuals with ancestry from West Africa (n = 203 from 12 populations) and Europe (n = 400 from 42 countries). We find that population structure within the West African sample reflects primarily language and secondarily geographical distance, echoing the Bantu expansion. Among African Americans, analysis of genomic admixture by a principal component-based approach indicates that the median proportion of European ancestry is 18.5% (25th-75th percentiles: 11.6-27.7%), with very large variation among individuals. In the African-American sample as a whole, few autosomal regions showed exceptionally high or low mean African ancestry, but the X chromosome showed elevated levels of African ancestry, consistent with a sex-biased pattern of gene flow with an excess of European male and African female ancestry. We also find that genomic profiles of individual African Americans afford personalized ancestry reconstructions differentiating ancient vs. recent European and African ancestry. Finally, patterns of genetic similarity among inferred African segments of African-American genomes and genomes of contemporary African populations included in this study suggest African ancestry is most similar to non-Bantu Niger-Kordofanian-speaking populations, consistent with historical documents of the African Diaspora and trans-Atlantic slave trade. PMID:20080753

  14. Funding Opportunity: Genomic Data Centers

    Cancer.gov

    Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,

  15. High-resolution genomic profiling of chronic lymphocytic leukemia reveals new recurrent genomic alterations.

    PubMed

    Edelmann, Jennifer; Holzmann, Karlheinz; Miller, Florian; Winkler, Dirk; Bühler, Andreas; Zenz, Thorsten; Bullinger, Lars; Kühn, Michael W M; Gerhardinger, Andreas; Bloehdorn, Johannes; Radtke, Ina; Su, Xiaoping; Ma, Jing; Pounds, Stanley; Hallek, Michael; Lichter, Peter; Korbel, Jan; Busch, Raymonde; Mertens, Daniel; Downing, James R; Stilgenbauer, Stephan; Döhner, Hartmut

    2012-12-01

    To identify genomic alterations in chronic lymphocytic leukemia (CLL), we performed single-nucleotide polymorphism-array analysis using Affymetrix Version 6.0 on 353 samples from untreated patients entered in the CLL8 treatment trial. Based on paired-sample analysis (n = 144), a mean of 1.8 copy number alterations per patient were identified; approximately 60% of patients carried no copy number alterations other than those detected by fluorescence in situ hybridization analysis. Copy-neutral loss-of-heterozygosity was detected in 6% of CLL patients and was found most frequently on 13q, 17p, and 11q. Minimally deleted regions were refined on 13q14 (deleted in 61% of patients) to the DLEU1 and DLEU2 genes, on 11q22.3 (27% of patients) to ATM, on 2p16.1-2p15 (gained in 7% of patients) to a 1.9-Mb fragment containing 9 genes, and on 8q24.21 (5% of patients) to a segment 486 kb proximal to the MYC locus. 13q deletions exhibited proximal and distal breakpoint cluster regions. Among the most common novel lesions were deletions at 15q15.1 (4% of patients), with the smallest deletion (70.48 kb) found in the MGA locus. Sequence analysis of MGA in 59 samples revealed a truncating mutation in one CLL patient lacking a 15q deletion. MNT at 17p13.3, which in addition to MGA and MYC encodes for the network of MAX-interacting proteins, was also deleted recurrently. PMID:23047824

  16. Enabling functional genomics with genome engineering

    PubMed Central

    Hilton, Isaac B.; Gersbach, Charles A.

    2015-01-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  17. Exploring Other Genomes: Bacteria.

    ERIC Educational Resources Information Center

    Flannery, Maura C.

    2001-01-01

    Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

  18. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer. PMID:26323482

  19. Genome Maps, a new generation genome browser.

    PubMed

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-07-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  20. Genome Maps, a new generation genome browser

    PubMed Central

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-01-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  1. High-resolution genomic analysis suggests the absence of recurrent genomic alterations other than SMARCB1 aberrations in atypical teratoid/rhabdoid tumors.

    PubMed

    Hasselblatt, Martin; Isken, Sarah; Linge, Anna; Eikmeier, Kristin; Jeibmann, Astrid; Oyen, Florian; Nagel, Inga; Richter, Julia; Bartelheim, Kerstin; Kordes, Uwe; Schneppenheim, Reinhard; Frühwald, Michael; Siebert, Reiner; Paulus, Werner

    2013-02-01

    Atypical teratoid/rhabdoid tumor (AT/RT) is a rare malignant pediatric brain tumor characterized by genetic alterations affecting the SMARCB1 (hSNF5/INI1) locus in chromosome band 22q11.2. To identify potential additional genetic alterations, high-resolution genome-wide analysis was performed using a molecular inversion probe single-nucleotide polymorphism (MIP SNP) assay (Affymetrix OncoScan formalin-fixed paraffin-embedded express) on DNA isolated from 18 formalin-fixed paraffin-embedded archival samples. Alterations affecting the SMARCB1 locus could be demonstrated by MIP SNP in 15 out of 16 evaluable cases (94%). These comprised five tumors with homozygous deletions, six tumors with heterozygous deletions, and four tumors with copy number neutral loss of heterozygosity (LOH) involving chromosome band 22q11.2. Remarkably, MIB SNP analysis did not yield any further recurrent chromosomal gains, losses, or copy neutral LOH. On MIP SNP screening for somatic mutations, the presence of a SMARCB1 mutation (c.472C>T p.R158X) was confirmed, but no recurrent mutations of other cancer relevant genes could be identified. Results of fluorescence in situ hybridization, multiplex ligation-dependent probe amplification, and SMARCB1 sequencing were highly congruent with that of the MIP SNP assay. In conclusion, these data further suggest the absence of recurrent genomic alterations other than SMARCB1 in AT/RT. PMID:23074045

  2. Genomic Analysis of Stress Response against Arsenic in Caenorhabditis elegans

    PubMed Central

    Sahu, Surasri N.; Lewis, Jada; Patel, Isha; Bozdag, Serdar; Lee, Jeong H.; Sprando, Robert; Cinar, Hediye Nese

    2013-01-01

    Arsenic, a known human carcinogen, is widely distributed around the world and found in particularly high concentrations in certain regions including Southwestern US, Eastern Europe, India, China, Taiwan and Mexico. Chronic arsenic poisoning affects millions of people worldwide and is associated with increased risk of many diseases including arthrosclerosis, diabetes and cancer. In this study, we explored genome level global responses to high and low levels of arsenic exposure in Caenorhabditis elegans using Affymetrix expression microarrays. This experimental design allows us to do microarray analysis of dose-response relationships of global gene expression patterns. High dose (0.03%) exposure caused stronger global gene expression changes in comparison with low dose (0.003%) exposure, suggesting a positive dose-response correlation. Biological processes such as oxidative stress, and iron metabolism, which were previously reported to be involved in arsenic toxicity studies using cultured cells, experimental animals, and humans, were found to be affected in C. elegans. We performed genome-wide gene expression comparisons between our microarray data and publicly available C. elegans microarray datasets of cadmium, and sediment exposure samples of German rivers Rhine and Elbe. Bioinformatics analysis of arsenic-responsive regulatory networks were done using FastMEDUSA program. FastMEDUSA analysis identified cancer-related genes, particularly genes associated with leukemia, such as dnj-11, which encodes a protein orthologous to the mammalian ZRF1/MIDA1/MPP11/DNAJC2 family of ribosome-associated molecular chaperones. We analyzed the protective functions of several of the identified genes using RNAi. Our study indicates that C. elegans could be a substitute model to study the mechanism of metal toxicity using high-throughput expression data and bioinformatics tools such as FastMEDUSA. PMID:23894281

  3. Genome-Wide Association Studies for Comb Traits in Chickens

    PubMed Central

    Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  4. A whole genome association study on meat palatability in hanwoo.

    PubMed

    Hyeong, K-E; Lee, Y-M; Kim, Y-S; Nam, K C; Jo, C; Lee, K-H; Lee, J-E; Kim, J-J

    2014-09-01

    A whole genome association (WGA) study was carried out to find quantitative trait loci (QTL) for sensory evaluation traits in Hanwoo. Carcass samples of 250 Hanwoo steers were collected from National Agricultural Cooperative Livestock Research Institute, Ansung, Gyeonggi province, Korea, between 2011 and 2012 and genotyped with the Affymetrix Bovine Axiom Array 640K single nucleotide polymorphism (SNP) chip. Among the SNPs in the chip, a total of 322,160 SNPs were chosen after quality control tests. After adjusting for the effects of age, slaughter-year-season, and polygenic effects using genome relationship matrix, the corrected phenotypes for the sensory evaluation measurements were regressed on each SNP using a simple linear regression additive based model. A total of 1,631 SNPs were detected for color, aroma, tenderness, juiciness and palatability at 0.1% comparison-wise level. Among the significant SNPs, the best set of 52 SNP markers were chosen using a forward regression procedure at 0.05 level, among which the sets of 8, 14, 11, 10, and 9 SNPs were determined for the respectively sensory evaluation traits. The sets of significant SNPs explained 18% to 31% of phenotypic variance. Three SNPs were pleiotropic, i.e. AX-26703353 and AX-26742891 that were located at 101 and 110 Mb of BTA6, respectively, influencing tenderness, juiciness and palatability, while AX-18624743 at 3 Mb of BTA10 affected tenderness and palatability. Our results suggest that some QTL for sensory measures are segregating in a Hanwoo steer population. Additional WGA studies on fatty acid and nutritional components as well as the sensory panels are in process to characterize genetic architecture of meat quality and palatability in Hanwoo. PMID:25178363

  5. Genome-Wide Association Studies for Comb Traits in Chickens.

    PubMed

    Shen, Manman; Qu, Liang; Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61-0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  6. A Whole Genome Association Study on Meat Palatability in Hanwoo

    PubMed Central

    Hyeong, K.-E.; Lee, Y.-M.; Kim, Y.-S.; Nam, K. C.; Jo, C.; Lee, K.-H.; Lee, J.-E.; Kim, J.-J.

    2014-01-01

    A whole genome association (WGA) study was carried out to find quantitative trait loci (QTL) for sensory evaluation traits in Hanwoo. Carcass samples of 250 Hanwoo steers were collected from National Agricultural Cooperative Livestock Research Institute, Ansung, Gyeonggi province, Korea, between 2011 and 2012 and genotyped with the Affymetrix Bovine Axiom Array 640K single nucleotide polymorphism (SNP) chip. Among the SNPs in the chip, a total of 322,160 SNPs were chosen after quality control tests. After adjusting for the effects of age, slaughter-year-season, and polygenic effects using genome relationship matrix, the corrected phenotypes for the sensory evaluation measurements were regressed on each SNP using a simple linear regression additive based model. A total of 1,631 SNPs were detected for color, aroma, tenderness, juiciness and palatability at 0.1% comparison-wise level. Among the significant SNPs, the best set of 52 SNP markers were chosen using a forward regression procedure at 0.05 level, among which the sets of 8, 14, 11, 10, and 9 SNPs were determined for the respectively sensory evaluation traits. The sets of significant SNPs explained 18% to 31% of phenotypic variance. Three SNPs were pleiotropic, i.e. AX-26703353 and AX-26742891 that were located at 101 and 110 Mb of BTA6, respectively, influencing tenderness, juiciness and palatability, while AX-18624743 at 3 Mb of BTA10 affected tenderness and palatability. Our results suggest that some QTL for sensory measures are segregating in a Hanwoo steer population. Additional WGA studies on fatty acid and nutritional components as well as the sensory panels are in process to characterize genetic architecture of meat quality and palatability in Hanwoo. PMID:25178363

  7. Genomic analysis of stress response against arsenic in Caenorhabditis elegans.

    PubMed

    Sahu, Surasri N; Lewis, Jada; Patel, Isha; Bozdag, Serdar; Lee, Jeong H; Sprando, Robert; Cinar, Hediye Nese

    2013-01-01

    Arsenic, a known human carcinogen, is widely distributed around the world and found in particularly high concentrations in certain regions including Southwestern US, Eastern Europe, India, China, Taiwan and Mexico. Chronic arsenic poisoning affects millions of people worldwide and is associated with increased risk of many diseases including arthrosclerosis, diabetes and cancer. In this study, we explored genome level global responses to high and low levels of arsenic exposure in Caenorhabditis elegans using Affymetrix expression microarrays. This experimental design allows us to do microarray analysis of dose-response relationships of global gene expression patterns. High dose (0.03%) exposure caused stronger global gene expression changes in comparison with low dose (0.003%) exposure, suggesting a positive dose-response correlation. Biological processes such as oxidative stress, and iron metabolism, which were previously reported to be involved in arsenic toxicity studies using cultured cells, experimental animals, and humans, were found to be affected in C. elegans. We performed genome-wide gene expression comparisons between our microarray data and publicly available C. elegans microarray datasets of cadmium, and sediment exposure samples of German rivers Rhine and Elbe. Bioinformatics analysis of arsenic-responsive regulatory networks were done using FastMEDUSA program. FastMEDUSA analysis identified cancer-related genes, particularly genes associated with leukemia, such as dnj-11, which encodes a protein orthologous to the mammalian ZRF1/MIDA1/MPP11/DNAJC2 family of ribosome-associated molecular chaperones. We analyzed the protective functions of several of the identified genes using RNAi. Our study indicates that C. elegans could be a substitute model to study the mechanism of metal toxicity using high-throughput expression data and bioinformatics tools such as FastMEDUSA. PMID:23894281

  8. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population

    PubMed Central

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene–environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10−8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  9. Acetaminophen-NAPQI Hepatotoxicity: A Cell Line Model System Genome-Wide Association Study

    PubMed Central

    Moyer, Ann M.; Fridley, Brooke L.; Jenkins, Gregory D.; Batzler, Anthony J.; Pelleymounter, Linda L.; Kalari, Krishna R.; Ji, Yuan; Chai, Yubo; Nordgren, Kendra K. S.; Weinshilboum, Richard M.

    2011-01-01

    Acetaminophen is the leading cause of acute hepatic failure in many developed nations. Acetaminophen hepatotoxicity is mediated by the reactive metabolite N-acetyl-p-benzoquinonimine (NAPQI). We performed a “discovery” genome-wide association study using a cell line–based model system to study the possible contribution of genomics to NAPQI-induced cytotoxicity. A total of 176 lymphoblastoid cell lines from healthy subjects were treated with increasing concentrations of NAPQI. Inhibiting concentration 50 values were determined and were associated with “glutathione pathway” gene single nucleotide polymorphisms (SNPs) and genome-wide basal messenger RNA expression, as well as with 1.3 million genome-wide SNPs. A group of SNPs in linkage disequilibrium on chromosome 3 was highly associated with NAPQI toxicity. The p value for rs2880961, the SNP with the lowest p value, was 1.88 × 10−7. This group of SNPs mapped to a “gene desert,” but chromatin immunoprecipitation assays demonstrated binding of several transcription factor proteins including heat shock factor 1 (HSF1) and HSF2, at or near rs2880961. These chromosome 3 SNPs were not significantly associated with variation in basal expression for any of the genome-wide genes represented on the Affymetrix U133 Plus 2.0 GeneChip. We have used a cell line–based model system to identify a SNP signal associated with NAPQI cytotoxicity. If these observations are validated in future clinical studies, this SNP signal might represent a potential biomarker for risk of acetaminophen hepatotoxicity. The mechanisms responsible for this association remain unclear. PMID:21177773

  10. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population.

    PubMed

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene-environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10-8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  11. Genomic Encyclopedia of Fungi

    SciTech Connect

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  12. JGI Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  13. "Replicated" genome wide association for dependence on illegal substances: genomic regions identified by overlapping clusters of nominally positive SNPs.

    PubMed

    Drgon, Tomas; Johnson, Catherine A; Nino, Michelle; Drgonova, Jana; Walther, Donna M; Uhl, George R

    2011-03-01

    Declaring "replication" from results of genome wide association (GWA) studies is straightforward when major gene effects provide genome-wide significance for association of the same allele of the same SNP in each of multiple independent samples. However, such unambiguous replication may be unlikely when phenotypes display polygenic genetic architecture, allelic heterogeneity, locus heterogeneity, and when different samples display linkage disequilibria with different fine structures. We seek chromosomal regions that are tagged by clustered SNPs that display nominally significant association in each of several independent samples. This approach provides one "nontemplate" approach to identifying overall replication of groups of GWA results in the face of difficult genetic architectures. We apply this strategy to 1 million (1M) SNP Affymetrix and Illumina GWA results for dependence on illegal substances. This approach provides high confidence in rejecting the null hypothesis that chance alone accounts for the extent to which clustered, nominally significant SNPs from samples of the same racial/ethnic background identify the same chromosomal regions. There is more modest confidence in: (a) identification of individual chromosomal regions and genes and (b) overlap between results from samples of different racial/ethnic backgrounds. The strong overlap identified among the samples with similar racial/ethnic backgrounds, together with prior work that identified overlapping results in samples of different racial/ethnic backgrounds, support contributions to individual differences in vulnerability to addictions that come from both relatively older allelic variants that are common in many current human populations and newer allelic variants that are common in fewer current human populations. PMID:21302341

  14. Genomics and Health Impact Update

    MedlinePlus

    ... Genomics in Practice Newborn Screening Pharmacogenomics Reproductive Health Tools and Databases About the Genomics & Health Impact Update The Office of Public Health Genomics provides updated and credible ...

  15. Integrating sequence, evolution and functional genomics in regulatory genomics

    PubMed Central

    Vingron, Martin; Brazma, Alvis; Coulson, Richard; van Helden, Jacques; Manke, Thomas; Palin, Kimmo; Sand, Olivier; Ukkonen, Esko

    2009-01-01

    With genome analysis expanding from the study of genes to the study of gene regulation, 'regulatory genomics' utilizes sequence information, evolution and functional genomics measurements to unravel how regulatory information is encoded in the genome. PMID:19226437

  16. Arabidopsis gene expression patterns during spaceflight

    NASA Astrophysics Data System (ADS)

    Paul, A.-L.; Ferl, R. J.

    The exposure of Arabidopsis thaliana (Arabidopsis) plants to spaceflight environments resulted in the differential expression of hundreds of genes. A 5 day mission on orbiter Columbia in 1999 (STS-93) carried transgenic Arabidopsis plants engineered with a transgene composed of the alcohol dehydrogenase (Adh) gene promoter linked to the β -Glucuronidase (GUS) reporter gene. The plants were used to evaluate the effects of spaceflight on two fronts. First, expression patterns visualized with the Adh/GUS transgene were used to address specifically the possibility that spaceflight induces a hypoxic stress response, and to assess whether any spaceflight response was similar to control terrestrial hypoxia-induced gene expression patterns. (Paul et al., Plant Physiol. 2001, 126:613). Second, genome-wide patterns of native gene expression were evaluated utilizing the Affymetrix ATH1 GeneChip? array of 8,000 Arabidopsis genes. As a control for the veracity of the array analyses, a selection of genes identified with the arrays was further characterized with quantitative Real-Time RT PCR (ABI - TaqmanTM). Comparison of the patterns of expression for arrays of hybridized with RNA isolated from plants exposed to spaceflight compared to the control arrays revealed hundreds of genes that were differentially expressed in response to spaceflight, yet most genes that are hallmarks of hypoxic stress were unaffected. These results will be discussed in light of current models for plant responses to the spaceflight environment, and with regard to potential future flight opportunities.

  17. Genomic Data Commons | Office of Cancer Genomics

    Cancer.gov

    The NCI’s Center for Cancer Genomics launches the Genomic Data Commons (GDC), a unified data sharing platform for the cancer research community. The mission of the GDC is to enable data sharing across the entire cancer research community, to ultimately support precision medicine in oncology.

  18. Harvesting rice's dispensable genome.

    PubMed

    Wing, Rod A

    2015-01-01

    A rapid and cost-effective approach has been developed to harvest and map the dispensable genome, that is, population-level natural sequence variation within a species that is not present in static genome assemblies. PMID:26429765

  19. Libraries for genomic SELEX.

    PubMed Central

    Singer, B S; Shtatland, T; Brown, D; Gold, L

    1997-01-01

    An increasing number of proteins are being identified that regulate gene expression by binding specific nucleic acidsin vivo. A method termed genomic SELEX facilitates the rapid identification of networks of protein-nucleic acid interactions by identifying within the genomic sequences of an organism the highest affinity sites for any protein of the organism. As with its progenitor, SELEX of random-sequence nucleic acids, genomic SELEX involves iterative binding, partitioning, and amplification of nucleic acids. The two methods differ in that the variable region of the nucleic acid library for genomic SELEX is derived from the genome of an organism. We have used a quick and simple method to construct Escherichia coli, Saccharomyces cerevisiae, and human genomic DNA PCR libraries that can be transcribed with T7 RNA polymerase. We present evidence that the libraries contain overlapping inserts starting at most of the positions within the genome, making these libraries suitable for genomic SELEX. PMID:9016629

  20. Genomic Data Commons launches

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  1. GENOMICS AND ENVIRONMENTAL RESEARCH

    EPA Science Inventory

    The impact of recently developed and emerging genomics technologies on environmental sciences has significant implications for human and ecological risk assessment issues. The linkage of data generated from genomics, transcriptomics, proteomics, metabalomics, and ecology can be ...

  2. Genome-wide and fine-resolution association analysis of malaria in West Africa

    PubMed Central

    Jallow, Muminatou; Teo, Yik Ying; Small, Kerrin S; Rockett, Kirk A; Deloukas, Panos; Clark, Taane G; Kivinen, Katja; Bojang, Kalifa A; Conway, David J; Pinder, Margaret; Sirugo, Giorgio; Sisay-Joof, Fatou; Usen, Stanley; Auburn, Sarah; Bumpstead, Suzannah J; Campino, Susana; Coffey, Alison; Dunham, Andrew; Fry, Andrew E; Green, Angela; Gwilliam, Rhian; Hunt, Sarah E; Inouye, Michael; Jeffreys, Anna E; Mendy, Alieu; Palotie, Aarno; Potter, Simon; Ragoussis, Jiannis; Rogers, Jane; Rowlands, Kate; Somaskantharajah, Elilan; Whittaker, Pamela; Widden, Claire; Donnelly, Peter; Howie, Bryan; Marchini, Jonathan; Morris, Andrew; SanJoaquin, Miguel; Achidi, Eric Akum; Agbenyega, Tsiri; Allen, Angela; Amodu, Olukemi; Corran, Patrick; Djimde, Abdoulaye; Dolo, Amagana; Doumbo, Ogobara K; Drakeley, Chris; Dunstan, Sarah; Evans, Jennifer; Farrar, Jeremy; Fernando, Deepika; Hien, Tran Tinh; Horstmann, Rolf D; Ibrahim, Muntaser; Karunaweera, Nadira; Kokwaro, Gilbert; Koram, Kwadwo A; Lemnge, Martha; Makani, Julie; Marsh, Kevin; Michon, Pascal; Modiano, David; Molyneux, Malcolm E; Mueller, Ivo; Parker, Michael; Peshu, Norbert; Plowe, Christopher V; Puijalon, Odile; Reeder, John; Reyburn, Hugh; Riley, Eleanor M; Sakuntabhai, Anavaj; Singhasivanon, Pratap; Sirima, Sodiomon; Tall, Adama; Taylor, Terrie E; Thera, Mahamadou; Troye-Blomberg, Marita; Williams, Thomas N; Wilson, Michael; Kwiatkowski, Dominic P

    2009-01-01

    We report a genome-wide association (GWA) study of severe malaria in The Gambia. The initial GWA scan included 2,500 children genotyped on the Affymetrix 500K GeneChip, and a replication study included 3,400 children. We used this to examine the performance of GWA methods in Africa. We found considerable population stratification, and also that signals of association at known malaria resistance loci were greatly attenuated owing to weak linkage disequilibrium (LD). To investigate possible solutions to the problem of low LD, we focused on the HbS locus, sequencing this region of the genome in 62 Gambian individuals and then using these data to conduct multipoint imputation in the GWA samples. This increased the signal of association, from P = 4 × 10−7 to P = 4 × 10−14, with the peak of the signal located precisely at the HbS causal variant. Our findings provide proof of principle that fine-resolution multipoint imputation, based on population-specific sequencing data, can substantially boost authentic GWA signals and enable fine mapping of causal variants in African populations. PMID:19465909

  3. Genome-wide and fine-resolution association analysis of malaria in West Africa.

    PubMed

    Jallow, Muminatou; Teo, Yik Ying; Small, Kerrin S; Rockett, Kirk A; Deloukas, Panos; Clark, Taane G; Kivinen, Katja; Bojang, Kalifa A; Conway, David J; Pinder, Margaret; Sirugo, Giorgio; Sisay-Joof, Fatou; Usen, Stanley; Auburn, Sarah; Bumpstead, Suzannah J; Campino, Susana; Coffey, Alison; Dunham, Andrew; Fry, Andrew E; Green, Angela; Gwilliam, Rhian; Hunt, Sarah E; Inouye, Michael; Jeffreys, Anna E; Mendy, Alieu; Palotie, Aarno; Potter, Simon; Ragoussis, Jiannis; Rogers, Jane; Rowlands, Kate; Somaskantharajah, Elilan; Whittaker, Pamela; Widden, Claire; Donnelly, Peter; Howie, Bryan; Marchini, Jonathan; Morris, Andrew; SanJoaquin, Miguel; Achidi, Eric Akum; Agbenyega, Tsiri; Allen, Angela; Amodu, Olukemi; Corran, Patrick; Djimde, Abdoulaye; Dolo, Amagana; Doumbo, Ogobara K; Drakeley, Chris; Dunstan, Sarah; Evans, Jennifer; Farrar, Jeremy; Fernando, Deepika; Hien, Tran Tinh; Horstmann, Rolf D; Ibrahim, Muntaser; Karunaweera, Nadira; Kokwaro, Gilbert; Koram, Kwadwo A; Lemnge, Martha; Makani, Julie; Marsh, Kevin; Michon, Pascal; Modiano, David; Molyneux, Malcolm E; Mueller, Ivo; Parker, Michael; Peshu, Norbert; Plowe, Christopher V; Puijalon, Odile; Reeder, John; Reyburn, Hugh; Riley, Eleanor M; Sakuntabhai, Anavaj; Singhasivanon, Pratap; Sirima, Sodiomon; Tall, Adama; Taylor, Terrie E; Thera, Mahamadou; Troye-Blomberg, Marita; Williams, Thomas N; Wilson, Michael; Kwiatkowski, Dominic P

    2009-06-01

    We report a genome-wide association (GWA) study of severe malaria in The Gambia. The initial GWA scan included 2,500 children genotyped on the Affymetrix 500K GeneChip, and a replication study included 3,400 children. We used this to examine the performance of GWA methods in Africa. We found considerable population stratification, and also that signals of association at known malaria resistance loci were greatly attenuated owing to weak linkage disequilibrium (LD). To investigate possible solutions to the problem of low LD, we focused on the HbS locus, sequencing this region of the genome in 62 Gambian individuals and then using these data to conduct multipoint imputation in the GWA samples. This increased the signal of association, from P = 4 × 10(-7) to P = 4 × 10(-14), with the peak of the signal located precisely at the HbS causal variant. Our findings provide proof of principle that fine-resolution multipoint imputation, based on population-specific sequencing data, can substantially boost authentic GWA signals and enable fine mapping of causal variants in African populations. PMID:19465909

  4. Whole-genome transcriptional analysis of heavy metal stresses inCaulobacter crescentus

    SciTech Connect

    Hu, Ping; Brodie, Eoin L.; Suzuki, Yohey; McAdams, Harley H.; Andersen, Gary L.

    2005-09-21

    The bacterium Caulobacter crescentus and related stalkbacterial species are known for their distinctive ability to live in lownutrient environments, a characteristic of most heavy metal contaminatedsites. Caulobacter crescentus is a model organism for studying cell cycleregulation with well developed genetics. We have identified the pathwaysresponding to heavy metal toxicity in C. crescentus to provide insightsfor possible application of Caulobacter to environmental restoration. Weexposed C. crescentus cells to four heavy metals (chromium, cadmium,selenium and uranium) and analyzed genome wide transcriptional activitiespost exposure using a Affymetrix GeneChip microarray. C. crescentusshowed surprisingly high tolerance to uranium, a possible mechanism forwhich may be formation of extracellular calcium-uranium-phosphateprecipitates. The principal response to these metals was protectionagainst oxidative stress (up-regulation of manganese-dependent superoxidedismutase, sodA). Glutathione S-transferase, thioredoxin, glutaredoxinsand DNA repair enzymes responded most strongly to cadmium and chromate.The cadmium and chromium stress response also focused on reducing theintracellular metal concentration, with multiple efflux pumps employed toremove cadmium while a sulfate transporter was down-regulated to reducenon-specific uptake of chromium. Membrane proteins were also up-regulatedin response to most of the metals tested. A two-component signaltransduction system involved in the uranium response was identified.Several differentially regulated transcripts from regions previously notknown to encode proteins were identified, demonstrating the advantage ofevaluating the transcriptome using whole genome microarrays.

  5. Genome-wide association studies for multiple diseases of the German Shepherd Dog.

    PubMed

    Tsai, Kate L; Noorai, Rooksana E; Starr-Moss, Alison N; Quignon, Pascale; Rinz, Caitlin J; Ostrander, Elaine A; Steiner, Jörg M; Murphy, Keith E; Clark, Leigh Anne

    2012-02-01

    The German Shepherd Dog (GSD) is a popular working and companion breed for which over 50 hereditary diseases have been documented. Herein, SNP profiles for 197 GSDs were generated using the Affymetrix v2 canine SNP array for a genome-wide association study to identify loci associated with four diseases: pituitary dwarfism, degenerative myelopathy (DM), congenital megaesophagus (ME), and pancreatic acinar atrophy (PAA). A locus on Chr 9 is strongly associated with pituitary dwarfism and is proximal to a plausible candidate gene, LHX3. Results for DM confirm a major locus encompassing SOD1, in which an associated point mutation was previously identified, but do not suggest modifier loci. Several SNPs on Chr 12 are associated with ME and a 4.7 Mb haplotype block is present in affected dogs. Analysis of additional ME cases for a SNP within the haplotype provides further support for this association. Results for PAA indicate more complex genetic underpinnings. Several regions on multiple chromosomes reach genome-wide significance. However, no major locus is apparent and only two associated haplotype blocks, on Chrs 7 and 12 are observed. These data suggest that PAA may be governed by multiple loci with small effects, or it may be a heterogeneous disorder. PMID:22105877

  6. Genome-Wide Association Study of a Varroa-Specific Defense Behavior in Honeybees (Apis mellifera).

    PubMed

    Spötter, Andreas; Gupta, Pooja; Mayer, Manfred; Reinsch, Norbert; Bienefeld, Kaspar

    2016-05-01

    Honey bees are exposed to many damaging pathogens and parasites. The most devastating is Varroa destructor, which mainly affects the brood. A promising approach for preventing its spread is to breed Varroa-resistant honey bees. One trait that has been shown to provide significant resistance against the Varroa mite is hygienic behavior, which is a behavioral response of honeybee workers to brood diseases in general. Here, we report the use of an Affymetrix 44K SNP array to analyze SNPs associated with detection and uncapping of Varroa-parasitized brood by individual worker bees (Apis mellifera). For this study, 22 000 individually labeled bees were video-monitored and a sample of 122 cases and 122 controls was collected and analyzed to determine the dependence/independence of SNP genotypes from hygienic and nonhygienic behavior on a genome-wide scale. After false-discovery rate correction of the P values, 6 SNP markers had highly significant associations with the trait investigated (α < 0.01). Inspection of the genomic regions around these SNPs led to the discovery of putative candidate genes. PMID:26774061

  7. Exploiting the genome

    SciTech Connect

    Block, S.; Cornwall, J.; Dyson, F.; Koonin, S.; Lewis, N.; Schwitters, R.

    1998-09-11

    In 1997, JASON conducted a DOE-sponsored study of the human genome project with special emphasis on the areas of technology, quality assurance and quality control, and informatics. The present study has two aims: first, to update the 1997 Report in light of recent developments in genome sequencing technology, and second, to consider possible roles for the DOE in the ''post-genomic" era, following acquisition of the complete human genome sequence.

  8. COMPARATIVE GENOMICS IN LEGUMES

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The legume plant family will soon include three sequenced genomes. The majority of the gene-containing portions of the model legumes Medicago truncatula and Lotus japonicus have been sequenced in clone-by-clone projects, and the sequencing of the soybean genome is underway in a whole-genome shotgun ...

  9. Whole Genome Selection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Whole genome selection (WGS) is an approach to using DNA markers that are distributed throughout the entire genome. Genes affecting most economically-important traits are distributed throughout the genome and there are relatively few that have large effects with many more genes with progressively sm...

  10. Family based genome-wide copy number scan identifies complex rearrangements at 17q21.31 in dyslexics.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2014-10-01

    Developmental dyslexia (DD) is a complex heritable disorder with unexpected difficulty in learning to read and spell despite adequate intelligence, education, environment, and normal senses. We performed genome-wide screening for copy number variations (CNVs) in 10 large Indian dyslexic families using Affymetrix Genome-Wide Human SNP Array 6.0. Results revealed the complex genomic rearrangements due to one non-contiguous deletion and five contiguous micro duplications and micro deletions at 17q21.31 region in three dyslexic families. CNVs in this region harbor the genes KIAA1267, LRRC37A, ARL17A/B, NSFP1, and NSF. The CNVs in case 1 and case 2 at this locus were found to be in homozygous state and case 3 was a de novo CNV. These CNVs were found with at least one CNV having a common break and end points in the parents. This cluster of genes containing NSF is implicated in learning, cognition, and memory, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia related module genes suggests NSF and other genes to be associated with cellular/vesicular membrane fusion and synaptic transmission. Thus, we suggest that NSF in this cluster would be the nearest gene responsible for the learning disability phenotype. PMID:25139666

  11. A genome-wide study of preferential amplification/hybridization in microarray-based pooled DNA experiments

    PubMed Central

    Yang, H.-C.; Liang, Y.-J.; Huang, M.-C.; Li, L.-H.; Lin, C.-H.; Wu, J.-Y.; Chen, Y.-T.; Fann, C.S.J.

    2006-01-01

    Microarray-based pooled DNA methods overcome the cost bottleneck of simultaneously genotyping more than 100 000 markers for numerous study individuals. The success of such methods relies on the proper adjustment of preferential amplification/hybridization to ensure accurate and reliable allele frequency estimation. We performed a hybridization-based genome-wide single nucleotide polymorphisms (SNPs) genotyping analysis to dissect preferential amplification/hybridization. The majority of SNPs had less than 2-fold signal amplification or suppression, and the lognormal distributions adequately modeled preferential amplification/hybridization across the human genome. Comparative analyses suggested that the distributions of preferential amplification/hybridization differed among genotypes and the GC content. Patterns among different ethnic populations were similar; nevertheless, there were striking differences for a small proportion of SNPs, and a slight ethnic heterogeneity was observed. To fulfill appropriate and gratuitous adjustments, databases of preferential amplification/hybridization for African Americans, Caucasians and Asians were constructed based on the Affymetrix GeneChip Human Mapping 100 K Set. The robustness of allele frequency estimation using this database was validated by a pooled DNA experiment. This study provides a genome-wide investigation of preferential amplification/hybridization and suggests guidance for the reliable use of the database. Our results constitute an objective foundation for theoretical development of preferential amplification/hybridization and provide important information for future pooled DNA analyses. PMID:16931491

  12. Next generation genome-wide association tool: Design and coverage of a high-throughput European-optimized SNP array

    PubMed Central

    Hoffmann, Thomas J.; Kvale, Mark N.; Hesselson, Stephanie E.; Zhan, Yiping; Aquino, Christine; Cao, Yang; Cawley, Simon; Chung, Elaine; Connell, Sheryl; Eshragh, Jasmin; Ewing, Marcia; Gollub, Jeremy; Henderson, Mary; Hubbell, Earl; Iribarren, Carlos; Kaufman, Jay; Lao, Richard Z.; Lu, Yontao; Ludwig, Dana; Mathauda, Gurpreet K.; McGuire, William; Mei, Gangwu; Miles, Sunita; Purdy, Matthew M.; Quesenberry, Charles; Ranatunga, Dilrini; Rowell, Sarah; Sadler, Marianne; Shapero, Michael H.; Shen, Ling; Shenoy, Tanushree R.; Smethurst, David; Van den Eeden, Stephen K.; Walter, Larry; Wan, Eunice; Wearley, Reid; Webster, Teresa; Wen, Christopher C.; Weng, Li; Whitmer, Rachel A.; Williams, Alan; Wong, Simon C.; Zau, Chia; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil

    2011-01-01

    The success of genome-wide association studies has paralleled the development of efficient genotyping technologies. We describe the development of a next-generation microarray based on the new highly-efficient Affymetrix Axiom genotyping technology that we are using to genotype individuals of European ancestry from the Kaiser Permanente Research Program on Genes, Environment and Health (RPGEH). The array contains 674,517 SNPs, and provides excellent genome-wide as well as gene-based and candidate-SNP coverage. Coverage was calculated using an approach based on imputation and cross validation. Preliminary results for the first 80,301 saliva-derived DNA samples from the RPGEH demonstrate very high quality genotypes, with sample success rates above 94% and over 98% of successful samples having SNP call rates exceeding 98%. At steady state, we have produced 462 million genotypes per week for each Axiom system. The new array provides a valuable addition to the repertoire of tools for large scale genome-wide association studies. PMID:21565264

  13. Genomics and functional genomics with haloarchaea.

    PubMed

    Soppa, J; Baumann, A; Brenneis, M; Dambeck, M; Hering, O; Lange, C

    2008-09-01

    The first haloarchaeal genome was published in 2000 and today five genome sequences are available. Transcriptome and proteome analyses have been established for two and three haloarchaeal species, respectively, and more than 20 studies using these functional genomic approaches have been published in the last two years. These studies gave global overviews of metabolic regulation (aerobic and anaerobic respiration, phototrophy, carbon source usage), stress response (UV, X-rays, transition metals, osmotic and temperature stress), cell cycle-dependent transcript level regulation, and transcript half-lives. The only translatome analysis available for any prokaryotic species revealed that 10 and 20% of all transcripts are translationally regulated in Haloferax volcanii and Halobacterium salinarum, respectively. Very effective methods for the construction of in frame deletion mutants have been established recently for haloarchaea and are intensively used to unravel the biological roles of genes in this group. Bioinformatic analyses include both cross-genome comparisons as well as integration of genomic data with experimental results. The first systems biology approaches have been performed that used experimental data to construct predictive models of gene expression and metabolism, respectively. In this contribution the current status of genomics, functional genomics, and molecular genetics of haloarchaea is summarized and selected examples are discussed. PMID:18493745

  14. Genomic characterization of esophageal squamous cell carcinoma from a high-risk population in China

    PubMed Central

    Hu, Nan; Wang, Chaoyu; Ng, David; Clifford, Robert; Yang, Howard H; Tang, Ze-Zhong; Wang, Quan-Hong; Han, Xiao-You; Giffen, Carol; Goldstein, Alisa M; Taylor, Philip R; Lee, Maxwell P

    2009-01-01

    Genomic instability plays an important role in most human cancers. To characterize genomic instability in esophageal squamous cell carcinoma (ESCC), we examined loss of heterozygosity (LOH), copy number (CN) loss, CN gain, and gene expression using the Affymetrix GeneChip Human Mapping 500K (n=30 cases) and Human U133A (n=17 cases) arrays in ESCC cases from a high-risk region of China. We found that genomic instability measures varied widely among cases and separated them into two groups: a high-frequency instability group (two-thirds of all cases with one or more instability category ≥ 10%) and a low-frequency instability group (one-third of cases with instability < 10%). Genomic instability also varied widely across chromosomal arms, with the highest frequency of LOH on 9p (33% of informative single nucleotide polymorphisms (SNPs)), CN loss on 3p (33%), and CN gain on 3q (48%). Twenty-two LOH regions were identified: four on 9p, seven on 9q, four on 13q, two on 17p, and five on 17q. Three CN loss regions – 3p12.3, 4p15.1, and 9p21.3 – were detected. Twelve CN gain regions were found, including six on 3q, one on 7q, four on 8q, and one on 11q. One of the most gene-rich of these CN gain regions was 11q13.1-13.4, where 26 genes also had RNA expression data available. CN gain was significantly correlated with increased RNA expression in over 80% of these genes. Our findings demonstrate the potential utility of combining CN analysis and gene expression data to identify genes involved in esophageal carcinogenesis. PMID:19584285

  15. Genome Wide Association for Addiction: Replicated Results and Comparisons of Two Analytic Approaches

    PubMed Central

    Drgon, Tomas; Zhang, Ping-Wu; Johnson, Catherine; Walther, Donna; Hess, Judith; Nino, Michelle; Uhl, George R.

    2010-01-01

    Background Vulnerabilities to dependence on addictive substances are substantially heritable complex disorders whose underlying genetic architecture is likely to be polygenic, with modest contributions from variants in many individual genes. “Nontemplate” genome wide association (GWA) approaches can identity groups of chromosomal regions and genes that, taken together, are much more likely to contain allelic variants that alter vulnerability to substance dependence than expected by chance. Methodology/Principal Findings We report pooled “nontemplate” genome-wide association studies of two independent samples of substance dependent vs control research volunteers (n = 1620), one European-American and the other African-American using 1 million SNP (single nucleotide polymorphism) Affymetrix genotyping arrays. We assess convergence between results from these two samples using two related methods that seek clustering of nominally-positive results and assess significance levels with Monte Carlo and permutation approaches. Both “converge then cluster” and “cluster then converge” analyses document convergence between the results obtained from these two independent datasets in ways that are virtually never found by chance. The genes identified in this fashion are also identified by individually-genotyped dbGAP data that compare allele frequencies in cocaine dependent vs control individuals. Conclusions/Significance These overlapping results identify small chromosomal regions that are also identified by genome wide data from studies of other relevant samples to extents much greater than chance. These chromosomal regions contain more genes related to “cell adhesion” processes than expected by chance. They also contain a number of genes that encode potential targets for anti-addiction pharmacotherapeutics. “Nontemplate” GWA approaches that seek chromosomal regions in which nominally-positive associations are found in multiple independent samples are

  16. The complete genome sequence of a chronic atrophic gastritis Helicobacter pylori strain: Evolution during disease progression

    PubMed Central

    Oh, Jung D.; Kling-Bäckhed, Helene; Giannakis, Marios; Xu, Jian; Fulton, Robert S.; Fulton, Lucinda A.; Cordum, Holland S.; Wang, Chunyan; Elliott, Glendoria; Edwards, Jennifer; Mardis, Elaine R.; Engstrand, Lars G.; Gordon, Jeffrey I.

    2006-01-01

    Helicobacter pylori produces acute superficial gastritis in nearly all of its human hosts. However, a subset of individuals develops chronic atrophic gastritis (ChAG), a condition characterized in part by diminished numbers of acid-producing parietal cells and increased risk for development of gastric adenocarcinoma. Previously, we used a gnotobiotic transgenic mouse model with an engineered ablation of parietal cells to show that loss of parietal cells provides an opportunity for a H. pylori isolate from a patient with ChAG (HPAG1) to bind to, enter, and persist within gastric stem cells. This finding raises the question of how ChAG influences H. pylori genome evolution, physiology, and tumorigenesis. Here we describe the 1,596,366-bp HPAG1 genome. Custom HPAG1 Affymetrix GeneChips, representing 99.6% of its predicted ORFs, were used for whole-genome genotyping of additional H. pylori ChAG isolates obtained from Swedish patients enrolled in a case-control study of gastric cancer, as well as ChAG- and cancer-associated isolates from an individual who progressed from ChAG to gastric adenocarcinoma. The results reveal a shared gene signature among ChAG strains, as well as genes that may have been lost or gained during progression to adenocarcinoma. Whole-genome transcriptional profiling of HPAG1’s response to acid during in vitro growth indicates that genes encoding components of metal uptake and utilization pathways, outer membrane proteins, and virulence factors are among those associated with H. pylori’s adaptation to ChAG. PMID:16788065

  17. Chromium and Genomic Stability

    PubMed Central

    Wise, Sandra S.; Wise, John Pierce

    2014-01-01

    Many metals serve as micronutrients which protect against genomic instability. Chromium is most abundant in its trivalent and hexavalent forms. Trivalent chromium has historically been considered an essential element, though recent data indicate that while it can have pharmacological effects and value, it is not essential. There are no data indicating that trivalent chromium promotes genomic stability and, instead may promote genomic instability. Hexavalent chromium is widely accepted as highly toxic and carcinogenic with no nutritional value. Recent data indicate that it causes genomic instability and also has no role in promoting genomic stability. PMID:22192535

  18. The Genomic Medicine Game.

    PubMed

    Tran, Elvis; de Andrés-Galiana, Enrique J; Benitez, Sonia; Martin-Sanchez, Fernando; Lopez-Campos, Guillermo H

    2016-01-01

    With advancements in genomics technology, health care has been improving and new paradigms of medicine such as genomic medicine have evolved. The education of clinicians, researchers and students to face the challenges posed by these new approaches, however, has been often lagging behind. From this the Genomic Medicine Game, an educational tool, was created for the purpose of conceptualizing the key components of Genomic Medicine. A number of phenotype-genotype associations were found through a literature review, which was used to be a base for the concepts the Genomic Medicine Game would focus on. Built in Java, the game was successfully tested with promising results. PMID:27577486

  19. Microbial genomic taxonomy.

    PubMed

    Thompson, Cristiane C; Chimetto, Luciane; Edwards, Robert A; Swings, Jean; Stackebrandt, Erko; Thompson, Fabiano L

    2013-01-01

    A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, <10 in Karlin genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups. PMID:24365132

  20. The Bluejay genome browser.

    PubMed

    Soh, Jung; Gordon, Paul M K; Sensen, Christoph W

    2012-03-01

    The Bluejay genome browser is a stand-alone visualization tool for the multi-scale viewing of annotated genomes and other genomic elements. Bluejay allows users to customize display features to suit their needs, and produces publication-quality graphics. Bluejay provides a multitude of ways to interrelate biological data at the genome scale. Users can load gene expression data into a genome display for expression visualization in context. Multiple genomes can be compared concurrently, including time series expression data, based on Gene Ontology labels. External, context-sensitive biological Web Services are linked to the displayed genomic elements ad hoc for in-depth genomic data analysis and interpretation. Users can mark multiple points of interest in a genome by creating waypoints, and exploit them for easy navigation of single or multiple genomes. Using this comprehensive visual environment, users can study a gene not just in relation to its genome, but also its transcriptome and evolutionary origins. Written in Java, Bluejay is platform-independent and is freely available from http://bluejay.ucalgary.ca. PMID:22389011

  1. Bacterial Genome Instability

    PubMed Central

    Darmon, Elise

    2014-01-01

    SUMMARY Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease. PMID:24600039

  2. UCSC genome browser tutorial.

    PubMed

    Zweig, Ann S; Karolchik, Donna; Kuhn, Robert M; Haussler, David; Kent, W James

    2008-08-01

    The University of California Santa Cruz (UCSC) Genome Bioinformatics website consists of a suite of free, open-source, on-line tools that can be used to browse, analyze, and query genomic data. These tools are available to anyone who has an Internet browser and an interest in genomics. The website provides a quick and easy-to-use visual display of genomic data. It places annotation tracks beneath genome coordinate positions, allowing rapid visual correlation of different types of information. Many of the annotation tracks are submitted by scientists worldwide; the others are computed by the UCSC Genome Bioinformatics group from publicly available sequence data. It also allows users to upload and display their own experimental results or annotation sets by creating a custom track. The suite of tools, downloadable data files, and links to documentation and other information can be found at http://genome.ucsc.edu/. PMID:18514479

  3. Variations in genome mass.

    PubMed

    Wachtel, S S; Tiersch, T R

    1993-02-01

    1. Genome size varies considerably among vertebrates, ranging from less than 1 pg to more than 200 pg; the amount of DNA differing among individuals in a population can equal the amount in the entire structural gene complement. 2. Recent technological advances permit evaluation of genome size variation at several levels including sub-chromosomal, chromosomal and cellular. 3. Genome size variation may also be viewed from taxonomic levels, and across evolutionary time frames. 4. As sources of genome size variation are identified and studied, the conundrum of the C-value paradox (lack of correlations among genome size, genomic complexity and phylogenetic status of organisms) may prove to be more apparent than real. 5. For example, the limited and relatively constant genome size of avians may be related to the physiological constraints of flight. PMID:8462275

  4. Genome wide expression profiling of two accession of G. herbaceum L. in response to drought

    PubMed Central

    2012-01-01

    Background Genome-wide gene expression profiling and detailed physiological investigation were used for understanding the molecular mechanism and physiological response of Gossypium herbaceum, which governs the adaptability of plants in drought conditions. Recently, microarray-based gene expression analysis is commonly used to decipher genes and genetic networks controlling the traits of interest. However, the results of such an analysis are often plagued due to a limited number of genes (probe sets) on microarrays. On the other hand, pyrosequencing of a transcriptome has the potential to detect rare as well as a large number of transcripts in the samples quantitatively. We used Affymetrix microarray as well as Roche's GS-FLX transcriptome sequencing for a comparative analysis of cotton transcriptome in leaf tissues under drought conditions. Results Fourteen accessions of Gossypium herbaceum were subjected to mannitol stress for preliminary screening; two accessions, namely Vagad and RAHS-14, were selected as being the most tolerant and most sensitive to osmotic stress, respectively. Affymetrix cotton arrays containing 24,045 probe sets and Roche's GS-FLX transcriptome sequencing of leaf tissue were used to analyze the gene expression profiling of Vagad and RAHS-14 under drought conditions. The analysis of physiological measurements and gene expression profiling showed that Vagad has the inherent ability to sense drought at a much earlier stage and to respond to it in a much more efficient manner than does RAHS-14. Gene Ontology (GO) studies showed that the phenyl propanoid pathway, pigment biosynthesis, polyketide biosynthesis, and other secondary metabolite pathways were enriched in Vagad under control and drought conditions as compared with RAHS-14. Similarly, GO analysis of transcriptome sequencing showed that the GO terms responses to various abiotic stresses were significantly higher in Vagad. Among the classes of transcription factors (TFs) uniquely

  5. Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax.

    PubMed

    Sousa, Inês; Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura E Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís; Feijó, Salvato; Oliveira, Sofia A

    2016-01-01

    Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22-2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08-2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29-2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581

  6. Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax

    PubMed Central

    Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura e Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís

    2016-01-01

    Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22–2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08–2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29–2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581

  7. Genomics of sorghum.

    PubMed

    Paterson, Andrew H

    2008-01-01

    Sorghum (Sorghum bicolor (L.) Moench) is a subject of plant genomics research based on its importance as one of the world's leading cereal crops, a biofuels crop of high and growing importance, a progenitor of one of the world's most noxious weeds, and a botanical model for many tropical grasses with complex genomes. A rich history of genome analysis, culminating in the recent complete sequencing of the genome of a leading inbred, provides a foundation for invigorating progress toward relating sorghum genes to their functions. Further characterization of the genomes other than Saccharinae cereals may shed light on mechanisms, levels, and patterns of evolution of genome size and structure, laying the foundation for further study of sugarcane and other economically important members of the group. PMID:18483564

  8. The tiniest tiny genomes.

    PubMed

    Moran, Nancy A; Bennett, Gordon M

    2014-01-01

    Starting in 2006, surprisingly tiny genomes have been discovered from numerous bacterial symbionts of insect hosts. Despite their size, each retains some genes that enable provisioning of limiting nutrients or other capabilities required by hosts. Genome sequence analyses show that genome reduction is an ongoing process, resulting in a continuum of sizes, with the smallest genome currently known at 112 kilobases. Genome reduction is typical in host-restricted symbionts and pathogens, but the tiniest genomes are restricted to symbionts required by hosts and restricted to specialized host cells, resulting from long coevolution with hosts. Genes are lost in all functional categories, but core genes for central informational processes, including genes encoding ribosomal proteins, are mostly retained, whereas genes underlying production of cell envelope components are especially depleted. Thus, these entities retain cell-like properties but are heavily dependent on coadaptation of hosts, which continuously evolve to support the symbionts upon which they depend. PMID:24995872

  9. Querying genomic databases

    SciTech Connect

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  10. Genome Aliquoting Revisited

    NASA Astrophysics Data System (ADS)

    Warren, Robert; Sankoff, David

    We prove that the genome aliquoting problem, the problem of finding a recent polyploid ancestor of a genome, with breakpoint distance can be solved in polynomial time. We propose an aliquoting algorithm that is a 2-approximation for the genome aliquoting problem with double cut and join distance, improving upon the previous best solution to this problem, Feijão and Meidanis' 4-approximation algorithm.

  11. Physician Assistant Genomic Competencies.

    PubMed

    Goldgar, Constance; Michaud, Ed; Park, Nguyen; Jenkins, Jean

    2016-09-01

    Genomic discoveries are increasingly being applied to the clinical care of patients. All physician assistants (PAs) need to acquire competency in genomics to provide the best possible care for patients within the scope of their practice. In this article, we present an updated version of PA genomic competencies and learning outcomes in a framework that is consistent with the current medical education guidelines and the collaborative nature of PAs in interprofessional health care teams. PMID:27490287

  12. Filarial and Wolbachia genomics.

    PubMed

    Scott, A L; Ghedin, E; Nutman, T B; McReynolds, L A; Poole, C B; Slatko, B E; Foster, J M

    2012-01-01

    Filarial nematode parasites, the causative agents for a spectrum of acute and chronic diseases including lymphatic filariasis and river blindness, threaten the well-being and livelihood of hundreds of millions of people in the developing regions of the world. The 2007 publication on a draft assembly of the 95-Mb genome of the human filarial parasite Brugia malayi- representing the first helminth parasite genome to be sequenced - has been followed in rapid succession by projects that have resulted in the genome sequencing of six additional filarial species, seven nonfilarial nematode parasites of animals and nearly 30 plant parasitic and free-living species. Parallel to the genomic sequencing, transcriptomic and proteomic projects have facilitated genome annotation, expanded our understanding of stage-associated gene expression and provided a first look at the role of epigenetic regulation of filarial genomes through microRNAs. The expansion in filarial genomics will also provide a significant enrichment in our knowledge of the diversity and variability in the genomes of the endosymbiotic bacterium Wolbachia leading to a better understanding of the genetic principles that govern filarial-Wolbachia mutualism. The goal here is to provide an overview of the trends and advances in filarial and Wolbachia genomics. PMID:22098559

  13. Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor

    2012-03-12

    The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scale genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.

  14. Genomics of Clostridium tetani.

    PubMed

    Brüggemann, Holger; Brzuszkiewicz, Elzbieta; Chapeton-Montes, Diana; Plourde, Lucile; Speck, Denis; Popoff, Michel R

    2015-05-01

    Genomic information about Clostridium tetani, the causative agent of the tetanus disease, is scarce. The genome of strain E88, a strain used in vaccine production, was sequenced about 10 years ago. One additional genome (strain 12124569) has recently been released. Here we report three new genomes of C. tetani and describe major differences among all five C. tetani genomes. They all harbor tetanus-toxin-encoding plasmids that contain highly conserved genes for TeNT (tetanus toxin), TetR (transcriptional regulator of TeNT) and ColT (collagenase), but substantially differ in other plasmid regions. The chromosomes share a large core genome that contains about 85% of all genes of a given chromosome. The non-core chromosome comprises mainly prophage-like genomic regions and genes encoding environmental interaction and defense functions (e.g. surface proteins, restriction-modification systems, toxin-antitoxin systems, CRISPR/Cas systems) and other fitness functions (e.g. transport systems, metabolic activities). This new genome information will help to assess the level of genome plasticity of the species C. tetani and provide the basis for detailed comparative studies. PMID:25638019

  15. Between two fern genomes.

    PubMed

    Sessa, Emily B; Banks, Jo Ann; Barker, Michael S; Der, Joshua P; Duffy, Aaron M; Graham, Sean W; Hasebe, Mitsuyasu; Langdale, Jane; Li, Fay-Wei; Marchant, D Blaine; Pryer, Kathleen M; Rothfels, Carl J; Roux, Stanley J; Salmi, Mari L; Sigel, Erin M; Soltis, Douglas E; Soltis, Pamela S; Stevenson, Dennis W; Wolf, Paul G

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  16. [Landscape and ecological genomics].

    PubMed

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment. PMID:25508669

  17. [Landscape and ecological genomics].

    PubMed

    Tetushkin, E Ia

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment. PMID:25474890

  18. Between Two Fern Genomes

    PubMed Central

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  19. The impact of Converso Jews on the genomes of modern Latin Americans.

    PubMed

    Velez, C; Palamara, P F; Guevara-Aguirre, J; Hao, L; Karafet, T; Guevara-Aguirre, M; Pearlman, A; Oddoux, C; Hammer, M; Burns, E; Pe'er, I; Atzmon, G; Ostrer, H

    2012-02-01

    Modern day Latin America resulted from the encounter of Europeans with the indigenous peoples of the Americas in 1492, followed by waves of migration from Europe and Africa. As a result, the genomic structure of present day Latin Americans was determined both by the genetic structure of the founding populations and the numbers of migrants from these different populations. Here, we analyzed DNA collected from two well-established communities in Colorado (33 unrelated individuals) and Ecuador (20 unrelated individuals) with a measurable prevalence of the BRCA1 c.185delAG and the GHR c.E180 mutations, respectively, using Affymetrix Genome-wide Human SNP 6.0 arrays to identify their ancestry. These mutations are thought to have been brought to these communities by Sephardic Jewish progenitors. Principal component analysis and clustering methods were employed to determine the genome-wide patterns of continental ancestry within both populations using single nucleotide polymorphisms, complemented by determination of Y-chromosomal and mitochondrial DNA haplotypes. When examining the presumed European component of these two communities, we demonstrate enrichment for Sephardic Jewish ancestry not only for these mutations, but also for other segments as well. Although comparison of both groups to a reference Hispanic/Latino population of Mexicans demonstrated proximity and similarity to other modern day communities derived from a European and Native American two-way admixture, identity-by-descent and Y-chromosome mapping demonstrated signatures of Sephardim in both communities. These findings are consistent with historical accounts of Jewish migration from the realms that comprise modern Spain and Portugal during the Age of Discovery. More importantly, they provide a rationale for the occurrence of mutations typically associated with the Jewish Diaspora in Latin American communities. PMID:21789512

  20. Identification of Promising Mutants Associated with Egg Production Traits Revealed by Genome-Wide Association Study

    PubMed Central

    Dou, Taocun; Yi, Guoqiang; Qu, LuJiang; Qu, Liang; Wang, Kehua; Yang, Ning

    2015-01-01

    Egg number (EN), egg laying rate (LR) and age at first egg (AFE) are important production traits related to egg production in poultry industry. To better understand the knowledge of genetic architecture of dynamic EN during the whole laying cycle and provide the precise positions of associated variants for EN, LR and AFE, laying records from 21 to 72 weeks of age were collected individually for 1,534 F2 hens produced by reciprocal crosses between White Leghorn and Dongxiang Blue-shelled chicken, and their genotypes were assayed by chicken 600 K Affymetrix high density genotyping arrays. Subsequently, pedigree and SNP-based genetic parameters were estimated and a genome-wide association study (GWAS) was conducted on EN, LR and AFE. The heritability estimates were similar between pedigree and SNP-based estimates varying from 0.17 to 0.36. In the GWA analysis, we identified nine genome-wide significant loci associated with EN of the laying periods from 21 to 26 weeks, 27 to 36 weeks and 37 to 72 weeks. Analysis of GTF2A1 and CLSPN suggested that they influenced the function of ovary and uterus, and may be considered as relevant candidates. The identified SNP rs314448799 for accumulative EN from 21 to 40 weeks on chromosome 5 created phenotypic differences of 6.86 eggs between two homozygous genotypes, which could be potentially applied to the molecular breeding for EN selection. Moreover, our finding showed that LR was a moderate polygenic trait. The suggestive significant region on chromosome 16 for AFE suggested the relationship between sex maturity and immune in the current population. The present study comprehensively evaluates the role of genetic variants in the development of egg laying. The findings will be helpful to investigation of causative genes function and future marker-assisted selection and genomic selection in chickens. PMID:26496084

  1. Evaluation of Genome Wide Association Study Associated Type 2 Diabetes Susceptibility Loci in Sub Saharan Africans.

    PubMed

    Adeyemo, Adebowale A; Tekola-Ayele, Fasil; Doumatey, Ayo P; Bentley, Amy R; Chen, Guanjie; Huang, Hanxia; Zhou, Jie; Shriner, Daniel; Fasanmade, Olufemi; Okafor, Godfrey; Eghan, Benjamin; Agyenim-Boateng, Kofi; Adeleye, Jokotade; Balogun, Williams; Elkahloun, Abdel; Chandrasekharappa, Settara; Owusu, Samuel; Amoah, Albert; Acheampong, Joseph; Johnson, Thomas; Oli, Johnnie; Adebamowo, Clement; Collins, Francis; Dunston, Georgia; Rotimi, Charles N

    2015-01-01

    Genome wide association studies (GWAS) for type 2 diabetes (T2D) undertaken in European and Asian ancestry populations have yielded dozens of robustly associated loci. However, the genomics of T2D remains largely understudied in sub-Saharan Africa (SSA), where rates of T2D are increasing dramatically and where the environmental background is quite different than in these previous studies. Here, we evaluate 106 reported T2D GWAS loci in continental Africans. We tested each of these SNPs, and SNPs in linkage disequilibrium (LD) with these index SNPs, for an association with T2D in order to assess transferability and to fine map the loci leveraging the generally reduced LD of African genomes. The study included 1775 unrelated Africans (1035 T2D cases, 740 controls; mean age 54 years; 59% female) enrolled in Nigeria, Ghana, and Kenya as part of the Africa America Diabetes Mellitus (AADM) study. All samples were genotyped on the Affymetrix Axiom PanAFR SNP array. Forty-one of the tested loci showed transferability to this African sample (p < 0.05, same direction of effect), 11 at the exact reported SNP and 30 others at SNPs in LD with the reported SNP (after adjustment for the number of tested SNPs). TCF7L2 SNP rs7903146 was the most significant locus in this study (p = 1.61 × 10(-8)). Most of the loci that showed transferability were successfully fine-mapped, i.e., localized to smaller haplotypes than in the original reports. The findings indicate that the genetic architecture of T2D in SSA is characterized by several risk loci shared with non-African ancestral populations and that data from African populations may facilitate fine mapping of risk loci. The study provides an important resource for meta-analysis of African ancestry populations and transferability of novel loci. PMID:26635871

  2. Evaluation of Genome Wide Association Study Associated Type 2 Diabetes Susceptibility Loci in Sub Saharan Africans

    PubMed Central

    Adeyemo, Adebowale A.; Tekola-Ayele, Fasil; Doumatey, Ayo P.; Bentley, Amy R.; Chen, Guanjie; Huang, Hanxia; Zhou, Jie; Shriner, Daniel; Fasanmade, Olufemi; Okafor, Godfrey; Eghan, Benjamin; Agyenim-Boateng, Kofi; Adeleye, Jokotade; Balogun, Williams; Elkahloun, Abdel; Chandrasekharappa, Settara; Owusu, Samuel; Amoah, Albert; Acheampong, Joseph; Johnson, Thomas; Oli, Johnnie; Adebamowo, Clement; Collins, Francis; Dunston, Georgia; Rotimi, Charles N.

    2015-01-01

    Genome wide association studies (GWAS) for type 2 diabetes (T2D) undertaken in European and Asian ancestry populations have yielded dozens of robustly associated loci. However, the genomics of T2D remains largely understudied in sub-Saharan Africa (SSA), where rates of T2D are increasing dramatically and where the environmental background is quite different than in these previous studies. Here, we evaluate 106 reported T2D GWAS loci in continental Africans. We tested each of these SNPs, and SNPs in linkage disequilibrium (LD) with these index SNPs, for an association with T2D in order to assess transferability and to fine map the loci leveraging the generally reduced LD of African genomes. The study included 1775 unrelated Africans (1035 T2D cases, 740 controls; mean age 54 years; 59% female) enrolled in Nigeria, Ghana, and Kenya as part of the Africa America Diabetes Mellitus (AADM) study. All samples were genotyped on the Affymetrix Axiom PanAFR SNP array. Forty-one of the tested loci showed transferability to this African sample (p < 0.05, same direction of effect), 11 at the exact reported SNP and 30 others at SNPs in LD with the reported SNP (after adjustment for the number of tested SNPs). TCF7L2 SNP rs7903146 was the most significant locus in this study (p = 1.61 × 10−8). Most of the loci that showed transferability were successfully fine-mapped, i.e., localized to smaller haplotypes than in the original reports. The findings indicate that the genetic architecture of T2D in SSA is characterized by several risk loci shared with non-African ancestral populations and that data from African populations may facilitate fine mapping of risk loci. The study provides an important resource for meta-analysis of African ancestry populations and transferability of novel loci. PMID:26635871

  3. Genomics of Disease

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This edited book represents the 23rd symposium in the Stadler Genetics Symposia series, and the general theme of this conference was "The Genomics of Disease." The 24 national and international speakers were invited to discuss their world-class research into the advances that genomics has made on c...

  4. Genomics for Weed Science

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and ...

  5. Unlocking the bovine genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The draft genome sequence of cattle (Bos taurus) has now been analyzed by the Bovine Genome Sequencing and Analysis Consortium and the Bovine HapMap Consortium, which together represent an extensive collaboration involving more than 300 scientists from 25 different countries. ...

  6. Genetics and Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Good progress is being made on genetics and genomics of sugar beet, however it is in process and the tools are now being generated and some results are being analyzed. The GABI BeetSeq project released a first draft of the sugar beet genome of KWS2320, a dihaploid (see http://bvseq.molgen.mpg.de/Gen...

  7. Development of Genomic GMACE

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The use of genomics to enhance national genetic evaluation systems of dairy cattle is quickly becoming standard practice. The current MACE procedure used by Interbull may not accommodate these new “genomically-enhanced” national evaluations. An important assumption in MACE may no longer be valid in ...

  8. GENOME OF HORSEPOX VIRUS

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Here we present the genomic sequence of horsepox virus (HSPV) isolate MNR-76, an orthopoxvirus (OPV) isolated in 1976 from diseased Mongolian horses. The 212 kbp genome contained 7.5 kbp inverted terminal repeats (ITR) and lacked extensive terminal tandem repetition. HSPV contained 236 ORFs with sim...

  9. Genomic Instability and Cancer

    PubMed Central

    Yao, Yixin; Dai, Wei

    2014-01-01

    Genomic instability is a characteristic of most cancer cells. It is an increased tendency of genome alteration during cell division. Cancer frequently results from damage to multiple genes controlling cell division and tumor suppressors. It is known that genomic integrity is closely monitored by several surveillance mechanisms, DNA damage checkpoint, DNA repair machinery and mitotic checkpoint. A defect in the regulation of any of these mechanisms often results in genomic instability, which predisposes the cell to malignant transformation. Posttranslational modifications of the histone tails are closely associated with regulation of the cell cycle as well as chromatin structure. Nevertheless, DNA methylation status is also related to genomic integrity. We attempt to summarize recent developments in this field and discuss the debate of driving force of tumor initiation and progression. PMID:25541596

  10. Microbial Genomes Multiply

    NASA Technical Reports Server (NTRS)

    Doolittle, Russell F.

    2002-01-01

    The publication of the first complete sequence of a bacterial genome in 1995 was a signal event, underscored by the fact that the article has been cited more than 2,100 times during the intervening seven years. It was a marvelous technical achievement, made possible by automatic DNA-sequencing machines. The feat is the more impressive in that complete genome sequencing has now been adopted in many different laboratories around the world. Four years ago in these columns I examined the situation after a dozen microbial genomes had been completed. Now, with upwards of 60 microbial genome sequences determined and twice that many in progress, it seems reasonable to assess just what is being learned. Are new concepts emerging about how cells work? Have there been practical benefits in the fields of medicine and agriculture? Is it feasible to determine the genomic sequence of every bacterial species on Earth? The answers to these questions maybe Yes, Perhaps, and No, respectively.

  11. An Integrated View of Gene Expression and Solute Profiles of Arabidopsis Tumors: A Genome-Wide Approach[W

    PubMed Central

    Deeken, Rosalia; Engelmann, Julia C.; Efetova, Marina; Czirjak, Tina; Müller, Tobias; Kaiser, Werner M.; Tietz, Olaf; Krischke, Markus; Mueller, Martin J.; Palme, Klaus; Dandekar, Thomas; Hedrich, Rainer

    2006-01-01

    Transformation of plant cells with T-DNA of virulent agrobacteria is one of the most extreme triggers of developmental changes in higher plants. For rapid growth and development of resulting tumors, specific changes in the gene expression profile and metabolic adaptations are required. Increased transport and metabolic fluxes are critical preconditions for growth and tumor development. A functional genomics approach, using the Affymetrix whole genome microarray (∼22,800 genes), was applied to measure changes in gene expression. The solute pattern of Arabidopsis thaliana tumors and uninfected plant tissues was compared with the respective gene expression profile. Increased levels of anions, sugars, and amino acids were correlated with changes in the gene expression of specific enzymes and solute transporters. The expression profile of genes pivotal for energy metabolism, such as those involved in photosynthesis, mitochondrial electron transport, and fermentation, suggested that tumors produce C and N compounds heterotrophically and gain energy mainly anaerobically. Thus, understanding of gene-to-metabolite networks in plant tumors promotes the identification of mechanisms that control tumor development. PMID:17172353

  12. Statistical Genomic Approach Identifies Association between FSHR Polymorphisms and Polycystic Ovary Morphology in Women with Polycystic Ovary Syndrome

    PubMed Central

    Du, Tao; Duan, Yu; Li, Kaiwen; Zhao, Xiaomiao; Ni, Renmin; Li, Yu; Yang, Dongzi

    2015-01-01

    Background. Single-nucleotide polymorphisms (SNPs) in the follicle stimulating hormone receptor (FSHR) gene are associated with PCOS. However, their relationship to the polycystic ovary (PCO) morphology remains unknown. This study aimed to investigate whether PCOS related SNPs in the FSHR gene are associated with PCO in women with PCOS. Methods. Patients were grouped into PCO (n = 384) and non-PCO (n = 63) groups. Genomic genotypes were profiled using Affymetrix human genome SNP chip 6. Two polymorphisms (rs2268361 and rs2349415) of FSHR were analyzed using a statistical approach. Results. Significant differences were found in the allele distributions of the GG genotype of rs2268361 between the PCO and non-PCO groups (27.6% GG, 53.4% GA, and 19.0% AA versus 33.3% GG, 36.5% GA, and 30.2% AA), while no significant differences were found in the allele distributions of the GG genotype of rs2349415. When rs2268361 was considered, there were statistically significant differences of serum follicle stimulating hormone, estradiol, and sex hormone binding globulin between genotypes in the PCO group. In case of the rs2349415 SNP, only serum sex hormone binding globulin was statistically different between genotypes in the PCO group. Conclusions. Functional variants in FSHR gene may contribute to PCO susceptibility in women with PCOS. PMID:26273622

  13. High-resolution genome-wide linkage mapping identifies susceptibility loci for BMI in the Chinese population.

    PubMed

    Zhang, Dong Feng; Pang, Zengchang; Li, Shuxia; Thomassen, Mads; Wang, Shaojie; Jiang, Wengjie; Hjelmborg, Jacob v B; Kruse, Torben A; Kyvik, Kirsten O; Christensen, Kaare; Tan, Qihua

    2012-04-01

    The genetic loci affecting the commonly used BMI have been intensively investigated using linkage approaches in multiple populations. This study aims at performing the first genome-wide linkage scan on BMI in the Chinese population in mainland China with hypothesis that heterogeneity in genetic linkage could exist in different ethnic populations. BMI was measured from 126 dizygotic twins in Qingdao municipality who were genotyped using high-resolution Affymetrix Genome-Wide Human SNP arrays containing about 1 million single-nucleotide polymorphisms (SNPs). Nonparametric linkage analysis was performed with Merlin software package for linkage analysis using variance components approach for quantitative trait loci mapping. We identified a strong linkage peak at the end of chromosome 7 (7q36 at 186 cM) with a lod score of 4.06 which overlaps with that reported by a large multicenter study in western countries. Multiple loci showing suggestive linkage were found on chromosome 1 (lod score 2.38 at 242 cM), chromosome 8 (2.48 at 95 cM), and chromosome 14 (2.2 at 89.4 cM). The strong linkage identified in the Chinese subjects that is consistent with that found in populations of European origin could suggest the existence of evolutionarily preserved genetic mechanisms for BMI whereas the multiple suggestive loci could represent genetic effect from gene-environment interaction as a result of population-specific environmental adaptation. PMID:21273998

  14. Phytozome Comparative Plant Genomics Portal

    SciTech Connect

    Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

    2014-09-09

    The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes

  15. Genome size evolution: sizing mammalian genomes.

    PubMed

    Redi, C A; Capanna, E

    2012-01-01

    The study of genome size (GS) and its variation is so fascinating to the scientific community because it constitutes the link between the present-day analytical and molecular studies of the genome and the old trunk of the holistic and synthetic view of the genome. The GS of several taxa vary over a broad range and do not correlate with the complexity of the organisms (the C-value paradox). However, the biology of transposable elements has let us reach a satisfactory view of the molecular mechanisms that give rise to GS variation and novelties, providing a less perplexing view of the significance of the GS (C-enigma). The knowledge of the composition and structure of a genome is a pre-requisite for trying to understand the evolution of the main genome signature: its size. The radiation of mammals provides an approximately 180-million-year test case for theories of how GS evolves. It has been found from data-mining GS databases that GS is a useful cyto-taxonomical instrument at the level of orders/superorders, providing genomic signatures characterizing Monotremata, Marsupialia, Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. A hypothetical ancestral mammalian-like GS of 2.9-3.7 pg has been suggested. This value appears compatible with the average values calculated for the high systematic levels of the extant Monotremata (∼2.97 pg) and Marsupialia (∼4.07 pg), suggesting invasion of mobile DNA elements concurrently with the separation of the older clades of Afrotheria (∼5.5 pg) and Xenarthra (∼4.5 pg) with larger GS, leaving the Euarchontoglires (∼3.4 pg) and Laurasiatheria (∼2.8 pg) genomes with fewer transposable elements. However, the paucity of GS data (546 mammalian species sized from 5,488 living species) for species, genera, and families calls for caution. Considering that mammalian species may be vanished even before they are known, GS data are sorely needed to phenotype the effects brought about by their variation and to validate any

  16. Evolution of genome architecture.

    PubMed

    Koonin, Eugene V

    2009-02-01

    Charles Darwin believed that all traits of organisms have been honed to near perfection by natural selection. The empirical basis underlying Darwin's conclusions consisted of numerous observations made by him and other naturalists on the exquisite adaptations of animals and plants to their natural habitats and on the impressive results of artificial selection. Darwin fully appreciated the importance of heredity but was unaware of the nature and, in fact, the very existence of genomes. A century and a half after the publication of the "Origin", we have the opportunity to draw conclusions from the comparisons of hundreds of genome sequences from all walks of life. These comparisons suggest that the dominant mode of genome evolution is quite different from that of the phenotypic evolution. The genomes of vertebrates, those purported paragons of biological perfection, turned out to be veritable junkyards of selfish genetic elements where only a small fraction of the genetic material is dedicated to encoding biologically relevant information. In sharp contrast, genomes of microbes and viruses are incomparably more compact, with most of the genetic material assigned to distinct biological functions. However, even in these genomes, the specific genome organization (gene order) is poorly conserved. The results of comparative genomics lead to the conclusion that the genome architecture is not a straightforward result of continuous adaptation but rather is determined by the balance between the selection pressure, that is itself dependent on the effective population size and mutation rate, the level of recombination, and the activity of selfish elements. Although genes and, in many cases, multigene regions of genomes possess elaborate architectures that ensure regulation of expression, these arrangements are evolutionarily volatile and typically change substantially even on short evolutionary scales when gene sequences diverge minimally. Thus, the observed genome

  17. Copy Number Variation of UGT 2B Genes in Indian Families Using Whole Genome Scans

    PubMed Central

    Veerappa, Avinash M.; Padakannaya, Prakash; Ramachandra, Nallur B.

    2016-01-01

    Background and Objectives. Uridine diphospho-glucuronosyltransferase 2B (UGT2B) is a family of genes involved in metabolizing steroid hormones and several other xenobiotics. These UGT2B genes are highly polymorphic in nature and have distinct polymorphisms associated with specific regions around the globe. Copy number variations (CNVs) status of UGT2B17 in Indian population is not known and their disease associations have been inconclusive. It was therefore of interest to investigate the CNV profile of UGT2B genes. Methods. We investigated the presence of CNVs in UGT2B genes in 31 members from eight Indian families using Affymetrix Genome-Wide Human SNP Array 6.0 chip. Results. Our data revealed >50% of the study members carried CNVs in UGT2B genes, of which 76% showed deletion polymorphism. CNVs were observed more in UGT2B17 (76.4%) than in UGT2B15 (17.6%). Molecular network and pathway analysis found enrichment related to steroid metabolic process, carboxylesterase activity, and sequence specific DNA binding. Interpretation and Conclusion. We report the presence of UGT2B gene deletion and duplication polymorphisms in Indian families. Network analysis indicates the substitutive role of other possible genes in the UGT activity. The CNVs of UGT2B genes are very common in individuals indicating that the effect is neutral in causing any suspected diseases. PMID:27092269

  18. Who are the Okinawans? Ancestry, genome diversity, and implications for the genetic study of human longevity from a geographically isolated population.

    PubMed

    Bendjilali, Nasrine; Hsueh, Wen-Chi; He, Qimei; Willcox, D Craig; Nievergelt, Caroline M; Donlon, Timothy A; Kwok, Pui-Yan; Suzuki, Makoto; Willcox, Bradley J

    2014-12-01

    Isolated populations have advantages for genetic studies of longevity from decreased haplotype diversity and long-range linkage disequilibrium. This permits smaller sample sizes without loss of power, among other utilities. Little is known about the genome of the Okinawans, a potential population isolate, recognized for longevity. Therefore, we assessed genetic diversity, structure, and admixture in Okinawans, and compared this with Caucasians, Chinese, Japanese, and Africans from HapMap II, genotyped on the same Affymetrix GeneChip Human Mapping 500K array. Principal component analysis, haplotype coverage, and linkage disequilibrium decay revealed a distinct Okinawan genome-more homogeneity, less haplotype diversity, and longer range linkage disequilibrium. Population structure and admixture analyses utilizing 52 global reference populations from the Human Genome Diversity Cell Line Panel demonstrated that Okinawans clustered almost exclusively with East Asians. Sibling relative risk (λs) analysis revealed that siblings of Okinawan centenarians have 3.11 times (females) and 3.77 times (males) more likelihood of centenarianism. These findings suggest that Okinawans are genetically distinct and share several characteristics of a population isolate, which are prone to develop extreme phenotypes (eg, longevity) from genetic drift, natural selection, and population bottlenecks. These data support further exploration of genetic influence on longevity in the Okinawans. PMID:24444611

  19. Integrated genome-based studies of Shewanella ecophysiology

    SciTech Connect

    Segre Daniel; Beg Qasim

    2012-02-14

    This project was a component of the Shewanella Federation and, as such, contributed to the overall goal of applying the genomic tools to better understand eco-physiology and speciation of respiratory-versatile members of Shewanella genus. Our role at Boston University was to perform bioreactor and high throughput gene expression microarrays, and combine dynamic flux balance modeling with experimentally obtained transcriptional and gene expression datasets from different growth conditions. In the first part of project, we designed the S. oneidensis microarray probes for Affymetrix Inc. (based in California), then we identified the pathways of carbon utilization in the metal-reducing marine bacterium Shewanella oneidensis MR-1, using our newly designed high-density oligonucleotide Affymetrix microarray on Shewanella cells grown with various carbon sources. Next, using a combination of experimental and computational approaches, we built algorithm and methods to integrate the transcriptional and metabolic regulatory networks of S. oneidensis. Specifically, we combined mRNA microarray and metabolite measurements with statistical inference and dynamic flux balance analysis (dFBA) to study the transcriptional response of S. oneidensis MR-1 as it passes through exponential, stationary, and transition phases. By measuring time-dependent mRNA expression levels during batch growth of S. oneidensis MR-1 under two radically different nutrient compositions (minimal lactate and nutritionally rich LB medium), we obtain detailed snapshots of the regulatory strategies used by this bacterium to cope with gradually changing nutrient availability. In addition to traditional clustering, which provides a first indication of major regulatory trends and transcription factors activities, we developed and implemented a new computational approach for Dynamic Detection of Transcriptional Triggers (D2T2). This new method allows us to infer a putative topology of transcriptional dependencies

  20. The Banana Genome Hub

    PubMed Central

    Droc, Gaëtan; Larivière, Delphine; Guignon, Valentin; Yahiaoui, Nabila; This, Dominique; Garsmeur, Olivier; Dereeper, Alexis; Hamelin, Chantal; Argout, Xavier; Dufayard, Jean-François; Lengelle, Juliette; Baurens, Franc-Christophe; Cenci, Alberto; Pitollat, Bertrand; D’Hont, Angélique; Ruiz, Manuel; Rouard, Mathieu; Bocs, Stéphanie

    2013-01-01

    Banana is one of the world’s favorite fruits and one of the most important crops for developing countries. The banana reference genome sequence (Musa acuminata) was recently released. Given the taxonomic position of Musa, the completed genomic sequence has particular comparative value to provide fresh insights about the evolution of the monocotyledons. The study of the banana genome has been enhanced by a number of tools and resources that allows harnessing its sequence. First, we set up essential tools such as a Community Annotation System, phylogenomics resources and metabolic pathways. Then, to support post-genomic efforts, we improved banana existing systems (e.g. web front end, query builder), we integrated available Musa data into generic systems (e.g. markers and genetic maps, synteny blocks), we have made interoperable with the banana hub, other existing systems containing Musa data (e.g. transcriptomics, rice reference genome, workflow manager) and finally, we generated new results from sequence analyses (e.g. SNP and polymorphism analysis). Several uses cases illustrate how the Banana Genome Hub can be used to study gene families. Overall, with this collaborative effort, we discuss the importance of the interoperability toward data integration between existing information systems. Database URL: http://banana-genome.cirad.fr/ PMID:23707967

  1. Genomic Insights into Bifidobacteria

    PubMed Central

    Lee, Ju-Hoon; O'Sullivan, Daniel J.

    2010-01-01

    Summary: Since the discovery in 1899 of bifidobacteria as numerically dominant microbes in the feces of breast-fed infants, there have been numerous studies addressing their role in modulating gut microflora as well as their other potential health benefits. Because of this, they are frequently incorporated into foods as probiotic cultures. An understanding of their full interactions with intestinal microbes and the host is needed to scientifically validate any health benefits they may afford. Recently, the genome sequences of nine strains representing four species of Bifidobacterium became available. A comparative genome analysis of these genomes reveals a likely efficient capacity to adapt to their habitats, with B. longum subsp. infantis exhibiting more genomic potential to utilize human milk oligosaccharides, consistent with its habitat in the infant gut. Conversely, B. longum subsp. longum exhibits a higher genomic potential for utilization of plant-derived complex carbohydrates and polyols, consistent with its habitat in an adult gut. An intriguing observation is the loss of much of this genome potential when strains are adapted to pure culture environments, as highlighted by the genomes of B. animalis subsp. lactis strains, which exhibit the least potential for a gut habitat and are believed to have evolved from the B. animalis species during adaptation to dairy fermentation environments. PMID:20805404

  2. Ensembl comparative genomics resources

    PubMed Central

    Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J.; Searle, Stephen M. J.; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847

  3. Genome instability and aging.

    PubMed

    Vijg, Jan; Suh, Yousin

    2013-01-01

    Genome instability has long been implicated as the main causal factor in aging. Somatic cells are continuously exposed to various sources of DNA damage, from reactive oxygen species to UV radiation to environmental mutagens. To cope with the tens of thousands of chemical lesions introduced into the genome of a typical cell each day, a complex network of genome maintenance systems acts to remove damage and restore the correct base pair sequence. Occasionally, however, repair is erroneous, and such errors, as well as the occasional failure to correctly replicate the genome during cell division, are the basis for mutations and epimutations. There is now ample evidence that mutations accumulate in various organs and tissues of higher animals, including humans, mice, and flies. What is not known, however, is whether the frequency of these random changes is sufficient to cause the phenotypic effects generally associated with aging. The exception is cancer, an age-related disease caused by the accumulation of mutations and epimutations. Here, we first review current concepts regarding the relationship between DNA damage, repair, and mutation, as well as the data regarding genome alterations as a function of age. We then describe a model for how randomly induced DNA sequence and epigenomic variants in the somatic genomes of animals can result in functional decline and disease in old age. Finally, we discuss the genetics of genome instability in relation to longevity to address the importance of alterations in the somatic genome as a causal factor in aging and to underscore the opportunities provided by genetic approaches to develop interventions that attenuate genome instability, reduce disease risk, and increase life span. PMID:23398157

  4. Ensembl comparative genomics resources.

    PubMed

    Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847

  5. Center for Cancer Genomics | Office of Cancer Genomics

    Cancer.gov

    The Center for Cancer Genomics (CCG) was established to unify the National Cancer Institute's activities in cancer genomics, with the goal of advancing genomics research and translating findings into the clinic to improve the precise diagnosis and treatment of cancers. In addition to promoting genomic sequencing approach

  6. Genetic Diversity of the Q Fever Agent, Coxiella burnetii, Assessed by Microarray-Based Whole-Genome Comparisons†

    PubMed Central

    Beare, Paul A.; Samuel, James E.; Howe, Dale; Virtaneva, Kimmo; Porcella, Stephen F.; Heinzen, Robert A.

    2006-01-01

    Coxiella burnetii, a gram-negative obligate intracellular bacterium, causes human Q fever and is considered a potential agent of bioterrorism. Distinct genomic groups of C. burnetii are revealed by restriction fragment-length polymorphisms (RFLP). Here we comprehensively define the genetic diversity of C. burnetii by hybridizing the genomes of 20 RFLP-grouped and four ungrouped isolates from disparate sources to a high-density custom Affymetrix GeneChip containing all open reading frames (ORFs) of the Nine Mile phase I (NMI) reference isolate. We confirmed the relatedness of RFLP-grouped isolates and showed that two ungrouped isolates represent distinct genomic groups. Isolates contained up to 20 genomic polymorphisms consisting of 1 to 18 ORFs each. These were mostly complete ORF deletions, although partial deletions, point mutations, and insertions were also identified. A total of 139 chromosomal and plasmid ORFs were polymorphic among all C. burnetii isolates, representing ca. 7% of the NMI coding capacity. Approximately 67% of all deleted ORFs were hypothetical, while 9% were annotated in NMI as nonfunctional (e.g., frameshifted). The remaining deleted ORFs were associated with diverse cellular functions. The only deletions associated with isogenic NMI variants of attenuated virulence were previously described large deletions containing genes involved in lipopolysaccharide (LPS) biosynthesis, suggesting that these polymorphisms alone are responsible for the lower virulence of these variants. Interestingly, a variant of the Australia QD isolate producing truncated LPS had no detectable deletions, indicating LPS truncation can occur via small genetic changes. Our results provide new insight into the genetic diversity and virulence potential of Coxiella species. PMID:16547017

  7. Genome-wide association study identifies a maternal copy-number deletion in PSG11 enriched among preeclampsia patients

    PubMed Central

    2012-01-01

    Background Specific genetic contributions for preeclampsia (PE) are currently unknown. This genome-wide association study (GWAS) aims to identify maternal single nucleotide polymorphisms (SNPs) and copy-number variants (CNVs) involved in the etiology of PE. Methods A genome-wide scan was performed on 177 PE cases (diagnosed according to National Heart, Lung and Blood Institute guidelines) and 116 normotensive controls. White female study subjects from Iowa were genotyped on Affymetrix SNP 6.0 microarrays. CNV calls made using a combination of four detection algorithms (Birdseye, Canary, PennCNV, and QuantiSNP) were merged using CNVision and screened with stringent prioritization criteria. Due to limited DNA quantities and the deleterious nature of copy-number deletions, it was decided a priori that only deletions would be selected for assay on the entire case-control dataset using quantitative real-time PCR. Results The top four SNP candidates had an allelic or genotypic p-value between 10-5 and 10-6, however, none surpassed the Bonferroni-corrected significance threshold. Three recurrent rare deletions meeting prioritization criteria detected in multiple cases were selected for targeted genotyping. A locus of particular interest was found showing an enrichment of case deletions in 19q13.31 (5/169 cases and 1/114 controls), which encompasses the PSG11 gene contiguous to a highly plastic genomic region. All algorithm calls for these regions were assay confirmed. Conclusions CNVs may confer risk for PE and represent interesting regions that warrant further investigation. Top SNP candidates identified from the GWAS, although not genome-wide significant, may be useful to inform future studies in PE genetics. PMID:22748001

  8. Human Genome Project

    SciTech Connect

    Block, S.; Cornwall, J.; Dally, W.; Dyson, F.; Fortson, N.; Joyce, G.; Kimble, H. J.; Lewis, N.; Max, C.; Prince, T.; Schwitters, R.; Weinberger, P.; Woodin, W. H.

    1998-01-04

    The study reviews Department of Energy supported aspects of the United States Human Genome Project, the joint National Institutes of Health/Department of Energy program to characterize all human genetic material, to discover the set of human genes, and to render them accessible for further biological study. The study concentrates on issues of technology, quality assurance/control, and informatics relevant to current effort on the genome project and needs beyond it. Recommendations are presented on areas of the genome program that are of particular interest to and supported by the Department of Energy.

  9. Genomic taxonomy of vibrios

    PubMed Central

    Thompson, Cristiane C; Vicente, Ana Carolina P; Souza, Rangel C; Vasconcelos, Ana Tereza R; Vesth, Tammi; Alves, Nelson; Ussery, David W; Iida, Tetsuya; Thompson, Fabiano L

    2009-01-01

    Background Vibrio taxonomy has been based on a polyphasic approach. In this study, we retrieve useful taxonomic information (i.e. data that can be used to distinguish different taxonomic levels, such as species and genera) from 32 genome sequences of different vibrio species. We use a variety of tools to explore the taxonomic relationship between the sequenced genomes, including Multilocus Sequence Analysis (MLSA), supertrees, Average Amino Acid Identity (AAI), genomic signatures, and Genome BLAST atlases. Our aim is to analyse the usefulness of these tools for species identification in vibrios. Results We have generated four new genome sequences of three Vibrio species, i.e., V. alginolyticus 40B, V. harveyi-like 1DA3, and V. mimicus strains VM573 and VM603, and present a broad analyses of these genomes along with other sequenced Vibrio species. The genome atlas and pangenome plots provide a tantalizing image of the genomic differences that occur between closely related sister species, e.g. V. cholerae and V. mimicus. The vibrio pangenome contains around 26504 genes. The V. cholerae core genome and pangenome consist of 1520 and 6923 genes, respectively. Pangenomes might allow different strains of V. cholerae to occupy different niches. MLSA and supertree analyses resulted in a similar phylogenetic picture, with a clear distinction of four groups (Vibrio core group, V. cholerae-V. mimicus, Aliivibrio spp., and Photobacterium spp.). A Vibrio species is defined as a group of strains that share > 95% DNA identity in MLSA and supertree analysis, > 96% AAI, ≤ 10 genome signature dissimilarity, and > 61% proteome identity. Strains of the same species and species of the same genus will form monophyletic groups on the basis of MLSA and supertree. Conclusion The combination of different analytical and bioinformatics tools will enable the most accurate species identification through genomic computational analysis. This endeavour will culminate in the birth of the online

  10. Human Genome Program

    SciTech Connect

    Not Available

    1993-01-01

    The DOE Human Genome program has grown tremendously, as shown by the marked increase in the number of genome-funded projects since the last workshop held in 1991. The abstracts in this book describe the genome research of DOE-funded grantees and contractors and invited guests, and all projects are represented at the workshop by posters. The 3-day meeting includes plenary sessions on ethical, legal, and social issues pertaining to the availability of genetic data; sequencing techniques, informatics support; and chromosome and cDNA mapping and sequencing.

  11. What Is a Genome?

    PubMed Central

    Goldman, Aaron David; Landweber, Laura F.

    2016-01-01

    The genome is often described as the information repository of an organism. Whether millions or billions of letters of DNA, its transmission across generations confers the principal medium for inheritance of organismal traits. Several emerging areas of research demonstrate that this definition is an oversimplification. Here, we explore ways in which a deeper understanding of genomic diversity and cell physiology is challenging the concepts of physical permanence attached to the genome as well as its role as the sole information source for an organism. PMID:27442251

  12. Comparative primate genomics: emerging patterns of genome content and dynamics

    PubMed Central

    Rogers, Jeffrey; Gibbs, Richard A.

    2014-01-01

    Preface Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for several primates, with analyses of several others underway. Whole genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other nonhuman primates provide valuable insight into genetic similarities and differences among species used as models for disease-related research. This review summarizes current knowledge regarding primate genome content and dynamics and offers a series of goals for the near future. PMID:24709753

  13. Comparative primate genomics: emerging patterns of genome content and dynamics.

    PubMed

    Rogers, Jeffrey; Gibbs, Richard A

    2014-05-01

    Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for various primate species, and analyses of several others are underway. Whole-genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other non-human primates offer valuable insights into genetic similarities and differences among species that are used as models for disease-related research. This Review summarizes current knowledge regarding primate genome content and dynamics, and proposes a series of goals for the near future. PMID:24709753

  14. GenomeView: a next-generation genome browser

    PubMed Central

    Abeel, Thomas; Van Parys, Thomas; Saeys, Yvan; Galagan, James; Van de Peer, Yves

    2012-01-01

    Due to ongoing advances in sequencing technologies, billions of nucleotide sequences are now produced on a daily basis. A major challenge is to visualize these data for further downstream analysis. To this end, we present GenomeView, a stand-alone genome browser specifically designed to visualize and manipulate a multitude of genomics data. GenomeView enables users to dynamically browse high volumes of aligned short-read data, with dynamic navigation and semantic zooming, from the whole genome level to the single nucleotide. At the same time, the tool enables visualization of whole genome alignments of dozens of genomes relative to a reference sequence. GenomeView is unique in its capability to interactively handle huge data sets consisting of tens of aligned genomes, thousands of annotation features and millions of mapped short reads both as viewer and editor. GenomeView is freely available as an open source software package. PMID:22102585

  15. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine

    PubMed Central

    Elsik, Christine G.; Tayal, Aditi; Diesh, Colin M.; Unni, Deepak R.; Emery, Marianne L.; Nguyen, Hung N.; Hagen, Darren E.

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  16. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine.

    PubMed

    Elsik, Christine G; Tayal, Aditi; Diesh, Colin M; Unni, Deepak R; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  17. Clinal distribution of human genomic diversity across the Netherlands despite archaeological evidence for genetic discontinuities in Dutch population history

    PubMed Central

    2013-01-01

    Background The presence of a southeast to northwest gradient across Europe in human genetic diversity is a well-established observation and has recently been confirmed by genome-wide single nucleotide polymorphism (SNP) data. This pattern is traditionally explained by major prehistoric human migration events in Palaeolithic and Neolithic times. Here, we investigate whether (similar) spatial patterns in human genomic diversity also occur on a micro-geographic scale within Europe, such as in the Netherlands, and if so, whether these patterns could also be explained by more recent demographic events, such as those that occurred in Dutch population history. Methods We newly collected data on a total of 999 Dutch individuals sampled at 54 sites across the country at 443,816 autosomal SNPs using the Genome-Wide Human SNP Array 5.0 (Affymetrix). We studied the individual genetic relationships by means of classical multidimensional scaling (MDS) using different genetic distance matrices, spatial ancestry analysis (SPA), and ADMIXTURE software. We further performed dedicated analyses to search for spatial patterns in the genomic variation and conducted simulations (SPLATCHE2) to provide a historical interpretation of the observed spatial patterns. Results We detected a subtle but clearly noticeable genomic population substructure in the Dutch population, allowing differentiation of a north-eastern, central-western, central-northern and a southern group. Furthermore, we observed a statistically significant southeast to northwest cline in the distribution of genomic diversity across the Netherlands, similar to earlier findings from across Europe. Simulation analyses indicate that this genomic gradient could similarly be caused by ancient as well as by the more recent events in Dutch history. Conclusions Considering the strong archaeological evidence for genetic discontinuity in the Netherlands, we interpret the observed clinal pattern of genomic diversity as being caused by

  18. Vita Genomics, Inc.

    PubMed

    Shih-Hsin Wu, Lawrence; Su, Chun-Lin; Chen, Ellson

    2007-06-01

    Vita Genomics, Inc., centered in Taiwan and China, aims to be a premier genomics-based biotechnological and biopharmaceutical company in the Asia-Pacific region. The company focuses on conducting pharmacogenomics research, in vitro diagnosis product development and specialty contract research services in both genomics and pharmacogenomics fields. We are now initiating a drug rescue program designed to resurrect drugs that have failed in the previous clinical trials owing to low efficacies. This program applies pharmacogenomics approaches using biomarkers to screen subsets of patients who may respond better or avoid adverse responses to the test drugs. Vita Genomics, Inc. has envisioned itself as an important player in the healthcare industry offering advanced molecular diagnostic products and services, revolutionizing thedrug-development process and providing pharmacogenomic solutions. PMID:17559355

  19. Lophotrochozoan mitochondrial genomes

    SciTech Connect

    Valles, Yvonne; Boore, Jeffrey L.

    2005-10-01

    Progress in both molecular techniques and phylogeneticmethods has challenged many of the interpretations of traditionaltaxonomy. One example is in the recognition of the animal superphylumLophotrochozoa (annelids, mollusks, echiurans, platyhelminthes,brachiopods, and other phyla), although the relationships within thisgroup and the inclusion of some phyla remain uncertain. While much ofthis progress in phylogenetic reconstruction has been based on comparingsingle gene sequences, we are beginning to see the potential of comparinglarge-scale features of genomes, such as the relative order of genes.Even though tremendous progress is being made on the sequencedetermination of whole nuclear genomes, the dataset of choice forgenome-level characters for many animals across a broad taxonomic rangeremains mitochondrial genomes. We review here what is known aboutmitochondrial genomes of the lophotrochozoans and discuss the promisethat this dataset will enable insight into theirrelationships.

  20. Androgen receptor genomic regulation

    PubMed Central

    Jin, Hong-Jian; Kim, Jung

    2013-01-01

    The transcriptional activity of the androgen receptor (AR) is not only critical for the normal development and function of the prostate but also pivotal to the onset and progression of prostate cancer (PCa). The studies of AR transcriptional regulation were previously limited to a handful of AR-target genes. Owing to the development of various high-throughput genomic technologies, significant advances have been made in recent years. Here we discuss the discoveries of genome-wide androgen-regulated genes in PCa cell lines, animal models and tissues using expression microarray and sequencing, the mapping of genomic landscapes of AR using Combining Chromatin Immunoprecipitation (ChIP)-on-chip and ChIP-seq assays, the interplay of transcriptional cofactors in defining AR binding profiles, and the genomic regulation and AR reprogramming in advanced PCa. PMID:25237629

  1. Mouse genome database 2016

    PubMed Central

    Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.; Kadin, James A.; Richardson, Joel E.

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  2. The genomics of adaptation.

    PubMed

    Radwan, Jacek; Babik, Wiesław

    2012-12-22

    The amount and nature of genetic variation available to natural selection affect the rate, course and outcome of evolution. Consequently, the study of the genetic basis of adaptive evolutionary change has occupied biologists for decades, but progress has been hampered by the lack of resolution and the absence of a genome-level perspective. Technological advances in recent years should now allow us to answer many long-standing questions about the nature of adaptation. The data gathered so far are beginning to challenge some widespread views of the way in which natural selection operates at the genomic level. Papers in this Special Feature of Proceedings of the Royal Society B illustrate various aspects of the broad field of adaptation genomics. This introductory article sets up a context and, on the basis of a few selected examples, discusses how genomic data can advance our understanding of the process of adaptation. PMID:23097510

  3. Genomics and vaccine development

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomic-based approaches are driving fundamental changes in our understanding of microbiology. Comparative analysis of microbial strain is providing new insights into pathogen evolution, virulence mechanisms, and host range specificity. Most importantly, gene discovery and genetic variations can now...

  4. Platyzoan mitochondrial genomes.

    PubMed

    Wey-Fabrizius, Alexandra R; Podsiadlowski, Lars; Herlyn, Holger; Hankeln, Thomas

    2013-11-01

    Platyzoa is a putative lophotrochozoan (spiralian) subtaxon within the protostome clade of Metazoa, comprising a range of biologically diverse, mostly small worm-shaped animals. The monophyly of Platyzoa, the relationships between the putative subgroups Platyhelminthes, Gastrotricha and Gnathifera (the latter comprising at least Gnathostomulida, "Rotifera" and Acanthocephala) as well as some aspects of the internal phylogenies of these subgroups are highly debated. Here we review how complete mitochondrial (mt) genome data contribute to these debates. We highlight special features of the mt genomes and discuss problems in mtDNA phylogenies of the clade. Mitochondrial genome data seem to be insufficient to resolve the position of the platyzoan clade within the Spiralia but can help to address internal phylogenetic questions. The present review includes a tabular survey of all published platyzoan mt genomes. PMID:23274056

  5. Mouse genome database 2016.

    PubMed

    Bult, Carol J; Eppig, Janan T; Blake, Judith A; Kadin, James A; Richardson, Joel E

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  6. The rise of genomics.

    PubMed

    Weissenbach, Jean

    2016-01-01

    A brief history of the development of genomics is provided. Complete sequencing of genomes of uni- and multicellular organisms is based on important progress in sequencing and bioinformatics. Evolution of these methods is ongoing and has triggered an explosion in data production and analysis. Initial analyses focused on the inventory of genes encoding proteins. Completeness and quality of gene prediction remains crucial. Genome analyses profoundly modified our views on evolution, biodiversity and contributed to the detection of new functions, yet to be fully elucidated, such as those fulfilled by non-coding RNAs. Genomics has become the basis for the study of biology and provides the molecular support for a bunch of large-scale studies, the omics. PMID:27263360

  7. Epidemiology & Genomics Research Program

    Cancer.gov

    The Epidemiology and Genomics Research Program, in the National Cancer Institute's Division of Cancer Control and Population Sciences, funds research in human populations to understand the determinants of cancer occurrence and outcomes.

  8. Genomic definition of species

    SciTech Connect

    Crkvenjakov, R.; Drmanac, R.

    1991-07-01

    The subject of this paper is the definition of species based on the assumption that genome is the fundamental level for the origin and maintenance of biological diversity. For this view to be logically consistent it is necessary to assume the existence and operation of the new law which we call genome law. For this reason the genome law is included in the explanation of species phenomenon presented here even if its precise formulation and elaboration are left for the future. The intellectual underpinnings of this definition can be traced to Goldschmidt. We wish to explore some philosophical aspects of the definition of species in terms of the genome. The point of proposing the definition on these grounds is that any real advance in evolutionary theory has to be correct in both its philosophy and its science.

  9. Molluscan Evolutionary Genomics

    SciTech Connect

    Simison, W. Brian; Boore, Jeffrey L.

    2005-12-01

    In the last 20 years there have been dramatic advances in techniques of high-throughput DNA sequencing, most recently accelerated by the Human Genome Project, a program that has determined the three billion base pair code on which we are based. Now this tremendous capability is being directed at other genome targets that are being sampled across the broad range of life. This opens up opportunities as never before for evolutionary and organismal biologists to address questions of both processes and patterns of organismal change. We stand at the dawn of a new 'modern synthesis' period, paralleling that of the early 20th century when the fledgling field of genetics first identified the underlying basis for Darwin's theory. We must now unite the efforts of systematists, paleontologists, mathematicians, computer programmers, molecular biologists, developmental biologists, and others in the pursuit of discovering what genomics can teach us about the diversity of life. Genome-level sampling for mollusks to date has mostly been limited to mitochondrial genomes and it is likely that these will continue to provide the best targets for broad phylogenetic sampling in the near future. However, we are just beginning to see an inroad into complete nuclear genome sequencing, with several mollusks and other eutrochozoans having been selected for work about to begin. Here, we provide an overview of the state of molluscan mitochondrial genomics, highlight a few of the discoveries from this research, outline the promise of broadening this dataset, describe upcoming projects to sequence whole mollusk nuclear genomes, and challenge the community to prepare for making the best use of these data.

  10. Biobanks for Genomics and Genomics for Biobanks

    PubMed Central

    Ducournau, Pascal; Gourraud, Pierre-Antoine; Pontille, David

    2003-01-01

    Biobanks include biological samples and attached databases. Human biobanks occur in research, technological development and medical activities. Population genomics is highly dependent on the availability of large biobanks. Ethical issues must be considered: protecting the rights of those people whose samples or data are in biobanks (information, autonomy, confidentiality, protection of private life), assuring the non-commercial use of human body elements and the optimal use of samples and data. They balance other issues, such as protecting the rights of researchers and companies, allowing long-term use of biobanks while detailed information on future uses is not available. At the level of populations, the traditional form of informed consent is challenged. Other dimensions relate to the rights of a group as such, in addition to individual rights. Conditions of return of results and/or benefit to a population need to be defined. With ‘large-scale biobanking’ a marked trend in genomics, new societal dimensions appear, regarding communication, debate, regulation, societal control and valorization of such large biobanks. Exploring how genomics can help health sector biobanks to become more rationally constituted and exploited is an interesting perspective. For example, evaluating how genomic approaches can help in optimizing haematopoietic stem cell donor registries using new markers and high-throughput techniques to increase immunogenetic variability in such registries is a challenge currently being addressed. Ethical issues in such contexts are important, as not only individual decisions or projects are concerned, but also national policies in the international arena and organization of democratic debate about science, medicine and society. PMID:18629026

  11. How the genome folds

    NASA Astrophysics Data System (ADS)

    Lieberman Aiden, Erez

    2012-02-01

    I describe Hi-C, a novel technology for probing the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. Working with collaborators at the Broad Institute and UMass Medical School, we used Hi-C to construct spatial proximity maps of the human genome at a resolution of 1Mb. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.

  12. Human Genome Annotation

    NASA Astrophysics Data System (ADS)

    Gerstein, Mark

    A central problem for 21st century science is annotating the human genome and making this annotation useful for the interpretation of personal genomes. My talk will focus on annotating the 99% of the genome that does not code for canonical genes, concentrating on intergenic features such as structural variants (SVs), pseudogenes (protein fossils), binding sites, and novel transcribed RNAs (ncRNAs). In particular, I will describe how we identify regulatory sites and variable blocks (SVs) based on processing next-generation sequencing experiments. I will further explain how we cluster together groups of sites to create larger annotations. Next, I will discuss a comprehensive pseudogene identification pipeline, which has enabled us to identify >10K pseudogenes in the genome and analyze their distribution with respect to age, protein family, and chromosomal location. Throughout, I will try to introduce some of the computational algorithms and approaches that are required for genome annotation. Much of this work has been carried out in the framework of the ENCODE, modENCODE, and 1000 genomes projects.

  13. An archaeal genomic signature

    NASA Technical Reports Server (NTRS)

    Graham, D. E.; Overbeek, R.; Olsen, G. J.; Woese, C. R.

    2000-01-01

    Comparisons of complete genome sequences allow the most objective and comprehensive descriptions possible of a lineage's evolution. This communication uses the completed genomes from four major euryarchaeal taxa to define a genomic signature for the Euryarchaeota and, by extension, the Archaea as a whole. The signature is defined in terms of the set of protein-encoding genes found in at least two diverse members of the euryarchaeal taxa that function uniquely within the Archaea; most signature proteins have no recognizable bacterial or eukaryal homologs. By this definition, 351 clusters of signature proteins have been identified. Functions of most proteins in this signature set are currently unknown. At least 70% of the clusters that contain proteins from all the euryarchaeal genomes also have crenarchaeal homologs. This conservative set, which appears refractory to horizontal gene transfer to the Bacteria or the Eukarya, would seem to reflect the significant innovations that were unique and fundamental to the archaeal "design fabric." Genomic protein signature analysis methods may be extended to characterize the evolution of any phylogenetically defined lineage. The complete set of protein clusters for the archaeal genomic signature is presented as supplementary material (see the PNAS web site, www.pnas.org).

  14. Ebolavirus comparative genomics

    DOE PAGESBeta

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; et al

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

  15. Barley Genomics: An Overview

    PubMed Central

    Sreenivasulu, Nese; Graner, Andreas; Wobus, Ulrich

    2008-01-01

    Barley (Hordeum vulgare), first domesticated in the Near East, is a well-studied crop in terms of genetics, genomics, and breeding and qualifies as a model plant for Triticeae research. Recent advances made in barley genomics mainly include the following: (i) rapid accumulation of EST sequence data, (ii) growing number of studies on transcriptome, proteome, and metabolome, (iii) new modeling techniques, (iv) availability of genome-wide knockout collections as well as efficient transformation techniques, and (v) the recently started genome sequencing effort. These developments pave the way for a comprehensive functional analysis and understanding of gene expression networks linked to agronomically important traits. Here, we selectively review important technological developments in barley genomics and related fields and discuss the relevance for understanding genotype-phenotype relationships by using approaches such as genetical genomics and association studies. High-throughput genotyping platforms that have recently become available will allow the construction of high-density genetic maps that will further promote marker-assisted selection as well as physical map construction. Systems biology approaches will further enhance our knowledge and largely increase our abilities to design refined breeding strategies on the basis of detailed molecular physiological knowledge. PMID:18382615

  16. A Review on Genomics APIs

    PubMed Central

    Swaminathan, Rajeswari; Huang, Yungui; Moosavinasab, Soheil; Buckley, Ronald; Bartlett, Christopher W.; Lin, Simon M.

    2015-01-01

    The constant improvement and falling prices of whole human genome Next Generation Sequencing (NGS) has resulted in rapid adoption of genomic information at both clinics and research institutions. Considered together, the complexity of genomics data, due to its large volume and diversity along with the need for genomic data sharing, has resulted in the creation of Application Programming Interface (API) for secure, modular, interoperable access to genomic data from different applications, platforms, and even organizations. The Genomics APIs are a set of special protocols that assist software developers in dealing with multiple genomic data sources for building seamless, interoperable applications leading to the advancement of both genomic and clinical research. These APIs help define a standard for retrieval of genomic data from multiple sources as well as to better package genomic information for integration with Electronic Health Records. This review covers three currently available Genomics APIs: a) Google Genomics, b) SMART Genomics, and c) 23andMe. The functionalities, reference implementations (if available) and authentication protocols of each API are reviewed. A comparative analysis of the different features across the three APIs is provided in the Discussion section. Though Genomics APIs are still under active development and have yet to reach widespread adoption, they hold the promise to make building of complicated genomics applications easier with downstream constructive effects on healthcare. PMID:26702340

  17. WheatGenome.info: A Resource for Wheat Genomics Resource.

    PubMed

    Lai, Kaitao

    2016-01-01

    An integrated database with a variety of Web-based systems named WheatGenome.info hosting wheat genome and genomic data has been developed to support wheat research and crop improvement. The resource includes multiple Web-based applications, which are implemented as a variety of Web-based systems. These include a GBrowse2-based wheat genome viewer with BLAST search portal, TAGdb for searching wheat second generation genome sequence data, wheat autoSNPdb, links to wheat genetic maps using CMap and CMap3D, and a wheat genome Wiki to allow interaction between diverse wheat genome sequencing activities. This portal provides links to a variety of wheat genome resources hosted at other research organizations. This integrated database aims to accelerate wheat genome research and is freely accessible via the web interface at http://www.wheatgenome.info/ . PMID:26519407

  18. Genome-Wide Association Study for Autism Spectrum Disorder in Taiwanese Han Population

    PubMed Central

    Kuo, Po-Hsiu; Chuang, Li-Chung; Su, Mei-Hsin; Chen, Chia-Hsiang; Chen, Chien-Hsiun; Wu, Jer-Yuarn; Yen, Chung-Jen; Wu, Yu-Yu; Liu, Shih-Kai; Chou, Miao-Chun; Chou, Wen-Jiun; Chiu, Yen-Nan; Tsai, Wen-Che; Gau, Susan Shur-Fen

    2015-01-01

    Background Autism spectrum disorder (ASD) is a neurodevelopmental disorder with strong genetic components. Several recent genome-wide association (GWA) studies in Caucasian samples have reported a number of gene regions and loci correlated with the risk of ASD—albeit with very little consensus across studies. Methods A two-stage GWA study was employed to identify common genetic variants for ASD in the Taiwanese Han population. The discovery stage included 315 patients with ASD and 1,115 healthy controls, using the Affymetrix SNP array 6.0 platform for genotyping. Several gene regions were then selected for fine-mapping and top markers were examined in extended samples. Single marker, haplotype, gene-based, and pathway analyses were conducted for associations. Results Seven SNPs had p-values ranging from 3.4~9.9*10−6, but none reached the genome-wide significant level. Five of them were mapped to three known genes (OR2M4, STYK1, and MNT) with significant empirical gene-based p-values in OR2M4 (p = 3.4*10−5) and MNT (p = 0.0008). Results of the fine-mapping study showed single-marker associations in the GLIS1 (rs12082358 and rs12080993) and NAALADL2 (rs3914502 and rs2222447) genes, and gene-based associations for the OR2M3-OR2T5 (olfactory receptor genes, p = 0.02), and GLIPR1/KRR1 gene regions (p = 0.015). Pathway analyses revealed important pathways for ASD, such as olfactory and G protein–coupled receptors signaling pathways. Conclusions We reported Taiwanese Han specific susceptibility genes and variants for ASD. However, further replication in other Asian populations is warranted to validate our findings. Investigation in the biological functions of our reported genetic variants might also allow for better understanding on the underlying pathogenesis of autism. PMID:26398136

  19. Genome-wide SNP analysis of the Systemic Capillary Leak Syndrome (Clarkson disease)

    PubMed Central

    Xie, Zhihui; Nagarajan, Vijayaraj; Sturdevant, Daniel E; Iwaki, Shoko; Chan, Eunice; Wisch, Laura; Young, Michael; Nelson, Celeste M; Porcella, Stephen F; Druey, Kirk M

    2013-01-01

    The Systemic Capillary Leak Syndrome (SCLS) is an extremely rare, orphan disease that resembles, and is frequently erroneously diagnosed as, systemic anaphylaxis. The disorder is characterized by repeated, transient, and seemingly unprovoked episodes of hypotensive shock and peripheral edema due to transient endothelial hyperpermeability. SCLS is often accompanied by a monoclonal gammopathy of unknown significance (MGUS). Using Affymetrix Single Nucleotide Polymorphism (SNP) microarrays, we performed the first genome-wide SNP analysis of SCLS in a cohort of 12 disease subjects and 18 controls. Exome capture sequencing was performed on genomic DNA from nine of these patients as validation for the SNP-chip discoveries and de novo data generation. We identified candidate susceptibility loci for SCLS, which included a region flanking CAV3 (3p25.3) as well as SNP clusters in PON1 (7q21.3), PSORS1C1 (6p21.3), and CHCHD3 (7q33). Among the most highly ranked discoveries were gene-associated SNPs in the uncharacterized LOC100130480 gene (rs6417039, rs2004296). Top case-associated SNPs were observed in BTRC (rs12355803, 3rs4436485), ARHGEF18 (rs11668246), CDH13 (rs4782779), and EDG2 (rs12552348), which encode proteins with known or suspected roles in B cell function and/or vascular integrity. 61 SNPs that were significantly associated with SCLS by microarray analysis were also detected and validated by exome deep sequencing. Functional annotation of highly ranked SNPs revealed enrichment of cell projections, cell junctions and adhesion, and molecules containing pleckstrin homology, Ras/Rho regulatory, and immunoglobulin Ig-like C2/fibronectin type III domains, all of which involve mechanistic functions that correlate with the SCLS phenotype. These results highlight SNPs with potential relevance to SCLS. PMID:24808988

  20. GenomeVista

    SciTech Connect

    Poliakov, Alexander; Couronne, Olivier

    2002-11-04

    Aligning large vertebrate genomes that are structurally complex poses a variety of problems not encountered on smaller scales. Such genomes are rich in repetitive elements and contain multiple segmental duplications, which increases the difficulty of identifying true orthologous SNA segments in alignments. The sizes of the sequences make many alignment algorithms designed for comparing single proteins extremely inefficient when processing large genomic intervals. We integrated both local and global alignment tools and developed a suite of programs for automatically aligning large vertebrate genomes and identifying conserved non-coding regions in the alignments. Our method uses the BLAT local alignment program to find anchors on the base genome to identify regions of possible homology for a query sequence. These regions are postprocessed to find the best candidates which are then globally aligned using the AVID global alignment program. In the last step conserved non-coding segments are identified using VISTA. Our methods are fast and the resulting alignments exhibit a high degree of sensitivity, covering more than 90% of known coding exons in the human genome. The GenomeVISTA software is a suite of Perl programs that is built on a MySQL database platform. The scheduler gets control data from the database, builds a queve of jobs, and dispatches them to a PC cluster for execution. The main program, running on each node of the cluster, processes individual sequences. A Perl library acts as an interface between the database and the above programs. The use of a separate library allows the programs to function independently of the database schema. The library also improves on the standard Perl MySQL database interfere package by providing auto-reconnect functionality and improved error handling.

  1. GenomeVista

    2002-11-04

    Aligning large vertebrate genomes that are structurally complex poses a variety of problems not encountered on smaller scales. Such genomes are rich in repetitive elements and contain multiple segmental duplications, which increases the difficulty of identifying true orthologous SNA segments in alignments. The sizes of the sequences make many alignment algorithms designed for comparing single proteins extremely inefficient when processing large genomic intervals. We integrated both local and global alignment tools and developed a suitemore » of programs for automatically aligning large vertebrate genomes and identifying conserved non-coding regions in the alignments. Our method uses the BLAT local alignment program to find anchors on the base genome to identify regions of possible homology for a query sequence. These regions are postprocessed to find the best candidates which are then globally aligned using the AVID global alignment program. In the last step conserved non-coding segments are identified using VISTA. Our methods are fast and the resulting alignments exhibit a high degree of sensitivity, covering more than 90% of known coding exons in the human genome. The GenomeVISTA software is a suite of Perl programs that is built on a MySQL database platform. The scheduler gets control data from the database, builds a queve of jobs, and dispatches them to a PC cluster for execution. The main program, running on each node of the cluster, processes individual sequences. A Perl library acts as an interface between the database and the above programs. The use of a separate library allows the programs to function independently of the database schema. The library also improves on the standard Perl MySQL database interfere package by providing auto-reconnect functionality and improved error handling.« less

  2. Genomes to Proteomes

    SciTech Connect

    Panisko, Ellen A.; Grigoriev, Igor; Daly, Don S.; Webb-Robertson, Bobbie-Jo; Baker, Scott E.

    2009-03-01

    Biologists are awash with genomic sequence data. In large part, this is due to the rapid acceleration in the generation of DNA sequence that occurred as public and private research institutes raced to sequence the human genome. In parallel with the large human genome effort, mostly smaller genomes of other important model organisms were sequenced. Projects following on these initial efforts have made use of technological advances and the DNA sequencing infrastructure that was built for the human and other organism genome projects. As a result, the genome sequences of many organisms are available in high quality draft form. While in many ways this is good news, there are limitations to the biological insights that can be gleaned from DNA sequences alone; genome sequences offer only a bird's eye view of the biological processes endemic to an organism or community. Fortunately, the genome sequences now being produced at such a high rate can serve as the foundation for other global experimental platforms such as proteomics. Proteomic methods offer a snapshot of the proteins present at a point in time for a given biological sample. Current global proteomics methods combine enzymatic digestion, separations, mass spectrometry and database searching for peptide identification. One key aspect of proteomics is the prediction of peptide sequences from mass spectrometry data. Global proteomic analysis uses computational matching of experimental mass spectra with predicted spectra based on databases of gene models that are often generated computationally. Thus, the quality of gene models predicted from a genome sequence is crucial in the generation of high quality peptide identifications. Once peptides are identified they can be assigned to their parent protein. Proteins identified as expressed in a given experiment are most useful when compared to other expressed proteins in a larger biological context or biochemical pathway. In this chapter we will discuss the automatic

  3. Genome position specific priors for genomic prediction

    PubMed Central

    2012-01-01

    Background The accuracy of genomic prediction is highly dependent on the size of the reference population. For small populations, including information from other populations could improve this accuracy. The usual strategy is to pool data from different populations; however, this has not proven as successful as hoped for with distantly related breeds. BayesRS is a novel approach to share information across populations for genomic predictions. The approach allows information to be captured even where the phase of SNP alleles and casuative mutation alleles are reversed across populations, or the actual casuative mutation is different between the populations but affects the same gene. Proportions of a four-distribution mixture for SNP effects in segments of fixed size along the genome are derived from one population and set as location specific prior proportions of distributions of SNP effects for the target population. The model was tested using dairy cattle populations of different breeds: 540 Australian Jersey bulls, 2297 Australian Holstein bulls and 5214 Nordic Holstein bulls. The traits studied were protein-, fat- and milk yield. Genotypic data was Illumina 777K SNPs, real or imputed. Results Results showed an increase in accuracy of up to 3.5% for the Jersey population when using BayesRS with a prior derived from Australian Holstein compared to a model without location specific priors. The increase in accuracy was however lower than was achieved when reference populations were combined to estimate SNP effects, except in the case of fat yield. The small size of the Jersey validation set meant that these improvements in accuracy were not significant using a Hotelling-Williams t-test at the 5% level. An increase in accuracy of 1-2% for all traits was observed in the Australian Holstein population when using a prior derived from the Nordic Holstein population compared to using no prior information. These improvements were significant (P<0.05) using the Hotelling

  4. Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms (SNPs) Associated With the Development of Erectile Dysfunction in African-American Men After Radiotherapy for Prostate Cancer

    SciTech Connect

    Kerns, Sarah L.; Ostrer, Harry; Stock, Richard; Li, William; Pearlman, Alexander; Campbell, Christopher; Shao Yongzhao; Stone, Nelson; Kusnetz, Lynda; Rosenstein, Barry S.

    2010-12-01

    Purpose: To identify single nucleotide polymorphisms (SNPs) associated with erectile dysfunction (ED) among African-American prostate cancer patients treated with external beam radiation therapy. Methods and Materials: A cohort of African-American prostate cancer patients treated with external beam radiation therapy was observed for the development of ED by use of the five-item Sexual Health Inventory for Men (SHIM) questionnaire. Final analysis included 27 cases (post-treatment SHIM score {<=}7) and 52 control subjects (post-treatment SHIM score {>=}16). A genome-wide association study was performed using approximately 909,000 SNPs genotyped on Affymetrix 6.0 arrays (Affymetrix, Santa Clara, CA). Results: We identified SNP rs2268363, located in the follicle-stimulating hormone receptor (FSHR) gene, as significantly associated with ED after correcting for multiple comparisons (unadjusted p = 5.46 x 10{sup -8}, Bonferroni p = 0.028). We identified four additional SNPs that tended toward a significant association with an unadjusted p value < 10{sup -6}. Inference of population substructure showed that cases had a higher proportion of African ancestry than control subjects (77% vs. 60%, p = 0.005). A multivariate logistic regression model that incorporated estimated ancestry and four of the top-ranked SNPs was a more accurate classifier of ED than a model that included only clinical variables. Conclusions: To our knowledge, this is the first genome-wide association study to identify SNPs associated with adverse effects resulting from radiotherapy. It is important to note that the SNP that proved to be significantly associated with ED is located within a gene whose encoded product plays a role in male gonad development and function. Another key finding of this project is that the four SNPs most strongly associated with ED were specific to persons of African ancestry and would therefore not have been identified had a cohort of European ancestry been screened. This study

  5. Berkeley Quantitative Genome Browser

    SciTech Connect

    Hechmer, Aaron

    2008-02-29

    The Berkeley Quantitative Genome Browser provides graphical browsing functionality for genomic data organized, at a minimum, by sequence and position. While supporting the annotation browsing features typical of many other genomic browsers, additional emphasis is placed on viewing and utilizing quantitative data. Data may be read from GFF, SGR, FASTA or any column delimited format. Once the data has been read into the browser's buffer, it may be searched. filtered or subjected to mathematical transformation. The browser also supplies some graphical design manipulation functionality geared towards preparing figures for presentations or publication. A plug-in mechanism enables development outside the core functionality that adds more advanced or esoteric analysis capabilities. BBrowse's development and distribution is open-source and has been built to run on Linux, OSX and MS Windows operating systems.

  6. Genomics, health, and society.

    PubMed

    Chan, Chee Khoon

    2002-01-01

    On June 27, 2001, the World Health Organization conducted hearings in Geneva for a Special Report on Genomics & Health. Initially intended as a document to address the ethical, legal, and social implications of the gathering genomics resolution (ELSI), the terms of reference of the report were significantly modified to give primary emphasis to a scientific and technological assessment of the implications of genomics for human health. The Citizens' Health Initiative, one of two NGOs invited to make submissions at these consultations, suggested that no less important than the scientific and technical assessment was a perspective which gave due attention to the social context and political economy of scientific/technological development and its deployment. The article below touches upon neglected health priorities of poor countries, intellectual property rights and patents, risk management, insurance and discrimination, and predictive (prenatal) testing, reproductive choice, and eugenics. PMID:17208760

  7. Berkeley Quantitative Genome Browser

    2008-02-29

    The Berkeley Quantitative Genome Browser provides graphical browsing functionality for genomic data organized, at a minimum, by sequence and position. While supporting the annotation browsing features typical of many other genomic browsers, additional emphasis is placed on viewing and utilizing quantitative data. Data may be read from GFF, SGR, FASTA or any column delimited format. Once the data has been read into the browser's buffer, it may be searched. filtered or subjected to mathematical transformation.more » The browser also supplies some graphical design manipulation functionality geared towards preparing figures for presentations or publication. A plug-in mechanism enables development outside the core functionality that adds more advanced or esoteric analysis capabilities. BBrowse's development and distribution is open-source and has been built to run on Linux, OSX and MS Windows operating systems.« less

  8. Genomics for Weed Science

    PubMed Central

    Horvath, David

    2010-01-01

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and evolutionary processes of weedy plants. Genomics-based tools such as extensive EST databases and microarrays have been developed for a limited number of weedy species, although application of information and resources developed for model plants and crops are possible and have been exploited. These tools have just begun to provide insights into the response of these weeds to herbivore and pathogen attack, survival of extreme environmental conditions, and interaction with crops. The potential of these tools to illuminate mechanisms controlling the traits that allow weeds to invade novel habitats, survive extreme environments, and that make weeds difficult to eradicate have potential for both improving crops and developing novel methods to control weeds. PMID:20808523

  9. SINGLE CELL GENOME SEQUENCING

    PubMed Central

    Yilmaz, Suzan; Singh, Anup K.

    2011-01-01

    Whole genome amplification and next-generation sequencing of single cells has become a powerful approach for studying uncultivated microorganisms that represent 90–99 % of all environmental microbes. Single cell sequencing enables not only the identification of microbes but also linking of functions to species, a feat not achievable by metagenomic techniques. Moreover, it allows the analysis of low abundance species that may be missed in community-based analyses. It has also proved very useful in complementing metagenomics in the assembly and binning of single genomes. With the advent of drastically cheaper and higher throughput sequencing technologies, it is expected that single cell sequencing will become a standard tool in studying the genome and transcriptome of microbial communities. PMID:22154471

  10. Genomic Southern blot analysis.

    PubMed

    Gebbie, Leigh

    2014-01-01

    This chapter describes a detailed protocol for genomic Southern blot analysis which can be used to detect transgene or endogenous gene sequences in cereal genomes. The protocol follows a standard approach that has been shown to generate high-quality results: size fractionation of genomic DNA; capillary transfer to a nylon membrane; hybridization with a digoxigenin-labelled probe; and detection using a chemiluminescent-based system. High sensitivity and limited background are key to successful Southern blots. The critical steps in this protocol are complete digestion of the right quantity of DNA, careful handling of the membrane to avoid unnecessary background, and optimization of probe concentration and temperatures during the hybridization step. Detailed instructions on how to successfully master these techniques are provided. PMID:24243203

  11. Genomics of Volvocine Algae

    PubMed Central

    Umen, James G.; Olson, Bradley J.S.C.

    2015-01-01

    Volvocine algae are a group of chlorophytes that together comprise a unique model for evolutionary and developmental biology. The species Chlamydomonas reinhardtii and Volvox carteri represent extremes in morphological diversity within the Volvocine clade. Chlamydomonas is unicellular and reflects the ancestral state of the group, while Volvox is multicellular and has evolved numerous innovations including germ-soma differentiation, sexual dimorphism, and complex morphogenetic patterning. The Chlamydomonas genome sequence has shed light on several areas of eukaryotic cell biology, metabolism and evolution, while the Volvox genome sequence has enabled a comparison with Chlamydomonas that reveals some of the underlying changes that enabled its transition to multicellularity, but also underscores the subtlety of this transition. Many of the tools and resources are in place to further develop Volvocine algae as a model for evolutionary genomics. PMID:25883411

  12. Genomic medicine and neurology.

    PubMed

    Vance, Jeffery M; Tekin, Demet

    2011-04-01

    The application of genetics to the understanding of neurology has been highly successful over the past several decades. During the past 10 years, tools were developed to begin genetic investigations into more common disorders such as Alzheimer disease, multiple sclerosis, autism, and Parkinson disease. The era of genomic medicine now has begun and will have an increasing effect on the daily care of common neurologic diseases. Thus it is important for neurologists to have a basic understanding of genomic medicine and how it differs from the traditional clinical genetics of the past. This article provides some basic information about genomic medicine and pharmacogenetics in neurology to help neurologists to begin to adopt these principles into their practice. PMID:22810818

  13. Genomic Imprinting in Mammals

    PubMed Central

    Barlow, Denise P.

    2014-01-01

    Genomic imprinting affects a subset of genes in mammals and results in a monoallelic, parental-specific expression pattern. Most of these genes are located in clusters that are regulated through the use of insulators or long noncoding RNAs (lncRNAs). To distinguish the parental alleles, imprinted genes are epigenetically marked in gametes at imprinting control elements through the use of DNA methylation at the very least. Imprinted gene expression is subsequently conferred through lncRNAs, histone modifications, insulators, and higher-order chromatin structure. Such imprints are maintained after fertilization through these mechanisms despite extensive reprogramming of the mammalian genome. Genomic imprinting is an excellent model for understanding mammalian epigenetic regulation. PMID:24492710

  14. Resequencing rice genomes: an emerging new era of rice genomics.

    PubMed

    Huang, Xuehui; Lu, Tingting; Han, Bin

    2013-04-01

    Rice is a model system for crop genomics studies. Much of the early work on rice genomics focused on analyzing genome-wide genetic variation to further understand rice gene functions in agronomic traits and to generate data and resources for rice research. The advent of next-generation high-throughput DNA sequencing technologies and the completion of high-quality reference genome sequences have enabled the development of sequencing-based genotyping and genome-wide association studies (GWAS) that have significantly advanced rice genetics research. This has led to the emergence of a new era of rice genomics aimed at bridging the knowledge gap between genotype and phenotype in rice. These technologies have also led to pyramid breeding through genomics-assisted selection, which will be useful in breeding elite varieties suitable for sustainable agriculture. Here, we review the recent advances in rice genomics and discuss the future of this line of research. PMID:23295340

  15. Brief Guide to Genomics: DNA, Genes and Genomes

    MedlinePlus

    ... guía de genómica A Brief Guide to Genomics DNA, Genes and Genomes Deoxyribonucleic acid (DNA) is the ... and lead to a disease such as cancer. DNA Sequencing Sequencing simply means determining the exact order ...

  16. Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.

    PubMed

    Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A

    2016-01-01

    One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. PMID:27238013

  17. A Genome-Wide Association Study Uncovers a Genetic Locus Associated with Thoracic-to-Hip Ratio in Koreans

    PubMed Central

    Cha, Seongwon; Park, Ah Yeon; Kang, Changsoo

    2015-01-01

    The thoracic-to-hip circumference ratio (THR) is an anthropometric marker recently described as a predictor of type 2 diabetes. In this study, we performed a genome-wide association study (GWAS) followed by confirmatory analyses to identify genetic markers associated with THR. A total of 7,240 Korean subjects (4,988 for the discovery stage and 2,252 for the confirmatory analyses) were recruited for this study, and genome-wide single nucleotide polymorphism (SNP) genotyping of the initial 4,988 individuals was performed using Affymetrix Human SNP array 5.0. Linear regression analysis was then performed to adjust for the effects of age, sex, and current diabetes medication status on the THR of the study subjects. In the initial discovery stage, there was a statistically nominal association between minor alleles of SNP markers on chromosomes 4, 8, 10, and 12, and THR changes (p < 5.0 × 10−6). The subsequent confirmatory analyses of these markers, however, only detected a significant association between two SNPs in the HECTD4 gene and decreased THRs. Notably, this association was detected in male (rs11066280: p = 1.14 × 10−2; rs2074356: p = 1.10 × 10−2), but not in female subjects. Meanwhile, the combined results from the two analyses (initial and confirmatory) indicated that minor alleles of these two intronic variants exhibited a significant genome-wide association with decreased THR in the male subjects (n = 3,155; rs11066280: effect size = −0.008624, p = 6.19 × 10−9; rs2074356: effect size = −0.008762, p = 1.89 × 10−8). Furthermore, minor alleles of these two SNPs exhibited protective effects on patients’ risks for developing type 2 diabetes. In conclusion, we have identified two genetic variations in HECTD4 that are associated with THR, particularly in men. PMID:26675016

  18. Genomic occupancy of Runx2 with global expression profiling identifies a novel dimension to control of osteoblastogenesis

    PubMed Central

    2014-01-01

    Background Osteogenesis is a highly regulated developmental process and continues during the turnover and repair of mature bone. Runx2, the master regulator of osteoblastogenesis, directs a transcriptional program essential for bone formation through genetic and epigenetic mechanisms. While individual Runx2 gene targets have been identified, further insights into the broad spectrum of Runx2 functions required for osteogenesis are needed. Results By performing genome-wide characterization of Runx2 binding at the three major stages of osteoblast differentiation - proliferation, matrix deposition and mineralization - we identify Runx2-dependent regulatory networks driving bone formation. Using chromatin immunoprecipitation followed by high-throughput sequencing over the course of these stages, we identify approximately 80,000 significantly enriched regions of Runx2 binding throughout the mouse genome. These binding events exhibit distinct patterns during osteogenesis, and are associated with proximal promoters and also non-promoter regions: upstream, introns, exons, transcription termination site regions, and intergenic regions. These peaks were partitioned into clusters that are associated with genes in complex biological processes that support bone formation. Using Affymetrix expression profiling of differentiating osteoblasts depleted of Runx2, we identify novel Runx2 targets including Ezh2, a critical epigenetic regulator; Crabp2, a retinoic acid signaling component; Adamts4 and Tnfrsf19, two remodelers of the extracellular matrix. We demonstrate by luciferase assays that these novel biological targets are regulated by Runx2 occupancy at non-promoter regions. Conclusions Our data establish that Runx2 interactions with chromatin across the genome reveal novel genes, pathways and transcriptional mechanisms that contribute to the regulation of osteoblastogenesis. PMID:24655370

  19. Ebolavirus comparative genomics

    PubMed Central

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  20. Ebolavirus comparative genomics.

    PubMed

    Jun, Se-Ran; Leuze, Michael R; Nookaew, Intawat; Uberbacher, Edward C; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S; Pedersen, Thomas D; Wassenaar, Trudy M; Ussery, David W

    2015-09-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  1. Genome Size and Species Diversification

    PubMed Central

    2010-01-01

    Theoretically, there are reasons to believe that large genome size should favour speciation. Several major factors contributing to genome size, such as duplications and transposable element activity have been proposed to facilitate the formation of new species. However, it is also possible that small genome size promotes speciation. For example, selection for genome reduction may be resolved in different ways in incipient species, leading to incompatibilities. Mutations and chromosomal rearrangements may also be more stably inherited in smaller genomes. Here I review the following lines of empirical evidence bearing on this question: (i) Correlations between genome size and species richness of taxa are often negative. (ii) Fossil evidence in lungfish shows that the accumulation of DNA in the genomes of this group coincided with a reduction in species diversity. (iii) Estimates of speciation interval in mammals correlate positively with genome size. (iv) Genome reductions are inferred at the base of particular species radiations and genome expansions at the base of others. (v) Insect clades that have been increasing in diversity up to the present have smaller genomes than clades that have remained stable or have decreased in diversity. The general pattern emerging from these observations is that higher diversification rates are generally found in small-genome taxa. Since diversification rates are the net effect of speciation and extinction, large genomes may thus either constrain speciation rate, increase extinction rate, or both. I argue that some of the cited examples are unlikely to be explained by extinction alone. PMID:22140283

  2. The cancer genome

    PubMed Central

    Stratton, Michael R.; Campbell, Peter J.; Futreal, P. Andrew

    2010-01-01

    All cancers arise as a result of changes that have occurred in the DNA sequence of the genomes of cancer cells. Over the past quarter of a century much has been learnt about these mutations and the abnormal genes that operate in human cancers. We are now, however, moving into an era in which it will be possible to obtain the complete DNA sequence of large numbers of cancer genomes. These studies will provide us with a detailed and comprehensive perspective on how individual cancers have developed. PMID:19360079

  3. Methanococcus jannaschii genome: revisited

    NASA Technical Reports Server (NTRS)

    Kyrpides, N. C.; Olsen, G. J.; Klenk, H. P.; White, O.; Woese, C. R.

    1996-01-01

    Analysis of genomic sequences is necessarily an ongoing process. Initial gene assignments tend (wisely) to be on the conservative side (Venter, 1996). The analysis of the genome then grows in an iterative fashion as additional data and more sophisticated algorithms are brought to bear on the data. The present report is an emendation of the original gene list of Methanococcus jannaschii (Bult et al., 1996). By using a somewhat more updated database and more relaxed (and operator-intensive) pattern matching methods, we were able to add significantly to, and in a few cases amend, the gene identification table originally published by Bult et al. (1996).

  4. Genomic standards consortium projects.

    PubMed

    Field, Dawn; Sterk, Peter; Kottmann, Renzo; De Smet, J Wim; Amaral-Zettler, Linda; Cochrane, Guy; Cole, James R; Davies, Neil; Dawyndt, Peter; Garrity, George M; Gilbert, Jack A; Glöckner, Frank Oliver; Hirschman, Lynette; Klenk, Hans-Peter; Knight, Rob; Kyrpides, Nikos; Meyer, Folker; Karsch-Mizrachi, Ilene; Morrison, Norman; Robbins, Robert; San Gil, Inigo; Sansone, Susanna; Schriml, Lynn; Tatusova, Tatiana; Ussery, Dave; Yilmaz, Pelin; White, Owen; Wooley, John; Caporaso, Gregory

    2014-06-15

    The Genomic Standards Consortium (GSC) is an open-membership community that was founded in 2005 to work towards the development, implementation and harmonization of standards in the field of genomics. Starting with the defined task of establishing a minimal set of descriptions the GSC has evolved into an active standards-setting body that currently has 18 ongoing projects, with additional projects regularly proposed from within and outside the GSC. Here we describe our recently enacted policy for proposing new activities that are intended to be taken on by the GSC, along with the template for proposing such new activities. PMID:25197446

  5. The Brachypodium genome sequence: a resource for oat genomics research

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Oat (Avena sativa) is an important cereal crop used as both an animal feed and for human consumption. Genetic and genomic research on oat is hindered because it is hexaploid and possesses a large (13 Gb) genome. Diploid Avena relatives have been employed for genetic and genomic studies, but only mod...

  6. Tick Genomics: The Ixodes genome project and beyond

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ticks and mites (subphylum Chelicerata; subclass Acari) are important pests of animals and plants worldwide. The Ixodes scapularis (black-legged tick) genome sequencing project marks the beginning of the genomics era for the field of acarology. This project is the first to sequence the genome of a...

  7. Multiplexed Fragaria Chloroplast Genome Sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A method to sequence multiple chloroplast genomes that uses the sequencing depth of ultra high throughput sequencing technologies was recently described. Sequencing complete chloroplast genomes can resolve phylogenetic relationships at low taxonomic levels and identify point mutations and indels tha...

  8. The diversity of fungal genome.

    PubMed

    Mohanta, Tapan Kumar; Bae, Hanhong

    2015-01-01

    The genome size of an organism varies from species to species. The C-value paradox enigma is a very complex puzzle with regards to vast diversity in genome sizes in eukaryotes. Here we reported the detailed genomic information of 172 fungal species among different fungal genomes and found that fungal genomes are very diverse in nature. In fungi, the diversity of genomes varies from 8.97 Mb to 177.57 Mb. The average genome sizes of Ascomycota and Basidiomycota fungi are 36.91 and 46.48 Mb respectively. But higher genome size is observed in Oomycota (74.85 Mb) species, a lineage of fungus-like eukaryotic microorganisms. The average coding genes of Oomycota species are almost doubled than that of Acomycota and Basidiomycota fungus. PMID:25866485

  9. Company profile: Complete Genomics Inc.

    PubMed

    Reid, Clifford

    2011-02-01

    Complete Genomics Inc. is a life sciences company that focuses on complete human genome sequencing. It is taking a completely different approach to DNA sequencing than other companies in the industry. Rather than building a general-purpose platform for sequencing all organisms and all applications, it has focused on a single application - complete human genome sequencing. The company's Complete Genomics Analysis Platform (CGA™ Platform) comprises an integrated package of biochemistry, instrumentation and software that sequences human genomes at the highest quality, lowest cost and largest scale available. Complete Genomics offers a turnkey service that enables customers to outsource their human genome sequencing to the company's genome sequencing center in Mountain View, CA, USA. Customers send in their DNA samples, the company does all the library preparation, DNA sequencing, assembly and variant analysis, and customers receive research-ready data that they can use for biological discovery. PMID:21345140

  10. On genomics, kin, and privacy

    PubMed Central

    Telenti, Amalio; Ayday, Erman; Hubaux, Jean Pierre

    2014-01-01

    The storage of greater numbers of exomes or genomes raises the question of loss of privacy for the individual and for families if genomic data are not properly protected. Access to genome data may result from a personal decision to disclose, or from gaps in protection. In either case, revealing genome data has consequences beyond the individual, as it compromises the privacy of family members. Increasing availability of genome data linked or linkable to metadata through online social networks and services adds one additional layer of complexity to the protection of genome privacy.  The field of computer science and information technology offers solutions to secure genomic data so that individuals, medical personnel or researchers can access only the subset of genomic information required for healthcare or dedicated studies. PMID:25254097

  11. National Human Genome Research Institute

    MedlinePlus

    ... for Patient Care Education All About the Human Genome Project Fact Sheets Genetic Education Resources for Teachers ... Education Kit Online Genetics Education Resources Smithsonian NHGRI Genome Exhibition Talking Glossary: English Talking Glossary: Español Issues ...

  12. Snat: a SNP annotation tool for bovine by integrating various sources of genomic information

    PubMed Central

    2011-01-01

    Background Most recently, with maturing of bovine genome sequencing and high throughput SNP genotyping technologies, a large number of significant SNPs associated with economic important traits can be identified by genome-wide association studies (GWAS). To further determine true association findings in GWAS, the common strategy is to sift out most promising SNPs for follow-up replication studies. Hence it is crucial to explore the functional significance of the candidate SNPs in order to screen and select the potential functional ones. To systematically prioritize these statistically significant SNPs and facilitate follow-up replication studies, we developed a bovine SNP annotation tool (Snat) based on a web interface. Results With Snat, various sources of genomic information are integrated and retrieved from several leading online databases, including SNP information from dbSNP, gene information from Entrez Gene, protein features from UniProt, linkage information from AnimalQTLdb, conserved elements from UCSC Genome Browser Database and gene functions from Gene Ontology (GO), KEGG PATHWAY and Online Mendelian Inheritance in Animals (OMIA). Snat provides two different applications, including a CGI-based web utility and a command-line version, to access the integrated database, target any single nucleotide loci of interest and perform multi-level functional annotations. For further validation of the practical significance of our study, SNPs involved in two commercial bovine SNP chips, i.e., the Affymetrix Bovine 10K chip array and the Illumina 50K chip array, have been annotated by Snat, and the corresponding outputs can be directly downloaded from Snat website. Furthermore, a real dataset involving 20 identified SNPs associated with milk yield in our recent GWAS was employed to demonstrate the practical significance of Snat. Conclusions To our best knowledge, Snat is one of first tools focusing on SNP annotation for livestock. Snat confers researchers with a

  13. Computational method for estimating DNA copy numbers in normal samples, cancer cell lines, and solid tumors using array comparative genomic hybridization.

    PubMed

    Abkevich, Victor; Iliev, Diana; Timms, Kirsten M; Tran, Thanh; Skolnick, Mark; Lanchbury, Jerry S; Gutin, Alexander

    2010-01-01

    Genomic copy number variations are a typical feature of cancer. These variations may influence cancer outcomes as well as effectiveness of treatment. There are many computational methods developed to detect regions with deletions and amplifications without estimating actual copy numbers (CN) in these regions. We have developed a computational method capable of detecting regions with deletions and amplifications as well as estimating actual copy numbers in these regions. The method is based on determining how signal intensity from different probes is related to CN, taking into account changes in the total genome size, and incorporating into analysis contamination of the solid tumors with benign tissue. Hidden Markov Model is used to obtain the most likely CN solution. The method has been implemented for Affymetrix 500K GeneChip arrays and Agilent 244K oligonucleotide arrays. The results of CN analysis for normal cell lines, cancer cell lines, and tumor samples are presented. The method is capable of detecting copy number alterations in tumor samples with up to 80% contamination with benign tissue. Analysis of 178 cancer cell lines reveals multiple regions of common homozygous deletions and strong amplifications encompassing known tumor suppressor genes and oncogenes as well as novel cancer related genes. PMID:20706610

  14. Who Are the Okinawans? Ancestry, Genome Diversity, and Implications for the Genetic Study of Human Longevity From a Geographically Isolated Population

    PubMed Central

    Hsueh, Wen-Chi; He, Qimei; Willcox, D. Craig; Nievergelt, Caroline M.; Donlon, Timothy A.; Kwok, Pui-Yan; Suzuki, Makoto; Willcox, Bradley J.

    2014-01-01

    Isolated populations have advantages for genetic studies of longevity from decreased haplotype diversity and long-range linkage disequilibrium. This permits smaller sample sizes without loss of power, among other utilities. Little is known about the genome of the Okinawans, a potential population isolate, recognized for longevity. Therefore, we assessed genetic diversity, structure, and admixture in Okinawans, and compared this with Caucasians, Chinese, Japanese, and Africans from HapMap II, genotyped on the same Affymetrix GeneChip Human Mapping 500K array. Principal component analysis, haplotype coverage, and linkage disequilibrium decay revealed a distinct Okinawan genome—more homogeneity, less haplotype diversity, and longer range linkage disequilibrium. Population structure and admixture analyses utilizing 52 global reference populations from the Human Genome Diversity Cell Line Panel demonstrated that Okinawans clustered almost exclusively with East Asians. Sibling relative risk (λs) analysis revealed that siblings of Okinawan centenarians have 3.11 times (females) and 3.77 times (males) more likelihood of centenarianism. These findings suggest that Okinawans are genetically distinct and share several characteristics of a population isolate, which are prone to develop extreme phenotypes (eg, longevity) from genetic drift, natural selection, and population bottlenecks. These data support further exploration of genetic influence on longevity in the Okinawans. PMID:24444611

  15. The auxin response factor transcription factor family in soybean: genome-wide identification and expression analyses during development and water stress.

    PubMed

    Ha, Chien Van; Le, Dung Tien; Nishiyama, Rie; Watanabe, Yasuko; Sulieman, Saad; Tran, Uyen Thi; Mochida, Keiichi; Dong, Nguyen Van; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo; Tran, Lam-Son Phan

    2013-10-01

    In plants, the auxin response factor (ARF) transcription factors play important roles in regulating diverse biological processes, including development, growth, cell division and responses to environmental stimuli. An exhaustive search of soybean genome revealed 51 GmARFs, many of which were formed by genome duplications. The typical GmARFs (43 members) contain a DNA-binding domain, an ARF domain and an auxin/indole acetic acid (AUX/IAA) dimerization domain, whereas the remaining eight members lack the dimerization domain. Phylogenetic analysis of the ARFs from soybean and Arabidopsis revealed both similarity and divergence between the two ARF families, as well as enabled us to predict the functions of the GmARFs. Using quantitative real-time polymerase chain reaction (qRT-PCR) and available soybean Affymetrix array and Illumina transcriptome sequence data, a comprehensive expression atlas of GmARF genes was obtained in various organs and tissues, providing useful information about their involvement in defining the precise nature of individual tissues. Furthermore, expression profiling using qRT-PCR and microarray data revealed many water stress-responsive GmARFs in soybean, albeit with different patterns depending on types of tissues and/or developmental stages. Our systematic analysis has identified excellent tissue-specific and/or stress-responsive candidate GmARF genes for in-depth in planta functional analyses, which would lead to potential applications in the development of genetically modified soybean cultivars with enhanced drought tolerance. PMID:23810914

  16. The Auxin Response Factor Transcription Factor Family in Soybean: Genome-Wide Identification and Expression Analyses During Development and Water Stress

    PubMed Central

    Van Ha, Chien; Le, Dung Tien; Nishiyama, Rie; Watanabe, Yasuko; Sulieman, Saad; Tran, Uyen Thi; Mochida, Keiichi; Van Dong, Nguyen; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo; Tran, Lam-Son Phan

    2013-01-01

    In plants, the auxin response factor (ARF) transcription factors play important roles in regulating diverse biological processes, including development, growth, cell division and responses to environmental stimuli. An exhaustive search of soybean genome revealed 51 GmARFs, many of which were formed by genome duplications. The typical GmARFs (43 members) contain a DNA-binding domain, an ARF domain and an auxin/indole acetic acid (AUX/IAA) dimerization domain, whereas the remaining eight members lack the dimerization domain. Phylogenetic analysis of the ARFs from soybean and Arabidopsis revealed both similarity and divergence between the two ARF families, as well as enabled us to predict the functions of the GmARFs. Using quantitative real-time polymerase chain reaction (qRT-PCR) and available soybean Affymetrix array and Illumina transcriptome sequence data, a comprehensive expression atlas of GmARF genes was obtained in various organs and tissues, providing useful information about their involvement in defining the precise nature of individual tissues. Furthermore, expression profiling using qRT-PCR and microarray data revealed many water stress-responsive GmARFs in soybean, albeit with different patterns depending on types of tissues and/or developmental stages. Our systematic analysis has identified excellent tissue-specific and/or stress-responsive candidate GmARF genes for in-depth in planta functional analyses, which would lead to potential applications in the development of genetically modified soybean cultivars with enhanced drought tolerance. PMID:23810914

  17. Importance of anchor genomes for any plant genome project

    PubMed Central

    Messing, Joachim; Llaca, Victor

    1998-01-01

    Progress in agricultural and environmental technologies is hampered by a slower rate of gene discovery in plants than animals. The vast pool of genes in plants, however, will be an important resource for insertion of genes, via biotechnological procedures, into an array of plants, generating unique germ plasms not achievable by conventional breeding. It just became clear that genomes of grasses have evolved in a manner analogous to Lego blocks. Large chromosome segments have been reshuffled and stuffer pieces added between genes. Although some genomes have become very large, the genome with the fewest stuffer pieces, the rice genome, is the Rosetta Stone of all the bigger grass genomes. This means that sequencing the rice genome as anchor genome of the grasses will provide instantaneous access to the same genes in the same relative physical position in other grasses (e.g., corn and wheat), without the need to sequence each of these genomes independently. (i) The sequencing of the entire genome of rice as anchor genome for the grasses will accelerate plant gene discovery in many important crops (e.g., corn, wheat, and rice) by several orders of magnitudes and reduce research and development costs for government and industry at a faster pace. (ii) Costs for sequencing entire genomes have come down significantly. Because of its size, rice is only 12% of the human or the corn genome, and technology improvements by the human genome project are completely transferable, translating in another 50% reduction of the costs. (iii) The physical mapping of the rice genome by a group of Japanese researchers provides a jump start for sequencing the genome and forming an international consortium. Otherwise, other countries would do it alone and own proprietary positions. PMID:9482827

  18. Genomics in Cardiovascular Disease

    PubMed Central

    Roberts, Robert; Marian, A.J.; Dandona, Sonny; Stewart, Alexandre F.R.

    2013-01-01

    A paradigm shift towards biology occurred in the 1990’s subsequently catalyzed by the sequencing of the human genome in 2000. The cost of DNA sequencing has gone from millions to thousands of dollars with sequencing of one’s entire genome costing only $1,000. Rapid DNA sequencing is being embraced for single gene disorders, particularly for sporadic cases and those from small families. Transmission of lethal genes such as associated with Huntington’s disease can, through in-vitro fertilization, avoid passing it on to one’s offspring. DNA sequencing will meet the challenge of elucidating the genetic predisposition for common polygenic diseases, especially in determining the function of the novel common genetic risk variants and identifying the rare variants, which may also partially ascertain the source of the missing heritability. The challenge for DNA sequencing remains great, despite human genome sequences being 99.5% identical, the 3 million single nucleotide polymorphisms (SNPs) responsible for most of the unique features add up to 60 new mutations per person which, for 7 billion people, is 420 billion mutations. It is claimed that DNA sequencing has increased 10,000 fold while information storage and retrieval only 16 fold. The physician and health user will be challenged by the convergence of two major trends, whole genome sequencing and the storage/retrieval and integration of the data. PMID:23524054

  19. Poster: the macaque genome.

    PubMed

    2007-04-13

    The rhesus macaque (Macaca mulatta) facilitates an extraordinary range of biomedical and basic research, and the publication of the genome only makes it a more powerful model for studies of human disease; moreover, the macaque's position relative to humans and chimpanzees affords the opportunity to learn about the processes that have shaped the last 25 million years of primate evolution. To allow users to explore these themes of the macaque genome, Science has created a special interactive version of the poster published in the print edition of the 13 April 2007 issue. The interactive version includes additional text and exploration, as well as embedded video featuring seven scientists discussing the importance of the macaque and its genome sequence in studies of biomedicine and evolution. We have also created an accompanying teaching resource, including a lesson plan aimed at teachers of advanced high school life science students, for exploring what a comparison of the macaque and human genomes can tell us about human biology and evolution. These items are free to all site visitors. PMID:17431172

  20. (Genomic variation in maize)

    SciTech Connect

    Rivin, C.J.

    1991-01-01

    These studies have sought to learn how different DNA sequences and sequence arrangements contribute to genome plasticity in maize. We describe quantitative variation among maize inbred lines for tandemly arrayed and dispersed repeated DNA sequences and gene families, and qualitative variation for sequences homologous to the Mutator family of transposons. The potential of these sequences to undergo unequal crossing over, non-allelic (ectopic) recombination and transposition makes them a source of genome instability. We have found examples of rapid genomic change involving these sequences in Fl hybrids, tissue culture cells and regenerated plants. We describe the repetitive portion of the maize genome as composed primarily of sequences that vary markedly in copy number among different genetic stocks. The most highly variable is the 185 bp repeat associated with the heterochromatic chromosome knobs. Even in lines without visible knobs, there is a considerable quantity of tandemly arrayed repeats. We also found a high degree of variability for the tandemly arrayed 5S and ribosomal DNA repeats. While such variation might be expected as the result of unequal cross-over, we were surprised to find considerable variation among lower copy number, dispersed repeats as well. One highly repeated sequence that showed a complex tandem and dispersed arrangement stood out as showing no detectable variability among the maize lines. In striking contrast to the variability seen between the inbred stocks, individuals within a stock were indistinguishable with regard to their repeated sequence multiplicities.

  1. Better chocolate through genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Theobroma cacao, the cacao or chocolate tree, is a tropical understory tree whose seeds are used to make chocolate. And like any important crop, cacao is the subject of much research. On September 15, 2010, scientists publicly released a preliminary sequence of the cacao genome--which contains all o...

  2. The Nostoc punctiforme Genome

    SciTech Connect

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9 Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.

  3. The human genome project.

    PubMed Central

    Olson, M V

    1993-01-01

    The Human Genome Project in the United States is now well underway. Its programmatic direction was largely set by a National Research Council report issued in 1988. The broad framework supplied by this report has survived almost unchanged despite an upheaval in the technology of genome analysis. This upheaval has primarily affected physical and genetic mapping, the two dominant activities in the present phase of the project. Advances in mapping techniques have allowed good progress toward the specific goals of the project and are also providing strong corollary benefits throughout biomedical research. Actual DNA sequencing of the genomes of the human and model organisms is still at an early stage. There has been little progress in the intrinsic efficiency of DNA-sequence determination. However, refinements in experimental protocols, instrumentation, and project management have made it practical to acquire sequence data on an enlarged scale. It is also increasingly apparent that DNA-sequence data provide a potent means of relating knowledge gained from the study of model organisms to human biology. There is as yet little indication that the infusion of technology from outside biology into the Human Genome Project has been effectively stimulated. Opportunities in this area remain large, posing substantial technical and policy challenges. PMID:8506271

  4. Genetics, genomics and fertility

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In order to enhance the sustainability of dairy businesses, new management tools are needed to increase the fertility of dairy cattle. Genomic selection has been successfully used by AI studs to screen potential sires and significantly decrease the generation interval of bulls. Buoyed by the success...

  5. Dairy genomics in application

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Implementation of genomic evaluation has caused profound changes in dairy cattle breeding. All young bulls bought by major artificial-insemination organizations now are selected based on these evaluation. Evaluation reliability can reach ~75% for yield traits, which is adequate for marketing semen o...

  6. Genomic selection in plant breeding

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomic selection (GS) is a method to predict the genetic value of selection candidates based on the genomic estimated breeding value (GEBV) predicted from high-density markers positioned throughout the genome. Unlike marker-assisted selection, the GEBV is based on all markers including both minor ...

  7. Whole genome co-expression analysis of soybean cytochrome P450 genes identifies nodulation-specific P450 monooxygenases

    PubMed Central

    2010-01-01

    Background Cytochrome P450 monooxygenases (P450s) catalyze oxidation of various substrates using oxygen and NAD(P)H. Plant P450s are involved in the biosynthesis of primary and secondary metabolites performing diverse biological functions. The recent availability of the soybean genome sequence allows us to identify and analyze soybean putative P450s at a genome scale. Co-expression analysis using an available soybean microarray and Illumina sequencing data provides clues for functional annotation of these enzymes. This approach is based on the assumption that genes that have similar expression patterns across a set of conditions may have a functional relationship. Results We have identified a total number of 332 full-length P450 genes and 378 pseudogenes from the soybean genome. From the full-length sequences, 195 genes belong to A-type, which could be further divided into 20 families. The remaining 137 genes belong to non-A type P450s and are classified into 28 families. A total of 178 probe sets were found to correspond to P450 genes on the Affymetrix soybean array. Out of these probe sets, 108 represented single genes. Using the 28 publicly available microarray libraries that contain organ-specific information, some tissue-specific P450s were identified. Similarly, stress responsive soybean P450s were retrieved from 99 microarray soybean libraries. We also utilized Illumina transcriptome sequencing technology to analyze the expressions of all 332 soybean P450 genes. This dataset contains total RNAs isolated from nodules, roots, root tips, leaves, flowers, green pods, apical meristem, mock-inoculated and Bradyrhizobium japonicum-infected root hair cells. The tissue-specific expression patterns of these P450 genes were analyzed and the expression of a representative set of genes were confirmed by qRT-PCR. We performed the co-expression analysis on many of the 108 P450 genes on the Affymetrix arrays. First we confirmed that CYP93C5 (an isoflavone synthase gene) is

  8. Plant functional genomics

    NASA Astrophysics Data System (ADS)

    Holtorf, Hauke; Guitton, Marie-Christine; Reski, Ralf

    2002-04-01

    Functional genome analysis of plants has entered the high-throughput stage. The complete genome information from key species such as Arabidopsis thaliana and rice is now available and will further boost the application of a range of new technologies to functional plant gene analysis. To broadly assign functions to unknown genes, different fast and multiparallel approaches are currently used and developed. These new technologies are based on known methods but are adapted and improved to accommodate for comprehensive, large-scale gene analysis, i.e. such techniques are novel in the sense that their design allows researchers to analyse many genes at the same time and at an unprecedented pace. Such methods allow analysis of the different constituents of the cell that help to deduce gene function, namely the transcripts, proteins and metabolites. Similarly the phenotypic variations of entire mutant collections can now be analysed in a much faster and more efficient way than before. The different methodologies have developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics. Gene function, however, cannot solely be inferred by using only one such approach. Rather, it is only by bringing together all the information collected by different functional genomic tools that one will be able to unequivocally assign functions to unknown plant genes. This review focuses on current technical developments and their impact on the field of plant functional genomics. The lower plant Physcomitrella is introduced as a new model system for gene function analysis, owing to its high rate of homologous recombination.

  9. Thinking laterally about genomes.

    PubMed

    Ragan, Mark A

    2009-10-01

    Perhaps the most-surprising discovery of the genome era has been the extent to which prokaryotic and many eukaryotic genomes incorporate genetic material from sources other than their parent(s). Lateral genetic transfer (LGT) among bacteria was first observed about 100 years ago, and is now accepted to underlie important phenomena including the spread of antibiotic resistance and ability to degrade xenobiotics. LGT is invoked, perhaps too readily, to explain a breadth of awkward data including compositional heterogeneity of genomes, disagreement among gene-sequence trees, and mismatch between physiology and systematics. At the same time many details of LGT remain unknown or controversial, and some key questions have scarcely been asked. Here I critically review what we think we know about the existence, extent, mechanism and impact of LGT; identify important open questions; and point to research directions that hold particular promise for elucidating the role of LGT in genome evolution. Evidence for LGT in nature is not only inferential but also direct, and potential vectors are ubiquitous. Genetic material can pass between diverse habitats and be significantly altered during residency in viruses, complicating the inference of donors, In prokaryotes about twice as many genes are interrupted by LGT as are transferred intact, and about 5Short protein domains can be privileged units of transfer. Unresolved phylogenetic issues include the correct null hypothesis, and genes as units of analysis. Themes are beginning to emerge regarding the effect of LGT on cellular networks, but I show why generalization is premature. LGT can associate with radical changes in physiology and ecological niche. Better quantitative models of genome evolution are needed, and theoretical frameworks remain to be developed for some observations including chromosome assembly by LGT. PMID:20180279

  10. TUTORIAL ON NETWORK GENOMICS.

    SciTech Connect

    Forst, C.

    2001-01-01

    With the ever-increasing genomic information pouring into the databases researchers start to look for pattern in genomes. Key questions are the identification of function. In the past function was mainly understood to be assigned to a single gene isolated from other cellular components or mechanisms. Sequence comparison fo single genes and their products (proteins) as well as of intergenic space are a consequence of a well established one-gene one-function interpretation. prediction of function solely by sequence similarity searches are powerful techniques that initiated the advent of bioinformatics and computational biology. Seminal work on sequence alignment by Temple Smith and Michael Waterman [33] and sequence searches with the BLAST algorithm by Altschul et al. [2] provide essential methods for sequence based determination of function. Similar outstanding contributions to determination of function have been archived in the area of structure prediction, molecular modeling and molecular dynamics. Techniques covering ab initio and homology modeling up to biophysical interpretation of long-run molecular dynamics simulations are mentioned ehre. With the ever-increasing number of information of different genetic/genomic origin, new aspect are looked for that deviate from the single gene at a time method. Especially with the identification of surprisingly few human genes the emerging perception in the scientific community that the concept of function has to be extended to include other sequence based as well as non-sequenced based information. A schema of determination of function by different concepts is shown in Figure 1. The tutorial is comprised of the following sections: The first two sections discuss the differences between genomic and non-genomic based context information, section three will cover combined methods. Finally, section four lsits web-resources and databases. All presented approaches extensively employ comparative methods.

  11. USH1G with unique retinal findings caused by a novel truncating mutation identified by genome-wide linkage analysis

    PubMed Central

    Taibah, Khalid; Bin-Khamis, Ghada; Kennedy, Shelley; Hemidan, Amal; Al-Qahtani, Faisal; Tabbara, Khalid; Mubarak, Bashayer Al; Ramzan, Khushnooda; Meyer, Brian F.; Al-Owain, Mohammed

    2012-01-01

    Purpose Usher syndrome (USH) is an autosomal recessive disorder divided into three distinct clinical subtypes based on the severity of the hearing loss, manifestation of vestibular dysfunction, and the age of onset of retinitis pigmentosa and visual symptoms. To date, mutations in seven different genes have been reported to cause USH type 1 (USH1), the most severe form. Patients diagnosed with USH1 are known to be ideal candidates to benefit from cochlear implantation. Methods Genome-wide linkage analysis using Affymetrix GeneChip Human Mapping 10K arrays were performed in three cochlear implanted Saudi siblings born from a consanguineous marriage, clinically diagnosed with USH1 by comprehensive clinical, audiological, and ophthalmological examinations. From the linkage results, the USH1G gene was screened for mutations by direct sequencing of the coding exons. Results We report the identification of a novel p.S243X truncating mutation in USH1G that segregated with the disease phenotype and was not present in 300 ethnically matched normal controls. We also report on the novel retinal findings and the outcome of cochlear implantation in the affected individuals. Conclusions In addition to reporting a novel truncating mutation, this report expands the retinal phenotype in USH1G and presents the first report of successful cochlear implants in this disease. PMID:22876113

  12. 3' tag digital gene expression profiling of human brain and universal reference RNA using Illumina Genome Analyzer

    PubMed Central

    2009-01-01

    Background Massive parallel sequencing has the potential to replace microarrays as the method for transcriptome profiling. Currently there are two protocols: full-length RNA sequencing (RNA-SEQ) and 3'-tag digital gene expression (DGE). In this preliminary effort, we evaluated the 3' DGE approach using two reference RNA samples from the MicroArray Quality Control Consortium (MAQC). Results Using Brain RNA sample from multiple runs, we demonstrated that the transcript profiles from 3' DGE were highly reproducible between technical and biological replicates from libraries constructed by the same lab and even by different labs, and between two generations of Illumina's Genome Analyzers. Approximately 65% of all sequence reads mapped to mitochondrial genes, ribosomal RNAs, and canonical transcripts. The expression profiles of brain RNA and universal human reference RNA were compared which demonstrated that DGE was also highly quantitative with excellent correlation of differential expression with quantitative real-time PCR. Furthermore, one lane of 3' DGE sequencing, using the current sequencing chemistry and image processing software, had wider dynamic range for transcriptome profiling and was able to detect lower expressed genes which are normally below the detection threshold of microarrays. Conclusion 3' tag DGE profiling with massive parallel sequencing achieved high sensitivity and reproducibility for transcriptome profiling. Although it lacks the ability of detecting alternative splicing events compared to RNA-SEQ, it is much more affordable and clearly out-performed microarrays (Affymetrix) in detecting lower abundant transcripts. PMID:19917133

  13. Whole genome single nucleotide polymorphism based phylogeny of Francisella tularensis and its application to the development of a strain typing assay

    PubMed Central

    2009-01-01

    Background A low genetic diversity in Francisella tularensis has been documented. Current DNA based genotyping methods for typing F. tularensis offer a limited and varying degree of subspecies, clade and strain level discrimination power. Whole genome sequencing is the most accurate and reliable method to identify, type and determine phylogenetic relationships among strains of a species. However, lower cost typing schemes are necessary in order to enable typing of hundreds or even thousands of isolates. Results We have generated a high-resolution phylogenetic tree from 40 Francisella isolates, including 13 F. tularensis subspecies holarctica (type B) strains, 26 F. tularensis subsp. tularensis (type A) strains and a single F. novicida strain. The tree was generated from global multi-strain single nucleotide polymorphism (SNP) data collected using a set of six Affymetrix GeneChip® resequencing arrays with the non-repetitive portion of LVS (type B) as the reference sequence complemented with unique sequences of SCHU S4 (type A). Global SNP based phylogenetic clustering was able to resolve all non-related strains. The phylogenetic tree was used to guide the selection of informative SNPs specific to major nodes in the tree for development of a genotyping assay for identification of F. tularensis subspecies and clades. We designed and validated an assay that uses these SNPs to accurately genotype 39 additional F. tularensis strains as type A (A1, A2, A1a or A1b) or type B (B1 or B2). Conclusion Whole-genome SNP based clustering was shown to accurately identify SNPs for differentiation of F. tularensis subspecies and clades, emphasizing the potential power and utility of this methodology for selecting SNPs for typing of F. tularensis to the strain level. Additionally, whole genome sequence based SNP information gained from a representative population of strains may be used to perform evolutionary or phylogenetic comparisons of strains, or selection of unique strains for

  14. Polygenic transmission and complex neuro developmental network for attention deficit hyperactivity disorder: genome-wide association study of both common and rare variants.

    PubMed

    Yang, Li; Neale, Benjamin M; Liu, Lu; Lee, S Hong; Wray, Naomi R; Ji, Ning; Li, Haimei; Qian, Qiujin; Wang, Dongliang; Li, Jun; Faraone, Stephen V; Wang, Yufeng; Doyle, Alysa E; Reif, Andreas; Rothenberger, Aribert; Franke, Barbara; Sonuga-Barke, Edmund J S; Steinhausen, Hans-Christoph; Buitelaar, Jan K; Kuntsi, Jonna; Biederman, Joseph; Lesch, Klaus-Peter; Kent, Lindsey; Asherson, Philip; Oades, Robert D; Loo, Sandra K; Nelson, Stan F; Faraone, Stephen V; Smalley, Susan L; Banaschewski, Tobias; Arias Vasquez, Alejandro; Todorov, Alexandre; Charach, Alice; Miranda, Ana; Warnke, Andreas; Thapar, Anita; Neale, Benjamin M; Cormand, Bru; Freitag, Christine; Mick, Eric; Mulas, Fernando; Middleton, Frank; HakonarsonHakonarson, Hakon; Palmason, Haukur; Schäfer, Helmut; Roeyers, Herbert; McGough, James J; Romanos, Jasmin; Crosbie, Jennifer; Meyer, Jobst; Ramos-Quiroga, Josep Antoni; Sergeant, Joseph; Elia, Josephine; Langely, Kate; Nisenbaum, Laura; Romanos, Marcel; Daly, Mark J; Ribasés, Marta; Gill, Michael; O'Donovan, Michael; Owen, Michael; Casas, Miguel; Bayés, Mònica; Lambregts-Rommelse, Nanda; Williams, Nigel; Holmans, Peter; Anney, Richard J L; Ebstein, Richard P; Schachar, Russell; Medland, Sarah E; Ripke, Stephan; Walitza, Susanne; Nguyen, Thuy Trang; Renner, Tobias J; Hu, Xiaolan

    2013-07-01

    Attention-deficit hyperactivity disorder (ADHD) is a complex polygenic disorder. This study aimed to discover common and rare DNA variants associated with ADHD in a large homogeneous Han Chinese ADHD case-control sample. The sample comprised 1,040 cases and 963 controls. All cases met DSM-IV ADHD diagnostic criteria. We used the Affymetrix6.0 array to assay both single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Genome-wide association analyses were performed using PLINK. SNP-heritability and SNP-genetic correlations with ADHD in Caucasians were estimated with genome-wide complex trait analysis (GCTA). Pathway analyses were performed using the Interval enRICHment Test (INRICH), the Disease Association Protein-Protein Link Evaluator (DAPPLE), and the Genomic Regions Enrichment of Annotations Tool (GREAT). We did not find genome-wide significance for single SNPs but did find an increased burden of large, rare CNVs in the ADHD sample (P = 0.038). SNP-heritability was estimated to be 0.42 (standard error, 0.13, P = 0.0017) and the SNP-genetic correlation with European Ancestry ADHD samples was 0.39 (SE 0.15, P = 0.0072). The INRICH, DAPPLE, and GREAT analyses implicated several gene ontology cellular components, including neuron projections and synaptic components, which are consistent with a neurodevelopmental pathophysiology for ADHD. This study suggested the genetic architecture of ADHD comprises both common and rare variants. Some common causal variants are likely to be shared between Han Chinese and Caucasians. Complex neurodevelopmental networks may underlie ADHD's etiology. PMID:23728934

  15. Genome size evolution in macroparasites.

    PubMed

    Sundberg, Lotta-Riina; Pulkkinen, Katja

    2015-04-01

    Reduction in genome size has been associated not only with a parasitic lifestyle in intracellular microparasites but also in some macroparasitic insects and nematodes. We collected the available data on genome size for flatworms, annelids, nematodes and arthropods, compared those with available data for the phylogenetically closest free-living taxa and found evidence of smaller genome sizes for parasites in six of nine comparisons. Our results suggest that despite great differences in evolutionary history and life cycles, parasitism as a lifestyle promotes convergent genome size reduction in macroparasites. We discuss factors that could be associated with small genome size in parasites which require further exploration in the future. PMID:25724591

  16. Professional medical education and genomics.

    PubMed

    Demmer, Laurie A; Waggoner, Darrel J

    2014-01-01

    Genomic medicine is a relatively new concept that involves using individual patients' genomic results in their clinical care. Genetic technology has advanced swiftly over the past decade, and most providers have been left behind without an understanding of this complex field. To realize its full potential, genomic medicine must be both understood and accepted by the greater medical community. The current state of professional medical education in genomics and genomic medicine is reviewed, including ongoing plans to expand educational efforts for medical students, clinical geneticists, and nongeneticist physicians. PMID:24635717

  17. Evolution of plant genome architecture.

    PubMed

    Wendel, Jonathan F; Jackson, Scott A; Meyers, Blake C; Wing, Rod A

    2016-01-01

    We have witnessed an explosion in our understanding of the evolution and structure of plant genomes in recent years. Here, we highlight three important emergent realizations: (1) that the evolutionary history of all plant genomes contains multiple, cyclical episodes of whole-genome doubling that were followed by myriad fractionation processes; (2) that the vast majority of the variation in genome size reflects the dynamics of proliferation and loss of lineage-specific transposable elements; and (3) that various classes of small RNAs help shape genomic architecture and function. We illustrate ways in which understanding these organism-level and molecular genetic processes can be used for crop plant improvement. PMID:26926526

  18. Rice: The First Crop Genome.

    PubMed

    Jackson, Scott A

    2016-12-01

    Rice was the first sequenced crop genome, paving the way for the sequencing of additional and more complicated crop genomes. The impact that the genome sequence made on rice genetics and breeding research was immediate, as evidence by citations and DNA marker use. The impact on other crop genomes was evident too, particularly for those within the grass family. As we celebrate 10 years since the completion of the rice genome sequence, we look forward to new empowering tool sets that will further revolutionize research in rice genetics and breeding and result in varieties that will continue to feed a growing population. PMID:27003180

  19. Nongenetic functions of the genome.

    PubMed

    Bustin, Michael; Misteli, Tom

    2016-05-01

    The primary function of the genome is to store, propagate, and express the genetic information that gives rise to a cell's architectural and functional machinery. However, the genome is also a major structural component of the cell. Besides its genetic roles, the genome affects cellular functions by nongenetic means through its physical and structural properties, particularly by exerting mechanical forces and by serving as a scaffold for binding of cellular components. Major cellular processes affected by nongenetic functions of the genome include establishment of nuclear structure, signal transduction, mechanoresponses, cell migration, and vision in nocturnal animals. We discuss the concept, mechanisms, and implications of nongenetic functions of the genome. PMID:27151873

  20. Genome of Crocodilepox Virus

    PubMed Central

    Afonso, C. L.; Tulman, E. R.; Delhon, G.; Lu, Z.; Viljoen, G. J.; Wallace, D. B.; Kutish, G. F.; Rock, D. L.

    2006-01-01

    Here, we present the genome sequence, with analysis, of a poxvirus infecting Nile crocodiles (Crocodylus niloticus) (crocodilepox virus; CRV). The genome is 190,054 bp (62% G+C) and predicted to contain 173 genes encoding proteins of 53 to 1,941 amino acids. The central genomic region contains genes conserved and generally colinear with those of other chordopoxviruses (ChPVs). CRV is distinct, as the terminal 33-kbp (left) and 13-kbp (right) genomic regions are largely CRV specific, containing 48 unique genes which lack similarity to other poxvirus genes. Notably, CRV also contains 14 unique genes which disrupt ChPV gene colinearity within the central genomic region, including 7 genes encoding GyrB-like ATPase domains similar to those in cellular type IIA DNA topoisomerases, suggestive of novel ATP-dependent functions. The presence of 10 CRV proteins with similarity to components of cellular multisubunit E3 ubiquitin-protein ligase complexes, including 9 proteins containing F-box motifs and F-box-associated regions and a homologue of cellular anaphase-promoting complex subunit 11 (Apc11), suggests that modification of host ubiquitination pathways may be significant for CRV-host cell interaction. CRV encodes a novel complement of proteins potentially involved in DNA replication, including a NAD+-dependent DNA ligase and a protein with similarity to both vaccinia virus F16L and prokaryotic serine site-specific resolvase-invertases. CRV lacks genes encoding proteins for nucleotide metabolism. CRV shares notable genomic similarities with molluscum contagiosum virus, including genes found only in these two viruses. Phylogenetic analysis indicates that CRV is quite distinct from other ChPVs, representing a new genus within the subfamily Chordopoxvirinae, and it lacks recognizable homologues of most ChPV genes involved in virulence and host range, including those involving interferon response, intracellular signaling, and host immune response modulation. These data reveal

  1. Genome of crocodilepox virus.

    PubMed

    Afonso, C L; Tulman, E R; Delhon, G; Lu, Z; Viljoen, G J; Wallace, D B; Kutish, G F; Rock, D L

    2006-05-01

    Here, we present the genome sequence, with analysis, of a poxvirus infecting Nile crocodiles (Crocodylus niloticus) (crocodilepox virus; CRV). The genome is 190,054 bp (62% G+C) and predicted to contain 173 genes encoding proteins of 53 to 1,941 amino acids. The central genomic region contains genes conserved and generally colinear with those of other chordopoxviruses (ChPVs). CRV is distinct, as the terminal 33-kbp (left) and 13-kbp (right) genomic regions are largely CRV specific, containing 48 unique genes which lack similarity to other poxvirus genes. Notably, CRV also contains 14 unique genes which disrupt ChPV gene colinearity within the central genomic region, including 7 genes encoding GyrB-like ATPase domains similar to those in cellular type IIA DNA topoisomerases, suggestive of novel ATP-dependent functions. The presence of 10 CRV proteins with similarity to components of cellular multisubunit E3 ubiquitin-protein ligase complexes, including 9 proteins containing F-box motifs and F-box-associated regions and a homologue of cellular anaphase-promoting complex subunit 11 (Apc11), suggests that modification of host ubiquitination pathways may be significant for CRV-host cell interaction. CRV encodes a novel complement of proteins potentially involved in DNA replication, including a NAD(+)-dependent DNA ligase and a protein with similarity to both vaccinia virus F16L and prokaryotic serine site-specific resolvase-invertases. CRV lacks genes encoding proteins for nucleotide metabolism. CRV shares notable genomic similarities with molluscum contagiosum virus, including genes found only in these two viruses. Phylogenetic analysis indicates that CRV is quite distinct from other ChPVs, representing a new genus within the subfamily Chordopoxvirinae, and it lacks recognizable homologues of most ChPV genes involved in virulence and host range, including those involving interferon response, intracellular signaling, and host immune response modulation. These data

  2. Evolution of small prokaryotic genomes

    PubMed Central

    Martínez-Cano, David J.; Reyes-Prieto, Mariana; Martínez-Romero, Esperanza; Partida-Martínez, Laila P.; Latorre, Amparo; Moya, Andrés; Delaye, Luis

    2015-01-01

    As revealed by genome sequencing, the biology of prokaryotes with reduced genomes is strikingly diverse. These include free-living prokaryotes with ∼800 genes as well as endosymbiotic bacteria with as few as ∼140 genes. Comparative genomics is revealing the evolutionary mechanisms that led to these small genomes. In the case of free-living prokaryotes, natural selection directly favored genome reduction, while in the case of endosymbiotic prokaryotes neutral processes played a more prominent role. However, new experimental data suggest that selective processes may be at operation as well for endosymbiotic prokaryotes at least during the first stages of genome reduction. Endosymbiotic prokaryotes have evolved diverse strategies for living with reduced gene sets inside a host-defined medium. These include utilization of host-encoded functions (some of them coded by genes acquired by gene transfer from the endosymbiont and/or other bacteria); metabolic complementation between co-symbionts; and forming consortiums with other bacteria within the host. Recent genome sequencing projects of intracellular mutualistic bacteria showed that previously believed universal evolutionary trends like reduced G+C content and conservation of genome synteny are not always present in highly reduced genomes. Finally, the simplified molecular machinery of some of these organisms with small genomes may be used to aid in the design of artificial minimal cells. Here we review recent genomic discoveries of the biology of prokaryotes endowed with small gene sets and discuss the evolutionary mechanisms that have been proposed to explain their peculiar nature. PMID:25610432

  3. Advances in plant chromosome genomics.

    PubMed

    Doležel, Jaroslav; Vrána, Jan; Cápal, Petr; Kubaláková, Marie; Burešová, Veronika; Simková, Hana

    2014-01-01

    Next generation sequencing (NGS) is revolutionizing genomics and is providing novel insights into genome organization, evolution and function. The number of plant genomes targeted for sequencing is rising. For the moment, however, the acquisition of full genome sequences in large genome species remains difficult, largely because the short reads produced by NGS platforms are inadequate to cope with repeat-rich DNA, which forms a large part of these genomes. The problem of sequence redundancy is compounded in polyploids, which dominate the plant kingdom. An approach to overcoming some of these difficulties is to reduce the full nuclear genome to its individual chromosomes using flow-sorting. The DNA acquired in this way has proven to be suitable for many applications, including PCR-based physical mapping, in situ hybridization, forming DNA arrays, the development of DNA markers, the construction of BAC libraries and positional cloning. Coupling chromosome sorting with NGS offers opportunities for the study of genome organization at the single chromosomal level, for comparative analyses between related species and for the validation of whole genome assemblies. Apart from the primary aim of reducing the complexity of the template, taking a chromosome-based approach enables independent teams to work in parallel, each tasked with the analysis of a different chromosome(s). Given that the number of plant species tractable for chromosome sorting is increasing, the likelihood is that chromosome genomics - the marriage of cytology and genomics - will make a significant contribution to the field of plant genetics. PMID:24406816

  4. Informational laws of genome structures

    PubMed Central

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-01-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined. PMID:27354155

  5. Informational laws of genome structures.

    PubMed

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-01-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined. PMID:27354155

  6. Informational laws of genome structures

    NASA Astrophysics Data System (ADS)

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-06-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.

  7. Comparative genomics of Brassicaceae crops

    PubMed Central

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-01-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  8. Toward genome-enabled mycology.

    PubMed

    Hibbett, David S; Stajich, Jason E; Spatafora, Joseph W

    2013-01-01

    Genome-enabled mycology is a rapidly expanding field that is characterized by the pervasive use of genome-scale data and associated computational tools in all aspects of fungal biology. Genome-enabled mycology is integrative and often requires teams of researchers with diverse skills in organismal mycology, bioinformatics and molecular biology. This issue of Mycologia presents the first complete fungal genomes in the history of the journal, reflecting the ongoing transformation of mycology into a genome-enabled science. Here, we consider the prospects for genome-enabled mycology and the technical and social challenges that will need to be overcome to grow the database of complete fungal genomes and enable all fungal biologists to make use of the new data. PMID:23928422

  9. Comparative genomics of Brassicaceae crops.

    PubMed

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-05-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  10. eGenomics: Cataloguing Our Complete Genome Collection III

    PubMed Central

    Field, Dawn; Garrity, George; Gray, Tanya; Selengut, Jeremy; Sterk, Peter; Thomson, Nick; Tatusova, Tatiana; Cochrane, Guy; Glöckner, Frank Oliver; Kottmann, Renzo; Lister, Allyson L.; Tateno, Yoshio; Vaughan, Robert

    2007-01-01

    This meeting report summarizes the proceedings of the “eGenomics: Cataloguing our Complete Genome Collection III” workshop held September 11–13, 2006, at the National Institute for Environmental eScience (NIEeS), Cambridge, United Kingdom. This 3rd workshop of the Genomic Standards Consortium was divided into two parts. The first half of the three-day workshop was dedicated to reviewing the genomic diversity of our current and future genome and metagenome collection, and exploring linkages to a series of existing projects through formal presentations. The second half was dedicated to strategic discussions. Outcomes of the workshop include a revised “Minimum Information about a Genome Sequence” (MIGS) specification (v1.1), consensus on a variety of features to be added to the Genome Catalogue (GCat), agreement by several researchers to adopt MIGS for imminent genome publications, and an agreement by the EBI and NCBI to input their genome collections into GCat for the purpose of quantifying the amount of optional data already available (e.g., for geographic location coordinates) and working towards a single, global list of all public genomes and metagenomes.

  11. The Genomic Standards Consortium

    PubMed Central

    Field, Dawn; Amaral-Zettler, Linda; Cochrane, Guy; Cole, James R.; Dawyndt, Peter; Garrity, George M.; Gilbert, Jack; Glöckner, Frank Oliver; Hirschman, Lynette; Karsch-Mizrachi, Ilene; Klenk, Hans-Peter; Knight, Rob; Kottmann, Renzo; Kyrpides, Nikos; Meyer, Folker; San Gil, Inigo; Sansone, Susanna-Assunta; Schriml, Lynn M.; Sterk, Peter; Tatusova, Tatiana; Ussery, David W.; White, Owen; Wooley, John

    2011-01-01

    A vast and rich body of information has grown up as a result of the world's enthusiasm for 'omics technologies. Finding ways to describe and make available this information that maximise its usefulness has become a major effort across the 'omics world. At the heart of this effort is the Genomic Standards Consortium (GSC), an open-membership organization that drives community-based standardization activities, Here we provide a short history of the GSC, provide an overview of its range of current activities, and make a call for the scientific community to join forces to improve the quality and quantity of contextual information about our public collections of genomes, metagenomes, and marker gene sequences. PMID:21713030

  12. The dog genome.

    PubMed

    Galibert, F; André, C

    2006-01-01

    Over the last few centuries, several hundred dog breeds have been artificially selected through intense breeding, resulting in the modern dog population having the widest polymorphism spectrum in terms of body shape, behavior and aptitude among mammals. Unfortunately, this diversification has predisposed most breeds to specific diseases of genetic origin. The highly fragmented nature of the dog population offers a great opportunity to track the genes and alleles responsible for these diseases as well as for the various phenotypic traits. This has led to a thorough analysis of the dog genome. Here, we report the main results obtained during the last ten years, culminating in the recent publication of a complete dog genome sequence. PMID:18753768

  13. Big cat genomics.

    PubMed

    O'Brien, Stephen J; Johnson, Warren E

    2005-01-01

    Advances in population and quantitative genomics, aided by the computational algorithms that employ genetic theory and practice, are now being applied to biological questions that surround free-ranging species not traditionally suitable for genetic enquiry. Here we review how applications of molecular genetic tools have been used to describe the natural history, present status, and future disposition of wild cat species. Insight into phylogenetic hierarchy, demographic contractions, geographic population substructure, behavioral ecology, and infectious diseases have revealed strategies for survival and adaptation of these fascinating predators. Conservation, stabilization, and management of the big cats are important areas that derive benefit from the genome resources expanded and applied to highly successful species, imperiled by an expanding human population. PMID:16124868

  14. Mapping the human genome

    SciTech Connect

    Annas, G.C.; Elias, S.

    1992-01-01

    This article is a review of the book Mapping the Human Genome: Using Law and Ethics as Guides, edited by George C. Annas and Sherman Elias. The book is a collection of essays on the subject of using ethics and laws as guides to justify human gene mapping. It addresses specific issues such problems related to eugenics, patents, insurance as well as broad issues such as the societal definitions of normality.

  15. Genomic landscape of liposarcoma

    PubMed Central

    Kanojia, Deepika; Nagata, Yasunobu; Garg, Manoj; Lee, Dhong Hyun; Sato, Aiko; Yoshida, Kenichi; Sato, Yusuke; Sanada, Masashi; Mayakonda, Anand; Bartenhagen, Christoph; Klein, Hans-Ulrich; Doan, Ngan B.; Said, Jonathan W.; Mohith, S.; Gunasekar, Swetha; Shiraishi, Yuichi; Chiba, Kenichi; Tanaka, Hiroko; Miyano, Satoru; Myklebost, Ola; Yang, Henry; Dugas, Martin; Meza-Zepeda, Leonardo A.; Silberman, Allan W.; Forscher, Charles; Tyner, Jeffrey W.; Ogawa, Seishi; Koeffler, H. Phillip

    2015-01-01

    Liposarcoma (LPS) is the most common type of soft tissue sarcoma accounting for 20% of all adult sarcomas. Due to absence of clinically effective treatment options in inoperable situations and resistance to chemotherapeutics, a critical need exists to identify novel therapeutic targets. We analyzed LPS genomic landscape using SNP arrays, whole exome sequencing and targeted exome sequencing to uncover the genomic information for development of specific anti-cancer targets. SNP array analysis indicated known amplified genes (MDM2, CDK4, HMGA2) and important novel genes (UAP1, MIR557, LAMA4, CPM, IGF2, ERBB3, IGF1R). Carboxypeptidase M (CPM), recurrently amplified gene in well-differentiated/de-differentiated LPS was noted as a putative oncogene involved in the EGFR pathway. Notable deletions were found at chromosome 1p (RUNX3, ARID1A), chromosome 11q (ATM, CHEK1) and chromosome 13q14.2 (MIR15A, MIR16-1). Significantly and recurrently mutated genes (false discovery rate < 0.05) included PLEC (27%), MXRA5 (21%), FAT3 (24%), NF1 (20%), MDC1 (10%), TP53 (7%) and CHEK2 (6%). Further, in vitro and in vivo functional studies provided evidence for the tumor suppressor role for Neurofibromin 1 (NF1) gene in different subtypes of LPS. Pathway analysis of recurrent mutations demonstrated signaling through MAPK, JAK-STAT, Wnt, ErbB, axon guidance, apoptosis, DNA damage repair and cell cycle pathways were involved in liposarcomagenesis. Interestingly, we also found mutational and copy number heterogeneity within a primary LPS tumor signifying the importance of multi-region sequencing for cancer-genome guided therapy. In summary, these findings provide insight into the genomic complexity of LPS and highlight potential druggable pathways for targeted therapeutic approach. PMID:26643872

  16. Bioinformatics and genomic medicine.

    PubMed

    Kim, Ju Han

    2002-01-01

    Bioinformatics is a rapidly emerging field of biomedical research. A flood of large-scale genomic and postgenomic data means that many of the challenges in biomedical research are now challenges in computational science. Clinical informatics has long developed methodologies to improve biomedical research and clinical care by integrating experimental and clinical information systems. The informatics revolution in both bioinformatics and clinical informatics will eventually change the current practice of medicine, including diagnostics, therapeutics, and prognostics. Postgenome informatics, powered by high-throughput technologies and genomic-scale databases, is likely to transform our biomedical understanding forever, in much the same way that biochemistry did a generation ago. This paper describes how these technologies will impact biomedical research and clinical care, emphasizing recent advances in biochip-based functional genomics and proteomics. Basic data preprocessing with normalization and filtering, primary pattern analysis, and machine-learning algorithms are discussed. Use of integrative biochip informatics technologies, including multivariate data projection, gene-metabolic pathway mapping, automated biomolecular annotation, text mining of factual and literature databases, and the integrated management of biomolecular databases, are also discussed. PMID:12544491

  17. Exploring genomes for glycosyltransferases.

    PubMed

    Hansen, Sara Fasmer; Bettler, Emmanuel; Rinnan, Asmund; Engelsen, Søren B; Breton, Christelle

    2010-10-01

    Glycosyltransferases are one of the largest and most diverse enzyme groups in Nature. They catalyse the synthesis of glycosidic linkages by the transfer of a sugar residue from a donor to an acceptor substrate. These enzymes have been classified into families on the basis of amino acid sequence similarity that are kept updated in the Carbohydrate Active enZyme database (CAZy, ). The repertoire of glycosyltransferases in genomes is believed to determine the diversity of cellular glycan structures, and current estimates suggest that for most genomes about 1% of the coding regions are glycosyltransferases. However, plants tend to have far more glycosyltransferase genes than any other organism sequenced to date, and this can be explained by the highly complex polysaccharide network that form the cell wall and also by the numerous glycosylated secondary metabolites. In recent years, various bioinformatics strategies have been used to search bacterial and plant genomes for new glycosyltransferase genes. These are based on the use of remote homology detection methods that act at the 1D, 2D, and 3D level. The combined use of methods such as profile Hidden Markov Model (HMM) and fold recognition appears to be appropriate for this class of enzyme. Chemometric tools are also particularly well suited for obtaining an overview of multivariate data and revealing hidden latent information when dealing with large and highly complex datasets. PMID:20556308

  18. Cancer Genome Landscapes

    PubMed Central

    Vogelstein, Bert; Papadopoulos, Nickolas; Velculescu, Victor E.; Zhou, Shibin; Diaz, Luis A.; Kinzler, Kenneth W.

    2013-01-01

    Over the past decade, comprehensive sequencing efforts have revealed the genomic landscapes of common forms of human cancer. For most cancer types, this landscape consists of a small number of “mountains” (genes altered in a high percentage of tumors) and a much larger number of “hills” (genes altered infrequently). To date, these studies have revealed ~140 genes that, when altered by intragenic mutations, can promote or “drive” tumorigenesis. A typical tumor contains two to eight of these “driver gene” mutations; the remaining mutations are passengers that confer no selective growth advantage. Driver genes can be classified into 12 signaling pathways that regulate three core cellular processes: cell fate, cell survival, and genome maintenance. A better understanding of these pathways is one of the most pressing needs in basic cancer research. Even now, however, our knowledge of cancer genomes is sufficient to guide the development of more effective approaches for reducing cancer morbidity and mortality. PMID:23539594

  19. Mapping the human genome

    SciTech Connect

    Cantor, Charles R.

    1989-06-01

    The following pages aim to lay a foundation for understanding the excitement surrounding the ''human genome project,'' as well as to convey a flavor of the ongoing efforts and plans at the Human Genome Center at the Lawrence Berkeley Laboratory. Our own work, of course, is only part of a broad international effort that will dramatically enhance our understanding of human molecular genetics before the end of this century. In this country, the bulk of the effort will be carried out under the auspices of the Department of Energy and the National Institutes of Health, but significant contributions have already been made both by nonprofit private foundations and by private corporation. The respective roles of the DOE and the NIH are being coordinated by an inter-agency committee, the aims of which are to emphasize the strengths of each agency, to facilitate cooperation, and to avoid unnecessary duplication of effort. The NIH, for example, will continue its crucial work in medical genetics and in mapping the genomes of nonhuman species. The DOE, on the other hand, has unique experience in managing large projects, and its national laboratories are repositories of expertise in physics, engineering, and computer science, as well as the life sciences. The tools and techniques the project will ultimately rely on are thus likely to be developed in multidisciplinary efforts at laboratories like LBL. Accordingly, we at LBL take great pride in this enterprise -- an enterprise that will eventually transform our understanding of ourselves.

  20. Translational genomics for plant breeding with the genome sequence explosion.

    PubMed

    Kang, Yang Jae; Lee, Taeyoung; Lee, Jayern; Shim, Sangrea; Jeong, Haneul; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

    2016-04-01

    The use of next-generation sequencers and advanced genotyping technologies has propelled the field of plant genomics in model crops and plants and enhanced the discovery of hidden bridges between genotypes and phenotypes. The newly generated reference sequences of unstudied minor plants can be annotated by the knowledge of model plants via translational genomics approaches. Here, we reviewed the strategies of translational genomics and suggested perspectives on the current databases of genomic resources and the database structures of translated information on the new genome. As a draft picture of phenotypic annotation, translational genomics on newly sequenced plants will provide valuable assistance for breeders and researchers who are interested in genetic studies. PMID:26269219

  1. A genome-wide association study identifies novel single nucleotide polymorphisms associated with dermal shank pigmentation in chickens.

    PubMed

    Li, Guangqi; Li, Dongfeng; Yang, Ning; Qu, Lujiang; Hou, Zhuocheng; Zheng, Jiangxia; Xu, Guiyun; Chen, Sirui

    2014-12-01

    Shank color of domestic chickens varies from black to blue, green, yellow, or white, which is controlled by the combination of melanin and xanthophylls in dermis and epidermis. Dermal shank pigmentation of chickens is determined by sex-linked inhibitor of dermal melanin (Id), which is located on the distal end of the long arm of Z chromosome, through controlling dermal melanin pigmentation. Although previous studies have focused on the identification of Id and the linear relationship with barring and recessive white skin, no causal mutations have yet been identified in relation to the mutant dermal pigment inhibiting allele at the Id locus. In this study, we first used the 600K Affymetrix Axiom HD genotyping array, which includes ~580,961 SNP of which 26,642 SNP were on the Z chromosome to perform a genome-wide association study on pure lines of 19 Tibetan hens with dermal pigmentation shank and 21 Tibetan hens with yellow shank to refine the Id location. Association analysis was conducted by the PLINK software using the standard chi-squared test, and then Bonferroni correction was used to adjust multiple testing. The genome-wide study revealed that 3 SNP located at 78.5 to 79.2 Mb on the Z chromosome in the current assembly of chicken genome (galGal4) were significantly associated with dermal shank pigmentation of chickens, but none of them were located in known genes. The interval we refined was partly converged with previous results, suggesting that the Id gene is in or near our refined genome region. However, the genomic context of this region was complex. There were only 15 SNP markers developed by the genotyping array within the interval region, in which only 1 SNP marker passed quality control. Additionally, there were about 5.8-Mb gaps on both sides of the refined interval. The follow-up replication studies may be needed to further confirm the functional significance for these newly identified SNP. PMID:25260525

  2. Exploring genome-wide – dietary heme iron intake interactions and the risk of type 2 diabetes

    PubMed Central

    Pasquale, Louis R.; Loomis, Stephanie J.; Aschard, Hugues; Kang, Jae H.; Cornelis, Marilyn C.; Qi, Lu; Kraft, Peter; Hu, Frank B.

    2013-01-01

    Aims/hypothesis: Genome-wide association studies have identified over 50 new genetic loci for type 2 diabetes (T2D). Several studies conclude that higher dietary heme iron intake increases the risk of T2D. Therefore we assessed whether the relation between genetic loci and T2D is modified by dietary heme iron intake. Methods: We used Affymetrix Genome-Wide Human 6.0 array data [681,770 single nucleotide polymorphisms (SNPs)] and dietary information collected in the Health Professionals Follow-up Study (n = 725 cases; n = 1,273 controls) and the Nurses’ Health Study (n = 1,081 cases; n = 1,692 controls). We assessed whether genome-wide SNPs or iron metabolism SNPs interacted with dietary heme iron intake in relation to T2D, testing for associations in each cohort separately and then meta-analyzing to pool the results. Finally, we created 1,000 synthetic pathways matched to an iron metabolism pathway on number of genes, and number of SNPs in each gene. We compared the iron metabolic pathway SNPs with these synthetic SNP assemblies in their relation to T2D to assess if the pathway as a whole interacts with dietary heme iron intake. Results: Using a genomic approach, we found no significant gene–environment interactions with dietary heme iron intake in relation to T2D at a Bonferroni corrected genome-wide significance level of 7.33 ×10-8 (top SNP in pooled analysis: intergenic rs10980508; p = 1.03 × 10-6). Furthermore, no SNP in the iron metabolic pathway significantly interacted with dietary heme iron intake at a Bonferroni corrected significance level of 2.10 × 10-4 (top SNP in pooled analysis: rs1805313; p = 1.14 × 10-3). Finally, neither the main genetic effects (pooled empirical p by SNP = 0.41), nor gene – dietary heme–iron interactions (pooled empirical p-value for the interactions = 0.72) were significant for the iron metabolic pathway as a whole. Conclusions: We found no significant interactions between dietary heme iron intake and common SNPs in

  3. A Genome-Wide Association Search for Type 2 Diabetes Genes in African Americans

    PubMed Central

    Palmer, Nicholette D.; McDonough, Caitrin W.; Hicks, Pamela J.; Roh, Bong H.; Wing, Maria R.; An, S. Sandy; Hester, Jessica M.; Cooke, Jessica N.; Bostrom, Meredith A.; Rudock, Megan E.; Talbert, Matthew E.; Lewis, Joshua P.; Ferrara, Assiamira; Lu, Lingyi; Ziegler, Julie T.; Sale, Michele M.; Divers, Jasmin; Shriner, Daniel; Adeyemo, Adebowale; Rotimi, Charles N.; Ng, Maggie C. Y.; Langefeld, Carl D.; Freedman, Barry I.; Bowden, Donald W.

    2012-01-01

    African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10−8). SNP rs7560163 (P = 7.0×10−9, OR (95% CI) = 0.75 (0.67–0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10−5) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations. PMID:22238593

  4. A genome-wide association search for type 2 diabetes genes in African Americans.

    PubMed

    Palmer, Nicholette D; McDonough, Caitrin W; Hicks, Pamela J; Roh, Bong H; Wing, Maria R; An, S Sandy; Hester, Jessica M; Cooke, Jessica N; Bostrom, Meredith A; Rudock, Megan E; Talbert, Matthew E; Lewis, Joshua P; Ferrara, Assiamira; Lu, Lingyi; Ziegler, Julie T; Sale, Michele M; Divers, Jasmin; Shriner, Daniel; Adeyemo, Adebowale; Rotimi, Charles N; Ng, Maggie C Y; Langefeld, Carl D; Freedman, Barry I; Bowden, Donald W; Voight, Benjamin F; Scott, Laura J; Steinthorsdottir, Valgerdur; Morris, Andrew P; Dina, Christian; Welch, Ryan P; Zeggini, Eleftheria; Huth, Cornelia; Aulchenko, Yurii S; Thorleifsson, Gudmar; McCulloch, Laura J; Ferreira, Teresa; Grallert, Harald; Amin, Najaf; Wu, Guanming; Willer, Cristen J; Raychaudhuri, Soumya; McCarroll, Steve A; Langenberg, Claudia; Hofmann, Oliver M; Dupuis, Josée; Qi, Lu; Segrè, Ayellet V; van Hoek, Mandy; Navarro, Pau; Ardlie, Kristin; Balkau, Beverley; Benediktsson, Rafn; Bennett, Amanda J; Blagieva, Roza; Boerwinkle, Eric; Bonnycastle, Lori L; Boström, Kristina Bengtsson; Bravenboer, Bert; Bumpstead, Suzannah; Burtt, Noël P; Charpentier, Guillaume; Chines, Peter S; Cornelis, Marilyn; Couper, David J; Crawford, Gabe; Doney, Alex S F; Elliott, Katherine S; Elliott, Amanda L; Erdos, Michael R; Fox, Caroline S; Franklin, Christopher S; Ganser, Martha; Gieger, Christian; Grarup, Niels; Green, Todd; Griffin, Simon; Groves, Christopher J; Guiducci, Candace; Hadjadj, Samy; Hassanali, Neelam; Herder, Christian; Isomaa, Bo; Jackson, Anne U; Johnson, Paul R V; Jørgensen, Torben; Kao, Wen H L; Klopp, Norman; Kong, Augustine; Kraft, Peter; Kuusisto, Johanna; Lauritzen, Torsten; Li, Man; Lieverse, Aloysius; Lindgren, Cecilia M; Lyssenko, Valeriya; Marre, Michel; Meitinger, Thomas; Midthjell, Kristian; Morken, Mario A; Narisu, Narisu; Nilsson, Peter; Owen, Katharine R; Payne, Felicity; Perry, John R B; Petersen, Ann-Kristin; Platou, Carl; Proença, Christine; Prokopenko, Inga; Rathmann, Wolfgang; Rayner, N William; Robertson, Neil R; Rocheleau, Ghislain; Roden, Michael; Sampson, Michael J; Saxena, Richa; Shields, Beverley M; Shrader, Peter; Sigurdsson, Gunnar; Sparsø, Thomas; Strassburger, Klaus; Stringham, Heather M; Sun, Qi; Swift, Amy J; Thorand, Barbara; Tichet, Jean; Tuomi, Tiinamaija; van Dam, Rob M; van Haeften, Timon W; van Herpt, Thijs; van Vliet-Ostaptchouk, Jana V; Walters, G Bragi; Weedon, Michael N; Wijmenga, Cisca; Witteman, Jacqueline; Bergman, Richard N; Cauchi, Stephane; Collins, Francis S; Gloyn, Anna L; Gyllensten, Ulf; Hansen, Torben; Hide, Winston A; Hitman, Graham A; Hofman, Albert; Hunter, David J; Hveem, Kristian; Laakso, Markku; Mohlke, Karen L; Morris, Andrew D; Palmer, Colin N A; Pramstaller, Peter P; Rudan, Igor; Sijbrands, Eric; Stein, Lincoln D; Tuomilehto, Jaakko; Uitterlinden, Andre; Walker, Mark; Wareham, Nicholas J; Watanabe, Richard M; Abecasis, Goncalo R; Boehm, Bernhard O; Campbell, Harry; Daly, Mark J; Hattersley, Andrew T; Hu, Frank B; Meigs, James B; Pankow, James S; Pedersen, Oluf; Wichmann, H-Erich; Barroso, Inês; Florez, Jose C; Frayling, Timothy M; Groop, Leif; Sladek, Rob; Thorsteinsdottir, Unnur; Wilson, James F; Illig, Thomas; Froguel, Philippe; van Duijn, Cornelia M; Stefansson, Kari; Altshuler, David; Boehnke, Michael; McCarthy, Mark I; Soranzo, Nicole; Wheeler, Eleanor; Glazer, Nicole L; Bouatia-Naji, Nabila; Mägi, Reedik; Randall, Joshua; Johnson, Toby; Elliott, Paul; Rybin, Denis; Henneman, Peter; Dehghan, Abbas; Hottenga, Jouke Jan; Song, Kijoung; Goel, Anuj; Egan, Josephine M; Lajunen, Taina; Doney, Alex; Kanoni, Stavroula; Cavalcanti-Proença, Christine; Kumari, Meena; Timpson, Nicholas J; Zabena, Carina; Ingelsson, Erik; An, Ping; O'Connell, Jeffrey; Luan, Jian'an; Elliott, Amanda; McCarroll, Steven A; Roccasecca, Rosa Maria; Pattou, François; Sethupathy, Praveen; Ariyurek, Yavuz; Barter, Philip; Beilby, John P; Ben-Shlomo, Yoav; Bergmann, Sven; Bochud, Murielle; Bonnefond, Amélie; Borch-Johnsen, Knut; Böttcher, Yvonne; Brunner, Eric; Bumpstead, Suzannah J; Chen, Yii-Der Ida; Chines, Peter; Clarke, Robert; Coin, Lachlan J M; Cooper, Matthew N; Crisponi, Laura; Day, Ian N M; de Geus, Eco J C; Delplanque, Jerome; Fedson, Annette C; Fischer-Rosinsky, Antje; Forouhi, Nita G; Frants, Rune; Franzosi, Maria Grazia; Galan, Pilar; Goodarzi, Mark O; Graessler, Jürgen; Grundy, Scott; Gwilliam, Rhian; Hallmans, Göran; Hammond, Naomi; Han, Xijing; Hartikainen, Anna-Liisa; Hayward, Caroline; Heath, Simon C; Hercberg, Serge; Hicks, Andrew A; Hillman, David R; Hingorani, Aroon D; Hui, Jennie; Hung, Joe; Jula, Antti; Kaakinen, Marika; Kaprio, Jaakko; Kesaniemi, Y Antero; Kivimaki, Mika; Knight, Beatrice; Koskinen, Seppo; Kovacs, Peter; Kyvik, Kirsten Ohm; Lathrop, G Mark; Lawlor, Debbie A; Le Bacquer, Olivier; Lecoeur, Cécile; Li, Yun; Mahley, Robert; Mangino, Massimo; Manning, Alisa K; Martínez-Larrad, María Teresa; McAteer, Jarred B; McPherson, Ruth; Meisinger, Christa; Melzer, David; Meyre, David; Mitchell, Braxton D; Mukherjee, Sutapa; Naitza, Silvia; Neville, Matthew J; Oostra, Ben A; Orrù, Marco; Pakyz, Ruth; Paolisso, Giuseppe; Pattaro, Cristian; Pearson, Daniel; Peden, John F; Pedersen, Nancy L; Perola, Markus; Pfeiffer, Andreas F H; Pichler, Irene; Polasek, Ozren; Posthuma, Danielle; Potter, Simon C; Pouta, Anneli; Province, Michael A; Psaty, Bruce M; Rayner, Nigel W; Rice, Kenneth; Ripatti, Samuli; Rivadeneira, Fernando; Rolandsson, Olov; Sandbaek, Annelli; Sandhu, Manjinder; Sanna, Serena; Sayer, Avan Aihie; Scheet, Paul; Seedorf, Udo; Sharp, Stephen J; Shields, Beverley; Sijbrands, Eric J G; Silveira, Angela; Simpson, Laila; Singleton, Andrew; Smith, Nicholas L; Sovio, Ulla; Swift, Amy; Syddall, Holly; Syvänen, Ann-Christine; Tanaka, Toshiko; Tönjes, Anke; Uitterlinden, André G; van Dijk, Ko Willems; Varma, Dhiraj; Visvikis-Siest, Sophie; Vitart, Veronique; Vogelzangs, Nicole; Waeber, Gérard; Wagner, Peter J; Walley, Andrew; Ward, Kim L; Watkins, Hugh; Wild, Sarah H; Willemsen, Gonneke; Witteman, Jaqueline C M; Yarnell, John W G; Zelenika, Diana; Zethelius, Björn; Zhai, Guangju; Zhao, Jing Hua; Zillikens, M Carola; Borecki, Ingrid B; Loos, Ruth J F; Meneton, Pierre; Magnusson, Patrik K E; Nathan, David M; Williams, Gordon H; Silander, Kaisa; Salomaa, Veikko; Smith, George Davey; Bornstein, Stefan R; Schwarz, Peter; Spranger, Joachim; Karpe, Fredrik; Shuldiner, Alan R; Cooper, Cyrus; Dedoussis, George V; Serrano-Ríos, Manuel; Lind, Lars; Palmer, Lyle J; Franks, Paul W; Ebrahim, Shah; Marmot, Michael; Kao, W H Linda; Pramstaller, Peter Paul; Wright, Alan F; Stumvoll, Michael; Hamsten, Anders; Buchanan, Thomas A; Valle, Timo T; Rotter, Jerome I; Siscovick, David S; Penninx, Brenda W J H; Boomsma, Dorret I; Deloukas, Panos; Spector, Timothy D; Ferrucci, Luigi; Cao, Antonio; Scuteri, Angelo; Schlessinger, David; Uda, Manuela; Ruokonen, Aimo; Jarvelin, Marjo-Riitta; Waterworth, Dawn M; Vollenweider, Peter; Peltonen, Leena; Mooser, Vincent; Sladek, Robert

    2012-01-01

    African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10(-8)). SNP rs7560163 (P = 7.0×10(-9), OR (95% CI) = 0.75 (0.67-0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10(-5)) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations. PMID:22238593

  5. Detection of selective sweeps in cattle using genome-wide SNP data

    PubMed Central

    2013-01-01

    Background The domestication and subsequent selection by humans to create breeds and biological types of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have now been identified in the genomes of many animal species including humans, dogs, horses, and chickens. Here, we attempt to identify and characterise regions of the bovine genome that have been subjected to selective sweeps. Results Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 diverse cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 85 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions and the lack of sequence data for the analysed breeds precludes the nomination of selected genes or variants and limits the prediction of the selected phenotypes. However, phenotypes that we predict to have historically been under strong selection include horned-polled, coat colour, stature, ear morphology, and behaviour. Conclusions The bias towards common SNPs in the design of the BovineSNP50 assay led to the identification of recent selective sweeps associated with breed formation and common to only a small number of breeds rather than ancient events associated with domestication which could potentially be common to all European taurines. The limited SNP density, or marker resolution, of the BovineSNP50 assay significantly impacted the rate of false discovery of selective sweeps, however, we found sweeps in common between breeds which were confirmed using an ultra

  6. Genome-wide temporal-spatial gene expression profiling of drought responsiveness in rice

    PubMed Central

    2011-01-01

    Background Rice is highly sensitive to drought, and the effect of drought may vary with the different genotypes and development stages. Genome-wide gene expression profiling was used as the initial point to dissect molecular genetic mechanism of this complex trait and provide valuable information for the improvement of drought tolerance in rice. Affymetrix rice genome array containing 48,564 japonica and 1,260 indica sequences was used to analyze the gene expression pattern of rice exposed to drought stress. The transcriptome from leaf, root, and young panicle at three developmental stages was comparatively analyzed combined with bioinformatics exploring drought stress related cis-elements. Results There were 5,284 genes detected to be differentially expressed under drought stress. Most of these genes were tissue- or stage-specific regulated by drought. The tissue-specific down-regulated genes showed distinct function categories as photosynthesis-related genes prevalent in leaf, and the genes involved in cell membrane biogenesis and cell wall modification over-presented in root and young panicle. In a drought environment, several genes, such as GA2ox, SAP15, and Chitinase III, were regulated in a reciprocal way in two tissues at the same development stage. A total of 261 transcription factor genes were detected to be differentially regulated by drought stress. Most of them were also regulated in a tissue- or stage-specific manner. A cis-element containing special CGCG box was identified to over-present in the upstream of 55 common induced genes, and it may be very important for rice plants responding to drought environment. Conclusions Genome-wide gene expression profiling revealed that most of the drought differentially expressed genes (DEGs) were under temporal and spatial regulation, suggesting a crosstalk between various development cues and environmental stimuli. The identification of the differentially regulated DEGs, including TF genes and unique candidate

  7. Expression patterns of a novel AtCHX gene family highlight potential roles in osmotic adjustment and K+ homeostasis in pollen development.

    PubMed

    Sze, Heven; Padmanaban, Senthilkumar; Cellier, Françoise; Honys, David; Cheng, Ning-Hui; Bock, Kevin W; Conéjéro, Genevieve; Li, Xiyan; Twell, David; Ward, John M; Hirschi, Kendal D

    2004-09-01

    A combined bioinformatic and experimental approach is being used to uncover the functions of a novel family of cation/H(+) exchanger (CHX) genes in plants using Arabidopsis as a model. The predicted protein (85-95 kD) of 28 AtCHX genes after revision consists of an amino-terminal domain with 10 to 12 transmembrane spans (approximately 440 residues) and a hydrophilic domain of approximately 360 residues at the carboxyl end, which is proposed to have regulatory roles. The hydrophobic, but not the hydrophilic, domain of plant CHX is remarkably similar to monovalent cation/proton antiporter-2 (CPA2) proteins, especially yeast (Saccharomyces cerevisiae) KHA1 and Synechocystis NhaS4. Reports of characterized fungal and prokaryotic CPA2 indicate that they have various transport modes, including K(+)/H(+) (KHA1), Na(+)/H(+)-K(+) (GerN) antiport, and ligand-gated ion channel (KefC). The expression pattern of AtCHX genes was determined by reverse transcription PCR, promoter-driven beta-glucuronidase expression in transgenic plants, and Affymetrix ATH1 genome arrays. Results show that 18 genes are specifically or preferentially expressed in the male gametophyte, and six genes are highly expressed in sporophytic tissues. Microarray data revealed that several AtCHX genes were developmentally regulated during microgametogenesis. An exciting idea is that CHX proteins allow osmotic adjustment and K(+) homeostasis as mature pollen desiccates and then rehydrates at germination. The multiplicity of CHX-like genes is conserved in higher plants but is not found in animals. Only 17 genes, OsCHX01 to OsCHX17, were identified in rice (Oryza sativa) subsp. japonica, suggesting diversification of CHX in Arabidopsis. These results reveal a novel CHX gene family in flowering plants with potential functions in pollen development, germination, and tube growth. PMID:15347787

  8. An extensive (co-)expression analysis tool for the cytochrome P450 superfamily in Arabidopsis thaliana

    PubMed Central

    Ehlting, Jürgen; Sauveplane, Vincent; Olry, Alexandre; Ginglinger, Jean-François; Provart, Nicholas J; Werck-Reichhart, Danièle

    2008-01-01

    Background Sequencing of the first plant genomes has revealed that cytochromes P450 have evolved to become the largest family of enzymes in secondary metabolism. The proportion of P450 enzymes with characterized biochemical function(s) is however very small. If P450 diversification mirrors evolution of chemical diversity, this points to an unexpectedly poor understanding of plant metabolism. We assumed that extensive analysis of gene expression might guide towards the function of P450 enzymes, and highlight overlooked aspects of plant metabolism. Results We have created a comprehensive database, 'CYPedia', describing P450 gene expression in four data sets: organs and tissues, stress response, hormone response, and mutants of Arabidopsis thaliana, based on public Affymetrix ATH1 microarray expression data. P450 expression was then combined with the expression of 4,130 re-annotated genes, predicted to act in plant metabolism, for co-expression analyses. Based on the annotation of co-expressed genes from diverse pathway annotation databases, co-expressed pathways were identified. Predictions were validated for most P450s with known functions. As examples, co-expression results for P450s related to plastidial functions/photosynthesis, and to phenylpropanoid, triterpenoid and jasmonate metabolism are highlighted here. Conclusion The large scale hypothesis generation tools presented here provide leads to new pathways, unexpected functions, and regulatory networks for many P450s in plant metabolism. These can now be exploited by the community to validate the proposed functions experimentally using reverse genetics, biochemistry, and metabolic profiling. PMID:18433503

  9. The fungal genome initiative and lessons learned from genome sequencing.

    PubMed

    Cuomo, Christina A; Birren, Bruce W

    2010-01-01

    The sequence of Saccharomyces cerevisiae enabled systematic genome-wide experimental approaches, demonstrating the power of having the complete genome of an organism. The rapid impact of these methods on research in yeast mobilized an effort to expand genomic resources for other fungi. The "fungal genome initiative" represents an organized genome sequencing effort to promote comparative and evolutionary studies across the fungal kingdom. Through such an approach, scientists can not only better understand specific organisms but also illuminate the shared and unique aspects of fungal biology that underlie the importance of fungi in biomedical research, health, food production, and industry. To date, assembled genomes for over 100 fungi are available in public databases, and many more sequencing projects are underway. Here, we discuss both examples of findings from comparative analysis of fungal sequences, with a specific emphasis on yeast genomes, and on the analytical approaches taken to mine fungal genomes. New sequencing methods are accelerating comparative studies of fungi by reducing the cost and difficulty of sequencing. This has driven more common use of sequencing applications, such as to study genome-wide variation in populations or to deeply profile RNA transcripts. These and further technological innovations will continue to be piloted in yeasts and other fungi, and will expand the applications of sequencing to study fungal biology. PMID:20946837

  10. Genomic Data Commons and Genomic Cloud Pilots - Google Hangout

    Cancer.gov

    Join us for a live, moderated discussion about two NCI efforts to expand access to cancer genomics data: the Genomic Data Commons and Genomic Cloud Pilots. NCI subject matters experts will include Louis M. Staudt, M.D., Ph.D., Director Center for Cancer Genomics, Warren Kibbe, Ph.D., Director, NCI Center for Biomedical Informatics and Information Technology, and moderated by Anthony Kerlavage, Ph.D., Chief, Cancer Informatics Branch, Center for Biomedical Informatics and Information Technology. We welcome your questions before and during the Hangout on Twitter using the hashtag #AskNCI.

  11. The coffee genome hub: a resource for coffee genomes

    PubMed Central

    Dereeper, Alexis; Bocs, Stéphanie; Rouard, Mathieu; Guignon, Valentin; Ravel, Sébastien; Tranchant-Dubreuil, Christine; Poncet, Valérie; Garsmeur, Olivier; Lashermes, Philippe; Droc, Gaëtan

    2015-01-01

    The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager. PMID:25392413

  12. The Saccharomyces Genome Database: Exploring Genome Features and Their Annotations.

    PubMed

    Cherry, J Michael

    2015-12-01

    Genomic-scale assays result in data that provide information over the entire genome. Such base pair resolution data cannot be summarized easily except via a graphical viewer. A genome browser is a tool that displays genomic data and experimental results as horizontal tracks. Genome browsers allow searches for a chromosomal coordinate or a feature, such as a gene name, but they do not allow searches by function or upstream binding site. Entry into a genome browser requires that you identify the gene name or chromosomal coordinates for a region of interest. A track provides a representation for genomic results and is displayed as a row of data shown as line segments to indicate regions of the chromosome with a feature. Another type of track presents a graph or wiggle plot that indicates the processed signal intensity computed for a particular experiment or set of experiments. Wiggle plots are typical for genomic assays such as the various next-generation sequencing methods (e.g., chromatin immunoprecipitation [ChIP]-seq or RNA-seq), where it represents a peak of DNA binding, histone modification, or the mapping of an RNA sequence. Here we explore the browser that has been built into the Saccharomyces Genome Database (SGD). PMID:26631126

  13. Flexible genomic islands as drivers of genome evolution.

    PubMed

    Rodriguez-Valera, Francisco; Martin-Cuadrado, Ana-Belen; López-Pérez, Mario

    2016-06-01

    Natural prokaryotic populations are composed of multiple clonal lineages that are different in their core genomes in a range that varies typically between 95 and 100% nucleotide identity. Each clonal lineage also carries a complement of not shared flexible genes that can be very large. The compounded flexible genome provides polyclonal populations with enormous gene diversity that can be used to efficiently exploit resources. This has fundamental repercussions for interpreting individual bacterial genomes. They are better understood as parts rather than the whole. Multiple genomes are required to understand how the population interacts with its biotic and abiotic environment. PMID:27085300

  14. Defining Genome Maintenance Pathways using Functional Genomic Approaches

    PubMed Central

    Bansbach, Carol E.; Cortez, David

    2011-01-01

    Genome maintenance activities including DNA repair, cell division cycle control, and checkpoint signaling pathways preserve genome integrity and prevent disease. Defects in these pathways cause birth defects, neurodegeneration, premature aging, and cancer. Recent technical advances in functional genomic approaches such as expression profiling, proteomics, and RNA interference (RNAi) technologies have rapidly expanded our knowledge of the proteins that work in these pathways. In this review, we examine the use of these high-throughput methodologies in higher eukaryotic organisms for the interrogation of genome maintenance activities. PMID:21787120

  15. Genome Projector: zoomable genome map with multiple views

    PubMed Central

    Arakawa, Kazuharu; Tamaki, Satoshi; Kono, Nobuaki; Kido, Nobuhiro; Ikegami, Keita; Ogawa, Ryu; Tomita, Masaru

    2009-01-01

    Background Molecular biology data exist on diverse scales, from the level of molecules to -omics. At the same time, the data at each scale can be categorised into multiple layers, such as the genome, transcriptome, proteome, metabolome, and biochemical pathways. Due to the highly multi-layer and multi-dimensional nature of biological information, software interfaces for database browsing should provide an intuitive interface that allows for rapid migration across different views and scales. The Zoomable User Interface (ZUI) and tabbed browsing have proven successful for this purpose in other areas, especially to navigate the vast information in the World Wide Web. Results This paper presents Genome Projector, a Web-based gateway for genomics information with a zoomable user interface using Google Maps API, equipped with four seamlessly accessible and searchable views: a circular genome map, a traditional genome map, a biochemical pathways map, and a DNA walk map. The Web application for 320 bacterial genomes is available at . All data and software including the source code, documentations, and development API are freely available under the GNU General Public License. Zoomable maps can be easily created from any image file using the development API, and an online data mapping service for Genome Projector is also available at our Web site. Conclusion Genome Projector is an intuitive Web application for browsing genomics information, implemented with a zoomable user interface and tabbed browsing utilising Google Maps API and Asynchronous JavaScript and XML (AJAX) technology. PMID:19166610

  16. The Anolis Lizard Genome: An Amniote Genome without Isochores?

    PubMed Central

    Costantini, Maria; Greif, Gonzalo; Alvarez-Valin, Fernando; Bernardi, Giorgio

    2016-01-01

    Two articles published 5 years ago concluded that the genome of the lizard Anolis carolinensis is an amniote genome without isochores. This claim was apparently contradicting previous results on the general presence of an isochore organization in all vertebrate genomes tested (including Anolis). In this investigation, we demonstrate that the Anolis genome is indeed heterogeneous in base composition, since its macrochromosomes comprise isochores mainly from the L2 and H1 families (a moderately GC-poor and a moderately GC-rich family, respectively), and since the majority of the sequenced microchromosomes consists of H1 isochores. These families are associated with different features of genome structure, including gene density and compositional correlations (e.g., GC3 vs flanking sequence GC and intron GC), as in the case of mammalian and avian genomes. Moreover, the assembled Anolis chromosomes have an enormous number of gaps, which could be due to sequencing problems in GC-rich regions of the genome. In conclusion, the Anolis genome is no exception to the general rule of an isochore organization in the genomes of vertebrates (and other eukaryotes). PMID:26992416

  17. The coffee genome hub: a resource for coffee genomes.

    PubMed

    Dereeper, Alexis; Bocs, Stéphanie; Rouard, Mathieu; Guignon, Valentin; Ravel, Sébastien; Tranchant-Dubreuil, Christine; Poncet, Valérie; Garsmeur, Olivier; Lashermes, Philippe; Droc, Gaëtan

    2015-01-01

    The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager. PMID:25392413

  18. Genome-wide screening of loci associated with drug resistance to 5-fluorouracil-based drugs.

    PubMed

    Ooyama, Akio; Okayama, Yoshihiro; Takechi, Teiji; Sugimoto, Yoshikazu; Oka, Toshinori; Fukushima, Masakazu

    2007-04-01

    Resistance to chemotherapeutic agents represents the chief cause of mortality in cancer patients with advanced disease. Chromosomal aberration and altered gene expression are the main genetic mechanisms of tumor chemoresistance. In this study, we have established an algorithm to calculate DNA copy number using the Affymetrix 10K array, and performed a genome-wide correlation analysis between DNA copy number and antitumor activity against 5-fluorouracil (5-FU)-based drugs (S-1, tegafur + uracil [UFT], 5'-DFUR and capecitabine) to screen for loci influencing drug resistance using 27 human cancer xenografts. A correlation analysis confirmed that the single nucleotide polymorphism (SNP) showing significant associations with drug sensitivity were concentrated in some cytogenetic regions (18p, 17p13.2, 17p12, 11q14.1, 11q11 and 11p11.12), and we identified some genes that have been indicated their relations to drug sensitivity. Among these regions, 18p11.32 at the location of the thymidylate synthase gene (TYMS) was strongly associated with resistance to 5-FU-based drugs. A change in copy number of the TYMS gene was reflected in the TYMS expression level, and showed a significant negative correlation with sensitivity against 5-FU-based drugs. These results suggest that amplification of the TYMS gene is associated with innate resistance, supporting the possibility that TYMS copy number might be a predictive marker of drug sensitivity to fluoropyrimidines. Further study is necessary to clarify the functional roles of other genes coded in significant cytogenetic regions. These promising data suggest that a comprehensive DNA copy number analysis might aid in the quest for optimal markers of drug response. PMID:17425594

  19. Genome-wide expression profiling in the peripheral blood of patients with fibromyalgia

    PubMed Central

    Jones, Kim D.; Gelbart, Terri; Whisenant, Thomas C.; Waalen, Jill; Mondala, Tony S.; Iklé, David N.; Salomon, Daniel R.; Bennett, Robert M.; Kurian, Sunil M.

    2016-01-01

    Objective Fibromyalgia (FM) is a common pain disorder characterised by nociceptive dysregulation. The basic biology of FM is poorly understood. Herein we have used agnostic gene expression as a potential probe for informing its underlying biology and the development of a proof-of-concept diagnostic gene expression signature. Methods We analysed RNA expression in 70 FM patients and 70 healthy controls. The isolated RNA was amplified and hybridised to Affymetrix® Human Gene 1.1 ST Peg arrays. The data was analysed using Partek Genomics Suite v. 6.6. Results Fibromyalgia patients exhibited a differential expression of 421 genes (p<0.001), several relevant to pathways for pain processing, such as glutamine/glutamate signaling and axonal development. There was also an upregulation of several inflammatory pathways and downregulation of pathways related to hypersensitivity and allergy. Using rigorous diagnostic modeling strategies, we show “locked” gene signatures discovered on Training and Test cohorts, that have a mean Area Under the Curve (AUC) of 0.81 on randomised, independent external data cohorts. Lastly, we identified a subset of 10 probesets that provided a diagnostic sensitivity for FM of 95% and a specificity of 96%. We also show that the signatures for FM were very specific to FM rather than common FM comorbidities. Conclusion These findings provide new insights relevant to the pathogenesis of FM, and provide several testable hypotheses that warrant further exploration and also establish the foundation for a first blood-based molecular signature in FM that needs to be validated in larger cohorts of patients. PMID:27157394

  20. Analysis of genomic aberrations and gene expression profiling identifies novel lesions and pathways in myeloproliferative neoplasms

    PubMed Central

    Rice, K L; Lin, X; Wolniak, K; Ebert, B L; Berkofsky-Fessler, W; Buzzai, M; Sun, Y; Xi, C; Elkin, P; Levine, R; Golub, T; Gilliland, D G; Crispino, J D; Licht, J D; Zhang, W

    2011-01-01

    Polycythemia vera (PV), essential thrombocythemia and primary myelofibrosis, are myeloproliferative neoplasms (MPNs) with distinct clinical features and are associated with the JAK2V617F mutation. To identify genomic anomalies involved in the pathogenesis of these disorders, we profiled 87 MPN patients using Affymetrix 250K single-nucleotide polymorphism (SNP) arrays. Aberrations affecting chr9 were the most frequently observed and included 9pLOH (n=16), trisomy 9 (n=6) and amplifications of 9p13.3–23.3 (n=1), 9q33.1–34.13 (n=1) and 9q34.13 (n=6). Patients with trisomy 9 were associated with elevated JAK2V617F mutant allele burden, suggesting that gain of chr9 represents an alternative mechanism for increasing JAK2V617F dosage. Gene expression profiling of patients with and without chr9 abnormalities (+9, 9pLOH), identified genes potentially involved in disease pathogenesis including JAK2, STAT5B and MAPK14. We also observed recurrent gains of 1p36.31–36.33 (n=6), 17q21.2–q21.31 (n=5) and 17q25.1–25.3 (n=5) and deletions affecting 18p11.31–11.32 (n=8). Combined SNP and gene expression analysis identified aberrations affecting components of a non-canonical PRC2 complex (EZH1, SUZ12 and JARID2) and genes comprising a ‘HSC signature' (MLLT3, SMARCA2 and PBX1). We show that NFIB, which is amplified in 7/87 MPN patients and upregulated in PV CD34+ cells, protects cells from apoptosis induced by cytokine withdrawal. PMID:22829077

  1. Genetic associations with neuroendocrine tumor risk: results from a genome-wide association study.

    PubMed

    Du, Yeting; Ter-Minassian, Monica; Brais, Lauren; Brooks, Nichole; Waldron, Amanda; Chan, Jennifer A; Lin, Xihong; Kraft, Peter; Christiani, David C; Kulke, Matthew H

    2016-08-01

    The etiology of neuroendocrine tumors remains poorly defined. Although neuroendocrine tumors are in some cases associated with inherited genetic syndromes, such syndromes are rare. The majority of neuroendocrine tumors are thought to be sporadic. We performed a genome-wide association study (GWAS) to identify potential genetic risk factors for sporadic neuroendocrine tumors. Using germline DNA from blood specimens, we genotyped 909,622 SNPs using the Affymetrix 6.0 GeneChip, in a cohort comprising 832 neuroendocrine tumor cases from Dana-Farber Cancer Institute and Massachusetts General Hospital and 4542 controls from the Harvard School of Public Health. An additional 241 controls from Dana-Farber Cancer Institute were used for quality control. We assessed risk associations in the overall cohort, and in neuroendocrine tumor subgroups. We identified no potential risk associations in the cohort overall. In the small intestine neuroendocrine tumor subgroup, comprising 293 cases, we identified risk associations with three SNPs on chromosome 12, all in strong LD. The three SNPs are located upstream of ELK3, a transcription factor implicated in angiogenesis. We did not identify clear risk associations in the bronchial or pancreatic neuroendocrine subgroups. This large-scale study provides initial evidence that presumed sporadic small intestine neuroendocrine tumors may have a genetic etiology. Our results provide a basis for further exploring the role of genes implicated in this analysis, and for replication studies to confirm the observed associations. Additional studies to evaluate potential genetic risk factors for sporadic pancreatic and bronchial neuroendocrine tumors are warranted. PMID:27492634

  2. Identification of genes promoting skin youthfulness by genome-wide association study.

    PubMed

    Chang, Anne L S; Atzmon, Gil; Bergman, Aviv; Brugmann, Samantha; Atwood, Scott X; Chang, Howard Y; Barzilai, Nir

    2014-03-01

    To identify genes that promote facial skin youthfulness (SY), a genome-wide association study on an Ashkenazi Jewish discovery group (n=428) was performed using Affymetrix 6.0 Single-Nucleotide Polymorphism (SNP) Array. After SNP quality controls, 901,470 SNPs remained for analysis. The eigenstrat method showed no stratification. Cases and controls were identified by global facial skin aging severity including intrinsic and extrinsic parameters. Linear regression adjusted for age and gender, with no significant differences in smoking history, body mass index, menopausal status, or personal or family history of centenarians. Six SNPs met the Bonferroni threshold with Pallele<10(-8); two of these six had Pgenotype<10(-8). Quantitative trait loci mapping confirmed linkage disequilibrium. The six SNPs were interrogated by MassARRAY in a replication group (n=436) with confirmation of rs6975107, an intronic region of KCND2 (potassium voltage-gated channel, Shal-related family member 2) (Pgenotype=0.023). A second replication group (n=371) confirmed rs318125, downstream of DIAPH2 (diaphanous homolog 2 (Drosophila)) (Pallele=0.010, Pgenotype=0.002) and rs7616661, downstream of EDEM1 (ER degradation enhancer, mannosidase α-like 1) (Pgenotype=0.042). DIAPH2 has been associated with premature ovarian insufficiency, an aging phenotype in humans. EDEM1 associates with lifespan in animal models, although not humans. KCND2 is expressed in human skin, but has not been associated with aging. These genes represent new candidate genes to study the molecular basis of healthy skin aging. PMID:24037343

  3. Genome-Wide Analysis Reveals Novel Genes Essential for Heme Homeostasis in Caenorhabditis elegans

    PubMed Central

    Rao, Anita U.; Cerqueira, Gustavo C.; Mitreva, Makedonka; El-Sayed, Najib M.; Krause, Michael; Hamza, Iqbal

    2010-01-01

    Heme is a cofactor in proteins that function in almost all sub-cellular compartments and in many diverse biological processes. Heme is produced by a conserved biosynthetic pathway that is highly regulated to prevent the accumulation of heme—a cytotoxic, hydrophobic tetrapyrrole. Caenorhabditis elegans and related parasitic nematodes do not synthesize heme, but instead require environmental heme to grow and develop. Heme homeostasis in these auxotrophs is, therefore, regulated in accordance with available dietary heme. We have capitalized on this auxotrophy in C. elegans to study gene expression changes associated with precisely controlled dietary heme concentrations. RNA was isolated from cultures containing 4, 20, or 500 µM heme; derived cDNA probes were hybridized to Affymetrix C. elegans expression arrays. We identified 288 heme-responsive genes (hrgs) that were differentially expressed under these conditions. Of these genes, 42% had putative homologs in humans, while genomes of medically relevant heme auxotrophs revealed homologs for 12% in both Trypanosoma and Leishmania and 24% in parasitic nematodes. Depletion of each of the 288 hrgs by RNA–mediated interference (RNAi) in a transgenic heme-sensor worm strain identified six genes that regulated heme homeostasis. In addition, seven membrane-spanning transporters involved in heme uptake were identified by RNAi knockdown studies using a toxic heme analog. Comparison of genes that were positive in both of the RNAi screens resulted in the identification of three genes in common that were vital for organismal heme homeostasis in C. elegans. Collectively, our results provide a catalog of genes that are essential for metazoan heme homeostasis and demonstrate the power of C. elegans as a genetic animal model to dissect the regulatory circuits which mediate heme trafficking in both vertebrate hosts and their parasites, which depend on environmental heme for survival. PMID:20686661

  4. Comparative Genomic Profiling of Synovium Versus Skin Lesions in Psoriatic Arthritis

    PubMed Central

    Belasco, Jennifer; Louie, James S; Gulati, Nicholas; Wei, Nathan; Nograles, Kristine; Fuentes-Duculan, Judilyn; Mitsui, Hiroshi; Suárez-Fariñas, Mayte; Krueger, James G

    2015-01-01

    Objective To our knowledge, there is no broad genomic analysis comparing skin and synovium in psoriatic arthritis (PsA). Also, there is little understanding of the relative levels of cytokines and chemokines in skin and synovium. The purpose of this study was to better define inflammatory pathways in paired lesional skin and affected synovial tissue in patients with PsA. Methods We conducted a comprehensive analysis of cytokine and chemokine activation and genes representative of the inflammatory processes in PsA. Paired PsA synovial tissue and skin samples were obtained from 12 patients on the same day. Gene expression studies were performed using Affymetrix HGU133 Plus 2.0 arrays. Confirmatory quantitative real-time polymerase chain reaction (PCR) was performed on selected transcripts. Cell populations were assessed by immunohistochemistry and immunofluorescence. Results Globally, gene expression in PsA synovium was more closely related to gene expression in PsA skin than to gene expression in synovium in other forms of arthritis. However, PsA gene expression patterns in skin and synovium were clearly distinct, showing a stronger interleukin-17 (IL-17) gene signature in skin than in synovium and more equivalent tumor necrosis factor (TNF) and interferon-γ gene signatures in both tissues. These results were confirmed with real-time PCR. Conclusion This is the first comprehensive molecular comparison of paired lesional skin and affected synovial tissue samples in PsA. Our results support clinical trial data showing that PsA skin and joint disease are similarly responsive to TNF antagonists, while IL-17 antagonists have better results in PsA skin than in PsA joints. Genes selectively expressed in PsA synovium might direct future therapies for PsA. PMID:25512250

  5. Genome-Wide Survey of Cold Stress Regulated Alternative Splicing in Arabidopsis thaliana with Tiling Microarray

    PubMed Central

    Leviatan, Noam; Alkan, Noam; Leshkowitz, Dena; Fluhr, Robert

    2013-01-01

    Alternative splicing plays a major role in expanding the potential informational content of eukaryotic genomes. It is an important post-transcriptional regulatory mechanism that can increase protein diversity and affect mRNA stability. Alternative splicing is often regulated in a tissue-specific and stress-responsive manner. Cold stress, which adversely affects plant growth and development, regulates the transcription and splicing of plant splicing factors. This can affect the pre-mRNA processing of many genes. To identify cold regulated alternative splicing we applied Affymetrix Arabidopsis tiling arrays to survey the transcriptome under cold treatment conditions. A novel algorithm was used for detection of statistically relevant changes in intron expression within a transcript between control and cold growth conditions. A reverse transcription polymerase chain reaction (RT-PCR) analysis of a number of randomly selected genes confirmed the changes in splicing patterns under cold stress predicted by tiling array. Our analysis revealed new types of cold responsive genes. While their expression level remains relatively unchanged under cold stress their splicing pattern shows detectable changes in the relative abundance of isoforms. The majority of cold regulated alternative splicing introduced a premature termination codon (PTC) into the transcripts creating potential targets for degradation by the nonsense mediated mRNA decay (NMD) process. A number of these genes were analyzed in NMD-defective mutants by RT-PCR and shown to evade NMD. This may result in new and truncated proteins with altered functions or dominant negative effects. The results indicate that cold affects both quantitative and qualitative aspects of gene expression. PMID:23776682

  6. Genome-wide age-related changes in DNA methylation and gene expression in human PBMCs.

    PubMed

    Steegenga, Wilma T; Boekschoten, Mark V; Lute, Carolien; Hooiveld, Guido J; de Groot, Philip J; Morris, Tiffany J; Teschendorff, Andrew E; Butcher, Lee M; Beck, Stephan; Müller, Michael

    2014-06-01

    Aging is a progressive process that results in the accumulation of intra- and extracellular alterations that in turn contribute to a reduction in health. Age-related changes in DNA methylation have been reported before and may be responsible for aging-induced changes in gene expression, although a causal relationship has yet to be shown. Using genome-wide assays, we analyzed age-induced changes in DNA methylation and their effect on gene expression with and without transient induction with the synthetic transcription modulating agent WY14,643. To demonstrate feasibility of the approach, we isolated peripheral blood mononucleated cells (PBMCs) from five young and five old healthy male volunteers and cultured them with or without WY14,643. Infinium 450K BeadChip and Affymetrix Human Gene 1.1 ST expression array analysis revealed significant differential methylation of at least 5 % (ΔYO > 5 %) at 10,625 CpG sites between young and old subjects, but only a subset of the associated genes were also differentially expressed. Age-related differential methylation of previously reported epigenetic biomarkers of aging including ELOVL2, FHL2, PENK, and KLF14 was confirmed in our study, but these genes did not display an age-related change in gene expression in PBMCs. Bioinformatic analysis revealed that differentially methylated genes that lack an age-related expression change predominantly represent genes involved in carcinogenesis and developmental processes, and expression of most of these genes were silenced in PBMCs. No changes in DNA methylation were found in genes displaying transiently induced changes in gene expression. In conclusion, aging-induced differential methylation often targets developmental genes and occurs mostly without change in gene expression. PMID:24789080

  7. Population substructure and control selection in genome-wide association studies.

    PubMed

    Yu, Kai; Wang, Zhaoming; Li, Qizhai; Wacholder, Sholom; Hunter, David J; Hoover, Robert N; Chanock, Stephen; Thomas, Gilles

    2008-01-01

    Determination of the relevance of both demanding classical epidemiologic criteria for control selection and robust handling of population stratification (PS) represents a major challenge in the design and analysis of genome-wide association studies (GWAS). Empirical data from two GWAS in European Americans of the Cancer Genetic Markers of Susceptibility (CGEMS) project were used to evaluate the impact of PS in studies with different control selection strategies. In each of the two original case-control studies nested in corresponding prospective cohorts, a minor confounding effect due to PS (inflation factor lambda of 1.025 and 1.005) was observed. In contrast, when the control groups were exchanged to mimic a cost-effective but theoretically less desirable control selection strategy, the confounding effects were larger (lambda of 1.090 and 1.062). A panel of 12,898 autosomal SNPs common to both the Illumina and Affymetrix commercial platforms and with low local background linkage disequilibrium (pair-wise r(2)<0.004) was selected to infer population substructure with principal component analysis. A novel permutation procedure was developed for the correction of PS that identified a smaller set of principal components and achieved a better control of type I error (to lambda of 1.032 and 1.006, respectively) than currently used methods. The overlap between sets of SNPs in the bottom 5% of p-values based on the new test and the test without PS correction was about 80%, with the majority of discordant SNPs having both ranks close to the threshold. Thus, for the CGEMS GWAS of prostate and breast cancer conducted in European Americans, PS does not appear to be a major problem in well-designed studies. A study using suboptimal controls can have acceptable type I error when an effective strategy for the correction of PS is employed. PMID:18596976

  8. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

    PubMed

    2007-06-01

    There is increasing evidence that genome-wide association (GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study (using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined approximately 2,000 individuals for each of 7 major diseases and a shared set of approximately 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 x 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals (including 58 loci with single-point P values between 10(-5) and 5 x 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a

  9. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls

    PubMed Central

    2009-01-01

    There is increasing evidence that genome-wide association (GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study (using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined ~2,000 individuals for each of 7 major diseases and a shared set of ~3,000 controls. Case-control comparisons identified 24 independent association signals at P<5×10-7: 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn’s disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals (including 58 loci with single-point P values between 10-5 and 5×10-7) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics

  10. Antiphospholipid antibodies in a large population-based cohort: genome-wide associations and effects on monocyte gene expression.

    PubMed

    Müller-Calleja, Nadine; Rossmann, Heidi; Müller, Christian; Wild, Philipp; Blankenberg, Stefan; Pfeiffer, Norbert; Binder, Harald; Beutel, Manfred E; Manukyan, Davit; Zeller, Tanja; Lackner, Karl J

    2016-07-01

    The antiphospholipid syndrome (APS) is characterised by venous and/or arterial thrombosis and pregnancy morbidity in women combined with the persistent presence of antiphospholipid antibodies (aPL). We aimed to identify genetic factors associated with the presence of aPL in a population based cohort. Furthermore, we wanted to clarify if the presence of aPL affects gene expression in circulating monocytes. Titres of IgG and IgM against cardiolipin, β2glycoprotein 1 (anti-β2GPI), and IgG against domain 1 of β2GPI (anti-domain 1) were determined in approx. 5,000 individuals from the Gutenberg Health Study (GHS) a population based cohort of German descent. Genotyping was conducted on Affymetrix Genome-Wide Human SNP 6.0 arrays. Monocyte gene expression was determined in a subgroup of 1,279 individuals by using the Illumina HT-12 v3 BeadChip. Gene expression data were confirmed in vitro and ex vivo by qRT-PCR. Genome wide analysis revealed significant associations of anti-β2GPI IgG and APOH on chromosome 17, which had been previously identified by candidate gene approaches, and of anti-domain1 and MACROD2 on chromosome 20 which has been listed in a previous GWAS as a suggestive locus associated with the occurrence of anti-β2GPI antibodies. Expression analysis confirmed increased expression of TNFα in monocytes and identified and confirmed neuron navigator 3 (NAV3) as a novel gene induced by aPL. In conclusion, MACROD2 represents a novel genetic locus associated with aPL. Furthermore, we show that aPL induce the expression of NAV3 in monocytes and endothelial cells. This will stimulate further research into the role of these genes in the APS. PMID:27098658

  11. A Genome-wide Association Analysis of a Broad Psychosis Phenotype Identifies Three Loci for Further Investigation

    PubMed Central

    2014-01-01

    Background Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. Methods 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 695,193 SNPs were conducted using UNPHASED, which combines information across families and unrelated individuals. We attempted to replicate signals found in 23 genomic regions using existing data on nonoverlapping samples from the Psychiatric GWAS Consortium and Schizophrenia-GENE-plus cohorts (10,352 schizophrenia patients and 24,474 controls). Results No individual SNP showed compelling evidence for association with psychosis in our data. However, we observed a trend for association with same risk alleles at loci previously associated with schizophrenia (one-sided p = .003). A polygenic score analysis found that the Psychiatric GWAS Consortium’s panel of SNPs associated with schizophrenia significantly predicted disease status in our sample (p = 5 × 10–14) and explained approximately 2% of the phenotypic variance. Conclusions Although narrowly defined phenotypes have their advantages, we believe new loci may also be discovered through meta-analysis across broad phenotypes. The novel statistical methodology we introduced to model effect size heterogeneity between studies should help future GWAS that combine association evidence from related phenotypes. Applying these approaches, we highlight three loci that warrant further investigation. We found that SNPs conveying risk for schizophrenia are also predictive of disease status in our data. PMID:23871474

  12. Genome-Wide Gene Expression Profiling Reveals Conserved and Novel Molecular Functions of the Stigma in Rice1[W

    PubMed Central

    Li, Meina; Xu, Wenying; Yang, Wenqiang; Kong, Zhaosheng; Xue, Yongbiao

    2007-01-01

    In angiosperms, the stigma provides initial nutrients and guidance cues for pollen grain germination and tube growth. However, little is known about the genes that regulate these processes in rice (Oryza sativa). Here, we generate rice stigma-specific or -preferential gene expression profiles through comparing genome-wide expression patterns of hand-dissected, unpollinated stigma at anthesis with seven tissues, including seedling shoot, seedling root, mature anther, ovary at anthesis, seeds 5 d after pollination, 10-d-old embryo, 10-d-old endosperm, and suspension-cultured cells by using both 57 K Affymetrix rice whole-genome array and 10 K rice cDNA microarray. A high reproducibility of the microarray results was detected between the two different technology platforms. In total, we identified 548 genes to be expressed specifically or predominantly in the stigma papillar cells of rice. Real-time quantitative reverse transcription-polymerase chain reaction analysis of 34 selected genes all confirmed their stigma-specific expression. The expression of five selected genes was further validated by RNA in situ hybridization. Gene Ontology analysis shows that several auxin-signaling components, transcription, and stress-related genes are significantly overrepresented in the rice stigma gene set. Interestingly, most of them also share several cis-regulatory elements with known stress-responsive genes, supporting the notion of an overlap of genetic programs regulating pollination and stress/defense responses. We also found that genes involved in cell wall metabolism and cellular communication appear to be conserved in the stigma between rice and Arabidopsis (Arabidopsis thaliana). Our results indicate that the stigmas appear to have conserved and novel molecular functions between rice and Arabidopsis. PMID:17556504

  13. Computational Systems Biology Approach Predicts Regulators and Targets of microRNAs and Their Genomic Hotspots in Apoptosis Process.

    PubMed

    Alanazi, Ibrahim O; Ebrahimie, Esmaeil

    2016-07-01

    Novel computational systems biology tools such as common targets analysis, common regulators analysis, pathway discovery, and transcriptomic-based hotspot discovery provide new opportunities in understanding of apoptosis molecular mechanisms. In this study, after measuring the global contribution of microRNAs in the course of apoptosis by Affymetrix platform, systems biology tools were utilized to obtain a comprehensive view on the role of microRNAs in apoptosis process. Network analysis and pathway discovery highlighted the crosstalk between transcription factors and microRNAs in apoptosis. Within the transcription factors, PRDM1 showed the highest upregulation during the course of apoptosis, with more than 9-fold expression increase compared to non-apoptotic condition. Within the microRNAs, MIR1208 showed the highest expression in non-apoptotic condition and downregulated by more than 6 fold during apoptosis. Common regulators algorithm showed that TNF receptor is the key upstream regulator with a high number of regulatory interactions with the differentially expressed microRNAs. BCL2 and AKT1 were the key downstream targets of differentially expressed microRNAs. Enrichment analysis of the genomic locations of differentially expressed microRNAs led us to the discovery of chromosome bands which were highly enriched (p < 0.01) with the apoptosis-related microRNAs, such as 13q31.3, 19p13.13, and Xq27.3 This study opens a new avenue in understanding regulatory mechanisms and downstream functions in the course of apoptosis as well as distinguishing genomic-enriched hotspots for apoptosis process. PMID:27178576

  14. Mosquito genomics: progress and challenges.

    PubMed

    Severson, David W; Behura, Susanta K

    2012-01-01

    The whole-genome sequencing of mosquitoes has facilitated our understanding of fundamental biological processes at their basic molecular levels and holds potential for application to mosquito control and prevention of mosquito-borne disease transmission. Draft genome sequences are available for Anopheles gambiae, Aedes aegypti, and Culex quinquefasciatus. Collectively, these represent the major vectors of African malaria, dengue fever and yellow fever viruses, and lymphatic filariasis, respectively. Rapid advances in genome technologies have revealed detailed information on genome architecture as well as phenotype-specific transcriptomics and proteomics. These resources allow for detailed comparative analyses within and across populations as well as species. Next-generation sequencing technologies will likely promote a proliferation of genome sequences for additional mosquito species as well as for individual insects. Here we review the current status of genome research in mosquitoes and identify potential areas for further investigations. PMID:21942845

  15. Invariants of DNA genomic signals

    NASA Astrophysics Data System (ADS)

    Cristea, Paul Dan A.

    2005-02-01

    For large scale analysis purposes, the conversion of genomic sequences into digital signals opens the possibility to use powerful signal processing methods for handling genomic information. The study of complex genomic signals reveals large scale features, maintained over the scale of whole chromosomes, that would be difficult to find by using only the symbolic representation. Based on genomic signal methods and on statistical techniques, the paper defines parameters of DNA sequences which are invariant to transformations induced by SNPs, splicing or crossover. Re-orienting concatenated coding regions in the same direction, regularities shared by the genomic material in all exons are revealed, pointing towards the hypothesis of a regular ancestral structure from which the current chromosome structures have evolved. This property is not found in non-nuclear genomic material, e.g., plasmids.

  16. The genome of Eucalyptus grandis.

    PubMed

    Myburg, Alexander A; Grattapaglia, Dario; Tuskan, Gerald A; Hellsten, Uffe; Hayes, Richard D; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R K; Hussey, Steven G; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B; Togawa, Roberto C; Pappas, Marilia R; Faria, Danielle A; Sansaloni, Carolina P; Petroli, Cesar D; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A; Bornberg-Bauer, Erich; Kersting, Anna R; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E; Liston, Aaron; Spatafora, Joseph W; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C; Steane, Dorothy A; Vaillancourt, René E; Potts, Brad M; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J; Strauss, Steven H; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S; Schmutz, Jeremy

    2014-06-19

    Eucalypts are the world's most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled >94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology. PMID:24919147

  17. Big Data: Astronomical or Genomical?

    PubMed

    Stephens, Zachary D; Lee, Skylar Y; Faghri, Faraz; Campbell, Roy H; Zhai, Chengxiang; Efron, Miles J; Iyer, Ravishankar; Schatz, Michael C; Sinha, Saurabh; Robinson, Gene E

    2015-07-01

    Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a "four-headed beast"--it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the "genomical" challenges of the next decade. PMID:26151137

  18. Genomics Nursing Faculty Champion Initiative

    PubMed Central

    Jenkins, Jean; Calzone, Kathleen A.

    2016-01-01

    Nurse faculty are challenged to keep up with the emerging and fast-paced field of genomics and the mandate to prepare the nursing workforce to be able to translate genomic research advances into routine clinical care. Using Faculty Champions and other options, the initiative stimulated curriculum development and promoted genomics curriculum integration. The authors summarize this yearlong initiative for undergraduate and graduate nursing faculty. PMID:24300251

  19. Cactus Graphs for Genome Comparisons

    NASA Astrophysics Data System (ADS)

    Paten, Benedict; Diekhans, Mark; Earl, Dent; St. John, John; Ma, Jian; Suh, Bernard; Haussler, David

    We introduce a data structure, analysis and visualization scheme called a cactus graph for comparing sets of related genomes. Cactus graphs capture some of the advantages of de Bruijn and breakpoint graphs in one unified framework. They naturally decompose the common substructures in a set of related genomes into a hierarchy of chains that can be visualized as multiple alignments and nets that can be visualized in circular genome plots.

  20. Programs | Office of Cancer Genomics

    Cancer.gov

    OCG facilitates cancer genomics research through a series of highly-focused programs. These programs generate and disseminate genomic data for use by the cancer research community. OCG programs also promote advances in technology-based infrastructure and create valuable experimental reagents and tools. OCG programs encourage collaboration by interconnecting with other genomics and cancer projects in order to accelerate translation of findings into the clinic. Below are OCG’s current, completed, and initiated programs:

  1. Genome walking by Klenow polymerase.

    PubMed

    Volpicella, Mariateresa; Leoni, Claudia; Fanizza, Immacolata; Rius, Sebastian; Gallerani, Raffaele; Ceci, Luigi R

    2012-11-15

    Genome walking procedures are all based on a final polymerase chain reaction amplification, regardless of the strategy employed for the synthesis of the substrate molecule. Here we report a modification of an already established genome walking strategy in which a single-strand DNA substrate is obtained by primer extension driven by Klenow polymerase and which results suitable for the direct sequencing of complex eukaryotic genomes. The efficacy of the method is demonstrated by the identification of nucleotide sequences in the case of two gene families (chiA and P1) in the genomes of several maize species. PMID:22922302

  2. Global efforts in structural genomics.

    PubMed

    Stevens, R C; Yokoyama, S; Wilson, I A

    2001-10-01

    A worldwide initiative in structural genomics aims to capitalize on the recent successes of the genome projects. Substantial new investments in structural genomics in the past 2 years indicate the high level of support for these international efforts. Already, enormous progress has been made on high-throughput methodologies and technologies that will speed up macromolecular structure determinations. Recent international meetings have resulted in the formation of an International Structural Genomics Organization to formulate policy and foster cooperation between the public and private efforts. PMID:11588249

  3. Genomic medicine and neurological disease

    PubMed Central

    Boone, Philip M.; Wiszniewski, Wojciech; Lupski, James R.

    2011-01-01

    Genomic medicine” refers to the diagnosis, optimized management, and treatment of disease—as well as screening, counseling, and disease gene identification—in the context of information provided by an individual patient’s personal genome. Genomic medicine, to some extent synonymous with “personalized medicine,” has been made possible by recent advances in genome technologies. Genomic medicine represents a new approach to health care and disease management that attempts to optimize the care of a patient based upon information gleaned from his or her personal genome sequence. In this review, we describe recent progress in genomic medicine as it relates to neurological disease. Many neurological disorders either segregate as Mendelian phenotypes or occur sporadically in association with a new mutation in a single gene. Heritability also contributes to other neurological conditions that appear to exhibit more complex genetics. In addition to discussing current knowledge in this field, we offer suggestions for maximizing the utility of genomic information in clinical practice as the field of genomic medicine unfolds. PMID:21594611

  4. Genomics of Bacillus Species

    NASA Astrophysics Data System (ADS)

    Økstad, Ole Andreas; Kolstø, Anne-Brit

    Members of the genus Bacillus are rod-shaped spore-forming bacteria belonging to the Firmicutes, the low G+C gram-positive bacteria. The Bacillus genus was first described and classified by Ferdinand Cohn in Cohn (1872), and Bacillus subtilis was defined as the type species (Soule, 1932). Several Bacilli may be linked to opportunistic infections. However, pathogenicity among Bacillus spp. is mainly a feature of bacteria belonging to the Bacillus cereus group, including B. cereus, Bacillus anthracis, and Bacillus thuringiensis. Here we review the genomics of B. cereus group bacteria in relation to their roles as etiological agents of two food poisoning syndromes (emetic and diarrhoeal).

  5. The human genome project

    SciTech Connect

    Bell, G.I.

    1991-06-01

    The Human Genome Project will obtain high-resolution genetic and physical maps of each human chromosome and, somewhat later, of the complete nucleotide sequence of the deoxyribonucleic acid (DNA) in a human cell. The talk will begin with an extended introduction to explain the Project to nonbiologists and to show that map construction and sequence determination require extensive computation in order to determine the correct order of the mapped entities and to provide estimates of uncertainty. Computational analysis of the sequence data will become an increasingly important part of the project, and some computational challenges are described. 5 refs.

  6. Beyond the dna: a prototype for functional genomics

    SciTech Connect

    Albala, J

    2000-03-02

    A prototype oligonucleotide ''functional chip'' has been developed to screen novel DNA repair proteins for their ability to bind or alter different forms of DNA. This chip has been developed as a functional genomics screen for analysis of protein-DNA interactions for novel proteins identified from the Human Genome Project The process of novel gene identification that has ensued as a consequence of available sequence information is remarkable. The challenge how lies in determining the function of newly identified gene products in a time-and cost-effective high-throughput manner. The functional chip is generated by the robotic application of DNA spotted in a microarray format onto a glass slide. Individual proteins are then analyzed against the different form of DNA bound to the slide. Several prototype functional chips were designed to contain various DNA fragments tethered to a glass slide for analysis of protein-DNA binding or enzymatic activity of known proteins. The technology has been developed to screen novel, putative DNA repair proteins for their ability to bind various types of DNA alone and in concert with protein partners. An additional scheme has been devised to screen putative repair enzymes for their ability to process different types of DNA molecules. Current methods to analyze gene expression primarily utilize either of two technologies. The oligonucleotide chip, pioneered by Fodor and co-workers and Affymetrix, Inc., consists of greater than 64,000 oligonucleotides attached in situ to a glass support. The oligonucleotide chip has been used primarily to identify specific mutations in a given gene by hybridization against a fluorescently-labeled substrate. The second method is the microarray, whereby DNA targets are systematically arranged on a glass slide and then hybridized with fluorescently-labeled complex targets for gene expression analysis (Jordan, 1998). By this technique, a large amount of information can be obtained examining global

  7. Human Genome Program Image Gallery (from genomics.energy.gov)

    DOE Data Explorer

    This collection contains approximately 240 images from the genome programs of DOE's Office of Science. The images are divided into galleries related to biofuels research, systems biology, and basic genomics. Each image has a title, a basic citation, and a credit or source. Most of the images are original graphics created by the Genome Management Information System (GMIS). GMIS images are recognizable by their credit line. Permission to use these graphics is not needed, but please credit the U.S. Department of Energy Genome Programs and provide the website http://genomics.energy.gov. Other images were provided by third parties and not created by the U.S. Department of Energy. Users must contact the person listed in the credit line before using those images. The high-resolution images can be downloaded.

  8. A Taste of Algal Genomes from the Joint Genome Institute

    SciTech Connect

    Kuo, Alan; Grigoriev, Igor

    2012-06-17

    Algae play profound roles in aquatic food chains and the carbon cycle, can impose health and economic costs through toxic blooms, provide models for the study of symbiosis, photosynthesis, and eukaryotic evolution, and are candidate sources for bio-fuels; all of these research areas are part of the mission of DOE's Joint Genome Institute (JGI). To date JGI has sequenced, assembled, annotated, and released to the public the genomes of 18 species and strains of algae, sampling almost all of the major clades of photosynthetic eukaryotes. With more algal genomes currently undergoing analysis, JGI continues its commitment to driving forward basic and applied algal science. Among these ongoing projects are the pan-genome of the dominant coccolithophore Emiliania huxleyi, the interrelationships between the 4 genomes in the nucleomorph-containing Bigelowiella natans and Guillardia theta, and the search for symbiosis genes of lichens.

  9. Modifying the Mitochondrial Genome.

    PubMed

    Patananan, Alexander N; Wu, Ting-Hsiang; Chiou, Pei-Yu; Teitell, Michael A

    2016-05-10

    Human mitochondria produce ATP and metabolites to support development and maintain cellular homeostasis. Mitochondria harbor multiple copies of a maternally inherited, non-nuclear genome (mtDNA) that encodes for 13 subunit proteins of the respiratory chain. Mutations in mtDNA occur mainly in the 24 non-coding genes, with specific mutations implicated in early death, neuromuscular and neurodegenerative diseases, cancer, and diabetes. A significant barrier to new insights in mitochondrial biology and clinical applications for mtDNA disorders is our general inability to manipulate the mtDNA sequence. Microinjection, cytoplasmic fusion, nucleic acid import strategies, targeted endonucleases, and newer approaches, which include the transfer of genomic DNA, somatic cell reprogramming, and a photothermal nanoblade, attempt to change the mtDNA sequence in target cells with varying efficiencies and limitations. Here, we discuss the current state of manipulating mammalian mtDNA and provide an outlook for mitochondrial reverse genetics, which could further enable mitochondrial research and therapies for mtDNA diseases. PMID:27166943

  10. Parsing of genomic graffiti

    SciTech Connect

    Tibbetts, C.; Golden, J. III; Torgersen, D.

    1996-12-31

    A focal point of modern biology is investigation of wide varieties of phenomena at the level of molecular genetics. The nucleotide sequences of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) define the ultimate resolution of this reductionist approach to understand the determinants of heritable traits. The structure and function of genes, their composite genomic organization, and their regulated expression have been studied in systems representing every class of organism. Many human diseases or pathogenic syndromes can be directly attributed to inherited defects in either the regulated expression, or the quality of the products of specific genes. Genetic determinants of susceptibility to infectious agents or environmental hazards are amply documented. Mapping and sequencing of the DNA molecules encoding human genes have provided powerful technology for pharmaceutical bioengineering and forensic investigations. From an alternative perspective, we may anticipate that voluminous archives of singular DNA sequences alone will not suffice to define and understand the functional determinants of genome organization, allelic diversity and evolutionary plasticity of living organisms. New insights will accumulate pertaining to human evolutionary origins and relationships of human biology to models based on other mammals. Investigators of population genetics and epidemiology now exploit the technology of molecular genetics to more powerfully probe variation within the human gene pool at the level of DNA sequences. 40 refs., 7 figs., 2 tabs.

  11. Genomic imprinting and cancer.

    PubMed

    Brenton, J D; Viville, S; Surani, M A

    1995-01-01

    Imprinting is vital for normal development, and disruption of imprinting mechanisms on syntenic chromosomes gives very similar phenotypes in mouse and humans. In addition, disruption of normal imprinting provides a plausible explanation for preferential LOH in some embryonal tumours. Moreover, there is evidence that in Wilms' tumour, dysregulation of specific imprinted genes may give rise to the cancer phenotype. Many more questions regarding genomic imprinting need to be answered before the associations described in this review can be properly understood. The most basic issues, such as when and how the imprint is established, can still only be speculated upon. Further study of new imprinted genes and the relationship between their domains and differential replication may show us higher control mechanisms than methylation alone. It remains to be seen if these epigenetic modifications are amenable to therapeutic change in the treatment of inherited syndromes and cancer, or if they can be used to assess individuals at risk of disease. Until then it is probably unwise to speculate on a single unifying theory that explains why a subset of the genome shows such a peculiar non-Mendelian form of inheritance. PMID:8718517

  12. Genomic Analysis of Reactive Astrogliosis

    PubMed Central

    Zamanian, JL; Xu, L; Foo, LC; Nouri, N; Zhou, L; Giffard, RG; Barres, BA

    2012-01-01

    Reactive astrogliosis is characterized by a profound change in astrocyte phenotype in response to all CNS injuries and diseases. To better understand the reactive astrocyte state, we used Affymetrix GeneChip arrays to profile gene expression in populations of reactive astrocytes isolated at various time points after induction using two mouse injury models, ischemic stroke and neuroinflammation. We find reactive gliosis consists of a rapid, but quickly attenuated induction of gene expression after insult and identify two induced genes, Lcn2 and Serpina3n, as strong markers of reactive astrocytes. Strikingly, reactive astrocyte phenotype strongly depended on the type of inducing injury. Although there is a core set of genes that is up-regulated in reactive astrocytes from both injury models, at least 50% of the altered gene expression is specific to a given injury type. Reactive astrocytes in ischemia exhibited a molecular phenotype that suggests that they may be beneficial or protective, whereas reactive astrocytes induced by LPS exhibited a phenotype that suggests that they may be detrimental. These findings demonstrate that, despite well established commonalities, astrocyte reactive gliosis is a highly heterogeneous state in which astrocyte activities are altered to respond to the specific injury. This raises the question of how many subtypes of reactive astrocytes exist. Our findings provide transcriptome databases for two subtypes of reactive astrocytes that will be highly useful in generating new and testable hypotheses of their function, as well as for providing new markers to detect different types of reactive astrocytes in human neurological diseases. PMID:22553043

  13. A 2-Stage Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms Associated With Development of Erectile Dysfunction Following Radiation Therapy for Prostate Cancer

    SciTech Connect

    Kerns, Sarah L.; Stock, Richard; Stone, Nelson; Buckstein, Michael; Shao, Yongzhao; Campbell, Christopher; Rath, Lynda; De Ruysscher, Dirk; Lammering, Guido; Hixson, Rosetta; Cesaretti, Jamie; Terk, Mitchell; Ostrer, Harry; Rosenstein, Barry S.

    2013-01-01

    Purpose: To identify single nucleotide polymorphisms (SNPs) associated with development of erectile dysfunction (ED) among prostate cancer patients treated with radiation therapy. Methods and Materials: A 2-stage genome-wide association study was performed. Patients were split randomly into a stage I discovery cohort (132 cases, 103 controls) and a stage II replication cohort (128 cases, 102 controls). The discovery cohort was genotyped using Affymetrix 6.0 genome-wide arrays. The 940 top ranking SNPs selected from the discovery cohort were genotyped in the replication cohort using Illumina iSelect custom SNP arrays. Results: Twelve SNPs identified in the discovery cohort and validated in the replication cohort were associated with development of ED following radiation therapy (Fisher combined P values 2.1 Multiplication-Sign 10{sup -5} to 6.2 Multiplication-Sign 10{sup -4}). Notably, these 12 SNPs lie in or near genes involved in erectile function or other normal cellular functions (adhesion and signaling) rather than DNA damage repair. In a multivariable model including nongenetic risk factors, the odds ratios for these SNPs ranged from 1.6 to 5.6 in the pooled cohort. There was a striking relationship between the cumulative number of SNP risk alleles an individual possessed and ED status (Sommers' D P value = 1.7 Multiplication-Sign 10{sup -29}). A 1-allele increase in cumulative SNP score increased the odds for developing ED by a factor of 2.2 (P value = 2.1 Multiplication-Sign 10{sup -19}). The cumulative SNP score model had a sensitivity of 84% and specificity of 75% for prediction of developing ED at the radiation therapy planning stage. Conclusions: This genome-wide association study identified a set of SNPs that are associated with development of ED following radiation therapy. These candidate genetic predictors warrant more definitive validation in an independent cohort.

  14. Genome-wide scan with nearly 700 000 SNPs in two Sardinian sub-populations suggests some regions as candidate targets for positive selection

    PubMed Central

    Piras, Ignazio Stefano; De Montis, Antonella; Calò, Carla Maria; Marini, Monica; Atzori, Manuela; Corrias, Laura; Sazzini, Marco; Boattini, Alessio; Vona, Giuseppe; Contu, Licinio

    2012-01-01

    This paper explores the genetic structure and signatures of natural selection in different sub-populations from the Island of Sardinia, exploiting information from nearly 700 000 autosomal SNPs genotyped with the Affymetrix Genome-Wide Human SNP 6.0 Array. The genetic structure of the Sardinian population and its position within the context of other Mediterranean and European human groups were investigated in depth by comparing our data with publicly available data sets. Principal components and admixture analyses suggest a clustering of the examined samples in two significantly differentiated sub-populations (Ogliastra and Southern Sardinia), as confirmed by AMOVA (FST=0.011; P<0.001). Differentiation of these sub-populations was still evident when they were pooled together with supplementary Sardinian samples from HGDP and compared with several other European, North-African and Near Eastern populations, confirming the uniqueness of the Sardinian genetic background. Moreover, by applying several statistical approaches aimed at assessing differences at the SNP level, the highest differentiated genomic regions between Ogliastra and Southern Sardinia were thus investigated via an extended haplotype homozygosity (EHH)-based test to point out potential selective sweeps. Using this approach, 40 genomic regions were detected, with significant differences between Ogliastra and Southern Sardinia. These regions were subsequently investigated using a long-range haplotype test, which found significant REHH values for SNPs rs11070188 and rs11070192 in the Ogliastra sub-population. In the light of these results and the overlap of the different computed statistics, the region encompassing these loci can be considered a strong candidate to have undergone selective pressure in Ogliastra. PMID:22535185

  15. OryzaGenome: Genome Diversity Database of Wild Oryza Species

    PubMed Central

    Ohyanagi, Hajime; Ebata, Toshinobu; Huang, Xuehui; Gong, Hao; Fujita, Masahiro; Mochizuki, Takako; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu; Feng, Qi; Wang, Zi-Xuan; Han, Bin; Kurata, Nori

    2016-01-01

    The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype–phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a text-based browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tab-delimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/. PMID:26578696

  16. All about the Human Genome Project (HGP)

    MedlinePlus

    ... full human sequence All About The Human Genome Project (HGP) The Human Genome Project (HGP) was one of the great feats of ... Organisms A Quarter Century after the Human Genome Project's Launch: Lessons Beyond the Base Pairs October 1, ...

  17. International genomic evaluation methods for dairy cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Genomic evaluations are rapidly replacing traditional evaluation systems used for dairy cattle selection. Economies of scale in genomics promote cooperation across country borders. Genomic information can be transferred across countries using simple conversion equations, by modifying mult...

  18. Surveying Breast Cancer's Genomic Landscape.

    PubMed

    2016-07-01

    An in-depth analysis has produced the most comprehensive portrait to date of the myriad genomic alterations involved in breast cancer. In sequencing the whole genomes of 560 breast cancers and combining this information with published data from another 772 breast tumors, the research team uncovered several new genes and mutational signatures that potentially influence this disease. PMID:27225883

  19. CROP GENOME DATABASES -- CRITICAL ISSUES

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Crop genome databases, see www.agron.missouri.edu/bioservers.html of the past decade have had designed and implemented (1) models and schema for the genome and related domains; (2) methodologies for input of data by expert biologists and high-throughput projects; and (3) various text, graphical, and...

  20. Cocoa/Cotton Comparative Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    With genome sequence from two members of the Malvaceae family recently made available, we are exploring syntenic relationships, gene content, and evolutionary trajectories between the cacao and cotton genomes. An assembly of cacao (Theobroma cacao) using Illumina and 454 sequence technology yielded ...

  1. Genomics and Weeds: A Synthesis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomics can be used to solve many problems associated with the management of weeds. New target sites for herbicides have been discovered through functional genomic approaches to determine gene function. Modes of action of herbicides can be clarified or discovered by transcriptome analysis. Under...

  2. Plant cytogenetics in genome databases

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cytogenetic maps provide an integrated representation of genetic and cytological information that can be used to enhance genome and chromosome research. As genome analysis technologies become more affordable, the density of markers on cytogenetic maps increases, making these resources more useful a...

  3. From genes to genome biology

    SciTech Connect

    Pennisi, E.

    1996-06-21

    This article describes a change in the approach to mapping genomes, from looking at one gene at a time, to other approaches. Strategies include everything from lab techniques to computer programs designed to analyze whole batches of genes at once. Also included is a update on the work on the human genome.

  4. Fueling Future with Algal Genomics

    SciTech Connect

    Grigoriev, Igor

    2012-07-05

    Algae constitute a major component of fundamental eukaryotic diversity, play profound roles in the carbon cycle, and are prominent candidates for biofuel production. The US Department of Energy Joint Genome Institute (JGI) is leading the world in algal genome sequencing (http://jgi.doe.gov/Algae) and contributes of the algal genome projects worldwide (GOLD database, 2012). The sequenced algal genomes offer catalogs of genes, networks, and pathways. The sequenced first of its kind genomes of a haptophyte E.huxleyii, chlorarachniophyte B.natans, and cryptophyte G.theta fill the gaps in the eukaryotic tree of life and carry unique genes and pathways as well as molecular fossils of secondary endosymbiosis. Natural adaptation to conditions critical for industrial production is encoded in algal genomes, for example, growth of A.anophagefferens at very high cell densities during the harmful algae blooms or a global distribution across diverse environments of E.huxleyii, able to live on sparse nutrients due to its expanded pan-genome. Communications and signaling pathways can be derived from simple symbiotic systems like lichens or complex marine algae metagenomes. Collectively these datasets derived from algal genomics contribute to building a comprehensive parts list essential for algal biofuel development.

  5. Genomic Evaluations: Past, Present, Future

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomic evaluation has been implemented in dairy cattle causing profound changes in dairy cattle breeding. All young bulls purchased by major AI organizations are selected based on genomic evaluations. The reliability of these evaluations reaches the mid seventies for yield traits and is adequate to...

  6. Functional Genomics Tools for Papaya

    Technology Transfer Automated Retrieval System (TEKTRAN)

    With the genome of papaya (Carica papaya L.) sequenced, the study of gene function is becoming an increasing priority. Our research is to develop an RNA-induced gene silencing tool for the study of functional genomics in papaya. We employed agrobacterium leaf infiltration to induce PTGS in '-glucuro...

  7. Quantitative Genomics of Male Reproduction

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective of the review was to establish the current status of quantitative genomics for male reproduction. Genetic variation exists for male reproduction traits. These traits are expensive and time consuming traits to evaluate through conventional breeding schemes. Genomics is an alternative to...

  8. How Can Genomics Inform Education?

    ERIC Educational Resources Information Center

    Grigorenko, Elena L.

    2007-01-01

    This article offers some thoughts on possible connections between genomics and education. Genomics is already revolutionizing the way medical care is delivered and distributed; it will inevitably affect children's developmental trajectories by introducing more pharmacological and behavioral therapies. Educators should be prepared to understand the…

  9. Mycobacterium avium subsp. paratuberculosis Genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The completion of the MAP K-10 genome sequence has opened the doors to many new avenues of research. In the few years since the publication of the genome sequence, the manuscript describing the completed sequence has been cited in the scientific literature more than 85 times. The public availabili...

  10. Future Health Applications of Genomics

    PubMed Central

    McBride, Colleen M.; Bowen, Deborah; Brody, Lawrence C.; Condit, Celeste M.; Croyle, Robert T.; Gwinn, Marta; Khoury, Muin J.; Koehly, Laura M.; Korf, Bruce R.; Marteau, Theresa M.; McLeroy, Kenneth; Patrick, Kevin; Valente, Thomas W.

    2014-01-01

    Despite the quickening momentum of genomic discovery, the communication, behavioral, and social sciences research needed for translating this discovery into public health applications has lagged behind. The National Human Genome Research Institute held a 2-day workshop in October 2008 convening an interdisciplinary group of scientists to recommend forward-looking priorities for translational research. This research agenda would be designed to redress the top three risk factors (tobacco use, poor diet, and physical inactivity) that contribute to the four major chronic diseases (heart disease, type 2 diabetes, lung disease, and many cancers) and account for half of all deaths worldwide. Three priority research areas were identified: (1) improving the public’s genetic literacy in order to enhance consumer skills; (2) gauging whether genomic information improves risk communication and adoption of healthier behaviors more than current approaches; and (3) exploring whether genomic discovery in concert with emerging technologies can elucidate new behavioral intervention targets. Important crosscutting themes also were identified, including the need to: (1) anticipate directions of genomic discovery; (2) take an agnostic scientific perspective in framing research questions asking whether genomic discovery adds value to other health promotion efforts; and (3) consider multiple levels of influence and systems that contribute to important public health problems. The priorities and themes offer a framework for a variety of stakeholders, including those who develop priorities for research funding, interdisciplinary teams engaged in genomics research, and policymakers grappling with how to use the products born of genomics research to address public health challenges. PMID:20409503

  11. Big Data: Astronomical or Genomical?

    PubMed Central

    Stephens, Zachary D.; Lee, Skylar Y.; Faghri, Faraz; Campbell, Roy H.; Zhai, Chengxiang; Efron, Miles J.; Iyer, Ravishankar; Schatz, Michael C.; Sinha, Saurabh; Robinson, Gene E.

    2015-01-01

    Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a “four-headed beast”—it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the “genomical” challenges of the next decade. PMID:26151137

  12. Genome-Wide Gene-Sodium Interaction Analyses on Blood Pressure: The Genetic Epidemiology Network of Salt-Sensitivity Study.

    PubMed

    Li, Changwei; He, Jiang; Chen, Jing; Zhao, Jinying; Gu, Dongfeng; Hixson, James E; Rao, Dabeeru C; Jaquish, Cashell E; Gu, Charles C; Chen, Jichun; Huang, Jianfeng; Chen, Shufeng; Kelly, Tanika N

    2016-08-01

    We performed genome-wide analyses to identify genomic loci that interact with sodium to influence blood pressure (BP) using single-marker-based (1 and 2 df joint tests) and gene-based tests among 1876 Chinese participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. Among GenSalt participants, the average of 3 urine samples was used to estimate sodium excretion. Nine BP measurements were taken using a random zero sphygmomanometer. A total of 2.05 million single-nucleotide polymorphisms were imputed using Affymetrix 6.0 genotype data and the Chinese Han of Beijing and Japanese of Tokyo HapMap reference panel. Promising findings (P<1.00×10(-4)) from GenSalt were evaluated for replication among 775 Chinese participants of the Multi-Ethnic Study of Atherosclerosis (MESA). Single-nucleotide polymorphism and gene-based results were meta-analyzed across the GenSalt and MESA studies to determine genome-wide significance. The 1 df tests identified interactions for UST rs13211840 on diastolic BP (P=3.13×10(-9)). The 2 df tests additionally identified associations for CLGN rs2567241 (P=3.90×10(-12)) and LOC105369882 rs11104632 (P=4.51×10(-8)) with systolic BP. The CLGN variant rs2567241 was also associated with diastolic BP (P=3.11×10(-22)) and mean arterial pressure (P=2.86×10(-15)). Genome-wide gene-based analysis identified MKNK1 (P=6.70×10(-7)), C2orf80 (P<1.00×10(-12)), EPHA6 (P=2.88×10(-7)), SCOC-AS1 (P=4.35×10(-14)), SCOC (P=6.46×10(-11)), CLGN (P=3.68×10(-13)), MGAT4D (P=4.73×10(-11)), ARHGAP42 (P≤1.00×10(-12)), CASP4 (P=1.31×10(-8)), and LINC01478 (P=6.75×10(-10)) that were associated with at least 1 BP phenotype. In summary, we identified 8 novel and 1 previously reported BP loci through the examination of single-nucleotide polymorphism and gene-based interactions with sodium. PMID:27271309

  13. Meta-analysis of genome-wide association studies in five cohorts reveals common variants in RBFOX1, a regulator of tissue-specific splicing, associated with refractive error

    PubMed Central

    Stambolian, Dwight; Wojciechowski, Robert; Oexle, Konrad; Pirastu, Mario; Li, Xiaohui; Raffel, Leslie J.; Cotch, Mary Frances; Chew, Emily Y.; Klein, Barbara; Klein, Ronald; Wong, Tien Y.; Simpson, Claire L.; Klaver, Caroline C.W.; van Duijn, Cornelia M.; Verhoeven, Virginie J.M.; Baird, Paul N.; Vitart, Veronique; Paterson, Andrew D.; Mitchell, Paul; Saw, Seang Mei; Fossarello, Maurizio; Kazmierkiewicz, Krista; Murgia, Federico; Portas, Laura; Schache, Maria; Richardson, Andrea; Xie, Jing; Wang, Jie Jin; Rochtchina, Elena; Viswanathan, Ananth C.; Hayward, Caroline; Wright, Alan F.; Polašek, Ozren; Campbell, Harry; Rudan, Igor; Oostra, Ben A.; Uitterlinden, André G.; Hofman, Albert; Rivadeneira, Fernando; Amin, Najaf; Karssen, Lennart C.; Vingerling, Johannes R.; Hosseini, S.M.; Döring, Angela; Bettecken, Thomas; Vatavuk, Zoran; Gieger, Christian; Wichmann, H.-Erich; Wilson, James F.; Fleck, Brian; Foster, Paul J.; Topouzis, Fotis; McGuffin, Peter; Sim, Xueling; Inouye, Michael; Holliday, Elizabeth G.; Attia, John; Scott, Rodney J.; Rotter, Jerome I.; Meitinger, Thomas; Bailey-Wilson, Joan E.

    2013-01-01

    Visual refractive errors (REs) are complex genetic traits with a largely unknown etiology. To date, genome-wide association studies (GWASs) of moderate size have identified several novel risk markers for RE, measured here as mean spherical equivalent (MSE). We performed a GWAS using a total of 7280 samples from five cohorts: the Age-Related Eye Disease Study (AREDS); the KORA study (‘Cooperative Health Research in the Region of Augsburg’); the Framingham Eye Study (FES); the Ogliastra Genetic Park-Talana (OGP-Talana) Study and the Multiethnic Study of Atherosclerosis (MESA). Genotyping was performed on Illumina and Affymetrix platforms with additional markers imputed to the HapMap II reference panel. We identified a new genome-wide significant locus on chromosome 16 (rs10500355, P = 3.9 × 10−9) in a combined discovery and replication set (26 953 samples). This single nucleotide polymorphism (SNP) is located within the RBFOX1 gene which is a neuron-specific splicing factor regulating a wide range of alternative splicing events implicated in neuronal development and maturation, including transcription factors, other splicing factors and synaptic proteins. PMID:23474815

  14. Privacy in the Genomic Era

    PubMed Central

    NAVEED, MUHAMMAD; AYDAY, ERMAN; CLAYTON, ELLEN W.; FELLAY, JACQUES; GUNTER, CARL A.; HUBAUX, JEAN-PIERRE; MALIN, BRADLEY A.; WANG, XIAOFENG

    2015-01-01

    Genome sequencing technology has advanced at a rapid pace and it is now possible to generate highly-detailed genotypes inexpensively. The collection and analysis of such data has the potential to support various applications, including personalized medical services. While the benefits of the genomics revolution are trumpeted by the biomedical community, the increased availability of such data has major implications for personal privacy; notably because the genome has certain essential features, which include (but are not limited to) (i) an association with traits and certain diseases, (ii) identification capability (e.g., forensics), and (iii) revelation of family relationships. Moreover, direct-to-consumer DNA testing increases the likelihood that genome data will be made available in less regulated environments, such as the Internet and for-profit companies. The problem of genome data privacy thus resides at the crossroads of computer science, medicine, and public policy. While the computer scientists have addressed data privacy for various data types, there has been less attention dedicated to genomic data. Thus, the goal of this paper is to provide a systematization of knowledge for the computer science community. In doing so, we address some of the (sometimes erroneous) beliefs of this field and we report on a survey we conducted about genome data privacy with biomedical specialists. Then, after characterizing the genome privacy problem, we review the state-of-the-art regarding privacy attacks on genomic data and strategies for mitigating such attacks, as well as contextualizing these attacks from the perspective of medicine and public policy. This paper concludes with an enumeration of the challenges for genome data privacy and presents a framework to systematize the analysis of threats and the design of countermeasures as the field moves forward. PMID:26640318

  15. Recombination Drives Vertebrate Genome Contraction

    PubMed Central

    Nam, Kiwoong; Ellegren, Hans

    2012-01-01

    Selective and/or neutral processes may govern variation in DNA content and, ultimately, genome size. The observation in several organisms of a negative correlation between recombination rate and intron size could be compatible with a neutral model in which recombination is mutagenic for length changes. We used whole-genome data on small insertions and deletions within transposable elements from chicken and zebra finch to demonstrate clear links between recombination rate and a number of attributes of reduced DNA content. Recombination rate was negatively correlated with the length of introns, transposable elements, and intergenic spacer and with the rate of short insertions. Importantly, it was positively correlated with gene density, the rate of short deletions, the deletion bias, and the net change in sequence length. All these observations point at a pattern of more condensed genome structure in regions of high recombination. Based on the observed rates of small insertions and deletions and assuming that these rates are representative for the whole genome, we estimate that the genome of the most recent common ancestor of birds and lizards has lost nearly 20% of its DNA content up until the present. Expansion of transposable elements can counteract the effect of deletions in an equilibrium mutation model; however, since the activity of transposable elements has been low in the avian lineage, the deletion bias is likely to have had a significant effect on genome size evolution in dinosaurs and birds, contributing to the maintenance of a small genome. We also demonstrate that most of the observed correlations between recombination rate and genome contraction parameters are seen in the human genome, including for segregating indel polymorphisms. Our data are compatible with a neutral model in which recombination drives vertebrate genome size evolution and gives no direct support for a role of natural selection in this process. PMID:22570634

  16. RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes

    PubMed Central

    Buza, Krisztian; Wilczynski, Bartek; Dojer, Norbert

    2015-01-01

    Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of the species. However, in most typical protocols, this information is disregarded and the reference genome is used. Results. We provide a new approach that allows researchers to reconstruct genomes very closely related to the reference genome (e.g., mutants of the same species) directly from the reads used in the experiment. Our approach applies de novo assembly software to experimental reads and so-called pseudoreads and uses the resulting contigs to generate a modified reference sequence. In this way, it can very quickly, and at no additional sequencing cost, generate new, modified reference sequence that is closer to the actual sequenced genome and has a full coverage. In this paper, we describe our approach and test its implementation called RECORD. We evaluate RECORD on both simulated and real data. We made our software publicly available on sourceforge. Conclusion. Our tests show that on closely related sequences RECORD outperforms more general assisted-assembly software. PMID:26558255

  17. RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes.

    PubMed

    Buza, Krisztian; Wilczynski, Bartek; Dojer, Norbert

    2015-01-01

    Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of the species. However, in most typical protocols, this information is disregarded and the reference genome is used. Results. We provide a new approach that allows researchers to reconstruct genomes very closely related to the reference genome (e.g., mutants of the same species) directly from the reads used in the experiment. Our approach applies de novo assembly software to experimental reads and so-called pseudoreads and uses the resulting contigs to generate a modified reference sequence. In this way, it can very quickly, and at no additional sequencing cost, generate new, modified reference sequence that is closer to the actual sequenced genome and has a full coverage. In this paper, we describe our approach and test its implementation called RECORD. We evaluate RECORD on both simulated and real data. We made our software publicly available on sourceforge. Conclusion. Our tests show that on closely related sequences RECORD outperforms more general assisted-assembly software. PMID:26558255

  18. A Genome-Wide Landscape of Retrocopies in Primate Genomes

    PubMed Central

    Navarro, Fábio C.P.; Galante, Pedro A.F.

    2015-01-01

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. PMID:26224704

  19. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    PubMed

    Navarro, Fábio C P; Galante, Pedro A F

    2015-08-01

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. PMID:26224704

  20. Linking the genomes of nonmodel teleosts through comparative genomics.

    PubMed

    Sarropoulou, E; Nousdili, D; Magoulas, A; Kotoulas, G

    2008-01-01

    Recently the genomes of two more teleost species have been released: the medaka (Oryzias latipes), and the three-spined stickleback (Gasterosteus aculateus). The rapid developments in genomics of fish species paved the way to new and valuable research in comparative genetics and genomics. With the accumulation of information in model species, the genetic and genomic characterization of nonmodel, but economically important species, is now feasible. Furthermore, comparison of low coverage gene maps of aquacultured fish species against fully sequenced fish species will enhance the efficiency of candidate genes identification projected for quantitative trait loci (QTL) scans for traits of commercial interest. This study shows the syntenic relationship between the genomes of six different teleost species, including three fully sequenced model species: Tetraodon nigroviridis, Oryzias latipes, Gasterosteus aculateus, and three marine species of commercial and evolutionary interest: Sparus aurata, Dicentrarchus labrax, Oreochromis spp. All three commercial fish species belong to the order Perciformes, which is the richest in number of species (approximately 10,000) but poor in terms of available genomic information and tools. Syntenic relationships were established by using 800 EST and microsatellites sequences successfully mapped on the RH map of seabream. Comparison to the stickleback genome produced most positive BLAT hits (58%) followed by medaka (32%) and Tetraodon (30%). Thus, stickleback was used as the major stepping stone to compare seabass and tilapia to seabream. In addition to the significance for the aquaculture industry, this approach can encompass important ecological and evolutionary implications. PMID:18297360

  1. Integrated genome browser: visual analytics platform for genomics

    PubMed Central

    Norris, David C.; Loraine, Ann E.

    2016-01-01

    Motivation: Genome browsers that support fast navigation through vast datasets and provide interactive visual analytics functions can help scientists achieve deeper insight into biological systems. Toward this end, we developed Integrated Genome Browser (IGB), a highly configurable, interactive and fast open source desktop genome browser. Results: Here we describe multiple updates to IGB, including all-new capabilities to display and interact with data from high-throughput sequencing experiments. To demonstrate, we describe example visualizations and analyses of datasets from RNA-Seq, ChIP-Seq and bisulfite sequencing experiments. Understanding results from genome-scale experiments requires viewing the data in the context of reference genome annotations and other related datasets. To facilitate this, we enhanced IGB’s ability to consume data from diverse sources, including Galaxy, Distributed Annotation and IGB-specific Quickload servers. To support future visualization needs as new genome-scale assays enter wide use, we transformed the IGB codebase into a modular, extensible platform for developers to create and deploy all-new visualizations of genomic data. Availability and implementation: IGB is open source and is freely available from http://bioviz.org/igb. Contact: aloraine@uncc.edu PMID:27153568

  2. Unlocking hidden genomic sequence

    PubMed Central

    Keith, Jonathan M.; Cochran, Duncan A. E.; Lala, Gita H.; Adams, Peter; Bryant, Darryn; Mitchelson, Keith R.

    2004-01-01

    Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs. PMID:14973330

  3. Genomics in Neurological Disorders

    PubMed Central

    Han, Guangchun; Sun, Jiya; Wang, Jiajia; Bai, Zhouxian; Song, Fuhai; Lei, Hongxing

    2014-01-01

    Neurological disorders comprise a variety of complex diseases in the central nervous system, which can be roughly classified as neurodegenerative diseases and psychiatric disorders. The basic and translational research of neurological disorders has been hindered by the difficulty in accessing the pathological center (i.e., the brain) in live patients. The rapid advancement of sequencing and array technologies has made it possible to investigate the disease mechanism and biomarkers from a systems perspective. In this review, recent progresses in the discovery of novel risk genes, treatment targets and peripheral biomarkers employing genomic technologies will be discussed. Our major focus will be on two of the most heavily investigated neurological disorders, namely Alzheimer’s disease and autism spectrum disorder. PMID:25108264

  4. Genomics of Atrial Fibrillation.

    PubMed

    Gutierrez, Alejandra; Chung, Mina K

    2016-06-01

    Atrial fibrillation (AF) is a common clinical arrhythmia that appears to be highly heritable, despite representing a complex interplay of several disease processes that generally do not manifest until later in life. In this manuscript, we will review the genetic basis of this complex trait established through studies of familial AF, linkage and candidate gene studies of common AF, genome wide association studies (GWAS) of common AF, and transcriptomic studies of AF. Since AF is associated with a five-fold increase in the risk of stroke, we also review the intersection of common genetic factors associated with both of these conditions. Similarly, we highlight the intersection of common genetic markers associated with some risk factors for AF, such as hypertension and obesity, and AF. Lastly, we describe a paradigm where genetic factors predispose to the risk of AF, but which may require additional stress and trigger factors in older age to allow for the clinical manifestation of AF. PMID:27139902

  5. Genome patent fight erupts

    SciTech Connect

    Roberts, L.

    1991-10-11

    At a Congressional briefing while describing a new project to sequence partially every gene active in the human brain, it was made known that the National Institutes of Health was planning to file patent applications on 1,000 of these sequences a month. The scheme has engendered a firestorm of criticism from genome scientists and project officials alike. The critics argue that these sequences probably can't be patented in the first place - and even if they can, they shouldn't be. The plan would undercut patent protection for those who labor long and hard at the real task of elucidating the function of the proteins encoded by the genes, thereby driving industry away from developing inventions based on that work.

  6. The South Asian Genome

    PubMed Central

    Scott, William R.; Tan, Sian-Tsung; Afzal, Uzma; Afaq, Saima; Loh, Marie; Lehne, Benjamin; O'Reilly, Paul; Gaulton, Kyle J.; Pearson, Richard D.; Li, Xinzhong; Lavery, Anita; Vandrovcova, Jana; Wass, Mark N.; Miller, Kathryn; Sehmi, Joban; Oozageer, Laticia; Kooner, Ishminder K.; Al-Hussaini, Abtehale; Mills, Rebecca; Grewal, Jagvir; Panoulas, Vasileios; Lewin, Alexandra M.; Northwood, Korrinne; Wander, Gurpreet S.; Geoghegan, Frank; Li, Yingrui; Wang, Jun; Aitman, Timothy J.; McCarthy, Mark I.

    2014-01-01

    The genetic sequence variation of people from the Indian subcontinent who comprise one-quarter of the world's population, is not well described. We carried out whole genome sequencing of 168 South Asians, along with whole-exome sequencing of 147 South Asians to provide deeper characterisation of coding regions. We identify 12,962,155 autosomal sequence variants, including 2,946,861 new SNPs and 312,738 novel indels. This catalogue of SNPs and indels amongst South Asians provides the first comprehensive map of genetic variation in this major human population, and reveals evidence for selective pressures on genes involved in skin biology, metabolism, infection and immunity. Our results will accelerate the search for the genetic variants underlying susceptibility to disorders such as type-2 diabetes and cardiovascular disease which are highly prevalent amongst South Asians. PMID:25115870

  7. Plantagora: Modeling Whole Genome Sequencing and Assembly of Plant Genomes

    PubMed Central

    Barthelson, Roger; McFarlin, Adam J.; Rounsley, Steven D.; Young, Sarah

    2011-01-01

    Background Genomics studies are being revolutionized by the next generation sequencing technologies, which have made whole genome sequencing much more accessible to the average researcher. Whole genome sequencing with the new technologies is a developing art that, despite the large volumes of data that can be produced, may still fail to provide a clear and thorough map of a genome. The Plantagora project was conceived to address specifically the gap between having the technical tools for genome sequencing and knowing precisely the best way to use them. Methodology/Principal Findings For Plantagora, a platform was created for generating simulated reads from several different plant genomes of different sizes. The resulting read files mimicked either 454 or Illumina reads, with varying paired end spacing. Thousands of datasets of reads were created, most derived from our primary model genome, rice chromosome one. All reads were assembled with different software assemblers, including Newbler, Abyss, and SOAPdenovo, and the resulting assemblies were evaluated by an extensive battery of metrics chosen for these studies. The metrics included both statistics of the assembly sequences and fidelity-related measures derived by alignment of the assemblies to the original genome source for the reads. The results were presented in a website, which includes a data graphing tool, all created to help the user compare rapidly the feasibility and effectiveness of different sequencing and assembly strategies prior to testing an approach in the lab. Some of our own conclusions regarding the different strategies were also recorded on the website. Conclusions/Significance Plantagora provides a substantial body of information for comparing different approaches to sequencing a plant genome, and some conclusions regarding some of the specific approaches. Plantagora also provides a platform of metrics and tools for studying the process of sequencing and assembly further. PMID:22174807

  8. Microbial Genomics Data from the DOE Joint Genome Institute (JGI)

    DOE Data Explorer

    The JGI makes high-quality genome sequencing data freely available to the greater scientific community through its web portal. Having played a significant role in the federally funded Human Genome Project -- generating the complete sequences of Chromosomes 5, 16, and 19--the JGI has now moved on to contributing in other critical areas of genomics research. While NIH-funded genome sequencing activities continue to emphasize human biomedical targets and applications, the JGI has since shifted its focus to the non-human components of the biosphere, particularly those relevant to the science mission of the Department of Energy. With efficiencies of scale established at the PGF, and capacity now exceeding three billion bases generated on a monthly basis, the JGI has tackled scores of additional genomes. These include more than 60 microbial genomes and many important multicellular organisms and communities of microbes. In partnership with other federal institutions and universities, the JGI is in the process of sequencing a frog (Xenopus tropicalis), a green alga (Chlamydomonas reinhardtii), a diatom (Thalassiosira pseudonana) , the cottonwood tree (Populus trichocarpa), and a host of agriculturally important plants and plant pathogens. Microorganisms, for example those that thrive under extreme conditions such as high acidity, radiation, and metal contamination, are of particular interest to the DOE and JGI. Investigations by JGI and its partners are shedding light on the cellular machinery of microbes and how they can be harnessed to clean up contaminated soil or water, capture carbon from the atmosphere, and produce potentially important sources of energy such as hydrogen and methane. [Excerpt from the JGI page "Who We Are" at http://www.jgi.doe.gov/whoweare/whoweare.html] From the JGI webportal users can view a photo grid of organisims, check assemblies for status, access the Integrated Microbial Genomes (IMG) system to do comparative analysis of publicly available

  9. Genome-wide analysis of DNA methylation and gene expression patterns in purified, uncultured human liver cells and activated hepatic stellate cells

    PubMed Central

    Reiner, Andrew H.; Coll, Mar; Verhulst, Stefaan; Mannaerts, Inge; Øie, Cristina I.; Smedsrød, Bård; Najimi, Mustapha; Sokal, Etienne; Luttun, Aernout; Sancho-Bru, Pau; Collas, Philippe; van Grunsven, Leo A.

    2015-01-01

    Background & Aims Liver fibrogenesis – scarring of the liver that can lead to cirrhosis and liver cancer – is characterized by hepatocyte impairment, capillarization of liver sinusoidal endothelial cells (LSECs) and hepatic stellate cell (HSC) activation. To date, the molecular determinants of a healthy human liver cell phenotype remain largely uncharacterized. Here, we assess the transcriptome and the genome-wide promoter methylome specific for purified, non-cultured human hepatocytes, LSECs and HSCs, and investigate the nature of epigenetic changes accompanying transcriptional changes associated with activation of HSCs. Material and methods Gene expression profile and promoter methylome of purified, uncultured human liver cells and culture-activated HSCs were respectively determined using Affymetrix HG-U219 genechips and by methylated DNA immunoprecipitation coupled to promoter array hybridization. Histone modification patterns were assessed at the single-gene level by chromatin immunoprecipitation and quantitative PCR. Results We unveil a DNA-methylation-based epigenetic relationship between hepatocytes, LSECs and HSCs despite their distinct ontogeny. We show that liver cell type-specific DNA methylation targets early developmental and differentiation-associated functions. Integrative analysis of promoter methylome and transcriptome reveals partial concordance between DNA methylation and transcriptional changes associated with human HSC activation. Further, we identify concordant histone methylation and acetylation changes in the promoter and putative novel enhancer elements of genes involved in liver fibrosis. Conclusions Our study provides the first epigenetic blueprint of three distinct freshly isolated, human hepatic cell types and of epigenetic changes elicited upon HSC activation. PMID:26353929

  10. Genome wide expression analysis of the effect of the Chinese patent medicine Zilongjin tablet on four human lung carcinoma cell lines.

    PubMed

    Zhang, Ping; Wang, Xin; Xiong, Songjin; Wen, Shaoping; Gao, Song; Wang, Lei; Cao, Boyang

    2011-10-01

    Zilongjin (ZLJ) tablet, which is a traditional Chinese medicine, has been approved as a new anti-tumor drug by the State Food and Drug Administration of China; however, its anti-cancer mechanisms remain elusive. The goal of this study was to investigate the underlying anti-cancer activities of ZLJ tablet in vitro. In this study, four lung cancer cell lines, A549, H446, H460 and H520, were treated with 2.2 mg/mL of ZLJ solution for 24 h at 37 °C under 5% CO(2) . RNA was isolated and a microarray experiment using the Affymetrix Human Genome U133 plus 2.0 Array was employed to differentiate the expression patterns of cancer-related genes after drug treatment. Of 483 genes in 63 functional categories and 25 different pathways that showed at least a 2-fold change of expression level in the four cancer cell lines, 170 genes were upregulated, and 313 genes were downregulated. Eleven of the 483 genes were cancer-related and belong to the three known pathways: apoptosis, cell cycle regulation and mitogen-activated protein kinase (MAPK) cascade. The microarray data were validated by real-time RT-PCR. The results of this investigation suggest possible anti-cancer mechanisms of the ZLJ tablet, and lay a foundation to further analyse its therapeutic roles. PMID:21953710

  11. Exploring Prostate Cancer Genome Reveals Simultaneous Losses of PTEN, FAS and PAPSS2 in Patients with PSA Recurrence after Radical Prostatectomy

    PubMed Central

    Ibeawuchi, Chinyere; Schmidt, Hartmut; Voss, Reinhard; Titze, Ulf; Abbas, Mahmoud; Neumann, Joerg; Eltze, Elke; Hoogland, Agnes Marije; Jenster, Guido; Brandt, Burkhard; Semjonow, Axel

    2015-01-01

    The multifocal nature of prostate cancer (PCa) creates a challenge to patients’ outcome prediction and their clinical management. An approach that scrutinizes every cancer focus is needed in order to generate a comprehensive evaluation of the disease, and by correlating to patients’ clinico-pathological information, specific prognostic biomarker can be identified. Our study utilized the Affymetrix SNP 6.0 Genome-wide assay to investigate forty-three fresh frozen PCa tissue foci from twenty-three patients. With a long clinical follow-up period that ranged from 2.0–9.7 (mean 5.4) years, copy number variation (CNV) data was evaluated for association with patients’ PSA status during follow-up. From our results, the loss of unique genes on 10q23.31 and 10q23.2–10q23.31 were identified to be significantly associated to PSA recurrence (p < 0.05). The implication of PTEN and FAS loss (10q23.31) support previous reports due to their critical roles in prostate carcinogenesis. Furthermore, we hypothesize that the PAPSS2 gene (10q23.2–10q23.31) may be functionally relevant in post-operative PSA recurrence because of its reported role in androgen biosynthesis. It is suggestive that the loss of the susceptible region on chromosome 10q, which implicates PTEN, FAS and PAPSS2 may serve as genetic predictors of PSA recurrence after radical prostatectomy. PMID:25679447

  12. Insights into conifer giga-genomes.

    PubMed

    De La Torre, Amanda R; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K; Jansson, Stefan; Jones, Steven J M; Keeling, Christopher I; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

    2014-12-01

    Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world's forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20-30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. PMID:25349325

  13. Jumbled genomes: missing Apicomplexan synteny.

    PubMed

    DeBarry, Jeremy D; Kissinger, Jessica C

    2011-10-01

    Whole-genome comparisons provide insight into genome evolution by informing on gene repertoires, gene gains/losses, and genome organization. Most of our knowledge about eukaryotic genome evolution is derived from studies of multicellular model organisms. The eukaryotic phylum Apicomplexa contains obligate intracellular protist parasites responsible for a wide range of human and veterinary diseases (e.g., malaria, toxoplasmosis, and theileriosis). We have developed an in silico protein-encoding gene based pipeline to investigate synteny across 12 apicomplexan species from six genera. Genome rearrangement between lineages is extensive. Syntenic regions (conserved gene content and order) are rare between lineages and appear to be totally absent across the phylum, with no group of three genes found on the same chromosome and in the same order within 25 kb up- and downstream of any orthologous genes. Conserved synteny between major lineages is limited to small regions in Plasmodium and Theileria/Babesia species, and within these conserved regions, there are a number of proteins putatively targeted to organelles. The observed overall lack of synteny is surprising considering the divergence times and the apparent absence of transposable elements (TEs) within any of the species examined. TEs are ubiquitous in all other groups of eukaryotes studied to date and have been shown to be involved in genomic rearrangements. It appears that there are different criteria governing genome evolution within the Apicomplexa relative to other well-studied unicellular and multicellular eukaryotes. PMID:21504890

  14. The genome of Eucalyptus grandis

    SciTech Connect

    Myburg, Alexander A.; Grattapaglia, Dario; Tuskan, Gerald A.; Hellsten, Uffe; Hayes, Richard D.; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M.; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R. K.; Hussey, Steven G.; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B.; Togawa, Roberto C.; Pappas, Marilia R.; Faria, Danielle A.; Sansaloni, Carolina P.; Petroli, Cesar D.; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J.; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A.; Bornberg-Bauer, Erich; Kersting, Anna R.; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E.; Liston, Aaron; Spatafora, Joseph W.; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H.; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C.; Steane, Dorothy A.; Vaillancourt, René E.; Potts, Brad M.; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J.; Strauss, Steven H.; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S.; Schmutz, Jeremy

    2014-06-11

    Eucalypts are the world s most widely planted hardwood trees. Their broad adaptability, rich species diversity, fast growth and superior multipurpose wood, have made them a global renewable resource of fiber and energy that mitigates human pressures on natural forests. We sequenced and assembled >94% of the 640 Mbp genome of Eucalyptus grandis into its 11 chromosomes. A set of 36,376 protein coding genes were predicted revealing that 34% occur in tandem duplications, the largest proportion found thus far in any plant genome. Eucalypts also show the highest diversity of genes for plant specialized metabolism that act as chemical defence against biotic agents and provide unique pharmaceutical oils. Resequencing of a set of inbred tree genomes revealed regions of strongly conserved heterozygosity, likely hotspots of inbreeding depression. The resequenced genome of the sister species E. globulus underscored the high inter-specific genome colinearity despite substantial genome size variation in the genus. The genome of E. grandis is the first reference for the early diverging Rosid order Myrtales and is placed here basal to the Eurosids. This resource expands knowledge on the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.

  15. Genomics in the ecological arena.

    PubMed

    Orsini, Luisa; Decaestecker, Ellen; De Meester, Luc; Pfrender, Michael E; Colbourne, John K

    2011-02-23

    This meeting report presents the cutting-edge research that is developing around the waterflea Daphnia, an emerging model system in environmental genomics. Daphnia has been a model species in ecology, toxicology and evolution for many years and is supported by a large community of ecologists, evolutionary biologists and ecotoxicologists. Thanks to new advances in genomics and transciptomics and to the sustained efforts of the Daphnia Genomics Consortium (DGC), Daphnia is also rapidly developing as a model system in environmental genomics. Advances in this emerging field were presented at the DGC 2010, held for the first time in a European University. During the meeting, a plethora of elegant studies were presented on the mechanisms of responses to environmental challenges using recently developed genomic tools. The DGC 2010 is a concrete example of the new trends in ecology and evolution. The times are mature for the application of innovative genomic and transcriptomic tools for studies of environmental genomics in non-model organisms. PMID:20702453

  16. Clinical genomics: from a truly personal genome viewpoint.

    PubMed

    Lupski, James R

    2016-06-01

    The path to Clinical Genomics is punctuated by our understanding of what types of DNA structural and sequence variation contribute to disease, the many technical challenges to detect such variation genome-wide, and the initial struggles to interpret personal genome variation in the context of disease. This review describes one perspective of the development of clinical genomics; whereas the experimental challenges, and hurdles to overcoming them, might be deemed readily apparent, the non-technical issues for clinical implementation may be less obvious. Some of these latter challenges, including: (1) informed consent, (2) privacy, (3) what constitutes potentially pathogenic variation contributing to disease, (4) disease penetrance in populations, and (5) the genetic architecture of disease, and the struggles sometimes faced for solutions, are highlighted using illustrative examples. PMID:27221143

  17. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  18. Orthology for comparative genomics in the mouse genome database.

    PubMed

    Dolan, Mary E; Baldarelli, Richard M; Bello, Susan M; Ni, Li; McAndrews, Monica S; Bult, Carol J; Kadin, James A; Richardson, Joel E; Ringwald, Martin; Eppig, Janan T; Blake, Judith A

    2015-08-01

    The mouse genome database (MGD) is the model organism database component of the mouse genome informatics system at The Jackson Laboratory. MGD is the international data resource for the laboratory mouse and facilitates the use of mice in the study of human health and disease. Since its beginnings, MGD has included comparative genomics data with a particular focus on human-mouse orthology, an essential component of the use of mouse as a model organism. Over the past 25 years, novel algorithms and addition of orthologs from other model organisms have enriched comparative genomics in MGD data, extending the use of orthology data to support the laboratory mouse as a model of human biology. Here, we describe current comparative data in MGD and review the history and refinement of orthology representation in this resource. PMID:26223881

  19. Applied genomics: Tools ranging from genomic prediction to bioconservation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This invited presentation will provide an overview of the development of genomic tools in cattle and goats, and how these approaches and methodologies can be adapted for bioconservation of endangered ruminant species....

  20. Genome Modeling System: A Knowledge Management Platform for Genomics

    PubMed Central

    Griffith, Malachi; Griffith, Obi L.; Smith, Scott M.; Ramu, Avinash; Callaway, Matthew B.; Brummett, Anthony M.; Kiwala, Michael J.; Coffman, Adam C.; Regier, Allison A.; Oberkfell, Ben J.; Sanderson, Gabriel E.; Mooney, Thomas P.; Nutter, Nathaniel G.; Belter, Edward A.; Du, Feiyu; Long, Robert L.; Abbott, Travis E.; Ferguson, Ian T.; Morton, David L.; Burnett, Mark M.; Weible, James V.; Peck, Joshua B.; Dukes, Adam; McMichael, Joshua F.; Lolofie, Justin T.; Derickson, Brian R.; Hundal, Jasreet; Skidmore, Zachary L.; Ainscough, Benjamin J.; Dees, Nathan D.; Schierding, William S.; Kandoth, Cyriac; Kim, Kyung H.; Lu, Charles; Harris, Christopher C.; Maher, Nicole; Maher, Christopher A.; Magrini, Vincent J.; Abbott, Benjamin S.; Chen, Ken; Clark, Eric; Das, Indraniel; Fan, Xian; Hawkins, Amy E.; Hepler, Todd G.; Wylie, Todd N.; Leonard, Shawn M.; Schroeder, William E.; Shi, Xiaoqi; Carmichael, Lynn K.; Weil, Matthew R.; Wohlstadter, Richard W.; Stiehr, Gary; McLellan, Michael D.; Pohl, Craig S.; Miller, Christopher A.; Koboldt, Daniel C.; Walker, Jason R.; Eldred, James M.; Larson, David E.; Dooling, David J.; Ding, Li; Mardis, Elaine R.; Wilson, Richard K.

    2015-01-01

    In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms. PMID:26158448

  1. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  2. Behavior, Brain, and Genome in Genomic Disorders: Finding the Correspondences

    PubMed Central

    Grigorenko, Elena L.; Urban, Alexander E.; Mencl, Einar

    2014-01-01

    Objective Within the last decade or so, there has been an acceleration of research attempting to connect specific genetic lesions to patterns of brain structure and activation. This article comments on observations that have been made based on these recent data and discusses their importance for the field of investigations into developmental disorders. Method In making these observations, we focus on one specific genomic lesion, the well-studied, yet still incompletely understood, 22q11.2 deletion syndrome (22q11.2DS). Results We demonstrate the degree of variability in the phenotype that occurs at both the brain and behavioral levels of genomic disorders, and describe how this variability is, upon close inspection, represented at the genomic level. Conclusion We emphasize the importance of combining genetic/genomic analyses and neuroimaging for research and for future clinical diagnostic purposes, and for the purposes of developing individualized, patient-tailored treatment and remediation approaches. PMID:20814258

  3. In quest of genomic treasure

    PubMed Central

    INOUE, Kimiko; OGURA, Atsuo

    2015-01-01

    It should be emphasized that “129” is not simply a number but is also the designation of a mouse strain that has made a great contribution to modern biological science and technology. Embryonic stem cells derived from 129 mice were essential components of gene-targeting strategies in early research. More recently, 129 mice have provided superior donor genomes for cloning by nuclear transfer. Some factor or factors conferring genomic plasticity must exist in the 129 genome, but these remain unidentified. PMID:26400375

  4. Human genome. 1993 Program report

    SciTech Connect

    Not Available

    1994-03-01

    The purpose of this report is to update the Human Genome 1991-92 Program Report and provide new information on the DOE genome program to researchers, program managers, other government agencies, and the interested public. This FY 1993 supplement includes abstracts of 60 new or renewed projects and listings of 112 continuing and 28 completed projects. These two reports, taken together, present the most complete published view of the DOE Human Genome Program through FY 1993. Research is progressing rapidly toward 15-year goals of mapping and sequencing the DNA of each of the 24 different human chromosomes.

  5. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    SciTech Connect

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  6. Genomic structural variants are linked with intellectual disability.

    PubMed

    Bulayeva, Kazima; Lesch, Klaus-Peter; Bulayev, Oleg; Walsh, Christopher; Glatt, Stephen; Gurgenova, Farida; Omarova, Jamilja; Berdichevets, Irina; Thompson, Paul M

    2015-09-01

    Mutations in more than 500 genes have been associated with intellectual disability (ID) and related disorders of cognitive function, such as autism and schizophrenia. Here we aimed to unravel the molecular epidemiology of non-specific ID in a genetic isolate using a combination of population and molecular genetic approaches. A large multigenerational pedigree was ascertained within a Dagestan Genetic Heritage research program in a genetic isolate of indigenous ethnics. Clinical characteristics of the affected members were based on combining diagnoses from regional psychiatric hospitals with our own clinical assessment, using a Russian translation of the structured psychiatric interviews, the Diagnostic Interview for Genetic Studies and the Family Interview for Genetic Studies, based on DSM-IV criteria. Weber/CHLC 9.0 STRs set was used for multipoint parametric linkage analyses (Simwalk2.91). Next, we checked CNVs and LOH (based on Affymetrix SNP 5.0 data) in the linked with ID genomic regions with the aim to identify candidate genes associated with mutations in linked regions. The number of statistically significant (p ≤ 0.05) suggestive linkage peaks with 1.3 < LOD < 3.0 we detected in a total of 10 genomic regions: 1q41, 2p25.3-p24.2, 3p13-p12.1, 4q13.3, 10p11, 11q23, 12q24.22-q24.31, 17q24.2-q25.1, 21q22.13 and 22q12.3-q13.1. Three significant linkage signals with LOD >3 were obtained at 2p25.3-p24.2 under the dominant model, with a peak at 21 cM flanked by loci D2S2976 and D2S2952; at 12q24.22-q24.31 under the recessive model, with a peak at -120 cM flanked by marker D12S2070 and D12S395 and at 22q12.3 under the dominant model, with a peak at 32 cM flanked by marker D22S683 and D22S445. After a set of genes had been designated as possible candidates in these specific chromosomal regions,we conducted an exploratory search for LOH and CNV based on microarray data to detect structural genomic variants within five ID-linked regions with LOD scores between 2.0 and

  7. Radiation Induced Genomic Instability

    SciTech Connect

    Morgan, William F.

    2011-03-01

    Radiation induced genomic instability can be observed in the progeny of irradiated cells multiple generations after irradiation of parental cells. The phenotype is well established both in vivo (Morgan 2003) and in vitro (Morgan 2003), and may be critical in radiation carcinogenesis (Little 2000, Huang et al. 2003). Instability can be induced by both the deposition of energy in irradiated cells as well as by signals transmitted by irradiated (targeted) cells to non-irradiated (non-targeted) cells (Kadhim et al. 1992, Lorimore et al. 1998). Thus both targeted and non-targeted cells can pass on the legacy of radiation to their progeny. However the radiation induced events and cellular processes that respond to both targeted and non-targeted radiation effects that lead to the unstable phenotype remain elusive. The cell system we have used to study radiation induced genomic instability utilizes human hamster GM10115 cells. These cells have a single copy of human chromosome 4 in a background of hamster chromosomes. Instability is evaluated in the clonal progeny of irradiated cells and a clone is considered unstable if it contains three or more metaphase sub-populations involving unique rearrangements of the human chromosome (Marder and Morgan 1993). Many of these unstable clones have been maintained in culture for many years and have been extensively characterized. As initially described by Clutton et al., (Clutton et al. 1996) many of our unstable clones exhibit persistently elevated levels of reactive oxygen species (Limoli et al. 2003), which appear to be due dysfunctional mitochondria (Kim et al. 2006, Kim et al. 2006). Interestingly, but perhaps not surprisingly, our unstable clones do not demonstrate a “mutator phenotype” (Limoli et al. 1997), but they do continue to rearrange their genomes for many years. The limiting factor with this system is the target – the human chromosome. While some clones demonstrate amplification of this chromosome and thus lend

  8. Archaic human genomics.

    PubMed

    Disotell, Todd R

    2012-01-01

    For much of the 20th century, the predominant view of human evolutionary history was derived from the fossil record. Homo erectus was seen arising in Africa from an earlier member of the genus and then spreading throughout the Old World and into the Oceania. A regional continuity model of anagenetic change from H. erectus via various intermediate archaic species into the modern humans in each of the regions inhabited by H. erectus was labeled the multiregional model of human evolution (MRE). A contrasting model positing a single origin, in Africa, of anatomically modern H. sapiens with some populations later migrating out of Africa and replacing the local archaic populations throughout the world with complete replacement became known as the recent African origin (RAO) model. Proponents of both models used different interpretations of the fossil record to bolster their views for decades. In the 1980s, molecular genetic techniques began providing evidence from modern human variation that allowed not only the different models of modern human origins to be tested but also the exploration demographic history and the types of selection that different regions of the genome and even specific traits had undergone. The majority of researchers interpreted these data as strongly supporting the RAO model, especially analyses of mitochondrial DNA (mtDNA). Extrapolating backward from modern patterns of variation and using various calibration points and substitution rates, a consensus arose that saw modern humans evolving from an African population around 200,000 years ago. Much later, around 50,000 years ago, a subset of this population migrated out of Africa replacing Neanderthals in Europe and western Asia as well as archaics in eastern Asia and Oceania. mtDNA sequences from more than two-dozen Neanderthals and early modern humans re-enforced this consensus. In 2010, however, the complete draft genomes of Neanderthals and of heretofore unknown hominins from Siberia, called

  9. The Materials Genome Project

    NASA Astrophysics Data System (ADS)

    Aourag, H.

    2008-09-01

    In the past, the search for new and improved materials was characterized mostly by the use of empirical, trial- and-error methods. This picture of materials science has been changing as the knowledge and understanding of fundamental processes governing a material's properties and performance (namely, composition, structure, history, and environment) have increased. In a number of cases, it is now possible to predict a material's properties before it has even been manufactured thus greatly reducing the time spent on testing and development. The objective of modern materials science is to tailor a material (starting with its chemical composition, constituent phases, and microstructure) in order to obtain a desired set of properties suitable for a given application. In the short term, the traditional "empirical" methods for developing new materials will be complemented to a greater degree by theoretical predictions. In some areas, computer simulation is already used by industry to weed out costly or improbable synthesis routes. Can novel materials with optimized properties be designed by computers? Advances in modelling methods at the atomic level coupled with rapid increases in computer capabilities over the last decade have led scientists to answer this question with a resounding "yes'. The ability to design new materials from quantum mechanical principles with computers is currently one of the fastest growing and most exciting areas of theoretical research in the world. The methods allow scientists to evaluate and prescreen new materials "in silico" (in vitro), rather than through time consuming experimentation. The Materials Genome Project is to pursue the theory of large scale modeling as well as powerful methods to construct new materials, with optimized properties. Indeed, it is the intimate synergy between our ability to predict accurately from quantum theory how atoms can be assembled to form new materials and our capacity to synthesize novel materials atom

  10. Sequencing and mapping of the onion genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The cost of DNA sequencing continues to decline and, in the near future, it will become reasonable to undertake sequencing of the enormous nuclear genome of onion. We undertook sequencing of expressed and genomic regions of the onion genome to learn about the structure of the onion genome, as well a...

  11. Genomic Aspects of Research Involving Polyploid Plants

    SciTech Connect

    Yang, Xiaohan; Ye, Chuyu; Tschaplinski, Timothy J; Wullschleger, Stan D; Tuskan, Gerald A

    2011-01-01

    Almost all extant plant species have spontaneously doubled their genomes at least once in their evolutionary histories, resulting in polyploidy which provided a rich genomic resource for evolutionary processes. Moreover, superior polyploid clones have been created during the process of crop domestication. Polyploid plants generated by evolutionary processes and/or crop domestication have been the intentional or serendipitous focus of research dealing with the dynamics and consequences of genome evolution. One of the new trends in genomics research is to create synthetic polyploid plants which provide materials for studying the initial genomic changes/responses immediately after polyploid formation. Polyploid plants are also used in functional genomics research to study gene expression in a complex genomic background. In this review, we summarize the recent progress in genomics research involving ancient, young, and synthetic polyploid plants, with a focus on genome size evolution, genomics diversity, genomic rearrangement, genetic and epigenetic changes in duplicated genes, gene discovery, and comparative genomics. Implications on plant sciences including evolution, functional genomics, and plant breeding are presented. It is anticipated that polyploids will be a regular subject of genomics research in the foreseeable future as the rapid advances in DNA sequencing technology create unprecedented opportunities for discovering and monitoring genomic and transcriptomic changes in polyploid plants. The fast accumulation of knowledge on polyploid formation, maintenance, and divergence at whole-genome and subgenome levels will not only help plant biologists understand how plants have evolved and diversified, but also assist plant breeders in designing new strategies for crop improvement.

  12. 2004 Structural, Function and Evolutionary Genomics

    SciTech Connect

    Douglas L. Brutlag Nancy Ryan Gray

    2005-03-23

    This Gordon conference will cover the areas of structural, functional and evolutionary genomics. It will take a systematic approach to genomics, examining the evolution of proteins, protein functional sites, protein-protein interactions, regulatory networks, and metabolic networks. Emphasis will be placed on what we can learn from comparative genomics and entire genomes and proteomes.

  13. Meeting Highlights: Genome Sequencing and Biology 2001

    PubMed Central

    2001-01-01

    We bring you a report from the CSHL Genome Sequencing and Biology Meeting, which has a long and prestigious history. This year there were sessions on large-scale sequencing and analysis, polymorphisms (covering discovery and technologies and mapping and analysis), comparative genomics of mammalian and model organism genomes, functional genomics and bioinformatics. PMID:18628920

  14. Reannotation of Shewanella oneidensis genome.

    PubMed

    Daraselia, N; Dernovoy, D; Tian, Y; Borodovsky, M; Tatusov, R; Tatusova, T

    2003-01-01

    As more and more complete bacterial genome sequences become available, the genome annotation of previously sequenced genomes may become quickly outdated. This is primarily due to the discovery and functional characterization of new genes. We have reannotated the recently published genome of Shewanella oneidensis with the following results: 51 new genes have been identified, and functional annotation has been added to the 97 genes, including 15 new and 82 existing ones with previously unassigned function. The identification of new genes was achieved by predicting the protein coding regions using the HMM-based program GeneMark.hmm. Subsequent comparison of the predicted gene products to the non-redundant protein database using BLAST and the COG (Clusters of Orthologous Groups) database using COGNITOR provided for the functional annotation. PMID:14506846

  15. Do Echinoderm Genomes Measure Up?

    PubMed Central

    Cameron, R. Andrew; Kudtarkar, Parul; Gordon, Susan M.; Worley, Kim C.; Gibbs, Richard A.

    2015-01-01

    Echinoderm genome sequences are a corpus of useful information about a clade of animals that serve as research models in fields ranging from marine ecology to cell and developmental biology. Genomic information from echinoids has contributed to insights into the gene interactions that drive the developmental process at the molecular level. Such insights often rely heavily on genomic information and the kinds of questions that can be asked thus depend on the quality of the sequence information. Here we describe the history of echinoderm genomic sequence assembly and present details about the quality of the data obtained. All of the sequence information discussed here is posted on the echinoderm information web system, Echinobase.org. PMID:25701080

  16. Genomics and equal opportunity ethics.

    PubMed

    Cappelen, A W; Norheim, O F; Tungodden, B

    2008-05-01

    Genomics provides information on genetic susceptibility to diseases and new possibilities for interventions which can fundamentally alter the design of fair health policies. The aim of this paper is to explore implications of genomics from the perspective of equal opportunity ethics. The ideal of equal opportunity requires that individuals are held responsible for some, but not all, factors that affect their health. Informational problems, however, often make it difficult to implement the ideal of equal opportunity in the context of healthcare. In this paper, examples are considered of how new genetic information may affect the way individual responsibility for choice is assigned. It is also argued that genomics may result in relocation of the responsibility cut by providing both new information and new technology. Finally, how genomics may affect healthcare policies and the market for health insurance is discussed. PMID:18448717

  17. Genomic Resources for Cancer Epidemiology

    Cancer.gov

    This page provides links to research resources, complied by the Epidemiology and Genomics Research Program, that may be of interest to genetic epidemiologists conducting cancer research, but is not exhaustive.

  18. Collaborators | Office of Cancer Genomics

    Cancer.gov

    The TARGET initiative is jointly managed within the National Cancer Institute (NCI) by the Office of Cancer Genomics (OCG)Opens in a New Tab and the Cancer Therapy Evaluation Program (CTEP)Opens in a New Tab.

  19. Genomic Datasets for Cancer Research

    Cancer.gov

    A variety of datasets from genome-wide association studies of cancer and other genotype-phenotype studies, including sequencing and molecular diagnostic assays, are available to approved investigators through the Extramural National Cancer Institute Data Access Committee.

  20. Genome Statute and Legislation Database

    MedlinePlus

    ... of page Last Reviewed: February 29, 2016 Get Email Updates Advancing human health through genomics research Privacy Copyright Contact Accessibility Plug-ins Site Map Staff Directory FOIA Share Top

  1. Genomic understanding of glioblastoma expanded

    Cancer.gov

    Glioblastoma multiforme (GBM) was the first cancer type to be systematically studied by TCGA in 2008. In a new, complementary report, TCGA experts examined more than 590 GBM samples--the largest to date utilizing genomic characterization techniques and ne

  2. Mutational dynamics of aroid chloroplast genomes.

    PubMed

    Ahmed, Ibrar; Biggs, Patrick J; Matthews, Peter J; Collins, Lesley J; Hendy, Michael D; Lockhart, Peter J

    2012-01-01

    A characteristic feature of eukaryote and prokaryote genomes is the co-occurrence of nucleotide substitution and insertion/deletion (indel) mutations. Although similar observations have also been made for chloroplast DNA, genome-wide associations have not been reported. We determined the chloroplast genome sequences for two morphotypes of taro (Colocasia esculenta; family Araceae) and compared these with four publicly available aroid chloroplast genomes. Here, we report the extent of genome-wide association between direct and inverted repeats, indels, and substitutions in these aroid chloroplast genomes. We suggest that alternative but not mutually exclusive hypotheses explain the mutational dynamics of chloroplast genome evolution. PMID:23204304

  3. Genomic imprinting and cancer.

    PubMed Central

    Joyce, J A; Schofield, P N

    1998-01-01

    Genomic imprinting is the phenomenon by which individual alleles of certain genes are expressed differentially according to their parent of origin. The alleles appear to be differentially marked during gametogenesis or during the early part of development. This mark is heritable but reversible from generation to generation, implying a stable epigenetic modification. Approximately 25 imprinted genes have been identified to date, and dysregulation of a number of these has been implicated in tumour development. The normal physiological role of many imprinted genes is in the control of cell proliferation and fetal growth, indicating potential mechanisms of action in tumour formation. Both dominant and recessive modes of action have been postulated for the role of imprinted genes in neoplasia, as a result of effective gene dosage alterations by epigenetic modification of the normal pattern of allele specific transcription. The aim of this review is to assess the importance of imprinted genes in generating tumours and to discuss the implications for novel mechanisms of transforming mutation. PMID:9893743

  4. The soft genome

    PubMed Central

    Anava, Sarit; Posner, Rachel; Rechavi, Oded

    2014-01-01

    Caenorhabditis elegans (C. elegans) nematodes transmit small RNAs across generations, a process that enables transgenerational regulation of genes. In contrast to changes to the DNA sequence, transgenerational transmission of small RNA-mediated responses is reversible, and thus enables “soft” or “flexible” inheritance of acquired characteristics. Until very recently only introduction of foreign genetic material (viruses, transposons, transgenes) was shown to directly lead to inheritance of small RNAs. New discoveries however, demonstrate that starvation also triggers inheritance of endogenous small RNAs in C.elegans. Multiple generations of worms inherit starvation-responsive endogenous small RNAs, and starvation also results in heritable extension of the progeny's lifespan. In this Commentary paper we explore the intriguing possibility that large parts of the genome and many additional traits are similarly subjected to heritable small RNA-mediated regulation, and focus on the potential influence of transgenerational RNAi on the worm's physiology. While the universal relevance of this mechanism remains to be discovered, we will examine how the discoveries made in worms already challenge long held dogmas in genetics and evolution. PMID:26430554

  5. Eukaryotic Genomics Data from the DOE Joint Genome Institute (JGI)

    DOE Data Explorer

    The JGI makes high-quality genome sequencing data freely available to the greater scientific community through its web portal. Having played a significant role in the federally funded Human Genome Project -- generating the complete sequences of Chromosomes 5, 16, and 19--the JGI has now moved on to contributing in other critical areas of genomics research. While NIH-funded genome sequencing activities continue to emphasize human biomedical targets and applications, the JGI has since shifted its focus to the non-human components of the biosphere, particularly those relevant to the science mission of the Department of Energy. With efficiencies of scale established at the PGF, and capacity now exceeding three billion bases generated on a monthly basis, the JGI has tackled scores of additional genomes. These include more than 60 microbial genomes and many important multicellular organisms and communities of microbes. In partnership with other federal institutions and universities, the JGI is in the process of sequencing a frog (Xenopus tropicalis), a green alga (Chlamydomonas reinhardtii), a diatom (Thalassiosira pseudonana) , the cottonwood tree (Populus trichocarpa), and a host of agriculturally important plants and plant pathogens. Microorganisms, for example those that thrive under extreme conditions such as high acidity, radiation, and metal contamination, are of particular interest to the DOE and JGI. Investigations by JGI and its partners are shedding light on the cellular machinery of microbes and how they can be harnessed to clean up contaminated soil or water, capture carbon from the atmosphere, and produce potentially important sources of energy such as hydrogen and methane. [Excerpt from the JGI page "Who We Are" at http://www.jgi.doe.gov/whoweare/whoweare.html] From the JGI webportal users can choose Eukaryotic genomes from a photo list, access the JGI FTP directories to download data files, use the Tree of Life navigation tool, or choose a genome and go

  6. Genomic Landscapes of Pancreatic Neoplasia

    PubMed Central

    Wood, Laura D.; Hruban, Ralph H.

    2015-01-01

    Pancreatic cancer is a deadly disease with a dismal prognosis. However, recent advances in sequencing and bioinformatic technology have led to the systematic characterization of the genomes of all major tumor types in the pancreas. This characterization has revealed the unique genomic landscape of each tumor type. This knowledge will pave the way for improved diagnostic and therapeutic approaches to pancreatic tumors that take advantage of the genetic alterations in these neoplasms. PMID:25812653

  7. IS4 family goes genomic

    PubMed Central

    2008-01-01

    Background Insertion sequences (ISs) are small, mobile DNA entities able to expand in prokaryotic genomes and trigger important rearrangements. To understand their role in evolution, accurate IS taxonomy is essential. The IS4 family is composed of ~70 elements and, like some other families, displays extremely elevated levels of internal divergence impeding its classification. The increasing availability of complete genome sequences provides a valuable source for the discovery of additional IS4 elements. In this study, this genomic database was used to update the structural and functional definition of the IS4 family. Results A total of 227 IS4-related sequences were collected among more than 500 sequenced bacterial and archaeal genomes, representing more than a three fold increase of the initial inventory. A clear division into seven coherent subgroups was discovered as well as three emerging families, which displayed distinct structural and functional properties. The IS4 family was sporadically present in 17 % of analyzed genomes, with most of them displaying single or a small number of IS4 elements. Significant expansions were detected only in some pathogens as well as among certain extremophiles, suggesting the probable involvement of some elements in bacterial and archaeal adaptation and/or evolution. Finally, it should be noted that some IS4 subgroups and two emerging families occurred preferentially in specific phyla or exclusively inside a specific genus. Conclusion The present taxonomic update of IS4 and emerging families will facilitate the classification of future elements as they arise from ongoing genome sequencing. Their narrow genomic impact and the existence of both IS-poor and IS-rich thriving prokaryotes suggested that these families, and probably ISs in general, are occasionally used as a tool for genome flexibility and evolution, rather than just representing self sustaining DNA entities. PMID:18215304

  8. Contact | Office of Cancer Genomics

    Cancer.gov

    For more information about the Office of Cancer Genomics, please contact: Office of Cancer Genomics National Cancer Institute 31 Center Drive, 10A07 Bethesda, Maryland 20892-2580 Phone: (301) 451-8027 Fax: (301) 480-4368 Email: ocg@mail.nih.gov *Please note that this site will not function properly in Internet Explorer unless you completely turn off the Compatibility View*

  9. Genome diversity of Shigella boydii.

    PubMed

    Kania, Dane A; Hazen, Tracy H; Hossain, Anowar; Nataro, James P; Rasko, David A

    2016-06-01

    ITALIC! Shigella boydiiis one of the four ITALIC! Shigellaspecies that causes disease worldwide; however, there are few published studies that examine the genomic variation of this species. This study compares genomes of 72 total isolates; 28 ITALIC! S. boydiifrom Bangladesh and The Gambia that were recently isolated as part of the Global Enteric Multicenter Study (GEMS), 14 historical ITALIC! S. boydiigenomes in the public domain and 30 ITALIC! Escherichia coliand ITALIC! Shigellareference genomes that represent the genomic diversity of these pathogens. This comparative analysis of these 72 genomes identified that the ITALIC! S. boydiiisolates separate into three phylogenomic clades, each with specific gene content. Each of the clades contains ITALIC! S. boydiiisolates from geographic and temporally distant sources, indicating that the ITALIC! S. boydiiisolates from the GEMS are representative of ITALIC! S. boydii.This study describes the genome sequences of a collection of novel ITALIC! S. boydiiisolates and provides insight into the diversity of this species in comparison to the ITALIC! E. coliand other ITALIC! Shigellaspecies. PMID:27056949

  10. Shannon Information in Complete Genomes

    NASA Astrophysics Data System (ADS)

    Hsieh, Li-Ching; Chang, Chang-Heng; Lee, Hoong-Chien

    2004-03-01

    Genomes are books of life and necessarily carry a huge amount of information. This study was first motivated by the question: "How much information do complete genomes have?" As an answer we measured a particular type of Shannon information in all prokaryotes and eukaryotes whose complete genomes have been sequenced and are available in publically assessible database. The Shannon information in complete genome sequences follow an extremely simple pattern. With the exception of one eukaryote the Shannon information in all (more than 200) complete sequences belong to a single universality class given by a simple geometric recursion formula. The data are interpreted in terms of models for genome growth and inferred to suggest that the ancestors of present day genomes began to grow, mainly by stochastic, selectively neutral, duplications and short mutations, most likely when they were not more than 300 nt long. This notion of selective neutralism independently corroborates Kimura's neutral theory of evolution which was based on the investigation of polymorphisms of genes.

  11. Genomic expression during human myelopoiesis

    PubMed Central

    Ferrari, Francesco; Bortoluzzi, Stefania; Coppe, Alessandro; Basso, Dario; Bicciato, Silvio; Zini, Roberta; Gemelli, Claudia; Danieli, Gian Antonio; Ferrari, Sergio

    2007-01-01

    Background Human myelopoiesis is an exciting biological model for cellular differentiation since it represents a plastic process where multipotent stem cells gradually limit their differentiation potential, generating different precursor cells which finally evolve into distinct terminally differentiated cells. This study aimed at investigating the genomic expression during myeloid differentiation through a computational approach that integrates gene expression profiles with functional information and genome organization. Results Gene expression data from 24 experiments for 8 different cell types of the human myelopoietic lineage were used to generate an integrated myelopoiesis dataset of 9,425 genes, each reliably associated to a unique genomic position and chromosomal coordinate. Lists of genes constitutively expressed or silent during myelopoiesis and of genes differentially expressed in commitment phase of myelopoiesis were first identified using a classical data analysis procedure. Then, the genomic distribution of myelopoiesis genes was investigated integrating transcriptional and functional characteristics of genes. This approach allowed identifying specific chromosomal regions significantly highly or weakly expressed, and clusters of differentially expressed genes and of transcripts related to specific functional modules. Conclusion The analysis of genomic expression during human myelopoiesis using an integrative computational approach allowed discovering important relationships between genomic position, biological function and expression patterns and highlighting chromatin domains, including genes with coordinated expression and lineage-specific functions. PMID:17683550

  12. Genomic sequencing in clinical trials

    PubMed Central

    2011-01-01

    Human genome sequencing is the process by which the exact order of nucleic acid base pairs in the 24 human chromosomes is determined. Since the completion of the Human Genome Project in 2003, genomic sequencing is rapidly becoming a major part of our translational research efforts to understand and improve human health and disease. This article reviews the current and future directions of clinical research with respect to genomic sequencing, a technology that is just beginning to find its way into clinical trials both nationally and worldwide. We highlight the currently available types of genomic sequencing platforms, outline the advantages and disadvantages of each, and compare first- and next-generation techniques with respect to capabilities, quality, and cost. We describe the current geographical distributions and types of disease conditions in which these technologies are used, and how next-generation sequencing is strategically being incorporated into new and existing studies. Lastly, recent major breakthroughs and the ongoing challenges of using genomic sequencing in clinical research are discussed. PMID:22206293

  13. The dynamic genome of Hydra

    PubMed Central

    Chapman, Jarrod A.; Kirkness, Ewen F.; Simakov, Oleg; Hampson, Steven E.; Mitros, Therese; Weinmaier, Therese; Rattei, Thomas; Balasubramanian, Prakash G.; Borman, Jon; Busam, Dana; Disbennett, Kathryn; Pfannkoch, Cynthia; Sumin, Nadezhda; Sutton, Granger G.; Viswanathan, Lakshmi Devi; Walenz, Brian; Goodstein, David M.; Hellsten, Uffe; Kawashima, Takeshi; Prochnik, Simon E.; Putnam, Nicholas H.; Shu, Shengquiang; Blumberg, Bruce; Dana, Catherine E.; Gee, Lydia; Kibler, Dennis F.; Law, Lee; Lindgens, Dirk; Martinez, Daniel E.; Peng, Jisong; Wigge, Philip A.; Bertulat, Bianca; Guder, Corina; Nakamura, Yukio; Ozbek, Suat; Watanabe, Hiroshi; Khalturin, Konstantin; Hemmrich, Georg; Franke, André; Augustin, René; Fraune, Sebastian; Hayakawa, Eisuke; Hayakawa, Shiho; Hirose, Mamiko; Hwang, Jung Shan; Ikeo, Kazuho; Nishimiya-Fujisawa, Chiemi; Ogura, Atshushi; Takahashi, Toshio; Steinmetz, Patrick R. H.; Zhang, Xiaoming; Aufschnaiter, Roland; Eder, Marie-Kristin; Gorny, Anne-Kathrin; Salvenmoser, Willi; Heimberg, Alysha M.; Wheeler, Benjamin M.; Peterson, Kevin J.; Böttger, Angelika; Tischler, Patrick; Wolf, Alexander; Gojobori, Takashi; Remington, Karin A.; Strausberg, Robert L.; Venter, J. Craig; Technau, Ulrich; Hobmayer, Bert; Bosch, Thomas C. G.; Holstein, Thomas W.; Fujisawa, Toshitaka; Bode, Hans R.; David, Charles N.; Rokhsar, Daniel S.; Steele, Robert E.

    2015-01-01

    The freshwater cnidarian Hydra was first described in 17021 and has been the object of study for 300 years. Experimental studies of Hydra between 1736 and 1744 culminated in the discovery of asexual reproduction of an animal by budding, the first description of regeneration in an animal, and successful transplantation of tissue between animals2. Today, Hydra is an important model for studies of axial patterning3, stem cell biology4 and regeneration5. Here we report the genome of Hydra magnipapillata and compare it to the genomes of the anthozoan Nematostella vectensis6 and other animals. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer, trans-splicing, and simplification of gene structure and gene content that parallel simplification of the Hydra life cycle. We also report the sequence of the genome of a novel bacterium stably associated with H. magnipapillata. Comparisons of the Hydra genome to the genomes of other animals shed light on the evolution of epithelia, contractile tissues, developmentally regulated transcription factors, the Spemann–Mangold organizer, pluripotency genes and the neuromuscular junction. PMID:20228792

  14. Comparative genomic analyses in Asparagus.

    PubMed

    Kuhl, Joseph C; Havey, Michael J; Martin, William J; Cheung, Foo; Yuan, Qiaoping; Landherr, Lena; Hu, Yi; Leebens-Mack, James; Town, Christopher D; Sink, Kenneth C

    2005-12-01

    Garden asparagus (Asparagus officinalis L.) belongs to the monocot family Asparagaceae in the order Asparagales. Onion (Allium cepa L.) and Asparagus officinalis are 2 of the most economically important plants of the core Asparagales, a well supported monophyletic group within the Asparagales. Coding regions in onion have lower GC contents than the grasses. We compared the GC content of 3374 unique expressed sequence tags (ESTs) from A. officinalis with Lycoris longituba and onion (both members of the core Asparagales), Acorus americanus (sister to all other monocots), the grasses, and Arabidopsis. Although ESTs in A. officinalis and Acorus had a higher average GC content than Arabidopsis, Lycoris, and onion, all were clearly lower than the grasses. The Asparagaceae have the smallest nuclear genomes among all plants in the core Asparagales, which typically have huge genomes. Within the Asparagaceae, European Asparagus species have approximately twice the nuclear DNA of that of southern African Asparagus species. We cloned and sequenced 20 genomic amplicons from European A. officinalis and the southern African species Asparagus plumosus and observed no clear evidence for a recent genome doubling in A. officinalis relative to A. plumosus. These results indicate that members of the genus Asparagus with smaller genomes may be useful genomic models for plants in the core Asparagales. PMID:16391674

  15. Genomics and marine microbial ecology.

    PubMed

    Pedrós-Alió, Carlos

    2006-09-01

    Genomics has brought about a revolution in all fields of biology. Before the development of microbial ecology in the 1970s, microbes were not even considered in marine ecological studies. Today we know that half of the total primary production of the planet must be credited to microorganisms. This and other discoveries have changed dramatically the perspective and the focus of marine microbial ecology. The application of genomics-based approaches has provided new challenges and has allowed the discovery of novel functions, an appreciation of the great diversity of microorganisms, and the introduction of controversial ideas regarding the concepts of species, genome, and niche. Nevertheless, thorough knowledge of the traditional disciplines of biology is necessary to explore the possibilities arising from these new insights. This work reviews the different genomic techniques that can be applied to marine microbial ecology, including both sequencing of the complete genomes of microorganisms and metagenomics, which, in turn, can be complemented with the study of mRNAs (transcriptomics) and proteins (proteomics). The example of proteorhodopsin illustrates the type of information that can be gained from these approaches. A genomics perspective constitutes a map that will allow microbiologists to focus their research on potentially more productive aspects. PMID:17061209

  16. Comparative genomic hybridization with single cells after whole genome amplification

    SciTech Connect

    Haddad, B.R.; Baldini, A.; Hughes, M.R.

    1994-09-01

    Conventional karyotype analysis is the ideal way to diagnose chromosomal imbalances. However it requires cell culture and chromosome preparation. There are instances where a very small number of cells are available for cytogenetic evaluation and chromosomes cannot be obtained. Comparative genomic hybridization (CGH) is a novel molecular cytogenetic technique that provides information about genetic imbalances affecting the genome. The power of this technique lies in its ability to detect genetic imbalances using total genomic DNA. We have previously demonstrated the feasibility of whole genome amplification from single cells for subsequent analysis of multiple genetic loci by PCR. In this present work, we combine whole genome amplification with CGH to detect chromosomal imbalances from small numbers of cells. Both cytogenetically normal and abnormal cells were individually picked by micromanipulation and subjected to whole genome amplification using random oligonucleotide primers. Amplified test and control DNA were differentially labeled by incorporation of digoxigenin or biotin, mixed together and hybridized to normal male metaphase spreads. Hybridization was detected with two fluorochromes, rhodamine-anti-digoxigenin and FITC -Avidin. Ratio of intensities of the two fluorochromes along the target chromosomes was analyzed using locally developed computer imaging software. Using the combination of whole genome amplification and CGH, we were able to detect different chromosomal aneuploidies from 30, 20, and 10 cells. It can also be applied to the analysis of fetal cells sorted from maternal circulation, or to tumor cells obtained from needle biopsies or from different body fluids and effusions. Finally, its successful application to single cells will have a great impact on preimplantation diagnosis.

  17. Genomics and museum specimens.

    PubMed

    Nachman, Michael W

    2013-12-01

    Nearly 25 years ago, Allan Wilson and colleagues isolated DNA sequences from museum specimens of kangaroo rats (Dipodomys panamintinus) and compared these sequences with those from freshly collected animals (Thomas et al. 1990). The museum specimens had been collected up to 78 years earlier, so the two samples provided a direct temporal comparison of patterns of genetic variation. This was not the first time DNA sequences had been isolated from preserved material, but it was the first time it had been carried out with a population sample. Population geneticists often try to make inferences about the influence of historical processes such as selection, drift, mutation and migration on patterns of genetic variation in the present. The work of Wilson and colleagues was important in part because it suggested a way in which population geneticists could actually study genetic change in natural populations through time, much the same way that experimentalists can do with artificial populations in the laboratory. Indeed, the work of Thomas et al. (1990) spawned dozens of studies in which museum specimens were used to compare historical and present-day genetic diversity (reviewed in Wandeler et al. 2007). All of these studies, however, were limited by the same fundamental problem: old DNA is degraded into short fragments. As a consequence, these studies mostly involved PCR amplification of short templates, usually short stretches of mitochondrial DNA or microsatellites. In this issue, Bi et al. (2013) report a breakthrough that should open the door to studies of genomic variation in museum specimens. They used target enrichment (exon capture) and next-generation (Illumina) sequencing to compare patterns of genetic variation in historic and present-day population samples of alpine chipmunks (Tamias alpinus) (Fig. 1). The historic samples came from specimens collected in 1915, so the temporal span of this comparison is nearly 100 years. PMID:24138088

  18. Brazil: public health genomics.

    PubMed

    Castilla, E E; Luquetti, D V

    2009-01-01

    Brazil represents half of South America and one third of Latin America, having more than 186 million inhabitants. After China and India it is the third largest developing country in the world. The wealth is unequally distributed among the states and among the people. Brazil has a large and complex health care system. A Universal Public Health System (SUS: Sistema SPACEnico de Saúde) covers the medical expenses for 80% of the population. The genetic structure of the population is very complex, including a large proportion of tri- hybrid persons, genetic isolates, and a panmictic large majority. Genetic services are offered at 64 genetic centers, half of them public and free. Nationwide networks are operating for inborn errors of metabolism, oncogenetics, and craniofacial anomalies. The Brazilian Society of Medical Genetics (SBGM) has granted 120 board certifications since 1986, and 7 recognized residences in medical genetics are operating in the country. Three main public health actions promoted by the federal government have been undertaken in the last decade, ultimately aimed at the prevention of birth defects. Since 1999, birth defects are reported for all 3 million annual live births, several vaccination strategies aim at the eradication of rubella, and wheat and maize flours are fortified with folic acid. Currently, the government distributes over 2 million US dollars to finance 14 research projects aimed at providing the basis for the adequate prevention and care of genetics disorders through the SUS. Continuity of this proactive attitude of the government in the area of genomics in public health is desired. PMID:19023184

  19. Genome-wide analysis of differentially expressed genes and splicing isoforms in clear cell renal cell carcinoma.

    PubMed

    Valletti, Alessio; Gigante, Margherita; Palumbo, Orazio; Carella, Massimo; Divella, Chiara; Sbisà, Elisabetta; Tullo, Apollonia; Picardi, Ernesto; D'Erchia, Anna Maria; Battaglia, Michele; Gesualdo, Loreto; Pesole, Graziano; Ranieri, Elena

    2013-01-01

    Clear cell renal cell carcinoma (ccRCC) is the most common malignant renal epithelial tumor and also the most deadly. To identify molecular changes occurring in ccRCC, in the present study we performed a genome wide analysis of its entire complement of mRNAs. Gene and exon-level analyses were carried out by means of the Affymetrix Exon Array platform. To achieve a reliable detection of differentially expressed cassette exons we implemented a novel methodology that considered contiguous combinations of exon triplets and candidate differentially expressed cassette exons were identified when the expression level was significantly different only in the central exon of the triplet. More detailed analyses were performed for selected genes using quantitative RT-PCR and confocal laser scanning microscopy. Our analysis detected over 2,000 differentially expressed genes, and about 250 genes alternatively spliced and showed differential inclusion of specific cassette exons comparing tumor and non-tumoral tissues. We demonstrated the presence in ccRCC of an altered expression of the PTP4A3, LAMA4, KCNJ1 and TCF21 genes (at both transcript and protein level). Furthermore, we confirmed, at the mRNA level, the involvement of CAV2 and SFRP genes that have previously been identified. At exon level, among potential candidates we validated a differentially included cassette exon in DAB2 gene with a significant increase of DAB2 p96 splice variant as compared to the p67 isoform. Based on the results obtained, and their robustness according to both statistical analysis and literature surveys, we believe that a combination of gene/isoform expression signature may remarkably contribute, after suitable validation, to a more effective and reliable definition of molecular biomarkers for ccRCC early diagnosis, prognosis and prediction of therapeutic response. PMID:24194935

  20. A genome wide analysis of alternative splicing events during the osteogenic differentiation of human cartilage endplate-derived stem cells.

    PubMed

    Shang, Jin; Wang, Honggang; Fan, Xin; Shangguan, Lei; Liu, Huan

    2016-08-01

    Low back pain is a prevalent disease, which leads to suffering and disabilities in a vast number of individuals. Degenerative disc diseases are usually the underlying causes of low back pain. However, the pathogenesis of degenerative disc diseases is highly complex and difficult to determine. Current therapies for degenerative disc diseases are various. In particular, cell-based therapies have proven to be effective and promising. Our research group has previously isolated and identified the cartilage endplate‑derived stem cells. In addition, alternative splicing is a sophisticated regulatory mechanism, which greatly increases cellular complexity and phenotypic diversity of eukaryotic organisms. The present study continued to investigate alternative splicing events in osteogenic differentiation of cartilage endplate‑derived stem cells. An Affymetrix Human Transcriptome Array 2.0 was used to detect splicing changes between the control and differentiated samples. Additionally, molecular function and pathway analysis were also performed. Following rigorous bioinformatics analysis of the data, 3,802 alternatively spliced genes were identified, and 10 of these were selected for validation by reverse transcription‑polymerase chain reaction. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes pathway analysis also revealed numerous enriched GO terms and signaling pathways. To the best of our knowledge, the present study is the first to investigate alternative splicing mechanisms in osteogenic differentiation of stem cells on a genome‑wide scale. The illumination of molecular mechanisms of stem cell osteogenic differentiation may assist the development novel bioengineered methods to treat degenerative disc diseases. PMID:27278552

  1. The mouse genome informatics and the mouse genome database

    SciTech Connect

    Maltais, L.J.; Blackburn, R.E.; Bradt, D.W.

    1994-09-01

    The Mouse Genome Database (MGD) is a centralized, comprehensive database of the mouse genome that includes genetic mapping data, comparative mapping data, gene descriptions, mutant phenotype descriptions, strains and allelic polymorphism data, inbred strain characteristics, physical mapping data, and molecular probes and clones data. Data in MGD are obtained from the published literature and by electronic transfer from laboratories working on large backcross panels of mice. MGD provides tools that enable the user to search the database, retrieve data, generate reports, analyze data, annotate records, and build genetic maps. The Encyclopedia of the Mouse Genome provides a graphic user interface to mouse genome data. It consists of software tools including: LinkMap, a graphic display of genetic linkage maps with the ability to magnify regions of high locus density: CytoMap, a graphic display of cytogenetic maps showing banded chromosomes with cytogenetic locations of genes and chromosomal aberrations; CATS, a catalog searching tool for text retrieval of mouse locus descriptions. These software tools provide access to the following data sets: Chromosome Committee Reports, MIT Genome Center data, GBASE reports, Mouse Locus Catalog (MLC), and Mouse Cytogenetic Mapping Data. The MGD is available to the scientific community through the World Wide Web (WWW) and Gopher. In addition GBASE can be accessed via the Internet.

  2. Genomic repeats, genome plasticity and the dynamics of Mycoplasma evolution

    PubMed Central

    Rocha, Eduardo P. C.; Blanchard, Alain

    2002-01-01

    Mycoplasmas evolved by a drastic reduction in genome size, but their genomes contain numerous repeated sequences with important roles in their evolution. We have established a bioinformatic strategy to detect the major recombination hot-spots in the genomes of Mycoplasma pneumoniae, Mycoplasma genitalium, Ureaplasma urealyticum and Mycoplasma pulmonis. This allowed the identification of large numbers of potentially variable regions, as well as a comparison of the relative recombination potentials of different genomic regions. Different trends are perceptible among mycoplasmas, probably due to different functional and structural constraints. The largest potential for illegitimate recombination in M.pulmonis is found at the vsa locus and its comparison in two different strains reveals numerous changes since divergence. On the other hand, the main M.pneumoniae and M.genitalium adhesins rely on large distant repeats and, hence, homologous recombination for variation. However, the relation between the existence of repeats and antigenic variation is not necessarily straightforward, since repeats of P1 adhesin were found to be anti-correlated with epitopes recognized by patient antibodies. These different strategies have important consequences for the structures of genomes, since large distant repeats correlate well with the major chromosomal rearrangements. Probably to avoid such events, mycoplasmas strongly avoid inverse repeats, in comparison to co-oriented repeats. PMID:11972343

  3. Mapping whole genome shotgun sequence and variant calling in mammalian species without their reference genomes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these s...

  4. Genome-wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC)

    PubMed Central

    Neale, Benjamin M.; Fagerness, Jesen; Reynolds, Robyn; Sobrin, Lucia; Parker, Margaret; Raychaudhuri, Soumya; Tan, Perciliz L.; Oh, Edwin C.; Merriam, Joanna E.; Souied, Eric; Bernstein, Paul S.; Li, Binxing; Frederick, Jeanne M.; Zhang, Kang; Brantley, Milam A.; Lee, Aaron Y.; Zack, Donald J.; Campochiaro, Betsy; Campochiaro, Peter; Ripke, Stephan; Smith, R. Theodore; Barile, Gaetano R.; Katsanis, Nicholas; Allikmets, Rando; Daly, Mark J.; Seddon, Johanna M.

    2010-01-01

    Advanced age-related macular degeneration (AMD) is the leading cause of late onset blindness. We present results of a genome-wide association study of 979 advanced AMD cases and 1,709 controls using the Affymetrix 6.0 platform with replication in seven additional cohorts (totaling 5,789 unrelated cases and 4,234 unrelated controls). We also present a comprehensive analysis of copy-number variations and polymorphisms for AMD. Our discovery data implicated the association between AMD and a variant in the hepatic lipase gene (LIPC) in the high-density lipoprotein cholesterol (HDL) pathway (discovery P = 4.53e-05 for rs493258). Our LIPC association was strongest for a functional promoter variant, rs10468017, (P = 1.34e-08), that influences LIPC expression and serum HDL levels with a protective effect of the minor T allele (HDL increasing) for advanced wet and dry AMD. The association we found with LIPC was corroborated by the Michigan/Penn/Mayo genome-wide association study; the locus near the tissue inhibitor of metalloproteinase 3 was corroborated by our replication cohort for rs9621532 with P = 3.71e-09. We observed weaker associations with other HDL loci (ABCA1, P = 9.73e-04; cholesterylester transfer protein, P = 1.41e-03; FADS1-3, P = 2.69e-02). Based on a lack of consistent association between HDL increasing alleles and AMD risk, the LIPC association may not be the result of an effect on HDL levels, but it could represent a pleiotropic effect of the same functional component. Results implicate different biologic pathways than previously reported and provide new avenues for prevention and treatment of AMD. PMID:20385826

  5. HLA-DPB1 and DPB2 are genetic loci for systemic sclerosis -Genome-wide association study in Koreans with Replication in North Americans

    PubMed Central

    Zhou, Xiaodong; Lee, Jong Eun; Arnett, Frank. C.; Xiong, Momiao; Park, Min Young; Yoo, Yeon Kyeong; Shin, Eun Soon; Reveille, John. D.; Mayes, Maureen D.; Kim, Jin Hyun; Song, Ran; Choi, Ji Yong; Park, Ji Ah; Lee, Yun Jong; Lee, Eun Young; Song, Yeong Wook; Lee, Eun Bong

    2010-01-01

    Objective To investigate the most susceptible genetic loci in systemic sclerosis (SSc) with genome-wide association study (GWAS). Methods A genome-wide association study was performed in 137 patients with systemic sclerosis and 564 controls from Korea using the Affymetrix Human SNP Array 5.0. After fine mapping study, the results were replicated in 1,107 SSc patients and 2,747 controls from a US Caucasian population. Results The SNPs (rs3128930, rs7763822, rs7764491, rs3117230 and rs3128965) of HLA-DPB1 and –DPB2 on chromosome 6 formed a distinctive peak with log p-values (p =8.16 × 10−13) for association with SSc susceptibility. Subtyping analysis of HLA-DPB1 showed that DPB1*1301 (p = 7.61×10−8) and DPB1*0901 (p = 2.56×10−5)were the most susceptible subtypes for SSc in Koreans. In US Caucasians, two pairs of SNPs, rs7763822/rs7764491 and rs3117230/rs3128965, showed strong association with SSc patients who had either circulating anti-DNA topoisomerase I (p = 7.58 × 10−17/4.84 × 10−16) or anti-centromere autoantibodies (p = 1.12 × 10−3/3.2 × 10−5), respectively. Conclusion Our GWAS in Koreans revealed that the region of HLA-DPB1 and –DPB2 contains the most susceptible loci to Korean SSc. The confirmatory studies in US Caucasians indicated that specific SNPs of the HLA-DPB1 and/or –DPB2 were strongly associated with US Caucasian SSc patients who were positive to anti-topoisomerase I or anti-centromere autoantibodies. PMID:19950302

  6. Novel loci for major depression identified by genome-wide association study of Sequenced Treatment Alternatives to Relieve Depression and meta-analysis of three studies.

    PubMed

    Shyn, S I; Shi, J; Kraft, J B; Potash, J B; Knowles, J A; Weissman, M M; Garriock, H A; Yokoyama, J S; McGrath, P J; Peters, E J; Scheftner, W A; Coryell, W; Lawson, W B; Jancic, D; Gejman, P V; Sanders, A R; Holmans, P; Slager, S L; Levinson, D F; Hamilton, S P

    2011-02-01

    We report a genome-wide association study (GWAS) of major depressive disorder (MDD) in 1221 cases from the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) study and 1636 screened controls. No genome-wide evidence for association was detected. We also carried out a meta-analysis of three European-ancestry MDD GWAS data sets: STAR*D, Genetics of Recurrent Early-onset Depression and the publicly available Genetic Association Information Network-MDD data set. These data sets, totaling 3957 cases and 3428 controls, were genotyped using four different platforms (Affymetrix 6.0, 5.0 and 500 K, and Perlegen). For each of 2.4 million HapMap II single-nucleotide polymorphisms (SNPs), using genotyped data where available and imputed data otherwise, single-SNP association tests were carried out in each sample with correction for ancestry-informative principal components. The strongest evidence for association in the meta-analysis was observed for intronic SNPs in ATP6V1B2 (P=6.78 x 10⁻⁷), SP4 (P=7.68 x 10⁻⁷) and GRM7 (P=1.11 x 10⁻⁶). Additional exploratory analyses were carried out for a narrower phenotype (recurrent MDD with onset before age 31, N=2191 cases), and separately for males and females. Several of the best findings were supported primarily by evidence from narrow cases or from either males or females. On the basis of previous biological evidence, we consider GRM7 a strong MDD candidate gene. Larger samples will be required to determine whether any common SNPs are significantly associated with MDD. PMID:20038947

  7. Novel loci for major depression identified by genome-wide association study of STAR*D and meta-analysis of three studies

    PubMed Central

    Shyn, SI; Shi, J; Kraft, JB; Potash, JB; Knowles, JA; Weissman, MM; Garriock, HA; Yokoyama, JS; McGrath, PJ; Peters, EJ; Scheftner, WA; Coryell, W; Lawson, WB; Jancic, D; Gejman, PV; Sanders, AR; Holmans, P; Slager, SL; Levinson, DF; Hamilton, SP

    2009-01-01

    We report a genome-wide association study (GWAS) of major depressive disorder (MDD) in 1,221 cases from the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) study and 1,636 screened controls. No genome-wide evidence for association was detected. We also carried out a meta-analysis of three European-ancestry MDD GWAS datasets: STAR*D, Genetics of Recurrent Early-Onset Depression (GenRED) and the publicly-available Genetic Association Information Network MDD dataset (GAIN-MDD). These datasets, totaling 3,957 cases and 3,428 controls, were genotyped using four different platforms (Affymetrix 6.0, 5.0 and 500K, and Perlegen). For each of 2.4 million HapMap II SNPs, using genotyped data where available and imputed data otherwise, single-SNP association tests were carried out in each sample with correction for ancestry-informative principal components. The strongest evidence for association in the meta-analysis was observed for intronic SNPs in ATP6V1B2 (P = 6.78 × 10−7), SP4 (P = 7.68 × 10−7) and GRM7 (P = 1.11 × 10−6). Additional exploratory analyses were carried out for a narrower phenotype (recurrent MDD with onset before age 31, N = 2,191 cases), and separately for males and females. Several of the best findings were supported primarily by evidence from narrow cases or from either males or females. Based on previous biological evidence, we consider GRM7 a strong MDD candidate gene. Larger samples will be required to determine whether any common SNPs are significantly associated with MDD. PMID:20038947

  8. Genome-Wide Transcriptomic Analysis of Intestinal Tissue to Assess the Impact of Nutrition and a Secondary Nematode Challenge in Lactating Rats

    PubMed Central

    Athanasiadou, Spiridoula; Jones, Leigh A.; Burgess, Stewart T. G.; Kyriazakis, Ilias; Pemberton, Alan D.; Houdijk, Jos G. M.; Huntley, John F.

    2011-01-01

    Background Gastrointestinal nematode infection is a major challenge to the health and welfare of mammals. Although mammals eventually acquire immunity to nematodes, this breaks down around parturition, which renders periparturient mammals susceptible to re-infection and an infection source for their offspring. Nutrient supplementation reduces the extent of periparturient parasitism, but the underlying mechanisms remain unclear. Here, we use a genome wide approach to assess the effects of protein supplementation on gene expression in the small intestine of periparturient rats following nematode re-infection. Methodology/Principal Findings The use of a rat whole genome expression microarray (Affymetrix Gene 1.0ST) showed significant differential regulation of 91 genes in the small intestine of lactating rats, re-infected with Nippostrongylus brasiliensis compared to controls; affected functions included immune cell trafficking, cell-mediated responses and antigen presentation. Genes with a previously described role in immune response to nematodes, such as mast cell proteases, and intelectin, and others newly associated with nematode expulsion, such as anterior gradient homolog 2 were identified. Protein supplementation resulted in significant differential regulation of 64 genes; affected functions included protein synthesis, cellular function and maintenance. It increased cell metabolism, evident from the high number of non-coding RNA and the increased synthesis of ribosomal proteins. It regulated immune responses, through T-cell activation and proliferation. The up-regulation of transcription factor forkhead box P1 in unsupplemented, parasitised hosts may be indicative of a delayed immune response in these animals. Conclusions/Significance This study provides the first evidence for nutritional regulation of genes related to immunity to nematodes at the site of parasitism, during expulsion. Additionally it reveals genes induced following secondary parasite challenge

  9. Contrast enhancement in 1p/19q-codeleted anaplastic oligodendrogliomas is associated with 9p loss, genomic instability, and angiogenic gene expression

    PubMed Central

    Reyes-Botero, German; Dehais, Caroline; Idbaih, Ahmed; Martin-Duverneuil, Nadine; Lahutte, Marion; Carpentier, Catherine; Letouzé, Eric; Chinot, Olivier; Loiseau, Hugues; Honnorat, Jerome; Ramirez, Carole; Moyal, Elisabeth; Figarella-Branger, Dominique; Ducray, François; Desenclos, Christine; Sevestre, Henri; Menei, Philippe; Michalak, Sophie; Al Nader, Edmond; Godard, Joel; Viennet, Gabriel; Carpentier, Antoine; Eimer, Sandrine; Dam-Hieu, Phong; Quintin-Roué, Isabelle; Guillamo, Jean-Sebastien; Lechapt-Zalcman, Emmanuelle; Kemeny, Jean-Louis; Verrelle, Pierre; Faillot, Thierry; Gaultier, Claude; Tortel, Marie Christine; Christov, Christo; Le Guerinel, Caroline; Aubriot-Lorton, Marie-Hélène; Ghiringhelli, Francois; Berger, François; Lacroix, Catherine; Parker, Fabrice; Dubois, François; Maurage, Claude-Alain; Gueye, Edouard-Marcel; Labrousse, Francois; Jouvet, Anne; Bauchet, Luc; Rigau, Valérie; Beauchesne, Patrick; Vignaud, Jean-Michel; Campone, Mario; Loussouarn, Delphine; Fontaine, Denys; Vandenbos, Fanny; Campello, Chantal; Roger, Pascal; Fesneau, Melanie; Heitzmann, Anne; Delattre, Jean-Yves; Elouadhani, Selma; Mokhtari, Karima; Polivka, Marc; Ricard, Damien; Levillain, Pierre-Marie; Wager, Michel; Colin, Philippe; Diebold, Marie-Danièle; Chiforeanu, Dan; Vauleon, Elodie; Langlois, Olivier; Laquerriere, Annie; Motsuo Fotso, Marie Janette; Peoc'h, Michel; Andraud, Marie; Mouton, Servane; Chenard, Marie-Pierre; Noel, Georges; Desse, Nicolas; Soulard, Raoulin; Amiel-Benouaich, Alexandra; Uro-Coste, Emmanuelle; Dhermain, Frederic

    2014-01-01

    Background The aim of this study was to correlate MRI features and molecular characteristics in anaplastic oligodendrogliomas (AOs). Methods The MRI characteristics of 50 AO patients enrolled in the French national network for high-grade oligodendroglial tumors were analyzed. The genomic profiles and IDH mutational statuses were assessed using high-resolution single-nucleotide polymorphism arrays and direct sequencing, respectively. The gene expression profiles of 25 1p/19q-codeleted AOs were studied on Affymetrix expression arrays. Results Most of the cases were frontal lobe contrast-enhanced tumors (52%), but the radiological presentations of these cases were heterogeneous, ranging from low-grade glioma-like aspects (26%) to glioblastoma-like aspects (22%). The 1p/19q codeletion (n = 39) was associated with locations in the frontal lobe (P = .001), with heterogeneous intratumoral signal intensities (P = .003) and with no or nonmeasurable contrast enhancements (P = .01). The IDH wild-type AOs (n = 7) more frequently displayed ringlike contrast enhancements (P = .03) and were more frequently located outside of the frontal lobe (P = .01). However, no specific imaging pattern could be identified for the 1p/19q-codeleted AO or the IDH-mutated AO. Within the 1p/19q-codeleted AO, the contrast enhancement was associated with larger tumor volumes (P = .001), chromosome 9p loss and CDKN2A loss (P = .006), genomic instability (P = .03), and angiogenesis-related gene expression (P < .001), particularly for vascular endothelial growth factor A and angiopoietin 2. Conclusion In AOs, the 1p/19q codeletion and the IDH mutation are associated with preferential (but not with specific) imaging characteristics. Within 1p/19q-codeleted AO, imaging heterogeneity is related to additional molecular alterations, especially chromosome 9p loss, which is associated with contrast enhancement and larger tumor volume. PMID:24353325

  10. GOLD: The Genomes Online Database

    DOE Data Explorer

    Kyrpides, Nikos; Liolios, Dinos; Chen, Amy; Tavernarakis, Nektarios; Hugenholtz, Philip; Markowitz, Victor; Bernal, Alex

    Since its inception in 1997, GOLD has continuously monitored genome sequencing projects worldwide and has provided the community with a unique centralized resource that integrates diverse information related to Archaea, Bacteria, Eukaryotic and more recently Metagenomic sequencing projects. As of September 2007, GOLD recorded 639 completed genome projects. These projects have their complete sequence deposited into the public archival sequence databases such as GenBank EMBL,and DDBJ. From the total of 639 complete and published genome projects as of 9/2007, 527 were bacterial, 47 were archaeal and 65 were eukaryotic. In addition to the complete projects, there were 2158 ongoing sequencing projects. 1328 of those were bacterial, 59 archaeal and 771 eukaryotic projects. Two types of metadata are provided by GOLD: (i) project metadata and (ii) organism/environment metadata. GOLD CARD pages for every project are available from the link of every GOLD_STAMP ID. The information in every one of these pages is organized into three tables: (a) Organism information, (b) Genome project information and (c) External links. [The Genomes On Line Database (GOLD) in 2007: Status of genomic and metagenomic projects and their associated metadata, Konstantinos Liolios, Konstantinos Mavromatis, Nektarios Tavernarakis and Nikos C. Kyrpides, Nucleic Acids Research Advance Access published online on November 2, 2007, Nucleic Acids Research, doi:10.1093/nar/gkm884]

    The basic tables in the GOLD database that can be browsed or searched include the following information:

    • Gold Stamp ID
    • Organism name
    • Domain
    • Links to information sources
    • Size and link to a map, when available
    • Chromosome number, Plas number, and GC content
    • A link for downloading the actual genome data
    • Institution that did the sequencing
    • Funding source
    • Database where information resides
    • Publication status and information

    • GIPSy: Genomic island prediction software.

      PubMed

      Soares, Siomar C; Geyik, Hakan; Ramos, Rommel T J; de Sá, Pablo H C G; Barbosa, Eudes G V; Baumbach, Jan; Figueiredo, Henrique C P; Miyoshi, Anderson; Tauch, Andreas; Silva, Artur; Azevedo, Vasco

      2016-08-20

      Bacteria are highly diverse organisms that are able to adapt to a broad range of environments and hosts due to their high genomic plasticity. Horizontal gene transfer plays a pivotal role in this genome plasticity and in evolution by leaps through the incorporation of large blocks of genome sequences, ordinarily known as genomic islands (GEIs). GEIs may harbor genes encoding virulence, metabolism, antibiotic resistance and symbiosis-related functions, namely pathogenicity islands (PAIs), metabolic islands (MIs), resistance islands (RIs) and symbiotic islands (SIs). Although many software for the prediction of GEIs exist, they only focus on PAI prediction and present other limitations, such as complicated installation and inconvenient user interfaces. Here, we present GIPSy, the genomic island prediction software, a standalone and user-friendly software for the prediction of GEIs, built on our previously developed pathogenicity island prediction software (PIPS). We also present four application cases in which we crosslink data from literature to PAIs, MIs, RIs and SIs predicted by GIPSy. Briefly, GIPSy correctly predicted the following previously described GEIs: 13 PAIs larger than 30kb in Escherichia coli CFT073; 1 MI for Burkholderia pseudomallei K96243, which seems to be a miscellaneous island; 1 RI of Acinetobacter baumannii AYE, named AbaR1; and, 1 SI of Mesorhizobium loti MAFF303099 presenting a mosaic structure. GIPSy is the first life-style-specific genomic island prediction software to perform analyses of PAIs, MIs, RIs and SIs, opening a door for a better understanding of bacterial genome plasticity and the adaptation to new traits. PMID:26376473

    • Genome size: a novel genomic signature in support of Afrotheria.

      PubMed

      Redi, Carlo Alberto; Garagna, Silvia; Zuccotti, Maurizio; Capanna, Ernesto

      2007-04-01

      Molecular phylogenetic analyses suggest an emerging phylogeny for the extant Placentalia (eutherian) that radically departs from morphologically based constructions of the past. Placental mammals are partitioned into four supraordinal clades: Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. Afrotheria form an endemic African clade that includes elephant shrews, golden moles, tenrecs, aardvarks, hyraxes, elephants, dugongs, and manatees. Datamining databases of genome size (GS) shows that till today just one afrotherian GS has been evaluated, that of the aardvark Orycteropus afer. We show that the GSs of six selected representatives across the Afrotheria supraordinal group are among the highest for the extant Placentalia, providing a novel genomic signature of this enigmatic group. The mean GS value of Afrotheria, 5.3 +/- 0.7 pg, is the highest reported for the extant Placentalia. This should assist in planning new genome sequencing initiatives. PMID:17479346

    • Genomics made easier: an introductory tutorial to genome datamining.

      PubMed

      Schattner, Peter

      2009-03-01

      Integrated genome databases--such as the UCSC, Ensembl and NCBI MapViewer databases--and their associated data querying and visualization interfaces (e.g. the genome browsers) have transformed the way that molecular biologists, geneticists and bioinformaticists analyze genomic data. Nevertheless, because of the complexity of these tools, many researchers take advantage of only a fraction of their capabilities. In this tutorial, using examples from medical genetics and alternative splicing, I describe some of the biological questions that can be addressed with these techniques. I also show why doing so typically is more effective than using alternative methods and indicate some of the resources available for learning more about the advanced capabilities of these powerful tools. PMID:19041391

    • Linking genome-scale metabolic modeling and genome annotation

      PubMed Central

      Blais, Edik M.; Chavali, Arvind K.; Papin, Jason A.

      2014-01-01

      Summary Genome-scale metabolic network reconstructions, assembled from annotated genomes, serve as a platform for integrating data from heterogeneous sources and generating hypotheses for further experimental validation. Implementing constraint-based modeling techniques such as Flux Balance Analysis (FBA) on network reconstructions allow for interrogating metabolism at a systems-level, which aids in identifying and rectifying gaps in knowledge. With genome sequences for various organisms from prokaryotes to eukaryotes becoming increasingly available, a significant bottleneck lies in the structural and functional annotation of these sequences. Using topologically-based and biologically-inspired metabolic network refinement, we can better characterize enzymatic functions present in an organism and link annotation of these functions to candidate transcripts, both steps that can be experimentally validated. PMID:23417799

    • Transcriptional Regulation: a Genomic Overview

      PubMed Central

      Riechmann, José Luis

      2002-01-01

      The availability of the Arabidopsis thaliana genome sequence allows a comprehensive analysis of transcriptional regulation in plants using novel genomic approaches and methodologies. Such a genomic view of transcription first necessitates the compilation of lists of elements. Transcription factors are the most numerous of the different types of proteins involved in transcription in eukaryotes, and the Arabidopsis genome codes for more than 1,500 of them, or approximately 6% of its total number of genes. A genome-wide comparison of transcription factors across the three eukaryotic kingdoms reveals the evolutionary generation of diversity in the components of the regulatory machinery of transcription. However, as illustrated by Arabidopsis, transcription in plants follows similar basic principles and logic to those in animals and fungi. A global view and understanding of transcription at a cellular and organismal level requires the characterization of the Arabidopsis transcriptome and promoterome, as well as of the interactome, the localizome, and the phenome of the proteins involved in transcription. PMID:22303220

    • Expanding genomics of mycorrhizal symbiosis

      PubMed Central

      Kuo, Alan; Kohler, Annegret; Martin, Francis M.; Grigoriev, Igor V.

      2014-01-01

      The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolve through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism. PMID:25408690

    • Comparative genomics for biodiversity conservation

      PubMed Central

      Grueber, Catherine E.

      2015-01-01

      Genomic approaches are gathering momentum in biology and emerging opportunities lie in the creative use of comparative molecular methods for revealing the processes that influence diversity of wildlife. However, few comparative genomic studies are performed with explicit and specific objectives to aid conservation of wild populations. Here I provide a brief overview of comparative genomic approaches that offer specific benefits to biodiversity conservation. Because conservation examples are few, I draw on research from other areas to demonstrate how comparing genomic data across taxa may be used to inform the characterisation of conservation units and studies of hybridisation, as well as studies that provide conservation outcomes from a better understanding of the drivers of divergence. A comparative approach can also provide valuable insight into the threatening processes that impact rare species, such as emerging diseases and their management in conservation. In addition to these opportunities, I note areas where additional research is warranted. Overall, comparing and contrasting the genomic composition of threatened and other species provide several useful tools for helping to preserve the molecular biodiversity of the global ecosystem. PMID:26106461

    • Manipulating duckweed through genome duplication.

      PubMed

      Vunsh, R; Heinig, U; Malitsky, S; Aharoni, A; Avidov, A; Lerner, A; Edelman, M

      2015-01-01

      Significant inter- and intraspecific genetic variation exists in duckweed, thus the potential for genome plasticity and manipulation is high. Polyploidy is recognised as a major mechanism of adaptation and speciation in plants. We produced several genome-duplicated lines of Landoltia punctata (Spirodela oligorrhiza) from both whole plants and regenerating explants using a colchicine-based cocktail. These lines stably maintained an enlarged frond and root morphology. DNA ploidy levels determined by florescence-activated cell sorting indicated genome duplication. Line A4 was analysed after 75 biomass doublings. Frond area, fresh and dry weights, rhizoid number and length were significantly increased versus wild type, while the growth rate was unchanged. This resulted in accumulation of biomass 17-20% faster in the A4 plants. We sought to determine if specific differences in gene products are found in the genome duplicated lines. Non-targeted ultra performance LC-quadrupole time of flight mass spectrometry was employed to compare some of the lines and the wild type to seek identification of up-regulated metabolites. We putatively identified differential metabolites in Line A65 as caffeoyl hexoses. The combination of directed genome duplication and metabolic profiling might offer a path for producing stable gene expression, leading to altered production of secondary metabolites. PMID:25040392

    • Expanding genomics of mycorrhizal symbiosis

      SciTech Connect

      Kuo, Alan; Kohler, Annegret; Martin, Francis M.; Grigoriev, Igor V.

      2014-11-04

      The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolve through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism.

    • Expanding genomics of mycorrhizal symbiosis

      DOE PAGESBeta

      Kuo, Alan; Kohler, Annegret; Martin, Francis M.; Grigoriev, Igor V.

      2014-11-04

      The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolvemore » through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism.« less

  1. The Fitness of Genomic Order

    NASA Astrophysics Data System (ADS)

    Zhang, Qiucen; Vyawahare, Saurabh; Austin, Robert

    2012-02-01

    Most bacteria have a single circular chromosome that can range in size from 160,000 to 12,200,000 base pairs. Considering the typical gene density, i.e. 1 gene per 1,000 base pairs, both the number of genes and the ways to arrange are huge. Intuitively, the arrangement of genes on the circle is not important if all of them can be replicated. However, there is typically one origin of replication, and when bacteria is attacked by genotoxic stress during replication, the whole replication process can not be finished. As a result, which gene is replicated first, which is second, ..., becomes very important. Experimentally, we found a broad increase of DNA copy number near the origin of replication (OriC) of bacteria E.coli (˜3200 genes) under genotoxic stress. Since the genes near OriC are mostly efflux pump genes, we propose that there is fitness advantage for those rapid stress response genes got replicated first, because they can facilitate the replication of the rest of genome. Similar to bacterial evolution to present genomic order, in the somatic evolution of cancer, genomic shuffling was also frequently observed, especially under genotoxic chemotherapy. Such re-arrangement of genome can be viewed as a journey to optimal point in the rugged fitness landscape of genomic order.

  2. Comparative genomics of protoploid Saccharomycetaceae.

    PubMed

    Souciet, Jean-Luc; Dujon, Bernard; Gaillardin, Claude; Johnston, Mark; Baret, Philippe V; Cliften, Paul; Sherman, David J; Weissenbach, Jean; Westhof, Eric; Wincker, Patrick; Jubin, Claire; Poulain, Julie; Barbe, Valérie; Ségurens, Béatrice; Artiguenave, François; Anthouard, Véronique; Vacherie, Benoit; Val, Marie-Eve; Fulton, Robert S; Minx, Patrick; Wilson, Richard; Durrens, Pascal; Jean, Géraldine; Marck, Christian; Martin, Tiphaine; Nikolski, Macha; Rolland, Thomas; Seret, Marie-Line; Casarégola, Serge; Despons, Laurence; Fairhead, Cécile; Fischer, Gilles; Lafontaine, Ingrid; Leh, Véronique; Lemaire, Marc; de Montigny, Jacky; Neuvéglise, Cécile; Thierry, Agnès; Blanc-Lenfle, Isabelle; Bleykasten, Claudine; Diffels, Julie; Fritsch, Emilie; Frangeul, Lionel; Goëffon, Adrien; Jauniaux, Nicolas; Kachouri-Lafond, Rym; Payen, Célia; Potier, Serge; Pribylova, Lenka; Ozanne, Christophe; Richard, Guy-Franck; Sacerdot, Christine; Straub, Marie-Laure; Talla, Emmanuel

    2009-10-01

    Our knowledge of yeast genomes remains largely dominated by the extensive studies on Saccharomyces cerevisiae and the consequences of its ancestral duplication, leaving the evolution of the entire class of hemiascomycetes only partly explored. We concentrate here on five species of Saccharomycetaceae, a large subdivision of hemiascomycetes, that we call "protoploid" because they diverged from the S. cerevisiae lineage prior to its genome duplication. We determined the complete genome sequences of three of these species: Kluyveromyces (Lachancea) thermotolerans and Saccharomyces (Lachancea) kluyveri (two members of the newly described Lachancea clade), and Zygosaccharomyces rouxii. We included in our comparisons the previously available sequences of Kluyveromyces lactis and Ashbya (Eremothecium) gossypii. Despite their broad evolutionary range and significant individual variations in each lineage, the five protoploid Saccharomycetaceae share a core repertoire of approximately 3300 protein families and a high degree of conserved synteny. Synteny blocks were used to define gene orthology and to infer ancestors. Far from representing minimal genomes without redundancy, the five protoploid yeasts contain numerous copies of paralogous genes, either dispersed or in tandem arrays, that, altogether, constitute a third of each genome. Ancient, conserved paralogs as well as novel, lineage-specific paralogs were identified. PMID:19525356

  3. Advances in Genome Biology & Technology

    SciTech Connect

    Thomas J. Albert, Jon R. Armstrong, Raymond K. Auerback, W. Brad Barbazuk, et al.

    2007-12-01

    This year's meeting focused on the latest advances in new DNA sequencing technologies and the applications of genomics to disease areas in biology and biomedicine. Daytime plenary sessions highlighted cutting-edge research in areas such as complex genetic diseases, comparative genomics, medical sequencing, massively parallel DNA sequencing, and synthetic biology. Technical approaches being developed and utilized in contemporary genomics research were presented during evening concurrent sessions. Also, as in previous years, poster sessions bridged the morning and afternoon plenary sessions. In addition, for the third year in a row, the Advances in Genome Biology and Technology (AGBT) meeting was preceded by a pre-meeting workshop that aimed to provide an introductory overview for trainees and other meeting attendees. This year, speakers at the workshop focused on next-generation sequencing technologies, including their experiences, findings, and helpful advise for others contemplating using these platforms in their research. Speakers from genome centers and core sequencing facilities were featured and the workshop ended with a roundtable discussion, during which speakers fielded questions from the audience.

  4. The genome of Prunus mume

    PubMed Central

    Zhang, Qixiang; Chen, Wenbin; Sun, Lidan; Zhao, Fangying; Huang, Bangqing; Yang, Weiru; Tao, Ye; Wang, Jia; Yuan, Zhiqiong; Fan, Guangyi; Xing, Zhen; Han, Changlei; Pan, Huitang; Zhong, Xiao; Shi, Wenfang; Liang, Xinming; Du, Dongliang; Sun, Fengming; Xu, Zongda; Hao, Ruijie; Lv, Tian; Lv, Yingmin; Zheng, Zequn; Sun, Ming; Luo, Le; Cai, Ming; Gao, Yike; Wang, Junyi; Yin, Ye; Xu, Xun; Cheng, Tangren; Wang, Jun

    2012-01-01

    Prunus mume (mei), which was domesticated in China more than 3,000 years ago as ornamental plant and fruit, is one of the first genomes among Prunus subfamilies of Rosaceae been sequenced. Here, we assemble a 280M genome by combining 101-fold next-generation sequencing and optical mapping data. We further anchor 83.9% of scaffolds to eight chromosomes with genetic map constructed by restriction-site-associated DNA sequencing. Combining P. mume genome with available data, we succeed in reconstructing nine ancestral chromosomes of Rosaceae family, as well as depicting chromosome fusion, fission and duplication history in three major subfamilies. We sequence the transcriptome of various tissues and perform genome-wide analysis to reveal the characteristics of P. mume, including its regulation of early blooming in endodormancy, immune response against bacterial infection and biosynthesis of flower scent. The P. mume genome sequence adds to our understanding of Rosaceae evolution and provides important data for improvement of fruit trees. PMID:23271652

  5. Evolutionary engineering by genome shuffling.

    PubMed

    Biot-Pelletier, Damien; Martin, Vincent J J

    2014-05-01

    An upsurge in the bioeconomy drives the need for engineering microorganisms with increasingly complex phenotypes. Gains in productivity of industrial microbes depend on the development of improved strains. Classical strain improvement programmes for the generation, screening and isolation of such mutant strains have existed for several decades. An alternative to traditional strain improvement methods, genome shuffling, allows the directed evolution of whole organisms via recursive recombination at the genome level. This review deals chiefly with the technical aspects of genome shuffling. It first presents the diversity of organisms and phenotypes typically evolved using this technology and then reviews available sources of genetic diversity and recombination methodologies. Analysis of the literature reveals that genome shuffling has so far been restricted to microorganisms, both prokaryotes and eukaryotes, with an overepresentation of antibiotics- and biofuel-producing microbes. Mutagenesis is the main source of genetic diversity, with few studies adopting alternative strategies. Recombination is usually done by protoplast fusion or sexual recombination, again with few exceptions. For both diversity and recombination, prospective methods that have not yet been used are also presented. Finally, the potential of genome shuffling for gaining insight into the genetic basis of complex phenotypes is also discussed. PMID:24595425

  6. NCBI prokaryotic genome annotation pipeline.

    PubMed

    Tatusova, Tatiana; DiCuccio, Michael; Badretdin, Azat; Chetvernin, Vyacheslav; Nawrocki, Eric P; Zaslavsky, Leonid; Lomsadze, Alexandre; Pruitt, Kim D; Borodovsky, Mark; Ostell, James

    2016-08-19

    Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/. PMID:27342282

  7. Human Genome Education Program

    SciTech Connect

    Richard Myers; Lane Conn

    2000-05-01

    The funds from the DOE Human Genome Program, for the project period 2/1/96 through 1/31/98, have provided major support for the curriculum development and field testing efforts for two high school level instructional units: Unit 1, ''Exploring Genetic Conditions: Genes, Culture and Choices''; and Unit 2, ''DNA Snapshots: Peaking at Your DNA''. In the original proposal, they requested DOE support for the partial salary and benefits of a Field Test Coordinator position to: (1) complete the field testing and revision of two high school curriculum units, and (2) initiate the education of teachers using these units. During the project period of this two-year DOE grant, a part-time Field-Test Coordinator was hired (Ms. Geraldine Horsma) and significant progress has been made in both of the original proposal objectives. Field testing for Unit 1 has occurred in over 12 schools (local and non-local sites with diverse student populations). Field testing for Unit 2 has occurred in over 15 schools (local and non-local sites) and will continue in 12-15 schools during the 96-97 school year. For both curricula, field-test sites and site teachers were selected for their interest in genetics education and in hands-on science education. Many of the site teachers had no previous experience with HGEP or the unit under development. Both of these first-year biology curriculum units, which contain genetics, biotechnology, societal, ethical and cultural issues related to HGP, are being implemented in many local and non-local schools (SF Bay Area, Southern California, Nebraska, Hawaii, and Texas) and in programs for teachers. These units will reach over 10,000 students in the SF Bay Area and continues to receive support from local corporate and private philanthropic organizations. Although HGEP unit development is nearing completion for both units, data is still being gathered and analyzed on unit effectiveness and student learning. The final field testing result from this analysis will

  8. Genomics and the origin of species.

    PubMed

    Seehausen, Ole; Butlin, Roger K; Keller, Irene; Wagner, Catherine E; Boughman, Janette W; Hohenlohe, Paul A; Peichel, Catherine L; Saetre, Glenn-Peter; Bank, Claudia; Brännström, Ake; Brelsford, Alan; Clarkson, Chris S; Eroukhmanoff, Fabrice; Feder, Jeffrey L; Fischer, Martin C; Foote, Andrew D; Franchini, Paolo; Jiggins, Chris D; Jones, Felicity C; Lindholm, Anna K; Lucek, Kay; Maan, Martine E; Marques, David A; Martin, Simon H; Matthews, Blake; Meier, Joana I; Möst, Markus; Nachman, Michael W; Nonaka, Etsuko; Rennison, Diana J; Schwarzer, Julia; Watson, Eric T; Westram, Anja M; Widmer, Alex

    2014-03-01

    Speciation is a fundamental evolutionary process, the knowledge of which is crucial for understanding the origins of biodiversity. Genomic approaches are an increasingly important aspect of this research field. We review current understanding of genome-wide effects of accumulating reproductive isolation and of genomic properties that influence the process of speciation. Building on this work, we identify emergent trends and gaps in our understanding, propose new approaches to more fully integrate genomics into speciation research, translate speciation theory into hypotheses that are testable using genomic tools and provide an integrative definition of the field of speciation genomics. PMID:24535286

  9. A physical map of the human genome

    SciTech Connect

    McPherson, J.D.; Marra, M.; Hillier, L.; Waterston, R.H.; Chinwalla, A.; Wallis, J.; Sekhon, M.; Wylie, K.; Mardis, E.R.; Wilson, R.K.; Fulton, R.; Kucaba, T.A.; Wagner-McPherson, C.; Barbazuk, W.B.; Gregory, S.G.; Humphray, S.J.; French, L.; Evans, R.S.; Bethel, G.; Whittaker, A.; Holden, J.L.; McCann, O.T.; Dunham, A.; Soderlund, C.; Scott, C.E.; Bentley, D.R.; Schuler, G.; Chen, H.-C.; Jang, W.; Green, E.D.; Idol, J.R.; Maduro, V.V. Braden; Montgomery, K.T.; Lee, E.; Miller, A.; Emerling, S.; Kucherlapati; Gibbs, R.; Scherer, S.; Gorrell, J.H.; Sodergren, E.; Clerc-Blankenburg, K.; Tabor, P.; Naylor, S.; Garcia, D.; de Jong, P.J.; Catanese, J.J.; Nowak, N.; Osoegawa, K.; Qin, S.; Rowen, L.; Madan, A.; Dors, M.; Hood, L.; Trask, B.; Friedman, C.; Massa, H.; Cheung, V.G.; Kirsch, I.R.; Reid, T.; Yonescu, R.; Weissenbach, J.; Bruls, T.; Heilig, R.; Branscomb, E.; Olsen, A.; Doggett, N.; Cheng, J.F.; Hawkins, T.; Myers, R.M.; Shang, J.; Ramirez, L.; Schmutz, J.; Velasquez, O.; Dixon, K.; Stone, N.E.; Cox, D.R.; Haussler, D.; Kent, W.J.; Furey, T.; Rogic, S.; Kennedy, S.; Jones, S.; Rosenthal, A.; Wen, G.; Schilhabel, M.; Gloeckner, G.; Nyakatura, G.; Siebert, R.; Schlegelberger, B.; Korenberg, J.; Chen, X.N.; Fujiyama, A.; Hattori, M.; Toyoda, A.; Yada, T.; Park, H.S.; Sakaki, Y.; Shimizu, N.; Asakawa, S.; Kawasaki, K.; Sasaki, T.; Shintani, A.; Shimizu, A.; Shibuya, K.; Kudoh, J.; Minoshima, S.; Ramser, J.; Seranski, P.; Hoff, C.; Poustka, A.; Reinhardt, R.; Lehrach, H.

    2001-01-01

    The human genome is by far the largest genome to be sequenced, and its size and complexity present many challenges for sequence assembly. The International Human Genome Sequencing Consortium constructed a map of the whole genome to enable the selection of clones for sequencing and for the accurate assembly of the genome sequence. Here we report the construction of the whole-genome bacterial artificial chromosome (BAC) map and its integration with previous landmark maps and information from mapping efforts focused on specific chromosomal regions. We also describe the integration of sequence data with the map.

  10. PLEXdb: Plant and Pathogen Expression Database and Tools for Comparative and Functional Genomics Analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    PLEXdb is a plant expression database that supports all Affymetrix microarray designs for plants and plant pathogens. PLEXdb provides annotation and hand-curated microarray data. Experiments deposited in PLEXdb are checked for MIAME/Plant compliance and completeness, then processed by normalizing th...

  11. Decoding the human genome sequence.

    PubMed

    Bentley, D R

    2000-10-01

    The year 2000 is marked by the production of the sequence of the human genome. A 'working draft' of high quality sequence covering 90% of the genome has been determined and a quarter is in finished form, including the first two completed chromosomes. All sequence data from the project is made freely available to the community via the Internet, for further analysis and exploitation. The challenge which lies ahead is to decipher the information. Knowledge of the human genome sequence will enable us to understand how the genetic information determines the development, structure and function of the human body. We will be able to explore how variations within our DNA sequence cause disease, how they affect our interaction with our environment and ultimately to develop new and effective ways to improve human health. PMID:11005789

  12. Multiscale Representation of Genomic Signals

    PubMed Central

    Knijnenburg, Theo A.; Ramsey, Stephen A.; Berman, Benjamin P.; Kennedy, Kathleen A.; Smit, Arian F.A.; Wessels, Lodewyk F.A.; Laird, Peter W.; Aderem, Alan; Shmulevich, Ilya

    2014-01-01

    Genomic information is encoded on a wide range of distance scales, ranging from tens of base pairs to megabases. We developed a multiscale framework to analyze and visualize the information content of genomic signals. Different types of signals, such as GC content or DNA methylation, are characterized by distinct patterns of signal enrichment or depletion across scales spanning several orders of magnitude. These patterns are associated with a variety of genomic annotations, including genes, nuclear lamina associated domains, and repeat elements. By integrating the information across all scales, as compared to using any single scale, we demonstrate improved prediction of gene expression from Polymerase II chromatin immunoprecipitation sequencing (ChIP-seq) measurements and we observed that gene expression differences in colorectal cancer are not most strongly related to gene body methylation, but rather to methylation patterns that extend beyond the single-gene scale. PMID:24727652

  13. Support Values for Genome Phylogenies

    PubMed Central

    Klötzl, Fabian; Haubold, Bernhard

    2016-01-01

    We have recently developed a distance metric for efficiently estimating the number of substitutions per site between unaligned genome sequences. These substitution rates are called “anchor distances” and can be used for phylogeny reconstruction. Most phylogenies come with bootstrap support values, which are computed by resampling with replacement columns of homologous residues from the original alignment. Unfortunately, this method cannot be applied to anchor distances, as they are based on approximate pairwise local alignments rather than the full multiple sequence alignment necessary for the classical bootstrap. We explore two alternatives: pairwise bootstrap and quartet analysis, which we compare to classical bootstrap. With simulated sequences and 53 human primate mitochondrial genomes, pairwise bootstrap gives better results than quartet analysis. However, when applied to 29 E. coli genomes, quartet analysis comes closer to the classical bootstrap. PMID:26959064

  14. Genomics of Escherichia and Shigella

    NASA Astrophysics Data System (ADS)

    Perna, Nicole T.

    The laboratory workhorse Escherichia coli K-12 is among the most intensively studied living organisms on earth, and this single strain serves as the model system behind much of our understanding of prokaryotic molecular biology. Dense genome sequencing and recent insightful comparative analyses are making the species E. coli, as a whole, an emerging system for studying prokaryotic population genetics and the relationship between system-scale, or genome-scale, molecular evolution and complex traits like host range and pathogenic potential. Genomic perspective has revealed a coherent but dynamic species united by intraspecific gene flow via homologous lateral or horizontal transfer and differentiated by content flux mediated by acquisition of DNA segments from interspecies transfers.

  15. How good is our genome?

    PubMed

    Weill, Jean-Claude; Radman, Miroslav

    2004-01-29

    Our genome has evolved to perpetuate itself through the maintenance of the species via an uninterrupted chain of reproductive somas. Accordingly, evolution is not concerned with diseases occurring after the soma's reproductive stage. Following Richard Dawkins, we would like to reassert that we indeed live as disposable somas, slaves of our germline genome, but could soon start rebelling against such slavery. Cancer and its relation to the TP53 gene may offer a paradigmatic example. The observation that the latency period in cancer can be prolonged in mice by increasing the number of TP53 genes in their genome, suggests that sooner or later we will have to address the question of heritable disease avoidance via the manipulation of the human germline. PMID:15065661

  16. Tripartite genome of all species.

    PubMed

    Long, MengPing; Hu, TaoBo

    2016-01-01

    Neutral theory has dominated the molecular evolution field for more than half a century, but it has been severely challenged by the recently emerged Maximum Genetic Diversity (MGD) theory. However, based on our recent work of tripartite human genome architecture, we found that MGD theory may have overlooked the regulatory but variable genomic regions that increase with species complexity. Here we propose a new molecular evolution theory named Increasing Functional Variation (IFV) hypothesis. According to the IFV hypothesis, the genome of all species is divided into three regions that are 'functional and invariable', 'functional and variable' and 'non-functional and variable'. While the 'non-functional and variable' region decreases as species become more complex, the other two regions increase. PMID:27366319

  17. Tripartite genome of all species

    PubMed Central

    2016-01-01

    Neutral theory has dominated the molecular evolution field for more than half a century, but it has been severely challenged by the recently emerged Maximum Genetic Diversity (MGD) theory. However, based on our recent work of tripartite human genome architecture, we found that MGD theory may have overlooked the regulatory but variable genomic regions that increase with species complexity. Here we propose a new molecular evolution theory named Increasing Functional Variation (IFV) hypothesis. According to the IFV hypothesis, the genome of all species is divided into three regions that are ‘functional and invariable’, ‘functional and variable’ and ‘non-functional and variable’. While the ‘non-functional and variable’ region decreases as species become more complex, the other two regions increase. PMID:27366319

  18. Environmental Influences on Genomic Imprinting

    PubMed Central

    Kappil, Maya; Lambertini, Luca; Chen, Jia

    2015-01-01

    Genomic imprinting refers to the epigenetic mechanism that results in the mono-allelic expression of a subset of genes in a parent-of-origin manner. These haploid genes are highly active in the placenta and are functionally implicated in the appropriate development of the fetus. Furthermore, the epigenetic marks regulating imprinted expression patterns are established early in development. These characteristics make genomic imprinting a potentially useful biomarker for environmental insults, especially during the in utero or early development stages, and for health outcomes later in life. Herein, we critically review the current literature regarding environmental influences on imprinted genes and summarize findings that suggest that imprinted loci are sensitive to known teratogenic agents, such as alcohol and tobacco, as well as less established factors with the potential to manipulate the in utero environment, including assisted reproductive technology. Finally, we discuss the potential of genomic imprinting to serve as an environmental sensor during early development. PMID:26029493

  19. Enhancer Identification through Comparative Genomics

    SciTech Connect

    Visel, Axel; Bristow, James; Pennacchio, Len A.

    2006-10-01

    With the availability of genomic sequence from numerousvertebrates, a paradigm shift has occurred in the identification ofdistant-acting gene regulatory elements. In contrast to traditionalgene-centric studies in which investigators randomly scanned genomicfragments that flank genes of interest in functional assays, the modernapproach begins electronically with publicly available comparativesequence datasets that provide investigators with prioritized lists ofputative functional sequences based on their evolutionary conservation.However, although a large number of tools and resources are nowavailable, application of comparative genomic approaches remains far fromtrivial. In particular, it requires users to dynamically consider thespecies and methods for comparison depending on the specific biologicalquestion under investigation. While there is currently no single generalrule to this end, it is clear that when applied appropriately,comparative genomic approaches exponentially increase our power ingenerating biological hypotheses for subsequent experimentaltesting.

  20. Kaposi's Sarcoma Herpesvirus Genome Persistence.

    PubMed

    Juillard, Franceline; Tan, Min; Li, Shijun; Kaye, Kenneth M

    2016-01-01

    Kaposi's sarcoma-associated herpesvirus (KSHV) has an etiologic role in Kaposi's sarcoma, primary effusion lymphoma, and multicentric Castleman's disease. These diseases are most common in immunocompromised individuals, especially those with AIDS. Similar to all herpesviruses, KSHV infection is lifelong. KSHV infection in tumor cells is primarily latent, with only a small subset of cells undergoing lytic infection. During latency, the KSHV genome persists as a multiple copy, extrachromosomal episome in the nucleus. In order to persist in proliferating tumor cells, the viral genome replicates once per cell cycle and then segregates to daughter cell nuclei. KSHV only expresses several genes during latent infection. Prominent among these genes, is the latency-associated nuclear antigen (LANA). LANA is responsible for KSHV genome persistence and also exerts transcriptional regulatory effects. LANA mediates KSHV DNA replication and in addition, is responsible for segregation of replicated genomes to daughter nuclei. LANA serves as a molecular tether, bridging the viral genome to mitotic chromosomes to ensure that KSHV DNA reaches progeny nuclei. N-terminal LANA attaches to mitotic chromosomes by binding histones H2A/H2B at the surface of the nucleosome. C-terminal LANA binds specific KSHV DNA sequence and also has a role in chromosome attachment. In addition to the essential roles of N- and C-terminal LANA in genome persistence, internal LANA sequence is also critical for efficient episome maintenance. LANA's role as an essential mediator of virus persistence makes it an attractive target for inhibition in order to prevent or treat KSHV infection and disease. PMID:27570517

  1. An Exploration into Fern Genome Space

    PubMed Central

    Wolf, Paul G.; Sessa, Emily B.; Marchant, Daniel Blaine; Li, Fay-Wei; Rothfels, Carl J.; Sigel, Erin M.; Gitzendanner, Matthew A.; Visger, Clayton J.; Banks, Jo Ann; Soltis, Douglas E.; Soltis, Pamela S.; Pryer, Kathleen M.; Der, Joshua P.

    2015-01-01

    Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants. PMID:26311176

  2. An Exploration into Fern Genome Space.

    PubMed

    Wolf, Paul G; Sessa, Emily B; Marchant, Daniel Blaine; Li, Fay-Wei; Rothfels, Carl J; Sigel, Erin M; Gitzendanner, Matthew A; Visger, Clayton J; Banks, Jo Ann; Soltis, Douglas E; Soltis, Pamela S; Pryer, Kathleen M; Der, Joshua P

    2015-09-01

    Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants. PMID:26311176

  3. Fungal genome sequencing: basic biology to biotechnology.

    PubMed

    Sharma, Krishna Kant

    2016-08-01

    The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet's stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research. PMID:25721271

  4. Genomic profiling of breast cancer.

    PubMed

    Pandey, Anjita; Singh, Alok Kumar; Maurya, Sanjeev Kumar; Rai, Rajani; Tewari, Mallika; Kumar, Mohan; Shukla, Hari S

    2009-05-01

    Genome study provides significant changes in the advancement of molecular diagnosis and treatment in Breast cancer. Several recent critical advances and high-throughput techniques identified the genomic trouble and dramatically accelerated the pace of research in preventing and curing this malignancy. Tumor-suppressor genes, proto-oncogenes, DNA-repair genes, carcinogen-metabolism genes are critically involved in progression of breast cancer. We reviewed imperative finding in breast genetics, ongoing work to segregate further susceptible genes, and preliminary studies on molecular profiling. PMID:19235775

  5. Genomics screens for metastasis genes

    PubMed Central

    Yan, Jinchun; Huang, Qihong

    2014-01-01

    Metastasis is responsible for most cancer mortality. The process of metastasis is complex, requiring the coordinated expression and fine regulation of many genes in multiple pathways in both the tumor and host tissues. Identification and characterization of the genetic programs that regulate metastasis is critical to understanding the metastatic process and discovering molecular targets for the prevention and treatment of metastasis. Genomic approaches and functional genomic analyses can systemically discover metastasis genes. In this review, we summarize the genetic tools and methods that have been used to identify and characterize the genes that play critical roles in metastasis. PMID:22684367

  6. Genome editing comes of age.

    PubMed

    Kim, Jin-Soo

    2016-09-01

    Genome editing harnesses programmable nucleases to cut and paste genetic information in a targeted manner in living cells and organisms. Here, I review the development of programmable nucleases, including zinc finger nucleases (ZFNs), TAL (transcription-activator-like) effector nucleases (TALENs) and CRISPR (cluster of regularly interspaced palindromic repeats)-Cas9 (CRISPR-associated protein 9) RNA-guided endonucleases (RGENs). I specifically highlight the key advances that set the foundation for the rapid and widespread implementation of CRISPR-Cas9 genome editing approaches that has revolutionized the field. PMID:27490630

  7. Pfizer targets genomics through Pfizergen

    SciTech Connect

    Glaser, V.

    1995-06-01

    Recently, Pfizer (New York) formed Pfizergen to develop and commercialize genomics. For starters, Pfizergen involves investments by Pfizer of more than $115 million - excluding milestone payments and royalties on future products - in four biotech firms. Seeking a strong foothold in genomics, Pfizer is piecing together a multifaceted network of technologies. Through its alliance with Incyte, Pfizer has already accessed gene databases, high-throughput gene sequencing, and transcription analysis. Through Pfizergen, it will access expertise in microbial genetic engineering and combinatorial chemistry, as well as antiviral, antisense, and gene therapy capabilities. Future investments could target firms specializing in such products as positional cloning and bioinformatics.

  8. Translating genomics in cancer care.

    PubMed

    Bombard, Yvonne; Bach, Peter B; Offit, Kenneth

    2013-11-01

    There is increasing enthusiasm for genomics and its promise in advancing personalized medicine. Genomic information has been used to personalize health care for decades, spanning the fields of cardiovascular disease, infectious disease, endocrinology, metabolic medicine, and hematology. However, oncology has often been the first test bed for the clinical translation of genomics for diagnostic, prognostic, and therapeutic applications. Notable hereditary cancer examples include testing for mutations in BRCA1 or BRCA2 in unaffected women to identify those at significantly elevated risk for developing breast and ovarian cancers, and screening patients with newly diagnosed colorectal cancer for mutations in 4 mismatch repair genes to reduce morbidity and mortality in their relatives. Somatic genomic testing is also increasingly used in oncology, with gene expression profiling of breast tumors and EGFR testing to predict treatment response representing commonly used examples. Health technology assessment provides a rigorous means to inform clinical and policy decision-making through systematic assessment of the evidentiary base, along with precepts of clinical effectiveness, cost-effectiveness, and consideration of risks and benefits for health care delivery and society. Although this evaluation is a fundamental step in the translation of any new therapeutic, procedure, or diagnostic test into clinical care, emerging developments may threaten this standard. These include "direct to consumer" genomic risk assessment services and the challenges posed by incidental results generated from next-generation sequencing (NGS) technologies. This article presents a review of the evidentiary standards and knowledge base supporting the translation of key cancer genomic technologies along the continuum of validity, utility, cost-effectiveness, health service impacts, and ethical and societal issues, and offers future research considerations to guide the responsible introduction of

  9. Genomic Signals of Reoriented ORFs

    NASA Astrophysics Data System (ADS)

    Dan Cristea, Paul

    2004-12-01

    Complex representation of nucleotides is used to convert DNA sequences into complex digital genomic signals. The analysis of the cumulated phase and unwrapped phase of DNA genomic signals reveals large-scale features of eukaryote and prokaryote chromosomes that result from statistical regularities of base and base-pair distributions along DNA strands. By reorienting the chromosome coding regions, a "hidden" linear variation of the cumulated phase has been revealed, along with the conspicuous almost linear variation of the unwrapped phase. A model of chromosome longitudinal structure is inferred on these bases.

  10. Structural variations in plant genomes

    PubMed Central

    Edwards, David; Varshney, Rajeev K.

    2014-01-01

    Differences between plant genomes range from single nucleotide polymorphisms to large-scale duplications, deletions and rearrangements. The large polymorphisms are termed structural variants (SVs). SVs have received significant attention in human genetics and were found to be responsible for various chronic diseases. However, little effort has been directed towards understanding the role of SVs in plants. Many recent advances in plant genetics have resulted from improvements in high-resolution technologies for measuring SVs, including microarray-based techniques, and more recently, high-throughput DNA sequencing. In this review we describe recent reports of SV in plants and describe the genomic technologies currently used to measure these SVs. PMID:24907366

  11. Intellectual property issues in genomics.

    PubMed

    Eisenberg, R S

    1996-08-01

    Controversy over intellectual property rights in the results of large-scale cDNA sequencing raises intriguing questions about the roles of the public and private sectors in genomics research, and about who stands to benefit (and who stands to lose) from the private appropriation of genomic information. While the US Patent and Trademark Office has rejected patent applications on cDNA fragments of unknown function from the National Institutes of Health, private firms have pursued three distinct strategies for exploiting unpatented cDNA sequence information: exclusive licensing, non-exclusive licensing and dedication to the public domain. PMID:8987463

  12. [Nutritional genomics: an approach to the genome-environment interaction].

    PubMed

    Xacur-García, Fiona; Castillo-Quan, Jorge I; Hernández-Escalante, Víctor M; Laviada-Molina, Hugo

    2008-11-01

    Nutritional genomics forms part of the genomic sciences and addresses the interaction between genes and the human diet, its influence on metabolism and subsequent susceptibility to develop common diseases. It encompasses both nutrigenomics, which explores the effects of nutrients on the genome, proteome and metabolome; and nutrigenetics, that explores the effects of genetic variations on the diet/disease interaction. A number of mechanisms drive the gene/diet interaction: elements in the diet can act as links for transcription factor receptors and after intermediary concentrations, thereby modifying chromatin and impacting genetic regulation; affect signal pathways, regulating phosphorylation of tyrosine in receptors; decrease signaling through the inositol pathway; and act through epigenetic mechanisms, silencing DNA fragments by methylation of cytosine. The signals generated by polyunsaturated fatty acids are so powerful that they can even bypass insulin mediated lipogenesis, stimulated by carbohydrates. Some fatty acids modify the expression of genes that participate in fatty acid transport by lipoproteins. Nutritional genomics has myriad possible therapeutic and preventive applications: in patients with enzymatic deficiencies; in those with a genetic predisposition to complex diseases such as dyslipidemia, diabetes and cancer; in those that already suffer these diseases; in those with altered mood or memory; during the aging process; in pregnant women; and as a preventive measure in the healthy population. PMID:19301779

  13. Genome editing assessment using CRISPR Genome Analyzer (CRISPR-GA)

    PubMed Central

    Güell, Marc; Yang, Luhan; Church, George M.

    2014-01-01

    Summary: Clustered regularly interspaced short palindromic repeats (CRISPR)-based technologies have revolutionized human genome engineering and opened countless possibilities to basic science, synthetic biology and gene therapy. Albeit the enormous potential of these tools, their performance is far from perfect. It is essential to perform a posterior careful analysis of the gene editing experiment. However, there are no computational tools for genome editing assessment yet, and current experimental tools lack sensitivity and flexibility. We present a platform to assess the quality of a genome editing experiment only with three mouse clicks. The method evaluates next-generation data to quantify and characterize insertions, deletions and homologous recombination. CRISPR Genome Analyzer provides a report for the locus selected, which includes a quantification of the edited site and the analysis of the different alterations detected. The platform maps the reads, estimates and locates insertions and deletions, computes the allele replacement efficiency and provides a report integrating all the information. Availability and implementation: CRISPR-GA Web is available at http://crispr-ga.net. Documentation on CRISPR-GA instructions can be found at http://crispr-ga.net/documentation.html Contact: mguell@genetics.med.harvard.edu PMID:24990609

  14. Cancer Genome Anatomy Project | Office of Cancer Genomics

    Cancer.gov

    The National Cancer Institute (NCI) Cancer Genome Anatomy Project (CGAP) is an online resource designed to provide the research community access to biological tissue characterization data. Request a free copy of the CGAP Website Virtual Tour CD from ocg@mail.nih.gov.

  15. Translational Genomics of Onion: Challenges of an Enormous Nuclear Genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The use of high throughput DNA sequencing to address important production constraints has been termed “translational genomics”. Classical breeding of onion (Allium cepa) is expensive and slow due to a long generation time and the high costs of crossing with insects. Translational genomics should r...

  16. Cancer Genome Anatomy Project (CGAP) | Office of Cancer Genomics

    Cancer.gov

    CGAP generated a wide range of genomics data on cancerous cells that are accessible through easy-to-use online tools. Researchers, educators, and students can find "in silico" answers to biological questions through the CGAP website. Request a free copy of the CGAP Website Virtual Tour CD from ocg@mail.nih.gov to learn how to navigate the website.

  17. The Human Genome Project, and recent advances in personalized genomics.

    PubMed

    Wilson, Brenda J; Nicholls, Stuart G

    2015-01-01

    The language of "personalized medicine" and "personal genomics" has now entered the common lexicon. The idea of personalized medicine is the integration of genomic risk assessment alongside other clinical investigations. Consistent with this approach, testing is delivered by health care professionals who are not medical geneticists, and where results represent risks, as opposed to clinical diagnosis of disease, to be interpreted alongside the entirety of a patient's health and medical data. In this review we consider the evidence concerning the application of such personalized genomics within the context of population screening, and potential implications that arise from this. We highlight two general approaches which illustrate potential uses of genomic information in screening. The first is a narrowly targeted approach in which genetic profiling is linked with standard population-based screening for diseases; the second is a broader targeting of variants associated with multiple single gene disorders, performed opportunistically on patients being investigated for unrelated conditions. In doing so we consider the organization and evaluation of tests and services, the challenge of interpretation with less targeted testing, professional confidence, barriers in practice, and education needs. We conclude by discussing several issues pertinent to health policy, namely: avoiding the conflation of genetics with biological determinism, resisting the "technological imperative", due consideration of the organization of screening services, the need for professional education, as well as informed decision making and public understanding. PMID:25733939

  18. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes.

    PubMed

    Ribeiro, Teresa; Barrela, Ricardo M; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A P

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  19. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes

    PubMed Central

    Ribeiro, Teresa; Barrela, Ricardo M.; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A. P.

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  20. Joint Genome Institute's Automation Approach and History

    SciTech Connect

    Roberts, Simon

    2006-07-05

    Department of Energy/Joint Genome Institute (DOE/JGI) collaborates with DOE national laboratories and community users, to advance genome science in support of the DOE missions of clean bio-energy, carbon cycling, and bioremediation.